Abstract
Style transfer is one of the most intriguing hallmarks of natural language processing. It involves the semantic preserving conversion of artistic 'style'. Style transfer of the Hebrew language is an exceptionally challenging task due to the language's intricate morphology, inflectional structure, and orthography, which have undergone significant transformations throughout its history. In this work, we present the first generative language model for unsupervised textual style transfer for modern Hebrew, which rewrites sentences in a target style in the absence of parallel style corpora. We create a pseudo-parallel corpus through back translation, fine-tunes a pre-trained Hebrew language model, and leverages zero-shot learning. Our results demonstrate the first significant results in Hebrew style transfer in terms of transfer accuracy, semantic similarity, and fluency.
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023 |
Editors | Jingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 4646-4653 |
Number of pages | 8 |
ISBN (Electronic) | 9798350324457 |
DOIs | |
State | Published - 2023 |
Event | 2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy Duration: 15 Dec 2023 → 18 Dec 2023 |
Publication series
Name | Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023 |
---|
Conference
Conference | 2023 IEEE International Conference on Big Data, BigData 2023 |
---|---|
Country/Territory | Italy |
City | Sorrento |
Period | 15/12/23 → 18/12/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- Computational Literary Studies
- Hebrew Language
- Language Model
- Machine Learning
- Modern Hebrew Literature
- Natural Language Processing
- Style Transfer