Unsupervised Style Transfer of Modern Hebrew using Generative Language Modeling and Zero-Shot Prompting

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Style transfer is one of the most intriguing hallmarks of natural language processing. It involves the semantic preserving conversion of artistic 'style'. Style transfer of the Hebrew language is an exceptionally challenging task due to the language's intricate morphology, inflectional structure, and orthography, which have undergone significant transformations throughout its history. In this work, we present the first generative language model for unsupervised textual style transfer for modern Hebrew, which rewrites sentences in a target style in the absence of parallel style corpora. We create a pseudo-parallel corpus through back translation, fine-tunes a pre-trained Hebrew language model, and leverages zero-shot learning. Our results demonstrate the first significant results in Hebrew style transfer in terms of transfer accuracy, semantic similarity, and fluency.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Big Data, BigData 2023
EditorsJingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4646-4653
Number of pages8
ISBN (Electronic)9798350324457
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023

Publication series

NameProceedings - 2023 IEEE International Conference on Big Data, BigData 2023

Conference

Conference2023 IEEE International Conference on Big Data, BigData 2023
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Computational Literary Studies
  • Hebrew Language
  • Language Model
  • Machine Learning
  • Modern Hebrew Literature
  • Natural Language Processing
  • Style Transfer

Fingerprint

Dive into the research topics of 'Unsupervised Style Transfer of Modern Hebrew using Generative Language Modeling and Zero-Shot Prompting'. Together they form a unique fingerprint.

Cite this