Style Transfer of Modern Hebrew Literature Using Text Simplification and Generative Language Modeling

Research output: Contribution to journalConference articlepeer-review

Abstract

The task of Style Transfer (ST) in Natural Language Processing (NLP), involves altering the style of a given sentence to match another target style while preserving its semantics. Currently, the availability of Hebrew models for NLP, specifically generative models, is scarce. The development of such models is a non-trivial task due to the complex nature of Hebrew. The Hebrew language presents notable challenges to NLP as a result of its rich morphology, intricate inflectional structure, and orthography, which have undergone significant transformations throughout its history1. In this work, we propose a generative ST model of modern Hebrew language that rewrites sentences to a target style in the absence of parallel style corpora. Our focus is on the domain of Modern Hebrew literature, which presents unique challenges for the ST task. To overcome the lack of parallel data, we initially create a pseudo-parallel corpus using back translation (BT) techniques for the purpose of achieving text simplification. Subsequently, we fine-tune a pre-trained Hebrew language model (LM) and leverage a zero-shot Learning (ZSL) approach for ST. Our study demonstrates significant achievements in terms of transfer accuracy, semantic similarity, and fluency in the ST of source sentence to a target style using our model. Notably, to the best of our knowledge, no prior research has focused on the development of ST models specifically for Modern Hebrew literature. As such, our proposed model constitutes a novel and valuable contribution to the field of Hebrew NLP, Modern Hebrew Literature and more generally computational literary studies.

Original languageEnglish
Pages (from-to)391-412
Number of pages22
JournalCEUR Workshop Proceedings
Volume3558
StatePublished - 2023
Event2023 Computational Humanities Research Conference, CHR 2023 - Paris, France
Duration: 6 Dec 20238 Dec 2023

Bibliographical note

Publisher Copyright:
© 2023 Copyright for this paper by its authors.

Keywords

  • Computational Literary Studies
  • Hebrew Language
  • Language Model
  • Modern Hebrew Literature
  • Natural Language Processing
  • Style Transfer

Fingerprint

Dive into the research topics of 'Style Transfer of Modern Hebrew Literature Using Text Simplification and Generative Language Modeling'. Together they form a unique fingerprint.

Cite this