Multilingual Sequence-to-Sequence Models for Hebrew NLP

Matan Eyal, Hila Noga, Roee Aharoni, Idan Szpektor, Reut Tsarfaty

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرمنشور من مؤتمرمراجعة النظراء

ملخص

Recent work attributes progress in NLP to large language models (LMs) with increased model size and large quantities of pretraining data. Despite this, current state-of-the-art LMs for Hebrew are both under-parameterized and under-trained compared to LMs in other languages. Additionally, previous work on pretrained Hebrew LMs focused on encoder-only models. While the encoder-only architecture is beneficial for classification tasks, it does not cater well for sub-word prediction tasks, such as Named Entity Recognition, when considering the morphologically rich nature of Hebrew. In this paper we argue that sequence-to-sequence generative architectures are more suitable for large LMs in morphologically rich languages (MRLs) such as Hebrew. We demonstrate this by casting tasks in the Hebrew NLP pipeline as text-to-text tasks, for which we can leverage powerful multilingual, pretrained sequence-to-sequence models as mT5, eliminating the need for a separate, specialized, morpheme-based, decoder. Using this approach, our experiments show substantial improvements over previously published results on all existing Hebrew NLP benchmarks. These results suggest that multilingual sequence-to-sequence models present a promising building block for NLP for MRLs.

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيفFindings of the Association for Computational Linguistics, ACL 2023
ناشرAssociation for Computational Linguistics (ACL)
الصفحات7700-7708
عدد الصفحات9
رقم المعيار الدولي للكتب (الإلكتروني)9781959429623
حالة النشرنُشِر - 2023
منشور خارجيًانعم
الحدث61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, كندا
المدة: ٩ يوليو ٢٠٢٣١٤ يوليو ٢٠٢٣

سلسلة المنشورات

الاسمProceedings of the Annual Meeting of the Association for Computational Linguistics
رقم المعيار الدولي للدوريات (المطبوع)0736-587X

!!Conference

!!Conference61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
الدولة/الإقليمكندا
المدينةToronto
المدة٩/٠٧/٢٣١٤/٠٧/٢٣

ملاحظة ببليوغرافية

Publisher Copyright:
© 2023 Association for Computational Linguistics.

بصمة

أدرس بدقة موضوعات البحث “Multilingual Sequence-to-Sequence Models for Hebrew NLP'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا