تخطي إلى التنقل الرئيسي تخطي إلى البحث تخطي إلى المحتوى الرئيسي

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

  • Uri Shaham
  • , Jonathan Herzig
  • , Roee Aharoni
  • , Idan Szpektor
  • , Reut Tsarfaty
  • , Matan Eyal

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرمنشور من مؤتمرمراجعة النظراء

ملخص

As instruction-tuned large language models (LLMs) gain global adoption, their ability to follow instructions in multiple languages becomes increasingly crucial. In this work, we investigate how multilinguality during instruction tuning of a multilingual LLM affects instruction-following across languages from the pre-training corpus. We first show that many languages transfer some instruction-following capabilities to other languages from even monolingual tuning. Furthermore, we find that only 40 multilingual examples integrated in an English tuning set substantially improve multilingual instruction-following, both in seen and unseen languages during tuning. In general, we observe that models tuned on multilingual mixtures exhibit comparable or superior performance in multiple languages compared to monolingually tuned models, despite training on 10x fewer examples in those languages. Finally, we find that diversifying the instruction tuning set with even just 2-4 languages significantly improves cross-lingual generalization. Our results suggest that building massively multilingual instruction-tuned models can be done with only a very small set of multilingual instruction-responses.

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيفThe 62nd Annual Meeting of the Association for Computational Linguistics
العنوان الفرعي لمنشور المضيفFindings of the Association for Computational Linguistics, ACL 2024
المحررونLun-Wei Ku, Andre Martins, Vivek Srikumar
ناشرAssociation for Computational Linguistics (ACL)
الصفحات2304-2317
عدد الصفحات14
رقم المعيار الدولي للكتب (الإلكتروني)9798891760998
المعرِّفات الرقمية للأشياء
حالة النشرنُشِر - 2024
منشور خارجيًانعم
الحدثFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, تايلند
المدة: ١١ أغسطس ٢٠٢٤١٦ أغسطس ٢٠٢٤

سلسلة المنشورات

الاسمProceedings of the Annual Meeting of the Association for Computational Linguistics
رقم المعيار الدولي للدوريات (المطبوع)0736-587X

!!Conference

!!ConferenceFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
الدولة/الإقليمتايلند
المدينةHybrid, Bangkok
المدة١١/٠٨/٢٤١٦/٠٨/٢٤

ملاحظة ببليوغرافية

Publisher Copyright:
© 2024 Association for Computational Linguistics.

بصمة

أدرس بدقة موضوعات البحث “Multilingual Instruction Tuning With Just a Pinch of Multilinguality'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا