תקציר
Large language models hold significant promise in multilingual applications. However, inherent biases stemming from predominantly English-centric pre-training have led to the widespread practice of pre-translation, i.e., translating non-English inputs to English before inference, leading to complexity and information loss. This study re-evaluates the need for pre-translation in the context of PaLM2 models (Anil et al., 2023), which have been established as highly performant in multilingual tasks. We offer a comprehensive investigation across 108 languages and 6 diverse benchmarks, including open-end generative tasks, which were excluded from previous similar studies. Our findings challenge the pre-translation paradigm established in prior research, highlighting the advantages of direct inference in PaLM2. Specifically, PaLM2-L consistently outperforms pre-translation in 94 out of 108 languages. These findings pave the way for more efficient and effective multilingual applications, alleviating the limitations associated with pre-translation and unlocking linguistic authenticity.
| שפה מקורית | אנגלית |
|---|---|
| כותר פרסום המארח | Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics |
| כותר משנה של פרסום המארח | Human Language Technologies, NAACL 2024 |
| מוציא לאור | Association for Computational Linguistics (ACL) |
| עמודים | 829-844 |
| מספר עמודים | 16 |
| מסת"ב (אלקטרוני) | 9798891761155 |
| מזהי עצם דיגיטלי (DOIs) | |
| סטטוס פרסום | פורסם - 2024 |
| פורסם באופן חיצוני | כן |
| אירוע | 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 - Hybrid, Mexico City, מקסיקו משך הזמן: 16 יוני 2024 → 21 יוני 2024 |
סדרות פרסומים
| שם | Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 |
|---|---|
| כרך | 2 |
כנס
| כנס | 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 |
|---|---|
| מדינה/אזור | מקסיקו |
| עיר | Hybrid, Mexico City |
| תקופה | 16/06/24 → 21/06/24 |
הערה ביבליוגרפית
Publisher Copyright:© 2024 Association for Computational Linguistics.
טביעת אצבע
להלן מוצגים תחומי המחקר של הפרסום 'Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications?'. יחד הם יוצרים טביעת אצבע ייחודית.פורמט ציטוט ביבליוגרפי
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver