UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies

Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori Levin, Amir Zeldes, Joakim Nivre, William Croft, Nathan Schneider

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرمنشور من مؤتمرمراجعة النظراء

ملخص

The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages. However, the UD annotations do not tell the full story. Grammatical constructions that convey meaning through a particular combination of several morphosyntactic elements-for example, interrogative sentences with special markers and/or word orders-are not labeled holistically. We argue for (i) augmenting UD annotations with a “UCxn” annotation layer for such meaning-bearing grammatical constructions, and (ii) approaching this in a typologically informed way so that morphosyntactic strategies can be compared across languages. As a case study, we consider five construction families in ten languages, identifying instances of each construction in UD treebanks through the use of morphosyntactic patterns. In addition to findings regarding these particular constructions, our study yields important insights on methodology for describing and identifying constructions in language-general and language-particular ways, and lays the foundation for future constructional enrichment of UD treebanks.

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيف2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
المحررونNicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
ناشرEuropean Language Resources Association (ELRA)
الصفحات16919-16932
عدد الصفحات14
رقم المعيار الدولي للكتب (الإلكتروني)9782493814104
حالة النشرنُشِر - 2024
الحدثJoint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024 - Hybrid, Torino, إيطاليا
المدة: ٢٠ مايو ٢٠٢٤٢٥ مايو ٢٠٢٤

سلسلة المنشورات

الاسم2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings

!!Conference

!!ConferenceJoint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024
الدولة/الإقليمإيطاليا
المدينةHybrid, Torino
المدة٢٠/٠٥/٢٤٢٥/٠٥/٢٤

ملاحظة ببليوغرافية

Publisher Copyright:
© 2024 ELRA Language Resource Association: CC BY-NC 4.0.

بصمة

أدرس بدقة موضوعات البحث “UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا