Opening the knowledge dam: Speech recognition for video search

Vered Silber-Varod, Amir Winer, Nitza Geri

نتاج البحث: نشر في مجلةمقالةمراجعة النظراء


Automatic Speech Recognition (ASR) may increase access to spoken information captured in videos. ASR is needed, especially for online academic video lectures that gradually replace class lectures and traditional textbooks. This conceptual article examines how technological barriers to ASR in under-resourced languages impair accessibility to video content and demonstrates it with the empirical findings of Hebrew ASR evaluations. We compare ASR with Optical Character Recognition (OCR) as facilitating access to textual and speech content and show their current performance in under-resourced languages. We target ASR of under-resourced languages as the main barrier to searching academic video lectures. We further show that information retrieval technologies, such as smart video players that combine both ASR and OCR capacities, must come to the fore once ASR technologies have matured. Therefore, suggesting that the current state of information retrieval from video lectures in under-resourced languages is equivalent to a knowledge dam.

اللغة الأصليةالإنجليزيّة
الصفحات (من إلى)106-111
عدد الصفحات6
دوريةJournal of Computer Information Systems
مستوى الصوت57
رقم الإصدار2
المعرِّفات الرقمية للأشياء
حالة النشرنُشِر - 2017

ملاحظة ببليوغرافية

Funding Information:
The authors gratefully acknowledge that this research was supported by the Open University of Israel’s research fund (grant no. 502532).

Publisher Copyright:
© 2017 International Association for Computer Information Systems.

Copyright 2017 Elsevier B.V., All rights reserved.


أدرس بدقة موضوعات البحث “Opening the knowledge dam: Speech recognition for video search'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا