תקציר
We present an end-to-end system for aligning transcript letters to their coordinates in a manuscript image. An intuitive GUI and an automatic line detection method enable the user to perform an exact alignment of parts of document pages. In order to bridge large regions in between annotation, and augment the manual effort, the system employs an optical-flow engine for directly matching at the pixel level the image of a line of a historical text with a synthetic image created from the transcript's matching line. Meanwhile, by accumulating aligned letters, and performing letter spotting, the system is able to bootstrap a rapid semi-automatic transcription of the remaining text. Thus, the amount of manual work is greatly diminished and the transcript alignment task becomes practical regardless of the corpus size.
שפה מקורית | אנגלית |
---|---|
כותר פרסום המארח | 13th IAPR International Conference on Document Analysis and Recognition, ICDAR 2015 - Conference Proceedings |
מוציא לאור | IEEE Computer Society |
עמודים | 711-715 |
מספר עמודים | 5 |
מסת"ב (אלקטרוני) | 9781479918058 |
מזהי עצם דיגיטלי (DOIs) | |
סטטוס פרסום | פורסם - 20 נוב׳ 2015 |
אירוע | 13th International Conference on Document Analysis and Recognition, ICDAR 2015 - Nancy, צרפת משך הזמן: 23 אוג׳ 2015 → 26 אוג׳ 2015 |
סדרות פרסומים
שם | Proceedings of the International Conference on Document Analysis and Recognition, ICDAR |
---|---|
כרך | 2015-November |
ISSN (מודפס) | 1520-5363 |
כנס
כנס | 13th International Conference on Document Analysis and Recognition, ICDAR 2015 |
---|---|
מדינה/אזור | צרפת |
עיר | Nancy |
תקופה | 23/08/15 → 26/08/15 |
הערה ביבליוגרפית
Publisher Copyright:© 2015 IEEE.