Abstract
We present an end-to-end system for aligning transcript letters to their coordinates in a manuscript image. An intuitive GUI and an automatic line detection method enable the user to perform an exact alignment of parts of document pages. In order to bridge large regions in between annotation, and augment the manual effort, the system employs an optical-flow engine for directly matching at the pixel level the image of a line of a historical text with a synthetic image created from the transcript's matching line. Meanwhile, by accumulating aligned letters, and performing letter spotting, the system is able to bootstrap a rapid semi-automatic transcription of the remaining text. Thus, the amount of manual work is greatly diminished and the transcript alignment task becomes practical regardless of the corpus size.
Original language | English |
---|---|
Title of host publication | 13th IAPR International Conference on Document Analysis and Recognition, ICDAR 2015 - Conference Proceedings |
Publisher | IEEE Computer Society |
Pages | 711-715 |
Number of pages | 5 |
ISBN (Electronic) | 9781479918058 |
DOIs | |
State | Published - 20 Nov 2015 |
Event | 13th International Conference on Document Analysis and Recognition, ICDAR 2015 - Nancy, France Duration: 23 Aug 2015 → 26 Aug 2015 |
Publication series
Name | Proceedings of the International Conference on Document Analysis and Recognition, ICDAR |
---|---|
Volume | 2015-November |
ISSN (Print) | 1520-5363 |
Conference
Conference | 13th International Conference on Document Analysis and Recognition, ICDAR 2015 |
---|---|
Country/Territory | France |
City | Nancy |
Period | 23/08/15 → 26/08/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.