TY - JOUR
T1 - Ocr-free transcript alignment
AU - Hassner, Tal
AU - Wolf, Lior
AU - Dershowitz, Nachum
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - Recent large-scale digitization and preservation efforts have made images of original manuscripts, accompanied by transcripts, commonly available. An important challenge, for which no practical system exists, is that of aligning transcript letters to their coordinates in manuscript images. Here we propose a system that directly matches the image of a historical text with a synthetic image created from the transcript for the purpose. This, rather than attempting to recognize individual letters in the manuscript image using optical character recognition (OCR). Our method matches the pixels of the two images by employing a dedicated dense flow mechanism coupled with novel local image descriptors designed to spatially integrate local patch similarities. Matching these pixel representations is performed using a message passing algorithm. The various stages of our method make it robust with respect to document degradation, to variations between script styles and to non-linear image transformations. Robustness, as well as practicality of the system, are verified by comprehensive empirical experiments.
AB - Recent large-scale digitization and preservation efforts have made images of original manuscripts, accompanied by transcripts, commonly available. An important challenge, for which no practical system exists, is that of aligning transcript letters to their coordinates in manuscript images. Here we propose a system that directly matches the image of a historical text with a synthetic image created from the transcript for the purpose. This, rather than attempting to recognize individual letters in the manuscript image using optical character recognition (OCR). Our method matches the pixels of the two images by employing a dedicated dense flow mechanism coupled with novel local image descriptors designed to spatially integrate local patch similarities. Matching these pixel representations is performed using a message passing algorithm. The various stages of our method make it robust with respect to document degradation, to variations between script styles and to non-linear image transformations. Robustness, as well as practicality of the system, are verified by comprehensive empirical experiments.
UR - http://www.scopus.com/inward/record.url?scp=84889578536&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2013.265
DO - 10.1109/ICDAR.2013.265
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:84889578536
SN - 1520-5363
SP - 1310
EP - 1314
JO - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
JF - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
M1 - 6628826
T2 - 12th International Conference on Document Analysis and Recognition, ICDAR 2013
Y2 - 25 August 2013 through 28 August 2013
ER -