A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
Adaptive Hindi OCR using generalized Hausdorff image comparison
ACM Transactions on Asian Language Information Processing
In this paper, we present an adaptive Hindi OCR using generalized Hausdorff image comparison implemented as part of a rapidly retargetable language tool effort. The system includes: script identification, character segmentation, training sample creation and character recognition. The OCR design (completed in one month) was applied to a complete Hindi-English bilingual dictionary (with 1083 pages) and a collection of ideal images extracted from Hindi documents in PDF format. Experimental resultsdoi:10.1145/979872.979875 fatcat:c7ktgcciy5dqbl4qw6hky5tbda