A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Font adaptive word indexing of modern printed documents
2006
IEEE Transactions on Pattern Analysis and Machine Intelligence
We propose an approach for the word-level indexing of modern printed documents which are difficult to recognize using current OCR engines. By means of word-level indexing, it is possible to retrieve the position of words in a document, enabling queries involving proximity of terms. Web search engines implement this kind of indexing, allowing users to retrieve Web pages on the basis of their textual content. Nowadays, digital libraries hold collections of digitized documents that can be
doi:10.1109/tpami.2006.162
pmid:16886856
fatcat:qw7bx6blcncl7cnr73l3rvi3va