Autotag: A tool for creating structured document collections from printed materials [chapter]

Kazem Taghva, Allen Condit, Julie Borsack
1998 Lecture Notes in Computer Science  
We report on the design and implementation of a system which automates the process of capturing structured documents from the optically recognized form of printed materials. The system is intended to be used to convert printed collections into their corresponding tagged electronic versions with little or no manual interventon. This conversion process has some unique problems associated with it, these are discussed, along with our attempts to solve them. This system also establishes a mapping
more » ... ween the bitmap image and its corresponding ASCII representation that can be used to design flexible image-based interfaces for IR-related applications.
doi:10.1007/bfb0053288 fatcat:x2o72wqwfnfutmgomfoyzyllpy