A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Semantics-based content extraction in typewritten historical documents
2005
Eighth International Conference on Document Analysis and Recognition (ICDAR'05)
This paper presents a flexible approach to extracting content from scanned historical documents using semantic information. The final electronic document is the result of a "digital historical document lifecycle" process, where the expert knowledge of the historian/archivist user is incorporated at different stages. Results show that such a conversion strategy aided by (expert) user-specified semantic information and which enables the processing of individual parts of the document in a
doi:10.1109/icdar.2005.215
dblp:conf/icdar/AntonacopoulosK05
fatcat:ezw4rc6bibforlpyumwdvcxoii