Layout analysis and content enrichment of digitized books

Costantino Grana, Giuseppe Serra, Marco Manfredi, Dalia Coppi, Rita Cucchiara
2014 Multimedia tools and applications  
In this paper we describe a system for automatically analyzing old documents and creating hyper linking between different epochs, thus opening ancient documents to young people and to make them available on the web with old and current content. We propose a supervised learning approach to segment text and illustration of digitized old documents using a texture feature based on local correlation aimed at detecting the repeating patterns of text regions and differentiate them from pictorial
more » ... ts. Moreover we present a solution to help the user in finding contemporary content connected to what is automatically extracted from the ancient documents.
doi:10.1007/s11042-014-2360-0 fatcat:mywxvsu53jhfxd7bqjc26xh2ve