A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Two complementary techniques for digitized document analysis
1988
Proceedings of the ACM conference on Document processing systems - DOCPROCS '88
Two complementary methods are proposed for characterizing the spatial structure of digitized technical documents and labelling various logical components without using optical character recognition. The top-down method segments and labels the page image simultaneously using publication-specific information in the form of a page-grammar. The bottom-up method naively segments the document into rectangles that contain individual connected components, combines blocks using knowledge about generic
doi:10.1145/62506.62539
fatcat:mj7vn652orezzczcuxacldd4zu