A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Integration of Text Information and Graphic Composite for PDF Document Analysis
[chapter]
2012
Communications in Computer and Information Science
The trend of large scale digitization has greatly motivated the research on the processing of the PDF documents with little structure information. Challenging problems like graphic segmentation integrating with texts remain unsolved for successful practical application of PDF layout analysis. To cope with PDF documents, a hybrid method incorporating text information and graphic composite is proposed to segment the pages that are difficult to handle by traditional methods. Specifically, the text
doi:10.1007/978-3-642-34456-5_2
fatcat:6g5b2vseg5bzlckgsuijfnkc5e