Extraction of Logical Structure from Articles in Mathematics [chapter]

Koji Nakagawa, Akihiro Nomura, Masakazu Suzuki
2004 Lecture Notes in Computer Science  
We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this
more » ... in INFTY which is an integrated OCR system for mathematical documents. In order to show the feasibility of the method we made a correct database from an existing mathematical OCR database, and made an experiment.
doi:10.1007/978-3-540-27818-4_20 fatcat:bljntj4l5nb6bn3bhkv2pcvfqu