A complex document information processing prototype

S. Argamon, G. Agam, O. Frieder, D. Grossman, D. Lewis, G. Sohn, K. Voorhees
2006 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06  
We developed a prototype for integrated retrieval and aggregation of diverse information contained in scanned paper documents. Such complex document information processing combines several forms of image processing together with textual/linguistic processing to enable effective analysis of complex document collections, a necessity for a wide range of applications. This is the first system to attempt integrated retrieval from complex documents; we report its current capabilities.
doi:10.1145/1148170.1148274 dblp:conf/sigir/ArgamonAFGLSV06 fatcat:f6qhnzyggreibiq6vddlzk27du