Document Image Processing - A Review

Shazia Akram, Mehraj-Ud-Din Dar, Aasia Quyoum
2010 International Journal of Computer Applications  
The field of a digital-image processing has experienced dramatic growth and increasingly widespread applicability in recent years. Fortunately, advances in computer technology have kept pace with the rapid growth in volume of image data in these and other applications. Digital-image processing has become economical in many fields of research and in industrial and military applications. While each application has requirements unique from the others, all are concerned with faster, cheaper, more
more » ... curate, and more extensive computation. Analysis of document images for information extraction has become very prominent in recent past. Wide variety of information, which has been conventionally stored on paper, is now being converted into electronic form for better storage and intelligent processing. This needs processing of documents using image analysis, processing methods. This article provides an overview of various methods used for digital image processing using three main components: Pre-processing, Feature extraction and the Classification. Pre-processing includes Image acquisition, Binarization, identification, Layout analysis, feature extraction and classification. Classification is an important step in Office Automation, Digital Libraries, and other document image analysis applications. This article examines the various methods used for document image processing in order to achieve a processed document having high quality, accuracy and fast retrieval.
doi:10.5120/1475-1991 fatcat:hbcl73h53fd3lhzhtcioht4wby