Application of Phase-Based Features and Denoising in Postprocessing and Binarization of Historical Document Images

Hossein Ziaei Nafchi, Reza Farrahi Moghaddam, Mohamed Cheriet
2013 2013 12th International Conference on Document Analysis and Recognition  
Preprocessing and postprocessing steps significantly improve the performance of binarization methods, especially in the case of severely-degraded historical documents. In this paper, an unsupervised postprocessing method is introduced based on the phase-preserved denoised image and also phase congruency features extracted from the input image. The core of the method consists of two robust mask images that can be used to cross out false positive pixels on the output of the binarization method.
more » ... arization method. First, a mask with a high recall value is obtained from the denoised image using morphological operations. In parallel, a second mask is obtained based on phase congruency features. Then, a median filter is used to remove noise on these two masks, which then are used to correct the output of any binarization method. This approach was tested along with several state-ofthe-art binarization methods on the DIBCO'09, H-DIBCO'10, DIBCO'11 and H-DIBCO'12 datasets with promising and robust results. Furthermore, the high performance of the proposed masks shows their potential use as unsupervised semi-ground truth generator for learning-based binarization methods.
doi:10.1109/icdar.2013.51 dblp:conf/icdar/NafchiMC13 fatcat:sb7hydljajgizczhb5as7xv5nq