A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines

Zhixin Shi, Srirangaraj Setlur, Venu Govindaraju
2009 2009 10th International Conference on Document Analysis and Recognition  
In this paper, we present a new text line extraction method for handwritten Arabic documents. The proposed technique is based on a generalized adaptive local connectivity map (ALCM) using a steerable directional filter. The algorithm is designed to solve the particularly complex problems seen in handwritten documents such as fluctuating, touching or crossing text lines. The proposed algorithm consists of three steps. Firstly, a steerable filter is used to probe and determine foreground
more » ... along multiple directions at each pixel while generating the ALCM. The ALCM is then binarized using an adaptive thresholding algorithm to get a rough estimate of the location of the text lines. In the second step, connected component analysis is used to classify text and non text patterns in the generated ALCM to refine the location of the text lines. Finally, the text lines are separated by superimposing the text line patterns in the ALCM on the original document image and extracting the connected components covered by the pattern mask. Analysis of experimental results on the DARPA MADCAT Arabic handwritten document data indicate that the method is robust and is capable of correctly isolating handwritten text lines even on challenging document images.
doi:10.1109/icdar.2009.79 dblp:conf/icdar/ShiSG09 fatcat:wysbtgfabrhjdhnde22bxh2bxq