Robust line segmentation for handwritten documents

Kamal Kuzhinjedathu, Harish Srinivasan, Sargur Srihari, Kathrin Berkner, Berrin A. Yanikoglu
2008 Document Recognition and Retrieval XV  
Line segmentation is the first and the most critical pre-processing step for a document recognition/analysis task. Complex handwritten documents with lines running into each other impose a great challenge for the line segmentation problem due to the absence of online stroke information. This paper describes a method to disentangle lines running into each other, by splitting and associating the correct character strokes to the appropriate lines. The proposed method can be used along with the
more » ... along with the existing algorithm 1 that identifies such overlapping lines in documents. A stroke tracing method is used to intelligently segment the overlapping components. The method uses slope and curvature information of the stroke to disambiguate the course of the stroke at cross points. Once the overlapping components are segmented into strokes, a statistical method is used to associate the strokes with appropriate lines.
doi:10.1117/12.766701 dblp:conf/drr/KuzhinjedathuSS08 fatcat:jx6ejlz54feu3gdnj6y2jonqta