Scene Text Segmentation with Multi-level Maximally Stable Extremal Regions

Shangxuan Tian, Shijian Lu, Bolan Su, Chew Lim Tan
2014 2014 22nd International Conference on Pattern Recognition  
The segmentation of scene text from the image background has shown great importance in scene text recognition. In this paper, we propose a multi-level MSER technology that identifies the best-quality text candidates from a set of stable regions that are extracted from different color channel images. In order to identify the best-quality text candidates, a segmentation score is defined which exploits four measures to evaluate the text probability of each stable region including: 1) Stroke width
more » ... hat measures the small stroke width variation of the text; 2) Boundary curvature that measures the smoothness of the stable region boundary; 3) Character confidence that measures the likelihood of a stable region being text based on a pre-trained support vector classifier; 4) Color constancy that measures the global color consistency of each selected text candidate. Finally, the MSERs with the best segmentation score from each channel are combined to form the final segmentation. The proposed method is evaluated on the ICDAR2003 and SVT datasets and experiments show that it outperforms both popular document image binarization methods and state of the art scene text segmentation methods.
doi:10.1109/icpr.2014.467 dblp:conf/icpr/TianLST14 fatcat:nzi7x24smjdrfk552whhlwenry