Efficient Text Extraction Algorithm Using Color Clustering for Language Translation in Mobile Phone

Adrián Canedo-Rodríguez, Jung Hyoun Kim, Soo-Hyung Kim, John Kelly, Jung Hee Kim, Sun Yi, Sai Kiran Veeramachaneni, Yolanda Blanco-Fernández
2012 Journal of Signal and Information Processing  
Many Text Extraction methodologies have been proposed, but none of them are suitable to be part of a real system implemented on a device with low computational resources, either because their accuracy is insufficient, or because their performance is too slow. In this sense, we propose a Text Extraction algorithm for the context of language translation of scene text images with mobile phones, which is fast and accurate at the same time. The algorithm uses very efficient computations to calculate
more » ... ations to calculate the Principal Color Components of a previously quantized image, and decides which ones are the main foreground-background colors, after which it extracts the text in the image. We have compared our algorithm with other algorithms using commercial OCR, achieving accuracy rates more than 12% higher, and performing two times faster. Also, our methodology is more robust against common degradations, such as uneven illumination, or blurring. Thus, we developed a very attractive system to accurately separate foreground and background from scene text images, working over low computational resources devices. In this paper, we propose a simple, fast, and accurate algorithm to separate foreground and background in text detected within natural scene images, so it can be im-
doi:10.4236/jsip.2012.32031 fatcat:gzsb5lphsfay3b4jz5p6swbiae