Separating Chinese Character from Noisy Background Using GAN

Bin Huang, Jiaqi Lin, Jinming Liu, Jie Chen, Jiemin Zhang, Yendo Hu, Erkang Chen, Jingwen Yan, Philippe Fournier-Viger
2021 Wireless Communications and Mobile Computing  
Separating printed or handwritten characters from a noisy background is valuable for many applications including test paper autoscoring. The complex structure of Chinese characters makes it difficult to obtain the goal because of easy loss of fine details and overall structure in reconstructed characters. This paper proposes a method for separating Chinese characters based on generative adversarial network (GAN). We used ESRGAN as the basic network structure and applied dilated convolution and
more » ... novel loss function that improve the quality of reconstructed characters. Four popular Chinese fonts (Hei, Song, Kai, and Imitation Song) on real data collection were tested, and the proposed design was compared with other semantic segmentation approaches. The experimental results showed that the proposed method effectively separates Chinese characters from noisy background. In particular, our methods achieve better results in terms of Intersection over Union (IoU) and optical character recognition (OCR) accuracy.
doi:10.1155/2021/9922017 fatcat:gx4ofxf33rg2lgcun2bdlvmg3m