DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images
[article]
W. Ronny Huang, Yike Qi, Qianqian Li, Jonathan Degange
2020
arXiv
pre-print
Paper-intensive industries like insurance, law, and government have long leveraged optical character recognition (OCR) to automatically transcribe hordes of scanned documents into text strings for downstream ...
We devise a method to programmatically assemble real text images and real artifacts into realistic-looking "dirty" text images, and use them to train an artifact segmentation network in a weakly supervised ...
The results discussed in this letter and references to terms architecture, robustness, efficient, accurate, and bias are with respect to the letters mathematical treatment of a generalized methodology ...
arXiv:1910.07070v3
fatcat:acann2xosndwddx2kqlseuehwa