NENET: An Edge Learnable Network for Link Prediction in Scene Text [article]

Mayank Kumar Singh, Sayan Banerjee, Shubhasis Chaudhuri
2020 arXiv   pre-print
Text detection in scenes based on deep neural networks have shown promising results. Instead of using word bounding box regression, recent state-of-the-art methods have started focusing on character bounding box and pixel-level prediction. This necessitates the need to link adjacent characters, which we propose in this paper using a novel Graph Neural Network (GNN) architecture that allows us to learn both node and edge features as opposed to only the node features under the typical GNN. The
more » ... n advantage of using GNN for link prediction lies in its ability to connect characters which are spatially separated and have an arbitrary orientation. We show our concept on the well known SynthText dataset, achieving top results as compared to state-of-the-art methods.
arXiv:2005.12147v1 fatcat:aowflbv6wbaurosdmnaskzpft4