Mathematical Symbol Indexing Using Topologically Ordered Clusters of Shape Contexts

Simone Marinai, Beatrice Miotti, Giovanni Soda
2009 2009 10th International Conference on Document Analysis and Recognition  
This paper addresses the indexing and retrieval of mathematical symbols from digitized documents. The proposed approach exploits Shape Contexts (SC) to describe the shape of mathematical symbols. Starting from the vector space method, that is based on SC clustering, we explore the use of topological ordered clusters to improve the retrieval performance. The clustering is computed by means of Self-Organizing Maps that organize the clusters in two dimensional topologically ordered feature maps.
more » ... e retrieval performance are compared with those obtained using the K-means clustering on a large collection of mathematical symbols gathered from the widely used INFTY database.
doi:10.1109/icdar.2009.120 dblp:conf/icdar/MarinaiMS09 fatcat:p5docdx5bnelvis6d23mstgdsm