Word image matching using dynamic time warping

T.M. Rath, R. Manmatha
2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.  
Libraries and other institutions are interested in providing access to scanned versions of their large collections of handwritten historical manuscripts on electronic media. Convenient access to a collection requires an index, which is manually created at great labour and expense. Since current handwriting recognizers do not perform well on historical documents, a technique called word spotting has been developed: clusters with occurrences of the same word in a collection are established using
more » ... mage matching. By annotating "interesting" clusters, an index can be built automatically. We present an algorithm for matching handwritten words in noisy historical documents. The segmented word images are preprocessed to create sets of 1-dimensional features, which are then compared using dynamic time warping. We present experimental results on two different data sets from the George Washington collection. Our experiments show that this algorithm performs better and is faster than competing matching techniques.
doi:10.1109/cvpr.2003.1211511 dblp:conf/cvpr/RathM03 fatcat:6vgqtmx2rjcv5k2vxaaqyvubda