A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
Mining Historical Documents for Near-Duplicate Figures
2011 IEEE 11th International Conference on Data Mining
The increasing interest in archiving all of humankind's cultural artifacts has resulted in the digitization of millions of books, and soon a significant fraction of the world's books will be online. Most of the data in historical manuscripts is text, but there is also a significant fraction devoted to images. This fact has driven much of the recent increase in interest in query-by-content systems for images. While querying/indexing systems can undoubtedly be useful, we believe that thedoi:10.1109/icdm.2011.102 dblp:conf/icdm/RakthanmanonZK11 fatcat:wkwzoiw2rfexzo742jjzulusda