Indexing Text and Visual Features for WWW Images [chapter]

Heng Tao Shen, Xiaofang Zhou, Bin Cui
2005 Lecture Notes in Computer Science  
In this paper, we present a novel indexing technique called Multiscale Similarity Indexing (MSI) to index image's multi-features into a single one-dimensional structure. Both for text and visual feature spaces, the similarity between a point and a local partition's center in individual space is used as the indexing key, where similarity values in different features are distinguished by different scale. Then a single indexing tree can be built on these keys. Based on the property that relevant
more » ... ages haves similar similarity values from the center of the same local partition in any feature space, certain number of irrelevant images can be fast pruned based on the triangle inequity on indexing keys. To remove the "dimensionality curse" existing in high dimensional structure, we propose a new technique called Local Bit Stream (LBS). LBS transforms image's text and visual feature representations into simple, uniform and effective bit stream (BS) representations based on local partition's center. Such BS representations are small in size and fast for comparison since only bit operation are involved. By comparing common bits existing in two BSs, most of irrelevant images can be immediately filtered. Our extensive experiment showed that single one-dimensional index on multi-features improves multiindices on multi-features greatly. Our LBS method outperforms sequential scan on high dimensional space by an order of magnitude.
doi:10.1007/978-3-540-31849-1_85 fatcat:in2bkrfv4be6jlognxtmzzkdnu