MESH: A Flexible Manifold-Embedded Semantic Hashing for Cross-Modal Retrieval

Fangming Zhong, Guangze Wang, Zhikui Chen, Feng Xia
2020 IEEE Access  
Hashing based methods for cross-modal retrieval has been widely explored in recent years. However, most of them mainly focus on the preservation of neighborhood relationship and label consistency, while ignore the proximity of neighbors and proximity of classes, which degrades the discrimination of hash codes. And most of them learn hash codes and hashing functions simultaneously, which limits the flexibility of algorithms. To address these issues, in this article, we propose a two-step
more » ... dal retrieval method named Manifold-Embedded Semantic Hashing (MESH). It exploits Local Linear Embedding to model the neighborhood proximity and uses class semantic embeddings to consider the proximity of classes. By so doing, MESH can not only extract the manifold structure in different modalities, but also can embed the class semantic information into hash codes to further improve the discrimination of learned hash codes. Moreover, the two-step scheme makes MESH flexible to various hashing functions. Extensive experimental results on three datasets show that MESH is superior to 10 state-of-the-art cross-modal hashing methods. Moreover, MESH also demonstrates superiority on deep features compared with the deep cross-modal hashing method. INDEX TERMS Cross-modal hashing, manifold embedding, semantic, discrete optimization.
doi:10.1109/access.2020.3015528 fatcat:k42zalqde5afbk5hlw3sj736im