Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia [chapter]

Yonghui Cong, Zengchang Qin, Jing Yu, Tao Wan
2012 Lecture Notes in Computer Science  
Probability models have been used in cross-modal multimedia information retrieval recently by building conjunctive models bridging the text and image components. Previous studies have shown that cross-modal information retrieval system using the topic correlation model (TCM) outperforms state-of-the-art models in English corpus. In this paper, we will focus on the Chinese language, which is different from western languages composed by alphabets. Words and characters will be chosen as the basic
more » ... tructural units of Chinese, respectively. We also set up a test database, named Ch-Wikipedia, in which documents with paired image and text are extracted from Chinese website of Wikipedia. We investigate the problems of retrieving texts (ranked by semantic closeness) given an image query, and vice versa. The capabilities of the TCM model is verified by experiments across the Ch-Wikipedia dataset.
doi:10.1007/978-3-642-35527-1_2 fatcat:4mjzgdhvfrax7nmb4vawi6ppva