Fusion vs. Two-Stage for Multimodal Retrieval [chapter]

Avi Arampatzis, Konstantinos Zagoris, Savvas A. Chatzichristofis
2011 Lecture Notes in Computer Science  
We compare two methods for retrieval from multimodal collections. The first is a score-based fusion of results, retrieved visually and textually. The second is a two-stage method that visually re-ranks the top-K results textually retrieved. We discuss their underlying hypotheses and practical limitations, and contact a comparative evaluation on a standardized snapshot of Wikipedia. Both methods are found to be significantly more effective than single-modality baselines, with no clear winner but
more » ... with different robustness features. Nevertheless, two-stage retrieval provides efficiency benefits over fusion.
doi:10.1007/978-3-642-20161-5_88 fatcat:uwmh3s5vzndsxmh33vg23z4oya