Fusion vs. Two-Stage for Multimodal Retrieval

Arampatzis, Avi; Zagoris, Konstantinos; Chatzichristofis, Savvas A.

Fusion vs. Two-Stage for Multimodal Retrieval

Files

Primary C18.pdf (81.01 KB)

Date

2011

Authors

Arampatzis, Avi

Zagoris, Konstantinos

Chatzichristofis, Savvas A.

Publisher

Springer

Abstract

We compare two methods for retrieval from multimodal collections. The first is a score-based fusion of results, retrieved visually and textually. The second is a two-stage method that visually re-ranks the top-K results textually retrieved. We discuss their underlying hypotheses and practical limitations, and contact a comparative evaluation on a standardized snapshot of Wikipedia. Both methods are found to be significantly more effective than single-modality baselines, with no clear winner but with different robustness features. Nevertheless, two-stage retrieval provides efficiency benefits over fusion.