Header menu link for other important links
Cross-specificity: modelling data semantics for cross-modal matching and retrieval
, A. Jha, C.V. Jawahar
Published in Springer London
Volume: 7
Issue: 2
Pages: 139 - 146
While dealing with multi-modal data such as pairs of images and text, though individual samples may demonstrate inherent heterogeneity in their content, they are usually coupled with each other based on some higher-level concepts such as their categories. This shared information can be useful in measuring semantics of samples across modalities in a relative manner. In this paper, we investigate the problem of analysing the degree of specificity in the semantic content of a sample in one modality with respect to semantically similar samples in another modality. Samples that have high similarity with semantically similar samples from another modality are considered to be specific, while others are considered to be relatively ambiguous. To model this property, we propose a novel notion of “cross-specificity”. We present two mechanisms to measure cross-specificity: one based on human judgement and other based on an automated approach. We analyse different aspects of cross-specificity and demonstrate its utility in cross-modal retrieval task. Experiments show that though conceptually simple, it can benefit several existing cross-modal retrieval techniques and provide significant boost in their performance. © 2017, Springer-Verlag London Ltd., part of Springer Nature.
About the journal
JournalData powered by TypesetInternational Journal of Multimedia Information Retrieval
PublisherData powered by TypesetSpringer London