|
ABSTRACT
In this paper we present the results of our work on the analysis of multi-modal data for video Information Retrieval, where we exploit the properties of this data for query-time, automatic generation of weights for multi-modal data fusion. Through empirical testing we have observed that for a given topic, a high performing feature,that is one which achieves high relevance, will have a different distribution of document scores when compared against those that do not perform as well. These observations form the basis for our initial fusion model, which generates weights based on these properties, without the need for prior training.Our model can be used to not only combine feature data,but to also combine the results of multiple example query images and apply weights to these.Our analysis and experiments were conducted on the TRECVid 2004 and 2005 collections,making use of multiple MPEG-7 low-level features and automatic speech recognition (ASR)transcripts.Results achieved from our model achieve performance on a par with that of 'oracle' determined weights,and demonstrate the applicability of our model whilst advancing the case for further investigation of score distributions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
The AceMedia Project,available at http://www.acemedia.org.
|
| |
2
|
Learning of personal visual impressions for image database systems.In In Proceedings of the Second International Conference on Document Analysis and Recognition pages 547--552, 1993.
|
| |
3
|
|
| |
4
|
S.-F.Chang, T. Sikora, and A. Puri. Overview of the MPEG-7 standard.IEEE Transactions on Circuits and Systems for Video Technology 11(6): 688--695, June 2001.
|
| |
5
|
E. Cooke, P. Ferguson, G. Gaughan, C. Gurrin, G. J. F. Jones, H. L. Borgne, H. Lee, S. Marlow, K. McDonald, M. McHugh, N. Murphy, N. E. O 'Connor, N.O 'Hare, S. Rothwell, A. F. Smeaton, and P. Wilkins. TRECVID 2004 experiments in Dublin City University. In Proceedings of TRECVID 2004 November 2004.
|
| |
6
|
A. P. Dempster. A generalization of the Bayesian inference. Journal of Royal Statistical Society 30:205--447, 1968.
|
| |
7
|
P. Ferguson, C. Gurrin, P. Wilkins, and A. F. Smeaton.Físréál: A Low Cost Terabyte Search Engine. In Proceedings of European Conference in IR March 2005.
|
| |
8
|
E. A. Fox and J. A. Shaw. Combination of multiple searches. In Proceedings of the 2nd Text REtrieval Conference 1994.
|
 |
9
|
|
 |
10
|
|
| |
11
|
B. S. Manjunath, J. R. Ohm, V. Vasudevan, and A. Yamada. Color and texture description. In IEEE Trans. On Circuits and Systems for Video Technology June 2001.
|
 |
12
|
|
| |
13
|
R. W. Picard, T. P. Minka, and M. Szummer. Modeling user subjectivity in image libraries. In IEEE International Conference On Image Processing 1996.
|
| |
14
|
|
| |
15
|
|
| |
16
|
Y. Rui and T. Huang. Optimizing learning in image retrieval.In Proceeding of IEEE int. Conf. On Computer Vision and Pattern Recognition pages 236--245, 2000.
|
| |
17
|
Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra. Relevance feedback:A power tool for interactive content-based image retrieval relevance feedback: A power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and Systems for Video Technology 8(5): 644--655 1998.
|
| |
18
|
|
| |
19
|
G. Shafer. A Mathematical Theory of Evidence. Princeton University Press 1976.
|
| |
20
|
A. F. Smeaton. Large Scale Evaluations of Multimedia Information Retrieval: The TRECVid Experience volume 3568 /2006,pages 11--17.Springer, 2005.
|
 |
21
|
|
| |
22
|
P. Wilkins, P. Ferguson, C. Gurrin, and A. F. Smeaton. Automatic determination of feature weights for mult-feature CBIR. In ECIR 2006 - European Conference on Information Retrieval. Lalmas M et al. (Eds.): (LNCS Series 3936) pages 527--530. Springer,
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
Thierry Urruty , Frank Hopfgartner , David Hannah , Desmond Elliott , Joemon M. Jose, Supporting aspect-based video browsing: analysis of a user study, Proceeding of the ACM International Conference on Image and Video Retrieval, July 08-10, 2009, Santorini, Fira, Greece
|
|
|
|
|