|
ABSTRACT
We address the problem of searching multimedia by semantic similarity in a keyword space. In contrast to previous research we represent multimedia content by a vector of keywords instead of a vector of low-level features. This vector of keywords can be obtained through user manual annotations or computed by an automatic annotation algorithm. In this setting, we studied the influence of two aspects of the search by semantic similarity process: (1) accuracy of user keywords versus automatic keywords and (2) functions to compute semantic similarity between keyword vectors of two multimedia documents. We consider these two aspects to be crucial in the design of a keyword space that can exploit social-media information and can enrich applications such as Flickr and YouTube. Experiments were performed on an image and a video dataset with a large number of keywords, with different similarity functions and with two annotation methods. Surprisingly, we found that multimedia semantic similarity with automatic keywords performs as good as or better than 95% accurate user keywords.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
S.-F. Chang, W. Chen, and H. Sundaram, "Semantic visual templates: linking visual features to semantics," in Int'l Conference on Image Processing, Chicago, IL, USA, 1998.
|
| |
3
|
|
| |
4
|
S. L. Feng, V. Lavrenko, and R. Manmatha, "Multiple Bernoulli relevance models for image and video annotation," in IEEE Conf. on Computer Vision and Pattern Recognition, Cambridge, UK, 2004, pp. 1002--1009.
|
| |
5
|
Myron Flickner , Harpreet Sawhney , Wayne Niblack , Jonathan Ashley , Qian Huang , Byron Dom , Monika Gorkani , Jim Hafner , Denis Lee , Dragutin Petkovic , David Steele , Peter Yanker, Query by Image and Video Content: The QBIC System, Computer, v.28 n.9, p.23-32, September 1995
[doi> 10.1109/2.410146]
|
| |
6
|
A. Haubold, A. Natsev, and M. Naphade, "Semantic multimedia retrieval using lexical query expansion and model-based re-ranking," in IEEE Int'l Conference on Multimedia and Expo Toronto, Canada, 2006.
|
 |
7
|
|
| |
8
|
X. He, O. King, W.-Y. Ma, M. Li, and H.-J. Zhang, "Learning a semantic space from user's relevance feedback for image retrieval," IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, pp. 39--48, Jan 2003.
|
| |
9
|
P. Howarth and S. Rüger, "Fractional distance measures for content-based image retrieval," in European Conference on Information Retrieval, Santiago de Compostela, Spain, 2005.
|
| |
10
|
P. Howarth and S. Rüger, "Trading accuracy for speed," in Int'l Conf. on Image and Video Retrieval Singapore, 2005.
|
 |
11
|
Michael S. Lew , Nicu Sebe , Chabane Djeraba , Ramesh Jain, Content-based multimedia information retrieval: State of the art and challenges, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.2 n.1, p.1-19, February 2006
[doi> 10.1145/1126004.1126005]
|
| |
12
|
J. Lin, "Divergence measures based on the Shannon entropy," IEEE Trans. Inform. Theory, vol. 37, pp. 145--151, January 1991.
|
 |
13
|
Ye Lu , Chunhui Hu , Xingquan Zhu , HongJiang Zhang , Qiang Yang, A unified framework for semantics and feature based relevance feedback in image retrieval systems, Proceedings of the eighth ACM international conference on Multimedia, p.31-37, October 2000, Marina del Rey, California, United States
[doi> 10.1145/354384.354403]
|
| |
14
|
J. Magalhães, "Statistical models for semantic-multimedia information retrieval," PhD Thesis, University of London, Imperial College of Science, Technology and Medicine, 2008.
|
 |
15
|
|
 |
16
|
|
| |
17
|
A. McCallum and K. Nigam, "A comparison of event models for naive Bayes text classification," in AAAI Workshop on Learning for Text Categorization, 1998.
|
| |
18
|
|
| |
19
|
Milind Naphade , John R. Smith , Jelena Tesic , Shih-Fu Chang , Winston Hsu , Lyndon Kennedy , Alexander Hauptmann , Jon Curtis, Large-Scale Concept Ontology for Multimedia, IEEE MultiMedia, v.13 n.3, p.86-91, July 2006
[doi> 10.1109/MMUL.2006.63]
|
 |
20
|
Apostol (Paul) Natsev , Alexander Haubold , Jelena Tešić , Lexing Xie , Rong Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291448]
|
| |
21
|
N. Rasiwasia, P. Moreno, and N. Vasconcelos, "Bridiging the gap: Query by semantic example," IEEE Transactions on Multimedia, vol. 9, pp. 923--938, August 2007.
|
| |
22
|
N. Rasiwasia, N. Vasconcelos, and P. Moreno, "Query by semantic example," in CIVR, Phoenix, AZ, USA, 2006.
|
| |
23
|
|
 |
24
|
|
| |
25
|
Cees G. M. Snoek , Marcel Worring , Jan-Mark Geusebroek , Dennis C. Koelma , Frank J. Seinstra , Arnold W. M. Smeulders, The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.28 n.10, p.1678-1689, October 2006
[doi> 10.1109/TPAMI.2006.212]
|
 |
26
|
|
| |
27
|
|
 |
28
|
|
| |
29
|
A. Yavlinsky, E. Schofield, and S. Rüger, "Automated image annotation using global features and robust nonparametric density estimation," in Int'l Conf. on Image and Video Retrieval, Singapore, 2005.
|
| |
30
|
C. Zhang and T. Chen, "An active learning framework for content-based information retrieval," IEEE Transactions on Multimedia, vol. 4, pp. 260--268, Jun 2002.
|
| |
31
|
|
|