|
ABSTRACT
In this paper we unify two supposedly distinct tasks in multimedia retrieval. One task involves answering queries with a few examples. The other involves learning models for semantic concepts, also with a few examples. In our view these two tasks are identical with the only differentiation being the number of examples that are available for training. Once we adopt this unified view, we then apply identical techniques for solving both problems and evaluate the performance using the NIST TRECVID benchmark evaluation data [15]. We propose a combination hypothesis of two complementary classes of techniques, a nearest neighbor model using only positive examples and a discriminative support vector machine model using both positive and negative examples. In case of queries, where negative examples are rarely provided to seed the search, we create pseudo-negative samples. We then combine the ranked lists generated by evaluating the test database using both methods, to create a final ranked list of retrieved multimedia items. We evaluate this approach for rare concept and query topic modeling using the NIST TRECVID video corpus.In both tasks we find that applying the combination hypothesis across both modeling techniques and a variety of features results in enhanced performance over any of the baseline models, as well as in improved robustness with respect to training examples and visual features. In particular, we observe an improvement of 6% for rare concept detection and 17% for the search task.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
TREC Video Retrieval. National Institute of Standards and Technology, http://www-nlpir.nist.gov/projects/trecvid/.
|
| |
2
|
|
| |
3
|
T. S. Chua, S.-Y. Neo, K.-Y. Li, G. Wang, R. Shi, M. Zhao, and H. Xu. TREC VID 2004 search and feature extraction task by NUSPRIS. In TRECVID 2004 Workshop, Gaithersburg, MD, Nov. 2004.
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
C. Lin, B. Tseng, and J. Smith. Video collaborative annotation forum: Establishing ground-truth labels on large multimedia datasets. In Proc. Text Retrieval Conference (TREC), Gaithersburg, MD, Nov 2003.
|
 |
9
|
|
| |
10
|
M. R. Naphade, S. Basu, J. Smith, C. Y. Lin, and B. Tseng. Modeling semantic concepts to support query by keywords in video. In Proc. IEEE Intl. Conference on Image Processing (ICIP'02), Rochester, NY, Sep. 2002.
|
| |
11
|
S. Nepal and M. V. Ramakrishna. Single feature query by multi examples in image databases. In Proc. SPIE Photonics East Intl. Symposium on Voice, Data and Communications, volume 4210, pages 424--435, 2000.
|
| |
12
|
|
| |
13
|
Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra. Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. on Circuits and Systems for Video Technology, 8:644--656, Sep. 1998.
|
| |
14
|
R. Singh and R. Kothari. Relevance feedback algorithm based on learning from labeled and unlabeled data. In IEEE ICME 2003, Baltimore, MD, July 2003.
|
 |
15
|
|
| |
16
|
D. M. J. Tax. One-Class Classification: Concept-Learning in the Absence of Counter-Examples. PhD thesis, Delft University of Technology, June 2001.
|
 |
17
|
|
| |
18
|
|
 |
19
|
|
| |
20
|
T. Westerveld and A. P. de Vries. Multimedia retrieval using multiple examples. In CIVR, pages 344--352, 2004.
|
 |
21
|
|
 |
22
|
|
CITED BY 25
|
|
Dong Wang , Xiaobing Liu , Linjie Luo , Jianmin Li , Bo Zhang, Video diver: generic video indexing with diverse features, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Xun Yuan , Xian-Sheng Hua , Meng Wang , Xiu-Qing Wu, Manifold-ranking based video concept detection on large database and feature pool, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lyndon Kennedy , Mor Naaman , Shane Ahern , Rahul Nair , Tye Rattenbury, How flickr helps us make sense of the world: context and content in community-contributed media collections, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|
|
Apostol (Paul) Natsev , Alexander Haubold , Jelena Tešić , Lexing Xie , Rong Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
Xirong Li , Dong Wang , Jianmin Li , Bo Zhang, Video search in concept subspace: a text-like paradigm, Proceedings of the 6th ACM international conference on Image and video retrieval, p.603-610, July 09-11, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
Jingjing Liu , Wei Lai , Xian-Sheng Hua , Yalou Huang , Shipeng Li, Video search re-ranking via multi-graph propagation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
Yanan Liu , Fei Wu , Yueting Zhuang , Jun Xiao, Active post-refined multimodality video semantic concept detection with tensor representation, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Dong Wang , Zhikun Wang , Jianmin Li , Bo Zhang , Xirong Li, Query representation by structured concept threads with application to interactive video retrieval, Journal of Visual Communication and Image Representation, v.20 n.2, p.104-116, February, 2009
|
|
|
Klimis Ntalianis , Anastasios Doulamis , Nicolas Tsapatsoulis , Nikolaos Doulamis, Automatic annotation of multimedia content by user clickthroughs: enhancing the performance of multimedia search engines, Proceedings of the 10th WSEAS international conference on Mathematical methods, computational techniques and intelligent systems, p.439-446, October 26-28, 2008, Corfu, Greece
|
|
|
|
|