|
ABSTRACT
Combining retrieval results from multiple modalities plays a crucial role for video retrieval systems, especially for automatic video retrieval systems without any user feedback and query expansion. However, most of current systems only utilize query independent combination or rely on explicit user weighting. In this work, we propose using query-class dependent weights within a hierarchial mixture-of-expert framework to combine multiple retrieval results. We first classify each user query into one of the four predefined categories and then aggregate the retrieval results with query-class associated weights, which can be learned from the development data efficiently and generalized to the unseen queries easily. Our experimental results demonstrate that the performance with query-class dependent weights can considerably surpass that with the query independent weights.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Amir, M. Berg, S.-F. Chang, and et al. IBM research TRECVID-2003 video retrieval system. In NIST TRECVID-2003, Nov 2003.
|
| |
2
|
M. Beigi, A. B. Benitez, and S.-F. Chang. Metaseek: A content-based meta-search engine for images. In Proc. of SPIE, 1997.
|
| |
3
|
Daniel M. Bikel , Scott Miller , Richard Schwartz , Ralph Weischedel, Nymble: a high-performance learning name-finder, Proceedings of the fifth conference on Applied natural language processing, p.194-201, March 31-April 03, 1997, Washington, DC
[doi> 10.3115/974557.974586]
|
| |
4
|
|
| |
5
|
P. Clarkson and R. Rosenfeld. Statistical language modeling using the CMU-Cambridge toolkit. In Proc. Eurospeech'97, 1997.
|
| |
6
|
|
| |
7
|
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1--38, 1977.
|
 |
8
|
Georgina Gaughan , Alan F. Smeaton , Cathal Gurrin , Hyowon Lee , Kieran McDonald, Design, implementation and testing of an interactive video retrieval system, Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval, November 07-07, 2003, Berkeley, California
[doi> 10.1145/973264.973269]
|
| |
9
|
D. Grinberg, J. Lafferty, and D. Sleator. A robust parsing algorithm for link grammars. In Proc. of the 4th Int'l Workshop on Parsing Technologies, 1995.
|
| |
10
|
A. G. Hauptmann and et al. Informedia at TRECVID 2003: Analyzing and searching broadcast news video. In Proc. of TRECVID 2003, Gaithersburg, MD, 2003.
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
 |
14
|
Andrew Merlino , Daryl Morey , Mark Maybury, Broadcast news navigation using story segmentation, Proceedings of the fifth ACM international conference on Multimedia, p.381-391, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266390]
|
| |
15
|
L. Ramshaw and M. Marcus. Text chunking using transformation-based learning. In Proc. of the ACL Third Workshop on Very Large Corpora, 1995.
|
| |
16
|
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC4. In Text REtrieval Conference, pages 21--30, 1992.
|
| |
17
|
|
| |
18
|
TRECVID: TREC Video Retrieval Evaluation. http://www-nlpir.nist.gov/projects/trecvid.
|
| |
19
|
T. Westerveld, T. Ianeva, L. Boldareva, A. P. de Vries, and D. Hiemstra. Combining infomation sources for video retrieval: The lowlands team at TRECVID 2003. In NIST TRECVID-2003, Nov 2003.
|
| |
20
|
R. Yan and A. Hauptmann. Co-retrieval: A boosted reranking approach for multimedia retrieval. In Proc. of Intl. Conf. on Image and Video Retrieval, 2004.
|
 |
21
|
Hui Yang , Lekha Chaisorn , Yunlong Zhao , Shi-Yong Neo , Tat-Seng Chua, VideoQA: question answering on news video, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
[doi> 10.1145/957013.957146]
|
| |
22
|
|
 |
23
|
|
CITED BY 22
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ruofei Zhang , Ramesh Sarukkai , Jyh-Herng Chow , Wei Dai , Zhongfei Zhang, Joint categorization of queries and clips for web-based video search, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
Alexander G. Hauptmann , Wei-Hao Lin , Rong Yan , Jun Yang , Ming-Yu Chen, Extreme video retrieval: joint maximization of human and computer performance, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
Bin Cui , Ling Liu , Calton Pu , Jialie Shen , Kian-Lee Tan, QueST: querying music databases by acoustic and textual features, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
Apostol (Paul) Natsev , Alexander Haubold , Jelena Tešić , Lexing Xie , Rong Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
Yifan Zhang , Xiaoyu Zhang , Changsheng Xu , Hanqing Lu, Personalized retrieval of sports video, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
Shi-Yong Neo , Yuanyuan Ran , Hai-Kiat Goh , Yantao Zheng , Tat-Seng Chua , Jintao Li, The use of topic evolution to help users browse and find answers in news video corpus, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Yanan Liu , Fei Wu , Yueting Zhuang , Jun Xiao, Active post-refined multimodality video semantic concept detection with tensor representation, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|