ACM Home Page
Please provide us with feedback. Feedback
Learning query-class dependent weights in automatic video retrieval
Full text PdfPdf (415 KB)
Source International Multimedia Conference archive
Proceedings of the 12th annual ACM international conference on Multimedia table of contents
New York, NY, USA
SESSION: Technical best paper contest session table of contents
Pages: 548 - 555  
Year of Publication: 2004
ISBN:1-58113-893-8
Authors
Rong Yan  Carnegie Mellon University, Pittsburgh, PA
Jun Yang  Carnegie Mellon University, Pittsburgh, PA
Alexander G. Hauptmann  Carnegie Mellon University, Pittsburgh, PA
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 59,   Citation Count: 22
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027527.1027661
What is a DOI?

ABSTRACT

Combining retrieval results from multiple modalities plays a crucial role for video retrieval systems, especially for automatic video retrieval systems without any user feedback and query expansion. However, most of current systems only utilize query independent combination or rely on explicit user weighting. In this work, we propose using query-class dependent weights within a hierarchial mixture-of-expert framework to combine multiple retrieval results. We first classify each user query into one of the four predefined categories and then aggregate the retrieval results with query-class associated weights, which can be learned from the development data efficiently and generalized to the unseen queries easily. Our experimental results demonstrate that the performance with query-class dependent weights can considerably surpass that with the query independent weights.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Amir, M. Berg, S.-F. Chang, and et al. IBM research TRECVID-2003 video retrieval system. In NIST TRECVID-2003, Nov 2003.
 
2
M. Beigi, A. B. Benitez, and S.-F. Chang. Metaseek: A content-based meta-search engine for images. In Proc. of SPIE, 1997.
 
3
 
4
 
5
P. Clarkson and R. Rosenfeld. Statistical language modeling using the CMU-Cambridge toolkit. In Proc. Eurospeech'97, 1997.
 
6
 
7
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1--38, 1977.
8
 
9
D. Grinberg, J. Lafferty, and D. Sleator. A robust parsing algorithm for link grammars. In Proc. of the 4th Int'l Workshop on Parsing Technologies, 1995.
 
10
A. G. Hauptmann and et al. Informedia at TRECVID 2003: Analyzing and searching broadcast news video. In Proc. of TRECVID 2003, Gaithersburg, MD, 2003.
 
11
12
 
13
14
 
15
L. Ramshaw and M. Marcus. Text chunking using transformation-based learning. In Proc. of the ACL Third Workshop on Very Large Corpora, 1995.
 
16
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC4. In Text REtrieval Conference, pages 21--30, 1992.
 
17
 
18
TRECVID: TREC Video Retrieval Evaluation. http://www-nlpir.nist.gov/projects/trecvid.
 
19
T. Westerveld, T. Ianeva, L. Boldareva, A. P. de Vries, and D. Hiemstra. Combining infomation sources for video retrieval: The lowlands team at TRECVID 2003. In NIST TRECVID-2003, Nov 2003.
 
20
R. Yan and A. Hauptmann. Co-retrieval: A boosted reranking approach for multimedia retrieval. In Proc. of Intl. Conf. on Image and Video Retrieval, 2004.
21
 
22
23

CITED BY  22

Collaborative Colleagues:
Rong Yan: colleagues
Jun Yang: colleagues
Alexander G. Hauptmann: colleagues