| Bayesian video search reranking |
| Full text |
Pdf
(605 KB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 16th ACM international conference on Multimedia
table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C4: video sp81-wei.pdfearch
table of contents
Pages 131-140
Year of Publication: 2008
ISBN:978-1-60558-303-7
|
|
Authors
|
|
Xinmei Tian
|
Univ. of Sci. & Tech. of China, Hefei, China
|
|
Linjun Yang
|
Microsoft Research Asia, Beijing, China
|
|
Jingdong Wang
|
Microsoft Research Asia, Beijing, China
|
|
Yichen Yang
|
Zhejiang University, Hangzhou, China
|
|
Xiuqing Wu
|
Univ. of Sci. & Tech. of China, Hefei, China
|
|
Xian-Sheng Hua
|
Microsoft Research Asia, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 21, Downloads (12 Months): 236, Citation Count: 1
|
|
|
ABSTRACT
Content-based video search reranking can be regarded as a process that uses visual content to recover the "true" ranking list from the noisy one generated based on textual information. This paper explicitly formulates this problem in the Bayesian framework, i.e., maximizing the ranking score consistency among visually similar video shots while minimizing the ranking distance, which represents the disagreement between the objective ranking list and the initial text-based. Different from existing point-wise ranking distance measures, which compute the distance in terms of the individual scores, two new methods are proposed in this paper to measure the ranking distance based on the disagreement in terms of pair-wise orders. Specifically, hinge distance penalizes the pairs with reversed order according to the degree of the reverse, while preference strength distance further considers the preference degree. By incorporating the proposed distances into the optimization objective, two reranking methods are developed which are solved using quadratic programming and matrix computation respectively. Evaluation on TRECVID video search benchmark shows that the performance improvement up to 21% on TRECVID 2006 and 61.11% on TRECVID 2007 are achieved relative to text search baseline.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Trec-10 proceedings appendix on common evaluation measures. http://trec.nist.gov/pubs/trec10/appendices/measures.pdf.
|
 |
2
|
Zhe Cao , Tao Qin , Tie-Yan Liu , Ming-Feng Tsai , Hang Li, Learning to rank: from pairwise approach to listwise approach, Proceedings of the 24th international conference on Machine learning, p.129-136, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273513]
|
| |
3
|
J. G. Carbonell, Y. Yang, R. E. Frederking, R. D. Brown, Y. Geng, and D. Lee. Translingual information retrieval: A comparative evaluation. In IJCAI (1), pages 708--715, 1997.
|
| |
4
|
L. B. Cremeant and R. M. Murra. Stability analysis of interconnected nonlinear systems under matrix feedback. In Proceedings of the 42nd IEEE Conference on Decision and Control, Maui, Hawaii USA, 2003.
|
| |
5
|
R. Fergus, P. Perona, and A. Zisserman. A visual category filter for google images. In ECCV (1), pages 242--256, 2004.
|
| |
6
|
R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In Advances in Large Margin Classifiers, pages 115--132. MIT Press, Cambridge, MA, 2000.
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
 |
10
|
Jingjing Liu , Wei Lai , Xian-Sheng Hua , Yalou Huang , Shipeng Li, Video search re-ranking via multi-graph propagation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291279]
|
| |
11
|
Y. Liu, T. Mei, J. Tang, X. Wu, and X.-S. Hua. Learning to video search rerank via pseudo preference feedback. In ICME, 2008.
|
| |
12
|
T. Mei, X.-S. Hua, W. Lai, L. Yang, Z.-J. Zha, Y. Liu, Z. Gu, G.-J. Qi, M. Wang, J. Tang, X. Yuan, Z. Lu, and J. Liu. Msra-ustc-sjtu at trecvid 2007: High-level feature extraction and search. In TREC Video Retrieval Evaluation Online Proceddings, 2007.
|
 |
13
|
Apostol (Paul) Natsev , Alexander Haubold , Jelena Tešić , Lexing Xie , Rong Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291448]
|
| |
14
|
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, M. Gatford, and A. Payne. Simple, proven approaches to text retrieval. In Cambridge University Computer Laboratory Technical Report TR356, 1997.
|
 |
15
|
|
| |
16
|
A. J. Smola. Pr_LOQO, http://www.kernel-machines.org/code/prloqo.tar.gz.
|
| |
17
|
TRECVID. TREC Video Retrieval Evaluation. http://www-nlpir.nist.gov/projects/trecvid/.
|
| |
18
|
R. Yan and A. G. Hauptmann. Co-retrieval: A boosted reranking approach for video retrieval. In CIVR, pages 60--69, 2004.
|
| |
19
|
R. Yan, A. G. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In CIVR, pages 238--247, 2003.
|
| |
20
|
L. Yang and R. Jin. Distance metric learning: A comprehensive survey. http://www.cse.msu.edu/~yangliu1/frame_ survey_v2.pdf. 2006.
|
| |
21
|
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Scholkopf. Learning with local and global consistency. In NIPS, 2003.
|
| |
22
|
X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912--919, 2003.
|
|