ACM Home Page
Please provide us with feedback. Feedback
Learning structured concept-segments for interactive video retrieval
Full text PdfPdf (958 KB)
Source
Conference On Image And Video Retrieval archive
Proceedings of the 2008 international conference on Content-based image and video retrieval table of contents
Niagara Falls, Canada
SESSION: Objects, events and concepts table of contents
Pages 57-66  
Year of Publication: 2008
ISBN:978-1-60558-070-8
Authors
Zhikun Wang  Tsinghua University, Beijing, China
Dong Wang  Tsinghua University, Beijing, China
Jianmin Li  Tsinghua University, Beijing, China
Bo Zhang  Tsinghua University, Beijing, China
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 137,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1386352.1386364
What is a DOI?

ABSTRACT

Now with a large lexicon of over 300 semantic concepts available for indexing purpose, video retrieval can be made easier by leveraging on the available semantic indices. However, any successful concept-based video retrieval approach must take the following into account: though improving continuously, these concept indexing results are still far from perfect; more concepts are awaiting for detection instead of being detected due to the limited amount of annotated data. If possible, a structured query formulation other than a simple AND logic of some chosen concepts is more desirable to model the complex query need with the fixed concept lexicon. In this paper, we propose a concept-based interactive video retrieval approach to tackle these problems. To better represent the query information need, the proposed approach learns through the feedback information a structured formulation which consists of multiple semantic concept combination terms. Instead of taking the top-ranked items from the selected concepts, it leverages on a simple mining algorithm to drill down to concept-segments where the positive examples are most densely populated than the negative examples. We evaluate the proposed method on the large scale TRECVid 05&06 data sets, and achieve promising results. Retrieval in concept-segment level has a 14% improvement upon the concept-level. Structured query formulation improves around 13% compared with the simple logical AND formulation. The learning and retrieval process only takes 300ms, satisfying the real-time interactive search need.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Bay, T. Tuytelaars, and L. Gool. 2006. Surf: Speeded up robust features. In Proc. of ECCV, 2006.
 
2
M. Campbell and et al.2006. IBM research trecvid-2006 video retrieval system. In Proc. Of TRECVid, 2006.
 
3
T. Hastie, R. Tibshirani, and J. Friedman.2001. The Elements of Statistical Learning. Springer series in statistics. Springer, New York, 2001.
 
4
L. Hollink, G.P. Nguyen, D.C. Koelma, A. Th. Schreiber, and M. Worring.2004. User Strategies in Video Retrieval: a Case Study. In Proc. of CIVR, 2004.
 
5
J. Kender.2007. A Large Scale Concept Ontology for News Stories: Empirical Methods, Analysis, and Improvements. In Proc of ICME, 2007
6
 
7
 
8
D. Metzler and R. Manmatha.2004. An inference network approach to image retrieval. In Proc. of CIVR, 2004.
 
9
10
11
 
12
S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua.2006. Video retrieval using high level features: Exploiting query matching and condence-based weighting. In Proc. of CIVR, 2006.
 
13
Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra.1998. Relevance Feedback: A Power Tool for Interactive Content-Based Image Retrieval. IEEE Trans. on Circuits and Systems for Video Technology 8(5) (Sep. 1998):644--655
14
 
15
C. G. Snoek, M. Worring, D. C. Koelma, and A. W. Smeulders.2007. A learned lexicon-driven paradigm for interactive video retrieval. IEEE Trans. Multimedia, 2007.
 
16
D. Wang, J. Li, and B. Zhang.2006. Relay boost fusion for learning rare concepts in multimedia. In Proc. of CIVR, 2006.
17
18
 
19
J. Yuan et al.2007. THU and ICRC at TRECVid 2007, In Proc. of TRECVid, 2007
 
20
W. Zheng, J. Li, Z. Si, F. Lin, and B. Zhang.2006. Using high-level semantic features in video retrieval. In Proc. of CIVR, 2006.
 
21
X. S. Zhou, T. S. Huang.2003. Relevance feedback in image retrieval: A comprehensive review. Multimedia Systems, Vol. 8, No. 6. (1 April 2003), pp. 536--544.


Collaborative Colleagues:
Zhikun Wang: colleagues
Dong Wang: colleagues
Jianmin Li: colleagues
Bo Zhang: colleagues