|
ABSTRACT
Information linkage is becoming more and more important in this digital age. In this paper, we propose a concept tracking method, which links news stories on the same topic across multiple sources. The semantic linkage between the news stories is reflected in combination of both of their visual content and their spoken language content. Visually, each news story is represented by a set of key-frames with or without detected faces. The facial key-frames are linked based on the analysis of the extended facial regions, and the non-facial key-frames are correlated using the global Affine matching. The language similarity is expressed in terms of the normalized text similarity between the stories' keywords. The output results of the story linking are further used in a story ranking task, which indicate the interesting level of the stories. The proposed semantic linking framework and the story ranking method have been tested on a set of 60 hours open-benchmark TRECVID video data, and very satisfactory results for both tasks have been obtained.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Allan, J.G. Carbonell, G. Doddington, J. Yamron and Y. Yang, "Topic Detection and Tracking Pilot Study Final Report", Broadcast News Transcription and Understanding Workshop, 1998.
|
| |
2
|
W.G. Cheng and D. Xu, "Content-Based Video Retrieval Using Shot Cluster Tree", International Conference on Machine Learning and Cybermetics, 2003.
|
| |
3
|
|
| |
4
|
I. Ide, H. Mo, N. Katayama and S. Satoh, "Topic Threading for Structuring a Large-Scale News Video Archive", International Conference on Image and Video Retrieval, 2004.
|
| |
5
|
|
| |
6
|
C-W. Ngo, T-C. Pong and H-J. Zhang, "On Clustering and Retrieval of Video Shots Through Temporal Slices Analysis", IEEE Transactions on Multimedia, Vol.4, No.4, 2002.
|
| |
7
|
J-M. Odobez, D.G. Perez and M. Guillemot, "Video Shot Clustering Using Spectral Methods", International Working on Content-Based Multimedia Indexing, 2003.
|
| |
8
|
J. Sivic, F. Schaffalitzky and A. Zisserman, "Object Level Grouping for Video Shots", European Conference on Computer Vision, 2003.
|
| |
9
|
|
| |
10
|
W. Tavanapong and J.Y. Zhou, "Shot Clustering Techniques for Story Browsing", IEEE Transactions on Multimedia, 2004.
|
| |
11
|
Y. Zhai, A. Yilmaz and M. Shah, "Story Segmentation in News Videos Using Visual and Text Cues", International Conference on Image and Video Retrieval, 2005.
|
| |
12
|
D-Q. Zhang, C-Y. Lin, S-F Chang and J.R. Smith, "Semantic Video Clustering Across Sources Using Bipartitie Spectral Clustering", International Conference on Multimedia and Expo, 2004.
|
CITED BY 8
|
|
Lu Liu , Lifeng Sun , Yong Rui , Yao Shi , Shiqiang Yang, Web video topic discovery and tracking via bipartite graph reinforcement model, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xiaomeng Wu , Masao Takimoto , Shin'ichi Satoh , Jun Adachi, Scene duplicate detection based on the pattern of discontinuities in feature point trajectories, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|