|
ABSTRACT
Event detection is essential for sports video summarization, indexing and retrieval and extensive research efforts have been devoted to this area. However, the previous approaches are heavily relying on video content itself and require the whole video content for event detection. Due to the semantic gap between low-level features and high-level events, it is difficult to come up with a generic framework to achieve a high accuracy of event detection. In addition, the dynamic structures from different sports domains further complicate the analysis and impede the implementation of live event detection systems. In this paper, we present a novel approach for event detection from the live sports game using web-casting text and broadcast video. Web-casting text is a text broadcast source for sports game and can be live captured from the web. Incorporating web-casting text into sports video analysis significantly improves the event detection accuracy. Compared with previous approaches, the proposed approach is able to: (1) detect live event only based on the partial content captured from the web and TV; (2) extract detailed event semantics and detect exact event boundary, which are very difficult or impossible to be handled by previous approaches; and (3) create personalized summary related to certain event, player or team according to user's preference. We present the framework of our approach and details of text analysis, video analysis and text/video alignment. We conducted experiments on both live games and recorded games. The results are encouraging and comparable to the manually detected events. We also give scenarios to illustrate how to apply the proposed solution to professional and consumer services.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
A. Ekin, A. M. Tekalp, and R. Mehrotra, "Automatic soccer video analysis and summarization", IEEE Trans. on Image Processing, vol. 12:7, no. 5, pp. 796--807, 2003.
|
 |
5
|
|
| |
6
|
|
| |
7
|
R. Radhakrishan, Z. Xiong, A. Divakaran, Y. Ishikawa, "Generation of sports highlights using a combination of supervised & unsupervised learning in audio domain", In Proc. of International Conference on Pacific Rim Conference on Multimedia, Vol. 2, pp. 935--939, December 2003.
|
| |
8
|
K. Wan, and C. Xu, "Robust soccer highlight generation with a novel dominant-speech feature extractor", In Proc. of IEEE International Conference on Multimedia and Expo, Taipei, Taiwan, pp.591--594, 27-30 Jun. 2004.
|
| |
9
|
M. Xu, L. Duan, C. Xu, and Q. Tian, "A fusion scheme of visual and auditory modalities for event detection in sports video", In Proc. of IEEE International Conference on Acoustics, Speech, & Signal Processing, Hong Kong, China, Vol.3, pp.189--192, 2003.
|
| |
10
|
|
| |
11
|
M. Xu, L. Duan, C. Xu, M.S. Kankanhalli, and Q. Tian, "Event detection in basketball video using multi-modalities", In Proc. of IEEE Pacific Rim Conference on Multimedia, Singapore, Vol.3, pp.1526--1530, 15-18 Dec, 2003.
|
 |
12
|
|
 |
13
|
|
 |
14
|
Jinjun Wang , Changsheng Xu , Engsiong Chng , Lingyu Duan , Kongwah Wan , Qi Tian, Automatic generation of personalized music sports video, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101309]
|
| |
15
|
N. Nitta and N. Babaguchi, "Automatic story segmentation of closed-caption text for semantic content analysis of broadcasted sports video," In Proc. of 8th International Workshop on Multimedia Information Systems '02, pp. 110--116, 2002.
|
| |
16
|
N. Babaguchi, Y. Kawai, and T. Kitahashi, "Event based indexing of broadcasted sports video by intermodal collaboration," IEEE Trans. on Multimedia, Vol. 4, pp. 68--75, March 2002.
|
| |
17
|
|
 |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
Y. Tan and et al, "Rapid estimation of camera motion from compressed video with application to video annotation," IEEE Trans. on Circuits and Systems for Video Technology, vol. 10- 1, pp. 133--146, 2000.
|
| |
26
|
|
CITED BY 12
|
|
Ling-Yu Duan , Jinqiao Wang , Yantao Zheng , Jesse S. Jin , Hanqing Lu , Changsheng Xu, Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
Guangyu Zhu , Changsheng Xu , Yi Zhang , Qingming Huang , Hanqing Lu, Event tactic analysis based on player and ball trajectory in broadcast video, Proceedings of the 2008 international conference on Content-based image and video retrieval, July 07-09, 2008, Niagara Falls, Canada
|
|
|
|
|
|
Guangyu Zhu , Qingming Huang , Changsheng Xu , Yong Rui , Shuqiang Jiang , Wen Gao , Hongxun Yao, Trajectory based event tactics analysis in broadcast sports video, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
Yifan Zhang , Xiaoyu Zhang , Changsheng Xu , Hanqing Lu, Personalized retrieval of sports video, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Guangyu Zhu , Changsheng Xu , Qingming Huang , Yong Rui , Shuqiang Jiang , Wen Gao , Hongxun Yao, Event tactic analysis based on broadcast sports video, IEEE Transactions on Multimedia, v.11 n.1, p.49-67, January 2009
|
|