|
ABSTRACT
Advances in the media and entertainment industries, for example streaming audio and digital TV, present new challenges for managing large audio-visual collections. Efficient and effective retrieval from large content collections forms an important component of the business models for content holders and this is driving a need for research in audio-visual search and retrieval. Current content management systems support retrieval using low-level features, such as motion, colour, texture, beat and loudness. However, low-level features often have little meaning for the human users of these systems, who much prefer to identify content using high-level semantic descriptions or concepts. This creates a gap between the system and the user that must be bridged for these systems to be used effectively. The research presented in this paper describes our approach to bridging this gap in a specific content domain, sports video. Our approach is based on a number of automatic techniques for feature detection used in combination with heuristic rules determined through manual observations of sports footage. This has led to a set of models for interesting sporting events-goal segments-that have been implemented as part of an information retrieval system. The paper also presents results comparing output of the system against manually identified goals.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Myron Flickner , Harpreet Sawhney , Wayne Niblack , Jonathan Ashley , Qian Huang , Byron Dom , Monika Gorkani , Jim Hafner , Denis Lee , Dragutin Petkovic , David Steele , Peter Yanker, Query by Image and Video Content: The QBIC System, Computer, v.28 n.9, p.23-32, September 1995
[doi> 10.1109/2.410146]
|
 |
2
|
Shih-Fu Chang , William Chen , Horace J. Meng , Hari Sundaram , Di Zhong, VideoQ: an automated content based video search system using visual cues, Proceedings of the fifth ACM international conference on Multimedia, p.313-324, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266382]
|
| |
3
|
|
| |
4
|
Arun Hampapur and Ramesh Jain. Virage Video Engine. SPIE Vol. 3022. pp 188-197.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
Hari Sundaram and Shih-Fu Chang. Efficient Video Sequence Retrieval in Large Repositories. SPIE '99 Storage and Retrieval of Image and Video Databases VII, San Jose CA, Jan 24-29, 1999.
|
| |
9
|
|
| |
10
|
C.Carson and V.E. Ogle. Storage and retrieval of feature data for a very large online image collection. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, I9(4):19-27, December 1996.
|
 |
11
|
|
| |
12
|
|
| |
13
|
M. Naphade, T. Kristjansson, B. Frey, and T.S. Huang. Probabilistic multimedia objects (multijects): A novel approach to indexing and retrieval in multimedia systems. In Proceedings of the fifth IEEE International Conference on Image Processing, Volume 3, pages 536-540, Chicago, IL, Ott 1998.
|
| |
14
|
Drew D. Sam-, Yap-Peng Tan, Sanjeev R. Kulkarni, and Peter J. Ramadge. Automatic analysis and Annotation of Basketball Video. In Storage and Retrieval for Image and Video Databases V, volume SPIE-3022, pages 176-187, Feb. 1997.
|
| |
15
|
G. Sudhir, John CM. Lee, and Anil K. Jam. Automatic Classification of Tennis Video for High-level Content-based Retrieval. Technical Report HKUST-CS97-2, The Hong Kong University of Science and Technology, Hong Kong, August 7, 1997.
|
| |
16
|
|
| |
17
|
Peter Nell. MPEG Digital Audio Coding. IEEE Signal Processing Magazine. Sept. 1997. Pages 59-8
|
| |
18
|
Surya Nepal, Uma Srinivasan and Graham Reynolds. Semantic-Based Retrieval Model for Digital Audio/Video. CSIRO Mathematical and Information Sciences. Technical Report No 2000/174. October 2000.
|
| |
19
|
Mediaware solutions. http://www.mediaware.com.au/
|
| |
20
|
MPEGMaaate: http://www.cmis.csiro.au/dmis/Maaate/
|
| |
21
|
Lifang Gu. Scene Analysis of Video Sequences in the MPEG Domain. Proceedings of the IASTED International Conference Signal and Image Processing October 28-31, Las Vegas, Nevada, U.S.A.
|
| |
22
|
|
 |
23
|
|
| |
24
|
Basketball Rules http:Nwww.basketball.coml
|
| |
25
|
Uma Srinivasan, Jordi Robert-Ribes and Graham Reynolds. Querying Video Content Using Multi-Modal Features. CSIRO Mathematical and Information Sciences. Technical Report 1997.
|
CITED BY 38
|
|
Min Xu , Ling-Yu Duan , Liang-Tien Chia , Chang-sheng Xu, Audio keyword generation for sports video analysis, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
|
|
|
J. Assfalg , M. Bertini , C. Colombo , A. Del Bimbo , W. Nunziati, Automatic interpretation of soccer video for highlights extraction and annotation, Proceedings of the 2003 ACM symposium on Applied computing, March 09-12, 2003, Melbourne, Florida
|
|
|
|
|
|
Ling-Yu Duan , Min Xu , Tat-Seng Chua , Qi Tian , Chang-Sheng Xu, A mid-level representation framework for semantic sports video analysis, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Songyang Lao , Alan F. Smeaton , Gareth J. F. Jones , Hyowon Lee, A query description model based on basic semantic unit composite petri-nets for soccer video analysis, Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, October 15-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
Marco Bertini , Alberto Del Bimbo , Rita Cucchiara , Andrea Prati, Semantic video adaptation based on automatic annotation of sport videos, Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, October 15-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alan F. Smeaton , Bart Lehane , Noel E. O'Connor , Conor Brady , Gary Craig, Automatically selecting shots for action movie trailers, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
Changsheng Xu , Jinjun Wang , Kongwah Wan , Yiqun Li , Lingyu Duan, Live sports event detection based on broadcast video and web-casting text, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Uma Srinivasan , Silvia Pfeiffer , Surya Nepal , Michael Lee , Lifang Gu , Stephen Barrass, A Survey of MPEG-1 Audio, Video and Semantic Analysis Techniques, Multimedia Tools and Applications, v.27 n.1, p.105-141, September 2005
|
|
|
|
|
|
Chunxi Liu , Qingming Huang , Shuqiang Jiang , Liyuan Xing , Qixiang Ye , Wen Gao, A framework for flexible summarization of racquet sports video using multiple modalities, Computer Vision and Image Understanding, v.113 n.3, p.415-424, March, 2009
|
|
|
|
|
|
Min Xu , Changsheng Xu , Lingyu Duan , Jesse S. Jin , Suhuai Luo, Audio keywords generation for sports video analysis, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.4 n.2, p.1-23, May 2008
|
|
|
Yifan Zhang , Xiaoyu Zhang , Changsheng Xu , Hanqing Lu, Personalized retrieval of sports video, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
Surong Wang , Manoranjan Dash , Liang-Tien Chia , Min Xu, Efficient sampling of training set in large and noisy multimedia data, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.3 n.3, p.14-es, August 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|