|
ABSTRACT
Like other types of digital information, video sequences must be classified based on the semantics of their contents. A more-precise and completer extraction of semantic information will result in a more-effective classification. The most-discernible difference between still images and moving pictures stems from movements and variations. Thus, to go from the realm of still-image repositories to video databases, we must be able to deal with motion. Particularly, we need the ability to classify objects appearing in a video sequence based on their characteristics and features such as shape or color, as well as their movements. By describing the movements that we derive from the process of motion analysis, we introduce a dual hierarchy consisting of spatial and temporal parts for video sequence representation. This gives us the flexibility to examine arbitrary sequences of frames at various levels of abstraction and to retrieve the associated temporal information (say, object trajectories) in addition to the spatial representation. Our algorithm for motion detection uses the motion compensation component of the MPEG video-encoding scheme and then computes trajectories for objects of interest. The specification of a language for retrieval of video based on the spatial as well as motion characteristics is presented.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
F. Arman , R. Depommier , A. Hsu , M.-Y. Chiu, Content-based browsing of video sequences, Proceedings of the second ACM international conference on Multimedia, p.97-103, October 15-20, 1994, San Francisco, California, United States
[doi> 10.1145/192593.192630]
|
| |
3
|
BOBICK, A.F. 1993. Representational frames in video annotation. In Proceedzngs of the 27th Annual Conference on Signals. Systems and Computers. IEEE Computer Society Press, Los Alamitos, Cahf.
|
| |
4
|
BUCHANAN, M. C. AND ZELLWEGER, P.T. 1993. Automatically generating consistent schedules for multimedia documents. Multzrned~a Syst. J. 1, 2 (Sept.).
|
| |
5
|
DAVIS, M. 1993. Media streams: An icomc visual language for video annotation. In Proceedings of the IEEE Symposzum on Wsual Languages (Bergen, Norway). IEEE, New York, 196 202.
|
| |
6
|
|
 |
7
|
|
| |
8
|
DUDA, R. O. AND HART, P.E. 1973 Pattern Classification and Scene Analys~s. John Wiley and Sons, New York.
|
| |
9
|
|
| |
10
|
|
| |
11
|
GODD^RD, N. 1992. The perception of articulated motion: Recognizing moving light displays. Ph.D. thesis, Univ. of Rochester, Rochester, N.Y.
|
| |
12
|
GOLSHANI, F. AND DIMITROVA, N. 1994. Retrieval and delivery of information in multimedia database systems. Inf. Softw Tech. 36, 4 (May), 235 242.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
 |
17
|
A. Hampapur , T. Weymouth , R. Jain, Digital video segmentation, Proceedings of the second ACM international conference on Multimedia, p.357-364, October 15-20, 1994, San Francisco, California, United States
[doi> 10.1145/192593.192699]
|
| |
18
|
HORN, B. K. AND SCHUNCK, B. G. 1981. Determining optical flow. Artzfi Intell. 17, 1-3, 185-203.
|
| |
19
|
JOHANSSON, G. 1976. Spatio-temporal differentiation and lntegTation in visual motion perception. Psychol. Res. 38, 4, 379 393.
|
| |
20
|
|
| |
21
|
|
 |
22
|
|
 |
23
|
T. D. C. Little , G. Ahanger , R. J. Folz , J. F. Gibbon , F. W. Reeve , D. H. Schelleng , D. Venkatesh, A digital on-demand video service supporting content-based queries, Proceedings of the first ACM international conference on Multimedia, p.427-436, August 02-06, 1993, Anaheim, California, United States
[doi> 10.1145/166266.168450]
|
| |
24
|
|
| |
25
|
|
| |
26
|
MICHAEL, N. 1994. VEENA--a visual query language. M.S. thesis, Arizona State Univ., Tempe, Ariz.
|
| |
27
|
|
| |
28
|
|
 |
29
|
|
 |
30
|
Ketan Patel , Brian C. Smith , Lawrence A. Rowe, Performance of a software MPEG video decoder, Proceedings of the first ACM international conference on Multimedia, p.75-82, August 02-06, 1993, Anaheim, California, United States
[doi> 10.1145/166266.166274]
|
| |
31
|
RASURE, J., ARGIRO, D., SAUER, T., AND WILLIAMS, C. 1990. Visual language and software development environment for image processing. Int. J. Imaging Syst. Tech. 2, 2, 183-199.
|
| |
32
|
|
| |
33
|
|
| |
34
|
RowE, L. A., BORECZKY, J. S., AND EADS, C.A. 1994. Indexes for user access to large video databases. In Proceedings of SPIE IS and Symposium on Storage and Retmeval for Image and Video Databases (San Jose, Calif.). SPIE.
|
| |
35
|
|
| |
36
|
SWANBERG, D., SHU, C.-F., AND JAIN, R. 1993. Knowledge guided parsing in video databases. In Image and Video Processtng Conference; Symposium on Electromc hnagtng: Sczence and Technology. Vol. 1908. IS & T/SPIE, 13-24.
|
 |
37
|
|
 |
38
|
|
| |
39
|
WEISS, R. 1994. Content-based access to algebraic video. Tech. Rep., Massachusetts Inst. of Technology, Cambridge, Mass.
|
| |
40
|
ZttANG, H., GONG, Y., SMOLIAR, S., AND TAN, S.Y. 1994. Automatic parsing of news video. In Proceedings of the Internattonal Conference on Multimedia Computing and Systems (Boston, Mass.). IEEE Computer Society Press. Los Alarnltos, Ca}i~, 45-54.
|
CITED BY 20
|
|
|
|
|
|
|
|
|
|
|
Nevenka Dimitrova , Thomas McGee , Herman Elenbaas, Video keyframe extraction and filtering: a keyframe is not a keyframe to everyone, Proceedings of the sixth international conference on Information and knowledge management, p.113-120, November 10-14, 1997, Las Vegas, Nevada, United States
|
|
|
Wensheng Zhou , Asha Vellaikal , C. C. Jay Kuo, Rule-based video classification system for basketball video indexing, Proceedings of the 2000 ACM workshops on Multimedia, p.213-216, October 30-November 03, 2000, Los Angeles, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|