ACM Home Page
Please provide us with feedback. Feedback
Designing annotation before it's needed
Full text PdfPdf (1.16 MB)
Source International Multimedia Conference; Vol. 9 archive
Proceedings of the ninth ACM international conference on Multimedia table of contents
Ottawa, Canada
Session: Authoring Support table of contents
Pages: 251 - 260  
Year of Publication: 2001
ISBN:1-58113-394-4
Authors
Frank Nack  CWI Amsterdam, Amsterdam, The Netherlands
Wolfgang Putz  GMD-IPSI, Darmstadt, Darmstadt, Germany
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 27,   Citation Count: 13
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/500141.500180
What is a DOI?

ABSTRACT

This paper considers the automated and semi-automated annotation of audiovisual media in a new type of production framework, A4SM (Authoring System for Syntactic, Semantic and Semiotic Modelling). We present the architecture of the framework and outline the underlying XML-Schema based content description structures of A4SM. We then describe tools for a news and demonstrate how video material can be annotated in real time and how this information can not only be used for retrieval but also can be used during the different phases of the production process itself. Finally, we discuss the pros and cons of our approach of evolving semantic networks as the basis for audio- visual content description.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Arnheim, R. (1956). Art and Visual Perception: A Psychology of the creative eye. London: Faber & Faber.
 
4
Bloch, G. R. (1986). Elements d'une Machine de Montage Pour I'Audio-Visuel. Ph.D., Ecole Nationale Superieure Des Telecommunications.
 
5
Bloom, P.J. (1985). High-quality digital audio in the entertainment industry: an overview of achievements and challenges, IEEE Acoust. Speech Signal Process. Mag.. 2, 2- 25 (1985)
 
6
 
7
Bordwell, D. (1989). Making Meaning - Inference and Rhetoric in the Interpretation of Cinema. Cambridge, Massachusetts: Harvard University Press.
 
8
 
9
Brooks, KM (1999). Metalinear Cinematic Narrative: Theory, Process, and Tool. MIT Ph.D. Thesis.
 
10
 
11
 
12
 
13
Eco, U. (1977). A Theory of Semiotics. London: The Macmillan Press.
 
14
Fehlis, H. (I 999). Hybrides Trackingsystem fur virtuelle Studios. Fernseh- + Kinotechnik; Bd. 53, Nr. 5
15
 
16
Greimas, J. (1983). Structural Semantics: An Attempt at a Method. Lincoln: University of Nebraska Press.
17
 
18
Hirata, K. (1995). Towards Formalizing Jazz Piano Knowledge witha Deductive Object-Oriented Approach. Proceedings of Artificial intelligence and Music, IJCAI, pp. 77 - 80, Montreal.
 
19
20
 
21
IS0 MPEG-7(2001). Text of ISO/IEC FCD 15938-2 Information Technology - Multimedia Content Description Interface - Part 2 Description Definition Language, ISO/IEC JTC I/SC 29iWG 11 N4002, March 2001
 
22
ISO MPEG-7(2001). Tex to fISOiIEC 15938-5/FCD Information Technology - Multimedia Content Description Interface - Part 5 Multimedia Description Schemes, ISOilEC JTC l/SC 29/WG I1 N3966, March 2001
 
23
Johnson, S.E., Jourlin, P., Spgrk jones, K. 7 Woodland P.C. (2000). Audio Indexing and retrieval of Complete Broadcast News Shows. RIAO' 2000 Conference proceedings, Vol 2, pp. 1 163 - 1177, CollCge de France, Paris, France, April 12- 14 2000
 
24
Lindley, C. (2000). A Video Annotation Methodology for Interactive Video Sequence Generation, BCS Computer Graphics & Displays Group Conference on Digital Content Creation, Bradford, UK, 12-13 April 2000.
 
25
LemstrGm, K. & Tarhio, J. (2000). Searching Monophonic Patterns within Polyphonic Sources. RIAO' 2000 Conference proceedings, Vol 2, pp. 1163 - 1177, CollCge de France, Paris, France, April 12-14 2000
 
26
MPEG Requirements Group (2000). Overview of the MPEG-7 Standard (version4.0), Dot. ISO/MPEG N3752, MPEG La Baule Meeting, October 2000.
 
27
Melucci, M. & Orio, N. (2000). SMILE: a System for Content-based Musical Information Retrieval Environments. RIAO' 2000 Conference proceedings, Vol 2, pp. 1261 -1279, CollCge de France, Paris, France, April 12-14 2000
 
28
Mills, T.J., Pye, D., hollinghurst, N.J. & Wood, K.R. (2000). At&TV: Broadcast Television and Radio Retrieval. RIAO' 2000 Conference proceedings, Vol 2, pp. 1135 - 1144, CollCge de France, Paris, France, April 12-14 2000
 
29
Nack, F. (1996). "AUTEUR: The Application of Video Semantics and Theme Representation in Automated Video Editing," Ph.D., Lancaster University, 1996.
 
30
Nack, F. and Parkes, A. (1997). Towards the Automated Editing of Theme-Oriented Video Sequences. In Applied Artificial Intelligence (AAI) {Ed: Hiroaki Kitano}, Vol. 11, No. 4, pp. 33 1-366.
 
31
Nack, F. and A. Steinmetz (1998). Approaches on Intelligent Video Production, Proceedings of ECAI-98 Workshop on AI/Alife and Entertainment, August 24, 1998, Brighton.
 
32
 
33
Nack, F. & C. Lindley (2000) Environments for the production and maintenance of interactive stories, Workshop on Digital Storytelling, Darmstadt, Germany, 15-16/6/2000.
 
34
 
35
Pachet, F. and Cazzly, D. (2000). A Taxonomy of Musical Genres. RIAO' 2000 Conference proceedings, Vol 2, pp. 1238 - 1245, CollCge de France, Paris, France, April 12-14 2000
 
36
37
 
38
Peirce, C. S. (1960). The Collected Papers of Charles Sanders Peirce - I Principles of Philosophy and 2 Elements of Logic, Edited by Charles Hartshone and Paul Weiss. Cambridge, MA: The Belknap Press of Harvard University Press.
39
 
40
Robertson, J., De Quincey, A., Stapleford T. & Wiggins, G. (1998). Real-Time Music Generation for a Virtual Environment. Proceedings of ECAI-98 Workshop on AI/Alife and Entertainment, August 24, 1998, Brighton.
 
41
Sack, W. (1993). Coding News And Popular Culture. In The International Joint Conference on Artificial Intelligence (IJCA93) Workshop on Models of Teaching and Models of Learning. Chambery, Savoie, France.
 
42
 
43
SMPTE (1999). Dynamic Data Dictonary Structure, 6. Draft, September 1999.
 
44
 
45
TALC (1999). http://www.de.ibm.com/ide/solutions/dmsc/
 
46
 
47
 
48
XML Schema Part 0 (2000). Primer, W3C Candidate Recommendation, 24 October 2000, http://www.w3.org/TR/xmlschema-0/
 
49
XML Schema Part 1 (2000). Structures W3C Candidate Recommendation, 24 October 2000, htta://www.w3.orp/TK/xmlschema-1/
 
50
XML Schema Part 2 (2000). Datatypes W3C Candidate Recommendation, 24 October 2000, http//www.w3.org/TR/xmlschema-2/
 
51
Yeung, M. M., Yeo, B., Wolf, W. & Liu, B. (1995). Video Browsing using Clustering and Scene Transitions on Compressed Sequences. In Proceedings IS&T/SPIE '95 Multimedia Computing and Networking, San Jose. SPIE (2417), 399 - 413.
 
52
Zhang, II., Gong, Y., Smoliar, S. W. (1994). Automated parsing of news video. In IEEE International Conference on Multimedia Computing and Systems, (pp. 45 - 54). Boston: IEEE Computer Society Press.

CITED BY  13

Collaborative Colleagues:
Frank Nack: colleagues
Wolfgang Putz: colleagues