ACM Home Page
Please provide us with feedback. Feedback
Automated authoring of coherent multimedia discourse in conversation systems
Full text PdfPdf (2.26 MB)
Source International Multimedia Conference; Vol. 9 archive
Proceedings of the ninth ACM international conference on Multimedia table of contents
Ottawa, Canada
Session: Posters and Short Papers table of contents
Pages: 555 - 558  
Year of Publication: 2001
ISBN:1-58113-394-4
Authors
Michelle X. Zhou  IBM T. J. Watson Research Center, Hawthorne, NY
Shimei Pan  IBM T. J. Watson Research Center, Hawthorne, NY
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 17,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/500141.500241
What is a DOI?

ABSTRACT

We are building a full-fledged multimedia conversation framework called Responsive Information Architect (RIA), using a combination of AI and multimedia techniques. Here we describe RIA's capability of automated authoring of a coherent multimedia discourse, which is used by RIA to express itself when conversing with a user. Specifically, we focus on explaining three unique features of our automated authoring approach: automated authoring of multimedia inter¿action acts, dynamic insertion of multimedia punctuation acts, and systematic design of cross-media acts.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Arijon. Grammar of the Film Language. Silman-James Press, 1976.
 
2
3
4
5
 
6
7
 
8
J. Levin, G. Anglin, and R. Carney. On empirically validating functions of pictures in prose. In D. Willows and H. Houghton, editors, The Psychology of Illustration: Basic Research, volume 1, chapter 2, pages 5 l-86. Springer-Verlag, New York, 1987.
 
9
 
10
K. McKeown, S. Feiner, J. Robin, D. Seligmann, and M. Tanenblatt. Generating cross-references for multimedia explanation. In Proc. AAAI '92, pages 12-17, 1992.
 
11
K. McKeown and S. Pan. Prosody modeling in concept-to-speech generation: Methodological issues. Philosophical Trunsacrions of the Royal Society, Series A, 358( 1769), 2000.
 
12
S. Pan and K. McKeown. Spoken language generation in a multimedia system. In Proc. ICSLP '96, pages 374-377, 1996.
13
 
14
 
15
 
16
M. Zhou and S. Ma. Representing and retrieving visual presentations for example-based graphics generation. In Proc. 1st Intl. Syp. on Smart Graphics 2001, pages 87-94, 2001.


Collaborative Colleagues:
Michelle X. Zhou: colleagues
Shimei Pan: colleagues