|
ABSTRACT
The Behavior Expression Animation Toolkit (BEAT) allows animators to input typed text that they wish to be spoken by an animated human figure, and to obtain as output appropriate and synchronized nonverbal behaviors and synthesized speech in a form that can be sent to a number of different animation systems. The nonverbal behaviors are assigned on the basis of actual linguistic and contextual analysis of the typed text, relying on rules derived from extensive research into human conversational behavior. The toolkit is extensible, so that new rules can be quickly added. It is designed to plug into larger systems that may also assign personality profiles, motion characteristics, scene constraints, or the animation styles of particular animators.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Norman I. Badler , Rama Bindiganavale , Jan Allbeck , William Schuler , Liwei Zhao , Martha Palmer, Parameterized action representation for virtual human agents, Embodied conversational agents, MIT Press, Cambridge, MA, 2001
|
| |
3
|
Becheiraz, P. and Thalmann, D., A Behavioral Animation System for Autonomous Actors personified by Emotions, Proc. of the1st Workshop on Embodied Conversational Characters, 57-65, 1998.
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
 |
10
|
Justine Cassell , Catherine Pelachaud , Norman Badler , Mark Steedman , Brett Achorn , Tripp Becket , Brett Douville , Scott Prevost , Matthew Stone, Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents, Proceedings of the 21st annual conference on Computer graphics and interactive techniques, p.413-420, July 1994
[doi> 10.1145/192161.192272]
|
| |
11
|
Cassell, J. and Prevost, S., Distribution of Semantic Features Across Speech and Gesture by Humans and Computers. Proc. Workshop on the Integration of Gesture in Language and Speech, pp. 253-270, Newark, DE, 1996.
|
| |
12
|
Cassell, J., Torres, O., and Prevost, S., Turn Taking vs. Discourse Structure: How Best to Model Multimodal Conversation, in Machine Conversations, Y. Wilks, Ed. The Hague: Kluwer, pp. 143-154, 1999.
|
| |
13
|
Chang, J., Action Scheduling in Humanoid Conversational Agents, M.S. Thesis in Electrical Engineering and Computer Science. Cambridge, MA: MIT, 1998.
|
| |
14
|
|
| |
15
|
Colburn, A., Cohen, M. F., and Drucker, S., The Role of Eye Gaze in Avatar Mediated Conversational Interfaces, MSR-TR-2000-81. Microsoft Research, 2000.
|
| |
16
|
Halliday, M. A. K., Explorations in the Functions of Language. London: Edward Arnold, 1973.
|
| |
17
|
Hirschberg, J., Accent and Discourse Context: Assigning Pitch Accent in Synthetic Speech. Proc. AAAI 90, pp. 952-957, 1990.
|
| |
18
|
Hiyakumoto, L., Prevost, S., and Cassell, J., Semantic and Discourse Information for Text-to-Speech Intonation. Proc. ACL Workshop on Concept-to-Speech Generation, Madrid, 1997.
|
| |
19
|
Huang, X., Acero, A., Adcock, J., Hon, H.-W., Goldsmith, J., Liu, J., and Plumpe, M., Whistler: A Trainable Text-to-Speech System. Proc. 4th Int'l. Conf. on Spoken Language Processing (ICSLP '96), pp. 2387- 2390, Piscataway, NJ, 1996.
|
 |
20
|
|
| |
21
|
|
| |
22
|
Massaro, D. W., Perceiving Talking Faces: From Speech Perception to a Behavioral Principle. Cambridge, MA: MIT Press, 1987.
|
| |
23
|
McNeill, D., Hand and Mind: What Gestures Reveal about Thought. Chicago, IL/London, UK: The University of Chicago Press, 1992.
|
| |
24
|
Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., and Miller, K., Introduction to Wordnet: An On-line Lexical Database, 1993.
|
| |
25
|
|
| |
26
|
A Pearce , B Wyvill , G Wyvill , D Hill, Speech and expression: a computer solution to face animation, Proceedings on Graphics Interface '86/Vision Interface '86, p.136-140, August 1986, Vancouver, British Columbia, Canada
|
| |
27
|
Pelachaud, C., Badler, N., and Steedman, M., Generating Facial Expressions for Speech, Cognitive Science, 20(1), pp. 1-46, 1994.
|
| |
28
|
Perlin, K., Noise, Hypertexture, Antialiasing and Gesture, in Texturing and Modeling, A Procedural Approach, D. Ebert, Ed. Cambridge, MA: AP Professional, 1994.
|
 |
29
|
|
| |
30
|
|
| |
31
|
Roehl, B., Specification for a Standard Humanoid, Version 1.1, H. A. W. Group, Ed. http://ece.uwaterloo.ca/~h-anim/spec1.1/, 1999.
|
| |
32
|
Taylor, P., Black, A., and Caley, R., The architecture of the Festival Speech Synthesis System. Proc. 3rd ESCA Workshop on Speech Synthesis, pp. 147-151, Jenolan Caves, Australia, 1998.
|
 |
33
|
|
| |
34
|
Yan, H., Paired Speech and Gesture Generation in Embodied Conversational Agents, M.S. thesis in the Media Lab. Cambridge, MA: MIT, 2000.
|
CITED BY 66
|
|
Catherine Pelachaud , Valeria Carofiglio , Berardina De Carolis , Fiorella de Rosis , Isabella Poggi, Embodied contextual agent in information delivering application, Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2, July 15-19, 2002, Bologna, Italy
|
|
|
|
|
|
Sumedha Kshirsagar , Nadia Magnenat-Thalmann , Anthony Guye-Vuillème , Daniel Thalmann , Kaveh Kamyab , Ebrahim Mamdani, Avatar Markup Language, Proceedings of the workshop on Virtual environments 2002, May 30-31, 2002, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
William Swartout , Jonathan Gratch , Randall W. Hill , Eduard Hovy , Stacy Marsella , Jeff Rickel , David Traum, Toward virtual humans, AI Magazine, v.27 n.2, p.96-108, July 2006
|
|
|
|
|
|
Matthew Stone , Doug DeCarlo , Insuk Oh , Christian Rodriguez , Adrian Stere , Alyssa Lees , Chris Bregler, Speaking with hands: creating animated conversational characters from recordings of human performance, ACM Transactions on Graphics (TOG), v.23 n.3, August 2004
|
|
|
|
|
|
Zhigang Deng , Shri Narayanan , Carlos Busso , Ulrich Neumann, Audio-based head motion synthesis for Avatar-based telepresence systems, Proceedings of the 2004 ACM SIGMM workshop on Effective telepresence, October 15-15, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Justine Cassell , Yukiko I. Nakano , Timothy W. Bickmore , Candace L. Sidner , Charles Rich, Non-verbal cues for discourse structure, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, p.114-123, July 06-11, 2001, Toulouse, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrea Corradini , Manish Mehta , Niels-Ole Bernsen , Marcela Charfuelan, Animating an interactive conversational character for an educational game system, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Jonathan Gratch , Jeff Rickel , Elisabeth André , Justine Cassell , Eric Petajan , Norman Badler, Creating Interactive Virtual Humans: Some Assembly Required, IEEE Intelligent Systems, v.17 n.4, p.54-63, July 2002
|
|
|
|
|
|
Atsushi Nakano , Kenta Shioiri , Junichi Hoshino, Synthesizing pose, unconscious movement, and gesture for mental behavior expression of interactive characters, Proceedings of the 2006 ACM SIGCHI international conference on Advances in computer entertainment technology, June 14-16, 2006, Hollywood, California
|
|
|
|
|
|
|
|
|
Yukiko I. Nakano , Toshihiro Murayama , Masashi Okamoto , Daisuke Kawahara , Qing Li , Sadao Kurohashi , Toyoaki Nishida, Cards-to-presentation on the web: generating multimedia contents featuring agent animations, Journal of Network and Computer Applications, v.29 n.2, p.83-104, April 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Timothy W. Bickmore , Laura Pfeifer , Daniel Schulman , Sepalika Perera , Chaamari Senanayake , Ishraque Nazmi, Public displays of affect: deploying relational agents in public spaces, CHI '08 extended abstracts on Human factors in computing systems, April 05-10, 2008, Florence, Italy
|
|
|
|
|
|
|
|
|
|
|
|
Kees van Deemter , Brigitte Krenn , Paul Piwek , Martin Klesen , Marc Schröder , Stefan Baumann, Fully generated scripted dialogue for embodied agents, Artificial Intelligence, v.172 n.10, p.1219-1244, June, 2008
|
|
|
Klaus Brügmann , Hannes Dohrn , Helmut Prendinger , Marc Stamminger , Mitsuru Ishizuka, Phase-based gesture motion parametrization and transitions for conversational agents with MPML3D, Proceedings of the 2nd international conference on INtelligent TEchnologies for interactive enterTAINment, January 08-10, 2008, Cancun, Mexico
|
|
|
Nathan Nichols , Jiahui Liu , Bryan Pardo , Kristian Hammond , Larry Birnbaum, Learning to gesture: applying appropriate animations to spoken text, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hung-Hsuan Huang , Aleksandra Cerekovic , Kateryna Tarasenko , Vjekoslav Levacic , Goranka Zoric , Igor S. Pandzic , Yukiko Nakano , Toyoaki Nishida, Integrating embodied conversational agent components with a generic framework, Multiagent and Grid Systems, v.4 n.4, p.371-386, December 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|