ACM Home Page
Please provide us with feedback. Feedback
An automatic lip-synchronization algorithm for synthetic faces
Full text PdfPdf (787 KB)
Source International Multimedia Conference archive
Proceedings of the second ACM international conference on Multimedia table of contents
San Francisco, California, United States
Pages: 149 - 156  
Year of Publication: 1994
ISBN:0-89791-686-7
Authors
K. Waters  Digital Equipment Corporation, Cambridge Research Lab, One Kendall Square, Cambridge, MA
T. Levergood  Digital Equipment Corporation, Cambridge Research Lab, One Kendall Square, Cambridge, MA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGGROUP: ACM Special Interest Group on Supporting Group Work
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
SIGCOMM: ACM Special Interest Group on Data Communication
SIGLINK: Hypertext, Hypermedia, and Web
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGIR: ACM Special Interest Group on Information Retrieval
SIGBIO: ACM Special Interest Group on Biomedical Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 35,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/192593.192644
What is a DOI?

ABSTRACT

This paper addresses the problem of automatically synchronizing computer-generated faces with synthetic speech. The complete process provides a novel form of face-to-face communication and the ability to create a new range of talking personable synthetic characters. Based on plain ASCII text input, a synthetic speech segment is generated and synchronized in real-time to a graphical display of an articulating mouth and face. The key component of the algorithm is the run-time facility that adaptively synchronizes the graphical display of the face to the audio.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
P. Bergeron and P. Lachapelle. Techniques for animating characters. In Advanced Computer Animation, volume 2 of SIGGIiAPH '85 Tutorials, pages 61-79. ACM, 1985.
 
2
H. Choi, S.C. Harashima and T. Takebe. Analysis and synthesis of facial expressions in know}edge-based coding of facial image sequences. In International Conference on Acoustics Speech and Signal Processing, pages 2737-2740, 1991.
 
3
J. Destandes. Histoire Comparee du Cinema, volume 1. Castarman, Paris, 1966.
 
4
P. Ekman and W.V. Friesen. Manual for the Facial Action Coding System. Consulting Psychologists Press, Palo Alto CA, 1977.
5
 
6
 
7
V. Fromkin. Lip positions in American English vowels. Language and Speech, 7(3):215-225, 1964.
 
8
H. Gouraud. Continuous shading of curved surfaces. IEEE Trans on Computers, 20(6), 1971.
 
9
 
10
D.R. Hill, A. Pearce, and B. WyviU. Animating speech: An automated aproach using speech synthesis by rules. The Visual Computer, 3:277-289, 1988.
 
11
Dennis H. Klatt. Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am., 67(3):971-995, 1980.
 
12
Dennis H. Ktatt. Review of text-to-speech conversion for English. J. Acoust. Soc. Am., 82(3):737-793, 1987.
13
 
14
Thomas M. Levergood, Andrew C. Payne, James Gettys, G. Win_6eld Treese, and Lawrence C. Stewart. AudioFile' A network-transparent system for distributed audio applications. In Proceedings o/the USENIX Summet Conference, June 1993.
15
 
16
N. Magnenat-Thalmann, N.E. Primeau, and D. Thaimann. Abstract muscle actions procedures for human face animation. Visual Computer, 3(5):290-297, 1988.
 
17
M. McGrath. An gzamination o! Cues /or Visual and Audio-Visual Speech Perception using Natural and Computer Generated Faces. PhD thesis, University of Nottingham, England, November 1985.
 
18
H. McGurk and J. MacDonald. Heating lips and seeing voices. Nature, 264:126-130, 1986.
19
 
20
John K. Ousterhout. Tel: An embeddable command language. In Proceedings of the USENIX Winter Conference, January 1990.
 
21
John K. Ousterhout. An Xll toolkit based on the Tcl language. In Proceedings of the USENIX Winter Conference, January 1991.
 
22
F.I. Parke. Computer generated animation of faces. Master's thesis, University of Utah, Salt Lake City, June 1972. UTEC-CSc-72-120.
 
23
F.I. Parke. Parameterized models for facial animation. IEEE Computer Graphics and Applications, 2(9):61-68, 1982.
 
24
F.I. Parke. State of the art in facial animation. A CM SIGGRAPH Course Notes, 26, 1990.
25
 
26
 
27
F. Thomas and O. Johnson. Disney Animation: the Illusion of Life. Abbeville Press, New York, 1981.
28
 
29
C.T. Waite. The facial action control editor, face: A parametric facial expression editor for computer generated animation. Master's thesis, MIT, Feb 1989.
30
 
31
E.F. Walther. Lipreading. Nelson-Hall Inc, Chicago, 1982.
32
 
33
K. Waters and T. Levergood. DECface: An automatic lip-synchronization algorithm for synthetic faces. Technical Report 93/4, Digital Equipment Corp, Cambridge Research Laboratory, Sept 1993.
 
34
P. Weil. About face. Master's thesis, MIT, Aug 1982.
35