ACM Home Page
Please provide us with feedback. Feedback
eFASE: expressive facial animation synthesis and editing with phoneme-isomap controls
Full text PdfPdf (632 KB)
Source Symposium on Computer Animation archive
Proceedings of the 2006 ACM SIGGRAPH/Eurographics symposium on Computer animation table of contents
Vienna, Austria
SESSION: Facial animation table of contents
Pages: 251 - 260  
Year of Publication: 2006
ISBN ~ ISSN:1727-5288 , 3-905673-34-7
Authors
Zhigang Deng  University of Houston
Ulrich Neumann  University of Southern California
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Eurographics: Eurographics
Publisher
Eurographics Association  Aire-la-Ville, Switzerland, Switzerland
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 56,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

This paper presents a novel data-driven system for expressive facial animation synthesis and editing. Given novel phoneme-aligned speech input and its emotion modifiers (specifications), this system automatically generates expressive facial animation by concatenating captured motion data while animators establish constraints and goals. A constrained dynamic programming algorithm is used to search for best-matched captured motion nodes by minimizing a cost function. Users optionally specify "hard constraints" (motion-node constraints for expressing phoneme utterances) and "soft constraints" (emotion modifiers) to guide the search process. Users can also edit the processed facial motion node database by inserting and deleting motion nodes via a novel phoneme-Isomap interface. Novel facial animation synthesis experiments and objective trajectory comparisons between synthesized facial motion and captured motion demonstrate that this system is effective for producing realistic expressive facial animations.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
{BBPV03} Blanz V., Basso C., Poggio T., Vetter T.: Reanimating faces in images and video. Computer Graphics Forum 22, 3 (2003).
 
2
 
3
 
4
 
5
6
 
7
 
8
 
9
{CG00} Cosatto E., Graf H. P.: Audio-visual unit selection for the synthesis of photo-realistic talking-heads. In Proc. of ICME (2000), pp. 619--622.
 
10
{CM93} Cohen M. M., Massaro D. W.: Modeling coarticulation in synthetic visual speech. Models and Techniques in Computer Animation, Springer Verlag (1993), 139--156.
 
11
12
 
13
{DLN05} Deng Z., Lewis J. P., Neumann U.: Synthesizing speech animation by learning compact speech co-articulation models. In Proc. of Computer Graphics International (2005), pp. 19--25.
 
14
15
 
16
{fes04} http://www.cstr.ed.ac.uk/projects/festival/, 2004.
17
 
18
 
19
 
20
 
21
 
22
{KT03} Kshirsagar S., Thalmann N. M.: Visyllable based speech animation. Computer Graphics Forum 22, 3 (2003).
 
23
{Lew91} Lewis J. P.: Automated lip-sync: Background and techniques. Journal of Visualization and Computer Animation (1991), 118--122.
24
 
25
26
 
27
28
 
29
 
30
31
32
 
33
{TSL00} Tenenbaum J., Silva V. D., Langford J.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 5500 (2000), 2319--2333.
34
 
35
{WF95} Waters K., Frisble J.: A coordinated muscle model for speech animation. Proc. of Graphics Interface '95 (1995), 163--170.
36
 
37
38


Collaborative Colleagues:
Zhigang Deng: colleagues
Ulrich Neumann: colleagues