|
ABSTRACT
This paper presents a novel data-driven system for expressive facial animation synthesis and editing. Given novel phoneme-aligned speech input and its emotion modifiers (specifications), this system automatically generates expressive facial animation by concatenating captured motion data while animators establish constraints and goals. A constrained dynamic programming algorithm is used to search for best-matched captured motion nodes by minimizing a cost function. Users optionally specify "hard constraints" (motion-node constraints for expressing phoneme utterances) and "soft constraints" (emotion modifiers) to guide the search process. Users can also edit the processed facial motion node database by inserting and deleting motion nodes via a novel phoneme-Isomap interface. Novel facial animation synthesis experiments and objective trajectory comparisons between synthesized facial motion and captured motion demonstrate that this system is effective for producing realistic expressive facial animations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
{BBPV03} Blanz V., Basso C., Poggio T., Vetter T.: Reanimating faces in images and video. Computer Graphics Forum 22, 3 (2003).
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
{CG00} Cosatto E., Graf H. P.: Audio-visual unit selection for the synthesis of photo-realistic talking-heads. In Proc. of ICME (2000), pp. 619--622.
|
| |
10
|
{CM93} Cohen M. M., Massaro D. W.: Modeling coarticulation in synthetic visual speech. Models and Techniques in Computer Animation, Springer Verlag (1993), 139--156.
|
| |
11
|
|
 |
12
|
|
| |
13
|
{DLN05} Deng Z., Lewis J. P., Neumann U.: Synthesizing speech animation by learning compact speech co-articulation models. In Proc. of Computer Graphics International (2005), pp. 19--25.
|
| |
14
|
Zhigang Deng , Ulrich Neumann , J. P. Lewis , Tae-Yong Kim , Murtaza Bulut , Shrikanth Narayanan, Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces, IEEE Transactions on Visualization and Computer Graphics, v.12 n.6, p.1523-1534, November 2006
[doi> 10.1109/TVCG.2006.90]
|
 |
15
|
|
| |
16
|
{fes04} http://www.cstr.ed.ac.uk/projects/festival/, 2004.
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Kolja Kähler , Jörg Haber , Hans-Peter Seidel, Geometry-based muscle modeling for facial animation, No description on Graphics interface 2001, p.37-46, June 07-09, 2001, Ottawa, Ontario, Canada
|
| |
21
|
|
| |
22
|
{KT03} Kshirsagar S., Thalmann N. M.: Visyllable based speech animation. Computer Graphics Forum 22, 3 (2003).
|
| |
23
|
{Lew91} Lewis J. P.: Automated lip-sync: Background and techniques. Journal of Visualization and Computer Animation (1991), 118--122.
|
 |
24
|
|
| |
25
|
|
 |
26
|
|
| |
27
|
|
 |
28
|
|
| |
29
|
Hyewon Pyun , Yejin Kim , Wonseok Chae , Hyung Woo Kang , Sung Yong Shin, An example-based approach for facial expression cloning, Proceedings of the 2003 ACM SIGGRAPH/Eurographics symposium on Computer animation, July 26-27, 2003, San Diego, California
|
| |
30
|
|
 |
31
|
|
 |
32
|
|
| |
33
|
{TSL00} Tenenbaum J., Silva V. D., Langford J.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 5500 (2000), 2319--2333.
|
 |
34
|
|
| |
35
|
{WF95} Waters K., Frisble J.: A coordinated muscle model for speech animation. Proc. of Graphics Interface '95 (1995), 163--170.
|
 |
36
|
|
| |
37
|
|
 |
38
|
|
CITED BY
|
|
Tanasai Sucontphunt , Zhenyao Mo , Ulrich Neumann , Zhigang Deng, Interactive 3D facial expression posing through 2D portrait manipulation, Proceedings of graphics interface 2008, May 28-30, 2008, Windsor, Ontario, Canada
|
|