ACM Home Page
Please provide us with feedback. Feedback
Real-time speech motion synthesis from recorded motions
Full text PdfPdf (266 KB)
Source
Symposium on Computer Animation archive
Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation table of contents
Grenoble, France
SESSION: Reality-based animation table of contents
Pages: 345 - 353  
Year of Publication: 2004
ISBN ~ ISSN:1727-5288 , 3-905673-14-2
Authors
Yong Cao  University of California at Los Angeles
Petros Faloutsos  University of California at Los Angeles
Eddie Kohler  University of California at Los Angeles
Frédéric Pighin  University of Southern California
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Eurographics: Eurographics Association
Publisher
Eurographics Association  Aire-la-Ville, Switzerland, Switzerland
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 46,   Citation Count: 9
Additional Information:

appendices and supplements   abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1028523.1028570
What is a DOI?


ABSTRACT

Data-driven approaches have been successfully used for realistic visual speech synthesis. However, little effort has been devoted to real-time lip-synching for interactive applications. In particular, algorithms that are based on a graph of motions are notorious for their exponential complexity. In this paper, we present a greedy graph search algorithm that yields vastly superior performance and allows real-time motion synthesis from a large database of motions. The time complexity of the algorithm is linear with respect to the size of an input utterance. In our experiments, the synthesis time for an input sentence of average length is under a second.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
{BS94} Brook N., Scott S.: Computer graphics animations of talking faces based on stochastic models. In International Symposium on Speech, Image Processing, and Neural Networkds (1994).
 
5
 
6
{CM93} Cohen N., Massaro D. W.: Modeling coarticulation in synthetic visual speech. In Models and Techniques in Computer Animation (1993), Thalmann N. M., Thalmann D., (Eds.), Springer-Verlang, pp. 139--156.
7
 
8
9
 
10
{Int} International Computer Science Institute, Berkeley, CA: Rasta software. www.icsi.berkeley.edu/Speech/rasta.html.
11
 
12
{KMG02} Kalberer G. A., Mueller P., Gool L. V.: Speech animation using viseme space. In Vision, Modeling, and Visualization VMV 2002 (2002), Akademische Verlags-gesellschaft Aka GmbH, Berlin, pp. 463--470.
 
13
{LCR*02} Lee J., Chai J., Reitsma P., Hodgins J., Pollard N.: Interactive control of avatars animated with human motion data, 2002.
14
15
 
16
{MKT*98} Masuko T., Kobayashi T., Tamura M., Masubuchi J., K. Tokuda: Text-to-visual speech synthesis based on parameter generation from hmm. In ICASSP (1998).
 
17
{Pel91} Pelachaud C.: Realistic Face Animation for Speech. PhD thesis, University of Pennsylvania, 1991.
 
18
{SBCS04} Saisan P., Bissacco A., Chiuso A., Soatto S.: Modeling and synthesis of facial motion driven by speech. In European Conference on Computer Vision 2004 (2004), pp. 456--467.
 
19
{SG} Speech Group C. M. U.:. www.speech.cs.cmu.edu/festival.
20

CITED BY  9

Collaborative Colleagues:
Yong Cao: colleagues
Petros Faloutsos: colleagues
Eddie Kohler: colleagues
Frédéric Pighin: colleagues