| Emotional speech synthesis by XML file using interactive genetic algorithms |
| Full text |
Pdf
(460 KB)
|
Source
|
ACM/SIGEVO Summit on Genetic and Evolutionary Computation
archive
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
table of contents
Shanghai, China
POSTER SESSION: Poster sessions
table of contents
Pages 907-910
Year of Publication: 2009
ISBN:978-1-60558-326-6
|
|
Authors
|
|
Siliang Lv
|
Department of Computer Science and Technology, Key Laboratory of Software in Computing and Communication in Anhui, USTC, Hefei, China
|
|
Shangfei Wang
|
Department of Computer Science and Technology, Key Laboratory of Software in Computing and Communication in Anhui, USTC, Hefei, China
|
|
Xufa Wang
|
Department of Computer Science and Technology, Key Laboratory of Software in Computing and Communication in Anhui, USTC, Hefei, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 15, Downloads (12 Months): 47, Citation Count: 0
|
|
|
ABSTRACT
As a technique that can "let computer speak", speech synthesis is drawing more and more attention. Today, much speech synthesis software can synthesize neutral speech naturally and knowingly. However, it is hard to make computers speak with "emotion" as that in our daily life, because of the complexity of emotion model. Interactive Genetic Algorithms which can be acted self-organizingly, adaptively and self-learningly can just resolve the problem of difficulty in modeling emotional speech synthesis. As a result, this paper designs an emotional speech synthesis process, which adjusts the parameters (XML-tags) used to synthesize emotional speech dynamically, using interactive Genetic Algorithms, to optimize the quality of emotional speech. Also, the paper includes an evaluation experiment, which proves the feasibility of the algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Marc Schroder,Emotional Speech Synthesis: A Review, DFKI, Saarbrucken, Germany Institute of Phonetics, University of the Saarland
|
| |
2
|
Paul Ekrnan and Harriet Oster Facial Expressions Of Emotion, Department of Psychiatry, University of California, San Francisco, California 94143, Ann. Rev. Psvchol. 1979. 30:527--54
|
| |
3
|
Jianhua Tao, Member, IEEE, Yongguo Kang, and Aijun Li, Prosody Conversion From Neutral Speech to Emotional Speech, IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 4, JULY 2006, 1558--7916/$20.00 @ 2006 IEEE
|
| |
4
|
|
| |
5
|
Yuji Sato. Voice quality conversion using interactive evolution of prosodic control. Faculty of Computer and Information Sciences, Hosei University. Applied Soft Computing 5 (2005) 181--192
|
| |
6
|
Murtaza Bulut etc. A Statistical Approach For Modeling Prosody Features Using POS Tags For Emotional Speech Synthesis. 1-4244-0728-1/07/$20.00 @ 2007 IEEE
|
| |
7
|
Shinya Mori etc. Emotional Speech Synthesis Using Subspace Constraints In Prosody. 1424403677/06/$20.00 @ 2006 IEEE
|
| |
8
|
Mariet Theune etc. Generating Expressive Speech for Storytelling Applications. 1558--7916/$20.00 @ 2006 IEEE
|
| |
9
|
Janet E. Cahn. Generating Expression in Synthesized Speech. Massachusetts Institute of Technology 1990
|
| |
10
|
|
| |
11
|
|
| |
12
|
Marc Schroder, Expressing Degree of Activation in Synthetic Speech, IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 4, JULY 2006
|
| |
13
|
|
| |
14
|
H Scheffe, An analysis of variance for paired comparisons, Journal of the American Statistical Association, 1952 -- JSTOR
|
| |
15
|
Microsoft Corporation, Microsoft Speech SDK 5.1 SAPI
|
|