| Challenges in adopting speech recognition |
| Full text |
Html
(27 KB),
Pdf
(160 KB)
|
Source
|
Communications of the ACM
archive
Volume 47 , Issue 1 (January 2004)
table of contents
Multimodal interfaces that flex, adapt, and persist
SPECIAL ISSUE: Multimodal interfaces that flex, adapt, and persist
table of contents
Pages: 69 - 75
Year of Publication: 2004
ISSN:0001-0782
|
|
Authors
|
|
Li Deng
|
Microsoft Research, Redmond, WA
|
|
Xuedong Huang
|
Microsoft .NET Speech Technologies Group, Redmond, WA
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 24, Downloads (12 Months): 217, Citation Count: 9
|
|
|
ABSTRACT
Although progress has been impressive, there are still several hurdles that speech recognition technology must clear before ubiquitous adoption can be realized. R&D in spontaneous and free-flowing speech style is critical to its success.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
DARPA's EARS Conference (Boston, MA, May 21--22, 2003).
|
| |
2
|
DARPA's EARS Kickoff Meeting (Vienna, VA, May 9--10, 2002).
|
| |
3
|
Datamonitor. Voice Automation---Past, Present, and Future. White Paper (July 2003).
|
| |
4
|
Deng, L., and O'Shaughnessy, D. Speech Processing---A Dynamic and Optimization-Oriented Approach. Marcel Dekker, NY, 2003.
|
| |
5
|
Deng, L. Wang, K., Acero, A., Hon, H., Droppo, J., Boulis, C., Wang, Y., Jacoby, D., Mahajan, M., Chelba, C., and Huang, X.D. Distributed speech processing in MiPad's multimodal user interface. IEEE Transactions on Speech and Audio 10 (2002), 605--619.
|
| |
6
|
Furui, S. Recent progress in spontaneous speech recognition and understanding. In Proceedings of the IEEE Workshop on Multimedia Signal Processing (Dec. 2002).
|
| |
7
|
Hirsch, H., and Pearce, D. The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. ISCA ITRW Workshop on Automatic Speech Recognition (Paris, 2000).
|
| |
8
|
|
| |
9
|
Neti, C., Iyengar, G., Potamianos, G., Senior, A., and Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In the ICSLP Proceedings 1. (Beijing, 2000), 11--14.
|
| |
10
|
Oviatt, S. Breaking the robustness barrier: Recent progress on the design of robust multimodal systems. Advances in Computers. M. Zelkowitz, Ed. Academic Press, 2002, 305--341.
|
| |
11
|
Zhang, Y. et al. Air- and bone-conductive integrated microphones for robust speech detection and enhancement. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. (St. Thomas, U.S. Virgin Islands, Dec, 2003.)
|
CITED BY 9
|
|
Lee Hoi Leong , Shinsuke Kobayashi , Noboru Koshizuka , Ken Sakamura, CASIS: a context-aware speech interface system, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Maja Pantic , Alex Pentland , Anton Nijholt , Thomas Huang, Human computing and machine understanding of human behavior: a survey, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|