ACM Home Page
Please provide us with feedback. Feedback
Challenges in adopting speech recognition
Full text HtmlHtml (27 KB),  PdfPdf (160 KB)
Source
Communications of the ACM archive
Volume 47 ,  Issue 1  (January 2004) table of contents
Multimodal interfaces that flex, adapt, and persist
SPECIAL ISSUE: Multimodal interfaces that flex, adapt, and persist table of contents
Pages: 69 - 75  
Year of Publication: 2004
ISSN:0001-0782
Authors
Li Deng  Microsoft Research, Redmond, WA
Xuedong Huang  Microsoft .NET Speech Technologies Group, Redmond, WA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 24,   Downloads (12 Months): 217,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/962081.962108
What is a DOI?

ABSTRACT

Although progress has been impressive, there are still several hurdles that speech recognition technology must clear before ubiquitous adoption can be realized. R&D in spontaneous and free-flowing speech style is critical to its success.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
DARPA's EARS Conference (Boston, MA, May 21--22, 2003).
 
2
DARPA's EARS Kickoff Meeting (Vienna, VA, May 9--10, 2002).
 
3
Datamonitor. Voice Automation---Past, Present, and Future. White Paper (July 2003).
 
4
Deng, L., and O'Shaughnessy, D. Speech Processing---A Dynamic and Optimization-Oriented Approach. Marcel Dekker, NY, 2003.
 
5
Deng, L. Wang, K., Acero, A., Hon, H., Droppo, J., Boulis, C., Wang, Y., Jacoby, D., Mahajan, M., Chelba, C., and Huang, X.D. Distributed speech processing in MiPad's multimodal user interface. IEEE Transactions on Speech and Audio 10 (2002), 605--619.
 
6
Furui, S. Recent progress in spontaneous speech recognition and understanding. In Proceedings of the IEEE Workshop on Multimedia Signal Processing (Dec. 2002).
 
7
Hirsch, H., and Pearce, D. The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. ISCA ITRW Workshop on Automatic Speech Recognition (Paris, 2000).
 
8
 
9
Neti, C., Iyengar, G., Potamianos, G., Senior, A., and Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In the ICSLP Proceedings 1. (Beijing, 2000), 11--14.
 
10
Oviatt, S. Breaking the robustness barrier: Recent progress on the design of robust multimodal systems. Advances in Computers. M. Zelkowitz, Ed. Academic Press, 2002, 305--341.
 
11
Zhang, Y. et al. Air- and bone-conductive integrated microphones for robust speech detection and enhancement. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. (St. Thomas, U.S. Virgin Islands, Dec, 2003.)

CITED BY  9