ACM Home Page
Please provide us with feedback. Feedback
Taming recognition errors with a multimodal interface
Full text HtmlHtml (36 KB),  PdfPdf (494 KB)
Source
Communications of the ACM archive
Volume 43 ,  Issue 9  (September 2000) table of contents
Pages: 45 - 51  
Year of Publication: 2000
ISSN:0001-0782
Author
Sharon Oviatt  Oregon Graduate Institute of Science and Technology, Beaverton
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 139,   Citation Count: 34
Additional Information:

references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/348941.348979
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
Junqua, J. The Lombard reflex and its role on human listeners and automatic speech recognizers. J. Acoustic. Soc. Amer. 93, 1 (1993), 510-524.
 
4
Martin, A., Fiscus, J., Fisher, B., Pallet, D., and Przybocki, M. System descriptions and performance summary. In Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation 0ohns Hopkins University, Baltimore, 1997).
 
5
Oviatt, S. Multimodal system processing in mobile environments. In Proceedings of the International Conference on Spoken Language Proc essing (Beijing, Oct. 2000).
6
 
7
Oviatt, S., Cohen, P., Wu, L., Vergo, J., Duncan, L., Suhm, B., Bers, J., Holzman, T., Winograd, T., Landay, J., Larson, J., and Ferro, D. Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions. Human Computer Interaction, in press. To be reprinted in Human Computer Interaction in the New Millennium, J. Carroll, Ed. Addison-Wesley Press, Boston, in press.
 
8
 
9
Pick, H., Siegel, G., Fox, P., Garber, S., and Kearney, J. Inhibiting the Lombard effect. J. Acoustic. Soc. Amer. 85, 2 (1989), 894-900.
 
10
Rubin, P., Vatikiotis-Bateson, E., and Benoit, C., Eds. Special issue on audio-visual speech processing. Speech Commun. 26, 1-2 (1998), 1-2.
 
11
Weintraub, M., Taussig, K., Hunicke, K., and Snodgrass, A. Effect of speaking style on LVCSR performance. In Proceedings ofthe International Conference on Spoken Language Processing (Philadelphia, 1996), 16-19.
 
12
Wu, L., Oviatt, S., and Cohen, P. Multimodal integration: A statistical view. IEEE Transact. Multimedia i, 4 (1999), 334-342.

CITED BY  34


REVIEW

"Graeme J. Hirst : Reviewer"

Contemporary speech recognition systems are still prone to much error, especially when dealing with difficult conditions, such as the background noise of public places or the accents of nonnative speakers. Analogous problems occur with other r  more...