|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Philip R. Cohen , Michael Johnston , David McGee , Sharon Oviatt , Jay Pittman , Ira Smith , Liang Chen , Josh Clow, QuickSet: multimodal interaction for distributed applications, Proceedings of the fifth ACM international conference on Multimedia, p.31-40, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266328]
|
| |
2
|
|
| |
3
|
Junqua, J. The Lombard reflex and its role on human listeners and automatic speech recognizers. J. Acoustic. Soc. Amer. 93, 1 (1993), 510-524.
|
| |
4
|
Martin, A., Fiscus, J., Fisher, B., Pallet, D., and Przybocki, M. System descriptions and performance summary. In Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation 0ohns Hopkins University, Baltimore, 1997).
|
| |
5
|
Oviatt, S. Multimodal system processing in mobile environments. In Proceedings of the International Conference on Spoken Language Proc essing (Beijing, Oct. 2000).
|
 |
6
|
|
| |
7
|
Oviatt, S., Cohen, P., Wu, L., Vergo, J., Duncan, L., Suhm, B., Bers, J., Holzman, T., Winograd, T., Landay, J., Larson, J., and Ferro, D. Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions. Human Computer Interaction, in press. To be reprinted in Human Computer Interaction in the New Millennium, J. Carroll, Ed. Addison-Wesley Press, Boston, in press.
|
| |
8
|
|
| |
9
|
Pick, H., Siegel, G., Fox, P., Garber, S., and Kearney, J. Inhibiting the Lombard effect. J. Acoustic. Soc. Amer. 85, 2 (1989), 894-900.
|
| |
10
|
Rubin, P., Vatikiotis-Bateson, E., and Benoit, C., Eds. Special issue on audio-visual speech processing. Speech Commun. 26, 1-2 (1998), 1-2.
|
| |
11
|
Weintraub, M., Taussig, K., Hunicke, K., and Snodgrass, A. Effect of speaking style on LVCSR performance. In Proceedings ofthe International Conference on Spoken Language Processing (Philadelphia, 1996), 16-19.
|
| |
12
|
Wu, L., Oviatt, S., and Cohen, P. Multimodal integration: A statistical view. IEEE Transact. Multimedia i, 4 (1999), 334-342.
|
CITED BY 34
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jeffrey Heer , Nathaniel S. Good , Ana Ramirez , Marc Davis , Jennifer Mankoff, Presiding over accidents: system direction of human action, Proceedings of the SIGCHI conference on Human factors in computing systems, p.463-470, April 24-29, 2004, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xugang Wang , Junfeng Li , Xiang Ao , Gang Wang , Guozhong Dai, Multimodal error correction for continuous handwriting recognition in pen-based user interfaces, Proceedings of the 11th international conference on Intelligent user interfaces, January 29-February 01, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
Xiang Ao , Xugang Wang , Feng Tian , Guozhong Dai , Hongan Wang, Crossmodal error dorrection of continuous handwriting recognition by speech, Proceedings of the 12th international conference on Intelligent user interfaces, January 28-31, 2007, Honolulu, Hawaii, USA
|
|
|
|
|
|
|
|
|
Chreston Miller , Ashley Robinson , Rongrong Wang , Pak Chung , Francis Quek, Interaction techniques for the analysis of complex data on high-resolution displays, Proceedings of the 10th international conference on Multimodal interfaces, October 20-22, 2008, Chania, Crete, Greece
|
|
|
|
|
|
Sharon Oviatt , Phil Cohen , Lizhong Wu , John Vergo , Lisbeth Duncan , Bernhard Suhm , Josh Bers , Thomas Holzman , Terry Winograd , James Landay , Jim Larson , David Ferro, Designing the user interface for multimodal speech and pen-based gesture applications: state-of-the-art systems and future research directions, Human-Computer Interaction, v.15 n.4, p.263-322, December 2000
|
|
|
|
|
|
Andrew Sears , Jinjuan Feng , Kwesi Oseitutu , Clare-Marie Karat, Hands-free, speech-based navigation during dictation: difficulties, consequences, and solutions, Human-Computer Interaction, v.18 n.3, p.229-257, September 2003
|
|
|
|
|
|
|
|
|
|
|
|
Tim Paek , Bo Thiesson , Yun-Cheng Ju , Bongshin Lee, Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search, Proceedings of the 21st annual ACM symposium on User interface software and technology, October 19-22, 2008, Monterey, CA, USA
|
|
|
|
|
|
|
|
|
Americo Talarico Neto , Thiago Jabur Bittar , Renata Pontin , Mattos Fortes , Kátia Felizardo, Abordagem para o desenvolvimento e avaliação de interfaces multimodais web pautada em princípios de usabilidade, Proceedings of the VIII Brazilian Symposium on Human Factors in Computing Systems, October 21-24, 2008, Porto Alegre, RS, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"Graeme J. Hirst : Reviewer"
Contemporary speech recognition systems are still prone to much
error, especially when dealing with difficult conditions, such as the
background noise of public places or the accents of nonnative speakers.
Analogous problems occur with other r
more...
|