| Patterns of entry and correction in large vocabulary continuous speech recognition systems |
| Full text |
Pdf
(1.11 MB)
|
| Source
|
Conference on Human Factors in Computing Systems
archive
Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit
table of contents
Pittsburgh, Pennsylvania, United States
Pages: 568 - 575
Year of Publication: 1999
ISBN:0-201-48559-1
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 70, Citation Count: 42
|
|
|
ABSTRACT
A study was conducted to evaluate user performance and
satisfaction in completion of a set of text creation tasks using
three commercially available continuous speech recognition systems.
The study also compared user performance on similar tasks using
keyboard input. One part of the study (Initial Use) involved 24
users who enrolled, received training and carried out practice
tasks, and then completed a set of transcription and composition
tasks in a single session. In a parallel effort (Extended Use),
four researchers used speech recognition to carry out real work
tasks over 10 sessions with each of the three speech recognition
software products. This paper presents results from the Initial Use
phase of the study along with some preliminary results from the
Extended Use phase. We present details of the kinds of usability
and system design problems likely in current systems and several
common patterns of error correction that we found.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Clark, H. H. & Brennan, S. E. (1991). Grounding in communication. In J. Levine, L. B. Resnick, and S. D. Behrand (Eds.), Shared Cognition: Thinking as Social Practice. APA Books, Washington.
|
 |
2
|
Catalina Danis , John Karat, Technology-driven design of speech recognition systems, Proceedings of the conference on Designing interactive systems: processes, practices, methods, & techniques, p.17-24, August 23-25, 1995, Ann Arbor, Michigan, United States
[doi> 10.1145/225434.225437]
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
Oviatt, S. (1995). Predicting spoken disfluencies during human-computer interaction. Computer Speech and Language, 9, 19-35.
|
| |
10
|
Nicole Yankelovich , Gina-Anne Levow , Matt Marx, Designing SpeechActs: issues in speech user interfaces, Proceedings of the SIGCHI conference on Human factors in computing systems, p.369-376, May 07-11, 1995, Denver, Colorado, United States
[doi> 10.1145/223904.223952]
|
CITED BY 43
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shumin Zhai , Michael Hunter , Barton A. Smith, The metropolis keyboard - an exploration of quantitative techniques for virtual keyboard design, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.119-128, November 06-08, 2000, San Diego, California, United States
|
|
|
|
|
|
|
|
|
Kevin Christian , Bill Kules , Ben Shneiderman , Adel Youssef, A comparison of voice controlled and mouse controlled web browsing, Proceedings of the fourth international ACM conference on Assistive technologies, p.72-79, November 13-15, 2000, Arlington, Virginia, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qiaohui Zhang , Atsumi Imamiya , Kentaro Go , Xiaoyang Mao, Resolving ambiguities of a gaze and speech interface, Proceedings of the 2004 symposium on Eye tracking research & applications, p.85-92, March 22-24, 2004, San Antonio, Texas
|
|
|
|
|
|
|
|
|
Clare-Marie Karat , John Vergo , David Nahamoo, Conversational interface technologies, The human-computer interaction handbook: fundamentals, evolving technologies and emerging applications, Lawrence Erlbaum Associates, Inc., Mahwah, NJ, 2002
|
|
|
|
|
|
Lin Zhong , Niraj K. Jha, Energy efficiency of handheld computer interfaces: limits, characterization and practice, Proceedings of the 3rd international conference on Mobile systems, applications, and services, June 06-08, 2005, Seattle, Washington
|
|
|
|
|
|
|
|
|
|
|
|
Xiang Ao , Xugang Wang , Feng Tian , Guozhong Dai , Hongan Wang, Crossmodal error dorrection of continuous handwriting recognition by speech, Proceedings of the 12th international conference on Intelligent user interfaces, January 28-31, 2007, Honolulu, Hawaii, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sharon Oviatt , Phil Cohen , Lizhong Wu , John Vergo , Lisbeth Duncan , Bernhard Suhm , Josh Bers , Thomas Holzman , Terry Winograd , James Landay , Jim Larson , David Ferro, Designing the user interface for multimodal speech and pen-based gesture applications: state-of-the-art systems and future research directions, Human-Computer Interaction, v.15 n.4, p.263-322, December 2000
|
|
|
Andrew Sears , Jinjuan Feng , Kwesi Oseitutu , Clare-Marie Karat, Hands-free, speech-based navigation during dictation: difficulties, consequences, and solutions, Human-Computer Interaction, v.18 n.3, p.229-257, September 2003
|
|
|
|
|
|
|
|
|
Saturnino Luz , Masood Masoodian , Bill Rogers, Interactive visualisation techniques for dynamic speech transcription, correction and training, Proceedings of the 9th ACM SIGCHI New Zealand Chapter's International Conference on Human-Computer Interaction: Design Centered HCI, p.9-16, July 02-02, 2008, Wellington, New Zealand
|
|
|
|
|
|
|
|
|
|
|
|
|
|