| Putting people first: specifying proper names in speech interfaces |
| Full text |
Pdf
(936 KB)
|
| Source
|
Symposium on User Interface Software and Technology
archive
Proceedings of the 7th annual ACM symposium on User interface software and technology
table of contents
Marina del Rey, California, United States
Pages: 29 - 37
Year of Publication: 1994
ISBN:0-89791-657-3
|
|
Authors
|
|
Matt Marx
|
Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA
|
|
Chris Schmandt
|
Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 17, Citation Count: 8
|
|
|
ABSTRACT
Communication is about people, not machines. But as firms and families alike spread out geographically, we rely increasingly on telecommunications tools to keep us “connected”. The challenge of such systems is to enable conversation between individuals without computational infrastructure getting in the way. This paper compares two speech-based communication systems, Phoneshell and Chatter, in how they deal with the keys to communication: proper names. Chatter, a conversational system using speech-recognition, improves upon the hierarchical nature of the touch-tone based Phoneshell by maintaining context and enabling use of anaphora. Proper names can present particular problems for speech recognizers, so an interface algorithm for reliable name specification by spelling is offered. Since individual letter recognition is non-robust, Chatter implicitly disambiguates strings of letters based on context. We hypothesize that the right interface can make faulty speech recognition as usable as TouchTones—even more so.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bach, E. Informal Lectures on Formal Semantics, State University of New York Press, p. 97.
|
 |
2
|
Barbara L. Chalfonte , Robert S. Fish , Robert E. Kraut, Expressive richness: a comparison of speech and text as media for revision, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.21-26, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108848]
|
| |
3
|
Davis, J. "Let Your Fingers Do the Spelling' Implicit disambiguation of words spelled with the telephone keypad" in proceedings of the American Voice Input/Output Society, 1990.
|
| |
4
|
|
| |
5
|
Ly, E. "Chatter: A Conversational Learning Speech Interface" in proceedings of AAAI Spring Symposium on Intelligent Multi-Media Multi-Modal Systems, March 1994.
|
 |
6
|
|
| |
7
|
Spiegel, M. "Pronouncing Surnames Automatically", In Proceedings of the 1985 Conference. San Jose, CA: American Voice I/O Society, September 1985.
|
| |
8
|
|
CITED BY 8
|
|
|
|
|
Jennifer Mankoff , Scott E. Hudson , Gregory D. Abowd, Interaction techniques for ambiguity resolution in recognition-based interfaces, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.11-20, November 06-08, 2000, San Diego, California, United States
|
|
|
Dan R. Olsen, Jr. , Sean Jefferies , Travis Nielsen , William Moyes , Paul Fredrickson, Cross-modal interaction using XWeb, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.191-200, November 06-08, 2000, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.7
Natural Language Processing
Subjects:
Speech recognition and synthesis
Additional Classification:
C.
Computer Systems Organization
C.2
COMPUTER-COMMUNICATION NETWORKS
C.2.0
General
Subjects:
Data communications
H.
Information Systems
H.4
INFORMATION SYSTEMS APPLICATIONS
H.4.3
Communications Applications
Subjects:
Electronic mail
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation)
General Terms:
Algorithms,
Design,
Human Factors,
Languages
Keywords:
conversational systems,
error-repair,
speech recognition,
user interface
|