ACM Home Page
Please provide us with feedback. Feedback
Putting people first: specifying proper names in speech interfaces
Full text PdfPdf (936 KB)
Source Symposium on User Interface Software and Technology archive
Proceedings of the 7th annual ACM symposium on User interface software and technology table of contents
Marina del Rey, California, United States
Pages: 29 - 37  
Year of Publication: 1994
ISBN:0-89791-657-3
Authors
Matt Marx  Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA
Chris Schmandt  Speech Research Group, MIT Media Laboratory, 20 Ames St., Cambridge, MA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGSOFT: ACM Special Interest Group on Software Engineering
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 13,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/192426.192439
What is a DOI?

ABSTRACT

Communication is about people, not machines. But as firms and families alike spread out geographically, we rely increasingly on telecommunications tools to keep us “connected”. The challenge of such systems is to enable conversation between individuals without computational infrastructure getting in the way. This paper compares two speech-based communication systems, Phoneshell and Chatter, in how they deal with the keys to communication: proper names. Chatter, a conversational system using speech-recognition, improves upon the hierarchical nature of the touch-tone based Phoneshell by maintaining context and enabling use of anaphora. Proper names can present particular problems for speech recognizers, so an interface algorithm for reliable name specification by spelling is offered. Since individual letter recognition is non-robust, Chatter implicitly disambiguates strings of letters based on context. We hypothesize that the right interface can make faulty speech recognition as usable as TouchTones—even more so.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bach, E. Informal Lectures on Formal Semantics, State University of New York Press, p. 97.
2
 
3
Davis, J. "Let Your Fingers Do the Spelling' Implicit disambiguation of words spelled with the telephone keypad" in proceedings of the American Voice Input/Output Society, 1990.
 
4
 
5
Ly, E. "Chatter: A Conversational Learning Speech Interface" in proceedings of AAAI Spring Symposium on Intelligent Multi-Media Multi-Modal Systems, March 1994.
6
 
7
Spiegel, M. "Pronouncing Surnames Automatically", In Proceedings of the 1985 Conference. San Jose, CA: American Voice I/O Society, September 1985.
 
8

CITED BY  8

Collaborative Colleagues:
Matt Marx: colleagues
Chris Schmandt: colleagues