| VoiceNotes: a speech interface for a hand-held voice notetaker |
| Full text |
Pdf
(927 KB)
|
| Source
|
Conference on Human Factors in Computing Systems
archive
Proceedings of the INTERACT '93 and CHI '93 conference on Human factors in computing systems
table of contents
Amsterdam, The Netherlands
Pages: 179 - 186
Year of Publication: 1993
ISBN:0-89791-575-5
|
|
Authors
|
|
Lisa J. Stifelman
|
Speech Research Group, MIT Media Lab., 20 Ames Street, Cambridge, MA and Human Interface Group/ATG, Apple Computer, Inc., 20525 Mariani Ave., MS 301-3H, Cupertino, CA
|
|
Barry Arons
|
Speech Research Group, MIT Media Lab, 20 Ames Street, Cambridge, MA
|
|
Chris Schmandt
|
Speech Research Group, MIT Media Lab, 20 Ames Street, Cambridge, MA
|
|
Eric A. Hulteen
|
Human Interface Group/ATG, Apple Computer, Inc., 20525 Mariani Ave., MS 301-3H, Cupertino, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 58, Citation Count: 30
|
|
|
ABSTRACT
VoiceNotes is an application for a voice-controlled hand-held computer that allows the creation, management, and retrieval of user-authored voice notes—small segments of digitized speech containing thoughts, ideas, reminders, or things to do. Iterative design and user testing helped to refine the initial user interface design. VoiceNotes explores the problem of capturing and retrieving spontaneous ideas, the use of speech as data, and the use of speech input and output in the user interface for a hand-held computer without a visual display. In addition, VoiceNotes serves as a step toward new uses of voice technology and interfaces for future portable devices.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Arons, B. Techniques, perception, and applications of time-compressed speech. In Proceedings of AVIOS '92, pp. 169-177. American Voice I/O Society, 1992.
|
 |
3
|
Barbara L. Chalfonte , Robert S. Fish , Robert E. Kraut, Expressive richness: a comparison of speech and text as media for revision, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.21-26, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108848]
|
| |
4
|
Cypher, A. The structure of users' activities. In Norman, D.A. and Draper, S.W., editors, User Centered System Design, chapter 12, pp. 243-263. Lawrence Erlbaum Associates, 1986.
|
 |
5
|
Leo Degen , Richard Mander , Gitta Salomon, Working with audio: integrating personal tape recorders and desktop computers, Proceedings of the SIGCHI conference on Human factors in computing systems, p.413-418, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.142877]
|
| |
6
|
Ericsson, K.A. and Simon, H.A. Protocol Analysis. The MIT Press, 1984.
|
| |
7
|
Gaver, W.W. The SonicFinder: An interface that uses auditory icons. Human-Computer Interaction, 4(1):67- 94, 1989.
|
| |
8
|
Gould, J.D. An experimental study of writing, dictating, and speaking. In Requin, J., editor, Attention & Performance VII, pp. 299-319. Lawrence Erlbaum, 1978.
|
| |
9
|
Hayes, P.J. and Reddy, D.R. Steps toward graceful interaction in spoken and written man-machine communication. International Journal of Man.Machine Studies, 19:231-284, 1983.
|
| |
10
|
Heiman, G.W., Leo, R.J., Leighbody, G. and Bowler, K. Word intelligibility decrements and the comprehension of time-compressed speech. Perception and Psychophysics, 40(6):407-411, 1986.
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
Stifelman, L.J. VoiceNotes: An application for a voice-controlled hand-held computer. Master's Thesis. Massachusetts Institute of Technology, 1992.
|
| |
16
|
Voor, J.B. and Miller, J.M. The effect of practice upon the comprehension of time-compressed speech. Speech Monographs, 32:452-455, 1965.
|
| |
17
|
Waterworth, J.A. Interaction with machines by voice: A telecommunications perspective. Behaviour and Information Technology, 3(2):163-177, 1984.
|
CITED BY 30
|
|
T. Imai , K. Yamaguchi , T. Muranaga, Hypermedia conversation recording to preserve informal artifacts in realtime collaboration, Proceedings of the second ACM international conference on Multimedia, p.417-424, October 15-20, 1994, San Francisco, California, United States
|
|
|
Dan R. Olsen, Jr. , Sean Jefferies , Travis Nielsen , William Moyes , Paul Fredrickson, Cross-modal interaction using XWeb, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.191-200, November 06-08, 2000, San Diego, California, United States
|
|
|
Nosa Omoigui , Liwei He , Anoop Gupta , Jonathan Grudin , Elizabeth Sanocki, Time-compression: systems concerns, usage, and benefits, Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit, p.136-143, May 15-20, 1999, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
Liwei He , Elizabeth Sanocki , Anoop Gupta , Jonathan Grudin, Comparing presentation summaries: slides vs. reading vs. listening, Proceedings of the SIGCHI conference on Human factors in computing systems, p.177-184, April 01-06, 2000, The Hague, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Scott R. Klemmer , Anoop K. Sinha , Jack Chen , James A. Landay , Nadeem Aboobaker , Annie Wang, Suede: a Wizard of Oz prototyping tool for speech user interfaces, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.1-10, November 06-08, 2000, San Diego, California, United States
|
|
|
Liwei He , Elizabeth Sanocki , Anoop Gupta , Jonathan Grudin, Auto-summarization of audio-video presentations, Proceedings of the seventh ACM international conference on Multimedia (Part 1), p.489-498, October 30-November 05, 1999, Orlando, Florida, United States
|
|
|
Peter J. Thomas , John F. Meech , Robert D. Macredie, A framework for the development of information appliances, Proceedings of the 1995 ACM symposium on Applied computing, p.396-400, February 26-28, 1995, Nashville, Tennessee, United States
|
|
|
William M. Newman, Better or just different? On the benefits of designing interactive systems in terms of critical parameters, Proceedings of the conference on Designing interactive systems: processes, practices, methods, and techniques, p.239-245, August 18-20, 1997, Amsterdam, The Netherlands
|
|
|
Steve Whittaker , Patrick Hyland , Myrtle Wiley, FILOCHAT: handwritten notes provide access to recorded conversations, Proceedings of the SIGCHI conference on Human factors in computing systems: celebrating interdependence, p.271-277, April 24-28, 1994, Boston, Massachusetts, United States
|
|
|
Nicole Yankelovich , Gina-Anne Levow , Matt Marx, Designing SpeechActs: issues in speech user interfaces, Proceedings of the SIGCHI conference on Human factors in computing systems, p.369-376, May 07-11, 1995, Denver, Colorado, United States
|
|
|
|
|
|
|
|
|
Hirotaka Ueda , Masahiko Tsukamoto , Shojiro Nishio, W-mail: an electronic mail system for wearable computing environments, Proceedings of the 6th annual international conference on Mobile computing and networking, p.284-291, August 06-11, 2000, Boston, Massachusetts, United States
|
|
|
|
|
|
|
|
|
|
|
|
Kent Lyons , Christopher Skeels , Thad Starner , Cornelis M. Snoeck , Benjamin A. Wong , Daniel Ashbrook, Augmenting conversations using dual-purpose speech, Proceedings of the 17th annual ACM symposium on User interface software and technology, October 24-27, 2004, Santa Fe, NM, USA
|
|
|
Chris Schmandt , Kwan Hong Lee , Jang Kim , Mark Ackerman, Impromptu: managing networked audio applications for mobile users, Proceedings of the 2nd international conference on Mobile systems, applications, and services, June 06-09, 2004, Boston, MA, USA
|
|
|
|
|
|
|
|
|
Shengdong Zhao , Pierre Dragicevic , Mark Chignell , Ravin Balakrishnan , Patrick Baudisch, Earpod: eyes-free menu selection using touch input and reactive audio feedback, Proceedings of the SIGCHI conference on Human factors in computing systems, April 28-May 03, 2007, San Jose, California, USA
|
|
|
|
INDEX TERMS
Primary Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.1
Applications and Expert Systems
Subjects:
Natural language interfaces
Additional Classification:
B.
Hardware
B.4
INPUT/OUTPUT AND DATA COMMUNICATIONS
B.4.2
Input/Output Devices
Subjects:
Voice
H.
Information Systems
H.1
MODELS AND PRINCIPLES
H.1.2
User/Machine Systems
Subjects:
Human factors
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation)
General Terms:
Design,
Human Factors
Keywords:
hand-held computers,
non-speech audio,
speech as data,
speech interfaces,
speech recognition
|