| Speech and sketching for multimodal design |
| Full text |
Pdf
(141 KB)
|
| Source
|
International Conference on Intelligent User Interfaces
archive
Proceedings of the 9th international conference on Intelligent user interfaces
table of contents
Funchal, Madeira, Portugal
SESSION: Short Papers
table of contents
Pages: 214 - 216
Year of Publication: 2004
ISBN:1-58113-815-6
|
|
Authors
|
|
Aaron Adler
|
MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
|
|
Randall Davis
|
MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 34, Citation Count: 12
|
|
|
ABSTRACT
While sketches are commonly and effectively used in the early stages of design, some information is far more easily conveyed verbally than by sketching. In response, we have combined sketching with speech, enabling a more natural form of communication. We studied the behavior of people sketching and speaking, and from this derived a set of rules for segmenting and aligning the signals from both modalities. Once the inputs are aligned, we use both modalities in interpretation. The result is a more natural interface to our system.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Adler. Segmentation and Alignment of Speech and Sketching in a Design Environment. Master's Thesis, Massachusetts Institute of Technology, 2003.
|
| |
2
|
C. Alvarado and R. Davis. Resolving ambiguities to create a natural computer-based sketching environment. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pages 1365--1374, 2001.
|
 |
3
|
Kenneth D. Forbus , Ronald W. Ferguson , Jeffery M. Usher, Towards a computational model of sketching, Proceedings of the 6th international conference on Intelligent user interfaces, p.77-83, January 14-17, 2001, Santa Fe, New Mexico, United States
[doi> 10.1145/359784.360278]
|
| |
4
|
T. J. Hazen, S. Seneff, and J. Polifroni. Recognition confidence scoring and its use in speech understanding systems. Computer Speech and Language, 16:49--67, 2002.
|
| |
5
|
Michael Johnston , Srinivas Bangalore , Gunaranjan Vasireddy , Amanda Stent , Patrick Ehlen , Marilyn Walker , Steve Whittaker , Preetam Maloor, MATCH: an architecture for multimodal dialogue systems, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, July 07-12, 2002, Philadelphia, Pennsylvania
[doi> 10.3115/1073083.1073146]
|
| |
6
|
M. Oltmans. Understanding Naturally Conveyed Explanations of Device Behavior. Master's Thesis, Massachusetts Institute of Technology, 2001.
|
| |
7
|
S. Oviatt, P. Cohen, L. Wu, J. Vergo, L. Duncan, B. Suhm, J. Bers, T. Holzman, T. Winograd, J. Landay, J. Larson, and D. Ferro. Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions. Human Computer Interaction, 15(4):263--322, 2000.
|
CITED BY 12
|
|
|
|
|
Richard Anderson , Crystal Hoyer , Craig Prince , Jonathan Su , Fred Videon , Steve Wolfman, Speech, ink, and slides: the interaction of content channels, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
Christine Alvarado , Ned Burns , Howard Chen , Jason Fennell , Sarah Harris , Max Pfleuger , Devin Smith , Paul Wais , Matt Weiner , Aaron Wolin, Sketch understanding systems, ACM SIGGRAPH 2007 courses, August 05-09, 2007, San Diego, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Diane J. Cook , Juan C. Augusto , Vikramaditya R. Jakkula, Review: Ambient intelligence: Technologies, applications, and opportunities, Pervasive and Mobile Computing, v.5 n.4, p.277-298, August, 2009
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Natural language
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Voice I/O;
Evaluation/methodology;
Interaction styles (e.g., commands, menus, forms, direct manipulation);
Input devices and strategies (e.g., mouse, touchscreen);
User-centered design;
Graphical user interfaces (GUI)
General Terms:
Design,
Experimentation,
Human Factors,
Performance
Keywords:
multimodal interaction,
sketch,
speech
|