ACM Home Page
Please provide us with feedback. Feedback
Speech and sketching for multimodal design
Full text PdfPdf (141 KB)
Source International Conference on Intelligent User Interfaces archive
Proceedings of the 9th international conference on Intelligent user interfaces table of contents
Funchal, Madeira, Portugal
SESSION: Short Papers table of contents
Pages: 214 - 216  
Year of Publication: 2004
ISBN:1-58113-815-6
Authors
Aaron Adler  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Randall Davis  MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 34,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/964442.964483
What is a DOI?

ABSTRACT

While sketches are commonly and effectively used in the early stages of design, some information is far more easily conveyed verbally than by sketching. In response, we have combined sketching with speech, enabling a more natural form of communication. We studied the behavior of people sketching and speaking, and from this derived a set of rules for segmenting and aligning the signals from both modalities. Once the inputs are aligned, we use both modalities in interpretation. The result is a more natural interface to our system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Adler. Segmentation and Alignment of Speech and Sketching in a Design Environment. Master's Thesis, Massachusetts Institute of Technology, 2003.
 
2
C. Alvarado and R. Davis. Resolving ambiguities to create a natural computer-based sketching environment. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pages 1365--1374, 2001.
3
 
4
T. J. Hazen, S. Seneff, and J. Polifroni. Recognition confidence scoring and its use in speech understanding systems. Computer Speech and Language, 16:49--67, 2002.
 
5
 
6
M. Oltmans. Understanding Naturally Conveyed Explanations of Device Behavior. Master's Thesis, Massachusetts Institute of Technology, 2001.
 
7
S. Oviatt, P. Cohen, L. Wu, J. Vergo, L. Duncan, B. Suhm, J. Bers, T. Holzman, T. Winograd, J. Landay, J. Larson, and D. Ferro. Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions. Human Computer Interaction, 15(4):263--322, 2000.

CITED BY  12

Collaborative Colleagues:
Aaron Adler: colleagues
Randall Davis: colleagues