ACM Home Page
Please provide us with feedback. Feedback
Modality fusion for graphic design applications
Full text PdfPdf (411 KB)
Source International Conference on Multimodal Interfaces archive
Proceedings of the 6th international conference on Multimodal interfaces table of contents
State College, PA, USA
POSTER SESSION: Poster session 1 table of contents
Pages: 167 - 174  
Year of Publication: 2004
ISBN:1-58113-995-0
Author
André D. Milota  MindGrip Consulting, Kensington, CA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 43,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027933.1027963
What is a DOI?

ABSTRACT

Users must enter a complex mix of spatial and abstract information when operating a graphic design application. Speech / language provides a fluid and natural method for specifying abstract information while a spatial input device is often most intuitive for the entry of spatial information. Thus, the combined speech / gesture interface is ideally suited to this application domain. While some research has been conducted on multimodal graphic design applications, advanced research on modality fusion has typically focused on map related applications. This paper considers the particular demands of graphic design applications and what impact these demands will have on the general strategies employed when combining the speech and gesture channels. We also describe initial work on our own multimodal graphic design application (DPD) which uses these strategies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
 
5
6
 
7
 
8
 
9
 
10
Kaiser, C., and Cohen, P.R. Implementation testing of a hybrid symbolic/statistical multimodal architecture, In Proceedings of the International Conference on spoken language processing (Denver, September 2002,) pp. 173--176.
 
11
 
12
 
13
Nishimoto, T., Shida, N., Kobayashi T., and Shirai, K. Multimodal Drawing Tool Using Speech, Mouse and Key-Board. In Proceedings 1994 International Conference on Spoken Language Processing (ICSLP), vol. 3:1287--1290.
14
15
16
 
17
Pausch, R. and Leatherby, J.H. An Empirical Study: Adding Voice Input to a Graphical Editor. Journal of the American Voice Input/Output Society, 9:2, July, 1991, pp 55--66.
 
18
Pitel, G., Sansonnet, J. A differential representation of predicates for extensional reference resolution. In proceedings of ARQAS, (Venice 2003).
 
19
Rowe J. Speech recognition takes us a giant step closer to more natural interaction with CAD software Computer Graphics World February, 2001
 
20
Sato, T., and Tojo, A. recognition and understanding of hand-drawn diagrams. In Proceedings of 6th International Conference on pattern recognition. IEEE Computer Society Press, New Jersey, 1982.
21
 
22
 
23
Vo, T., and Wood, C. Building an application framework for speech and pen input integration in multimodal learning interfaces. In Proceedings of the International Conference on acoustics speech and signal processing (IEEE-ICAASP 1996), Vol. 6, 3545--3548, IEEE Press.
24