|
ABSTRACT
Users must enter a complex mix of spatial and abstract information when operating a graphic design application. Speech / language provides a fluid and natural method for specifying abstract information while a spatial input device is often most intuitive for the entry of spatial information. Thus, the combined speech / gesture interface is ideally suited to this application domain. While some research has been conducted on multimodal graphic design applications, advanced research on modality fusion has typically focused on map related applications. This paper considers the particular demands of graphic design applications and what impact these demands will have on the general strategies employed when combining the speech and gesture channels. We also describe initial work on our own multimodal graphic design application (DPD) which uses these strategies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
 |
6
|
Takeo Igarashi , Satoshi Matsuoka , Sachiko Kawachiya , Hidehiko Tanaka, Interactive beautification: a technique for rapid geometric design, Proceedings of the 10th annual ACM symposium on User interface software and technology, p.105-114, October 14-17, 1997, Banff, Alberta, Canada
[doi> 10.1145/263407.263525]
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Kaiser, C., and Cohen, P.R. Implementation testing of a hybrid symbolic/statistical multimodal architecture, In Proceedings of the International Conference on spoken language processing (Denver, September 2002,) pp. 173--176.
|
| |
11
|
|
| |
12
|
|
| |
13
|
Nishimoto, T., Shida, N., Kobayashi T., and Shirai, K. Multimodal Drawing Tool Using Speech, Mouse and Key-Board. In Proceedings 1994 International Conference on Spoken Language Processing (ICSLP), vol. 3:1287--1290.
|
 |
14
|
|
 |
15
|
|
 |
16
|
|
| |
17
|
Pausch, R. and Leatherby, J.H. An Empirical Study: Adding Voice Input to a Graphical Editor. Journal of the American Voice Input/Output Society, 9:2, July, 1991, pp 55--66.
|
| |
18
|
Pitel, G., Sansonnet, J. A differential representation of predicates for extensional reference resolution. In proceedings of ARQAS, (Venice 2003).
|
| |
19
|
Rowe J. Speech recognition takes us a giant step closer to more natural interaction with CAD software Computer Graphics World February, 2001
|
| |
20
|
Sato, T., and Tojo, A. recognition and understanding of hand-drawn diagrams. In Proceedings of 6th International Conference on pattern recognition. IEEE Computer Society Press, New Jersey, 1982.
|
 |
21
|
|
| |
22
|
|
| |
23
|
Vo, T., and Wood, C. Building an application framework for speech and pen input integration in multimodal learning interfaces. In Proceedings of the International Conference on acoustics speech and signal processing (IEEE-ICAASP 1996), Vol. 6, 3545--3548, IEEE Press.
|
 |
24
|
|
|