|
ABSTRACT
Multimodal interaction enables the user to employ different modalities such as voice, gesture and typing for communicating with a computer. This paper presents an analysis of the integration of multiple communication modalities within an interactive system. To do so, a software engineering perspective is adopted. First, the notion of “multimodal system” is clarified. We aim at proving that two main features of a multimodal system are the concurrency of processing and the fusion of input/output data. On the basis of these two features, we then propose a design space and a method for classifying multimodal systems. In the last section, we present a software architecture model of multimodal systems which supports these two salient properties: concurrency of processing and data fusion. Two multimodal systems developed in our team, VoicePaint and NoteBook, are used to illustrate the discussion.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Kjelldahl L., Introduction. In Proc. 1st Eurographics Workshop, Stockholm, Sweden (April 18/19,1991), Springer Verlag, pp. 3-5.
|
 |
2
|
|
| |
3
|
Byte, Special issue on Computing without Keyboard, (July 1990), pp. 202-251.
|
| |
4
|
Coutaz J. Multimedia and Multimodal User Interfaces: A Taxonomy for Software Engineering Research Issues. in Proc. Second East-Weat HCI conference (St Petersburg, Aug. 1992), pp.229-240.
|
| |
5
|
|
| |
6
|
Frohlich D.M. The Design Space of Interfaces. Multimedia Systems, Interaction and Applications. In Proc. 1st Eurographics Workshop, Stockholm, Sweden (April 18/19,1991), Springer Verlag, pp. 53-69.
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
 |
12
|
Barbara L. Chalfonte , Robert S. Fish , Robert E. Kraut, Expressive richness: a comparison of speech and text as media for revision, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.21-26, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108848]
|
| |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
Nigay L. and Coutaz J. Building User Interfaces: Organizing Software Agents. In Proc. ESPRIT'91 Conference (Bruxelles, Nov. 1991), pp. 707-719.
|
 |
17
|
J. Bryan Lewis , Lawrence Koved , Daniel T. Ling, Dialogue structures for virtual worlds, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.131-136, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108864]
|
| |
18
|
|
 |
19
|
|
| |
20
|
Wilson M. The first MMI2 Demonstrator, A Multimodal Interface for Man Machine interaction with Knowledge Based Systems. Deliverable D7, ESPRIT project 2474 MMI2, Tech. report Rutherford Appleton Laboratory, Chilton Didcot Oxon OXll 0QX, RAL-91-093, 1991.
|
 |
21
|
|
| |
22
|
Coutaz J. PAC: an Implementation Model for Dialog Design, In Proc, Interact'87, (Stuttgart, Sept. 1987), H-J. Bullinger, B. Shackel ed., North Holland, pp. 431-436.
|
CITED BY 13
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alice Smith , John Dunaway , Patrick Demasco , Denise Peischl, Multimodal input for computer access and augmentative communication, Proceedings of the second annual ACM conference on Assistive technologies, p.80-85, April 11-12, 1996, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation)
Additional Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.1
Applications and Expert Systems
Subjects:
Natural language interfaces
I.2.7
Natural Language Processing
Subjects:
Speech recognition and synthesis
I.3
COMPUTER GRAPHICS
I.3.4
Graphics Utilities
Subjects:
Graphics editors
General Terms:
Algorithms,
Design,
Human Factors
Keywords:
concurrency,
data fusion,
design space,
modality,
multimodal interaction,
software architecture,
taxonomy
|