|
ABSTRACT
This article presents a User Interface (UI) framework for multimodal interactions targeted at immersive virtual environments. Its configurable input and gesture processing components provide an advanced behavior graph capable of routing continuous data streams asynchronously. The framework introduces a Knowledge Representation Layer which augments objects of the simulated environment with Semantic Entities as a central object model that bridges and interfaces Virtual Reality (VR) and Artificial Intelligence (AI) representations. Specialized node types use these facilities to implement required processing tasks like gesture detection, preprocessing of the visual scene for multimodal integration, or translation of movements into multimodally initialized gestural interactions. A modified Augmented Transition Nettwork (ATN) approach accesses the knowledge layer as well as the preprocessing components to integrate linguistic, gestural, and context information in parallel. The overall framework emphasizes extensibility, adaptivity and reusability, e.g., by utilizing persistent and interchangeable XML-based formats to describe its processing stages.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Frank Althoff , Gregor McGlaun , Björn Schuller , Peter Morguet , Manfred Lang, Using multimodal interaction to navigate in arbitrary virtual VRML worlds, Proceedings of the 2001 workshop on Perceptive user interfaces, November 15-16, 2001, Orlando, Florida
[doi> 10.1145/971478.971494]
|
| |
2
|
|
| |
3
|
K. Böhm, W. Hübner, and K. Väänänen. Given: Gesture driven interactions in virtual environments; a toolkit approach to 3D interactions. In Interfaces to Real and Virtual Worlds, 1992.
|
 |
4
|
|
| |
5
|
R. Carey, G. Bell, and C. Marrin. ISO/IEC 14772-1:1997 virtual reality modeling language (VRML). Technical report, The VRML Consortium Incorporated, 1997.
|
| |
6
|
M. Cavazza, X. Pouteau, and D. Pernel. Multimodal communication in virtual environments. In Symbiosis of Human and Artifact, pages 597--604. Elsevier Science B. V., 1995.
|
| |
7
|
Philip Cohen , David McGee , Sharon Oviatt , Lizhong Wu , Joshua Clow , Robert King , Simon Julier , Lawrence Rosenblum, Multimodal Interaction for 2D and 3D Environments, IEEE Computer Graphics and Applications, v.19 n.4, p.10-13, July 1999
[doi> 10.1109/38.773958]
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
M. Johnston and S. Bangalore. Finite-state methods for multimodal parsing and integration. In Finite-state Methods Workshop, ESSLLI Summer School on Logic Language and Information,Helsinki, Finland, august 2001.
|
| |
12
|
Michael Johnston , Philip R. Cohen , David McGee , Sharon L. Oviatt , James A. Pittman , Ira Smith, Unification-based multimodal integration, Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, p.281-288, July 07-12, 1997, Madrid, Spain
|
 |
13
|
Ed Kaiser , Alex Olwal , David McGee , Hrvoje Benko , Andrea Corradini , Xiaoguang Li , Phil Cohen , Steven Feiner, Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality, Proceedings of the 5th international conference on Multimodal interfaces, November 05-07, 2003, Vancouver, British Columbia, Canada
[doi> 10.1145/958432.958438]
|
| |
14
|
David B. Koons , Carlton J. Sparrell , Kristinn R. Thorisson, Integrating simultaneous input from speech, gaze, and hand gestures, Intelligent multimedia interfaces, American Association for Artificial Intelligence, Menlo Park, CA, 1993
|
| |
15
|
|
 |
16
|
|
| |
17
|
|
| |
18
|
M. E. Latoschik and M. Schilling. Incorporating VR Databases into AI Knowledge Representations: A Framework for Intelligent Graphics Applications. In Proceedings of the Sixth International Conference on Computer Graphics and Imaging. IASTED, ACTA Press, 2003.
|
| |
19
|
B. Lenzmann. Benutzeradaptive und multimodale Interface-Agenten. PhD thesis, Technische Fakultät, Universität Bielefeld, 1998.
|
| |
20
|
M. Lucente, G.-J. Zwart, and A. D. George. Visualization space: A testbed for deviceless multimodal user interface. In Intelligent Environments Symposium, American Assoc. for Artificial Intelligence Spring Symposium Series, Mar. 1998.
|
| |
21
|
|
 |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
C. J. Sparrell and D. B. Koons. Interpretation of coverbal depictive gestures. In AAAI Spring Symposium Series, pages 8--12. Stanford University, March 1994.
|
 |
26
|
|
 |
27
|
|
| |
28
|
|
| |
29
|
|
| |
30
|
M. Vo and C. Wood. Building an application framework for speech and pen input integration in multimodal learning interfaces. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing, 1996.
|
| |
31
|
|
CITED BY 6
|
|
Yong Sun , Fang Chen , Yu (David) Shi , Vera Chung, A novel method for multi-sensory data fusion in multimodal human computer interaction, Proceedings of the 20th conference of the computer-human interaction special interest group (CHISIG) of Australia on Computer-human interaction: design: activities, artefacts and environments, November 20-24, 2006, Sydney, Australia
|
|
|
|
|
|
Yong Sun , Yu Shi , Fang Chen , Vera Chung, An efficient unification-based multimodal language processor in multimodal input fusion, Proceedings of the 2007 conference of the computer-human interaction special interest group (CHISIG) of Australia on Computer-human interaction: design: activities, artifacts and environments, November 28-30, 2007, Adelaide, Australia
|
|
|
Konstantinos Moustakas , Dimitrios Tzovaras , Michael G. Strintzis , Sebastien Carbini , Olivier Bernier , Jean Emmanuel Viallet , Stephan Raidt , Matei Mancas , Mariella Dimiccoli , Enver Yagci , Serdar Balci , Eloisa Ibanez Leon, Masterpiece: Physical Interaction and 3D Content-Based Search in VR Applications, IEEE MultiMedia, v.13 n.3, p.92-100, July 2006
|
|
|
Corey Manders , Farzam Farbiz , Chong Jyh Herng , Tang Ka Yin, A 3D interactive kiosk system, Proceedings of the 5th international conference on Computer graphics, virtual reality, visualisation and interaction in Africa, October 29-31, 2007, Grahamstown, South Africa
|
|
|
Konstantinos Moustakas , Dimitrios Tzovaras , Michael G. Strintzis , Sebastien Carbini , Olivier Bernier , Jean Emmanuel Viallet , Stephan Raidt , Matei Mancas , Mariella Dimiccoli , Enver Yagci , Serdar Balci , Eloisa Ibanez Leon, Masterpiece: Physical Interaction and 3D Content-Based Search in VR Applications, IEEE MultiMedia, v.13 n.3, p.92-100, July 2006
|
|