| Skipping spare information in multimodal inputs during multimodal input fusion |
| Full text |
Pdf
(515 KB)
|
Source
|
International Conference on Intelligent User Interfaces
archive
Proceedings of the 13th international conference on Intelligent user interfaces
table of contents
Sanibel Island, Florida, USA
SESSION: Short papers
table of contents
Pages 451-456
Year of Publication: 2009
ISBN:978-1-60558-168-2
|
|
Authors
|
|
Yong Sun
|
The University of Sydney, Sydney, Australia and National ICT Australia, Eveleigh, Australia
|
|
Yu Shi
|
National ICT Australia, Eveleigh, Australia
|
|
Fang Chen
|
National ICT Australia, Eveleigh, Australia and National ICT Australia, Eveleigh, Australia
|
|
Vera Chung
|
The University of Sydney, Sydney, Australia
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 78, Citation Count: 0
|
|
|
ABSTRACT
In a multimodal interface, a user can use multiple modalities, such as speech, gesture, and eye gaze etc., to communicate with a system. As a critical component in a multimodal interface, multimodal input fusion explores the ways to effectively interpret the combined semantic interpretation of user's multimodal inputs. Although multimodal inputs may contain spare information, few multimodal input fusion approaches have tackled how to deal with spare information in multimodal inputs. This paper proposes a novel multimodal input fusion approach to flexibly skip spare information in multimodal inputs and derive semantic interpretation of them. The evaluation about the proposed approach confirms that the approach makes human-computer interaction more natural and smooth.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bangalore S., and Johnston M. Integrating multimodal language processing with speech recognition. In Proceedings of Sixth International Conference on Spoken Language Processing. October 16-20, Beijing, China, 2000.
|
 |
2
|
|
| |
3
|
Michael Johnston , Philip R. Cohen , David McGee , Sharon L. Oviatt , James A. Pittman , Ira Smith, Unification-based multimodal integration, Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, p.281-288, July 07-12, 1997, Madrid, Spain
|
 |
4
|
Ed Kaiser , David Demirdjian , Alexander Gruenstein , Xiaoguang Li , John Niekrasz , Matt Wesson , Sanjeev Kumar, A multimodal learning interface for sketch, speak and point creation of a schedule chart, Proceedings of the 6th international conference on Multimodal interfaces, October 13-15, 2004, State College, PA, USA
[doi> 10.1145/1027933.1027992]
|
 |
5
|
|
| |
6
|
Rudzicz F. Clavius: Bi-Directional Parsing for Generic Multimodal Interaction. In Proceedings of Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics (COLING/ACL-06 SRW), July 17-21, Sydney, Australia 2006.
|
 |
7
|
Yong Sun , Helmut Prendinger , Yu (David) Shi , Fang Chen , Vera Chung , Mitsuru Ishizuka, The hinge between input and output: understanding the multimodal input fusion results in an agent-based multimodal presentation system, CHI '08 extended abstracts on Human factors in computing systems, April 05-10, 2008, Florence, Italy
[doi> 10.1145/1358628.1358878]
|
 |
8
|
|
|