| A probabilistic approach to reference resolution in multimodal user interfaces |
| Full text |
Pdf
(336 KB)
|
| Source
|
International Conference on Intelligent User Interfaces
archive
Proceedings of the 9th international conference on Intelligent user interfaces
table of contents
Funchal, Madeira, Portugal
SESSION: Dialogue
table of contents
Pages: 70 - 77
Year of Publication: 2004
ISBN:1-58113-815-6
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 57, Citation Count: 12
|
|
|
ABSTRACT
Multimodal user interfaces allow users to interact with computers through multiple modalities, such as speech, gesture, and gaze. To be effective, multimodal user interfaces must correctly identify all objects which users refer to in their inputs. To systematically resolve different types of references, we have developed a probabilistic approach that uses a graph-matching algorithm. Our approach identifies the most probable referents by optimizing the satisfaction of semantic, temporal, and contextual constraints simultaneously. Our preliminary user study results indicate that our approach can successfully resolve a wide variety of referring expressions, ranging from simple to complex and from precise to ambiguous ones.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
J. Cassell , T. Bickmore , M. Billinghurst , L. Campbell , K. Chang , H. Vilhjálmsson , H. Yan, Embodiment in conversational interfaces: Rea, Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit, p.520-527, May 15-20, 1999, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/302979.303150]
|
| |
3
|
|
 |
4
|
Philip R. Cohen , Michael Johnston , David McGee , Sharon Oviatt , Jay Pittman , Ira Smith , Liang Chen , Josh Clow, QuickSet: multimodal interaction for distributed applications, Proceedings of the fifth ACM international conference on Multimedia, p.31-40, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266328]
|
| |
5
|
|
| |
6
|
|
| |
7
|
Gustafson, J., Bell, L., Beskow, J., Boye J., Carlson, R., Edlund, J., Granstrom, B., House D., and Wiren, M. AdApt -- a Multimodal Conversational Dialogue System in an Apartment Domain. Proceedings of 6th International Conference on Spoken Language Processing (ICSLP), 2000.
|
| |
8
|
|
| |
9
|
Michael Johnston , Philip R. Cohen , David McGee , Sharon L. Oviatt , James A. Pittman , Ira Smith, Unification-based multimodal integration, Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, p.281-288, July 07-12, 1997, Madrid, Spain
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
David B. Koons , Carlton J. Sparrell , Kristinn R. Thorisson, Integrating simultaneous input from speech, gaze, and hand gestures, Intelligent multimedia interfaces, American Association for Artificial Intelligence, Menlo Park, CA, 1993
|
 |
14
|
|
| |
15
|
J. G. Neal , C. Y. Thielman , Z. Dobes , S. M. Haller , S. C. Shapiro, Natural language with integrated deictic and graphic gestures, Readings in intelligent user interfaces, Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998
|
 |
16
|
|
 |
17
|
Sharon Oviatt , Antonella DeAngeli , Karen Kuhn, Integration and synchronization of input modes during multimodal human-computer interaction, Proceedings of the SIGCHI conference on Human factors in computing systems, p.415-422, March 22-27, 1997, Atlanta, Georgia, United States
[doi> 10.1145/258549.258821]
|
 |
18
|
|
 |
19
|
|
| |
20
|
Amanda Stent , John Dowding , Jean Mark Gawron , Elizabeth Owen Bratt , Robert Moore, The CommandTalk spoken dialogue system, Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, p.183-190, June 20-26, 1999, College Park, Maryland
[doi> 10.3115/1034678.1034713]
|
| |
21
|
|
| |
22
|
Tsai, W.H. and Fu, K.S. Error-correcting isomorphism of attributed relational graphs for pattern analysis. IEEE Trans. Sys., Man and Cyb., vol. 9, 1979, pp. 757--768.
|
| |
23
|
|
| |
24
|
Zancanaro, M., Stock, O., and Strapparava, C. 1997. Multimodal Interaction for Information Access: Exploiting Cohesion. Computational Intelligence 13(7):439--464.
|
 |
25
|
|
CITED BY 12
|
|
Joyce Y. Chai , Zahar Prasov , Joseph Blaim , Rong Jin, Linguistic theories in efficient multimodal reference resolution: an empirical investigation, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
|
|
|
Shimei Pan , Siwei Shen , Michelle X. Zhou , Keith Houck, Two-way adaptation for robust input interpretation in practical multimodal conversation systems, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
Michelle X. Zhou , Keith Houck , Shimei Pan , James Shaw , Vikram Aggarwal , Zhen Wen, Enabling context-sensitive information seeking, Proceedings of the 11th international conference on Intelligent user interfaces, January 29-February 01, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
Joyce Y. Chai , Pengyu Hong , Michelle X. Zhou , Zahar Prasov, Optimization in multimodal interpretation, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p.1-es, July 21-26, 2004, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
Michelle X. Zhou , Keith Houck , Shimei Pan , James Shaw , Vikram Aggarwal , Zhen Wen, Responsive information architect: enabling context-sensitive information seeking, proceedings of the 21st national conference on Artificial intelligence, p.1691-1694, July 16-20, 2006, Boston, Massachusetts
|
|
|
|
|