|
ABSTRACT
In this paper we describe a Wizard of Oz (WOz) user study of an Augmented Reality (AR) interface that uses multimodal input (MMI) with natural hand interaction and speech commands. Our goal is to use a WOz study to help guide the creation of a multimodal AR interface which is most natural to the user. In this study we used three virtual object arranging tasks with two different display types (a head mounted display, and a desktop monitor) to see how users used multimodal commands, and how different AR display conditions affect those commands. The results provided valuable insights into how people naturally interact in a multimodal AR scene assembly task. For example, we discovered the optimal time frame for fusing speech and gesture commands into a single command. We also found that display type did not produce a significant difference in the type of commands used. Using these results, we present design recommendations for multimodal interaction in AR environments.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Swan II, J.E. and Gabbard, J. L. 2005. Survey of User-Based Experimentation in Augmented Reality. In Proc. of 1st Int'l Conf. on Virtual Reality. HCI International 2005, 2005.
|
| |
2
|
Broll, W., Stoerring, M., and Mottram, C. 2003. The Augmented Round Table -- a New Interface to Urban Planning and Architectural Design, In Proc. INTERACT'03, pp. 1103--1104.
|
 |
3
|
Kousuke Nakashima , Takashi Machida , Kiyoshi Kiyokawa , Haruo Takemura, A 2D-3D integrated environment for cooperative work, Proceedings of the ACM symposium on Virtual reality software and technology, November 07-09, 2005, Monterey, CA, USA
[doi> 10.1145/1101616.1101621]
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
Philip R. Cohen , Michael Johnston , David McGee , Sharon Oviatt , Jay Pittman , Ira Smith , Liang Chen , Josh Clow, QuickSet: multimodal interaction for distributed applications, Proceedings of the fifth ACM international conference on Multimedia, p.31-40, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266328]
|
 |
9
|
Ingmar Rauschert , Pyush Agrawal , Rajeev Sharma , Sven Fuhrmann , Isaac Brewer , Alan MacEachren, Designing a human-centered, multimodal GIS interface to support emergency management, Proceedings of the 10th ACM international symposium on Advances in geographic information systems, November 08-09, 2002, McLean, Virginia, USA
[doi> 10.1145/585147.585172]
|
 |
10
|
|
| |
11
|
|
 |
12
|
|
 |
13
|
|
| |
14
|
Lucente, M., Zwart, G. J., and George, A. D. 1998. Visualization Space: A Testbed for Deviceless Multimodal User Interface. In AAAI Spring Symposium on Intelligent Environments. AAAI TR SS-98-02.
|
| |
15
|
Irawati, S., Green, S., Billinghurst, M., Duenser, A., and Ko, H. 2006. "Move the Couch Where?": Developing an Augmented Reality Multimodal Interface". In Proc. ISMAR'06, 183 -- 186.
|
| |
16
|
Billinghurst, M., Kato, H., Poupyrev, I., Imamoto, K., and Tachibana, K. 2000. Virtual Object Manipulation on a Table-Top AR Environment. In Proc. ISAR'00, 111--119.
|
| |
17
|
Irawati, S., Green, S., Billinghurst, M., Duenser, A., and Ko, H. 2006. An Evaluation of an Augmented Reality Multimodal Interface Using Speech and Paddle Gestures. In Proc. ICAT'06, 272--283.
|
| |
18
|
|
| |
19
|
Kölsch, M., Turk, M., and Tobias, H. 2004. Vision-Based Interfaces for Mobility. In Proc. MobiQuitous'04, 86-- 94.
|
| |
20
|
|
 |
21
|
Joëlle Coutaz , Daniel Salber , Eric Carraux , Nathalie Portolan, NEIMO, a multiworkstation usability lab for observing and analyzing multimodal interaction, Conference companion on Human factors in computing systems: common ground, p.402-403, April 13-18, 1996, Vancouver, British Columbia, Canada
[doi> 10.1145/257089.257904]
|
| |
22
|
Oviatt, S. L., Cohen, P. R., Fong, M. W., and Frank, M. P. 1992. A rapid semi-automatic simulation technique for interactive speech and handwriting. In Proc. Int'l Conf. on Spoken Language Processing, 2, 1351--1354.
|
| |
23
|
|
| |
24
|
Corradini, A. and Cohen, P. R. 2002. On the Relationships among Speech, Gestures, and Object Manipulation in Virtual Environments: Initial Evidence. In Proc. Int'l CLASS Workshop on Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems, 52--61.
|
 |
25
|
|
| |
26
|
OpenCV Library. 2008. http://sourceforge.net/projects/opencvlibrary/
|
 |
27
|
|
| |
28
|
|
| |
29
|
Looser, J., Grasset, R., Seichter, H., and Billinghurst, M. 2006. OSGART -- A Pragmatic Approach to MR. In Proc. Industrial AR Workshop, ISMAR'06.
|
| |
30
|
Billinghurst, M., Campbell, S., Chinthammit, W., Hendrickson, D., Poupyrev, I., Takahashi, K., and Kato, H. 2000. Magic book: Exploring transitions in collaborative AR interfaces. Emerging Technologies Proposal, SIGGRAPH'00.
|
| |
31
|
McNeil, D. 1992. Hand and Mind: What gestures reveal about thought. University of Chicago Press, Chicago, IL, USA.
|
|