| Elvis: situated speech and gesture understanding for a robotic chandelier |
| Full text |
Pdf
(404 KB)
|
| Source
|
International Conference on Multimodal Interfaces
archive
Proceedings of the 6th international conference on Multimodal interfaces
table of contents
State College, PA, USA
SESSION: Multimodal applications
table of contents
Pages: 90 - 96
Year of Publication: 2004
ISBN:1-58113-995-0
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 18, Citation Count: 5
|
|
|
ABSTRACT
We describe a home lighting robot that uses directional spotlights to create complex lighting scenes. The robot senses its visual environment using a panoramic camera and attempts to maintain its target goal state by adjusting the positions and intensities of its lights. Users can communicate desired changes in the lighting environment through speech and gesture (e.g., "Make it brighter over there"). Information obtained from these two modalities are combined to form a goal, a desired change in the lighting of the scene. This goal is then incorporated into the system's target goal state. When the target goal state and the world are out of alignment, the system formulates a sensorimotor plan that acts on the world to return the system to homeostasis.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
P. Lamere, P. Kwok, et al. Design of the CMU Sphinx-4 Decoder. Eurospeech, September 2003.
|
| |
2
|
M. Kuperstein. Neural model of adaptive hand-eye coordination for single postures. Science, 239. 1308--1311, 1988.
|
| |
3
|
M. Petrou, L. Shafarenko, and J. Kittler. Histogram-based segmentation in a perceptually uniform color space. IEEE Transactions on Image Processing, vol. 7, pp. 1354--1358, September 1998.
|
| |
4
|
A. Kendon. Conducting Interaction. Cambridge: Cambridge University Press 1990.
|
| |
5
|
D. McNeill. Hand and Mind. The University of Chicago Press, Chicago, 1992.
|
| |
6
|
|
| |
7
|
S. Kettebekov and R. Sharma. Understanding Gestures in Multimodal Human-Computer Interaction. International Journal on Artificial Intelligence Tools, vol. 9, no. 2, pp. 205--224, June 2000.
|
 |
8
|
|
| |
9
|
|
 |
10
|
Sharon Oviatt , Antonella DeAngeli , Karen Kuhn, Integration and synchronization of input modes during multimodal human-computer interaction, Proceedings of the SIGCHI conference on Human factors in computing systems, p.415-422, March 22-27, 1997, Atlanta, Georgia, United States
[doi> 10.1145/258549.258821]
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.1
MODELS AND PRINCIPLES
H.1.2
User/Machine Systems
Subjects:
Human information processing
Additional Classification:
H.
Information Systems
H.1
MODELS AND PRINCIPLES
H.1.2
User/Machine Systems
Subjects:
Human factors
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Input devices and strategies (e.g., mouse, touchscreen);
Interaction styles (e.g., commands, menus, forms, direct manipulation)
General Terms:
Design,
Human Factors
Keywords:
gesture,
grounded,
input methods,
lighting,
multimodal,
natural interaction,
situated,
speech
|