ACM Home Page
Please provide us with feedback. Feedback
Elvis: situated speech and gesture understanding for a robotic chandelier
Full text PdfPdf (404 KB)
Source International Conference on Multimodal Interfaces archive
Proceedings of the 6th international conference on Multimodal interfaces table of contents
State College, PA, USA
SESSION: Multimodal applications table of contents
Pages: 90 - 96  
Year of Publication: 2004
ISBN:1-58113-995-0
Authors
Joshua Juster  MIT Media Laboratory, Cambridge, MA
Deb Roy  MIT Media Laboratory, Cambridge, MA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 18,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027933.1027950
What is a DOI?

ABSTRACT

We describe a home lighting robot that uses directional spotlights to create complex lighting scenes. The robot senses its visual environment using a panoramic camera and attempts to maintain its target goal state by adjusting the positions and intensities of its lights. Users can communicate desired changes in the lighting environment through speech and gesture (e.g., "Make it brighter over there"). Information obtained from these two modalities are combined to form a goal, a desired change in the lighting of the scene. This goal is then incorporated into the system's target goal state. When the target goal state and the world are out of alignment, the system formulates a sensorimotor plan that acts on the world to return the system to homeostasis.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
P. Lamere, P. Kwok, et al. Design of the CMU Sphinx-4 Decoder. Eurospeech, September 2003.
 
2
M. Kuperstein. Neural model of adaptive hand-eye coordination for single postures. Science, 239. 1308--1311, 1988.
 
3
M. Petrou, L. Shafarenko, and J. Kittler. Histogram-based segmentation in a perceptually uniform color space. IEEE Transactions on Image Processing, vol. 7, pp. 1354--1358, September 1998.
 
4
A. Kendon. Conducting Interaction. Cambridge: Cambridge University Press 1990.
 
5
D. McNeill. Hand and Mind. The University of Chicago Press, Chicago, 1992.
 
6
 
7
S. Kettebekov and R. Sharma. Understanding Gestures in Multimodal Human-Computer Interaction. International Journal on Artificial Intelligence Tools, vol. 9, no. 2, pp. 205--224, June 2000.
8
 
9
10