ACM Home Page
Please provide us with feedback. Feedback
Attention driven visual processing for an interactive dialog robot
Full text PdfPdf (755 KB)
Source
Symposium on Applied Computing archive
Proceedings of the 2009 ACM symposium on Applied Computing table of contents
Honolulu, Hawaii
SESSION: Intelligent robotic systems track table of contents
Pages 1151-1155  
Year of Publication: 2009
ISBN:978-1-60558-166-8
Authors
Thomas Müller  Technische Universtiät München, Garching/München/Germany
Alois Knoll  Technische Universtiät München, Garching/München/Germany
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 36,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1529282.1529538
What is a DOI?

ABSTRACT

In this paper we propose an attention-based vision system for the JAST interactive dialog robot. The robotic vision system incorporates three submodules: object recognition, gesture recognition and self recognition.

The performance boost of our biologically inspired vision system is based on two assumptions: first, generally attention is attracted by regions of high intensity or hue gardients as well as scene dynamics (bottom-up attention attraction), and second, attentioninal focus can be directed by higher level modules, whether volitional or not, in an inhibitory or reinforcing way (top-down attention control). The system proposed in this paper is able to utilize these assumptions and organize its computational efforts accordingly.

Integrated into an efficient data management architecture, the vision system is capable of continuously publishing results to the cognitive layer of the robot and thus enables operations in realtime. Furthermore, the modular system structure and the asynchronous communication paradigm allows for efficient integration of additional modules, be it visual or any other sensory input data.

The main contribution of this work is the application of neuroscience findings and biologically plausible theories of attention based visual processing to a real-world robotic setup. Here, our experimental results show tremendous speed-ups using either the bottom-up attention attractors or the principle of top-down attention control as input data filters for further visual analysis, reaching the peak in a combination of the two.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
P. Reinagel and A. M. Zador, "Natural scene statistics at the center of gaze," Network: Computation in Neural Systems, vol. 10, no. 4, pp. 341--350, 1999.
 
2
 
3
S. Gilles, "Robust description and matching of images," Ph.D. dissertation, University of Oxford, 1998.
 
4
W. Kienzle, F. A. Wichmann, B. Schölkopf, and M. O. Franz, "A nonparametric approach to bottom-up visual saliency," in Proc. NIPS '06, 2006.
 
5
J. K. O'Regan and A. Noë, "A sensorimotor account of vision and visual consciousness," Behavioural and Brain Sciences, vol. 24, pp. 939--1031, 2001.
 
6
M. Posner and Y. Cohen, "Components of visual orienting," Attention and Performance, vol. 10, pp. 531--556, 1984.
 
7
L. Itti and C. Koch, "Computational modelling of visual attention," Nature Reviews Neuroscience, vol. 2, pp. 194--203, March 2001.
 
8
 
9
10
 
11
M. Rickert, M. E. Foster, M. Giuliani, T. By, G. Panin, and A. Knoll, "Integrating language, vision, and action for human robot dialog systems," in Proc. ICMI, 2007.
 
12
M. Giuliani and A. Knoll, "Integrating multimodal cues using grammar based models," in Proc. ICMI, 2007.
 
13
 
14
 
15
T. Müller, P. Ziaie, and A. Knoll, "A wait-free realtime system for optimal distribution of vision tasks on multicore architectures," in Proc. ICINCO '08, 2008.
16
 
17
P. Ziaie, T. Müller, M. E. Foster, and A. Knoll, "A naïve bayes classifier with distance weighting for hand-gesture recognition," in Proc. Intl. CSI Computer Conference '08, 2008.
 
18
 
19
 
20
 
21
W. Li, V. Piëch, and C. D. Gilbert, "Perceptual learning and top-down influences in primary visual cortex," Nature Neuroscience, vol. 7, pp. 651--657, May 2004.
 
22
M. I. Posner, R. D. Rafal, L. S. Choate, and J. Vaughan, "Inhibition of return: Neural basis function," Cognitive Neuropsychology, vol. 2, no. 3, pp. 211--228, August 1985.
 
23
J. R. Stroop, "Studies of interference in serial verbal reactions," Journal of Experimental Pyschology, vol. 18, pp. 643--662, 1935.
 
24
J. D. Cohen, G. Aston-Jones, and M. S. Gilzenrat, "A systems-level perspective on attention and cognitive control: Guided activation, adaptive gating, conflict monitoring, and exploitation versus exploration," in Cognitive Neuroscience of Attention. Guilford Publications, 2004, pp. 71--90.

Collaborative Colleagues:
Thomas Müller: colleagues
Alois Knoll: colleagues