| Interactive learning of mappings from visual percepts to actions |
| Full text |
Pdf
(1.28 MB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 119
archive
Proceedings of the 22nd international conference on Machine learning
table of contents
Bonn, Germany
Pages: 393 - 400
Year of Publication: 2005
ISBN:1-59593-180-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 16, Citation Count: 2
|
|
|
ABSTRACT
We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier in front of a Reinforcement Learning algorithm. The classifier partitions the visual space according to the presence or absence of highly informative local descriptors. The image classifier is incrementally refined by selecting new local descriptors when perceptual aliasing is detected. Thus, we reduce the visual input domain down to a size manageable by Reinforcement Learning, permitting us to learn direct percept-to-action mappings. Experimental results on a continuous visual navigation task illustrate the applicability of the framework.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Chapman, D., & Kaelbling, L. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. Proc. of the 12th International Joint Conference on Artificial Intelligence (IJCAI) (pp. 726--731). Sydney.
|
| |
4
|
Chrisman, L. (1992). Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. National Conference on Artificial Intelligence (pp. 183--188).
|
| |
5
|
Coelho, J., Piater, J., & Grupen, R. (2001). Developing haptic and visual perceptual categories for reaching and grasping with a humanoid robot. Robotics and Autonomous Systems, 37, 195--218.
|
| |
6
|
Gibson, E., & Spelke, E. (1983). The development of perception. Handbook of child psychology vol. iii: Cognitive development, chapter 1, 2--76. Wiley.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Mikolajczyk, K., & Schmid, C. (2003). A performance evaluation of local descriptors. IEEE Conference on Computer Vision and Pattern Recognition (pp. 257--263). Madison (WI, USA).
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
Schyns, P., & Rodet, L. (1997). Categorization creates functional features. Journ. of Experimental Psychology: Learning, Memory and Cognition, 23, 681--696.
|
| |
18
|
Singh, S., Jaakkola, T., & Jordan, M. (1995). Reinforcement learning with soft state aggregation. Advances in Neural Information Processing Systems (pp. 361--368). MIT Press.
|
| |
19
|
|
| |
20
|
|
| |
21
|
Watkins, C. (1989). Learning from delayed rewards. Doctoral dissertation, King's College, Cambridge.
|
| |
22
|
|
|