ACM Home Page
Please provide us with feedback. Feedback
Adaptive Kanerva-based function approximation for multi-agent systems
Full text PdfPdf (571 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3 table of contents
Estoril, Portugal
SESSION: Agent and multi-agent learning table of contents
Pages 1361-1364  
Year of Publication: 2008
ISBN:978-0-9817381-2-X
Authors
Sponsors
ACM: Association for Computing Machinery
AAAI : Association for the Advancement of Artifical Intelligence
Publisher
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instances of classic multi-agent problems. We apply our techniques to the predator-prey pursuit problem. We first demonstrate that Kanerva Coding applied within a reinforcement learner does not give good results. We then describe our new adaptive Kanerva-based function approximation algorithm, based on prototype deletion and generation. We show that probabilistic prototype deletion with random prototype generation increases the fraction of test instances that are solved from 45% to 90%, and that prototype splitting increases that fraction to 94%. We also show that optimizing prototypes reduces the number of prototypes, and therefore the number of features, needed to achieve a 90% solution rate by up to 87%. These results demonstrate that our approach can dramatically improve the quality of the results obtained and reduce the number of prototypes required. We conclude that adaptive prototype optimization can greatly improve a Kanerva-based reinforcement learner's ability to solve large-scale multi-agent problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
M. Benda, V. Jagannathan, and R. Rodhiawalla. On optimal cooperation of knowledge sources. Technical Report, Boeing Computer Services, 1985.
 
5
T. Haynes and S. Sen. The evolution of multiagent coordination strategies. Adaptive Behavior, 1997.
 
6
G. Hinton. Distributed representations. Technical Report, Department of Computer Science, Carnegie-Mellon University, Pittsburgh, 1984.
 
7
 
8
 
9
K. Kostiadis and H. Hu. KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football. In Proc. of IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems, 2001.
 
10
B. Ratitch and D. Precup. Sparse distributed memories for on-line value-based reinforcement learning. In Proc. of the European Conf. on Machine Learning, 2004.
 
11
 
12
 
13


Collaborative Colleagues:
Cheng Wu: colleagues
Waleed M. Meleis: colleagues