ACM Home Page
Please provide us with feedback. Feedback
Using multi-agent systems for learning optimal policies for complex problems
Full text PdfPdf (481 KB)
Source ACM Southeast Regional Conference archive
Proceedings of the 45th annual southeast regional conference table of contents
Winston-Salem, North Carolina
SESSION: Papers table of contents
Pages: 244 - 249  
Year of Publication: 2007
ISBN:978-1-59593-629-5
Authors
Andreas Lommatzsch  TU-Berlin, Germany
Sahin Albayrak  TU-Berlin, Germany
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 23,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1233341.1233385
What is a DOI?

ABSTRACT

The automatic computation of an optimal solution for a complex problem is a challenging task if no additional knowledge is available. For bounded sized problems there are universally applicable algorithms (e.g. genetic algorithms, branch and bound, reinforcement learning). The disadvantage of these algorithms is their high computational complexity so that real world problems can only be solved efficiently, if the search space is reduced dramatically.

In this paper we present an approach that enables the automatic computation of the parameter dependencies of a complex problem without any additional information. The basic idea is to apply reinforcement learning and to incrementally acquire knowledge about the implicit parameters dependencies. Based on the obtained data an optimal strategy is learned. For speeding up the learning process a multiagent architecture is applied, that supports the simultaneous analysis of alternative strategies. We prove the advantages of our approach by successfully learning a control strategy for a model helicopter.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
E. N. Johnson and S. Kannan. Adaptive flight control for an autonomous unmanned helicopter. In gnc, number AIAA-2002-4439, Monterey, CA, aug 2002.
 
3
J. Leishman. Principles of Helicopter Aerodynamics. Cambridge University Press, Cambridge, MA, 2000.
 
4
M. Musial, G. Hommel, U. W. Brandenburg, E. Berg, M. Christmann, C. Fleischer, C. Reinicke, V. Remuß, S. Rönneke, and A. Wege. MARVIN - Technische Universität Berlin's Flying Robot Competing at the IARC'99.
 
5
A. Y. Ng, A. Coates, and e. a. Mark Diel. Inverted autonomous helicopter flight via reinforcement learning. In International Symposium on Experimental Robotics, 2004.
 
6
A. Y. Ng, H. J. Kim, M. I. Jordan, and S. Sastry. Autonomous helicopter flight via reinforcement learning. In S. Thrun, L. Saul, and B. Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
 
7
8
 
9
 
10

Collaborative Colleagues:
Andreas Lommatzsch: colleagues
Sahin Albayrak: colleagues