ACM Home Page
Please provide us with feedback. Feedback
A task specification language for bootstrap learning
Full text PdfPdf (216 KB)
Source
International Conference on Autonomous Agents archive
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 table of contents
Budapest, Hungary
SESSION: Environments table of contents
Pages 1169-1170  
Year of Publication: 2009
ISBN:978-0-9817381-7-8
Authors
Ian Fasel  The University of Texas at Austin
Michael Quinlan  The University of Texas at Austin
Peter Stone  The University of Texas at Austin
Sponsors
: The Foundation for Intelligent Physical Agents
Microsoft Research : Microsoft Research
: Whitestein Technologies
: European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
: Drexel University
: Wiley -- Blackwell Ltd
Publisher
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 15,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Traditionally, research in the reinforcement learning (RL) community has been devoted to developing domain-independent algorithms such as SARSA [13], Q-learning [16], prioritized sweeping [8], or LSPI [6], that are designed to work for any given state space and action space. However, the modus operandi in RL research has been for a human expert to re-code each learning environment, including defining the actions and state features, as well as specifying the algorithm to be used. Typically each new RL experiment is run by explicitly calling a new program (even when learning can be biased by previous learning experiences, as in transfer learning [10, 15, 14]). Thus, while standards have developed for describing and testing individual RL algorithms (e.g., RL-Glue [17]), no such standards have developed for the problem of describing complete tasks to a preexisting agent.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bootstrapped learning proposer information pamphlet. http://www.darpa.mil/IPTO/solicit/closed/BAA-07-04_PIP.pdf.
 
2
 
3
 
4
T. Jung and D. Polani. Learning robocup-keepaway with kernels. In JMLR Workshop and Conference Proceedings ("Gaussian Processes in Practice"), volume 1. JMLR, 2007.
 
5
G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik. Guiding a reinforcement learner with natural language advice: Initial results in robocup soccer. In AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, 2004.
 
6
 
7
 
8
 
9
S. Schaal. Learning from demonstration. NIPS 9, 1997.
 
10
 
11
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165--188, 2005.
 
12
 
13
14
 
15
L. Torrey, T. Walker, J. Shavlik, and R. Maclin. Using advice to transfer knowledge acquired in one reinforcement learning task to another. In ECML 2005. Porto, Portugal, 2005.
 
16
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, King's College, Cambridge, UK, 1989.
 
17
A. White, M. Lee, A. Butcher, B. Tanner, L. Hackman, and R. Sutton. Rl-glue distribution, http://rlai.cs.ualberta.ca/rlbb/top.html, 2007.

Collaborative Colleagues:
Ian Fasel: colleagues
Michael Quinlan: colleagues
Peter Stone: colleagues