ACM Home Page
Please provide us with feedback. Feedback
Discovering options from example trajectories
Full text PdfPdf (637 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 1217-1224  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Peng Zang  Georgia Institute of Technology, Atlanta, GA
Peng Zhou  Georgia Institute of Technology, Atlanta, GA
David Minnen  Georgia Institute of Technology, Atlanta, GA
Charles Isbell  Georgia Institute of Technology, Atlanta, GA
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 32,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553529
What is a DOI?

ABSTRACT

We present a novel technique for automated problem decomposition to address the problem of scalability in reinforcement learning. Our technique makes use of a set of near-optimal trajectories to discover options and incorporates them into the learning process, dramatically reducing the time it takes to solve the underlying problem. We run a series of experiments in two different domains and show that our method offers up to 30 fold speedup over the baseline.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
4
 
5
6
 
7
 
8
 
9
 
10
 
11

Collaborative Colleagues:
Peng Zang: colleagues
Peng Zhou: colleagues
David Minnen: colleagues
Charles Isbell: colleagues