ACM Home Page
Please provide us with feedback. Feedback
Constraint relaxation in approximate linear programs
Full text PdfPdf (713 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 809-816  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Marek Petrik  University of Massachusetts Amherst, Amherst, MA
Shlomo Zilberstein  University of Massachusetts Amherst, Amherst, MA
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 26,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553478
What is a DOI?

ABSTRACT

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for the poor quality of ALP solutions in problems where the approximation induces virtual loops. We then introduce two methods for improving solution quality. One method rolls out selected constraints of the ALP, guided by the dual information. The second method is a relaxation of the ALP, based on external penalty methods. The latter method is applicable in domains in which rolling out constraints is impractical. Both approaches show promising empirical results for simple benchmark problems as well as for a realistic blood inventory management problem.



Collaborative Colleagues:
Marek Petrik: colleagues
Shlomo Zilberstein: colleagues