ACM Home Page
Please provide us with feedback. Feedback
Approximate inference for planning in stochastic relational worlds
Full text PdfPdf (771 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 585-592  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Tobias Lang  TU Berlin, Berlin, Germany
Marc Toussaint  TU Berlin, Berlin, Germany
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 27,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553450
What is a DOI?

ABSTRACT

Relational world models that can be learned from experience in stochastic domains have received significant attention recently. However, efficient planning using these models remains a major issue. We propose to convert learned noisy probabilistic relational rules into a structured dynamic Bayesian network representation. Predicting the effects of action sequences using approximate inference allows for planning in complex worlds. We evaluate the effectiveness of our approach for online planning in a 3D simulated blocksworld with an articulated manipulator and realistic physics. Empirical results show that our method can solve problems where existing methods fail.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Botvinick, M. M., & An, J. (2009). Goal-directed decision making in prefrontal cortex: a computational framework. Advances in Neural Information Processing Systems (NIPS) (pp. 169--176).
 
2
Croonenborghs, T., Ramon, J., Blockeel, H., & Bruynooghe, M. (2007). Online learning and exploiting relational models in reinforcement learning. Proc. of the Int. Conf. on Artificial Intelligence (IJ-CAI) (pp. 726--731).
 
3
 
4
Gardiol, N. H., & Kaelbling, L. P. (2007). Action-space partitioning for planning. Proc. of the Nat. Conf. on Artificial Intelligence (AAAI) (pp. 980--986).
 
5
 
6
7
 
8
 
9
Pasula, H. M., Zettlemoyer, L. S., & Kaelbling, L. P. (2007). Learning symbolic models of stochastic domains. Artificial Intelligence Research, 29, 309--352.
 
10
Sanghai, S., Domingos, P., & Weld, D. (2005). Relational dynamic Bayesian networks. Artificial Intelligence Research, 24, 759--797.
 
11
Sanner, S., & Boutilier, C. (2007). Approximate solution techniques for factored first-order MDPs. Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS) (pp. 288--295).
12
 
13
van Otterlo, M. (2009). The logic of adaptive behavior. IOS Press, Amsterdam.

Collaborative Colleagues:
Tobias Lang: colleagues
Marc Toussaint: colleagues