| Approximate inference for planning in stochastic relational worlds |
| Full text |
Pdf
(771 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 585-592
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 27, Citation Count: 0
|
|
|
ABSTRACT
Relational world models that can be learned from experience in stochastic domains have received significant attention recently. However, efficient planning using these models remains a major issue. We propose to convert learned noisy probabilistic relational rules into a structured dynamic Bayesian network representation. Predicting the effects of action sequences using approximate inference allows for planning in complex worlds. We evaluate the effectiveness of our approach for online planning in a 3D simulated blocksworld with an articulated manipulator and realistic physics. Empirical results show that our method can solve problems where existing methods fail.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Botvinick, M. M., & An, J. (2009). Goal-directed decision making in prefrontal cortex: a computational framework. Advances in Neural Information Processing Systems (NIPS) (pp. 169--176).
|
| |
2
|
Croonenborghs, T., Ramon, J., Blockeel, H., & Bruynooghe, M. (2007). Online learning and exploiting relational models in reinforcement learning. Proc. of the Int. Conf. on Artificial Intelligence (IJ-CAI) (pp. 726--731).
|
| |
3
|
|
| |
4
|
Gardiol, N. H., & Kaelbling, L. P. (2007). Action-space partitioning for planning. Proc. of the Nat. Conf. on Artificial Intelligence (AAAI) (pp. 980--986).
|
| |
5
|
|
| |
6
|
|
 |
7
|
Kristian Kersting , Martijn Van Otterlo , Luc De Raedt, Bellman goes relational, Proceedings of the twenty-first international conference on Machine learning, p.59, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015401]
|
| |
8
|
|
| |
9
|
Pasula, H. M., Zettlemoyer, L. S., & Kaelbling, L. P. (2007). Learning symbolic models of stochastic domains. Artificial Intelligence Research, 29, 309--352.
|
| |
10
|
Sanghai, S., Domingos, P., & Weld, D. (2005). Relational dynamic Bayesian networks. Artificial Intelligence Research, 24, 759--797.
|
| |
11
|
Sanner, S., & Boutilier, C. (2007). Approximate solution techniques for factored first-order MDPs. Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS) (pp. 288--295).
|
 |
12
|
|
| |
13
|
van Otterlo, M. (2009). The logic of adaptive behavior. IOS Press, Amsterdam.
|
|