| Rule value reinforcement learning for cognitive agents |
| Full text |
Pdf
(230 KB)
|
| Source
|
International Conference on Autonomous Agents
archive
Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
table of contents
Hakodate, Japan
SESSION: Learning and evolution
table of contents
Pages: 792 - 794
Year of Publication: 2006
ISBN:1-59593-303-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 29, Citation Count: 0
|
|
|
ABSTRACT
RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule representation. The algorithm attaches values to learned rules by adapting reinforcement learning. Structure captured by the rules is used to form a policy. The resulting rule values represent the utility of taking an action if the rule's conditions are present in the agent's current percept. Advantages of the new framework are demonstrated, through examples in a predator-prey environment.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Child, C. and Stathis, K. "Learning to Act with RVRL agents". City University Technical Report. (2006).
|
| |
2
|
Child, C. and Stathis, K. "The Apriori Stochastic Dependency Detection (ASDD) Algorithm for Learning Stochastic Logic Rules", In Proceedings of the 4th International Workshop on Computation Logic in Multi-agent Systems (CLIMA-04), J. Dix, J. Leiter (Eds), Florida, Jan. (2004).
|
| |
3
|
Pasula, H. M. Zettlemoyer, L. S. and Kaelbling, L. P. "Learning Probabilistic Relational Planning Rules." Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling, ICAPS, 73--82, (2004).
|
| |
4
|
|
| |
5
|
Watkins, C. J. C. H. "Learning from Delayed Rewards." PhD thesis, Cambridge University, (1989).
|
|