ACM Home Page
Please provide us with feedback. Feedback
Rule value reinforcement learning for cognitive agents
Full text PdfPdf (230 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems table of contents
Hakodate, Japan
SESSION: Learning and evolution table of contents
Pages: 792 - 794  
Year of Publication: 2006
ISBN:1-59593-303-4
Authors
Chris Child  City University, London, UK
Kostas Stathis  City University, London, UK
Sponsors
IFMAS : The International Foundation for Multiagent Systems
ATAL : The International Workshop on Agent Theories, Architectures, and Languages
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 30,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1160633.1160773
What is a DOI?

ABSTRACT

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule representation. The algorithm attaches values to learned rules by adapting reinforcement learning. Structure captured by the rules is used to form a policy. The resulting rule values represent the utility of taking an action if the rule's conditions are present in the agent's current percept. Advantages of the new framework are demonstrated, through examples in a predator-prey environment.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Child, C. and Stathis, K. "Learning to Act with RVRL agents". City University Technical Report. (2006).
 
2
Child, C. and Stathis, K. "The Apriori Stochastic Dependency Detection (ASDD) Algorithm for Learning Stochastic Logic Rules", In Proceedings of the 4th International Workshop on Computation Logic in Multi-agent Systems (CLIMA-04), J. Dix, J. Leiter (Eds), Florida, Jan. (2004).
 
3
Pasula, H. M. Zettlemoyer, L. S. and Kaelbling, L. P. "Learning Probabilistic Relational Planning Rules." Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling, ICAPS, 73--82, (2004).
 
4
 
5
Watkins, C. J. C. H. "Learning from Delayed Rewards." PhD thesis, Cambridge University, (1989).

Collaborative Colleagues:
Chris Child: colleagues
Kostas Stathis: colleagues