| Towards adaptive programming: integrating reinforcement learning into a programming language |
| Full text |
Pdf
(192 KB)
|
Source
|
Conference on Object Oriented Programming Systems Languages and Applications
archive
Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
table of contents
Nashville, TN, USA
SESSION: Onward!
table of contents
Pages 603-614
Year of Publication: 2008
ISBN:978-1-60558-215-3
Also published in ...
|
|
Authors
|
|
Christopher Simpkins
|
Georgia Institute of Technology, Atlanta, GA, USA
|
|
Sooraj Bhat
|
Georgia Institute of Technology, Atlanta, GA, USA
|
|
Charles Isbell, Jr.
|
Georgia Institute of Technology, Atlanta, GA, USA
|
|
Michael Mateas
|
University of California, Santa Cruz, Santa Cruz, CA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 21, Downloads (12 Months): 193, Citation Count: 0
|
|
|
ABSTRACT
Current programming languages and software engineering paradigms are proving insufficient for building intelligent multi-agent systems--such as interactive games and narratives--where developers are called upon to write increasingly complex behavior for agents in dynamic environments. A promising solution is to build adaptive systems; that is, to develop software written specifically to adapt to its environment by changing its behavior in response to what it observes in the world. In this paper we describe a new programming language, An Adaptive Behavior Language (A2BL), that implements adaptive programming primitives to support partial programming, a paradigm in which a programmer need only specify the details of behavior known at code-writing time, leaving the run-time system to learn the rest. Partial programming enables programmers to more easily encode software agents that are difficult to write in existing languages that do not offer language-level support for adaptivity. We motivate the use of partial programming with an example agent coded in a cutting-edge, but non-adaptive agent programming language (ABL), and show how A2BL can encode the same agent much more naturally.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
David Andre and Stuart Russell. Programmable reinforcement learning agents. In Advances in Neural Information Processing Systems, volume 13, 2001.
|
| |
2
|
|
| |
3
|
Sooraj Bhat, Charles Isbell, and Michael Mateas. On the difficulty of modular reinforcement learning for real-world partial programming. In Proceedings of the Twenty-First National Conference on Artificial Intelligence (AAAI-06), Boston, MA, USA, July 2006.
|
| |
4
|
|
| |
5
|
Leslie Pack Kaelbling, Michael L. Littman, and Andrew P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research}, 237--285, 1996.
|
| |
6
|
A. B. Loyall and J. Bates. Hap: A reactive adaptive architecture for agents. Technical Report CMU-CS-91-147, 1991.
|
| |
7
|
Michael Mateas and Andrew Stern. Facade: An experiment in building a fully-realized interactive drama. In Game Developers Conference: Game Design Track, San Jose, CA, March 2003.
|
| |
8
|
Michael Mateas and Andrew Stern. Life-like Characters. Tools, Affective Functions and Applications, chapter A Behavior Language: Joint Action and Behavioral Idioms. Springer, 2004.
|
| |
9
|
|
| |
10
|
Peter Norvig. Decision theory: The language of adaptive agent software. Presentation, March 1998. http://www.norvig.com/adaptive/index.htm
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Sprague, N., and Ballard, D. Multiple-Goal Reinforcement Learning with Modular Sarsa(0). In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003. Workshop paper.
|
|