|
ABSTRACT
Recent work in transfer learning has succeeded in making reinforcement learning algorithms more efficient by incorporating knowledge from previous tasks. However, such methods typically must be provided either a full model of the tasks or an explicit relation mapping one task into the other. An autonomous agent may not have access to such high-level information, but would be able to analyze its experience to find similarities between tasks. In this paper we introduce Modeling Approximate State Transitions by Exploiting Regression (MASTER), a method for automatically learning a mapping from one task to another through an agent's experience. We empirically demonstrate that such learned relationships can significantly improve the speed of a reinforcement learning algorithm in a series of Mountain Car tasks. Additionally, we demonstrate that our method may also assist with the difficult problem of task selection for transfer.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Agnar and P. Enric. Case-based reasoning: Foundational issues, methodological variations, and system approaches, 1994.
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
M. Genesereth and N. Love. General game playing: Overview of the AAAI competition. AI Magazine, 26(2), 2005.
|
| |
6
|
|
| |
7
|
Y. Liu and P. Stone. Value-function-based transfer for reinforcement learning using structure mapping. In Proc. of the 21st National Conf. on Artificial Intelligence, July 2006.
|
| |
8
|
R. Maclin, J. Shavlik, L. Torrey, T. Walker, and E. Wild. Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression. In Proceedings of the 20th National Conference on Artificial Intelligence, 2005.
|
| |
9
|
|
| |
10
|
M. J. D. Powell. An efficient method for finding the minimum of a function of several variables without calculating derivatives. Computer Journal, 7:155--162, 1964.
|
| |
11
|
|
| |
12
|
G. Rummery and M. Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
|
| |
13
|
M. Sharma, M. Holmes, J. C. Santamaria, A. Irani, C. Isbell, and A. Ram. Transfer learning in real-time strategy games using hybrid cbr/rl. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, 2007.
|
| |
14
|
|
| |
15
|
V. Soni and S. Singh. Using homomorphisms to transfer options across continuous reinforcement learning domains. In Proc. of the Twenty First National Conf. on Artificial Intelligence, July 2006.
|
| |
16
|
|
| |
17
|
|
| |
18
|
E. Talvitie and S. Singh. An experts algorithm for transfer learning. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, 2007.
|
 |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
L. Torrey, T. Walker, J. W. Shavlik, and R. Maclin. Using advice to transfer knowledge acquired in one reinforcement learning task to another. In The 16th European Conf. on Machine Learning, 2005.
|
| |
23
|
|
|