| General methodology 1: optimising discrete event simulation models using a reinforcement learning agent |
| Full text |
Pdf
(355 KB)
|
| Source
|
Winter Simulation Conference
archive
Proceedings of the 34th conference on Winter simulation: exploring new frontiers
table of contents
San Diego, California
SESSION: General applications and methodology
table of contents
Pages: 1945 - 1950
Year of Publication: 2002
ISBN:0-7803-7615-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
Winter Simulation Conference
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 7, Citation Count: 1
|
|
|
ABSTRACT
A reinforcement learning agent has been developed to determine optimal operating policies in a multi-part serial line. The agent interacts with a discrete event simulation model of a stochastic production facility. This study identifies issues important to the simulation developer who wishes to optimise a complex simulation or develop a robust operating policy. Critical parameters pertinent to 'tuning' an agent quickly and enabling it to rapidly learn the system were investigated.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Ashkin, R. G., and C. R. Standridge. 1993. Modelling and analysis of Manufacturing Systems Cambridge, MA: The MIT Press.
|
| |
2
|
Aydin, M. E., and E. Öztemel. 2000. Dynamic jobshop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33: 169--178.
|
| |
3
|
Campbell, P., P. Hodgson, and M. Cardew-Hall. 2001. Optimisation of Production Decisions with Stochastic Process Failures. Proceedings of the 16th International Conference On Production Research.
|
| |
4
|
Das, T. K., and S. Sarkar. 1999. Optimal preventitive maintenance in a production inventory system. IIE Transactions 31: 537--551.
|
| |
5
|
Harp, S. A., and T. Samad. 2000. Adaptive agents and artificial life: Insights for the power industry. Soft Computing and Intelligent Systems: Theory and Applications. eds. Sinha, N. K., and M. G. Madan, San Diego: Academic Press.
|
| |
6
|
Jeong, K. 2000. Conceptual frame for development of optimized simulation-based scheduling systems. Expert Systems with Applications 18: 299--306.
|
| |
7
|
Kaelbling, L. P., M. L. Littman, and A W. Moore. 1996. Reinforcement learning: a survey Journal of Artificial Intelligence Research 4: 237--285.
|
| |
8
|
|
| |
9
|
Mahadevan, S., N. Marchalleck, T. K. Das, and A. Gosavi. 1997. Self-improving factory simulation using continuous-time average-reward reinforcement learning. Proceedings of the 14th International Conference on Machine Learning 202--210.
|
| |
10
|
|
| |
11
|
Paternina-Arboleda, C. D., and T. K. Das. 2001. Intelligent dynamic control policies for serial production lines. IIE Transactions 33: 65--77.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Zhang, W., and T. G. Dietterich. 1995. A reinforcement learning approach to job-shop scheduling. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. 1114--1120.
|
|