ACM Home Page
Please provide us with feedback. Feedback
General methodology 1: optimising discrete event simulation models using a reinforcement learning agent
Full text PdfPdf (355 KB)
Source Winter Simulation Conference archive
Proceedings of the 34th conference on Winter simulation: exploring new frontiers table of contents
San Diego, California
SESSION: General applications and methodology table of contents
Pages: 1945 - 1950  
Year of Publication: 2002
ISBN:0-7803-7615-3
Authors
Douglas C. Creighton  Deakin University, Geelong, Victoria, Australia
Saeid Nahavandi  Deakin University, Geelong, Victoria, Australia
Sponsors
IEEE/CS : Institute of Electrical and Electronics Engineers/Computer Society
ASA : American Statistical Association
IEEE/SMCS : Institute of Electrical and Electronics Engineers/Systems, Man, and Cybernetics Society
INFORMS/CS : Institute for Operations Research and the Management Sciences/College on Simulation
NIST : National Institute of Standards and Technology
ACM: Association for Computing Machinery
(SCS) : The Society for Modeling and Simulation International
SIGSIM: ACM Special Interest Group on Simulation and Modeling
IIE : Institute of Industrial Engineers
Publisher
Winter Simulation Conference 
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 12,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

A reinforcement learning agent has been developed to determine optimal operating policies in a multi-part serial line. The agent interacts with a discrete event simulation model of a stochastic production facility. This study identifies issues important to the simulation developer who wishes to optimise a complex simulation or develop a robust operating policy. Critical parameters pertinent to 'tuning' an agent quickly and enabling it to rapidly learn the system were investigated.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Ashkin, R. G., and C. R. Standridge. 1993. Modelling and analysis of Manufacturing Systems Cambridge, MA: The MIT Press.
 
2
Aydin, M. E., and E. Öztemel. 2000. Dynamic jobshop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33: 169--178.
 
3
Campbell, P., P. Hodgson, and M. Cardew-Hall. 2001. Optimisation of Production Decisions with Stochastic Process Failures. Proceedings of the 16th International Conference On Production Research.
 
4
Das, T. K., and S. Sarkar. 1999. Optimal preventitive maintenance in a production inventory system. IIE Transactions 31: 537--551.
 
5
Harp, S. A., and T. Samad. 2000. Adaptive agents and artificial life: Insights for the power industry. Soft Computing and Intelligent Systems: Theory and Applications. eds. Sinha, N. K., and M. G. Madan, San Diego: Academic Press.
 
6
Jeong, K. 2000. Conceptual frame for development of optimized simulation-based scheduling systems. Expert Systems with Applications 18: 299--306.
 
7
Kaelbling, L. P., M. L. Littman, and A W. Moore. 1996. Reinforcement learning: a survey Journal of Artificial Intelligence Research 4: 237--285.
 
8
 
9
Mahadevan, S., N. Marchalleck, T. K. Das, and A. Gosavi. 1997. Self-improving factory simulation using continuous-time average-reward reinforcement learning. Proceedings of the 14th International Conference on Machine Learning 202--210.
 
10
 
11
Paternina-Arboleda, C. D., and T. K. Das. 2001. Intelligent dynamic control policies for serial production lines. IIE Transactions 33: 65--77.
 
12
 
13
 
14
 
15
Zhang, W., and T. G. Dietterich. 1995. A reinforcement learning approach to job-shop scheduling. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. 1114--1120.

Collaborative Colleagues:
Douglas C. Creighton: colleagues
Saeid Nahavandi: colleagues