ACM Home Page
Please provide us with feedback. Feedback
Constructing optimal policies for agents with constrained architectures
Full text PdfPdf (49 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the second international joint conference on Autonomous agents and multiagent systems table of contents
Melbourne, Australia
POSTER SESSION: Posters table of contents
Pages: 974 - 975  
Year of Publication: 2003
ISBN:1-58113-683-8
Authors
Dmitri A. Dolgov  University of Michigan, Ann Arbor, MI
Edmund H. Durfee  University of Michigan, Ann Arbor, MI
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 15,   Citation Count: 1
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/860575.860752
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Abadie, editor. Nonlinear Programming. North-Holland, 1967.
 
2
E. Altman. Constrained Markov Decision Processes. Chapman and HALL/CRC, 1999.
 
3
D'Epenoux. A probabilistic production and inventory problem. Management Science, 10:98--108, 1963.
 
4
D. A. Dolgov and E. H. Durfee. Approximating optimal policies for agents with limited execution resources. In Proceedings of IJCAI-2003, 2003.
 
5
D. A. Dolgov and E. H. Durfee. Constructing optimal polcies for agents with constrained architecture. Technical Report University of Michigan CSE-TR-476-03, 2003.
 
6
 
7
L. Kallenberg. Linear Programming and Finite Markovian Control Problems. Math. Centrum, Amsterdam, 1983.
 
8
M. L. Littman, T. L. Dean, and L. P. Kaelbling. On the complexity of solving Markov decision problems. In UAI--95, pages 394--402, Montreal, 1995.
 
9
D. Musliner, E. Durfee, and K. Shin. CIRCA: A cooperative intelligent real time control architecture. IEEE Transactions on Systems, Man, and Cybernetics, 23(6):1561--1574, - 1993.
 
10
 
11
M. L. Puterman. Markov Decision Processes. John Wiley & Sons, New York, 1995.
 
12
K. Ross and R. Varadarajan. Markov decision processes with sample path constraints: the communicating case. OR, 37:780--790, 1989.
 
13
 
14
M. Sobel. Maximal mean/standard deviation ratio in undiscounted mdp. OR Letters, 4:157--188, 1985.


Collaborative Colleagues:
Dmitri A. Dolgov: colleagues
Edmund H. Durfee: colleagues