ACM Home Page
Please provide us with feedback. Feedback
Resource allocation among agents with preferences induced by factored MDPs
Full text PdfPdf (418 KB)
Source International Conference on Autonomous Agents archive
Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems table of contents
Hakodate, Japan
SESSION: Agent planning and search table of contents
Pages: 297 - 304  
Year of Publication: 2006
ISBN:1-59593-303-4
Authors
Dmitri Dolgov  University of Michigan, Ann Arbor, MI
Edmund Durfee  University of Michigan, Ann Arbor, MI
Sponsors
IFMAS : The International Foundation for Multiagent Systems
ATAL : The International Workshop on Agent Theories, Architectures, and Languages
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 35,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1160633.1160684
What is a DOI?

ABSTRACT

Distributing scarce resources among agents in a way that maximizes the social welfare of the group is a computationally hard problem when the value of a resource bundle is not linearly decomposable. Furthermore, the problem of determining the value of a resource bundle can be a significant computational challenge in itself, such as for an agent operating in a stochastic environment, where the value of a resource bundle is the expected payoff of the optimal policy realizable given these resources. Recent work has shown that the structure in agents' preferences induced by stochastic policy-optimization problems (modeled as MDPs) can be exploited to solve the resource-allocation and the policy-optimization problems simultaneously, leading to drastic (often exponential) improvements in computational efficiency. However, previous work used a flat MDP model that scales very poorly. In this work, we present and empirically evaluate a resource-allocation mechanism that achieves much better scaling by using factored MDP models, thus exploiting both the structure in agents' MDP-induced preferences, as well as the structure within agents' MDPs.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
R. Bellman. Adaptive Control Processes: A Guided Tour. Princeton University Press, 1961.
 
3
C. Boutilier, R. Dearden, and M. Goldszmidt. Exploiting structure in policy construction. In Proceedings of the 14th International Joint Conference on AI, pages 1104--1111, 1995.
 
4
 
5
 
6
 
7
 
8
D. A. Dolgov and E. H. Durfee. Optimal resource allocation and policy formulation in loosely-coupled Markov decision processes. In Proceedings of the 14th International Conference on Automated Planning and Scheduling, pages 315--324, 2004.
9
 
10
D. A. Dolgov and E. H. Durfee. Symmetric primal-dual approximate linear programming for factored MDPs. In Proc. of the Ninth Int. Symposium on AI and Math. (AI&M-06), January 2006.
 
11
 
12
C. Guestrin, D. Koller, R. Parr, and S. Venkataraman. Efficient solution algorithms for factored MDPs. Journal of AI Research, 19:399--468, 2003.
 
13
M. I. Jordan. Graphical models. Statistical Science (Special Issue on Bayesian Stat), 19:140--155, 2004.
 
14
15
 
16
 
17
 
18
M. L. Puterman. Markov Decision Processes. John Wiley & Sons, New York, 1994.
 
19
 
20
P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. Journal of Mathematical Analysis and Applications, 110:568--582, 1985.


Collaborative Colleagues:
Dmitri Dolgov: colleagues
Edmund Durfee: colleagues