| Resource allocation among agents with preferences induced by factored MDPs |
| Full text |
Pdf
(418 KB)
|
| Source
|
International Conference on Autonomous Agents
archive
Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
table of contents
Hakodate, Japan
SESSION: Agent planning and search
table of contents
Pages: 297 - 304
Year of Publication: 2006
ISBN:1-59593-303-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 35, Citation Count: 2
|
|
|
ABSTRACT
Distributing scarce resources among agents in a way that maximizes the social welfare of the group is a computationally hard problem when the value of a resource bundle is not linearly decomposable. Furthermore, the problem of determining the value of a resource bundle can be a significant computational challenge in itself, such as for an agent operating in a stochastic environment, where the value of a resource bundle is the expected payoff of the optimal policy realizable given these resources. Recent work has shown that the structure in agents' preferences induced by stochastic policy-optimization problems (modeled as MDPs) can be exploited to solve the resource-allocation and the policy-optimization problems simultaneously, leading to drastic (often exponential) improvements in computational efficiency. However, previous work used a flat MDP model that scales very poorly. In this work, we present and empirically evaluate a resource-allocation mechanism that achieves much better scaling by using factored MDP models, thus exploiting both the structure in agents' MDP-induced preferences, as well as the structure within agents' MDPs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
R. Bellman. Adaptive Control Processes: A Guided Tour. Princeton University Press, 1961.
|
| |
3
|
C. Boutilier, R. Dearden, and M. Goldszmidt. Exploiting structure in policy construction. In Proceedings of the 14th International Joint Conference on AI, pages 1104--1111, 1995.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
D. A. Dolgov and E. H. Durfee. Optimal resource allocation and policy formulation in loosely-coupled Markov decision processes. In Proceedings of the 14th International Conference on Automated Planning and Scheduling, pages 315--324, 2004.
|
 |
9
|
|
| |
10
|
D. A. Dolgov and E. H. Durfee. Symmetric primal-dual approximate linear programming for factored MDPs. In Proc. of the Ninth Int. Symposium on AI and Math. (AI&M-06), January 2006.
|
| |
11
|
|
| |
12
|
C. Guestrin, D. Koller, R. Parr, and S. Venkataraman. Efficient solution algorithms for factored MDPs. Journal of AI Research, 19:399--468, 2003.
|
| |
13
|
M. I. Jordan. Graphical models. Statistical Science (Special Issue on Bayesian Stat), 19:140--155, 2004.
|
| |
14
|
|
 |
15
|
|
| |
16
|
Nicolas Meuleau , Milos Hauskrecht , Kee-Eung Kim , Leonid Peshkin , Leslie Pack Kaelbling , Thomas Dean , Craig Boutilier, Solving very large weakly coupled Markov decision processes, Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence, p.165-172, July 1998, Madison, Wisconsin, United States
|
| |
17
|
Relu Patrascu , Pascal Poupart , Dale Schuurmans , Craig Boutilier , Carlos Guestrin, Greedy linear value-approximation for factored Markov decision processes, Eighteenth national conference on Artificial intelligence, p.285-291, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
| |
18
|
M. L. Puterman. Markov Decision Processes. John Wiley & Sons, New York, 1994.
|
| |
19
|
|
| |
20
|
P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. Journal of Mathematical Analysis and Applications, 110:568--582, 1985.
|
|