|
ABSTRACT
The capabilities of an autonomous agent are often determined by the resources that are available to it. We examine the problem of allocating scarce resources among multiple self-interested agents, operating in complex, stochastic environments (modeled as MDPs), where the value of a particular resource bundle to an agent is the expected payoff of the best control policy the agent can implement using these resources. Combinatorial auctions are popular mechanisms for allocating resource bundles among agents with such complex preferences. In particular, generalized Vickrey auctions (GVAs) yield socially optimal outcomes and have excellent strategic complexity, but suffer from high computational complexity for the agents' preference-valuation and the auctioneer's winner-determination problems. We describe a GVA-based mechanism that exploits knowledge of the agents' MDP-based resource preferences, and we show analytically and demonstrate empirically that this leads to an exponential improvement in computational efficiency over a straightforward GVA implementation with flat preferences. We also present a distributed implementation of the winner-determination problem that leads to further speedup while maintaining strategy-proofness of the mechanism, and without revealing agents' private MDP information.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
C. Boutilier, R. Dearden, and M. Goldszmidt. Exploiting structure in policy construction. In Proc. of the 14th Int. Joint Conf. on Artificial Intelligence, pages 1104--1111. Morgan Kaufmann, 1995.
|
| |
3
|
C. Boutilier and H. H. Hoos. Bidding languages for combinatorial auctions. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, pages 1211--1217, 2001.
|
| |
4
|
E. H. Clarke. Multipart pricing of public goods. Public Choice, 18:19--33, 1971.
|
| |
5
|
|
| |
6
|
D. A. Dolgov and E. H. Durfee. Optimal resource allocation and policy formulation in loosely-coupled Markov decision processes. In Proc. of the 14th Int. Conf. on Automated Planning and Scheduling, 2004.
|
| |
7
|
Donald F. Ferguson , Christos Nikolaou , Jakka Sairamesh , Yechiam Yemini, Economic models for allocating resources in computer systems, Market-based control: a paradigm for distributed resource allocation, World Scientific Publishing Co., Inc., River Edge, NJ, 1996
|
| |
8
|
T. Groves. Incentives in teams. Econometrica, 41(4):617--631, 1973.
|
| |
9
|
L. Kallenberg. Linear Programming and Finite Markovian Control Problems. Math. Centrum, Amsterdam, 1983.
|
| |
10
|
J. K. MacKie-Mason and H. Varian. Generalized Vickrey auctions. Technical report, University of Michigan, 1994.
|
| |
11
|
Nicolas Meuleau , Milos Hauskrecht , Kee-Eung Kim , Leonid Peshkin , Leslie Pack Kaelbling , Thomas Dean , Craig Boutilier, Solving very large weakly coupled Markov decision processes, Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence, p.165-172, July 1998, Madison, Wisconsin, United States
|
| |
12
|
|
| |
13
|
|
| |
14
|
M. L. Puterman. Markov Decision Processes. John Wiley & Sons, New York, 1994.
|
| |
15
|
|
| |
16
|
P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. Journal of Math. Analysis and Applications, 110:568--582, 1985.
|
| |
17
|
|
| |
18
|
W. Vickrey. Counterspeculation, auctions and competitive sealed tenders. J. of Finance, 1961.
|
| |
19
|
M. Wellman, W. Walsh, P. Wurman, and J. MacKie-Mason. Auction protocols for decentralized scheduling. Games and Economic Behavior, 35:271--303, 2001.
|
| |
20
|
L. Wolsey. Integer Programming. John Wiley & Sons, 1998.
|
|