| Exceeding expectations and clustering uncertain data |
| Full text |
Pdf
(537 KB)
|
Source
|
Symposium on Principles of Database Systems
archive
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
table of contents
Providence, Rhode Island, USA
SESSION: Uncertain data
table of contents
Pages 269-278
Year of Publication: 2009
ISBN:978-1-60558-553-6
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 59, Citation Count: 1
|
|
|
ABSTRACT
Database technology is playing an increasingly important role in understanding and solving large-scale and complex scientific and societal problems and phenomena, for instance, understanding biological networks, climate modeling, electronic markets, etc. In these settings, uncertainty or imprecise information is a pervasive issue that becomes a serious impediment to understanding and effectively utilizing such systems. Clustering is one of the key problems in this context. In this paper we focus on the problem of clustering, specifically the k-center problem. Since the problem is NP-Hard in deterministic setting, a natural avenue is to consider approximation algorithms with a bounded performance ratio. In an earlier paper Cormode and McGregor had considered certain variants of this problem, but failed to provide approximations that preserved the number of centers. In this paper we remedy the situation and provide true approximation algorithms for a wider class of these problems. However, the key aspect of this paper is to devise general techniques for optimization under uncertainty. We show that a particular formulation which uses the contribution of a random variable above its expectation is useful in this context. We believe these techniques will find wider applications in optimization under uncertainty.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Barbara M. Anthony , Vineet Goyal , Anupam Gupta , Viswanath Nagarajan, A plant location guide for the unsure, Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms, p.1164-1173, January 20-22, 2008, San Francisco, California
|
| |
2
|
Vijay Arya , Naveen Garg , Rohit Khandekar , Adam Meyerson , Kamesh Munagala , Vinayaka Pandit, Local Search Heuristics for k-Median and Facility Location Problems, SIAM Journal on Computing, v.33 n.3, p.544-562, 2004
[doi> 10.1137/S0097539702416402]
|
| |
3
|
|
 |
4
|
Moses Charikar , Sudipto Guha , Éva Tardos , David B. Shmoys, A constant-factor approximation algorithm for the k-median problem (extended abstract), Proceedings of the thirty-first annual ACM symposium on Theory of computing, p.1-10, May 01-04, 1999, Atlanta, Georgia, United States
[doi> 10.1145/301250.301257]
|
| |
5
|
Moses Charikar , Samir Khuller , David M. Mount , Giri Narasimhan, Algorithms for facility location problems with outliers, Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms, p.642-651, January 07-09, 2001, Washington, D.C., United States
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
A. Goel, S. Guha, and K. Munagala. How to probe for an extreme value. ACM Trans. Algorithms (to appear), 2008. Preliminary version appeared in Proc. of the ACM Symp. on Principles of Database Systems (PODS), 2006.
|
| |
11
|
|
| |
12
|
|
 |
13
|
Anupam Gupta , Martin Pál , R. Ravi , Amitabh Sinha, Boosted sampling: approximation algorithms for stochastic optimization, Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, June 13-16, 2004, Chicago, IL, USA
[doi> 10.1145/1007352.1007419]
|
| |
14
|
A. Gupta and M. Pál. Stochastic steiner trees without a root. Proc. of ICALP, pages 1051--1063, 2005.
|
| |
15
|
A. Gupta, M. Pál, R. Ravi, and A. Sinha. What about wednesday? Approximation algorithms for multistage stochastic optimization. Proc. of APPROX-RANDOM, pages 86--98, 2005.
|
| |
16
|
|
| |
17
|
D. Hochbaum and D. Shmoys. A best possible heuristic for the k-center problem. Mathematics of Operations Research, 10(2):180--184, May 1985.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
A. Krause and C. Guestrin. Near-optimal nonmyopic value of information in graphical models. Twenty-first Conference on Uncertainty in Artificial Intelligence (UAI 2005), 2005.
|
 |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
|
|