|
ABSTRACT
We investigate ways in which an algorithm can improve its expected performance by fine-tuning itself automatically with respect to an arbitrary, unknown input distribution. We give such self-improving algorithms for sorting and clustering. The highlights of this work: (i) a sorting algorithm with optimal expected limiting running time; and (ii) a k-median algorithm over the Hamming cube with linear expected limiting running time. In all cases, the algorithm begins with a learning phase during which it adjusts itself to the input distribution (typically in a logarithmic number of rounds), followed by a stationary regime in which the algorithm settles to its optimized incarnation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
|
| |
4
|
N. Alon and J. Spencer. The probabilistic method. John Wiley, 2nd edition, 2000.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
 |
9
|
|
| |
10
|
J. R. Bitner. Heuristics that dynamically organize data structures, SIAM J. Comput., 8:82--110, 1979.
|
| |
11
|
A. Blum. On-line algorithms in machine learning, Online Algorithms: The state of the art, 1441, 1998.
|
| |
12
|
|
| |
13
|
|
 |
14
|
Nicolò Cesa-Bianchi , Yoav Freund , David Haussler , David P. Helmbold , Robert E. Schapire , Manfred K. Warmuth, How to use expert advice, Journal of the ACM (JACM), v.44 n.3, p.427-485, May 1997
[doi> 10.1145/258128.258179]
|
 |
15
|
W. Fernandez de la Vega , Marek Karpinski , Claire Kenyon , Yuval Rabani, Approximation schemes for clustering problems, Proceedings of the thirty-fifth annual ACM symposium on Theory of computing, June 09-11, 2003, San Diego, CA, USA
[doi> 10.1145/780542.780550]
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
Fredman, M. L. How good is the information theory bound in sorting?, Theoretical Computer Science 1 (1976), 355--361.
|
| |
20
|
Y. Freund and R. E. Schapire. Adaptive game playing using multiplicative weights, Games and Economic Behavior, 29:79--103, 1999.
|
| |
21
|
G. H. Gonnet, J. I. Munro, and H. Suwanda. Exegesis of self-organizing linear search, SIAM J. Comput., 10:613--637, 1981.
|
| |
22
|
|
 |
23
|
|
 |
24
|
|
 |
25
|
|
 |
26
|
Mary Inaba , Naoki Katoh , Hiroshi Imai, Applications of weighted Voronoi diagrams and randomization to variance-based k-clustering: (extended abstract), Proceedings of the tenth annual symposium on Computational geometry, p.332-339, June 06-08, 1994, Stony Brook, New York, United States
[doi> 10.1145/177424.178042]
|
| |
27
|
A. Kalai and S. Vempala. Efficient algorithms for the online decision problem, In Proc. 16 COLT, 2003.
|
| |
28
|
|
| |
29
|
|
 |
30
|
Jon Kleinberg , Christos Papadimitriou , Prabhakar Raghavan, Segmentation problems, Proceedings of the thirtieth annual ACM symposium on Theory of computing, p.473-482, May 24-26, 1998, Dallas, Texas, United States
[doi> 10.1145/276698.276860]
|
| |
31
|
|
| |
32
|
|
| |
33
|
|
| |
34
|
|
| |
35
|
J. McCabe. On serial files with relocatable records, Operations Research, 13:609--618, 1965.
|
| |
36
|
B. McMahan and A. Blum. Online geometric optimization in the bandit setting against an adaptive adversary, In Proc. 17th COLT, 2004.
|
| |
37
|
|
| |
38
|
Nina Mishra , Dan Oblinger , Leonard Pitt, Sublinear time approximate clustering, Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms, p.439-447, January 07-09, 2001, Washington, D.C., United States
|
| |
39
|
|
 |
40
|
|
 |
41
|
|
| |
42
|
Albers S. and M. Mitzenmacher. Average case analyses of list update algorithms, Algorithmica, 21:312--329, 1998.
|
 |
43
|
|
 |
44
|
|
| |
45
|
F. Takens. Detecting strange attractors in turbulence, Dynamical Systems and Turbulence, 898:366--381, 1981.
|
| |
46
|
|
REVIEW
"Apostolos N Papadopoulos : Reviewer"
Sorting and clustering are considered very important operations. As a consequence, the existence of efficient algorithms for these operations is of vital importance in several scientific fields, such as database systems and data mining. The author
more...
|