ACM Home Page
Please provide us with feedback. Feedback
Sampling algorithms: lower bounds and applications
Full text PdfPdf (276 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thirty-third annual ACM symposium on Theory of computing table of contents
Hersonissos, Greece
Pages: 266 - 275  
Year of Publication: 2001
ISBN:1-58113-349-9
Authors
Ziv Bar-Yossef  Computer Science Division, U.C. Berkeley, Berkeley, CA
Ravi Kumar  IBM Almaden, 650 Harry Road, San Jose, CA
D. Sivakumar  IBM Almaden, 650 Harry Road, San Jose, CA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 54,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/380752.380810
What is a DOI?

ABSTRACT

We develop a framework to study probabilistic sampling algorithms that approximate general functions of the form \genfunc, where \domain and \range are arbitrary sets. Our goal is to obtain lower bounds on the query complexity of functions, namely the number of input variables x_i that any sampling algorithm needs to query to approximate f(x_1,\ldots,x_n).We define two quantitative properties of functions --- the it block sensitivity and the minimum Hellinger distance --- that give us techniques to prove lower bounds on the query complexity. These techniques are quite general, easy to use, yet powerful enough to yield tight results. Our applications include the mean and higher statistical moments, the median and other selection functions, and the frequency moments, where we obtain lower bounds that are close to the corresponding upper bounds.We also point out some connections between sampling and streaming algorithms and lossy compression schemes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
J. O. Berger. Statistical Decision Theory and Bayesian Analysis. Springer-Verlag, 1985.
5
 
6
H. Buhrman and R. de Wolf. Complexity measures and decision tree complexity: A survey, 1999. Available at http://www.cwi.nl/rdewolf.
 
7
 
8
9
 
10
H. Chernoff. A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. American Mathematical Society, 23:493-507, 1952.
 
11
 
12
P. Diaconis. Group Representation in Probability and Statistics. IMS Lecture Series 11, Institute of Mathematical Statistics, 1999.
 
13
 
14
15
 
16
O. Goldreich and D. Ron. On testing expansion in bounded-degree graphs. Electronic Colloquium on Computational Complexity (ECCC), 2000. TR00-020.
 
17
 
18
W. Hoeding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58:13-30, 1963.
 
19
 
20
 
21
L. Le Cam and G. Lo Yang. Asymptotics in Statistics - Some Basic Concepts, pages 24-30. Springer-Verlag, 1990.
22
 
23
 
24
25
 
26
D. Siegmund. Sequential Analysis - Tests and Confidence Intervals. Springer-Verlag, 1985.
27
 
28
 
29
 
30
A.-C. Yao. Probabilistic computations: toward a unified measure of complexity. InProceedings of the 18th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pages 222-227, 1977.

CITED BY  15

Collaborative Colleagues:
Ziv Bar-Yossef: colleagues
Ravi Kumar: colleagues
D. Sivakumar: colleagues