ACM Home Page
Please provide us with feedback. Feedback
Stable distributions, pseudorandom generators, embeddings, and data stream computation
Full text PdfPdf (147 KB)
Source Journal of the ACM (JACM) archive
Volume 53 ,  Issue 3  (May 2006) table of contents
Pages: 307 - 323  
Year of Publication: 2006
ISSN:0004-5411
Author
Piotr Indyk  MIT, Cambridge, Massachusetts, Cambridge, MA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 39,   Downloads (12 Months): 270,   Citation Count: 13
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1147954.1147955
What is a DOI?

ABSTRACT

In this article, we show several results obtained by combining the use of stable distributions with pseudorandom generators for bounded space. In particular:---We show that, for any p ∈ (0, 2], one can maintain (using only O(log n2) words of storage) a sketch C(q) of a point qlnp under dynamic updates of its coordinates. The sketch has the property that, given C(q) and C(s), one can estimate ‖qsp up to a factor of (1 + ε) with large probability. This solves the main open problem of Feigenbaum et al. [1999].---We show that the aforementioned sketching approach directly translates into an approximate algorithm that, for a fixed linear mapping A, and given x ∈ ℜn and y ∈ ℜm, estimates ‖Axyp in O(n + m) time, for any p ∈ (0, 2]. This generalizes an earlier algorithm of Wasserman and Blum [1997] which worked for the case p = 2.---We obtain another sketch function C′ which probabilistically embeds ln1 into a normed space lm1. The embedding guarantees that, if we set m = log(1/δ)O(1/ε), then for any pair of points q, sln1, the distance between q and s does not increase by more than (1 + ε) with constant probability, and it does not decrease by more than (1 − ε) with probability 1 − δ. This is the only known dimensionality reduction theorem for the l1 norm. In fact, stronger theorems of this type (i.e., that guarantee very low probability of expansion as well as of contraction) cannot exist [Brinkman and Charikar 2003].---We give an explicit embedding of ln2 into lnO(log n)1 with distortion (1 + 1/nΘ(1)).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
Broder, A. 1998. Filtering near-duplicate documents. In Proceedings of FUN.
 
6
 
7
Chambers, J. M., Mallows, C. L., and Stuck, B. W. 1976. A method for simulating stable random variables. J. Amer. Statist. Assoc. 71, 340--344.
 
8
 
9
Cormode, G. 2003. Stable distributions for stream computations: It's as easy as 0,1,2. In Proceedings of the Workshop on Management and Processing of Data Streams.
 
10
Cormode, G., Datar, M., Indyk, P., and Muthukrishnan, S. 2002a. Comparing data streams using hamming norms. In Proceedings of the International Conference on Very Large Databases (VLDB).
 
11
 
12
Cormode, G., and Muthukrishnan, S. 2003. Estimating dominance norms of multiple data streams. In Proceedings of the European Symposium on Algorithms.
 
13
14
 
15
 
16
 
17
Feigenbaum, J., Ishai, Y., Malkin, T., Nissim, K., Strauss, M. J., and Wright, R. N. 2001b. Secure multiparty computation of approximations. http://eprint.iacr.org/2001/024/.
 
18
 
19
Figiel, T., Lindenstrauss, J., and Milman, V. D. 1977. The dimension of almost spherical sections of convex bodies. Acta Math. 139, 53--94.
 
20
Freivalds, R. 1979. Fast probabilistic algorithms. In Proceedings of the Mathematical Foundations of Computer Science. Lecture Notes in Computer Science, vol. 74, Springer-Verlag, New York.
21
 
22
Henzinger, M., Raghavan, P., and Rajagopalan, S. 1998. Computing on data streams. Technical Note 1998-011, Digital Systems Research Center, Palo Alto, CA.
 
23
 
24
25
 
26
27
28
 
29
Johnson, W., and Lindenstrauss, J. 1984. Extensions of lipshitz mapping into hilbert space. Contemp. Math. 26, 189--206.
 
30
Johnson, W., and Schechtman, G. 1982. Embedding lmp into ln1. Acta Math. 149, 71--85.
 
31
Lindenstrauss, J., and Milman, V. D. 1993. The local theory of normed spaces and its applications to convexity. In Handbook of Convex Geometry, P. M. Gruber and J. M. Wills, eds Elsevier, Amsterdam, The Netherlands, 1149--1220.
 
32
Linial, N., London, E., and Rabinovich, Y. 1994. The geometry of graphs and some of its algorithmic applications. In Proceedings of 35th Annual IEEE Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, CA, 577--591.
 
33
Matoušek, J. 2004. Collection of open problems on low-distortion embeddings of finite metric spaces. Available at http://kam.mff.cuni.cz/~matousek/metrop.ps.gz|.
 
34
Maurer, A. 2003. A bound on the deviation probability for sums of non-negative random variables. J. Ineq. Pure Applied Math. 4, 1, Art. 15.
 
35
Muthukrishnan, S. 2003. Data streams: Algorithms and applications (invited talk at SODA'03). Available at http://athos.rutgers.edu/~muthu/stream-1-1.ps.
36
37
38
39
40
 
41
Zolotarev, V. 1986. One-Dimensional Stable Distributions. Vol. 65 of Translations of Mathematical Monographs, American Mathematical Society.

CITED BY  13