ACM Home Page
Please provide us with feedback. Feedback
Overcoming the l1 non-embeddability barrier: algorithms for product metrics
Full text PdfPdf (577 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the Nineteenth Annual ACM -SIAM Symposium on Discrete Algorithms table of contents
New York, New York
Pages 865-874  
Year of Publication: 2009
Authors
Alexandr Andoni  MIT
Piotr Indyk  MIT
Robert Krauthgamer  Weizmann Institute of Science
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 53,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

A common approach for solving computational problems over a difficult metric space is to embed the "hard" metric into L1 which admits efficient algorithms and is thus considered an "easy" metric. This approach has proved successful or partially successful for important spaces such as the edit distance, but it also has inherent limitations: it is provably impossible to go below certain approximation for some metrics.

We propose a new approach, of embedding the difficult space into richer host spaces, namely iterated products of standard spaces like l1 and l∞. We show that this class is rich since it contains useful metric spaces with only a constant distortion, and, at the same time, it is tractable and admits efficient algorithms. Using this approach, we obtain for example the first nearest neighbor data structure with O(log log d) approximation for edit distance in non-repetitive strings (the Ulam metric). This approximation is exponentially better than the lower bound for embedding into L1. Furthermore, we give constant factor approximation for two other computational problems. Along the way, we answer positively a question posed in [Ajtai, Jayram, Kumar, and Sivakumar, STOC 2002]. One of our algorithms has already found applications for smoothed edit distance over 0--1 strings [Andoni and Krauthgamer, ICALP 2008].


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
A. Andoni. Approximate nearest neighbor problem in high dimensions. M. Eng. Thesis, Massachusetts Institute of Technology, June 2005.
 
5
A. Andoni, K. Do Ba, and P. Indyk. Block heavy hitters. MIT Technical Report MIT-CSAIL-TR-2008-024, 2008.
6
 
7
 
8
 
9
10
11
 
12
13
14
15
 
16
M. Charikar and R. Krauthgamer. Embedding the ulam metric into l1. Theory of Computing, 2(11):207--224, 2006.
17
 
18
19
20
 
21
 
22
 
23
 
24
 
25
 
26
27
 
28
 
29
 
30
P. Indyk and J. Matoušek. Discrete metric spaces. CRC Handbook of Discrete and Computational Geometry, 2nd edition, 2003.
31
 
32
P. Indyk and N. Thaper. Fast color image retrieval via embeddings. Workshop on Statistical and Computational Theories of Vision (at ICCV), 2003.
33
 
34
T. Jayram and D. Woodruff. Cascaded stream aggregates. Manuscript, 2008.
 
35
W. B. Johnson and J. Lindenstrauss, editors. Handbook of the geometry of Banach spaces. Vol. I. North-Holland Publishing Co., Amsterdam, 2001.
 
36
 
37
38
 
39
N. Linial. Finite metric spaces - combinatorics, geometry and algorithms. Proceedings of the International Congress of Mathematicians III, pages 573--586, 2002.
 
40
N. Linial, E. London, and Y. Rabinovich. The geometry of graphs and some of its algorithmic applications. Combinatorica, 15(2):215--245, 1995.
 
41
J. I. Marden. Analyzing and Modeling Rank Data. Monographs on Statistics and Applied Probability 64. CRC Press, 1995.
 
42
 
43
44
 
45

Collaborative Colleagues:
Alexandr Andoni: colleagues
Piotr Indyk: colleagues
Robert Krauthgamer: colleagues