ACM Home Page
Please provide us with feedback. Feedback
The communication and streaming complexity of computing the longest common and increasing subsequences
Full text PdfPdf (468 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
New Orleans, Louisiana
Pages: 336 - 345  
Year of Publication: 2007
ISBN:978-0-898716-24-5
Authors
Xiaoming Sun  Tsinghua University
David P. Woodruff  M.I.T.
Sponsors
: SIAM Activity Group on Discrete Mathematics
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 55,   Citation Count: 2
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

We consider the communication complexity of finding the longest increasing subsequence (LIS) of a string shared between two parties. We prove tight bounds for the space complexity of randomized one-pass streaming algorithms for this problem. Our bounds are parameterized in terms of the LIS of the inputs. This resolves an open question in [19]. We also give the first bounds for approximating the LIS and its length.

Next, we consider the communication complexity of finding the longest common subsequece (LCS) of two strings held by different parties, as well as the problem of approximating its length. We improve the existing lower bounds for these problems, even in the most difficult case when both parties have a permutation of N symbols. Our results yield tight space bounds for multipass deterministic streaming algorithms. For randomized mutlipass algorithms, our bounds are tight up to a logarithmic factor.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Aldous and P. Diaconis. Longest increasing subsequences: from patience sorting to the Baik-Deift-Johansson theorem, Bull. Amer. Math. Soc., 36, 1999, pp. 413--432.
2
 
3
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. Basic local alignment search tool. Journal of Molecular Biology, 215:403--410, 1990.
 
4
A. Apostolico and C. Guerra. The longest common subsequence problem revisited. Algorithmica, 2:315--336, 1987.
 
5
J. Baik, P. Deift, and K. Johansson. On the distribution of the length of the longest increasing subsequence of random permutations, J. Amer. Math. Soc. 12 (1999), no. 4, 1119--1178.
 
6
A. Banerjee and J. Ghosh. Clickstream clustering using weighted longest common subsequence. In: ICM Workshop on Web Mining, 2001.
 
7
 
8
P. Deift. Integrable systems and combinatorial theory. Notices Amer. Math. Soc., 47, 2000, pp. 631--640.
 
9
A. L. Delcher, S. Kasif, R. D. Fleischmann, J. Peterson, O. White, and S. L. Salzberg. Alignment of whole genomes. Nucleic Acides Research, 27(11):2369--2376, 1999.
 
10
M. L. Fredman. On computing the length of the longest increasing subsequences.. In Discrete Mathematics 11, 1975, pp. 29--35.
 
11
 
12
 
13
M. R. Henzinger, P. Raghavan, and S. Rajagopalon. Computing on data streams. Technical Report 1998--011, Digital Equipment Corp., Systems Res. Center, 1998.
14
15
 
16
 
17
 
18
 
19
D. Liben-Nowell, E. Vee, and A. Zhu. Finding Longest Increasing and Common Subsequences in Streaming Data. In COCOON, 2005.
 
20
F. J. MacWilliams and N. J. A. Sloane. The Theory of Error-Correcting Codes, North-Holland, Amsterdam, 1977.
 
21
 
22
 
23
A. Razborov. On the distributional complexity of dis-jointness. JCSS 28, 1984.
 
24
D. Sankoff and J. Kruskal. Time warps, string edits, and macromolecules: The theory and practice of sequence comparison, Addison-Wesley, 1983.
 
25
C. Schensted. Longest increasing and decreasing subsequences. Canadian Journal of Mathematics, 13:179--191, 1961.
 
26
A. C.-C. Yao. Lower bounds by probabilistic arguments. In FOCS, 1983.
 
27
H. Zhang. Alignment of BLAST high-scoring segment pairts based on the longest increasing subsequence algorithm. Bioinformatics, 19(11):1391--1396, 2003.

Collaborative Colleagues:
Xiaoming Sun: colleagues
David P. Woodruff: colleagues