ACM Home Page
Please provide us with feedback. Feedback
Trace reconstruction with constant deletion probability and related results
Full text PdfPdf (501 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
San Francisco, California
Pages 389-398  
Year of Publication: 2008
Authors
Thomas Holenstein  Microsoft Research, Silicon Valley
Michael Mitzenmacher  Harvard School of Engineering and Applied Sciences, Cambridge, MA
Rina Panigrahy  Microsoft Research, Silicon Valley
Udi Wieder  Microsoft Research, Silicon Valley
Sponsors
: SIAM Activity Group on Discrete Mathematics
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 48,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

We provide several new results for the trace reconstruction problem. In this setting, a binary string yields a collection of traces, where each trace is independently obtained by independently deleting each bit with a fixed probability δ. Each trace therefore consists of a random subsequence of the original sequence. Given the traces, we wish to reconstruct the original string with high probability. The questions are how many traces are necessary for reconstruction, and how efficiently can the reconstruction be performed.

Our primary result is that for some universal constant γ and uniformly chosen strings of length n, for any δ < γ reconstruction is possible with poly(n) traces in poly(n) time with high probability. We also obtain algorithms that require a number of traces exponential in Õ (√n) for any δ < 1 even for worst case strings, and we derive lower bound results for simpler classes of algorithms based on summary statistics from the traces.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
H. Alzer. Sharp inequalities for the digamma and polygamma functions Forum Mathematicum, vol. 16, no. 2, pp. 181--221. 2004.
 
3
 
4
R. L. Dobrushin. Shannon's Theorems for Channels with Synchronization Errors. Problems of Information Transmission, 3(4):11--26, 1967. Translated from Problemy Peredachi Informatsii, vol. 3, no. 4, pp 18--36, 1967.
 
5
E. Drinea and M. Mitzenmacher. Improved lower bounds for the capacity of i.i.d. deletion and duplication channels. IEEE Trans. on Information Theory, 53:8, pp. 2693--2714, 2007.
 
6
S. Kannan and A. McGregor. More on reconstructing strings from random traces: insertions and deletions. In Proc. of the Int'l. Symp. on Information Theory, pp. 297--301, 2005.
 
7
V. I. Levenshtein. Efficient reconstruction of sequences. IEEE Transactions on Information Theory, vol. 47, no. 1, pp. 2--22, 2001.
 
8

Collaborative Colleagues:
Thomas Holenstein: colleagues
Michael Mitzenmacher: colleagues
Rina Panigrahy: colleagues
Udi Wieder: colleagues