ACM Home Page
Please provide us with feedback. Feedback
Sequencing a genome by walking with clone-end sequences (abstract): a mathematical analysis
Full text PdfPdf (44 KB)
Source Annual Conference on Research in Computational Molecular Biology archive
Proceedings of the fourth annual international conference on Computational molecular biology table of contents
Tokyo, Japan
Page: 45  
Year of Publication: 2000
ISBN:1-58113-186-0
Authors
Serafim Batzoglou  Laboratory for Computer Science, MIT, Cambridge, MA
Bonnie Berger  Mathematics Department, MIT, Cambridge, MA
Jill Mesirov  Whitehead Institute for Biomedical Research, Nine Cambridge Center, Cambridge, MA
Eric S. Lander  Department of Biology, Massachusetts Institute of Technology, Cambridge, MA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 7,   Citation Count: 0
Additional Information:

abstract   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/332306.332325
What is a DOI?

ABSTRACT

One important approach to sequencing a large genome is (i) to sequence a collection of non-overlapping `seed' chosen from a genomic library of large-insert clones (such as bacterial artificial chromosome (BACs)) and then (ii) to take successive `walking' steps by selecting and sequencing minimally overlapping clones, using information such as clone-end sequences to identify the overlaps. We analyze the strategic issues involved in using this approach. We derive formulas showing how two key factors, the initial density of seed clones and the depth of the genomic library used for walking, affect the cost and time of a sequencing project—that is, the amount of redundant sequencing and the number of steps to cover the vast majority of the genome. We also discuss a variant strategy in which a second genomic library with clones having a somewhat smaller insert size is used to close gaps. This approach can dramatically decrease the amount of redundant sequencing, without affecting the rate at which the genome is covered.

Collaborative Colleagues:
Serafim Batzoglou: colleagues
Bonnie Berger: colleagues
Jill Mesirov: colleagues
Eric S. Lander: colleagues