ACM Home Page
Please provide us with feedback. Feedback
Approximating the true evolutionary distance between two genomes
Full text PdfPdf (2.03 MB)
Source Journal of Experimental Algorithmics (JEA) archive
Volume 12 ,  (June 2008) table of contents
SECTION: SECTION: 3 - Special Issue table of contents
Article No. 3.5  
Year of Publication: 2008
ISSN:1084-6654
Authors
Krister M. Swenson  EPFL, Lausanne, Switzerland
Mark Marron  University of New Mexico, Albuquerque, New Mexico
Joel V. Earnest-Deyoung  University of New Mexico, Albuquerque, New Mexico
Bernard M. E. Moret  EPFL and the Swiss Institute of Bioinformatics, Lausanne, Switzerland
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 84,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1227161.1402297
What is a DOI?

ABSTRACT

As more and more genomes are sequenced, evolutionary biologists are becoming increasingly interested in evolution at the level of whole genomes, in scenarios in which the genome evolves through insertions, duplications, deletions, and movements of genes along its chromosomes. In the mathematical model pioneered by Sankoff and others, a unichromosomal genome is represented by a signed permutation of a multiset of genes; Hannenhalli and Pevzner showed that the edit distance between two signed permutations of the same set can be computed in polynomial time when all operations are inversions. El-Mabrouk extended that result to allow deletions and a limited form of insertions (which forbids duplications); in turn we extended it to compute a nearly optimal edit sequence between an arbitrary genome and the identity permutation. In this paper we generalize our approach to compute distances between two arbitrary genomes, but focus on approximating the true evolutionary distance rather than the edit distance. We present experimental results showing that our algorithm produces excellent estimates of the true evolutionary distance up to a (high) threshold of saturation; indeed, the distances thus produced are good enough to enable the simple neighbor-joining procedure to reconstruct our test trees with high accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Andelfinger, G., Hitte, C., Etter, L., Guyon, R., Bourque, G., Tesler, G., Pevzner, P., Kirkness, E., Galibert, F., and Benson, D. W. 2004. Detailed four-way comparative mapping and gene order analysis of the canine ctvm locus reveals evolutionary chromosome rearrangements. Genomics 83, 1053--1062.
 
2
Bader, D. A., Moret, B. M. E., and Yan, M. 2001. A fast linear-time algorithm for inversion distance with an experimental comparison. J. Comp. Biol. 8, 5, 483--491.
3
4
 
5
 
6
Downie, S. and Palmer, J. 1992. Use of chloroplast DNA rearrangements in reconstructing plant phylogeny. In Plant Molecular Systematics. P. Soltis, D. Soltis, and J. Doyle, eds. Chapman and Hall, London. 14--35.
 
7
Earnest-deyoung, J., Lerat, E., and Moret, B. M. E. 2004. Reversing gene erosion: reconstructing ancestral bacterial genomes from genecontent and geneorder data. In Proc. 4th Workshop on Algs. in Bioinformatics WABI'04, volume 3240 of Lecture Notes in Computer Science. Springer-Verlag, New York. 1--13.
 
8
9
 
10
Heard, S. B. 1996. Patterns in phylogenetic tree balance with variable and evolving speciation rates. Evolution 50, 2141--2148.
 
11
Li, W.-H. and Graur, D. 2000. Fundamentals of Molecular Evolution. Sinauer and Associates.
 
12
 
13
 
14
Moret, B. M. E., Tang, J., and Warnow, T. 2005. Reconstructing phylogenies from gene-content and gene-order data. In Mathematics of Evolution and Phylogeny. O. Gascuel, ed. Oxford University Press, Oxford. 321--352.
 
15
Nakhleh, L., Moret, B. M. E., Roshan, U., st. John, K., Sun, J., and Vvarnow, T. 2002. The accuracy of fast phylogenetic methods for large datasets In Proc. 7th Pacific Symp. on Biocomputing PSB'O2. World Scientific Pub. 211--222.
 
16
Olmstead, R. and Palmer, J. 1994. Chloroplast DNA systematics: a review of methods and data analysis. Amer. J. Bot. 81, 1205--1224.
 
17
Palmer, J. 1992. Chloroplast and mitochondrial genome evolution in land plants. In Cell Organelles. R. Herrmann, ed. Springer Verlag, New York. 99--133.
 
18
Pattengale, N. D., Swenson, K. M., and Moret, B. M. E. Approximation algorithms for orthology assignment from gene rearrangement data. Journal of Computer and Systems Sciences. Submitted.
 
19
Raubeson L. and Jansen, R. 1992. Chloroplast DNA evidence on the ancient evolutionary split in vascular land plants. Science 255, 1697--1699.
 
20
Robinson, D. R. and Foulds, L. R. 1981. Comparison of phylogenetic trees. Mathematical Biosciences, 53, 131--147.

Collaborative Colleagues:
Krister M. Swenson: colleagues
Mark Marron: colleagues
Joel V. Earnest-Deyoung: colleagues
Bernard M. E. Moret: colleagues