|
ABSTRACT
Given a set of leaf-labeled trees with identical leaf sets, the well-known Maximum Agreement SubTree (MAST) problem consists in finding a subtree homeomorphically included in all input trees and with the largest number of leaves. MAST and its variant called Maximum Compatible Tree (MCT) are of particular interest in computational biology. This article presents a linear-time approximation algorithm to solve the complement version of MAST, namely identifying the smallest set of leaves to remove from input trees to obtain isomorphic trees. We also present an O(n2 + kn) algorithm to solve the complement version of MCT. For both problems, we thus achieve significantly lower running times than previously known algorithms. Fast running times are especially important in phylogenetics where large collections of trees are routinely produced by resampling procedures, such as the nonparametric bootstrap or Bayesian MCMC methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Berger-Wolf, T. 2004. Consensus and agreement of phylogenetic trees. In Proceedings of the 4th Workshop on Algorithms in Bioinformatics (WABI). Lecture Notes in Computer Science. Springer, 350--361.
|
| |
3
|
Berry, V., and Nicolas, F. 2004. Maximum agreement and compatible supertrees. In Proceedings of the 15th Annual Symposium on Combinatorial Pattern Matching (CPM'04), S. C. Sahinalp et al., Eds. Lecture Notes in Computer Science, vol. 3109. Springer, 205--219.
|
| |
4
|
|
| |
5
|
Bryant, D. 1997. Building trees, hunting for trees and comparing trees: Theory and method in phylogenetic analysis. Ph.D. thesis, University of Canterbury, Department of Mathemathics.
|
| |
6
|
Bryant, D., Steel, M., and MacKenzie, A. 1983. The size of a maximum agreement subtree for random binary trees. In BioConsensus, M. Janowitz et al., Eds. DIMACS AMS, 55--66.
|
| |
7
|
|
| |
8
|
Downey, R. G., Fellows, M. R., and Stege, U. 1999. Computational tractability: The view from Mars. Bull. Eur. Assoc. Theor. Comput. Sci. 69, 73--97.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
Guindon, S., and Gascuel, O. 2003. A simple, fast and accurate method to estimate large phylogenies by maximum-likelihood. Syst. Biol. 52, 5, 696--704.
|
| |
13
|
Gupta, A., and Nishimura, N. 1998. Finding largest subtrees and smallest supertrees. Algorithmica 21, 2, 183--210.
|
| |
14
|
Hamel, A. M., and Steel, M. A. 1996. Finding a maximum compatible tree is NP-hard for sequences and trees. Appl. Math. Lett. 9, 2, 55--59.
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
McMorris, F., Meronik, D., and Neumann, D. 1983. A view of some consensus methods for trees. In Numerical Taxonomy, J. Felsenstein, Ed. Springer, 122--125.
|
| |
20
|
Nishimura, N., Ragde, P., and Thilikos, D. 2004. Smaller kernels for hitting set problems of constant arity. In International Workshop on Parameterized and Exact Computation (IWPEC). Lecture Notes in Computer Science, vol. 3162. 121--126.
|
| |
21
|
Semple, C., and Steel, M. 2003. Phylogenetics. Oxford Lecture Series in Mathematics and its Applications, vol. 24. Oxford University Press.
|
| |
22
|
|
|