Accelerating Parallel Maximum Likelihood-Based Phylogenetic Tree Calculations Using Subtree Equality Vectors
Alexandros Stamatakis,Thomas Ludwig,Harald Meier,Marty J. Wolf +3 more
- 16 Nov 2002
- pp 1-16
TL;DR: This paper describes and uses Subtree Equality Vectors (SEVs) to reduce the number of required floating point operations during topology evaluation and integrated these optimizations into various sequential programs and into parallel fastDNAml, one of the most common and efficient parallel programs for calculating large phylogenetic trees.
read more
Abstract: Heuristics for calculating phylogenetic trees for a large sets of aligned rRNA sequences based on the maximum likelihood method are computationally expensive. The core of most parallel algorithms, which accounts for the greatest part of computation time, is the tree evaluation function, that calculates the likelihood value for each tree topology. This paper describes and uses Subtree Equality Vectors (SEVs) to reduce the number of required floating point operations during topology evaluation. We integrated our optimizations into various sequential programs and into parallel fastDNAml, one of the most common and efficient parallel programs for calculating large phylogenetic trees. Experimental results for our parallel program, which renders exactly the same output as parallel fastDNAml show global runtime improvements of 26% to 65%. The optimization scales best on clusters of PCs, which also implies a substantial cost saving factor for the determination of large trees.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
ARB: a software environment for sequence data
Wolfgang Ludwig,Oliver Strunk,Ralf Westram,Lothar Richter,Harald Meier,Yadhukumar,Arno Buchner,Tina Lai,Susanne Steppi,Gangolf Jobb,Wolfram Förster,Igor Brettske,Stefan Gerber,Anton W. Ginhart,Oliver Gross,Silke Grumann,Stefan Hermann,Ralf Jost,Andreas König,Thomas Liss,Ralph Lüßmann,Michael May,Björn Nonhoff,Boris Reichel,Robert Strehlow,Alexandros Stamatakis,Norbert Stuckmann,Alexander Vilbig,Michael Lenke,Thomas Ludwig,Arndt Bode,Karl-Heinz Schleifer +31 more
TL;DR: The ARB program package comprises a variety of directly interacting software tools for sequence database maintenance and analysis which are controlled by a common graphical user interface.
RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees
TL;DR: This paper presents the latest release of the program RAxML-III for rapid maximum likelihood-based inference of large evolutionary trees which allows for computation of 1.000-taxon trees in less than 24 hours on a single PC processor.
Estimating population parameters using the structured serial coalescent with Bayesian MCMC inference when some demes are hidden.
Greg Ewing,Allen G. Rodrigo +1 more
- 14 Feb 2007
TL;DR: This paper introduces a novel probabilistic method for prediction of protein-protein interactions using a new empirical probabilistic formula describing the loss of interactions between homologous proteins during the course of evolution.
319
Increasing the Efficiency of Searches for the Maximum Likelihood Tree in a Phylogenetic Analysis of up to 150 Nucleotide Sequences
TL;DR: A range of heuristic search strategies are compared at locating the ML tree for 20 empirical data sets with 14 to 158 sequences and 411 to 120,762 aligned nucleotides and a newly developed version of the likelihood ratchet using PAUP* successfully found the peaks of multiple islands, but its speed needs to be improved.
Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees
TL;DR: A new search strategy is developed that can reduce the time required for tree inferences by more than 50% while yielding equally good trees for well-chosen starting trees and issues pertaining to the numerical stability of the Γ model of rate heterogeneity on very large trees are addressed.
References
Evolutionary trees from DNA sequences: A maximum likelihood approach
TL;DR: A computationally feasible method for finding such maximum likelihood estimates is developed, and a computer program is available that allows the testing of hypotheses about the constancy of evolutionary rates by likelihood ratio tests.
14.5K
fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood
TL;DR: A new tool, called fastDNAml, for constructing phylogenetic trees from DNA sequences, which is based on version 3.3 of Felsenstein's dnaml program and significantly improve performance and reduce memory usage, making it feasible to construct even very large trees.
1.1K
Top500 Supercomputer Sites
Jack Dongarra,Hans Werner Meuer,Erich Strohmaier +2 more
- 01 Nov 1997
TL;DR: This list lists the sites that have the 500 most powerful computer systems installed and the best Linpack benchmark performance achieved is used as a performance measure in ranking the computers.
TOP500 Supercomputer sites 11/2000
Hans W. Meuer,Erich Strohmaier,Jack Dongarra,Horst D. Simon +3 more
- 02 Nov 2000
TL;DR: This list lists the sites that have the 500 most powerful computer systems installed and the best Linpack benchmark performance achieved is used as a performance measure in ranking the computers.
A Fast Algorithm for the Computation and Enumeration of Perfect Phylogenies
Sampath Kannan,Tandy Warnow +1 more
TL;DR: It is shown how it is possible to efficiently build a structure that implicitly represents the set of all perfect phylogenies and to randomly sample from that set.
76