A Simple, Fast, and Accurate Method to Estimate Large Phylogenies by Maximum Likelihood

Stéphane Guindon 1 Olivier Gascuel 1, *
* Auteur correspondant
1 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The increase in the number of large data sets and the complexity of current probabilistic sequence evolution models necessitates fast and reliable phylogeny reconstruction methods. We describe a new approach, based on the maximumlikelihood principle, which clearly satisfies these requirements. The core of this method is a simple hill-climbing algorithm that adjusts tree topology and branch lengths simultaneously. This algorithm starts from an initial tree built by a fast distance-based method and modifies this tree to improve its likelihood at each iteration. Due to this simultaneous adjustment of the topology and branch lengths, only a few iterations are sufficient to reach an optimum.We used extensive and realistic computer simulations to show that the topological accuracy of this new method is at least as high as that of the existing maximum-likelihood programs and much higher than the performance of distance-based and parsimony approaches. The reduction of computing time is dramatic in comparison with other maximum-likelihood packages, while the likelihood maximization ability tends to be higher. For example, only 12 min were required on a standard personal computer to analyze a data set consisting of 500 rbcL sequences with 1,428 base pairs fromplant plastids, thus reaching a speed of the same order as some popular distance-based and parsimony algorithms. This new method is implemented in the PHYML program, which is freely available on our web page: http://www.lirmm.fr/w3ifa/MAAS/.
Type de document :
Article dans une revue
Systematic Biology, Oxford University Press (OUP), 2003, 52 (5), pp.696-704. 〈10.1080/10635150390235520〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00191949
Contributeur : Christine Carvalho de Matos <>
Soumis le : lundi 26 novembre 2007 - 11:43:37
Dernière modification le : jeudi 24 mai 2018 - 15:59:22
Document(s) archivé(s) le : lundi 12 avril 2010 - 05:04:59

Fichier

D152.PDF
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Stéphane Guindon, Olivier Gascuel. A Simple, Fast, and Accurate Method to Estimate Large Phylogenies by Maximum Likelihood. Systematic Biology, Oxford University Press (OUP), 2003, 52 (5), pp.696-704. 〈10.1080/10635150390235520〉. 〈lirmm-00191949〉

Partager

Métriques

Consultations de la notice

382

Téléchargements de fichiers

2140