Modelling Protein Evolution

Olivier Gascuel

Communication Dans Un Congrès Année : 2009

Modelling Protein Evolution

(1)

Olivier Gascuel

Fonction : Auteur correspondant
PersonId : 938491
IdHAL : olivier-gascuel
ORCID : 0000-0002-9412-9723

Connectez-vous pour contacter l'auteur

Méthodes et Algorithmes pour la Bioinformatique

Résumé

Amino-acid substitution models are essential in most methods to infer phylogenies from protein data. These models represent the ways in which proteins evolve and substitutions accumulate along the course of time. It is widely accepted that the substitution processes vary depending on the structural configuration of the protein residues. However, this information is not (or is rarely) used in phylogenetic studies, though the three-dimensional structure of dozens of thousands of proteins has been elucidated. Here we reinvestigate the question in order to fill this gap. We use an improved estimation methodology and a very large database comprising 1,471 non-redundant globular protein alignments with structural annotations to estimate new amino-acid substitution models accounting for the secondary structure and solvent accessibility of the residues. These models incorporate a confidence coefficient which is estimated from the data and reflects the reliability of structural annotations in the analyzed sequences. Our results with 300 independent test alignments show an impressive likelihood gain, compared to standard models such as JTT or WAG. Moreover, the use of these models induces significant topological changes in the inferred trees, which should be of primary interest to phylogeneticists. Our data, models and software are available for download from http://atgc.lirmm.fr/phyml-structure/.

Domaines

Bio-informatique [q-bio.QM] Bio-Informatique, Biologie Systémique [q-bio.QM] Evolution [q-bio.PE]

Olivier Gascuel : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00511807

Soumis le : jeudi 26 août 2010-11:54:44

Dernière modification le : vendredi 24 mars 2023-14:52:53

Dates et versions

lirmm-00511807 , version 1 (26-08-2010)

Identifiants

HAL Id : lirmm-00511807 , version 1

Citer

Olivier Gascuel. Modelling Protein Evolution. Darwin 200 South American Celebration, Uruguay. ⟨lirmm-00511807⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAB LIRMM MIPS UNIV-MONTPELLIER

70 Consultations

0 Téléchargements

Modelling Protein Evolution

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager