A Site- and Time-Heterogeneous Model of Amino Acid Replacement

Abstract : We combined the category (CAT) mixture model (Lartillot N, Philippe H. 2004) and the nonstationary break point (BP) model (Blanquart S, Lartillot N. 2006) into a new model, CAT-BP, accounting for variations of the evolutionary process both along the sequence and across lineages. As in CAT, the model implements a mixture of distinct Markovian processes of substitution distributed among sites, thus accommodating site-specific selective constraints induced by protein structure and function. Furthermore, as in BP, these processes are nonstationary, and their equilibrium frequencies are allowed to change along lineages in a correlated way, through discrete shifts in global amino acid composition distributed along the phylogenetic tree. We implemented the CAT-BP model in a Bayesian Markov Chain Monte Carlo framework and compared its predictions with those of 3 simpler models, BP, CAT, and the site- and time-homogeneous general time-reversible (GTR) model, on a concatenation of 4 mitochondrial proteins of 20 arthropod species. In contrast to GTR, BP, and CAT, which all display a phylogenetic reconstruction artifact positioning the bees Apis mellifera and Melipona bicolor among chelicerates, the CAT-BP model is able to recover the monophyly of insects. Using posterior predictive tests, we further show that the CAT-BP combination yields better anticipations of site- and taxon-specific amino acid frequencies and that it better accounts for the homoplasies that are responsible for the artifact. Altogether, our results show that the joint modeling of heterogeneities across sites and along time results in a synergistic improvement of the phylogenetic inference, indicating that it is essential to disentangle the combined effects of both sources of heterogeneity, in order to overcome systematic errors in protein phylogenetic analyses.
Type de document :
Article dans une revue
Molecular Biology and Evolution, Oxford University Press (OUP), 2008, 25, pp.842-858
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00324422
Contributeur : Nicolas Lartillot <>
Soumis le : jeudi 25 septembre 2008 - 01:29:44
Dernière modification le : jeudi 11 janvier 2018 - 06:26:12

Identifiants

  • HAL Id : lirmm-00324422, version 1

Collections

Citation

Samuel Blanquart, Nicolas Lartillot. A Site- and Time-Heterogeneous Model of Amino Acid Replacement. Molecular Biology and Evolution, Oxford University Press (OUP), 2008, 25, pp.842-858. 〈lirmm-00324422〉

Partager

Métriques

Consultations de la notice

146