Skip to Main content Skip to Navigation
Book sections

CAT : Un Modèle Phylogénétique Bayésien permettant de prendre en compte l'Hétérogénéité des Processus de Substitution entre Sites dans les Alignements Protéiques

Nicolas Lartillot 1 Hervé Philippe 2
1 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Résumé : We propose a Bayesian mixture model, accounting for across-site heterogeneities of the substi- tutional processes in protein sequences. Our model, CAT, is based on the formalism of the Dirichlet processes, in which the total number of classes of the underlying mixture is not specified a priori, but rather, is considered an unknown of the problem, and is directly inferred from the available data. In this paper, we describe the model, and show its connections with the Bayesian non-parametric approach for modeling hetero- geneity. We apply it to a series of alignments of real proteins, and uncover a significant level of heterogene- ity across sites. Finally, by the evaluation of the Bayes factor, we show that the CAT model yields a signifi- cant improvement of the statistical fit over the standard models, based on one single substitution process describing all the sites of the alignment.
Document type :
Book sections
Complete list of metadatas

Cited literature [22 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00108548
Contributor : Christine Carvalho de Matos <>
Submitted on : Monday, October 23, 2006 - 7:43:10 AM
Last modification on : Wednesday, April 17, 2019 - 12:21:21 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 8:27:47 PM

File

Identifiers

  • HAL Id : lirmm-00108548, version 1

Collections

Citation

Nicolas Lartillot, Hervé Philippe. CAT : Un Modèle Phylogénétique Bayésien permettant de prendre en compte l'Hétérogénéité des Processus de Substitution entre Sites dans les Alignements Protéiques. Biosystema 22, 22, pp.97-104, 2004, Avenir et pertinence des méthodes d’analyse en phylogénie moléculaire, 2-9068922-22-X. ⟨lirmm-00108548⟩

Share

Metrics

Record views

179

Files downloads

336