FLU, an amino acid substitution model for influenza proteins

Cuong Cao Dang 1 Quang Le Si 2 Olivier Gascuel 3, * Vinh Sy Le 1
* Auteur correspondant
3 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Background: The amino acid substitution model is the core component of many protein analysis systems such as sequence similarity search, sequence alignment, and phylogenetic inference. Although several general amino acid substitution models have been estimated from large and diverse protein databases, they remain inappropriate for analyzing specific species, e.g., viruses. Emerging epidemics of influenza viruses raise the need for comprehensive studies of these dangerous viruses. We propose an influenza-specific amino acid substitution model to enhance the understanding of the evolution of influenza viruses. Results: A maximum likelihood approach was applied to estimate an amino acid substitution model (FLU) from ∼ 113, 000 influenza protein sequences, consisting of ∼ 20 million residues. FLU outperforms 14 widely used models in constructing maximum likelihood phylogenetic trees for the majority of influenza protein alignments. On average, FLU gains ∼ 42 log likelihood points with an alignment of 300 sites. Moreover, topologies of trees constructed using FLU and other models are frequently different. FLU does indeed have an impact on likelihood improvement as well as tree topologies. It was implemented in PhyML and can be downloaded from ftp://ftp.sanger.ac.uk/pub/1000genomes/lsq/FLU or included in PhyML 3.0 server at http://www.atgc-montpellier.fr/phyml/. Conclusions: FLU should be useful for any influenza protein analysis system which requires an accurate description of amino acid substitutions.
Type de document :
Article dans une revue
BMC Evolutionary Biology, BioMed Central, 2010, 10, pp.99. 〈www.lirmm.fr/mab〉. 〈10.1186/1471-2148-10-99〉
Liste complète des métadonnées

Littérature citée [30 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00511801
Contributeur : Olivier Gascuel <>
Soumis le : jeudi 26 août 2010 - 11:39:56
Dernière modification le : jeudi 24 mai 2018 - 15:59:22
Document(s) archivé(s) le : lundi 29 novembre 2010 - 11:57:05

Fichier

FLU_BMCversion.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Collections

Citation

Cuong Cao Dang, Quang Le Si, Olivier Gascuel, Vinh Sy Le. FLU, an amino acid substitution model for influenza proteins. BMC Evolutionary Biology, BioMed Central, 2010, 10, pp.99. 〈www.lirmm.fr/mab〉. 〈10.1186/1471-2148-10-99〉. 〈lirmm-00511801〉

Partager

Métriques

Consultations de la notice

337

Téléchargements de fichiers

125