Computing Galled Networks from Real Data

Daniel Huson; Regula Rupp; Vincent Berry; Philippe Gambette; Christophe Paul

doi:10.1093/bioinformatics/btp217

Article Dans Une Revue Bioinformatics Année : 2009

Computing Galled Networks from Real Data

(1) , (1) , (2) , (3) , (3)

1
2
3

Daniel Huson

Fonction : Auteur

Institut Wilhelm Schickard

Regula Rupp

Fonction : Auteur

Institut Wilhelm Schickard

Vincent Berry

Fonction : Auteur
PersonId : 4886
IdHAL : vincent-berry
ORCID : 0000-0001-7271-4027
IdRef : 135401925

Méthodes et Algorithmes pour la Bioinformatique

Philippe Gambette

Fonction : Auteur correspondant
PersonId : 148
IdHAL : philippe-gambette
ORCID : 0000-0001-7062-0262
IdRef : 151101248

Connectez-vous pour contacter l'auteur

Algorithmes, Graphes et Combinatoire

Christophe Paul

Fonction : Auteur
PersonId : 4726
IdHAL : christophe-paul
ORCID : 0000-0001-6519-975X
IdRef : 151101345

Algorithmes, Graphes et Combinatoire

Résumé

Developing methods for computing phylogenetic networks from biological data is an important problem posed by molecular evolution and much work is currently being undertaken in this area. Although promising approaches exist, there are no tools available that biologists could easily and routinely use to compute rooted phylogenetic networks on real datasets containing tens or hundreds of taxa. Biologists are interested in clades, that is, groups of monophyletic taxa, and these are usually represented by clusters in a rooted phylogenetic tree. The problem of computing an optimal rooted phylogenetic network from a set of clusters, is hard, in general. Indeed, even the problem of just determining whether a given network contains a given cluster is hard. Hence, some researchers have focused on topologically restricted classes of networks, such as galled trees and level-k networks, that are more tractable, but have the practical draw-back that a given set of clusters will usually not possess such a representation. In this paper we argue that galled networks (a generalization of galled trees) provide a good trade-off between level of generality and tractability. Any set of clusters can be represented by some galled network and the question whether a cluster is contained in such a network is easy to solve. Although the computation of an optimal galled network involves solving instances of two different NPcomplete problems, in practice our algorithm solves this problem exactly on large datasets containing hundreds of taxa and many reticulations in seconds, as illustrated by a dataset containing 279 prokaryotes. We provide a fast, robust and easy-to-use implementation of this work in version 2.0 of our tree-handling software Dendroscope.

Domaines

Bio-informatique [q-bio.QM] Bio-Informatique, Biologie Systémique [q-bio.QM] Algorithme et structure de données [cs.DS] Complexité [cs.CC]

Fichier principal

2009HusonRuppBerryGambettePaul.pdf (616.47 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Philippe Gambette : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00368545

Soumis le : samedi 20 juin 2009-15:23:08

Dernière modification le : vendredi 24 mai 2024-09:24:06

Archivage à long terme le : mercredi 22 septembre 2010-12:45:54

Dates et versions

lirmm-00368545 , version 1 (16-03-2009)

lirmm-00368545 , version 2 (20-06-2009)

Identifiants

HAL Id : lirmm-00368545 , version 2
DOI : 10.1093/bioinformatics/btp217

Citer

Daniel Huson, Regula Rupp, Vincent Berry, Philippe Gambette, Christophe Paul. Computing Galled Networks from Real Data. Bioinformatics, 2009, 25 (12), pp.i85-i93. ⟨10.1093/bioinformatics/btp217⟩. ⟨lirmm-00368545v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAB ALGCO LIRMM MIPS UNIV-MONTPELLIER

733 Consultations

708 Téléchargements

Computing Galled Networks from Real Data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager