Phylogenetic networks: what can we reconstruct?

Fabio Pardi

Communication Dans Un Congrès Année : 2014

Phylogenetic networks: what can we reconstruct?

(1, 2)

1
2

Fabio Pardi

Fonction : Auteur
PersonId : 742812
IdHAL : fabio-pardi
ORCID : 0000-0001-8084-1464
IdRef : 253167272

Méthodes et Algorithmes pour la Bioinformatique

Institut de Biologie Computationnelle

Résumé

Phylogenies are used to describe the history of evolutionarily related biological entities (e.g. genes, individuals, species) and are central in many biological applications, including functional genomics, epidemiology and biodiversity assessment. Many methods for reconstructing and studying phylogenies have been proposed, almost all of which use trees to represent them. Although in many cases this is reasonable, in many others phylogenies should be represented as networks (more precisely directed acyclic graphs). This is due to a number of biological phenomena collectively known as reticulation events, whereby a species or a gene inherits genetic material from more than one parent organism. This may be caused by events such as hybridization (e.g. in plants), horizontal gene transfer (e.g. in bacteria) or recombination (e.g. in viruses or in genomes of sexually reproducing species). Network inference methods are in their infancy, but they are almost invariably based on the following idea: the goodness of a candidate network is evaluated on the basis of how well the subtrees it contains fit the data. This poses a problem: different networks may contain exactly the same set of subtrees (up to isomorphism), meaning that these networks will be considered "indistinguishable" by most network inference methods, no matter the input data. We propose a novel definition of what constitutes a "uniquely reconstructible" network: for each class of indistinguishable networks, we define a canonical form. Under mild assumptions, the canonical form is unique. Given data coming from any phylogenetic network, only its canonical equivalent can be uniquely reconstructed. This is a fundamental limitation that implies a drastic reduction of the solution space in phylogenetic network inference.

Domaines

Bio-informatique [q-bio.QM]

Fabio Pardi : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01237157

Soumis le : mercredi 2 décembre 2015-18:21:10

Dernière modification le : vendredi 24 mars 2023-14:53:01

Dates et versions

lirmm-01237157 , version 1 (02-12-2015)

Identifiants

HAL Id : lirmm-01237157 , version 1

Citer

Fabio Pardi. Phylogenetic networks: what can we reconstruct?. FILOFOCS: French-Israeli Workshop on Foundations of Computer Science, May 2014, Paris, France. ⟨lirmm-01237157⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRA MAB LIRMM MIPS UNIV-MONTPELLIER INRAE

142 Consultations

0 Téléchargements

Phylogenetic networks: what can we reconstruct?

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager