Identifiability of phylogenetic networks - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Access content directly
Conference Papers Year : 2014

Identifiability of phylogenetic networks

Abstract

Phylogenies are almost invariably represented as trees. Although in many cases this is reasonable, in many others phylogenies should be represented as networks (more precisely directed acyclic graphs). This is due to a number of biological phenomena collectively known as reticulation events, whereby a species or a gene inherits genetic material from more than one parent organism. This may be caused by events such as hybrid speciation, introgression, horizontal gene transfer, or recombination. Phylogenetic network inference methods are in their infancy, but they are almost invariably based on the following idea: the goodness of a candidate network is evaluated on the basis of how well the trees it contains fit the data. This poses a problem: different networks may contain exactly the same set of trees, meaning that these networks will be considered "indistinguishable" by most network inference methods, no matter the input data. We propose a novel definition of what constitutes a "uniquely reconstructible" network: for each class of indistinguishable networks, we define a canonical form. Under mild assumptions, the canonical form is unique. Given data coming from any phylogenetic network, only its canonical equivalent can be uniquely reconstructed. This is a fundamental limitation that implies a drastic reduction of the solution space in phylogenetic network inference.
No file

Dates and versions

lirmm-01237418 , version 1 (03-12-2015)

Identifiers

  • HAL Id : lirmm-01237418 , version 1

Cite

Fabio Pardi, Celine Scornavacca. Identifiability of phylogenetic networks. MCEB: Mathematical and Computational Evolutionary Biology, Jun 2014, Hameau de l'Etoile, St Martin de Londres, France. ⟨lirmm-01237418⟩
211 View
0 Download

Share

Gmail Facebook X LinkedIn More