Aligning the unalignable: bacteriophage whole genome alignments - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Article Dans Une Revue BMC Bioinformatics Année : 2016

Aligning the unalignable: bacteriophage whole genome alignments

Résumé

Background In recent years, many studies focused on the description and comparison of large sets of related bacteriophage genomes. Due to the peculiar mosaic structure of these genomes, few informative approaches for comparing whole genomes exist: dot plots diagrams give a mostly qualitative assessment of the similarity/dissimilarity between two or more genomes, and clustering techniques are used to classify genomes. Multiple alignments are conspicuously absent from this scene. Indeed, whole genome aligners interpret lack of similarity between sequences as an indication of rearrangements, insertions, or losses. This behavior makes them ill-prepared to align bacteriophage genomes, where even closely related strains can accomplish the same biological function with highly dissimilar sequences. Results In this paper, we propose a multiple alignment strategy that exploits functional collinearity shared by related strains of bacteriophages, and uses partial orders to capture mosaicism of sets of genomes. As classical alignments do, the computed alignments can be used to predict that genes have the same biological function, even in the absence of detectable similarity. The Alpha aligner implements these ideas in visual interactive displays, and is used to compute several examples of alignments of Staphylococcus aureus and Mycobacterium bacteriophages, involving up to 29 genomes. Using these datasets, we prove that Alpha alignments are at least as good as those computed by standard aligners. Comparison with the progressiveMauve aligner – which implements a partial order strategy, but whose alignments are linearized – shows a greatly improved interactive graphic display, while avoiding misalignments. Conclusions Multiple alignments of whole bacteriophage genomes work, and will become an important conceptual and visual tool in comparative genomics of sets of related strains. A python implementation of Alpha, along with installation instructions for Ubuntu and OSX, is available on bitbucket (https://​bitbucket.​org/​thekswenson/​alpha).
Fichier principal
Vignette du fichier
s12859-015-0869-5.pdf (2.5 Mo) Télécharger le fichier

Dates et versions

lirmm-01275670 , version 1 (17-02-2016)

Licence

Identifiants

Citer

Sèverine Bérard, Annie Chateau, Nicolas Pompidor, Paul Guertin, Anne Bergeron, et al.. Aligning the unalignable: bacteriophage whole genome alignments. BMC Bioinformatics, 2016, 17 (1), pp.30-43. ⟨10.1186/s12859-015-0869-5⟩. ⟨lirmm-01275670⟩
238 Consultations
104 Téléchargements

Altmetric

Partager

More