Aligning the unalignable: bacteriophage whole genome alignments

Abstract : Background In recent years, many studies focused on the description and comparison of large sets of related bacteriophage genomes. Due to the peculiar mosaic structure of these genomes, few informative approaches for comparing whole genomes exist: dot plots diagrams give a mostly qualitative assessment of the similarity/dissimilarity between two or more genomes, and clustering techniques are used to classify genomes. Multiple alignments are conspicuously absent from this scene. Indeed, whole genome aligners interpret lack of similarity between sequences as an indication of rearrangements, insertions, or losses. This behavior makes them ill-prepared to align bacteriophage genomes, where even closely related strains can accomplish the same biological function with highly dissimilar sequences. Results In this paper, we propose a multiple alignment strategy that exploits functional collinearity shared by related strains of bacteriophages, and uses partial orders to capture mosaicism of sets of genomes. As classical alignments do, the computed alignments can be used to predict that genes have the same biological function, even in the absence of detectable similarity. The Alpha aligner implements these ideas in visual interactive displays, and is used to compute several examples of alignments of Staphylococcus aureus and Mycobacterium bacteriophages, involving up to 29 genomes. Using these datasets, we prove that Alpha alignments are at least as good as those computed by standard aligners. Comparison with the progressiveMauve aligner – which implements a partial order strategy, but whose alignments are linearized – shows a greatly improved interactive graphic display, while avoiding misalignments. Conclusions Multiple alignments of whole bacteriophage genomes work, and will become an important conceptual and visual tool in comparative genomics of sets of related strains. A python implementation of Alpha, along with installation instructions for Ubuntu and OSX, is available on bitbucket (https://​bitbucket.​org/​thekswenson/​alpha).
Type de document :
Article dans une revue
BMC Bioinformatics, BioMed Central, 2016, 17 (1), pp.1-13. 〈10.1186/s12859-015-0869-5〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01275670
Contributeur : Annie Chateau <>
Soumis le : mercredi 17 février 2016 - 21:32:21
Dernière modification le : jeudi 25 janvier 2018 - 17:22:02

Lien texte intégral

Identifiants

Collections

Citation

Sèverine Bérard, Annie Château, Nicolas Pompidor, Paul Guertin, Anne Bergeron, et al.. Aligning the unalignable: bacteriophage whole genome alignments. BMC Bioinformatics, BioMed Central, 2016, 17 (1), pp.1-13. 〈10.1186/s12859-015-0869-5〉. 〈lirmm-01275670〉

Partager

Métriques

Consultations de la notice

97