A Novel Approach for Comparative Genomics & Annotation Transfer

Alban Mancheron; Raluca Uricaru; Eric Rivals

Poster De Conférence Année : 2010

A Novel Approach for Comparative Genomics & Annotation Transfer

(1) , (1) , (1)

Alban Mancheron

Fonction : Auteur correspondant
PersonId : 6019
IdHAL : alban-mancheron
ORCID : 0000-0001-9249-7592
IdRef : 111581362

Connectez-vous pour contacter l'auteur

Méthodes et Algorithmes pour la Bioinformatique

Raluca Uricaru

Fonction : Auteur
PersonId : 1202
IdHAL : ruricaru
ORCID : 0000-0002-5730-6428
IdRef : 151193681

Méthodes et Algorithmes pour la Bioinformatique

Eric Rivals

Fonction : Auteur correspondant
PersonId : 2002
IdHAL : eric-rivals
ORCID : 0000-0003-3791-3973
IdRef : 118021850

Connectez-vous pour contacter l'auteur

Méthodes et Algorithmes pour la Bioinformatique

Résumé

With the rapid development of sequencing techniques, the situation where a newly sequenced genome needs to be annotated using available genomes from close species should become more prevalent in the future. However, because of the cost of genome finishing we may have to handle incomplete or not fully assembled genomes. Undoubtedly, the need for comparative annotation will increase, but the genomic community still lack computational solutions that are both efficient and sensitive under various conditions. Present approaches are mainly based on the sequence similarity detected at the gene or protein levels, which are mostly further analysed independently one of each other, despite the dependency implied by the genome. Hence, we propose a novel approach to genome comparison and use it to develop a system that transfers annotations between the compared genomes. Besides features' sequence similarity, it accounts for the synteny it detects across multiple genomes. This approach is simple for it avoids to solve complex questions that makes other approaches computationally hard. The underlying idea is to partition a focus genome according to its pairwise similarities with the other compared genomes. The question is formulated as searching for the intervals that are shared across all genomes under consideration, and maximal in length (i.e., not extendible). If a genomic region is covered by at least one interval it is conserved across all genomes, and the number of such intervals tells how many possibilities exist for aligning it with different regions of the other genomes. Hence, our algorithm partitions the genome into regions following two criteria: 1/ being shared or unshared across all genomes, 2/ offering a unique or several alignment possibilities. The annotation transfer procedure crosses the focus genome's annotations with these regions and automatically derives the possible alignments for each feature. All features falling entirely in a region offering only one alignment possibility are declared as potentially transferable, and the user may interactively select among those according to various criteria: alignment's percent of identity, feature class, etc. We implemented these procedures in an efficient and flexible tool, named QOD, equipped with a user-friendly graphical interface. Graphical and textual results representations allow both to grasp the overall genome similarity at a glance and to browse the conserved and unshared features in various ways. This enables the investigation of genome specific genes or of rearrangements, and copy number variations, for instance. For it does not require the genome sequence to be completely assembled, our approach allows to compare and pre-annotate unfinished genomes, as well as assemblies of Next Generation Sequencing data.

Mots clés

Bioinformatics comparative genomics computational tools evolution genomic structure

Domaines

Algorithme et structure de données [cs.DS] Bio-informatique [q-bio.QM] Bio-Informatique, Biologie Systémique [q-bio.QM]

Fichier principal

HGM2010_PosterPresentation_certification_EricR.pdf (1.02 Mo)

qod-poster.pdf (5.39 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Origine	Fichiers produits par l'(les) auteur(s)

Alban Mancheron : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00491326

Soumis le : vendredi 11 juin 2010-15:30:33

Dernière modification le : lundi 16 décembre 2024-15:26:28

Archivage à long terme le : jeudi 1 décembre 2016-04:25:33

Dates et versions

lirmm-00491326 , version 1 (11-06-2010)

lirmm-00491326 , version 2 (11-06-2010)

Identifiants

HAL Id : lirmm-00491326 , version 2

Citer

Alban Mancheron, Raluca Uricaru, Eric Rivals. A Novel Approach for Comparative Genomics & Annotation Transfer. , 2010. ⟨lirmm-00491326v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAB LIRMM MIPS UNIV-MONTPELLIER

239 Consultations

272 Téléchargements

A Novel Approach for Comparative Genomics & Annotation Transfer

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager