Combining SAGE Tags to Predict Genomic Transcribed Regions

Abstract : Analysis of several million expressed gene signatures (tags) revealed an increasing number of different sequences, largely exceeding that of annotated genes in mammalian genomes. Serial Analysis of Gene Expression (SAGE) can reveal new RNAs transcribed from previously unrecognized genomic regions. However, conventional SAGE tags are too short to identify unambiguously unique sites in large genomes. Here, we design a novel strategy with tags anchored on two different restrictions sites of cDNAs. New transcripts are then tentatively defined by the two SAGE tags in tandem and by the spanning sequence read on the genome between these tagged sites. Having developed a new algorithm to locate these tag-delimited genomic sequences, we first validated its capacity to recognize known genes and its ability to reveal new transcripts with two SAGE libraries built in parallel from a single RNA sample. Our algorithm proves fast enough to experiment this strategy at a large scale. We then collected and processed the complete sets of human SAGE tags to predict yet unknown transcripts. A cross-validation with tiling arrays data shows that 47%of these tag-delimited genomic sequences overlap transcriptional active regions. Our method provides a new and complementary approach for complex transcriptome annotation.
Type de document :
Communication dans un congrès
Jacques van Helden, Yves Moreau. JOBIM'08 : Journées Ouvertes en Biologie, Informatique et Mathématiques, Jun 2008, Lille, France, pp.141-146, 2008, 〈http://www2.lifl.fr/jobim2008/〉
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00343895
Contributeur : Eric Rivals <>
Soumis le : mercredi 3 décembre 2008 - 10:01:11
Dernière modification le : mercredi 13 juin 2018 - 18:36:02
Document(s) archivé(s) le : jeudi 11 octobre 2012 - 12:30:09

Fichier

Rivals-tandem-sage-jobim-08.pd...
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : lirmm-00343895, version 1

Collections

Citation

Eric Rivals, Anthony Boureux, Mireille Lejeune, Florence Ottones, Oscar Pecharromàn Pérez, et al.. Combining SAGE Tags to Predict Genomic Transcribed Regions. Jacques van Helden, Yves Moreau. JOBIM'08 : Journées Ouvertes en Biologie, Informatique et Mathématiques, Jun 2008, Lille, France, pp.141-146, 2008, 〈http://www2.lifl.fr/jobim2008/〉. 〈lirmm-00343895〉

Partager

Métriques

Consultations de la notice

300

Téléchargements de fichiers

221