Read indexing

Abstract : The question of read indexing remains broadly unexplored. However, the increase in sequence throughput urges for new algorithmic solutions to query large read collections efficiently. We pro- pose a solution, named Gk arrays, to index large collections of reads, an algorithm to build the structure, and procedures to query it. Once constructed, the index structure is kept in main memory and is repeatedly accessed to answer various types of queries. We compare our data structure to other possible solutions to investigate its scalability and computational efficiency. Gk arrays are im- plemented in a general purpose library, which may prove useful for assembly purposes, for evaluating the expression level in RNA-seq, and others high throughput sequencing applications.
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00757983
Contributeur : Eric Rivals <>
Soumis le : mardi 27 novembre 2012 - 18:14:57
Dernière modification le : vendredi 12 octobre 2018 - 22:12:05
Document(s) archivé(s) le : samedi 17 décembre 2016 - 16:12:35

Fichier

289-2729-1-PB.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : lirmm-00757983, version 1

Citation

Nicolas Philippe, Mikael Salson, Thierry Lecroq, Martine Léonard, Thérèse Commes, et al.. Read indexing. EMBnet.journal, EMBnet, 2011, 17 (Supplement B), pp.1. 〈http://journal.embnet.org/index.php/embnetjournal/article/view/289〉. 〈lirmm-00757983〉

Partager

Métriques

Consultations de la notice

825

Téléchargements de fichiers

351