Read indexing

The question of read indexing remains broadly unexplored. However, the increase in sequence throughput urges for new algorithmic solutions to query large read collections efficiently. We pro- pose a solution, named Gk arrays, to index large collections of reads, an algorithm to build the structure, and procedures to query it. Once constructed, the index structure is kept in main memory and is repeatedly accessed to answer various types of queries. We compare our data structure to other possible solutions to investigate its scalability and computational efficiency. Gk arrays are im- plemented in a general purpose library, which may prove useful for assembly purposes, for evaluating the expression level in RNA-seq, and others high throughput sequencing applications.

Mots clés

Domaines

Fichier principal

289-2729-1-PB.pdf (177.97 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte
Licence	Autorisation HAL

Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00757983

Soumis le : mardi 27 novembre 2012-18:14:57

Dernière modification le : mercredi 13 août 2025-03:08:04

Archivage à long terme le : samedi 17 décembre 2016-16:12:35

Dates et versions

lirmm-00757983 , version 1 (27-11-2012)

Licence

Autorisation HAL

Identifiants

HAL Id : lirmm-00757983 , version 1
DOI : 10.14806/ej.17.B.289

Citer

Nicolas Philippe, Mikael Salson, Thérèse Commes, Thierry Lecroq, Martine Léonard, et al.. Read indexing. EMBnet.journal, 2011, 17 (Supplement B), pp.1. ⟨10.14806/ej.17.B.289⟩. ⟨lirmm-00757983⟩

Read indexing

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager