Read indexing - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Access content directly
Journal Articles EMBnet.journal Year : 2011

Read indexing

Abstract

The question of read indexing remains broadly unexplored. However, the increase in sequence throughput urges for new algorithmic solutions to query large read collections efficiently. We pro- pose a solution, named Gk arrays, to index large collections of reads, an algorithm to build the structure, and procedures to query it. Once constructed, the index structure is kept in main memory and is repeatedly accessed to answer various types of queries. We compare our data structure to other possible solutions to investigate its scalability and computational efficiency. Gk arrays are im- plemented in a general purpose library, which may prove useful for assembly purposes, for evaluating the expression level in RNA-seq, and others high throughput sequencing applications.
Fichier principal
Vignette du fichier
289-2729-1-PB.pdf (177.97 Ko) Télécharger le fichier
Origin : Publisher files allowed on an open archive

Dates and versions

lirmm-00757983 , version 1 (27-11-2012)

Identifiers

Cite

Nicolas Philippe, Mikael Salson, Thérèse Commes, Thierry Lecroq, Martine Léonard, et al.. Read indexing. EMBnet.journal, 2011, 17 (Supplement B), pp.1. ⟨10.14806/ej.17.B.289⟩. ⟨lirmm-00757983⟩
495 View
250 Download

Altmetric

Share

Gmail Facebook X LinkedIn More