Correction of long sequencing reads: a novel approach - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Communication Dans Un Congrès Année : 2015

Correction of long sequencing reads: a novel approach

Résumé

High-throughput DNA and RNA sequencing has become a routine experiment in molecular biology and life sciences in general. It is increasingly used in the hospital as a key procedure of personalized medicine. Compared to the second generation, third generation sequencing technologies produce longer reads with comparatively lower throughput and higher error rate. Those errors include substitutions, in-dels, and they hinder or at least complicate downstream analysis like mapping or de novo assembly. However, these long read data are often used in conjunction with short reads of the 2nd generation. I will present a hybrid strategy for correcting the long reads using the short reads that we introduced last year. Unlike existing error correction tool, ours, called LoRDEC, avoids the alignment of short reads on long reads, which is computationally very intensive. Instead, it takes advantage of a succinct graph to represent the short reads, and compares the long reads to paths in the graph. Experiments show that LoRDEC outperforms existing methods in running time and memory while achieving a comparable correction performance. In conclusion, i will comment on the impact of read error correction. LoRDEC is available at http://atgc.lirmm.fr/lordec and is a joint work with Leena Salmela.
Fichier non déposé

Dates et versions

lirmm-01185688 , version 1 (21-08-2015)

Identifiants

  • HAL Id : lirmm-01185688 , version 1

Citer

Eric Rivals. Correction of long sequencing reads: a novel approach. BIG N2N: Bioinformatics Institute Ghent "From Nucleotides to Networks", Ghent University, Bioinformatics Institute Ghent, May 2015, Gand, Ghent, Belgium. ⟨lirmm-01185688⟩
200 Consultations
1 Téléchargements

Partager

More