LoRDEC: accurate and efficient long read error correction

Leena Salmela 1 Eric Rivals 2, 3
2 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Motivation: PacBio single molecule real-time sequencing is a third-generation sequencing technique producing long reads, with com-paratively lower throughput and higher error rate. Errors include numerous indels and complicate downstream analysis like mapping or de novo assembly. A hybrid strategy that takes advantage of the high accuracy of second-generation short reads has been proposed for correcting long reads. Mapping of short reads on long reads pro-vides sufficient coverage to eliminate up to 99% of errors, however, at the expense of prohibitive running times and considerable amounts of disk and memory space. Results: We present LoRDEC, a hybrid error correction method that builds a succinct de Bruijn graph representing the short reads, and seeks a corrective sequence for each erroneous region in the long reads by traversing chosen paths in the graph. In comparison, LoRDEC is at least six times faster and requires at least 93% less memory or disk space than available tools, while achieving comparable accuracy.
Type de document :
Article dans une revue
Bioinformatics, Oxford University Press (OUP), 2014, 30 (24), pp.3506-3514. 〈http://bioinformatics.oxfordjournals.org/content/30/24/3506〉. 〈10.1093/bioinformatics/btu538〉
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01100451
Contributeur : Eric Rivals <>
Soumis le : mardi 6 janvier 2015 - 14:26:48
Dernière modification le : jeudi 11 janvier 2018 - 06:26:13
Document(s) archivé(s) le : mardi 7 avril 2015 - 11:20:59

Fichier

Bioinformatics-2014-Salmela-35...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Leena Salmela, Eric Rivals. LoRDEC: accurate and efficient long read error correction. Bioinformatics, Oxford University Press (OUP), 2014, 30 (24), pp.3506-3514. 〈http://bioinformatics.oxfordjournals.org/content/30/24/3506〉. 〈10.1093/bioinformatics/btu538〉. 〈lirmm-01100451〉

Partager

Métriques

Consultations de la notice

193

Téléchargements de fichiers

229