Analysis of genomic data from high throughput sequencing: concepts and basic methods

Eric Rivals

Communication Dans Un Congrès Année : 2011

Analysis of genomic data from high throughput sequencing: concepts and basic methods

(1)

Eric Rivals

Fonction : Auteur correspondant
PersonId : 2002
IdHAL : eric-rivals
ORCID : 0000-0003-3791-3973
IdRef : 118021850

Connectez-vous pour contacter l'auteur

Méthodes et Algorithmes pour la Bioinformatique

Résumé

When a reference genome is available, analysis of Next Generation Sequencing (NGS) reads require to determine the plausible genomic position(s) for all reads. This computational step is termed mapping. This lecture will provide an overview of underlying concepts of the mapping question, algorithms, and pitfalls: alignment, approximate motif searching, filtration, and indexing data structures (like the Burrows Wheeler Transform). The impact of the read length, the background mapping probability, or sequencing errors on the results will be illustrated. This will lead us to present the most current mapping algorithms and their limitations; the results of a comparison will be presented. The cases of genomic and transcriptomic data will be considered in this regard, and finally the question of efficiency will be addressed. In a second time, we will give a short overview of concepts for assembly methods.

Mots clés

Next Generation Sequencing (NGS) algorithms stringology assembly data mining mapping

Domaines

Bio-informatique [q-bio.QM] Bio-Informatique, Biologie Systémique [q-bio.QM]

Eric Rivals : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00832551

Soumis le : lundi 10 juin 2013-23:09:46

Dernière modification le : samedi 15 juillet 2023-04:09:53

Dates et versions

lirmm-00832551 , version 1 (10-06-2013)

Identifiants

HAL Id : lirmm-00832551 , version 1

Citer

Eric Rivals. Analysis of genomic data from high throughput sequencing: concepts and basic methods. Palaeogenomics Summer School, Oct 2011, Cargese, France. ⟨lirmm-00832551⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAB LIRMM MIPS UNIV-MONTPELLIER

100 Consultations

0 Téléchargements

Analysis of genomic data from high throughput sequencing: concepts and basic methods

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager