Machine learning meets genome assembly

Kleber Padovani de Souza; João Carlos Setubal; André Carlos Ponce de Leon F. de Carvalho; Guilherme Oliveira; Annie Chateau; Ronnie Alves

doi:10.1093/bib/bby072

Article Dans Une Revue Briefings in Bioinformatics Année : 2019

Machine learning meets genome assembly

(1) , (2) , , , (3) , (4, 3)

1
2
3
4

Kleber Padovani de Souza

Fonction : Auteur

Federal University of Para - Universidade Federal do Pará - UFPA [Belém, Brazil]

João Carlos Setubal

Fonction : Auteur

Universidade de São Paulo = University of São Paulo

André Carlos Ponce de Leon F. de Carvalho

Fonction : Auteur

Guilherme Oliveira

Fonction : Auteur

Annie Chateau

Fonction : Auteur
PersonId : 173624
IdHAL : annie-chateau
ORCID : 0000-0003-4760-8171
IdRef : 227798856

Méthodes et Algorithmes pour la Bioinformatique

Ronnie Alves

Fonction : Auteur
PersonId : 992094
ORCID : 0000-0003-4139-0562

Institut de Biologie Computationnelle

Méthodes et Algorithmes pour la Bioinformatique

Résumé

Motivation: With the recent advances in DNA sequencing technologies, the study of the genetic composition of living organisms has become more accessible for researchers. Several advances have been achieved because of it, especially in the health sciences. However, many challenges which emerge from the complexity of sequencing projects remain unsolved. Among them is the task of assembling DNA fragments from previously unsequenced organisms, which is classified as an NP-hard (nondeterministic polynomial time hard) problem, for which no efficient computational solution with reasonable execution time exists. However, several tools that produce approximate solutions have been used with results that have facilitated scientific discoveries, although there is ample room for improvement. As with other NP-hard problems, machine learning algorithms have been one of the approaches used in recent years in an attempt to find better solutions to the DNA fragment assembly problem, although still at a low scale. Results: This paper presents a broad review of pioneering literature comprising artificial intelligence-based DNA assemblers—particularly the ones that use machine learning—to provide an overview of state-of-the-art approaches and to serve as a starting point for further study in this field.

Mots clés

Machine learning Genome assembly Metagenomics Artificial intelligence De novo assembly

Domaines

Informatique [cs] Bio-informatique [q-bio.QM]

Annie CHATEAU : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-02067457

Soumis le : jeudi 14 mars 2019-11:37:28

Dernière modification le : samedi 15 juillet 2023-04:10:03

Dates et versions

lirmm-02067457 , version 1 (14-03-2019)

Identifiants

HAL Id : lirmm-02067457 , version 1
DOI : 10.1093/bib/bby072

Citer

Kleber Padovani de Souza, João Carlos Setubal, André Carlos Ponce de Leon F. de Carvalho, Guilherme Oliveira, Annie Chateau, et al.. Machine learning meets genome assembly. Briefings in Bioinformatics, 2019, 20 (6), pp.2116-2129. ⟨10.1093/bib/bby072⟩. ⟨lirmm-02067457⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRA MAB LIRMM MIPS UNIV-MONTPELLIER INRAE

134 Consultations

0 Téléchargements

Machine learning meets genome assembly

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager