Fast and accurate genome-scale identification of DNA-binding sites

David Martin; Vincent Maillol; Eric Rivals

doi:10.1109/BIBM.2018.8621093

Communication Dans Un Congrès Année : 2019

Fast and accurate genome-scale identification of DNA-binding sites

(1) , (1) , (2, 1)

1
2

David Martin

Fonction : Auteur
PersonId : 1041115

Méthodes et Algorithmes pour la Bioinformatique

Vincent Maillol

Fonction : Auteur

Méthodes et Algorithmes pour la Bioinformatique

Eric Rivals

Fonction : Auteur
PersonId : 2002
IdHAL : eric-rivals
ORCID : 0000-0003-3791-3973
IdRef : 118021850

Institut de Biologie Computationnelle

Méthodes et Algorithmes pour la Bioinformatique

Résumé

Motivation: Discovering DNA binding sites in genome sequences is crucial for understanding genomic regulation. Currently available computational tools for finding binding sites with Position Weight Matrices of known motifs are often used in restricted genomic regions because of their long run times. The ever-increasing number of complete genome sequences points to the need for new generations of algorithms capable of processing large amounts of data. Results: Here we present MOTIF, a new algorithm for seeking transcription factor binding sites in whole genome sequences in a few seconds. We propose a web service that enables the users to search for their own matrix or for multiple JASPAR matrices. Beyond its efficacy , the service properly handles undetermined positions within the genome sequence and provides an adequate output listing for each position the matching word and its score. Availability: MOTIF is freely available for use through an interface at http://www. atgc-montpellier.fr/motif. The source code of the stand-alone search method of MOTIF is freely available at https://gite.lirmm.fr/rivals/motif.git. It is written in C++ and tested on Linux platforms.

Mots clés

transcription factor transcriptome genome efficiency binding sites interactive software interface web tool motif search stringology pattern matching bioinformatics

Domaines

Bio-informatique [q-bio.QM]

Fichier principal

motifs-hal.pdf (337.05 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Eric Rivals : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01967466

Soumis le : lundi 31 décembre 2018-12:33:37

Dernière modification le : samedi 15 juillet 2023-04:10:03

Archivage à long terme le : lundi 1 avril 2019-12:41:38

Dates et versions

lirmm-01967466 , version 1 (31-12-2018)

Identifiants

HAL Id : lirmm-01967466 , version 1
DOI : 10.1109/BIBM.2018.8621093

Citer

David Martin, Vincent Maillol, Eric Rivals. Fast and accurate genome-scale identification of DNA-binding sites. BIBM: Bioinformatics and Biomedicine, Dec 2018, Madrid, Spain. pp.201-205, ⟨10.1109/BIBM.2018.8621093⟩. ⟨lirmm-01967466⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRA MAB LIRMM MIPS UNIV-MONTPELLIER INRAE ANR

205 Consultations

333 Téléchargements

Fast and accurate genome-scale identification of DNA-binding sites

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager