Skip to Main content Skip to Navigation
Journal articles

Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences

Anne-Muriel Arigon Chifolleau 1, 2, * Guy Perrière 1 Manolo Gouy 1
* Corresponding author
2 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The number of available genomic sequences is growing very fast, due to the development of massive sequencing techniques. Sequence identification is needed and contributes to the assessment of gene and species evolutionary relationships. Automated bioinformatics tools are thus necessary to carry out these identification operations in an accurate and fast way. We developed HoSeqI (Homologous Sequence Identification), a software environment allowing this kind of automated sequence identification using homologous gene family databases. HoSeqI is accessible through a Web interface (http://pbil.univ-lyon1.fr/software/HoSeqI/) allowing to identify one or several sequences and to visualize resulting alignments and phylogenetic trees. We also implemented another application, MultiHoSeqI, to quickly add a large set of sequences to a family database in order to identify them, to update the database, or to help automatic genome annotation. Lately, we developed an application, ChiSeqI (Chimeric Sequence Identification), to automate the processes of identification of bacterial 16S ribosomal RNA sequences and of detection of chimeric sequences.
Complete list of metadatas

Cited literature [27 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00366131
Contributor : Anne-Muriel Arigon Chifolleau <>
Submitted on : Thursday, May 23, 2013 - 3:27:47 PM
Last modification on : Monday, February 10, 2020 - 4:36:55 PM
Document(s) archivé(s) le : Saturday, August 24, 2013 - 2:25:08 AM

File

Arigon_et_al-Biochimie-2008.pd...
Files produced by the author(s)

Identifiers

Citation

Anne-Muriel Arigon Chifolleau, Guy Perrière, Manolo Gouy. Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences. Biochimie, Elsevier, 2008, 90 (4), pp.609-614. ⟨10.1016/j.biochi.2007.08.006⟩. ⟨lirmm-00366131⟩

Share

Metrics

Record views

605

Files downloads

564