Skip to Main content Skip to Navigation
Journal articles

Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences

Anne-Muriel Arigon Chifolleau 1, 2, * Guy Perrière 1 Manolo Gouy 1 
* Corresponding author
2 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The number of available genomic sequences is growing very fast, due to the development of massive sequencing techniques. Sequence identification is needed and contributes to the assessment of gene and species evolutionary relationships. Automated bioinformatics tools are thus necessary to carry out these identification operations in an accurate and fast way. We developed HoSeqI (Homologous Sequence Identification), a software environment allowing this kind of automated sequence identification using homologous gene family databases. HoSeqI is accessible through a Web interface ( allowing to identify one or several sequences and to visualize resulting alignments and phylogenetic trees. We also implemented another application, MultiHoSeqI, to quickly add a large set of sequences to a family database in order to identify them, to update the database, or to help automatic genome annotation. Lately, we developed an application, ChiSeqI (Chimeric Sequence Identification), to automate the processes of identification of bacterial 16S ribosomal RNA sequences and of detection of chimeric sequences.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download
Contributor : Anne-Muriel Arigon Chifolleau Connect in order to contact the contributor
Submitted on : Thursday, May 23, 2013 - 3:27:47 PM
Last modification on : Saturday, September 24, 2022 - 2:36:04 PM
Long-term archiving on: : Saturday, August 24, 2013 - 2:25:08 AM


Files produced by the author(s)



Anne-Muriel Arigon Chifolleau, Guy Perrière, Manolo Gouy. Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences. Biochimie, Elsevier, 2008, 90 (4), pp.609-614. ⟨10.1016/j.biochi.2007.08.006⟩. ⟨lirmm-00366131⟩



Record views


Files downloads