Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier
Journal Articles Biochimie Year : 2008

Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences

Abstract

The number of available genomic sequences is growing very fast, due to the development of massive sequencing techniques. Sequence identification is needed and contributes to the assessment of gene and species evolutionary relationships. Automated bioinformatics tools are thus necessary to carry out these identification operations in an accurate and fast way. We developed HoSeqI (Homologous Sequence Identification), a software environment allowing this kind of automated sequence identification using homologous gene family databases. HoSeqI is accessible through a Web interface (http://pbil.univ-lyon1.fr/software/HoSeqI/) allowing to identify one or several sequences and to visualize resulting alignments and phylogenetic trees. We also implemented another application, MultiHoSeqI, to quickly add a large set of sequences to a family database in order to identify them, to update the database, or to help automatic genome annotation. Lately, we developed an application, ChiSeqI (Chimeric Sequence Identification), to automate the processes of identification of bacterial 16S ribosomal RNA sequences and of detection of chimeric sequences.
Fichier principal
Vignette du fichier
Arigon_et_al-Biochimie-2008.pdf (406.23 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

lirmm-00366131 , version 1 (23-05-2013)

Identifiers

Cite

Anne-Muriel Arigon Chifolleau, Guy Perrière, Manolo Gouy. Automatic Identification of Large Collections of Protein-Coding or rRNA Sequences. Biochimie, 2008, 90 (4), pp.609-614. ⟨10.1016/j.biochi.2007.08.006⟩. ⟨lirmm-00366131⟩
303 View
436 Download

Altmetric

Share

More