Please use this identifier to cite or link to this item:
https://www.arca.fiocruz.br/handle/icict/38950
Type
ArticleCopyright
Restricted access
Embargo date
2025-01-01
Collections
- IOC - Artigos de Periódicos [12968]
Metadata
Show full item record
PROTOGIM: A NOVEL TOOL TO SEARCH MOTIFS AND DOMAINS IN HYPOTHETICAL PROTEINS OF PROTOZOAN GENOMES
Affilliation
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Universidade Federal de Minas Gerais. Instituto de Ciências Biológicas. Departamento de Bioquímica e Imunologia. Belo Horizonte, MG, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Universidade Federal de Minas Gerais. Instituto de Ciências Biológicas. Departamento de Bioquímica e Imunologia. Belo Horizonte, MG, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Abstract
Whole sequencing of protozoan trypanosomatid genomes revealed the presence of several predicted unknown genes coding for hypothetical proteins. Pairwise, alignment-based, computational methods available online are unable to identify the function of these sequences. To detect clues to identify the function of hypothetical proteins, a user-friendly, bioinformatic tool named PROTOzoan Gene Identification Motifs (PROTOGIM, available on http://www.biowebdb.org/protogim) was developed, which allows the user to search functional patterns of hypothetical proteins through the screening of regular expression in the sequences. The analysis of 1,194 trypanosomatid hypothetical proteins through PROTOGIM resulted in an identification of motifs and domains in 98% of the cases, demonstrating the reliability and accuracy of the employed method. The added value of this tool is the possibility to modify or insert new regular expressions to perform an analysis against either one or several sequences at the same time. An in silico strategy along with biochemical and molecular characterizations creates new possibilities to find the functions of hypothetical proteins at the postgenome era.
Share