Please use this identifier to cite or link to this item:
https://www.arca.fiocruz.br/handle/icict/38948
Type
ArticleCopyright
Open access
Collections
- IFF - Artigos de Periódicos [1290]
- IOC - Artigos de Periódicos [12878]
Metadata
Show full item record
GENOMYCDB: A DATABASE FOR COMPARATIVE ANALYSIS OF MYCOBACTERIAL GENES AND GENOMES
Affilliation
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil / Fundação Oswaldo Cruz. Instituto Fernandes Figueira. Departamento de Genética. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Departamento de Bioquímica e Biologia Molecular. Rio de Janeiro, RJ, Brasil.
Abstract
Several databases and computational tools have been created with the aim of organizing, integrating and analyzing the wealth of information generated by large-scale sequencing projects of mycobacterial genomes and those of other organisms. However, with very few exceptions, these databases and tools do not allow for massive and/ or dynamic comparison of these data. GenoMycDB (http://www. dbbm.fiocruz.br/GenoMycDB) is a relational database built for largescale comparative analyses of completely sequenced mycobacterial genomes, based on their predicted protein content. Its central structure is composed of the results obtained after pair-wise sequence alignments among all the predicted proteins coded by the genomes of six mycobacteria: Mycobacterium tuberculosis (strains H37Rv and CDC1551), M. bovis AF2122/97, M. avium subsp. paratuberculosis K10, M. leprae TN, and M. smegmatis MC2 155. The database stores the computed similarity parameters of every aligned pair, providing for each protein sequence the predicted subcellular localization, the assigned cluster of orthologous groups, the features of the corresponding gene, and links to several important databases. Tables containing pairs or groups of potential homologs between selected species/strains can be produced dynamically by user-defined criteria, based on one or multiple sequence similarity parameters. In addition, searches can be restricted according to the predicted subcellular localization of the protein, the DNA strand of the corresponding gene and/or the description of the protein. Massive data search and/or retrieval are available, and different ways of exporting the result are offered. GenoMycDB provides an on-line resource for the functional classification of mycobacterial proteins as well as for the analysis of genome structure, organization, and evolution.
Share