Please use this identifier to cite or link to this item:
https://www.arca.fiocruz.br/handle/icict/47751
Type
ArticleCopyright
Open access
Collections
Metadata
Show full item record15
CITATIONS
15
Total citations
9
Recent citations
3.29
Field Citation Ratio
0.76
Relative Citation Ratio
TSSFINDER—FAST AND ACCURATE AB INITIO PREDICTION OF THE CORE PROMOTER IN EUKARYOTIC GENOMES
Promoter Regions, Genetic
annotation of genomes
Genomics
conditional random fields
Author
Affilliation
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil.
Technology Company Elo7. São Paulo, SP, Brasil.
Universidade de São Paulo. São Paulo, SP, Brasil.
Universidade de São Paulo. Instituto de Química. São Paulo, SP, Brasil.
Universidade de São Paulo. São Paulo, SP, Brasil.
Technology Company Elo7. São Paulo, SP, Brasil.
Universidade de São Paulo. São Paulo, SP, Brasil.
Universidade de São Paulo. Instituto de Química. São Paulo, SP, Brasil.
Universidade de São Paulo. São Paulo, SP, Brasil.
Abstract
Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software.We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/.
Keywords
Transcription Initiation SitePromoter Regions, Genetic
annotation of genomes
Genomics
conditional random fields
Share