Please use this identifier to cite or link to this item:
https://www.arca.fiocruz.br/handle/icict/56334
Type
ArticleCopyright
Open access
Sustainable Development Goals
04 Educação de qualidadeCollections
- IOC - Artigos de Periódicos [12820]
Metadata
Show full item record
A MACHINE LEARNING-BASED VIRTUAL SCREENING FOR NATURAL COMPOUNDS CAPABLE OF INHIBITING THE HIV-1 INTEGRASE
Affilliation
Faculty of Exact and Natural Sciences, University of Buenos Aires. Buenos Aires, Argentina.
Fundação Oswaldo Cruz. Plataforma Institucional para a Biodiversidade e Saúde da Vida Selvagem. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Genômica Funcional e Bioinformática. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Plataforma Institucional para a Biodiversidade e Saúde da Vida Selvagem. Rio de Janeiro, RJ, Brasil.
Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Genômica Funcional e Bioinformática. Rio de Janeiro, RJ, Brasil.
Abstract
HIV-1 integrase is an essential enzyme for the HIV-1 replication cycle, and
currently, integrase inhibitors are in the first line of treatment in many guidelines.
Despite the discovery of new inhibitors, including a new class of molecules with
different mechanisms of action, resistance is still a relevant problem, and adding
new options to the therapeutic arsenal to fight viral resistance is a Sisyphean
task. Because of the difficulty and cost of in vitro screenings, machine learningdriven
ligand-based virtual screenings are an alternative that can not only cut
costs but also use valuable information about active compounds with yet
unknown mechanisms of action. In this work, we describe a thorough
model exploration and hyperparameter tuning procedure in a dataset with
class imbalance and show several models capable of distinguishing between
compounds that are active or inactive against the HIV-1 integrase. The best of
the models was then used to screen the natural product atlas for active
compounds, resulting in a myriad of molecules that share features with
known integrase inhibitors. Here we also explore the strengths and
shortcomings of our models and discuss the use of the applicability domain
to guide in vitro screenings and differentiate between the “predictable” and
“unknown” regions of the chemical space.
Share