Abstract
HIV-1 protease (PR) inhibitors are crucial for antiviral therapies targeting acquired immunodeficiency syndrome. Hundreds of PR complexes with various ligands have been resolved and deposited in the Protein Data Bank. However, binding affinity measurements for these ligands are not always available. This gap hinders a comprehensive understanding of inhibitor efficacy. To address this challenge, machine learning models are constructed and validated based on the crystallographic coordinates of 291 PR–inhibitor complexes, leveraging over 2500 molecular descriptors. The models achieved accuracy scores exceeding 0.85, and applied to predict the binding affinity of 274 additional complexes for which inhibition constants are not experimentally measured. The analysis is focused on three models, each with 8–9 features, and based on KBest with random forest, recursive feature elimination with random forest, and sequential feature selection with support vector machine. The findings revealed key predictive features, including properties of PR inhibitors like charge distribution, hydrogen-bonding capability, and 3D topology, as well as intrinsic properties of PR, such as active site symmetry and flap mutations. The study highlights the contribution of a comprehensive analysis of accumulated experimental data to enhance the structural understanding of this important molecular system.
| Original language | English |
|---|---|
| Article number | e202500277 |
| Pages (from-to) | e202500277 |
| Journal | ChemMedChem |
| Volume | 20 |
| Issue number | 15 |
| DOIs | |
| State | Published - 2 Aug 2025 |
Bibliographical note
© 2025 The Authors. ChemMedChem published by Wiley‐VCH GmbH.Keywords
- Binding Sites
- HIV Protease Inhibitors/chemistry
- HIV Protease/metabolism
- HIV-1/enzymology
- Humans
- Machine Learning
- Models, Molecular
- Molecular Structure