Machine‐learning scoring functions for structure‐based drug lead optimization

Hongjian Li, Kam‐Heung Sze, Gang Lu, Pedro J. Ballester
2020 Wiley Interdisciplinary Reviews. Computational Molecular Science  
Molecular docking can be used to predict how strongly small-molecule binders and their chemical derivatives bind to a macromolecular target using its available three-dimensional structures. Scoring functions (SFs) are employed to rank these molecules by their predicted binding affinity (potency). A classical SF assumes a predetermined theory-inspired functional form for the relationship between the features characterizing the structure of the protein-ligand complex and its predicted binding
more » ... nity (this relationship is almost always assumed to be linear). Recent years have seen the prosperity of machinelearning SFs, which are fast regression models built instead with contemporary supervised learning algorithms. In this review, we analyzed machine-learning SFs for drug lead optimization in the 2015-2019 period. The performance gap between classical and machine-learning SFs was large and has now broadened owing to methodological improvements and the availability of more training data. Against the expectations of many experts, SFs employing deep learning techniques were not always more predictive than those based on more established machine learning techniques and, when they were, the performance gain was small. More codes and webservers are available and ready to be applied to prospective structure-based drug lead optimization studies. These have exhibited excellent predictive accuracy in compelling retrospective tests, outperforming in some cases much more computationally demanding molecular simulation-based methods. A discussion of future work completes this review.
doi:10.1002/wcms.1465 fatcat:qnrk4qw3h5gjtcxncourqzhkxe