A machine learning free energy functional for the 1D reference interaction site model : towards prediction of solvation free energy for all solvent systems

Conn, Jonathan G. M. and Ahmad, Abdullah and Palmer, David S. (2024) A machine learning free energy functional for the 1D reference interaction site model : towards prediction of solvation free energy for all solvent systems. Liquids, 4 (4). pp. 710-731. ISSN 2673-8015 (https://doi.org/10.3390/liquids4040040)

[thumbnail of Conn-etal-Liquids-2024-A-machine-learning-free-energy-functional-for-the-1D-reference-interaction-site-model]
Preview
Text. Filename: Conn-etal-Liquids-2024-A-machine-learning-free-energy-functional-for-the-1D-reference-interaction-site-model.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (3MB)| Preview

Abstract

Understanding the interactions between solutes and solvents is vital in many areas of the chemical sciences. Solvation free energy (SFE) is an important thermodynamic property in characterising molecular solvation and so accurate prediction of this property is sought after. The One-Dimensional Reference Interaction Site Model (RISM) is a well-established method for modelling solvation, but it is known to yield large errors in the calculation of SFE. In this work, we show that a single machine learning free energy functional for RISM can accurately model solvation thermodynamics in multiple solvents. A convolutional neural network is trained on solvation free energy density functions calculated by RISM for small organic molecules in approximately 100 different solvent systems. We achieve an average RMSE of 1.41 kcal/mol and an R2 of 0.89 across all solvent systems. We also compare the performance for the most and least commonly represented solvents and show that higher accuracy is generally seen with higher volumes of data, with RMSE values of 0.69–1.29 kcal/mol and R2 values of 0.78–0.97 for solvents with more than 50 data points. We have shown that machine learning can greatly improve solvation free energy predictions in RISM, while demonstrating that the methodology is generalisable across solvent systems. This represents a significant step towards a universal machine learning SFE functional for RISM.

ORCID iDs

Conn, Jonathan G. M., Ahmad, Abdullah ORCID logoORCID: https://orcid.org/0000-0002-6315-749X and Palmer, David S. ORCID logoORCID: https://orcid.org/0000-0003-4356-9144;