Abstract
We applied the COSMO-RS method to predict the partition coefficient logP between water and 1-octanol for 22 small drug like molecules within the framework of the SAMPL7 blind challenge. We carefully collected a set of thermodynamically meaningful microstates, including tautomeric forms of the neutral species, and calculated the logP using the current COSMOtherm implementation on the most accurate level. With this approach, COSMO-RS was ranked as the 6st most accurate method (Measured by the mean absolute error (MAE) of 0.57) over all 17 ranked submissions. We achieved a root mean square deviation (RMSD) of 0.78. The largest deviations from experimental values are exhibited by five SAMPL molecules (SM), which seem to be shifted in most SAMPL7 contributions. In context with previous SAMPL challenges, COSMO-RS demonstrates a wide range of applicability and one of the best in class reliability and accuracy among the physical methods.
Similar content being viewed by others
Availability of data and materials
The datasets generated during and analyzed during the current study are available from the corresponding author on reasonable request.
Code availability
The code of our software is commercially available.
Notes
Originally, we submitted transfer free energies in the mol-fraction based reference state (Eq. 2 without the additional term of molar volumes). After realizing that the concentration-based framework was asked, we got the opportunity to resubmit our predicted transfer free energies in the relevant reference state by adding the logarithm of the quotient of the molar volumes (like in Eq. 2).
The AB5 and AWI17 analysis was based on the SAMPL7 analysis of Version 0.7 (https://github.com/samplchallenges/SAMPL7/releases/tag/0.7).
This analysis was based on the SAMPL7 analysis of Version 0.7 (https://github.com/samplchallenges/SAMPL7/releases/tag/0.7).
In SAMPL7, TFE MLR was the most accurate method with an RMSD of 0.58 log units.
References
Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46:3–26. https://doi.org/10.1016/s0169-409x(00)00129-0
Lipinski CA (2004) Lead- and drug-like compounds: the rule-of-five revolution. Drug Discov Today 1:337–341. https://doi.org/10.1016/j.ddtec.2004.11.007
Nicholls A, Mobley DL, Guthrie JP et al (2008) Predicting small-molecule solvation free energies: an informal blind test for computational chemistry. J Med Chem 51:769–779. https://doi.org/10.1021/jm070549+
(2019) Drug design data resource. In: Drug Des. Data Resour. https://drugdesigndata.org. Accessed 1 Feb 2019
Klamt A, Eckert F, Reinisch J, Wichmann K (2016) Prediction of cyclohexane-water distribution coefficients with COSMO-RS on the SAMPL5 data set. J Comput Aided Mol Des 30:959–967. https://doi.org/10.1007/s10822-016-9927-y
Loschen C, Reinisch J, Klamt A (2020) COSMO-RS based predictions for the SAMPL6 logP challenge. J Comput Aided Mol Des 34:385–392. https://doi.org/10.1007/s10822-019-00259-z
Pracht P, Wilcken R, Udvarhelyi A et al (2018) High accuracy quantum-chemistry-based calculation and blind prediction of macroscopic pKa values in the context of the SAMPL6 challenge. J Comput Aided Mol Des 32:1139–1149. https://doi.org/10.1007/s10822-018-0145-7
Grimme S, Bannwarth C, Dohm S et al (2017) Fully automated quantum-chemistry-based computation of spin-spin-coupled nuclear magnetic resonance spectra. Angew Chem Int Ed. https://doi.org/10.1002/anie.201708266
Sangster J (1989) Octanol-water partition coefficients of simple organic compounds. J Phys Chem Ref Data 18:1111–1229. https://doi.org/10.1063/1.555833
Klamt A (1995) Conductor-like screening model for real solvents: a new approach to the quantitative calculation of solvation phenomena. J Phys Chem 99:2224–2235. https://doi.org/10.1021/j100007a062
Klamt A (2018) The COSMO and COSMO-RS solvation models: COSMO and COSMO-RS. Wiley Interdiscip Rev Comput Mol Sci 8:e1338. https://doi.org/10.1002/wcms.1338
Bannan CC, Calabró G, Kyu DY, Mobley DL (2016) Calculating partition coefficients of small molecules in octanol/water and cyclohexane/water. J Chem Theory Comput 12:4015–4024. https://doi.org/10.1021/acs.jctc.6b00449
COSMOquick, Version 1.8; Dassault Systemes, 2020. https://www.3ds.com/
Hornig M, Klamt A (2005) COSMOfrag: a novel tool for high-throughput ADME property prediction and similarity screening based on quantum chemistry. J Chem Inf Model 45:1169–1177. https://doi.org/10.1021/ci0501948
Loschen C, Klamt A (2012) COSMOquick: a novel interface for fast σ-profile composition and its application to COSMO-RS solvent screening using multiple reference solvents. Ind Eng Chem Res 51:14303–14308. https://doi.org/10.1021/ie3023675
Vainio MJ, Johnson MS (2007) Generating conformer ensembles using a multiobjective genetic algorithm. J Chem Inf Model 47:2462–2474. https://doi.org/10.1021/ci6005646
RDKit: Open-Source Cheminformatics Software. https://www.rdkit.org/ (Release_2016.09). Accessed 4 Nov 2016
Ebejer J-P, Morris GM, Deane CM (2012) Freely available conformer generation methods: how good are they? J Chem Inf Model 52:1146–1158. https://doi.org/10.1021/ci2004658
Becke AD (1988) Density-functional exchange-energy approximation with correct asymptotic behavior. Phys Rev A 38:3098–3100. https://doi.org/10.1103/PhysRevA.38.3098
Perdew JP (1986) Density-functional approximation for the correlation energy of the inhomogeneous electron gas. Phys Rev B 33:8822–8824. https://doi.org/10.1103/PhysRevB.33.8822
Schäfer A, Huber C, Ahlrichs R (1994) Fully optimized contracted Gaussian basis sets of triple zeta valence quality for atoms Li to Kr. J Chem Phys 100:5829. https://doi.org/10.1063/1.467146
Klamt A, Schüürmann G (1993) COSMO: a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient. J Chem Soc Perkin Trans 2(1993):799–805. https://doi.org/10.1039/P29930000799
Rappoport D, Furche F (2010) Property-optimized Gaussian basis sets for molecular response calculations. J Chem Phys 133:134105. https://doi.org/10.1063/1.3484283
TURBOMOLE V7.4 2019, a development of University of Karlsruhe and Forschungszentrum Karlsruhe GmbH, 1989-2007, TURBOMOLE GmbH, since 2007; available from https://www.turbomole.org.
Klamt A, Jonas V, Bürger T, Lohrenz JC (1998) Refinement and parametrization of COSMO-RS. J Phys Chem A 102:5074–5085. https://doi.org/10.1021/jp980017s
Eckert F, Klamt A (2002) Fast solvent screening via quantum chemistry: COSMO-RS approach. AIChE J 48:369–385. https://doi.org/10.1002/aic.690480220
BIOVIA COSMOtherm, Release 2020; Dassault Systèmes. https://www.3ds.com/
Mobley DL, Bergazin D, et al (2020) samplchallenges/SAMPL7: version 0.6: Add logP challenge inputs/experimental results. Zenodo. https://doi.org/10.5281/zenodo.4086044
Bergazin TD, Tielker N, Zhang Y, Mao J, Gunner MR, Francisco K, Ballatore C, Kast SM, Mobley DL. Evaluation of log P, pKa, and log D predictions from the SAMPL7 blind challenge
Acknowledgements
The authors thank David Mobley and the organizers for setting up the SAMPL7 challenge and fruitful discussions. We appreciate the National Institutes of Health for its support of the SAMPL project via R01GM124270 to David L. Mobley (UC Irvine). We thank the Ballatore lab at UCSD for carrying out the experimental logP measurements. We thank Andreas Klamt for helpful discussions. We thank Michael Diedenhofen, Johannes Schwöbel and Frank Eckert for conducting a big part of the tautomer and conformer workflow with COSMOconf and COSMOquick as well as for helping us in refining the data and supporting discussions. Judith Warnau thanks Felix Hanke for revising the manuscript.
Funding
Financial interests: All author are employed and have received research support from Company Dassault Systemes Deutschland GmbH. Apart from that, no funding was received for conducting this study.
Author information
Authors and Affiliations
Contributions
All authors contributed to developing the research design. Judith Warnau performed material preparation, data collection and analysis. All authors contributed to data analysis decisions and interpretation of the results. Judith Warnau wrote the first draft of the manuscript and all authors revised and commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
All authors are employed by Dassault Systemes Deutschland GmbH. Dassault Systemes commercially distributes the software (COSMOtherm, COSMOconf, COSMOquick and Turbomole) which was used to conduct the calculations for this manuscript. Apart from that, there are no conflicts of interests or competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Warnau, J., Wichmann, K. & Reinisch, J. COSMO-RS predictions of logP in the SAMPL7 blind challenge. J Comput Aided Mol Des 35, 813–818 (2021). https://doi.org/10.1007/s10822-021-00395-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-021-00395-5