Skip to main content
Log in

COSMO-RS predictions of logP in the SAMPL7 blind challenge

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

We applied the COSMO-RS method to predict the partition coefficient logP between water and 1-octanol for 22 small drug like molecules within the framework of the SAMPL7 blind challenge. We carefully collected a set of thermodynamically meaningful microstates, including tautomeric forms of the neutral species, and calculated the logP using the current COSMOtherm implementation on the most accurate level. With this approach, COSMO-RS was ranked as the 6st most accurate method (Measured by the mean absolute error (MAE) of 0.57) over all 17 ranked submissions. We achieved a root mean square deviation (RMSD) of 0.78. The largest deviations from experimental values are exhibited by five SAMPL molecules (SM), which seem to be shifted in most SAMPL7 contributions. In context with previous SAMPL challenges, COSMO-RS demonstrates a wide range of applicability and one of the best in class reliability and accuracy among the physical methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Availability of data and materials

The datasets generated during and analyzed during the current study are available from the corresponding author on reasonable request.

Code availability

The code of our software is commercially available.

Notes

  1. Originally, we submitted transfer free energies in the mol-fraction based reference state (Eq. 2 without the additional term of molar volumes). After realizing that the concentration-based framework was asked, we got the opportunity to resubmit our predicted transfer free energies in the relevant reference state by adding the logarithm of the quotient of the molar volumes (like in Eq. 2).

  2. The AB5 and AWI17 analysis was based on the SAMPL7 analysis of Version 0.7 (https://github.com/samplchallenges/SAMPL7/releases/tag/0.7).

  3. This analysis was based on the SAMPL7 analysis of Version 0.7 (https://github.com/samplchallenges/SAMPL7/releases/tag/0.7).

  4. In SAMPL7, TFE MLR was the most accurate method with an RMSD of 0.58 log units.

References

  1. Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46:3–26. https://doi.org/10.1016/s0169-409x(00)00129-0

    Article  CAS  PubMed  Google Scholar 

  2. Lipinski CA (2004) Lead- and drug-like compounds: the rule-of-five revolution. Drug Discov Today 1:337–341. https://doi.org/10.1016/j.ddtec.2004.11.007

    Article  CAS  Google Scholar 

  3. Nicholls A, Mobley DL, Guthrie JP et al (2008) Predicting small-molecule solvation free energies: an informal blind test for computational chemistry. J Med Chem 51:769–779. https://doi.org/10.1021/jm070549+

    Article  CAS  PubMed  Google Scholar 

  4. (2019) Drug design data resource. In: Drug Des. Data Resour. https://drugdesigndata.org. Accessed 1 Feb 2019

  5. Klamt A, Eckert F, Reinisch J, Wichmann K (2016) Prediction of cyclohexane-water distribution coefficients with COSMO-RS on the SAMPL5 data set. J Comput Aided Mol Des 30:959–967. https://doi.org/10.1007/s10822-016-9927-y

    Article  CAS  PubMed  Google Scholar 

  6. Loschen C, Reinisch J, Klamt A (2020) COSMO-RS based predictions for the SAMPL6 logP challenge. J Comput Aided Mol Des 34:385–392. https://doi.org/10.1007/s10822-019-00259-z

    Article  CAS  PubMed  Google Scholar 

  7. Pracht P, Wilcken R, Udvarhelyi A et al (2018) High accuracy quantum-chemistry-based calculation and blind prediction of macroscopic pKa values in the context of the SAMPL6 challenge. J Comput Aided Mol Des 32:1139–1149. https://doi.org/10.1007/s10822-018-0145-7

    Article  CAS  PubMed  Google Scholar 

  8. Grimme S, Bannwarth C, Dohm S et al (2017) Fully automated quantum-chemistry-based computation of spin-spin-coupled nuclear magnetic resonance spectra. Angew Chem Int Ed. https://doi.org/10.1002/anie.201708266

    Article  Google Scholar 

  9. Sangster J (1989) Octanol-water partition coefficients of simple organic compounds. J Phys Chem Ref Data 18:1111–1229. https://doi.org/10.1063/1.555833

    Article  CAS  Google Scholar 

  10. Klamt A (1995) Conductor-like screening model for real solvents: a new approach to the quantitative calculation of solvation phenomena. J Phys Chem 99:2224–2235. https://doi.org/10.1021/j100007a062

    Article  CAS  Google Scholar 

  11. Klamt A (2018) The COSMO and COSMO-RS solvation models: COSMO and COSMO-RS. Wiley Interdiscip Rev Comput Mol Sci 8:e1338. https://doi.org/10.1002/wcms.1338

    Article  CAS  Google Scholar 

  12. Bannan CC, Calabró G, Kyu DY, Mobley DL (2016) Calculating partition coefficients of small molecules in octanol/water and cyclohexane/water. J Chem Theory Comput 12:4015–4024. https://doi.org/10.1021/acs.jctc.6b00449

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. COSMOquick, Version 1.8; Dassault Systemes, 2020. https://www.3ds.com/

  14. Hornig M, Klamt A (2005) COSMOfrag: a novel tool for high-throughput ADME property prediction and similarity screening based on quantum chemistry. J Chem Inf Model 45:1169–1177. https://doi.org/10.1021/ci0501948

    Article  CAS  PubMed  Google Scholar 

  15. Loschen C, Klamt A (2012) COSMOquick: a novel interface for fast σ-profile composition and its application to COSMO-RS solvent screening using multiple reference solvents. Ind Eng Chem Res 51:14303–14308. https://doi.org/10.1021/ie3023675

    Article  CAS  Google Scholar 

  16. Vainio MJ, Johnson MS (2007) Generating conformer ensembles using a multiobjective genetic algorithm. J Chem Inf Model 47:2462–2474. https://doi.org/10.1021/ci6005646

    Article  CAS  PubMed  Google Scholar 

  17. RDKit: Open-Source Cheminformatics Software. https://www.rdkit.org/ (Release_2016.09). Accessed 4 Nov 2016

  18. Ebejer J-P, Morris GM, Deane CM (2012) Freely available conformer generation methods: how good are they? J Chem Inf Model 52:1146–1158. https://doi.org/10.1021/ci2004658

    Article  CAS  PubMed  Google Scholar 

  19. Becke AD (1988) Density-functional exchange-energy approximation with correct asymptotic behavior. Phys Rev A 38:3098–3100. https://doi.org/10.1103/PhysRevA.38.3098

    Article  CAS  Google Scholar 

  20. Perdew JP (1986) Density-functional approximation for the correlation energy of the inhomogeneous electron gas. Phys Rev B 33:8822–8824. https://doi.org/10.1103/PhysRevB.33.8822

    Article  CAS  Google Scholar 

  21. Schäfer A, Huber C, Ahlrichs R (1994) Fully optimized contracted Gaussian basis sets of triple zeta valence quality for atoms Li to Kr. J Chem Phys 100:5829. https://doi.org/10.1063/1.467146

    Article  Google Scholar 

  22. Klamt A, Schüürmann G (1993) COSMO: a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient. J Chem Soc Perkin Trans 2(1993):799–805. https://doi.org/10.1039/P29930000799

    Article  Google Scholar 

  23. Rappoport D, Furche F (2010) Property-optimized Gaussian basis sets for molecular response calculations. J Chem Phys 133:134105. https://doi.org/10.1063/1.3484283

    Article  CAS  PubMed  Google Scholar 

  24. TURBOMOLE V7.4 2019, a development of University of Karlsruhe and Forschungszentrum Karlsruhe GmbH, 1989-2007, TURBOMOLE GmbH, since 2007; available from https://www.turbomole.org.

  25. Klamt A, Jonas V, Bürger T, Lohrenz JC (1998) Refinement and parametrization of COSMO-RS. J Phys Chem A 102:5074–5085. https://doi.org/10.1021/jp980017s

    Article  CAS  Google Scholar 

  26. Eckert F, Klamt A (2002) Fast solvent screening via quantum chemistry: COSMO-RS approach. AIChE J 48:369–385. https://doi.org/10.1002/aic.690480220

    Article  CAS  Google Scholar 

  27. BIOVIA COSMOtherm, Release 2020; Dassault Systèmes. https://www.3ds.com/

  28. Mobley DL, Bergazin D, et al (2020) samplchallenges/SAMPL7: version 0.6: Add logP challenge inputs/experimental results. Zenodo. https://doi.org/10.5281/zenodo.4086044

  29. Bergazin TD, Tielker N, Zhang Y, Mao J, Gunner MR, Francisco K, Ballatore C, Kast SM, Mobley DL. Evaluation of log P, pKa, and log D predictions from the SAMPL7 blind challenge

Download references

Acknowledgements

The authors thank David Mobley and the organizers for setting up the SAMPL7 challenge and fruitful discussions. We appreciate the National Institutes of Health for its support of the SAMPL project via R01GM124270 to David L. Mobley (UC Irvine). We thank the Ballatore lab at UCSD for carrying out the experimental logP measurements. We thank Andreas Klamt for helpful discussions. We thank Michael Diedenhofen, Johannes Schwöbel and Frank Eckert for conducting a big part of the tautomer and conformer workflow with COSMOconf and COSMOquick as well as for helping us in refining the data and supporting discussions. Judith Warnau thanks Felix Hanke for revising the manuscript.

Funding

Financial interests: All author are employed and have received research support from Company Dassault Systemes Deutschland GmbH. Apart from that, no funding was received for conducting this study.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to developing the research design. Judith Warnau performed material preparation, data collection and analysis. All authors contributed to data analysis decisions and interpretation of the results. Judith Warnau wrote the first draft of the manuscript and all authors revised and commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Judith Warnau.

Ethics declarations

Conflict of interest

All authors are employed by Dassault Systemes Deutschland GmbH. Dassault Systemes commercially distributes the software (COSMOtherm, COSMOconf, COSMOquick and Turbomole) which was used to conduct the calculations for this manuscript. Apart from that, there are no conflicts of interests or competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 1313 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Warnau, J., Wichmann, K. & Reinisch, J. COSMO-RS predictions of logP in the SAMPL7 blind challenge. J Comput Aided Mol Des 35, 813–818 (2021). https://doi.org/10.1007/s10822-021-00395-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-021-00395-5

Keywords

Navigation