Skip to main content
Log in

FreeSolv: a database of experimental and calculated hydration free energies, with input files

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

This work provides a curated database of experimental and calculated hydration free energies for small neutral molecules in water, along with molecular structures, input files, references, and annotations. We call this the Free Solvation Database, or FreeSolv. Experimental values were taken from prior literature and will continue to be curated, with updated experimental references and data added as they become available. Calculated values are based on alchemical free energy calculations using molecular dynamics simulations. These used the GAFF small molecule force field in TIP3P water with AM1-BCC charges. Values were calculated with the GROMACS simulation package, with full details given in references cited within the database itself. This database builds in part on a previous, 504-molecule database containing similar information. However, additional curation of both experimental data and calculated values has been done here, and the total number of molecules is now up to 643. Additional information is now included in the database, such as SMILES strings, PubChem compound IDs, accurate reference DOIs, and others. One version of the database is provided in the Supporting Information of this article, but as ongoing updates are envisioned, the database is now versioned and hosted online. In addition to providing the database, this work describes its construction process. The database is available free-of-charge via http://www.escholarship.org/uc/item/6sd403pz.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. Obtained using essentially the same protocols.

  2. With one exception described below.

  3. The author selected is usually one of those involved in running the calculations represented; for most of these sets, J. Peter Guthrie was key in determining the composition of the set.

  4. It does contain a variety of carboxylic acids which would be expected to be charged in solution at neutral pH, but hydration free energies of these are typically reported for the neutral form of the molecule.

  5. Tetrahydropyran numbering is used here.

  6. Various groups used extremely long names and were abbreviated, while some other groups which were underrepresented were filtered out. We provide statistics only for groups occurring in more than 5 compounds, and we renamed “tertiary aliphatic amine (trialkylamine)” to “trialkylamine”, “halogen derivative” to “halogenated”, “tertiary aliphatic/aromatic amine (alkylarylamine)” to “alkylarylamine (3rd)”, “primary aliphatic amine (alkylamine)” to “alkyl amine”, “phenol or hydroxyhetarene” to “phenolic”, “secondary aliphatic/aromatic amine (alkylarylamine)” to “alkylarylamine (2nd)”, “secondary aliphatic amine (dialkylamine)” to “dialkylamine”, “orthocarboxylic acid derivative” to “ca-ortho”, and “carboxylic acid ester” to “ca-ester”.

  7. As was the case when we examined the average error in our set by functional group, we simplified and shortened a variety of group names, as well as merging some groups and passing over others which contained too few or too many compounds. Specifically, every “carboxylic acid” was abbreviated “ca”, so “carboxylic acid amidine” became “ca-amidine”, etc. Other names were simplified to aid alphabetizing, such as “primary aliphatic amine (alkylamine)” being replaced by “amine, alkyl”, and similar changes for other alcohols and amines. “carbamic acid ester (urethane)” became “urethane”, and “halogen derivative” became “halogenated”. We otherwise retained only groups which occurred in at least 30 compounds in DrugBank, and passed over groups labeled “aromatic”, “heterocyclic”, “anion”, “cation”, and “alkene” because they tended to hit too many compounds or (in the case of “anion” and “cation”) were assigned in error. Other groups were merged to save space, either because they involved sub-categories (i.e. “carboxylic acid imide, N-unsubstituted” and “carboxylic acid imide, N-substituted” just became “carboxylic acid imide”) or to reduce the number of categories (“acetal” and “hemiacetal” became “acetal or hemiacetal”).

References

  1. Aguilar B, Onufriev AV (2012) Efficient computation of the total solvation energy of small molecules via the R6 generalized Born Model. J Chem Theory Comput 8(7):2404–2411

    Article  CAS  Google Scholar 

  2. Baron R, Setny P, McCammon JA (2012) Hydrophobic association and volume-confined water molecules. In: Gohlke H (ed) Protein-ligand interactions. Wiley, New Jersey

    Google Scholar 

  3. Case DA, Cheatham TE, Darden T, Gohlke H, Luo R, Merz KM Jr, Onufriev A, Simmerling C, Wang B, Woods RJ (2005) The Amber biomolecular simulation programs. J Comput Chem 26(16):1668–1688

    Article  CAS  Google Scholar 

  4. ChemAxon: MarvinSketch (2013)

  5. Chodera JD, Mobley DL, Shirts MR, Dixon RW, Branson K, Pande VS (2011) Alchemical free energy methods for drug discovery: progress and challenges. Curr Opin Struct Biol 21(2):150–160

    Article  CAS  Google Scholar 

  6. Chorny I, Dill K, Jacobson MP (2005) Surfaces affect ion pairing. J Phys Chem B 109(50):24,056–24,060

    Article  CAS  Google Scholar 

  7. Christ CD, Mark AE, van Gunsteren WF (2010) Basic ingredients of free energy calculations: a review. J Comput Chem 31(8):1569–1582

    CAS  Google Scholar 

  8. Corbeil CR, Sulea T, Purisima EO (2010) Rapid prediction of solvation free energy. 2. The first-shell hydration (FiSH) continuum model. J Chem Theory Comput 6(5):1622–1637

    Article  CAS  Google Scholar 

  9. Fennel CJ, Bizjak A, Vlachy V (2009) Ion pairing in molecular simulations of aqueous alkali halide solutions. J Phys Chem B 113:6782–6791

    Article  Google Scholar 

  10. Fennell CJ, Kehoe CW, Dill KA (2010) Oil/water transfer is partly driven by molecular shape, not just size. J Am Chem Soc 132(1):234–240

    Article  CAS  Google Scholar 

  11. Fennell CJ, Kehoe CW, Dill KA (2011) Modeling aqueous solvation with semi-explicit assembly. Proc Natl Acad Sci 108(8):3234–3239

    Article  CAS  Google Scholar 

  12. Fennell CJ, Wymer KL, Mobley DL (2014) A fixed-charge model for alcohol polarization in the condensed phase, and its role in small molecule hydration. J Phys Chem B

  13. Gallicchio E, Paris K, Levy RM (2009) The AGBNP2 implicit solvation model. J Chem Theory Comput 5:2544–2564

    Article  CAS  Google Scholar 

  14. Guthrie JP (2009) A blind challenge for computational solvation free energies: introduction and overview. J Phys Chem B 113(14):4501–4507

    Article  CAS  Google Scholar 

  15. Haider N. Checkmol. merian.pch.univie.ac.at

  16. Hine J, Mookerjee PK (1975) Structural effects on rates and equilibriums. XIX. Intrinsic hydrophilic character of organic compounds. Correlations in terms of structural contributions. J Org Chem 40(3):292–298

    Article  CAS  Google Scholar 

  17. Jakalian A, Bush B, Jack D, Bayly CI (2000) Fast, efficient generation of high-quality atomic charges. AM1-BCC model: I. Method. J Comput Chem 21(2):132–146

    Article  CAS  Google Scholar 

  18. Jakalian A, Jack D, Bayly CI (2002) Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J Comput Chem 23(16):1623–1641

    Article  CAS  Google Scholar 

  19. Kastenholz M, Hünenberger P (2006) Computation of methodology-independent ionic solvation free energies from molecular simulations. I. The electrostatic potential in molecular liquids. J Chem Phys 124:124 106

    Google Scholar 

  20. Kastenholz M, Hünenberger P (2006) Computation of methodology-independent ionic solvation free energies from molecular simulations. II. The hydration free energy of the sodium cation. J Chem Phys 124:224 501

    Google Scholar 

  21. Kehoe CW, Fennell CJ, Dill KA (2012) Testing the semi-explicit assembly solvation model in the SAMPL3 community blind test. J Comput Aided Mol Des 26(5):563–568

    Article  CAS  Google Scholar 

  22. Klimovich P, Mobley DL (2010) Predicting hydration free energies using all-atom molecular dynamics simulations and multiple starting conformations. J Comput Aided Mol Des 24(4):307–316

    Article  CAS  Google Scholar 

  23. Knight JL, Brooks CL III (2009) Validating CHARMM parameters and exploring charge distribution rules in structure-based drug design. J Chem Theory Comput 5:1680–1691

    Article  CAS  Google Scholar 

  24. Knight JL, Brooks CL III (2011) Surveying implicit solvent models for estimating small molecule absolute hydration free energies. J Comput Chem 32(13):2909–2923

    Article  CAS  Google Scholar 

  25. Knight JL, Yesselman JD, Brooks CL III (2013) Assessing the quality of absolute hydration free energies among CHARMM-compatible ligand parameterization schemes. J Comput Chem 34(11):893–903

    Article  CAS  Google Scholar 

  26. Knox C, Law V, Jewison T, Liu P, Ly, S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS (2011) DrugBank 3.0: a comprehensive resource for ’omics’ research on drugs. Nucleic Acids Res 39(Database issue), D1035–41

  27. Li L, Fennell CJ, Dill KA (2013) Field-SEA: a model for computing the solvation free energies of nonpolar, polar, and charged solutes in water. J Phys Chem B p. 131213113930002

  28. Liu Y, Fu J, Wu J (2013) High-throughput prediction of the hydration free energies of small molecules from a classical density functional theory. J Phys Chem Lett 4(21):3687–3691

    Article  CAS  Google Scholar 

  29. Michel J, Essex JW (2010) Prediction of protein-ligand binding affinity by free energy simulations: assumptions, pitfalls and expectations. J Comput Aided Mol Des 24:649–658

    Article  Google Scholar 

  30. Mobley DL, Barber AE II, Fennell CJ, Dill KA (2008) Charge asymmetries in hydration of polar solutes. J Phys Chem B 112:2404–2414

    Google Scholar 

  31. Mobley DL, Bayly CI, Cooper MD, Dill KA, Dill KA (2009) Predictions of hydration free energies from all-atom molecular dynamics simulations. J Phys Chem B 113:4533–4537

    Article  CAS  Google Scholar 

  32. Mobley DL, Bayly CI, Cooper MD, Shirts MR, Dill KA (2009) Small molecule hydration free energies in explicit solvent: an extensive test of fixed-charge atomistic simulations. J Chem Theory Comput 5(2):350–358

    Article  CAS  Google Scholar 

  33. Mobley DL, Dill K, Chodera JD (2008) Treating entropy and conformational changes in implicit solvent simulations of small molecules. J Phys Chem B 112(3):938

    Article  CAS  Google Scholar 

  34. Mobley DL, Dumont É, Chodera JD, Dill K (2007) Comparison of charge models for fixed-charge force fields: small-molecule hydration free energies in explicit solvent. J Phys Chem B 111(9):2242–2254

    Article  CAS  Google Scholar 

  35. Mobley DL, Liu S, Cerutti DS, Swope WC, Rice JE (2012) Alchemical prediction of hydration free energies for SAMPL. J Comput Aided Mol Des 26(5):551–562

    Article  CAS  Google Scholar 

  36. Mobley DL, Wymer KL, Lim NM (2014) Blind prediction of solvation free energies from the SAMPL4 challenge. J Comput Aided Mol Des 28:135

    Article  CAS  Google Scholar 

  37. Mukhopadhyay A, Fenley AT, Tolokh IS, Onufriev AV (2012) Charge hydration asymmetry: the basic principle and how to use it to test and improve water models. J Phys Chem B 116(32):9776–9783

    Article  CAS  Google Scholar 

  38. Nerenberg PS, Jo B, So C, Tripathy A, Head-Gordon T (2012) Optimizing solute–water van der waals interactions to reproduce solvation free energies. J Phys Chem B p. 120406101304000

  39. Nicholls A, Mobley DL, Guthrie JP, Chodera JD, Bayly CI, Cooper MD, Pande VS (2008) Predicting small-molecule solvation free energies: an informal blind test for computational chemistry. J Med Chem 51(4):769–779

    Article  CAS  Google Scholar 

  40. Nicholls A, Wlodek S, Grant JA (2009) The SAMP1 solvation challenge: further lessons regarding the pitfalls of parametrization. J Phys Chem B 113:4521–4532

    Article  CAS  Google Scholar 

  41. OpenEye Scientific Software (2013) OpenEye Python Toolkits

  42. Ponder JW, Wu C, Ren P, Pande VS, Chodera JD, Schneiders MJ, Haque I, Mobley DL, Lambrecht DS, DiStasio RA Jr, Head-Gordon M, Clark GNI, Johnson ME, Head-Gordon T (2010) Current status of the AMOEBA polarizable force field. J Phys Chem 114:2549

    Article  CAS  Google Scholar 

  43. Ren P, Chun J, Thomas DG, Schnieders MJ, Marucho M, Zhang J, Baker NA (2012) Biomolecular electrostatics and solvation: a computational perspective. Q Rev Biophys 45(04):427–491

    Article  Google Scholar 

  44. Rizzo RC (2013) Hexafluoropropene correction (2013). Personal communication

  45. Rizzo RC, Aynechi T, Case DA, Kuntz ID (2006) Estimation of absolute free energies of hydration using continuum methods: accuracy of partial charge models and optimization of nonpolar contributions. J Chem Theory Comput 2(1):128–139

    Article  CAS  Google Scholar 

  46. Rocklin GJ, Mobley DL, Dill KA, Hünenberger PH (2013) Calculating the binding free energies of charged species based on explicit-solvent simulations employing lattice-sum methods: an accurate correction scheme for electrostatic finite-size effects. J Chem Phys 139(18):184 103

    Article  Google Scholar 

  47. Schmuckler ME, Barefoot AC, Kleier DA, Cobranchi DP (2003) Vapor pressures of sulfonylurea herbicides. Pest Manag Sci 56(6):521–532

    Article  Google Scholar 

  48. Shirts MR, Mobley DL (2013) An Introduction to Best Practices in Free Energy Calculations. In: Biomolecular Simulations. Methods in Molecular Biology

  49. Shirts MR, Pitera JW, Swope WC, Pande VS (2003) Extremely precise free energy calculations of amino acid side chain analogs: comparison of common molecular mechanics force fields for proteins. J Chem Phys 119(11):5740–5761

    Article  CAS  Google Scholar 

  50. Shivakumar D, Deng Y, Roux B (2009) Computations of absolute solvation free energies of small molecules using explicit and implicit solvent model. J Chem Theory Comput 5(4):919–930

    Article  CAS  Google Scholar 

  51. Shivakumar D, Harder E, Damm W, Friesner RA, Sherman W (2012) Improving the prediction of absolute solvation free energies using the next generation OPLS force field. J Chem Theory Comput 8(8):2553–2558

    Article  CAS  Google Scholar 

  52. Shivakumar D, Williams J, Wu Y, Damm W, Shelley J, Sherman W (2010) Prediction of absolute solvation free energies using molecular dynamics free energy perturbation and the OPLS force field. J Chem Theory Comput 6(5):1509–1519

    Article  CAS  Google Scholar 

  53. da Silva Sousa AW, Vranken WF (2012) ACPYPE— AnteChamber PYthon Parser interfacE. BMC Res Notes 5(1):367

    Article  Google Scholar 

  54. van der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC (2005) GROMACS: fast, flexible, and free. J Comput Chem 26:1701–1718

    Article  Google Scholar 

  55. Sulea T, Corbeil CR, Purisima EO (2010) Rapid prediction of solvation free energy. 1. An extensive test of linear interaction energy (LIE). J Chem Theory Comput 6(5):1608–1621

    Article  CAS  Google Scholar 

  56. Swain M (2013) PubChemPy, https://pypi.python.org/pypi/PubChemPy/1.0

  57. Truchon JF, Pettitt BM, Labute P (2014) A cavity corrected 3D-RISM functional for accurate solvation free energies. J Chem Theory Comput. p. 140114120800002

  58. Wang J, Wang W, Kollman P, Case DA (2006) Automatic atom type and bond type perception in molecular mechanical calculations. J Mol Graph Model 25:247–260

    Article  Google Scholar 

  59. Wang J, Wolf R, Caldwell J, Kollman P, Case D (2004) Development and testing of a general amber force field. J Comput Chem 25(9):1157–1174

    Article  CAS  Google Scholar 

  60. Zheng L, Yang W (2012) Practically efficient and robust free energy calculations: double-integration orthogonal space tempering. J Chem Theory Comput 8:810–823

    Article  CAS  Google Scholar 

Download references

Acknowledgments

We thank Robert C. Rizzo (Stony Brook University) for help tracking down an issue with hexafluoropropene, and many others who have been involved in work on the experimental and calculated values represented in this database, including Élise Dumont, John D. Chodera, Ken A. Dill, Alan E. Barber, II, Anthony Nicholls, Christopher I. Bayly, Matthew D. Cooper, Vijay S. Pande, Michael R. Shirts, Pavel V. Klimovich, Shuai Liu, David S. Cerutti, William C. Swope, Julia E. Rice, Christopher J. Fennell, Nathan M. Lim, and Karisa L. Wymer. We also appreciate work done by Karisa Wymer and Jessica Fuselier towards initial curation of the set. DLM appreciates financial support from the National Institutes of Health (1R15GM096257-01A1), and computing support from the UCI GreenPlanet cluster, supported in part by NSF Grant CHE-0840513.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David L. Mobley.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 102 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mobley, D.L., Guthrie, J.P. FreeSolv: a database of experimental and calculated hydration free energies, with input files. J Comput Aided Mol Des 28, 711–720 (2014). https://doi.org/10.1007/s10822-014-9747-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-014-9747-x

Keywords

Navigation