Abstract
This work provides a curated database of experimental and calculated hydration free energies for small neutral molecules in water, along with molecular structures, input files, references, and annotations. We call this the Free Solvation Database, or FreeSolv. Experimental values were taken from prior literature and will continue to be curated, with updated experimental references and data added as they become available. Calculated values are based on alchemical free energy calculations using molecular dynamics simulations. These used the GAFF small molecule force field in TIP3P water with AM1-BCC charges. Values were calculated with the GROMACS simulation package, with full details given in references cited within the database itself. This database builds in part on a previous, 504-molecule database containing similar information. However, additional curation of both experimental data and calculated values has been done here, and the total number of molecules is now up to 643. Additional information is now included in the database, such as SMILES strings, PubChem compound IDs, accurate reference DOIs, and others. One version of the database is provided in the Supporting Information of this article, but as ongoing updates are envisioned, the database is now versioned and hosted online. In addition to providing the database, this work describes its construction process. The database is available free-of-charge via http://www.escholarship.org/uc/item/6sd403pz.
Similar content being viewed by others
Notes
Obtained using essentially the same protocols.
With one exception described below.
The author selected is usually one of those involved in running the calculations represented; for most of these sets, J. Peter Guthrie was key in determining the composition of the set.
It does contain a variety of carboxylic acids which would be expected to be charged in solution at neutral pH, but hydration free energies of these are typically reported for the neutral form of the molecule.
Tetrahydropyran numbering is used here.
Various groups used extremely long names and were abbreviated, while some other groups which were underrepresented were filtered out. We provide statistics only for groups occurring in more than 5 compounds, and we renamed “tertiary aliphatic amine (trialkylamine)” to “trialkylamine”, “halogen derivative” to “halogenated”, “tertiary aliphatic/aromatic amine (alkylarylamine)” to “alkylarylamine (3rd)”, “primary aliphatic amine (alkylamine)” to “alkyl amine”, “phenol or hydroxyhetarene” to “phenolic”, “secondary aliphatic/aromatic amine (alkylarylamine)” to “alkylarylamine (2nd)”, “secondary aliphatic amine (dialkylamine)” to “dialkylamine”, “orthocarboxylic acid derivative” to “ca-ortho”, and “carboxylic acid ester” to “ca-ester”.
As was the case when we examined the average error in our set by functional group, we simplified and shortened a variety of group names, as well as merging some groups and passing over others which contained too few or too many compounds. Specifically, every “carboxylic acid” was abbreviated “ca”, so “carboxylic acid amidine” became “ca-amidine”, etc. Other names were simplified to aid alphabetizing, such as “primary aliphatic amine (alkylamine)” being replaced by “amine, alkyl”, and similar changes for other alcohols and amines. “carbamic acid ester (urethane)” became “urethane”, and “halogen derivative” became “halogenated”. We otherwise retained only groups which occurred in at least 30 compounds in DrugBank, and passed over groups labeled “aromatic”, “heterocyclic”, “anion”, “cation”, and “alkene” because they tended to hit too many compounds or (in the case of “anion” and “cation”) were assigned in error. Other groups were merged to save space, either because they involved sub-categories (i.e. “carboxylic acid imide, N-unsubstituted” and “carboxylic acid imide, N-substituted” just became “carboxylic acid imide”) or to reduce the number of categories (“acetal” and “hemiacetal” became “acetal or hemiacetal”).
References
Aguilar B, Onufriev AV (2012) Efficient computation of the total solvation energy of small molecules via the R6 generalized Born Model. J Chem Theory Comput 8(7):2404–2411
Baron R, Setny P, McCammon JA (2012) Hydrophobic association and volume-confined water molecules. In: Gohlke H (ed) Protein-ligand interactions. Wiley, New Jersey
Case DA, Cheatham TE, Darden T, Gohlke H, Luo R, Merz KM Jr, Onufriev A, Simmerling C, Wang B, Woods RJ (2005) The Amber biomolecular simulation programs. J Comput Chem 26(16):1668–1688
ChemAxon: MarvinSketch (2013)
Chodera JD, Mobley DL, Shirts MR, Dixon RW, Branson K, Pande VS (2011) Alchemical free energy methods for drug discovery: progress and challenges. Curr Opin Struct Biol 21(2):150–160
Chorny I, Dill K, Jacobson MP (2005) Surfaces affect ion pairing. J Phys Chem B 109(50):24,056–24,060
Christ CD, Mark AE, van Gunsteren WF (2010) Basic ingredients of free energy calculations: a review. J Comput Chem 31(8):1569–1582
Corbeil CR, Sulea T, Purisima EO (2010) Rapid prediction of solvation free energy. 2. The first-shell hydration (FiSH) continuum model. J Chem Theory Comput 6(5):1622–1637
Fennel CJ, Bizjak A, Vlachy V (2009) Ion pairing in molecular simulations of aqueous alkali halide solutions. J Phys Chem B 113:6782–6791
Fennell CJ, Kehoe CW, Dill KA (2010) Oil/water transfer is partly driven by molecular shape, not just size. J Am Chem Soc 132(1):234–240
Fennell CJ, Kehoe CW, Dill KA (2011) Modeling aqueous solvation with semi-explicit assembly. Proc Natl Acad Sci 108(8):3234–3239
Fennell CJ, Wymer KL, Mobley DL (2014) A fixed-charge model for alcohol polarization in the condensed phase, and its role in small molecule hydration. J Phys Chem B
Gallicchio E, Paris K, Levy RM (2009) The AGBNP2 implicit solvation model. J Chem Theory Comput 5:2544–2564
Guthrie JP (2009) A blind challenge for computational solvation free energies: introduction and overview. J Phys Chem B 113(14):4501–4507
Haider N. Checkmol. merian.pch.univie.ac.at
Hine J, Mookerjee PK (1975) Structural effects on rates and equilibriums. XIX. Intrinsic hydrophilic character of organic compounds. Correlations in terms of structural contributions. J Org Chem 40(3):292–298
Jakalian A, Bush B, Jack D, Bayly CI (2000) Fast, efficient generation of high-quality atomic charges. AM1-BCC model: I. Method. J Comput Chem 21(2):132–146
Jakalian A, Jack D, Bayly CI (2002) Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J Comput Chem 23(16):1623–1641
Kastenholz M, Hünenberger P (2006) Computation of methodology-independent ionic solvation free energies from molecular simulations. I. The electrostatic potential in molecular liquids. J Chem Phys 124:124 106
Kastenholz M, Hünenberger P (2006) Computation of methodology-independent ionic solvation free energies from molecular simulations. II. The hydration free energy of the sodium cation. J Chem Phys 124:224 501
Kehoe CW, Fennell CJ, Dill KA (2012) Testing the semi-explicit assembly solvation model in the SAMPL3 community blind test. J Comput Aided Mol Des 26(5):563–568
Klimovich P, Mobley DL (2010) Predicting hydration free energies using all-atom molecular dynamics simulations and multiple starting conformations. J Comput Aided Mol Des 24(4):307–316
Knight JL, Brooks CL III (2009) Validating CHARMM parameters and exploring charge distribution rules in structure-based drug design. J Chem Theory Comput 5:1680–1691
Knight JL, Brooks CL III (2011) Surveying implicit solvent models for estimating small molecule absolute hydration free energies. J Comput Chem 32(13):2909–2923
Knight JL, Yesselman JD, Brooks CL III (2013) Assessing the quality of absolute hydration free energies among CHARMM-compatible ligand parameterization schemes. J Comput Chem 34(11):893–903
Knox C, Law V, Jewison T, Liu P, Ly, S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS (2011) DrugBank 3.0: a comprehensive resource for ’omics’ research on drugs. Nucleic Acids Res 39(Database issue), D1035–41
Li L, Fennell CJ, Dill KA (2013) Field-SEA: a model for computing the solvation free energies of nonpolar, polar, and charged solutes in water. J Phys Chem B p. 131213113930002
Liu Y, Fu J, Wu J (2013) High-throughput prediction of the hydration free energies of small molecules from a classical density functional theory. J Phys Chem Lett 4(21):3687–3691
Michel J, Essex JW (2010) Prediction of protein-ligand binding affinity by free energy simulations: assumptions, pitfalls and expectations. J Comput Aided Mol Des 24:649–658
Mobley DL, Barber AE II, Fennell CJ, Dill KA (2008) Charge asymmetries in hydration of polar solutes. J Phys Chem B 112:2404–2414
Mobley DL, Bayly CI, Cooper MD, Dill KA, Dill KA (2009) Predictions of hydration free energies from all-atom molecular dynamics simulations. J Phys Chem B 113:4533–4537
Mobley DL, Bayly CI, Cooper MD, Shirts MR, Dill KA (2009) Small molecule hydration free energies in explicit solvent: an extensive test of fixed-charge atomistic simulations. J Chem Theory Comput 5(2):350–358
Mobley DL, Dill K, Chodera JD (2008) Treating entropy and conformational changes in implicit solvent simulations of small molecules. J Phys Chem B 112(3):938
Mobley DL, Dumont É, Chodera JD, Dill K (2007) Comparison of charge models for fixed-charge force fields: small-molecule hydration free energies in explicit solvent. J Phys Chem B 111(9):2242–2254
Mobley DL, Liu S, Cerutti DS, Swope WC, Rice JE (2012) Alchemical prediction of hydration free energies for SAMPL. J Comput Aided Mol Des 26(5):551–562
Mobley DL, Wymer KL, Lim NM (2014) Blind prediction of solvation free energies from the SAMPL4 challenge. J Comput Aided Mol Des 28:135
Mukhopadhyay A, Fenley AT, Tolokh IS, Onufriev AV (2012) Charge hydration asymmetry: the basic principle and how to use it to test and improve water models. J Phys Chem B 116(32):9776–9783
Nerenberg PS, Jo B, So C, Tripathy A, Head-Gordon T (2012) Optimizing solute–water van der waals interactions to reproduce solvation free energies. J Phys Chem B p. 120406101304000
Nicholls A, Mobley DL, Guthrie JP, Chodera JD, Bayly CI, Cooper MD, Pande VS (2008) Predicting small-molecule solvation free energies: an informal blind test for computational chemistry. J Med Chem 51(4):769–779
Nicholls A, Wlodek S, Grant JA (2009) The SAMP1 solvation challenge: further lessons regarding the pitfalls of parametrization. J Phys Chem B 113:4521–4532
OpenEye Scientific Software (2013) OpenEye Python Toolkits
Ponder JW, Wu C, Ren P, Pande VS, Chodera JD, Schneiders MJ, Haque I, Mobley DL, Lambrecht DS, DiStasio RA Jr, Head-Gordon M, Clark GNI, Johnson ME, Head-Gordon T (2010) Current status of the AMOEBA polarizable force field. J Phys Chem 114:2549
Ren P, Chun J, Thomas DG, Schnieders MJ, Marucho M, Zhang J, Baker NA (2012) Biomolecular electrostatics and solvation: a computational perspective. Q Rev Biophys 45(04):427–491
Rizzo RC (2013) Hexafluoropropene correction (2013). Personal communication
Rizzo RC, Aynechi T, Case DA, Kuntz ID (2006) Estimation of absolute free energies of hydration using continuum methods: accuracy of partial charge models and optimization of nonpolar contributions. J Chem Theory Comput 2(1):128–139
Rocklin GJ, Mobley DL, Dill KA, Hünenberger PH (2013) Calculating the binding free energies of charged species based on explicit-solvent simulations employing lattice-sum methods: an accurate correction scheme for electrostatic finite-size effects. J Chem Phys 139(18):184 103
Schmuckler ME, Barefoot AC, Kleier DA, Cobranchi DP (2003) Vapor pressures of sulfonylurea herbicides. Pest Manag Sci 56(6):521–532
Shirts MR, Mobley DL (2013) An Introduction to Best Practices in Free Energy Calculations. In: Biomolecular Simulations. Methods in Molecular Biology
Shirts MR, Pitera JW, Swope WC, Pande VS (2003) Extremely precise free energy calculations of amino acid side chain analogs: comparison of common molecular mechanics force fields for proteins. J Chem Phys 119(11):5740–5761
Shivakumar D, Deng Y, Roux B (2009) Computations of absolute solvation free energies of small molecules using explicit and implicit solvent model. J Chem Theory Comput 5(4):919–930
Shivakumar D, Harder E, Damm W, Friesner RA, Sherman W (2012) Improving the prediction of absolute solvation free energies using the next generation OPLS force field. J Chem Theory Comput 8(8):2553–2558
Shivakumar D, Williams J, Wu Y, Damm W, Shelley J, Sherman W (2010) Prediction of absolute solvation free energies using molecular dynamics free energy perturbation and the OPLS force field. J Chem Theory Comput 6(5):1509–1519
da Silva Sousa AW, Vranken WF (2012) ACPYPE— AnteChamber PYthon Parser interfacE. BMC Res Notes 5(1):367
van der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC (2005) GROMACS: fast, flexible, and free. J Comput Chem 26:1701–1718
Sulea T, Corbeil CR, Purisima EO (2010) Rapid prediction of solvation free energy. 1. An extensive test of linear interaction energy (LIE). J Chem Theory Comput 6(5):1608–1621
Swain M (2013) PubChemPy, https://pypi.python.org/pypi/PubChemPy/1.0
Truchon JF, Pettitt BM, Labute P (2014) A cavity corrected 3D-RISM functional for accurate solvation free energies. J Chem Theory Comput. p. 140114120800002
Wang J, Wang W, Kollman P, Case DA (2006) Automatic atom type and bond type perception in molecular mechanical calculations. J Mol Graph Model 25:247–260
Wang J, Wolf R, Caldwell J, Kollman P, Case D (2004) Development and testing of a general amber force field. J Comput Chem 25(9):1157–1174
Zheng L, Yang W (2012) Practically efficient and robust free energy calculations: double-integration orthogonal space tempering. J Chem Theory Comput 8:810–823
Acknowledgments
We thank Robert C. Rizzo (Stony Brook University) for help tracking down an issue with hexafluoropropene, and many others who have been involved in work on the experimental and calculated values represented in this database, including Élise Dumont, John D. Chodera, Ken A. Dill, Alan E. Barber, II, Anthony Nicholls, Christopher I. Bayly, Matthew D. Cooper, Vijay S. Pande, Michael R. Shirts, Pavel V. Klimovich, Shuai Liu, David S. Cerutti, William C. Swope, Julia E. Rice, Christopher J. Fennell, Nathan M. Lim, and Karisa L. Wymer. We also appreciate work done by Karisa Wymer and Jessica Fuselier towards initial curation of the set. DLM appreciates financial support from the National Institutes of Health (1R15GM096257-01A1), and computing support from the UCI GreenPlanet cluster, supported in part by NSF Grant CHE-0840513.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Mobley, D.L., Guthrie, J.P. FreeSolv: a database of experimental and calculated hydration free energies, with input files. J Comput Aided Mol Des 28, 711–720 (2014). https://doi.org/10.1007/s10822-014-9747-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-014-9747-x