1 Introduction

Diffusion coefficients of solutes in water and in organic solvents play a very important role in chemical engineering [1], environmental chemistry [2] and in biological chemistry [3]. Therefore it is not surprising that there have been numerous studies on the correlation and prediction of limiting diffusion coefficients, mostly diffusion coefficients in water but also in organic solvents [4]. Many of the older studies on diffusion coefficients have involved a restricted number of solutes, but recently there have been a number of studies which dealt with much larger numbers of solutes. Gharagheizi and Sattari [5] used a data base of diffusion coefficients in water for 300 solutes. They calculated 1664 descriptors for each solute from the Dragon software [6], and reduced these to the best six descriptors to use in a linear equation. Garagheizi et al. [7] later used a huge data base of 4852 solutes to develop a predictive method based on 148 functional groups. Unfortunately, although they give the predicted diffusion coefficients for all 4852 solutes, the authors do not give any actual experimental values. Garagheizi [8] later used the 4852 data base to obtain a linear three parameter equation using descriptors calculated from the Dragon software program.

Use of extensive data bases has not been confined to diffusion coefficients in water. Magalhaes et al. [9] studied 3463 values of diffusion coefficients in water and in organic solvents. However, since 18 solvents were examined, the number of solutes per solvent is not that large. Li et al. [10] used a linear free energy relationship based on the Abraham equation [11], to correlate 743 diffusion coefficients in 25 solvents; again, although a large number of data points was used, the number per solvent is still rather limited.

All these recent studies involve only nonelectrolytes as solutes. Ions and ionic species are excluded. Indeed it seems impossible for methods based on the calculation of Dragon descriptors or on functional group analysis to be extended to ionic molecules. The only study of limiting diffusion coefficients that included ions and ionic molecules has been our own work on solutes in water [12]. It is the aim of the present work to enlarge the data base of ionic and nonelectrolyte solutes in water, and to extend our analysis to as many organic solvents for which data on nonelectrolytes, ions and ionic species are available.

2 Methods

Our general method is based on the linear free energy equation, LFER, Eq. 1.

$$\log_{10} \left( {D^\text{{o}} } \right) = c + e \cdot \varvec{E} + s \cdot \varvec{S} + a \cdot \varvec{A} + b \cdot \varvec{B} + v \cdot \varvec{V} + j^{ + } \cdot \varvec{J}^{ + } + j^{ - } \cdot \varvec{J}^{ - }$$
(1)

The dependent variable is log10 (Do), where Do is the diffusion coefficient at infinite dilution at 298 K for a series of solutes in a given solvent. The independent variables in Eq. 1 (E, S, A, B, V, J+ and J) are properties or descriptors of the solutes and the coefficients (c, e, s, a, b, v, j+ and j) are the complementary properties of the solvent system. Solute and solvent properties have been described in detail in earlier publications [13,14,15,16,17,18,19,20,21], there are several reviews on our method [22,23,24], and a number of applications [25,26,27] and so only an outline will be given here. E is the solute excess molar refraction in cm3·mol−1/10, S is a measure of the solute polarity/polarizability, A and B refer to the hydrogen bond acidity and hydrogen bond basicity of the solute, and V is McGowan’s characteristic molecular volume of the solute in cm3·mol−1/100 [28]. J+ and J are properties of positively charged and negatively charged solutes. Note that for positively charged solutes J = 0, for negatively charges solutes J+ = 0, and for nonelectrolytes J+ =J = 0. For the specific case of zwitterionic aminoacid solutes both J+ and J are necessary to characterize the solutes [29].

3 Results

Diffusion coefficients for ions and ionic species in organic solvents were obtained from data on single-ion conductances \(\varLambda_{\text{i}}^{\text{o}}\) through the Nernst–Einstein equation, Eq. 2, where F is the Faraday constant. We give diffusion coefficients for ions and nonelectrolytes in units of 10−5 cm2·s−1 at 298 K.

$$D_{i}^{\text{o}} = \frac{\text{RT}}{{F^{2} }}\varLambda_{i}^{\text{o}}$$
(2)

3.1 Water

Data for ions and ionic species were those that we used before [12]. For nonelectrolytes we supplemented our previous data with a considerable number of extra values of log10 (Do) [2, 5, 7, 30,31,32,33,34], leading to a total of 377 values, including ions, ionic species, zwitterionic amino acids and neutral nonelectrolytes. The list of solutes, diffusion coefficients and solute descriptors are given in Table S1 of the Supplementary Material. We found that polyaromatic hydrocarbons with three or more condensed rings were systematically out of line, and we included an indicator variable for these solutes, Ipha, that has the value Ipha= 1 for the condensed ring compounds and Ipha= 0 for all other solutes. Our final equation is

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.1803 - 0.0119\,\varvec{E} + 0.0319\,\varvec{S} + 0.0104\,\varvec{A} - 0.0362 - 0.2519\,\varvec{V} - 0.0148\,\varvec{J}^{ + } + 0.0205\,\varvec{J}^{ - } - 0.0798\,I_{\text{pha}}$$
(3)

N = 377, SD = 0.0474, R2 = 0.840, F = 241.20, PRESS = 0.91821, Q2 = 0.823, and PSD = 0.0499.

Here and elsewhere, N is the number of data points (solutes), SD is the standard deviation, R is the equation correlation coefficient, F is the Fisher F-statistic and PRESS and Q2 are the leave-one-out statistics. We use the predictive standard deviation, PSD, as a measure of the predictive capability of the equation [35] rather than the method of division into test and training sets, because PSD is based on the leave-one-out statistics, it takes into account the number of data points and the number of descriptors and it uses data on all the data points. The high value of the leave-one-out Q2 shows that the equation is statistically sound.

3.2 Ethanol

We obtained diffusion coefficients for ions from conductance data through Eq. 2 [36,37,38]. We took our previous data on nonelectrolytes [12] and added further data [39,40,41,42] to give a total of 96 values for ions plus nonelectrolytes. The resulting equation for log10 (Do) is Eq. 4. The term in Ipha was not significant. Both the value of Q2 and PSD show that the equation is statistically sound. The list of solutes, diffusion coefficients and solute descriptors are given in Table S2.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.3710 - 0.1058\,\varvec{E} + 0.1369\,{\text{S}} - 0.3189\,\varvec{A} - 0.1629 - 0.2215\,\varvec{V} + 0.4084\,\varvec{J}^{ + } + 0.1096\,\varvec{J}^{ - }$$
(4)

N = 96, SD = 0.0666, R2 = 0.932, F = 169.17, PRESS = 0.58308, Q2 = 0.897, and PSD = 0.0823.

3.3 Methanol

There are numerous studies on single ion conductance in methanol (43,44,45,46,47, 48) from which we derived corresponding diffusion coefficients. Diffusion coefficients of nonelectrolytes have been well studied (4, 10, 34, 39,40,41,42, 49,50,51,52,53) and we collected data on a total of 96 solutes, ionic plus nonelectrolytes, that yielded Eq. 5. The list of solutes, diffusion coefficients and solute descriptors are given in Table S3.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.4601 + 0.0197\,\varvec{E} - 0.0020\,\varvec{S} - 0.2403\,\varvec{A} - 0.0463 - 0.1377\,\varvec{V} + 0.0252\,\varvec{J}^{ + } - 0.0788\,\varvec{J}^{ - }$$
(5)

N = 96, SD = 0.0332, R2 = 0.915, F = 135.98, PRESS = 0.1232, Q2 = 0.893, and PSD = 0.0374.

3.4 Propan-1-ol

Diffusion coefficients for ions were obtained from conductance data through Eq. 2 [30, 37, 43, 45]. There is surprisingly little data on diffusion coefficients of nonelectrolytes [4, 34], not enough to enable us to obtain a robust equation based on Eq. 1. However, there are considerable data on nonelectrolytes in octanol [3, 49, 50] as well as in methanol and ethanol (see above). For any given solute, a plot of log10 (Do) against the number of carbon atoms in the alcohol is almost linear. Taking into account the slight nonlinearity, it is then quite easy to interpolate values of log10 (Do) to obtain estimates in propan-1-ol (and in butan-1-ol). We obtained a total of 73 values of log10 (Do) for ions and nonelectrolytes that yielded Eq. 6. The list of solutes, diffusion coefficients and solute descriptors are given in Table S4.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.1263 - 0.0353\,\varvec{E} - 0.0144\,\varvec{S} - 0.4433\,\varvec{A} - 0.1063 - 0.0938\,\varvec{V} - 0.1097\,\varvec{J}^{ + } - 0.2016\,\varvec{J}^{ - }$$
(6)

N = 71, SD = 0.0487, R2 = 0.948, F = 164.81, PRESS = 0.20879, Q2 = 0.928, and PSD = 0.0576.

3.5 Butan-1-ol

Data are available for ions [30, 45], but, again, there is a lack of data on nonelectrolytes [4, 10, 39, 40, 42, 47]. We adopted the same stratagem as for propan-1-ol solvent and obtained data on a total of 53 solutes, leading to Eq. 7. The list of solutes, diffusion coefficients and solute descriptors are given in Table S5.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.0608 - 0.0036\,\varvec{E} - 0.0316\,\varvec{S} - 0.4993\,\varvec{A} - 0.0461 - 0.1822\,\varvec{V} - 0.1777\,\varvec{J}^{ + } - 0.2422\,\varvec{J}^{ - }$$
(7)

N = 53, SD = 0.0600, R2 = 0.926, F = 80.29, PRESS = 0.32266, Q2 = 0.852, and PSD = 0.0846.

3.6 Octan-1-ol

Diffusion coefficients for ions are available from conductance data [46, 54] and there is just enough data on diffusion coefficients for nonelectrolytes [3, 10, 54,55,56,57] to be able to construct Eq. 8 for a total of 61 solutes. The list of solutes, diffusion coefficients and solute descriptors are given in Table S6.

$$\log_{10} \left( {D^{\text{o}} } \right) = -\,0.1233 - 0.0382\,\varvec{E} - 0.0440\,\varvec{S} - 0.5710\,\varvec{A} - 0.2862 - 0.2378\,\varvec{V} - 0.8020\,\varvec{J}^{ + } - 0.2899\,\varvec{J}^{ - }$$
(8)

N = 61, SD = 0.0684, R2 = 0.949, F = 141.41, PRESS = 0.3327, Q2 = 0.932, and PSD = 0.0792.

3.7 Propanone

There is a reasonable amount of data on the conductance of ions in propanone [46, 58], and several studies on the diffusion of nonelectrolytes [10, 50, 59,60,61,62,63]. We did not use the results of Hejtmanek and Schnelder [63] on crown ethers that were completely out of line. Interestingly, these experimental results were not well reproduced by any of the predictive methods used [63]. We compiled a total of 86 values of log10 (Do) including ions and nonelectrolytes, that yielded Eq. 9. The list of solutes, diffusion coefficients and solute descriptors are given in Table S7.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.6593 - 0.0221\,\varvec{E} - 0.0162\,\varvec{S} - 0.1718\,\varvec{A} - 0.0031 - 0.1102\,\varvec{V} - 0.0167\,\varvec{J}^{ + } - 0.0631\,\varvec{J}^{ - }$$
(9)

N = 86, SD = 0.0366, R2 = 0.837, F = 57.37, PRESS = 0.14773, Q2 = 0.767, and PSD = 0.0435.

3.8 Acetonitrile

There have been a several studies on conductance of ions in acetonitrile solvent [44, 46, 58, 64,65,66,67] so that we were able to obtain log10 (Do) for 18 ions. There are also a considerable number of studies on nonelectrolytes [10, 40, 68,69,70,71,72,73,74] so that we had a combined total of 74 log10 (Do) values for ions and nonelectrolytes. This led to Eq. 10. The list of solutes, diffusion coefficients and solute descriptors are given in Table S8.

$$\log_{10} \left( {D^{\text{o}} } \right) = 0.6612 - 0.0283\,\varvec{E} - 0.0126\,\varvec{S} - 0.2311\,\varvec{A} - 0.0511 - 0.1399\,\varvec{V} + 0.034\,\varvec{J}^{ + } - 0.0287\,\varvec{J}^{ - }$$
(10)

N = 74, SD = 0.0438, R2 = 0.886, F = 73.53, PRESS = 0.15983, Q2 = 0.857, and PSD = 0.0492.

4 Discussion

There are several more solvents than those we have studied for which conductance data exists for a sufficient number of ions to be able to use in Eq. 1. Unfortunately there are not enough data on nonelectrolytes in these solvents to obtain statistically meaningful equations. On the other hand, there are a large number of nonpolar solvents for which log10 (Do) are available, but for which there is no data, or very little data, on log10 (Do) for ions. However, we have results for enough solvents to show that Eq. 1 can be regarded as a general equation for the correlation and prediction of diffusion coefficients of ions and nonelectrolytes in water and in organic solvents. We include diffusion coefficients of zwitterionic aminoacids (in water), of the tetraalkyltins (in most of the solvents) and of the inorganic complexes Fe(acac)3 and Co(acac)3 as well as a number of ferrocenes (in acetonitrile) so that the scope of our equations is very large indeed. The generality of our equations is not limited by functional groups, but depends on the range of the independent variables in the various equations. We give in Table 1 the maximum and minimum values of the independent variables for the nonelectrolytes studied with water and butan-1-ol solvents; that is, the solvents with the largest and smallest numbers of the diffusion coefficients. Even for butan-1-ol, the range of the independent variables for nonelectrolytes is large enough to include a very wide range of solute types. The statistics of our equations for log10 (Do) are all quite good, with SD values ranging from 0.0332 (methanol) to 0.0684 (octan-1-ol). By comparison, our present equation for solvent water has SD = 0.0474 for 377 ions plus nonelectrolytes as compared to our previous statistic [12] of SD = 0.085 for 190 ions plus nonelectrolytes in water.

Table 1 Minimum and maximum values of the independent variables used in Eq. 1 for nonelectrolytes

We have data on diffusion coefficients for the large ions Ph4As+, Ph4P+ and Ph4B but in several solvents the observed values of log10 (Do) were considerably out of line and we then omitted these ions from all of our analyses. We have no explanation for this observation. Our descriptors for these ions have been used on several occasions in partition systems with good results [24] so errors in the descriptors can be ruled out. Our calculated log10 (Do) values for Ph4As+and Ph4P+ were always quite close to the calculated value for Ph4B as is required by the TATB convention [75, 76] that we used to obtain data on single ions [24], so that the log10 (Do) values we use for single ions are compatible with the TATB convention that we use to obtain descriptors for single ions.

It is useful to compare the coefficients in Eq. 1 for the various solvents that we have studied, and these are collected in Table 2. The three most important terms for both ions and nonelectrolytes are A, B and V with the a-, b- and v-coefficients all being negative. Thus increase in solute hydrogen bond acidity, hydrogen bond basicity and volume all lead to a decrease in the limiting diffusion coefficient, the effect of solute volume on diffusion being well known [77, 78]. The decrease in diffusion due to increase in solute hydrogen bond acidity and hydrogen bond basicity can probably be attributed to increases in solute–solvent acid–base interactions. It would be useful if the coefficients in Table 2 could be used to predict coefficients for other solvents. We examined the coefficients for water and the alcohols as a function of the number of carbon atoms in the solvents. Although there are some general trends, we were unable to find statistically satisfactory correlations between any of the coefficients and the number of carbon atoms. It can be seen, almost by inspection, that the coefficients in water and in ethanol are often outliers to the general trends.

Table 2 Summary of coefficients in Eq. 1

5 Conclusions

We have now shown that it is possible to correlate limiting diffusion coefficients for non-electrolytes, ions and ionic species in water and in seven organic solvents with Abraham descriptors through the linear free energy equation, LFER, Eq. 1. We can therefore now regard Eq. 1 as a general equation for the correlation of limiting diffusion coefficients. In principle, it could be applied to any solvent for which diffusion data exists on non-electrolytes, ions and ionic species. Of course, this excludes non-polar solvents where diffusion data are restricted to non-electrolytes, but there are numerous polar or semi-polar solvents to which Eq. 1 could apply. We have not been able to extend our studies to additional solvents simply because of the lack of data on either non-electrolytes or ionic species. Conductance data exists for ions in 2-methoxyethanol [79, 80], in N,N-dimethylformamide [66, 81] and in formic acid [82], and these could be used to obtain corresponding limiting diffusion coefficients through Eq. 2. However, there is not enough data on non-electrolytes in these solvents to apply Eq. 1. Whether Eq. 1 could be applied to diffusion of non-electrolytes and ions through biological or other membranes [3] remains to be seen.