Background

When a chemical enters the body, either through absorption or through direct administration, the relative concentrations found in the blood and other tissues are determined by physiochemical processes that separate the different parts of the body. For example, the blood–brain barrier separates the blood from the brain’s extracellular fluid in the central nervous system and protects the brain from potential neurotoxins and bacteria while allowing passage of essential molecules such as water, glucose, and amino acids that are crucial to neural function.

Knowing or predicting the partition coefficients (ratio of concentrations) of compounds between the bloodstream and various tissues is important in order to study the pharmacokinetic profile of drug candidates. While in vivo measurements are of most value, obtaining them is often not practical. Thus over the years several models have been developed to predict blood–tissue partition coefficients [13], with recent special attention being paid to the blood–brain barrier [4, 5].

Linear free energy relationships, developed by Abraham [6], have been applied directly to blood–tissue partition coefficients by Abraham, Gola, Ibrahim, Acree, and Liu [1] resulting in the model

$${\text{log BB}} = c + e E + s S + a A + b B + v V + i l_{\text{c}}$$
(1)

where log BB is the base ten logarithm of the blood–brain partition coefficient; E, S, A, B, and V are the standard solute descriptors [7, 8] and c, e, s, a, b, v, and i are the process coefficients, see Table 1. The descriptor Ic is an indicator variable for carboxylic acids that is taken to be one if the solute is a carboxylic acid and zero otherwise. This flag is not usually included in a general Abraham-type model but is needed here because the pH of blood is 7.4 and carboxylic acids are ionized at this pH.

Table 1 Coefficients in equation one for in vivo processes at 37 °C [1]

Abraham and Acree have also used Eq. (1) to show that the water–1,9-decadiene system can be used as an excellent model for permeation through egg lecithin bilayers [9]. This suggests that other water–solvent systems could be used as models for blood–tissue coefficients. This would be very useful, because then in vivo blood–tissue partition coefficients could be estimated in vitro.

Methods

Abraham model coefficients have been determined for over 90 organic solvents and can be predicted for others [10]. To find water–solvent systems that could be used to approximate blood–tissue systems we regressed the e, s, a, b, and v coefficients for each of the 90 organic solvents against the e, s, a, b, and v coefficients for each blood–tissue system listed in Table 1 above. The c-coefficient was not included as it is the intercept and could be adjusted separately after the regression had been performed. Specifically, we used linear regression in R (v 3.1.1)—‘lm’ command—and determined the best fit by using ‘regsubsets’ command in the ‘leaps’ package.

For example, the logarithm of partition coefficient for the blood–brain barrier is:

$$\begin{aligned} {\text{log BB}}& = 0.547 + 0.221 {\text{ E}} - 0.604 {\text{ S}} \\ &\quad- 0.641 {\text{ A}} - 0.681 {\text{B}} + 0.635 {\text{V}} - 1.216 {\text{ l}}_{\text{c}} \end{aligned}$$
(2)

Regressing Abraham solvent coefficients against this equation, we find that the water–methylcyclohexane partition system

$$\begin{aligned} {\text{log P}}_{\text{mcy}} &= 0.246 + 0.782 {\text{ E}} -1.982 {\text{S}} \\ &\quad- 3.517 {\text{ A}} - 4.293 {\text{B}} + 4.528 {\text{V}} \end{aligned}$$
(3)

can be used as a good (p < 0.002, R2 = 0.94) model for blood–brain barrier partition coefficients as follows:

$${\text{log BB}} = 0.505 + 0.169 {\text{log P}}_{\text{mcy}} - 1.216 {\text{ I}}_{\text{c}}$$
(4)

where log Pmcy is the measured log P value for methylcyclohexane. For additional details, datasets, and the R-code used, see the Open Notebook lab page [11].

Substituting Eq. (3) into (4) gives:

$$\begin{aligned} {\text{log BB}} &= 0.547 + 0.132 {\text{ E}} - 0.335 {\text{S}} \\ &\quad- 0.594 {\text{A}} - 0.726 {\text{ B}} + 0.765 {\text{ V}} - 1.216 {\text{ l}}_{\text{c}} \end{aligned}$$
(5)

Comparing Eqs. (2) and (5) we see fairly good agreement between coefficients. To validate our model we plotted the predicted log BB values for water, for six inorganic gases and for 13 common organic compounds using both equations, see Table 2; Additional file 1: Appendix Table S1; Fig. 1.

Table 2 Predicted blood–brain barrier partition coefficients
Fig. 1
figure 1

Predicted blood–brain barrier partition coefficients coloured by measured log BB value

The mean-square-error (MSE) between Eqs. (2) and (4) is 0.03 log units. The largest error occurs for styrene (AE 0.93 log units). In fact, without styrene, the MSE would drop to 0.02 log units. The reason why styrene is an outlier is that it is on the edge on the training-set chemical space. It has E and S values of 0.85 and 0.65 respectively as compared to the average values of E and S for the other compounds in the training set of 0.16 and 0.24 respectively. Other solvents that could be used as model systems for the blood–brain barrier include 1,9-decadience and octane.

We have modeled log BB indirectly by comparing the Abraham coefficients for water–solvent systems to the Abraham coefficients for log BB. We found that the water–methylcyclohexane system may be a good system to use to approximate log BB values in vitro, especially for solutes whose descriptor values fall within the range covered by both Abraham models (log BB and log Pmcy). That is, Eq. (4) can be used to predict log BB values from log Pmcy values but should be used with caution when using it with compounds outside the chemical space used to create these models. In addition, the MSE of 0.03 is between Eqs. (2) and (4) and we do not claim that Eq. (4) will have this type of performance when used to predict measured log BB values. Our work indicates that methylcyclohexane is a good candidate for approximating log BB values but future work should focus on modeling log BB directly from log Pmcy when measured values for both log BB and log Pmcy are known for a significant number of compounds. Of particular interest would be experimentally determining both log BB and log Pmcy values for more common organic compounds (including crystalline compounds) that span a larger range of solute descriptors. The 20 compounds that are common to both the log BB and log Pmcy databases are inorganic gases and liquid organic compounds. The organic compounds, while not pharmaceutical compounds, are ones that workers are exposed to in chemical manufacturing processes.

Results and discussion

We have seen that methylcyclohexane can be used to approximate log BB using Eq. (4). In general, we approximate the blood–tissue partition coefficient using the following equation

$${\text{log P}}_{\text{blood/tissue}} = {\text{c}}_{0} + {\text{c}}_{ 1} {\text{X}}_{ 1} + {\text{I}}_{\text{c}}$$
(6)

where c0 is the intercept, c1 is the coefficient multiplier for the log P system corresponding to solvent X1, and Ic is the carboxylic acid flag. Performing a similar analysis as described above and regressing the water–solvent system Abraham coefficients against the blood–tissue systems given in Table 1, we find the following results, presented in tables, see Tables 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, where the p-values are the standard p-values from linear regression—calculated using the ‘lm’ command in R.

Table 3 Top five solvents for blood–brain
Table 4 Top five solvents for blood–muscle
Table 5 Top five solvents for blood–liver
Table 6 Top five solvents for blood–lung
Table 7 Top five solvents for blood–kidney
Table 8 Top five solvents for blood–heart
Table 9 Top five solvents for blood–skin
Table 10 Top five solvents for blood–fat
Table 11 Top five solvents for water–skin
Table 12 Top five solvents for skin-permeation

Examining the results presented in the Tables 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, we see that the blood–brain barrier system can be modeled well with multiple solvents, including methylcyclohexane, octane, and 1,9-decadiene.

The results for blood–muscle and blood–liver were similar, with similar solvents, but very poor R2 values overall. The highest R2 was 0.44, exhibited by 2,2,2-trifluoroethanol for the blood–liver system.

The results for modeling the blood–lung, blood–kidney, and blood–heart partition coefficients were interesting as the top three suggested replacement solvents were identical, namely: 2,2,2-trifluoroethanol, methylcyclohexane, and 1,9-decdiene. The R2 values for these systems ranged between 0.41 for blood–kidney to 0.72 for blood–heart.

The blood–skin barrier model showed very strong results, with all of the top 5 R2 values above 0.95, which is very good. Some previously unseen solvents came up, the various ethanol–water mixtures composed four of the top five solvents.

Modeling the blood–fat system also had some very promising results. The highest was carbon disulfide with an R2 of 0.998. The lowest of the top 5 values was still very good, an R2 value of 0.95 for peanut oil. We suggest using the water/peanut oil system as a replacement system for blood–fat partition coefficients.

The water–skin solvents tested also produced strong results; the lowest of the top five R2 values is over 0.9, much higher than several of the earlier systems. Tetrahydrofuran resulted in the highest R2 value at 0.997.

The top five suggested replacement water–solvent systems for skin-permeation, like many previous blood–tissue systems, show great promise. The top three solvents being methyl tert-butyl ether, tetrahydrofuran, and diethyl ether.

Whilst most blood–tissue systems can be modeled with a single water–solvent system, blood–muscle, blood–liver, and blood–kidney had poor results, with R2 values all below 0.45. This is due to these three solvents having the smallest v values (0.110, 0.337, and 0.410) and highest b values (0.028, 0.181, 0.232) taking them out of the chemical space for single solvents. For these systems we modeled the blood–tissue coefficients using two measured water–solvent partition coefficient values X1 and X2 as follows

$${\text{log P}}_{\text{blood/tissue}} = {\text{c}}_{0} + {\text{c}}_{ 1} {\text{X}}_{1} + {\text{c}}_{ 2} {\text{X}}_{ 2} + {\text{I}}_{\text{c}}$$
(7)

where again c0 is the intercept. The results of these models are again presented in table form, see Tables 13, 14, 15.

Table 13 Top five results for two-variable blood–kidney partition coefficient
Table 14 Top five results for two-variable blood–liver partition coefficient
Table 15 Top five results for two-variable blood–muscle partition coefficient

Blood–kidney regression with 1-variable produced very poor results, the top R2 value was 0.4 for 2,2,2-trifluoroethanol. Two variables can be used to increase the R2 value. This greatly improved all values for blood–kidney, the top value produced by a mixture of ethanol/water (20:80) and DMSO, with an R2 value of 0.997.

Blood–liver also produced very poor 1-variable results, so 2-variables were used to improve the R2 value. The highest R2 with 1-variable was 0.44 with 2,2,2-trifluoroethanol. The highest R2 with 2-variables was 0.99 by ethanol/water (60:40) and N-methyl-2-piperidone.

For the blood–muscle process, the overall 2-variable correlation coefficients were fairly good. The solvents that are best are chloroform and dibutyl ether with an R2 value of 0.97.

Combining two measured water/solvent partition coefficients can also improve the models for approximation the other blood–tissue partition coefficient values. See the Wiki page in the references for a complete list of all two-variable data tables [11].

When looking at the results, we note that the standard 1-octanol/water partition coefficient (log P) does not appear as a top solvent for any of the blood–tissue processes. This is interesting because log P has for a long time been assumed to be useful in estimating the distribution of drugs within the body and is a standard descriptor used in most QSAR modeling. Since the use of log P is prevalent throughout the chemistry community, we calculated how well the Abraham model for every blood–tissue partition coefficient can be modelled by the Abraham model for log P, see Table 16.

Table 16 Equation (6) coefficients for 1-octanol against multiple processes

Examining Table 16, we see that log P can be used to approximate all blood–tissue partition coefficients and actually performs moderately well for estimating log BB, but poorly for blood–muscle and all other organs. However, log P seems like a reasonable measure for processes to do with chemicals entering into the body: blood–skin, blood–fat, water–skin, and skin-permeation. The latter observation is in accord with the published results of Cronin and coworkers [12, 13] who noted that the percutaneous adsorption of organic chemicals through skin is mediated by both the hydrophobicity (log P) and the molecular size of the penetrant.

The water/solvent systems that included methylcyclohexane and 1,9-decadiene were in the top 5 results for multiple regressions. In Tables 17 and 18 we present the Eq. (6) coefficients for methylcyclohexane and 1,9-decadience respectively. In some case the coefficients have low R2 values. Keeping that in mind, we have a two more ways (with better performance than log P for predicting the important log BB partition coefficient) that all blood–tissue partition coefficients can be approximated by a single water–solvent partition coefficient measurement.

Table 17 Equation (6) coefficients for methylcyclohexane against multiple processes
Table 18 Equation (6) coefficients for 1,9-decadiene against multiple processes

As we have seen, methylcyclohexane is a good solvent when used to model the blood–brain barrier process. For other processes, blood–fat and skin-permeation, it showed a reasonably good R2 value (over 0.80). However, blood–muscle, blood–liver, and blood–kidney showed really poor R2 values (all less than 0.33).

1,9-Decadiene was just as good of a solvent as methylcyclohexane for approximating multiple blood–tissue coefficients. Blood–brain, blood–fat, and skin-permeation all showed good R2 values over 0.80. Just as in the methylcyclohexane case, the processes blood–muscle, blood–liver, blood–kidney were not well modeled and 2-solvent models are needed for more accurate approximations.

The research presented in this paper was performed under standard Open Notebook Science conditions, where day-to-day results were posted online in as near to real time as possible. For addition details, the data files, and the R-code used to find model systems, see the Open Lab Notebook page [11].

Conclusions

Replacement solvents for various blood–tissue processes are proposed based upon the Abraham general solvation linear free energy relationship (1). For example, the top five solvents for approximating the blood brain barrier partition coefficient are methylcyclohexane, 1,9-decadiene, octane, cyclohexane, and decane. The five best solvents for the other blood–tissue partition coefficients were also calculated and presented. For three systems: muscle, liver, and lung; two-solvent models were presented to improve accuracy. For 1-solvent models, two solvents regularly came up in the list of best solvents for many processes. The top two recurring solvents were methylcyclohexane and 1,9-decadiene. This suggests that a single water–solvent partition measurement could in either methylcyclohexane or 1,9-decadiene can be used to approximate several blood–tissue partition coefficients.