1 Introduction

The world is increasingly searching for sustainable substitutes to replace fossil fuels. Biomass from plants is one large resource that could be converted into fuels and chemicals [1]. To accomplish this goal, some have proposed the concept of a biorefinery, which would take raw materials such as biomass and convert them into valuable products needed in modern society [2]. Similar to the current petroleum-based supply chain, a bio-based supply chain would involve hundreds of different chemicals, whether as intermediates or final products [1, 2].

Chemicals relevant to the petroleum industry have been thoroughly studied over the past century, but for many bio-based chemicals there is little, if any, data about their properties. Key thermodynamic and physical properties of bio-compounds will need to be determined in order to produce bio-based chemicals. In this study we measured the vapor pressures, densities and refractive indexes of 11 such bio-compounds.

Several of these bio-compounds show up as intermediates or platform chemicals that could be further converted into a variety of products [2]. Many of the chemicals are, or could be, useful products. Dihydrolevoglucosenone is a potential bio-based alternative for dipolar aprotic solvents, such as N-methyl-2-pyrrolidone and dimethylformamide [3, 4]. 2,6-dimethoxyphenol has a smoky aroma and is an ingredient in artificial smoke-flavoring products [5, 6]. It, and its derivatives, could also be used for producing renewable phenolic resins [7]. 2-methoxy-4-methylphenol is used as a flavoring [8]. 2-sec-Butylphenol has been successfully tested as a solvent for furfural extraction in the so-called biphasic reactor concept, and it may be possible to produce it from lignin [9]. It has also been proposed that alkylphenols, such as 2-sec-butylphenol, be used as a solvent in biofuel production to increase the effectiveness of the process [10]. γ-valerolactone can be used as a fuel and has also been identified as a promising platform chemical that can be used to produce a variety of other chemicals [11,12,13,14]. Tetrahydrofuran is used as a solvent and in producing some polymers [15, 16]. 2-methylfuran has received attention as a potential substitute for gasoline due to its impressive combustion performance [17]. Cyclopentyl methyl ether is used as a solvent, and it has been shown to be a promising solvent for extracting compounds from the aqueous streams present in bio-refineries [18, 19]. Furfural can be used as a selective solvent and is a platform chemical that can be processed into other products [20].

Many of these compounds can also be produced from one another. For instance, dihydrolevoglucosenone is produced via hydrogenation of levoglucosenone, which itself is produced from the sugar levoglucosan that is formed in the pyrolysis of lignocellulosic biomass [21]. In addition, there are reactions for converting between 2-methylfuran, tetrahydrofuran, furfural, and 2-pentanone [2, 22]. 1,3-dimethoxy-2-hydroxybenzene and 2-methoxy-4-methylphenol have been found in plants and in pyrolysis oil [23].

For 6 (2-methoxy-4-methylphenol, 2-sec-butylphenol, 2,6-dimethoxyphenol, cyclopentyl methyl ether, dihydrolevoglucosenone, levoglucosenone) of the 11 compounds there is only a small amount of data, if any at all. For 5 bio-compounds (2-methylfuran, 2-pentanone, furfural, tetrahydrofuran, γ-valerolactone) large amounts of data can be found in the literature. However, even for these 5 compounds the data presented here extends beyond the range covered in the literature. For 7 of the bio-compounds (2-methoxy-4-methylphenol, 2-methylfuran, 2-pentanone, cyclopentyl methyl ether, dihydrolevoglucosenone, furfural, tetrahydrofuran) we measured compressed densities at higher pressures (up to 12 or 16 MPa), and for many of the compounds there was no reliable data available at higher pressures. We have placed a file containing both the literature data we found and our experimental data in a repository at the Open Science Framework, and this file can be obtained at (https://osf.io/u9amn/). An overview of the measurements made in this work is presented in Table 1.

Table 1 Overview of the measurements made in this work

2 Methods

2.1 Chemicals

Information about the chemicals used in this study is presented in Tables 2 and 3. The structures of the chemicals are presented in Fig. 1 [24]. The purities were measured using gas chromatography with a flame ionization detector. For most of the chemicals the water content was also measured using a DL38 Karl Fischer Titrator (Mettler Toledo). The purity was calculated by taking the relative peak area from the chromatogram and then accounting for any water by dividing by 1 plus the water content (if measured).

Table 2 Names, identifiers, and suppliers of the chemicals used in this study
Table 3 Purities of the chemicals used in this study
Fig. 1
figure 1

Structures of the chemicals measured in this study. Structures were obtained from PubChem [24]

All samples except one had purities close to 100 %. The one exception was the levoglucosenone used for vapor pressure measurements, which only had a purity of 96.2 wt%. Mass spectroscopy was used to investigate what impurities were present, and most of them were lower weight impurities such as 2-methylpentane, hexane, methylcyclopentane, cyclohexane and acetone. The levoglucosenone also contained one heavier impurity: 2-methoxyphenol. These impurities could be tolerated when measuring the vapor pressure because in the gas saturation method most of the light impurities are removed in the first run. This occurred with levoglucosenone in this study: the vapor pressure was much higher for the first run, and that point was removed. After the vapor pressure measurement, the purity of the levoglucosenone condensed at the outlet of the gas saturation cell was determined to be 98.5 wt%. Later the levoglucosenone was distilled to get a higher purity for the density and refractive index measurements. This distilled sample had a purity of 98.7 wt%.

2.2 Density Measurements

Densities were measured with a DMA HP density meter (Anton Paar). Most of the samples were measured at a range of pressures, and for these measurements a UNIK 5000 pressure sensor (GE) was used (range of 0 to 20 MPa, abs.). The sensor had been calibrated against a MC2-PE calibrator with an EXT600 external pressure module (Beamex). The MC2-PE calibrator had been calibrated by Beamex. The pressure data has a standard uncertainty of 3100 Pa (expanded uncertainty of 6300 Pa at the 95 % level). For three of the samples measurements were only made at atmospheric pressure, and pressure data were taken from the Finnish Meteorological Institute (Tapiola observation station, Espoo, Finland) [25]. For the temperature, the manufacturer of the density meter only states that the accuracy is better than 0.1 K.

The samples were degassed for about 30 min before measuring. This was done by placing the sample in a round-bottomed flask, which was then placed in an ultrasonic bath. Gasses were removed from the system using a vacuum pump.

Water and nitrogen were used to calibrate the density meter. Reference values for these compounds were taken from reference equations of state, [26, 27] and we used the implementations of these equations available in the CoolProp package for Python [28]. Alternatively, these equations of state are also implemented in the NIST thermophysical properties calculator, [29] and we verified the CoolProp implementation by manually comparing the results of the two programs at multiple temperatures and pressures. For optimizing the calibration equation parameters we used the differential evolution solver [30] implemented in the SciPy package [31] for Python. The root mean-squared error between the reference and calculated values was used as the objective function.

Between samples, the performance of the device was checked by measuring air and water. During the study the performance checks indicated that a recalibration was necessary, so for later samples, a second set of calibration parameters were used. For the first calibration the standard uncertainty was estimated to be 0.047 kg·m−3 (expanded uncertainty of 0.092 kg·m−3 at the 95 % level). For the second calibration, the standard uncertainty was estimated to be 0.036 kg·m−3 (expanded uncertainty of 0.072 kg·m−3 at the 95 % level). One major uncertainty component of density is the impurities in the sample. The impurities are sample specific and the effect is included in the uncertainty estimates given in the density results.

One sample, levoglucosenone, was measured at atmospheric pressure using a DMA 5000 M density meter (Anton Paar). The performance of the device was checked with water and air, and based on this the expanded uncertainty at the 95 % level was estimated to be 0.05 kg·m−3.

Bio-compounds are often thermally unstable. The densities of decomposition products often deviate from that of the pure measured component. In some cases, this allows potential decomposition to be detected by just observing density changes during the measurement. When measuring furfural, decomposition was observed at 473 K. Dihydrolevoglucosenone started to react at 423 K. The density value was stable for about the first 20 min at this temperature, but then started to increase. Therefore, the few points from the beginning of the measurement at 423 K are included in the data file, but these points were not included during regression.

2.3 Gas Saturation Measurements

A gas saturation method was used to measure the vapor pressures of 6 of the bio-compounds (2-methoxy-4-methylphenol, 2-sec-Butylphenol, 2,6-dimethoxyphenol, dihydrolevoglucosenone, levoglucosenone, and γ-valerolactone. About 10 ml of each sample was placed in a glass vessel filled with spherical glass beads, and the vessel was put in a gas chromatography oven. The oven maintained a stable temperature (fluctuations were within ± 0.01 K). A flow of nitrogen was introduced, and this gas became saturated with the vaporized compound. To maintain the flow rate of nitrogen, a flow controller (Alicat Scientific, Tucson, AZ, USA) was placed in the nitrogen inlet line. The nitrogen flow rate was measured with a bubble meter, both before and after each run (standard uncertainty of 0.039 ml·min−1, expanded uncertainty of 0.088 ml·min−1 at the 95 % level). The vessel was left in the oven for a period of time (on the order of hours). Afterwards, it was removed and weighed to determine the mass lost. The vapor pressure was then calculated based on Eq. 1

$$ P = \frac{{\frac{m}{W}}}{{\left( {\frac{m}{W} + \frac{{tVP_{atm} }}{{T_{room} R}}} \right)}} \cdot \left( {P_{atm} + \Delta P_{loss} } \right) $$
(1)

where P is the vapor pressure (Pa), m is the mass of the test chemical that leaves the cell (g), W is the molar mass of the test chemical (g·mol−1), t is the duration of the measurement (min), V is the volumetric flow rate of the carrier gas (in this case, nitrogen) in units of L·min−1, Patm is the atmospheric pressure at the place and time the experiment is carried out (Pa), Troom is room temperature (K), R is the ideal gas constant, and ΔPloss is the pressure drop over the gas saturation cell (assumed to be zero for our measurements). More details about the gas saturation method can be found from other references [32, 33].

Atmospheric pressure was taken from values measured by the Finnish Meteorological Institute (Tapiola observation station, Espoo, Finland) [25]. The cell and room temperatures were measured with calibrated Pt-100 temperature probes (Frontec) connected to a Systemteknik S2541 thermometer (Frontec). These probes had an expanded uncertainty of 0.04 K (using a coverage factor of 2).

The uncertainty of the vapor pressures were calculated using a Monte Carlo method (see ISO/IEC Guide 98-3) [34]. 8 different parameters were included that could potentially affect the vapor pressure value, including the purity of each compound. The uncertainty of those 8 parameters was used to specify a distribution for each, and values were then selected from those distributions to calculate a vapor pressure value. This was repeated 1 million times for each experimental data point. The standard uncertainty of each vapor pressure point was taken to be the standard deviation of the distribution from the Monte Carlo calculation. The calculated uncertainties can be found in the vapor pressure data file in the OSF project for this article (https://osf.io/u9amn/). The code we used for performing the uncertainty calculations can also be found in the same OSF project (https://osf.io/u9amn/).

2.4 Refractive Index Measurements

Refractive indexes were measured using a Dr. Kernchen Abbemat digital refractometer (Anton Paar, Graz, Austria), and this refractometer measures at a wavelength of 598.3 nm. Based on measurements with water at 25 °C, the standard uncertainty of the refractometer was calculated to be 0.00 034 (expanded uncertainty of 0.00 078 at the 95 % level). Reference data for water were obtained from Schiebener et al. [35].

2.5 Modeling with PC-SAFT

The PC-SAFT equation of state was used to model the properties of the bio-compounds [36]. Because all of the bio-compounds contain polar and/or associating functional groups, contributions from the dipole and associating terms were also included, as appropriate [37,38,39,40,41,42]. When including the dipole term from Gross and Vrabec, the equation is also called the PCP-SAFT equation of state [38].

De Villiers et al. [43] showed that it can be difficult to find the best fit for polar compounds using pure component data alone. Often there is a large range of parameter values that will give good results for pure component properties, but poor results for mixtures. De Villiers et al. suggested that this could be because it is difficult to disentangle the contribution from polar interactions from the part due to dispersion forces. They proposed including VLE data with a nonpolar component when fitting the pure component parameters for polar compounds. Because such data is not available for many of the bio-compounds studied in this article, we simply set the number of dipoles (a parameter in Gross and Vrabec’s dipole term) equal to the actual number of polar functional groups in the molecule. This was the same strategy originally proposed by Gross and Vrabec [38]. The one exception was for furfural. A good fit could not be achieved unless the number of dipoles was also fitted against experimental data.

The dipole term also uses the dipole moment of the compound as a parameter. For most of the bio-compounds, the dipole moments were found in the literature [44,45,46,47,48,49]. Levoglucosenone was the only compound for which the dipole moment could not be found. Therefore, for use as a parameter in the PC-SAFT equation, the dipole moment was set to be equal to that of dihydrolevoglucosenone, since they are structurally similar compounds.

For some of the compounds in this article PC-SAFT parameters have already been presented in the literature (2-methylfuran, [50, 51]; 2-pentanone [37, 52]; cyclopentyl methyl ether [53]; furfural [37, 54]; tetrahydrofuran [37, 55] and γ-valerolactone [56]. We have refit parameters for these compounds because we had more data to include in the regression, including the new data measured in this article. Both literature data and data from this study were used when optimizing the PC-SAFT parameters. A file containing all the data used in optimization, including references, can be obtained from the OSF page (https://osf.io/u9amn/). It should be noted that in the data from Apaev et al. [57] for 2-pentanone there seems to have been a typo for the point at 376.63 K and 687 bar (probably should have been 799.5 instead of 899.5 kg·m−3), and we made this change in our literature data file. The total number of points used in the optimization and the number of literature data points is presented in Table 4. A total of 969 new measured data points were used in the regression.

Table 4 PC-SAFT parameters obtained for the 11 bio-compounds

Parameters were optimized by minimizing the root mean-squared errors of the vapor pressure and density added together. Optimization was performed using the differential evolution solver implemented in the Scipy package for Python [30, 31]. The resulting PC-SAFT parameters are given in Table 3. Our code for implementing the PC-SAFT equation of state can be found on GitHub: https://github.com/zmeri/PC-SAFT.

3 Results and Discussion

3.1 Compressed Liquid Density, Liquid Vapor Pressure and Refractive Index Measurement Results

The results for the density measurement are presented in Tables 5, 6, 7, 8, 9, 10, 11, 12, 13 and 14. The results for the vapor pressure measurement are presented in Tables 15, 16, 17, 18, 19 and 20. The measurements of the refractive index of the components can be found in Table 21.

Table 5 Compressed liquid density of 2-methoxy-4-methylphenol (creosol)
Table 6 Compressed liquid density of 2-Methylfuran
Table 7 Compressed liquid density of 2-pentanone
Table 8 Liquid density of 2-sec-butylphenol at atmospheric pressure
Table 9 Liquid density of 2,6-dimethoxyphenol (syringol) at atmospheric pressure
Table 10 Compressed liquid density of cyclopentyl methyl ether
Table 11 Compressed liquid density of dihydrolevoglucosenone
Table 12 Compressed liquid density of furfural
Table 13 Liquid density of levoglucosenone
Table 14 Compressed liquid density of tetrahydrofuran
Table 15 Vapor pressure of liquid 2-methoxy-4-methylphenol
Table 16 Vapor pressure of liquid 2-sec-butylphenol
Table 17 Vapor pressure of liquid 2,6-dimethoxyphenol (syringol)
Table 18 Vapor pressure of liquid Dihydrolevoglucosenone (cyrene)
Table 19 Vapor pressure of liquid Levoglucosenone
Table 20 Vapor pressure of liquid γ-valerolactone
Table 21 Refractive index of liquid Dihydrolevoglucosenone (cyrene), γ-valerolactone, 2-methoxy-4-methylphenol (creosol), 2-sec-butylphenol, Levoglucosenone, 2,6-dimethoxyphenol (syringol), Tetrahydrofuran, 2-pentanone, 2-methylfuran at 0.10 MPa

3.2 Comparison with Literature Values

A lot of literature data is available for tetrahydrofuran, and our experimental density values matched well with most of the literature values. For instance, at 293.15 K the value we measured was close to the mean of the literature data (see Table 22) and was well within the standard deviation of the literature values (0.33 kg·m−3, with outliers removed).

Table 22 Comparison of experimental density values from this study with literature values

We found seven literature sources that give compressed densities of tetrahydrofuran, and when comparing we found that the data from many of these literature sources seems to have relatively large errors [58,59,60,61,62,63,64]. Figure 2 compares the literature sources and values from this article, using the PC-SAFT fit for tetrahydrofuran as a reference. The points from Holzapfel et al. [58] and Sato et al. [59] aligned well with our data, as did much of the data from Govender et al. [60]. Data from Vasileva et al. [61] was in the same range as our data, but in general the deviations between the two data sets were larger than the measurement uncertainty of our data. The largest difference is 12 kg·m−3, and this occurs at 473.15 K and about 21 bar. At 293.15 K the value from Vasileva et al. is more than 7 standard deviations higher than the literature mean, and so it seems that there is a higher uncertainty in the data from Vasileva et al. The vapor pressures from Vasileva et al. also had some of the largest deviations from the PC-SAFT equation.

Fig. 2
figure 2

Comparison of the literature sources that give the density of tetrahydrofuran at elevated pressures. This work, [58], [59], [60], [61], [62], [63], [64]

The data from the other three sources [62,63,64] was significantly lower than our values and those from other literature sources. Although these researchers mostly measured at much higher pressures than the other sources, values at atmospheric pressure show that these three data sets have significant errors. For instance, at 323.15 K and atmospheric pressure the value from Schornack and Eckert [62] is more than 35 standard deviations below the mean of the other literature values (ours was within a standard deviation). There may have been some problem with their experimental setup because their value for chlorobenzene at 323 K and atmospheric pressure is 10 kg·m−3 lower than the value from the DIPPR correlation [45], so there seems to be a consistent negative deviation. Zhang and Kiran [63] did not measure at atmospheric pressure, but extrapolating from their data down to atmospheric pressure using a linear pressure dependency shows a similar large negative deviation from most other literature sources. The uncertainty caused by the extrapolation is in this case a magnitude lower than the deviation. For instance, at 302.4 K we got a value of 848.2 kg·m−3 when extrapolating their data, which is about 28 kg·m−3 lower than the literature mean.

For the vapor pressure measurements gamma-valerolactone could be used as a reference compound for validating our gas saturation method. We found the literature data from 11 other sources, and we compared these different data sets in Fig. 3 [56, 65,66,67,68,69,70,71,72,73,74]. Most of the data fall within a few percentage points of each other, but there is one point from Havasi et al. [69] that is about 40 % lower than the other data. It is so far out of line that it is below the y-axis limits in Fig. 3. In general, the data from Havasi et al. show an increasing negative deviation at lower temperatures, which could indicate a problem with the measurement procedure at those temperatures. The 3 points at the lowest temperatures from Havasi et al. were left out when fitting the PC-SAFT parameters.

Fig. 3
figure 3

Comparison of vapor pressure data for gamma-valerolactone from different sources. This work, [65], [66], [67], [68], [69], [70], [56], [71], [72], [73], [74]

Our data falls in line with the literature data, although it has a somewhat larger scatter than some of the literature data sets. In more recent experiments with the gas saturation equipment, we were able to improve the repeatability by making repeat measurements of the cell mass and taking the average of them, [75] so it seems that uncertainty in the sample weight was the largest contribution to the uncertainty for the gas saturation measurements presented here. We did not observe any systematic bias in our data when compared to the model or other data.

3.3 Results and PC-SAFT Modeling

The data accompanying this article has been uploaded to a scientific repository (Open Science Framework), and it can be accessed at https://osf.io/u9amn/. The repository contains the density, vapor pressure, and refractive index data. Here, we give an overview of the results.

Table 22 gives the density at 293 K for each of the 10 compounds measured in this study. Literature values are also given for comparison, where available. In general, our measured results match well with literature values. However, with 2-methoxy-4-methylphenol and 2-sec-butylphenol there are relatively large discrepancies between the experimental and literature values [75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101]. For both of these compounds only one literature value could be found, and they were from articles in 1952 and 1896 that used methods with higher uncertainties. The value for 2-pentanone may also seem to be out of line at first glance; however, when looking at the individual literature values one can see large variations between the different sources [82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100]. There are 5 sources that give a density close to ours (about 806.3 kg·m−3), but 6 of the 19 sources give a higher density of about 809 kg·m−3. So many of the values actually lie more than 1 kg·m−3 away from the mean. It is unclear why there is such a large scatter in the data for 2-pentanone, although all of the highest values were measured more than 50 years ago, and the more recent articles report a lower density.

Using the PC-SAFT equation, we could calculate the normal boiling point (at 101,325 Pa) and enthalpy of vaporization of each compound. These values are given in Table 23. For dihydrolevoglucosenone and levoglucosenone the PC-SAFT equation had to be extrapolated about 100 K above the available vapor pressure data to reach the boiling point, so the boiling points for these compounds contain larger uncertainty. For dihydrolevoglucosenone Sherwood et al. [3] measured an approximate value of 476 K for the boiling point using a TGA, which is about 20 K lower than the value we calculated. We observed that dihydrolevoglucosenone decomposed even at 423 K (see Sect. 2.2), and so it is possible that decomposition occurred in the TGA measurement, leading to an estimated value lower than the actual boiling point.

Table 23 Normal boiling points and enthalpies of vaporization of the bio-compounds

We can also briefly examine the relationships between molecular structure and properties. For instance we can see that hydrogenating levoglucosenone to dihydrolevoglucosenone gives a compound with a somewhat lower vapor pressure and density. This may be useful information in designing processes that produce dihydrolevoglucosenone from levoglucosenone. γ-valerolactone also stands out because it has a much higher boiling point than other compounds with similar molar masses.

4 Conclusions

Vapor pressures, densities, and refractive indexes were measured for a group of bio-compounds. For several of the compounds this is the first publicly available data on these properties. The experimental data measured here showed good agreement with most literature data for tetrahydrofuran and γ-valerolactone. Comparison also showed that density data at higher pressures from several of the literature sources was erroneous, both for tetrahydrofuran and for 2-pentanone. The new density data in this article helps to fill the gaps left when removing those unreliable datasets.