1 Introduction

1.1 PCM, their fields of application, and scope of this work

Materials that are able to store and release a considerable amount of heat while undergoing a phase change are called phase change materials (PCM). To achieve a high heat storage density at small volume and small volume change, most applied PCM undergo a phase change between solid and liquid; the phase change is then called melting and solidification. Some PCM undergo a phase change between two different solid phases. In general, it is thus a phase change on heating and on cooling.

The ability to store and release a considerable amount of heat while undergoing a phase change at small temperature changes results in two main fields of use [1]: heat storage with high storage density (per unit mass or volume) in a small temperature interval, and passive temperature stabilization. Applications focusing on storage of heat are, e.g. in buildings, industry, or power plants, and applications focusing on passive temperature stabilization are, e.g. in transport boxes, textiles, and again buildings. The temperature in current applications covers a range from about − 40 °C to about + 80 °C, while interest for future applications in power plants and industrial applications is up to about 400 °C. Common PCM are water (ice storage), salt hydrates, salts, and eutectic salt-water solutions, as well as paraffins, fatty acids, and sugar alcohols. A PCM can be a pure material or a mixture, and it can also be inhomogeneous due to additives or phase separation. For application, the PCM is often encapsulated, with capsule sizes ranging from about 10 μm (micro-encapsulation) to several 10 cm in cross-section (macro-encapsulation). Calorimetric measurements to determine the ability of PCM to store heat as a function of temperature are thus performed with a variety of methods and instruments. Some of them have been developed only in the last decade, especially to allow characterization of larger samples, even encapsulated PCM with volume in the order of 100 ml and larger. In this paper, we give an introduction, review, and recommendations for new reference materials for calibration and testing, needed to cover the full range of temperatures and sample sizes.

1.2 Basic effects and terms

For the following discussion, it is helpful to define some basic terms. At the center of the discussion is the capacity of a material for storage of heat as a function of temperature. For its description, the specific enthalpy h(T) and specific heat capacity cp(T) = dh(T)/dT are usually used.

Ideally, PCM undergo the phase change at a single phase change temperature Tpc. This case, ideal phase change behavior, is shown in Fig. 1. PCM are selected to have a high phase change enthalpy, better called enthalpy change during phase change Δpch. At an ideal single Tpc the value of cp is infinite due to the step in h. The relevant data to characterize an ideal PCM are then Tpc and Δpch; they are commonly tabulated. The enthalpy change Δpch is also called latent heat, meaning “hidden” heat as the temperature T remains unchanged at Tpc while heat is stored/released during the phase change.

Fig. 1
figure 1

Ideal phase change: heating and cooling T(t), h(T), and cp(T)

Most PCM however show non-ideal phase change behavior. There are different types of it. A type of non-ideal phase change behavior is a phase change temperature range, as Fig. 2 shows.

Fig. 2
figure 2

Non-ideal phase change: phase change temperature range; heating and cooling T(t), h(T), and cp(T)

A different behavior on heating and cooling is also possible, called hysteresis. The case of two different curves for a phase change temperature range shows Fig. 3. Another cause for hysteresis is subcooling, also called supercooling or undercooling. It is due to a metastable state: when cooling, the new phase is not formed until a sufficiently low temperature, the nucleation temperature Tnuc, is reached (Fig. 3). The effect depends e.g. on the sample size [2]; water nucleates between 0 °C in large volumes and about − 40 °C [3] e.g. in small droplets.

Fig. 3
figure 3

Non-ideal phase change: hysteresis incl. subcooling, here with a phase change temperature range; heating and cooling T(t), h(T), and cp(T). In cp(T), the effect of subcooling cannot be depicted

The relevant data to characterize a PCM in general are thus not just, as commonly done, Tpc and Δpch. An accurate way of data presentation/communication is the tabulation of the stored heat in temperature intervals for the case of heating and for the case of cooling, like cp, or plotting the h(T) curves. When using h(T), normalization should be done in the liquid, close to the phase change region; if normalized in the solid, different crystallinity etc. can lead to different h for the same T. The effect of subcooling should be treated separately by giving the nucleation temperature Tnuc and the measurement conditions e.g. sample size; this is because it commonly depends on the sample size and can be affected externally e.g. by the container walls, thus is not a material property.

The following discussion uses preferably h(T) and not cp(T). The reason for this is that h(T) data show not only the temperature variation, but additionally also the integral value of the heat stored directly. In addition, cp(T) is not useful to discuss ideal phase change as its value becomes infinite. This is also the reason why it is common that people refer to cp only in the temperature range outside the phase change; the different use of the term is a frequent source of misunderstandings. Nevertheless, cp data are a common output of many calorimeters. Care must then be taken when integrating cp(T) data to h(T) data in a temperature range where cp changes strongly while there is a low density of data points.

2 Calorimetric methods used for PCM

2.1 Reference materials and their use

The stored heat as a function of temperature, e.g. given as enthalpy change with respect to a reference temperature, is determined by calorimetric measurements. In a calorimetric measurement, the heat flow into or out of a sample is measured in connection with the sample temperature, giving also the resulting temperature change. Heat exchange and temperature are measured by sensors. The relation between the value and the signal of a sensor, thus the input and output, is called signal-value function, transfer function, or response function. Often it is not well known, or not known at all. The procedure of establishing a signal-value function is called calibration. For a calibration, one or several reference points are necessary; these are values of the property to be measured from so-called reference materials. A reference material is a material with a property that is highly constant with time and where a property value is well known. Certified reference materials have a value that has been accepted by experts by definition or from highly accurate measurements; e.g. the kilogram, the meter, the atomic unit, or the temperature fixed points e.g. in ITS-90. From the signal resulting from a measurement and the known property value of one or several reference materials the function between the value to be determined and the signal is found. For calorimetry, reference materials for temperature and amounts of heat are needed. Commonly, they refer to melting or triple point for temperature, and for heat to the related phase change enthalpy or the heat capacity of solids and liquids. Materials, and recommended values, are e.g. found in [4,5,6]. Figure 4 shows an overview on certified reference materials for Tpc, Δpch, and cp in the temperature range from − 40 to +160 °C; thus also includes the high temperatures of immediate interest in PCM R&D.

Fig. 4
figure 4

Overview on certified reference materials in the temperature range from − 40 to +160 °C, above the T axis for cp, below for Tpc and Δpch

Calibration of sensors is only one application of reference materials. Somewhat similar is their use in a relative/comparative measurement. A relative/comparative measurement compares the sample with a reference directly during the measurement process; this allows a result relative to the reference, e.g. the measured sample value is smaller/larger than the reference value. If the signal-value function is linear, the ratio of both values and the well-known value of the reference can be used to calculate the value of the sample. Last, but not least, reference materials are also used for testing of methods, equipment, operating personnel etc. The availability of reference materials is crucial with regard to all these usages, thus must cover not only the temperature range, but also specifics of equipment including sample size, cost, and safety etc. We will now discuss the different methods and equipment.

2.2 Individual calorimetric methods

To discuss the need for new reference materials and the requirements on them, it is necessary to first look at the methods used for calorimetric measurements on PCM. These are hf-DSC, T-history method, 3-layer calorimeter, and AC (adiabatic calorimeter). In addition, methods to determine the thermal conductivity have been modified for calorimetric measurements; these are heat flow meter (HFM), guarded hot plate (GHP), and longitudinal guarded comparative calorimeter. They all are now discussed briefly, with a focus on what is relevant for the use and selection of reference materials.

The stored heat as a function of temperature is, at constant pressure, given as the enthalpy change with respect to a reference temperature

$$ \Delta Q_{p = const} = \Delta H $$
(1)

or as heat capacity at constant pressure

$$ C_{p} (T) = \left. {\frac{dQ}{{dT}}} \right|_{p = const} = \frac{dH}{{dT}} $$
(2)

Specific values, regarding mass or volume, are denoted by c, q, and h.

2.2.1 Calorimetric methods with heat flow via a thermal resistance

In a calorimetric measurement, the heat flow into or out of a sample is measured in connection with the sample temperature. The heat flow into or out of a sample can be realized in different ways. In DSC, T-history, or 3-layer calorimeter it is by a thermal resistance Rth between sample and ambient, so

$$ \Delta Q_{p = const} = \Delta H = \frac{1}{{R_{th} }} \cdot \left( {T_{s} - T_{a} } \right) \cdot \Delta t $$
(3)

(Ts and Ta denote sample and ambient temperature, t the time). This thermal resistance must be calibrated.

The most widespread commercially available calorimetric method is hf-DSC. In a heat flow Differential Scanning Calorimeter (hf-DSC), a difference measurement is performed between a sample and a reference crucible. Both are subject to the same varying temperature in their ambient, commonly at constant heating/cooling rate dTa/dt (dynamic), but also stepwise is possible. The reference crucible remains empty, thus ideally the signal from the difference measurement is due to the sample without its crucible. To determine the heat flow in/out of a sample from the measured temperature difference between sample and reference crucible, the heat flow signal must be calibrated. For this, a sample from a reference material is measured. For calibration, all materials shown in Fig. 4 with melting point and enthalpy are used to calibrate sensors for temperature and heat flow. Materials with cp are only useful to calibrate sensors for heat flow; here Al2O3 is most popular, specifically at high temperatures. Hf-DSC equipment covers the temperature range from below − 50 °C to beyond 1500 °C. Typical sample size is 10–100 μl, thus unsuitable for PCM composites as well as PCM that subcool in such small amounts. Larger samples of 500 μl or more are possible with Calvet-DSC; they can also be calibrated for heat by an electric heat source. Sample crucibles are metallic or ceramic, and metallic ones can also be sealed by cold-welding.

The T-history method is the most widespread self-built calorimetric equipment used for PCM. It is based on observing the T-history T(t) of a sample, e.g. in metallurgy where an eutectic point is indicated by a temperature plateau during cooling; thus, the method originally is purely qualitative. For PCM it was initially proposed by Zhang et al. [7] to avoid the high cost of DSC and the limitation in sample size. In the proposed method, the sample as well as a reference material are contained in test tubes, placed symmetrically in a temperature-controlled ambience, and exposed to a sudden change in temperature. During the temperature adjustment of the sample and the reference, the ambience is kept at constant temperature. From the recorded T(t) of the reference, the thermal resistance of the reference to the ambient is known. Assuming that it is the same for the sample, the heat flow in or out of the sample can be calculated from its T(t). Due to the variable heat transfer in air, the method was initially not very accurate. It was then modified successively, e.g. by Marín et al. [8] and Lázaro et al. [9]. The key was using insulated test tubes for the sample and the reference to increase their thermal resistance to the ambient, making the sample and reference more isothermal, and dominating overall heat transfer to assure that sample and reference have the same heat transfer resistance to the ambient. Existing equipment covers the temperature range from about − 40 °C to 200 °C, and sample size in the range of 10–20 ml, thus about 2 orders of magnitude larger than typical hf-DSC. The sample and reference containers are tubes from metal, plastic, or glass, and closed by a rubber plug or screw cap. As reference material in the measurement, water is commonly used (using its cp). Rathgeber et al. [10] describe a T-History calorimeter covering up to 200 °C and, due to the wide temperature range, used copper (cp) as reference material in the measurement, instead of water. The calibration is refined using, besides copper (cp), also indium (Tpc, Δpch) and h(T) data of three PCM from DSC measurements.

The 3-layer calorimeter is commercially available from just a single company, w&a (wärme- und anwendungstechnische Prüfungen, Fürstenwalde, Germany), and was solely developed for PCM. The measurement principle is similar to that of the T-history method, but instead of cylindrical tubes a box-shaped sample chamber, which is insulated, is used. And there is only a single chamber, so the calibration of the thermal resistance to the ambient must be done before any PCM sample is measured. The equipment measures samples encapsulated in flat bags of 100 ml size, thus about one order of magnitude larger than common T-history equipment, and 3 orders of magnitude larger than for common hf-DSC. The temperature range of the instrument is from − 20 °C to 100 °C. Calibration is done by the manufacturer only, using water and hexadecane (99 %).

2.2.2 Calorimetric methods with electric heating

The determination of the heat flow in or out of a sample via a thermal resistance and the temperature difference between its ends (Eq. 3), as used in hf-DSC, T-history, and 3-layer calorimeter, is not the only possible measurement principle. A completely different approach to realize the heat flow, not by a thermal resistance, is by electric heating. The heating power is Pel = U⋅I with U the voltage difference and I the electric current (el.). Then

$$ \Delta Q_{p = const} = \Delta H = P_{el} \cdot \Delta t = U \cdot I \cdot \Delta t $$
(4)

If adiabatic conditions are realized, heat flow into or out of the sample via the ambient is avoided. Adiabatic conditions are assured with an adiabatic shield, surrounding the sample cell, and following the temperature evolution of the sample cell such that no more relevant temperature difference exists. Calorimeters using this principle are called adiabatic calorimeter (AC). A calibration of a sensor for heat flow is here not needed; the heat input by el. work is known from Pel = U⋅I with high accuracy. To correct for the heat capacity of the sample cell, a calibration of it by a separate experiment is done.

A drawback of AC is that electric heating allows only measurements on heating, not on cooling. However, if small heat losses are allowed instead of completely adiabatic conditions, measurements can also be done on cooling. AC is used in different variations, e.g. depending on the variation of the electric heating with time.

To accurately measure heat capacity and enthalpy changes the classical Nernst heat pulse method [11] was used extensively in the past, using finite electric pulses, thus giving finite steps ΔQ. By adding consecutively small heat inputs, the enthalpy as a function of temperature can be obtained. Although tedious and time consuming, the method has been essential to determine accurate values of specific heat capacity over wide temperature ranges for a set of reference materials, e.g. synthetic sapphire and copper [12]. Often uncertainties of a few tenths of a percent are reported. Determination of latent heats, e.g. heat of fusion, is also possible [12, 13].

Adiabatic scanning calorimetry (ASC) was introduced in the 1970s, applying, under adiabatic conditions a constant electrical power to the sample (a crucial advantage when measuring latent heats compared to DSC, if operated at a constant rate of ambient temperature change). The result of ASC is a continuous heat capacity or enthalpy curve, compared to a discontinuous one when using heat pulses. Previous implementations of the ASC concept (e.g. [14,15,16]) required careful preparation and operation, which made ASC a method largely used in academia, operated by skilled and well-trained personnel. In a modern type of ASC these problems are eliminated by inserting, as a temperature difference detector between the sample and the adiabatic shield, a very sensitive semiconductor-material-based Peltier element (PE) [17]. The µK sensitivity of the PE for temperature differences allows to maintain almost perfect equality of sample and shield temperature in heating mode. In the cooling mode, a preset temperature difference between the sample and the shield can be kept constant with equal resolution. In the older versions of the ASC concept samples from a few grams to a couple of hundred milligrams were used to achieve proper accuracy. In pASC (Peltier-element-based ASC) samples as small as a few tens of mg are used. Routinely liquid (or melting) substances enclosed in typically 100 µl hermetically closed sample cells are used [18]. This size is similar to that of typical hf-DSC equipment. Recently, pASC has been used extensively for measuring reference materials like gallium [18] or biphenyl and phenyl salicylate [19], water and water-sodium chloride solutions [20], the n-alkanes C23 [21] and C26, C27, C28, C29, C30 [22], as well as the commercial PCM RT42 ([21, 23]).

2.2.3 Calorimetric methods based on equipment for thermal conductivity measurement

In addition to the calorimetric methods just discussed, methods to determine the thermal conductivity have been modified for calorimetric measurements, specifically to be able to measure large samples of PCM, PCM composites, or encapsulated PCM. This is necessary e.g. for quality control, as to test PCM content in an insulation material [24] or in macrocapsules. The methods used are heat flow meter (HFM), guarded hot plate (GHP), and longitudinal guarded comparative calorimeter.

In a calorimetric measurement, the heat flow into or out of a sample is measured. Heat flow meter (HFM), guarded hot plate (GHP), and longitudinal guarded comparative calorimeter are all methods to determine the thermal conductivity of a sample, where the heat flow into the sample on one side and out of the sample on the other is measured. In recent years, they have been tested also as calorimeters. For calorimetric measurements the conditions are changed such that the heat flow is in or out from both sides. They are thus non-conventional calorimetric methods, not developed for calorimetric measurements, but useful for the purpose with some modification. Information on calorimetric measurements on PCM by a heat flow meter (HFM), also called dynamic heat flow meter apparatus, is found in [25,26,27]. Wan et al. [28] describe the use of guarded hot plate (GHP). For the Longitudinal Guarded Comparative Calorimeter information is available in [29]. With regard to calibration, if needed, they are calibrated using reference materials for thermal conductivity. However, reference materials for Δpch or cp are still needed for testing (reference materials for thermal conductivity can e.g. not test side losses or gains).

Calorimetric methods used for PCM thus cover a wide variety of equipment as well as measurement, calibration, and evaluation procedures. They differ significantly regarding the sample size, the general sample properties, the parameter range that they can cover, and of course the accuracy.

2.3 State of the art measuring PCM

2.3.1 Basic terms of calorimetry regarding measurement and results

To discuss calorimetric measurements and results a few terms are needed and thus introduced here.

First, to reflect h(T) or cp(T) properly regarding T, the sample must be sufficiently close to isothermal conditions during a measurement; otherwise the heat exchanged cannot be attributed to a single T. In addition, to properly characterize a material, the sample must be representative of the material to be investigated.

Regarding the results of measurements, the “guide to the expression of uncertainty in measurement” (GUM) of the Joint Committee for Guides in Metrology [30] defines further, crucial terms. The true value of a quantity is the value consistent with the definition of the quantity. It would be obtained by a perfect measurement, which is impossible, therefore true values cannot be determined. The accuracy of a measurement refers to the closeness of the agreement between the result of a measurement and the true value in a qualitative way. To be able to assess the reliability of a result, a result should be accompanied by a quantitative indication of its quality by its uncertainty; the uncertainty reflects a lack of exact knowledge of a result, e.g. 10 °C ± 0.5 °C, 150 J/g ± 15 J/g, or 150 J/g ± 10%. Important is if the measurement result is repeatable, which refers to same conditions, and also if it is reproducible, which refers to different conditions. Reproducibility is crucial for using values from a measurement in an application at different conditions, but also when using different measurement methods, equipment etc. where sample size, pressure, or e.g. container walls are different.

2.3.2 Required uncertainty for the results of calorimetric measurements on PCM

The required uncertainty for results of calorimetric measurements on PCM depends on the application. Applications of PCM in buildings and in transport boxes can serve as examples.

The uncertainty of Δpch should be less than 10%, if possible 5%. This is a balance between effort and demand. 10% is a value that is commonly achieved in calorimetric measurements used in R&D on PCM, while 5% is reached by few, with significantly increased effort. At the same time, sensible heat and dynamic effects are present in real applications, and there is also always some variability in properties of different samples of the same material e.g. due to small changes in composition; therefore, the effort to achieve an uncertainty less than 5% in individual measurements is not justified.

The uncertainty in the temperature is important in applications regarding temperature stabilization and heat transfer. Regarding temperature stabilization, e.g. in transport boxes, requirements are usually given in integer values of T/°C. For heat transfer between PCM and ambient, e.g. at a temperature difference of 5 °C an uncertainty of 0.5 °C is needed to limit the uncertainty in heat transfer to 10%. Thus an uncertainty of at least 1 °C, better 0.5 °C, is desired.

2.3.3 Achieved uncertainty, and used reference materials

To discuss the need for new reference materials and the requirements on them it is necessary to look at the achieved uncertainty, and review which reference materials exist and which are used.

Today, activities exist worldwide and have resulted in experience with a number of calorimetric techniques, comprising diverse equipment, and different calibration and measurement procedures. Several standards for measurement, evaluation and data presentation/communication were developed. The oldest standard is the one of the RAL Quality Association PCM (available at http://www.pcm-ral.org/pcm/en/quality-testing-specifications-pcm/). It covers several properties and corresponding measurement methods. It does not specify all details, e.g. calibration, because the monitoring institutes are well experienced in the individual methods used by them. Institutes working in PCM R&D in most cases use hf-DSC for calorimetric measurements on PCM. A standard for them was developed within an IEA activity, the Task 42/Annex 29. The IEA standard comprises [31], specifically for hf-DSC, the determination of the appropriate heating and cooling rate to assure closeness to isothermal conditions in the sample, the calibration, the measurement itself, and the data analysis and representation. For calorimetric measurement of PCM using a HFM, an ASTM standard [26] was developed.

Testing the accuracy of measurement results is crucial to assess the reliability of data for applications. The immediate requirement is testing a specific measurement principle, method, procedure, etc. In addition, intercomparison tests, also called ring tests or round robin tests, test the agreement of results measured by different methods, equipment, procedures (calibration, measurement, evaluation) and/or people or institutes. Independent of the type of testing, a suitable material for testing must fulfill some requirements, to not pose a problem already on its own, such that it allows to just focus on the quality of the measurement. For this, the material must behave repeatable, thus not show any instability or irreversible behavior. It must also behave reproducible under the required conditions varied in the measurements, especially in intercomparison tests. This includes that representative samples can be taken for all sample sizes. In addition, the material must be safe, available in the required purity in sufficient amount and at reasonable price. A simple approach is using a certified reference material, where property values have been determined in intercomparison tests of professional institutes. The measured value can then be compared with the value of the reference material. For intercomparison tests, another option is selecting a material that is not a reference material, sometimes where even no property value is previously known, and just comparing the values resulting from the measurements.

With regard to the calibration and testing of individual equipment, the case of hf-DSC is the easiest. They are commonly calibrated by Ga, In, and Al2O3, and sometimes by further reference materials. Direct testing is then often done by one of these reference materials. For T-history, there is a reference material with known cp used during measurement, usually water; others with Tpc and Δpch are used for testing. Lázaro et al. [9] e.g. describe the testing of a T-history setup designed for the temperature range of − 20 °C to +65 °C, and sample size from 15 to 150 ml, using Ga, water, and hexadecane. Water melts at 0 °C, however is not a certified reference material regarding its melting enthalpy; the respective value of 333.5 J/g was taken from available literature. For hexadecane, 18.1 °C and 236 J/g [32] were used. For higher temperatures two PCM melting around 27 °C and 58 °C were measured and their results compared to results from DSC measurements. To be able to measure water at temperatures T < 0 °C, Al2O3 was used as reference material for cp in the measurement. Rathgeber et al. [10] describe the calibration of a T-History calorimeter for measurements in the temperature range from 40 to 200 °C. Due to the temperature range, instead of water they use copper with its cp as reference material during all measurements. As only a few reference materials undergo a phase change within the operating temperature range, they used In and also three PCM where data from literature and DSC measurements were available. For the 3-layer calorimeter, Δpch of hexadecane is used for calibration, using literature data. To be able to measure even larger samples, Göbel et al. [27] tested a commercially available heat flow meter, modified to be used for calorimetric measurements. The samples are plates of 20 cm × 20 cm and variable thickness. The materials used for testing were stainless steel 1.0332 and a gypsum board with microencapsulated PCM melting around 28 °C; the samples had about 12 mm thickness, thus about 400 ml volume. For both, reference data were taken from DSC measurement. For the stainless steel, using a temperature step between 19 and 29 °C, the mean specific heat capacity in the temperature range differed by less than 2% from the value determined by DSC. For the gypsum board with microencapsulated PCM, using a temperature step between 11 and 33 °C to include the phase change, the enthalpy difference in the temperature range differed by less than 4% from value on the data sheet provided by the manufacturer.

The use of reference data from DSC measurements in several cases when testing within a lab already indicates some demand for reference materials. But before analyzing the situation, let’s first complete things looking at intercomparison tests between labs using same or different calorimetric methods. Gschwander et al. [31] present results of an intercomparison test using hf-DSC within the IEA SHC Task 42/ECES Annex 29, at different laboratories using different devices. The material used for testing was laboratory-grade octadecane (Parafol 18-97, Sasol). After initially having large deviations in results, a measurement standard was developed telling how to take care of temperature gradients in the sample, calibration, and baseline instability. For calibration water, Ga, and In are recommended. With the new procedure, the results of the final test that comprise samples of about 8–300 mg measured in different DSCs, show a standard deviation of 10% in enthalpy and 0.3 °C in temperature. Vidi et al. [33] presents the results of an intercomparison test performed between the monitoring institutes of the RAL Quality Association PCM: the ZAE Bayern, the FhG-ISE, and w&a. The test followed the standard of the RAL Quality Association PCM. The material used for the test was the commercial, paraffin based PCM RT8 HC (Rubitherm). The instruments used were, in order of increasing sample size, a hf-DSC (~ 10μl) at the ZAE Bayern, a Calvet calorimeter (~ 100μl) at the FhG ISE, a T-history setup (~ 10ml) again at the ZAE Bayern, and a 3-layer calorimeter (~ 100ml) by w&a. The results show a very good agreement for the overall phase change enthalpy, ranging between 169 and 184 kJ/kg, thus with a total difference of 9%. The peak temperatures of cp for heating are between 7.7 and 8.0 °C, thus differing by only 0.3 °C (the samples are to be measured close to isothermal conditions, so comparing peak temperatures is a check of that). For cooling, they are between 6.7 and 8.0 °C.

3 Reference materials: demand

At a first glance, these results look quite good. All instruments are calibrated, reference materials for comparative measurements are available, tests could be performed, and test results are within the desired uncertainty. But things are not as satisfactory as they look at a first glance in general.

Let’s start with calibration, as it was performed. Already in DSC measurements water has been used as reference material for Tpc and Δpch, e.g. by Gschwander et al. [31]; this was done to have a third reference point besides Ga and In, so the temperature range around room temperature is included. But Δpch of water is not a certified reference value; the value has just been taken from literature. And the same applies to hexadecane when used to calibrate the 3-layer calorimeter; it is not a certified reference material, and its value of Δpch is just a literature value. In T-history a reference material for cp is measured in parallel with the sample, commonly water; its cp is a certified reference value. But only between 0 and 100 °C. To be able to measure below 0 °C, Lázaro et al. [9] used Al2O3, which is a certified reference material. To be able to measure up to 200 °C, Rathgeber et al. [10] used copper, however copper is not a certified reference material for cp above 300 K. Also, while copper and Al2O3 have volume specific cp similar to water (water 1.0 g/cm3·4.1J/gK = 4.1 J/cm3K, copper 8.9 g/cm3·0.39 J/gK = 3.5 J/cm3K, and Al2O3 3.9 g/cm3·0.78 J/gK = 3.0 J/cm3K), they involve some technical difficulties. Both are solid, therefore have a thermal contact resistance to the test tubes. In addition, Al2O3 as single crystal (synthetic sapphire) is far too expensive, and as powder or granulate has lower density and could absorb water, which could significantly alter the volumetric cp. The situation regarding calibration is thus by far not as good in general as it looks at a first glance. And testing? Regarding testing by using certified reference materials the situation is the same as just discussed, as the materials are the same that are already used for calibration. To test T-history equipment, Lázaro et al. [9] used Ga, and water and hexadecane which are not certified reference materials, while Rathgeber et al. [10] used In. Overall, testing was done where certified reference materials were available, specifically by In and Ga; they were supplemented by water which is not a certified reference material for Δpch, and in intercomparison tests they were supplemented by results from DSC measurements on materials that are not certified reference materials. The results of the tests achieved the desired accuracy. Striking is that no test was done at all below 0 °C, or above 100 °C.

We can summarize from the state of the art of current usage that

  • Specifically for T-history, it would be good to have other reference materials for cp besides water, Cu and Al2O3, specifically materials which are liquid, for below 0 °C and for above 100 °C where water cannot be used.

  • In general, it would be good to have other reference materials for Tpc and Δpch, besides Ga and In, especially as both are not cheap when needed in volumes of 10 ml or more (e.g. it would be good to be able to use water as a reference material not only for Tpc, but also for Δpch).

But there are other, known certified reference materials, that have not been used. Why are they not used? Reference materials are applied for different purposes: for calibration, as reference materials in comparative measurements, and for testing of equipment including also inter-comparison tests. For all these usages, it is recommended that a reference material has similar properties as the samples to be measured, so it is necessary to look at the typical properties of PCM and not of materials in general. When a cp reference material is used for calibration, e.g. in T-history, this rule is violated; the reason is to get a calibration in a temperature range instead of just at temperature points. However, the value of cp per volume should be high, as it is for water. In addition, in comparative measurements the reference is often used for months, so container tightness and corrosive behavior do play a special role. Also, the different methods used to characterize PCM differ in sample/reference size, and suitable reference materials should cover the full size range, e.g. be representative, have low cost, no hazards...

For a reference material for cp in T-history, heptane, shown in Fig. 4, is a certified reference material, but it has a flash point of 4 °C such that it poses some fire danger, especially if test tubes are not sealed perfectly. The flash point is the lowest temperature at which vapor of a material will ignite when given an ignition source. In addition, its cp per volume is just half of that of water. Sabbah et al. [5] list further reference materials for cp, which are also liquids like heptane and water: naphthalene, diphenyl-ether, polystyrene, polyvinyl chloride, benzene, hexafluorobenzene and nitrogen. But naphthalene, being liquid above 80 °C, has a flash point of only 80 °C. Diphenyl-ether, which is also liquid above 80 °C, has a flash point of 115 °C. Polystyrene is only liquid beyond 240 °C, and polyvinyl chloride is only liquid well beyond 100 °C. Benzene is liquid above 5 °C, and has a boiling point of just 80 °C and a flash point of − 11 °C. Hexafluorobenzene has similar properties, and nitrogen is only liquid below − 210 °C. All these materials are either not liquid in the required temperature range or are hazardous.

For Tpc and Δpch there are a number of other certified reference materials. Figure 4 shows mercury, cyclohexane, biphenyl, and naphthalene. No reference material below 0 °C is available, except mercury; but mercury should not be used for safety reasons in amounts of 10 ml or more as is needed for T-history and 3-layer calorimeter. Cyclohexane has a flash point of − 18 °C, is thus also not safe, especially in large amounts and if vapors can be generated by leakage; its boiling point is 81 °C. Biphenyl has a low Δpch of about 125 J/cm3 compared to PCM, but otherwise reasonable properties. Naphthalene has again a low flash point of 80 °C, making it not safe when handled in large amounts. Phenyl salicylate, not shown in Fig. 4, has a low Δpch of about 115 J/cm3, and a flash point of 113 °C.

For comparison, hexadecane and octadecane, the two alkanes that have been used already, have the following values: hexadecane has a boiling point of 287 °C and a flash point of 135 °C, octadecane has a boiling point of 316 °C and a flash point of 166 °C. They are thus much safer to handle. And for water, where Δpch is not a certified reference value, a value of 333.5 ± 0.2 J/g is available in literature [13, 34]. These three materials could thus be potential new reference materials. But even if they are, they are not sufficient as they just cover a narrow temperature range.

4 Potential new reference materials

4.1 Requirements for reference materials

Reference materials are applied for different purposes: for calibration, as reference materials in comparative measurements, and for testing of equipment including also inter-comparison tests. For all these uses, a number of requirements exist in general. The requirements on a material to be suitable as reference material can be split in requirements regarding the selection of the material, the measurement of the property, and the presentation/communication of the property data.

  1. 1.

    Selection of a material

    In general, a material should pose no significant problem on measuring its property. This means it should not be hazardous or corrosive. It must also be available at the required composition, e.g. purity, at an acceptable price. For the measurement of the property by a wide range of instruments, parameters etc., the material must behave repeatable with regard to the property, meaning give the same result at same conditions, as well as reproducible, meaning at somewhat different conditions. E.g. the material must also allow to take representative samples for different sample sizes.

  2. 2.

    Measurement of the property

    The property of the material must have been measured by experts in the field, with highly accurate measurements, testing specifically reproducibility, and a property value or value function has been accepted including uncertainty. The uncertainty of a reference material must be significantly lower than the uncertainty needed in common measurements to be useful for calibration etc.

  3. 3.

    Presentation/communication of the property data

    The presented/communicated property data must include the specification of the material (composition, preparation…), the value or value function of the property including uncertainty, and maybe also the measurement method used and a reference where further information is found.

4.1.1 Requirements for reference materials for calorimetric measurements on PCM

What does this mean specifically for calorimetric measurements on PCM? First, reference materials are needed with regard to amounts of heat and temperature. Useful material properties are Tpc and Δpch of phase changes, and for amounts of heat also the specific heat capacity cp.

  1. 1.

    Selection of a material

    Calorimetric measurements on PCM are done using a wide variety of samples regarding size and even encapsulated PCM (e.g. for quality control), and very different methods and instruments. Reference materials are used for calibration, for comparison, and for testing, and at best are suitable for all these usages. To be useful in comparative measurements and inter-comparison tests they must be useful for many measurements and for long time, meaning be perfectly repeatable. And to be useful for the variety of sample sizes and measurement conditions well reproducible, e.g. the material must allow to take representative samples for a wide range of sample sizes. It is critical that the material is available in the required composition; e.g. how does the value of cp, Tpc, or Δpch change with composition? The el. conductivity of semiconductors is very sensitive to impurities, while the density of paraffins is not. For Tpc, Δpch, and cp, this should be tested by comparing a highly pure sample with one of lower purity, e.g. for paraffins 99% or better to 98%. Also related are the cost, even for sample size of 100 ml or above (500 ml for HFM).

    Because of the sample size, and the construction of the various instruments, a reference material should also have low vapor pressure e.g. to prevent the hazard of combustible vapors in large amounts, not diffuse (diffusion loss in crucibles), not change its composition or that of containers by corrosion, not change its composition if in contact with air (not be hygroscopic, not oxidize). For a specific selection, a reference material should have a similar property value as the samples to be measured. For PCM this means Δpch should be higher than 150 J/g or 150 J/ml, and Tpc in the range of interest. The thermal conductivity can be low; common liquid PCM have below 1 W/mK. Furtheron, materials with close peaks that cannot clearly be resolved into separate peaks by the calorimetric measurements used for PCM are not useful. When a cp reference material is used for calibration, e.g. in T-history, this rule is violated somewhat; the reason is to get a calibration in a temperature range instead of just at temperature points. However, the value of cp per volume should be high, as it is for water.

  2. 2.

    Measurement of the property

    For a reference material for cp, Tpc or Δpch in calorimetric measurements, the respective value or value function needs to be measured by experts in the field with highly accurate calorimeters, testing specifically reproducibility, and result in an accepted property value or value function including uncertainty. Specifically, for PCM, what does that mean?

    For amounts of heat, a measurement should have an uncertainty of 5% as discussed previously. The uncertainty of a reference material must be significantly lower, e.g. 3%; this could comprise a contribution of 1% from the sample (material) e.g. by its composition (e.g. purity), 2% from the measurement of the property, and 0% from the presentation/communication of the property data. For temperature, a measurement should have an uncertainty of 0.5 °C as discussed previously. Let’s assume an uncertainty of 0.3 °C at most for a reference material; this could comprise a contribution of 0.2 °C from the sample (material), 0.1 °C from the measurement of the property, and 0.0 °C from the presentation/communication of the property data.

  3. 3.

    Presentation/communication of the property data

    The presented/communicated property data must include the specification of the material (composition, preparation…), the value or value function of the property including uncertainty, and maybe also the measurement method used and a reference where further information is found. In general, the use of material property data just from data tables is thus not recommended. Usually, neither the precise composition (e.g. purity) is given, nor the uncertainty of the measurement, and also not how values were read from the measured curve, meaning what given data refer to exactly. All these effects are reflected by the range of values found for cp, Tpc, and Δpch of materials in literature.

How cp, Tpc, or Δpch were determined from the measured data is especially critical if the measured data show a phase change temperature range. Let’s first look at the origin, and then on how to deal with it.

If the true behavior of a material is phase change in a temperature range (Fig. 2), the result of a measurement (if having sufficiently dense data points) shows the phase change in a temperature range. In that case, a single value of Tpc does not exist. The RAL and the IEA standard, who focus on data for heat storage applications, therefore require to give the stored heat in temperature intervals, e.g. of 1 °C, thus Δh for each interval. This is the thermal effect in any application and avoids misunderstandings. The reason for a true behavior with a phase change temperature range are impurities in the material, in the extreme case even leading to phase separation during phase change. To avoid this, thus to have for true behavior an ideal phase change (Fig. 1), commonly certified reference materials are highly pure. Their true behavior on phase change is a single value of Tpc, and Δpch is the corresponding step in h(T).

However, even if the true behavior is ideal, this is still often not the direct result of a measurement. One reason is the necessary T-gradient within the sample for heat transfer in or out of its full volume. The sample is thus never fully isothermal. However, its temperature is detected only at a single point. In a scanning measurement with constant dT/dt, as used commonly in hf-DSC, this can lead to significant artificial peak widening, thus that even if the true behavior of the sample material is ideal phase change a significant peak width is observed in the measurement result. This is why many common reference materials have a high thermal conductivity, meaning they are of the class of metals. In addition, the sample temperature is determined e.g. at the surface of the sample crucible (in hf-DSC sometimes even the reference crucible), such that there is an additional temperature shift along the thermal resistance in between. Thus, for measurements with hf-DSC, small dT/dt are recommended to assure close to isothermal conditions, or optionally to use ΔT steps that limit the effect to the step size. The effects are generally well known; Hiebler [35] describes and studies it specifically for hf-DSC. The slow heating used in ASC limits the effect such that it does not become relevant. Only in AC measurements that use ΔQ steps an ideal phase change can be observed at a single temperature Tpc.

Consequently, that measured data show a phase change temperature range is quite common, and practically always the case if using hf-DSC, even if measuring highly pure materials. Thus, it is necessary to discuss how cp, Tpc, or Δpch are determined from the measured data. A measurement result with a finite peak width shows Fig. 5: h(T) and corresponding cp(T), incl. characteristic temperatures.

Fig. 5
figure 5

Measurement result with a finite peak width: h(T), corresponding cp(T), and ways of reading data

Values for Tpc, Δpch, and cp, to be read from the experimental results, are often hard to identify. Most critical are Tpc and Δpch; less critical is cp outside the phase change because there cp(T) changes little. The extrapolated onset, peak, or offset of cp(T) are e.g. used to “define” Tpc (Fig. 5). The onset does not vary with heating rate, thus it is often used if the heating rate strongly affects the peak width. If impurities have a significant effect on the onset of heating, but not the heating rate (e.g. in AC or ASC) other points in the peak are used (e.g. Table 1). For Δpch, one option is to use the whole enthalpy change from start to end, as indicated by the shaded area in Fig. 5 at the top. But if using hf-DSC, with increasing heating rate the end of phase change is shifted further into the range of the liquid phase, as discussed above, thus Δpch between the start and end T increases with the heating rate (and the same for cooling). The same holds if the on- and offset temperature are used. This is quite common when using hf-DSC, including for calibration. For example, at a cp of 2 J/(g⋅K) a peak width of 1 K already gives 2 J/g, thus a difference of 1% for a phase change enthalpy of 200 J/g. It is thus common to integrate the peak just to a peak baseline, as shown in Fig. 5 center or bottom. This results in Δpch values that vary far less with the heating rate. Different methods exist to construct a peak baseline. E.g. based on the assumption of having a true ideal phase change, the peak width in cp(T) reflects a gradual change of phase such that the fraction of the shaded area of the peak on the left and right of a temperature gives the fraction of the phases; this is called the 1/F (extrapolation) method [20, 36]. If the phase change is not ideal, it just reflects the fraction of the peak’s enthalpy. Another option shows Fig. 5 bottom, assuming constant cp to the peak temperature.

Table 1 Water data from Leys et al. [20]

The effect is small if the peak is sharp, thus materials with high thermal conductivity are preferable as reference material. However, this is not the case with materials like water, hexadecane, or octadecane. Also, measurements in DSC or T-history often give different curves on heating and cooling (Fig. 7), and even different curve shapes compared e.g. to pASC (tetradecane in Figs. 7 and 8) due to differences in heat flow and temperature measurement. Data reading and presentation/communication is thus susceptible to significant misinterpretations. A solution of the problem comprises two things. First, measurements must be in a way that assures the sample is sufficiently close to isothermal conditions to reduce the peak width, and materials should be pure. Second, the values of Δpch and Tpc must be accompanied by a description of the way they were determined from the measured curves.

4.1.2 Suitable calorimetric methods, equipment, and available literature data

An uncertainty of 2 % in amounts of heat and 0.1 °C in temperature or better, as needed, cannot be attained with most calorimetric methods and equipment. Hemminger et al. [37] discuss this, and they recommend for high precision calorimetry to use adiabatic calorimetry as the prime choice. As adiabatic calorimeters are rare worldwide, they recommend, if a larger uncertainty is acceptable, DSC calibrated by absolutely measured reference materials, or calibrated by an electrical source, if possible.

Therefore, a search was performed by the first author in journal publications and on institute websites to find well-documented measurements by AC, or people using AC being able to do measurements. While few well-documented measurement results are published, the research group of the coauthors at KU Leuven, which uses adiabatic scanning calorimetry (ASC) was found. The ASC is self-developed, built, and tested, using a Peltier element to assure adiabatic conditions, thus called pASC. Leys et al. [18] describe it in detail, and state that the uncertainty on the absolute values of specific heat capacity and enthalpy is about 2%. The pASC has been tested on several certified reference materials for Tpc and Δpch. For gallium, Leys et al. [18] measured two samples, both giving 29.79 °C while Della Gatta et al. [4] report (certified reference materials, Tab. 2) a value of 29.780 °C, respectively 82.4±2 J/g and 82.1±2 J/g compared to 80.14±0.17 J/g which is reported by [4]. For biphenyl, Leys et al. [19] report values of 68.98 °C, 68.99 °C, and 68.97 °C compared to 68.93 °C given by Della Gatta et al. [4], and respectively 122.9 J/g, 122.5 J/g, 122.1 J/g compared to 120.6±0.8 J/g. Leys et al. [19] report also results for phenyl salicylate and a comparison with values given by Della Gatta et al. [4]. Altogether, all measurements show a deviation from the certified reference value for Tpc of less than 0.1 °C, and for Δpch of about 2 % or less. This is within the required accuracy as discussed previously.

Table 2 Values for dodecane, from the NIST Chemistry WebBook (the values in J/g have been calculated here using the molar weight of 170.3348 from the NIST Chemistry WebBook)

Further measurements on several other materials have been published by the research group at KU Leuven, including uncertainty of the results, composition of the materials, effect of change of composition etc. Some of these materials seem suitable to be used as reference materials for calorimetric measurements on PCM. These serve as a basis for the further discussion; where available, suitable data on other materials and measurement results of other groups are added.

4.2 New options for reference materials for T pc, Δpc h

In general, materials with close peaks that cannot clearly be resolved into separate peaks by the calorimetric measurements used for PCM are not useful; peaks should be clearly at least 5 °C apart. Water, hexadecane, and octadecane have already been used occasionally to test calorimetric equipment used for measuring PCM, so these could be useful candidates. Water seems to be especially suitable as it is already used as reference material, however just for Tpc and cp, and not for Δpch, so it fulfills many requirements. And it is also a PCM and has similar properties to many other PCM. Besides hexadecane and octadecane, the n-alkanes are in general materials that are very promising; they are inert, and they are already used as PCM. Hexadecane and octadecane are just two examples. We will start looking at these candidates.

4.2.1 Water

Water seems to be suitable as it is already used as reference material, however just for Tpc and cp, and not for Δpch, so it fulfills many requirements. And it is used as a PCM, and has similar properties to many other PCM. Data of cp, Tpc, and Δpch can be found in many sources. For cp(T) and Tpc, a detailed discussion is found in [5], including uncertainty and required purity. For Δpch, which is not a certified reference value, Dickinson et al. [38] present an early review. Dickinson et al. [39] then describe a correction of previous measurements on water, which was purified by repeated distillation and measured by several methods; the result was 79.74 cal/g, being equal to 333.9 J/g (1 cal = 4.187 J). Osborne [34] later gives a value of 333.5±0.2 J/g, which is still in close agreement. Later, Archer [13] published new data consistent with the older literature and suggested to use water as reference material also for Δpch.

Leys et al. [20] present pASC measurements (heating) of highly pure (Fisher) water (Fig. 6, left), and comparison with water of different purity (right): Spa water and a diluted salt solution with 0.1388 mass% of NaCl. The results (Table 1) for Δpch all agree within 2% with the literature values given above. The variation of purity shows that within the required uncertainty for Tpc and Δpch, the required purity (composition) is not critical. Tpc, determined from the intersection of the extrapolations of the h(T) data in the upper transition region and the higher temperature phase, also agrees very well.

Fig. 6
figure 6

Source: [20]

Results of pASC measurements on highly pure (Fisher) water (left), and comparison with water of different purities (right): Spa water and a diluted salt solution with 0.1388 mass% of NaCl

Water has a repeatable and reproducible behavior, a suitable value of Δpch and Tpc in the required range, is available in the required composition at acceptable price, and information on how Tpc, and Δpch vary with composition is available. It poses no hazards, has a low vapor pressure, and is compatible with containers and air exposure (hygroscopic, oxidation…). The measurement of the property for Δpch shows a variation by composition < 1% (Fischer and Spa water) and ≤ 2% by uncertainty of the value (comparison of the values from [20] with those of [34]), for Tpc a variation by composition < 0.2 °C and < 0.1 °C by uncertainty of the value. The presentation/communication of the property data includes the specification of the material (composition, preparation…), the value or value of the property including uncertainty, and also the measurement method used and a reference where further information is found. Water, as already suggested by Archer [13], is thus useful as reference material for measurements of Tpc and Δpch, e.g. on PCM.

4.2.2 Overview on n-alkanes

Fig. 7 shows an overview on h(T) curves of a number of n-alkanes that have phase transitions between − 20 and + 60 °C, taken from [40], which is the final report of a public funded project.

Fig. 7
figure 7

Source: [40]

h(T) curves from DSC measurements on n-alkanes with a chain length of C-atoms between 12 and 28, at the top for even numbers, and at the bottom for odd numbers

The data show h(T) curves from heating and cooling measurements using hf-DSC on n-alkanes with a chain length of C-atoms between 12 and 28 (C26 and C27 are missing), at the top for even and at the bottom for odd numbers. The heating and cooling rates were chosen between 0.5 and 2 °C/min (based on the RAL criterion that a sufficiently small rate is reached when, after halving the heating rate and halving it again, the temperatures at the inflection points of the heating measurements deviate no more than 0.2 K from each other).

The results can be used to make three groups:

  • Group 1 shows single transitions; it comprises C12, C14, C16, and C18.

  • Group 2 shows transitions more than 5 K apart; it comprises C13, C15, C17, C19, and C21.

  • Group 3 shows transitions less than 5 K apart; it comprises C20, C22, C23, C24, C25, and C28.

For the search for new reference materials for Tpc and Δpch, the first group with single transitions is most promising. It includes hexadecane, which is already used to calibrate the 3-layer calorimeter, and also octadecane, which was used in the IEA intercomparison test for hf-DSC as the sample material. The second group is less promising, because it is only useful if the user of a potential reference material can clearly separate the peaks; this should still be possible with calorimetric methods used for PCM if used properly. The third group has transitions less than 5 K apart, which are not promising.

The following discussion focuses on the group 1, materials with single transitions and thus most promising. It comprises C12, C14, C16, and C18. In Tables 25 typical values from the NIST Chemistry WebBook are presented; further data can be found in a review by Acree and Chickos [41].

4.2.3 n-alkane dodecane C12

No reliable source for data of Tpc and Δpch with high accuracy was found, including purity and effect of its variation, thus no reliable data for the use of dodecane as reference material for Tpc and Δpch. [42] lists the values shown in Table 2. The values of Δpch vary by 5.74 J/g between max. and min. (209 J/g and 214.74 J/g), thus about 2.7%, while those of Tpc vary by 0.5 K between max. and min. (263.1 K and 263.6 K). For Tpc [42] also lists a value for separate data (without Δpch), which is 263.5 ± 0.3 K (approx. − 9.6 °C, average of 35 values). The data scatter significantly compared to the desired uncertainty of 2% in amounts of heat, and far compared to the 0.1 °C in temperature.

4.2.4 n-alkane tetradecane C14

For tetradecane [43] lists the values shown in Table 3. Values of Δpch vary by 12.18 J/g between max. and min. (227.18 J/g and 215 J/g), thus about 5.5%, while those of Tpc vary by 0.7 K between max. and min. (279.03 K and 278.3 K). For Tpc [43] also lists a value for separate data (without Δpch), which is 278.7 ± 0.9 K (approx. 5.6 °C, average of 31 values). The data scatter rather far, compared to the desired uncertainty of 2% in amounts of heat and the 0.1 °C in temperature.

Table 3 Values for tetradecane, from the NIST Chemistry WebBook (the values in J/g have been calculated here using the molar weight of 198.3880 from the NIST Chemistry WebBook)

The data from [43] do not show if a value of Tpc or Δpch has a high accuracy or not, not the purity of the sample, and thus also not the effect of its variation on Tpc or Δpch.

In an early attempt at KU Leuven in 2013 [44] to apply the novel pASC to first-order phase transitions, measurements were carried out on a C14 sample of 66.56 mg (from Sigma Aldrich, 99.5% pure) in a standard DSC cell. Only data for a heating run were obtained, resulting in a transition heat of 223 ± 2 J/g. Recently, new results were obtained with pASC at KU Leuven on a new C14 (Aldrich 99 + %) 82.0 mg sample, both in heating and cooling at an average heating rate of about 15 mK/min and average cooling rate of about 24 mK/min (average rate because the power is kept constant but the rate is inversely proportional to the heat capacity). In Fig. 8 results for the specific enthalpy h(T) and the corresponding effective specific heat capacity cp(T) are displayed. The transition temperature obtained on heating is 5.60 ± 0.03 °C (278.75 ± 0.03 K). The same transition temperature is obtained on cooling, however, only after first supercooling by 3.7 °C to a temperature of 1.9 °C. To obtain the transition heat the I/F method [18, 36] was applied to the heating data. This resulted in a value of 229.7 J/g for the heat of fusion. This value corresponds well with two values of Table 3, but is also about 5% larger than the DSC value given in that table. If one derives a heat of fusion from the cooling data of Fig. 8 a 2% larger value is obtained, which is acceptable in view of some thermal instability encountered during rapid solidification from supercooling.

Fig. 8
figure 8

Specific enthalpy (top) and specific heat capacity (bottom) as a function of temperature from pASC heating and cooling runs on a 82 mg sample around the solid to liquid phase transition of n-tetradecane (C14)

Can tetradecane now be recommended as a reference material? Tetradecane has a repeatable and reproducible behavior, suitable values of Tpc and Δpch, and is available in the required composition (e.g. purity) at an acceptable price. It has a flash point of 100 °C and a boiling point of 254 °C, and has good compatibility with containers and air exposure (hygroscopic, oxidation…). These properties make tetradecane an interesting candidate. Reliable data for Tpc and Δpch with high accuracy, including purity and effect of its variation, are however only available from KU Leuven. And here the values of Δpch discussed above (223 J/g and 229.7 J/g) vary by about 3%. Thus, it seems advisable to do more research on the effect of purity, before tetradecane could be used as a reference material.

4.2.5 n-alkane hexadecane C16

Hexadecane, as mentioned, has already been used as calibration material for the 3-layer calorimeter, and to test T-history equipment by Lazaro et al. [9].

No reliable source for data of Tpc and Δpch with high accuracy was found, including purity and effect of its variation, thus no reliable data for the use of hexadecane as reference material for Tpc and Δpch. [45] lists the values shown in Table 4. The values of Δpch vary by 25.67 J/g between max. and min. (235.64 J/g and 209.97 J/g), thus about 11.6%, while those of Tpc vary by 1.4 K between max. and min. (292.1 K and 290.7 K). For Tpc [45] also lists a value for separate data (without Δpch), which is 291 ± 1 K (approx. 17.9 °C, average of 52 values). The data scatter rather far compared to the desired uncertainty of 2% in amounts of heat and 0.1 °C in temperature. For comparison, the values used by Lazaro et al. [9], taken from [32], were 18.1 °C, thus 291.25 K, and 236 J/g.

Table 4 Values for hexadecane, from the NIST Chemistry WebBook (the values in J/g have been calculated here using the molar weight of 226.4412 from the NIST Chemistry WebBook)

4.2.6 n-alkane octadecane C18

Octadecane was already used in the IEA intercomparison test for hf-DSC as the sample material. But no reliable source for data of Tpc and Δpch with high accuracy was found, including purity and effect of its variation, thus no reliable data for the use of octadecane as reference material for Tpc and Δpch. [46] lists the values shown in Table 5. The values of Δpch vary by 6.27 J/g between max. and min. (242.47 J/g and 236.2 J/g), thus about 2.6%, while those of Tpc vary by 0.5 K between max. and min. (301.5 K and 301.0 K). For Tpc [46] also lists a value for separate data (without Δpch), which is 301.0 ± 0.7 K (approx. 27.9 °C, average of 35 values). The data scatter significantly compared to the desired uncertainty of 2 % in amounts of heat and rather far compared to 0.1 °C in temperature.

Table 5 Values for octadecane, from the NIST Chemistry WebBook (the values in J/g have been calculated here using the molar weight of 254.4943 from the NIST Chemistry WebBook)

Data from [46] do not show if a value of Tpc or Δpch has a high accuracy or not, not the purity of the sample, and thus also not the effect of its variation on Tpc or Δpch. This is better in other data.

The sample used in the IEA intercomparison test for hf-DSC was a commercial Parafol 18-97 (97%). The Fraunhofer ISE, who took part in the intercomparison test and also organized it, has in the meantime measured samples of various purity: Parafol 18-97 (97 %) from Sasol and n-octadecane 99.5% from Alfa Aesar. The following values shown in Fig. 9 are from their measurements with a MikroDSCIII (Calvet DSC), sample masses 234 mg (99.5%) and 272 mg (97%), 3rd cycle, 0.05 K/min.

Fig. 9
figure 9

Values of h(T) for octadecane of different purity, measured by the Fraunhofer ISE using a MikroDSCIII (Calvet DSC); the values are normalized to 0 J/g at 30 °C

As discussed in section "Suitable calorimetric methods, equipment, and available literature data", DSC calibrated by absolutely measured reference materials, or calibrated by an electrical source, is preferable. The MikroDSCIII is calibrated for heat flow by an electrical source (Joule effect calibration). It was again tested before the measurements on octadecane, using water, Ga, and In, and the result was about 1.5% too high enthalpies (heat of fusion), using a sigmoidal peak baseline.

The results in Fig. 9 show for heating and for cooling a significant difference (hysteresis), indicating that the samples were still not fully isothermal (still lower heating/cooling rates are however prohibitive). There is the expected difference due to purity in the onset of melting. In addition, the sample with the lower purity also seems to have a slightly lower enthalpy change, in the order of 5 J/g. The peak integral, using a peak baseline by extrapolation of cp in the solid, is about 240 J/g. The value, and the variation with purity, corresponds well with the values in the NIST Chemistry WebBook.

Based on these results a discussion of the suitability of octadecane as reference material is possible. Octadecane, as is well known, has a repeatable and reproducible behavior, suitable Δpch above 150 J/g, and Tpc in the required range. It is available in the required purity at acceptable price. Using the new measurements from the Fraunhofer ISE information how Tpc and Δpch vary with purity is available. Octadecane poses no hazards, has low vapor pressure, high flashpoint, and is compatible with containers and air exposure (hygroscopic, oxidation…). The values from the Fraunhofer ISE show a Δpch variation with purity from 97 to 99.5% by about 2% (5 J/g of 240 J/g); a limitation to < 1% at > 99% purity thus seems probable. The uncertainty of a measurement is probably in the order of 2%, as deviations by 1.5% for the enthalpies of water, Ga, and In in the preceding tests show. Still investigated should be how the peak is integrated, due to the significant peak width. In contrast to Δpch, the data from the measurements by the Fraunhofer ISE do not allow a conclusion regarding Tpc. The data in [46] from an average of 35 values show approx. 27.9 ± 0.7 °C, and this range is well within the options when reading a value from the curves in Fig. 9 from an onset or offset. The cause of this should be the variation with composition for the onset, and for the offset (of the heating curve) the fact that the sample might not have been sufficiently close to isothermal conditions. The latter is a general problem when using DSC with a constant heating rate, independent of the material. However it is not a general problem of calorimetric methods used for measuring PCM. Therefore, overall, octadecane could possibly be useful as reference material for Δpch as well as for Tpc.

4.2.7 Summary and further options

The n-alcanes C12, C14, C16, and C18, would probably be suitable as reference materials for Tpc and Δpch at about − 9.6 °C, 5.6 °C, 17.9 °C, and 27.9 °C, as soon as for all of them suitable measurement data are available. In addition comes water at 0 °C. Looking at Fig. 7 it is thus reasonable to skip the n-alcanes C13, C15, C17, and C19, as they would cover the same temperature range while having somewhat more complex melting behavior; they were all summarized in group 2. Left from group 2 is only C21, with a Tpc somewhat higher than those covered by C12, C14, C16, and C18. The group 3 materials were already excluded because of close double peaks.

Nevertheless, the temperature range between about − 10 °C and about 30 °C would be covered well. Below this temperature range there is currently only mercury, which is not safe when large samples are needed, thus there is still demand for options for reference materials at temperatures below − 10 °C. Regarding the temperatures above 30 °C, in section 3 it was discussed why biphenyl might be suitable for larger samples, while naphthalene is not considered safe. Indium in large amounts is too expensive. So there is still demand for options for reference materials at temperatures above + 30 °C.

Other classes of materials that are used as PCM, and available in high purity, are the n-alkanols and the n-carboxylic acids. However, the materials from both groups are polar, thus should be hygroscopic, which can cause trouble in keeping them free of water. In addition, the functional groups make them more reactive. In this first attempt to find new options for reference materials they are thus not investigated. Due to the required reproducibility, no salthydrates, no salt-water eutectics, and no salts for higher temperature are considered as reference materials for Tpc and Δpch; these materials are all hygroscopic and well known for difficulties in controlling their composition.

4.3 New options for reference materials for c p

Besides new options for reference materials for Tpc and Δpch, it is still worth looking again at options for cp, which can also be used to calibrate amounts of heat.

4.3.1 Water-salt mixtures

Water is a certified reference material for cp, but limited by the solid-liquid and liquid-vapor phase transition to temperatures between 0 and 100 °C (Fig. 4). Addition of salts is known to reduce the freezing point and to raise the boiling point (reduce the vapor pressure). At the same time, cp could still be not much lower than for pure water. By adding salt to water, its applicable temperature range as a reference material for cp might thus be extended.

Leys et al. [20] measured eutectic and diluted mixtures of NaCl in water. Figure 10 shows the reduction of the freezing point, down to about − 21.1 °C where the eutectic point is, such that the liquid phase is extended more than 20 °C below zero.

Fig. 10
figure 10

Source: [20]

Overview on h(T) of NaCl-water mixtures up to the eutectic at 23.2 wt%

The overview in Fig. 10 also shows again some things discussed earlier, related to Fig. 5. First, the addition of some impurities (e.g. 0.139% compared to 0%) readily changes the temperature behavior at the onset of melting but not at its end, and second, the enthalpy difference between an extrapolated cp between liquid and solid phase is also initially not much affected. Thus these approaches can lead to the values of Tpc and Δpch of the pure substance, here water, in case of a practically isothermal sample. If the sample is not isothermal, as for DSC measurements with constant heating rate typical, the end of melting is shifted to higher temperatures, e.g. in Fig. 9 to about 29.5 °C instead of just about 27.9 °C.

For a better evaluation the h(T) graph in Fig. 10, from the original publication, is not very suitable. Thus, Fig. 11 shows the specific heat capacity cp(T), calculated from the data from Leys et al. [20]. The data are consistent with those of a more extensive data set found in [47], which show cp(T) measured on cooling even into the supercooled region, in contrast to the data from Leys et al. [20] that have been measured on heating from below the eutectic temperature (− 21 °C) starting from the solid phase. In the region of overlap with the Archer data the agreement is better than 3%.

Fig. 11
figure 11

Specific heat capacity cp(T) of NaCl-water mixtures of various NaCl wt%, calculated from the data from [20]. The open circles represent the end of the melting peak in heating runs.

The exact eutectic composition of water-salt mixtures is hard to prepare, because they are hygroscopic, and already the base salt for the mixture is rarely completely water free. Therefore, they do not seem to be suitable as reference materials for Tpc and Δpch. However, for cp of the liquid phase the composition does not seem to be too critical. For example, in Fig. 11 at about 0 °C the value of cp changes from about 4.2 J/(g·K) for the pure water (0%) to about 3.3 J/(g·K) for the 23.2% solution. The change of 0.9 J/(g·K) in cp per 23% change of salt concentration means roughly on average 0.038 J/(g·K) per % of salt, and with an average cp of the solutions of some 3.8 J/(g·K) it means roughly 1% change in cp per 1% change in salt concentration. Preparing mixtures with sufficiently accurate concentration should be possible, such that a near eutectic composition might be suitable as a reference material for cp in its liquid phase, at least to temperatures down to may be − 15 °C (Fig. 11). This would already be a significant improvement e.g. for T-history measurements, because currently, if using liquid water as reference for cp, temperatures below 0 °C cannot be reached.

Besides NaCl, of course also other salts could be suitable. However, water-salt mixtures can be corrosive depending on where they are used. Thus it is worth looking at two more options for reference materials for cp, besides water-salt mixtures: water-alcohol mixtures, and the n-alkanes.

4.3.2 Water-alcohol mixtures

Water-salt mixtures, as with NaCl, are corrosive to many metals, thus a second, similar option could be interesting: water-alcohol mixtures. Several of them are known as heat transfer fluids like water-salt mixtures, e.g. mixtures of water with methanol, ethanol, ethylene glycol, or propylene glycol.

For example, according to the Engineering ToolBox [48], a mixture of 60 vol.% ethylene glycol in water does not freeze until − 52 °C, has at 0 °C a density of 1.11 g/cm3 and a heat capacity of 3.027 J/gK, and at − 40°C a density of 1.12 g/cm3 and heat capacity of 2.847 J/gK. The cp at 0 °C is just 3.266 J/gK for 50 vol.% and 2.931 J/gK at 65 vol.%, thus varies just about 0.022 J/gK per vol.%.

4.3.3 n-alkanes

Figure 4 shows that liquid heptane C7 is a certified reference material for cp [5], however it has a flashpoint of only 4 °C, which makes it unsafe if large amounts of vapor can form. Nevertheless, it is straightforward to look for higher temperatures at n-alkanes with longer chain length. For example, hexadecane has a boiling point of 287 °C and a flash point of 135 °C, and octadecane has a boiling point of 316 °C and a flash point of 166 °C. They are thus much safer to handle.

Vélez et al. [49] studied n-hexadecane C16, n-octadecane C18 and n-eicosane C20 by hf-DSC, up to about 70 °C, with good agreement to values of other references, and determined a polynomial expression for the cp in the solid and liquid phase for each material. Vélez et al. [50] repeated the same for n-pentadecane C15, n-heptadecane C17 and n-nonadecane C19 by hf-DSC. While the data shown in Fig. 7 give the impression as if the value of cp (inclination of h) in the liquid phase is the same for all materials, the data of [49] and [50] show very similar cp for all six n-alkanes in the liquid phase at the same temperature, but with a slight, continuous increase of cp with the temperature. It could thus be speculated that cp in the liquid phase of n-alcanes is practically independent of the chain length; in literature, usually the same value is found for all paraffins. And maybe this then even applies to mixtures of them, which would reduce the demand of purity when taking a sample as reference. One of the references that were used by Vélez et al. [49] and [50] to compare their measurement results with those of others is van Miltenburg [51]. He gives data for the molar cp of liquid n-alkanes from pentane C5 to hexacosane C26. More important, he gives a function of temperature and number of carbon atoms in the chain that describes the experimental data to within the experimental error (the average percentage deviation is 0.18%):

$$ c_{p} = \left\{ {43.9 + 13.990 \cdot \left( {n - 1} \right) + 0.05430 \cdot \left( {n - 1} \right) \cdot T} \right\} \cdot JK^{ - 1} mol^{ - 1} $$
(5)

This allows a comparison of the molar cp values as a function of temperature and number of C-atoms, and using the molar mass then the mass specific cp. The result for the range of n covered by [51] shows Table 6. It shows that for n ≥ 10 the difference between alcanes at the same temperature is within 2%.

Table 6 Mass specific cp in J/gK of n-alkanes calculated using the function from van Miltenburg [51]

It could thus be interesting to study the n-alkanes further, also regarding the sensitivity of their cp in the liquid phase to impurities from other n-alkanes. A significant drawback of the n-alkanes, compared to water-salt solutions, is however the lower specific heat capacity per mass, and the lower density, which in combination lead to much lower specific heat capacity per volume; thus also per sample.

5 Summary and conclusions

Calorimetric measurements to determine the ability of PCM to store heat as a function of temperature are performed with a variety of methods and instruments. Some of them have been developed only in the last decade, especially to allow characterization of larger samples, even encapsulated PCM with volume in the order of 100 ml and larger. In this paper, we gave an introduction, review, and recommendations for new reference materials for calibration and testing, needed to cover the full range of temperatures and sample sizes. Specifically, the calorimetric methods used for PCM were discussed, the different uses of reference materials, available certified reference materials, and their limitations when making measurements on PCM. The requirements on reference materials were discussed, specifically for calorimetric measurements on PCM, and some potential new reference materials were suggested in addition to those shown in Fig. 4.

For Tpc and Δpch, water at 0 °C is useful as already measurements with the required uncertainty etc. are available. Another option are the n-alkanes, specifically dodecane C12 at about − 10 °C, tetradecane C14 at about 6 °C, hexadecane C16 at about 18 °C, and octadecane C18 at about 28 °C. These would cover the range from − 10 to about 30 °C well, especially when large sample masses are needed. Despite the fact that they are not certified reference materials, water, C14, and C18, where detailed information was found, are a big step forward compared to using materials and values just from some reference, with unknown purity, uncertainty, and information regarding how the values were determined from the measured data. However, specifically for C12 and C16 values with sufficiently low uncertainty are yet not available. While C12 would close the gap at low temperatures, several materials are needed to close the gap to indium (biphenyl might be a useful candidate), and especially beyond indium at higher temperatures. For cp, water-salt mixtures, water-alcohol mixtures, as well as n-alcanes in the liquid phase might be an option, each having specific advantages and disadvantages. In any case, for all options some more research seems to be needed. And especially for temperatures below − 10 °C and above 30 °C other material classes like n-alkanoles should also be investigated. They have not been considered in this first attempt here. At the end, a sufficient number of reference materials should be available to have the option to cover a temperature range of interest with at least 3 reference materials, one at the lower temperature end, one at the upper, and one somewhere in between.