1 Introduction

In recent years, damage has increasingly occurred in the case of bituminous surface layers, such as the loss of aggregates and decreased fatigue life of the base layers in various parts of Central and Western Europe. These damages cannot be attributed either to a severe weather phenomena or a dramatically increased traffic load. A possible cause of damage is a lack of quality of the asphalt mastic (bitumen + mineral fine fraction), like poor adhesion to the aggregates or overall lack in strength. This can be due to poor quality of one of the two components, bitumen and added filler or fines, respectively. Preliminary work [1,2,3] on this subject suggests that the mineralogical composition of the added filler or fines affects the quality of the mastic and thus the adhesion to the coarse aggregates and strength. It can be assumed that a lack of quality of the added filler or fines is responsible for premature damage. Possibly, the use of non-quality-assured fines from the fine and coarse aggregates instead of added filler or mixed filler can be accounted to these damages. This study aims to develop a suitable test to assess the quality of asphalt mastic with regard to durability (fatigue life).

The European Standard EN 12697-24 [4] defines fatigue as “…the reduction of strength of material under repeated loading when compared to the strength of a single load”. Fatigue is the progressive and localized structural damage that occurs when a material is subjected to cyclic loading below the stress limit until the state of serviceability limit or total failure is reached.

This effect also occurs in asphalt pavements and affects all asphalt layers. Due to continuous heavy goods vehicle (HGV) traffic tensile stress occurs in the base layer and thus micro-cracks accumulate. In the case of repeated loads, these cracks are propagated upwards and are linked to macro-cracks, which are visible. If these cracks finally pass through to the top layer, a cracking pattern typical of fatigue damage occurs. Another form of fatigue damage affects the asphalt surface. The coarse aggregates are held together by asphalt mastic and under repeated (shear) stresses, poor mastic leads to loss of aggregates on the surface. While there are standardized testing methods available for bitumen, aggregates and hot-mix asphalt (HMA), there is no testing method available to assess the durability of asphalt mastic. Thus, several researchers already published their work to account the fatigue performance of asphalt mastic or mortar (bitumen + filler + sand) in different testing setups and machines, like DSR [2], Annular Shear Rheometer (ASR) [5,6,7,8], Dynamic Mechanical Analysis (DMA) [9,10,11] and Tension Compression Tests [12, 13]. All of these tests are based on applying dynamic loads (shear or tension–compression) onto bitumen, mastic or mortar specimens with different shape at different temperatures, frequencies and loading modes. Table 1 gives an overview of published work, applied setups and testing parameters for DSR and DMA. DMA consists of a DSR with special clamping system instead of parallel-plate or plate and cone geometries.

Table 1 Publications of dynamic tests for bitumen, mastic and mortar

2 Approach and objectives

The 4-Point Bending-Beam Test (4PBB) according to EN 12697-24 [4] is one of the standardized fatigue tests for HMA. Since the aim of this paper is the development of a fatigue test for asphalt mastic, the 4PBB test is described in more detail for a better understanding of fatigue. In the 4PBB test, a prismatic beam is supported at 4 points and is dynamically loaded at the two inner bearings. The sample is pulled upwards and pushed downwards until a defined deformation/strain is reached in the centre of the beam. The force required for the predetermined strain is measured and the dynamic modulus is calculated continuously. Therefore, the 4PBB test is a so called strain-controlled test. Due to the increasing number of load cycles during the test progress the beam becomes weaker and the dynamic modulus is getting lower. As soon as the dynamic modulus reaches half of its initial value at the beginning of the 4PBB test the fatigue criterion is met. Tests have to be carried out at 3 different strain levels with at least 6 samples each level to maintain a limited statistic certainty. This results in at least 18 samples that have to be tested. Including the mixing of HMA, sample fabrication and preparation for testing, it takes about 3 weeks to complete all necessary steps for one single HMA. Thus, the 4PBB test is a time consuming test and it is also not applicable to the testing of asphalt mastic or mortar.

This paper aims to find a fatigue testing method suitable for asphalt mastic applicable on a device that is already available in most of the commercial laboratories of pavement engineering. The dynamic shear rheometer (DSR) was found as a suitable device with good availability due to the fact that it is used for binder grading. The objectives of this paper are:

  • Development of a fatigue test with new specimen geometry applicable to all standard DSR setups used for binder grading.

  • Study into test repeatability.

  • Assess the fatigue performance of asphalt mastic with particle size < 125 µm.

3 Development of the fatigue test

3.1 DSR loading mode and fatigue failure criterion

DSR are capable of applying oscillatory stress- and strain-controlled loads on small specimens. Deciding between stress- and strain-controlled loading modes, the stress-controlled mode is in favour because it is more comparable to what happens on site. Taking a HGV passing an asphalt pavement into account, the induced stress is independent, while the strain is a function of the stiffness of the material. Under strain-controlled loading mode stiffer materials are subjected to higher stress levels than materials with lower stiffness. Changing the loading mode also requires changing the fatigue criterion. While 50% of the initial modulus is used as a criterion for strain-controlled tests, the true failure of the specimen is applied to stress-controlled tests. Figure 1 shows a typical fatigue curve under stress-controlled loading mode obtained by DSR within this paper.

Fig. 1
figure 1

Fatigue curve under stress-controlled loading mode

A fatigue curve obtained by stress-controlled testing consists of four typical phases: Phase (1) is the adaption phase dominated by thixotropy [25, 29] and limited heating of the specimen caused by energy dissipation (both causes a reversible loss of stiffness). Phase (2) consist of a continuous propagation of micro-cracks. In Phase (3) these micro-cracks are linked to macro-cracks until in Phase (4) these macro-cracks are linked to a yield line. Finally, the specimen breaks due to fatigue failure. The failure is reflected in both, dynamic modulus and phase lag. In this study, the point of failure is determined by the sudden drop of the phase lag (δmax) as shown in Fig. 1. However, there are several other fatigue failure criterions such as 50% reduction in stiffness or Rowe’s criteria based on changes in dissipated energy. The fatigue tests are carried out in one single go without rest periods. Thus, healing effects are not accounted in this study. Healing effects are well known in literature and are depending on binder grading, temperature and applied stress or strain level [20, 28, 29, 32].

3.2 Issues and challenges with the DSR

During the development of the fatigue test three different DSR were in use for different reasons. These reasons can be accounted to the testing geometry and cooling capacity/efficiency and have to be described in detail because of the huge impact to the fatigue test:

  1. 1.

    Thermo Scientific HAAKE MARSII Modular Rheometer.

  2. 2.

    Anton Paar MCR300 Smartpave series with active Peltier-temperature-controlled sample hood (H-PTD120).

  3. 3.

    Anton Paar MCR302 with active Peltier-temperature-controlled sample hood and optional air flushing (H-PTD200).

Asphalt mastic or mortar is about 3–6 times stiffer than the neat binder. The stiffness is mainly dependent on the amount of aggregates (filler, fines) mixed with the binder but also the mineralogy, grading curve, grain shape and surface characteristics play an important role. The first DSR, a HAAKE MARSII has been employed for preliminary tests and was further proposed for all tests within this study. But this specific DSR is not suitable for fatigue tests for two reasons: First, the testing geometries are mounted with a thread and they get loose during long-term tests with stiff materials at high frequencies (Fig. 2a). Second, the drive engine is capable of enough torque but it is not durable and fails with an engine overheat error after about 10 to 30 min depending on the pre-set stress. The second DSR, a MCR300 is both, durable without any overheating issues and testing geometries are mounted with a patented mechanical coupling called QuickConnect. A MCR300 Smartpave series is equipped with a Peltier-temperature-controlled hood and testing geometries with a hollow plastic cylinder and a disposable measuring plate (Fig. 2b). While the plastic cylinder serves as a thermal barrier to the upper steel shaft, the disposable plate is used to measure curing materials like epoxy glue. The disposable plate is equipped with a thread used to fix the plate on the plastic cylinder. The adhesive used to fix the plate onto the cylinder was in focus because of its compliance at testing conditions. The disposable plate got even loose after some fatigue tests. Thus, it was necessary to apply a 2-component adhesive to fix the plate onto the plastic cylinder. Most of the fatigue tests have been carried out on this device until it became defective. Because of a discontinued support of this 15 year old DSR, there was no opportunity to get it back to a working state. During the fatigue tests on the old MCR300 a new MCR302 became available to the lab. In comparison to the MCR300 Smartpave series, the MCR302 is equipped with a stronger Peltier-temperature-controlled hood with optional air flushing. Preliminary tests on the new MCR302 have been carried out without additional air flushing. Due to the fact that the new testing geometries have no thermal barrier included (Fig. 2c) it is necessary to switch on the air flushing of the hood (200 Ln/h). The perfusing air is efficiently cooled down to target temperature of + 10 °C. Without air flushing there is a thermal gradient from the bottom plate to the testing geometry causing a lower initial dynamic modulus at the beginning of the fatigue test in comparison to the dynamic modulus obtained by the MCR300. This thermal gradient has also been confirmed by measuring the temperature with tiny thermocouples (Type K) inside a loaded sample showing about + 15 °C at the top interface from sample to testing geometry.

Fig. 2
figure 2

Testing geometries from a HAAKE MARSII, b MCR300 and c MCR302

Fatigue testing was continued after a homogeneous temperature distribution was ensured within the whole sample. These resulted in the same initial dynamic modulus for every mastic sample as obtained by the old MCR300. Expecting the same fatigue life (number of load cycles until failure), samples tested in the new MCR302 show a 1.5–2 times higher fatigue life and a better repeatability. While at least six fatigue tests were necessary to obtain a limited statistical certainty for the MCR300, three tests are sufficient for the MCR302. These phenomenons are related to the dissipated energy into the sample causing local heating due to friction and the different cooling efficiency of the sample hoods of both devices. The cooling efficiency of the sample hood with air flushing of the MCR302 is higher than of the MCR300. During controlled-stress testing mode, heating leads to lower stiffness causing higher strain rates and shorter fatigue life. The improved repeatability can be accounted to a more homogeneous temperature distribution within the sample and complete removal of any side effects from the outside (e.g. room temperature, airflow caused by room ventilation) due to the more efficient sample hood of the MCR300. It must be noted that fatigue tests in DSR have a very high sensitivity to the sample cooling, and fatigue response is highly depending on the test temperature. Standardized binder grading is not affected of this phenomenon because of the test requirements: First, because of the low frequency (1.592 vs. 30 Hz) and second, there is no need for continuous oscillation during stepwise temperature sweeps.

3.3 Preparation of asphalt mastic samples

Depending on aggregate gradation, air void and binder content, HMA is categorized into asphalt concrete (AC), stone mastic asphalt (SMA), mastic asphalt (MA) and porous asphalt (PA). The ratio of binder to filler or fines is different within these mix designs. A mixing ratio of 1 part binder to 1.5 parts of filler or fines by weight (m/m) is chosen to obtain a ratio almost similar to the ratio used in AC. In preliminary tests mixing ratios of up to 1:2.5 have been tested where rations from 1:2.0 show a decreasing repeatability. This can either be accounted to bad mixing quality or the grain shape. The higher the content of filler or fines the higher is the interaction level between each single grain. The higher the interaction level the more prone is a poured DSR specimen to the exact position of each grain. Both, the filler or fines and the binder are heated up to + 180 °C in a thermal chamber. The next step is to pour the binder into the metal can with the filler/fines inside and it is then stirred with a metal or glass rod until a homogeneous asphalt mastic sample is obtained. A temperature controlled heat gun is used to maintain the temperature during mixing. Stirring is continued during the cooling down period as long as it is possible to mix. The mastic samples are then immediately stored in a refrigerator at + 5 °C to avoid settling of particles after mixing. Two different fillers (limestone and quartzite) have been tested in this study, both of them with a maximum grain size of 125 µm. It is important to keep a certain maximum grain size when comparing fillers or fines because of the sensitivity to fatigue tests. As a binder a 70/100 paving grade bitumen with a PG 64-22 (continuous PG 65-25) is used. The performance grade (PG) was obtained by DSR (EN 14770 [34]) and BBR (EN 14771 [35]) and the binder is graded according to AASHTO M 320 [36]. Table 2 shows the binder properties at three different ageing levels (virgin, RTFOT and RTFOT + PAV) in terms of needle penetration (EN 1426 [37]), softening point ring and ball (EN 1427 [38]) and breaking point Fraass (EN 12593 [39]), respectively. It is known from literature that the stiffness of a binder has an impact to the fatigue life of the asphalt mastic [19, 20, 23, 30,31,32,33]. Due to the fact that only one binder is used in this paper this behaviour is not further accounted. However, different binders are part of the ongoing project. For the comparison of different binder grades it is necessary to perform the fatigue tests either at equi-viscous temperature or at several temperatures. The latter could make it possible to calculate life cycle costs based on fatigue life for a specific climate on site.

Table 2 Binder properties

3.4 Test parameters and sample geometry

While a temperature sweep is applied during binder grading the fatigue test is carried out at one single temperature only. The temperature is selected according to the creep stiffness of the mastic. The higher the creep stiffness is, the higher the applicable temperature can be chosen for the fatigue test. It is necessary to maintain a certain minimum stiffness at test temperature to avoid creep deformation [32]. In this study and in accordance to other researchers a test temperature of + 10 °C is selected (see Table 1). Fatigue performance testing requires a high number of load cycles until fatigue failure. It is recommended to select a high frequency to conduct these tests in a limited time to be economical. For that reason a testing frequency of 30 Hz is selected in this study. With 30 Hz it is possible to perform 108,000 load cycles within 1 h. It is possible to choose even higher frequencies as far as an appropriate DSR is available. However, higher frequencies cause higher dissipated energy and thus, more friction heating to the mastic sample.

There are two different parallel-plate testing geometries used for binder grading, PP08 and PP25. The numbers determine the diameter of the specimen and are applied to the upper (PP25) and lower temperature testing range (PP08) according to AASHTO M 320 [36]. Due to the fact that bitumen is a highly temperature-dependent material and DSR are limited in its applicable torque, PP08 has to be used for fatigue testing. In preliminary tests the standard specimen geometry shape (diameter 8 mm, height 2 mm) used for binder grading have also been used for fatigue tests. After extensive pre-tests, it can be concluded that a cylindrical specimen shape (even with a height of 3 mm) is not suitable for fatigue testing due to the fracture behaviour. In all these tests performed, the specimens failed either as pure interfacial/adhesion failure (Fig. 3a) or in a combination of adhesion and cohesion failure (Fig. 3b). However, several researchers published their work employing a cylindrical specimen shape for fatigue testing of pure binder or mastic [14, 16, 18, 20, 22,23,24,25, 28,29,30,31,32]. Some researchers came to the same conclusion regarding the disadvantageous use of a cylindrical specimen shape for fatigue testing [15, 21].

Fig. 3
figure 3

a Pure adhesion/interface failure, b partial adhesion/cohesion failure

A solution has been found in a redesigned specimen geometry (Fig. 4). This geometry is based on the PP08 testing geometry with a specimen height of 3 mm and a predetermined point of failure in the middle of the height. Higher specimens are not recommended due to the limited cooling capacity of the DSR (thermal gradient). The predetermination is realised by circular necking of the original diameter of 8 mm down to 6 mm. A small platform of 0.3 mm at both ends of the hyperboloid avoids unfavourable stress concentrations in the edges. In literature, researchers solved that issue with other specimen shapes like dog-bone or even larger cylinders with clamps or terminals at both ends [15, 21, 27]. These specimen shapes requires a DSR with a temperature-controlled chamber instead of a temperature-controlled hood. This leads to expensive, additional equipment not usual or even not available for standard DSR setups employed for binder grading.

Fig. 4
figure 4

Hyperbolic specimen shape

Figure 5 shows the shear stress distribution in the hyperboloid where the stress goes from higher (red) to lower levels (blue), calculated by finite element analysis software Abaqus. With this geometry it is possible to obtain true cohesion failure within the mastic specimen.

Fig. 5
figure 5

FE model of the hyperbolic specimen shape

3.5 Specimen preparation in the DSR

The specimen preparation has to be carried out directly in the DSR because it is not possible to trim a hyperbolic shape by hand. A reusable silicone mould is employed to ensure an accurate specimen. The silicone mould is made out of two-component silicone and is temperature-resistant of up to + 180 °C. Thus, it is possible to pour the molten mastic sample directly into the silicone mould loaded in the DSR. The loading of molten samples guarantees an ideal bonding between mastic and the smooth stainless steel surfaces of the DSR. It is optional to remove excess mastic at the top of the mould because it has no impact to the fatigue test. The silicone mould is removed after a cooling period of at least 10 min resulting in an accurate specimen shape (Fig. 6a). The fatigue test starts after another cooling period of 15 min to achieve thermal equilibrium and lasts until the total failure of the specimen (Fig. 6b).

Fig. 6
figure 6

a Loaded DSR specimen, b specimen after fatigue test

4 Results

4.1 Hyperboloid: a challenging specimen shape

It must be noted that the dynamic modulus calculated by the DSR software (Anton Paar RheoCompass) is not the true dynamic modulus of the mastic sample being measured. This issue is related to the hyperbolic specimen shape. Parallel-plate tests are usually run with cylindrical specimen shapes and therefore, all the calculations within the DSR software are based on a cylindrical shape with a diameter of 8 mm and a sample height of 3 mm. Equalling Ø6 and Ø8 mm for shear stress (Eq. 1) and shear strain (Eq. 2) gives a correction factor for the dynamic modulus of 3.2 (Eq. 3). Hence, a cylindrical specimen with = Ø8 mm is expected to be 3.2 times higher in dynamic modulus than a cylindrical specimen with Ø6 mm.

$$\tau :\quad \frac{{2 \cdot M_{{8{\text{mm}}}} }}{{\pi \cdot r_{{8{\text{mm}}}}^{3} }} \cdot f_{{{\text{korr}}.\tau }} = \frac{{2 \cdot M_{{6{\text{mm}}}} }}{{\pi \cdot r_{{6{\text{mm}}}}^{3} }}$$
(1)
$$\gamma :\quad \frac{{\alpha_{{8{\text{mm}}}} \cdot r_{{8{\text{mm}}}} }}{{h_{{8{\text{mm}}}} }} \cdot f_{{{\text{korr}}.\gamma }} = \frac{{\alpha_{{6{\text{mm}}}} \cdot r_{{6{\text{mm}}}} }}{{h_{{6{\text{mm}}}} }}$$
(2)
$$\left| {G^{*} } \right|:\quad f_{{{\text{korr}}.|G^{*} |}} = \frac{\tau }{\gamma } = \frac{{f_{{{\text{korr}}.\tau }} }}{{f_{{{\text{korr}}.\gamma }} }}$$
(3)

where |G*| dynamic modulus [Pa], α deflection angle [rad], τ shears stress [Pa], h specimen height [m], γ shear strain [–], fkorr.τ correction factor τ, M maximum torque fkorr.γ correction factor γ, r radius testing geometry [m], fkorr.|G*| correction factor |G*|.

Oscillatory shear tests on a bitumen 70/100 with the same conditions as applied for the fatigue tests (+ 10 °C, 30 Hz sample height of 3 mm) on both, Ø8 mm cylindrical and hyperbolic shape resulted in a correction factor of 2.4. This means that the dynamic modulus of the Ø8 mm cylindrical specimen is not as high as expected from calculations or vice versa, the hyperboloid is stiffer than expected. It is highly likely that the contributing diameter or volume of the hyperboloid is higher than 6 mm because of the circular necking. Probably, this correction factor is varying depending on the grading curve and grain shape of the filler/fines as well as the grading or (polymer-) modification of the binder. This issue is shown in Fig. 7 and will be looked into in future work.

Fig. 7
figure 7

Contributing diameter or volume of cylindrical and hyperbolic specimen shape

4.2 Repeatability of the developed fatigue test

Tests are required to have a good repeatability and comparability to guarantee a wide spread application. Figure 8 shows the repeatability of 10 fatigue tests at a shear stress level of 400 kPa of both, dynamic modulus at the beginning of each fatigue test and the fatigue life expressed as the number of load cycles until failure. The dynamic modulus is obtained after 10 s, which is equal to 300 load cycles. While EN 14770 [34] for standard DSR tests pre-sets the comparability of the dynamic modulus to 10%, there is no value given for the repeatability. It has to be accounted that this comparability is based on a round-robin test with a cylindrical specimen shape. Because there is no repeatability given, the mean ± 5% is shown in Fig. 8. The 50% (median) and 95% confidence interval of the standard error (SEM) is calculated to look into the quality of the obtained fatigue life. With a probability of 95% the true fatigue life is between 188,000 and 212,000 load cycles. This is about ± 6% of the mean. It can be seen that both, dynamic modulus and fatigue life show a good repeatability. This proves that the hyperboloid specimen shape can be prepared very accurately and the entire DSR setup is suitable for fatigue testing.

Fig. 8
figure 8

Repeatability test with hyperbolical shape at + 10 °C and 400 kPa

4.3 Comparison of mastic samples

For the determination of the fatigue life of a mastic sample and for the comparison to other samples it is necessary to fit a logarithmic stress-cycle (SN) curve (Wöhler curve) into the fatigue test results obtained of at least three different shear stress levels. Figure 9 shows SN curves of two different mastic samples, both mixed with the same 70/100 paving grade bitumen and mixing ratio 1 part bitumen to 1.5 parts filler by mass. Both curves are fitted to the mean values of three single fatigue tests at four different shear stress levels, respectively. Both SN curves show an excellent coefficient of determination of R2 = 0.99. The performance of limestone seems to be far superior that for quartzite, but it has to be noted that the grading curve [1] and the density (filler volume concentration [22]) have an high impact on the fatigue performance and is not accounted in this study.

Fig. 9
figure 9

SN curves of two different asphalt mastic samples

5 Conclusions and outlook

5.1 Conclusions

This study shows that a standard DSR equipped with components used for binder grading (temperature-controlled hood, PP08 geometry) is capable of fatigue testing of asphalt mastic. Therefore, no costly changes are necessary. The developed hyperbolic specimen shape with predetermined point of failure gives a good repeatability. This makes it possible to compare the fatigue durability of different mastic samples with SN curves at different shear stress levels. However, the hyperbolic shape is challenging in terms of the calculation of the true dynamic modulus compared to a cylindrical specimen shape.

5.2 Outlook

This paper is limited to one binder and two different fillers. Further tests with different binders and filler/fines are still ongoing. Another aim of this project is to determine the impact of different temperatures on the fatigue life of mastic. As soon as the fatigue life of several fillers/fines is found, a correlation analysis with 4PBB results will be carried out. The actual state of the developed fatigue test does not allow any kind of weathering. Because fillers/fines can have high water susceptibility, it is important to find a suitable solution for taking the water susceptibility in the fatigue test into account in the future.