1 Introduction

Alkali-activated concretes have shown sufficient durability to apply them for full-scale application, as demonstrated for alkali-activated concretes based on ground granulated blast furnace slags [1,2,3]. However, there are still open questions regarding the long-term durability of alkali-activated materials (AAMs) [4, 5], which are related to the wide range of precursors and activators that can be used. This, in turn, leads to a wide variety in hydrate phase chemistries and microstructural characteristics of these materials. The work of RILEM technical committee (TC) 247-DTA ‘Durability Testing of Alkali-Activated Materials’ has been targeted to perform a round robin testing programme to assess the validity of different durability testing methods concerning alkali-activated concretes. The two previous papers derived from the activities of RILEM TC 247-DTA presented the outcomes of round robin tests on compressive strength [6] and on carbonation and chloride penetration testing [7]. The TC evaluation of testing methodologies for the determination of sulfate resistance, alkali-silica reaction and freeze–thaw resistance is highlighted in this contribution.

The underlying mechanisms related to sulfate attack on Portland cement-and blended Portland cement-based concretes are largely well understood; however, there is still a lack of agreement on suitable accelerated laboratory tests of sulfate resistance. For further details the reader is referred to the state-of-the-art report of RILEM TC 211-PAE (Performance of cement-based materials in aggressive aqueous environments) [8]. The sulfate resistance of AAMs was reviewed in the state-of-the-art report of RILEM TC 224-AAM (Alkali-activated materials) [1], and more recently e.g. by Zhang et al. [9], Arbi et al. [10], Bernal and Provis [4], and Provis [5]. The interaction of AAMs with sulfates is often tackled in the context of sulfuric acid attack, but a reasonable number of published studies deal also with the interaction of AAMs with Na2SO4 or MgSO4 solutions. It is generally agreed that according to accelerated laboratory tests the resistance of AAMs to sulfate attack is quite high; however, experience from field tests is very scarce. Fly ash-based AAMs show high sulfate resistance when tested using sodium and magnesium sulfate solutions. Slag-based AAMs generally exhibit a high resistance to Na2SO4 solutions, while MgSO4 solutions are often deleterious, especially at high concentrations, as decalcification of calcium silicate hydrate (C–S–H) and gypsum formation occur, which lead to a loss of dimensional and structural integrity. Information on sulfate resistance of pure metakaolin-based AAMs is scarce in the open literature. According to Hawa et al. [11] these materials exhibit a high volume stability when exposed to 5% Na2SO4 and 5% MgSO4 solutions, respectively, except for a very early expansion during the first week of storage, which was not further explained by the authors.

The question of alkali-silica (or more broadly, alkali-aggregate) reactions in AAMs is contentious in many parts, particularly because AAMs often contain alkalis at levels that are an order of magnitude higher than the levels that are conventionally considered “safe” for Portland cement-based concretes. The science and engineering of alkali-silica reactions in AAMs has been reviewed in detail by Shi et al. [12]. Published results from tests conducted across a variety of AAM binder-aggregate combinations do not indicate that alkali-silica reactions are unduly problematic in AAM concretes or mortars produced using aggregates of ‘normal’ reactivity [12]. Expansion attributed to alkali-silica reactions can certainly be induced under accelerated conditions, particularly when using an aggregate type known to be reactive [13, 14]. However, in most reported cases of this type, AAM mortars formulated with reactive aggregates have shown less expansion than compared to Portland cement-based mortars produced with the same aggregates, despite the difference in alkali content [13, 15,16,17]. The binder chemistry of AAMs generally leads to a high Al concentration in the pore solution and a limited availability of reactive calcium, which is believed to restrict alkali-silica expansion by a mechanism analogous to the action of aluminous pozzolans in concrete with Portland cement blends [13, 16, 18].

Depending on the application, the freeze–thaw resistance, with or without the presence of deicing salts, can be another important aspect of the durability of concrete. Though early publications claimed an “excellent” freeze–thaw resistance for alkali-activated fly ash [19] as well as for alkali-activated slag concrete [20], the parameters influencing freeze–thaw damage of alkali-activated concretes have received little attention—with few exceptions [21,22,23]—and remain poorly understood to date [5]. To establish such an understanding via comparison of microstructural alterations and concrete performance, it is important to first evaluate the applicability of existing freeze–thaw resistance test methods to alkali-activated concretes.

2 Materials and methods

2.1 Starting materials, concrete mix designs and curing

Hard coal fly ash (FA), supplied by BauMineral (Germany), ground granulated blast furnace slag (BFS), supplied by ECOCEM® (France), and flash-calcined metakaolin (MK), supplied by Argeco (France) were used as solid precursors. Sodium silicate solutions were used as the activators in each mix, adjusted to the targeted compositions (Table 1) by adding appropriate amounts of water and sodium hydroxide. Each laboratory used locally available aggregates, and thus, their mineralogical compositions and their grading curves differed between laboratories. For more details the reader is referred to the first paper of the TC work [6].

Table 1 Mix designs (kg/m3) and approximate design strengths of the alkali-activated concretes, reproduced from [6, 7]

Two concretes based on BFS, two concretes based on FA, and one concrete based on MK were produced and tested. For the concretes based on BFS and FA, one concrete with a ‘moderate’ performance and one concrete with a ‘high’ performance, respectively, were designed. The mix designs and the targeted compressive strengths after 28 days are shown in Table 1, and more details are provided in [6].

The concretes were cured for the first 2 days (or until demouldable) in covered or sealed moulds at 20–23 °C, and subsequently until the testing age in tightly closed plastic bags at 20–23 °C. For some tests, differing mix designs and curing regimes were applied as described in Sects. 2.22.4.

2.2 Sulfate resistance testing

As testing methods ASTM C1012 [24],Footnote 1 the SVA test [25], and the accelerated test according to Swiss Standard SIA 262/1 Appendix D [26, 27] were selected.

ASTM C1012 [24] is based on expansion measurements of 25 mm × 25 mm × 285 mm mortar bars stored in a 5 wt% Na2SO4 solution at 23 °C for 12–18 months. The mortar bars are specified to be cured at 35 °C for the first 24 h, demoulded, and then cured in lime water until a compressive strength of 20 MPa has been reached. This means that quite immature samples are tested when mixes that are intended to develop moderate to high strength at 28 days are being tested. The use of other solutions such as MgSO4 is also possible, and this was tested by one participating laboratory.

Pre-curing in saturated lime water is prescribed in ASTM C1012, specifically targeted at improving the maturity of Portland cement-based materials. Nevertheless, it was carried out for the AAMs tested. Although secondary effects such as leaching of the activator, formation of calcium-containing phases in the pore space, and/or formation of gypsum during the later storage in sulfate solution may occur, it was not possible to directly replace this curing condition within the specified procedure of this standard, and it does not appear that any of the potential interfering effects were significantly detrimental to the samples.

The SVA test [25] uses mortar prisms 10 mm × 40 mm × 160 mm, which are cured at 20 °C for 2 days in the mould and subsequently 12 days in saturated lime water. Afterwards a set of samples is stored at 5 °C and at 20 °C, respectively, in 4.4 wt% Na2SO4 solution, while a set of control samples remains in saturated lime water at the same temperatures. Length change and dynamic modulus of elasticity are followed over time.

Mortar mix designs for the above test methods were developed based on the compositions of the concretes FA2, FA8, S1b and S3a, see Table S1 in the Electronic Supplementary Material, which deviate from the specifications of ASTM C1012 and SVA. For all mortars, the composition of the paste (i.e. solid binder plus activator solution) was identical to the paste composition of the respective concretes. CEN standard sand (EN 196–1 [28]) with a nominal maximum grain size of 2.0 mm was specified as the aggregate.

While ASTM C1012 and the SVA test are designated to assess the binder, SIA 262/1 Appendix D [26, 27] is a method to test the concrete directly. Concrete cores with a diameter of 28 mm and a length of 150 mm are drilled out of larger samples cured for 28 days. Afterwards the samples are in contact with sulfate ions during 4 cycles of immersion in 5 wt% Na2SO4 solution (5 days at 20 °C) and drying (2 days at 50 °C). After the 4th cycle the specimens are immersed again in the sulfate solution, and length and mass changes are measured up to 56 days storage time in 5 wt% sulfate solution at 20 °C.

2.3 Alkali-silica reaction testing

The alkali-silica reaction testing was conducted according to the RILEM AAR-3 method [29], as well as an adapted version of ASTM C1293 [30], which are both based on prismatic concrete samples within the range of dimensions 75(± 5) mm × 75(± 5) mm × 250(± 50) mm.Footnote 2 In applying both of these tests, the specified exposure conditions (38 °C in a sealed storage container; provisions which are near-identical in the two test methods) were followed, but the mix designs described in Table 1 were used unmodified, rather than following the mix designs specified by the standard. Each laboratory used a different alkali-reactive aggregate combination, as described in Sect. 4.2. The alkali contents of the mix designs in Table 1 are all significantly higher than the 5.25 kg/m3 specified in ASTM C1293-08b [30], or the 5.5 kg/m3 specified in RILEM AAR-3.1 [29]. By testing without adjustment of the concrete alkali content, direct application of this test method to alkali-activated concretes may therefore be expected to give a somewhat conservative determination of the alkali-reaction potential of the aggregate used.

ASTM C1293 specifies that for measurements of length changes, the samples are to be cooled down to 23 °C and measured by reference to a zero point taken at 23.5 ± 0.5 h. The test specifies a criterion of 0.04% expansion at 12 months to indicate problematic behaviour, whereas if the expansion has not reached 0.04% at 24 months, the combination of cementitious and aggregate constituents should be able to “prevent excessive expansion in field concretes”.

The RILEM AAR-3.1 test method was applied according to the 2011 revision of the testing methodology, as published in the State of the Art Report of RILEM TC 219-ACS [29]. This test specifies a measurement temperature of 20 °C, referenced to a zero point that is defined at 24 ± 1 h.

The test specification document for the AAR-3.1 method does not set explicit pass/fail criteria, but the results of the EU PARTNER project [31] indicate that a 12-month expansion of less than 0.05% in this test is “likely to indicate non-expansive materials”, while an expansion of 0.1% or more indicates an expansive combination, and results between 0.05 and 0.1% show a potential for alkali-reactivity.

2.4 Freeze–thaw resistance testing

Freeze–thaw resistance testing methods included the CDF test [32] as well as the tests according to ASTM C666 (Procedure A) [33] and ASTM C672 [34]. Neither air-entraining agents nor any other compounds or measures were used to adapt the concretes to freeze–thaw attack. The CDF test and the test according to ASTM C672 involve the use of deicing salt, while the test according to ASTM C666 uses water only. For conventional concretes, the former two test methods are thought to cause deterioration mainly in the near-surface regions, while attack without deicing salts affects the inner regions as well [35].

The CDF test consists of exposing a minimum of five concrete specimens with dimensions 150 mm × 150 mm × (50–150) mm to freeze–thaw cycles while partially immersed in a 3% NaCl aqueous solution. Curing and pre-treatment of the specimens is prescribed as follows: Curing in the moulds at 20 ± 2 °C for 24 h; demoulding and curing in tap water at 20 ± 2 °C for six days; dry storage at 20 °C/65% relative humidity (RH) for 21 days, during which the lateral surfaces of the specimens are sealed; pre-saturation by partially immersing the specimens in the test solution (3 wt% NaCl) at 20 °C for seven days. Each freeze–thaw cycle of the CDF test has a duration of 12 h, during which the temperature is lowered from 20 ± 1 °C to− 20 ± 1 °C, held at that temperature for 3 h, increased back to 20 ± 1 °C and then held at this temperature for 1 h. At specified intervals, the surface scaling of the immersed surface is determined through ultrasonic treatment to remove loosely adhering material, which is then dried and weighed. Assessment of the resistance of a concrete is usually based on the cumulative surface scaling after 28 freeze–thaw cycles; a common acceptance criterion is that the surface scaling is ≤ 1500 g/m2 after 28 cycles [36].

In the present testing campaign, curing conditions differed between laboratories: Laboratory B followed the instructions of the CDF test entirely; in laboratory A, the initial curing up to seven days was done in sealed conditions at 23 °C to avoid leaching of activator components, after which dry storage was conducted as prescribed; and curing in laboratory C was done at ~ 20 °C in a closed box over an open water surface for 28 days, and immediately thereafter pre-saturation by immersion in the test solution was started (i.e. no dry storage was conducted). Laboratory A reported that, for some of the tested concretes, the specimen surfaces to be immersed were ground to flatness within the dry storage period, because their visual appearance in terms of smoothness and efflorescence was unsatisfactory. The same laboratory tested a second set of concrete specimens which were cured in sealed conditions at 23 °C for 56 days (instead of seven days) before dry storage; all other pre-treatment steps were completed as prescribed.

Laboratory D performed the tests according to ASTM C666 (Procedure A) and ASTM C672. For the test according to ASTM C672, concrete slabs with dimensions 220 mm × 220 mm × 76.2 mm were produced, with the flat surfaces of the slabs covered by 6 mm of either 4 wt% CaCl2 aqueous solution or 4 wt% NaCl aqueous solution and then exposed to 90 freeze–thaw cycles (16 h at − 20 °C then 8 h at 25 °C); the test solution was replaced each five cycles. At specified intervals, the surfaces were visually examined and qualitatively rated according to the scale given in ASTM C672; in addition, the mass change of the specimens was determined and reported.

The ASTM C666 test procedure was modified for the application to AAM concretes through a change in curing conditions. Instead of limewater saturation, concrete specimens (76.2 mm × 101.6 mm × 406.4 mm) were removed from the moulds after 24 h, then air cured until the age of 28 days, and subsequently submerged in water for 14 days. Test procedure A consists of exposing the specimens, while submerged in water, to up to 300 freeze–thaw cycles (4 °C to− 18 °C within two to five hours). At regular intervals, the absolute and relative dynamic moduli of elasticity were recorded.

3 Reporting of results

Pre-formatted templates were used by the round robin participants for reporting of the test results and to register deviations from the standard procedures as described in Sects. 2 and 4.

Five laboratories reported sulfate resistance testing data, three laboratories reported alkali-silica reaction testing data, and four laboratories reported freeze–thaw resistance data.

All results are given, summarised as the laboratories’ mean values with their standard deviations, in the Electronic Supplementary Material (Tables S1–S12) accompanying this article. In Sect. 4, these results are graphically presented and discussed.

4 Results and discussion

4.1 Sulfate resistance testing

Due to the limited number of participants in the sulfate resistance part of the round robin test, a full interlaboratory assessment of the results is not possible. Thus, the discussion of the results is focussed on the comparison between the different methods.

4.1.1 ASTM C1012

The mix design of the AAM tested was different from the mix design specified in ASTM C1012, as described in Sect. 2.2 and shown in Table S1 in the Electronic Supplementary Material.

Figure 1a shows the compressive strength development of the fly ash-based mortars during storage in saturated lime water, as reported by laboratory E. Both mixes show a steady development of compressive strength. Mix FA2 reaches the 20 MPa threshold specified in ASTM C1012 for the start of exposure testing after 7 d, and mix FA8 after 9 d of curing.

Fig. 1
figure 1

Sulfate resistance of fly ash-based AAM mortar mixes tested according to ASTM C1012; a Compressive strength development for maturity assessment, b Length and mass changes in 5 wt% Na2SO4. Results of Laboratory E

Figure 1b gives length and mass changes of the fly ash-based AAM mixes stored in 5 wt% Na2SO4. Neither mix shows any expansion, but a slight shrinkage instead. During the first 3–4 weeks of storage, a mass loss from the samples occurs, probably due to the leaching of the activator. Afterwards a continuous mass increase occurs, which does not seem to have stabilized after 15 months of exposure.

Figure 2 compares the ASTM C1012 length changes for all five mixes based on slag, fly ash or metakaolin, measured by one of the participating laboratories. The results for the fly ash mixes compare well with the data shown in Fig. 1b, although a different initial curing regime was used (2 days at 20 °C in the mould, instead of 2 days at 35 °C). The slag-based AAM mixes show a negligible change (S1b) or slight expansion (S3a) after one year of storage in Na2SO4. It can be speculated whether the differences between both slag samples are significant, and if the higher expansion of the higher performance slag mix S3a compared to the moderate one (S1b) is due to its higher binder content. Another participating laboratory found the reverse rank order for S1b and S3a under Na2SO4 exposure (Fig. 3), with small expansion in S1b and actually a minor contraction in S3a. However, there were no significant dimensional stability issues in any of these cases; expansions and contractions measured for the fly ash-based and slag-based AAMs were all well within the specified “safe” limits (generally 0.1%).

Fig. 2
figure 2

Length changes of all AAM mortars tested in 5 wt% Na2SO4 according to ASTM C1012. Note that pre-curing in the mould was done at 20 °C for 48 h (instead of 24 h at 35 °C). Compressive strengths after pre-curing were: FA8: 19.5 MPa (13 d), FA2: 22.0 MPa (16 d), S3a: 20.6 MPa (3 d), S1b: 22.0 MPa (6 d), MK: 54.1 MPa (2 d). Results of Laboratory H

Fig. 3
figure 3

Length changes of slag-based AAM mortars tested in 5 wt% Na2SO4 and in 5 wt% MgSO4 according to ASTM C1012. Samples were submerged in sulfate solution after 24 h of pre-curing in the mould at 35 °C. Compressive strengths after pre-curing were: S3a: 16.1 MPa (1 d), S1b: 22.7 MPa (1 d). S3a was tested also after 1 additional day of curing in saturated lime water, where a compressive strength of 22.0 MPa was reached, and the results (not shown) did not differ significantly from the results with 24 h of pre-curing. Results of Laboratory F

Surprisingly, the MK mix in Fig. 2 shows a fast and early expansion after 7 days of storage, while afterwards no significant length changes occur. This behaviour is in agreement with the findings of Hawa et al. [11], however due to its rapid onset and short-term nature it is unlikely to be linked to any conventional sulfate attack mechanisms, and it is beyond the scope of the present paper to provide further detailed discussion proposing an underlying expansion mechanism. However, this observation highlights the importance of taking multiple length measurements (not solely a final measurement after several weeks or months) when seeking information about the nature of expansive processes in AAMs. This finding also raises discussion about the appropriate time to determine and define the “zero point” for measurements such as this, and also the alkali-silica reaction tests described in Sect. 4.2. It has to be further noted that the MK mix already showed a very high compressive strength of over 50 MPa before starting the immersion, which significantly exceeds the minimum strength of 20 MPa as required by ASTM C1012.

As calcium-containing binder systems (e.g. blended with Portland cement or alkali-activated slag) are often attacked by MgSO4 solutions more severely than by Na2SO4 solutions [37], one participant carried out the C1012 test using both solutions (Fig. 3). With Na2SO4 no expansion is measured, while the rank order between S1b and S3a is reversed compared to the data shown in Fig. 2. Conversely, the samples stored in MgSO4 solution show a slight expansion. The moderate-strength mix S1b expands slightly more than the higher-strength mix S3a. While the samples stored in Na2SO4 did not show any visually recognisable damage on the surfaces, the specimens stored in the MgSO4 solution were evidently corroded at the corners and the edges.

The expansion limit of 0.1% as specified in ASTM C595 for blended Portland cements [37] at 12 months in Na2SO4 for high sulfate resistance is not exceeded by any of the samples tested here according to ASTM C1012, neither in Na2SO4 nor in MgSO4.

4.1.2 SVA test

For the SVA test [25], mortars were used as well, and their mix design is shown in Table S1 in the Electronic Supplementary Material. The testing was performed by one participating laboratory, and on the slag-based mixes (S1b and S3a) only. Length changes in 4.4 wt% Na2SO4 and in saturated lime solution (reference) were measured at 20 °C up to 91 days. Measurements at 5 °C were not carried out. It is noted that length change measurements in saturated lime solutions might not be an appropriate zero-point reference for AAMs, but this curing method is specified by the test method, and was therefore implemented.

While the samples stored in saturated lime solution showed neither significant shrinkage nor expansion, the samples stored in Na2SO4 solution exhibited very limited shrinkage, in the order of 0.003% (Table S4). The maximum expansion limit for high sulfate resistance is specified in this test to be a difference between length changes in Na2SO4 and saturated lime solution of 0.05% after 90 d, so the observed expansion falls well within this limit. Mass changes were negligible for both slag mixes under both exposure conditions (Table S4).

4.1.3 SIA 262/1 Appendix D

One participant used a modified version of SIA 262/1 Appendix D [26, 27] to test the resistance of the slag- and fly ash-based mixtures to MgSO4 attack. Four 50 mm × 50 mm × 285 mm concrete prisms were used (instead of six ø 20 mm × 150 mm drilled cores as specified in the testing method documents). Instead of 5 wt% Na2SO4, 5 wt% MgSO4 solution was used. The samples were stored for up to two years after the initial four loading sulfate cycles. In parallel, a separate set of replicate samples was stored according to the conditions described in ASTM C1012, but after pre-curing for 28 days. In this case, the MgSO4 solution was exchanged after each measurement of length and mass.

The length changes determined according to the modified SIA 262/1 Appendix D method are shown in Fig. 4. While the fly ash-based mixes do not show a significant expansion, the slag-based mixes reach values of 0.061% (S1b) and 0.068% (S3a), respectively. Those values are below the limit of 0.1% specified for a sulfate resistant cement. The slag-based concrete prisms showed some minor cracking at the edges, while the fly ash-based mixtures showed no signs of physical damage. When using the modified ASTM C1012 (Table S5), severe scaling of the surface of the slag-based concretes was observed, while the expansion was less (0.018% for S1b and 0.028% for S3a) than for the SIA method. For the fly ash-based concretes only a negligible expansion was measured, and no signs of physical damage occurred.

Fig. 4
figure 4

Length changes of fly ash- and slag-based AAM concrete prisms in 5 wt% MgSO4 using the modified SIA 262/1 Appendix D method. Results of Laboratory G

Both the modified versions of the SIA and the ASTM methods used larger specimens than specified in the respective standards, and thus it could be possible that expansion was restricted by a limited penetration depth of sulfate compared to the diameter of the specimens. By micro-X-ray fluorescence analyses (µ-XRF) it has been shown that significant ingress of sulfate occurred for a slag-based AAM concrete tested according to the ASTM method, while this was not the case in the SIA setup [38]. For the fly ash mixes, both testing methods produced similar concentration gradients over the depths of the specimens.

4.2 Alkali-silica reaction testing

Laboratories G, H, and I reported the results of alkali-silica reaction testing: Laboratory G used the ASTM C1293 [30] method, adapted as described in Sect. 2.3, while Laboratories H and I applied the RILEM AAR-3.1 [29] test. All of these tests are applied to concrete prism specimens; the C1293 and AAR-3.1 tests are conducted at 38 °C, and Laboratory I also conducted a parallel test with exposure at 20 °C but otherwise following the RILEM AAR-3.1 methodology.

The RILEM AAR-3.1 test also specifies that mass change measurements should be taken at each interval to ensure that there has not been drying of the samples; any mass loss from conventional cements is taken to indicate insufficient availability of water and leads to results being discarded [29]. Laboratory I (which tested only mix FA8 with different aggregates) observed an initial mass gain of 3–5% for all specimens at the first measurement point then little change (< 0.3% variation in any specimen) thereafter, while Laboratory H recorded mass decreases of ~ 1.5% for the FA8 and MK samples, 0.3% for FA2, and negligible change for S1b. Although the ASTM C1293 method applied in Laboratory G does not specifically require the measurement of mass changes, this information was nonetheless recorded; all samples gained up to 0.9% mass during the 24 months of testing, without a clear trend regarding whether the mass change occurred early or late for each particular specimen. The mass change data recorded in each laboratory are provided in the Electronic Supplementary Material. Due to the uncertainty around whether this provision of the test method is relevant for testing of AAMs, data corresponding to samples with mass losses were not discarded from this study.

Figure 5 shows the data recorded in Laboratory G using ASTM C1293, where concretes were tested both with conventional aggregates, and also with replacement of the coarse aggregates by highly reactive Spratt limestone (sourced from Ontario, Canada), which is known to cause expansive reactions in conventional cement concretes [39]. The results obtained using the Spratt aggregates in slag-based AAM binders (Fig. 5a) show that the test method is suitable for the detection of cases where excessive expansion would be expected, as both slag-based AAMs exceeded the 0.04% expansion limit after 2 years with this type of aggregate. However, when using aggregates that were not expected to be problematically reactive when used with Portland cements, the expansions observed at 2 years for S1b and S3a were approximately half the specified limit, even with a concrete alkali content vastly exceeding the design alkali content for concretes specified to be used in this testing method.

Fig. 5
figure 5

ASTM C1293 results from Laboratory G. a slag-based samples, b fly ash-based samples. Error bars represent one standard deviation among 4–6 replicate samples, and where not visible are smaller than the data symbols

None of the fly ash-based AAMs (Fig. 5b) exceeded the specified expansion limit after 2 years, even when the Spratt aggregate was used. Approximately half of the total observed expansion took place very early in the testing, within the first week. It is very unlikely that such early expansion is related to alkali-aggregate reaction processes, as these are generally much slower to develop, and this raises potentially important questions about the setting of the length reference datum (i.e., the point taken as zero expansion) for this test when applied to alkali-activated materials. This will be revisited below in the discussion of further results for fly ash-based AAM concretes.

A detailed mechanistic analysis of the reasons for the alkali-aggregate reaction behaviour of AAMs is beyond the scope of the current paper, which is instead focused on understanding whether the given test methods are suitable to be applied to AAMs. It is hypothesised that the high Al content and often low Ca content of alkali-activated binders are important in controlling alkali-related expansive behaviour, as was discussed in Sect. 1.

The results from Laboratory G indicate that: (a) problematic aggregate-binder combinations (e.g. Spratt in slag-based AAM) can be identified; and (b) that the role of the binder chemistry in influencing aggregate-derived expansion can also be determined (as the Spratt aggregates gave a passing result when tested with a fly ash-based AAM, but failed with a slag-based AAM). These two observations indicate that the ASTM C1293 method can be considered broadly suitable for testing of AAM concretes, although it is evident that comparison between laboratory and long-term field data is still needed to enable a firm conclusion to be drawn regarding the precision of the test method in predicting field performance. The within-laboratory reproducibility of this test appears to be very good; the coefficients of variation among 4–6 replicate prisms at 720 days were less than 12.7% in all cases (shown as error bars in Fig. 5), and between 10.4–12.7% in 5 of the 8 data sets collected. This compares remarkably well with the 12% within-laboratory coefficient of variation quoted in the ASTM C1293 method specification for expansions of 0.02% or more [30]. ASTM C1293 quotes a standard deviation of 0.0025% for length changes less than 0.02% [30], compared to 0.0019% for the results presented here (calculated as the mean of the standard deviation values for all measured length changes not exceeding 0.02% at any age for any sample). This appears to reflect the intrinsic variability in the manual determination of small length changes, and the standard deviation is not material-specific. The reproducibility of ASTM C1293 therefore appears to be very similar whether it is applied to AAMs or to conventional cement concretes.

The testing programme of RILEM TC 247-DTA did not yield sufficiently large data sets to enable an evaluation of the inter-laboratory reproducibility beyond the statements present in the ASTM C1293 test method documentation.

Figure 6 presents the results from the RILEM AAR-3.1 test (Fig. 6a, b), along with the outcomes of a parallel test conducted in one laboratory where the same test protocol was followed but with exposure at 20 °C rather than 38 °C (Fig. 6c). In this laboratory, Laboratory I, tests were conducted using the fly ash-and slag-based AAM concretes with a conventional siliceous (river gravel) aggregate and similar petrography for both fine and coarse fractions. In each concrete, a part of one of the aggregate size fractions was replaced by a potentially reactive component: either 58.9% replacement of fine aggregates by a suspected reactive sand, or 35.9% replacement of coarse aggregates by Dry Rigg greywacke (a known reactive coarse aggregate [40]).

Fig. 6
figure 6

RILEM AAR-3.1 results from a Laboratory H, b Laboratory I (38 °C results). Parallel 20 °C test results from Laboratory I are shown in (c). In a, error bars indicate the standard deviation among 2–3 replicate samples. In b and c, replicates of the same test are shown separately, but with the same line and marker styles

Figure 6a shows that shrinkage of both fly ash-based AAM mixes took place during the RILEM AAR-3.1 test, while S1b and MK1 expanded less than 0.04%. For FA2, only two of the three prisms shrank, while the third remained dimensionally stable, and this explains the relatively larger error bars shown for this data set. Such variation was not reflected in differences in mass loss between the specimens during the test, so is unlikely to be a drying shrinkage effect. As was observed for the slag-based mixes in the ASTM C1293 results, around half of the total observed expansion of S1b took place during the earliest part (first four weeks) of the test. This supports the discussion presented above, regarding whether this is actually likely to be an ASR-related expansion. It is unlikely that an ASR reaction would be so rapid, and this behaviour is more realistically attributed to other chemical and/or physical changes in the concrete during the first few weeks of its hardening.

Figure 6b shows the data from three replicate tests of FA8 in Laboratory I, with either part of the fine aggregates or part of the coarse aggregates replaced by potentially reactive components as noted above. The RILEM AAR-3.1 test specification identifies that the within-laboratory reproducibility should be approximately half of the measured expansion value [29], and for each set of prisms here, the measured results do fall within a range of approximately 50% above and below the mean for the triplicate samples. Similarly, for the duplicate samples tested at 20 °C (Fig. 6c), the within-laboratory variation is approximately 50% above or below the mean. The measured expansions are all less than 0.03%, and therefore are not considered problematic under the indications of this test method. This indicates that the FA8 concrete mix does not cause undue expansion when using either of these potentially reactive aggregates. It is not possible to use Fig. 6 to conduct a meaningful inter-laboratory comparison of the test results, because Fig. 6a shows only results for unreactive aggregates, while Fig. 6b, c have reactive aggregates included in the mixes.

However, it is striking that in Fig. 6b, c, almost all of the observed length changes take place within the first week of the test. Re-normalising the test results shown in Fig. 6b to use the 1-week test result as the effective zero length change datum (i.e. subtracting the 1-week expansion reading from all measured data) yields a very different view of the test outcomes, as seen in Fig. 7. In this re-evaluation of the test data, it is seen that all samples are highly dimensionally stable from the 7th day onwards, without evidence of undue expansion. Nevertheless, all samples do show a slight contraction from 14 to 91 days, then an expansion back to reach the 1-week length again at 360 days. The reproducibility of the re-normalised results is very high, with all of the 360-day data points for the triplicate specimens containing the potentially reactive fine aggregates falling within a range of 0.003% in length change (range − 0.001 to + 0.002%), while the triplicate specimens containing the reactive coarse aggregate are all within 0.001% of each other (range − 0.002 to − 0.001%).

Fig. 7
figure 7

RILEM AAR-3.1 data from Laboratory I (as in Fig. 6b), re-normalised to use the 1-week measurement as the zero length change datum

The RILEM AAR-3.1 test method specification states that “the lowest result for a reactive aggregate should exceed 0.075%” [29]. On this basis, it can be concluded that none of the combinations tested here with this method can be classified as ASR-reactive, even without the re-normalisation shown in Fig. 7 and regardless of whether potentially reactive aggregates are used. It is not possible to conclusively say from these results whether the RILEM AAR-3.1 test method can actually identify a problematically reactive aggregate-binder combination in AAM concretes, because with no combinations the failure criterion for the test was approached. However, considering the very close similarities between this test and the ASTM C1293 methodology (which is stated in [29] to have been taken into account in the development of the RILEM method), it is likely that the conclusions drawn above for the ASTM method are also applicable to the RILEM method. This method therefore seems to be broadly suitable for the testing of AAM concretes. However, it is recommended that the 1-week measurement be used as the zero length datum point when this type of expansion testing method is applied to AAM concretes. This appears to be the age at which some degree of early-age dimensional stability is achieved, and from which it is possible to identify actual ASR-related processes (which take place at a later age).

4.3 Freeze–thaw resistance testing

4.3.1 CDF test

The CDF test results of Laboratories A, B and C are shown in Fig. 8. In all cases, the ranking of the performance of the concretes in each laboratory was in the expected order, i.e. for both the slag-based concretes and the fly ash-based concretes, the high-strength mix always exhibited less surface scaling than the moderate-strength mix. Also in line with expectations, MK1, which had the highest water/binder ratio of the low-Ca AAM-based concretes, already exhibited considerably more surface scaling than FA8 after four freeze–thaw cycles. However, while both slag-based concretes exhibited considerably less surface scaling than both of the fly ash-based concretes when tested in Laboratory B, there was no clear distinction between these two groups of concretes in Laboratory A, at least at early test times up to 14 freeze–thaw cycles.

Fig. 8
figure 8

Results of CDF tests obtained in a) Laboratory A, 28-day cured concretes; b) Laboratory A, 56-day cured concretes; c) Laboratory B, 28-day cured concretes; and d) Laboratory C, 28-day cured concretes. Details of all curing conditions are given in Sect. 2.4. Error bars represent one standard deviation in each direction from the mean, within the results recorded by each laboratory

The results of the CDF test differed significantly between laboratories in two further respects. First, the absolute values of surface scaling were very different for the same concrete tested in different laboratories; e.g., after four freeze–thaw cycles FA8 exhibited 3152 g/m2 of scaled material in Laboratory A, 1973 g/m2 of scaled material in laboratory B, and 3871 g/m2 of scaled material in Laboratory C. It can also be noted that concrete S3a conformed to the acceptance criterion of ≤ 1500 g/m2 surface scaling after 28 freeze–thaw cycles in laboratory B, while the same concrete exceeded this limit already after eight freeze–thaw cycles in Laboratory A.

Secondly, and perhaps more important, the shape of the cumulative surface scaling curves of the slag-based concretes differed between laboratories: The results from Laboratory A exhibited a steep increase of the scaled material from zero to four freeze–thaw cycles and a much more moderate increase of cumulative surface scaling at later times, almost plateauing towards the end of the test for S3a (Fig. 8a, b). Conversely, the results from Laboratory B showed an approximately constant slope throughout the test duration for both slag-based concretes (Fig. 8c).

Behaviour corresponding to the results for the slag-based concretes tested in Laboratory A has previously been reported for concretes based on blended Portland cements with ≥ 50% blast furnace slag. It was found that for these concretes, the depth at which the slope of the cumulative surface scaling curve decreased, coincided with the depth of carbonation (due to exposure to CO2 during curing and dry storage) [35]. The authors of that study assigned this behaviour partly to coarsening of the microstructure of slag-rich cements during carbonation, and in addition contended that transformation of well-crystallized aragonite and vaterite to poorly crystalline CaCO3 during freeze–thaw attack plays a role as well. For the results of the present testing campaign it is possible that 7-days underwater curing in Laboratory B had led to a more refined microstructure and much less carbonation of the slag-based concrete specimens before freeze–thaw attack than the 7-days sealed-curing in Laboratory A, and that this is the reason for the different results in these laboratories. However, this hypothesis remains to be proven, and it is possible that alkali leaching during immersed curing is also influential on some aspects of the mechanisms for the AAMs described here.

As concretes, particularly with SCM-rich binders, are generally more susceptible to carbonation at an early age [41], an extended curing period before dry storage would be expected to increase the freeze–thaw resistance by decreasing the depth of carbonation before freeze–thaw attack, if the above mechanism is operative. Comparing the results for concretes cured in sealed conditions either for 7 days or for 56 days before dry storage and tested under otherwise identical conditions, it is seen that extended sealed-curing did indeed decrease surface scaling of all tested concretes. However, the improvement in resistance caused by extended curing was only minor to moderate for the slag-based concretes (particularly for S3a; Fig. 8a, b). Thus, it remains uncertain whether carbonation is the main factor influencing freeze–thaw resistance of these concretes, and why the shape of their cumulative surface scaling curves differed so strikingly between Laboratory A and Laboratory B. Further work is evidently needed to fully understand the influence of curing on the freeze–thaw test results obtained for AAMs.

4.3.2 ASTM C666 and ASTM C672

Laboratory D tested the slag-based concretes S1b and S3a according to the method described in ASTM C672, using either 4 wt% CaCl2 solution or 4 wt% NaCl solution as the test solution. With CaCl2, S3a performed slightly better than S1b, in line with the results of the CDF test (Sect. 4.3.1), while testing with NaCl yielded very similar results for S1b and S3a in terms of visual appearance of the surfaces and mass loss (Tables S10 and S11 in the Electronic Supplementary Material). In most cases, “moderate to severe scaling” was observed for both concretes after 90 freeze–thaw cycles.

The test according to ASTM C666 showed a better performance for S3a than for S1b, with the relative dynamic modulus of elasticity after 300 freeze–thaw cycles being 96% for S3a and 87% for S1b (Table S12 in the Electronic Supplementary Material), again in accord with the results of the CDF test as well as testing according to ASTM C672 with CaCl2. These results are, however, not necessarily a confirmation of the CDF and ASTM C672 test results, as testing in the absence of deicing salts causes different damage mechanisms in the concrete compared to testing with NaCl or CaCl2 solution, potentially including differences in crystallisation pressure during freezing [35], although this remains to be proven for AAMs which can have intrinsically high ionic strengths in their pore fluids even without additional salt application.

5 Conclusions

The extent of sulfate damage incurred by AAMs based on slag or fly ash, whether measured by expansion, mass change or visual inspection, seems to be limited. Due to the low expansion values it is difficult to distinguish between the high and the moderate performing mixtures. All three methods used gave similar information and thus seem to be useful to assess sulfate resistance of AAMs based on slag or fly ash. The mixture based on metakaolin showed an excessive, very early expansion, followed by a dimensionally stable period, when tested according to ASTM C1012 exposing the specimens to Na2SO4. This behaviour was previously described in literature [11] and might need to be assessed in the future. In the slag-based concretes, exposure to MgSO4 caused more expansion and visual damage than Na2SO4, however the expansion limits defined in the respective standards were not exceeded. The strength limit of 20 MPa set in ASTM C1012 appears to be rather low as quite immature samples are tested.

Both test methods applied for determination of ASR expansion—the ASTM C1293 and RILEM AAR-3.1 methods—appear to provide essentially reliable identification of expansion due to highly reactive aggregates in AAM. The combination of an AAM binder (regardless of precursor) with a non-reactive or potentially-expansive aggregate did not cause problematic expansion in any case. Only the aggregates of known very high reactivity proved to be problematic when combined with AAM binders. It is necessary to carefully consider the selection of the zero expansion datum point for ASR testing of AAM concretes, as there is some initial expansion in these concretes that is unlikely to be linked to a conventional ASR mechanism. There is also a need to validate test results by comparison with field data, as the acceleration of the testing by heating to 38 °C has not yet been fully justified by comparison with field data obtained over decades or longer, and it may be necessary to more carefully specify pass/fail criteria for this specific type of concrete. Testing at 20 °C does not seem to bring any particular advantages over a 38 °C test, and the results obtained at these two temperatures do appear to be correlated closely.

The CDF (freeze–thaw resistance in the presence of NaCl) test yielded the same relative performance of the high-strength mixes and the moderate-strength mixes in the three participating laboratories. Apart from that, the results from these laboratories differed considerably in terms of absolute values of surface scaling and the relative performance of two classes of concretes (i.e. fly ash-based vs. slag-based), and indicated that curing conditions significantly affect the test results. Testing of the two slag-based concretes according to ASTM C672 was more equivocal, yielding a relative performance that depended on the deicing salt. When tested according to ASTM C666–i.e. without deicing salt—these concretes performed in the expected rank order. These results highlight the important influence of the experimental conditions—including curing of the specimens—on the outcomes of freeze–thaw resistance tests of alkali-activated concretes. These influences need to be better understood before the tests can be applied with confidence.