1 Introduction

In most concrete creep testing, it has been customary to report either the creep coefficient \(\varphi\), representing the creep-to-elastic strain ratio, or the compliance function determined from the measured creep part of strain. Unfortunately, though, information is missing on (1) the rate and evolution of the load as the load is raised to the sustained load level, (2) the moment of zero time for the reported creep data, (3) the magnitude of initial total deformation, and (4) the time corresponding to this deformation. As it turns out, the initial uncertainties can lead to large errors for the entire load duration. Such errors, unfortunately, contaminate virtually all the existing data, including the latest worldwide Northwestern University (NU) database containing over 4000 creep and shrinkage tests [1], which was used to calibrate creep and shrinkage prediction model B4 [2] (the database is downloadable from http://www.civil.northwestern.edu/people/bazant or http://www.baunat.boku.ac.at/creep.html).

Proposed here is a statistical optimization method to filter out these errors and thus decontaminate the database. The method is based on the hypothesis that the initial creep curve ought to be a power law. This hypothesis is here justified physically.

1.1 Uncertainties in initial deformation

The widespread habit of plotting the creep curves in a linear, rather than logarithmic, time scale (Fig. 1) has caused significant errors. The linear scale plot creates an illusion that concrete creep begins only after about several hours of loading. In reality, it begins in much less than a microsecond after instantaneous load application, which cannot be discerned in a linear scale plot. The hydraulic creep test frames (as well as standard testing machines) can measure the deformation for 0.1 s duration. For durations less than a microsecond the evidence comes from measurements of the viscoelastic complex modulus in high frequency vibrations, and of the velocity of elastic shock waves or ultrasound.

Fig. 1
figure 1

Compliance curve (for \(n=0.2\)) plotted in a transformed time scale, and b linear time scale; b, c uncertainties in creep values shown b schematically and c in a plot of Kommendant et al.’s [3] data (these data were found to be perfect, needing no correction)

The compliance function \(J(t,t')\) is defined as the total strain at concrete age t produced by a unit sustained stress applied at age \(t'\), and thus includes the initial instantaneous elastic strain, \(1/E_1\). In theory the load application should be sudden, in the form of Heaviside step function, but this is never followed in practice. Anyway, to ramp up the load without inertia effects in \(<0.1\) s is next to impossible. The creep actually begins already within less than \(<10^{-9}\) s after sudden load application and, typically, the strains obtained within 0.1 s and 1 hour after a sudden load application differ by 30–40%. The practical testing has deviated from the ideal step function in four different ways:

  1. 1.

    The load is always ramped up gradually—at a uniform rate, or a variable rate, or in steps (Fig. 2a). Concrete creep occurs already during the rise of the load. There exists a certain equivalent time for which a sudden step function loading would give about the same creep once the sustained load level is reached, but this equivalent time is not known, and is likely to be shifted compared to the reported times. This time shift, s, is, of course, negligible compared to the load durations of many years, but the problem is that it causes an error in evaluating the initial deformation, which, in turn, leads to a vertical shift \(\Delta\) of the entire compliance curve.

  2. 2.

    Another error in the reported compliance function has often been introduced when the initial, supposedly elastic, deformation was measured by a separate test, e.g., the standardized test of elastic modulus E, and not in the creep test itself. Such initial errors cause the entire compliance-time curve to shift, even by up to 50% of the initial instantaneous strain. This causes a similar error in the reported creep coefficient \(\varphi\).

  3. 3.

    Sometimes the E-value at the time \(t_1\) of loading was not measured at all but was inferred from its relation to the compression strength, \(f_c\), of the concrete, using the empirical ACI equation, given as Eq. A-39 in ACI-R209 [4]). This estimate, though, involves significant error. A further error in the E-value at age \(t_0\) at load application arises when the standard value at age 28 days is used. In that case, another empirical equation, Eq. A-34 in ACI-R209, describing the concrete age effect on E, is used. This equation is also approximate and introduces an additional error.

  4. 4.

    Many reports presented the creep test results only in the form of a plot in a linear time scale. Then, if the time scale ends at 1 h, 1 day, 1 month, or 1 year, it is impossible to distinguish visually loading durations less than 10 s, 10 min, 2 h or 1 day, respectively. This introduces a big uncertainty regarding the actual time of loading as well as the initial deformation.

Fig. 2
figure 2

a Possible histories of initial load application and b their effect on creep curve

Do these errors justify discarding most of the existing big data on creep? Of course not. Over the last eight decades, enormous funds and human effort were invested into worldwide creep testing. The worldwide creep test data, which are now collected in the NU database, comprise over 4000 long-time tests [1]. So, what is needed is to filter out the initial errors contaminating the database. The filtered database should allow a significant improvement in the predictive capability of a multi-decade creep prediction model such as B4 [5] (which became a RILEM recommendation).

The short-time evolution of \(J(t,t')\) is, of course, unimportant for predicting multi-year and multi-decade creep effects in structures. But the problem is that the error in the initial value shifts vertically the entire compliance curve. The shift can be up to 50% of the initial compliance.

It is now clear [6,7,8,9] that the initial creep curve follows approximately the power law \((t - t')^n\), where \(t-t'\) = load duration. So, after subtraction of the correct instantaneous strain, the logarithm of creep strain plotted vs. log\((t-t')\) must give a straight line of slope n. The slope can vary greatly but is typically between 0.05 and 0.40. It is instructive to look at numbers. Assume slope 0.2 and take the classical wisdom that the creep coefficient from 1 day to 1 year is around 2 (which gives \(\lambda \approx\) 1 day). Then, for load durations of 0.1 s, 1 s, 1 min. and 1 h, the total strains, i.e., \(J(t,t')\), typically increases by about 6.5%, 10%, 23% and 53%, respectively, compared to the truly instantaneous strain, which occurs at load duration 0, virtually at \(<10^{-9}\) s.

Compared to the total strain at 0.1 s (which the duration load rises when a valve is opened in a hydraulic system), the strains at 1 min. and 1 h are, respectively, about 16% and 44% larger (Fig. 1a). Compared to the total strain at 1 min., the total strain at 1 h is approximately 24% larger. The 7-day creep coefficient \(\varphi\) is about 1.24 if related to the total strain at 1 s, and 0.81 if related to the total strain at 10 min. Relating the measured creep strain to the elastic deformation calculated on the basis of the standardized test of elastic modulus E, or estimated on the basis of compression strength \(f_c\), gives again rather different values.

These discrepancies are quite significant. They have plagued creep analysis of structures for decades. Although the creep coefficient is convenient for approximate calculation of the structural creep effects in the traditional way, without a computer, it should always be evaluated from the directly measured compliance function \(J(t,t')\). The short-time values of \(J(t,t')\), for less than a day, are unimportant for long-term creep effects in structures, but if they are measured in a different way, they can lead to a big vertical shift of the compliance curve, which matters for all times. Consequently, the testing standards for creep should explicitly state that reporting the test results in terms of the creep coefficient, as well as determining the initial deformation from tests other than the creep test itself, is an erroneous and misleading practice.

The Northwestern University (NU) world-wide database [1, 2] contains data from 1433 creep tests, most of which suffer from this kind of errors. In previous studies, the errors in the initial strain definition were simply ignored. However, regardless of these errors, the NU database contains a wealth of information, which has been obtained at enormous cumulative cost, and cannot be discarded.

So the problem at hand is how to filter out the error from the reported creep data, at least approximately, and generate in a systematic way a filtered database of concrete creep. This filtering is not trivial, which is probably why it has not yet been done. But it is not difficult either, if the recent understanding of early creep is exploited and modern computer optimization techniques are utilized. Therefore, formulating a method to filter the creep database is adopted as the objective of this paper.

1.2 Basic hypothesis: physical justification of power-law

Let us denote \(\theta = t-t'\) = duration of sustained load. We restrict consideration to sufficiently short load durations \(\Delta \theta _{ini}\) after initial loading, during which the material changes due to hydration (or aging) or other phenomena are negligible. In practice, \(\Delta \theta _{ini}\) can be up to about 5% of the age at loading, and in the case of simultaneous drying up to 5% of the prior exposure to constant drying environment. During \(\Delta \theta _{ini}\), the creep process (caused by breakage of interatomic bonds at highly stressed creep sites and their restorations at adjacent sites) must be self-similar. Mathematically this requires the creep strains \(f(\theta )\) and \(f(\theta ^*)\) at any two times \(\theta\) and \(\theta ^*\) under constant load must satisfy the relation

$$\frac{f(\theta )}{f(\theta ^*)} = f \left( \frac{\theta }{\theta ^*} \right)$$
(1)

which means that the response does not depend on the choice of reference time \(t^*\). This is a functional equation, which happens to be easily solved. It can be checked by substitution that the power law with a non-zero exponent n:

$$f(\theta ) = \theta ^n\,\,\,\,\,(n \ne 0)$$
(2)

is a solution. Further it can be shown that the power law is the only solution [10, Sec. 9.4, p. 417].

What is the proper choice of the short-time range, in which self-similarity applies and the creep follows the power law? Examination of test data indicates that it may be chosen as

$$t - t'\, \le 0.05\, t'\,\,\,\,\text{ for } t' \ge 1 \text{ week }$$
(3)

During this period, the advance of hydration as well as autogenous shrinkage is negligible and cannot significantly affect the creep. This is confirmed by the evidence of power law creep in several datasets with many readings during the aforementioned interval (especially the tests of Kommendant et al. [3]). In some data sets, in this range, there is only one data point, or none, and for those, of course, no filtering of errors is possible. The restriction \(t' \ge 1\) week is added to avoid \(t - t'\) being too short.

It may be noted that the power law character of initial creep appears applicable for a short time even when a specimen is exposed to drying simultaneously with loading. But then exponent n may be considerably different.

2 Statistical filtering of imperfect data via optimization

2.1 Imperfect creep coefficient

Based on the aforementioned, physically justified, hypothesis, we assume that, for some initial period, the total strain in creep tests grows as \(\epsilon \propto \tau\) + constant where

$$\tau (n, s) = \theta ^n, \,\,\,\, \theta = t - t' + s$$
(4)

Here s is a certain time shift, not necessarily positive, and exponent n is an empirical constant. While, in models B3 and B4, \(n = 0.10\) was approximately the overall optimum for the whole database, in individual data, the optimal n can vary widely. For many data sets it is not clear whether the beginning of time was taken as the moment at which the full sustained load value was reached, or as another moment during the load application process. Optimally, one should integrate the creep along the loading ramp and find for which time a sudden step-wise load application would give the same initial deformation under full load. An estimated creep law with no aging would suffice for this purpose, but the problem is that the ramp duration and history are unknown for all the data.

Physically, shift s yields the loading time for which a sudden load application with a step function load history would give the same creep curve as the gradual load application in the actual test; see the sketch in Fig. 2. Actually, this condition can be satisfied accurately only after a certain lapse of time, and this is a weakness of the assumption that the initial creep curve is a power law. It could be exactly a power law only for step function loading. In this regard, if the process of applying the load takes time \(\Delta t\), one should better discard data points, if any, during a time interval equal to \(\Delta t\) after the instant at which the load reached the full value of sustained load (Fig. 2). The problem, of course, is that time \(\Delta t\) is generally not reported. But this is a simplification that we have to accept.

It may be noted that recently some researchers [11] interpreted their micro-indentation creep tests during the initial few seconds as approximately logarithmic, which would correspond to \(n \rightarrow 0\); but their data were later found to be fitted slightly better by a power law with \(n = 0.08\) (see Fig. 5 in [9]), although visually the difference is small.

The compliance function of concrete may be written as

$$J(t,t')=\frac{ 1 + \psi (t,t')}{ E_0 }$$
(5)
$$\psi (t,t')= \frac{\tau (n, s)}{\lambda ^n}$$
(6)

where \(E_0\) is the truly instantaneous elastic modulus (while \(1 /E_0\) is the truly instantaneous elastic deformation) (Fig. 3); \(\psi (t,t')\) is the true creep coefficient; \(\lambda\) is a factor (of time dimension) that is to be determined by creep data fitting; \(\lambda\) makes \(\psi (t,t')\) dimensionless; and \(\lambda ^{-n}\) represents a scaling factor of the creep coefficient referred to a zero load duration, which cannot be determined if the value of \(E_0\) has not yet been identified.

Fig. 3
figure 3

True and apparent creep coefficient in the compliance function

In terms of the conventional (or apparent) creep coefficient \(\varphi\), the compliance function is

$$J(t,t') = \frac{ 1 + \varphi (t,t')}{ E_a }$$
(7)

where \(E_a\) it the reported apparent elastic modulus, defined in one way or another by the experimenter (see Fig. 3) in disregard of the short-time uncertainties discussed here.

Equating the right-hand sides of Eqs. (5) and (7), we get

$$\varphi (t,t') = \frac{E_a}{E_0}\, [1 + \lambda ^{-n} \tau (n,s)] - 1$$
(8)

Let us now set

$$x= 1 - \frac{E_a}{E_0}$$
(9)
$$y= \frac{E_a}{\lambda ^n E_0}$$
(10)

where \(E_a\) is the value retrieved from the experimenter’s report. With this notation, Eq. (8) becomes:

$$- x + \tau (n,s) y = \varphi (t,t')$$
(11)

If n and s are specified, this a linear regression equation, in which y is the regression slope and \(-x\) is the regression line intercept (Fig. 4a). The \(\varphi (t,t')\) values expressed as a function of the transformed measurement times \(\tau _i\) (\(i = 1, 2, 3,\ldots N\)) represent the data.

Fig. 4
figure 4

Parameters of creep curve regressions for a creep coefficient and b creep compliance; c misleading schematic of concrete creep curves found in literature

Once the optimum x and y values are obtained, the inverses of Eqs. (9) and (10) give

$$E_0= \frac{E_a}{1-x}$$
(12)
$$\lambda ^n= \frac{E_a}{E_0 y} = \frac{1-x}{y}$$
(13)

where the regression parameters x and y depend of the chosen n and s. According to Eq. (5), \(1/E_a = (1 + \lambda ^{-n}\tau _a)/E_0\) or \(\tau _a = \lambda ^n [ (E_0 /E_a) - 1 ]\). This yields

$$\tau _a(x,y,s) = \frac{x}{y}$$
(14)

If n and s are known, this is the optimum estimate of the unreported loading time (or duration) that corresponds to the reported \(E_a\) value.

The regression based on Eq. (11) minimizes the sum

$$F(x,y,n,s) = \sum _{i=1}^{N_i} [\tau _i(n,s) y - x - {\varphi _a}_i]^2$$
(15)

where \({\varphi _a}_i\) are the apparent creep coefficient values corresponding times \((t-t')_i\) as reported by the experimenter. Because \(\tau _i(n)\) depends on the unknown exponent n and time shift s, the minimization of F(xyns) is a nonlinear optimization problem. This problem is complicated by the need to impose restrictions on the realistic ranges of ns and \(\tau\).

Because of a large possible mismatch in the reported \(E_a\) and \(\varphi\) values, as well as scarcity of sufficiently short times among the creep data, the linear regression can unfortunately produce values of n and \(\tau\) that fall outside reasonable bounds. If, for instance, the n value optimized for some data set could yield a \(\tau _a\) value lying outside the interval (1 s, 4 h) for a spring-loaded test frame, or outside (0.001 s, 5 min.) for a hydraulic test setup, it would be an optimization error attributable to the random scatter of data or to scarcity of short-time readings.

To prevent such optimization outcomes, limits on both n and \(\tau _a\) need to be imposed. In particular, n values exceeding 0.40 are unreasonable, and so are the \(\tau _a\) values outside the range

$${\tau _a}_{\,min} \le \tau \le {\tau _a}_{\,max}$$
(16)

So, a restricted optimization is necessary. A robust and effective way to implement the restrictions is a discretized optimization:

  1. 1.

    Choose a set of discrete, closely spaced, n and s-values such as

    $$n = \, 0.01, 0.02, 0.03,\ldots , n_{max},\,\,\,\,n_{max} = 0.40$$
    (17)
    $$s = \, 0, \pm \, 0.1, \pm \,0.3, \pm \,1, \pm \,3, \pm \,10, \pm \,30, \pm \, 100, \pm \,300, \pm \,1000, \pm \,3000, \pm \,10000\,s$$
    (18)

    Even though the database indicates no exponent n to be less than 0.05, the optimization covers n values down to 0.01. The reason is that recent nano-indentation tests [11] of about 10 s duration indicated exponent n close to 0, and some researchers think that this exponent applies for multi-decade creep, too. The fact that the database optimization yielded the average n to be about 0.3 shows that the ten-second nano-indentation test cannot be extended to hourly durations. Also, choose a set of discrete, closely spaced, \(\tau _a\)-values within the interval

    $${\tau _a}_{\,min} \le \tau _q \le {\tau _a}_{\,max}$$
    (19)

    where one may typically assume

    $${\tau _a}_{\,min} = 1\,\text{ s },\,\,\,{\tau _a}_{\,max} = 4\,\text{ hours }$$
    (20)
  2. 2.

    For each ns and \(\tau _a\), a linear regression subroutine based on Eq. (11) furnishes x.

  3. 3.

    Calculating for each case \(E_a = (1-x)E_0\), one may now exclude the cases for which \(E_0\) lies outside some chosen reasonable interval \(({E_0}_{\,min}, {E_0}_{\,max})\), although this step is needed only if one desires more stringent limits on \(E_0\) than those imposed indirectly by the chosen limits on \(\tau _0\) according to equations \(x = y \tau _a\) and \(E_0 = E_a /(1-x)\).

  4. 4.

    For each n and s and the corresponding \(\tau _a\), evaluate the sum of squares F(xyn).

  5. 5.

    Finally, search for the (ns) combination that yields the smallest F(xyns).

2.2 Imperfect compliance function

Properly, the creep should be characterized in terms of the compliance function \(J(t,t')\) which includes the initial instantaneous (or elastic) deformation. However, the time shift, s, may have led to some vertically shifted compliance function \(\hat{J}(t,t')\). Besides, what was sometimes reported may have been a compliance function constructed either from \(\varphi (t,t')\) or from a measured strain increase after the initial strain reading, using an imprecisely defined elastic modulus \(\tilde{E}\). E.g., the value of \(\tilde{E}\) may have been obtained by a separate short-time test. It may have been determined by the standardized test of elastic modulus, or estimated from the strength of concrete. Such imperfections may be revealed by a significant deviation of the initial period of the curve of \(J(t,t')\) versus t from the power law, \((t-t')^n\) + constant.

From Eqs. (5) and (6), defining creep coefficient \(\psi (t,t')\) in terms of instantaneous modulus \(E_0\), we have

$$\psi (t,t') = \tau (n,s) \lambda ^{-n}\,\,\,\text{ and }\,\,\, \psi (t,t') = E_0 \hat{J}(t,t') - 1$$
(21)

This may be rearranged as

$$X + \tau (n,s) Y = \hat{J}(\tau )$$
(22)

For fixed n and s, this is a linear regression relation (Fig. 4b) in which the intercept and slope of the regression line are

$$X = \frac{1}{E_0}, \,\,\,\, Y = \frac{1}{\lambda ^n E_0}$$
(23)

where X and Y depend on n and s. Once the values of X and Y are solved, one can invert these relations to evaluate:

$$E_0 = \frac{1}{X}, \,\,\,\, \lambda ^n = \frac{X}{Y}$$
(24)

For any fixed n, the linear regression minimizes the sum

$$F(X,Y,n,s) = \sum _{i=1}^{N_i} [\tau _i(n,s) Y + X - \hat{J}_i ]^2$$
(25)

Similar as before, the scatter or scarcity of the short-time data may cause the \(E_0\) value delivered by linear regression to be outside a reasonable range, defined as

$${E_0}_{\,min} \le E_0 \le {E_0}_{\,max}$$
(26)

where it might be suitable to assume

$${E_0}_{\,min} = 1.1 E_a,\,\,\, {E_0}_{\,max} = 5 E_a.$$
(27)

The reason for setting the lower bound on \(E_0/E\) as 1.1 is that the asymptotic initial strain is, and must always be, appreciably less than the measured Young’s modulus. The reason for the upper bound of 5 is that the sub-millisecond creep needed to explain a larger value would be unreasonably large.

Fig. 5
figure 5

Schematic of constrained optimization showing various possible locations of coordinate origin for various horizontal shifts s and vertical shifts \(\Delta\) of the compliance curve

To satisfy these limits, a discrete optimization procedure is pursued as follows (Fig. 5):

  1. 1.

    Choose a set of discrete closely spaced n-values and s values such as

    $$n= 0.01, 0.02, 0.03,\ldots , n_{max},\,\,\,\,n_{max} = 0.40$$
    (28)
    $$\begin{aligned} s= & {} \, 0, \pm \,0.1, \pm \,0.3, \pm \,1, \pm \,3, \pm \,10, \pm \,30, \pm \,100, \pm \,300, \pm \,1000, \\&\pm \,3000, \pm \,10000\,s \end{aligned}$$
    (29)
  2. 2.

    Choose a set of discrete closely spaced \(E_0\)-values within reasonable limits, such as

    $$\frac{E_0}{E_a} = 1.1, 1.2, 1.3, 1.4,\ldots , 5$$
    (30)
  3. 3.

    For each ns and \(E_0\), a linear regression subroutine based on Eq. (25) furnishes X and Y.

  4. 4.

    For each (ns) combination, evaluate

    $$\tau _a(n,s) = \left( \frac{1}{E_a} - \frac{1}{E_0} \right) \frac{1}{Y}$$
    (31)

    and exclude those cases for which \(\tau _a(n,s)\) exceeds reasonable limits, such as those in Eq. (20).

  5. 5.

    For each n and the corresponding \(E_0\), evaluate the sum of squares F(XYns).

  6. 6.

    Finally, search for the (ns) combination that yields the smallest F(XYns).

The vertical shift of the compliance curve is, for both cases, given by

$$\Delta = J(t_1 - t' + s) - \hat{J}(t_1 - t')$$
(32)

in which the \(t_1\) is time of initial reporting (or any time as the vertical shift passes through whole curve).

Note Instead of the foregoing discrete optimization, one can formulate a continuous quadratic objective function of variables xyns, and impose the limits on \(\tau _a\) or E(a) either by the penalty method or by transformation of variables. The resulting optimization problem is highly nonlinear. It has been found that the convergence of such optimization is far slower than that of the present discrete optimization and often does not converge to the overall minimum.

The great advantage of the present discrete optimization is that it can be reduced to linear regression, which is fast, needs no initial estimates, and always gives unique result.

3 Verification of filtering algorithm and examples

The NU database lists, for each set of experiments, the values of Young’s modulus E and of the mean compressive strength \(\bar{f}_c\) of concrete at the age of 28 days. For those sets in which the experimenter did not report E, the database entry gives the E-value at age 28 days that was estimated from concrete strength \(\bar{f}_c\) according to the ACI empirical equation (Eq. A-39 in ACI-R209 [4]). This estimate has a significant error. To convert the 28-day E value to the reported age, \(t_0\), at load application, the empirical equation, Eq. A-34 in ACI-R209, for the age effect on E was used. This introduces an additional error.

For most of tests, the optimized values n and \(E_0/E_a\) are quite scattered. Using raw data in calibration would introduce a systematic error. Therefore, filtering of the data is necessary. Although the data in the NU database are all in terms of compliance, some or many of them may have been calculated from creep coefficient data, and it is not known which ones. So we filter all the data.

3.1 Example of algorithm verification for creep coefficient

Since all the data in NU database are reported in terms of creep compliance, to demonstrate a case study for creep coefficient, we converted one set test using reported initial strain based on the following formula.

$$\varphi (t,t') = EJ(t,t')-1$$
(33)

Kommendant et al. [3] conducted about 20 creep tests none of which is found to require any significant correction. So they are a good choice for sensitivity analysis, and particularly for verifying whether the filtering recovers the original data if they are perturbed by changing the value \(E_a\). To study the algorithm for creep coefficient, the data had to be first converted to the creep coefficient. The specimens were cylinders of diameter 152 mm, tested at various stresses, ages \(t'\) and temperatures. Running the present algorithm showed that no corrections were needed for these data, and even if the constraints in the algorithm were omitted. Nearly the same \(\tau _a\) value was returned by the algorithm for each of the tests; see Fig. 6a, which shows a plot of \(\tau _a^{opt}\) versus \(\tau _a^{test}\) for \(E/E_a = 1\). Slope 1 of the straight line in this plot indicates a perfect agreement, and the data are close to this line, with no systematic deviation to one side or the other (these were probably the best, and best reported, creep tests ever conducted).

The agreement in Fig. 6a confirms the correctness of the algorithm, and the perfect quality of data by Kommendant et al., which makes them suitable for sensitivity analysis. For this purpose, some erroneous E-values were introduced, the present optimization algorithm was run for each of them, and their effect on each of these tests was calculated. The results are shown in Fig. 6b–d in terms of the optimized values of \(\tau _a\) as delivered by Eq. (14) for each of the tests. It is now noteworthy that even changing \(E/E_a\) to 1.01 or 0.99 disrupts agreement with the line of slope 1 markedly, and for 1.05 totally (ratios \(E/E_a < 1\) are, of course, physically impossible, but some testers reported data that give such ratios, and that is another reason why filtering is necessary).

Fig. 6
figure 6

Sensitivity analysis of the \(\tau _a\) with respect to E, based on Kommendant et al.’s data [3]

Most importantly, the filtering algorithm is run for the same data set (Kommendant et al.), shown in Fig. 7a after it is disrupted by setting \(E/E_a\) = 0.95. It is remarkable that, after running the filtering algorithm for the compliance function, the original values for \(\tau _a\) is recovered, as shown in Fig. 7b. This conclusion is a powerful experimental verification which shows effectiveness of the proposed algorithm.

Fig. 7
figure 7

a Unconstrained and b constrained optimization results for deliberately altered Young’s modulus E

This example demonstrates that a measurement error in Young’s modulus E can generate highly distorted values for the creep coefficient and for the optimized \(\tau _a\). It also shows that \(\tau _a\) is very sensitive to the creep coefficient and hence is an important variable to optimize.

3.2 Examples of filtering error in compliance function

Figure 8 shows the original and filtered representation of the creep data reported by Wittmann [12]. The specimens were cylinders of diameter 18 mm and height of 60 mm, made of pure cement paste with w/c = 0.40 and loaded at 80 days after casting. The raw data on creep compliance \(J-J_1\) as reported are shown in Fig. 8a, in the log-time scale, in which the power law is a straight line.

Fig. 8
figure 8

a Original (line fitted through solid points only) and b filtered representation of test data, reported by Wittmann [12]

It is seen that only the last four data points (solid circles) conform to the power law but the earlier ones (empty circles) deviate downwards significantly. The optimization algorithm for compliance was run for these data, yielding the revised plot in Fig. 8b. As seen, the filtered data are in excellent agreement with the power law and, after filtering, one finds from Eq. (32) the need for a significant vertical shift of the entire compliance curve, \(\Delta = 54.2 \,10^{-6}/\hbox {MPa}\).

Müller et al.’s data [13] for concrete, shown in Fig. 9a for one of their 18 tests, reveal a similar behavior as Wittmann’s. Again, the initial points (empty circles) deviate significantly downwards from the power law line fitted to the last five points (solid circles). This deviation for short-times is even more noticeable than for the Wittmann’s data which the error in the ill-reported initial elastic strain, associated with the required time shift, is here comparable to the compliance value itself. The other specimens tested by Müller et al. exhibit similar behavior. These specimens were loaded after 365 days, and so the power law validity should extend up to about a week. This makes these data excellent candidates to demonstrate the benefit of filtering. Indeed, the filtered data in Fig. 9b show a major benefit.

Fig. 9
figure 9

a Original (line fitted through solid points) and b filtered representation of the Müller et al. [13] test data

The power-law for creep is best represented by a straight line in the plot of \(\log (J-J_0)\) versus \(\log (t-t')\). But if the time of the first data point is specified as zero, then this point cannot be plotted (as \(\log 0 \rightarrow -\infty\)). If that point came from a linear time scale plot, then the time assigned to this point was the shortest time discernible visually, e.g., 1 s. For the data of Wittmann, and also of Müller et al., the ACI-R209 formula for estimating the elastic modulus from the strength gave a higher value than the first measured data point. In that case, the initial data point (\(J_1\)) was taken as the reported initial strain for the time corresponding to the first data point.

Further consider the tests by Anders [14]. They were performed for different ages \(t'\) but only two tests performed at age 28 days will be considered (for the others, \(t'\) was so low that hydration intervened in the initial period). The specimens were cylinders of diameter 150 mm and height of 450 mm. The time for the first point in the database was listed as 0, which is not feasible in practice. The original report could not be retrieved.

So it was assumed that the time corresponding to the first point was 0.1 s, which may be the fastest loading attainable by sudden opening of a valve in a hydraulic creep frame. With this interpretation, the data are plotted in the transformed time scale in Fig. 10.

Fig. 10
figure 10

Optimized fit of Anders test data [14] when time shift is a disallowed or b allowed

If no time shift is included in the optimization, a reasonably constrained optimization cannot handle these data (Fig. 10a). But when the full optimization algorithm with time shift s is applied to these data, the plot in Fig. 10b is obtained. As seen, an excellent power-law fit is achieved, and the likely loading time for the first point is also recovered.

3.3 Power-law exponent and its optimum for all short-time data

An unconstrained optimal fitting of the test data in the NU database showed that the n value was in most tests between 0.2 and 0.3 (Fig. 11a). Only in a tiny portion of the database, n was less than 0.05 and more than 0.4.

Fig. 11
figure 11

a Distribution of power law exponent n in NU database, and b overall C.o.V. of deviations from test data as a function of n

Let us now determine the optimum exponent n for all the tests in the database with sufficient data during the first day of sustained load, and the sensitivity of error to the value of n. The optimum minimizes the square of the combined coefficient of variation (C.o.V.) of regression errors, \(\omega _{all}\), which is calculated as

$$\omega _{all} = \sqrt{\frac{1}{N} \sum _{i=1}^{N} \omega _i^2}$$
(34)

where \(\omega _i\) (\(i=1,2,3\ldots N\)) is the individual coefficient of variation of the \(i{\mathrm{th}}\) test data set in the NU database, and N is total number of tests in the database. To find how \(\omega _{all}\) changes when n deviates from the optimum, discrete values n = 0.05, 0.1, 0.15, ..., 0.4 were prescribed for all the creep test series in the database, and then \(\omega _{all}\) was calculated for each prescribed n, and plotted in Fig. 11b. The optimum exponent is n = 0.3, for which \(\omega _{all}\) = 11.1%.

Varying n does not increase the combined C.o.V of errors dramatically. Decreasing n from 0.3 to 0.2 would change \(\omega _{all}\) by less than 2%, and would still give almost equally good fits of the data and C.o.V increases under 2% (Fig. 11b). So, the error sensitivity to the overall n of the database is quite low. An n-value representing a sharp optimum can be obtained only for an individual creep test. The scatter among different concretes in the database dominates the errors and is what prevents a sharp optimum.

4 Five power-law regimes in the evolution of concrete creep

Combining the present conclusion about overall n with the previous knowledge, one can discern a sequence of five regimes of power laws \(\theta ^n\) (where \(\theta = t-t'\)):

$$\text{ for } t-t' < 10 \text{ s }\,\,\,\quad n \approx 0.08$$
(35)
$$\text{ for } \text{10 } \text{ s } \le t-t' < \text{1 } \text{ day }\,\,\,\quad n \approx 0.30$$
(36)
$$\text{ for } \text{1 } \text{ day } \le t-t' < \text{1 } \text{ year }\,\,\, \quad n \approx 0.10$$
(37)
$$\text{ for } \text{1 } \text{ year } \le t-t' < 10^3 \text{ years }\,\,\, \quad n \rightarrow 0$$
(38)
$$\text{ for } t-t' \ge 10^3 \text{ years }\,\,\, \quad n = 1$$
(39)

The transition between the subsequent regimes are very gradual, spread-out, and the above-mentioned times are those at which the transitions are centered. The initial regime for \(t-t'\) less than about 10 s was studied by means of creep micro-indentation tests of hardened cement paste by Vandamme et al. [11]. They proposed to represent their data by a logarithmic curve, which corresponds to \(n \rightarrow 0\) because

$$\lim _{n \rightarrow 0} \frac{\text{ d }}{\text{d}t} \left( \frac{t^n}{n} \right) = \lim _{n \rightarrow 0} t^{n-1} = t^{-1} = \frac{\text{ d }}{\text{d}t}\ (\ln \, t)$$
(40)

So the creep rate for \(n \rightarrow 0\) is still a power law, which again satisfies the functional equation (1). However, reanalysis of Vandamme et al.’s [11] data in [9] revealed that the best fit of these data is obtained by a power law with n = 0.08, as stated above, although visually the difference from the logarithmic curve is quite small (later it was proposed that the near-logarithmic creep in the ten-second nanoindentation test explains why multidecade creep is logarithmic; but this argument involved some hypotheses open to doubt).

The second regime is the present case. The third regime with \(n \approx 0.10\) was identified by statistical optimization of large databases for the purpose calibration of RILEM creep prediction models B3 [6] and B4 [5] and was incorporated into a general expression of \(\text{ d }J(t,t')/\text{d}t\), which also covers the transitions between the regimes with n = 0.10 and 0.

The fourth regimes, \(n \rightarrow\) 0, is the long-time creep evolution, which is logarithmic, as reflected in Eq. (40). For all practical purposes, it is the terminal regime of long-time concrete creep, as proposed in 1975 in [15] and first incorporated into engineering society recommendations in 1995 (RILEM model B3). As a debatable point, it may be mentioned that some researchers [11] suggested the logarithmic multidecade creep to be a necessary consequence of a supposedly logarithmic creep seen in the aforementioned 10-second nanoindentation experiments (which itself is more likely a power law, as mentioned above).

The fifth and last regime is purely speculative and, for the service stress range of concrete, experimentally unverifiable. It is irrelevant for structural engineering practice. But it is interesting theoretically, e.g., because of analogy with the creep shale [16] and other rocks over geologic times. In materials science, it is generally accepted that the primary creep (which is all that applies to concrete practice) eventually transits to a secondary creep, which is constant-rate viscous flow and depends on the sustained stress nonlinearly. The necessity of this transition may be explained by the fact that (except in perfect crystals) every deformation increment must activate new creep sites (i.e., sites of nanoscale slip at overstressed locations in the microstructure; see [16, Fig. 6]). The gradual transition between the primary and secondary creep is centered at the so-called Maxwell time [16], which has been roughly estimated from a certain geological evidence to be between \(10^4\) and \(10^6\) years for rocks such as shale, and is probably shorter for concrete. The secondary creep of concrete should be observable at normal times only under very high shear stress combined with hydrostatic pressure high enough to prevent all microcracking.

The physical reasons for the changes in power-law exponent through subsequent time periods deserve further study.

5 Conclusions

  1. 1.

    The correct time scale for graphical presentation of the test data on creep (as well as shrinkage) is the logarithmic scale. Reporting in the linear scale of time (or test duration) is grossly misleading.

  2. 2.

    The best way of reporting concrete creep data is a numerical table (nowadays on a computer), in which the measurement times should be refined in an approximately geometric progression as the test duration decreases; e.g., \(t-t'\) = 0.1, 0.316, 1, 3.16, 10, ..., 1000, 3160, 10,000 s.

  3. 3.

    For a short initial period during which the changes in the effects of hydration and selfdesiccation (and possibly also of drying) are negligible, a scaling analysis shows that the creep strain must initially evolve as a power law.

  4. 4.

    Most reported test data are imperfect and need corrections. The time entering the compliance function needs to be corrected by a certain time shift due to the fact that the process of load application takes a finite time, ranging from 0.1 s to even several hours, which is a circumstance usually unreported. The time shift is optimized such that the reported initial strains would follow a power function of time. The time shift produces a vertical shift of the entire compliance function curve.

  5. 5.

    The creep coefficient values as well as the compliance function values are rather sensitive to Young’s modulus E which, in turn, varies greatly with the time, \(\tau _a\) (needed to apply the load and the load application history). Mixing data with very different \(\tau _a\)-values introduces large errors into the creep database. Their correction leads to a vertical shift of the compliance function curve.

  6. 6.

    Based on the foregoing conclusion, the way to filter out the errors from the database is to use statistical optimization so as to shift the time and strain scales with the objective of making the initial creep data fit a power function of time as closely as possible.

  7. 7.

    The sum-of-least-squares optimization problem is reducible to a linear regression that is subjected to several nonlinear constraints, which are best tackled in a discrete form.

  8. 8.

    Enforcing a power law form of the initial creep curve generally leads to closer fits of test data.

  9. 9.

    The optimum exponent n of the power law for initial creep varies from one data set to another, but is mostly between 0.10 and 0.35. The optimum for the whole database, minimizing the coefficient of variation of errors, is \(n \approx 0.3\).

  10. 10.

    After filtering out the errors, the corrected database will allow better calibration of the creep prediction model such as B3 or B4.