1 Introduction

For materials operating at high temperatures, the understanding of creep and its interaction with other damage mechanism such as fatigue and oxidation is of great importance. Indeed, creep is the dominant failure mechanism for pipework that is used to transport steam from boilers to turbines in power plants. 2.25Cr–1Mo is a main stay steel used for such structural components within the UK’s aging power plants—where the usual service conditions for heater tubes is around 840 K and 35 MPa. Under these static conditions, this material is designed to resist creep failure up to 250,000 h of operation. The development of new materials that can operate at higher temperatures, to improve efficiency levels, is however considerably hampered by the fact that currently there are few creep models that have proved capable of accurately extrapolating accelerated test data (out only to around 5000–10,000 h) to such operating conditions.[1] Studies on low alloyed steels typically reveal substantial downward revisions in estimated creep life as longer-term test results become available over time.

Many of the traditional parametric creep models such as the Larson–Miller,[2] Manson–Haferd,[3] and the Dorn and Shepherd[4] models have as their basis the simple Arrhenius expression for relating minimum creep rates to temperature. The Dorn-Shepherd model is usually written in terms of the minimum creep rate \({\dot{\varepsilon }}_{\text{m}}\), but using the Monkman–Grant[5] relation with an exponent of unity, it can also be written in terms of time to failure tF

$${t_{\rm{F}}} = \frac{M}{{{{\dot \varepsilon }_{\rm{m}}}}} = D{\sigma ^{( - n)}}{\rm{exp}}\left( {\frac{{{Q_c}}}{{{\rm{RT}}}}} \right),$$
(1a)

where σ is stress, T is absolute temperature, and M, n (sometimes termed Norton’s n), and D are unknown model constants. Qc is the activation energy for creep. This can be linearized using natural logs

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) = \left\{ {{\rm{ln}}\left( D \right) + {Q_{\rm{c}}}\left( {\frac{1}{{{\rm{RT}}}}} \right)} \right\} - {\rm{nln}}\left( \sigma \right),$$
(1b)

so that at constant temperature the model predicts a linear (and negative) relationship between ln(tF) and ln(σ) with the effect of temperature then being to shift this linear relationship in a parallel fashion (i.e., change only the intercept D in the model). When this model is applied to minimum creep rates obtained by the Japanese National Institute for Materials Science (NIMS)[6,7] on one batch (termed MAF) of 2.25Cr–1Mo steel, the results seen in Figure1(a) are obtained. Figure 1(a) plots various isothermal lines and it can be seen that Norton’s n [the power of x in Figure 1(a)] is not a model constant. Clearly, n tends to diminish with increasing temperature.

Fig. 1
figure 1

The variation in minimum creep rates shown in NIMS creep data sheets 3B & 50A[6,7] with (a) stress at constant temperature and (b) reciprocal of temperature at constant stress

Figure 1(b) plots various iso-stress lines, and the activation energy for creep is also far from a constant—tending to diminish with increasing temperature. But again, is not clear if this is just a temperature dependency as the lower Qc value [of 325 kJ mol−1—the power of e in Figure 1(b)] also occurs at the lowest stress. It is this variation in n and Qc that leads to the OSD model producing unreliable long-term predictions based on shorter-term data.

There is a tendency in the literature to attribute these changes in n and Qc values to different creep mechanisms applying over different stress and temperature ranges. Some solutions to this problem of changing creep mechanisms are present in the literature. A recent approach taken by Ding et al.[8] is to specify a separate equation to explain creep rates due to grain boundary sliding, dislocation creep, and dislocation glide. The overall creep rate is then simply the sum of the creep rates predicted by these mechanisms. Integration using true stress and true strain then yields an expression for predicting the observed creep curve at any point in time based on these evolving creep mechanisms. The approach is readily extended to encompass other mechanisms—for example to include a role for high-temperature oxidation. By modeling the whole creep curve, this approach can predict the role of both primary creep and microstructure evolution on time to failure.

This paper makes use of varying parameter models common in the statistical literature to deal with the non-constancy of n and Qc. Recently, Evans[9] made a first attempt at this by proposing a semi-parametric approach based on the LOESS technique. Here, the parametric form of a particular creep model corresponding to a fixed temperature is estimated by least squares or weighted least squares. For example, the OSD model at constant temperature has the form

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) = a - {\rm{nln}}\left( \sigma \right),$$
(1c)

where \(\text{ln}\left(D\right)+{Q}_{\text{c}}\left(\frac{1}{\text{RT}}\right)\) is a constant if T is held fixed. LOESS then involves selecting one of the test stresses (σ0) in the test matrix and applying least squares or weighted least squares to estimate a and b, but instead of using all test results, only the nearest ones to σ0 are used. This is then repeated for all values of σ0 in the test matrix and so what emerges from such repetition are values for a and b that can be associated with each stress. By applying this OSD model over such a local range of stresses, the creep mechanism remains approximately constant, so allowing for more accurate creep life predictions. This LOESS model was shown by Evans[9] to produce better extrapolations of creep life than a fully parametric OSD model.

However, this approach comes with several limitations and complications. First, the functional relationships between the parameters a and b and stress can only be identified by plotting a and n against stress, and then as a second stage, fitting trend lines to such plots. From a statistical perspective, such a two-stage estimation procedure produces less efficient estimates of the parameters of this trend line than any potential one-step procedure. By less efficient it is meant the parameters are not estimated with the smallest possible variance or level of sample variation uncertainty. Furthermore, the approach says nothing about the correct functional form of this trend line, which is currently unknown.

Secondly, LOESS only works for models containing one explanatory variable, but time to failure also depends on temperature. Evans[9] incorporated temperature into this LOESS model by introducing sub repetitions within the above-described repetitive process. But this came at cost. A relatively minor issue is that it results in a rather computer intensive estimation procedure that some readers and practitioner’s dislike. For example, Evans[9] incorporated T into the OSD model using a temperature compensated failure time

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) - \frac{{{Q_{\rm{c}}}}}{{RT}} = a - {\rm{nln}}\left( \sigma \right),$$
(1d)

where now a = ln(D). LOESS then involves guessing a value for the activation energy Qc, selecting one of the test stresses (σ0) in the test matrix and applying least squares or weighted least squares to estimate a and b, using nearest test results to σ0. This is then repeated for all values of σ0 in the test matrix and so what emerges from such repetition are values for a and b that can be associated with each stress. Finally, Qc is varied in a structured way until the fit to the temperature-adjusted failure times is maximized,

While Evans[9] showed that temperature adjusting stress as well as times opens up the LOESS procedure to other well-known creep models such as the Soviet model[10] and the Minimum Commitment Methods[11], it forces the activation energy to be a constant that does not vary with stress. Yet, changing creep mechanisms will often be associated with a change in activation energy, and Evans[9] stated that this limitation warranted future research to relax the constraint of a constant Qc. This paper presents the functional coefficient model as a solution to this identified shortcoming of LOESS. Closely related to the above point is the fact that each local regression in the above LOESS procedure is over a local range of stresses and not temperatures. But creep mechanisms also vary with temperature and the local regressions may therefore not be over constant creep mechanism conditions. So, it may be possible to improve life predictions by defining local in terms of stress and temperature, i.e., by local test conditions and not just local stress.

The first issue mentioned above (an unknown functional relationship between n and Qc with both stress and temperature) can be dealt with by approximating these unknown functions with a Taylor series expansion. This can be of any order, but this paper restricts itself to first and second-order Taylor series approximations. The second problem can be dealt by (i) including T as an additional explanatory variable on the right-hand side of Eq. [1c] and (ii) defining close in a local regression as similar stress-temperature combinations to the one under investigation. This allows Qc to vary with stress and temperature and allows estimation of all the models unknown parameters to be done in a single stage or step. The result is a functional coefficients model[12].

Therefore, the aim of this paper is to implement the suggestion made by Evans[9] of developing a fully variable parameter model (allowing Qc to vary as well) to predict creep failure times. This is done by introducing readers to the functional coefficients model that overcomes the limitations of the LOESS procedure proposed by Evans. The second aim is to assess whether this model is still capable of producing predictions of time to failure that are i. at least comparable with that produced by LOESS and ii. superior to a fully parametric creep model. The third aim of the paper is to try and explain the pattern of the varying creep model parameters that the functional coefficients model produces, in terms of changing creep mechanisms. To achieve these aims, experimental failure time data on 2.25Cr–1Mo steel produced by NIMS [6,7] will be used. The next section therefore describes these NIMS data. This is followed by a method section that outlines the structure of a functional coefficient model to reveal its inherent flexibility in creep model specification compared to the LOESS technique. A section on how the unknown parameters of such a model are estimated then follows. The penultimate results section then compares the functional coefficients models to a fully parametric OSD model and a LOESS version of this model. Finally, conclusions are drawn.

2 The Data

This paper makes use of the information in Creep Data Sheets 3B, 50A, published by the Japanese National Institute for Materials Science (NIMS).[6,7] These have extensive data on twelve batches of 2.25Cr–1Mo (according to ASTM A 387, Grade 22) steel where each batch has a different chemical composition that underwent one of four different heat treatments–details of which are given in Reference 6. This paper makes use of just one of these batches, the MAF batch, which was in tube form that had an outside diameter of 50.8 mm, a wall thickness of 8 mm, and a length of 5000 mm with a chemical composition of Fe–2.46 Cr–0.94Mo–0.1C–0.23Si–0.43Mn–0.011P–0.009S–0.008Ni–0.07Cu–0.005Al. Specimens for creep testing were taken longitudinally from this material. Each test specimen had a diameter of 6 mm with a gauge length of 30 mm. The creep tests were obtained over a wide range of conditions: 400−22 MPa and 723–923 K. For the MAF batch (and only this batch), both minimum creep rates and time to failure measurements were recorded, together with the times to attain various strains − 0.005, 0.01, 0.02 ,and 0.05. Figure 2 plots the creep failure times obtained for this MAF batch at the different stresses and temperatures used.

Fig. 2
figure 2

Relationship between stress, temperature, and time to failure for the MAF batch of 2.25Cr–1Mo steel contained within NIMS creep data sheets 3B &50A[6,7]

3 Method

3.1 Fully Parametric Creep Models

The Orr–Sherby–Dorn (OSD) and Larson–Miller (LM) models have as their basis the Arrhenius relation, that when applied to creep explains the relationship between absolute temperature and the minimum creep rate at a constant load or stress. The Arrhenius expression is typically used to relate the minimum creep rate to temperature but combining it with the Monkman–Grant relation also allows it to also be written in terms of the time to failure

$${\dot \varepsilon _{\rm{m}}} = {\rm{Aexp}}\left( { - \frac{{{Q_{\rm{c}}}}}{{{\rm{RT}}}}} \right) = \frac{M}{{{t_{\rm{F}}}}},$$
(2a)
$${t_{\rm{F}}} = \frac{M}{A}{\rm{\;exp}}\left( {\frac{{{Q_{\rm{c}}}}}{{{\rm{RT}}}}} \right).$$
(2b)

The OSD model has as its foundation this Arrhenius relation and introduces the effect of stress on the time to failure by making the parameter M/A a function of stress. More specifically, the OSD model postulates that

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) = \left\{ {{\rm{ln}}\left( {\rm{D}} \right) + {Q_{\rm{c}}}\left( {\frac{1}{{{\rm{RT}}}}} \right)} \right\} - {\rm{nln}}\left( \sigma \right),$$
(2c)

\(M/\text{A}={f}_{\text{OSD}}\left(\sigma \right)=D{\sigma }^{-n}\), so that

Consequently, in the OSD model, the effect of temperature is to shift in a parallel fashion the isothermal lines when plotted in ln(tF)–ln(σ) space. As temperature changes by ΔT, the slope of the isothermal line remains unchanged at the value given by n in Eq. [2c], but its intercept changes by \({Q}_{\text{c}}\left(\frac{1}{\text{R}}\Delta {\text{T}}\right)\). If σ is replaced with ln(ln(σTS/σ)) in Eq. [2c], the Wilshire[13] creep model emerges (where σTS is the materials tensile strength).

The LM model also has as its foundation the Arrhenius relation, but in this model it is parameter Qc that is made a function of stress \({Q}_{\text{c}}={\text{f}}_{\text{LM}}(\sigma )\). This functional relationship is postulated differently in different applications of the LM model, but a typical specification is Qc = B0 + B1ln(σ), where the B are further model parameters. Substituting this into Eq. [2c] gives one representation of the LM model

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) = {\rm{ln(}}M/A) + {B_{\rm{o}}}\left( {\frac{1}{{{\rm{RT}}}}} \right) + {B_1}\left( {\frac{{{\rm{ln}}\left( \sigma \right)}}{{{\rm{RT}}}}} \right).$$
(2d)

Consequently, in the LM model, the effect of temperature is to shift and rotate the isothermal lines when plotted in ln(tF)–ln(σ) space. The LM model predicts that the slope of the isothermal lines should be given by \(\left(\frac{{\text{B}}_{1}}{\text{RT}}\right)\) and so changes with temperature. The LM model also predicts that the intercept of the isothermal lines is given by ln(\(M/A)+{B}_{\text{o}}\left(\frac{1}{\text{RT}}\right)\) and so also changes with temperature.

The changing intercept of the isothermal lines shown in Figure 1(a) with changes in temperature is consistent with the OSD model, but the reduction in Norton’s n (exponent on x) seen in Figure 1(a) is not. This suggests that a sensible way to progress is to allow n to vary with tests conditions in the OSD model—i.e., to generalize the OSD model. But should this be with respect to stress or temperature? Because decreasing stresses are associated with increasing temperatures in Figure 1(a), it is not clear whether the observed fall in the value of n seen in Figure 1(a) is the result of a fall in stress or a rise in temperature. In the absence of any additional information, it is decided that the intercept of the isothermal lines be made temperature dependent, as in the original OSD model, but instead of treating the slope as a constant, it is made dependent on stress. Call this the generalized OSD model (GOSD).

3.2 Functional Coefficients Representation of the Generalized OSD Model

A functional coefficients model starts with a specification of the relationships between a dependent variable and one or more explanatory variable. It then introduces other explanatory variables by using these additional variables to explain how the parameters of that relationship change. The functional coefficients representation of the GOSD parametric creep model starts with the isothermal representation in the OSD model

$${\rm{ln}}\left( {{t_{\rm{F}}}} \right) = a + b{\rm{ln}}\left( \sigma \right),$$
(3)

where b < 0 (and b = − n). Then, parameter a is made a function of temperature a = f(1/RT) and b a function of stress b = g(ln(σ)). Other specifications are possible and lead to other well-known creep models. The difference between these well-known models and the functional coefficients model is that the functions f() and g() are left unspecified. Instead, they are approximated using a Taylor series. If this variation is modeled using a 1st order Taylor series expansion, then

$$a = {a_{\rm{o}}} + {a_1}{\left[ {\left( {\frac{1}{{{\rm{RT}}}}} \right) - \left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)} \right]},$$
(4a)
$$b = {b_{\rm{o}}} + {b_1}\left( {\ln (\sigma ) - \ln ({\sigma _{\rm{o}}})} \right),$$
(4b)

where bo = g(ln(σo)), ao = f(1/RTo), b1 \(=\text{dg}(\text{ln}\left({\upsigma }_{0}\right))/\text{dln}({\sigma }_{0}),\) a1 \(=\text{df}(1/\text{R}{T}_{0})/d(\frac{1}{{\text{RT}}_{0}})\)—i.e., each derivative is evaluated at specific values for stress and temperature, σ=σo and T = To. This approximation can be done for all values of stress and temperature in a test matrix and so there are values for bo, b1, ao, and a1 at each test condition making up the creep data set.

If a second-order Taylor series expansion is used to approximate g(ln(σ)) and f(1/RT), then Eqs. [4] become

$$b = {b_{\rm{o}}} + {b_1}(\ln (\sigma ) - \ln ({\sigma _{\rm{o}}})) + {b_2}{(\ln (\sigma ) - \ln ({\sigma _{\rm{o}}}))^2},$$
(5a)
$$a = {a_{\rm{o}}} + {a_1}\left[ {\left( {\frac{1}{{{\rm{RT}}}}} \right) - \left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)} \right] + {a_2}{\left[ {\left( {\frac{1}{{{\rm{RT}}}}} \right) - \left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)} \right]^2},$$
(5b)

where now bo = g(ln(σo)), ao = f(1/RTo), b1 \(=\text{dg}(\text{ln}\left({\sigma }_{0}\right))/d\text{ln}({\sigma }_{0}),\) a1 \(=\text{df}(1/{\text{RT}}_{0})/d(\frac{1}{{\text{RT}}_{0}}),\) b2 \(=0.5{\text{d}}^{2}\text{g}(\text{ln}\left({\sigma }_{0}\right))/d{\text{ln}({\sigma }_{\text{o}})}^{2},\) a2 = \(0.5{\text{d}}^{2}\text{f}(1/{\text{RT}}_{0})/d{(1/{\text{RT}}_{0})}^{2}\)—with each derivative evaluated at σ= σo and T = To. So, the first model to be used in this paper will be termed a first-order functional coefficient version of the generalized OSD model (or GOSD1 for short) and the second model used in this paper is the second-order functional coefficient version of the generalized OSD model (or GOSD2 for short).

3.3 Estimating the Functional Coefficients

The technique of least squares can be used to estimate values for all the above-mentioned derivatives by including a random error term ε to pick up the stochastic nature of creep failure times. Then inserting Eqs. [4] into Eq. [3] gives the single equation representation of the GOSD1 model

$$\ln ({t_{\rm{F}}}) = \left[ {{a_0} - {\rm{ }}{a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right] + \left[ {{b_0} - {b_1}\ln ({\sigma _{\rm{o}}})} \right]{\rm{ln}}(\sigma ) + {a_1}\frac{1}{{{\rm{RT}}}} + {b_1}{[{\rm{ln}}(\sigma )]^{2\;}} + \varepsilon$$
(6a)

Values for a1, b1, [a0a1 \(\frac{1}{{\text{RT}}_{\text{o}}}\)], and [b0b1 \(\text{ln}({\sigma }_{o})\)] are then chosen to minimize Σε2, where the sum is over all test conditions, of which there are say N, in the test matrix. Such a minimization has a well-known solution. This least square procedure therefore yields direct estimates for a1 and b1. Parameters a0 and b0 are obtained indirectly by substituting in each existing stress-temperature test combination for σo and To in Eq. [6a]. For example, the least squares procedure will yield a value for [a0a1 \(\frac{1}{{\text{RT}}_{\text{o}}}\)] and a1. If the least squares estimate for [a0a1 \(\frac{1}{{\text{RT}}_{\text{o}}}\)] is say θ, then the value for a0 at a temperature of 823 K would be given by ao = θ + a1 \(\frac{1}{(8.314)823}\)].

Inserting Eqs. [5] into Eq. [3] gives the single equation representation of the GOSD2 model

$$\begin{array}{ccccc}\ln ({t_{\rm{F}}}) = & \left[ {{a_0} - {a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}} - {a_2}{{\left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)}^2}} \right] + \left[ {{b_0} - {b_1}\ln ({\sigma _{\rm{o}}}) + {b_2}{{({\rm{ln}}(\sigma ))}^2}} \right]\ln (\sigma )\\ & \quad+ \left[ {{a_1} - {\rm{2}}{a_2}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right]\frac{1}{{{\rm{RT}}}} + [{b_1} - 2{b_2}\ln ({\sigma _{\rm{o}}})]{[\ln (\sigma )]^2} + {a_2}{\left( {\frac{1}{{{\rm{RT}}}}} \right)^2} + {b_2}{[\ln (\sigma )]^3} + \varepsilon \end{array}$$
(6b)

So at each test condition a2, b2 are estimated directly and the others indirectly. Higher order approximations can also be used. As Taylor series approximations are more accurate the closer the test condition is to (To,σo), an additional way of improving the predictive accuracy is to accept that any specified functional coefficients model is only a realistic descriptor of creep over a restricted range of test conditions, for example, over conditions where the creep mechanism remains unchanged around (To,σo). This can be achieved using the technique of weighted least squares. As stress and temperature are measured in different units, they must first be standardized using their means and standard deviations

$${x_1} = [{\rm{ln}}(\sigma ) - {\rm{mean}}\;{\rm{of}}\;{\rm{ln}}(\sigma )]/{\rm{standard}}\;{\rm{deviation}}\;{\rm{of}}\;{\rm{ln(}}\sigma ).$$
(7a)
$${x_2} = [1/{\rm{RT}} - {\rm{mean}}\;{\rm{of}}\;1/{\rm{RT}}]/{\rm{standard}}\;{\rm{deviation}}\;{\rm{of}}\;1/{\rm{RT}}.$$
(7b)

The closeness of each test condition to test condition (To, σo) can then be measured as

$$d = \left| {{x_1}-{x_{1,{\rm{o}}}}] + } \right|{x_2}-{x_{2,{\rm{o}}}}|,$$
(7c)

where x1,o is the standardized value for ln(σo) and x2,o is the standardized value for 1/RTo. Other distance measures, such as the Euclidean distance, can also be used, but using absolute values is more likely to make the distance measure robust to the presence of any unusually small or large failure times, i.e., to any outlying data points. Kernels can be used to determine the values for the weights, w. There are many such Kernels in common use, but the one used by Cleveland [14] for LOESS procedures is the tri-cubic Kernel

$${w^2} = {K_{\rm{h}}}(d/h) = \left\{ {\begin{array}{*{20}{c}}{0.864{{\left( {1 - {{\left( {\frac{d}{{\rm{h}}}} \right)}^3}} \right)}^3}}&{{\rm{when\;}}\left| {\frac{{\rm{d}}}{{\rm{h}}}} \right| < 1}\\0&{{\rm{otherwise}}}\end{array}} \right.,$$
(7d)

and where h is a selected band width in the same “units” as d. While other kernels can be used (the Epanechnikov Kernel is also popular), the different Kernels tend to give similar weight values, but some Kernels may give better lifetime predictions than others. This could form a possible topic for future research but is not the main scope of this paper. This paper uses the tri-cubic Kernel to enable broad comparisons to the results of Evans[9] that were obtained using this Kernel within his LOESS model.

For a selected target test condition (To, σo), \(w\text{ln}({t}_{\text{F}})\) is regressed on the variables w, ln(σ)w, (ln(σ))2w, and w/RT (by minimizing Σe2 in Eq. [8a]))

$$w\ln ({t_{\rm{F}}}) = \left[ {{a_0} - {a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right]w + \left[ {{b_0} - {b_1}\ln ({\sigma _{\rm{o}}})} \right]{\rm{ln}}(\sigma )w + {a_1}\frac{w}{{{\rm{RT}}}} + {b_1}w{[{\rm{ln}}(\sigma )]^{2\;}} + e$$
(8a)

when using the GOSD1 model. Note that these weighted least squares estimates of a1 and b1 provide values for the required derivates around the target test condition. When using GOSD2 model, the parameters in Eq. [8b] are chosen to minimize Σe2

$$\begin{array}{ccccc}w\ln ({t_{\rm{F}}}) = & \left[ {{a_0} - {a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}} - {a_2}{{\left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)}^2}} \right] + \left[ {{b_0} - {b_1}\ln ({\sigma _{\rm{o}}}) + {b_2}{{({\rm{ln}}(\sigma ))}^2}} \right]\ln (\sigma )w\\ & + \left[ {{a_1} - {\rm{2}}{a_2}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right]\frac{w}{{{\rm{RT}}}} + [{b_1} - 2{b_2}\ln ({\sigma _{\rm{o}}})]w{(\ln (\sigma ))^2} + {a_2}w{\left( {\frac{1}{{{\rm{RT}}}}} \right)^2} + {b_2}w{(\ln (\sigma ))^3} + e\end{array},$$
(8b)

where e is a random disturbance term. A prediction of the time to failure at test condition (To, σo) is then given by Eq. [3] or equivalently Eqs. [8]. So, a weighted regression (often termed a local regressions) defined by Eqs. [8] is carried out for each test condition in the test matrix to yield an estimate for bo, b1, ao, a1 (and a2, b2 if using the GOSD2 model) at all such conditions. Notice that the weighted least squares procedure also depends on the bandwidth h that now needs further explanation.

3.4 Determining the Band Width, h

The band width h is on the same scale as d and so must be a value between the largest and smallest observed value for d. If h is set too large, the resulting creep predictions will be over generalizations leading to a poor prediction for any individual failure time. On the other hand, if h is set to low, the predictions may be so specific to the sample of failure times in the local regression that any extrapolation to other failure times may be unreliable—over fitting to the specific sample of data. As the value of d will depend on the test condition (To, σo), it follows that the number of data points in each local regression will also be different. One approach to preventing over fitting is to use a “leave one data point out cross validation” procedure (see for example Fan et al.).[15] In this approach, Eq. [8a], when using the GOSD1 model, is replaced with

$$w\ln ({t_F}) = \left[ {{a_0} - {\rm{ }}{a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right]w\; + [{b_0} - {b_1}{\rm{ln}}(\sigma )]\ln (\sigma )\;w\; + {a_1}\frac{{\rm{w}}}{{{\rm{RT}}}} + {b_1}w\;{({\rm{ln}}(\sigma ))^{2\;}} + \lambda D + e,$$
(9a)

where D is a dummy variable that equals 1 when (T, σ) = (To, σo) and zero otherwise. The parameter λ then measures the error made in predicting that value for ln(tF) at test condition (To, σo), when this failure time recorded at (To, σo) is not used in the estimation of the other parameters in Eq. [9a]. Such a value for λ can be computed for all test conditions, then squared, and finally summed to obtain the sum of squared cross validation errors (SSCV)

$${\rm{SSCV}} = \sum\nolimits_{i = 1}^N \lambda _i^2,$$
(9b)

The band width h is then taken to be that value which minimize SSCV. The variable D can also be added to Eq. [8b] when using the GOSD2 model

$$\begin{array}{ccccc}w\ln ({t_{\rm{F}}}) = & \left[ {{a_0} - {a_1}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}} - {a_2}{{\left( {\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right)}^2}} \right]w + \left[ {{b_0} - {b_1}\ln ({\sigma _{\rm{o}}}) + {b_2}{{({\rm{ln}}(\sigma ))}^2}} \right]\ln (\sigma )w\\ & \quad+ \left[ {{a_1} - {\rm{2}}{a_2}\frac{1}{{{\rm{R}}{{\rm{T}}_{\rm{o}}}}}} \right]\frac{w}{{{\rm{RT}}}} + [{b_1} - 2{b_2}\ln ({\sigma _{\rm{o}}})]w{(\ln (\sigma ))^2} + {a_2}w{\left( {\frac{1}{{{\rm{RT}}}}} \right)^2} + {b_2}w{(\ln (\sigma ))^3} + \lambda D + e\end{array},$$
(9c)

4 Results

One of the aims of this paper is to use the functional coefficients representation of a generalized OSD model to predict failure times beyond an accelerated test data set. To do this, the accelerated creep data set is defined as all those test conditions in Figure 2 that have a failure time of 10,000 h or less. This accelerated data set is thus made up of N = 33 failure times. The band width h is then calculated using just this accelerated test data set, using Eq. [9] to measure the value for λi at each of these i = 1 to 33 test conditions (some repeated). Then suppose a prediction at test condition (To, σo) is required. If (To, σo) = (873 K, 29 MPa), this prediction would be an example of an extrapolation, while if (To, σo) = (873 K, 115 MPa), this prediction would be an example of an interpolation.

4.1 The Fully Parametric OSD Model

Applying least squares to Eq. [1b] using all 33 test results in the accelerated set of test data yielded the following result

$${\rm{ln}}({t_{\rm{F}}}) = - 1.179 - 5.063{\rm{ln}}\left( {\rm{\sigma }} \right) + 279.416\left( {\frac{{1000}}{{{\rm{RT}}}}} \right),$$
(10)

implying a Norton n value of 5.063 and an activation energy of 279 kJ mol−1. However, the R2 value of 78.23 pct suggests a very poor fit to the short-term data shown as closed circles in Figure 3. The vertical axis of Figure 3 is the temperature-adjusted failure time given by \({\text{ln}(t}_{\text{F}})-279.416\left(\frac{1000}{\text{RT}}\right)\). Estimating the parameters of this fully parametric OSD model in this way results in predictions that have substantial systematic errors associated with them—both when predicting the short-term and long-term data. At moderate stresses, the predictions shown by the fitted line substantially underestimate the time to failure, but as stress falls toward values associated with typical operating conditions for this material, the model dramatically overestimates the time to failure. Unsurprisingly then, the model produces predictions for the longer-term data with an average absolute error of some 150 pct. These results can be used as a benchmark for comparisons with the GOSD1 and GOSD2 models.

Fig. 3
figure 3

Relationship between temperature compensated time to failure and stress using an activation energy estimated using least squares applied to the fully parametric OSD model

4.2 A First-Order Functional Coefficients GOSD1 Model

Now consider the functional coefficients GOSD1 model that uses a 1st order Taylor series expansion to approximate coefficients a and b in Eq. [3]. The optimal band width estimated only from the accelerated tests data was found to be 2.2 using cross validation with SSCV = 0.00231. Figure 4 shows the estimates made for the parameters a0, b0, a1, and b1 of this model at each of the 33 test conditions in the accelerated test data. The arrows highlight the parameter values obtained at test condition (To, σo) = (823 K, 196 MPa).

Fig. 4
figure 4

Variations in (a) Norton’s n, (b) the activation energy for creep, and (c, d) the structural coefficients ao and bo, with stress and temperature based on the GOSD1 model

Before discussing the trends seen in Figure 4, it is informative to explain how these highlighted values were obtained. There were 22 such data points out of the 33 in the accelerated test matrix that had a Kh(d/h) value of less than 1 using h = 2.2. Figure 5 shows these 22 test conditions and the actual temperature-adjusted failure times associated with these conditions. These were calculated using an activation energy of creep of 375 kJ mol−1\({\text{ln}(t}_{\text{F}})-375\left(\frac{1000}{\text{RT}}\right)\). This value for the activation energy was obtained by minimizing Σe2 in Eq. [8a] using just the 22 data points shown in Figure 5. The procedure yielded a1 = Qc = 374.876 kJ mol−1, b1 = 1.448, [b0b1 \(\text{ln}({\sigma }_{\text{o}})\)] = − 22.808, and [a0a1 \(\frac{1}{{\text{RT}}_{\text{o}}}\)] = 37.337. These values for the parameters in Eq. [8a] define the dashed curve in Figure 5—which is the weighted regression curve. The arrow in Figure 4(b) shows this value for Qc. Consequently, values for ao and bo can be derived from these estimates: bo = − 22.808 + 1.448ln(196) = − 15.1634 and ao = 37.337 + 375,876/(8.314 × 723) = 92.124. The arrows in Figures 6(c) and (d) show these values for a0 and b0. When σ = σo = 196 MPa, Eq. [6a] collapses to

Fig. 5
figure 5

Local test conditions used to interpolate the failure time at 196 MPa and 823 K, together with the weighted regression curve through the nearest 22 data points to this test condition

Fig. 6
figure 6

Plot of actual against predicted failure times beyond 10,000 h using the GOSD1 model. The U statistics are as calculated by Evans[17]

$${\rm{ln}}({t_{\rm{F}}}) = {a_{\rm{o}}} - {b_{\rm{o}}}{\rm{ln}}\left( {{\rm{196}}} \right) = 92.124 - 15.1635\ln (196) = 12.09,$$
(11)

which defines the arrow highlighted point of the dashed regression curve in Figure 5. This is an example of an interpolated failure time. This interpolation has a prediction error of some 7.28 pct. The above steps are repeated for each test condition, so that a separate weighted regression is computed yielding values for a0, b0, a1, and b1 and an interpolated tF value for all test conditions in the accelerated test data. The resulting parameter estimates are shown in Figure 4.

Clearly, higher order approximations will give better fits to the local data, but may not necessarily give a better failure time prediction. It does so at the test condition shown in Figure 5 because the point it is trying to predict is within the range of the accelerated data points used for the local fit. In some instances, especially in extrapolation, this may not be the case, and this is more likely the higher the order of approximation used (due to the skipping rope effect of using polynomials). So a balance needs to be struct between parsimony and accuracy of prediction.

Figure 4(a) shows the variation in Norton’s n with test conditions, where n is the predicted change in ln(tF) following a small change in ln(σ), which from Eq. [6a] is found from the functional coefficients as [b0b1 \(\text{ln}({\sigma }_{\text{o}})\)] + 2b1ln(σ). Figure 4(b) shows the variation in the activation energy for creep with test conditions. These two together are suggestive of specific changes in creep mechanism with test conditions. The high values for n (ignoring the sign) and Qc at the highest stresses indicate creep takes place through the generation and movement of new dislocations, formed at appropriate sources, since the yield stress of the material is exceeded. These new dislocations that are continuously generated due to the high stress leads to large net movement of atoms, and so contributes to high creep rates making failure times very sensitive to changes in stress (hence the very negative n values). This creep occurs largely because of these dislocations moving within the grains and under these circumstances Qc is expected to be high and equivalent to that for self-diffusion in bainitic matrices. Figure 4(b) suggests this activation energy is Qc ≅ 420 kJ mol−1. Notice this value is much higher than the constant value of 279 kJ mol−1 obtained from the fully parametric OSD model.

At intermediates stresses, where the stress is below the yield stress, creep takes place within the grain boundary zones rather than in the grains, and this reflects itself in a lower activation energy for creep—Qc ≅ 370 kJ mol−1 in Figure 4(b)—representing diffusion along dislocations and grain boundaries. With fewer atoms moving the creep rate slows down and this reflects itself in a reduction in n to around − 8 in Figure 4(a). The change in n from around − 18 to − 8 (and Qc from 420 to 370 kJ mol−1) then reflects the growing dominance of grain boundary creep as the stress falls. At the lowest stress, the activation energy increases again to approximately 420 kJ mol−1 in Figure 4(a). Wilshire and Whittaker [16] attributed this to the bainitic regions transforming to ferrite and coarse molybdenum carbide particles in long-term tests at 923 K and very low stresses. This they argued would result in creep once again taking place within the grains where the activation energy is higher. But this is inconsistent with the continual reduction in the value of n seen in Figure 4(a)—the expectation would be for n to creep back to the values seen at the higher stresses if there was microstructural degradation. Instead, there is a tendency for n to diminish to a value of 1 at the very lowest stress and the highest temperature—and such conditions are usually associated with Nabarro-Herring diffusional creep. Such creep also leads to an n value of around 1 and an activation energy equal to that for self-diffusion as diffusion of atoms takes place only within the crystal lattice during Nabarro-Herring creep. This is consistent with the values seen in Figures 4(a) and (b) where n tends to 1 with decreasing stress and Qc back to a value approaching 420 kJ mol−1—that was taken above to be that associated with self-diffusion. Figures 4(c) and (d) shows the variation in the structural coefficients ao and bo with test conditions. Their changing values also reflect the above-mentioned changes in creep mechanism. While n, ao, and bo show very little dependency on temperature, Qc appears to also show some temperature dependency.

The same procedure as above is used to make extrapolative predictions into the longer-term data—local weighted regressions were carried out using accelerated test conditions closest to the test conditions in the longer-term data set. For example, to get a prediction for the time to failure of a specimen tested at 216 MPa and 723 K, 19 data points out of the 33 in the accelerated test matrix had a Kh(d/h) less than 1 using h = 2.2. A weighted regression through these 19 data points is used to predict the failure time at 216 MPa and 723 K following the same steps as demonstrated above for an interpolative prediction. Repeating this for all test conditions in the longer-term data sets produced the predictions shown in Figure 6, where they are plotted against the actual failure times beyond 10,000 h. The trend line through all the data points is quite close to the 1:1 line, and these predictions have an average absolute error |U| associated with them of 31.81 pct—which is a massive improvement on the error associated with the fully parametric OSD model of the previous sub section. The predictions are differing from this 1:1 line for three reasons. The first is because the slope of the best fit line of 0.83 differs from 1 and this accounts for some 24.28 pct of |U| (as revealed by |UR|—see Evans[17] for details on how to calculate these U values). Secondly, the average of the predictions differs from the average of the actual failure times by 0.0095 and this accounts for some 2.04 pct of |U| (as revealed by |UM|). In part this reflects itself in the intercept of the trend line in Figure 6 (23.894) differing from zero. The rest of the error is random in nature, and this accounts for some 73.76 pct of |U| (as revealed by |UD| and the R2 value of less than 100 pct). Figure 6 also reveals that most of the prediction errors occur at the lowest stresses at temperature of 873 K. The Z value shown in Figure 6 is given by Z = \({e}^{{2.58s}_{e}}\), where se is the standard deviation in the percentage prediction errors. Ideally, for single-cast assessment, Z should be less than or equal to 2, whereas Z ≥ 4 is unacceptable according to Holdsworth et al.[18,19] While this value is slightly above 2, its value is acceptable according to Holdsworth and substantially below the Z value for the fully parametric OSD model where Z = 13.18.

Evans[9] found that for the LOESS version of the OSD model, the average absolute error |U| was 55.68 pct and 46.84 pct of this error was random in nature. On both measures, the LOESS model produces inferior extrapolations.

4.3 A Second-Order Functional Coefficients GOSD Model

Figure 7 shows the variation of the GOSD2 model parameters with stress and temperature estimated using just the accelerated test data. Figure 7(a) shows the variation in Norton’s n with test conditions, where n is the predicted change in ln(tF) following a small change in ln(σ),which from Eq. [6b] is found from the functional coefficients as [b0b1 \(\text{ln}\left({\sigma }_{\text{o}}\right)+{b}_{2}{\{\text{ln}\left({\sigma }_{\text{o}}\right)\}}^{2}\)] + [b1−2b2ln(σo)] + 3b2ln(σo). Figure 7(b) shows the variation in the activation energy for creep with test conditions. The trends with respect to stress seen in these two figures are broadly consistent with those obtained using the first-order approximation [Figures 4(a) and (b)] and so support the conclusions drawn in the previous sub section with regard to changing creep mechanisms.

Fig. 7
figure 7

Variations in (a) Nortons n, (b) the activation energy for creep, and (cd) the structural coefficients a and b, with stress and temperature using the GOSD2 model

The GOSD2 model was then used to extrapolate into the longer-term data. The results are shown in Figure 8 where these predictions are plotted against the actual failure times beyond 10,000 h. The trend line through all the data points is quite close to the 1:1 line, and these predictions have an average absolute error |U| associated with them of 25.3 pct which is some 6 percentage points lower compared to the GOSD1 model.

Fig. 8
figure 8

Plot of actual against predicted failure times beyond 10,000 h using the GOSD2 model. The U statistics are as calculated by Evans[17]

But again, the predictions are differing from this 1:1 line for three reasons. The first is because the slope of the best fit line of 0.89 differs from 1 and this accounts for some 17.87 pct of |U| (as revealed by |UR|, and which is some 6 percentage points lower than when using the first-order approximation). Secondly, the average of the predictions differs from the average of the actual failure times by 0.022 and this accounts for some 5.38 pct of |U| (as revealed by |UM|, which is about 3 percentage points higher than when using the first-order approximation). The rest of the error is random in nature, and this accounts for some 76.75 pct of |U| (as revealed by |UM|, which is about 3 percentage points higher than when using the GOSD1 model). Figure 9 also reveals that most of the prediction errors occur at the lowest stresses at temperature of 873 K. In summary then, using the GOSD2 model over the GOSD1 model results in a set of predictions with a lower overall mean absolute percentage error, and with the proportion of this error that is random in nature being slightly higher—and so a slightly lower systematic error. The Z value for this model is also closer to 2 compared to the GOSD1, but like the first-order model most of the prediction errors are associated with a temperature of 873 K.

Fig. 9
figure 9

Plot of time to failure against stress together with the predictions from the GOSD1 and GOSD2 models

Finally, Figure 9 plots the long-term predictions from the first- and second-order functional coefficients version of the OSD model in the more familiar stress v failure time space. This reinforces the points made above where the predictions are good from both models at all temperature but 873 K. But at this temperature, the GOSD2 model performs best.

5 Conclusion

This paper has demonstrated the inadequacy of using the fully parametric OSD creep model for predicting long-term failure times for 2.25Cr–1Mo from accelerated test data. This failure was explained by non-constant model parameters—the result of changing creep mechanisms. The paper then introduces a semi-parametric estimation procedure for a generalized OSD model (a structural coefficients version of the GOSD model) that can be used to deal with changing creep mechanisms while maintaining the structure of the parametric model and consequently produce more reliable long-term predictions compared to the fully parametric version. When this technique was applied to 2.25Cr–1Mo steel, it was found that the model parameters varied in line with changing creep mechanisms but in a partially different way compared to that already suggested in the literature for this material. With diminishing stress and increasing temperature, dislocation creep within the crystal structure morphs into grain boundary dislocation motion and finally Nabarro-Herring creep. The functional coefficients models produced better long-term extrapolations compared to both the fully parametric OSD model and the LOESS method introduced by Evans. Areas for future work include applying this technique to other steels and high-temperature materials and extending the approach by specifying different forms for parameters a and b—perhaps by allowing a and b to vary with both stress and temperature.