Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers

WANG, Yifei; SHAHIDEHPOUR, Mohammad; GUO, Chuangxin

doi:10.1007/s40565-017-0322-z

Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers

Open access
Published: 22 September 2017

Volume 5, pages 959–969, (2017)
Cite this article

Download PDF

You have full access to this open access article

Journal of Modern Power Systems and Clean Energy

Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers

Download PDF

Yifei WANG¹,
Mohammad SHAHIDEHPOUR² &
Chuangxin GUO³

1881 Accesses
1 Altmetric
Explore all metrics

Abstract

The reliability of power transformers is subject to service age and health condition. This paper proposes a practical model for the evaluation of two reliability indices: survival function (SF) and mean residual life (MRL). In the proposed model, the periodical modeling of power transformers are considered for collecting the information on health conditions. The corresponding health condition is assumed to follow a continuous semi-Markov process for representing a state transition. The proportional hazard model (PHM) is introduced to incorporate service age and health condition into hazard rate. In addition, the proposed model derives the analytical formulas for and offers the analytical evaluation of SF and MRL. SF and MRL are calculated for new components and old components, respectively. In both cases, the proposed model offers rational results which are compared with those obtained from comparative models. The results obtained by the contrast of the proposed analytical method and the Monte Carlo method. The impact of different model parameters and the coefficient of variation (CV) on reliability indices are discussed in the case studies.

A hybrid model for economic design of MEWMA control chart under maintenance policies

Article 27 August 2015

SOH and RUL prediction of Li-ion batteries based on improved Gaussian process regression

Article 20 October 2021

Computational adaptive multivariable degradation model for improving the remaining useful life prediction in industrial systems

Article 10 January 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The equipment reliability is subject to degradation and influencing factors which are referred to as covariates. The evaluation of system reliability has gained additional interests for quantifying the risk of degradation and failure, predicting the performance, making economic decisions in large-scale energy, transportation, and telecommunication infrastructures. Considering the example of a 750 kV electric power transformer with a typical capital cost of $ 2 million [1], the failure of such equipment could cause extensive outages and blackouts and raise customer interruption costs. It is therefore imperative to monitor the equipment health condition and evaluate its reliability to avoid catastrophic circumstances.

We consider two reliability indices: survival function (SF) and mean residual life (MRL). SF is the probability that the equipment will survive beyond a specified time. MRL renders an overall estimate and summarizes the residual life distribution of the equipment. Several failure rate models are considered for calculating the two reliability indices. However, a constant failure rate model is often used in reliability analyses which can pose erroneous results for the calculation of reliability indices [2,3,4].

The failure rate calculation should take both service age and covariates into consideration. The proportional hazard model (PHM) was introduced by Cox in 1972 to shape the hazard rate used in engineering and medicine [5, 6]. In PHM, the hazard rate consists of baseline and link functions. The baseline function offers the basis for hazard rate and the link function quantifies the covariate effect. Sample covariates such as those pertaining to the lifecycle data [7], operation mode [8], vibration [9] and dissolved gas [10], which could be time-dependent, are considered according to the actual system situations. PHM offers advantages when applied to explanatory techniques [11]. Accordingly, the model adopts the information on dissolved gas analysis (DGA) as covariate which affects the failure rates of power transformers.

The MRL and conditional/unconditional SFs are calculated in [12] by a discrete Markov process and PHM for obtaining the additional insight on interactions between time-varying failure rates and reliability indices. Makis and Jardine [13] utilize a full parametric PHM and a time-homogeneous Markov chain to describe failure rates. Accordingly, the optimal expected average cost and replacement time are obtained. Reference [14] evaluates the equipment reliability with imperfect observations. The observations are collected periodically. The failure rate is modeled based on PHM which takes both the age and the health condition into consideration. The same model is adopted to identify the optimal inspection period and the replacement policy [15]. The parameter estimation problem is studied in [16]. A control-limit policy and parameter estimation are proposed in [17] where an optimal replacement policy is obtained to minimize the average replacement costs per unit time. The above models are based on the assumption that the condition information of covariates is inspected at discrete points and every state transition happens only at the end of inspection interval, exactly before the next inspection instant, to make the calculation tractable within every interval. These models are called discrete monitoring and discrete transition (DMDT) models in this paper.

In fact, the state transition can happen at any time and this assumption may not conform to the reality. Reference [18] evaluates the SF policy by applying PHM and the presumption that the state transition is continuous. Reference [19] also assumes the condition monitoring is continuous if the inspection interval is small. These models are called continuous monitoring and continuous transition (CMCT) models in this paper. However, the condition monitoring may not be continuous in practice and the assumption would not be in line with the actual operation.

In practice, the online condition monitoring of power transformers, such as dissolved gas analysis, is discrete (periodical) while the state transition could happen at any time [20, 21]. In this paper we propose the discrete monitoring and continuous transition (DMCT) models for our analyses. The parameter estimation is found in [22] which is not addressed in this paper. The main contributions of this paper offered by our model are summarized below:

1)
The proposed model is based on more practical assumptions in which the condition is discretely inspected but the state transition is continuous.
2)
Service age and DGA information are introduced to customize the failure rate by applying PHM. The state transition of DGA is described by a semi-Markov process.
3)
Analytical formulas are derived to evaluate SF and MRL using the given situations. The effectiveness of the proposed formulas is shown in our numerical studies.

2 Model description

2.1 Determine health condition with DGA information

All power transformers generate gases to some extent when they are operating normally. However, the incipient fault or degradation, such as overheating, partial discharge and arcing faults, will lead to the abnormality of gas-generating. A four-level criterion has been developed to classify health condition of transformers [23] according to the gases concentration. The gases include H₂, CH₄, C₂H₄, C₂H₆, CO, and CO₂. The total gas of H₂, CH₄, C₂H₄, C₂H₆, CO is known as total dissolved combustible gases concentration (TDGC). Table 1 shows the classification of gas concentration conditions. Condition (1) is the best condition, and the condition gets worse with the number increases. The power transformer is regarded as being in the worse condition irrespective of the type of dissolved gas that is in the worse condition. In other words, the condition of power transformer depends on the worst condition of all the dissolved gases.

Table 1 Dissolved gas concentrations for condition classification based on IEEE C57.104

Full size table

2.2 Failure rate model based on DGA information

The failure rate of power transformer is modelled by PHM and semi-Markov process. The failure rate is expressed as:

$$h(s,Z(s)) = h_{0} (s)\psi (Z(s)) = \frac{{\beta s^{\beta - 1} }}{{\alpha^{\beta } }}e^{\gamma Z(s)}$$

(1)

where $h_{0} (s)$ is the baseline function to describe basic hazard rate; $\psi (Z(s))$ is the link function to quantify the effect of covariates. The covariate $Z(s)$ represents the condition of dissolved gas concentrations at time s. The degradation process is irreversible, which is the most common case that the degradation state cannot improve by itself. Without loss of generality, the gas condition $Z(s)$ is assumed to fall into finite state space $\{ 1,2, \ldots ,n\}$ where the condition deteriorates as the state number increases. The analytical formulas are also derived with n conditions. The state n is the worst and absorbing state. Upon a failure or scheduled maintenance, the component is maintained and restored to state 1 and the process is renewed. The model is shown in Fig. 1 which is described as follows:

1)
$T_{i} ,\;X_{i} ,\;i = 1,2, \ldots ,n$, denote the ith state transition moment and the sojourn time of state i, respectively.
2)
t ₀ and t are the current time and future time point, respectively.
3)
The transition of health condition is assumed to follow a semi-Markov process. The transition is irreversible and increases by one whenever a transition occurs. That is, a pure birth process is considered.

The Markov process is memoryless and Markov models can lead to serious errors on certain conditions. However, the health condition transition of a power transformer is affected by the operation history which is not a memoryless process. In our study, a semi-Markov process is introduced to describe the memorial degradation process of power transformers and evaluate reliability indices. In our case studies, the results of the contrast of Markov and semi-Markov processes and some reasonable conclusions have been drawn. Other stochastic processes could also be also introduced to model the condition transition indeed. Based on our proposed model, we plan on performing more work in the future to compare the performance of different stochastic process when evaluating the reliability of power transformers.

Let Z _i denote the degradation state between T _i−1 and T _i. In a pure birth process Z _i = i. Since the state n is an absorbing state, we define X _n = ∞ and T _n = ∞. Clearly $X_{i} = T_{i} - T_{i - 1}$ and its distribution is denoted as:

$$G(x_{i} ) = P(T_{i} - T_{i - 1} \le x_{i} |Z_{i} = i,Z_{i + 1} = i + 1)$$

(2)

where x _i is the independent variable in the distribution function of X _i. The state sojourn time $X_{1} ,X_{2} , \ldots ,X_{n}$ are conditional independent in a semi-Markov process. That is, the Markovian property is satisfied at the transition point rather than the entire process.

The joint probability density function (PDF) of X ₁ , X ₂,…,X _n is represented as g _x which equals to $g(x_{1} ,x_{2} , \ldots ,x_{n - 1} ) = g_{1} g_{2} \ldots g_{n - 1}$, where $g_{i} = g(x_{i} )$ is the probability density function of x _i.

It should be noted that:

$$T_{k} = \sum\limits_{i = 1}^{k} {X_{i} } = X_{1} + X_{2} + \cdots + X_{k}$$

(3)

and

$$Z(t) = k \Leftrightarrow T_{k - 1} \le t < T_{k}$$

(4)

where $Z(t)$ is the gas concentration state at time t. The conditional survival function$R(t|t_{0} )$ is given by:

$$P(T \ge t|T > t_{0} ,Z(s),0 \le s \le t_{0} ) = \exp \left( { - \int_{{t_{0} }}^{t} {h(s,Z(s)){\text{d}}s} } \right)$$

(5)

where T is the failure time. Given t ₀ and $Z(t_{0} ) = j$, the component or the system may stay at an arbitrary state from j to n at any future time t. For $Z(t_{0} ) = j,\;Z(t) = k$, $1 \le j \le k \le n$, we have:

$$\begin{aligned} R_{jk} (t|t_{0} ) = P(T > t|T > t_{0} ,Z(t_{0} ) = j,Z(t) = k) = \hfill \\ \exp \left( { - \int_{{t_{0} }}^{{T_{j} }} h (s,Z_{j} ){\text{d}}s - \int_{{T_{j} }}^{{T_{j + 1} }} h (s,Z_{j + 1} ){\text{d}}s - \cdots - \int_{{T_{k - 1} }}^{t} h (s,Z_{k} ){\text{d}}s} \right) \hfill \\ \end{aligned}$$

(6)

If we view $(X_{1} ,X_{2} , \ldots ,X_{n - 1} )$ as a (n-1) dimensional space, (6) would be satisfied only in the sub-region D _jk.

$$D_{jk} = \left\{ {(X_{1} ,X_{2} , \ldots ,X_{n - 1} )\left| {\begin{array}{*{20}c} {x_{1} + \ldots + x_{j - 1} \le t_{0} } \\ {x_{1} + \cdots + x_{j} > t_{0} } \\ {x_{1} + \ldots + x_{k - 1} \le t} \\ {x_{1} + \cdots + x_{k} > t} \\ {x_{i} \ge 0,1 \le i \le n - 1} \\ \end{array} } \right. \, } \right\}$$

(7)

In this paper, x _i is assumed to be larger than zero. For instance,$D_{12} = \{ (X_{1} ,X_{2} , \ldots ,X_{n} )|x_{1} > t_{0} ,\;x_{1} \le t < x_{1} + x_{2} \}$, when$j = 1,\;k = 2$ in (7), which means the component is in state 1 at t ₀ and state 2 at t. Also, in the area D ₁₂, $R_{12} (t|t_{0} ) = \exp ( - \int_{{t_{0} }}^{{T_{1} }} h (s,Z_{1} ){\text{d}}s - \int_{{T_{1} }}^{t} h (s,Z_{2} ){\text{d}}s)$. Thus, when the state at any future time t varies from 1 to n, $R(t|t_{0} )$ can be viewed as a piecewise function in the n dimensional space $\{ R(t|t_{0} ),X_{1} , \ldots ,X_{n - 1} \}$. Accordingly, $R_{jk} (t|t_{0} )$ represents $R(t|t_{0} )$ in the sub-region $D_{jk}$. The boundary of each sub-region is decided by t ₀ and t, and the corresponding degradation states $Z(t_{0} )$ and $Z(t)$.

Generally, MRL is calculated by $M(t_{0} ) = \int_{{t_{0} }}^{\infty } R (t|t_{0} ){\text{d}}t$. Since $R(t|t_{0} )$ is a piecewise function from t ₀ to infinity, the conditional MRL, given t ₀ and $Z(t_{0} ) = j$, can be expressed as:

$$\begin{aligned} &m_{j} (t_{0} ) = \int_{{t_{0} }}^{\infty } R (t|t_{0} ){\text{d}}t = \int_{{t_{0} }}^{{T_{j} }} {R_{jj} } (t|t_{0} ){\text{d}}t + \hfill \\&\qquad \quad \int_{{T_{j} }}^{{T_{j + 1} }} R_{j,j + 1} (t|t_{0} ){\text{d}}t + \ldots + \int_{{T_{n - 1} }}^{\infty } R_{jn} (t|t_{0} ){\text{d}}t \hfill \\&\qquad = \sum\limits_{i = j}^{n} {M_{jk} } (t_{0} ) = M_{jj} (t_{0} ) + M_{j,j + 1} (t_{0} ) + \ldots + M_{jn} (t_{0} ) \hfill \\ \end{aligned}$$

(8)

where $M_{ji} (t_{0} ) = \int_{{T_{i - 1} }}^{{T_{i} }} {R_{ji} } (t|t_{0} ){\text{d}}t$ for $j < i \le n$.

With (5), (6) and (8) in place, there is still one barrier in evaluating SF and MRL, effectively, where the explicit analytical expressions are needed.

3 Evaluating SF and MRL

The DGA is inspected at discrete points, and the inspection instants are equally spaced. Figure 2 shows the inspection points, state transition points and time points, where $\Delta_{l}$ means the lth inspection point, S1 and S2 represent two different situations, respectively.

The formulas are presented in two situations: t ₀ (S1) is exactly the inspection point and t ₀ (S2) is between the inspection points. The t ₀ point exhibits a big influence on the expression and the shape of SF and MRL. In fact, whether t ₀ is the inspection point has a practical significance. The t ₀ as the exact inspection point corresponds to the situation where the DGA condition information has just been collected from an on-line or off-line test, while the t ₀ located between two successive inspection points corresponds to a situation where the condition inspection by either an on-line or off-line test has been done before and the next inspection point has not been reached. Both S1 and S2 situations could occur in practice.

Note that SF and MRL are the functions of random variables X ₁, X ₂,…, X _n-1, which are multiple integral in the variable space. The known conditions constitute the composite constraint surface of the integral region. In this section, the multiple integral is transformed to the repeated integral to derive the formulas of SF and MRL.

3.1 Survival function

The survival function is a piecewise function associated with random variables which is calculated by the conception of expectation.

1)
A new component

For a new component, we have $t_{0} = 0,\;Z(t_{0} ) = 1$. According to the Law of Total Probability, $R(t)$ can be expressed as:

$$\begin{aligned} R(t) = \sum\limits_{k = 1}^{n} P (T > t,Z(t) = k|Z(t_{0} ) = 1) \\ = \sum\limits_{k = 1}^{n} {\int_{{D_{1k} }} {R_{1k} } } (t)g(x_{1} , \ldots ,x_{k} ){\text{d}}x_{1} \ldots {\text{d}}x_{k} \\ \quad= r_{11} + r_{12} + \ldots + r_{1n} \\ \end{aligned}$$

(9)

where $r_{1k} = \int_{{D_{1k} }} {R_{1k} } (t)g(x_{1} , \ldots ,x_{n - 1} ){\text{d}}x_{k} \ldots {\text{d}}x_{1} , k=1, 2,\ldots,n-1$ can be calculated as follows:

$$r_{1k} = \left\{ {\begin{array}{*{20}l} {\int_{0}^{t} {\int_{0}^{{t - x_{1} }} } \ldots \int_{0}^{{t - T_{k - 2} }} \int_{{t - T_{k - 1} }}^{\infty } {R_{1k} } (t)g_{x} {\text{d}}x_{k} \ldots {\text{d}}x_{2} {\text{d}}x_{1} } \hfill \\ {\qquad \qquad \qquad \qquad \qquad \;\;k = 1,2, \ldots ,n - 1} \hfill \\ {\int_{0}^{t} {\int_{0}^{{t - x_{1} }} } \ldots \int_{0}^{{t - T_{n - 2} }} \int_{0}^{{t - T_{n - 1} }} {R_{1n} } (t)g_{x} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{1} } \hfill \\ {\qquad \qquad \qquad \qquad \;\;\;\;\;\;\;\;\;k = n} \hfill \\ \end{array} } \right.$$

(10)

The proof of (10) is presented in Appendix A.

2)
An old component

An old component has survived and suffered from degradation by the time t ₀. Assuming the last inspection instant is $\Delta_{m}$ and $Z(\Delta_{m} ) = j$, we have $T_{j} > \Delta_{m}$. In fact, state transition points are renewal points and have Markovian property [3], thus the state transition after T _j-1 has nothing to do with the history before and can happen at any time.

The calculation of survival function falls into the two cases which are designated as old1 and old2.

Situation S1: t ₀ is exactly the inspection instant $\Delta_{m}$. In this case, the known conditions are given: ① $T > t_{0}$; ② $Z(t_{0} ) = j$; ③ X ₁, X ₂,…, X _j−1. According to the Law of Total Probability, $R(t|t_{0} )$ is equal to:

$$\begin{aligned} R(t|t_{0} ) &= \sum\limits_{k = j}^{n} P (T > t,Z(t) = k|T > t_{0} ,Z(t_{0} ) = j) \hfill \\ \;\;\;\;\;\;\;\;\;\;\; &= \sum\limits_{k = j}^{n} {\int_{{D_{jk} }} {R_{jk} } } (t|t_{0} )g_{{x|x_{j} }} {\text{d}}x_{j} \ldots {\text{d}}x_{n - 1} \hfill \\ \end{aligned}$$

(11)

Let $r_{jk}^{\text{old1}}$ denote $\int_{{D_{jk} }} {R_{jk} } (t|t_{0} )g_{{x|x_{j} }} {\text{d}}x_{j} \ldots {\text{d}}x_{n - 1}$ and $r_{jk}^{\text{old1}}$ can be calculated by (12). The proof is shown in Appendix B.

$$r_{jk}^{\text{old1}} = \left\{ {\begin{array}{ll}\int_{{t_{0} - T_{j - 1} }}^{{t - T_{j - 1} }} int_{0}^{{t - T_{j -1} - x_{j} }} \ldots \int_{0}^{{t - x_{1} - \ldots - x_{k - 2} }}\int_{{t - x_{1} - \ldots - x_{k - 1} }}^{\infty } {R_{jk} } \left({t|t_{0} } \right)g_{{x|x_{j} }} {\text{d}}x_{k} \ldots{\text{d}}x_{j}\hfill & \quad {j \le k < n} \hfill \\\int_{{t_{0} - T_{j - 1} }}^{{t - T_{j - 1} }} \int_{0}^{{t - T_{j -1} - x_{j} }} \ldots \int_{0}^{{t - x_{1} - \ldots - x_{k - 2} }}\int_{0}^{{t - x_{1} - \ldots - x_{n - 1} }} {R_{jn} \left( {t|t_{0}} \right)} g_{{x|x_{j} }} {\text{d}}x_{n - 1} , \ldots{\text{d}}x_{j} \hfill & \quad {k = n} \hfill \\ \end{array}}\right.$$

(12)

Situation S2: t ₀ is between two successive inspection instants $\Delta_{m}$ and $\Delta_{m + 1}$.

In this case, $Z(t_{0} )$ can be arbitrarily selected from $Z(\Delta_{m} )$ to n since t ₀ is not an inspection point. The known conditions are given: ① $T > t_{0}$; ② $Z(\Delta_{m} ) = j$; ③ X ₁, X ₂,…, X _j-1. $R(t|t_{0} )$ is denoted by:

$$\begin{aligned} R(t|t_{0} ) &\,= P(T > t|T > t_{0} ,Z(\Delta_{m} ) = j) \hfill \\ &= \sum\limits_{i = j}^{n} {\sum\limits_{k = i}^{n} P } (T > t,Z(t) = k,Z(t_{0} ) = i|T > t_{0} ,\quad Z(\Delta_{m} ) = j) \hfill \\ &= \sum\limits_{i = j}^{n} {\sum\limits_{k = i}^{n} {\int_{{D_{ik} }} {R_{ik} } } } (t|t_{0} )g_{{x|x_{j} }} {\text{d}}x_{j} \ldots {\text{d}}x_{n - 1} \hfill \\ \end{aligned}$$

(13)

Let $r_{j,ik}^{\text{old2}}$ be $\int_{{D_{ik} }} {R_{ik} } (t|t_{0} )g_{{x|x_{j} }} {\text{d}}x_{j} \ldots {\text{d}}x_{n - 1}$ which represents the survival probability in the sub-region $Z(\Delta_{m} ) = j$, $Z\left( {t_{0} } \right) = i$ and $Z\left( t \right) = k$. $r_{j,ik}^{\text{old2}}$ can be evaluated by (14). The proof is shown in Appendix C.

$$r_{j,ik}^{\text{old2}} = \left\{ \begin{aligned} \int_{{\Delta_{m} - T_{j - 1} }}^{{t_{0} - T_{j - 1} }} \int_{0}^{{t_{0} - T_{j - 1} - x_{j} }} \ldots \int_{{t_{0} - x_{1} - \ldots - x_{i - 1} }}^{{t - x_{1} - \ldots - x_{i - 1} }} \ldots \int_{0}^{{t - x_{1} - \ldots - x_{k - 2} }} \int_{{t - x_{1} - \ldots - x_{k - 1} }}^{\infty } \;\;R_{jk} \left( {t|t_{0} } \right)g_{{x|x_{j} }} {\text{d}}x_{j} \ldots {\text{d}}x_{k} \quad i \le k < n \hfill \\ \int_{{\Delta_{m} - T_{j - 1} }}^{{t_{0} - T_{j - 1} }} \int_{0}^{{t_{0} - T_{j - 1} - x_{j} }} \ldots \int_{{t_{0} - x_{1} - \ldots - x_{i - 1} }}^{{t - x_{1} - \ldots - x_{i - 1} }} \ldots \int_{0}^{{t - x_{1} - \ldots - x_{n - 2} }} \int_{0}^{{t - x_{1} - \ldots - x_{n - 1} }} \;R_{jk} \left( {t|t_{0} } \right)g_{{x|x_{j} }} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{j} \quad k = n \hfill \\ \end{aligned} \right.$$

(14)

3.2 Mean residual life

We evaluate MRL for a new component and an old component, respectively. For an old component, MRL is also calculated in two cases according to whether t ₀ is the inspection instant.

1)
A new component

For a new component, $t_{0} = 0,\;Z(t_{0} ) = 1$, according to (8) MRL can be expressed as:

$$\begin{aligned} M(t_{0} ) &= E[M(t_{0} |Z(t_{0} ) = 1)] = E[M_{11} + M_{12} + \ldots + M_{1n} ] \\ &= \int_{{t_{0} }}^{\infty } \int_{0}^{\infty } \ldots \int_{0}^{\infty } {(\sum\limits_{i = j}^{n} {M_{ji} } (t_{0} ))} g_{{x|x_{j} }} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{j} \\ &= \int_{0}^{\infty } {M_{11} } (t_{0} )g_{1} {\text{d}}x_{1} + \int_{0}^{\infty } {\int_{0}^{\infty } {M_{12} } } (t_{0} )g_{1} g_{2} {\text{d}}x_{2} {\text{d}}x_{1} \\ \;\; &\quad+ \int_{0}^{\infty } \int_{0}^{\infty } \ldots \int_{0}^{\infty } {M_{1n} } (t_{0} )g_{x} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{2} {\text{d}}x_{1} \\ \end{aligned}$$

(15)

2)
An old component

For an old component, $T > t_{0}$ and $Z(\Delta_{m} ) = j$. Whether the health condition $Z(t_{0} )$ is known to depend on whether t ₀ is inspection point.

Situation S1: t ₀ is exactly the inspection instant.

In this case, $t_{0} = \Delta_{m}$ and $Z(t_{0} ) = j$, MRL is denoted by:

$$\begin{aligned} M(t_{0} ) &= E[M(t_{0} |Z(t_{0} ) = 1)] = E[M_{jj} + M_{j,j + 1} + \ldots + M_{jn} ] \hfill \\ &= \int_{{t_{0} }}^{\infty } M_{jj} (t_{0} )g_{{j|x_{j} }} {\text{d}}x_{j} + \int_{{t_{0} }}^{\infty } \int_{0}^{\infty } M_{j,j + 1} (t_{0} )g_{{j|x_{j} }} g_{j + 1} {\text{d}}x_{j + 1} {\text{d}}x_{j} \hfill \\ \;\; &\quad+ \int_{{t_{0} }}^{\infty } \int_{0}^{\infty } \ldots \int_{0}^{\infty } {M_{jn} } (t_{0} )g_{{x|x_{j} }} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{j + 1} {\text{d}}x_{j} \hfill \\ \end{aligned}$$

(16)

The old2 case: t ₀ is between two successive inspection instants $\Delta_{m}$ and $\Delta_{m + 1}$.

Given t ₀ and $Z(\Delta_{m} ) = j$, MRL is calculated by:

$$M(t_{0} ) = \sum\limits_{i = j}^{n} \int_{{\begin{array}{*{20}l} {Z(\Delta_{m} ) = j} \hfill \\ {Z(t_{0} ) = i} \hfill \\ \end{array} }} {M(t_{0} |Z(t_{0} ) = i)g_{{x|x_{j} }} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{j + 1} {\text{d}}x_{j} }$$

(17)

The integral region of $m_{i} (t_{0} )$ is equal to:

$$\left\{ {(X_{1} ,X_{2} , \ldots ,X_{j} )\left| {\begin{array}{*{20}c} {x_{j} > \Delta_{m} - T_{j - 1} } \\ {x_{j} + x_{j + 1} + \ldots + x_{i - 1} < t_{0} - T_{j - 1} } \\ {x_{j} + x_{j + 1} + \ldots + x_{i} > t - T_{j - 1} } \\ \end{array} } \right.} \right\}$$

(18)

Similar to the proof of (14), project the region onto the lower dimension space repeatedly so that the multiple integral is transformed into repeated integral. The upper and lower limits can be obtained as in (14).

The steps for evaluating SF and MRL are summarized as: ① obtain $t_{0} ,\;\Delta_{m} ,\;Z(\Delta_{m} ),\;T_{i} ,\;i < Z(\Delta_{m} )$ and determine $g_{{x|x_{j} }}$ according to historical inspection data; ② evaluate the survival function by (9), (11) and (13); ③ evaluate the mean residual life by (15), (16) and (17).

4 Numerical examples

The parameter estimation is not addressed in this paper. The DGA information and failure rate parameters in [24] are adopted. The first numerical example is to compare the results obtained by developed formulas with those of the Monte Carlo technique. In [25], the same reliability indices, SF and MRL, are evaluated by the Monte Carlo technique. The minimal errors indicate the accuracy of developed formulas.

Assume that the baseline function has a Weibull distribution and the link function follows exponential form:

$$\left\{ \begin{aligned} h_{0} (s) = \tfrac{{\beta s^{\beta - 1} }}{{\alpha^{\beta } }}\;\;\;s \ge 0,\alpha = 10,\beta = 2 \hfill \\ \psi (Z(s)) = e^{{1 \times \left( {Z(s) - 1} \right)}} \hfill \\ \end{aligned} \right.$$

(19)

For simplicity, we assume $n = 3$. In other words, the health condition is divided into 3 stages $\{ 1,2,3\}$ and the sojourn time X ₁ and X ₂ are s-independent and identically distributed Weibull random variables. The PDF of X _i is given:

$$\begin{aligned} g_{i} = g(x_{i} ) = \frac{{bx_{i}^{b - 1} }}{{a^{b} }}\exp \left[ { - \left( {\frac{{x_{i} }}{a}} \right)^{b} } \right]\;\;\;x_{i} > 0,i = 1,2, \ldots ,n \hfill \\ \hfill \\ \end{aligned}$$

(20)

where $a = 11.2838$ and $b = 2$. It is not hard to know $EX_{i} = 10$.

We sample $X_{1} ,\;X_{2}$ by the Monte Carlo technique and calculate the SF and MRL by (6) and (8).The convergence condition is that the coefficient of variation is less than 0.05. For a new component $t_{0} = 0,t = t_{0} + 5$; for an old component, we presume $\Delta_{m} = t_{0} = 4$, $Z(\Delta_{m} ) = 1$, $t = t_{0} + 5$ in case 1 and $\Delta_{m} = 4$, $t_{0} = 5$, $Z(\Delta_{m} ) = 1$, $t = t_{0} + 1$ in case 2. The results given by the proposed analytical method and the Monte Carlo method are contrasted in Table 2 where old1 and old2 represent the two different cases for an old component, respectively.

Table 2 SF and MRL given by RIBAM and Monte Carlo method

Full size table

From Table 2 we can see that the results obtained by the two methods are very close. Moreover, the proposed analytical formulas offer more advantages. Theoretically, the analytical formulas always provide a concise value which could be distinct from those offered Monte Carlo which varies based on the designated simulation parameters. The Monte Carlo method is based on the process of “sampling-evaluation-convergence” and the accuracy of results depends on the convergence condition and the number of samples. Although the uncertainty could be controlled within a given range, the evaluation results vary by samples. In other words, two sets of samples may pose different results even if they both satisfy the convergence conditions. On the other hand, the error in the proposed analytical method depends on rounding as long as the equations are stated correctly.

In the proposed model, the analytical method has a shorter calculation time and higher accuracy than the Monte Carlo method when $n \le 4$. When n is larger than 5, the computation of repeated integral could be a heavy burden. Therefore the proposed analytical method no longer has a computation time advantage. However, the analytical formulas present a higher accuracy for either n ≤ 4 or n ≥ 5. In this numerical example, the calculation time of SF is 0.18 s using the analytical method on a 2.6 GHz computer, while the time used by the Monte Carlo method varies between 0.5 s and 5 s.

In most practical cases, human operators would like to observe precise power transformer conditions. A large number of health condition stages will make the problem more complex and reduce the decision making efficiency. Moreover, the evaluation accuracy depends on the established model, monitoring data and parameters estimation. A large number of stages can reduce the evaluation efficiency and accuracy. Therefore, four stages of aging condition are deemed enough for the reliability evaluation of power transformers.

The DMDT models always assume that the equipment conditions stay the same between two successive inspection instances to make the SF calculation tractable. Furthermore, the CMCT model assumes rather superficially that the equipment condition would always be available. Hence, both models can produce errors. To illustrate the advantages of the proposed model, we concentrate on the equipment reliability in a single inspection interval. The survival function $P\left( {T > t_{0} + \Delta |T > t_{0} ,\;Z(t_{0} ) = 1} \right)$ under $\Delta = 0.4$ and $\Delta = 0.8$ are shown in Table 3 and Figs. 3–5.

Table 3 SF and MRL given by analytical formulas and Monte Carlo method

Full size table

From Table 3, Figs. 3 and 4, we can see that the SF obtained from DMDT is always larger than that from DMCT, which means that we will overestimate the reliability if we ignore the state transition between inspection points. At the inspection points the results obtained by the CMCT and DMCT models are the same which are shown in bold in Table 3. The difference between DMCT and CMCT models is that DMCT model is under discrete monitoring while CMCT model is under continuous monitoring. However, at inspection points the state is known and the uncertainty of $Z\left( {t_{0} } \right)$ is eliminated. Thus the difference between DMCT and CMCT disappears and the DMCT model degrades into the CMCT model.

For non-inspection points, the SF calculated in the DMCT model is smaller than that calculated in the CMCT model which is due to the possible state transition between the last inspection instant and the current time. The component reliability decreases with time in the long run. Besides, the DMCT curve shows that the reliability decreases deeper when $t_{0} - \Delta_{m}$ is larger. It is reasonable to assume that the longer the difference between the last inspection instant $\Delta_{m}$ and the current time t ₀, the bigger would be the error if we regard $Z(t_{0} )$ as $Z(\Delta_{m} )$. Besides, the older component tends to pose a larger error. This confirms the intuitive notion that the longer a component stays in a state (except absorbing state), the higher is the likelihood that it would transit to another (mostly worse) state and the larger would be the error unless we consider the state transition within the next inspection interval.

Figure 5 shows how the health condition strongly affects the shape of SFs. The component condition transits from state 1 to state 2 at time 10. The worse health condition offers a sharper decrease in SF. The diversity of component health condition is not considered in the traditional exponential or the Weibull distribution which will lead to serious errors.

To illustrate the influence of Weibull parameters of sojourn time distribution (STD), we vary the shape parameter b from 0.5 to 5 and change the scale parameter a to make sure the expectation of sojourn time is 10. We include the coefficient of variation of the sojourn time distribution in Table 4 to gain more insight. CV is usually introduced to describe the dispersion degree of distribution. In this case, we would like to observe how the distribution of condition sojourn time changes the reliability indices even though two distribution functions have the same expected condition sojourn time.

Table 4 SF and MRL in different Weibull distribution parameters

Full size table

Five groups of parameters $(a,b,{\text{CV}})$ are (5, 0.5, 2.2361), (8.8261, 0.8, 1.2605), (8.8261, 0.8, 1.2605), (10, 1, 1), (11.2838, 2, 0.5227), (10.8912, 5, 0.2290). In Table 4, even though the five groups of transformers have the same expected condition sojourn time, they have different reliability indices. A bigger CV always leads to a lower survival function and shorter mean residual life.

The survival functions are calculated for a new component. The five SFs are shown in Fig. 6. The semi-Markov process degenerates to a Markov process when b=1. This observation implies that the error is inevitable if we always assume the state transition conforms to a Markov process. Another notable observation is that the variation in sojourn time distribution parameters can lead to a different SF curves, though they all follow the Weibull distribution and have the same mean value. The equipment reliability declines sharper with the increase in CV. It is reasonable to assume that a larger variability always offers a lower reliability and boosts the cost on maintenance. The MRLs shown in the last column in Table 4 indicates that larger CV also means a shorter mean equipment life. The presented results are for n = 4. There are similar conclusions for $n > 4$.

5 Conclusion

We develop analytical formulas based on a more realistic DMCT model for evaluating the equipment reliability of deteriorating systems. The DMCT model assumption ensures that the results agree with the practice. The minimal errors between analytical formulas and Monte Carlo results imply the accuracy of the proposed method. Furthermore, the proposed method offers more realistic results in a shorter calculation time.

By comparing SF and MRL in the three models, we learn that the reliability will be overestimated if we apply a DMDT model between inspection points. That is, assuming $Z(t_{0} ) = Z(\Delta_{m} )$ or $Z(t) = Z(t_{0} )$ will bring inevitable errors. The longer inspection interval will result in a longer transition between the last and the current states with a larger error. For non-monitoring points, the DMCT results are different from those of CMCT. However, for the monitoring point, DMCT and CMCT models have the same results, i.e., the DMCT model degrades into the CMCT model. This also indicates that the CMCT model cannot conform to practical cases since it is unrealistic to obtain the health condition at all points. Another observation is that a larger CV always refers to a lower reliability despite the same state sojourn time expectation. We draw a conclusion that two sets of products will offer different reliability results although they have the same s-expected state sojourn time. Steady quality (means a smaller CV) is essential to achieve a higher reliability. A greater variation of quality always tends to shorten the MRL and boost the cost.

References

Duval M (1989) Dissolved gas analysis: it can save your transformer. IEEE Electr Insul Mag 5(6):22–27
Article Google Scholar
Pham H (2006) Handbook of engineering statistics. Springer, New York
Book MATH Google Scholar
Cao J, Cheng K (1986) Introduction to reliability mathematics. SciencePress, Beijing
Google Scholar
Bowles JB (2002) Commentary-caution: constant failure-rate models may be hazardous to your design. IEEE Trans Reliab 51(3):375–377
Article Google Scholar
Cox DR (1972) Regression models and life tables. JR Stat Soc [b] 34(2):187–220
MathSciNet MATH Google Scholar
Farewell VT (1979) An application of Cox’s proportional hazard model to multiple infection data. J R Stat Soc C-APP 28(2):136–143
Google Scholar
Qiu J, Wang H, Lin D et al (2015) Nonparametric regression-based failure rate model for electric power equipment using lifecycle data. IEEE Trans Smart Grid 6(2):955–964
Article Google Scholar
Gasmi S, Love CE, Kahle W (2003) A general repair, proportional-hazards framework to model complex repairable systems. IEEE Trans Reliab 52(1):26–32
Article Google Scholar
Vlok PJ, Coetzee JL, Banjevic D et al (2002) Optimal component replacement decisions using vibration monitoring and the proportional hazards mode. J Op Res Soc 53(2):193–202
Article MATH Google Scholar
Ji HX, Zhang JQ, Liu ZY et al (2010) Optimal maintenance decision of power transformers. In: International conference on electrical and control engineering (ICECE), Wuhan, China, 25–27 June 2010, pp. 3941–3944
Newby M (1994) Perspective on Weibull proportional-hazards models. IEEE Trans Reliab 43(2):217–223
Article Google Scholar
Banjevic D, Jardine AKS (2006) Calculation of reliability function and remaining useful life for a Markov failure time process. IMA J Manag Math 17(2):115–130
Article MathSciNet MATH Google Scholar
Makis V, Jardine AKS (1991) Computation of optimal policies in replacement models. IMA J Manag Math 3(3):169–175
Article MATH Google Scholar
Ghasemi A, Yacout S, Ouali MS (2010) Evaluating the reliability function and the mean residual life for equipment with unobservable states. IEEE Trans Reliab 59(1):45–54
Article Google Scholar
Ghasemi A, Yacout S, Ouali MS (1007) Optimal inspection period and replacement policy for CBM with imperfect information using PHM. World Congr Eng Comput Sci 247:247–266
Google Scholar
Ghasemi A, Yacout S, Ouali MS (2010) Parameter estimation methods for condition-based maintenance with indirect observations. IEEE Trans Reliab 59(2):426–439
Article Google Scholar
Banjevic D, Jardine AKS, Makis V et al (2001) A control-limit policy and software for condition-based maintenance optimization. INFOR-OTTAWA 9(1):32–50
Google Scholar
Wu X, Ryan SM (2011) Optimal replacement in the proportional hazards model with semi-Markovian covariate process and continuous monitoring. IEEE Trans Reliab 60(3):580–589
Article Google Scholar
Liu X, Li J, Al-Khalifa KN et al (2013) Condition-based maintenance for continuously monitored degrading systems with multiple failure modes. IIE Trans 45(4):422–435
Article Google Scholar
Cui M, Ke D, Sun Y et al (2015) Wind power ramp event forecasting using a stochastic scenario generation method. IEEE Trans Sustain Energy 6(2):422–433
Cui M, Feng C, Wang Z et al (2017) Statistical representation of wind power ramps using a generalized Gaussian mixture model. IEEE Trans Sustain Energy. doi:10.1109/TSTE.2017.2727321
Bi J, Lu M, Yang X et al (2014) A transformer failure rate model concering aging process and equipment inspection data. In: international conference on power system technology, Chengdu, China, 20–22 Oct 2014, pp. 1363–1367
IEEE Std C57.104-2008 (2009) IEEE guide for the interpretation of gases generated in oil-immersed transformers. IEEE Power & Energy Society
Lu MM, Wang YF, Guo CX et al (2014) Failure rate model for oil-immersed transformer based on PHM concerning aging process and equipment inspection information. Power Syst Prot Control 42(18):66–71
Google Scholar
Wang YF, Bao YK, Zhang H et al (2014) Evaluating equipment reliability function and mean residual life based on proportional hazard model and semi-Markov process. In: Proceedings of International conference on power system technology (POWERCON), Chengdu, China, 20–22 Oct 2014, pp. 1293–1299

Download references

Author information

Authors and Affiliations

School of Automation, Guangdong University of Technology, Guangzhou, 510006, China
Yifei WANG
Galvin Center for Electricity Innovation, Illinois Institute of Technology, Chicago, IL, 60616, USA
Mohammad SHAHIDEHPOUR
College of Electrical Engineering, Zhejiang University, Hangzhou, 310058, China
Chuangxin GUO

Authors

Yifei WANG
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad SHAHIDEHPOUR
View author publications
You can also search for this author in PubMed Google Scholar
Chuangxin GUO
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad SHAHIDEHPOUR.

Additional information

CrossCheck date: 10 July 2017

Appendices

Appendix A

1.1 Proof of (10)

To obtain (10) we need to transform the multiple integral $r_{1k}$ into repeated integral which contains the following steps:

Step 1: determine the integral area.

In this case, the integral area is denoted by:

$$D_{1k} = \left\{ {\left( {X_{1} ,X_{2} , \ldots ,X_{k} } \right)\left| \begin{aligned} x_{1} + \ldots + x_{k - 1} \le t \hfill \\ x_{1} + \ldots + x_{k} > t \hfill \\ \end{aligned} \right.} \right\}\;\;\;\;s \le k < n$$

$$D_{1n} = \left\{ {(X_{1} ,X_{2} , \ldots ,X_{n - 1} )\;|\;x_{1} + \ldots + x_{n - 1} \le t} \right\}$$

Step 2: project the integral area $D_{1k}$($D_{1n}$) onto lower dimensional space.

In this step, we obtain projected area $d_{k - 1} = \{ X_{1} , \ldots ,X_{k - 1} |x_{1} + \ldots + x_{k - 1} < t\}$ and the integral $r_{1k} = \int_{{d_{k - 1} }} {\int_{{t - x_{1} - \ldots - x_{k - 1} }}^{\infty } {R_{1k} } } (t)g_{x} {\text{d}}x_{k} \ldots {\text{d}}x_{2} {\text{d}}x_{1} ,$ $\left( {r_{1n} = \int_{{d_{n - 2} }} {\int_{0}^{{t - x_{1} - \ldots - x_{n - 1} }} } R_{1n} (t)g_{x} {\text{d}}x_{n - 1} \ldots {\text{d}}x_{2} {\text{d}}x_{1} } \right).$

Step 3: Repeat step 2 and decrease the dimension of $d_{i}$ in succession until the dimension of integral region projection is equal to 1, i.e. $i = 1$. Finally we can get (10).

Appendix B

2.1 Proof of (12)

The proof can be obtained as follows:

Step 1: determine the multiple integral area.

Since the state transition points have Markov property and the known condition is $Z(T_{j - 1} ) = j$, the integral region $D_{jk}$ is:

$$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{k} )\left| \begin{array}{l} x_{j} \ge t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{k - 1} < t - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{k} > t - T_{j - 1} \hfill \\ \end{array} \right.} \right\}\;\;\;\;1 \le k < n$$

and $D_{jn}$ is:

$$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{n - 1} )\;\left| \begin{array}{l} x_{j} \ge t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{n - 1} < t - T_{j - 1} \hfill \\ \end{array} \right.} \right\}$$

Step 2: project the integral area $D_{jk}$ onto lower-dimensional space $(X_{j} , \ldots ,X_{k - 1} )$.

In this step, we obtain projected area $d_{j,k - 1}$ is:

$$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{k - 1} )\;\left| \begin{array}{l} x_{j} \ge t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{k - 1} < t - T_{j - 1} \hfill \\ \end{array} \right.} \right\}$$

and the integral is:

$$\int_{{d_{j,k - 1} }} {\int_{{t - x_{1} - \ldots - x_{k - 1} }}^{\infty } {R_{jk} } } (t|t_{0} )g_{{x|x_{j} }} {\text{d}}x_{k} \ldots {\text{d}}x_{j} .$$

For $k = n$, similar results can be obtained which will not be listed for simplicity.

Step 3: repeat step 2 and decrease the dimension of integral region projection in succession until the dimension equals 1.

We have $d_{j,j} = \left\{ {X_{j} |t_{0} - T_{j - 1} \le x_{j} < t - T_{j - 1} } \right\}$ at last. Through steps 1 to 3 we get (12).

Appendix C

3.1 Proof of (14)

With$Z(\Delta_{m} = j),\;Z(t_{0} ) = i,\;Z(t) = k$, (15) can be obtained by following steps.

Step 1: the integral region $D_{j,ik}$ equals

$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{k} )\;\left| \begin{array}{l} \Delta_{m} < x_{1} + \ldots + x_{j} \hfill \\ x_{1} + \ldots + x_{i - 1} < t_{0} < x_{1} + \ldots + x_{i} \hfill \\ x_{1} + \ldots + x_{k - 1} < t < x_{1} + \ldots + x_{k} \hfill \\ \end{array} \right.} \right\}$, since the state transition points have Markov property and the known condition is$Z(T_{j - 1} ) = j$, the integral region $D_{jk}$ is:

$$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{k} )\left| \begin{array}{l} x_{j} \ge t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{k - 1} < t - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{k} > t - T_{j - 1} \hfill \\ \end{array} \right.} \right\}$$

$for\;1 \le k < n$ and $D_{jn}$ is:

$$\left\{ {(X_{j} ,X_{j + 1} , \ldots ,X_{n - 1} )\;\left| \begin{array}{l} x_{j} \ge t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{n - 1} < t - T_{j - 1} \hfill \\ \end{array} \right.} \right\}$$

Step 2: project $D_{j,ik}$ on lower-dimensional space.

Step 3: repeat step 2 until the dimension of integral region is reduced to 1.

Denote the projection on space $\{ X_{j} , \ldots ,X_{h} \}$ as $v_{h}$ and then we get:

$$v_{i} = \left\{ {\left( {X_{j} ,X_{j + 1} , \ldots ,X_{i} } \right)\;\left| \begin{aligned}\begin{array}{ll} \Delta_{m} - T_{j - 1} < x_{j} \hfill \\ x_{j} + \ldots + x_{i - 1} < t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{i} > t_{0} - T_{j - 1} \hfill \\ x_{j} + \ldots + x_{i} < t - T_{j - 1} \hfill \\ \end{array}\end{aligned} \right.} \right\}$$

$$v_{j} = \, \left\{ {X_{j} \left| {\Delta_{m} - T_{j - 1} < x_{j} < t_{0} - T_{j - 1} } \right.} \right\}$$

Through step 1 to step 3 we can obtain (14) finally.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

WANG, Y., SHAHIDEHPOUR, M. & GUO, C. Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers. J. Mod. Power Syst. Clean Energy 5, 959–969 (2017). https://doi.org/10.1007/s40565-017-0322-z

Download citation

Received: 07 January 2016
Accepted: 10 July 2017
Published: 22 September 2017
Issue Date: November 2017
DOI: https://doi.org/10.1007/s40565-017-0322-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers

Abstract

Similar content being viewed by others

A hybrid model for economic design of MEWMA control chart under maintenance policies

SOH and RUL prediction of Li-ion batteries based on improved Gaussian process regression

Computational adaptive multivariable degradation model for improving the remaining useful life prediction in industrial systems

1 Introduction

2 Model description

2.1 Determine health condition with DGA information

2.2 Failure rate model based on DGA information