1 Introduction

With the rapid development of Internet of Things, information technology and Artificial Intelligence, Predictive maintenance (PdM) has gradually attracted great attention of scholars [14]. As a core of operation management in the industrial system, PdM plays a vital role in reducing the frequency of failures, improving operating efficiency, and ensuring product quality [5, 6], and has become a research hotspot of Prognostics and Health Management (PHM) in aerospace and manufacturing fields [79]. At present, there are many methods to solve the problem of PdM of equipment. These methods can be roughly divided into two categories [10]: PdM methods based on fault mechanism [1116] and data-driven. The data-driven methods are divided into mathematical statistics [1721] and artificial intelligence [2226]. During the implementation of PdM, through the accurate prediction of product failure time, the most cost-effective maintenance window can be determined before its failure. Furthermore, it can prevent the operating deterioration of equipment and minimize the shutdown time and maintenance costs through appropriate maintenance activities [27]. Therefore, improving the prediction accuracy of TBF is of great significance and can bring tangible benefits to the industry.

As an important reliability index, fault time can show the dynamic evolution process of faults, and have been predicted by a variety of data-driven methods, such as autoregressive moving average (ARMA) [28], singular spectrum analysis (SSA) [29], support vector regression (SVR) [30], artificial neural network (ANN) [31], etc. Due to the difficulty of the reliability test for complex system, the reliability information (such as TBF) is usually difficult to obtain [32]. Most of the above methods are for component, and are not applicable to the fault time prediction of complex mechanical products.

To fill this gap, the paper proposed a machine learning-based TBF prediction model with reliability information conversion. Mainly through the information conversion method based on Fisher algorithm to deeply mine the multi-source reliability information, convert the multi-source reliability information (such as fault data of similar products, unit, subsystem, and reliability simulation test data, etc. [33]) into the reliability information of the target product, expand the sample size of the small sample target product. For data samples with certain errors after capacity expansion, BP neural network with good prediction effect on time series data is selected to realize the final prediction of TBF, and PSO algorithm is used to optimize the initial weight and threshold of BP neural network to avoid falling into local extreme value and improve the convergence speed. Thus, more accurate TBF prediction results can be obtained. The rest of this paper is organized as follows: Sect. 2 provides an overview of existing methods for PdM. Section 3 introduces the methodology used in this paper, including reliability information conversion, compatibility test and TBF prediction based on PSO-BP. Section 4 is a case study, and the prediction results of TBF are also analyzed in this section, followed by a brief summary in Sect. 5.

2 Literature review

TBF prediction of complex mechanical products is a typical small sample problem. There are two methods to solve the problem: one is mathematical methods suitable for data analysis of small samples, such as Bayesian and poor information method; The former can improve the evaluation accuracy of the field test with small sample by fusing the simulation prior information [3437], but there is problem of how to scientifically determine the prior distribution before information fusion. The evaluation accuracy will even drop when the prior information is distorted. In recent years, scholars have carried out relevant research on fault prediction, process evaluation and event prediction in poor information situation, and achieved good results [3842]. But the core prediction model of this method is the use of Grey Model alone or in combination with the Bootstrap. This type of prediction method does not essentially avoid the problem of averaging the weights after superposition calculation. The mean weight will cause the instability and large prediction errors of the prediction model. The second is expansion of small samples, such as Bootstrap [4345]. The Bootstrap relies too much on the subsample, which is not conducive to the robust of parameter evaluation. While expanding the capacity of small sample, the information conversion method can overcome the problems of the above two methods. This method was first proposed by Qian Xuesen in his research on two bombs and one satellite, that is, in order to evaluate reliability, it is often necessary to convert and synthesize experimental data in different environments [46]. At present, there are mainly two information conversion methods for converting reliability by environmental factors and regression conversion of reliability information. Because the latter has certain requirements on the distribution of reliability information in use, that is, it is required to obey a specific distribution. Therefore, it is necessary to find a method that is not affected by the distribution of reliability information to solve the problem of insufficient fault data in the TBF prediction problem of mechanical products with small samples. As the Fisher algorithm is a linear discriminant method based on the idea of variance analysis, which can better distinguish each population, and have no requirements for the distribution of the population [47]. Therefore, this paper uses the information conversion method based on Fisher algorithm to expand the capacity of small and medium-sized sample in TBF prediction of complex mechanical products.

TBF belongs to time series data. Due to the excellent performance in solving the problem of time series prediction, machine learning is very consistent with the needs of practical engineering problems [48]. To solve time series analysis problems, researchers have applied linear regression model, SVM [49], decision tree and other machine learning models. The time series prediction based on machine learning is a typical supervised learning task, in which the input attribute is the time series data of historical data, and the output label is the future data (on the training set). Machine learning model is a data-driven way to learn sequence features and establish the mapping relationship between historical data and future data. In practical application, the time series prediction method based on machine learning performs better than the method based on statistics [50]. In order to improve the accuracy of time series prediction, researchers try to make a breakthrough in model complexity and sequence feature expression [51]. As a classical feedforward neural network, BP neural network has the advantages of good self-learning, self-organization and adaptability, and strong fault tolerance for its structural characteristics, which allows certain errors of input samples. Good simulation effect can be achieved for the expanded data samples [52], which is conducive to reduce the error of the method proposed in this paper. However, the traditional BP neural network has the defects that it is easy to fall into local extremum and the convergence speed is slow or even non convergent [53]. In view of this defect, scholars use genetic algorithm (GA) to optimize BP neural network [54, 55] and improve the convergence speed. Compared with GA algorithm, particle swarm optimization (PSO) algorithm is simple in calculation and requires less parameters adjustment. Using PSO algorithm to optimize the initial weight and threshold of BP neural network cannot only avoid falling into local extreme value but also improve the convergence speed [56]. Therefore, BP neural network with good prediction effect on time series data is selected to realize the final prediction of TBF in this paper, and PSO algorithm is also adapted to optimize BP neural network to verify the feasibility of information conversion method based on Fisher algorithm.

3 Methodology

3.1 Reliability information conversion

The reliability information of a mechanical product (such as: fault data of similar product, unit, subsystems, and data of reliability simulation, etc. [52]) is various and have multidimensional characteristics. Specifically, according to the performance reliability analysis of the products, the reliability information can be divided into three dimensions: time, hierarchy, and information source. In terms of time dimension, it includes design phase information, manufacturing phase information, simulation operation phase information, etc. In the hierarchy dimension, it includes part level information, unit level information, subsystem level information and system level information. In this dimension, the amount of information presents an inverted pyramid; In the dimension of information source, it includes expert judgment information, performance degradation information, similar product information, etc. In order to use the information effectively and fully reflect its value, it needs to be processed deeply in the process of using reliability information, that is, the reliability information needs to be effectively converted before they can be used. Since the Fisher algorithm is a linear discriminant method established based on the idea of variance analysis, it can better distinguish each population, and this discriminant method does not make any requirements on the distribution of the population [53]. Therefore, the paper proposed an information conversion method based on Fisher algorithm to convert the reliability information. The following paragraph describes the detail.

The central idea of the conversion method is to apply the Fisher algorithm to divide the information samples, pair the sample points in two different environments, and then use the geometric mean of the ratio to calculate the environmental factor. Assume that the reliability information samples in environments A and B are represented by X and Y, respectively, \(X = \{ x_{1},x_{2},\ldots,x_{n} \}\), \(Y = \{ y_{1},y_{2},\ldots,y_{m} \}\) (\(n \ge m\)). As shown in Fig. 1, to convert the sample information in environment A into environment B, the specific process of information conversion based on Fisher algorithm is as follows:

Figure 1
figure 1

Information conversion process based on Fisher algorithm

1) Rearrange the order of sample information. Rearrange the sample information in the samples X and Y in ascending order:

$$\begin{aligned} & X:x_{(1)} < x_{(2)} < \cdots < x_{(n)}, \\ & Y:y_{(1)} < y_{(2)} < \cdots < y_{(m)}. \end{aligned}$$

2) Find center points of sample data. Calculate the center points of sample X and Y respectively according to Eq. (1).

$$\begin{aligned} \begin{aligned} &\vert x_{(r)} - \overline{X} \vert = \min_{1 \le i \le n} \vert x_{(i)} - \overline{X} \vert ,\\ & \vert y_{(s)} - \overline{Y} \vert = \min_{1 \le j \le m} \vert y_{(s)} - \overline{Y} \vert . \end{aligned} \end{aligned}$$
(1)

\(x_{(r)}\) divides the sample X into three parts: \(\{ x_{(1)},x_{(2)},\ldots, x_{(r - 1)} \}, \{ x_{(r)} \}, \{ x_{(r + 1)},x_{(r + 2)},\ldots,x_{(n)} \}\);

\(y_{(s)}\) divides the sample Y into three parts: \(\{ y_{(1)},y_{(2)},\ldots, y_{(s - 1)} \}, \{ y_{(s)} \}, \{ y_{(s + 1)},y_{(s + 2)},\ldots,y_{(m)} \}\).

3) Construct sample pairings. If: \(n = m\), and \(r = s\), construct sample pairing: \((x_{(i)},y_{(j)}),i = j = 1,2,\ldots,n\); otherwise, keep sample pairing: (\(x_{(r)},y_{(s)}\)), then, the samples are paired in two cases:

(i) For the left sample points of \(x_{(r)}\) and \(y_{(s)}\), if \(r - 1 \ge s - 1\), cluster the sample X based on Fisher, the cluster class is \(s - 1\), and the mean value of each class is \(y_{j}^{*}\), construct sample pairings: \(\{ y_{j}^{*},y_{j} \}\), \(1 \le j \le s - 1\); Otherwise, cluster the sample Y based on Fisher, the cluster class is \(r - 1\), and the mean value of each class is \(y_{j}^{*}\), construct sample pairings: \(\{ y_{j}^{*},y_{j} \}\), \(1 \le i \le r - 1\).

(ii) For the right sample points of \(x_{(r)}\) and \(y_{(s)}\), if \(n - r \ge m - s\), cluster the sample X based on Fisher, the cluster class is \(m - s\), and the mean value of each class is \(y_{j}^{*}\), construct sample pairings: \(\{ y_{j}^{*},y_{j} \}\), \(s + 1 \le j \le m\); Otherwise, cluster the sample Y based on Fisher, the cluster class is \(n - r\), and the mean value of each class is \(y_{j}^{*}\), construct sample pairings: \(\{ y_{j}^{*},y_{j} \}\), \(r + 1 \le i \le n\).

4) Calculate the conversion factor. After pairing the samples with the method in step 3), calculate the conversion factor according to Eq. (2).

$$\begin{aligned} K = \sqrt[m]{\frac{y_{1}^{*}}{y_{1}} \cdot \frac{y_{2}^{*}}{y_{2}} \cdots \frac{y_{m}^{*}}{y_{m}}}. \end{aligned}$$
(2)

5) Get the conversion function. The following Eq. (3) can convert the information of the sample X under environment A into the sample Y under environment B, so as to achieve the purpose of expanding the capacity of the sample Y.

$$\begin{aligned} y_{i}^{\prime} = x_{i} \cdot K^{ - 1}. \end{aligned}$$
(3)

3.2 Compatibility test

After the information conversion through the method in Sect. 3.1, it is necessary to use a certain method to judge whether the conversion result is valid, that is, the reliability information can truly reflect the characteristics of statistical parameters, which requires that the reliability information and the small sample field information of the target mechanical product approximately obey the same distribution, and this goal is achieved through the compatibility test. Wilcoxon rank sum test, Smirno test, Mood test, etc. can all be used to test the compatibility directly using sample data [57]. The rank sum test is adopted in this paper, which can effectively test whether the data from different samples obey the same overall distribution when the overall distribution is not clear. Assume that the reliability information sample is: \(X = \{ x_{1},x_{2},\ldots,x_{n} \}\). The on-site reliability information sample of the target mechanical product is: \(Y = \{ y_{1},y_{2},\ldots,y_{m} \}\), consider a trade-off hypothesis:

Null hypothesis \(H_{0}\): X and Y belong to the same population;

Alternative hypothesis \(H_{1} : X\) and Y do not belong to the same population.

Combine the sample information of the two samples into a new sample population, and arrange the new sample information in order from small to large to obtain a new sample: Z, \(Z = \{ z_{1},z_{2},\ldots,z_{n + m} \}\), \(z_{1} \le z_{2} \le \cdots z_{n + m}\). The subscript i is called the rank of the new sample information, \(i = 1,2, \ldots ,n + m\). If \(x_{k} = z_{j}\), \(x_{k} \in \{ x_{1},x_{2},\ldots,x_{n} \}\), then it is considered that the rank of elements \(x_{k}\) in sample X in the new sample is j, and note that: \(r(x_{k}) = j\). The rank sum of sample X is \(T = \sum_{k = 1}^{n} r(x_{k})\).

Based on the rank sum test table, under significance level α, if T meets: \(T_{1}(\alpha ) < T < T_{2}(\alpha )\), the null hypothesis should be accepted, otherwise, the null hypothesis should be rejected and the alternative hypothesis need to be accepted. When the values of n and m are both large (\(n,m > 10\)), the rank sum T approximately obeys normal distribution, that is:

$$\begin{aligned} T = \sum_{k = 1}^{n} r(x_{k}) \sim N\biggl(\frac{n(n + m + 1)}{2},\frac{nm(n + m + 1)}{2}\biggr). \end{aligned}$$
(4)

At this time, the Z test method of normal distribution is used to test the compatibility. Under the significance level α, the test rule is as shown in Eq. (2):

$$\begin{aligned} P \biggl\{ \biggl\vert \frac{T - \frac{n(n + m + 1)}{2}}{\sqrt{\frac{nm(n + m + 1)}{12}}} \biggr\vert < \mu _{\alpha} |{\mathrm{H}}_{0} \biggr\} = 1 - \alpha . \end{aligned}$$
(5)

Here \(\mu _{\alpha} \) represents the α quantile of the standard normal distribution. When Eq. (5) is established, under the confidence: \(1 - \alpha \), the null hypothesis will be accepted; otherwise, the two samples are considered not to meet the requirements of the compatibility test.

3.3 TBF prediction based on PSO-BP

3.3.1 Prediction model

After the reliability information is converted to the target mechanical product through the above steps, an enlarged data sample is obtained, which is composed of reliability information of different dimensions and source. Since the reliability information of each different source (such as the TBF to be discussed in this paper) generally has the characteristics of time series, the BP Neural Network with better prediction effect on time series data is selected to realize the final prediction of TBF in this paper, to verify the feasibility of the information conversion method based on Fisher algorithm. The prediction model is shown in Eq. (6).

$$\begin{aligned} y(t) = f(y(t - n),y(t - n + 1),\ldots,y(t - 1). \end{aligned}$$
(6)

Here \(y(t)\) represents the TBF value at time t, n is the number of input neuron nodes of the BP neural network, that is, the TBF at time t is affected by the previous nth times, the specific value of n is determined according to the number of faults. The predicted network structure is shown in Fig. 2.

Figure 2
figure 2

BP neural network prediction structure

Here f is the hidden layer, and the number of nodes in the hidden layer of the prediction model is determined by the empirical equation and the trial-and-error method. The empirical equation is defined as Eq. (7).

$$\begin{aligned} m = \sqrt{n + l} + \alpha. \end{aligned}$$
(7)

Here m is the number of hidden layer nodes, n is the number of input nodes, l is the number of output nodes, and α is a constant between 1 and 10. It can be seen from the above analysis that the value of m is between 3 and 12. Determining the optimal number of nodes by trial and error method.

BP Neural Network is a multi-layer feedforward neural network. Its weights are adjusted by Back-Propagation learning algorithm, which has good nonlinear mapping ability, generalization ability and fault tolerance ability, and is easy to implement in engineering. But the initial value is randomly assigned, and it is easy to fall into a local minimum [58]. PSO is an algorithm with good global searching ability. It only needs optimized function and does not need other auxiliary information. PSO algorithm is adapted in this paper to find the global optimal particle by updating particle speed and position, so as to optimize the weight and threshold of BP neural network, and then realize the establishment of TBF prediction model based on PSO-BP. The process of PSO-BP prediction model is shown in Fig. 3.

Figure 3
figure 3

PSO-BP network modeling

3.3.2 Performance evaluation

The ultimate goal of this paper is to use the above method to predict the TBF of the target mechanical product. Firstly, the converted reliability information is used as the training set to train the prediction model, and then the TBF in the actual operation stage of the target mechanical product is selected as the test set to test the prediction effect of the model, and two evaluation metrics as shown in Eq. (8): the mean-absolute-percentage-error (MAPE) and the coefficient of determination (\(R^{2}\)) are selected to evaluate the performance of the model and verify the feasibility of information conversion method based on Fisher algorithm.

$$\begin{aligned} \textstyle\begin{cases} MAPE = \frac{\sum_{i} \vert \frac{y_{i} - \hat{y}_{i}}{y_{i}} \vert }{N} ,\\ R^{2} = 1 - \frac{\sum_{i} ( y_{i} - \hat{y}_{i} )^{2}}{\sum_{i} ( y_{i} - \bar{y}_{i} )^{2}}. \end{cases}\displaystyle \end{aligned}$$
(8)

Here \(y_{i}\) is the true value, \(\hat{y}_{i}\) is the predicted value, \(\bar{y}_{i}\) is the mean value of the true value, and N is the sample size.

4 Case study

4.1 Case description

Taking the remanufactured CNC milling machine tool (XK6032-01) as an example, due to its single piece production mode and part blank value, it is impossible for remanufacturing product entities to carry out huge destructive on-site tests. In addition, it is extremely difficult to build the system level simulation test model and platform of personalized remanufacturing products, which is costly and difficult to accurately simulate all working conditions in the process of re-service. Therefore, the target product of this case belongs to a typical small sample. The TBF data (as shown in Table 1) and its related reliability information (TBF) were collected respectively. Perform TBF prediction for the remanufactured CNC milling machine according to the process shown in Fig. 4.

Figure 4
figure 4

TBF Prediction of the remanufactured CNC milling machine tools

Table 1 Fault data during actual operation stage

4.2 Results

4.2.1 Conversion results of reliability information

The machine tool is used as the target environment, and the reliability information of each source is converted to the working environment of the remanufactured machine tool according to the information conversion method in Sect. 3.1. The compatibility test is carried out respectively with the fault data of the actual operation stage, and finally the expanded fault information table of the CNC machine tool as shown in Table 2 is obtained.

Table 2 Fault data after converted (TBF)

According to the number of faults of Table 2, the number of input neurons in the TBF prediction model based on PSO-BP neural network is determined to be \(n=3\), that is, the TBF at the fourth moment is affected by the TBF at the first three moments. In the process of training the model, the fault data of the first three moments is used as input, and the fault data of the fourth moment is used as output. For information sources with more than 4 failures, the sliding time window is used to obtain the value, the window size is set to 4, and the step size is set to 1. Based on this, according to the order of Table 2, the data from different sources of each dimension is input into the neural network training model in turn. A total of 34 groups of data can be used for model training. Take the data in Table 1 as the test set (5 groups in total) to test the trained model, and the prediction performance of this model is measured by the metrics in Eq. (8).

4.2.2 Prediction results

In this prediction, the neural network toolbox in Matlab is used for network training. The prediction model reaches convergence after 378 iterations. The prediction performance of the training set is shown in Fig. 5, and the prediction performance of the test set is shown in Fig. 6.

Figure 5
figure 5

Comparison of the predicted value of the training sample with the true value

Figure 6
figure 6

Comparison of the predicted value of the test sample with the true value

As can be seen from Fig. 5, the predicted value of each point in the training set almost completely coincides with the true value, and the error is almost zero, indicating that the TBF prediction model based on PSO-BP has a very good prediction effect on the training set.

It can be seen from Fig. 6 that the change trend of the true value of the test set and the predicted value are basically the same, indicating that the prediction model has a good prediction effect on the change trend of TBF. The specific prediction error is shown in Table 3.

Table 3 The relative error of each point in the test set

It can be seen from Table 3 that, except for test point 3, the absolute value of the relative error of the other points is less than 10%, which is within the allowable error range. At this time, the mean-absolute-percentage-error of the model is 0.1061, in general, if the MAPE is less than 10%, the model prediction accuracy is high; the coefficient of determination of the model is 0.9902, in general, \(R^{2}\) is within \([0,1]\), and the value is closer to 1, the higher the prediction accuracy of the model will be. It can be seen that the prediction accuracy of the model is high, which proves that the proposed method has a good prediction effect on TBF.

4.3 Discussion

In order to verify whether PSO-BP neural network model is superior to other intelligent models in predicting the time series data after sample expansion, standard BP neural network and GA-BP neural network are selected for comparative analysis. The experimental results are shown in Table 4.

Table 4 Performance comparison of prediction models

To express the prediction accuracy and convergence speed of each model more intuitively, the above results are displayed in the form of bar graph, as shown in Fig. 7. Because the iteration times are dimensional parameters, they are processed to calculate the iteration rate, that is, the ratio of iteration times to total times.

Figure 7
figure 7

Performance comparison of prediction models

Generally, the smaller the MAPE, the closer to 1 of \(R^{2}\), the smaller the iteration rate, the higher the prediction accuracy of the model, and the faster the convergence speed. It can be seen from Fig. 7 that the performance of PSO-BP is better than the other two intelligent models, indicating that this model is more suitable for the prediction of time series data with small sample after the expansion of sample size.

Through the above analysis, some advantages of the proposed approach can be summarized as follows. 1) The information conversion method based on Fisher algorithm converts the multi-source reliability information into the target product itself, overcomes the influence of the specific distribution of the reliability information, expands the sample size of complex mechanical products. 2) The TBF prediction model based on PSO-BP neural network avoids the problem of data samples affected by certain errors after capacity expansion, has high accuracy, and provides method support for PdM of complex mechanical products.

However, the proposed reliability information conversion method based on the Fisher algorithm has some problems and deficiencies in the process of converting experimental information. The basic idea of the Fisher algorithm is similar to dichotomy, that is, it needs to divide all data into two categories firstly, and then divide one of them into two categories too, recursive in turn. Therefore, the classification result obtained by this method is a local optimal solution, which has a certain impact on the accuracy of environmental factors.

5 Conclusions

Aiming at the problems of reliability information collection difficulty, high collection cost and small data samples in the process of TBF prediction of complex mechanical products, a machine learning-based TBF prediction model with reliability information conversion is proposed in this paper. The information conversion method based on the Fisher algorithm is used to deeply mine the reliability information of multiple-source reliability information, convert the multi-source reliability information into the target product itself, and expand the sample size of small sample target products. For data samples with certain errors after capacity expansion, BP neural network with good prediction effect on time series data is selected to realize the final prediction of TBF, and PSO algorithm is used to optimize the initial weight and threshold of BP neural network to avoid local extreme value and improve the convergence speed. Taking a remanufactured machine tools as an example to verify the proposed approach. The results show that the TBF prediction model based on PSO-BP proposed in this paper has a good prediction effect on the sample data after the reliability information conversion and expansion using the Fisher algorithm, and provides a strong basis for the PdM and reliability growth of remanufactured machine tools. In the future work, we should consider an optimization method to get the global optimal solution of data segmentation and pairing, improve the sample pairing effect of data, and improve the accuracy of conversion.