1 Introduction

Due to their advantages of high operating voltage, high power density, and long lifespan, lithium-ion batteries have become staple energy storage devices for many applications, including electric vehicles, smart grids, and energy storage systems [1, 2]. However, as the frequency of charging and discharging increases, the battery will gradually degrade or even become deactivated through different mechanisms [3]. Therefore, the study of the degradation of batteries is highly important, and reliable degradation models need to be established to accurately predict their remaining useful life (P).

In recent years, data-driven methods have drawn much attention in the field of lithium-ion battery RUL prediction due to the ability to directly predict battery degradation trends using historical monitoring data without relying on complex physical models [4]. Most current studies define capacity or internal resistance as a health indicator (HI) [5, 6]. However, the detection of capacity or internal resistance requires complex experimental conditions and instrumentation [7], and realizing online collection is difficult. Therefore, the extraction of HIs, which can be easily calculated and can quantify the degradation state of a battery, has great practical value. Khaleghi et al. [8] set the upper and lower voltage boundaries and specified a fixed sampling rate to extract HIs from partial charging voltage profiles and used them to estimate the health state of a battery. In addition, current variations during short discharge intervals [9] and time differences between fixed voltage intervals [10] can also be used as features to reflect the health state of Li-ion batteries. Although these methods show excellent performance in extracting HIs, they require the setting of parameters such as fixed time or voltage, which not only relies on human experience, but also has the potential to complicate feature extraction [8,9,10]. Liu et al. [11] set different parameters to extract the same HIs and perform RUL prediction on battery datasets from the National Aeronautics and Space Administration (NASA) and CALCE, respectively, and the results indicated that the requirements of parameters for feature extraction could be different for different Li-ion batteries. Pang et al. [12] extracted the peak values of more sensitive IC curves and the under-peak area as the HIs and predicted RUL on the lithium-ion battery dataset provided by NASA. Pan et al. [13] extracted IC curves with two recognizable peaks on the battery dataset provided by CALCE. This feature extraction method was clearly not applicable to different Li-ion battery datasets. Although the feature extraction methods proposed in references [11,12,13] can be used to extract HIs, they may not be applicable to other lithium-ion battery datasets due to differences in battery parameters or experimental environments. As mentioned above, traditional feature extraction methods usually require parameter setting and are poorly adaptable; therefore, the first objective of this work is to find a feature extraction method that reduces or avoids parameter setting and is widely applicable to different lithium-ion battery datasets.

Based on the above analysis, a method for predicting the RUL interval of lithium-ion batteries is proposed based on adaptive health indicator extraction and support vector quantile regression. The main improvements of our proposed method are as follows:

First, singular value decomposition (SVD) [14] is used to extract the most important singular vector from the original measurement data as a health indicator of lithium-ion batteries. The original measurement data from the various types of lithium-ion batteries are distributed differently and may contain noise. The raw data are directly used as a health indicator of lithium-ion batteries; this process affects the prediction accuracy of the remaining useful life and leads to a lack of generality of the prediction model. Since SVD does not involve parameter setting in the decomposition process, the calculation process is simple, and the extracted features are very efficient. Therefore, in our proposed model, SVD is used to extract features from the original measurement data of lithium-ion batteries, and the extracted maximum singular vector is used as the degradation feature of the batteries.

Next, Spearman correlation analysis was used to screen the HIs extracted by SVD, and the screened HIs were fused by a stacked denoising autoencoder (SDAE) [15] to construct a fusion health indicator with stronger representation ability. A certain degree of redundancy exists between the HI extracted by SVD, and when the HI dimension is too high, the model can be more complex, affecting the accuracy of the RUL prediction. Therefore, Spearman correlation analysis was used to analyze each HI, and the HI that was highly correlated with the battery capacity sequence was retained. The retained HIs were fused with the SDAE to obtain a fusion health indicator with a stronger correlation.

Finally, the support vector quantile regression model (SVQR)[16] was used to perform interval prediction on the RUL of lithium-ion batteries. In the process of actual use, due to the complex working environment and large load fluctuations of lithium-ion batteries, the degradation mode of lithium batteries is not fixed; thus, the results from the RUL point estimation are uncertain. In this study, a support vector quantile regression model is used to predict the RUL interval of lithium batteries, and this model produces more generalizable prediction results. The experimental results show that our proposed SVQR model has high interval estimation accuracy and can be applied to different types of data.

The effectiveness of our proposed model was validated using lithium-ion battery datasets provided by the National Aeronautics and Space Administration research center [17] and the Massachusetts Institute of Technology (MIT) [18]. Our proposed method was compared with four existing classical prediction methods. The experimental results showed that the prediction accuracy of our proposed method was better than that of the existing four classical prediction methods. The remainder of this paper is arranged as follows. Section 1 briefly introduces the basic theory of stack denoising autoencoders. In Sect. 2, an adaptive health indicator extraction method based on SVD, Spearman correlation analysis and the SDAE is introduced. Section 3 introduces the RUL interval prediction method based on support vector quantile regression. Section 4 describes the experiment and analysis, and Sect. 5 provides the conclusion and future prospects.

2 Stack denoising autoencoder

A stacked denoising autoencoder, as the name suggests, is constructed by stacking multiple denoising autoencoders. Based on the autoencoder, the denoising autoencoder forces the network model to learn denoising and restore the input data by randomly shielding some training input data.

The traditional denoising autoencoder consists of a layer of encoder and a layer of decoder. If unsupervised pretraining is performed on each single hidden layer denoising autoencoder unit and then stacked, the output of the previous encoder is used as the input of the next autoencoder; finally, reverse tuning training is performed, and an SDAE with a multilayer structure is obtained. Compared with traditional denoising autoencoders, a stacked denoising autoencoder has a deeper network structure and stronger feature learning ability and generalization ability [19].

The training of the stacked denoising autoencoder involves making the output signal \(Z\) of each denoising autoencoder infinitely close to the input signal \(X\) and defining the error function as shown in Eq. (1):

$$L_{D} = ||{\mathbf{X}} - g(f({\mathbf{X}}_{1} ))||$$
(1)

Unsupervised learning for pretraining is used to determine the parameters \({{\varvec{\uptheta}}} = ({\mathbf{W,b}})\) of each denoising autoencoder. Then, supervised learning training is carried out on the entire stacked denoising autoencoder by the back propagation (BP) algorithm to fine-tune the global parameters \({{\varvec{\uptheta}}} = ({\mathbf{W,b}})\). Finally, the model is ensured to have good discriminant performance and effectively extract multidimensional features of the input signals.

3 Adaptive extraction of the health indicators via SVD and the SDAE

3.1 Experimental data

The RUL of a lithium-ion battery is defined as the number of charge and discharge cycles the battery goes through when its capacity drops to 70–80% of its initial value. Therefore, predicting the RUL of lithium-ion batteries essentially involves predicting the capacity value of the battery after each round of charge and discharge.

The selected battery degradation dataset 1 is from the NASA Research Center [18]. The dataset contains a collection of degradation state data for four 18,650-type Li-ion batteries in the same experimental environment. The nominal capacity of the four batteries is 2.2 Ah, and the nominal voltage is 3.7 V. In the charging stage, the batteries are charged in constant current (CC) mode at 1.5 A until the battery voltage reaches 4.2 V and then continue to be charged in constant voltage (CV) mode until the charging current drops to 20 mA. In the discharging stage, the battery is discharged with 2A CC until the voltages of the B0005, B0006, B0007 and B0018 batteries dropped to 2.7 V, 2.5 V, 2.2 V and 2.5 V, respectively. Figure 1 shows the actual decay curve of the battery capacity.

Fig. 1
figure 1

Capacity degradation curves of four batteries

3.2 HI extraction based on SVD

In this section, the singular value decomposition [20] method is initially introduced and employed for feature extraction. SVD is a matrix decomposition method that allows complex matrices to be represented as multiplications of smaller and simpler submatrices.

Assuming that matrix \(A\) is a matrix of size \(p\times q\), the SVD of matrix \(A\) is defined as shown in Eq. (2):

$$A=U\sum {V}^{T}$$
(2)

The matrix \(U\) has a rank r. \(U\) is an orthogonal matrix of size \(p\times q\). The column vectors of \(U\) are called left singular vectors. \(\sum\) is a diagonal matrix of size \(p\times q\) with all elements being 0 except those on the main diagonal. Each element on the main diagonal of \(\sum\) is a singular value of \(A\), which is denoted as \({\varvec{\sigma}}i\) (i = 1,…,r). \(V\) is an orthogonal matrix of size \(q\times q\). The column vectors of \(V\) are right singular vectors. Multiplying \(A\) by its transposed matrix \({A}^{T}\) yields Eq. (3), and similarly, Eq. (4) is associated with \(A{A}^{T}\),

$$A{A}^{T}=V\sum {U}^{T}U\sum {V}^{T}=V{\sum }^{2}{V}^{T}$$
(3)
$$A{A}^{T}=U\sum V{V}^{T}\sum {U}^{T}=U{\sum }^{2}{U}^{T}$$
(4)

The eigenvectors of \(A{A}^{T}\) and \(A{A}^{T}\) form the matrices \(V\) and \(U\) in the SVD, respectively. Because the eigenvalues of \(A{A}^{T}\) and \(A{A}^{T}\) are \({\lambda }_{1}\),…,\({\lambda }_{r}\), as shown in Eq. (5), the singular values of matrix \(A\) are denoted as \({\sigma }_{i}=\sqrt{{\lambda }_{i}}\) (i = 1,…,r), as follows:

$$A{A}^{T}={A}^{T}A=\left[\begin{array}{ccc}{\lambda }_{1}& \cdots & 0\\ \vdots & \ddots & \vdots \\ 0& \cdots & {\lambda }_{r}\end{array}\right]$$
(5)

The matrix singular value is closely related to the vital information hidden in the matrix, where the significance of the information is positively correlated with the singular value. SVD does not involve parameter setting in the matrix decomposition process, which allows simple calculations and simplifies the feature extraction process such that it can avoid dependence on human experience. Therefore, SVD is selected to extract HIs that characterize the degradation state of the battery.

Normally, the parameters, including the voltage, current and temperature, of Li-ion batteries are measurable; thus, the respective singular values are first extracted as 12 HIs from 12 sets of measurable data by SVD (see Table 1 for a detailed description). Moreover, since the degradation information contained in the extracted objects has an important impact on the accuracy of the extracted HIs in quantifying the degradation state of the battery, three additional sets of feature extraction objects are calculated with the matrices \(dQ/dV\), \(dV/dQ\), and \(dT/dV\). A lithium-ion battery is degraded in a dynamic and nonlinear process with various degradation modes, such as mechanical degradation of electrode materials, loss of lithium inventory, and loss of active materials. All of these degradation modes can further interact with each other. \(dQ/dV\), \(dV/dQ\), and \(dT/dV\) [21, 22] are often used to analyze the above related degradation modes and thus can be used to extract the features reflecting the degradation state of the batteries, from which HI13, HI14, and HI15 are derived, respectively. \(dQ/dV\), \(dV/dQ\) and \(dT/dV\) are calculated as follows:

Table 1 Spearman correlation coefficients of the 15 potential HIs and capacity of batteries
$$\frac{dQ}{dV}=\frac{I\times dt}{dV}=I\times \frac{dt}{dV}$$
(6)
$$\frac{dV}{dQ}=\frac{dV}{I\times dt}=\frac{1}{I}\times \frac{dV}{dt}$$
(7)
$$\frac{dT}{dV}=\frac{dT}{dt}/\frac{dV}{dt}$$
(8)

Here, \(I\), \(V\), \(t\) and \(T\) refer to the current, voltage, time and temperature during discharge, respectively.

For example, in the derivation of HI1 from the discharging voltage of the B0005 battery, the process of using SVD to derive HI from the battery data is as follows: first, the voltage data of all discharging cycles are regarded as matrices and constructed into the set of matrices \({\left\{A\left(i\right)\right\}}_{i=1}^{c}\), where c is the number of total discharging cycles. Second, the voltage data of all cycle periods, i.e., matrix \(A\left(i\right)\), are placed into the SVD one by one. Since each SVD is processed for a single cycle, according to the principle of the matrix for SVD, a singular value \({\sigma }_{i}\), which represents its characteristic information, can be obtained in every discharging cycle. Finally, the singular value \({\sigma }_{i}\), which is extracted from all discharging cycles, can constitute HI1, denoted as \({\left\{{\sigma }_{i}\right\}}_{i=1}^{c}\).

3.3 Feature evaluation and selection by Spearman correlation analysis

Feature integration is a key step in constructing HIs, and quantitative evaluation and feature selection on the potential HIs extracted need to be conducted before integration to obtain those with stronger correlation to the capacity. Considering that the data used in this work do not conform to a normal distribution, Spearman's correlation coefficient is selected to measure the correlation of HIs. The correlation analysis between the fifteen potential HIs and capacity is shown in Table 1. Based on the absolute values of the correlation coefficients in Table 1, the HIs with very strong correlations (1 to 0.9) are selected for the subsequent feature integration. The nine His of HI2, HI3, HI4, HI5, HI6, HI10, HI11, HI13 and HI15 are selected for integration with B0005, B0006, B0007 and B0018.

3.4 HI integration based on the SDAE

SDAE is an unsupervised neural network that can be used for dimensionality reduction and feature extraction. SDAE is a stack of multiple denoising autoencoders and overcomes the disadvantage that a single layer of encoders cannot efficiently extract complex features. The highly correlated HIs selected in Sect. 2.3 are processed using SDAE to reduce dimensionality and noise. The specific structure of the SDAE is shown in Fig. 2. The two SDAEs are stacked into a stacked autoencoder for two-order parameter fusion, and the structure is shown in Fig. 3. In the constructed SDAE, the number of nodes in the output layer is set to 1 such that the fusion result is a series, and the output result is the second-order fusion HI.

Fig. 2
figure 2

Second-order feature fusion based on SDAE

Fig. 3
figure 3

Flowchart of the RUL prediction for Li-ion batteries based on SVD-SDAE-SVQR

In the Fig. 2, \({\varvec{X}}\) and \(\widehat{{\varvec{X}}}\) are the inputs and outputs of the network, respectively; \({\mathbf{W}}_{1} ,{\mathbf{W}}_{2}\) and \({\mathbf{W}}_{3}\) are the connection weights; and \({\mathbf{b}}_{1} ,{\mathbf{b}}_{2}\) and \({\mathbf{b}}_{3}\) are the bias values. The hidden layer node output \({\mathbf{h}}_{1} ,{\mathbf{h}}_{2}\) and the output layer node \({\hat{\mathbf{X}}}\) are represented by the following equations [15]:

$${\mathbf{h}}_{1} = f_{1} ({\mathbf{W}}_{1} {\mathbf{X}} + {\mathbf{b}}_{1} )$$
(9)
$${\mathbf{h}}_{2} = f_{2} ({\mathbf{W}}_{2} {\mathbf{h}}_{1} + {\mathbf{b}}_{2} )$$
(10)
$${\hat{\mathbf{X}}} = f_{3} ({\mathbf{W}}_{3} {\mathbf{h}}_{2} + {\mathbf{b}}_{3} )$$
(11)

where \(f_{i} ( \cdot ),i = 1,2,3\) represents the node excitation functions and SDAE mainly maps the n-dimensional input sample \({\mathbf{X}}\) to 1-dimensional \({\hat{\mathbf{X}}}\) through encoding excitation functions.

To further analyze the accuracy of the integrated HI in quantifying the degradation state of the battery, quantitative and qualitative analyses are performed on it. The results of the quantitative analysis are given in Table 2, and to test the significance of the results, a test is conducted using the p value method. The table shows that the Spearman correlation coefficients of the integrated HI for all four batteries are greater than 0.99, and the p value is also much less than 0.01, indicating that a significant correlation exists between the integrated HI and capacity. The results of the qualitative analysis are shown in Fig. 4. The degradation trends from the four batteries' integrated HI curves and the original capacity curves are basically the same. Taken together, the results from the quantitative and qualitative analyses indicate that the integrated HI can effectively represent the degradation state of the batteries.

Table 2 Spearman correlation analysis of the fused HI and capacity of four batteries
Fig. 4
figure 4

Fusion HI qualitative analysis: a B0005; b B0006; c B0007; d B0018

4 RUL interval prediction based on support vector quantile regression

4.1 Quantile regression

Quantile regression (QR) [22] extends the traditional mean regression to quantile regression, can be used to describe the global characteristics of response variables in more detail and is not affected by outliers; moreover, the results are more robust. Importantly, QR does not require regression residuals to satisfy the basic assumptions of a normal distribution, random independence, and uniform mean regression methods. This approach is highly valuable for practical engineering problems such as RUL estimation of lithium batteries because the actual operation of lithium batteries is affected by variable working conditions, operating environments and other factors; thus, the monitoring data has difficulty meeting the normal distribution hypothesis. Given training set \(T = (x_{i} ,y_{i} ),i = 1,2, \cdots ,n\), the QR model can be constructed according to the following process:

$$q_{\tau } ({\mathbf{x}}_{i} ) = {\mathbf{x^{\prime}}}_{i} {{\varvec{\upbeta}}}(\tau ),\;\;i = 1,2, \cdots ,n;\tau = \tau_{1} ,\tau_{2} , \cdots ,\tau_{M}$$
(12)

where \({\mathbf{x}}_{i} = [x_{i1} ,x_{i2} , \cdots ,x_{id} ]\) is an independent variable with \(d\) components, \(x_{ij}\) is the \(j\) th component of \({\mathbf{x}}_{{\mathbf{i}}}\), \({\mathbf{y}}_{{\mathbf{i}}}\) is the dependent variable corresponding to \({\mathbf{x}}_{{\mathbf{i}}}\), and \(q_{\tau } ({\mathbf{x}}_{{\mathbf{i}}} )\) is the \(\tau\)-quantile of the dependent variable \({\mathbf{y}}_{{\mathbf{i}}}\) under the independent variable \({\mathbf{x}}_{{\mathbf{i}}}\). For any quantile \(0 < \tau < 1\), \({{\varvec{\upbeta}}}(\tau ) = [\beta_{1} (\tau ), \cdots ,\beta_{d} (\tau )]^{\prime}\) is the regression coefficient under the quantile \(\tau\). The estimate of \({{\varvec{\upbeta}}}\) is obtained by minimizing the test function \(\rho_{\tau } (\sigma )\), as follows:

$${\hat{\mathbf{\beta }}}(\tau ) = \arg \min \sum\limits_{i = 1}^{n} {\rho (\tau )(y_{i} - x^{\prime}_{i} {{\varvec{\upbeta}}}(\tau ))}$$
(13)

where \(n\) is the number of samples and the test function \(\rho_{\tau } (\sigma )\) is a piecewise linear function. The definition of \(\rho_{\tau } (\sigma )\) is as follows:

$$\rho_{\tau } (\sigma ) = \sigma (\tau - I(\sigma ))$$
(14)

where \(I(\sigma )\) is an indicator function and \(I(\sigma )\) is defined as follows:

$$I(\sigma ) = \left\{ {\begin{array}{*{20}c} {0,\sigma \ge 0} \\ {1,\sigma < 0} \\ \end{array} } \right.$$
(15)

4.2 Support vector quantile regression

Although the QR method can describe the global characteristics of response variables in more detail without the need for the described object to follow a specific distribution and based on the definition of QR, QR is essentially a regression analysis for linear problems. In practical engineering problems, the relationship between the independent variable and dependent variable is often nonlinear, and determining the explicit function form is difficult.

SVR [23, 24] can implicitly map low-dimensional nonlinear data to linear regression problems in high-dimensional feature spaces through kernel function techniques. Based on the principle of structural risk minimization, SVR has unique advantages in solving small sample sets and nonlinear problems; thus, it is desirable to combine SVR and QR to fully exploit their advantages.

$$\min \;\frac{1}{2}|{\mathbf{w}}|^{2} + \lambda \sum\limits_{i = 1}^{n} {{{\varvec{\uprho}}}_{\tau } ({\mathbf{y}}_{i} - {\mathbf{w}}^{T} \phi ({\mathbf{x}}_{i} ) - {\mathbf{b}})}$$
(16)

where \(\lambda\) is the penalty parameter, \(n\) is the number of samples, and \(\phi ({\mathbf{x}})\) is a nonlinear mapping function that can be solved by implicitly mapping the \(d\)-dimensional input vector \({\mathbf{x}}\) to a high-dimensional feature space. Similar to the process of solving for SVR, by introducing relaxation variables \(\xi_{i} ,\xi_{i}^{*}\), Eq. (19) can be converted to solve the following optimization problem:

$$\mathop {\min }\limits_{{{\mathbf{w}},{\mathbf{b}},\xi ,\xi^{*} }} \;\frac{1}{2}|{\mathbf{w}}|^{2} + \lambda \sum\limits_{i = 1}^{n} {(\tau \xi_{i} + (1 - \tau )\xi_{i}^{*} )}$$
(17)
$$s.t\;\left\{ {\begin{array}{*{20}c} {y_{i} - {\mathbf{w}}^{T} \phi ({\mathbf{x}}_{i} ) - b \le \xi_{i} } \\ {{\mathbf{w}}^{T} \phi ({\mathbf{x}}_{i} ){ + }b - y_{i} \le \xi_{i}^{*} } \\ {\xi_{i}^{{}} ,\xi_{i}^{*} \ge 0} \\ \end{array} } \right.$$
(18)

The semiparametric method [25] can be adopted to solve the optimization problem of Eq. (18) by constructing the Lagrange function, as shown in Eq. (19):

$$\;\left\{ {\begin{array}{*{20}c} {{\mathbf{w}}_{\tau } { = }\sum\limits_{{\tau { = }1}}^{T} {(a_{t} - a_{t}^{*} )\phi ({\mathbf{x}}_{t} )} } \\ {(b_{\tau } ,{{\varvec{\upbeta}}}_{\tau } )^{T} = ({\mathbf{U}}^{T} {\mathbf{U}})^{ - 1} {\mathbf{U}}^{T} ({\mathbf{y}} - {\mathbf{K}}({\mathbf{a}} - {\mathbf{a}}^{*} ))} \\ {Q_{yt} (\tau |{\mathbf{u}}_{t} ,{\mathbf{x}}_{t} ;{{\varvec{\uptheta}}}) = b_{\tau } + {{\varvec{\upbeta}}}_{\tau }^{T} {\mathbf{u}}_{t} + {\mathbf{K}}_{t} ({\mathbf{a}} - {\mathbf{a}}^{*} )} \\ \end{array} } \right.$$
(19)

where \(\;a_{t} ,a_{t}^{*}\) is the optimal Lagrange multiplier obtained by optimization, \({\mathbf{U}} = (1,u_{t}^{T} )\) is a design matrix, \({\mathbf{y}}\) is a vector composed of support vectors \(y_{t}\), \(t \in I_{SV}\), \(I_{SV}\) is the set of subscripts of support vectors, \(I_{SV} = \{ t = 1,2, \cdots ,T|0 < a_{t} < \tau \lambda ,0 < a_{t}^{*} < (1 - \tau )\lambda \}\), \(K({\mathbf{x}}_{s} ,{\mathbf{x}}_{t} ) = \phi ({\mathbf{x}}_{s} )^{T} \phi ({\mathbf{x}}_{t} )\) is the kernel function, \({\mathbf{K}}\) is the kernel matrix, and \(K_{t}\) is the \(t\)-th row of \({\mathbf{K}}\). The kernel function and penalty parameter \(\lambda\) have a great impact on the performance of SVQR. Many studies have shown that the Gaussian kernel function effectively performs in most cases [26]; therefore, the Gaussian kernel function is adopted, and its form is as follows:

$$K({\mathbf{x}}_{s} ,{\mathbf{x}}_{t} ) = \exp \left( { - \frac{{|{\mathbf{x}}_{s} - {\mathbf{x}}_{t} |^{2} }}{{2\sigma_{K}^{2} }}} \right)$$
(20)

\(\theta = (\lambda ,\sigma_{K}^{2} )^{T}\) is the hyperparameter that needs to be determined before training. In this study, the generalized approximate cross-validation (GACV) criterion [24] of Eq. (21) is adopted to obtain the optimal hyperparameter combination:

$${\text{GACV}}({{\varvec{\uptheta}}}) = \frac{{\sum\limits_{t = 1}^{n} {\rho_{\tau } (y_{t} - Q_{{y_{t} }} (\tau |{\mathbf{u}}_{t} ,{\mathbf{x}}_{t} ;\theta ))} }}{{n - trace({\mathbf{H}})}}$$
(21)

where θ is the hyperparameter set and \({\mathbf{H}}\) can be obtained by

$$\left( {Q_{{y_{1} }} (\tau |u_{1} ,x_{1} ), \cdots ,Q_{{y_{n} }} (\tau |u_{n} ,x_{n} )} \right)^{T} = {\mathbf{Hy}}$$
(22)

4.3 Modeling process based on the SVD-SDAE-SVQR

In this section, a new SVD-SDAE-SVQR lithium-ion battery RUL prediction method is proposed by combining SVD, SDAE and SVQR. The specific process, as shown in Fig. 5, can be divided into four steps.

Fig. 5
figure 5

Capacity degradation curves of CH2 and CH5

Step 1: SVD feature extraction. First, SVD is performed on the measurement data of the battery, and their singular values are taken as 12 potential HIs. Second, the dQ/dV, dV/dQ and dT/dV matrices are calculated based on parameters such as electricity, time, voltage and temperature, from which HI13, HI14 and HI15 are extracted using SVD.

Step 2: Feature selection and feature fusion. Spearman correlation analysis is used to select highly correlated HIs from the 15 potential HIs in Step 1 for feature fusion. The selected HIs are fused into an HI by SDAE processing.

Step 3: SVQR capacity prediction. With the fusion feature \({\left\{{\text{HI}}\left(i\right)\right\}}_{i=1}^{c}\) as the input and the battery capacity \({\left\{{\text{C}}\left(i\right)\right\}}_{i=1 }^{c}\) as the output, a GPR-based capacity prediction model is constructed, where c is the total cycle period. The training set is denoted as {x, y}, where x = [HI(1 ∶ k)Τ] and y = [C(1 ∶ k)| and the number of cycles corresponding to the prediction starting point is denoted as k + 1. The test set is expressed as {x*, y*}, where x* = [HI(k + 1 ∶ c)Τ] and y* = [C(k + 1 ∶ c)Τ]. The prediction part uses x* as input, and the predicted capacity value y* is obtained.

Step 4: Calculation of the RUL prediction value. The remaining life \(\left({N}_{RUL}\right)\) is the difference between the total number of charge and discharge cycles \(\left({N}_{EOL}\right)\) when the actual battery capacity drops to the threshold and the current number of charge and discharge cycles \(\left({N}_{ECL}\right)\). The calculation formula is shown in Eq. (23):

$${N}_{RUL}={N}_{EOL}-{N}_{ECL}$$
(23)

4.4 Evaluation indices of prediction performance

Three evaluation indices, the root mean square error (RMSE), coefficient of determination (R2) and RUL absolute error (AE), were used to evaluate the prediction performance of our proposed method. The calculation formulas for the RMSE, R2 and AE are calculated as follows:

$$RMSE = \sqrt {\frac{1}{N}\sum\limits_{i = 1}^{N} {(y_{i} - \hat{y}_{i} )^{2} ,\,\,} } R^{2} = \frac{{\sum\limits_{i = 1}^{N} {(\hat{y}_{i} - \overline{y})^{2} } }}{{\sum\limits_{i = 1}^{N} {(y_{i} - \overline{y})^{2} } }},\,\,AE = |RUL^{T} - RUL^{P} |$$
(24)

where \(N\) is the number of cycles of the battery; \(y_{i}\) and \(\hat{y}_{i}\) represent the actual capacity and predicted capacity of the lithium-ion battery, respectively, during the \(i\) th cycle; and \(\overline{y}\) is the mean of the actual capacity. \(RUL^{T}\) and \(RUL^{P}\) represent the actual RUL and predicted RuL, respectively, when the battery capacity reaches the failure threshold.

5 Experimental results and analysis

5.1 Li-ion battery RUL prediction results

To analyze the RUL prediction performance of the model, experiments with different prediction starting points are designed; the starting points of prediction are 60 and 80. Assuming that the capacity data length of the lithium-ion battery is \(N\), in the experiment with a starting point of 60, the first 59 capacity data points are selected as the training set, and the remaining capacity data are selected as the testing set. Specifically, the training set is \([Q(1), \cdots ,Q(59)]\), and the testing set is \([Q(60), \cdots ,Q(N)]\). Similarly, in the experiment with a starting point of 80, the first 79 capacity data points (\([Q(1), \cdots ,Q(79)]\)) are selected as the training set, and the remaining capacity data point (\([Q(80), \cdots ,Q(N)]\)) are selected as the testing set. The Gaussian function is used as the kernel function of support vector regression, and the optimal hyperparameter combination for support vector regression is determined by tenfold cross-validation.

In the experiments, the capacity fault threshold of batteries B0005, B0006, and B0018 is 1.38 Ah. Because the B0007 battery does not reach the fault threshold, no fault threshold line is provided.

Figure 6 shows the prediction results and their uncertainty expression for the four batteries at prediction starting points 60 and 80. Overall, the predicted curve is close to the real capacity degradation curve, the prediction result is less affected by the prediction starting point, and the prediction curves of the four batteries at prediction starting point 80 are closer to the real capacity degradation curve than those at prediction starting point 60.

Fig. 6
figure 6

Prediction results for the four batteries at different prediction starting points

In addition, Table 3 shows the evaluation results from the prediction performance of the four batteries at different prediction starting points. As shown in the table, the prediction error at prediction starting point 80 is lower than that at prediction starting point 60, and the RMSE of the four batteries is less than 0.0301; thus, these results indicate that the prediction performance of our proposed model is good. The R2 values of the four batteries are all greater than 0.93, indicating a high degree of fitting between the predicted curve and the actual curve. The AE values of batteries B0005, B0006 and B0018 gradually decrease as the prediction starting point is reduced. When the prediction starting point is 60, the AE values of the three batteries are all less than 6, and the error values are low. In addition, as the prediction starting point is set to 80, the AE value gradually decreases and is less than 2.

Table 3 Prediction performance of four batteries at different prediction starting points

5.2 Adaptability of SVD to different battery datasets

To verify the adaptability of the SVD feature extraction method to different lithium-ion battery datasets, the battery datasets provided by MIT [27] are selected for the experiments in this section. The battery parameters and experimental environment are different from those of the battery datasets provided by NASA. The battery datasets used for the test are first described in detail. Then, the fusion HI are still extracted by combining SVD and the SDAE. Finally, the SVQR is used for capacity prediction.

5.2.1 MIT datasets

The test battery is a lithium iron phosphate (LFP)/graphite battery manufactured by an A123 system (APR18650 M1A), and it is placed in a forced convection temperature chamber at 30 ℃. During the charging stage, 72 charging modes are formed by a constant current fast charging combination with different rates, and the discharge process adopts a unified 4 C rate. The battery has a nominal capacity of 1.1 Ah and a nominal voltage of 3.3 V. Two sets of data, Channel 2 (CH2) and Channel 5 (CH5), in the second batch of data labelled as “2017-06-30,” are selected for the experiment. The original capacity degradation trajectories of CH2 and CH5 are shown in Fig. 5.

5.2.2 Feature extraction and fusion

For CH2 and CH5, the same SVD is used to extract their respective singular values from the measurement data, dQ/dV, dV/dQ, and dT/dV, respectively, as HIs. dQ/dV is given in the dataset, and dV/dQ and dT/dV need to be calculated. When calculating dT/dV, because the same voltage value collected at some adjacent time points in the same cycle causes the dT/dV value to be invalid, the repeated voltage data and the temperature data corresponding to the corresponding measurement time are removed. Spearman correlation analysis is performed on the extracted potential HIs, and the analysis results are shown in Table 4. Except for HI8, the correlation coefficients between the remaining 7 potential HIs and capacity are all greater than 0.9 in absolute value, showing strong correlations. Therefore, according to the correlation coefficient, 7 HIs, excluding HI8, are selected and fused into one HI using the SDAE. To analyze the ability of the fusion HI to characterize the battery degradation state, the Spearman correlation coefficient and p value are calculated, and the specific results are provided in Table 5. As shown in Table 5, the Spearman correlation coefficients between the fused HI and tumor volume are greater than 0.99, and the p value is close to 0, indicating that the fused HI is significantly correlated with volume.

Table 4 Spearman correlation analysis between capacity and potential HIs of CH2 and CH5
Table 5 Spearman correlation analysis of the capacity and the fused HI

5.2.3 Capacity prediction

SVQR is used to model the prediction portion. By fusing the HI data as training data and capacity data as output data, a capacity prediction model based on the SVQR is constructed. Based on the data size of CH2 and CH5, the data from the first 500 and first 100 cycles for training were selected, respectively. The prediction results are shown in Fig. 7; the prediction curve approaches the real capacity degradation curve relatively accurately. The evaluation results of the prediction performance are provided in Table 6. As shown in the table, the RMSEs of CH2 and CH5 are both less than 0.008, and R2 is greater than 0.94; thus, our proposed model has good prediction performance and the predicted curve is in good agreement with the actual curve.

Fig. 7
figure 7

Capacity prediction results for CH2 and CH5 a CH2 b CH5

Table 6 Prediction performances of CH2 and CH5

In summary, through battery dataset experiments provided by NASA and MIT, SVD can be used to extract HIs and has good adaptability to different battery datasets while avoiding the artificial parameter setting. In addition, the fused HI has a strong correlation and better prediction performance and accuracy.

5.3 Comparative test analysis of different HIs

To more fully illustrate the effectiveness and superiority of using the SVD method to extract potential HIs from measurement data, dQ/dV, dV/dQ and dT/dV are used; highly correlated HIs are selected for feature fusion by using the SDAE and combined with the SVQR prediction model (SVD-SDAE-SVQR); and two comparison models, SVD-PCA-SVQR and SHI-SVQR, are designed. In the SVD-PCA-SVQR method, HIs with strong correlations are selected from the 15 potential HIs by Spearman correlation analysis, the selected HIs are fused into an HI indicator using PCA, and the fused HI indicator and SVQR model are used for RUL prediction. In the SHI-SVQR method, the discharge time of an equal voltage drop from 3.8 V to 3.5 V during the discharge phase is directly used as the HI, and the single HI and SVQR models (SHI-SVQR) are used for RUL prediction.

The prediction performances of the three models for the B0005, B0006, B0007, and B0018 batteries are shown in Table 7. For the SVD-SDAE-SVQR model, the RMSEs of batteries B0005, B0006, B0007 and B0018 are 0.0055, 0.0284, 0.0082 and 0.0068, respectively. The RMSEs of the four batteries are lower than those of the SVD-PCA-SVQR model and SHI-SVQR. The RMSE of the SVD-PCA-SVQR model is also lower than that of the SHI-SVQR model. For the SVD-SDAE-SVQR model, the AE values of the three batteries except B0007 are 1, 2 and 1, which are also lower than the AE values of SVD-PCA-SVQR and SHI-SVQR. Moreover, SVD-PCA-SVQR has a better prediction effect than the SHI-SVQR model. From the RMSE and AE results, the SVD-SDAE-SVQR model has better prediction accuracy than those of the SVD-PCA-SVQR and SHI-SVQR models, and the prediction performance of the HIs extracted in this study after PCA fusion is also better than that of a single HI.

Table 7 Prediction performance of the three methods

The R2 values of the four batteries of the SVD-SDAE-SVQR model are 0.9961, 0.9409, 0.9903 and 0.9586, which are significantly greater than the R2 values of the SVD-PCA-SVQR and HI-SVQR models. These findings fully indicate that the prediction results of the SVD-SDAE-SVQR method can better fit the original degradation curve than those of the SVD-PCA-SVQR and SHI-SVQR models. In addition, except for B0018, the R2 values of the other three batteries of the SVD-PCA-SVQR model are also significantly greater than those of the SHI-SVQR model, indicating that the prediction results of HI after PCA fusion better fit the original curve than those of the SHI-SVQR model. The comparative experimental results show that the same HIs after SDAE fusion have better HI prediction performance than those after PCA fusion; thus, the SDAE is better than PCA in feature fusion. In addition, the prediction performance of the potential HIs extracted in this study after fusion is better than that of other HIs, indicating the effectiveness and superiority of HIs extracted by our proposed model.

5.4 Comparison with existing methods

To further validate the performance of our method, we compared it to four existing prediction methods on NASA datasets. The GPR model [28], GCO-LSTM model [29], ARRVM model [30], and ADE-MESN model [31] were selected and compared with our proposed prediction model. The 80th cycle was taken as the starting point of prediction, and RMSE, MAE and AE were selected as evaluation indices. The experimental results on the MIT dataset (CH2, CH5, CH9 and CH10) are shown in Table 8.

Table 8 Prediction results from the different models

As shown in Table 8, compared with the GPR, GGO-LSTM, AR-RVM and ADE-MESN models, our proposed model has a lower RMSE and higher R2 values. Compared with those of the other four methods, the RMSEs of our proposed method are reduced by 56.72%, 61.17%, 17.57% and 30.94%, and the R2 are increased by 1.5012%, 1.9893%, 0.7806% and 0.8617%, respectively. Compared with the other four methods, our proposed prediction method has higher prediction accuracy.

6 Conclusion

In this paper, a RUL prediction method for lithium-ion batteries based on SVD-SDAE-SVQR is proposed. The main contributions are as follows:

First of all, in order to solve the shortcomings of traditional feature extraction methods, which depend on parameter setting and have poor adaptability to different lithium-ion battery data sets, SVD is used for feature extraction to solve the above problems.

Then, 15 potential HIs are extracted from the raw measurement data and calculated feature extraction objects containing more degradation information. Spearman correlation analysis and SDAE are used to deal with potential HIs to obtain a fused HI, considering that redundancy and insufficiency of HIs can affect RUL prediction.

Finally, the fusion HI and SVQR were combined to predict RUL, and the uncertainty expression of the result was given. In the validation phase, experimental results on battery datasets provided by NASA and MIT demonstrate the effectiveness and adaptability of this feature extraction strategy. The superiority of the proposed HI was verified by comparing the HI after PCA fusion with other HI. In summary, the SVD feature extraction method has good adaptability, and the fusion HI obtained by Spearman correlation analysis and SDAE processing has good prediction performance and the proposed RUL prediction framework has the advantages of high accuracy and output probability.

The proposed method also has some limitations. For example, laboratory data of lithium-ion batteries may vary according to different experimental conditions and measuring instruments. Will these differences affect the feature extraction effect of SVD-SDAE? Can the proposed method be directly applied to different types battery data without changing the model parameters? Therefore, the proposed method needs to be further explored in combination with transfer learning, so that it can be more widely applied to the battery measurement data under relatively complex working conditions.