1 Introduction

Underwater discharge explosions are widely utilized in industrial applications, such as underwater acoustic sources [1], water sterilization [2], marine exploration [3], and lithotripsy [4]. The time–frequency characteristics of shockwaves induced by the pulsed discharge in liquids are of the most concern. Accurate analysis of the signal characteristics is helpful in improving utilization efficiency.

The mechanism of underwater discharge is as follows [5]: When the underwater discharge starts, due to the enormous energy instantly released, the wire or the water between the discharge electrodes will quickly be heated, vaporize, plasma, and cause an explosion; the impact pressure can be up to 1000–10,000 atm. As the explosion’s duration is short, this high pressure can compress the water medium around, creating a sudden change of pressure and density, thus creating a mighty shock wave and the outward propagation of supersonic speed. Then the plasma cools down and oscillates as a bubble several times. The bubble pulsations lag behind shock waves by milliseconds and overlap in the time domain.

The underwater discharge acoustic signal has a short time duration, high pressure, and wide frequency band. So the traditional Fourier transform method is not suitable. HHT (Hilbert–Huang Transform) method can be used to analyze such transient signals. Still, modal aliasing may occur [6] so that the signal components’ reconstruction and analysis at a specific frequency band are intractable. The wavelet analysis method is also suitable for studying this kind of signal and is convenient for local analysis [7, 8]. Wavelet transforms (WTs) use wavelets instead of sinusoids to convert a signal from the time to the time–frequency domain, revealing the time-support of frequencies, and intelligently exploring different frequency bands based on adaptive window lengths and positions [9]. The main types of wavelet transform are the Continuous Wavelet Transform (CWT), the Discrete Wavelet Transform (DWT), the Discrete-Time Wavelet Transform (DTWT), and the Stationary Discrete-Time Wavelet Transform (SDTWT). Each of them has a different way of working and is suitable for specific situations [10]. Chebyshev wavelet bases use orthogonal bases to transform the studied problems into linear or nonlinear Algebraic equation to simplify the calculation process and have been widely used to deal with various boundary conditions and solve integrodifferential equation equations. [11]. Guido et al. [12] introduced the Discrete Path Transform (DPT) to improve the regular Energy based workload analysis and the traditional zero-crossing rate-based spectral description. Yang et al. [13] proposed a novel and effective method called wavelet transform-based smooth ordering (WTSO) for HSI classification, which consists of three main components: wavelet transform for feature extraction, spectral–spatial based similarity measurement, smooth ordering-based 1D embedding, and construction of final classifier using interpolation scheme. Zheng et al. [14] established an adaptive multiscale graph wavelet decomposition (AMGWD) framework by taking the downsampling unbalance and also the high-pass componentsinto account. Guariglia et al. [15] provided the Weierstrass-Mandelbrot function(WMF) on the Cantor set, the wavelet spectrum and scalogram of the WMF could open new and interesting prospects. Guariglia and Silvestrov [16] described a wavelet expansion theory for positive definite distributions over the real line and defined a fractional derivative operator for complex functions in the distribution sense, which had already given some results in analytic number theory and could be able to describe different physical phenomena.

However, the main problem of this method is to choose the appropriate wavelet basis and the appropriate decomposition level when de-noising. Wavelet selection has a remarkable effect on de-noising performance and reconstruction accuracy. A specific wavelet filter has a specific frequency and phase response to the signal [17, 18]. The selection of the appropriate wavelet basis function is the premise of wavelet analysis. Using different wavelets to process the same signal will get different results, sometimes even very different. The required wavelet should be selected according to the specific situation to obtain accurate results. The improper wavelet will lead to serious deviation of signal analysis results from the real situation.

There have been some research results on this issue. Ma et al. [19] proposed a method for optimal wavelet selection based on prior knowledge of the partial discharge (PD) pulse shape: Correlation Based Wavelet Selection( CBWS). Suppose the correlation between the wavelet and the target signal is the largest among all the selectable wavelets and the target signal. In that case, the base wavelet is selected as the optimal signal de-noising. Li et al. [20] presented an automatic and scale-dependent base wavelet selection scheme for PD signal de-noising: Energy Based Wavelet Selection(EBWS). The energy criterion of the scale-dependent wavelet selection scheme is proposed based on the energy percentage of PD signal and white noise decomposed at different scales. Liu et al. [21]proposed a Wavelet Entropy-Based Wavelet Selection Scheme (WEBWSS) to provide an alternative to CBWS and EBWS for partial discharge signal de-noising. Garg [22] introduced an automatic wavelet function selection algorithm: Mother Wavelet Selection (MWS) algorithm. The problem is formulated as an optimization criterion, and variance and genetic algorithm analysis are used to determine the optimal wavelets. Omari et al. [23] presented an automatic wavelet selection scheme, which was applied in PCG de-noising operation. The proposed method is based on multiplying the detail coefficients by the exponential approximation coefficients, or EXP for short, searching for the mother wavelet that provides the minimum value at each level and then selecting the wavelet and decomposition level regarding the highest EXP value. Peng et al. [24] analyzed the frequency characteristics of wavelets by mathematical modeling and selected the optimal wavelet base function hierarchically according to the amplitude-frequency characteristics of ECG signal. Du et al. [25] proposed the concept of information loss assessing coefficient on the foundation of some contents of wavelet threshold de-noising for the first time. To choose a suitable combination of scales, Wang et al. [26] proposed a rule to combine wavelet scales based on the sensitivity of each scale and selected the appropriate combination of wavelet scales based on sequence combination analysis (SCA). Instead of using correlation-based wavelet base selection for de-noising PD data, Altay et al. [27] introduced a wavelet selection method based on the most informative sub-band energy and entropy for separating noise from PD pulses. Wijaya et al. [28] proposed Information Quality Ratio (IQR) as a new metric for mother wavelet selection in real-world applications. IQR emphasizes that the reconstructed signal must maintain the essential information of the original signal. Wu et al. [29] presented a wavelet choice criterion to choose an appropriate mother wavelet that can make the energy-to-entropy of the wavelet coefficients maximum. The typical evaluation indexes to evaluate the performance of wavelets are the correlation coefficient, root mean square error, signal-to-noise ratio, and smoothness [30, 31]. There are also some effective index fusion methods in statistics [32].

Most of these previous research results are level by level and detailed analysis of a single specific signal, achieving very accurate results. Still, the efficiency is not high enough. A selection scheme that can select a wavelet with acceptable analysis accuracy and certain generality is necessary when a large number of signals need to be processed.

This study focuses on the selection of wavelets for the underwater discharge signal analysis. A scheme of wavelet selection based on typical discharge signal characteristics is given. According to the waveform characteristics of the signal to be analyzed, a waveform is generated by simulation as a reference target signal, and the actual noise is added to form a noisy signal. The reference signal and noise are known and consistent with the actual situation, ensuring the accuracy and credibility of performance evaluation. The optional wavelets are used to denoise the noisy signal, and the performance of each wavelet is evaluated according to the evaluation indexes to select the most suitable wavelet for analyzing this kind of signal. Then underwater discharge signals produced by the electric explosion of different wires were processed by the chosen optimal wavelet. The results show that the scheme is feasible and practical.

2 Method and experimental setup

2.1 Optional wavelets and decomposition levels

The wavelets have attributes such as orthogonality, biorthogonality, compact support, and vanishing moment [7]. Orthogonality and biorthogonality are conducive to accurate signal reconstruction. Compact support can ensure excellent local time–frequency characteristics. Wavelet basis with certain vanishing moment order can effectively highlight the singularity of signal, which is very beneficial to signal de-noising. The attributes of common wavelets are shown in Table 1.

Table 1 The attributes of common wavelets

As mentioned earlier, the underwater discharge signal has the characteristics of short time duration, high pressure, and wide frequency band, which requires that the wavelets used to process such signals have the properties of orthogonality or biorthogonality, compact support, symmetry, and particular vanishing moment. The Discrete-Time Wavelet Transform (DTWT) is used to process the signal. Then the Daubechies, Symlets, Coiflets, Biorthogonal, and Revers Biorthogonal wavelet families are selected as optional wavelets to process the underwater discharge signal. Waveforms of some wavelets are shown in Fig. 1.

Fig. 1
figure 1

Waveforms of some wavelets

There is a maximum decomposition level for a certain length of data and a specific wavelet. So the optional decomposition levels should not be more than the maximum level. The larger the decomposition level, the more obvious the different characteristics of noise and signal, which is more conducive to their separation. On the other hand, the larger the decomposition level, the greater the distortion of the reconstructed signal, which will affect the final de-noising effect to a certain extent.

2.2 The evaluation indexes of wavelet selection

The key to indexes selection is identifying quantitative expressions that can characterize the de-noised signal from different perspectives. Existing metrics for characterizing wavelet de-noised signals include root mean square error, signal to noise ratio, number of correlations, and smoothness [30, 31].

2.2.1 Cross-correlation coefficient (CC)

$$\begin{aligned} \text{CC}=\frac{\sum _{n=1}^{N} (s(n)-{\overline{s}}(n))(s_{r}(n)-{\overline{s}}_{r}(n))}{\sqrt{\sum _{n=1}^{N} (s(n)-{\overline{s}}(n) )^{2}\sum _{n=1}^{N} (s_{r}(n)-{\overline{s}}_{r}(n) )^{2}} } \end{aligned}$$
(1)

where N is the data length, s(n) and \(s(n)_{r}\) are the pure signal and the wavelet-processed and re-constructed signal; \({\overline{s}}(n)\) and \({\overline{s}}_{r}(n)\) are the mean values of s(n) and \(s_{r}(n)\).

2.2.2 Root mean square error (RMSE)

$$\begin{aligned} \text{RMSE}=\sqrt{\frac{1}{N}\sum _{n=1}^{N} \left[ s(n)-s_{r}(n) \right] ^2} \end{aligned}$$
(2)

This index reflects the degree of difference between s and \(s_{r}\). Smaller RMSE values indicate better results.

2.2.3 Signal-to-noise ratio (SNR)

$$\begin{aligned} \nonumber \text{power}_{\text{signal}}&=\frac{1}{N}\sum _{n=1}^{N}[s(n)]^2\\ \nonumber \text{power}_{\text{noise}}&=\frac{1}{N}\sum _{n=1}^{N}[\text{noise}(n)]^2\\ \text{SNR}&=10\text{lg}\left( \frac{\text{power}_{\text{signal}}}{\text{power}_{\text{noise}}}\right) \end{aligned}$$
(3)

where \(\text{noise}(n)\) is the noise. \(\text{power}_{\text{signal}}\) and \(\text{power}_{\text{noise}}\) are the power of pure signal and noise. It is generally accepted that a higher SNR means a better result.

2.2.4 Smoothness(r)

$$\begin{aligned} r=\frac{\sum _{n=1}^{N-1}[s_{r}(n+1)- s_{r}(n)]^2}{\sum _{n=1}^{N-1}[ s (n+1)- s(n)]^2} \end{aligned}$$
(4)

Smaller smoothness values indicate better de-noising results.

Accordingly, it can be inferred that [32]: the CC reflects the fitting information of the reconstructed signal to the pure signal; the RMSE actually reflects the overall deviation information of the signal; the SNR ratio reflects the effect of the noise information on the overall signal; and the r reflects the local variation information of the signal, i.e., whether there are more local mutations. These four indexes are independent of each other. The pursuit of the high performance of a single index will often lead to the decline of the performance of other indicators. For example, only the pursuit of high SNR will often lead to the deterioration of CC and RMSE.

3 The method of index fusion

One typical problem is the fusion of various evaluation indexes. Some previous studies have chosen only one measure, usually RMSE, so there is no index fusion problem. But there is no doubt that multi-index evaluation can obtain a better result. The usual method is to assign a certain weight to each indicator and then perform a weighted fusion. Some literature uses manual subjective assignment of weights, but the reasonableness of the weights determined by this method remains controversial. There are now several methods for objectively determining weights in statistics. Shen et al. [30] used the coefficient of variation method to assign weights and fuse indicators automatically. Tao et al. [32] used the entropy value method to weight the indicators for fusion. Both of the two methods assume that if the variation degree of an index value is greater, the dispersion degree of the index will be more significant, the influence of the index on the comprehensive evaluation will be more powerful, and the weight of the index should be greater; vice versa. This study used the coefficient of variation method [33] to fuse the indexes. Analysis using the entropy method was also carried out later, and the results were the same as the coefficient of variation method, so they are not listed in the text. The main steps of the coefficient of variation method are:

  • Step1 Use the min–max algorithm to normalize the data sets of the four evaluation parameters, as follows:

$$\begin{aligned} \text{CC}_{m}(i)&= \frac{\text{CC}(i)-\text{min}(\text{CC})}{ \text{max}(\text{CC})-\text{min}(\text{CC})} \end{aligned}$$
(5)
$$\begin{aligned} \text{RMSE}_{m}(i)&= \frac{\text{RMSE}(i)-\text{min}(\text{RMSE})}{ \text{max}(\text{RMSE})-\text{min}(\text{RMSE})}\end{aligned}$$
(6)
$$\begin{aligned} \text{SNR}_{m}(i)&= \frac{ \text{SNR} (i)-\text{min}( \text{SNR})}{ \text{max}( \text{SNR})-\text{min}( \text{SNR})}\end{aligned}$$
(7)
$$\begin{aligned} r_{m}(i)&=\frac{r(i)-\text{min}(r)}{\text{max}(r)-\text{min}(r)}\nonumber \\ (i&=1,2,\ldots ,M) \end{aligned}$$
(8)

where M is the length of the index dataset. \(\text{CC}(i)\), \(\text{RMSE}(i)\), \(\text{SNR}(i)\) and r(i) are the cross-correlation coefficient, root mean square error, signal-to-noise ratio and smoothness before normalize. \(\text{CC}_{m}(i)\), \(\text{RMSE}_{m}(i)\), \(\text{SNR}_{m}(i)\) and \(r_{m}(i)\) are the CC, RMSE, SNR and r after normalize. \(\text{min}()\) is the minimum value of the evaluation index data set before normalization, \(\text{max}()\) is the maximum value of the evaluation index data set before normalization.

  • Step2 Calculate the coefficient of variation of each normalized evaluation index data set:

$$\begin{aligned} \text{CV}_{{\text{EI}}_{k}}= \frac{\sigma _{{\text{EI}}_{k}}}{\mu _{{\text{EI}}_{k}}} \qquad (k=1,2,3,4) \end{aligned}$$
(9)

where \(\text{CV}_{{\text{EI}}_{k}}\) is the coefficient of variation of each normalized evaluation index data set, \(\sigma _{{\text{EI}}_{k}}\) and \(\mu _{{\text{EI}}_{k}}\) are the standard deviation and mean value of each normalized evaluation index data set. \(\text{EI}_{1}\), \(\text{EI}_{2}\), \(\text{EI}_{3}\), \(\text{EI}_{4}\) represent \(\text{CC}_{m}\), \(\text{RMSE}_{m}\), \(\text{SNR}_{m}\) and \(r_{m}\), respectively.

  • Step3:Obtain the weight of each normalized evaluation parameter data set.

$$\begin{aligned} W_{{\text{EI}}_{k}}=\frac{ \text{CV}_{{\text{EI}}_{k}}}{\sum _{k=1}^{4} \text{CV}_{{\text{EI}}_{k}}} \end{aligned}$$
(10)

where \(W_{{\text{EI}}_{k}}\) is the weight of each evaluation index data set.

  • Step4 Calculate the comprehensive evaluation index (CEI) with different combinations of optional wavelets and decomposition levels by weight:

$$\begin{aligned} \text{CEI}(i)=\sum _{k=1}^{4} W_{{\text{EI}}_{k}}\times {{\text{EI}}_{k}}(i) \end{aligned}$$
(11)

where \(\text{CEI}(i)\) is the comprehensive evaluation index of the ith combination of optional wavelets and decomposition levels. The combination with the highest CEI shows the best result.

3.1 The scheme of wavelet selection

To select the optimal wavelet accurately, the evaluation index must be accurate and effective. Evaluation indexes are mainly calculated from the pure signal, noise, and processed noisy signal. The accurate acquisition of pure signal and noise is the key to ensure the indexes’ accuracy and credibility. But in practical application, the pure signal is usually undetectable. It is obviously inappropriate to use the measured signal as the noisy signal and the wavelet processed signal as the reference signal.

Fortunately, typical waveform characteristics of many signals are known based on the mechanisms of signal generation. The typical reference waveform of underwater discharge signal is shown in Fig. 2. It consists of shock wave and subsequent bubble pulsations.

Fig. 2
figure 2

Ideal reference waveform of underwater discharge

Whether in the anechoic pool or in the outdoor lake or ocean, the noise of underwater discharge signals can be measured. The noise characteristics may be different in different environments, but they are basically stable in the short time of the underwater discharge. Blanks at the beginning or the end of the signal can be considered pure noise signals (shown in Fig. 3) and superimposed with the reference waveform to generate a simulated noise signal. The pure signal and noise in the simulated signal can be considered known. The instrument measurement error is considered as stable during the very short duration time of underwater discharge. By processing this signal, we can accurately calculate the evaluation index after the processing of each combination of wavelets and decomposition levels. The CEI can be calculated by fusing the indexes through the coefficient of variation method. Then the optimal wavelet can be selected. The scheme is shown in Fig. 4.

Fig. 3
figure 3

Measured underwater discharge signal and noise

Fig. 4
figure 4

Scheme of wavelet selection

3.2 Experiment setup

An experiment is implemented to compare the underwater wire-guided discharge discharge signals with different materials. A pair of 304 stainless steel cylindrical electrodes with 5 mm cross-section diameter and 150 mm length were fixed on a self-made plastic base to discharge. Gold, silver, and copper wires with a diameter of 0.05 mm and a length of 6 mm were fixed between the two electrodes and then placed at a depth of 1 m under the water of the anechoic pool. A high voltage pulse power drove the underwater plasma discharge system with a fixed output voltage of 10kV and an energy storage capacitor of 0.11mf. The explosive acoustic signals were collected by a hydrophone produced by the Hangzhou Institute of Applied Acoustics with a sensitivity of -205dB re 1V/mPa in the range of 5Hz-15MHz. A RIGOL MSO5354 digital storage oscilloscope recorded the collected sound signals. The water conductivity was 0.37mS/cm, and the water temperature was 17.5\(^{\circ }\)C. The experiment setup is shown in Fig. 5.

Fig. 5
figure 5

Setup of the underwater wire-guided discharge experiment

4 Results

4.1 The selection of optimal wavelet

After computation, the EIs and CEI of each combination of wavelets and decomposition levels are obtained; some of them are shown in Table 2. From the CEI, the optimal wavelet is rbio3.7, and the optimal decomposition level for de-noising is 9. The worst performing wavelet is ribo 3.1, which is mainly poor in signal-to-noise ratio and smoothness. The de-noising results are shown in Fig. 6.

Table 2 EIs and CEIs of some combination of wavelets and decomposition levels
Fig. 6
figure 6

The effect of denoising using optimal and poor wavelets

4.2 Wavelet-based processing of underwater wire-guided discharge discharge signals

Using the selected optimal wavelet, the measured underwater discharge signal is de-noised, and then wavelet packet decomposition is used to analyze the energy distribution in each frequency band. It can be seen in Fig. 7 that the noise can be eliminated effectively after processing. Then the de-noised signals were decomposed by rbio 3.7 wavelet-packet at level 4. The reconstructed waveforms of wavelet packet nodes are shown in Fig. 8. The sample rate is \(10^5\)Hz, so the frequency band of each node is shown in Table 3.

Fig. 7
figure 7

The underwater gold wire discharge signal and the de-noised signal

Fig. 8
figure 8

Reconstructing wavelet packe node waveforms of discharge signal

Table 3 The frequency band of each wavelet packet node

It can be concluded from Fig. 8 that the waveforms of the first six nodes have greater amplitude. So the underwater wire-guided discharge signal has more low-frequency components.

4.3 The comparison of underwater wire-guided discharge discharge signals with different materials

The energy–frequency distributions of underwater wire-guided discharge discharge signals are obtained by using rbio 3.7 wavelet packet decomposition at level 5 and calculating the energy proportion of each node in the 5th layer. We can conclude from Fig. 9 that the gold wire explosive signal has more high-frequency components than copper and silver.

Fig. 9
figure 9

Frequency–energy distributions of signals with different materials

5 Conclusions

In this study, a scheme of wavelet selection is provided. And the processing of underwater explosive signals using selected wavelets proved the effectiveness of the wavelet selection scheme. Based on prior knowledge, the reference waveform of the target signal can be fitted, and real noise is added to the fitted target signal to form a fitted noisy signal. Then the fitted noisy signal is processed by optional wavelets. The evaluation indexes after the processing of each combination of wavelets and decomposition levels are calculated, and the comprehensive evaluation indexes are obtained through the coefficient of variation method. Then the optimal wavelet can be selected. The underwater wire-guided explosive signals were de-noised by the optimal wavelet and decomposition level. Then wavelet packet decomposition of optimal wavelet was used to analyze the energy–frequency characteristics. The results proved the effectiveness of the wavelet selection scheme. The scheme is a general solution that can be applied to signals that have prior knowledge of approximate waveform characteristics, and the results of the selection can be used for a class of signals with similar signal characteristics. The selection of threshold in de-noising involves noise characteristics, which is not within the scope of this paper.