Denoising Method for Microseismic Signals with Convolutional Neural Network Based on Transfer Learning

Li, Xuegui; Feng, Shuo; Guo, Yuantao; Li, Hanyang; Zhou, Yingjie

doi:10.1007/s44196-023-00275-w

Denoising Method for Microseismic Signals with Convolutional Neural Network Based on Transfer Learning

Research Article
Open access
Published: 24 May 2023

Volume 16, article number 91, (2023)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Computational Intelligence Systems Aims and scope Submit manuscript

Denoising Method for Microseismic Signals with Convolutional Neural Network Based on Transfer Learning

Download PDF

Xuegui Li ORCID: orcid.org/0000-0001-9249-7509^1,2,3,4,
Shuo Feng^1,2,3,
Yuantao Guo ORCID: orcid.org/0000-0003-1248-7680⁵,
Hanyang Li^1,3 &
…
Yingjie Zhou^1,2,3

1372 Accesses
2 Citations
Explore all metrics

Abstract

Microseismic signals contain various information for oil and gas developing. Increasing the signal-to-noise ratio of microseismic signals can successfully improve the effectiveness of oil and gas resource exploration. The lack of sufficient labeled microseismic signals makes it difficult to train neural network model. Transfer learning can solve this problem using image data sets to pre-train the denoising model and the learned knowledge can be transferred into microseismic signals denoising. In addition, a convolutional neural network (CNN) model with 16 layers is designed for noise reduction. Considering the strong similarity between noisy signals and denoising signals, residual learning is utilized to optimize the denoising model. The simulation experiment results show that the proposed denoising model eliminates the noise in the microseismic signals effectively and quickly, restores the amplitude of the microseismic signals with high accuracy, and has excellent effect in denoising on the information at the edge.

A two-stage seismic data denoising network based on deep learning

Article 08 June 2024

Application of residual learning to microseismic random noise attenuation

Article 28 April 2021

Deep Convolutional Neural Network for Microseismic Signal Detection and Classification

Article 11 November 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Hydraulic fracturing technology is the core supporting technology for the development of unconventional oil and gas reservoirs. By injecting quantities of high-pressure liquid into the formation, the rock pore pressure rises. When the pressure exceeds the elastic critical value of rock, tension or shear rupture will form a fracture network. Rock rupture will produce microseismic events, and the microseismic wave signals are collected by geophones and analyzed to obtain information such as the location, magnitude and energy of the seismic source [1]. The microseismic monitoring technology is employed to monitor the microseismic signals in the hydraulic fracturing process. However, microseismic signals have the characteristics of low energy, complex noise and weak signals. The collected microseismic signals are affected by the surrounding environment and is often accompanied by many kinds of noises (e.g., drilling interference, acoustic interference and strong pulse interference). The microseismic signals are submerged in noises and cannot be used effectively. As a result, the microseismic signals denoising is introduced to improve the recognition rate of microseismic events, which is significant to microseismic monitoring technology and production increase of unconventional oil and gas reservoirs. Microseismic signals denoising has become a recurring topic of research since the pioneering work in [2] and numerous research results have been reported in the literature [3,4,5,6,7,8,9,10]. In [8], a joint method of CEEMD and wavelet packet threshold has been utilized in experimental analysis and engineering applications. The noise suppression effect of this method is better than the single CEEMD method and the wavelet packet threshold method. In [9], an automated platform has been built for microseismic signals analysis. The system can quickly process large data sets of continuous seismic records, and realize the original seismic signals denoising, the detection of seismic events, then the construction and selection of the best characteristics of each event type. Finally, the event is divided into a specific category.

In recent years, neural networks have stirred a great deal of research attention [11,12,13,14,15,16,17,18,19]. Meanwhile, with its rapid development, convolutional neural network (CNN) has been widely used in image recognition [20, 21], speech signal disposal [22], image denoising [23] and other fields, which have attracted the attention of researchers, see e.g., [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39]. In [32], an image denoising framework based on residual learning CNN has been designed for alleviating network degradation as well as improving the accuracy of verification without learning identity mapping. The hierarchical residual learning network is capable of handling multiple general image denoising tasks. Nevertheless, it increases network complexity and relies heavily on batch normalization. In [36], a CNN image denoising method has been proposed for adapting to different image restoration tasks, however, it may trap in a local minimum. In [37], a complex valued deep CNN has been designed for image denoising, which achieves good accuracy with huge calculations.

Classical supervised learning usually uses specific data to train a single task in a given domain. When the domain changes, the model is often no longer accurate or even invalid. Transfer learning refers to applying the knowledge learned in one field to another target field, which has attracted some initial research interest, see [40,41,42,43,44,45,46,47] and the references therein. For instance, in [45], the transfer learning method has been introduced to classify breast cancer in ultrasound images, which has got higher AUC value than CNN method.

High-quality microseismic signals not only guide the implementation of hydraulic fracturing engineering but also play a crucial role in oil and gas extraction, lithology discrimination, and geological exploration. To utilize microseismic signals more effectively, we build a transfer learning based microseismic denoising model of CNN (T-MCNN). The proposed model can successfully address the issue of small samples and raise the microseismic signal-to-noise ratio. The major contributions of this paper are as follows:

(1)
Transfer learning is introduced to learn the complex features of images noises and apply the knowledge to microseismic signals domain. The utilize of transfer learning can solve the problem of insufficient labeled training data and small samples.
(2)
A deep learning model, which based on CNN, is designed for denoising and improving the signal noise ratio. The 16-layer CNN based on VGG network and residual learning can produce excellent noise reduction results.
(3)
Experiments on microseismic denoising are carried out to verify the performance of the designed method. Peak signal-to-noise ratio, mean square error and signal-to-noise ratio are adopted to evaluate the noise reduction effect of various methods. In addition, the timefrequency diagram is also introduced to analyze the denoising effect of the proposed method.

The rest of this article is organized as follows. In Sect. 2, the transfer learning model for microseismic signals denoising is described in detail. Section 3 introduces the designed CNN model. In Sect. 4, the simulation experiments are conducted and analyzed. Finally, Sect. 5 concluded this paper.

2 A Transfer Learning Model for Microseismic Denoising

Most of the traditional deep learning algorithms are supervised learning, which requires abundant data to train network models. CNN is able to learn the potential rules existing in the data, so there are strict requirements for the training set. First of all, the samples in a data set need to meet the condition of independent identical distribution, that is, each sample data is sampled from a feature space with a fixed probability distribution. In reality, it is hard to get large amounts of data that all conform to the same distribution. Secondly, the data of the training network needs to be labeled. The labeled data needs plenty of manpower and material resources, and the distribution of the data will also change with time and environment. Then the labeled data cannot be used again and need to be relabeled. In this case, transfer learning is an idea of problem solving. The knowledge learned in a certain field can be applied to solve problems in a new field. Figure 1 shows the learning process of transfer learning and that of traditional deep learning algorithms.

In the actual working environment, there are complex noises in the collected microseismic signals by the geophone, and it is very complicated to separate the clean original microseismic signals and the actual noise signals. While the same problem exists in the construction of image data sets which are seriously disturbed by environmental noises. Besides, the microseismic data sets do not have enough samples for training of deep learning, while the image data sets contain plentiful samples for training [48]. Therefore, the knowledge of image domain is used to solve the denoising task of microseismic signals by transfer learning. Transfer learning mainly includes two concepts: domain D and task T. The domain D is described as

$$\begin{aligned} D=\left\{ \chi ,P\left( X \right) \right\} \end{aligned}$$

(1)

where $\chi $ is the feature space or sample space, sample data $x=\left\{ x_{1},x_{2},\cdots ,x_{n} \right\} \in \chi $. $P\left( X \right) $ is the probability distribution of the feature space, and the data x of the sample space obeys the probability distribution $P\left( X \right) $. That is to say, the samples in a domain obey the same probability distribution. When the two domains are different, their sample space and probability distribution are different. When a domain D is defined, task T is represented by

$$\begin{aligned} T=\left\{ \gamma ,f\left( \cdot \right) \right\} \end{aligned}$$

(2)

where $\gamma $ is the tag space, $f\left( \cdot \right) $ is the objective function, and T is a task in the domain. Domain D can contain multiple different tasks. The purpose of deep learning model is to learn the objective function from a substantial amount of data pairs $\left\{ x_{i},y_{i} \right\} $

$$\begin{aligned} y=f\left( \cdot \right) =f\left( x \right) ,x\in \chi ,y\in \gamma . \end{aligned}$$

(3)

For given new sample data x, the corresponding prediction value $f\left( x \right) $ can be obtained. In the denoising problem, the image domain is used as the source domain, and the data of the source domain is expressed as

$$\begin{aligned} D_{S}=\left\{ \left( x_{S_{1}},y_{S_{1}} \right) ,\left( x_{S_{2}},y_{S_{2}} \right) ,\cdots ,\left( x_{S_{n}},y_{S_{n}} \right) \right\} \end{aligned}$$

(4)

where $x_{S_{i}}\in x_{S}$ is the sample of image data of the source domain, and $y_{S_{i}}\in y_{S}$ is the corresponding noise sample label. The denoising problem of microseismic signals is taken as the target domain $D_{0}$. Similarly, the sample data of the target domain is denoted by

$$\begin{aligned} D_{0}=\left\{ \left( x_{O_{1}},y_{O_{1}} \right) ,\left( x_{O_{2}},y_{O_{2}} \right) ,\cdots ,\left( x_{O_{n}},y_{O_{n}} \right) \right\} \end{aligned}$$

(5)

where $x_{O_{i}} \in x_{O}$ is the sample of microseismic signals in the target domain, and $y_{O_{i}} \in y_{O}$ is the noise label data corresponding to the sample. Next, the knowledge learned by the model in the image field is transferred to solve the denoising problem of microseismic signals. The process is illustrated in Fig. 2.

Image domain and microseismic signal domain are two different domain problems, so $D_{S} \ne D_{O}$. At the same time, it also means that the data probability distribution of source domain and target domain is not the same, that is $P_{S}\left( X \right) \ne P_{O}\left( X \right) $. For the task $T_{S} = T_{O}$, the source task and the target task are denoising problems, and the problems to be solved are the same. Therefore, the knowledge learned in the image field is utilized to solve the denoising problem in the field of microseismic signal.

3 CNN Denoising Method Based on Transfer Learning

3.1 Noise Model

Image signal and microseismic signal models are defined respectively as follows:

$$\begin{aligned}{} & {} Y_{m}\triangleq X_{m}+ N_{m} \end{aligned}$$

(6)

$$\begin{aligned}{} & {} Y_{e}\triangleq X_{e}+ N_{e}. \end{aligned}$$

(7)

Among them, $Y_{m}$ ($Y_{e}$, respectively) is image signal (microseismic signal, respectively) with noise, $X_{m}$ ($X_{e}$, respectively) is clean image signal (microseismic signal, respectively), and $N_{m}$ ($N_{e}$, respectively) is noise in image signal (microseismic signal, respectively). Due to the strong similarity between noisy signals and denoising signals, it is easier to optimize the mapping of noisy signals to noise through residual learning and CNNs than to directly map clean data. Therefore, the construction of a CNN model is characterized as follows to map the noisy signal $Y_{m}$ and $Y_{e}$ to the noise $N_{m}$ and $N_{e}$:

$$\begin{aligned} {\hat{n}}=Net\left( y;\theta \right) \end{aligned}$$

(8)

where $Net\left( \cdot \right) $ is the constructed CNN model, $\theta $ contains the weight parameter w and the bias parameter b. Define the loss function as follows:

$$\begin{aligned} J\left( \theta \right) \triangleq \frac{1}{2N}\sum _{i=1}^{N}\left\| Net\left( y,\theta \right) -\left( y-x\right) \right\| ^{2}. \end{aligned}$$

(9)

In (9), N is the number of samples. Next, the image data set is used for pre-training the model. The image data pair is $\left\{ y_{m},n_{m} \right\} $ and input into the network model ${\hat{n}}_{m}=Net\left( y_{m};\theta \right) $. By minimizing the loss function, the parameter $\theta _{1}$ is obtained, which is indicated as follows:

$$\begin{aligned} \theta _{1}=\min _{\theta _{1} }\frac{1}{2N}\sum _{i=1}^{N}\left\| Net\left( y_{m},\theta _{1} \right) -\left( y_{m}-x _{m}\right) \right\| ^{2}. \end{aligned}$$

(10)

Through the pre-training of image data set, the pre-training model $Net\left( \theta _{1} \right) $ is acquired.

The task of this paper is to microseismic signals denoising. By taking advantage of transfer learning, the knowledge learned from image denoising is adopted in the task of microseismic signals denoising. The denoising model got from image denoising is fine-tuned by loss function and microseismic data sets. T-MCNN is described as follows:

$$\begin{aligned} {\hat{n}}_{e}=Net\left( y_{e},\theta _{1},\theta _{2} \right) . \end{aligned}$$

(11)

The flow chart of T-MCNN model in this paper is shown in Fig. 3.

3.2 Network Model

Comparison of $3\times 3$ convolution and $5\times 5$ convolution is illustrated in Fig. 4, which reveals that the convolution of $5\times 5$ is performed on the receptive field of $5\times 5$, and an output is obtained. In this process, a $3\times 3$ convolution can be used to process the receptive field of $5\times 5$ first, and a $3\times 3$ output can be obtained. In the second $3\times 3$ convolution kernel processing, the same effect can be obtained as that of the $5\times 5$ convolution kernel processing. When the receptive field is fixed, small convolution kernels are piled up to replace large convolution kernels, which increases the nonlinear layer and thus increases the expression ability of the network with fewer parameters. As displayed in Fig. 5, the output dimensions obtained by two $3\times 3$ convolutions and a $5\times 5$ convolution are the same, but the $3\times 3$ convolution block adds an activation layer and increases the nonlinear expression ability of the network. The structure of VGG network is simple. By stacking $3\times 3$ small convolution kernel and $2\times 2$ maximum pooling layer, the network achieves the depth increase and the performance improvement.

CNN microseismic denoising network T-MCNN based on transfer learning is constructed based on VGG network, and the network module is constructed by combining convolution operation, activation operation and batch normalization operation. T-MCNN is divided into three modules: $\left( 1\right) $ Conv + Leaky ReLU; $\left( 2\right) $ Conv + BN + Leaky ReLU; and $\left( 3\right) $ Conv. The selection of layers of the CNN determines the performance of the model. Although the performance of the network would be improved if too many layers are set, it also brings an increase in the amount of computation. If too few network layers are set, the denoising effect of the network cannot reach the ideal state. Therefore, choosing the appropriate number of network layers is the key to build the denoising model of CNN. To assign an appropriate number of network layers, experimental comparison method is used to set the network layers as 10, 12, 14, 16 and 18 to train 20 epochs, respectively. PSNR, SSIM and SNR are used to evaluate the denoising effect of different depth models. The experimental results are revealed in Table 1.

Table 1 Comparison of denoising effects of models with different layers at 20 epochs

Full size table

It is seen from Table 1 that, among the five different network layers, when the number of network layers is 16, the PSNR value of the network model is 30.72 (the highest), the SSIM value is 0.65, and the MSE value is 54.98 (the lowest). Considering comprehensively, the number of the network layers is set as 16, and the network structure is displayed in Fig. 5. The goal is to map from a noisy signal to a noise signal, so the input and output dimensions of the network are the same size. The convolution kernel size of the first-layer network is $3\times 3\times 1\times 64$, $3\times 3$ is the length and width dimensions of the convolution kernel, 1 is the number of channels of the convolution kernel, 64 is the number of convolution kernels, mainly including Conv and Leaky ReLu components. The convolution kernel size of layer 2–14 network is $3\times 3\times 64\times 64$, the number of channels of the convolution kernels is 64, the number of convolution kernels is 64, and the middle layer contains the Conv, BN, and Leaky ReLu components. The convolution kernel size of network of the last layer is $3\times 3\times 64\times 1$, the number of channels of the convolution kernels is 64, the number of convolution kernels is 1. This is due to the fact that the final output channel is consistent with the data channel.

Operation Batch Normalization is the batch normalization layer, which can be nested in the network layer to normalize the data, improve the generalization ability of the network and accelerate the convergence speed of the CNN. Leaky ReLU is a variant of the ReLU function, which is created to prevent too many neurons from falling into the “dead” state by entering the part of Leaky ReLU that is less than 0 and setting it to a small gradient, solving the problem of too many “dead” ReLU neurons not being able to update.

The number of layers of the network model is determined as 16, and the components contained in each layer of the network model are determined. Next, we proceed to the network training phase. The flowchart of network training is drawn in Fig. 6. First image data is adopted to pre-train the network model to obtain the pre-trained model. Then the pre-trained parameters are used as the initial parameters of the network, the noise containing microseismic data is applied as the input of the network, and the noise is utilized as the output of the network. T-MCNN model is obtained by fine-tuning the neural network. Finally, the test data set is utilized to test the denoising performance of the T-MCNN network.

4 Simulation Experiment and Analysis

4.1 Build the Training Data Set

To train the T-MCNN model, it is necessary to construct an image data set for pre-training the model and a microseismic data set for fine-tuning the model. According to the mapping relationship of T-MCNN model, the input samples are the microseismic signals data containing noise, and the output labels are the noise signal data. 400 grayscale image data are collected as the data set of the pre-training model, and Gaussian noise ($\sigma =50$) is applied. The data set for CNN training are organized as follow: noise (the corresponding noise data, respectively) are regarded as input data (label data, respectively) when synthesizing image data. Patch processing is conducted on the image, and the selection of patch size depends on the level of noise. If the noise is complex, a larger patch may be selected to obtain more information for signal recovery. According to the settings of [49], the patch window size is selected as $40\times 40$ and the sliding step size is 10. Patch data is intercepted from the original image data as the input of T-MCNN network. Figure 7 presents some of the pre-training data.

To meet the mapping principle requirements of T-MCNN model, Ricker wavelet forward modeling is used to synthesize microseismic simulated signals. Gaussian noise is also a commonly used simulation noise for microseismic signals denoising. In the case of unknown noise type, Gaussian noise is used as the simulation of actual noise, which is simple and close to the actual approximate simulation. The expression of Ricker wavelet is described as follows:

$$\begin{aligned} s\left( t \right) =\left[ 1-2\left( \pi f_{m}t \right) ^{2} \right] \exp \left[ -\left( \pi f_{m}t \right) ^{2} \right] \end{aligned}$$

(12)

where $f_{m}$ is the dominant frequency of the Ricker wavelet, and T is the time. Figure 8 is the Ricker wavelet graph.

A total of 8000 microseismic signals are generated, each of which is with 400 sampling points. Select 80 channels of signals to synthesize a microseismic image data. There is a total of 100 microseismic data, and the shape and size are $400\times 80$. The sliding window is chosen as $40\times 40$, with a step size of 10 to slide on the microseismic image data, and a total of 600 blocks of data are obtained as the training data set of the fine-tuned T-MCNN model. The microseismic signals data is characterized in Fig. 9.

4.2 Experimental Training Process

The computer used for this experiment consists of an Intel^® Core™ i7-4510U, CPU, running at 2.60 GHz, an 8 GB RAM, the Win10 64-bit operating system, and a NVIDIA GeForce 840M with 4 GB of memory. The software environment is Matlab R2018a. Figure 10 shows the convergence of training network loss value of SGD and Adam optimization algorithm. It is known that the convergence speed of Adam is relatively better than the SGD algorithm, so the Adam algorithm is chosen as the optimization algorithm in this experiment. As recommended by [50], set the values $\beta _{1}=0.9$, $\beta _{2}=0.99$, $\alpha =0.01$, and $\varepsilon =10^{-8}$. The number of iterations is set as 20, 12 grayscale images are used to add $\delta =50$ Gaussian noise as the image test set, and the value changes of PSNR and SSIM of each generation are counted. As drawn in Figs. 11 and 12, according to the changes of curves, the values of PSNR and SSIM firstly increased and then decreased with the increase of training algebra (epoch). When the algebra is 3, the values of PSNR and SSIM reach the highest value, which are 26.2396 and 0.7123, respectively. Therefore, the trained model with epoch = 3 is selected as the pre-training model.

Microseismic data sets are employed to fine-tune the model, the change curve of loss value is exhibited in Fig. 13. In the pre-training stage, each epoch is trained 3313 times, and a total of three epochs are trained. In the fine-tuning stage of the model, each epoch is trained for 600 times, and three epochs are trained. As is observed from the change curve of loss value in Fig. 13, in the pre-training stage, the model could converge quickly. When the iteration reaches 9939 times, the training of the pre-training model finishes, the fine-tuning of the model starts, and the loss function value continues to decline, finally reaches the convergence state.

4.3 Experimental Results and Analysis

To verify the denoising effect of our proposed CNN model, Ricker is used to generate 10 pairs $400\times 80$ microseismic synthetic data as a test set. As plotted in Fig. 14a, the microseismic signals contain two in-phase axes superimposed together, and the width of the center frequency wavelet ($f_{m}=30$) is 3. The generated Gaussian noise ($\sigma =50$) is applied, as reflected in Fig. 14b. To demonstrate the effect of our proposed CNN model based on transfer learning, we compare the effect of the transfer learning based microseismic denoising model (MCNN) and the non-transfer learning based microseismic denoising model. 400 pieces of microseismic data are utilized to train the model in the MCNN network model without transfer learning, while only 100 pieces of microseismic data are adopted to fine-tune the pre-training model in the T-MCNN network model based on transfer learning. To objectively evaluate the denoising effect of the two models, the PSNR, MSE, SNR and other indicators are calculated of the data after denoising with two models. PSNR measures the similarity between the denoised signal and the original clean signal. A higher PSNR indicates a better denoising effect. SNR represents the ratio of signal to noise. The larger the SNR, the better the denoising effect. MSE measures the error between the denoised signal and the original signal, and a smaller MSE suggests a smaller error between the denoised signal and the original signal, thus indicating a better denoising effect. The results are given in Table 2, which indicate that the three indexes of T-MCNN are all higher than those of MCNN.

Table 2 Comparison of evaluation indexes of denoising effect between MCNN and T-MCNN

Full size table

To further analyze the denoising effect of the T-MCNN model proposed in this paper, the traditional wavelet threshold denoising algorithm (Wavelet), the MCNN algorithm and the T-MCNN algorithm are selected to make a comparison, the results are shown in Fig. 14a–e. As is reflected in Fig. 14c, there is noise residue in both the signal part and the no-signal part, which leads to poor clarity of microseismic signals, and there are still quantities of noise to disturb the microseismic signals. For the CNN model MCNN that is not pre-trained, the same algebra training as T-MCNN is better than the wavelet threshold denoising algorithm, but there is still a small amount of noise residue, and the processing effect of edge position is not ideal. Figure 14e represents the denoising effect of the CNN model based on transfer learning. It is shown clearly that the denoising effect is obvious with almost no noise residue and clear microseismic signals. From the perspective of denoising effect, both MCNN and T-MCNN can effectively complete the denoising work of microseismic data. It is illustrated that CNN is a powerful algorithm, which is capable of completing the denoising of microseismic data validly.

Next, spectrum analysis before and after the denoising of microseismic signals are carried out. Spectrum analysis is to implement Fourier transform on the signals and expand the signal strength in frequency order as a function of frequency change. Figure 14a shows the spectrum of the original microseismic signals, which are concentrated in the low-frequency region. There are two wave peaks in the microseismic signals, corresponding to the part with brighter color, while there is no microseismic signal in other parts. In Fig. 15b, noise information is added, and a considerable number of signals with different frequencies appear, but their intensity is low. The signal part is also affected, and the signal intensity changes. Figure 15c shows the result of denoising by wavelet threshold algorithm. Most of the noise perception with high frequency is processed, while much noise remains in the part close to the signal frequency. Figure 15d shows the effect after denoising of MCNN model. Compared with the wavelet threshold denoising algorithm, it has better denoising effect for low-frequency noise signals. However, there is still a small part of noise, and the signal of the first wave peak does not recover its main intensity information. Figure 15e shows the time–frequency diagram of the denoising results of the T-MCNN model proposed in this paper, which almost eliminates all noise signals and obviously restores the amplitude information of microseismic signals. Note that T-MCNN is the method with the best denoising effect, which successfully eliminates the noise of microseismic signals and retains the main information of signals.

According to Figs. 14, 15, 16 (denoising effect diagram, spectrum analysis diagram, waveform analysis diagram, respectively), the microseismic denoising algorithm proposed in this paper is able to remove the noise in the signal adequately and better protect the details of the signal. To assess the denoising results further, the peak signal-to-noise ratio (PSNR), mean square error (MSE) and signal-to-noise ratio (SNR) are adopted to quantitatively evaluate the denoising results of the model. The microseismic data is one-dimensional microseismic signal, and we transform the microseismic signal into image through the superposition of 80 channel data. In particular, PSNR and MSE are commonly used evaluation indexes for image quality, which are taken as the main reference indexes. Noises with levels $\sigma =50$, $\sigma =30$ and $\sigma =15$ are added to the microseismic signals, respectively. The original microseismic signals are used as references. The statistical results of the evaluation indicators after denoising are given in Table 3. Taking the noise of $\sigma =15$ as an example, the PSNR of the noisy microseismic signals is 17.3090. After carrying on the wavelet threshold denoising algorithm (the T-MCNN algorithm developed in this paper, respectively), the PSNR value increases by 3.5319. Compared to the wavelet threshold denoising method, the T-MCNN method has significantly improved the PSNR of processed microseismic signals. Specifically, the PSNR after T-MCNN processing increased by 53.04$\%$ compared to the ratio after wavelet threshold algorithm processing. Although the wavelet threshold algorithm improves the PSNR of the microseismic signals and reduces the MSE of the microseismic signals, it does not effectively improve the SNR of the microseismic signals. In contrast, T-MCNN method has demonstrated effective denoising ability for high-level noisy signals by improving both the PSNR and SNR of the microseismic signals, while reducing MSE, unlike the wavelet threshold algorithm, which only improves the PSNR and reduces MSE without effectively improving the SNR of the signal. For low level noise, the T-MCNN model proposed in this paper still has better denoising effect. Specifically, compared to the signal processed by wavelet threshold algorithm, the signal processed by T-MCNN exhibits an increase of 27.92$\%$ in PSNR, 271.80$\%$ in SNR, and a decrease of 92.78$\%$ in MSE. Therefore, no matter from the subjective visual analysis or objective quantitative index evaluation, the denoising algorithm designed in this paper has greater advantages, including eliminating the noise signal in the microseismic signals to the maximum extent, restoring the amplitude of the original signal, and protecting the edge detail information well.

Table 3 Comparison of the wavelet threshold algorithm and T-MCNN method

Full size table

Finally, we select randomly a 80-channel microseismic signal and a random signal for waveform comparison and analysis. As plotted in Fig. 16, there are two waveforms in the signal. After adding noise, the waveforms appear distortion, and their maximum value exceeds the amplitude of the original signal, but most of the original waveforms remain. After denoising by wavelet threshold denoising algorithm, the amplitude of waveform decreases, but it is still higher than the amplitude of original signal. For no microseismic signal, the wavelet threshold denoising algorithm can not eliminate the noise, and there is still a large amount of noise. Both MCNN algorithm without transfer learning and T-MCNN based on transfer learning can achieve effective denoising. MCNN model cannot restore the amplitude to the original form, and its amplitude is lower than that of the original signal. In addition, the signal processing of edge position is not good, and the initial amplitude is different from the original amplitude. After denoising with T-MCNN algorithm in this paper, not only is the waveform of microseismic signal well protected with its amplitude being basically the same as that of the original signal, but also is the denoising effect quite obvious. In recent years, fuzzy learning has undergone rapid development, yielding fruitful results in methods such as fuzzy superior Mandelbrot sets [51], complex T-spherical fuzzy sets [52], and complex q-rung orthopair linguistic fuzzy sets [53]. To cope with uncertain and fuzzy data, we plan to enhance the capability of our model by combining transfer learning with fuzzy learning in the future, thus improving the data processing ability and robustness of the model.

5 Conclusion

In this paper, a transfer learning based CNN model has been proposed for microseismic signal denoising. The proposed method offers a novel and efficient method for noise reduction of microseismic signal. It is difficult to separate the microseismic data and noise data in the actual working environment and the synthetic data is not sufficient for network training. To address the problem, transfer learning has been introduced by utilizing the image data sets to pre-train the CNN denoising model and using the learned knowledge for microseismic signal denoising. The small sample problem can be successfully solved by the proposed T-MCNN which can also improve the microseismic signal-to-noise ratio. Experiments have indicated that the proposed method can increase the signal-to-noise ratio with different noise levels. Experiments have indicated that the proposed method can increase the signal-to-noise ratio with different noise levels. However, the proposed method has certain drawbacks. Due to the huge amount of calculation and the high demand on computational resources, it is challenging to apply the designed model on constrained devices, such as embedded microcontroller. In the future, we plan to compress the network, reduce the computation cost, and apply it on embedded microcontroller.

Data Availability

The data presented in this study are available from the corresponding author upon request from interested readers.

Abbreviations

SGD:: Stochastic gradient descent
Adam:: Adaptive moment estimation
CNN:: Convolutional neural network
T-MCNN:: Transfer learning based microseismic denoising model of convolutional neural network
PSNR:: Peak signal to noise ratio
SSIM:: Structural similarity
BN:: Batch normalization
Wavelet:: Wavelet threshold denoising algorithm
MCNN:: Transfer learning based microseismic denoising model
MSE:: Mean square error
SNR:: Signal-to-noise ratio
VGG:: Visual Geometry Group

References

Zhang, E., Zhu, Q., Miu, H., Gao, L., Chao, H., Zhang, Z.: Study on monitoring and predicting of mine ground pressure activities based on microseismic technology. Met. Mine 49(8), 172–181 (2020)
Google Scholar
Du, Z., Foulger, G., Mao, W.: Noise reduction for broad-band, three-component seismograms using data-adaptive polarization filters. Geophys. J. Int. 141(3), 820–828 (2000)
Article Google Scholar
Chen, H., Yang, Z.: Arrival picking of acoustic emission signals using a hybrid algorithm based on aic and histogram distance. IEEE Trans. Instrum. Measurement 70, 3505808 (2021)
Google Scholar
Chen, Y., Chen, W., Wang, Y., Bai, M.: Expression of concern: least-squares decomposition with time–space constraint for denoising microseismic data. Geophys. J. Int. 222(3), 1864–1880 (2020)
Article Google Scholar
Yi, Q., Cheng, T., Wu, Y., Zhang, Z.: Feature extraction and classification method of mine microseismic signals based on CEEMDAN-SE. In: 2020 IEEE 3rd International Conference on Electronics Technology (ICET), Chengdu, China, pp. 602–606, 08–12 May (2020)
Zhang, C., Baan, M.V.D.: Microseismic denoising and reconstruction by unsupervised machine learning. IEEE Geosci. Remote Sens. Lett. 17(7), 1114–1118 (2019)
Article Google Scholar
Zhu, W., Mousavi, S.M., Beroza, G.C.: Seismic signal denoising and decomposition using deep neural networks. IEEE Trans. Geosci. Remote Sens. 57(11), 9476–9488 (2019)
Article Google Scholar
Zuo, L., Sun, H., Mao, Q., Liu, X., Jia, R.: Noise suppression method of microseismic signal based on complementary ensemble empirical mode decomposition and wavelet packet threshold. IEEE Access 7, 176504–176513 (2019)
Article Google Scholar
Li, J., Stankovic, L., Pytharouli, S., Stankovic, V.: Automated platform for microseismic signal analysis: denoising, detection and classification in slope stability studies. IEEE Trans. Geosci. Remote Sens. 59(9), 7996–8006 (2021)
Article Google Scholar
Li, X., Feng, S., Hou, N., Wang, R., Li, H., Gao, M., Li, S.: Surface microseismic data denoising based on sparse autoencoder and Kalman filter. Syst. Sci. Control Eng. 10(1), 616–628 (2022)
Article Google Scholar
Yang, F., Li, J., Dong, H., Shen, Y.: Proportional-integral-type estimator design for delayed recurrent neural networks under encoding-decoding mechanism. Int. J. Syst. Sci. (2022). https://doi.org/10.1080/00207721.2022.2063968
Article MathSciNet MATH Google Scholar
Li, J., Wang, Z., Dong, H., Ghinea, G.: Outlier-resistant remote state estimation for recurrent neural networks with mixed time delays. IEEE Trans. Neural Netw. Learn. Syst. 32(5), 2266–2273 (2021)
Article MathSciNet Google Scholar
Gao, H., Dong, H., Wang, Z., Han, F.: An event-triggering approach to recursive filtering for complex networks with state saturations and random coupling strengths. IEEE Trans. Neural Netw. Learn. Syst. 31(10), 4279–4289 (2020)
Article MathSciNet Google Scholar
Yang, J., Ma, L., Chen, Y., Yi, X.: $L_2$-$L_\infty $ state estimation for continuous stochastic delayed neural networks via memory event-triggering strategy. Int. J. Syst. Sci. (2022). https://doi.org/10.1080/00207721.2022.2055192
Article MATH Google Scholar
Wang, L., Liu, S., Zhang, Y., Ding, D., Yi, X.: Non-fragile $l_2$-$l_\infty $ state estimation for time-delayed artificial neural networks: an adaptive event-triggered approach. Int. J. Syst. Sci. 53(10), 2247–2259 (2022)
Article MATH Google Scholar
Suo, J., Li, N., Li, Q.: Event-triggered $H_{\infty }$ state estimation for discrete-time delayed switched stochastic neural networks with persistent dwell-time switching regularities and sensor saturations. Neurocomputing 455, 297–307 (2021)
Article Google Scholar
Zou, L., Wang, Z., Hu, J., Dong, H.: Partial-nodes-based state estimation for delayed complex networks under intermittent measurement outliers: a multiple-order-holder approach. IEEE Trans. Neural Netw. Learn. Syst. (2022). https://doi.org/10.1109/TNNLS.2021.3138979
Article Google Scholar
Zou, C., Li, B., Liu, F., Xu, B.: Event-triggered $\mu $-state estimation for Markovian jumping neural networks with mixed time-delays. Appl. Math. Comput. 425, 127056 (2022)
MathSciNet MATH Google Scholar
Liu, Y., Wang, Z., Yuan, Y., Alsaadi, F.E.: Partial-nodes-based state estimation for complex networks with unbounded distributed delays. IEEE Trans. Neural Netw. Learn. Syst. 29(8), 3906–3912 (2018)
Article MathSciNet Google Scholar
Paoletti, M.E., Haut, J.M., Pereira, N.S., Plaza, J., Plaza, A.: Ghostnet for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 59(12), 10378–10393 (2021)
Article Google Scholar
Yin, J., Zhou, Z., Xu, S., Yang, R., Liu, K.: A 3D grouped convolutional network fused with conditional random field and its application in image multi-target fine segmentation. Int. J. Comput. Intell. Syst. 15, 11 (2022)
Article Google Scholar
Pal, A., Selvakumar, M., Sankarasubbu, M.: Magnet: multi-label text classification using attention-based graph neural network. In: 12th International Conference on Agents and Artificial Intelligence, Valletta, MALTA, pp. 494–505, 22-24 February (2020)
Tian, C., Fei, L., Zheng, W., Xu, Y., Zuo, W., Lin, C.W.: Deep learning on image denoising: an overview. Neural Netw. 131, 251–275 (2019)
Article MATH Google Scholar
Hesamian, M., Jia, W., He, X., Kennedy, P.: Deep learning techniques for medical image segmentation: achievements and challenges. J. Digit. Imaging 32(4), 582–596 (2019)
Article Google Scholar
Hoeser, T., Kuenzer, C.: Object detection and image segmentation with deep learning on earth observation data: a review part I: evolution and recent trends. Remote Sens. 12(10), 1667 (2020)
Article Google Scholar
Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze and excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. 42(8), 2011–2023 (2020)
Article Google Scholar
Ilesanmi, A.E., Ilesanmi, T.O.: Methods for image denoising using convolutional neural network: a review. Complex Intell. Syst. 7(5), 2179–2198 (2021)
Article Google Scholar
Li, S., Song, W., Fang, L., Chen, Y., Ghamisi, P., Benediktsson, J.A.: Deep learning for hyperspectral image classification: an overview. IEEE Trans. Geosci. Remote Sens. 57(8), 6690–6709 (2019)
Article Google Scholar
Li, J., Dong, H., Wang, Z., Bu, X.: Partial neurons-based passivity-guaranteed state estimation for neural networks with randomly occurring time-delays. IEEE Trans. Neural Netw. Learn. Syst. 31(9), 3747–3753 (2020)
Article MathSciNet Google Scholar
Ranjan, R., Patel, V.M., Chellappa, R.: Hyperface: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 121–135 (2019)
Article Google Scholar
Schlemper, J., Oktay, O., Schaap, M., Heinrich, M., Kainz, B., Glocker, B., Rueckert, D.: Attention gated networks: learning to leverage salient regions in medical images. Med. Image Anal. 53, 197–207 (2019)
Article Google Scholar
Shi, W., Jiang, F., Zhang, S., Wang, R., Zhao, D., Zhou, H.: Hierarchical residual learning for image denoising. Signal Process. Image Commun. 76, 243–251 (2019)
Article Google Scholar
Sony, S., Dunphy, K., Sadhu, A., Capretz, M.: A systematic review of convolutional neural network-based structural condition assessment techniques. Eng. Struct. 226, 111347 (2021)
Article Google Scholar
Wang, Y., Sun, Y., Liu, Z., Sarma, S., Bronstein, M., Solomon, J.: Dynamic graph CNN for learning on point clouds. ACM Trans. Graph. 38(5), 1–12 (2019)
Article Google Scholar
Wang, Z., Chen, J., Hoi, S.: Deep learning for image super-resolution: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 43(10), 3365–3387 (2020)
Article Google Scholar
Li, X., Xiao, J., Zhou, Y., Ye, Y., Lv, N., Wang, N., Wang, S., Gao, S.: Detail retaining convolutional neural network for image denoising. J. Vis. Commun. Image Represent. 71, 102774 (2020)
Article Google Scholar
Quan, Y., Chen, Y., Shao, Y., Teng, H., Xu, Y., Ji, H.: Image denoising using complex-valued deep CNN. Pattern Recogn. 111, 107639 (2021)
Article Google Scholar
Zeng, N., Li, H., Wang, Z., Liu, W., Liu, S., Alsaadi, F.E., Liu, X.: Deep-reinforcement-learning-based images segmentation for quantitative analysis of gold immunochromatographic strip. Neurocomputing 425, 173–180 (2021)
Article Google Scholar
Ke, L., Zhang, Y., Yang, B., Luo, Z., Liu, Z.: Fault diagnosis with synchrosqueezing transform and optimized deep convolutional neural network: an application in modular multilevel converters. Neurocomputing 430, 24–33 (2021)
Article Google Scholar
Cheng, P., Malhi, H.: Transfer learning with convolutional neural networks for classification of abdominal ultrasound images. J. Digit. Imaging 30(2), 234–243 (2017)
Article Google Scholar
Kandel, I., Castelli, M.: Transfer learning with convolutional neural networks for diabetic retinopathy image classification: a review. Appl. Sci. 10(6), 2021 (2020)
Article Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2009)
Article Google Scholar
Ranaweera, M., Mahmoud, Q.H.: Virtual to real-world transfer learning: a systematic review. Electronics 10(12), 1491 (2021)
Article Google Scholar
Zhuang, F., Qi, Z., Duan, K., Xi, D., Zhu, Y., Zhu, H., Xiong, H., He, Q.: A comprehensive survey on transfer learning. Proc. IEEE 109(1), 43–76 (2021)
Article Google Scholar
Hijab, A., Rushdi, M.A., Gomaa, M.M., Eldeib, A.: Breast cancer classification in ultrasound images using transfer learning. In; 2019 Fifth International Conference on Advances in Biomedical Engineering (ICABME), Tripoli, Lebanon, p. 4, 17–19 October (2019)
Ji, D., Wang, C., Li, J., Dong, H.: A review: data driven-based fault diagnosis and RUL prediction of petroleum machinery and equipment. Syst. Sci. Control Eng. 9(1), 724–747 (2021)
Article Google Scholar
Li, H., Jiang, B., Li, Y., Cao, L.: A combined method of crater detection and recognition based on deep learning. Syst. Sci. Control Eng. 9(sup2), 132–140 (2021)
Article Google Scholar
Zhang, Y., Li, X., Wang, B., Li, J., Dong, H.: Random noise suppression of seismic data based on joint deep learning. Oil Geophys. Prospect. 56(1), 9–25 (2021)
Google Scholar
Kai, Z., Zuo, W., Chen, Y., Meng, D., Lei, Z.: Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising. IEEE Trans. Image Process. 26(7), 3142–3155 (2016)
MathSciNet MATH Google Scholar
Kingma, D.P., Ba, J.L.: Adam: A Method for Stochastic Optimization. arXiv preprint. arXiv:1412.6980 (2014)
Tahir, M., Zeeshan, A.: Fuzzy superior mandelbrot sets. Soft. Comput. 26(18), 9011–9020 (2022)
Article Google Scholar
Zeeshan, A., Tahir, M., Miin, S.: Complex T-spherical fuzzy aggregation operators with application to multi-attribute decision making. Symmetry 12(8), 1311 (2020)
Article Google Scholar
Zeeshan, A., Tahir, M.: Maclaurin symmetric mean operators and their applications in the environment of complex q-rung orthopair fuzzy sets. Comput. Appl. Math. 39, 1–27 (2020)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank to all partners for knowledge sharing and supports for this research.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grants U21A2019 and 61873058, the Natural Science Foundation of Heilongjiang Province of China under Grant LH2022F008, the Heilongjiang Postdoctoral Foundation of China under Grant LBH-Z18045, the Hainan Province Science and Technology Special Fund of China under Grant ZDYF2022SHFZ105, and the Fundamental Research Funds for Provincial Undergraduate Universities of Heilongjiang Province of China under Grant 2018QNL-56.

Author information

Authors and Affiliations

Artificial Intelligence Energy Research Institute, Northeast Petroleum University, 163318, Daqing, China
Xuegui Li, Shuo Feng, Hanyang Li & Yingjie Zhou
School of Computer and Information Technology, Northeast Petroleum University, 163318, Daqing, China
Xuegui Li, Shuo Feng & Yingjie Zhou
Heilongjiang Provincial Key Laboratory of Networking and Intelligent Control, Northeast Petroleum University, 163318, Daqing, China
Xuegui Li, Shuo Feng, Hanyang Li & Yingjie Zhou
Sanya Offshore Oil and Gas Research Institute, Northeast Petroleum University, 572025, Sanya, China
Xuegui Li
Information Archive Center, AECC Harbin Dongan Engine Co., Ltd., 150060, Harbin, China
Yuantao Guo

Authors

Xuegui Li
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Feng
View author publications
You can also search for this author in PubMed Google Scholar
Yuantao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Hanyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Yingjie Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

In this study, XL, SF, YG prepared the initial draft of the manuscript. HL and YZ revised and reviewed the manuscript. Authors read and approved the final manuscript.

Corresponding author

Correspondence to Yuantao Guo.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Feng, S., Guo, Y. et al. Denoising Method for Microseismic Signals with Convolutional Neural Network Based on Transfer Learning. Int J Comput Intell Syst 16, 91 (2023). https://doi.org/10.1007/s44196-023-00275-w

Download citation

Received: 08 November 2022
Accepted: 15 May 2023
Published: 24 May 2023
DOI: https://doi.org/10.1007/s44196-023-00275-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Denoising Method for Microseismic Signals with Convolutional Neural Network Based on Transfer Learning

Abstract

Similar content being viewed by others

A two-stage seismic data denoising network based on deep learning

Application of residual learning to microseismic random noise attenuation

Deep Convolutional Neural Network for Microseismic Signal Detection and Classification

1 Introduction

2 A Transfer Learning Model for Microseismic Denoising