Introduction

The seismic signals describing the underlying geological structure are usually polluted by random noise during the data collection. The more complex geological structure, the lower SNR in seismic record will be. However, poor performance in a nonstationary circumstance restricts the application in some seismic records, especially in the low signal-to-noise ratio (SNR) case. Thus, the improvement of SNR is the critical step of seismic signal processing. How to effectively attenuate random noise, particularly for the records with strong random noise, is the central issue.

Various noise attenuation methods have been proposed and applied to suppress seismic random noise, such as adaptive filtering (Lin et al. 2015), independent component analysis (Wang et al. 2010), fx-deconvolution (Chen and Ma 2014), and tx-prediction filtering (Abma and claerbout 1995). In recent years, noise attenuation in transform domain has become a popular researching trend, such as Fourier transform (Liu and Sacchi 2004), wavelet transform (Liu et al. 2014a, b), Radon transform (Trad et al. 2002), curvelet transform (Górszczyk et al. 2014), and shearlet transform (Liu et al. 2014a, b).

After being proposed, there has been an increasing awareness of time–frequency peak filtering (TFPF) due to its outstanding performance in suppressing nonstationary and strong seismic random noise (Lin et al. 2011). It can effectively recover seismic signal even when it is under the circumstances when the SNR is as low as − 9 dB. However, the fixed window length (WL) leads to a pair of contradictions between random noise reduction and desired signal reservation. The remarkable filtering effectiveness is based on the foundation of linearity unbiased estimation, which results in time-windowing processing in the time domain (Lin et al. 2011).

In order to improve the degree of linearity, some improved TFPF algorithms have been already proposed. In the varying WL TFPF (Lin et al. 2008), the long WL and the short WL are applied to signal segments and noise segments, respectively. But these two kinds of segments are difficult to be distinguished. In the radial-trace TFPF (Wu et al. 2011), linearity of the effective signals is enhanced by radial-trace transform. However, radial filtering traces with fixed 45° angle limit its advantages. Parabolic-trace TFPF (PT-TFPF) (Tian and Li 2014) and variable eccentricity hyperbolic-trace TFPF (VH-TFPF) (Tian et al. 2014) have been proposed. They attempt to resample seismic data through the parabolic-trace and hyperbolic-trace. In case that resampling trace fits reflection event perfectly, the events become more clear and continuous by these two methods. However, filtering trace is difficult to match the mixed seismic events perfectly, particularly in field data.

In this article, we propose a novel method—adaptive linear TFPF (AL-TFPF). Firstly, the noisy seismic record is divided into some overlapping fragments. Similar blocks measured by \(l^{p}\)-norm are stacked together to structure a new matrix that we term it “group.” In this processing, temporal continuity and spatial correlation of seismic events are taken into account. Because of matching operation, similarity in grouped fragments enhances linearity greatly. Then, applying TFPF along inter-block traces will improve the results of de-noising. Grouping is an adaptive processing rather than a changeless operation, such as definite length window or some fixed filtering traces. As a conclusion, the method proposed in this article is a perfect fit for the seismic records.

The outline of this paper is as follows. In “Basic principle of the conventional TFPF” section, we present the basic principle and discuss bias of the conventional TFPF. In “Adaptive linear TFPF” section, we describe the algorithm details and implementation of AL-TFPF. In “Application to the seismic records” section, we show the performance of AL-TFPF on both synthetic models and field data and compare with the other methods. Finally, we draw conclusions at the end of the article.

Basic principle of the conventional TFPF

Principle of TFPF

The conventional TFPF is an effective method to reconstruct the useful signals from the additive random noise. Assume that noisy signal model is \(s(t) = x(t) + n(t)\), where \(x(t)\) is pure seismic record and \(n(t)\) is additive random noise. In this approach, noisy signal s(t) is encoded into an instantaneous frequency of an analytic signal \(Z_{S} (t)\). The peak of the Wigner–Ville spectrum of the analytic signal in the time–frequency domain is the useful components of seismic signal.

Firstly, signal can be modulated, according to the equation:

$$Z_{S} (t) = {\text{e}}^{{j2\pi \mu \int_{0}^{t} {s(\lambda ){\text{d}}\lambda } }}$$
(1)

where \(\mu\) is the scaling parameter, which represents the frequency modulation index.

And then, the peak of the time–frequency distribution of \(Z_{S} (t)\) is extracted as the filtered signal \(\hat{s}(t)\) (Boashash and Mesbah 2004):

$$\hat{s}(t) = \frac{{\arg_{f} \hbox{max} [W_{Z} (t,f)]}}{\mu }$$
(2)

where \(W_{Z} (t,f)\) denotes the pseudo Wigner–Ville distribution (WVD) \(Z_{S} (t)\) of and is given by

$$W_{Z} (t,f) = \int_{ - \infty }^{\infty } {Z\left( {t + \frac{\tau }{2}} \right)Z^{*} \left( {t - \frac{\tau }{2}} \right){\text{e}}^{ - j2\pi f\tau } {\text{d}}\tau }$$
(3)

Bias analysis of TFPF

For special cases that the signal is linear in time and signals have been polluted by stationary white Gaussian noise, TFPF could give an absolutely unbiased estimation (Boashash and Mesbah 2004). However, this condition is usually not satisfied. In that way, the bias B(t) of the TFPF algorithm is evaluated as

$$\begin{aligned} B(t) = & \arg_{f} \hbox{max} \left( {W_{Z} (t,f) \times \frac{{4\pi^{2} k_{{n_{2} }} \mu^{2} }}{{\left( {2\pi^{2} k_{{n_{2} }} \mu^{2} } \right)^{2} + (2\pi f)^{2} }}} \right) \cdot \frac{1}{\mu } \\ \quad \quad & - x(t) \\ \end{aligned}$$
(4)

where \(k_{{n_{2} }}\) is the second cumulant of \(n(t)\) (Boashash and Mesbah 2004).

If the TFPF can provide an unbiased estimation of a linear signal \(x(t) = \alpha t + \beta\), where \(\alpha\) and β are constants, WZ(t, f) is

$$W_{Z} (t,f) = \delta (f - \mu x(t))$$
(5)

Derived from this, the bias \(B(t)\) becomes

$$\begin{aligned} & B(t) = \arg_{f} \hbox{max} \left( {\frac{{4\pi^{2} k_{{n_{2} }} \mu^{2} }}{{(2\pi^{2} k_{{n_{2} }} \mu^{2} )^{2} + (2\pi f - 2\pi \mu x(t))^{2} }}} \right) \cdot \frac{1}{\mu } \\ & - \,x(t) = 0 \\ \end{aligned}$$
(6)

In practice, the conventional TFPF often adopts the pseudo Wigner–Ville distribution (PWVD) (Boashash and Mesbah 2004). It is the windowed version of WVD to satisfy the local linearity and reduce the bias. \(h(\tau )\) is a window function sliding with time. The PWVD is defined as

$${\text{PW}}_{Z} (t,f) = \int_{ - \infty }^{\infty } {h(\tau )Z\left( {t + \frac{\tau }{2}} \right)Z^{*} \left( {t - \frac{\tau }{2}} \right){\text{e}}^{ - \,j2\pi f\tau } {\text{d}}\tau }$$
(7)

So, the recovery result using the conventional TFPF is sensitive to window length. An empirical calculation method of window length is presented (Lin et al. 2007).

$$1 \le {\text{WL}} \le \frac{{0.384f_{s} }}{{f_{d} }}$$
(8)

where \(f_{s}\) is the sampling frequency and \(f_{d}\) is the dominant frequency of seismic signal.

However, a fixed WL is not suitable for all frequency components of the signal. The noisy signal contains two events, whose dominant frequencies are 25 and 45 Hz, respectively. The sampling frequency is 1000 Hz and the offset is 30 m. According to Eq. 8, the maximum optimal WL of 25 Hz is 15 points and the maximum optimal WL of 45 Hz is 7 points. The simulation results in Fig. 1 validate the effects of inappropriate window lengths on both signal amplitude and noise suppression. The longer the window, the stronger the noise suppression capability, the more severe the effective signal attenuation; otherwise, the effective signal amplitude is better, but with more random noise residue.

Fig. 1
figure 1

a Result of WL of 7 points. b Result of WL of 15 points

Clearly, different frequency signals correspond to different optimal WLs. Furthermore, dominant frequency of seismic signal as a priori information is unknown in field seismic data.

Adaptive linear TFPF

To resolve the problems in the conventional TFPF and the PT-TFPF, we propose an AL-TFPF for seismic random noise attenuation in this article. The novel method contains parts of the grouping by matching, inter-block trace TFPF and aggregation.

Grouping by matching

According to the bias analysis, we know that the linearity of the desired signal has an important influence on the unbiased estimation. Therefore, the conventional TFPF and several improved algorithms have done a lot of research in the field of linearity enhancement. The PT-TFPF and the VH-TFPF apply resampling along some filtering traces with fixed shape. Thus, similarity degree between resampling trace and bended reflection events directly influences the noise suppressing effect of improved TFPF algorithm. However, the filtering traces based on some fixed form cannot be applicable for properties (i.e., curvature and inclination) of arbitrary seismic event, in particular for field seismic data containing scattered events.

As previously described, the seismic events are temporal continuous in single channel and spatial correlation between adjacent channels. These properties lead to similarity on waveform of reflected event. However, in the seismic record there is often random noise whose energy is concentrated in certain directions; the noise in these directions is correlated. Based on this, we propose grouping by matching—an adaptive resampling method. We divide each channel into blocks with fixed length N1 and the whole data are divided into N blocks. Blocks are overlapped with step Np. Ideally, block of noise and block of signal can be effectively divided according to some similarity criteria.

Select a block as the reference block \(Y_{R}\) and find candidate blocks \(Y_{x} (x \in [1,N])\) which are similar to the given reference one. Figure 2 gives the detailed procedure of the block matching process. Similar blocks we desired are marked by dashed frame. The similarity measure is calculated by weighted Euclidean distance (\(l^{p}\)-norm, p = 2), such as

Fig. 2
figure 2

Diagram of block matching

$$d(Y_{R} ,Y_{x} ) = \frac{{\left\| {T(Y_{R} ) - T(Y_{x} )} \right\|_{2}^{2} }}{{N_{1} }}$$
(9)

In order to avoid erroneous grouping caused by large noise variance σ, we apply discrete wavelet transform \(T( \cdot )\) to each block. The wavelet transform divides the signals into low scale and high scale, and the effective signals are mainly distributed on the low scale. So we extract approximate coefficients to calculate distance d(YR, Yx). Similar block matching is performed for the simulated seismic record with real noise, in different domains, and the results are shown in Fig. 3. Comparing with matching operation in time domain, we can clearly see that the matching result after wavelet transform is clustered in a smaller area.

Fig. 3
figure 3

Grouping results. a Matching in time domain. b Matching in wavelet domain

The blocks whose distance d(YR, Yx) are smaller than a given threshold \(\tau\) are considered mutually similar and stacked together as a group ZR belong to \(Y_{R}\).

$$Z_{R} = \left \{ Y_{x} ,x \in [1,N]|d(Y_{R} ,Y_{x} ) \le \tau \right \}$$
(10)

Apparently, size of \(Z_{R}\) is \(N_{1} \times n\), where n is quantity of block similar to \(Y_{R}\). Groups belong to noise and noisy signal is shown as Fig. 4. Apparently, if the reference block contains the useful waveform, the blocks in \(Z_{R}\) should all contain reflected events.

Fig. 4
figure 4

Diagram of group

Inter-block trace TFPF

In the above approach, to achieve an unbiased result, the desired signal needs to have high linearity. By Eq. 6, we have proved this conclusion that bias of filtering is close to 0 if seismic signal is linear. Obviously, the conventional TFPF that filters along the time direction does not sufficiently take this issue into account.

In our method, grouping by matching is a procedure that transforms seismic record blocks into some new data matrixes whose size is \(N_{1} \times n\). Because of the similarity, reflection event and noise are divided into different groups. We term horizontal direction as inter-block trace. There is no doubt that the desired signal is linear for noise group. Moreover, the signal amplitude of corresponding position along inter-block trace should be approximate. As a result, we can infer that the linearity is enhanced, which is compared with time direction. This conclusion is verified in Fig. 5.

Fig. 5
figure 5

a A group of pure signal. b Channel along inter-block direction. c Channel along time direction

Figure 5a exhibits a group divided from a 50-channel noisy synthetic seismic record with one reflection event. Figure 5b shows the curve along inter-block trace. Figure 5c shows a block in this group. That is to say, the line depicted in Fig. 5c is along the time direction. Through observation and comparison, we can see that the curve extracted along inter-block trace is smoother than time trace. This describes higher degree of linearity and lower instantaneous frequency which realize lower deviation.

Figure 6 shows de-noising results using two methods, including time trace TFPF (WL = 13) and inter-block trace TFPF in this group. It is easy to deduce that waveform de-noised by inter-block trace matches the reflection event in shape better than the one de-noised by time trace. In addition, proposed method does not need any prior knowledge to estimate the window length. Thus, it is more suitable for the application of the field seismic records. The proposed method is no signal attenuation and distortion caused by inappropriate WL.

Fig. 6
figure 6

Result of two filtering methods for block

We use block matching before de-noising firstly. This greatly increases the computational complexity of the algorithm. For block matching, assuming that the size of noisy signal is \(N \times N\), the size of block is \(1 \times 1\), the step between two blocks is Ns and the search window is \(N_{h} \times N_{h}\). The time complexity is calculated as

$$\frac{{N^{2} \times N_{h}^{2} }}{{N_{s}^{2} }}$$
(11)

Our method is able to improve the de-noising performance by sacrificing computational time.

Aggregation

Through inter-block trace TFPF and block-wise estimates, \(\hat{Y}_{x}\) have been obtained. In general, each block is used several times, because it belongs to the different reference block. And, blocks are also overlapping in the original location. This situation leads to overlapping block-wise estimates. To achieve the global estimate \(\hat{Y}\), we place block-wise estimates back to original location and propose a computing method:

$$\hat{Y} = \frac{{\sum\limits_{{Y_{R} }} {\sum\limits_{{Z_{R} }} {\hat{Y}_{x} (a,b)} } }}{c}$$
(12)

where a and b are coordinates of point in the obtained block-wise estimates. And c is how many times the \(\hat{Y}_{x} (a,b)\) has been used.

Above all, implementation and details of AL-TFPF are described at length. Since the AL-TFPF achieves linear enhancement by matching, it does not require dominant frequency of reflection event to estimate the maximum optimal WL. In addition, AL-TFPF is able to avoid the signal loss usually caused by unsatisfactory WLs.

Application of AL-TFPF in broad band signals

To verify the advantage of AL-TFPF using broad band seismic signals, we simulate seismic signals containing three events, whose dominant frequencies are 25, 55 and 85 Hz, respectively. Actual noise is added to the noise-free data and the SNR is − 5.5977 dB. The frequency of noisy signals ranges from 0 to 200 Hz. The results are shown in Figs. 7 and 8. First, we can find that, after the conventional TFPF (WL = 13), the result of 85 Hz has serious distortion of nearly 50%. Then, we can see that there is almost no distortion in the result of conventional TFPF (WL = 5). However, the noise still exists. Finally, after the AL-TFPF, most random noise has been reduced and the reflection events are revealed clearly.

Fig. 7
figure 7

Results of TFPF with different WLs and AL-TFPFs

Fig. 8
figure 8

Enlarged section of 85 Hz

When a signal contains multiple dominant frequencies, the traditional TFPF cannot consider the different dominant frequencies at the same time, so the de-noising effect is poor. However, the AL-TFPF does not need to calculate the window length, so the de-noising effect is better.

Application to the seismic records

Synthetic seismic record

To verify the performance of the proposed algorithm, we test our method on a 50-channel synthetic seismic record. The record contains three reflected events with dominant frequencies 45, 35 and 25 Hz, respectively. The sampling frequency is 1000 Hz and the offset is 30 m. The desired signals are mixed with a series of stochastic shifting in the traces and embedded in real noise with an SNR of − 4.4474 dB. The real noise is intercepted from real seismic record and the actual noise is a kind of colored noise. The noisy signal is shown in Fig. 9a. We apply the conventional TFPF (WL = 13), the fx-deconvolution (fx-decon), the PT-TFPF (k = 0.0001, WL = 25) and the AL-TFPF \((N_{1} = 50,\) \(N_{p} = 4,\tau = 150)\) to it, respectively. Figure 9b shows the de-noised result by the conventional TFPF. Figure 9c shows the de-noised result by the fx-decon. Figure 9d shows the de-noised result by the PT-TFPF. Figure 9e shows the de-noised result by the AL-TFPF. It is obvious that the noisy data are recovered as the desired one, and the random noise is also adequately suppressed through AL-TFPF.

Fig. 9
figure 9

a Noisy seismic record. b Result by the conventional TFPF. c Result by fx-decon. d Result by PT-TFPF. e Result by AL-TFPF

Figure 10 gives the difference between the input data and the de-noised result of each method. Obviously, the AL-TFPF has the least signal leakage. Figure 11 gives the detailed comparison of each dominant frequency signal with four methods. The proposed method in this article retains the signal amplitude better for seismic events with different dominant frequencies. Figure 12 gives corresponding spectrum diagram. It can be seen that the energy of the improved method is more stable. For quantitative analysis, the amplitude loss corresponds to these four methods, as shown in Table 1. Bold characters in Table 1 indicate the lowest rate of amplitude loss among the three methods at different frequencies of noisy signals. In terms of amplitude preserving, the fx-decon looses more signals in seismic events with 45 Hz, and the PT-TFPF has been improvement comparing to TFPF, but AL-TFPF has achieved a better effect. Figure 13 shows spectrums diagram before and after filtering. By comparing with experimental results, a spectrum diagram of AL-TFPF more close to of pure signal at below 200 Hz and over 200 Hz. These results prove that AL-TFPF attenuates the random noise with less loss of the useful components, in particular for high dominant frequency signals.

Fig. 10
figure 10

a Difference of the conventional TFPF. b Difference of the fx-decon. c Difference of the PT-TFPF. d Difference of the AL-TFPF

Fig. 11
figure 11

a Result of 25-Hz synthetic seismic signal. b Result of 35-Hz synthetic seismic signal. c Result of 45-Hz synthetic seismic signal

Fig. 12
figure 12

a Amplitude spectrum of 25-Hz synthetic signal. b Amplitude spectrum of 35-Hz synthetic signal. c Amplitude spectrum of 45-Hz synthetic signal

Table 1 Amplitude loss with different methods
Fig. 13
figure 13

Amplitude spectrums of single trace with different methods

In order to verify a variety of situations, we adjust the amplitude of noise to obtain different SNRs. The de-noised SNRs of these four methods are shown in Table 2. Bold characters in the Table 2 indicate the highest SNR of the three methods when the SNR of the noisy signal is different. Comparing with the conventional TFPF and the PT-TFPF, the AL-TFPF gets a higher SNR in a variety of situations. Comparing with the fx-decon, the AL-TFPF suppresses noise better, when SNR is greater than − 9 dB. However, the AL-TFPF’s SNR is slightly smaller than the fx-decon’s SNR when SNR is less than − 9 dB.

Table 2 SNRs with different methods

Field data processing

To test the practicality of the proposed method, we apply the AL-TFPF to a field seismic data acquired from a certain zone in China. Figure 14a shows a common shot point (CSP) record with 168-channel. There are 6000 sample points in each channel. The sampling frequency is 1000 Hz and the geophone interval is 30 m. In this seismic record, the reflection events are buried in strong random. And there are some areas whose reflection events are too disordered to identify. Moreover, there are some other types of noise, which may be brought in by the collection device.

Fig. 14
figure 14

a Field seismic record. b Result by the conventional TFPF. c Result by the PT-TFPF. d Result by the AL-TFPF

We apply the conventional TFPF (WL = 13), the PT-TFPF (k = 0.2, WL = 25) and the AL-TFPF \((N_{1} = 100,N_{p} = 10,\) \(\tau = 1 \times 10^{ - 3} )\) to it, respectively. The amplitude of the actual data is within 10−3. In order to facilitate later data processing, we set amplitude of the simulation data to 1. Because the amplitudes are different between the synthetic and the real data, the threshold values are different. The de-noising results after these three methods are given in Fig. 14b–d. We can see that the conventional TFPF performs better in suppressing noise than others. However, background noise still exists, especially in the circled area. Furthermore, the special noise in the middle part of the filed seismic record is not removed. After the PT-TFPF, some originally hidden events are revealed. But background noise and special noise also exist. On the contrary, background noise between the reflection events is attenuated more effectively by the AL-TFPF, shown as in Fig. 14d. Special noise is also removed. From Fig. 15, we can see that each algorithm has some signal loss. The signal loss of the conventional TFPF is less, but the noise suppression is also less. Moreover, the PT-TFPF algorithm looses the most signal. The AL-TFPF has distinct advantages in both eliminating directional random noise and preserving reflected signals in comparison with many other methods.

Fig. 15
figure 15

a Difference section of the TFPF. b Difference section of the PT-TFPF. c Difference section of the AL-TFPF

Conclusion

The AL-TFPF seismic de-noising method is proposed in this article. This novel method resamples the noisy seismic record through matching based on spatial correlation between adjacent channels. Through this step, the AL-TFPF searches the events adaptively to resolve the problem caused by fixed form filtering trace. Therefore, the AL-TFPF is more suitable for the field record with complex events. In addition, the number of the resampling points is similar along inter-block trace in a group. That is to say, linearity is implemented and enhanced by inter-block filtering trace instead of window. This method resolves not only the problem of window length selection, but also signal attenuation even distortion.

By the comparison with the conventional TFPF, fx-decon and the PT-TFPF, we deduced that AL-TFPF performs better in effective components preservation, in particular for high dominant frequency reflection events. Moreover, the recovered events became clearer and more continuous.

Finally, due to the influence of noise, the linearity of the effective signals cannot be improved completely by weighted Euclidean distance in noisy seismic record. The AL-TFPF also needs further research.