1 Introduction

Automatic of seismic event detection and location can significantly improve efficiency of routine seismic network processing. This is especially important for seismic monitoring of CTBT verification. As magnitude of a 1 kt fully decoupled explosion may be as small as mb 2.0+ (Murphy 1996), events routinely need to be timely detected and located may amount to such a large number that are hard to be manually picked up. For example, it is estimated that annually there are over 1 million earthquakes with magnitude above mb 2.0 occurring in the world. In addition to this, nowadays a station-network operator may find that it is relatively easy to install more seismic stations to improve the network’s coverage. However, due to the deficiency of reliable automatic data processing system, often it is the case that the increasing amount of data is hard to be fully utilized for it is too large to be analyzed by a limited number of human analysts.

Although it may not be always necessary by some scholars (e.g., Ryzhikov et al. 1996; Withers et al. 1998), most of the practical automatic processing systems used by seismic station-network operators follow the procedure of signal detection, phase-type identification, signal association, and location, which is abbreviated as the DIAL in this paper. By this approach, possible seismic signals are automatically picked up by some kind of detectors [e.g., STA/LTA (Allen 1978; 1982; Evans and Allen 1983; Baer and Kradolfer 1987; Earle and Shearer 1994; Withers et al. 1998); PMCC (Cansi 1995); artificial neural network (Dai and Macbeth 1995, 1997; Zhao and Takano 1999); F-detector (Arrowsmith et al. 2009; Selby 2008, 2011)]. As a detection is triggered, its onset time is further estimated using some more sophisticated procedures [e.g., AR-AIC (GSE/JAPAN/40 1992; Sleeman and van Eck 1999); CUMSUM (Inclán and Tiao 1994; Der and Shumway 1999); HOS (Saragiotis et al. 2002; Küperkoch et al. 2010b); Wavelet Transform (Tibuleac et al. 2003; Zhang et al. 2003); MPX (Stefano et al. 2006); Tpd (Hildyard et al. 2008)], as well as its characterization parameters, including amplitude, dominant frequency, backazimuth, slowness, polarization pattern, etc., are measured. Based on these characterization parameters, specific algorithm such as artificial neural network usually is used to identify the phase type of the triggered detections (e.g., Roberts et al. 1989; Suteau-Henson 1991; Tong 1995; Wang and Teng 1995, 1997; Tong and Kennett 1996; Anant and Dowla 1997; Bai and Kennett 2000, 2001; Wang 2002). Search algorithms are then used to associate detections possibly from same events together and hypocenter of these events are then located (e.g., Anonymous (IDC-5.2.1). 1999; Küperkoch 2010).

The seismic data processing system that is currently used at the Vienna International Data Center is a typical example of applying DIAL approach. This system was designed to automatically detect and locate seismic events recorded by the seismic component of the International Monitoring System before they can be presented to human analysts for review. According to the IDC Documentation IDC-5.2.1, the processing of seismic data at IDC is divided into two stages, i.e., the station processing stage and the network processing stage. During the station processing stage, a running STA/LTA detector is applied to detect potential seismic signals after raw waveform data from individual stations have passed through sequential measures for data quality checking and signal-to-noise ratio improvement. Then, characteristics include arrival time, amplitude, period, azimuth, slowness, and polarization feature for declared detections are extracted. Based on these characteristics, two different methods, artificial neural network for three-component seismic stations and simple rule for seismic arrays, are used to categorize the detected signals as one of the four wave type, i.e., noise, teleseismic P, regional P or regional S. Then, detections from the same events are then grouped into station association groups. For each group, the first arrival is called as the generator, and its signal type is a local P, regional P, or teleseismic phases, and subsequent phases in each group are identified based on their compatibility with the generator, as determined by the arrival times and compatibility tests, which details are expatiated in IDC-5.2.1. Finally the phases identified during the station processing is feed to the network processing stage to associate with events using a grid-based event search procedure and to locate and estimate the magnitude of the events using arrival time and amplitude of the associated signals. Detail explanation about the grid-based event search procedure can also be found in IDC-5.2.1.

Though the automatic processing of seismic data at IDC is very complex, its performance hardly can be said satisfying by producing too many false events, while missing a fair portion of real (e.g., Procopio et al. 2009; Pearce 2009; Le Bras et al. 2011; Draelos et al. 2012), which has already attracted extensive attention among the CTBT monitoring community and some new techniques have been proposed to deal with this problem (e.g., Le Bras et al. 2011; Draelos 2012; Arora et al. 2012; Moore et al. 2012). However, most of the new techniques (e.g., Draelos 2012; Arora 2012) are still based on the existed signal characterization parameters and performance improvement by these techniques remains to be validated.

In fact, the poor performance of conventional automatic processing method is largely because the existing parameters fail to sufficiently characterize the signals’ feature. The characterization parameters exploited by conventional automatic processing systems (as that of IDC) are nearly all extracted from short waveform segments of seismograms. Though they may reflect signals’ transient behaviors well, they can neither reflect the whole shape of seismograms, nor fully reflect relationships between different seismic detections, both of which are important for seismogram interpretation.

Here, in this paper, we present a novel technique for automatic seismic data processing which utilizes both integral and local feature of seismograms. Here, the term integral feature of seismograms largely refer to feature about the envelope shape of seismograms. However, unlike some previous efforts which completely abandon the DIAL approach and seek to use envelope cross-correlation to detect seismic events directly (e.g., Ryzhikov et al. 1996; Withers et al. 1998; Moore et al. 2012), we keep to follow the DIAL approach, but include integral feature of the whole seismograms for phase identification and association. Results of preliminary application of this technique show that the inclusion of integral feature in data processing can greatly reduce false and missed events simultaneously and have the potential to realize accurate automatic seismic data processing.

2 Definition and extraction of signals’ integral feature

Integral feature about the shape of the whole seismograms play important roles in routine analyses by human analysts. Usually an experienced analyst can tell whether a seismogram is local, regional or teleseismic simply by glimpsing at it, picking up major phases such as P, S, and Lg by their position in the seismogram. However, because most feature utilized by human analysts are not clearly defined and parameterized, it is difficult to directly employ these feature for automatic seismic data processing.

Nevertheless, it is beneficial to think about how human analysts pick up and identify seismic signals. Unlike conventional automatic processing systems, human analysts usually pick the whole wave-train of a seismogram first, then determine its epicentral distance type by its envelope shape and frequency characteristics, and finally pick individual phases by relative amplitude variations and identify their phase types by their relative position in the seismogram along with their polarization and frequency feature.

Enlightened by this process, here, we propose a strategy to parameterize signals’ integral feature. The basic idea of this strategy is illustrated in Fig. 1. This strategy uses a modified STA/LTA detector (see Appendix for detail) to detect individual signal detections and extracts local feature such as amplitudes, dominant frequency, and polarization characteristics of them as well. However, in the process of picking up individual signals, we simultaneously identify continuous wave-trains and associate them with various detections simultaneously. To do so, we first define that a wave-train is a segment of waveform that starts from its onset time to the time when STA/LTA0 drops back to be less than a given threshold ε (it is set to be 1.1 in practice) where LTA0 is the LTA value just before the first triggered detection of it, i.e.

Fig. 1
figure 1

Illustration of the concept of wave-train and its integral feature. The top panel is the waveform of a regional seismogram of epicentral distance about 3.4°. The practical time separation between D0 and D3, which actually corresponding to Pn and Lg, respectively, in this case, is about 50 s. The waveform has passed a 1.5–2.5 Hz bandpass filter to make major regional phases, e.g., Pn, Pg, Sn, and Lg, clear. The bottom panel is its envelope represented by the STA curve

LTA 0 = 1 L i = t 0 - L t 0 x ( i ) ,
(1)

where L is the length of LTA window. For convenience, here we denote the onset time of the first detection as t 0 , and the time when STA/LTA0 drops back to below ε as t e , and the time difference

T D = t e - t 0
(2)

as the wave-train’s duration time. Second, we categorize all declared detections into the beginning type and the following type, i.e., detections at the beginning of a continuous wave-train (e.g., D0 in Fig. 1) and detections during its duration (e.g., D1, D2, and D3 in Fig. 1). In practice, we use a parameter D flag , which may be called as triggered state, to flag these two kinds of detections, i.e., D flag = 0 corresponds to beginning detections and D flag = 1 for following ones. Third, we extract characterization parameters, including amplitude, signal-to-noise ratio, and time delay of envelope peaks of wave-trains and assign them to various detections. For example in Fig. 1, assume that a wave-train starts at t0 and ends at te, and that there are n + 1 triggered detections, i.e.,

D0, D1,…, D n between this time period, where D0 (with arrival time t0) is the beginning detection and D i (with arrival time t i where i = 1,…, n) is the following detections. Denote the envelope peak of the whole seismogram as P0 (with arrival time t P 0 ) and the envelope peak of the waveform segment between detections Di−1 and D i as P i (with arrival time t P i , i = 1,…, n), then characterization parameters for P0 are

A P 0 = max t 0 < t < t e STA ( t ) = STA ( t P 0 ) , SNR P 0 = A P 0 / LTA 0 , T P 0 = t P 0 - t 0 ,
(3)

while characterization parameters for P1 are

A P 1 = max t 0 < t < t 1 STA ( t ) = STA ( t P 1 ) , SNR P 0 = A P 1 / LTA 0 , T P 1 = t P 1 - t 0
(4)

and so on. With these parameters, a wave-train’s general envelope shape can be reflected. For example, for the seismogram in Fig. 1, a general shape of its envelope can be inferred to be similar to that in Fig. 2 by assuming that envelope amplitudes rise or decay exponentially. In practice, these parameters, as well as wave-trains’ duration time can be estimated within a series of frequency bands.

Fig. 2
figure 2

Inferred envelope shape of the seismogram in Fig. 1 using arrival times and the integral characterization parameters defined in the paper

Integral feature defined as above can be easily extracted in the process of signal detection and can be easily embedded into conventional DIAL approach to improve its performance as well. First, each separate continuous wave-train can be represented by its beginning detection. And in cases of local or regional seismograms, beginning detections normally should be the first P arrivals (e.g., Pg or Pn). It is impossible that a following detection is the first P arrival of an event except for rare situations of overlapping signals of multiple events. For this reason, beginning detections naturally are good candidates of direct P phases. Second, following detections during the duration of a wave-train can be readily associated to respective beginning detections, considering their compatibility in amplitude, frequency, and azimuth. Further more, with characterization parameters of envelope peaks, we may determine a following detection’s position in its related wave-train, which is of importance for phase-type identification. One simple way to characterize a detection’s relative position within the seismogram is to use the ratio

r = t - t 0 T P 0 ,
(5)

where t is the detection’s arrival time, t0 is the onset time of respective beginning detection, and T P 0 the time delay of the envelope peak of the whole seismogram. In case of local or regional seismograms, the arrival time of regional S phases (e.g., Sn and Lg) normally should be a little before the envelope peak of the whole seismogram (e.g., Fig. 3), especially within low-frequency band such as 0.5–1.5 Hz and on horizontal components. In contrast, detections of secondary P phases, e.g., Pg of regional seismograms, normally are far before the envelope peak of the whole seismograms.

Fig. 3
figure 3

Illustration of typical three-component seismograms recorded at regional distances. The waveforms are from an earthquake of ML 4.2 occurring in Southwest Xinjiang, China on 2010/06/02. The waveform columns from left to right are 0.5–5 Hz filtered waveforms on vertical, south-to-north and west-to-east component, respectively

3 Application of the seismograms’ integral feature in seismic data processing

Based on the strategy described above, we have developed a prototype software, which is called as DRSN (i.e., Detector for Regional Seismic Networks) by us, for automatic seismic data processing for regional seismic station networks. As illustrated in Fig. 4, the processing flow of this prototype software can be split into 5 stages. The task of the first stage is to detect possible signals and extract their feature, both local and integral, simultaneously. The task of the second stage is to categorize detections into P waves, S waves and noise types using neural networks similar to that of Wang (2002). The task of the third stage is to search for P/S pairs of local or regional events, and the tasks of the fourth and fifth stage are to complete network association and localization for regional and global events, respectively. Currently integral feature are mainly applied in the third stage, though their application in the second stage has also been investigated by us. We will be back to discuss this point later.

Fig. 4
figure 4

DRSN data processing flow

Finding major P and S arrivals and grouping them together is of key importance for detecting and locating regional events. Figure 5 is a diagram to explain how integral feature defined in Sect. 2 can be used for this objective. The searching process for P/S pairs is divided into two steps. The first step is to search out candidate direct P arrivals from beginning detections; and the second step is to search out major S arrivals from respective following detections. During the searching process, candidate direct P arrivals and respective P/S combinations have to go through a series of tests, including: (1) compatibility between the beginning detection’s signal-to-noise ratio (SNR) and the duration time of respective wave-train (T1); (2) compatibility between amplitude of assumed P and S phases from same events (T2); (3) compatibility between assumed phase type and measured polarization characteristics (T3); (4) compatibility between amplitude ratios among different frequency bands and the assumed S-P time difference (T4); and (5) compatibility between assumed S-P time difference and position of respective seismogram’s envelope peaks (T5).

Fig. 5
figure 5

Illustration of the method to search for regional P/S pairs by integral and local feature together

Among these tests, T2 and T3 are conventional and reasons of them are easy to understand. For test T1, the reason behind it is that if a signal is of very low SNR, but with very long-duration time, especially when it occurs within high-frequency bands such as 8–16 Hz, then corresponding signal usually is a noise burst, such as that in Fig. 6a. On the other hand that if a signal is of great SNR, but very small duration time also within high-frequency band, then it may be signals emitted by microseismic events very close to the recording station and normally cannot be associated to other regional stations (e.g., Fig. 6b). The reason for test T4 is that, if an assumed P/S pair has a time difference corresponding to local distances, then respective signals, especially the P waves, should have rich high-frequency compositions. In practice this can be used to verify a local P/S pair by checking comprising signals’ spectral ratios against the assumed S–P time difference (Fig. 7). The reason for test T5 has already been mentioned in Sect. 2, i.e., in case of local or regional seismograms, the arrival time of major S arrivals normally should be a little before the envelope peak of the whole seismogram, at least within some low-frequency bands such as 0.5–1.5 Hz and at horizontal components. This is true even for underground explosions which can generate P waves more efficiently than earthquakes. Based on this phenomenon, we apply the following condition, i.e.

Fig. 6
figure 6

Testing the compatibility between the beginning detections’ high-frequency SNR and wave-train’s duration to screen out noise bursts (a) and microseismic signals (b), marked by the red arrow in both cases. Detections due to both cases can be viewed as disturbances to meaningful processing results. Waveforms illustrated in both panels are 8–16 Hz bandpass filtered, and readers are reminded to note the difference of the timing scale between them

Fig. 7
figure 7

Testing the compatibility between signals’ spectral ratio and assumed S–P time difference to search for regional P/S pairs. The waveforms in a are the 3C seismogram of a local earthquake, while the waveforms in b are the seismogram of a teleseismic event. Plotc compares the displacement spectra of the beginning detections of the two seismograms, both spectra were computed using data within the 10 s long window beginning from the onset time of D0. It is clear that the signal in a has rich high-frequency content which is compatible with the assumption that D0 and D1 consist a P/S pair with S–P time difference a little more than 10 s. In contrast, the signal in b is deficient of high-frequency content which would contradict the assumption that D0 and D1 consist a P/S pair as the arrival time difference between them is about 15 s

- ε 1 < T P 0 - ( t S - t P ) < ε 2
(6)

to test whether the assumed P/S pair is compatible with the envelope shape of respective seismogram, where T P 0 is the time delay of the peak of the whole wave-train, ɛ1 and ɛ2 are empirically specified parameters, and in practice ɛ1 can be set to be 1 s; and ɛ2 may take different values between 5 and 30 s, depending upon different t S - t P range for the assumed P/S pair. The reason to set ɛ1 to be 1 s instead of zero is for the consideration of that possible error may exist for S arrival time estimation in practice and in some cases the time interval T P 0 - ( t S - t P ) may be very small (e.g., S waves with very sharp onset). An example of applying this condition is illustrated in Fig. 8. The envelope peaks on the seismograms in thefigure is about 90 s after its beginning detection D0. By in Eq. (6) we can avoid mistaking the following detection D1 as a S phase, which often occurs when only local feature are applied for phase association, especially in the case of 3-component stations.

Fig. 8
figure 8

Testing the compatibility between the S–P time difference and envelope peaks for phase-type identification for detections on regional seismograms. Waveforms displayed have been 0.5–1.5 Hz filtered, and the arrow indicates the general location of envelope peaks of the seismograms

4 Test results using data from Xinjiang seismic station network

The approach presented here has been applied to routine data processing by us for regional seismic station networks of various scales for years. Promising results have been achieved with both false and missed events rate normally less than 10 %, in comparison prior to its application the false events rate had accounted about 50 %; and missed events rate for regional events having signals recorded at least at three stations accounted more than 15 %. As an example, here we present the test processing results of using two weeks’ continuous data of the full-scale Xinjiang Seismic Station Network (XJSSN) which is illustrated in Fig. 9. The time period of data that we processed is from 06/01/2010 to 06/16/2010. Automatically detected events were manually checked, and the results of regional events were compared with XJEB, the event bulletin produced by the Seismological Bureau of Xinjiang Province, China, while results of teleseismic or far regional seismic events outside the network are compared with that of the Reviewed Event Bulletin of the Vienna International Data Centre.

Fig. 9
figure 9

Locations of seismic stations of the Xinjiang seismic station network (XJSSN)

For the time period above mentioned, there are 3,654 events detected by our prototype software, only 284 of them are false or split events, approximately accounting for 7.8 %. To evaluate the detection rate for regional and local events, events listed in XJEB were compared to events automatically detected. Within the time period from 06/01/2010 to 06/16/2010, there are 1,025 events detected in XJEB, among which 136 events are of magnitude above ML 2.0, 325 events between ML 1.0–2.0, 48 events below ML 1.0, and 516 events are underdetermined, i.e., events defined by P and S waves at only one station. For an underdetermined event, its epicenter estimation is not given in the XJEB if S–P time difference at the defining station is greater than 3 s and it is assigned to be the station location otherwise. Examples of automatically detected regional events are given in Fig. 10, and detection rate for these four event groups is summarized in Fig. 11. The detection rate is 100 % for events of ML > 2.0, 98.5 % for ML between 1.0–2.0, 95.8 % for ML < 1.0, and 52.1 % for events underdetermined. The major cause why detection rate forunderdetermined events is much lower than that for other three event groups is that most of these events are micro-earthquakes very close to the recording station with S–P time difference only a few seconds, and our software especially its signal detection program is not designed to detect such events. For example, the signal detection program of our software requires that a detection to be triggered should be at least 3 s later than the last detection before it. However, among the 516 underdetermined events, 174 of them are of S–P time difference less than that. Besides nearly half the underdetermined events, there are another 7 events listed in XJEB were not detected by DRSN. These events typically did not have good signal-to-noise ratio and had both clear P and S waves at stations less than three. However, for the undetected events, P and S waves at these key stations had not been both detected or though detected, but not grouped together due to causes include: (1) emergent P wave; (2) very short S–P time interval (<5 s); (3) data dropout occurring just ahead of signal’s arrival time; (4) error in estimating arrival time or polarization characteristics; (5) the signal were affected by coda wave of other big events and the detection corresponding to the first P wave was identified as the following type; and (6) the only stations with both P and S waves detected just had a small amplitude S. The first three causes may lead to P or S wave fail to be detected, and the later three may lead to P and S waves fail to be grouped together. And for the 7 undetected events, generally there were more than one causes happened together and so they failed to be detected.

Fig. 10
figure 10

Examples of automatic processing results for two regional events. a Seismograms of an ML 2.6 Southern Xinjiang earthquake occurred at 2010/06/09 10:58:59. The epicenter of this earthquake is (42.33ºN, 84.82ºE) in XJEB, our automatically determined epicenter is 11 km away from it. b Seismograms of an mb 3.4 earthquake occurred at 2010/06/09 16:01:08 in the Hindu Kush Region, Afghanistan. For the epicenter of this earthquake is outside of XJSSN, no information is given in the XJEB. However, the IDC REB determined its epicenter and depth is (35.89ºN, 69.57ºE) and 82.5 km, respectively, and our automatically determined result is epicenter (36.1143ºN, 69.771ºE), depth 107.8 km. For each seismogram in this figure, respective recording station and epicentral distance are given at its left side, and all detections triggered within the displayed waveform time window are added, where red ones are automatically associated defining phases for corresponding event, black ones are unassociated, and N stands for noise. Readers should be aware that phase label for only associated detections is determined by integral feature, and that for unassociated phases are determined by a neural network with only local feature as the input

Fig. 11
figure 11

Comparison of automatically detected events with events listed in the XJEB

Comparison between automatically estimated epicenters and epicenters in XJEB for events of ML >2.0 are given in Fig. 12. The average deviation between them is 15.6 km. Considering thatthe Xinjiang seismic station network is quite sparse and average minimum epicentral distance for the earthquakes is close to 100 km; this is a fairly good result in practice and suggests that the application of integral feature helps improve the performance of not only events detection but also localization.

Fig. 12
figure 12

a Comparison of automatically determined epicenters (red dot) with that in XJEB (open blue circle) for events of ML >2.0 and b the distribution of relative location error between them

It is worth to point out that the number of local or regional events detected by our prototype software is much larger than that listed in XJEB (almost triple of it). Local or regional events detected by us, but not listed in XJEB include events outside Xinjiang, events corresponding to quarry blasts and events that are too weak to be efficiently picked up by human analysts or cannot meet the quality requirement of XJEB. Typical examples of the unmatchable events are given in Fig. 13.

Fig. 13
figure 13

Examples of local/regional events detected by DRSN but not listed in XJEB. Waveforms displayed have already been filtered using appropriate filters. Phases displayed on the trace of BHE components in green are manual picks, and meaning of picks displayed on BHZ component is the same as that in Fig. 10. a small earthquake recorded by only one station (2010/06/01 07:12:42, 36.4°, 81.5°, ML–0.5); b small quarry blasts recorded also by only one station with P, S and Rg discernible (2010/06/01 06:57:39, 44.2°, 88.5°, ML ~0.5); c small earthquake near Xinjiang-Tajik border region (2010/06/01 06:50:47, 37.7°, 72.8°, ML ~2.5); d small earthquake near Xinjiang-Xizang border region (2010/06/01 06:58:03, 34.5°, 80.9°, ML ~2.3); e quarry blast recorded at multiple stations (2010/06/01 07:10:23, 43.3, 87.7, ML ~1.5); note that signals of another detected event (2010/06/01 07:09:54, 44.0, 87.7, ML ~0.6) can also be seen at two of the stations; f another quarry blast with signals at multiple stations (2010/06/01 07:19:43, 44.8, 89.3, ML ~2.0), S phases for the event cannot be reliably determined. Note that all these events occurred in half an hour (2010/06/01 06:50-2010/06/01 07:20). Actually DRSN detected 11 events during this time period, among them only two are listed in XJEB

Besides the good performance to detect local and regional events, the prototype software also performs well to detect teleseismic or far regional events outside of the network’s coverage. The method about how to utilize a regional seismic station network to effectively and reliably detect teleseismic events is beyond the scope of this paper and will be discussed in another one. Here, we just mention that the integral feature introduced in this paper are also important for the identification and association of teleseismic signals for it can help to greatly reduce the number of detections need to be considered in the signal association process by screening out the following detections, as well as obvious noises and local signals.

5 Discussion and conclusions

Integral feature such as overall envelope shape are important for seismogram interpretation. It is surprising that to-date they are not utilized in main-stream automatic seismic data processing possibly due to the fact that they have not been clearly defined and parameterized before. Here in this paper we have shown that the overall envelope shape of the whole seismograms can be depicted by only a few parameters. On the base of this progress, we have developed a novel technique to incorporate seismograms’ integral feature into the conventional DIAL approach for seismic data processing. In addition to picking up individual signal detections, the new technique picks up separate wave-trains and extracts their characterizing parameters at the same time. In comparison with conventional local feature for individual signals, the newly introduced characterizing parameters for the whole wave-trains can better reflect the relationship among different detections, as well as their relative position within respective seismograms, and its combination with signals’ local feature including amplitude, frequency, azimuth, and polarization can lend help to resolve problems about signal association and phase-type identification, especially in the case of 3-component seismic stations, and consequently reduce the rate of both false and missing event rate a lot.

Nevertheless, some may argue that in case of very weak signals or signals of overlapping wave-trains of multiple events, the effectiveness of the integral feature would be discounted. Indeed, these are two situations that should be cautious to deal with and why local feature are still needed. For signals of overlapping wave-trains of multiple events, the direct P wave of the second events may have Dflag characterization of 1 other than 0. Fortunately, the frequency of this kind of situation is low, and often it can be identified by testing the compatibility between its features (e.g., amplitude, frequency, azimuth, etc.) against that of the signals of the first one. For example, if we have a detection D n that is really a direct P wave, however, its Dflag is mislabeled as 1 for the interference of another wave-train arriving at the station just a little while before it. For convenience, here denote detections of the interference wave-train with arrival times previous to D n to be D0, D1, …, Dn−1, where D0 is the beginning detection of Dflag = 0. In practice, D n can be accepted as a possible direct P wave to test if one of the following situations emerges. (1) The amplitude of D n is significantly larger than that of from D0 to Dn−1; (2) D n has richer high-frequency content than D0 does; (3) The azimuth and slowness indicates that D n cannot be coda P or S waves of D0 in case of array stations, and in case of 3-component stations this may be surrogated by polarization characteristics, however, it is much less reliable for the later situation and one should be cautious about applying it; (4) An S wave has already been associated to D0 and D n has characteristics inconsistent with a S coda wave. For very small events, the problem that should be dealt with is different. In this case, it is possible that P and S waves even at regional stations beyond some distance cannot constitute a continuous wave-train due to amplitude of P coda waves have dropped to below that of background noises before the arrival of S waves. Under this situation, we abandon the search for regional P and S pairs and will deal with them later in the network processing stage in case of three component stations, or they may be still grouped together taking consideration of their compatibility in azimuth, slowness, amplitude, and dominant frequency in case of array stations. Anyway, the application of integral feature would have no negative effect upon processing result in comparison with conventional approaches even for the two kinds of special situations.

As we have mentioned earlier that currently we have not apply the integral feature in the artificial neural network for phase-type identification, though we believe that these feature may improve its performance. The neural network that we currently used in DRSN for phase-type identification is similar to that described by Wang (2002). The later used three cascade neural network to categorize detections into four wave types, i.e., noise, teleseismic, regional P or regional S, where the first neural network distinguishes between noise and signal, the second distinguish between teleseismic or regional, and the third distinguish between regional P and regional S. There are 15 attributes used together for the input of each of the neural networks, including dominant period, polarization attributes, contextual attributes, and a spectral representation of the horizontal-to –vertical power ratio whose exact definition was explained in Wang (2002). The neural networks that we practically used differ from that in Wang (2002) in three aspects: (1) two instead of three cascade neural networks are used by us and signals are categorize into only three wave types, i.e., noise, P and S; (2) there are only 10 attributes all together for the input of each of the two neural networks, include signal-to-noise ratio (SNR) within 5 different frequency bands in the range of 0.5–16 Hz, rectilinearity, planarity determined by polarization analyses (e.g., Jurkevics,1988) within the frequency band of the best SNR, horizontal-to-vertical amplitude ratio also measured within the frequency band of the best SNR, and pseudo-duration time within 0.75–1.5 Hz and 8–16 Hz, respectively. Unlike the duration time defined in Eq. (2) which is measured for the whole wave-train, the pseudo-duration is measured for individual detections and it is cut off if it is longer than 20 s. The performance of the cascade neural networks is generally ok, however, still with a large portion of detections was mis-identified. For this reason, in the subsequent association processing stages, the output of the cascade neural networks actually was completely ignored. For example, in the process of searching P/S pairs, the major requirement for a detection to be accepted as the generator for searching P/S pairs is that its Dflag should be 0 or it has been identified as the direct P arrival buried in a previous wave-train as we just discussed in the last paragraph, and the requirement for the S phase is to pass tests which has been described in Sect. 3. So output of the neural networks actually has no impact on the final associated events. However, we still keep this processing stage for at least it can give a label to detections that are not associated to any event.

Our experience suggests that this novel technique presented in this paper have promising application prospect for reliable automatic seismic data processing. Actually we have applied this technique to regional seismic station networks of various scales with results generally as good as we have shown for the XJSSN. Nevertheless, it is worth to point out that so far the application of seismograms’ integral features by us is far to be optimized for many decision conditions are set by experience. It is necessary to optimize its application through statistical studies about the joint distribution behavior of both integral and local feature for different types of seismograms, as well as different kinds of seismic phases in the future to achieve more satisfying result.