## Abstract

An experimental method for the spatial resolution analysis of the earthquake frequency-magnitude distribution is introduced in order to identify the intrinsic spatial scale of the detected seismicity phenomenon. We consider the unbounded magnitude range *m* ∈ (−∞, +∞), which includes incomplete data below the completeness magnitude *m*
_{c}. By analyzing a relocated earthquake catalog of Taiwan, we find that the detected seismicity phenomenon is scale-variant for *m* ∈ (−∞, +∞) with its spatial grain a function of the configuration of the seismic network, while seismicity is known to be scale invariant for *m* ∈ [*m*
_{c}, +∞). Correction for data incompleteness for *m* < *m*
_{c} based on the knowledge of the spatial scale of the process allows extending the analysis of the Gutenberg–Richter law and of the fractal dimension to lower magnitudes. This shall allow verifying the continuity of universality of these parameters over a wider magnitude range. Our results also suggest that the commonly accepted Gaussian model of earthquake detection might be an artifact of observation.

## Introduction

Earthquakes represent a complicated geographical phenomenon, which is shown to be self-similar in a wide range of scales (Bak
*et al.*
2002). The universal Gutenberg–Richter scaling relation (Gutenberg and Richter
1944) states that earthquake magnitudes *m* are distributed according to the exponential law

where *λ*
_{0} is the number of expected earthquakes, *a* the earthquake productivity and *b* the magnitude scaling parameter. Assuming self-similarity in space, Eq. (1) becomes

with *L* the size of the spatial cell and *D* the spatial scaling parameter or fractal dimension of earthquake epicenters (Kosobokov and Mazhkenov
1994; Molchan and Kronrod
2005). Equations (1) and (2) hold only for *m* ≥ *m*
_{c} with *m*
_{c} the completeness magnitude. It has been shown that this parameter is ambiguous with the \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{m}_{\text{c}}\) estimate depending at a same location on the computation method and on the spatial scale considered (Mignan and Chouliaras
2014). The behavior of seismicity at *m* < *m*
_{c} is also ambiguous due to the fact that the intrinsic spatial scale of the detected seismicity phenomenon has not been considered so far. In other words, there is an implicit assumption that the detection process is also self-similar, which has recently been shown to be incorrect (i.e., the frequency-magnitude distribution does not display the same statistical properties or same shape at different scales; see Fig. 16 in Mignan
2012).

In this article, we investigate the shape of the earthquake frequency-magnitude distribution over the range *m* ∈ (−∞, +∞) to determine the fundamental spatial unit of the geographical phenomenon that is detected seismicity (Sect. 2; Pereira
2001). Knowing the scale of the detected seismicity phenomenon (Sect. 3) allows us (i) to study the behavior of the scaling parameters *b* and *D* on a wider magnitude range by including smaller events, and (ii) to gain more insight into the earthquake detection process (Sect. 4).

For this analysis, we use the earthquake catalog of Taiwan relocated by Wu
*et al.* (2008). We consider the period 1994–2005 and volume 119°E ≤ *x* ≤ 123°E, 21°N ≤ *y* ≤ 26°N and *z* ≤ 35 km. Year 2005 corresponds to the last year analyzed in Wu
*et al.* (2008).

## Definition of the Spatial Scale of Detected Seismicity

We show in this section that the spatial scale of detected seismicity can be inferred from the analysis of the shape of the earthquake frequency-magnitude distribution (FMD). This was first suggested by Mignan (2012) who observed that the FMD shape differs between local and regional datasets. This is conceptualized below and verified in an earthquake catalog in the next section.

The FMD is described over the range *m* ∈ (−∞, +∞) by

where *q*(*m*) = Pr(Detected | *m*) is the earthquake detection function. Mignan (2012) coined the term *elemental FMD* to describe the fundamental FMD, whose shape does not depend on *m*
_{c} variations. This elemental FMD takes the form

with *k* > *b* a detection parameter. The elemental FMD has an angular shape with *q* (*m* < *m*
_{c}) an exponential law and *q* (*m* ≥ *m*
_{c}) = 1. Since *m*
_{c} = *f*(*x*,*y*), the elemental FMD only holds in an elemental spatial cell of width *L*
_{e}. The FMD observed in larger cells is the sum of elemental FMDs, leading to a rounded shape of that composite FMD (Mignan
2012). This shape can be approximated by defining *q* (*m*) by the cumulative normal distribution *ϕ* (Ringdal
1975; Ogata and Katsura
1993; 2006; Mignan
2012). Figure 1 shows an FMD at the regional scale (*L* = 5°, full catalog) and an FMD at a local scale (*L* = 0.1°, *x* = 120.95°, *y* = 23.15°). They are rounded and angular, respectively, indicating that the regional FMD is a composite FMD (trivial) and that the local FMD is an elemental FMD, which means *L* ≤ *L*
_{e} at (*x*, *y*).

The concept of elemental cell brings us to the scale of the process. Scale is defined by two components: the grain (~grid cell) and the extent (~entire grid). Those components can be artifacts of observation or attributes of phenomena (Pereira
2001). While this distinction is common practice in ecological studies, it is not the case in seismology where all processes are assumed scale invariant (Bak
*et al.*
2002; Ogata and Katsura
1993). Figure 2 represents maps with a same observational extent (the Taiwanese region). Figure 2a shows the spatial distribution of seismicity and seismic stations used in this study. Figure 2b, c shows maps of the estimator \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{m}_{c} = m({ \hbox{max} }(\lambda ))\) using a same display resolution (0.1° pixel) but a different observational resolution with a constant coarse grain in Fig. 2b (with radius *r* = *L*/2 = 50 km) and a variable grain in Fig. 2c (with *r* = *L*/2 = *f*(*d*
_{
n
}), *d*
_{
n
} being the distance to the *n*th seismic station). Using a variable grain function of the seismic network geometry was proposed by Mignan
*et al.* (2011) to minimize spatial heterogeneities in *m*
_{c}, *m*
_{c} varying faster in the denser parts of a seismic network. We develop upon this idea by defining the fundamental spatial unit (or intrinsic grain) of the detected seismicity by the area within which the elemental FMD is observed. Visually, the elemental FMD represents the highest possible resolution (sharp angular shape, Fig. 1b) while the composite FMD represents a lower resolution (blurred rounded shape, Fig. 1a). Note that the intrinsic extent of the earthquake detection process corresponds to the area comprising all the seismicity declared from a given seismic network (here the same as the observational extent, Fig. 2a).

## Spatial Resolution Analysis

We investigate the shape of FMDs located in a partitioned space using Voronoi tessellation (Voronoi
1908). We test *N*
_{V} = 1,000 realizations with the number of generators *n*
_{V} randomly drawn from the log_{10} space in the range [10, 10^{3}]. Each generator, or point, is defined from random geographical coordinates (*x*
_{V}, *y*
_{V}) located in the region of interest. For each point, there is a corresponding region, or Voronoi cell, consisting of all points closer to that point than to any other. We choose this type of tessellation to objectively produce areas of various shapes and sizes (e.g., Fig. 2a) with no a priori knowledge on the spatial scale of the detected seismicity phenomenon (see also Kamer and Hiemer
2015). Each Voronoi cell is characterized by an area *A*, a distance \(d_{n} = \sqrt {(x_{\text{V}} - x_{n} )^{2} + (y_{\text{V}} - y_{n} )^{2} }\) between the generator and the *n*th nearest seismic station of coordinates (*x*
_{
n
}, *y*
_{
n
}), and an FMD defined from all earthquakes located in the cell. Let us note that the distance *d*
_{
n
} represents a proxy to the spatial density of seismic stations—and therefore to the degree of *m*
_{c} variations—and that the choice of *n* (commonly between 3 and 5) is not critical (Mignan
*et al.*
2011, 2013; Kraft
*et al.*
2013; Vorobieva
*et al.*
2013; Mignan and Chouliaras
2014; Tormann
*et al.*
2014).

Two FMD models are tested and compared using the Bayesian Information Criterion \({\text{BIC}} = - 2 {\text{ log}}\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L} + K{ \log }N\) with \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L} \;\) the maximized value of the likelihood function *f* of the model, *K* the number of free parameters and *N* the number of data points (Schwarz
1978). The elemental FMD model (Fig. 1b) is described by the following likelihood function

where \(\kappa = k{ \log }\left( {10} \right)\) and \(\beta = b{ \log }\left( {10} \right)\) (Mignan
2012). In practice, we use a faster method introduced by Kamer (2014), which takes advantage of the piecewise structure of Eq. (4), uses the Aki maximum likelihood method (Aki
1965) and shows that *K* = 2 instead of 3 (i.e., *b*/(*k* − *b*) = *N*
_{
i
}/*N*
_{c} with *N*
_{c} and *N*
_{
i
} the number of events in the complete and incomplete parts of the FMD, respectively). The composite FMD model (Fig. 1a) is described by

where *Φ*(*m* | μ, *σ*) is the cumulative normal distribution and *K* = 3 (Ogata and Katsura
1993; 2006). The two models represent homogeneous data (elemental area with constant *m*
_{c}) and heterogeneous data (composite area with variable *m*
_{c}), respectively, which allows us to differentiate intrinsic grains and extrinsic grains relative to the spatial scale of detected seismicity.

For each Voronoi cell of all *N*
_{
V
} realizations (yielding a total of ~60,000 cells), we calculate the distance *d*
_{5} to the fifth nearest seismic station, the width of the cell \(L = 2\sqrt {A/\pi }\) and the FMD model choice *C*

Event location in the Taiwanese national catalog requires at least three stations and at least five phases *P* or *S* and it is therefore equally reasonable to use *n* = 3, 4 or 5 in *d*
_{
n
}. Mignan
*et al.* (2011) showed that the models *m*
_{c} = *f*(*d*
_{
n
}) with *n* = {3, 4, 5} give similar results in Taiwan. We then compute the ratio ∑*C*
_{b}/n_{b} with *n*
_{b} the number of choices *C*
_{b}, which are all the choices *C* located in Δ*d*
_{5} × Δ*L* bins defined in the (*d*
_{5}, *L*) space. We fix Δ*d*
_{5} = Δ*L* = 10 km ~0.1° (Fig. 3). This ratio links the observational spatial resolution (or grain) *L* to the intrinsic spatial resolution *L*
_{e}. Figure 3 shows that a constant observational spatial grain *L* (i.e., any horizontal line) is a poor estimator of the detected seismicity phenomenon since it includes heterogeneities in the FMD shape potentially leading to artifacts of observation at *m* ≤ *m*
_{c}. Considering 10 km < *L* < 150 km only (removal of earthquake location errors and of boundary effects linked to the spatial extent of the region), we find

the boundary (black line) between observational grains *L* which contain heterogeneities (in purple, \(L > \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L}_{e}\)) and which are homogeneous (in red, \(L \le \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L}_{e} \;\)). This relationship verifies that *m*
_{c} varies faster in the denser parts of a seismic network, as independently observed by Mignan
*et al.* (2011), and describes the intrinsic spatial scale of the detected seismicity phenomenon. This phenomenon is thus shown to be scale-variant for *m* ∈ (−∞, +∞) with its spatial grain a function of the configuration of the seismic network (i.e., \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L}_{e} \; = f\left( {d_{n} } \right))\), which is opposed to the scale invariance of seismicity known for *m* ∈ [*m*
_{c}, +∞) for which the Gutenberg–Richter law holds and the network configuration is irrelevant.

## Discussion

Does a better understanding of the spatial scale of the detected seismicity phenomenon allow in turn a better understanding of the behavior of seismicity below completeness and of the process of earthquake detection itself? In the aim of initiating a debate on such challenging issues, we develop upon these two important aspects with some basic illustrations.

### Implications for the Universal Scaling Parameters b and D

We investigate if our results may help extending the analysis of the Gutenberg–Richter law and of the fractal dimension to lower magnitudes *m* < *m*
_{c}. We first test Eq. (2) in Taiwan by measuring the number of events

with *N*
_{
i
} the number of events of magnitude *m* in cell *i* of size *L* = *L*
_{0}/2^{h}, *L*
_{0} = 4° and *h* = {0, …, 5} the level of hierarchy (Kosobokov and Mazhkenov
1994). Figure 4a shows that a data collapse is obtained on the Gutenberg–Richter law of slope *b* = 0.9 (solid line) when normalizing to *N* (*m*, *L*)/*L*
^{D} with *D* = 1.2, which is in agreement with results in other regions for earthquake epicenters (Bak
*et al.*
2002). For illustration purposes, we here assume that Eq. (2) is verified if *N* (*m*, *L*)/*L*
^{D} is within a factor 2 of the theoretical curve (dotted lines). The law is verified in the range 2.1 ≤ *m* ≤ 5.0 for *h* ∈ [0, 5] while the data deviate from the law at *m* < 2.1 for all levels of hierarchy *h* (i.e. *m*
_{c} = 2.1).

We then consider the corrected case *N*
_{ic}(*m*) = *N*
_{
i
}(*m*)/*q*
_{
i
}(*m*) with *q*
_{
i
} the detection function (see Eq. 3). Since this approach assumes that the Gutenberg–Richter law holds over the unbounded magnitude range, any deviation from the law observed after correction for incompleteness would suggest that a different physical process is in play at low magnitudes. Using a fixed *L* in Eq. (9) means that the *q* model depends on cell *i* (Fig. 3; Eq. 8). We use the cumulative normal distribution model for \(L > \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L}_{e}\) (heterogeneous composite model; Fig. 1a) and the exponential model for \(L \le \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L}_{e}\) (homogeneous elemental model; Fig. 1b). Results are shown in Fig. 4b and indicate that the data collapse continues in the range 0.4 ≤ *m* < 2.1 for at least some *h* when the seismicity data are corrected for incompleteness. Scattering remains high and relates to under-sampling of the original data at low *m* values, similarly to the under-sampling observed at high *m* values. The scattering at low magnitudes is, however, lower here than in the case where only one detection model *q* would be used. These results suggest that the behavior of events at *m* < *m*
_{c} might be informative about *b* and *D* once the scale variance of the detected seismicity phenomenon is considered. This has yet to be confirmed by robust sensitivity analyses in various applications (e.g., in *b*-value mapping; Tormann
*et al.*
2014; Kamer and Hiemer
2015).

### Implications for the Earthquake Detection Process

The Gaussian detection model *q* (*m*) = Φ(*m* | μ, σ) was proposed long ago based on the assumptions that both the true magnitude *m*
_{A} and the threshold magnitude *m*
_{T} are normal variables—an event being detected if *m*
_{A} > *m*
_{T} (Ringdal
1975). This would reflect a lognormal distribution of the seismic noise amplitude (Richter
1935). Noise amplitude, defined as

with *A* (*t*) the raw amplitude time series, has been approximated as lognormal in studies of station detection capability (Freedman
1967; Zheng
*et al.*
2012).

An exponential detection model of magnitudes (Eq. 4) means a linear detection model of amplitudes and therefore suggests a triangular distribution of the noise amplitude *A* (*t*). This is illustrated in Fig. 5a where 100 s of waveform data from the WSF station (east–west direction) of the Taiwanese seismic network is shown for comparison (use of other stations, directions and time periods does not significantly change the results). The model *q*(*A*) = Pr(Detected | *A*) ∝ *A* for *A* < *A*
_{c}, with *A*
_{c} the completeness amplitude, is sketched in gray. Figure 5b shows the same empirical distribution in terms of *A*
_{norm}, which shows some skewness in the log_{10} space. Here, we simulate 1000 *A* (*t*) samples from the triangular distribution shown in Fig. 5a and compute the *A*
_{norm} distribution (0.05 and 0.95 quantiles as dotted gray curves in Fig. 5b). While a systematic analysis of waveform data would be needed to prove or disprove a triangular distribution of the seismic noise, our aim is not here to validate the empirical model of Mignan (2012) but to very briefly consider the earthquake detection process in a novel way, by looking at the possible relationship between noise amplitude distribution and FMD shape (Fig. 5). Such a link has never been made, to the best of our knowledge.

Our results, however, emphasize the fallacy of conclusions on the earthquake detection process inferred from regional FMD analyses. Use of the cumulative normal distribution Φ(*m* | μ, σ) is often justified based on the curved shape of regional FMDs (Ringdal
1975; Ogata and Katsura
1993) although its extrapolation to higher resolutions has already been shown to be incorrect (Mignan
2012). Therefore, the parameters μ and σ of Φ have no physical meaning and only represent an artifact of observation due to the difference between observational scale and the intrinsic scale of the detected seismicity phenomenon previously described. In this view, the physical origin of parameter *k* of Eq. (4) has yet to be understood and interpreted in terms of amplitude measurement uncertainty.

## Conclusions

Earthquake detectability issues have always hampered the study of seismicity at the lower magnitudes. From rule of thumb, using only *m* ≥ *m*
_{c} means that up to about half of all the data are systematically discarded. In this article, we have identified the intrinsic spatial scale of the detected seismicity over *m* ∈ (−∞, +∞) (Fig. 3; Eq. 8), which allows us to better understand changes in the shape of the FMD, and therefore to reduce ambiguity on *m*
_{c} and to correct for data incompleteness.

As noted by Pereira (2001), not considering the intrinsic scale of the process leads to artifacts of observation, which may explain why data below *m*
_{c} had previously been considered unstable (Kagan
2002). While the usefulness of the data below *m*
_{c} has yet to be confirmed in future studies, there would be a clear advantage in being able to exploit this information. Statistical analyses would gain using larger datasets, for instance in the investigation of the prognostic value of earthquake precursors (see meta-analysis by Mignan
2014), which remains one of the main challenges in the field of statistical seismology.

## References

Aki, K. (1965),

*Maximum Likelihood Estimate of b in the Formula log N**=**a – bM and its Confidence Limits*, Bull. Earthquake Res. Inst. Univ. Tokyo,*43*, 237–239.Bak, P., K. Christensen, L. Danon and T. Scanlon (2002),

*Unified Scaling Law for Earthquakes*, Phys. Rev. Lett.,*88*, 178501.Freedman, H. W. (1967),

*Estimating earthquake magnitude*, Bull. Seismol. Soc. Am.,*57*, 747–760.Gutenberg, B. and C. F. Richter (1944),

*Frequency of earthquakes in California*, Bull. Seismol. Soc. Am.,*34*, 185–188.Kagan, Y. Y. (2002),

*Seismic moment distribution revisited: I. Statistical results*, Geophys. J. Int.,*148*, 520–541.Kamer, Y. (2014),

*Minimum sample size for detection of Gutenberg-Richter’s b-value*, arXiv:1410.1815v1.Kamer, Y. and S. Hiemer (2015),

*Data-driven spatial b-value estimation with applications to California seismicity: To b or not to b*, J. Geophys. Res. Solid Earth, in press, doi:10.1002/2014JB011510.Kosobokov, V. G. and S. A. Mazhkenov (1994),

*On similarity in the spatial distribution of seismicity*, Computational Seismol. Geodyn.,*1*, 6–15.Kraft, T., A. Mignan and D. Giardini (2013),

*Optimization of a large-scale microseismic monitoring network in northern Switzerland*, Geophys. J. Int.,*195*, 474–490, doi:10.1093/gji/ggt225.Mignan, A., M. J. Werner, W. Wiemer, C.-C. Chen and Y.-M. Wu (2011),

*Bayesian Estimation of the Spatially Varying Completeness Magnitude of Earthquake Catalogs*, Bull. Seismol. Soc. Am.,*101*, 1371–1385, doi:10.1785/0120100223.Mignan, A. (2012),

*Functional shape of the earthquake frequency-magnitude distribution and completeness magnitude*, J. Geophys. Res.,*117*, B08302, doi:10.1029/2012JB009347.Mignan, A., C. Jiang, J. D. Zechar, S. Wiemer, Z. Wu and Z. Huang (2013),

*Completeness of the Mainland China Earthquake Catalog and Implications for the Setup of the China Earthquake Forecast Testing Center*, Bull. Seismol. Soc. Am.,*103*, 845–859, doi:10.1785/0120120052.Mignan, A. (2014),

*The debate on the prognostic value of earthquake foreshocks: A meta-analysis*, Sci. Rep.,*4*, 4099, doi:10.1038/srep04099.Mignan, A. and G. Chouliaras (2014), Fifty Years of Seismic Network Performance in Greece (1964–2013):

*Spatiotemporal Evolution of the Completeness Magnitude*, Seismol. Res. Lett.,*85*, 657–667, doi:10.1785/0220130209.Molchan, G. and T. Kronrod (2005),

*On the spatial scaling of seismicity rate*, Geophys. J. Int.,*162*, 899–909, doi:10.1111/j.1365-246X.2005.02693.x.Ogata, Y. and K. Katsura (1993),

*Analysis of temporal and spatial heterogeneity of magnitude frequency distribution inferred from earthquake catalogues*, Geophys. J. Int.,*113*, 727–738.Ogata, Y. and K. Katsura (2006),

*Immediate and updated forecasting of aftershock hazard*, Geophys. Res. Lett.,*33*, L10305, doi:10.1029/2006GL025888.Pereira, G. M. (2001),

*A Typology of Spatial and Temporal Scale Relations*, Geographical Analysis,*34*, 21–33.Richter, C. F (1935),

*An instrumental earthquake magnitude scale*, Bull. Seismol. Soc. Am.,*25*, 1–32.Ringdal, F. (1975),

*On the estimation of seismic detection thresholds*, Bull. Seismol. Soc. Am.,*65*, 1631–1642.Schwarz, G. (1978),

*Estimating the dimension of a model*, Ann. Stat.,*6*, 461–464.Tormann, T., S. Wiemer and A. Mignan (2014),

*Systematic survey of high-resolution b value imaging along Californian faults: inference on asperities*, J. Geophys. Res. Solid Earth,*119*, 2029–2054, doi:10.1002/2013JB010867.Vorobieva, I. C. Narteau, P. Shebalin, F. Beauducel, A. Nercessian, V. Clouard and M.-P. Bouin (2013),

*Multiscale Mapping of Completeness Magnitude of Earthquake Catalogs*, Bull. Seismol. Soc. Am.,*103*, 2188–2202, doi:10.1785/0120120132.Voronoi, G. F. (1908),

*Nouvelles applications des paramètres continus à la théorie de forms quadratiques*, J. für die Reine und Angewandte Mathematik,*134*, 198–287.Wu, Y.-M., C.-H. Chang, L. Zhao, T.-L. Teng and M. Nakamura (2008),

*A Comprehensive Relocation of Earthquakes in Taiwan from 1991 to 2005*, Bull. Seismol. Soc. Am.,*98*, 1471–1481, doi:10.1785/0120070166.Zheng, Z., T. Takaishi, N. Sakurai, X. Zhang and K. Yamasaki (2012),

*Statistical Regularities of Seismic Noise*, Progr. Theoretical Phys. Suppl.,*194*, 193–201.

## Acknowledgements

We thank three anonymous reviewers for their useful comments.

## Author information

### Affiliations

### Corresponding author

## Rights and permissions

**Open Access** This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

## About this article

### Cite this article

Mignan, A., Chen, CC. The Spatial Scale of Detected Seismicity.
*Pure Appl. Geophys.* **173, **117–124 (2016). https://doi.org/10.1007/s00024-015-1133-7

Received:

Revised:

Accepted:

Published:

Issue Date:

### Keywords

- Spatial scale
- Earthquake magnitude
- Earthquake detection
- Completeness magnitude