1 Introduction

The Gutenberg–Richter magnitude distribution (Gutenberg and Richter 1944; Richter 1958) is possibly the most widely used statistical relationship in seismology, and its slope b, widely known as the b-value, characterizes the ratio of small to large magnitudes. The b-value, hereafter referred to as b, is considered to be the characteristic for a given region, and many authors (Scholz 1968; Wyss 1973; Smith 1981; Wiemer and Benoit 1996; Enescu and Ito 2001; Nuannin et al. 2005; De Santis et al. 2011; Mallika et al. 2013) maintain that it decreases slightly before large earthquakes; De Santis et al. (2011) suggest that the knowledge of b is useful to characterize the information entropy of the system. Thus, knowing the correct value, within uncertainty bounds, of b is important for seismic hazard studies, because it allows the estimation of the occurrence rate of earthquakes for any given magnitude range, because of its possible role as a precursor and because other information it contains.

Because of the many imponderables affecting the transmission of energy from source to seismograph, including travel path, radiation pattern, site effects, variations in instrument response, and reading errors, in current practice, the magnitudes are usually rounded up to the first decimal place, so that the methods used to evaluate b experimentally usually work with data binned in classes ΔM = 0.1 wide.

In the present work, we discuss the effects of rounding and of noise in the determination of b and propose a way to correct them. The effects of rounding have been previously dealt with by, among others, Utsu (1965), Bender (1983), Tinti and Mulargia (1987), Kijko and Sellevoll (1992), Zúñiga and Wyss (1995), Rhoades (1996), Marzocchi and Sandri (2003), and Bengoubou-Valerius and Gibert (2012); here, we present our estimate of these effects and a new way of correcting the b measurements. The difference between our version and those of the aforementioned authors is that the correction method we propose is more simple and straightforward and does not involve complicated calculations.

2 Correcting the effects of rounding

In what follows, we present our own version of the effects of rounding, in order to introduce the notation that will be used in proposing a correction for them.

The Gutenberg–Richter (G–R) distribution

$$ \log_{10} N(M) = a - b(M - M_{1} );\quad M \ge M_{1} , $$
(1)

where N(M) is the number of earthquakes with magnitude ≥M and a is the log10 of the total number of events with M ≥ M 1 and depends on the overall seismicity rate and on the sampling time; an important issue in the G–R distribution is the magnitude of completeness, M 1 (Wiemer and Wyss 2000); for the magnitude population, we used several values of M 1 with the same results, and it was finally fixed at M 1 = 2; b has been discussed above.

The G–R is a reverse cumulative distribution that implies that magnitudes are distributed exponentially, with probability density function:

$$ f(M) = \beta \;{\rm{e}}^{{ - \beta \,(M - M_{1} )}} \,;\quad M \ge M_{1}, $$
(2)

where

$$ \beta = \frac{b}{{\log_{10} (\text{e} )}}\;. $$
(3)

From (2) and (3), Aki (1965) determined the maximum-likelihood b estimate to be

$$ b = \frac{{\log_{10} ({\text{e}})}}{{\overline{M} - M_{1} }}. $$
(4)

Equation (4) refers to “exact” magnitudes, but when the M population, distributed according to (2), is rounded to some ΔM, then the class containing magnitude M in fact contains all magnitudes in the interval [M − ΔM/2, M + ΔM/2), so that if M 1 is the minimum rounded magnitude, the effective minimum magnitude is M min = M 1 − ΔM/2, and the correct distribution is

$$ p(M) = \beta \text{e}^{{ - \beta \,[M - (M_{1} - \Delta M/2)]}} . $$
(5)

Hence, Utsu’s (1965) method of estimating the maximum-likelihood b is as follows:

$$ b = \frac{{\log_{10} \text{e} }}{{\overline{M} - (M_{1} - \Delta M/2)}} . $$
(6)

Thus, Aki’s formula is not applicable to practical cases where rounded magnitudes are used, unless M 1 in (4) actually refers to the minimum unrounded magnitude, in which case Aki’s formula coincides with (5). The mistaken application of (4) with the minimum rounded magnitude as M 1 will yield too high estimations of b.

Figure 1 illustrates the application of (4) and (6) to populations of different sizes and clearly shows the overestimation of b by Aki’s formula; the horizontal line indicates the b-value used to generate the magnitude population. Also shown in Fig. 1, the sample sizes larger than 450 are necessary for estimations with standard deviations within ±0.05 of the true value (dotted horizontal lines), and the sample sizes proposed as sufficient by Aki (1965) and Shi and Bolt (1982), ~50 and ~100 events, respectively, may be too optimistic; we have seen studies with b estimations based on ~50 event samples (Sahu and Saikia 1994; Monterroso and Kulhánek 2003; Spada et al. 2013; Sharma et al. 2013) and can only wonder about the reliability of their results.

Fig. 1
figure 1

Estimated \( \hat{b} \), from Aki’s and Utsu’s formulae, as a function of sample size. The magnitude population was randomly generated with b = 0.8, b = 1.0, and b = 1.2. Each symbol represents the mean of 100 realizations, and the vertical bars indicate plus/minus one standard deviation

Furthermore, working with rounded magnitudes has other consequences that are not completely described by (6). The probability distribution of rounded magnitudes is actually a discrete one, so that the probability of events in the class centered on rounded magnitude M (class M) is

$$ \begin{aligned} \Pr (M)& = \int\limits_{M - \Delta M/2}^{M + \Delta M/2} {\beta \;\text{e}^{{ - \beta \,[m - (M_{1} - \Delta M/2)]}} } \text{d} m \\ & = \;\text{e}^{{ - \beta [M - (M_{1} - \Delta M/2)]}} \left( {\text{e}^{ + \beta \,\Delta M/2} - \text{e}^{ - \beta \,\Delta M/2} } \right)\;, \hfill \\ \end{aligned} $$

and, expanding the exponentials within the parentheses as

$$ e^{x} \, = \,1\, + \,x\, + \,\frac{{x^{2} }}{2!}\, + \,\frac{{x^{3} }}{3!}\, + \,\frac{{x^{n} }}{n!}\, + \, \cdots $$

(e.g., Dwight (1961), the resulting equation is

$$ \Pr \left( M \right) = \beta {\text{e}}^{{ - \beta [M - \left( {M_{1} - \Delta M/2} \right]}} \Delta M\left[ {1 + \beta ^{2} \Delta M^{2} /\left( {2^{2} 3!} \right) + \beta ^{4} \Delta M^{4} /\left( {2^{4} 5!} \right) + \cdots } \right]. $$
(7)

The first term in brackets, multiplied by the common factor, is the probability density of magnitude M, multiplied by ΔM; for ΔM → dM, all other terms tend rapidly to zero and (7) gives the correct expression for the probability of M.

However, for finite ΔM, all but the first terms in brackets account for the fact that \( \beta \exp \left\{ { - \beta \left[ {M - \left( {M_{1} - \Delta M/2} \right)} \right]} \right\} \) is not an unbiased estimator of the probability in class M, and Eq. (7) tells us that, for finite ΔM, the rounded magnitudes are not distributed strictly exponentially, but rather as a sum of scaled copies of the exponential distribution.

Consequently, the correct mean of the rounded magnitudes will be

$$ \begin{aligned} \bar{M} &=\, \hat{\bar{M}}\left[ {1 + \beta^{2} \;\Delta M^{2} /(2^{2} \,3!) + \beta^{4} \;\Delta M^{4} /(2^{4} \,5!) + \cdots } \right] \\ \; &\doteq \, \hat{\bar{M}}\left[ {1 + b^{2} \;\Delta M^{2} \;\ln (10)^{2} /(2^{2} \,3!) + \cdots } \right],{\kern 1pt} \hfill \\ \end{aligned} $$
(8)

where \( \hat{\bar{M}} \) is the measured mean, so that b will be overestimated when using the measured mean from (8) in (6), because of the contribution of all but the first terms in brackets. In the bottom line, where the measured mean is expressed in terms of b, we have kept only the first two terms in the expansion, because, for typical b and ΔM values, contributions from the other terms are of the order of 10−6 or less.

Other authors have obtained expressions similar to (8), but they do not specify how to obtain the corrected \( \bar{M} \) estimation, since the correction depends on the still unknown value of b. The problem is, however, easily solved by equating (8) with the expression for \( \bar{M} \) as a function of b obtained from (6), which results in the simple polynomial

$$ 0.220912\;\Delta M^{2} \;b^{3} + (\hat{\bar{M}} - M_{\hbox{min} } )\;b\; - 0.434294 = 0 , $$
(9)

which can be easily solved for a real root not far from the overestimated value obtained from the measured mean.

Figure 2 illustrates the effects of rounding and of the proposed correction for Monte-Carlo simulation of 50 synthetic random samples of 10,000 magnitudes each, all generated using b = 0.8, 1.0, and 1.2 and rounded to ΔM = 0.1; for ease of view, they have been sorted according to the “true” observed \( \hat{b} \), i.e., the value actually estimated from the samples before rounding.

Fig. 2
figure 2

Comparison of b-values for Monte-Carlo simulations; squares are estimates from “exact” (unrounded) magnitudes, circles are the corresponding estimates after rounding, and rhombs are corrected rounded estimates

From (8), it is clear that the b overestimation error increases with b and with ΔM; for the example illustrated in Fig. 2, the mean error \( \hat{b}_{\text{exact}} \, - \,\hat{b}_{\text{rounded}} \) is −0.00222, while the mean error \( \hat{b}_{\text{exact}} \, - \,\hat{b}_{\text{corrected}} \) is −0.000011.

3 The effect of noise

Until now, we have considered rounded magnitudes derived from “exact” magnitudes, i.e., magnitudes generated according to (5) with no error (ignoring computer round-off errors). Actually, as mentioned before, magnitudes are rounded because of uncertainties inherent in their estimation, but the rounding is generally done actually considering the estimates as being exact. Thus, the effect of the uncertainties, which can be considered as noise superimposed on the exact magnitudes, on the estimate rounding should be considered.

For a total population of N magnitudes distributed exponentially with parameter β and classes of width ΔM, the number of events in the ith class may be approximated by

$$ n_{i} = \beta {\rm{e}}^{{ - \beta M_{i} }} \Delta M\,N, $$
(10)

corrected if desired by (7), and

$$ \bar{M} = \frac{1}{N}\sum\limits_{i}^{{}} {n_{i} M_{i} }. $$
(11)

The number of events in class i − k is

$$ n_{i - k} = \beta {\rm{e}}^{{ - \beta (M_{i} - k\Delta M)}} \Delta MN = n_{i} {\rm{e}}^{\beta k\Delta M} , $$
(12)

and that in class i − k

$$ n_{i + k} = \beta {\rm{e}}^{{ - \beta (M_{i} + k\Delta M)}} \Delta MN = n_{i} {\rm{e}}^{ - \beta k\Delta M} . $$
(13)

Consider now the effect of adding variations (noise) ν to the rounded exact magnitudes in (10); let \( p_{0} = \Pr \left( {\left| v \right| > \Delta M/2} \right) \) be the probability that a given modified magnitude from a given class i will be assigned to some other class (after rounding to ΔM). The probabilities that the variation of a magnitude belonging to class i − k or class i + k will place it in class i are \( p_{k}^{ - } = \Pr \left( {\Delta M\left[ {k - 1/2} \right] < v > \Delta M\left[ {k + 1/2} \right]} \right) \) and \( p_{k}^{ + } = \Pr \left( { - \Delta M\left[ {k - 1/2} \right] < v > - \Delta M\left[ {k + 1/2} \right]} \right) \), respectively.

Thus, the new number of events in class i, \( \hat{n}_{i} \), will be

$$ \hat{n}_{i} = n_{i} - p_{0} n_{i} + \sum\limits_{k = 1}^{\infty } {(p_{k}^{ - } n_{i - k} + p_{k}^{ + } n_{i + k} )} , $$

from (6) and (7),

$$ \hat{n}_{i} = n_{i} - p_{0} n_{i} + \sum\limits_{k = 1}^{\infty } {(p_{k}^{ - } n_{i} {\rm{e}}^{\beta k\Delta M} + p_{k}^{ + } n_{i} {\rm{e}}^{ - \beta k\Delta M} )} , $$
$$ \hat{n}_{i} = n_{i} \left[ {1 - p_{0} + \sum\limits_{k = 1}^{\infty } {(p_{k}^{ - } {\rm{e}}^{\beta k\Delta M} + p_{k}^{ + } {\rm{e}}^{ - \beta k\Delta M} )} } \right] \equiv n_{i} \xi . $$
(14)

The summation in (14) is done over permissible values of k while contributions are significant. Note that the ξ factor does not depend on i and is the same for all classes; thus, the new distribution will be a scaled version of the original one. The new numbers in each class come from a new exponential distribution

$$ \hat{n}_{i} = \hat{\beta }{\rm{e}}^{{ - \hat{\beta }M_{i} }} \Delta M\,N, $$
$$ \hat{\bar{M}} = \frac{1}{N}\sum\limits_{i}^{{}} {\hat{n}_{i} M_{i} } = \frac{\xi }{N}\sum\limits_{i}^{{}} {n_{i} M_{i} } = \xi \bar{M}, $$
$$ \therefore \quad \hat{\beta } = \frac{1}{{\hat{\bar{M}}}} = \frac{1}{{\xi \bar{M}}} = \frac{\beta }{\xi }, $$
$$ \hat{b} = \frac{b}{\xi } . $$
(15)

We have set no constraints to the noise distribution, so that it may have any shape, symmetrical or not, and may even be a function of the magnitude.

Thus, for any estimate about the distribution of noise, the measured estimate \( \hat{b} \) can be corrected by the factor ξ, evaluated according to (14), to obtain a better estimate of b.

On the practical side, since magnitudes are usually rounded up to the first decimal place, data are typically binned in classes ΔM = 0.1 wide. On the other hand, since magnitudes are customarily considered to have ±0.1 uncertainty and since usually the stated uncertainties correspond to 1 or 2 standard deviations, the customary uncertainty could be interpreted as saying that the variations in magnitude have σ = 0.1 or σ = 0.05 standard deviations, respectively.

As an example, let ΔM = 0.1 and consider variations distributed as N(0, σ), so that p k  = p + k , with σ = 0.1; if the original b = 1.0, then from (14) and (15) ξ = 1.029134 and \( \hat{b} = 0. 9 7 1 6 9 1 \), so that the variations are expected to cause a change \( \updelta b\,\sim {-}0.028 \) in the estimation of b. This estimation of the effect of noise on rounding is corroborated through Monte-Carlo simulation as shown in Fig. 3, where each point represents the mean of 100 realizations of b estimation for different population sizes; magnitudes were generated according to (5) with b = 0.8, 1.0, and 1.2, and noise distributed according to N (0, σ = 0.1) was added; then magnitudes were rounded, and b estimates were obtained using Utsu’s formula. Only multiplying the b estimates by ξ, we reach the actual value.

Fig. 3
figure 3

Estimated \( \hat{b} \), from Utsu’s formula, as a function of sample size N. The magnitude population was randomly generated with b = 0.8, b = 1.0, and b = 1.2, and noise distributed according to N (0, σ = 0.1) was added. The straight horizontal continuous line represents the true b-value, and dotted horizontal lines indicate ±0.05. Each symbol represents the mean of 100 realizations, and the vertical bars indicate plus/minus one standard deviation

4 Discussion and conclusions

The effect of rounding on b estimates made using Utsu’s formula is, for typical values of b and ΔM, ∼10−3; this effect may not be significant for most applications (including those using b estimates based on small samples), but could be significant when realistic uncertainties in magnitude determination demand using a larger ΔM. On the other hand, it is quite easy to bring the error caused by rounding to truly insignificant levels by the correction proposed in (9).

The effect of noise on b estimates from rounded magnitudes can be, for typical values, ∼10−2 or larger, so that it could be significant; fortunately, even a rough estimate of the noise level and distribution can be used to approximately correct the estimates through (14) and (15).