Skip to main content
Log in

A Bayesian multiple changepoint model for marked poisson processes with applications to deep earthquakes

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

A multiple changepoint model for marked Poisson process is formulated as a continuous time hidden Markov model, which is an extension of Chib’s multiple changepoint models (J Econ 86:221–241, 1998). The inference on the locations of changepoints and other model parameters is based on a two-block Gibbs sampling scheme. We suggest a continuous time version of forward-filtering backward-sampling algorithm for simulating the full trajectories of the latent Markov chain without utilizing the uniformization method. To retrieve the optimal posterior path of the latent Markov chain, i.e. the maximum a posteriori estimation of changepoint locations, a continuous-time version of Viterbi algorithm (CT-Viterbi) is proposed. The set of changepoint locations is obtainable either from the CT-Viterbi algorithm or the posterior samples of the latent Markov chain. The number of changepoints is determined according to a modified BIC criterion tailored particularly for the multiple changepoint problems of a marked Poisson process. We then perform a simulation study to demonstrate the methods. The methods are applied to investigate the temporal variabilities of seismicity rates and the magnitude-frequency distributions of medium size deep earthquakes in New Zealand.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Barry D, Hartigan JA (1992) Product partition models for change point problems. Ann Stat 20:260–279

    Article  Google Scholar 

  • Bebbington MS (2007) Identifying volcanic regimes using Hidden Markov Models. Geophys J Int 171:921–942

    Article  Google Scholar 

  • Chib S (1998) Estimation and comparison of multiple change-point models. J Econ 86:221–241

    Article  Google Scholar 

  • Daley DJ, Vere-Jones D (2003) An introduction to the theory of point processes. Elementary theory and Methods, vol 1. Springer, New York

    Google Scholar 

  • Elliott RJ, Siu TK, Yang H (2007) Insurance claims modulated by a hidden marked point process. Am Control Confer 2007:390–395

    Google Scholar 

  • Fearnhead P (2006) Exact and efficient bayesian inference for multiple changepoint problems. Stat Comput 16:203–213

    Article  Google Scholar 

  • Fearnhead P, Sherlock C (2006) An exact Gibbs sampler for the Markov-modulated Poisson process. J R Stat Soc B 68(5):767–784

    Article  Google Scholar 

  • Frick K, Munk A, Sieling H (2014) Multiscale change point inference. J R Stat Soc B 76(3):495–580

    Article  Google Scholar 

  • Frohlich C (2006) Deep earthquakes. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Fryzlewicz P (2014) Wild binary segmentation for multiple change-point detection. Ann Stat 42(6):2243–2281

    Article  Google Scholar 

  • Galeano P (2007) The use of cumulative sums for detection of changepoints in the rate parameter of a Poisson process. Comput Stat Data Anal 51:6151–6165

    Article  Google Scholar 

  • Green PJ (1995) Reversible Jump Markov Chain Monte Carlo computation and Bayesian model determination. Biometrika 82:711–732

    Article  Google Scholar 

  • Harchaoui Z, Lévy-Leduc C (2010) Multiple change-point estimation with a total variation penalty. J Am Stat Assoc 105:1480–1493

    Article  CAS  Google Scholar 

  • Kagan YY (2017) Worldwide earthquake forecasts. Stoch Environ Res Risk Assess 31:1273. https://doi.org/10.1007/s00477-016-1268-9

    Article  Google Scholar 

  • Kehagias A (2004) A hidden markov model segmentation procedure for hydrological and environmental time series. Stoch Environ Res Risk Assess 18(2):117–130

    Article  Google Scholar 

  • Lavielle M, Lebarbier E (2001) An application of MCMC methods for the multiple change-points problem. Signal Process 81:39–53

    Article  Google Scholar 

  • Lu S (2012) Markov modulated Poisson process associated with state-dependent marks and its applications to the deep earthquakes. Ann Inst Stat Math 64(1):87–106

    Article  Google Scholar 

  • Lu S (2017) Long-term b value variations of shallow earthquakes in New Zealand: a HMM-based analysis. Pure Appl Geophys 174:1629–1641

    Article  Google Scholar 

  • Lu S, Vere-Jones D (2011) Large occurrence patterns of New Zealand deep earthquakes: characterization by use of a switching poissn process. Pure Appl Geophys 168:1567–1585

    Article  Google Scholar 

  • Nanjo KZ, Hirata N, Obara K, Kasahara K (2012) Decade-scale decrease in b value prior to the M9-class 2011 Tohoku and 2004 Sumatra quakes. Geophys Res Lett 39:L20304. https://doi.org/10.1029/2012GL052997

    Article  Google Scholar 

  • Nuannin P, Kulhanek O, Persson L (2005) Spatial and temporal b value anomalies preceding the devastating off coast of NW Sumatra earthquake of December 26, 2004. Geophys Res Lett 32(11):L11307

    Article  Google Scholar 

  • Ogata Y (1988) Statistical models for earthquake occurrences and residual analysis for point processes. J Am Stat Assoc 83(401):9–27

    Article  Google Scholar 

  • Ramesh NI, Thayakaran R, Onof C (2013) Multi-site doubly stochastic poisson process models for fine-scale rainfall. Stoch Environ Res Risk Assess 27(6):1383–1396

    Article  Google Scholar 

  • Rao V, Teh YW (2013) Fast MCMC sampling for Markov jump processes and extensions. J Mach Learn Res 14:3295–3320

    Google Scholar 

  • Schorlemmer D, Wiemer S (2005) Microseismicity data forecast rupture area. Nature 434:1086. https://doi.org/10.1038/4341086a

    Article  CAS  Google Scholar 

  • Scott SL (2002) Bayesian methods for hidden Markov models: recursive computing in the 21st century. J Am Stat Assoc 97:337–351

    Article  Google Scholar 

  • Shen JJ, Zhang NR (2012) Change-point model on nonhomogeneous Poisson processes with application in copy number profiling by next-generation DNA sequencing. Ann Appl Stat 6(2):476–496

    Article  Google Scholar 

  • Stephens DA (1994) Bayesian retrospective multiple-changepoint identification. Appl Stat 43:159–178

    Article  Google Scholar 

  • Wiemer S, McNutt S, Wyss M (1998) Temporal and three-dimensional spatial analyses of the frequency magnitude distribution near Long Valley Caldera, California. Geophys J Int 134(2):409–421

    Article  Google Scholar 

  • Yang TY, Kuo L (2001) Bayesian binary segmentation procedure for a Poisson Process with multiple changepoints. J Comput Graph Stat 10:772–785

    Article  Google Scholar 

  • Yip CF, Ng WL, Yau CY (2018) A hidden Markov model for earthquake prediction. Stoch Environ Res Risk Assess 32:1415. https://doi.org/10.1007/s00477-017-1457-1

    Article  Google Scholar 

  • Zhang NR, Siegmund DO (2007) A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data. Biometrics 63(1):22–32

    Article  CAS  Google Scholar 

Download references

Acknowledgements

Two referees’ suggestions are acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lu Shaochuan.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Poisson process attached by exponential marks belongs to a two-parameter exponential family with canonical form:

$$\begin{aligned} \frac{d^2F_j}{dudv}=e^{\theta _ju-\Psi (\theta _j)+\eta _jv-\Phi (\eta )}. \end{aligned}$$

After fixing changepoints \(\tau\), the log-likelihood can be written in a second order Taylor series around the maximum likelihood estimate \(\hat{\varvec{\Theta }}(\tau )=arg\max \limits _{\varvec{\Theta }}l(\varvec{\Theta }, \tau )\), such that:

$$\begin{aligned} l(\varvec{\Theta }, \tau )\approx l(\hat{\varvec{\Theta }}(\tau ), \tau ) + (\varvec{\Theta }-\hat{\varvec{\Theta }}(\tau ))'H(\hat{\varvec{\Theta }}(\tau ), \tau )(\varvec{\Theta }-\hat{\varvec{\Theta }}(\tau ))/2, \end{aligned}$$
(16)

where H(.) is the Hessian matrix of the log-likelihood and \(\varvec{\Theta }=(\theta , \eta )\in {\mathcal {R}}^2.\) Therefore, after exponentiating \(l(\varvec{\Theta }, \tau )\) and treating it as a Gaussian kernel, under the Uniform priors for the changepoint locations, the marginal likelihood of the model \({\mathcal {M}}_m\) is given by

$$\begin{aligned} P({\mathcal {M}}_m|Y[0,T])=\int _{{\mathcal {D}}_m}\int _{{\mathcal {R}}^{2m}}e^{l(\varvec{\Theta }, \tau )}\frac{m!}{T^m}\,d\varvec{\Theta }d\tau =C\int _{{\mathcal {D}}_m}e^{l( {\hat{\varvec{\Theta }}}(\tau ), \tau )}|H({\hat{\varvec{\Theta }}}(\tau ), \tau )|^{-\frac{1}{2}}\frac{m!}{T^m}\,d\tau , \end{aligned}$$
(17)

where \({\mathcal {D}}_m=\{(t_0,t_1,\ldots ,t_{m+1}):0=t_0<t_1<\cdots <t_{m+1}=T\}\) and C is a normalizing constant. The data points in the sample are independent,

$$\begin{aligned} l(\varvec{\Theta }, \tau )=\sum _{i=1}^{m+1}\left\{ \theta _i(S_{\tau _i}^u-S_{\tau _{i-1}}^u)-(\tau _i-\tau _{i-1})\Psi (\theta _i) + \eta _i(S_{\tau _i}^v-S_{\tau _{i-1}}^v)-(\tau _i-\tau _{i-1})\Phi (\eta _i)\right\} , \end{aligned}$$

where \(S_{\tau _i}^u\) denotes the sum of u from time 0 to \(\tau _i\).

Obviously, the Hessian matrix is a diagonal matrix \(H(\hat{\Theta }(\tau ), \tau )=diag(\tau _1\ddot{\Psi }(\hat{\theta }_1(\tau )), (\tau _2-\tau _1)\ddot{\Psi }(\hat{\theta }_2(\tau )), \cdots , (T-\tau _m)\ddot{\Psi }(\hat{\theta }_m(\tau )), \tau _1\ddot{\Phi }(\hat{\eta }_1(\tau )), (\tau _2-\tau _1)\ddot{\Phi }(\hat{\eta }_2(\tau )), \cdots , (T-\tau _m)\ddot{\Phi }(\hat{\eta }_m(\tau )))\) and

$$\begin{aligned} |H(\hat{\Theta }(\tau ), \tau )|=\prod \limits _{i=1}^{m+1}(\tau _i-\tau _{i-1})^2\prod \limits _{i=1}^{m+1} \ddot{\Psi }(\hat{\theta }_i(\tau ))\ddot{\Phi }(\hat{\eta }_i(\tau )). \end{aligned}$$
(18)

In the denominator of Bayes factor, the marginal likelihood of the model \({\mathcal {M}}_0\) is simply given by

$$\begin{aligned} P({\mathcal {M}}_0|Y[0,T])=\int e^{l_0(\varvec{\Theta })}\,d\varvec{\Theta }=\frac{1}{2\pi } e^{l_0\left( \hat{\varvec{\Theta }}_0\right) }|\ddot{l}_0\left( \hat{\varvec{\Theta }}_0\right) |^{-\frac{1}{2}}, \end{aligned}$$
(19)

where \(\hat{\varvec{\Theta }}_0=arg\max \limits _{\varvec{\Theta }}l_0(\varvec{\Theta })\) and \(|\ddot{l}_0(\hat{\varvec{\Theta }}_0)|=T^2\ddot{\Psi }(\hat{\theta }_0)\ddot{\Phi }(\hat{\eta }_0)\).

Ignoring constant factors, the Bayes factor has approximation:

$$\begin{aligned} \int _{{\mathcal {D}}_m}e^{(l(\hat{\Theta }(\tau ), \tau )-l_0(\hat{\varvec{\Theta }}_0))}\frac{1}{T^{m-1}}\prod \limits _{i=1}^{m}(\tau _i-\tau _{i-1})^{-1}\,d\tau . \end{aligned}$$
(20)

In above equation, the log of the integrant at the maximum likelihood value \(\hat{\tau }\) is the Modified BIC. To prove the Modified BIC given in (11), it is necessary to show that the remainder term

$$\begin{aligned} \int _{{\mathcal {D}}_m}e^{l\left( \hat{\Theta }(\tau ), \tau \right) -l\left( \hat{\Theta }(\hat{\tau }), \hat{\tau }\right) }\prod \limits _{i=1}^{m}\left( \hat{\tau }_i-\hat{\tau }_{i-1}\right) /\prod \limits _{i=1}^{m}\left( \tau _i-\tau _{i-1}\right) \,d\tau . \end{aligned}$$
(21)

is uniformly bounded in T, see the proof in Zhang and Siegmund (2007).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shaochuan, L. A Bayesian multiple changepoint model for marked poisson processes with applications to deep earthquakes. Stoch Environ Res Risk Assess 33, 59–72 (2019). https://doi.org/10.1007/s00477-018-1632-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-018-1632-z

Keywords

Navigation