Bayesian parameter inference for partially observed stopped processes

Jasra, Ajay; Kantas, Nikolas; Persing, Adam

doi:10.1007/s11222-012-9348-2

Bayesian parameter inference for partially observed stopped processes

Published: 30 August 2012

Volume 24, pages 1–20, (2014)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

Ajay Jasra¹,
Nikolas Kantas² &
Adam Persing³

403 Accesses
4 Citations
Explore all metrics

Abstract

We consider Bayesian parameter inference associated to partially-observed stochastic processes that start from a set B ₀ and are stopped or killed at the first hitting time of a known set A. Such processes occur naturally within the context of a wide variety of applications. The associated posterior distributions are highly complex and posterior parameter inference requires the use of advanced Markov chain Monte Carlo (MCMC) techniques. Our approach uses a recently introduced simulation methodology, particle Markov chain Monte Carlo (PMCMC) (Andrieu et al. 2010), where sequential Monte Carlo (SMC) (Doucet et al. 2001; Liu 2001) approximations are embedded within MCMC. However, when the parameter of interest is fixed, standard SMC algorithms are not always appropriate for many stopped processes. In Chen et al. (2005), Del Moral (2004), the authors introduce SMC approximations of multi-level Feynman-Kac formulae, which can lead to more efficient algorithms. This is achieved by devising a sequence of sets from B ₀ to A and then performing the resampling step only when the samples of the process reach intermediate sets in the sequence. The choice of the intermediate sets is critical to the performance of such a scheme. In this paper, we demonstrate that multi-level SMC algorithms can be used as a proposal in PMCMC. In addition, we introduce a flexible strategy that adapts the sets for different parameter proposals. Our methodology is illustrated on the coalescent model with migration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sequential Monte Carlo with transformations

Article Open access 17 November 2019

Markov Chain Monte Carlo Algorithms for Bayesian Computation, a Survey and Some Generalisation

Efficient $$\hbox {SMC}^2$$ schemes for stochastic kinetic models

Article Open access 10 November 2017

References

Andrieu, C., Doucet, A., Holenstein, R.: Particle Markov chain Monte Carlo methods (with discussion). J. R. Stat. Soc. B 72, 269–342 (2010)
Article MathSciNet Google Scholar
Andrieu, C., Doucet, A., Tadic, V.: On-line parameter estimation in general state-space models using pseudo-likelihood. Technical report, University of Bristol (2009)
Andrieu, C., Roberts, G.O.: The pseudo-marginal approach for efficient Monte Carlo computations. Ann. Stat. 37, 697–725 (2009)
Article MATH MathSciNet Google Scholar
Bahlo, M., Griffiths, R.C.: Coalescence time for two genes from a subdivided population. J. Math. Biol. 43, 397–410 (2000)
Article MathSciNet Google Scholar
Bibbona, E., Ditlevsen, S.: Estimation in discretely observed Markov processes killed at a threshold. Scand. J. Stat. (2012, to appear). arXiv:1011.1356v1
Blom, H.A.P., Bakker, G.J., Krystul, J.: Probabilistic reachability analysis for large scale stochastic hybrid systems. In: Proc. 46th IEEE Conf. Dec. Contr., New Orleans, USA (2007)
Google Scholar
Casella, B., Roberts, G.O.: Exact Monte Carlo simulation of killed diffusions. Adv. Appl. Probab. 40, 273–291 (2008)
Article MATH MathSciNet Google Scholar
Cerou, F., Guyader, A.: Adaptive multilevel splitting for rare-events analysis. Stoch. Anal. Appl. 25, 417–433 (2007)
Article MATH MathSciNet Google Scholar
Cerou, F., Del Moral, P., Furon, T., Guyader, A.: Sequential Monte Carlo for rare event estimation. Stat. Comput. 22, 795–808 (2012)
Article MATH MathSciNet Google Scholar
Cerou, F., Del Moral, P., Guyader, A.: A non asymptotic variance theorem for unnormalized Feynman-Kac particle models. Ann. Inst. Henri Poincaré Probab. Stat. 47, 629–649 (2011)
Article MATH Google Scholar
Chen, Y., Xie, J., Liu, J.S.: Stopping-time resampling for sequential Monte Carlo methods. J. R. Stat. Soc. B 67, 199–217 (2005)
Article MATH MathSciNet Google Scholar
Chopin, N., Jacob, P., Papaspiliopoulos, O.: SMC²: A sequential Monte Carlo algorithm with particle Markov chain Monte Carlo updates. J. R. Stat. Soc. B (2012, to appear) arXiv:1101.1528v2
Dean, T., Dupuis, P.: Splitting for rare event simulation: a large deviations approach to design and analysis. Stoch. Process. Appl. 119, 562–587 (2009)
Article MATH MathSciNet Google Scholar
De Iorio, M., Griffiths, R.C.: Importance sampling on coalescent histories. II: Subdivided population models. Adv. Appl. Probab. 36, 434–454 (2004)
Article MATH Google Scholar
Del Moral, P.: Feynman-Kac Formulae: Genealogical and Interacting Particle Systems with Applications. Springer, New York (2004)
Book Google Scholar
Del Moral, P., Garnier, J.: Genealogical particle analysis of rare events. Ann. Appl. Probab. 15, 2496–2534 (2005)
Article MATH MathSciNet Google Scholar
Del Moral, P., Doucet, A., Jasra, A.: Sequential Monte Carlo samplers. J. R. Stat. Soc. B 68, 411–436 (2006)
Article MATH Google Scholar
Doucet, A., De Freitas, J.F.G., Gordon, N.J.: Sequential Monte Carlo Methods in Practice. Springer, New York (2001)
Book MATH Google Scholar
Glasserman, P., Heidelberger, P., Shahabuddin, P., Zajic, T.: Multi-level splitting for estimating rare event probabilities. Oper. Res. 47, 585–600 (1999)
Article MATH MathSciNet Google Scholar
Gorur, D., Teh, Y.W.: An efficient sequential Monte Carlo algorithm for coalescent clustering. Adv. Neural Inf. Process. Syst. 521–528 (2008)
Griffiths, R.C., Tavare, S.: Simulating probability distributions in the coalescent. Theor. Popul. Biol. 46, 131–159 (1994)
Article MATH Google Scholar
Johansen, A.M., Del Moral, P., Doucet, A.: Sequential Monte Carlo samplers for rare events. In: Proc. 6th Interl. Works. Rare Event Simul., Bamberg, Germany (2006)
Google Scholar
Kantas, N., Doucet, A., Singh, S.S., Maciejowski, J.M., Chopin, N.: On particle methods for parameter estimation in general state-space models. Technical report, Imperial College London (2011)
Kingman, J.F.C.: On the genealogy of large populations. J. Appl. Probab. 19, 27–43 (1982)
Article MathSciNet Google Scholar
Lee, A., Yau, C., Giles, M., Doucet, A., Holmes, C.C.: On the utility of graphics cards to perform massively parallel implementation of advanced Monte Carlo methods. J. Comput. Graph. Stat. 19, 769–789 (2010)
Article Google Scholar
Lezaud, P., Krystul, J., Le Gland, F.: Sampling per mode simulation for switching diffusions. In: Proc. 8th Internl. Works. Rare-Event Simul.. RESIM, Cambridge (2010)
Google Scholar
Liu, J.S.: Monte Carlo Strategies in Scientific Computing. Springer, New York (2001)
MATH Google Scholar
Roberts, G.O., Rosenthal, J.S.: Coupling and ergodicity of adaptive MCMC. J. Appl. Probab. 44, 458–475 (2007)
Article MATH MathSciNet Google Scholar
Roberts, G.O., Rosenthal, J.S.: Quantitative non-geometric convergence bounds for independence samplers. Methodol. Comput. Appl. Probab. 13, 391–403 (2011)
Article MATH MathSciNet Google Scholar
Sadowsky, J.S., Bucklew, J.A.: On large deviation theory and asymptotically efficient Monte Carlo estimation. IEEE Trans. Inf. Theory 36, 579–588 (1990)
Article MATH MathSciNet Google Scholar
Stephens, M., Donelly, P.: Inference in molecular population genetics (with discussion). J. R. Stat. Soc. B 62, 605–655 (2000)
Article MATH Google Scholar
Whiteley, N.: Discussion of particle Markov chain Monte Carlo methods. J. R. Stat. Soc. B 72, 306–307 (2010)
Google Scholar

Download references

Acknowledgements

We thank Arnaud Doucet and Maria De Iorio for many conversations on this work. The first author was supported by a Ministry of Education grant. The second author was kindly supported by the EPSRC programme grant on Control For Energy and Sustainability EP/G066477/1. We thank two referees and an associate editor for comments that vastly improved the paper.

Author information

Authors and Affiliations

Department of Statistics & Applied Probability, National University of Singapore, Singapore, 117546, Singapore
Ajay Jasra
Department of Statistical Science, University College London, London, W1CE 6BT, UK
Nikolas Kantas
Department of Mathematics, Imperial College London, London, SW7 2AZ, UK
Adam Persing

Authors

Ajay Jasra
View author publications
You can also search for this author in PubMed Google Scholar
Nikolas Kantas
View author publications
You can also search for this author in PubMed Google Scholar
Adam Persing
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ajay Jasra.

Appendix

Proof of Proposition 4.1

The result is a straight forward application of Theorem 6 of Roberts and Rosenthal (2011), which adapted to our notation states:

where the conditional expectation is the expectation w.r.t. the SMC algorithm (i.e. Ξ∼ψ _θ) and the outer expectation is w.r.t. the PIMH target (i.e. $\xi\sim\pi_{p}^{N}$). We also denote the estimate of the normalising constant as $\hat{Z}_{p}(\cdot )$ with ⋅ denoting which random variables generate the estimate.

Now, clearly via (A1),

with the convention that τ ₀=0. Thus, it follows that

and we obtain:

Note that by assumption, $Z_{p}(\varXi) (\rho\varphi )^{2\sum_{n=1}^{p}\bar{\tau}_{n}}\leq1$, and thus we have

Given Del Moral (2004, Theorem 7.4.2, Eq. (7.17), p. 239) and the fact that γ _θ is defined to be strictly positive in (A1), we have that the SMC approximation $\hat {Z}_{p}(\cdot)$ is an unbiased estimate of the normalising constant Z _p:

(13)

and we can easily conclude. □

Proof of Proposition 4.2

The proof of parts 1 and 2 follows the line of arguments used in Theorem 4 of Andrieu et al. (2010), which we will adapt to our set-up. The main difference lies in the multi-level construction and second statement regarding the marginal of $\bar{\pi}^{N}$. For the validity of the multi-level set-up, we will rely on Proposition 3.1.

Suppose we design a Metropolis-Hastings kernel with invariant density $\bar{\pi}^{N}$ and use a proposal $q^{N}(\theta,k,\bar {\mathcal{X}}_{1:p}, \bar{\mathbf{a}}_{1:p-1})=\psi_{\theta}(\bar {\mathcal{X}}_{1:p},\bar{\mathbf{a}}_{1:p-1})f(k\mid W_{p})\bar {q}(\theta(i-1)\mid \theta^{\prime})=\psi_{\theta}(\bar{\mathcal {X}}_{1:p},\bar{\mathbf{a}}_{1:p-1})\bar{W}_{p}^{(k)}\bar{q}(\theta \mid \theta^{\prime})$. Then

where we denote the normalising constant of the posterior in (1) as:

Therefore, the Metropolis-Hastings procedure to sample from $\bar{\pi }_{p}^{N}$ will be as in Algorithm 4.

Alternatively using similar arguments, one may write

Summing over k and using the unbiased property of the SMC algorithm in Eq. (13), it follows that $\bar{\pi }_{p}^{N}(\cdot)$ admits $\bar{\pi}(\theta)$ as a marginal, so the proof of part 1 is complete.

Part 2 is a direct consequence of Theorem 1 in Andrieu and Roberts (2009) and Assumption (A2). □

Proof of Proposition 4.3

The proof is similar to that of Proposition 4.2. For the proof of the first statement of part 1, one repeats the same arguments as for Proposition 4.2, with the difference being in the inclusion of $\bar{\varLambda}_{\theta}(v)$ for $\bar {\pi}^{N}$ and $\bar{q}^{N}$. For the second statement, to get the marginal of $\bar{\pi}^{N}$, one re-writes the target as

Let $\bar{\pi}_{p}^{N}(\theta)$ denote the marginal of $\bar {\pi}_{p}^{N}(\cdot)$ obtained in Proposition 4.2. Using (11) and the conditional independence of v and $\bar{\mathcal{X}}_{1:p(v)},\bar{\mathbf {a}}_{1:p(v)-1}$, then for the marginal of $\bar{\pi}^{N}(\cdot)$ w.r.t. v, $\bar{\mathcal{X}}_{1:p(v)},\bar{\mathbf {a}}_{1:p(v)-1}$, k we have that

where the summing over k and integrating w.r.t. $\bar{\mathcal {X}}_{1:p(v)}, \bar{\mathbf{a}}_{1:p(v)-1}$ is as in Proposition 4.2.

For part 2 note that the conditional density given k and v and θ of $\mathcal{X}_{1:p(v)}^{(k)}$ is

Hence the sequence $(\theta(i),\mathcal {X}_{1:p(v)}^{(k)}(i) )_{i\geq0}$ satisfies the required property as a direct consequence of Theorem 1 in Andrieu and Roberts (2009) and Assumption (A2). □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jasra, A., Kantas, N. & Persing, A. Bayesian parameter inference for partially observed stopped processes. Stat Comput 24, 1–20 (2014). https://doi.org/10.1007/s11222-012-9348-2

Download citation

Received: 17 January 2012
Accepted: 31 July 2012
Published: 30 August 2012
Issue Date: January 2014
DOI: https://doi.org/10.1007/s11222-012-9348-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bayesian parameter inference for partially observed stopped processes

Abstract

Access this article

Similar content being viewed by others

Sequential Monte Carlo with transformations

Markov Chain Monte Carlo Algorithms for Bayesian Computation, a Survey and Some Generalisation

Efficient $$\hbox {SMC}^2$$ schemes for stochastic kinetic models

References

Acknowledgements