Avoiding both the Garbage-In/Garbage-Out and the Borel Paradox in updating probabilities given experimental information

Bordley, Robert F.

doi:10.1007/s11238-013-9369-0

Avoiding both the Garbage-In/Garbage-Out and the Borel Paradox in updating probabilities given experimental information

Published: 17 June 2015

Volume 79, pages 95–105, (2015)
Cite this article

Theory and Decision Aims and scope Submit manuscript

Robert F. Bordley¹

278 Accesses
Explore all metrics

Abstract

Bayes Rule specifies how probabilities over parameters should be updated given any kind of information. But in some cases, the kind of information provided by both simulation and physical experiments is information on how certain output parameters may change when other input parameters are changed. There are three different approaches to this problem, one of which leads to the Garbage-In/Garbage-Out Paradox, the second of which (Bayesian synthesis) violates the Borel Paradox, and the third of which (Bayesian melding) is a supra-Bayesian heuristic. This paper shows how to derive a fully Bayesian formula which avoids the Garbage-In/Garbage-Out and Borel Paradoxes. We also compare a Laplacian approximation of this formula with Bayesian synthesis and Bayesian melding and find that the Bayesian formula sometimes coincides with the Bayesian melding solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Evolutionary algorithms and their applications to engineering problems

Article Open access 16 March 2020

On the optimality of DIA-estimators: theory and applications

Article Open access 21 May 2024

References

Andradottir, S., & Bier, V. (2000). Applying Bayesian ideas in simulation. Simulation Practice and Theory, 8, 253–280.
Article Google Scholar
Box, G., & Wilson, K. (1951). On the experimental attainment of optimum conditions. Journal of the Royal Statistical Society B, 13(1), 1–45.
Google Scholar
Box, G., & Cox, D. (1964). An analysis of transformations. Journal of the Royal Statistical, Society, 26, 211–243.
Google Scholar
Chambers, J., Cleveland, W., Kleiner, B., & Tukey, P. (1983). Graphical methods for data analysis. Belmont: Wadsworth.
Google Scholar
Glynn, P.W. (1986). Problems in Bayesian analysis of stochastic simulation. In: J.R. Wilson, J.O. Henriksen, S.D. Roberts (eds.), Proceedings of the winter simulation conference.
Hoeting, J., Madigan, D., Raftery, A., & Volinsky, C. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14, 382–417.
Article Google Scholar
Poole, D., & Raftery, A. (2000). Inference for deterministic simulation models: The bayesian melding approach. Journal of the American Statistical Association, 95, 1222–1255.
Article Google Scholar
Proschan, M., & Presnell, B. (1998). Expect the unexpected from conditional expectation. The American Statistician, 52(3), 248–252.
Google Scholar
Redtke, P., Burk, T., & Bolstad, P. (2002). Bayesian melding of a forecast ecosystem with correlated inputs. Forest Sciences., 48(4), 505–512.
Google Scholar
Roback, P., & Givens, G. (2001). Supra-Bayesian pooling of priors linked by a deterministic simulation model. Communications in Statistics: Simulation and Computing, 30, 447–476.
Article Google Scholar
Schweder, T., & Hjort, N. (1996). Bayesian synthesis or likelihood synthesis–what does Borel’s paradox say? Report of the International Whaling Commission, 46, 475–480.
Google Scholar
Singpurwalla, N., & Swift, A. (2001). Network reliability and Borel’s paradox. The American statistician, 55(3), 213–218.
Article Google Scholar
Wolpert, R. (1995). Comment on inference from a deterministic population dynamics model for bowhead whales. by A. Raftery, G. Givens & J. Zeh. Journal of the American Statistical Association, 90, 427–427.
Google Scholar

Download references

Author information

Authors and Affiliations

General Motors Research Labs & University of Michigan, Ann Arbor, MI, USA
Robert F. Bordley

Authors

Robert F. Bordley
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert F. Bordley.

Appendices

Appendix I

First consider the case in which x instead of being prespecified is, like y, observed in the application of interest. Let $E^*$ denote this information. Then the decision maker’s posterior probability over $\mu $ and $\nu $ is:

$$\begin{aligned} f(\mu ,\nu |E^*) \propto f(x,y|\mu ,\nu ) f(\mu ,\nu )= f(y|x,\mu ,\nu )f(x|\mu ,\nu ) f(\mu ,\nu ) \end{aligned}$$

(2)

Now suppose the decision maker is given new information $E^{**}$ indicating that x, instead of being observed from the complex system, was prespecified so that x, on its own, does not provide the client with any new information about the application of interest. Given this information $E^{**}$, the client would update his probabilities in Eq. (2) to

$$\begin{aligned} f(\mu ,\nu |x,y,E^*,E^{**}) \propto f(y|x,\mu ,\nu ,E^{**}) f(x|\mu ,\nu ,E^{**}) f(\mu ,\nu |E^{**}) \end{aligned}$$

(3)

Note that event E is the event of learning both $E^*$ and $E^{**}$. Note that

$$\begin{aligned} f(\mu ,\nu |E^{**})=f(\mu ,\nu ) \end{aligned}$$

i.e., the decision maker’s prior beliefs about $\mu $ and $\nu $—which were unaffected by the information $E^{*}$—will be equally unaffected by $E^{**}$. Also note that

$$\begin{aligned} f(y|x,\mu ,\nu ,E^{**})=f(y|x,\mu ,\nu ) \end{aligned}$$

i.e., the probability conditioned on x being the true value of X (as well as $\mu $ and $\nu $) will be unaffected by the information that x was chosen arbitrarily. However, the decision maker’s assessment of $f(x|\mu ,\nu ,E^{**})$ will definitely be affected upon learning that x was chosen arbitrarily. To understand how it is affected, note that learning the value of x after learning that x was chosen arbitrarily should not change our beliefs about $\mu $ and $\nu $. Thus

$$\begin{aligned} f(\mu ,\nu |x, E^{**})=f(\mu ,\nu |E^{**}) \end{aligned}$$

This implies

$$\begin{aligned} f(\mu ,\nu ,x|E^{**})=f(\mu ,\nu |E^{**})f(x|E^{**}) \hbox { or } f(x|\mu ,\nu ,E^{**})=f(x|E^{**}) \end{aligned}$$

Making all these substitutions into Eq. (3) gives the proposition.

Appendix II

Define $z_i$ as the $n+m$-dimensional vector whose first n elements are inputs to the experiment, i.e., $x_i$, in trial i and whose last m elements are observed outputs, $y_i$. For simplicity, we will temporarily suspend using the subscript i. Define Z as the $n+m$-dimensional vector whose first n elements are $\mu $ and whose last m elements are $\nu $. Conditioned on $\mu $ and $\nu $, the mean of z is Z. We can write the variance-covariance matrix of z as

$$\begin{aligned} \left( \begin{array}{ll} V_{XX} &{} V_{XY}\\ V^T_{XY} &{} V_{YY} \end{array}\right) \end{aligned}$$

where $V_{XX}$ is an n by n matrix, $V_{XY}$ is an n by m matrix, and $V_{YY}$ is an m by m matrix.

It can be shown that the inverse variance-covariance matrix has the form

$$\begin{aligned} \left( \begin{array}{ll} V_{XX}^{-1} + RR^T &{} R S^T\\ SR^T &{} SS^T \end{array}\right) \end{aligned}$$

where $R=-V_{XX}^{-1} V_{XY} S$ and $SS^T=(V_{YY} - V_{XY}^T V_{XX}^{-1} V_{XY})^{-1}$

Let $\Omega ^{-1}=SS^T$ and define $L=-\frac{1}{2}\ln [\frac{f(x_i,y_i|\mu ,\nu )}{f(x_i|\mu )} ]$. Then

$$\begin{aligned} L= & {} ([x_i-\mu ]^T \left[ V_{XX}^{-1} + RR^T\right] [x_i-\mu ]-2 [x_i-\mu ]^T RS^T[y_i-\nu ] + [y_i-\nu ]^T SS^T[y_i-\nu ] ) \\&-\,\, ([x_i-\mu ]^T V_{XX}^{-1}[x_i-\mu ] ) \\= & {} ([x_i-\mu ]^T RR^T[x_i-\mu ] -2[x_i-\mu ]^T RS^T[y_i-\nu ] + [y_i-\nu ]^T SS^T[y_i-\nu ]) \\= & {} ([x_i-\mu ]^T \Sigma ^{-1} \rho SS^T \rho ^T \Sigma ^{-1}[x_i-\mu ]- 2[x_i-\mu ]^T \Sigma ^{-1} \rho SS^T[y_i-\mu ]\\&+\,\,[y_i-\nu ]^T SS^T(y_i-\mu ]) \\= & {} ([x_i-\mu ]^T \Sigma ^{-1} \rho \Omega ^{-1} \rho ^T \Sigma ^{-1}[x_i-\mu ] - 2[x_i-\mu ]^T \Sigma ^{-1} \rho \Omega ^{-1}[y_i-\mu ]\\&+\,\, [y_i-\nu ]^T \Omega ^{-1} [y_i-\mu ]) \end{aligned}$$

Define B to be the $(n+m)$ by m matrix whose first n rows are $-\Sigma ^{-1} \rho $ and whose last m rows are the identity matrix. Then

$$\begin{aligned} L=[z_i-Z]^T B \Omega ^{-1} B^T[z_i-Z] \end{aligned}$$

and

$$\begin{aligned} f(y|x,\mu ,\nu ) \propto \exp \left( -\frac{1}{2} [z_i-Z]^T B \Omega ^{-1} B^T[z_i-Z]\right) \end{aligned}$$

Given K independent experiments with $\hat{z}$ being the mean of $z_1, \ldots z_K$,

$$\begin{aligned}&f(y|x,\mu ,\nu ) \propto \exp \left( -\frac{1}{2} \sum _{i=1}^K [z_i-Z]^T B \Omega ^{-1} B^T[z_i-Z]\right) \\&\quad \quad \propto \exp \left( -\frac{K}{2} [\hat{z}-Z]^T B \Omega ^{-1} B^T[\hat{z}-Z]\right) \end{aligned}$$

Let $\nu _0=\hat{y}-\beta \hat{x}$ and $u=\beta \mu $ so that

$$\begin{aligned} \int \limits _{\mu } f(y|x,\mu ,\nu ) f(\mu |\nu )= & {} \int \limits _{\mu } \exp (-\frac{K}{2} (\nu -\beta \mu -\nu _0)^T \Omega ^{-1} (\nu -\beta \mu -\nu _0)) f(\mu |\nu )\\= & {} \int \limits _u \exp (\!-\!\frac{K}{2} (\nu \!-\!u\!-\!\nu _0)^T \Omega ^{-1} (\nu -u-\nu _0)) \int \limits _{\mu |u=\beta \mu } f(\mu |\nu ) \end{aligned}$$

Defining a density $h(u|\nu )$ with $h(u|\nu ) \propto \int \limits _{\mu |u=\beta \mu } f(\mu |\nu )$ implies

$$\begin{aligned} f(E|\nu ) \propto \int \limits _u \exp (-\frac{K}{2}(\nu -u-\nu _0)^T \Omega ^{-1}(\nu -u-\nu _0)) h(u|\nu ) \end{aligned}$$

Appendix III

Let $L(u|\nu )=-\ln (h(u|\nu ))$ and define

$$\begin{aligned} g(u|\nu )=\frac{K}{2}(\nu -u-\nu _0)^T \Omega ^{-1} (\nu -u-\nu _0)+L(u|\nu ) \end{aligned}$$

so that $f(E|\nu ) \propto \int _{u} \exp (-g(u|\nu )) d u) $. Suppose $g(u|\nu )$ is convex in u with $g',g''$ be the vector and matrix of first- and second-order derivatives of $g(u|\nu )$. If $u^*$ is the mode of $g(u|\nu )$ with

$$\begin{aligned} g'(u^*|\nu )= & {} 0 \rightarrow K \Omega ^{-1}(u^*-\nu +\nu _0)- L'(u^*-\nu +\nu _0) = 0 \rightarrow u^*\\= & {} \nu -\nu _0 + \frac{1}{K} \Omega L'(u^*-\nu +\nu _0) \end{aligned}$$

Laplace’s approximation allows us to approximate $f(E|\nu )$ by

$$\begin{aligned} |g''(u^*|\nu )|^{1/2} \exp (- g(u^*|\nu )) \end{aligned}$$

Since the Gaussian term in the integral will heavily discount values of u for which u is significantly different from $\nu -\nu _0$, we now approximate $L(u|\nu )$ with a Taylor Series approximation about $u=\nu -\nu _0$. If $L'$ and $L''$ are the first- and second-order derivatives of $L(u|\nu )$ about the point $u=\nu -\nu _0$, then the Taylor Series approximation is

$$\begin{aligned} L(u|\nu ) = L(\nu -\nu _0|\nu ) + (u-\nu +\nu _0)^T L' + \frac{1}{2} (u-\nu +\nu _0)^T L''(u-\nu +\nu _0) \end{aligned}$$

Define $v=(u-\nu +\nu _0)$ so that $L(v|\nu )=L_0+v^Tv'+\frac{1}{2}v^TL''v$ and

$$\begin{aligned} g(v|\nu )= & {} \frac{K}{2} v^T \Omega ^{-1}v + L(\nu -\nu _0|\nu )+v^T L'+\frac{1}{2}v^T L''v\\= & {} \frac{1}{2}v^T [K \Omega ^{-1} +L'']v +v^T L' +L(\nu -\nu _0|\nu ) \end{aligned}$$

Defining $A=K \Omega ^{-1}+L''(v|\nu )$ yields $g(v)=\frac{1}{2} v^T Av+v^T L' +L(\nu -\nu _0|\nu ) $. Defining $v^*=u^*-\nu +\nu _0$ implies that the modal value of $u^*$ satisfies

$$\begin{aligned} g'=0 \rightarrow v^T A + L'=0 \rightarrow v^T= -L' A^{-1} \end{aligned}$$

with $g''=A$. Substituting in $g(u|\nu )$

$$\begin{aligned} g(u^*|\nu )= & {} \frac{1}{2} L' A^{-1} L' - (L')^T A^{-1}L]+L(\nu -\nu _0|\nu )\\= & {} -\frac{1}{2} (L')^T A^{-1} L' + L(\nu -\nu _0|\nu ) \end{aligned}$$

Since $g''(u^*|\nu )=A$ is a constant, we thus have

$$\begin{aligned}&f(E|\nu ) \propto \exp \left( \frac{1}{2}(L')^T A^{-1}L' + L(\nu -\nu _0|\nu )\right) \\&\quad \quad \propto h(\nu -\nu _0|\nu ) \exp \left( \frac{1}{2} [L']^T[K \Omega ^{-1}+L'']^{-1}L'\right) \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bordley, R.F. Avoiding both the Garbage-In/Garbage-Out and the Borel Paradox in updating probabilities given experimental information. Theory Decis 79, 95–105 (2015). https://doi.org/10.1007/s11238-013-9369-0

Download citation

Received: 30 July 2009
Accepted: 18 March 2013
Published: 17 June 2015
Issue Date: July 2015
DOI: https://doi.org/10.1007/s11238-013-9369-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Avoiding both the Garbage-In/Garbage-Out and the Borel Paradox in updating probabilities given experimental information

Abstract

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Evolutionary algorithms and their applications to engineering problems

On the optimality of DIA-estimators: theory and applications

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix I

Appendix II

Appendix III

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Avoiding both the Garbage-In/Garbage-Out and the Borel Paradox in updating probabilities given experimental information

Abstract

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Evolutionary algorithms and their applications to engineering problems

On the optimality of DIA-estimators: theory and applications

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix I

Appendix II

Appendix III

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation