Exact Maximum-Entropy Estimation with Feynman Diagrams

Netser Zernik, Amitai; Schlank, Tomer M.; Tessler, Ran J.

doi:10.1007/s10955-018-1960-x

Exact Maximum-Entropy Estimation with Feynman Diagrams

Published: 25 January 2018

Volume 170, pages 731–747, (2018)
Cite this article

Journal of Statistical Physics Aims and scope Submit manuscript

Amitai Netser Zernik¹,
Tomer M. Schlank² &
Ran J. Tessler³

244 Accesses
2 Altmetric
Explore all metrics

Abstract

A longstanding open problem in statistics is finding an explicit expression for the probability measure which maximizes entropy with respect to given constraints. In this paper a solution to this problem is found, using perturbative Feynman calculus. The explicit expression is given as a sum over weighted trees.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feynman Diagrams: A Toy Example

Article 01 June 2019

TBA and Tree Expansion

Hilbert transforms and sum rules of Bessel moments

Article 31 October 2017

Notes

See for example the proof in Joel Feldman’s lecture notes, http://www.math.ubc.ca/~feldman/m425/impFnThm.pdf. To apply the argument we need that $(\partial f)(x,y) \in \textit{Hom}(V,V^*)$ is invertible for all $(x,y) \in U \times W$. Since $\partial f = B({\text {id}} - \partial g)$ this follows from our assumption that $(\partial g)(x,y) \in \textit{Hom}(V,V)$ is contracting.

References

Avellaneda, M., Friedman, C., Holmes, R., Samperi, D.: Calibrating volatility surfaces via relative-entropy minimization. App. Math. Financ. 4(1), 37–64 (1997)
Article MATH Google Scholar
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (2012)
MATH Google Scholar
Etingof, P.: Geometry and quantum field theory. MIT OpenCourseware 18.238 (2002)
Frisch, H.L., Lebowitz, J.L.: The Equilibrium Theory of Classical Fluids. Benjamin, New York (1964)
MATH Google Scholar
Jaynes, E.T.: Information theory and statistical mechanics. Phys. Rev. 106(4), 620–630 (1957)
Article ADS MathSciNet MATH Google Scholar
Jaynes, E.T.: Information theory and statistical mechanics. II. Phys. Rev. 108(2), 171–190 (1957)
Article ADS MathSciNet MATH Google Scholar
Lin, H.W., Tegmark, M., Rolnick. D.: Why does deep and cheap learning work so well? arXiv preprint. arXiv:1608.08225 (2016)
Shell, S.M.: The relative entropy is fundamental to multiscale and inverse thermodynamic problems. J. Chem. Phys. 129(14), 144108 (2008)
Article ADS Google Scholar

Download references

Acknowledgements

We thank O. Bozo, B. Gomberg, R.S. Melzer, A. Moscovitch-Eiger, R. Schweiger, A. Solomon and D. Zernik for discussions related to the work presented here. R.T. was partially supported by Dr. Max Rössler, the Walter Haefner Foundation and the ETH Zurich Foundation.

Author information

Authors and Affiliations

Institute for Advanced Study, Princeton, NJ, USA
Amitai Netser Zernik
Department of Mathematics, The Hebrew University of Jerusalem, Jerusalem, Israel
Tomer M. Schlank
Institute for Theoretical Studies, ETH Zürich, Zurich, Switzerland
Ran J. Tessler

Authors

Amitai Netser Zernik
View author publications
You can also search for this author in PubMed Google Scholar
Tomer M. Schlank
View author publications
You can also search for this author in PubMed Google Scholar
Ran J. Tessler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ran J. Tessler.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Appendix A: Proof of Lemma 3

Recall that by Lemma 24

$$\begin{aligned} \log \left( \sum _{\sigma }q_{\sigma }\exp \left( -\sum _{i=1}^{k}\lambda _{i}r_{i}(\sigma )\right) \right) =1+\lambda _{0}=1+\lambda _{0}(\lambda _{1},\ldots ,\lambda _{k}). \end{aligned}$$

Hence $\lambda _{0}$ is an analytic function of $\lambda _{1},\ldots ,\lambda _{k}$ around $\lambda _{1}=\dots =\lambda _{k}=0$. Now,

$$\begin{aligned}&\rho _{i}(\lambda _{1},\ldots ,\lambda _{k})=\sum _{\sigma }r_{i}(\sigma )q_{\sigma }\exp \left( -1-\lambda _{0}(\lambda _{1},\ldots ,\lambda _{k})-\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \\&\quad =\frac{\sum _{\sigma }r_{i}(\sigma )q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) }{\exp (1+\lambda _{0}(\lambda _{1},\ldots ,\lambda _{k}))}=\frac{\sum _{\sigma }r_{i}(\sigma )q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) }{\sum _{\sigma }q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) } \end{aligned}$$

So that $\rho _{i}(\lambda _{1},\ldots ,\lambda _{k})$ is an analytic function of $\lambda _{1},\ldots ,\lambda _{k}$

The proof that $\lambda _{i}=\lambda _{i}(\rho _{1},\ldots ,\rho _{k})$ is an analytic function of $\rho _{1},\ldots ,\rho _{k}$ around

$$\begin{aligned} \lambda _{1}=\dots =\lambda _{k}=\rho _{1}=\dots =\rho _{k}=0, \end{aligned}$$

uses the analytic inverse function theorem. It is enough to show that the Jacobian $\frac{\partial (\rho _{1},\ldots ,\rho _{k})}{\partial (\lambda _{1},\ldots ,\lambda _{k})}$ is invertible for$\lambda _{1}=\dots =\lambda _{k}=0.$

But

$$\begin{aligned} \frac{\partial \rho _{i}}{\partial \lambda _{j}}&= \frac{\left( \sum _{\sigma }r_{i}(\sigma )r_{j}(\sigma )q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) \left( \sum _{\sigma }q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) }{\left( \sum _{\sigma }q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) ^{2}}\nonumber \\&\quad -\frac{\left( \sum _{\sigma }r_{l}(\sigma )q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) \left( \sum _{\sigma }r_{j}(\sigma )q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) }{{\left( \sum _{\sigma }q_{\sigma }\exp \left( -\sum _{l=1}^{k}\lambda _{l}r_{l}(\sigma )\right) \right) ^{2}}}. \end{aligned}$$

(35)

Evaluation at $\lambda _{1}=\dots =\lambda _{k}=0$ gives

$$\begin{aligned} \frac{\partial \rho _{i}}{\partial \lambda _{j}}\left| _{\lambda _{1}=\cdots =\lambda _{k}=0}\right.= & {} \frac{\left( \sum _{\sigma }r_{i}(\sigma )r_{j}(\sigma )q_{\sigma }\right) \left( \sum _{\sigma }q_{\sigma }\right) -\left( \sum _{\sigma }r_{i}(\sigma )q_{\sigma }\right) \left( \sum _{\sigma }r_{j}(\sigma )q_{\sigma }\right) }{\left( \sum _{\sigma }q_{\sigma }\right) ^{2}} \\= & {} \frac{\mathbb {E}_{Q}(r_{i}r_{j})\cdot 1-\mathbb {E}_{Q}(r_{i})\mathbb {E}_{Q}(r_{j})}{1^{2}}=\mathrm {Cov}_{Q}(r_{i},r_{j}). \end{aligned}$$

By assumption the KL constraint problem is normalized, hence

$$\begin{aligned} \frac{\partial \rho _{i}}{\partial \lambda _{j}}\left| _{\lambda _{1}=\cdots =\lambda _{k}=0}\right. =\mathrm {Cov}_{Q}(r_{i},r_{j})=\delta _{i,j}. \end{aligned}$$

The Jacobian $\frac{\partial (\rho _{1},\ldots ,\rho _{k})}{\partial (\lambda _{1},\ldots ,\lambda _{k})}=I_{k}$ is thus invertible. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Netser Zernik, A., Schlank, T.M. & Tessler, R.J. Exact Maximum-Entropy Estimation with Feynman Diagrams. J Stat Phys 170, 731–747 (2018). https://doi.org/10.1007/s10955-018-1960-x

Download citation

Received: 29 January 2017
Accepted: 09 January 2018
Published: 25 January 2018
Issue Date: February 2018
DOI: https://doi.org/10.1007/s10955-018-1960-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exact Maximum-Entropy Estimation with Feynman Diagrams

Abstract

Access this article

Similar content being viewed by others

Feynman Diagrams: A Toy Example

TBA and Tree Expansion

Hilbert transforms and sum rules of Bessel moments

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Appendix A: Proof of Lemma 3

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Exact Maximum-Entropy Estimation with Feynman Diagrams

Abstract

Access this article

Similar content being viewed by others

Feynman Diagrams: A Toy Example

TBA and Tree Expansion

Hilbert transforms and sum rules of Bessel moments

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Appendix A: Proof of Lemma 3

Appendix A: Proof of Lemma 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation