Skip to main content

A Computational Theory of Surprise

  • Chapter
Information, Coding and Mathematics

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 687))

Abstract

While eminently successful for the transmission of data, Shannon’s theory of information does not address semantic and subjective dimensions of data, such as relevance and surprise. We propose an observer-dependent computational theory of surprise where surprise is defined by the relative entropy between the prior and the posterior distributions of an observer. Surprise requires integration over the space of models in contrast with Shannon’s entropy, which requires integration over the space of data. We show how surprise can be computed exactly in a number of discrete and continuous cases using distributions from the exponential family with conjugate priors. We show that during sequential Bayesian learning, surprise decreases like 1/N and study how surprise differs and complements Shannon’s definition of information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. J. Aczel and Z. Daroczy. On measures of information and their characterizations. Academic Press, New York, 1975.

    MATH  Google Scholar 

  2. P. Baldi and S. Brunak. Bioinformatics: the machine learning approach. MIT Press, Cambridge, MA, 2001. Second edition.

    Google Scholar 

  3. J. O. Berger. Statistical decision theory and Bayesian analysis. Springer-Verlag, New York, 1985.

    MATH  Google Scholar 

  4. R. E. Blahut. Principles and practice of information theory. Addison-Wesley, Reading, MA, 1987.

    MATH  Google Scholar 

  5. Bela Bollobas. Random Graphs. Academic Press, London, 1985.

    MATH  Google Scholar 

  6. G. E. P. Box and G. C. Tiao. Bayesian inference in statistical analysis. John Wiley and Sons, New York, 1992. (First Edition in 1973 ).

    Google Scholar 

  7. L. D. Brown. Fundamentals of Statistical Exponential Families. Institute of Mathematical Statistics, Hayward, CA, 1986.

    MATH  Google Scholar 

  8. T. M. Cover and J. A. Thomas. Elements of Information Theory. John Wiley, New York, 1991.

    Book  MATH  Google Scholar 

  9. R. T. Cox. Probability, frequency and reasonable expectation. Am. J. Phys., 14: 1–13, 1964.

    Google Scholar 

  10. A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis. Chapman and Hall, London, 1995.

    Google Scholar 

  11. I. S. Gradshteyn and I. M. Ryzhik. Table of integrals, series, and products. Academic Press, New York, 1980.

    Google Scholar 

  12. S.F. Gull. Developments in maximum entropy data analysis. In J. Skilling, editor, Maximum entropy and Bayesian methods, pages 53–71. Kluwer, Dordrecht, 1989.

    Google Scholar 

  13. G. E. Hinton, P. Dayan, B. J. Frey, and R. M. Neal. The wake-sleep algorithm for unsupervised neural networks. Science, 268 (5214): 1158–1161, 1995.

    Article  Google Scholar 

  14. E. T. Jaynes. Probability Theory: The Logic of Science. 1996. Unpublished.

    Google Scholar 

  15. E.T. Jaynes. Bayesian methods: General background. In J.H. Justice, editor, Maximum Entropy and Bayesian Methods in Statistics, pages 1–25. Cambridge University Press, Cambridge, 1986.

    Google Scholar 

  16. G. Jumarie. Relative information. Springer Verlag, New York, 1990.

    Book  MATH  Google Scholar 

  17. S. Kullback. Information theory and statistics. Dover, New York, 1968. (First Edition in 1959 ).

    Google Scholar 

  18. R. J. McEliece. The Theory of Information and Coding. Addison-Wesley Publishing Company, Reading, MA, 1977.

    MATH  Google Scholar 

  19. L. J. Savage. The foundations of statistics. Dover, New York, 1972. (First Edition in 1954 ).

    Google Scholar 

  20. C. E. Shannon. A mathematical theory of communication. Bell System Technical Journal, 27: 379–423, 623–656, 1948.

    Google Scholar 

  21. N. Tishby, F. Pereira, and W. Bialek. The information bottleneck method. In B. Hajek and R. S. Sreenivas, editors, Proceedings of the 37th Annual Allerton Conference on Communication, Control, and Computing, pages 368–377. University of Illinois, 1999.

    Google Scholar 

  22. V. Vapnik. The Nature of Statistical Learning Theory. Springer Verlag, New York, 1995.

    MATH  Google Scholar 

  23. A. Zellner. Optimal information processing and Bayes’s theorem. The American Statistician, 42 (4): 278–284, 1998.

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer Science+Business Media New York

About this chapter

Cite this chapter

Baldi, P. (2002). A Computational Theory of Surprise. In: Blaum, M., Farrell, P.G., van Tilborg, H.C.A. (eds) Information, Coding and Mathematics. The Springer International Series in Engineering and Computer Science, vol 687. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3585-7_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-3585-7_1

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5289-9

  • Online ISBN: 978-1-4757-3585-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics