Skip to main content

Geometry of Goodness-of-Fit Testing in High Dimensional Low Sample Size Modelling

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9389))

Abstract

We introduce a new approach to goodness-of-fit testing in the high dimensional, sparse extended multinomial context. The paper takes a computational information geometric approach, extending classical higher order asymptotic theory. We show why the Wald – equivalently, the Pearson \(\chi ^2\) and score statistics – are unworkable in this context, but that the deviance has a simple, accurate and tractable sampling distribution even for moderate sample sizes. Issues of uniformity of asymptotic approximations across model space are discussed. A variety of important applications and extensions are noted.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Agresti, A.: Categorical Data Analysis. Wiley, Hoboken (2002)

    Book  MATH  Google Scholar 

  2. Amari, S.-I.: Differential-Geometrical Methods in Statistics. Springer, New York (1985)

    Book  MATH  Google Scholar 

  3. Anaya-Izquierdo, K., Critchley, F., Marriott, P.: When are first order asymptotics adequate? a diagnostic. STAT 3, 17–22 (2014)

    Article  Google Scholar 

  4. Anaya-Izquierdo, K., Critchley, F., Marriott, P., Vos, P.: Computational information geometry in statistics: foundations. In: Nielsen, F., Barbaresco, F. (eds.) GSI 2013. LNCS, vol. 8085, pp. 311–318. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  5. Barndorff-Nielsen, O.E., Cox, D.R.: Inference and Asymptotics. Chapman & Hall, London (1994)

    Book  MATH  Google Scholar 

  6. Critchley, F., Marriott, P.: Computational information geometry in statistics. Entropy 16, 2454–2471 (2014)

    Article  MathSciNet  Google Scholar 

  7. Fienberg, S.E., Rinaldo, A.: Maximum likelihood estimation in log-linear models. Ann. Stat. 40, 996–1023 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  8. Geyer, C.J.: Likelihood inference in exponential families and directions of recession. Electron. J. Stat. 3, 259–289 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  9. Holst, L.: Asymptotic normality and efficiency for certain goodness-of-fit tests. Biometrika 59, 137–145 (1972)

    Article  MathSciNet  MATH  Google Scholar 

  10. Lauritzen, S.L.: Graphical Models. Clarendon Press, Oxford (1996)

    MATH  Google Scholar 

  11. Liu, M., Vemuri, B.C., Amari, S.-I., Nielsen, F.: Shape retrieval using heirarchical total Bregman soft clustering. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2407–2419 (2012)

    Article  Google Scholar 

  12. Morris, C.: Central limit theorems for multinomial sums. Ann. Stat. 3, 165–188 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  13. Nielsen, F., Nock, N.: Optimal interval clustering: application to Bregman clustering and statistical mixture learning. IEEE Trans. Pattern Anal. Mach. Intell. 21(10), 1289–1292 (2014)

    Google Scholar 

  14. Steck, G.P.: Limit Theorems for Conditional Distributions. University of California Press, Berkeley (1957)

    MATH  Google Scholar 

Download references

Acknowledgements

The authors acknowledge with gratitude the support of EPSRC grant EP/L010429/1.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Radka Sabolova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Marriott, P., Sabolova, R., Van Bever, G., Critchley, F. (2015). Geometry of Goodness-of-Fit Testing in High Dimensional Low Sample Size Modelling . In: Nielsen, F., Barbaresco, F. (eds) Geometric Science of Information. GSI 2015. Lecture Notes in Computer Science(), vol 9389. Springer, Cham. https://doi.org/10.1007/978-3-319-25040-3_61

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25040-3_61

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25039-7

  • Online ISBN: 978-3-319-25040-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics