Skip to main content
Log in

Severe testing of Benford’s law

  • Original paper
  • Published:
TEST Aims and scope Submit manuscript

Abstract

Benford’s law is often used to support critical decisions related to data quality or the presence of data manipulations or even fraud in large datasets. However, many authors argue that conventional statistical tests will reject the null of data “Benford-ness” if applied in samples of the typical size in this kind of applications, even in the presence of tiny and practically unimportant deviations from Benford’s law. Therefore, they suggest using alternative criteria that, however, lack solid statistical foundations. This paper contributes to the debate on the “large n” (or “excess power”) problem in the context of Benford’s law testing. This issue is discussed in relation with the notion of severity testing for goodness-of-fit tests, with a specific focus on tests for conformity with Benford’s law. To do so, we also derive the asymptotic distribution of the mean absolute deviation (MAD) statistic as well as an asymptotic standard normal test. Finally, the severity testing principle is applied to six controversial large datasets to assess their “Benford-ness”.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Availability of data and materials

The data used in the paper are publicly available at https://web.williams.edu/Mathematics/sjmiller/public_html/benfordresources/.

Code availability

R scripts are available upon request.

References

Download references

Acknowledgements

We would like to express our gratitude to Aris Spanos for his comments and suggestions on an early draft of this paper. Comments from Marcel Ausloos and two anonymous referees are gratefully acknowledged. We owe a special thank to Alex Kossovsky for having made public his data. All computations have been carried out using R 4.1.2 (R Development Core Team 2021): graphs greatly benefited from package “ggplot2” (Wickham 2016).

Funding

No funds, grants, or other support was received.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Claudio Lupi.

Ethics declarations

Conflict of interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cerqueti, R., Lupi, C. Severe testing of Benford’s law. TEST 32, 677–694 (2023). https://doi.org/10.1007/s11749-023-00848-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11749-023-00848-z

Keywords

Mathematics Subject Classification

Navigation