Usability of the Benford’s law for the results of least square estimation

Abstract

Benford’s law (BL), also known as the first-digit or significant-digit law, is an intriguing pattern in data sets, considers the frequency of occurrence of the first digits, which are not uniformly distributed as might be expected, conversely follow a specified theoretical distribution. According to BL, the occurrence of first non-zero digit in a numerical data, which is generated or found in nature, depends on a logarithmic distribution. Least square estimation (LSE) method is mostly preferred for the estimation of the unknown parameters from different types of geodetic data. The residuals and the normalized residuals of the LSE method, which follow normal distribution and expected values of them are zero are used in outlier detection problem. In this study, BL is investigated for residuals and the normalized residuals estimated from LSE method. Three types of geodetic data are used: (1) simulated regression models, (2) global positioning system (GPS) data, (3) leveling network. The first group data sets are simulated based on linear regression and univariate models and each simulated group is generated for a number of 100, 1000, and 10,000 samples. To generate second group, an international global navigation satellite system (GNSS) service (IGS) station data (ISTA) is processed by kinematic PPP approach using GIPSY OASIS II v6.4 software. Here, the observation duration of GPS data is 4 days. For the last data, a leveling network with 55 points involving 110 observations of height differences is simulated. BL has been applied to the residuals (v) and normalized residuals (w) estimated from LSE method. Goodness-of-fit test has been implemented to determine whether a population has a specified BL distribution or not. This test is based on how good a fit we have between the frequency of occurrence of residuals and normalized residuals in an observed sample and the expected frequencies obtained from the hypothesized distribution. The results depending on the statistical test show that each data set (residuals and normalized residuals) used in this study follows BL.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

References

  1. Ausloos M, Herteliu C, Ileanu B (2015) Breakdown of Benford’s law for birth data. Physica A 419:736–745. https://doi.org/10.1016/j.physa.2014.10.041

    Article  Google Scholar 

  2. Becker PW (1982) Patterns in listings of failure-rate & MTTF values and listings of other data. IEEE Trans Reliab 31:132–134

    Article  Google Scholar 

  3. Benford F (1938) The law of anomalous numbers. Proc Am Philos Soc 78(4):551–573

    Google Scholar 

  4. Berber M (2008) Error analysis of geodetic networks. Shaker Publishing, Maastricht

    Google Scholar 

  5. Berger A, Hill TP (2015) An introduction to Benford’s law. Princeton University Press, Princeton

    Google Scholar 

  6. Berger A, Hill TP, Rogers E (2009) Benford online bibliography. http://www.benfordonline.net. Last accessed 25 Mar 2019

  7. Brown RJC (2005) Benford’s Law and the screening of analytical data: the case of pollutant concentrations in ambient air. Analyst 130:1280–1285. https://doi.org/10.1039/b504462f

    Article  Google Scholar 

  8. Chandra Das R, Sekhar Mishra C, Rajib P (2017) Detection of anomalies in accounting data using Benford’s law: evidence from India. J Soc Sci Stud 4(1):123–139

    Google Scholar 

  9. Formann AK (2010) The Newcomb–Benford law in its relation to some common distributions. PLoS ONE 5(5):e10541. https://doi.org/10.1371/journal.pone.0010541

    Article  Google Scholar 

  10. Hill TP (1995) A statistical derivation of the significant-digit law. Stat Sci 10(4):354–363

    Article  Google Scholar 

  11. Hofmann-Wellenhof B, Lichtenegger H, Collins J (2001) Global positioning system: theory and practice, 5th edn. Springer, New York

    Google Scholar 

  12. Jamain A (2001) Benford’s law. Master’s thesis, Department of Mathematics, Imperial College of London and ENSIMAG, London, UK

  13. Koch KR (1999) Parameter estimation and hypothesis testing in linear models, 2nd edn. Springer, Berlin

    Google Scholar 

  14. Kossovsky AE (2014) Benford’s law theory, the general law of relative quantities and forensic fraud detection applications. World Scientific Publishing Co.Pte.Ltd, Singapore

    Google Scholar 

  15. Mir TA (2012) The law of the leading digits and the world religions. Physica A 391:792–798. https://doi.org/10.1016/j.physa.2011.09.001

    Article  Google Scholar 

  16. Mir TA (2014) The Benford law behavior of the religious activity data. Physica A 408:1–9. https://doi.org/10.1016/j.physa.2014.03.074

    Article  Google Scholar 

  17. Nagasaka K (1984) On Benford’s law. Ann Inst Stat Math 36(Part A):337–352

    Article  Google Scholar 

  18. Newcomb S (1881) Note on the frequency of use of the different digits in natural numbers. Am J Math 4(1):39–40

    Article  Google Scholar 

  19. Sarkar BP (1973) An observation on the significant digits of binomial coefficients and factorials. Sankhyā Indian J Stat Ser B (1960–2002) 35(3):363–364

    Google Scholar 

  20. Teunissen PJG (2003) Adjustment theory an introduction. Delft University Press, Delft

    Google Scholar 

  21. Walpole RE, Myers RH, Myers SL (1998) Probability and statistics for engineers and scientists, 6th edn. Prentice Hall, Upper Saddle River

    Google Scholar 

  22. Whyman G, Ohtori N, Shulzinger E, Bormashenko E (2016) Revisiting the Benford law: when the Benford-like distribution of leading digits in sets of numerical data is expectable? Physica A 461:595–601

    Article  Google Scholar 

  23. Wlodarski J (1971) Fibonacci and Lucas numbers tend to obey Benford’s law. Fibonacci Q 9:87–88

    Google Scholar 

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to Nursu Tunalioglu.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Tunalioglu, N., Erdogan, B. Usability of the Benford’s law for the results of least square estimation. Acta Geod Geophys 54, 315–331 (2019). https://doi.org/10.1007/s40328-019-00259-3

Download citation

Keywords

  • Benford’s law
  • Frequency of occurrence
  • Residuals and normalized residuals
  • Goodness-of-fit test
  • Least square estimation