Skip to main content
Log in

Misuse of null hypothesis significance testing: would estimation of positive and negative predictive values improve certainty of chemical risk assessment?

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

Although generally misunderstood, the p value is the probability of the test results or more extreme results given H0 is true: it is not the probability of H0 being true given the results. To obtain directly useful insight about H0, the positive predictive value (PPV) and the negative predictive value (NPV) may be useful extensions of null hypothesis significance testing (NHST). They provide information about the probability of statistically significant and non-significant test outcomes being true based on an a priori defined biologically meaningful effect size. The present study explores the utility of PPV and NPV in an ecotoxicological context by using the frequently applied Daphnia magna reproduction test (OECD guideline 211) and the chemical stressor lindane as a model system. The results indicate that especially the NPV deviates meaningfully between a test design strictly following the guideline and an experimental procedure controlling for α and β at the level of 0.05. Consequently, PPV and NPV may be useful supplements to NHST that inform the researcher about the level of confidence warranted by both statistically significant and non-significant test results. This approach also reinforces the value of considering α, β, and a biologically meaningful effect size a priori.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Altman DG, Bland JM (1994) Diagnostic tests 2: predictive values. British Med J 309:102

    Article  CAS  Google Scholar 

  • Altman DG, Machin D, Bryant TN, Gardner MJ (2000) Statistics with confidence, 2nd edn. BMJ Books, Bristol

    Google Scholar 

  • Antunes SC, Castro BB, Goncalves F (2004) Effect of food level on the acute and chronic responses of daphnids to lindane. Environ Pollut 127:367–375

    Article  CAS  Google Scholar 

  • ASTM (2007) ASTM E729-96: Standard guide for conducting acute toxicity tests on test materials with fishes, macroinvertebrates, and amphibians. ASTM International, West Conshohocken, PA, 2007, doi:10.1520/E0729-96R07

  • Bundschuh M, Zubrod JP, Seitz F, Newman MC, Schulz R (2011) Mercury-contaminated sediments affect amphipod feeding. Arch Environ Contam Toxicol 60:437–443

    Article  CAS  Google Scholar 

  • Crane M, Newman MC (2000) What level of effect is a no observed effect? Environ Toxicol Chem 19:516–519

    Article  CAS  Google Scholar 

  • DeCoen WM, Janssen CR (1997) The use of biomarkers in Daphnia magna toxicity testing. IV. cellular energy allocation: a new methodology to assess the energy budget of toxicant-stressed Daphnia populations. J Aquat Ecosyst Stress Recovery 6:43–55

    Article  CAS  Google Scholar 

  • Dunnett CW (1955) A multiple comparison procedure for comparing several treatments with a control. J Am Stat Assoc 50:1096–1121

    Article  Google Scholar 

  • Ferrando MD, Sancho E, Andreumoliner E (1995) Effects of lindane on Daphnia magna during chronic exposure. J Environ Sci Health Part B 30:815–825

    Article  Google Scholar 

  • Fox DR (2009) Is the ECx a legitimate surrogate for a NOEC? Integr Environ Assess Manag 5:351–353

    Article  Google Scholar 

  • Fox DR (2010) A Bayesian approach for determining the no effect concentration and hazardous concentration in ecotoxicology. Ecotoxicol Environ Saf 73:123–131

    Article  CAS  Google Scholar 

  • Gigerenzer G (2004) Mindless statistics. J Socio-Econom 33:587–606

    Article  Google Scholar 

  • Ioannidis JPA (2005) Why most published research findings are false. Plos Med 2:696–701

    Google Scholar 

  • Jager T (2012) Bad habits die hard: the NOEC's persistence reflects poorly on ecotoxicology. Environ Toxicol Chem 31:228–229

    Article  CAS  Google Scholar 

  • Kline RB (2004) Beyond significance testing: reforming data analysis methods in behavioral research. American Psychological Association, Washington

    Book  Google Scholar 

  • Landis WG, Chapman PM (2011). Well past time to stop using NOELs and LOELs. Integr Environ Assess Manag 7:vi-viii

  • Mudge JF, Baker LF, Edge CB, Houlahan JE (2012) Setting an optimal alpha that minimizes errors in null hypothesis significance tests. PlosOne 7:e32734

    Article  CAS  Google Scholar 

  • Nakagawa S, Cuthill IC (2007) Effect size, confidence interval and statistical significance: a practical guide for biologists. Biol Rev 82:591–605

    Article  Google Scholar 

  • Nakagawa S, Forster TM (2004) The case against retrospective statisical power analyses with an introduction to power analysis. Acta etholog 7:103–108

    Article  Google Scholar 

  • Newman MC (2008) "What exactly are you inferring?" A closer look at hypothesis testing. Environ Toxicol Chem 27:1013–1019

    Article  CAS  Google Scholar 

  • Newman MC (2013) Quantitative ecotoxicology. CRC/Taylor & Francis, Boca Raton

    Google Scholar 

  • OECD (2008) OECD 211: Daphnia magna reproduction test. OECD Publishing, Paris

    Book  Google Scholar 

  • Preuss TG, Hammers-Wirtz M, Ratte HT (2010) The potential of individual based population models to extrapolate effects measured at standardized test conditions to relevant environmental conditions-an example for 3,4-dichloroaniline on Daphnia magna. J Environ Monit 12:2070–2079

    Article  CAS  Google Scholar 

  • van der Hoeven N (1998) Power analysis for the NOEC: what is the probability of detecting small toxic effects on three different species using the appropriate standardized test protocols? Ecotoxicology 7:355–361

    Article  Google Scholar 

  • Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N (2004) Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Nation Cancer Inst 96:434–442

    Article  Google Scholar 

  • Zubrod JP, Bundschuh M, Feckler A, Englert D, Schulz R (2011) Ecotoxicological impact of the fungicide tebuconazole on an aquatic decomposer-detritivore system. Environ Toxicol Chem 30:2718–2724

    Article  CAS  Google Scholar 

Download references

Acknowledgments

The authors acknowledge S. Hartmann, T. Schell, and S. Schneider for actually performing the experiments as well as the Fix-Stiftung Landau for the financial support regarding the research infrastructure. J.P. Zubrod received funding through a scholarship of the German Federal Environmental Foundation (Deutsche Bundesstiftung Umwelt).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mirco Bundschuh.

Additional information

Responsible editor: Philippe Garrigues

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bundschuh, M., Newman, M.C., Zubrod, J.P. et al. Misuse of null hypothesis significance testing: would estimation of positive and negative predictive values improve certainty of chemical risk assessment?. Environ Sci Pollut Res 20, 7341–7347 (2013). https://doi.org/10.1007/s11356-013-1749-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-013-1749-z

Keywords

Navigation