Skip to main content
Log in

Accurate data-driven prediction does not mean high reproducibility

  • Comment
  • Published:

From Nature Machine Intelligence

View current issue Submit your manuscript

A valid machine model is predictive, but a predictive model may not be valid. The gap between these two can be larger than many practitioners may expect.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1: Accuracy versus validity.

References

  1. Nat. Genet. 51, 1 (2019).

  2. Runge, J. et al. Nat. Commun. 10, 2553 (2019).

    Article  Google Scholar 

  3. Hussein, A. A. et al. Br. J. Cancer 119, 724–736 (2018).

    Article  Google Scholar 

  4. Tam, V. et al. Nat. Rev. Genet. 20, 467–484 (2019).

    Article  Google Scholar 

  5. Lewis, R. A., Rao, J. M. & Reiley, D. H. in Proc. 20th International Conference on World Wide Web 157–166 (ACM, 2011).

  6. Pearl, J. Causality: Models, Reasoning, and Inference (Cambridge Univ. Press, 2009).

  7. Imbens, G. W. & Rubin, D. B. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction (Cambridge Univ. Press, 2015).

  8. Reichstein, M. et al. Nature 566, 195–204 (2019).

    Article  Google Scholar 

  9. Hill, A. B. Proc. R. Soc. Med. 58, 295–300 (1965).

    Google Scholar 

  10. Pearl, J. Commun. ACM 62, 54–60 (2019).

    Article  Google Scholar 

Download references

Acknowledgements

This work has been supported by ARC Discovery Project grant DP170101306 and NHMRC grant 1123042.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiuyong Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, J., Liu, L., Le, T.D. et al. Accurate data-driven prediction does not mean high reproducibility. Nat Mach Intell 2, 13–15 (2020). https://doi.org/10.1038/s42256-019-0140-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s42256-019-0140-2

  • Springer Nature Limited

This article is cited by

Navigation