Skip to main content
Log in

POINTS OF SIGNIFICANCE

The class imbalance problem

  • This Month
  • Published:

From Nature Methods

View current issue Submit your manuscript

The exception proves the rule.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1: Rebalancing methods applied to a dataset with M = 50 normal and m = 5 diseased cells (IR = 10).
Fig. 2: Benefits of rebalancing classes in the training set by undersampling with increasing imbalance ratio on the performance of a CART decision tree classifier.
Fig. 3: Training with AUROC can provide acceptable performance at IR = 10 without rebalancing.
Fig. 4: Comparison of rebalancing methods on the performance of a CART decision tree classifier.

References

  1. Lever, J., Krzywinski, M. & Altman, N. Nat. Methods 13, 541–542 (2016).

    Article  CAS  Google Scholar 

  2. Krzywinski, M. & Altman, N. Nat. Methods 14, 757–758 (2017).

    Article  CAS  Google Scholar 

  3. Lever, J., Krzywinski, M. & Altman, N. Nat. Methods 13, 603–604 (2016).

    Article  CAS  Google Scholar 

  4. King, G. & Zeng, L. Polit. Anal. 9, 137–163 (2001).

    Article  Google Scholar 

  5. Altman, N. & Krzywinski, M. Nat. Methods 18, 840–842 (2021).

    Article  CAS  Google Scholar 

  6. Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. J. Artif. Intell. Res. 16, 321–357 (2002).

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Martin Krzywinski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Megahed, F.M., Chen, YJ., Megahed, A. et al. The class imbalance problem. Nat Methods 18, 1270–1272 (2021). https://doi.org/10.1038/s41592-021-01302-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41592-021-01302-4

  • Springer Nature America, Inc.

This article is cited by

Navigation