Skip to main content
Log in

Imbalanced single-cell data integration leads to loss of biological information

  • Research Briefing
  • Published:

From Nature Biotechnology

View current issue Submit your manuscript

The Iniquitate pipeline assessed the impacts of cell-type imbalance on single-cell RNA sequencing integration through perturbations to dataset balance. The results indicated that cell-type imbalance not only leads to loss of biological signal in the integrated space, but also can change the interpretation of downstream analyses after integration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1: The Iniquitate pipeline, experiments and analyses.

References

  1. Svensson, V., da Veiga Beltrame, E. & Pachter, L. A curated database reveals trends in single-cell transcriptomics. Database 2020, baaa073 (2020). A database that curates papers that use single-cell RNA sequencing technology and tracks key factors such as tissue type, techniques and number of cells sequenced.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Argelaguet, R. et al. Computational principles and challenges in single-cell data integration. Nat. Biotechnol. 39, 1202–1215 (2021). This review paper presents current paradigms in single-cell data integration, outstanding challenges and future directions.

    Article  CAS  PubMed  Google Scholar 

  3. Tran, H. T. N. et al. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 21, 12 (2020). This paper is a comprehensive benchmarking analysis of single-cell RNA sequencing integration methods across various data scenarios.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Luecken, M. D. et al. Benchmarking atlas-level data integration in single-cell genomics. Nat. Methods 19, 41–50 (2022). A comprehensive benchmark that expands on the work done by Tran et al. by incorporating more methods, modalities and preprocessing parameters, and larger atlas-level datasets.

    Article  CAS  PubMed  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Maan, H. et al. Characterizing the impacts of dataset imbalance on single-cell data integration. Nat. Biotechnol. https://doi.org/10.1038/s41587-023-02097-9 (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Imbalanced single-cell data integration leads to loss of biological information. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-023-02114-x

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41587-023-02114-x

  • Springer Nature America, Inc.

Navigation