Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts

Park, Sung Yong; Ter-Saakyan, Sonia; Faraci, Gina; Lee, Ha Youn

doi:10.1038/s41598-023-39282-4

Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts

Article
Open access
Published: 26 July 2023

Volume 13, article number 12093, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts

Download PDF

Sung Yong Park¹,
Sonia Ter-Saakyan¹,
Gina Faraci¹ &
…
Ha Youn Lee¹

3825 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Single cell RNA sequencing has a central role in immune profiling, identifying specific immune cells as disease markers and suggesting therapeutic target genes of immune cells. Immune cell-type annotation from single cell transcriptomics is in high demand for dissecting complex immune signatures from multicellular blood and organ samples. However, accurate cell type assignment from single-cell RNA sequencing data alone is complicated by a high level of gene expression heterogeneity. Many computational methods have been developed to respond to this challenge, but immune cell annotation accuracy is not highly desirable. We present ImmunIC, a simple and robust tool for immune cell identification and classification by combining marker genes with a machine learning method. With over two million immune cells and half-million non-immune cells from 66 single cell RNA sequencing studies, ImmunIC shows 98% accuracy in the identification of immune cells. ImmunIC outperforms existing immune cell classifiers, categorizing into ten immune cell types with 92% accuracy. We determine peripheral blood mononuclear cell compositions of severe COVID-19 cases and healthy controls using previously published single cell transcriptomic data, permitting the identification of immune cell-type specific differential pathways. Our publicly available tool can maximize the utility of single cell RNA profiling by functioning as a stand-alone bioinformatic cell sorter, advancing cell-type specific immune profiling for the discovery of disease-specific immune signatures and therapeutic targets.

Systematic comparison of high-throughput single-cell RNA-seq methods for immune cell profiling

Article Open access 20 January 2021

Combined Measurement of RNA and Protein Expression on a Single-Cell Level

Protocol for Classification Single-Cell PBMC Types from Pathological Samples Using Supervised Machine Learning

Introduction

Single cell RNA sequencing is an invaluable tool for immune profiling, providing the transcriptomic landscape of thousands of individual cells. This method has identified specific immune cells as disease markers^1,2 and suggested potential therapeutic target genes of disease-associated immune cells³. The utility of single cell RNA sequencing has also been demonstrated by investigating immune and non-immune cell communication in the tumor microenvironment⁴. Furthermore, this single cell approach has designated immune signatures of disease severity, as reported in recent COVID-19 studies^5,6. Taken together, single cell RNA sequencing has advanced our understanding of the immune system and aided in drug discovery for diverse diseases.

High-resolution immune profiling requires reliable cell-type classification. However, gene expression patterns within a single immune cell type can be heterogeneous with different study conditions. This heterogeneity hinders consistent cell type annotation from single cell RNA sequencing data. The current approach of unsupervised clustering⁷ is sensitive to these study-specific gene expression profiles. Furthermore, ad hoc annotation for each cluster is a non-standardized step and thus cell assignment outcomes may not be reproducible. Alternative approaches for immune cell-type assignment include Garnett⁸, CellAssign⁹, Cell BLAST¹⁰, CellTypist¹¹, scGate¹², and scType¹³. However, these methods have not been validated by a large-scale data of diverse immune cells collected from many different studies.

We here present ImmunIC (Immune cell Identifier and Classifier) as an accurate and automated tool for human immune cell classification. To properly address diverse study-specific gene expression signatures, we first compiled 66 independent single cell RNA sequencing studies and collect over two million immune cells and a half million non-immune cells. We took advantage of the predetermined leukocyte gene signature matrix¹⁴ (LM22) to filter immune cells from a mixture of immune and non-immune cells. Using the maximum correlation approach similar to SingleR¹⁵, we subsequently categorized the identified immune cells into B cells, plasma cells, T cells, NK cells, monocytes, dendritic cells, macrophages, neutrophils and other myeloid cells. We then added a machine learning method called Xgboost¹⁶ to further enhance the classification resolution between CD4+ and CD8+ T cells. We extensively benchmarked the accuracy of ImmunIC against the currently available methods. We also demonstrated ImmunIC’s utility by identifying immune cell-type specific differential pathways from previously published peripheral blood mononuclear cell (PBMC) data of severe COVID-19 cases and healthy controls⁵.

Results

Single cell RNA sequencing datasets

We compiled publicly available single cell RNA sequence datasets from 66 different studies, as presented in Tables 1 and 2. The total datasets consisted of 2,078,671 immune cells and 509,300 non-immune cells. These data were obtained using different platforms: (i) 10 × Genomics¹⁷, (ii) Smart-seq2¹⁸, (iii) MARS-seq¹⁹, (iv) Seq-Well²⁰, (v) Drop-seq²¹ and BD Rhapsody Single-Cell Analysis System²². The immune cell group included 999,462 PBMCs from 173 individuals⁵ and 81,713 PBMCs from 20 individuals²³. The lymphocyte group consists of 332,336 B cells from four different studies^5,17,24,25, 42,777 plasma cells²⁶, 100,411 T cells^5,27, 173,996 CD4+ T cells^{17,28,29,30,31,32,33}, 156,960 CD8+ T cells^{17,30,34,35,36}, and 28,570 NK cells^17,37,38. In the immune cell group, there were 23,705 myeloid cells^39,40,41, 49,687 monocytes^{17,42,43,44,45,46}, 7,739 dendritic cells^43,47,48, 999 macrophages⁴⁹, and 80,316 neutrophils^46,50. Immune cell types were determined polychromatic flow cytometry-based immunophenotyping measures in the original publications. Table 1 presents each study’s accession number, source publication and number of cells. The non-immune cell group consists of 22 different cell types from 26 studies, including 14,537 intestine cells⁵¹, 4,524 kidney cells⁵², 2,249 neuroblastoma cells⁵³, and 5,680 breast cancer cells⁵⁴ (Table 2).

Table 1 ImmunIC’s accuracy on single cell RNA sequencing datasets of immune cells.

Full size table

Table 2 Single cell RNA sequencing datasets of non-immune cells.

Full size table

Immune cell identification by ImmunIC

We identified immune cells using the Leukocyte signature matrix (LM22) which consists of 547 marker genes¹⁴, as summarized in ImmunIC’s workflow (see Supplementary Fig. S1). Each immune cell type has a unique gene marker combination in this signature matrix¹⁴. We observed that many genes in LM22 were uniquely expressed in each cell type at single cell resolution and these genes’ expressions were fairly low in non-immune cells (Fig. 1a and Supplementary Fig. S2). Therefore, we measured the correlation coefficient between each cell’s gene expression and each LM22 profile. B cells showed an average Pearson’s correlation coefficient of 0.31 with LM22’s B cell profile, but had less than 0.1 to T cell, NK cell, and myeloid cell profiles (Fig. 1b). Likewise, T cells, NK cells and myeloid cells showed the greatest correlation coefficient to their own cell types’ profiles (Fig. 1b). However, non-immune cells such as breast cancer cells, neuroblastoma cells, intestine epithelial cells and retina cells showed comparable correlations (< 0.2) with all immune cell types (Fig. 1b). By defining the maximum correlation coefficient as the highest coefficient to LM22, we compared its distribution of around two million immune cells with that of a half-million non-immune cells (Fig. 1c). We observed a clear difference between the immune and non-immune cells, suggesting that the maximum correlation coefficient is a robust metric not only for annotating immune cell type but also for differentiating immune and non-immune cells with single-cell transcriptomic data.

To further increase immune cell identification power, we introduced an additional metric of total immune gene expression, defined as the sum of each cell’s LM22 gene expressions. As expected, immune cells’ total immune gene expressions were much greater than non-immune cells (Fig. 1d). The immune and non-immune cells were clearly divided in the plane of the maximum correlation coefficient and total immune gene expression (Fig. 1e). Using a line as the boundary (dotted line in Fig. 1e), we were able to achieve 97.7% [97.67–97.71%] of sensitivity and 98.3% [98.23–98.30%] of specificity (see the Receiver Operating Characteristic (ROC) curve in Fig. 1f).

ImmunIC’s identification power surpassed the conventional clustering method⁷. While immune cells were separated from non-immune cells in the reduced dimensional space by clustering, around 75% of pancreatic cells from one study⁵⁵ clustered with myeloid cells and PBMCs (Fig. 1g). Around 32% of brain tumor cells and 27% of breast cancer cells were also grouped with immune cells. Additionally, plasma cells formed a separate cluster from other immune cells (dotted circle in Fig. 1g). These limitations clearly suggest that the conventional clustering approach is not desirable for the identification of immune cells from multicellular specimens including non-immune cells. Usage of the leukocyte gene signature matrix¹⁴ provided systematic identification of immune cells by capturing transcriptomic signatures of immune cells that are conserved across different datasets.

Immune cell type annotation by ImmunIC

ImmunIC assigns an immune cell as either B cell, plasma cell, T cell, NK cell, monocyte, dendritic cell, macrophage, neutrophil, or other myeloid cell group based on the maximum correlation coefficient to LM22. When a cell is assigned to T cell, ImmunIC further classifies into either CD4+ or CD8+ T cell by an Xgboost classifier¹⁶. Our simple and automated workflow showed 91.6% [91.6–91.7%] of immune cell-type classification accuracy for around one million cells. As presented in Fig. 2a and Table 1, the classification accuracy ranged from 70.1% to 99.8% for 42 datasets collected from 29 different studies. Over 300,000 B cells from two independent studies (BC-1 and BC-3) were labeled as B cells with around 99% accuracy. However, the accuracy for plasma cells was lower since around 20% of these were annotated as non-immune cells. The classification accuracy of CD4+ and CD8+ T cells was in the range of 82% and 98%. Notably, around 30,000 NK cells showed more than 96% accuracy across three independent studies. On contrary, around 8% of 4,434 myeloid cells (Myeloid-1) were misclassified as non-immune cells and 11% and 8% of those were misclassified as B cells and T cells, respectively. The confusion matrix of ImmunIC was presented in Supplementary Table S1. Although there is variation in the classification accuracy, the overall accuracy of ImmunIC was over 90% across one million single cells.

The addition of an Xgboost classifier enhanced CD4+ and CD8+ T cell classification accuracy from 78 to 93%, on average (p < 0.001, Fig. 2b). To address gene expression profile heterogeneity across different studies, we trained a classifier with 10,000 CD4+ T cells that were randomly selected from ten different studies (from CD4-1 to CD4-10 in Table 1) and 5,000 CD8+ T cells from five studies (from CD8-1 to CD8-5). We further examined the Xgboost classifier’s accuracy by five-fold cross-validation. The average classification accuracy over five independent tests was highly desirable, ranging from 93.6% to 99.9% (Supplementary Table S2). We also tested two additional datasets, 1,886 CD4+ T cells (CD4-11) and 69,457 CD8+ T cells (CD8-6), which were not included in any of our training sets. Our trained models showed 99.8% accuracy for CD4-11 and 99.7% accuracy for CD8-6.

A total of 334 genes were identified as important features that include CD4+ T cell markers (CD4, CD40LG, IL2 and TNFRSF4) and CD8+ T cell markers (CD8A, CD8B, CTSW, and NKG7), as presented in Fig. 2c. Important features also included genes that have not been linked to CD4+ and CD8+ T cell markers (Fig. 2c). We found that the designated genes were associated with differential signaling pathways of CD4+ and CD8+ T cells, including Th1 and Th2 Activation pathway with 9.3% coverage of DEGs (CD4, CD40LG, CD86, CD8A, GRB2, HLA-B, HLA-DRB5, IKZF1, IL2, IL2RA, IL2RG, JUN, KLRD1, S1PR1, TGFB1, and TNFRSF4), IL-2 Signaling pathway with 11.5% coverage (FOS, GRB2, IL2,I L2RA, IL2RG, JUN, and NRAS), and Granzyme A Signaling pathway with 10.5% coverage (H1-3 and H1-4).

Comparison of ImmunIC with other immune cell classifiers

We directly compared ImmunIC’s performance with the conventional clustering method⁷. We conducted unsupervised clustering on the 42 immune cell datasets collected from 29 different studies (from BC-1 to Neutro-2 in Table 1). We observed that CD4+ and CD8+ T cells were mixed together in the reduced dimension. As shown in Fig. 3a, four groups of CD4+ T cells (CD4-3, CD4-5, CD4-6, and CD5-7 in Table 1) and two groups of CD8+ T cells (CD8-2 and CD8-5 in Table 1) were clustered together. Note that these six groups of cells are from a single study¹⁷ and these cells were grouped together by this study’s gene markers, rather than by CD4+ and CD8+ T cell differential markers. Our observation was consistent with a recent study reporting that CD4+ and CD8+ T cells were not clearly separated at a single-cell transcriptome level⁷. Similarly, one group of dendritic cells (DC-2) was clustered with monocytes (MONO-3) from the same study⁴³, not with dendritic cells from other studies (Fig. 3a). Taken together, study-specific gene markers are important factors for determining clustering outcomes and we were not able to obtain cell-type specific clusters with the conventional clustering method.

Study-specific gene expression signatures dictated clustering outcomes even when only two datasets were analyzed together. Clustering of 2,700 PBMCs from an individual previously identified 9 immune cell clusters including B cells, CD8+ T cells and NK cells⁵⁶. When we added B cells from another study⁵, these B cells did not belong to the PBMCs’ B cell cluster (C4-BC-1 in Fig. 3b) but rather formed a separate cluster (BC-2 in Fig. 3b). This was because the added B cells had markers that defined a separate cluster, as shown in a heatmap of differentially expressed genes (DEGs) among clusters (Fig. 3c). The added B cells showed high expressions for B cell markers, CD79A and CD79B. However, other genes such as RACK1 and RPL39 were uniquely expressed, creating a separate cluster (Fig. 3c). Figure 3d compared expressions of six genes between B cells of the PBMC cluster and the added B cells. Likewise, NK cells from other study³⁸ did not cluster with the PBMCs’ NK cells (Supplementary Fig. S3). This clustering-based two-dimensional representation failed to group together the same type of immune cells from different studies, limiting its annotation capacity.

We next compared ImmunIC’s performance with three other immune cell classifiers: Garnett⁸, Cell BLAST¹⁰, and CellAssign⁹. Figure 3e presented each method’s minimum and maximum classification accuracy across different studies in each of seven cell groups (see Table 1 and Supplementary Table S3). ImmunIC outperformed other methods for both T cell and CD8+ T cell assignments (Fig. 3e). ImmunIC showed 92% classification accuracy for CD8+ T cells on average while the accuracy of Garnett (53%), Cell BLAST (49%), and CellAssign (75%) was significantly lower for the same cell population. Notably, the other methods showed a large variability in accuracy across different single cell RNA sequencing datasets. The minimum classification accuracy for CD8+ T cells was 39% for Garnett, 0% for Cell BLAST and 6% for CellAssign. These values were considerably lower than ImmunIC’s minimum accuracy of 82%. ImmunIC also performed better in designating monocytes and dendritic cells than Garnett and Cell BLAST (Fig. 3e and Supplementary Table S3). ImmunIC’s accuracy for six groups of monocytes ranged from 70 to 100%, which was higher than that of Garnett (from 66 to 92%) and Cell BLAST (from 19 to 99%). The minimum accuracy for three groups of dendritic cells was 85% for ImmunIC, 57% for Garnett and 0.1% for Cell BLAST. Note that CellAssign does not designate monocytes and dendritic cells as a separate population. Overall, ImmunIC showed the best performance in immune cell classification with its average accuracy of 93%, compared to the existing algorithms of Garnett (84%), Cell BLAST (74%), and CellAssign (69%).

PBMC classification by ImmunIC

ImmunIC analyzed previously published single cell RNA sequencing data of 48 individuals who were at the severe progression state of COVID-19 infection and 20 healthy controls⁵ and determined the PBMC composition of these individuals. Figure 4a shows the PBMC profiles measured from around 6,000 PBMCs from each of 68 individuals. As in Fig. 4b, there was a greater than tenfold increase of plasma cells in the severe group (0.13% vs. 2.2%, p < 0.001), as reported in the original publication⁵. While the proportion of B cells was elevated in severe patients (7.5% vs. 17.4%, p = 0.0015), the percentage of lymphocytes was significantly decreased (78.6% vs. 65.8%, p = 0.039). The observed decrease is in line with lymphopenia, a hallmark of severe COVID-19⁵⁷. In particular, both CD8+ T cells and NK cells were significantly reduced within the PBMC population of severe COVID-19 patients (40.3% vs. 19.5%, p < 0.001 and 18.1% vs. 10.6%, p < 0.001). The percentage of dendritic cells was also lower in the severe group than the control (0.6% vs. 0.16%, p < 0.001). Severe COVID-19 cases’ PBMC profiles determined by ImmunIC agreed with polychromatic flow cytometry-based immunophenotyping measures⁵⁸.

We then conducted functional pathway analysis on each immune cell population using the Ingenuity Pathway Analysis (IPA) program. Figure 4c shows the top 30 upregulated DEGs of macrophages in the severe group versus the control. Inflammatory genes including IFI27, FOS, JUN and NKFBIA were significantly upregulated in the severe group. Signaling pathway analysis highlighted that pro-inflammatory pathways are upregulated in the macrophages of the severe group, including IF-17A pathways (S100A8, S100A9, FOS, JUN, NKFBIA), TNFR2 Signaling pathway (FOS, JUN, NFKBIA), and Interferon Signaling pathway (IFI6, IFITM1, and IFITM3). Figure 4d plotted the severe group’s differential pathways with greater than 5% coverage of macrophages’ DEGs (see Supplementary Table S4 for more details). In addition, EIF2 Signaling pathway was enriched in severe cases with 14.3% coverage of differential genes such as RPL10, RPL11, and RPL12.

Monocytes identified by ImmunIC showed both similar and dissimilar transcriptomic signatures of COVID-19 severity compared to macrophages. Role of Hypercytokinemia/hyperchemokinemia in the Pathogenesis of Influenza pathway was found to be significantly upregulated in the severe group’s monocytes with DEGs of AREG, CCL3, CCL4, CXCL8, IL1B, and ISG15 (coverage = 7% and p < 0.001). In addition, Airway Inflammation in Asthma pathway (CXCL8 and RNASE2) was enhanced in the monocyte population. Similar to macrophages, monocytes in the severe group showed enhanced Interferon Signaling pathway with DEGs of IFI6, IFITM1, IFITM3, and ISG15 (coverage = 11%, p < 0.001). Supplementary Table S5 lists monocyte-specific differential pathways of COVID-19 severe cases.

Discussion

ImmunIC (Immune cell Transcriptome Classifier) is a simple and automated tool for immune cell identification and classification from a mixture of immune and non-immune cells. ImmunIC showed significant consistency in immune cell identification and classification accuracy across 66 independent single cell RNA sequencing studies. ImmunIC’s two metrics, maximum correlation coefficient and total immune gene expression, provided a high immune cell identification accuracy of 98%. We then used the maximum correlation coefficient to assign immune cell type and further increased the accuracy using an Xgboost classifier. The accuracy of ImmunIC’s immune cell type classification was around 92% for over one million immune cells.

ImmunIC outperformed existing immune cell classifiers^7,8,9,10. ImmunIC was robust in assigning a massive number of diverse immune cells with the minimum of 70% accuracy. When the existing methods were benchmarked with the same datasets, we observed a large variability in accuracy, marking the minimum accuracy of 40% (Garnett⁸), 0% (Cell BLAST¹⁰), and 1% (CellAssign⁹). In order to properly handle highly variable study-specific single cell transcriptomic signatures, we trained our classification model with datasets collected from multiple studies. Therefore, our model can be readily used without requiring further trainings. Indeed, ImmunIC showed a high level of classification accuracy for datasets which were not included in our training.

Our classifier clearly showed a capacity to sort immune cells directly from single cell transcriptomic readouts. Healthy individuals’ PBMC compositions determined by our immune cell classifier agreed with previous phenotypic measurements. We estimated the frequency of lymphocytes (B cells, T cells and NK cells) as 78.6%, which is consistent with that measured by polychromatic flow cytometry⁵⁹. Among lymphocytes, B cells, T cells and NK cells were estimated to be 9.9%, 67.2% and 23%, respectively. The average frequency of monocytes (19.9%) was also consistent with previous reports⁵⁹. The proportion of dendritic cells was estimated to range from 0% to 1.1% among 20 healthy individuals, which overlaps with the immunophenotypic measure of 1–2%⁵⁹.

We demonstrated the utility of ImmunIC by performing signaling pathway analysis of a specific immune cell population. We identified functional pathways in macrophages that are upregulated among patients with severe COVID-19, compared to those of healthy controls. Most of the identified pathways, including TNFR2 Signaling, B cell Activating Factor Signaling, and IL-17A Signaling, showed upregulation of three key inflammatory genes: JUN, FOS, and NFKB1A. These pathways are critical to the activation of macrophages and induction of inflammatory responses to COVID-19 infection^60,61. In addition, Interferon Signaling pathway was enhanced in the macrophage population, as previously reported⁶¹. We also observed that EIF2 Signaling (associated with mRNA translation) is significantly upregulated in the macrophages of the severe group. Upregulation of EIF2 Signaling was previously reported in severe patients’ lung specimens where macrophage infiltration was confirmed⁶². We found that Role of Hypercytokinemia/hyperchemokinemia in the Pathogenesis of Influenza pathway was enhanced in severe cases’ monocytes, corroborating a study of the monocyte-driven cytokine storm in SARS-CoV-2 infections⁶³. Taken together, our high-resolution classifier allows immune cell-type specific functional pathway analysis.

ImmunIC achieved accurate sorting of immune cells into ten distinct categories, including a clear separation between CD4+ and CD8+ T cells. However, further annotation into more specific immune cell types poses challenges. For instance, only 60% of memory B cells (BC-2) were assigned as memory B cells by the maximum correlation to LM22, while ImmunIC labeled 92% of these cells as B cells. Similarly, only three percent of regulatory CD4+ T cells (CD4-5) were correctly classified as regulatory cells. Therefore, we limited the number of cell types as 10 based on LM22’s 22 immune cell type profiles. As more data becomes available, the Xgboost classifier can be expanded to annotate subcategories of each immune cell. Alternatively, novel marker genes could be proposed to enhance the resolution for identifying more diverse immune cell types.

Our immune cell classification method is a time-efficient and accurate approach to designate ten immune cell populations from multicellular blood and tissue specimens. ImmunIC takes a single input of a gene count matrix, obtained from diverse single cell RNA sequencing platforms, to identify immune cells and determine immune cell composition by a single command line. In this way, ImmunIC is highly standardized and reproducible, not requiring any batch-specific modification on input data. ImmunIC can therefore serve as a reliable cell sorter for cell-type specific immune profiling.

Methods

Immune cells and non-immune cells’ single cell RNA sequencing data

Human single cell RNA sequencing data of 2,078,671 immune and 509,300 non-immune cells were collected from 66 previous publications. B cells analyzed in this study include a group of memory B cells (BC-2). CD4+ T cells include CD4+ Naïve T cells (CD4-6), CD4+ Memory T cells (CD4-7 and CD4-8) and regulatory CD4+ T cells (CD4-5, CD4-9 and CD4-10). The dataset also includes sorted CD8+ Naïve T cells (CD8-5). Monocytes include both classical and non-classical monocytes. Table 1 summarizes each study’s accession number, number of cells, and ImmunIC’s classification accuracy for immune cells. We analyzed 22 different cell types of non-immune cells: intestine cells, kidney cells, airway epithelial cells, prostate cells, oligodendrocytes, pancreatic islets cells, keratinocytes, neuroblastoma cells, retina cells, melanoma cells, entorhinal cortex cells, liver progenitor-like cells, mesothelial cells, breast epithelial cells, neuroendocrine tumor cells, oocytes, fibroblasts, oligodendroglioma cells, medulloblastoma cells, brain tumor cells, breast cancer cells and substantia nigra cells (Table 2).

Maximum correlation coefficient and total immune gene expression of ImmunIC

Each study’s raw gene count matrix from single cell RNA sequencing data was used as input. Cells were filtered out when the total count was less than 500. Subsequently, each cell’s total count was normalized to 10,000. We first measured Pearson’s correlation coefficients of each cell’s gene expression (normalized count) to LM22’s 22 cell-type gene expression profiles. The maximum correlation coefficient is the maximum of these 22 values. For instance, 49 genes including ABCB9, MANEA, and MZB1 are marked as 1 and other 498 genes are marked as 0 in LM22’s plasma cell profile¹⁴. The total immune gene expression of each cell is defined as the sum of each cell’s 547 gene expressions of LM22. Here the total immune gene expression denotes the relative proportion of LM22’s immune gene expression since each cell’s total count is normalized to 10,000.

The maximum correlation coefficient and total immune gene expression were used as two metrics for immune cell identification. An input cell was designated as an immune cell when a linear sum of the maximum correlation coefficient (${\rho }_{\mathrm{max}})$ and total immune gene expression ($T$) is greater than a threshold ($\theta )$,

$$A\times {\rho }_{\mathrm{max}}+T> \theta .$$

(1)

By maximizing the sensitivity (proportion of immune cells that are classified as immune cells) and specificity (proportion of non-immune cells that are classified as non-immune cells) of 2,078,671 immune and 509,300 non-immune cells, $A$ was determined as 1580.7 and $\theta$ as 413 with the area under the ROC curve of 0.987.

When an input cell is classified as an immune cell by Eq. (1), it is further categorized into B cell, plasma cell, T cell, NK cell, monocyte, macrophage, dendritic cell, neutrophil and other myeloid cell based on the maximum correlation coefficient to LM22 profiles. When the ainput cell is designated as a T cell, it is inputted into an Xgboost classifier to further classify it as either CD4+ or CD8+ T cell. ImmunIC’s entire process is integrated as a single command line with a single input of gene-cell count matrix. ImmunIC demonstrates a rapid cell annotation capacity while requiring minimal computational power. It took only 4 min 56 s to annotate 5,300 cells (CD4-4), and 12 min 3 s to annotate 10,047 cells (BC-3) using a single thread on an Intel Xeon CPU E5-2620.

Xgboost classifier of ImmunIC

One thousand cells were randomly selected from each of 10 different CD4+ T cell datasets (from CD4-1 to CD4-10 in Table 1) and 5 different CD8+ T cell datasets (from CD8-1 to CD8-5 in Table 1). The combined count matrix of 15,000 cells was log-normalized and used as an input for an Xgboost classifier. The Xgboost python module, xgbclassifier.fit, was used to fit a gradient boosting classifier with max_depth = 3, eta = 0.01, lambda = 0, gamma = 0.1, alpha = 0.5, nthread = 16, subsample = 0.5, and importance_type = gain. The trained model was then saved using pickle.dump. Important features were obtained using the module, xgbclassifier.feature_importances_. A total of 334 genes with a feature importance score greater than 10^–7 were used for the classification step. Lastly, the module predict was used to return the classification outcome of either CD4+ or CD8+ T cell.

Five-fold cross validation test

Five-fold cross-validation of the Xgboost classifier was conducted by dividing CD4+ and CD8+ T cells into training and test sets. A total of 1000 cells were randomly sampled from each of ten CD4+ T cell datasets (from CD4-1 to CD4-10) and these 10,000 cells were combined with 5,000 cells randomly picked from five CD8+ T cell datasets (from CD8-1 to CD8-5). The model was then trained with these 15,000 cells and tested by cells that were not included in the training set. This process was repeated five times such that each training data set was randomly sampled while the remaining cells were used as the test data. Two additional datasets, CD4-11 and CD8-6, were not included in any of our training procedures but tested by five independently trained models.

Other immune cell classification algorithms

Using the R library garnett⁸, each cell’s size factor was estimated with the function, estimateSizeFactors. Cells were then assigned by cluster_ext_type from the function, classify_cells, using Garnett’s pre-trained classifier for PBMC, hsPBMC_20191017.RDS (downloaded from https://cole-trapnell-lab.github.io/garnett/classifiers/).

The python package, Cell_BLAST¹⁰, was used along with the reference panel, Zheng.h5 (downloaded from https://cblast.gao-lab.org/download). Each study’s raw gene count matrix was loaded by cb.data.ExprDataSet.read_table and cells were annotated by cell_ontology_class.

The R library, cellassign⁹ was used with an input of cell-by-gene matrix of raw counts. Each cell’s size factor was estimated by sizeFactors in the R library, SingleCellExperiment. CellAssign’s marker gene set, example_TME_markers, was used to assign cell types using the function, cellassign.

Signaling pathway analysis

A total of 6,001 cells were labeled as macrophages by ImmunIC from 276,605 PBMCs of 48 individuals at COVID-19 severe progression state while a total of 615 macrophages were identified from 126,799 PBMCs of 20 uninfected individuals. With the FindAllMarkers module of Seurat 4.0⁷, a total of 82 genes were obtained as macrophages’ upregulated DEGs in the severe progression group, compared to the control (log-fold-change > 1 and p_adj < 0.001). These genes were used as input for Ingenuity Pathway Analysis (IPA, Qiagen). Identified differential pathways with greater than 5% coverage of DEGs were listed in Supplementary Table S4.

A total of 74,645 cells were annotated as monocytes by ImmunIC from the 48 severe cases and a total of 24,152 monocytes were identified from the 20 healthy individuals. These two groups’ monocytes were compared using the FindAllMarkers module of Seurat 4.0⁷ and a total of 26 genes were found to be upregulated in the severe progression group compared to the control (log-fold-change > 1 and p_adj < 0.001). Monocytes’ differential pathways were then obtained by IPA, as listed in Supplementary Table S5.

Data availability

All single cell RNA sequencing datasets analyzed in the current study were downloaded from public repositories. Accession numbers along with source publications are provided in Tables 1 and 2.

Code availability

ImmunIC is available at https://github.com/hayounlee-lab/immunic.

References

Sade-Feldman, M. et al. Defining T cell states associated with response to checkpoint immunotherapy in melanoma. Cell 175, 998–1013. https://doi.org/10.1016/j.cell.2018.10.038 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fernandez, D. M. et al. Single-cell immune landscape of human atherosclerotic plaques. Nat. Med. 25, 1576–1588. https://doi.org/10.1038/s41591-019-0590-4 (2019).
Article CAS PubMed PubMed Central Google Scholar
Deng, W. et al. Single-cell RNA-sequencing analyses identify heterogeneity of CD8(+) T cell subpopulations and novel therapy targets in melanoma. Mol. Ther. Oncolyt. 20, 105–118. https://doi.org/10.1016/j.omto.2020.12.003 (2021).
Article CAS Google Scholar
Kurten, C. H. L. et al. Investigating immune and non-immune cell interactions in head and neck tumors by single-cell RNA sequencing. Nat. Commun. 12, 7338. https://doi.org/10.1038/s41467-021-27619-4 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ren, X. et al. COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas. Cell 184, 5838. https://doi.org/10.1016/j.cell.2021.10.023 (2021).
Article CAS PubMed PubMed Central Google Scholar
Melms, J. C. et al. A molecular single-cell lung atlas of lethal COVID-19. Nature 595, 114–119. https://doi.org/10.1038/s41586-021-03569-1 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587. https://doi.org/10.1016/j.cell.2021.04.048 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pliner, H. A., Shendure, J. & Trapnell, C. Supervised classification enables rapid annotation of cell atlases. Nat. Methods 16, 983–986. https://doi.org/10.1038/s41592-019-0535-3 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, A. W. et al. Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling. Nat. Methods 16, 1007–1015. https://doi.org/10.1038/s41592-019-0529-1 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cao, Z. J., Wei, L., Lu, S., Yang, D. C. & Gao, G. Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST. Nat. Commun. 11, 3458. https://doi.org/10.1038/s41467-020-17281-7 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Dominguez Conde, C. et al. Cross-tissue immune cell analysis reveals tissue-specific features in humans. Science 376, eabl5197. https://doi.org/10.1126/science.abl5197 (2022).
Article CAS PubMed PubMed Central Google Scholar
Andreatta, M., Berenstein, A. J. & Carmona, S. J. scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets. Bioinformatics 38, 2642–2644. https://doi.org/10.1093/bioinformatics/btac141 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ianevski, A., Giri, A. K. & Aittokallio, T. Fully-automated and ultra-fast cell-type identification using specific marker combinations from single-cell transcriptomic data. Nat. Commun. 13, 1246. https://doi.org/10.1038/s41467-022-28803-w (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457. https://doi.org/10.1038/nmeth.3337 (2015).
Article CAS PubMed PubMed Central Google Scholar
Aran, D. et al. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 20, 163–172. https://doi.org/10.1038/s41590-018-0276-y (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, T. Q. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. Kdd'16: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785 (2016).
Zheng, G. X. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049. https://doi.org/10.1038/ncomms14049 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Picelli, S. et al. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protoc. 9, 171–181. https://doi.org/10.1038/nprot.2014.006 (2014).
Article CAS PubMed Google Scholar
Jaitin, D. A. et al. Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types. Science 343, 776–779. https://doi.org/10.1126/science.1247651 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Gierahn, T. M. et al. Seq-Well: Portable, low-cost RNA sequencing of single cells at high throughput. Nat. Methods 14, 395–398. https://doi.org/10.1038/nmeth.4179 (2017).
Article CAS PubMed PubMed Central Google Scholar
Macosko, E. Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202–1214. https://doi.org/10.1016/j.cell.2015.05.002 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fan, H. C., Fu, G. K. & Fodor, S. P. Expression profiling. Combinatorial labeling of single cells for gene expression cytometry. Science 347, 1258367. https://doi.org/10.1126/science.1258367 (2015).
Lee, J. S. et al. Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19. Sci. Immunol. 5, 1. https://doi.org/10.1126/sciimmunol.abd1554 (2020).
Article CAS Google Scholar
King, H. W. et al. Single-cell analysis of human B cell maturation predicts how antibody class switching shapes selection dynamics. Sci. Immunol. 6, 1. https://doi.org/10.1126/sciimmunol.abe6291 (2021).
Article CAS Google Scholar
Lu, Y. et al. Complement Signals determine opposite effects of b cells in chemotherapy-induced immunity. Cell 180, 1081–1097. https://doi.org/10.1016/j.cell.2020.02.015 (2020).
Article CAS PubMed Google Scholar
Ledergor, G. et al. Single cell dissection of plasma cell heterogeneity in symptomatic and asymptomatic myeloma. Nat Med 24, 1867–1876. https://doi.org/10.1038/s41591-018-0269-2 (2018).
Article CAS PubMed Google Scholar
Szabo, P. A. et al. Single-cell transcriptomics of human T cells reveals tissue and activation signatures in health and disease. Nat. Commun. 10, 4706. https://doi.org/10.1038/s41467-019-12464-3 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Cano-Gamez, E. et al. Single-cell transcriptomics identifies an effectorness gradient shaping the response of CD4(+) T cells to cytokines. Nat. Commun. 11, 1801. https://doi.org/10.1038/s41467-020-15543-y (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Brockmann, L. et al. Molecular and functional heterogeneity of IL-10-producing CD4(+) T cells. Nat. Commun. 9, 5457. https://doi.org/10.1038/s41467-018-07581-4 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Guo, X. et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat Med 24, 978–985. https://doi.org/10.1038/s41591-018-0045-3 (2018).
Article CAS PubMed Google Scholar
Rasouli, J. et al. A distinct GM-CSF(+) T helper cell subset requires T-bet to adopt a TH1 phenotype and promote neuroinflammation. Sci. Immunol. https://doi.org/10.1126/sciimmunol.aba9953 (2020).
Article PubMed Google Scholar
Povoleri, G. A. M. et al. Human retinoic acid-regulated CD161(+) regulatory T cells support wound repair in intestinal mucosa. Nat. Immunol. 19, 1403–1414. https://doi.org/10.1038/s41590-018-0230-z (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, N. et al. Memory CD4(+) T cells are generated in the human fetal intestine. Nat. Immunol. 20, 301–312. https://doi.org/10.1038/s41590-018-0294-9 (2019).
Article CAS PubMed PubMed Central Google Scholar
Eberhardt, C. S. et al. Functional HPV-specific PD-1(+) stem-like CD8 T cells in head and neck cancer. Nature 597, 279–284. https://doi.org/10.1038/s41586-021-03862-z (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Gangaev, A. et al. Identification and characterization of a SARS-CoV-2 specific CD8(+) T cell response with immunodominant features. Nat. Commun. 12, 2593. https://doi.org/10.1038/s41467-021-22811-y (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Pauken, K. E. et al. Single-cell analyses identify circulating anti-tumor CD8 T cells and markers for their enrichment. J. Exp. Med. 218, 1. https://doi.org/10.1084/jem.20200920 (2021).
Article CAS Google Scholar
Yang, C. et al. Heterogeneity of human bone marrow and blood natural killer cells defined by single-cell transcriptome. Nat. Commun. 10, 3931. https://doi.org/10.1038/s41467-019-11947-7 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Smith, S. L. et al. Diversity of peripheral blood human NK cells identified by single-cell RNA sequencing. Blood Adv. 4, 1388–1406. https://doi.org/10.1182/bloodadvances.2019000699 (2020).
Article CAS PubMed PubMed Central Google Scholar
Binnewies, M. et al. Unleashing type-2 dendritic cells to drive protective antitumor CD4(+) T cell immunity. Cell 177, 556–571. https://doi.org/10.1016/j.cell.2019.02.005 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wimmers, F. et al. The single-cell epigenomic and transcriptional landscape of immunity to influenza vaccination. Cell 184, 3915–3935. https://doi.org/10.1016/j.cell.2021.05.039 (2021).
Tang-Huau, T. L. et al. Human in vivo-generated monocyte-derived dendritic cells and macrophages cross-present antigens through a vacuolar pathway. Nat. Commun. 9, 2570. https://doi.org/10.1038/s41467-018-04985-0 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Goudot, C. et al. Aryl hydrocarbon receptor controls monocyte differentiation into dendritic cells versus macrophages. Immunity 47, 582–596. https://doi.org/10.1016/j.immuni.2017.08.016 (2017).
Article CAS PubMed Google Scholar
Villani, A. C. et al. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science 356, 1. https://doi.org/10.1126/science.aah4573 (2017).
Article CAS Google Scholar
Hie, B., Bryson, B. & Berger, B. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nat. Biotechnol. 37, 685–691. https://doi.org/10.1038/s41587-019-0113-3 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, X. et al. Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis. Nat. Commun. 11, 2338. https://doi.org/10.1038/s41467-020-15851-3 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Schulte-Schrepping, J. et al. Severe COVID-19 is marked by a dysregulated myeloid cell compartment. Cell 182, 1419–1440. https://doi.org/10.1016/j.cell.2020.08.001 (2020).
Article CAS PubMed PubMed Central Google Scholar
Doring, M. et al. Single-cell analysis reveals divergent responses of human dendritic cells to the MVA vaccine. Sci. Signal 14, 1. https://doi.org/10.1126/scisignal.abd9720 (2021).
Article CAS Google Scholar
Breton, G. et al. Human dendritic cells (DCs) are derived from distinct circulating precursors that are precommitted to become CD1c+ or CD141+ DCs. J. Exp. Med. 213, 2861–2870. https://doi.org/10.1084/jem.20161135 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wohnhaas, C. T. et al. DMSO cryopreservation is the method of choice to preserve cells for droplet-based single-cell RNA sequencing. Sci. Rep. 9, 10699. https://doi.org/10.1038/s41598-019-46932-z (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Xie, X. et al. Single-cell transcriptome profiling reveals neutrophil heterogeneity in homeostasis and infection. Nat. Immunol. 21, 1119–1133. https://doi.org/10.1038/s41590-020-0736-z (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Single-cell transcriptome analysis reveals differential nutrient absorption functions in human intestine. J. Exp. Med. 217, 1. https://doi.org/10.1084/jem.20191130 (2020).
Article CAS Google Scholar
Wu, H. et al. Comparative analysis and refinement of human PSC-derived kidney organoid differentiation with single-cell transcriptomics. Cell Stem Cell 23, 869–881. https://doi.org/10.1016/j.stem.2018.10.010 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jansky, S. et al. Single-cell transcriptomic analyses provide insights into the developmental origins of neuroblastoma. Nat. Genet. 53, 683–693. https://doi.org/10.1038/s41588-021-00806-1 (2021).
Article CAS PubMed Google Scholar
https://www.10xgenomics.com/resources/datasets/7-5-k-sorted-cells-from-human-invasive-ductal-carcinoma-3-v-3-1-3-1-standard-6-0-0.
Fang, Z. et al. Single-cell heterogeneity analysis and CRISPR screen identify key beta-cell-specific disease genes. Cell Rep 26, 3132–3144. https://doi.org/10.1016/j.celrep.2019.02.043 (2019).
Article CAS PubMed PubMed Central Google Scholar
https://satijalab.org/seurat/articles/pbmc3k_tutorial.html.
Tan, L. et al. Lymphopenia predicts disease severity of COVID-19: a descriptive and predictive study. Signal Transduct. Target Ther. 5, 33. https://doi.org/10.1038/s41392-020-0148-4 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhou, R. et al. Acute SARS-CoV-2 infection impairs dendritic cell and T cell responses. Immunity 53, 864–877. https://doi.org/10.1016/j.immuni.2020.07.0264 (2020).
Article CAS PubMed PubMed Central Google Scholar
Verhoeckx, K. et al. XVII, 338 p. 357 illus., 335 illus. in color (Springer International Publishing : Imprint: Springer,, Cham, 2015).
Shibabaw, T. Inflammatory cytokine: IL-17A signaling pathway in patients present with COVID-19 and current treatment strategy. J. Inflamm. Res. 13, 673–680. https://doi.org/10.2147/JIR.S278335 (2020).
Article CAS PubMed PubMed Central Google Scholar
Merad, M. & Martin, J. C. Pathological inflammation in patients with COVID-19: a key role for monocytes and macrophages. Nat. Rev. Immunol. 20, 355–362. https://doi.org/10.1038/s41577-020-0331-4 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bass, A., Liu, Y. & Dakshanamurthy, S. Single-cell and bulk RNASeq profiling of COVID-19 patients reveal immune and inflammatory mechanisms of infection-induced organ damage. Viruses 13. https://doi.org/10.3390/v13122418 (2021).
Vanderbeke, L. et al. Monocyte-driven atypical cytokine storm and aberrant neutrophil activation as key mediators of COVID-19 disease severity. Nat. Commun. 12, 4117. https://doi.org/10.1038/s41467-021-24360-w (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Goldfarbmuren, K. C. et al. Dissecting the cellular specificity of smoking effects and reconstructing lineages in the human airway epithelium. Nat. Commun. 11, 2485. https://doi.org/10.1038/s41467-020-16239-z (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Henry, G. H. et al. A cellular anatomy of the normal adult human prostate and prostatic urethra. Cell Rep. 25, 3530–3542. https://doi.org/10.1016/j.celrep.2018.11.086 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jakel, S. et al. Altered human oligodendrocyte heterogeneity in multiple sclerosis. Nature 566, 543–547. https://doi.org/10.1038/s41586-019-0903-2 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Xin, Y. et al. Pseudotime ordering of single human beta-cells reveals states of insulin production and unfolded protein response. Diabetes 67, 1783–1794. https://doi.org/10.2337/db18-0365 (2018).
Article CAS PubMed Google Scholar
Dominguez Gutierrez, G. et al. Gene signature of the human pancreatic epsilon cell. Endocrinology 159, 4023–4032. https://doi.org/10.1210/en.2018-00833 (2018).
Sui, L. et al. Reduced replication fork speed promotes pancreatic endocrine differentiation and controls graft size. JCI Insight 6, 1. https://doi.org/10.1172/jci.insight.141553 (2021).
Article Google Scholar
Camunas-Soler, J. et al. Patch-seq links single-cell transcriptomes to human islet dysfunction in diabetes. Cell Metab 31, 1017–1031. https://doi.org/10.1016/j.cmet.2020.04.005 (2020).
Article CAS PubMed PubMed Central Google Scholar
Enzo, E. et al. Single-keratinocyte transcriptomic analyses identify different clonal types and proliferative potential mediated by FOXM1 in human epidermal stem cells. Nat. Commun. 12, 2505. https://doi.org/10.1038/s41467-021-22779-9 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Menon, M. et al. Single-cell transcriptomic atlas of the human retina identifies cell types associated with age-related macular degeneration. Nat. Commun. 10, 4902. https://doi.org/10.1038/s41467-019-12780-8 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Voigt, A. P. et al. Molecular characterization of foveal versus peripheral human retina by single-cell RNA sequencing. Exp. Eye Res. 184, 234–242. https://doi.org/10.1016/j.exer.2019.05.001 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yan, W. et al. Cell atlas of the human fovea and peripheral retina. Sci. Rep. 10, 9802. https://doi.org/10.1038/s41598-020-66092-9 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Wouters, J. et al. Robust gene expression programs underlie recurrent cell states and phenotype switching in melanoma. Nat. Cell Biol. 22, 986–998. https://doi.org/10.1038/s41556-020-0547-3 (2020).
Article CAS PubMed Google Scholar
Grubman, A. et al. A single-cell atlas of entorhinal cortex from individuals with Alzheimer’s disease reveals cell-type-specific gene expression regulation. Nat. Neurosci. 22, 2087–2097. https://doi.org/10.1038/s41593-019-0539-4 (2019).
Article CAS PubMed Google Scholar
Fu, G. B. et al. Expansion and differentiation of human hepatocyte-derived liver progenitor-like cells and their use for the study of hepatotropic pathogens. Cell Res. 29, 8–22. https://doi.org/10.1038/s41422-018-0103-x (2019).
Article CAS PubMed Google Scholar
Fischer, A. et al. Post-surgical adhesions are triggered by calcium-dependent membrane bridges between mesothelial surfaces. Nat. Commun. 11, 3068. https://doi.org/10.1038/s41467-020-16893-3 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Nguyen, Q. H. et al. Profiling human breast epithelial cells using single cell RNA sequencing identifies cell diversity. Nat. Commun. 9, 2028. https://doi.org/10.1038/s41467-018-04334-1 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Rao, M. et al. Comparative single-cell RNA sequencing (scRNA-seq) reveals liver metastasis-specific targets in a patient with small intestinal neuroendocrine cancer. Cold Spring. Harb. Mol. Case Stud. 6, 1. https://doi.org/10.1101/mcs.a004978 (2020).
Article CAS Google Scholar
Zhang, Y. et al. Transcriptome landscape of human folliculogenesis reveals oocyte and granulosa cell interactions. Mol. Cell 72, 1021–1034. https://doi.org/10.1016/j.molcel.2018.10.029 (2018).
Article CAS PubMed Google Scholar
Liu, X. et al. Reprogramming roadmap reveals route to human induced trophoblast stem cells. Nature 586, 101–107. https://doi.org/10.1038/s41586-020-2734-6 (2020).
Article ADS CAS PubMed Google Scholar
Tirosh, I. et al. Single-cell RNA-seq supports a developmental hierarchy in human oligodendroglioma. Nature 539, 309–313. https://doi.org/10.1038/nature20123 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hovestadt, V. et al. Resolving medulloblastoma cellular architecture by single-cell genomics. Nature 572, 74–79. https://doi.org/10.1038/s41586-019-1434-6 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
https://www.10xgenomics.com/resources/datasets/2-k-sorted-cells-from-human-glioblastoma-multiforme-3-v-3-1-3-1-standard-6-0-0.
Agarwal, D. et al. A single-cell atlas of the human substantia nigra reveals cell-specific pathways associated with neurological disorders. Nat. Commun. 11, 4183. https://doi.org/10.1038/s41467-020-17876-0 (2020).
Article ADS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Youpeng Zou and Sayan Nanda for their helpful discussions. This study was supported by National Institutes of Health, National Institute of Allergy and Infectious Diseases (R01-AI095066).

Author information

Authors and Affiliations

Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Los Angeles, USA
Sung Yong Park, Sonia Ter-Saakyan, Gina Faraci & Ha Youn Lee

Authors

Sung Yong Park
View author publications
You can also search for this author in PubMed Google Scholar
Sonia Ter-Saakyan
View author publications
You can also search for this author in PubMed Google Scholar
Gina Faraci
View author publications
You can also search for this author in PubMed Google Scholar
Ha Youn Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.P. and H.L. conceived this study and performed the analysis. S.T. and G.F. contributed to the data curation and analysis. S.P., S.T., G.F., and H.L. wrote the paper.

Corresponding author

Correspondence to Ha Youn Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Park, S.Y., Ter-Saakyan, S., Faraci, G. et al. Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts. Sci Rep 13, 12093 (2023). https://doi.org/10.1038/s41598-023-39282-4

Download citation

Received: 29 November 2022
Accepted: 22 July 2023
Published: 26 July 2023
DOI: https://doi.org/10.1038/s41598-023-39282-4
Springer Nature Limited

Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts

Abstract

Similar content being viewed by others

Systematic comparison of high-throughput single-cell RNA-seq methods for immune cell profiling

Combined Measurement of RNA and Protein Expression on a Single-Cell Level

Protocol for Classification Single-Cell PBMC Types from Pathological Samples Using Supervised Machine Learning

Introduction