Abstract
The shotgun proteomic analysis is currently the most promising single-cell protein sequencing technology, however its identification level of ~1000 proteins per cell is still insufficient for practical applications. Here, we develop a pick-up single-cell proteomic analysis (PiSPA) workflow to achieve a deep identification capable of quantifying up to 3000 protein groups in a mammalian cell using the label-free quantitative method. The PiSPA workflow is specially established for single-cell samples mainly based on a nanoliter-scale microfluidic liquid handling robot, capable of achieving single-cell capture, pretreatment and injection under the pick-up operation strategy. Using this customized workflow with remarkable improvement in protein identification, 2449–3500, 2278–3257 and 1621–2904 protein groups are quantified in single A549 cells (n = 37), HeLa cells (n = 44) and U2OS cells (n = 27) under the DIA (MBR) mode, respectively. Benefiting from the flexible cell picking-up ability, we study HeLa cell migration at the single cell proteome level, demonstrating the potential in practical biological research from single-cell insight.
Similar content being viewed by others
Introduction
Nowadays, single-cell genomic1 and transcriptomic technologies2 have been well developed. However, the development of the single-cell proteomic technology faces major technical challenges, because the protein content in a single cell is extremely low and proteins are difficult to be amplified as nucleic acids. So far, a variety of protein analysis techniques at the single-cell level have been developed, including those based on specific antibody labeling such as flow cytometry3, fluorescence imaging4, Western blotting5 and mass cytometry6, as well as those based on non-labeling proteome analysis technique with high-resolution mass spectrometry (MS). At present, among the proteomic analysis techniques based on MS, the shotgun technique using the bottom-up strategy usually demonstrates the maximum depth and breadth of protein identification7, with which usually 5000–7000 proteins8,9,10 can be identified in a single measurement with a liquid chromatography-mass spectrometry (LC-MS) system for a sample containing a large number of cells with a protein amount in the range of 200–1000 ng. Therefore, in recent years, a variety of single-cell proteome analysis approaches based on the shotgun technique have been developed11,12,13,14,15,16,17,18,19,20,21,22,23,24.
A typical single-cell shotgun proteomic analysis process includes the sorting of target cells from a large number of sample cells, the pretreatment of single-cell samples, the LC injection and separation, and ESI-MS/MS detection of the digested peptides. The sample pretreatment process includes multi-steps of cell lysis, protein reduction, alkylation, enzymatic digestion and termination of the digestion. For such single-cell samples with extremely small protein amounts (ca. 100–500 pg per cell25), if the above-mentioned series of sample pretreatment operations are carried out with a microliter-scale reactor such as a centrifuge tube11 or a multi-well plate26, obvious sample loss will occur during the sample pretreatment and transferring process, which will significantly reduce the number of protein identification and thus severely limit the identification depth of single-cell proteomics.
To break through the barriers of the identification depth of single-cell proteomics, one strategy to address the above challenge is to perform the sample pretreatment in nanoliter-scale in-situ microreactors, such as the oil-air-droplet (OAD) chip13, the integrated proteome analysis device (iPAD)14 and the nanodroplet sample preparation (nanoPOTS) platform12,16,27,28,29,30. Compared with the conventional microliter-scale reactors used in routine laboratories, these microreactors have a volume reduction of hundreds of times, which can avoid the excessive dilution of the trace amounts of single-cell samples, effectively improve the reaction efficiency and reduce the sample loss caused by the adsorption of the sample components on the reactor surface during the pretreatment process. Based on these methods, up to 300–1100 proteins were able to be identified from single cells with label-free approach12,14,27,29. With the tandem mass tag (TMT) approach, the maximum up to 1500 proteins were quantified from single acute myeloid leukemia (AML) cells31. However, most of the reported nanoliter-scale microreactors required to use microfabricated microchips or devices as the microreactors, and needed special sample injection devices and additional operations to complete the injections of the nanoliter-volume samples. Currently, an important emerging trend is to develop single-cell proteomic platforms capable of integrating whole-process operations for promoting the practicality and popularization of single-cell proteomic analysis. Some integrated platforms were developed by using the combination of commercial instruments and self-developed systems (such as the autoPOTS platform32, T-SCP platform19, SCeptre33 and One-Pot34 workflow) or an integrated microfluidic chip17,18,35,36, to complete the whole process of single-cell proteomic analysis, with 300–2000 proteins identified from single cells.
In spite of the significant progresses obtained in single-cell proteomics, how to further improve the protein identification depth and simultaneously simplify the device and operation to achieve practical whole-process proteomic analysis at the single-cell level still presents great challenges.
Here, we develop a total workflow solution for single-cell proteomic analysis (Fig. 1) capable of achieving deep identification quantifying up to 3000 protein groups in a single mammalian cell. More automatic and convenient operation is performed by using a probe-based microfluidic liquid handling robot coupled with a commercial liquid chromatograph (LC) and a trapped ion mobility spectrometry (TIMS) QTOF mass spectrometer. The automated pick-up operation mode based on capillary probes is adopted throughout the pick-up single-cell proteomic analysis (PiSPA) workflow, including the sorting of single cells and multi-step single-cell pretreatment to digest cellular proteins into peptides in nanoliter reactors, as well as the injection of the peptide samples to the LC column. In addition, we utilize a single-cell customized strategy that fully considered the effects of the unique properties of single cells vs. bulk cells on sample pretreatment, separation, and detection to establish the series of measures throughout the PiSPA workflow. This strategy enables a much deeper depth of protein identification in single-cell analysis than those previously reported in the literatures. We apply this platform in the single-cell proteomic analysis of three kinds of mammalian cells, HeLa, A549 and U2OS cells, as well as the single-cell proteomic study of migrating HeLa cells.
Results
Establishment of the PiSPA workflow
We first integrated an improved microfluidic liquid handling robot37 previously developed by the authors’ group based on the sequential operation droplet array (SODA) technique38,39 into the PiSPA platform to achieve nanoliter-scale single-cell sorting and multistep pretreatment under the probe pick-up mode. The PiSPA platform performed the cell sorting with three-step operations of single-cell identification, picking-up and dispensing. Under the pick-up mode, it achieved the automated bright-field or fluorescence imaging and identification of the target cells in the cell suspension samples based on their bright-field apparent property or labeled fluorescence signals, picking-up of the single target cell by a tapered capillary probe connected with a high-precision syringe pump, and dispensing the target cell into an insert tube. We evaluated the cell sorting performance of the PiSPA platform by using it to pick-up 20 single HeLa cells separately and dispensing them individually into different insert tubes. Under the optimized conditions (i.e. capillary tip diameter of 35 μm, aspirating flow rate of 500 nL/s, aspirating volume of 25 nL, and injection volume of 400 nL), at least 19 single HeLa cells were successfully dispensed into the insert tubes, corresponding to a single-cell assignment rate of 95% (n = 20). To ensure the accuracy of cellular protein analysis, we used freshly isolated or cultured cell samples. The entire procedure of identification, picking-up and dispensing for each target cell usually took 20 s. Therefore, sorting of dozens of single cells could be achieved in 10–20 min. We observed that after the picked HeLa cells were dispensed into PBS droplets, the morphology of these cells was intact and they could be attached to the bottom surface of the vessel. A fluorescent dye for detecting cellular activity were also used to stain the cells, showing that they could retain their viability. Typical images of a picked HeLa cell in a droplet before and after staining by the fluorescent dye for cell viability assay are shown in Supplementary Fig. 1.
After the probe captured the target cell, we controlled the platform to dispense the cell into a commercial insert tube instead of using a microchip as the microreactor as previously reported13. The insert tubes can be seamless compatible with the autosampler of commercial LC-MS systems, without the need for microfabricated microchips and troublesome sample injection operations13,26,29. However, even for the smallest insert tubes available on the market, their volumes are still hundreds of microliters, which are far from a nanoliter-scale microreactor required for single-cell sample pretreatment. We solved this difficulty using the conical bottom tip (2 mm inner diameter, 0.4 mm height) of the insert tubes to load the nanoliter reaction solution. In addition, the evident evaporation effect of the nanoliter reaction solutions during the heating reaction process was suppressed by adding 100 μL of water and a sealing cap to the sample vial outside the insert tube to form an internal high-humidity environment. With the precise manipulation ability for nanoliter-scale liquids, the single-cell capture platform was continuously adopted to complete the multi-step nanoliter-scale reagent addition operation to sequentially achieve cell lysis, protein reduction, alkylation, digestion and reaction termination, under the pick-up operation mode. We performed a comparison experiment to conduct sample pretreatment of single HeLa cells with reaction volumes of microliters (5 μL) and nanoliters (~400 nL). The protein identification number of the nanoliter reactors was double that of the microliter reactors (Supplementary Fig. 2). The advantage of nanoliter-scale reactors in single cell proteomic analysis had also been demonstrated in many previous literatures14,25,30.
In the protein digestion experiments, we studied the effect of the enzyme/protein ratio on the protein identification number using single HeLa cells as samples. In routine proteomic experiments, enzyme/protein ratios between 1:100 and 1:10 are usually adopted as the optimized digestion conditions. Most of the current single-cell proteomic workflows also followed this enzyme/protein ratio range13,16,18,19,30,31. However, our experimental results (Fig. 2a) showed that the single-cell protein identification number showed an increasing trend with the increase of the enzyme/protein ratio. This increasing trend showed a rapid change in the enzyme/protein ratio range of 0-15:1, and significantly slowed down after 15:1. When the enzyme/protein ratio was between 40:1 and 60:1, the protein identification number reached the highest level (~1100 protein groups) and the fluctuations (i.e. CV) in the protein numbers identified in different single cells were also at the lowest level. Comprehensively considering the protein identification number and the working stability of the platform, we finally chose an enzyme/protein ratio of 50:1 in the subsequent single-cell proteomic analysis experiments. Such an enzyme/protein ratio largely exceeding those used in most conventional and single-cell proteomic experiments, was favorable to maintain a relatively high enzyme concentration40,41 (e.g. ~12 μg/mL trypsin in the present platform) in the nanoliter-scale reaction solutions to increase digestion efficiency of proteins. Another possible benefit of using larger amounts of enzymes may be related with the scale characteristics of single-cell analysis, that is, the largely excessive enzyme molecules in the reaction solutions could serve as an adsorption substitute, which substantially reduced the adsorption loss of trace cellular proteins and digested peptides on the surface of the microreactor and transmission conduit during the sample pretreatment and injection process.
To test the possible negative effects caused by the large enzyme/protein ratios, we first performed proteomic analysis for a control blank sample (containing all pretreatment reagents except for the single-cell sample), and no proteins from the two heterologous enzymes (trypsin and Lys-C) were identified and the protein identification number for the control samples was also at a very low level, indicating that the enzymes and their self-cleaved peptides did not produce misidentification of proteins. We also examined the peak positions and intensity signals of the self-cleaved peptides of trypsin in a proteomic analysis chromatogram of a typical single HeLa cell (Supplementary Fig. 3). It can be observed that the signals of the self-cleaved peptides of these enzymes appeared at only a few specific retention time and m/z positions and had no evident effect on the identification of the sample peptides.
The choice of gradient procedure in LC separations has a critical impact on chromatographic separation performance and proteome identification results. We evaluated the influence of different gradient times on single-cell proteomics analysis using gradients of 14, 18, 21, 28, 48, and 68 min with a sample of the 200 pg of standard HeLa cell digestion (Fig. 2b). The identification number reached the highest level of ca. 600 when the 18–28 min gradients were used, and then slowly decreased as the gradient time increased. Differing from the gradients of tens of minutes for routine proteomic analysis, the results showed that relatively short gradients are more beneficial for obtaining high protein identification numbers for single-cell samples. This is because short gradients are favorable for increasing the signal peak intensities of peptides, thereby improving the identification of some low-abundance peptide components in the sample (Supplementary Fig. 4). Another benefit of using short gradients is that it can increase the throughput of proteomic analysis. However, an excessively short gradient (such as 14 min gradient) will worsen the separation performance and result in a decrease in the number of protein identification. Comprehensively considering the analysis throughput and identification stability, we finally chose the 21-min LC gradient time in the subsequent experiments.
Before performing single-cell proteomic analysis of mammalian cells, we first tested the repeatability of the LC-MS system under the optimized conditions by running 10 consecutive analysis of the 200 pg of commercial HeLa cell digest serving as a quality control (QC) sample in data independent acquisition (DIA) and data dependent acquisition (DDA) modes. DDA and DIA are two widely-used MS data acquisition modes in proteomics analysis. DDA mode chooses a few most intense precursor ions based on the MS1 spectrum for acquisition of MS2 spectra and identification of peptides. In contrast, DIA mode allows the fragmentation of all precursor ions within a certain range of m/z and retention time to acquire complete record of theoretically all peptides in a sample42,43. On average, 1469 and 618 protein groups (Fig. 2c) with number variation coefficients of 4.7% and 2.3% were stably quantified under the DIA and DDA mode, respectively. The pairwise correlation analysis (Python package Pandas, version 1.4.4) between the protein quantification data of the 10 QC samples showed that all of the Pearson correlation coefficients were over 0.99 (DIA mode) and 0.85 (DDA mode) (Fig. 2d), demonstrating the good stability (especially in DIA mode) of the system in the proteomic analysis at the single-cell level.
Single-cell proteomic analysis of mammalian cells
Using the present workflow and platform, we performed single-cell proteomic analysis of three types of tumor cell lines, A549, HeLa, and U2OS cells, using the DIA and DDA modes, with the optimal enzyme/protein ratio of 50:1 and gradient time of 21 min. Under the DIA mode, an average of 2467 (1804–3349), 2421 (1778–3049) and 1705 (1074–2487) protein groups were quantified in single A549 cells (n = 37), HeLa cells (n = 44) and U2OS cells (n = 27), respectively (Fig. 3a). Using the match between runs (MBR) algorithm, an average of 3008 (2449–3500), 2926 (2278–3257) and 2259 (1621–2904) protein groups were quantified in the same samples of single A549, HeLa and U2OS cells, respectively. Under the DDA mode at the same analysis conditions, an average of 1328 (712–2129), 1290 (664–2198) and 1005 (536–1519) protein groups were quantified in single A549 cells (n = 56), HeLa cells (n = 68), and U2OS cells (n = 24) (Fig. 3a). In the DIA (MBR) mode, 2869, 2772, and 1889 protein groups were reproducibly quantified in 80% (i.e. recurrence percentage) of the single A549, HeLa, and U2OS cells, respectively, while only 638, 722, and 1103 protein groups were quantified with less than 20% recurrence percentage (Fig. 3b). These results indicated that the majority of the quantitative identification results of single cells by the PiSPA platform had high levels of reproducibility. In the above single-cell proteomic experiments, a total of 143 and 172 single cells were captured for analysis in the DIA and DDA modes, respectively, and we finally obtained 108 and 148 valid single-cell data, corresponding to average single-cell capture/analysis efficiencies of 76% and 86%.
We implemented a uniform manifold approximation and projection (UMAP) dimensionality reduction on the data of single A549, HeLa and U2OS cells, three types of single-cell samples could be automatically and clearly clustered (Figs. 3d, e), and all of the single cell samples were correctly clustered.
Under the DDA mode, the label-free single-cell protein identification numbers obtained in this work (Fig. 3a) are higher than most of the literature reported results for the same type of cells (e.g. HeLa cells)12,14,27,28,29,32, which shows the advantage of the present workflow and platform in the identification depth of single-cell proteome analysis. Compared with the single-cell proteomic data set obtained under the DIA mode, the single-cell protein identification numbers based on the DDA mode are relatively low. From the unions of the data sets of 37 single A549 cells, 44 single HeLa cells, and 27 single U2OS cells, total numbers of 5093, 5048 and 4286 protein groups could be cumulatively quantified under the DIA mode, respectively (Supplementary Fig. 5). However, only 3286, 3319 and 2357 protein groups were cumulatively quantified from the unions of the data sets of 56 single A549 cells, 68 single HeLa cells and 24 single U2OS cells under the DDA mode, respectively (Supplementary Fig. 5). Under the DIA and DDA modes, the ranges of protein abundance spanned nearly 5 and 4 orders of magnitude, respectively. Some important but low abundant proteins could only be quantified under the DIA mode (Fig. 3c), such as mismatch repair endonuclease PMS244 and E3 ubiquitin-protein ligase NEDD445.
Evaluation of quantitative accuracy and precision of the LC-MS system
To assess the quantitative accuracy and precision of the present LC/MS system, we compared the quantification results of 0.2 ng, 0.4 ng, 0.6 ng, 0.8 ng, and 1 ng HeLa digests (n = 6). Using the 0.2 ng HeLa digest as the reference to calculate the protein abundance fold change (FC) values obtained for each quantified protein, a proportionally linear increase in the protein fold change with the increase of the protein amount of the HeLa digests could be observed with R2 of 0.9995 (Fig. 4a). As the protein amount increased, the quantitative precision improved from a coefficient of variation (CV) of 25.6% with the 200 pg HeLa digest to 14.9% with the1 ng HeLa digest (Fig. 4b). These results demonstrate the favorable quantitative accuracy and precision of the present LC-MS system in the level of single cell and small number of cells.
In addition, we further tested the quantitative accuracy of the system using single cell samples spiked with different ratios of yeast and Escherichia coli (E. coli) peptides. The QC samples at the single-cell level were prepared by mixing eight digested single HeLa cell samples and then aliquoting it into eight portions. The single-cell QC samples were divided into two groups (S1 and S2) with four samples in each group, and were spiked with yeast peptides and E. coli peptides with different ratios to each sample. The spiked QC samples were analyzed with the LC-MS system, and the results showed that the median fold change values of the proteins from the three species in the samples closely matched the theoretical values, with a relative difference, i.e., (median FC - theoretical value)/theoretical value, ranging from –2% to +15% (Fig. 4c). This further demonstrated the high level of quantitative accuracy of the present system. Furthermore, the CV values of the protein quantification results from the three species in both QC sample groups were below 20% (Fig. 4d), also demonstrating the good quantitative precision in the single-cell level.
Benefiting from the ability of the present platform to flexibly and accurately control the cell number in samples, we tested samples containing 0, 1, 2, 3, 5, and 10 HeLa cells with the same procedures and conditions as in single-cell proteomic analysis, to evaluate the background level in the blank control sample as well as examine the effect of cell number on the protein identification number. Under the DIA (MBR) mode, on average, 0, 3000, 3121, 3411, 3718, and 3864 protein groups were quantified in 0, 1, 2, 3, 5 and 10 HeLa cells (n = 4), respectively (Fig. 4e). The number of protein identifications did not increase linearly with the cell number (Fig. 4e). However, the protein abundance, as reflected by the median fold changes relative to the single-cell sample, showed a linear relationship with the cell number (Fig. 4f), with FC values of 1.64, 2.69, 3.26 and 6.02 and R2 of 0.9848 (Fig. 4f). As the “Proteomic Ruler” method46, we searched the description of all the quantified proteins in the 1-10 cells HeLa data using the keyword “histone”, and 40 histone-related proteins were obtained. Using the abundances of these proteins in single-cell samples as a reference, the quantitative fold changes (medians) of these proteins in 2, 3, 5, and 10 cells (n = 4) was 1.66, 2.30, 3.32, and 6.83, respectively, which was linearly correlated with the number of cells (Supplementary Fig. 6). This further demonstrates the qualified performance of the PiSPA system in terms of cell capture specificity and protein quantification accuracy.
Single-cell proteomic analysis of migrating cells
Cell migration is a common biological process, which has important significance for the study of tumor migration, wound healing, embryonic development, immune response, etc47. At present, the scratch assay is frequently used to test the invasion and metastasis ability of adherent tumor cells48, while so far there is no report on the study of individual tumor cells with different migration behaviors at the deep-coverage proteome level. The PiSPA platform has the ability to observe the behavior of individual migrated cells, reliably pick the target cells up, and achieve deep-coverage proteomic analysis to these cells at the single-cell level to highlight the individual protein differences between cells with different apparent migratory properties.
We used HeLa cells to perform the scratch assay, and 46 single cells with significant migratory behaviors (Fig. 5a, Supplementary Data 1) and 43 single (control) cells without obvious migratory behaviors were captured and analyzed with the PiSPA platform. Under the DIA (MBR) mode, an average of 2544 (2058–3308) and 2893 (1896–3710) protein groups were quantified from single migrated cells (n = 46) and control cells (n = 43), respectively (Supplementary Data 2, Supplementary Data 3). Among them, over 78% of the proteins were quantified by two or more feature peptides (Supplementary Fig. 7).
After batch correction and normalization, the data were clustered into three distinct clusters by UMAP (Fig. 5b). Cluster 1 (n = 42) predominantly consisted of migrated cells (n = 35, 83%), while cluster 2 (n = 31) and cluster 3 (n = 16) consisted mainly of control cells (n = 26 and n = 10, 83% and 63%, respectively). By individually comparing the three clusters, 226 significantly different proteins were screened out (Wilcoxon test, p.adj < 0.05, fold change >2). Hierarchical cluster analysis (HCA) was utilized to quantitatively assess the differences in protein compositions among the three clusters (Fig. 5c) to feature the heterogeneity among the different clusters.
Compared to cluster 2, there were 33 proteins significantly upregulated and 13 proteins significantly downregulated in cluster 1 (Supplementary Fig. 8, Wilcoxon test, p.adj < 0.05, fold change > 2). Among them, the upregulated proteins such as Orc149, TGFBI50, MSLN51, HtrA152,53 were reported to be associated with the migration and invasion of tumor cells. The expression levels of these proteins in the different clusters were quantified (Supplementary Fig. 9), and it was observed that these proteins exhibited significantly downregulated in both cluster 2 and cluster 3, which corresponded to the relatively weak apparent migration behavior of most of the cells in the two clusters.
We further compared the protein expression differences between cluster 1 and cluster 3. In cluster 1, there were 135 proteins exhibiting upregulation and 36 proteins showing downregulation (Fig. 5d, Wilcoxon test, p.adj < 0.05, fold change >2). Among them, 11 upregulated proteins, including Cdc42, Rac1, RhoA, EZR, THBS1, Itgb6, Rap1A, MYL12A, Myl9, MSN, and Arpc1a, were involved in signal pathways of focal adhesion and regulation of actin cytoskeleton (Fig. 5e, Kyoto Encyclopedia of Genes and Genomes (KEGG)54 enrichment analysis, p.adj < 0.05), which were reported to be closely related to cell migration55,56. In the previously-reported cell migration models57,58,59,60, the function of Cdc42 is usually recognized as inducing filopodia formation and regulating the direction of cell migration, the function of Rac1 is to induce lamellipodia formation and establish new adhesion sites at the leading edge to pull the cell forward, and the function of RhoA is to promote the contraction of the posterior actin to detach from adhesions, eventually leading to the migration of cells. We clearly observed a significant upregulation of these above-mentioned proteins in cluster 1 compared to cluster 3, implying the activation of the relevant migration pathways, which was in agreement with the fact that the cluster 1 cells exhibited significant apparent migration property in the migration assay. Interestingly, we compared the expression levels of the three proteins at the single-cell level in the three clusters (Fig. 5f) and the results showed that the expressions of the three proteins in cluster 2 cells were also higher than those in cluster 3, showing obvious heterogeneity among the control cells. This result implied that although most cells in both cluster 2 and cluster 3 did not show significant migration behaviors, probably the reasons for their lack of significant migration behavior were different. The upregulations of the migration-associated proteins in cluster 2 cells implied the potential strong migration activity of these cells, while they did not show actual migration behaviors presumably due to the lack of space around these cells for migration. Such a cell protein heterogeneity that exists within a cell group with similar phenotypic behaviors is difficult to observe and explore with the conventional bulk cell experiments except using the single-cell proteomic analysis technique.
The use of the “pick-up” approach enabled the observation of evident cellular heterogeneity in cell migration, such as the presence of a small number of mixed cells within the same clusters. However, limited by the relatively rough operation mode of the scratch experiment in the condition control and monitoring of cell migration, the current experimental data could not provide a definitive explanation for these types of cellular heterogeneity, implying the possible existence of more complex relationships between cell migration behaviors and protein expression. For further validation and in-depth study on these relationships in the future, microfluidic chips could be used to precisely control the position and state of the initial cells and a live cell workstation could be used to monitor the whole migration process for tracking the migration paths of each tested cells.
The present application of single-cell proteomics in cell migration experiment is just a preliminary proof-of-principle attempt, while it could demonstrate that the PiSPA platform has the potential to reveal the intrinsic control factors behind the apparent behaviors that were previously difficult to detect using the conventional omics approaches for large amounts of cell samples, and could provide an effective tool for the cell migration studies as well as the development of anticancer approaches and drugs.
Discussion
In the present work, we adopted the single-cell customization strategy throughout the whole process of the platform building, methodology development and performance improvement for single-cell proteomic analysis, covering workflows of single-cell sorting, sample pretreatment, chromatographic injection and separation, mass spectrometry detection and data processing. A total solution was developed to achieve a leap improvement in protein identification depth and reliable and convenient operation of single-cell analysis. The PiSPA platform achieved single-cell sorting using the pick-up mode, which is different from the flow cytometry3, CellenONE61, or limiting dilution based13,62 single-cell sorting approaches. Under the pick-up mode, arbitrary cells of interest can be selected as the target cells in the microscopic field of view based on the bright-field or fluorescence information or other properties of the cells, and the target cells are picked individually by the capillary probe under microscopic monitoring. Therefore, this approach has strong controllability, definiteness and reliability in the selection and picking of the target cells, while retaining the phenotypic and spatial information of these target cells. This feature has been demonstrated in the picking of migrated cells in the cell migration experiment, which is difficult to achieve using the flow cytometry-based single-cell sorting techniques. Although the throughput (~20 s per cell) of the PiSPA platform in single-cell sorting is far lower than those of the flow cytometry-based systems, it can still match the current throughput of tens of samples per day in most of single-cell proteomic analysis systems. If needed, the cell sorting throughput could be further increased multiplicatively by using an array of capillary probes.
The PiSPA platform integrated the microfluidic liquid handling robot with the insert tube array to achieve the loading of single-cell droplets and subsequent multi-step nanoliter-scale sample pretreatment, using the robot’s pick-up operation and the conical bottom tips of the insert tubes as the nanoliter reaction vessels. Compared with the microchip-based reactors fabricated with special equipment and complex procedures13,16,17,18,25,30,31,35, the insert tubes have the merits of low-cost, easily available, and convenient to use. In addition, such an arrangement also enabled the seamless integration of single-cell sorting, nanoliter-scale sample pretreatment and subsequent automated liquid chromatography injection of the samples, which significantly improved the operability, reliability and success rate of the whole workflow of single cell proteomic analysis. This is of significance for promoting the practical popularization of single-cell proteomic analysis.
Considering that the conventional proteomic analysis of samples containing thousands of cells can typically identify about 5000–7000 protein groups8,18, the PiSPA platform has the ability to quantify nearly half this level for a single mammalian cell with a significant improvement over many current single-cell proteomic analysis systems. This may imply that the single-cell proteomics research has entered a stage of practical application in a wide range of biomedical research fields.
Looking back at the breakthrough in identification depth of single-cell transcriptome sequencing technology ten years ago63,64,65,66, the Smart-seq-based single-cell RNA-seq technique could identify around 30,000 transcripts from one human mammalian cell63, accounting for 13% of the total human transcripts (ca. 230,00067). In this study, up to 3,000 proteins quantified in one human mammalian cell account for 15% of the total human proteins (ca. 20,00067), reaching the similar level of single-cell RNA-seq technique at that time. Considering the explosive development of the single-cell transcriptome sequencing technology over the past decade68,69,70, we may now be on the eve of or even in the outbreak of the single-cell protein sequencing technology based on the shotgun proteomics strategy.
In the near future, the identification depth and throughput of single-cell proteomic analysis will be further improved, reaching the level of practical and popular application. In addition, by adopting the microfluidic sample pretreatment technique, it will be possible to combine it with single-cell genome, transcriptome, and metabolome analysis technologies to form a “true” single-cell multi-omics analysis technology for single-cell individuals. These will undoubtedly bring unprecedented powerful tools for people to understand the variations of cellular heterogeneity in life activities.
Methods
Cell culture
HeLa (SCSP-504), A549 (SCSP-503) and U2OS (SCSP-5030) cells were purchased from the Cell Bank of the Chinese Academy of Sciences. The HeLa and U2OS cells were maintained in Dulbecco’s modified eagle medium (DMEM) with high glucose/pyruvate (Invitrogen) supplemented with 10% fetal bovine serum (FBS) (Gibco). The A549 cells were maintained in F-12K supplemented with 10% FBS. All cell lines were maintained in a 5% CO2 incubator at 37 °C. For single-cell analysis, cells in a 6 cm dish were collected at 60–80% confluency using 0.25% trypsin with 0.02% EDTA, and washed five times with phosphate buffer solution (PBS). All cell lines were authenticated by STR profiling (BIOWING, Shanghai). No positive sign of mycoplasma contamination for all cell lines.
Single cell capture
The microfluidic liquid handling robot integrated in the PiSPA platform was developed based on a SODA system for single-cell soring37. The SODA technique was first developed by the authors’ group in 2013 for achieving automated picoliter to nanoliter liquid manipulation38,39 and has been applied in high-throughput screening, single cell analysis13,37,39, microscale cell assays, and micro-sample analysis. The liquid handling robot consisted of a microscopic imaging module for cell observation and identification, a capillary probe (10 cm length, 250 μm o.d., 150 μm i.d., tip size, 35 μm i.d.) connected with a high-precision syringe pump (1701N, Hamilton) for nanoliter-scale liquid metering/handling, an array of insert tubes (4 mm inner diameter, 31 mm height, 300 μL volume, ALWSCI Co.) for loading nanoliter-scale single-cell droplets and reaction solutions, an automated x-y-z translation stage for controlling the movement of other modules, and a system control module for the whole robot under the control of a computer program. In addition to cell sorting, the function of the robot was extended to complete nanoliter-scale 5-step single-cell sample pretreatment reactions. Instead of droplet array chips13 frequently used in the previous SODA systems, we used the array of insert tubes with their conical bottom tips as nanoliter reactors to improve the operability and reliability of the system as well as to facilitate the subsequent automated liquid chromatography injection. Based on this improved robotic system, we proposed the pick-up mode-based automated cell sorting and sample pretreatment approach, whereby the whole workflow of deep single-cell proteomic analysis was established.
For picking up target cells, 2 mL of cell PBS suspension was added in a petri dish (35 mm diameter) fixed on the translational stage of the robot. Usually, the cell dispersion density at the bottom of the culture dish was controlled to be less than 10,000 cells/cm2 (corresponding to a cell suspension density of <100,000 cells/mL), for ensuring the sufficient distances between the target cells and the surrounding adjacent cells (usually >30 μm) to avoid the adjacent non-target cells to be sucked into the capillary probe. The microscopic imaging module first took bright-field or fluorescence image of the cells in the cell suspension settled on the bottom of the petri dish. The target cells in the image were selected based on their bright-field apparent property or labeled fluorescence signals, and their location coordinates were automatically calculated by the control module. Then the tapered tip of the capillary probe was controlled to automatically align the target cell by moving the translational stage and suck it into the probe by aspirating 15 nL of cell suspension into the probe by the syringe pump. Before the cell picking-up operation, the capillary probe was prefilled with 50 mM NH4HCO3 solution. After the target cell was picked up by the capillary probe, it was then deposited to the bottom of an insert tube by dispensing 400 nL of the solution in the probe. Since the present pick-up mode employed the procedure of picking one target cell at one time and immediately dispensing it into the insert tube, and the capillary tip had an inner diameter of 35 µm which was larger than the cell size, capillary blockage was rarely encountered during the cell picking and dispensing process.
Single-cell sample pretreatment
After picking up the single target cell into the insert tube, we used the robot to perform the subsequent sample pretreatment operations, including cell lysis, protein reduction, alkylation, enzymatic digestion and termination of the digestion reaction. First, 100 nL of 0.3% (w/v) RapiGest SF (Waters) solution was added to the insert tube, and the insert tube was inserted into a sample vial with a sealing cap and 100 μL of water prefilled in the vial. Multiple sample vials were formed an array of vials, which was heated in an oven for 20 min at 60 °C to lyse the cells. After the sample vials were cooled to room temperature, 100 nL of 20 mM tris (2-carboxyethyl) phosphine (TCEP) was added to each insert tube, for performing the protein reduction reaction for 20 min at room temperature, and 100 nL of 125 mM iodoacetamide (IAA) was added to conduct the alkylation reaction to the protein cysteine residues at room temperature for 15 min in dark. Then, 100 nL of a mixed enzyme (0.05 μg/μL Lys-C and 0.05 μg/μL trypsin) solution was added to the insert tube to digest the proteins for 2 h at 37 °C. Finally, 100 nL of 40% formic acid (FA) solution was added to terminate the enzymatic reaction with an incubation time of 30 min at room temperature. To avoid cross-contamination between different reagents and samples, the capillary probe was washed with water three times before aspirating new reagents or samples. The sample vials with the insert tubes were placed in the sample tray of the LC autosampler for LC-MS/MS analysis.
During the single-cell dispensing and reagent addition processes, the caps of the sample vials were lifted and the sample vials were in open state to facilitate the capillary probe inserting into the insert tube for performing the relevant operations. Since these single-cell dispensing and reagent addition operations took only 5-10 s per tube and the sample solutions in the insert tubes were protected by the large volume of water added to the sample vials, the evaporation of hundreds of nanoliters of sample solutions during these operations was at a negligible level. In the reaction incubation and chromatographic injection processes, the sample vial caps were in a sealed closed position to avoid the significant evaporation losses of the sample solutions during prolonged processes of heating incubation or waiting in line for LC injection. In addition, the sample vial caps were installed with a seal membrane in each cap that could be penetrated by the sampling probe in the autosampler to allow it to insert into the single-cell sample solutions for sampling and LC injection.
For the condition optimizing experiments (Fig. 2), 24 HeLa cells were analyzed for the optimization of the enzyme/protein ratio (3 cells for each of the 8 conditions), and 18 HeLa cells for the optimization of the LC gradient time (3 cells for each of the 6 conditions). For the single-cell analysis (Fig. 3), 256 cells were analyzed (37 A549, 44 HeLa and 27 U2OS cells in DIA mode; 56 A549, 68 HeLa and 24 U2OS cells in DDA mode).
Preparation of quality control samples
Commercial HeLa protein digest powders (Thermo Scientific) were used to prepare standard HeLa digests at concentrations of 0.2 ng/μL, 0.4 ng/μL, 0.6 ng/μL, 0.8 ng/μL, and 1.0 ng/μL, which served as quality control (QC) samples for evaluating the performance of the LC-MS system. For the condition optimizing experiments (Fig. 2), 20 QC samples (0.2 ng/μL) were analyzed for LC-MS repeatability evaluation (10 samples for DDA and 10 samples for DIA). For the quantitative accuracy and precision evaluation (Fig. 4), 30 QC samples were analyzed (6 samples for each of the 5 concentrations). Two QC samples were used to evaluate the effect of RapiGest SF on single cell proteomic analysis (Supplementary Fig. 10).
The QC samples at the single-cell level were prepared by mixing eight digested single HeLa cell samples and then aliquoting it into eight portions. The single-cell QC samples for testing the quantitative accuracy of the system were divided into two groups (S1 and S2) with four samples in each group. For the S1 group, 50 pg of yeast peptides and 150 pg of E. coli peptides were added to each sample; for the S2 group, 100 pg of yeast peptide and 100 pg of E. coli peptide were added to each sample. For the quantitative accuracy and precision evaluation (Fig. 4), 8 mixed-species samples (4 samples for each of the 2 conditions) and 24 single/multi-cell samples (4 samples for each of the 6 conditions) were analyzed (Supplementary Fig. 11).
Cell migration experiment
The cell migration experiment was performed using the scratch assay mode. A scratch (blank area) was first drawn by a 1-mL pipette tip on a densely growing monolayer of HeLa cells in a petri dish. After washing with PBS, a new culture medium was added to the petri dish and no cells could be observed in the blank area. After 24-h culture at 37 °C, a small number of cells migrated into the blank area. Utilizing the ability of the liquid handling robot in picking up each target cell accurately and reliably, we employed it to pick up 46 cells in the blank area and 43 cells (control cells) outside the blank area for single-cell proteomics analysis.
LC-MS/MS analysis
A capillary LC column (10 cm length, 360 o.d., 50 μm i.d.) packed with 1.7 μm C18 particles (120 Å pore size, Nanomicro Co.) was used in the LC separation of the digested peptide samples. The single-cell samples in the insert tubes were injected by an autosampler coupled to an EASY-nLC 1200 LC (ThermoFisher Scientific). Mobile phases (A, 0.1% formic acid in water; B, 0.1% formic acid, 80% ACN in water) with a 21 min gradient (0–13 min, 3–40% B; 13–14 min, 40–100% B; 14–21 min, 100% B) at a flow rate of 150 nL/min was used for the separation of single-cell samples.
The separated peptide components of single cell samples were detected by a trapped ion mobility-time of flight mass spectrometer (timsTOF Pro, Bruker). Data dependent acquisition (DDA) mode and data independent acquisition (DIA) mode were used in the MS data acquisition of both the QC sample and single-cell samples to compare their performance in single cell analysis, since it was reported previously that the DIA mode generally has an advantage in protein identification depth over the DDA mode under the same LC conditions19. For DDA mode, the range of the ion mobility 1/k0 was 0.75–1.3, and the parallel accumulation serial fragmentation (PASEF) acquisition mode was used. The m/z acquisition range was 300–1500, the spray voltage was 1750 V, and the ion accumulation time was 166 ms. The precursor ions obtained by the primary mass spectrum underwent secondary fragmentation through collision induced dissociation (CID). The size of the precursor ion isolation window was related to the ion m/z. For ions with m/z below 700, the separation window was 2, for ions with m/z above 800, the separation window was 3. The dynamic rejection time was 0.4 min, and the CID collision energy was in the range of 20–59 eV, which varied with ion mobility. To obtain the optimal MS detection under the DIA mode, we tested the acquisition range of m/z = 384-1171 and the isolation window of m/z = 20-40 with both samples of 200 pg HeLa digest and single HeLa cells, and finally selected the optimized m/z acquisition range of 399-1124 and isolation window of m/z = 25. In addition, we optimized the ramp time to be 166 ms, to accommodate the low protein input at the single cell level.
MS data analysis
Mass spectrometry data was collected using Compass Hystar software (version 5.1). The DDA raw files were analyzed with SpectroMine software (version3.2, Biognosys AG, Schlieren, Switzerland) using default settings. Protein label-free quantitative identification from the DDA dataset was searched against the UniProt Proteomes Homo sapiens database (accession: UP000005640, taxon ID: 9606, 20,626 entries, access date 2021-01). Peptides with a length of 7 to 52 amino acids were considered for the search. Enzyme specificity was set to trypsin cleaving C-terminal to arginine and lysine. A maximum of one missed cleavage were allowed. Cysteine carbamidomethylation was set as a static modification, acetylation on protein N-terminus and oxidation on methionine set as variable modifications. The peptide-spectrum matches (PSM), peptides and protein groups were filtered at 1% false discovery rates (FDR). The DIA raw files were analyzed with DIA-NN71 software (version 1.8) in library-free search mode using default settings. The DIA raw data files for single cells were searched against UniProt Proteomes Homo sapiens database. The DIA raw data files of single-cell QC spiked with two species were searched against UniProt Proteomes Homo sapiens, Swiss-Prot Escherichia coli (strain K12) (taxon ID: 83333, 4530 entries, access date 2023-07) and Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (taxon ID: 559292, 6727 entries, access date 2023-07) database. Peptides with lengths of 7 to 52 amino acids were considered for the search, and the Q-values were controlled at 1% on precursor and protein group level.
The pairwise correlation analysis between the protein quantification data of the QC samples were performed by Python (version 3.9.15) and the package Pandas (version 1.4.4). The fold change in protein abundance and density distribution of coefficient of variation were analyzed by R (version 4.1.3). Proteins with missing values ≥ 50% were excluded from the CV calculation. Forty histone-related proteins were searched out among the quantified proteins from multiple-cell HeLa data with the keyword “histone”.
Bioinformatics analysis
Single-cell proteomics data were analyzed by R (version 4.1.3) and the R package Seurat72 (version 4.3.0). Proteins quantified in <3 cells were excluded and missing values were imputed with zeros. The protein quantities were normalized to sum to 10,000 for each cell, and then transformed to natural logarithm. Batch effects of samples were corrected by the R package harmony73 (version 0.1.1). Dimensional reduction was performed by principal components analysis (PCA) and the top 25 principal components were used for the uniform manifold approximation and projection (UMAP) analysis. The cells were divided into 3 clusters using the Louvain binning algorithm (Supplementary Fig. 12, Supplementary Note 1). The protein quantities were compared between clusters pairwisely, where only proteins with no more than 75% missing values across all the compared cells were taken into consideration. Differential proteins were determined by the Wilcoxon test (p.adj <0.05 by the Benjamini-Hochberg method, fold change > 2). Kyoto Encyclopedia of Genes and Genomes (KEGG)45 enrichment was carried out by the R package clusterProfiler74 (version 4.8.1).
The 2D-UMAP of the tumor cells was visualized by Python and the package umap-learn (version 0.5.3). The 3D-UMAP in the cell migration study was visualized by the R package plot3D (version 1.4), and the heatmap was visualized by the R package ComplexHeatmap75 (version 2.16.0). Other data were visualized by the R package ggplot2 (version 3.3.5).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The mass spectrometry proteomics data generated in this study have been deposited to the ProteomeXchange Consortium via the iProX partner repository76,77 with the dataset identifier PXD041966 or IPX0006351000. Homo sapiens (accession: UP000005640, taxon ID: 9606, 20,626 entries, access date 2021-01), Escherichia coli (strain K12) (taxon ID: 83333, 4530 entries, access date 2023-07) and Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (taxon ID: 559292, 6727 entries, access date 2023-07) protein databases were downloaded from UniProt [https://www.uniprot.org], and have also been deposited to the ProteomeXchange/iProX repository. Source data are provided with this paper.
References
Macosko, E. Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202–1214 (2015).
Klein, A. M. et al. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell 161, 1187–1201 (2015).
Krutzik, P. O., Crane, J. M., Clutter, M. R. & Nolan, G. P. High-content single-cell drug screening with phosphospecific flow cytometry. Nat. Chem. Biol. 4, 132–142 (2008).
Thul, P. J. et al. A subcellular map of the human proteome. Science 356, eaal3321 (2017).
Kang, C. et al. Single cell-resolution western blotting. Nat. Protoc. 11, 1508–1530 (2016).
Bendall, S. C. et al. Single-cell mass cytometry of differential immune and drug responses across a human hematopoietic continuum. Science 332, 687–696 (2011).
Meyer, J. G., Niemi, N. M., Pagliarini, D. J. & Coon, J. J. Quantitative shotgun proteome analysis by direct infusion. Nat. Methods 17, 1222–1228 (2020).
Angelidis, I. et al. An atlas of the aging lung mapped by single cell transcriptomics and deep tissue proteomics. Nat. Commun. 10, 963 (2019).
Meier, F., Park, M. A. & Mann, M. Trapped ion mobility spectrometry and parallel accumulation-serial fragmentation in proteomics. Mol. Cell. Proteom. 20, 100138 (2021).
Meier, F. et al. Online parallel accumulation-serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol. Cell. Proteom. 17, 2534–2545 (2018).
Budnik, B., Levy, E., Harmange, G. & Slavov, N. SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation. Genome Biol. 19, 161 (2018).
Cong, Y. et al. Improved single-cell proteome coverage using narrow-bore packed nanoLC columns and ultrasensitive mass spectrometry. Anal. Chem. 92, 2665–2671 (2020).
Li, Z. et al. Nanoliter-scale oil-air-droplet chip-based single cell proteomic analysis. Anal. Chem. 90, 5430–5438 (2018).
Shao, X. et al. Integrated proteome analysis device for fast single-cell protein profiling. Anal. Chem. 90, 14003–14010 (2018).
Petelski, A. A. et al. Multiplexed single-cell proteomics using SCoPE2. Nat. Protoc. 16, 5398–5425 (2021).
Woo, J. et al. High-throughput and high-efficiency sample preparation for single-cell proteomics using a nested nanowell chip. Nat. Commun. 12, 6246 (2021).
Leduc, A., Huffman, R. G., Cantlon, J., Khan, S. & Slavov, N. Exploring functional protein covariation across single cells using nPOP. Genome Biol. 23, 261 (2022).
Gebreyesus, S. T. et al. Streamlined single-cell proteomics by an integrated microfluidic chip and data-independent acquisition mass spectrometry. Nat. Commun. 13, 37 (2022).
Brunner, A. D. et al. Ultra-high sensitivity mass spectrometry quantifies single-cell proteome changes upon perturbation. Mol. Syst. Biol. 18, e10798 (2022).
Derks, J. et al. Increasing the throughput of sensitive proteomics by plexDIA. Nat. Biotechnol. 41, 50–59 (2023).
Ctortecka, C., Stejskal, K., Krssakova, G., Mendjan, S. & Mechtler, K. Quantitative accuracy and precision in multiplexed single-cell proteomics. Anal. Chem. 94, 2434–2443 (2022).
Petrosius, V. et al. Exploration of cell state heterogeneity using single-cell proteomics through sensitivity-tailored data-independent acquisition. Nat. Commun. 14, 5910 (2023).
Furtwängler, B. et al. Real-time search-assisted acquisition on a Tribrid mass spectrometer improves coverage in multiplexed single-cell proteomics. Mol. Cell Proteom. 21, 100219 (2022).
Orsburn, B. C., Yuan, Y. & Bumpus, N. N. Insights into protein post-translational modification landscapes of individual human cells by trapped ion mobility time-of-flight mass spectrometry. Nat. Commun. 13, 7246 (2022).
Dou, M. et al. High-throughput single cell proteomics enabled by multiplex isobaric labeling in a nanodroplet sample preparation platform. Anal. Chem. 91, 13119–13127 (2019).
Specht, H. et al. Single-cell proteomic and transcriptomic analysis of macrophage heterogeneity using SCoPE2. Genome Biol. 22, 50 (2021).
Zhu, Y. et al. Proteomic analysis of single mammalian cells enabled by microfluidic nanodroplet sample preparation and ultrasensitive nanoLC-MS. Angew. Chem. Int. Ed. 57, 12370–12374 (2018).
Zhu, Y. et al. Proteome profiling of 1 to 5 spiked circulating tumor cells isolated from whole blood using immunodensity enrichment, laser capture microdissection, nanodroplet sample processing, and ultrasensitive nanoLC-MS. Anal. Chem. 90, 11756–11759 (2018).
Cong, Y. et al. Ultrasensitive single-cell proteomics workflow identifies >1000 protein groups per mammalian cell. Chem. Sci. 12, 1001–1006 (2020).
Williams, S. M. et al. Automated coupling of nanodroplet sample preparation with liquid chromatography-mass spectrometry for high-throughput single-cell proteomics. Anal. Chem. 92, 10588–10596 (2020).
Tsai, C. F. et al. An improved boosting to amplify signal with isobaric labeling (iBASIL) strategy for precise quantitative single-cell proteomics. Mol. Cell. Proteom. 19, 828–838 (2020).
Liang, Y. et al. Fully automated sample processing and analysis workflow for low-input proteome profiling. Anal. Chem. 93, 1658–1666 (2021).
Schoof, E. M. et al. Quantitative single-cell proteomics as a tool to characterize cellular hierarchies. Nat. Commun. 12, 3341 (2021).
Matzinger, M., Müller, E., Dürnberger, G., Pichler, P. & Mechtler, K. Robust and easy-to-use one-pot workflow for label-free single-cell proteomics. Anal. Chem. 95, 4435–4445 (2023).
Li, Y. et al. An integrated strategy for mass spectrometry-based multiomics analysis of single cells. Anal. Chem. 93, 14059–14067 (2021).
Ctortecka, C. et al. An automated workflow for multiplexed single-cell proteomics sample preparation at unprecedented sensitivity. Preprint at bioRxiv https://doi.org/10.1101/2021.04.14.439828 (2022).
Wang, Y. et al. A microfluidic robot for rare cell sorting based on machine vision identification and multi-step sorting strategy. Talanta 226, 122136 (2021).
Dong, Z. & Fang, Q. Automated, flexible and versatile manipulation of nanoliter-to-picoliter droplets based on sequential operation droplet array technique. TrAC, Trends Anal. Chem. 124, 115812 (2020).
Zhu, Y., Zhang, Y. X., Cai, L. F. & Fang, Q. Sequential operation droplet array: an automated microfluidic platform for picoliter-scale liquid handling, analysis, and screening. Anal. Chem. 85, 6723–6731 (2013).
Hildonen, S., Halvorsen, T. G. & Reubsaet, L. Why less is more when generating tryptic peptides in bottom-up proteomics. Proteomics 14, 2031–2041 (2014).
Norrgran, J. et al. Optimization of digestion parameters for protein quantification. Anal. Biochem. 393, 48–55 (2009).
Xin, L. et al. A streamlined platform for analyzing tera-scale DDA and DIA mass spectrometry data enables highly sensitive immunopeptidomics. Nat. Commun. 13, 3108 (2022).
Pappireddi, N., Martin, L. & Wühr, M. A Review on quantitative multiplexed proteomics. Chembiochem 20, 1210–1224 (2019).
Nakagawa, H. et al. Mismatch repair gene PMS2: disease-causing germline mutations are frequent in patients whose tumors stain negative for PMS2 protein, but paralogous genes obscure mutation detection and interpretation. Cancer Res. 64, 4721–4727 (2004).
Persaud, A. et al. Comparison of substrate specificity of the ubiquitin ligases Nedd4 and Nedd4-2 using proteome arrays. Mol. Syst. Biol. 5, 333 (2009).
Wisniewski, J. R., Hein, M. Y., Cox, J. & Mann, M. A “Proteomic Ruler” for protein copy number and concentration estimation without spike-in standards. Mol. Cel. Proteom. 13, 3497–3506 (2014).
De Pascalis, C. & Etienne-Manneville, S. Single and collective cell migration: the mechanics of adhesions. Mol. Biol. Cell 28, 1833–1846 (2017).
Pijuan, J. et al. In vitro cell migration, invasion, and adhesion assays: from cell imaging to data analysis. Front. Cell. Dev. Biol. 7, 107 (2019).
Jian, Y., Qiao, Q., Tang, J. & Qin, X. Origin recognition complex 1 regulates phospholipase Cdelta1 to inhibit cell proliferation, migration and epithelial-mesenchymal transition in lung adenocarcinoma. Oncol. Lett. 24, 252 (2022).
Shang, D., Liu, Y., Yang, P., Chen, Y. & Tian, Y. TGFBI-promoted adhesion, migration and invasion of human renal cell carcinoma depends on inactivation of von Hippel-Lindau tumor suppressor. Urology 79, 961–966 (2012).
Marin-Muller, C. et al. A tumorigenic factor interactome connected through tumor suppressor microRNA-198 in human pancreatic cancer. Clin. Cancer Res. 19, 5901–5913 (2013).
Pei, X., Ma, K., Xu, J., Wang, N. & Liu, N. Inhibition of cell proliferation and migration after HTRA1 knockdown in retinal pigment epithelial cells. Graefes Arch. Clin. Exp. Ophthalmol. 253, 565–572 (2015).
Chien, J. et al. Serine protease HtrA1 associates with microtubules and inhibits cell migration. Mol. Cell. Biol. 29, 4177–4187 (2009).
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Paluch, E. K., Aspalter, I. M. & Sixt, M. Focal adhesion-independent cell migration. Annu. Rev. Cell Dev. Biol. 32, 469–490 (2016).
Yamaguchi, H. & Condeelis, J. Regulation of the actin cytoskeleton in cancer cell migration and invasion. Biochim. Biophys. Acta 1773, 642–652 (2007).
Raftopoulou, M. & Hall, A. Cell migration: Rho GTPases lead the way. Dev. Biol. 265, 23–32 (2004).
Friedl, P. & Wolf, K. Tumour-cell invasion and migration: Diversity and escape mechanisms. Nat. Rev. Cancer 3, 362–374 (2003).
Yamao, M. et al. Distinct predictive performance of Rac1 and Cdc42 in cell migration. Sci. Rep. 5, 17527 (2015).
Marston, D. J. et al. Multiplexed GTPase and GEF biosensor imaging enables network connectivity analysis. Nat. Chem. Biol. 16, 826–833 (2020).
Vallone, V. F. et al. Methods for automated single cell isolation and sub-cloning of human pluripotent stem cells. Curr. Protoc. Stem. Cell Biol. 55, e123 (2020).
Eshghi, A. et al. Sample preparation methods for targeted single-cell proteomics. J. Proteome Res. 22, 1589–1602 (2023).
Ramsköld, D. et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat. Biotechnol. 30, 777–782 (2012).
Islam, S. et al. Highly multiplexed and strand-specific single-cell RNA 5’ end sequencing. Nat. Protoc. 7, 813–828 (2012).
Hashimshony, T., Wagner, F., Sher, N. & Yanai, I. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep. 2, 666–673 (2012).
Picelli, S. et al. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat. Methods 10, 1096–1098 (2013).
Nurk, S. et al. The complete sequence of a human genome. Science 376, 44–53 (2022).
Lebrigand, K., Magnone, V., Barbry, P. & Waldmann, R. High throughput error corrected nanopore single cell transcriptome sequencing. Nat. Commun. 11, 4025 (2020).
Hagemann-Jensen, M. et al. Single-cell RNA counting at allele and isoform resolution using Smart-seq3. Nat. Biotechnol. 38, 708–714 (2020).
Philpott, M. et al. Nanopore sequencing of single-cell transcriptomes with scCOLOR-seq. Nat. Biotechnol. 39, 1517–1520 (2021).
Demichev, V., Messner, C. B., Vernardis, S. I., Lilley, K. S. & Ralser, M. DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods 17, 41–44 (2020).
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).
Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289–1296 (2019).
Wu, T. et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innov.-Amst. 2, 100141 (2021).
Gu, Z. & Hubschmann, D. Make interactive complex heatmaps in R. Bioinformatics 38, 1460–1462 (2022).
Ma, J. et al. iProX: an integrated proteome resource. Nucleic Acids Res. 47, D1211–D1217 (2019).
Chen, T. et al. iProX in 2021: connecting proteomics data sharing with big data. Nucleic Acids Res. 50, D1522–D1527 (2022).
Acknowledgements
We are grateful to Professor Peng-Yuan Yang (Fudan University) for his initial suggestion in single-cell proteomic analysis, Professor Catherine C. L. Wong (Peking Union Medical College Hospital) and Dr. Zi-Yi Li (Zhejiang University) for their technical assistance in LC-MS/MS systems, and Dr. Cheng-Pin Shen (Shanghai Omicsolution Co.) for his help in data analysis. The work was funded by grants from National Ministry of Science and Technology of China (2021YFA1301601 to Q.F.), National Natural Science Foundation of China (21827806 and 22234007 to Q.F.), and Fundamental Research Funds for the Central Universities of China (226-2023-00138 to Q.F.).
Author information
Authors and Affiliations
Contributions
Q.F., Y.W. and Z.Y.G. conceived the idea. Y.W. and H.F.W. built the instrument. Y.W. and H.F.W. wrote the software. Z.Y.G., J.Z. and Q.W. cultured the cells. S.W.S., Z.Y.G., Y.R.J. and Q.W. explored and optimized the LC-MS conditions. Y.W. and Z.Y.G. planned and performed experiments. Y.R.J., Q.W., S.W.S. and W.X.Y. provided nano-LC columns. Y.W., Z.Y.G., Y.Y., L.Z., Y.R.J., J.W., Q.W., J.B.C., W.X.Y., Q.Q.X., Q.X.F., L.W., J.F. and Q.F. contributed to data analysis and interpretation, as well as figure preparation. Q.F., Y.W., Z.Y.G. and J.B.C. wrote the manuscript. Y.W., Y.Y., J.Z., Y.R.J, Q.Q.X., L.Z. and Q.F. contributed to the supplementary experiments and manuscript revision. Q.F. and J.Z.P. supervised the studies and acquired funding.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interest.
Peer review
Peer review information
Nature Communications thanks Jeroen Krijgsveld and the other, anonymous, reviewers for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, Y., Guan, ZY., Shi, SW. et al. Pick-up single-cell proteomic analysis for quantifying up to 3000 proteins in a Mammalian cell. Nat Commun 15, 1279 (2024). https://doi.org/10.1038/s41467-024-45659-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-45659-4
- Springer Nature Limited