SCOPE-Seq: a scalable technology for linking live cell imaging and single-cell RNA sequencing
Optically decodable beads link the identity of a sample to a measurement through an optical barcode, enabling libraries of biomolecules to be captured on beads in solution and decoded by fluorescence. This approach has been foundational to microarray, sequencing, and flow-based expression profiling technologies. We combine microfluidics with optically decodable beads and show that phenotypic analysis of living cells can be linked to single-cell sequencing. As a proof-of-concept, we demonstrate the accuracy and scalability of our tool called Single Cell Optical Phenotyping and Expression sequencing (SCOPE-Seq) to combine live cell imaging with single-cell RNA sequencing.
KeywordsSingle-cell RNA-Seq Live cell imaging Microfluidics
Recent advances in microfluidics and cDNA barcoding have led to a dramatic increase in the throughput of single-cell RNA-Seq (scRNA-Seq) [1, 2, 3, 4, 5]. However, unlike earlier or less scalable techniques [6, 7, 8], these new tools do not offer a straightforward way to directly link phenotypic information obtained from individual, live cells to their expression profiles. Nonetheless, microwell-based implementations of scRNA-Seq are compatible with a wide variety of phenotypic measurements including live cell imaging, immunofluorescence, and protein secretion assays [3, 9, 10, 11, 12]. These methods involve co-encapsulation of individual cells and barcoded mRNA capture beads in arrays of microfabricated chambers. Because the barcoded beads are randomly distributed into microwells, one cannot directly link phenotypes measured in the microwells to their corresponding expression profiles. In Single Cell Optical Phenotyping and Expression Sequencing or SCOPE-Seq, we use optically barcoded beads  and identify the sequencing barcode associated with each single-cell cDNA library on the sequencer by fluorescence microscopy. Thus, we can obtain images, movies, or other phenotypic data from individual cells by microscopy and directly link this information to genome-wide expression profiles.
In SCOPE-Seq, we load cells into a microwell array as described previously [3, 9], collect live cell imaging data, and co-encapsulate the cells with dual-barcoded beads (Fig. 1d). The microwell array device used here has a total of 30,500 microwells (see Additional files 2 and 3 for design of the microwell array device). We then use a computer-controlled system to perform on-chip cell lysis, mRNA capture, reverse transcription, and exonuclease digestion. This process generates PCR-amplifiable, sequence barcoded cDNA that is covalently attached to each bead. Next, we perform “optical demultiplexing”—12 cycles of reversible fluorescence hybridization that determine the combination of 12 optical barcode oligonucleotides (OBOs) on each bead (Additional file 1: Supplementary Methods, Fig. 1d, e). We then cut the microwell array into multiple pieces and extract the beads from each piece for scRNA-Seq library construction. Beads from each piece are processed and indexed separately, thereby increasing our multiplexing capacity to 212× N where N = 10 is the number of pieces, giving us an effective barcode library size of 40,960. The resulting scRNA-Seq libraries from different pieces are then pooled and sequenced to obtain the RNA-Seq profile of the individual cells and their sequencing barcodes. Finally, we process the images to identify the optical barcode on each bead and identify the corresponding sequencing barcode using the look-up table described above to link microscopy data from a cell in a particular microwell to its RNA-Seq profile (Fig. 1e).
The microwell array contains a small number of multi-species multiplets (Fig. 2a). As expected from the high linking accuracy of SCOPE-Seq, the species purity of RNA-Seq expression profiles are consistent with their fluorescent labels from live cell imaging data (Fig. 2a). Interestingly, we were able to remove most of the multi-species multiplets using monochrome live cell imaging data (Additional file 1: Supplementary Methods, Fig. 2b). We used scRNA-Seq profiles with purity below 70% as a threshold for calling human-mouse multiplets. We then blinded ourselves to the two-color fluorescence information that distinguishes human-mouse multiplets in our imaging data and attempted to identify multiplets by manually examining the monochrome images. Imaging- and sequencing-based doublet identifications were conducted by different researchers in a blinded fashion. This resulted in a sensitivity of 66% and a specificity of 99.1% for multiplet detection, and a concordance of 87.5% between one- and two-color imaging. Some false negatives likely arise from imperfections in our scRNA-Seq “ground truth” and relatively low-resolution imaging. We anticipate that more sophisticated image processing and better microscopy and cell/nucleus stains will lead to further improvements in sensitivity to make SCOPE-Seq highly effective for detecting multiplets.
Additional imaging features obtained from cells may carry information related to gene expression. We measured 19 imaging features from the fluorescence images of cells reflecting various aspects of cell size, shape, and intensity distribution (Additional file 1: Supplementary Methods). Principal component (PC) analysis on these features suggests significant heterogeneity among the cells (Fig. 2c). The first PC is primarily correlated with cell size-related features (Fig. 2d) and the second with shape-related features (Fig. 2e). We then ranked protein coding genes by their correlation with these PCs and performed gene set enrichment analysis for each PC (Fig. 2f). Perhaps not surprisingly, we found that cell division-related genes are most correlated with the first PC, which is related to cell size, likely because actively dividing cells and mitotic figures tend to be larger. We found enrichment of extracellular matrix vacuolar genes for the second PC, which represents a measure of cell shape and may result from cells in the process of adhering to the microwells and the impact of vacuoles on overall cell morphology. We made very similar observations of associations between gene ontology and imaging features using partial least squares regression analysis on the same data (Additional file 1: Supplementary Methods, Additional file 4: Table S1).
Many cellular phenotypes are difficult to infer directly from static measurements of the transcriptome, such as protein expression and localization dynamics, organelle dynamics and distribution, morphological features, uptake of foreign objects, and biomolecular secretion. However, there are myriad live cell imaging and microwell-based assays for characterizing these phenotypes in individual cells. We expect that SCOPE-Seq will serve as a highly scalable, accurate, and economical approach to linking live cell microscopy assays to scRNA-Seq, enabling investigation of the transcriptional underpinnings of the resulting phenotypes. In addition, with the multiplet detection capability, one can potentially achieve approximately fivefold higher throughput by more aggressive cell loading, although this is subject to the caveat that background from molecular cross-talk could increase.
The current implementation of SCOPE-Seq has important limitations, motivating future improvements. The use of separate optical and sequencing barcodes requires an extra step-sequencing of bead-free libraries. Future implementations will use oligonucleotides in which the optical and sequencing barcodes are the same DNA sequence. OBO ligation to a subset of mRNA capture sites on the beads also leads to reduced molecular capture efficiency for the corresponding scRNA-Seq libraries relative to beads without optical barcodes (Additional file 1: Figure S1). Finally, the multiplexing capacity of the beads in our proof-of-concept experiment is relatively modest, which precludes an error-correcting code. While our current linking accuracy is high, both the yield of optically demultiplexed cells and accuracy could be improved with such a code.
The scheme presented here for linking cellular phenotypes to sequencing could also be generalized beyond single-cell analysis. Microscopy experiments on small multi-cellular organisms, organoids, or colonies could be linked with sequencing. Additionally, the optically decodable bead array could be used for spatial transcriptomic analysis as demonstrated previously with printed microarrays . One advantage of optically decodable beads over printed microarrays [15, 16] is that beads can be prepared in a large batch for use in many experiments starting from commercially available “Drop-seq” beads with relatively simple tube-based reactions. We hope that this economical approach will serve as a powerful tool for connecting high-throughput microscopy and sequencing on multiple scales.
We thank the Sulzberger Columbia Genome Center for assistance and resources for high-throughput sequencing.
P.A.S. was supported by NIH/NCI grant R33CA202827, NIH/NIBIB grant K01 EB016071, and a Human Cell Atlas Pilot Project grant from the Chan Zuckerberg Initiative.
Availability of data and materials
The single-cell RNA-Seq data and bead-free optical barcode/sequencing barcode linkage data have been deposited in the Gene Expression Omnibus under accession GSE116011 .
JY and PAS conceived the study and designed the experiments. JY constructed the automated instrumentation for SCOPE-Seq and conducted all of the experiments. JY, JS, and PAS analyzed the data. JY and PAS wrote the paper. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Consent for publication
J.Y. and P.A.S. are listed as inventors on patent applications filed by Columbia University related to the work described here.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 16.Vickovic S, Stahl PL, Salmen F, Giatrellis S, Westholm JO, Mollbrink A, Navarro JF, Custodio J, Bienko M, Sutton LA, et al. Massive and parallel expression profiling using microarrayed single-cell sequencing. Nat Commun. 2016;7:13182.Google Scholar
- 17.Yuan J, Sims PA. An optically decodable bead array for linking imaging and sequencing with single-cell resolution. Gene Expression Omnibus. 2018. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE116011.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.