Mapping human tissues with highly multiplexed RNA in situ hybridization

Kalhor, Kian; Chen, Chien-Ju; Lee, Ho Suk; Cai, Matthew; Nafisi, Mahsa; Que, Richard; Palmer, Carter R.; Yuan, Yixu; Zhang, Yida; Li, Xuwen; Song, Jinghui; Knoten, Amanda; Lake, Blue B.; Gaut, Joseph P.; Keene, C. Dirk; Lein, Ed; Kharchenko, Peter V.; Chun, Jerold; Jain, Sanjay; Fan, Jian-Bing; Zhang, Kun

doi:10.1038/s41467-024-46437-y

Mapping human tissues with highly multiplexed RNA in situ hybridization

Article
Open access
Published: 20 March 2024

Volume 15, article number 2511, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Mapping human tissues with highly multiplexed RNA in situ hybridization

Download PDF

7552 Accesses
3 Citations
13 Altmetric
Explore all metrics

Abstract

In situ transcriptomic techniques promise a holistic view of tissue organization and cell-cell interactions. There has been a surge of multiplexed RNA in situ mapping techniques but their application to human tissues has been limited due to their large size, general lower tissue quality and high autofluorescence. Here we report DART-FISH, a padlock probe-based technology capable of profiling hundreds to thousands of genes in centimeter-sized human tissue sections. We introduce an omni-cell type cytoplasmic stain that substantially improves the segmentation of cell bodies. Our enzyme-free isothermal decoding procedure allows us to image 121 genes in large sections from the human neocortex in <10 h. We successfully recapitulated the cytoarchitecture of 20 neuronal and non-neuronal subclasses. We further performed in situ mapping of 300 genes on a diseased human kidney, profiled >20 healthy and pathological cell states, and identified diseased niches enriched in transcriptionally altered epithelial cells and myofibroblasts.

Direct RNA targeted in situ sequencing for transcriptomic profiling in tissue

Article Open access 13 May 2022

Highly sensitive spatial transcriptomics using FISHnCHIPs of multiple co-expressed genes

Article Open access 15 March 2024

High-Throughput In Situ Hybridization: Systematical Production of Gene Expression Data and Beyond

Introduction

Analyzing single-cell expression of genes in their spatial context plays a critical role in deciphering the complex cellular organization in multicellular organisms^1,2. Gene expression in its spatial context is especially important in fields such as embryo development³, neuroscience⁴, and in histopathology⁵. The emergence of single-molecule fluorescence in situ hybridization (smFISH, Supplementary Table 1 for all acronyms in the manuscript) methods allowed us to simultaneously measure several RNA species in single cells^6,7 by imaging fluorophore-tagged DNA oligos, or probes, that tile the RNA molecules. Because of its high sensitivity, smFISH has become the gold standard assay to measure RNA expression in situ and has been used to show the importance of RNA localization in cell migration, neuron connectivity, and local protein synthesis^8,9. However, since smFISH is limited by spectral overlap of the fluorophores, it has limited multiplexing capacity¹⁰, and does not scale well for tasks such as resolving cellular heterogeneity in complex tissues, which require profiling hundreds of RNA species.

Recently, in situ hybridization techniques with combinatorial encoding have emerged in which the identity of hundreds or thousands of RNA species can be decoded with tens of FISH cycles^11,12. Although these methods have increased the multiplexity by 2-3 orders of magnitude compared to smFISH, they typically require longer target RNA transcripts (>1.5kb), restricting the analysis of important molecules such as neuropeptides and interferons^11,13. Furthermore, because of the low signal-to-noise ratio (SNR) from detected transcripts, these methods need high magnification objectives with high numerical aperture (NA), making it difficult and time-consuming to image large regions of interest (ROIs). The low SNR also makes it challenging to apply these methods to human tissues which may have a high autofluorescence background caused by lipofuscin granules^14,15, proteins such as collagen and elastin¹⁶, or mitochondria^17,18. Methods that ligate padlock probes annealing to mRNA derivatives, followed by rolling circle amplification (RCA) have been employed to boost SNR from individual transcripts. However, these methods are associated with high probe set expenses and complex decoding procedures. They further lack an efficient approach to stain the cell bodies for segmentation^19,20,21 (Supplementary Table 2).

With the advent of sequencing-based spatial transcriptomics methods^{22,23,24,25,26,27}, transcriptome-wide profiling of RNA molecules in tissue sections was made possible by transferring the RNA molecules to a slide coated with spatially-barcoded oligos. In this way, the spatial information of each RNA molecule can be registered through next-generation sequencing. Nevertheless, when compared to in situ methods, sequencing-based spatial transcriptomic tools in general have lower capture efficiency, complex slide preparation procedures, higher sequencing costs, and limited spatial resolution due to feature size and lateral diffusion²⁸.

Here, we developed Decoding Amplified taRgeted Transcripts with Fluorescence in situ Hybridization (DART-FISH) to overcome some of these limitations. The key technical features include a robust barcoding scheme, a set of molecular protocols for padlock probe production in large pools, in situ padlock capture and amplification, a cytoplasmic stain called RiboSoma, isothermal and enzyme-free decoding, and a computational method for decoding features at the pixel level from dense fluorescent images based on sparse deconvolution. We benchmarked DART-FISH by measuring 121 genes in a large section (~30 mm²) of the human primary motor cortex (M1C). We validated its sensitivity and specificity by comparing it to RNAscope, a commercially available smFISH method (Methods). Moreover, we successfully recapitulated the spatial organization of major neuronal and non-neuronal cell types, detected short neuropeptide genes (e.g., SST and NPY), and validated a deep layer neuron marker (TMSB10). Finally, we applied DART-FISH to measure 300 genes in a diseased human kidney section and characterized the spatial distribution of normal and disease-altered cell types and pathological niches. Overall, the DART-FISH workflow provides solutions to several foundational problems in the field while remains easy to implement and requires no specialized or custom-made equipment.

Results

DART-FISH framework

DART-FISH involves in situ feature generation by padlock probe capture of targeted transcripts and rolling circle amplification (RCA), followed by a highly robust decoding process of sequential isothermal hybridization. (Fig. 1a, Methods). Specifically, RNA molecules in fresh-frozen tissue sections are fixed with paraformaldehyde (PFA), permeabilized, and then reverse-transcribed with a mixture of random and poly-deoxythymidine (dT) primers. To assess the RNA content in human tissues as well as the retention of the cDNA molecules in situ, we added a 5’ handle to the reverse-transcription primers to enable the collective visualization of all cDNA molecules with fluorescent oligos (Fig. 1b). We call this labeling method RiboSoma because the resulting signal labels the cell bodies. During protocol optimization, we noticed that crosslinking the cDNA molecules immediately after reverse-transcription to a polyacrylamide (PA) gel enhances the RiboSoma signal (Supplementary Fig. 1a) suggesting better retention of cDNA in situ throughout the DART-FISH protocol. This cDNA embedding strategy also led to 1.5-fold median increase of the feature count per gene (Supplementary Fig. 1b, c), compared to when the polyacrylamide gel is cast after RCA. Thus, RiboSoma serves as a marker for cDNA content of the tissue and provides a quality control for in situ reactions.

Following gel embedding and RNA digestion, cDNA molecules are hybridized with a library of padlock probes and circularized at a high temperature to ensure specificity^29,30. On their backbone, padlock probes carry a universal sequence used for amplification and gene-specific barcodes. The circularized padlock probes are then rolling-circle-amplified, generating RCA colonies in situ (rolonies) with hundreds of copies of barcode sequences concatenated in the form of a DNA nanoball. The rolonies are then covalently attached to the polyacrylamide gel to secure their positions during decoding. The result of the experiment is then assessed in the “anchor round” imaging, where fluorescent probes are hybridized to the universal sequences and the 5’ handles on cDNA molecules to visualize the spatial distribution of all rolonies and cells (i.e., RiboSoma, Fig. 1b).

To achieve high multiplexity within only a few rounds of imaging, combinatorial labeling was used to generate gene-specific barcodes³¹. In this barcoding scheme, $n$ rounds of imaging are performed where every barcode is “on” in exactly $k$ rounds and “off” in other rounds (Fig. 1c). When “on”, the barcode signals in one of the three fluorescent channels; it emits no fluorescence when “off”. With $n$ rounds of imaging, a total of $\left(\genfrac{}{}{0ex}{}{n}{k}\right){3}^{k}$ unique barcodes can be generated, allowing us to measure hundreds of RNA species with limited rounds of decoding ($n=6$ and $k=3$ in Fig. 1 with 540 valid barcodes). This can be extended to 7 rounds of decoding for up to 945 genes ($k=3$), 8 rounds of decoding for 5670 genes ($k=4$), and so on. This barcoding scheme has a proven robustness evident by its wide adoption by Illumina’s gene expression, SNP genotyping and DNA methylation arrays^31,32,33. Hence, DART-FISH uses a barcoding strategy that can theoretically generate enough diversity to encode hundreds to thousands of genes within less than 10 rounds of imaging.

To implement this barcoding system such that the decoding process is fast and robust, gene-specific barcodes are created by the concatenation of $k$ 20-nucleotide-long decoder sequences placed on the backbone of padlock probes³⁴. The decoder sequences are derived from Illumina BeadArray technology and have limited cross-hybridization³¹ (Supplementary Data 1). In each round of imaging, three unique fluorescent decoding probes are hybridized and imaged. Rolonies will be "on" only if a decoding probe that corresponds to one of their decoder sequences is present. After imaging, the decoding probes are stripped and washed away at room temperature to prepare for the next round (Fig. 1b and e). During this procedure, the rolonies are stable with minimal movement, degradation and background buildup (Supplementary Fig. 1d). Note that this process enables rapid and reliable decoding since it depends solely on the hybridization of short oligonucleotides at room temperature, eliminating the need for sophisticated temperature control setups and avoiding the complications of performing enzymatic reactions on a microscope. Thus, DART-FISH uses an enzyme-free and isothermal method to decode the rolonies which allows short between-cycle preparation times.

It has been shown that increasing the number of padlock probes per gene leads to a higher detection sensitivity in situ³⁵. For such applications, it is common to pool individually synthesized padlock probes^35,36,37. This strategy, while manageable for small-scale studies, would be prohibitively expensive when probing hundreds of genes is desired. To overcome this limitation, we adapted an enzymatic protocol to produce thousands of padlock probes in-house starting from an oligo pool synthesized on microarrays³⁸ (Methods, Supplementary Fig. 3c). We were able to target 121 genes each with up to 50 padlock probes for less than 25% the cost of the direct synthesis option. Note in our strategy the cost per probe decreases further by including more probes in the pool, whereas for direct synthesis the cost per probe remains constant. Consequently, individually synthesizing 20,000 probes to target 400 genes is almost 10 times as expensive as array synthesis. To fully utilize this feature, multiple probe sets that, for instance, target different organs or organisms can be pooled together and amplified separately for a fraction of the upfront cost of the direct synthesis approach. This strategy opens up the possibility of using different probe sets in any regular research lab.

Targeting more genes with high sensitivity can result in optical overcrowding, which may hinder rolony decoding. Physical expansion of the tissues^37,39,40 has been used as an effective strategy to distance rolonies and reduce overcrowding but it leads to larger imaging areas, longer imaging time and thus lower throughput³⁷. A computational solution to the overcrowding problem can vastly increase the throughput. We reasoned that given the size of the rolonies (<1µm)⁴¹ and our pixel size (~0.3µm with 20x objective), each pixel will at most overlap a few rolonies. On the other hand, given that a small fraction of all possible barcodes are used, it may be possible to deconvolve mixtures of barcodes from fluorescent intensity values at the pixel level. To this end we developed the SparseDeconvolution (SpD) decoding algorithm: we formalized this deconvolution as a regularized linear regression problem, where barcodes can combine linearly to form the observed pixel intensities and optimized the combinations under a condition that promotes sparsity (Methods, Fig. 1d). We solve this problem for every pixel and obtain initial weight maps for every single barcode (Fig. 1f). This is followed by filtering and aggregating the neighboring pixels to form spots (Supplementary Fig. 2a,b). To control the quality of the deconvolution procedure, we add empty barcodes that are not used in the probe set to the codebook. While the fraction of empty barcodes is 5-8% of used barcodes, the fraction of spots decoded as empty is below 0.25% (empty rate, Supplementary Fig. 2c–e). We compared SpD with existing methods, including a naive algorithm that directly matches pixels to individual barcodes⁴² and more sophisticated deconvolution algorithms^42,43,44. The results on synthetic data show a complementary performance of SpD to the other deconvolution algorithms while a superior performance to the direct matching algorithm (Supplementary Fig. 2f). The simulations also show that specificity, which is unobserved on real data, is related to empty rate and one can keep specificity high by keeping the empty rate low. With this computational framework, we could mitigate optical overcrowding and increase our throughput by imaging with a 20x objective lens.

Benchmarking and validation of DART-FISH

To assess the performance of DART-FISH for profiling more than one hundred RNA species in large human tissue sections with fast image acquisition, we applied it to a 10μm-thick, 6.9-by-4.3-mm² fresh-frozen post-mortem human M1C brain section⁴⁵. The anatomy, function, and gene expression of M1C have been widely investigated at the single-cell level^{46,47,48,49,50}, giving us a well-defined standard to compare across different studies. Note that archived human brain samples represent one of the most challenging sample types for spatial RNA mapping, due to the presence of high autofluorescence⁴⁵ and in general, lower RNA quality⁵¹.

We designed 5097 padlock probes to target a selected panel of 121 genes containing known marker genes to resolve the spatial organization of excitatory and inhibitory neurons, as well as non-neuronal cells (Supplementary Data 2). The corresponding codebook followed a 3-on-3-off barcoding scheme. Imaging 6 rounds of decoding, the anchor round and the nuclear stain of this ~30 mm² section of human M1C took about 10 h. After image preprocessing and spot decoding by SpD, we obtained 2,008,260 transcripts (0.2% empty calls with 8 empty barcodes). The expression level of these 121 genes was highly consistent between two replicates (correlation coefficient r² = 0.988, Fig. 2b), demonstrating a high reproducibility of DART-FISH.

**Fig. 2: Benchmarking DART-FISH on the human M1C.**

We segmented the cells using RiboSoma, which revealed cell body morphology better than nuclear staining (Supplementary Fig. 4a, b), and assigned the transcripts to the closest cell if the distance to the cell boundary was less than 3μm (Methods, Supplementary Fig. 4c). Other transcripts were discarded from downstream analyses. Among the target genes, we noticed a higher fraction of MBP transcripts were found to be outside the cell bodies (93% outside, Supplementary Fig. 4d) while co-localizing with RiboSoma in the extrasomatic space of the cortex (Supplementary Fig. 4e). This observation reflects the local translation of MBP transcripts at the axon-glia contact sites⁵². Overall, we detected 26,646 cells with 802,361 transcripts that were assigned to a segmented cell with an average of 30 transcripts and 11 unique genes per cell (Fig. 2c).

To assess spatial specificity of transcript localization, we first inspected the marker genes SLC17A7 and SATB2 in excitatory neurons and GAD1 and GAD2 in inhibitory neurons. As expected, the SLC17A7 and SATB2 transcripts were mainly aggregated in the soma of excitatory neurons with mutual exclusivity to GAD1 and GAD2 transcripts in inhibitory neurons (Fig. 2d, e). We then compared the expression of 10 marker genes with the results of RNAscope generated on a parallel M1C tissue section (Methods). As shown in Fig. 2f and Supplementary Fig. 4f, the spatial distribution of these marker genes in the same region demonstrates high concordance between RNAscope and DART-FISH. Specifically, the pan-excitatory neuron marker, SLC17A7, showed pronounced enrichment in the L2-L6 cortical layers. CUX2, RORB, and FEZF2 were enriched in supragranular, granular, and infragranular layers of the neocortex, respectively, which is consistent with previous studies^{53,54,55,56,57}. The observed localization of CBLN2 in neocortical layers 2/3 and 5/6 neocortex also agrees with a previous report⁵⁸. Collectively, these results indicate that DART-FISH can specifically map the spatial localization of these marker genes in human M1C.

To estimate the sensitivity of DART-FISH, we selected a similar region of interest (ROI) with equal area between RNAscope and DART-FISH samples and compared the number of transcripts of each gene. We found that the estimated sensitivity ranged from 3.9% to 67.7%, depending on the transcript (Fig. 2g). We correlated our data to the publicly available MERFISH⁵⁹ and EEL FISH⁶⁰ datasets from the human brain (Pearson’s r = 0.755 and 0.750, respectively, Fig. 2h and i), which we consider a high concordance given the differential probing efficiencies between different technologies, and the fact that samples from different regions were used for each technology. In summary, DART-FISH is a reproducible spatial transcriptomic method with the sensitivity and specificity to detect hundreds of RNA species in their spatial context, with the potential for providing biologically meaningful insights to the human brain despite the high natural background autofluorescence.

Organization of cell types in the human primary motor cortex

To assess whether DART-FISH is able to resolve the organization of various cell types of human M1C, we set out to perform cell annotation by performing clustering on DART-FISH cells and matching them to the highest correlated subclass from a recent single-nucleus RNA sequencing (snRNA-seq) reference of M1C⁶¹ (Methods, Fig. 3a and b, Supplementary Fig. 5a, b). We resolved 20 subclasses from the major excitatory, inhibitory, and non-neuronal cell classes, which constituted 24.3%, 10.6%, and 65.1%, respectively, in the M1C (Fig. 3c–g). For excitatory neuronal subclasses, we successfully detected their laminar distribution, with L2/3 IT neurons localized at the superficial layer of the cortex and L6b/CT neurons deep in the cortex and close to the white matter (Fig. 3b–d), in line with the evolutionarily conserved organization of excitatory neurons in the mammalian M1C⁴⁶. Of note, L6 IT Car3 cells seem to be positioned more superficially than the L6 IT population, consistent with recent observations in human visual cortex and middle temporal gyrus^61,62 (Fig. 3d). In contrast, inhibitory neuronal subtypes generally showed wider spatial gradients along the cortical axis; for instance the Vip population was enriched in layer 2-4 as suggested by previous studies in the mouse^49,63 (Fig. 3b and e). Moreover, we observed some cells belonging to the excitatory neurons and inhibitory neurons localized in the white matter region, which likely are the adult remnants of early generated subplate neurons discovered in previous studies^64,65. For non-neuronal cells, we observed oligodendrocytes appearing at layer 4 and peaking in the white matter⁶⁶ in spite of the uniform distribution of the oligodendrocyte progenitors across the tissue section (OPC, Fig. 3f)⁶⁷.

**Fig. 3: DART-FISH mapping of cell types in the human M1C.**

We further assessed whether we could detect short genes (<1.5kb) with DART-FISH. smFISH-based methods rely on tiling sufficiently long RNA molecules with probes to generate detectable fluorescent signals. In contrast, DART-FISH requires only one padlock probe to bind successfully to the target to detect it. To boost our chances for detecting shorter genes, we allowed overlapping targets in our design strategy to obtain more probes for short RNA species⁶⁸ (Supplementary Fig. 3b, NPY as an example). We compiled a list of 33 differentially expressed genes shorter than 1.5kb comprising well-studied genes as well as less well-known computationally derived marker genes in the brain (Supplementary Data 2). For example, by targeting SST (607 nt) and NPY (893 nt), we could uncover a rare subclass of inhibitory neurons, Sst Chodl (0.1% abundance, Fig. 3g), specified by the expression of these short neuropeptides (Fig. 3b and h). Sst Chodl cells were found to be enriched in deeper layers, consistent with previous reports⁶⁹. In addition to these short neuropeptides, DART-FISH also detected other short RNA species, including PCP4 (534nt) and TMSB10 (461nt) with pronounced localization (Fig. 3h). PCP4 is reported to be a layer 5-6 marker in the mouse cerebral cortex⁷⁰ while TMSB10 seems to be a deep layer marker gene. To quantify how well the targeted genes performed, we correlated their average expression at the subclass level between DART-FISH and snRNA-seq (Methods, Supplementary Fig. 5c). We found 25 of 33 (75%) of the genes shorter than 1.5kb and 81 of 88 (92%) of the longer genes had higher correlations than 0.5 (Supplementary Data 2). This is similar to a MERFISH data set targeting another region of the human cortex with 250 genes (88% with >0.5 Pearson’s correlation, Supplementary Fig. 5c). Taken together, we showed that DART-FISH can accurately map the distribution of all the main neuronal and non-neuronal subclasses in the human brain and can uncover rare cell populations by detecting short genes.

Mapping cellular neighborhoods in histopathologically abnormal human kidney

To demonstrate the applicability of DART-FISH to a clinically relevant tissue context, we next applied it to the human kidney. The kidney is composed of repetitive functional tissue units, called nephrons, with various closely organized cell types, including endothelial, stromal, immune and epithelial cells that regulate the filtration of the blood as well as other homeostatic functions such as maintaining electrolyte and fluid balance⁷¹ (Fig. 4a, Supplementary Fig. 6a). The homeostatic interactions between these cell types are perturbed in kidney disease and can lead to fibrosis and decline in kidney function⁷². We recently reported an atlas of cell types in healthy and diseased patients, and identified multiple mal-adaptive cell states that are associated with kidney disease^73,74. In the same study, we used sequencing-based spatial transcriptomics methods with 10um and 55um resolution to map cellular neighborhoods in healthy and diseased samples, respectively, which lacked the resolution needed to delineate the exact cellular composition, the boundaries and the positioning of cells within the neighborhoods. We reasoned that the high spatial resolution provided by DART-FISH is complementary to the sequencing-based methods and can help define cellular niches more accurately.

**Fig. 4: DART-FISH mapping of a diseased human kidney.**

Guided by the published single-nucleus reference atlas, we designed a panel of 300 genes with 6299 padlock probes following the 3-on-4-off barcoding scheme, focusing on the major healthy cell types of the kidney, immune cells and cell states implicated in kidney disease (Supplementary Data 3). We then performed DART-FISH on tissue sections from the kidney cortex of a patient with various clinical features, including glomerulosclerosis, interstitial fibrosis, tubular atrophy, and chronic inflammation identified by a pathologist. Our gene panel correctly mapped the spatial organization of cells in different regions of the nephron, including glomeruli and cortical tubules (Fig. 4b). For instance, the transcripts NPHS2 and EMCN, which mark podocytes and glomerular capillary endothelial cells, respectively, are mainly found in the glomerular tuft of the round appearing renal corpuscles. We then compared our data with a Slide-seq dataset from a healthy individual. At the bulk level, the DART-FISH data is correlated with slide-seq (Pearson’s r = 0.609) with cells in DART-FISH demonstrating more copies of the targeted genes than Slide-seq beads⁷³ (median fold-change per gene=2.2 for the top 150 genes in slide-seq, Supplementary Fig. 6b). The comparison also showed upregulation of markers of inflammation in the DART-FISH dataset, consistent with the underlying pathology in our sample (Supplementary Fig. 6b). Hence, the spatial distribution of known kidney marker genes and their overall counts are consistent with kidney biology and prior data.

To find the molecular identity of the cells in the human kidney, cell segmentation was performed using both RiboSoma and nuclear stains. We found RiboSoma to be superior to the nuclear stain in revealing tubular morphology and distinguishing the interstitial cells (Supplementary Fig. 6c). Subsequently, with 30,000 segmented cells with an average of 30 detected transcripts and 20 unique genes per cell (Supplementary Fig. 6d, e, empty rate <0.25% with 15 empty barcodes), the kidney DART-FISH data was annotated to cortical and altered cell types as identified in the single-cell kidney atlas⁷³ (Fig. 4c, Supplementary Fig. 6f, Supplementary Fig. 7, Methods). These annotated cell types were of the expected relative proportions and showed strong and specific differential expression of corresponding marker genes (Fig. 4d, Supplementary Fig. 6f, Supplementary Fig. 8a). Thus, DART-FISH could confidently resolve >20 cell types and states in the human kidney.

Next, we investigated the neighborhoods formed by the healthy cell types. The complex archetypical structure of the renal corpuscle was successfully recapitulated, with podocytes (POD), glomerular capillary endothelial cells (EC-GC) and glomerular mesangial cells (MC) confined within the glomerular tuft, surrounded by parietal epithelial cells (PEC) or the outer layer of the Bowman’s capsule and juxtaposed with the renin-secreting cells (REN) in the wall of the arterioles (Fig. 4e, Supplementary Fig. 6a, Supplementary Fig. 7). We also detected medullary rays with the characteristic bundling of the tubules of cortical thick ascending limb (C-TAL), the S3 segment of proximal tubules (PT-S3) and collecting ducts (Fig. 4f). Further, collecting ducts comprising intermixed principal cells (PC) and alpha- and beta-intercalated cells (C-IC-A and IC-B) could be clearly resolved. These results show that our cell type annotations closely match the known structures within the human kidney.

To compare the tissue morphology obtained from DART-FISH with a clinically relevant histological stain, we performed Hematoxylin and Eosin (H&E) staining on a parallel section from the same tissue block. In an area with putative inflammation on the H&E slide, we observed an abundance of immune cells of both lymphoid and myeloid origin on the DART-FISH section (Fig. 4g). These immune cells surround a sclerotic glomerulus, which in contrast to a more normal glomerulus, is depleted from cells and is instead fibrotic (shown by an arrow in Fig. 4g). In DART-FISH, this phenomenon can be clearly detected by contrasting the low cell numbers revealed by RiboSoma and the physically occupied space through the accompanying transmitted light image (Supplementary Fig. 6h). Thus, by paired H&E staining we showed that DART-FISH can capture different pathological phenomena with a molecular resolution beyond that of the traditional histology.

In addition to healthy cell types, DART-FISH was also able to reveal distinct pathological cell states. This includes a population of myofibroblasts (MYOF) expressing matrisome genes, including COL1A1, TNC, DCN and POSTN, suggestive of their ECM-producing role in kidney fibrosis (Supplementary Fig. 8b)^73,75. Furthermore, we detected altered PT (aPT) and TAL (aTAL1) populations, both of which expressed PROM1, in line with recent findings^73,76. To determine whether these pathological cell states form distinctive niches, computational methods were applied to find pairs of cell types that show enrichment in their spatial colocalization⁷⁷. Interestingly, in neighborhoods around MYOFs, there was an increased presence of aTAL1 cells compared to C-TAL and aPT (Fig. 4h, Supplementary Fig. 6i). This observation indicates a possible interplay between the maladaptive repair of TALs and fibrosis. We speculate that there are a variety of cellular neighborhoods associated with adaptive repair and fibrosis that could be defined through further studies. All in all, these results demonstrate how DART-FISH as a single-cell resolution spatial transcriptomic technique can be used to interrogate neighborhoods of cell types and states defined by single-cell RNA sequencing studies in diseased human tissues.

Discussion

In this study we introduced DART-FISH, a high throughput RNA in situ mapping technique, and demonstrated its application to human tissues, even with high native autofluorescence background. In the human brain, we recovered the spatial distribution of 20 cell types from the 3 main cell classes. This included the laminar organization of the excitatory neurons in the cortex and the broader layer-specificity of inhibitory neurons, and the ubiquity of the non-neuronal cells across the brain cortex. We also profiled a sample from a histopathologically abnormal human kidney and demonstrated identification of rare cells such as REN-producing cells, the intricate functional niches, and quantified the interactions between pathological cell states.

DART-FISH is a cost-effective technology capable of fast decoding on relatively large tissue sections. Using our protocol for padlock probe production from oligo pools, the cost of synthesis per gene scales sublinearly with the number of genes. Hence, oligo pricing will not hinder scaling the probe set to tens of thousands of transcripts. Moreover, DART-FISH does not need any specialized equipment for neither rolony generation nor decoding. The decoding process is relatively fast because it depends on the diffusion and hybridization of very short oligos and a strong signal can be obtained by 5-10 min of incubation with the fluorescent decoding probes at room temperature. Likewise, stripping and washing away the unbound decoding probes is straightforward and fast at room temperature. This process can be performed on a stationary glass-bottom petri dish or a coverslip mounted on a microscope and does not require reaction chambers or flowcells with sophisticated temperature control. The large size and the bright signal of the rolonies permit the use of 20x objective lenses for decoding, which makes it possible to image centimeter-sized samples in a manageable time with an ordinary confocal microscope.

What distinguishes DART-FISH from other techniques of a similar class is how the cDNA molecules are treated^35,36. We demonstrated here that embedding the cDNA molecules in a polyacrylamide gel significantly enhances the retention of the cDNA throughout the rolony generation procedure and increases the sensitivity, a point not taken into account in previously published methods. Additionally, we introduced RiboSoma, a cDNA labeling technique, as a cell morphology marker which reveals more information about cell bodies than nuclear stains. We anticipate that this tool can be highly useful for cell body segmentation, particularly in thicker samples.

RCA-based in situ detection systems are prone to optical and physical overcrowding as more and more genes are detected with higher efficiency. To mitigate this issue, we developed a computational method (SpD) that used the redundancy in the barcode space to deconvolve mixed barcodes from single pixels. This strategy improved our decoding efficiency compared to naive decoding methods⁴². The utility of this method increases with higher redundancy in the barcode space by creating longer barcodes with more “on” cycles, and careful assignment of barcodes to genes such that genes that tend to co-express in the same cell types have unique barcode combinations. In addition, more sophisticated deconvolution methods that share information between neighboring pixels can potentially improve decoding efficiency^43,44,78. As the field is moving towards detecting more genes in parallel, pixel-based deconvolution methods like SpD could become increasingly relevant.

Although we have only tested DART-FISH on fresh-frozen tissue sections, we think it should be compatible with other tissue preservation methods as long as the RNA integrity is well-preserved. We have found tissue quality to be a critical source of variability across experiments and hence should be controlled by meticulous preparation and handling of tissue blocks. Future studies that systematically evaluate various preservation methods for post-mortem human tissues will be key to advancing the field. Note that different fixation methods, as well as different tissue types, may require optimization of the tissue processing steps (e.g., permeabilization) before reverse-transcription. RiboSoma can be a helpful guide through this optimization, as the overall intensity of the signal and the morphological patterns can be used to compare different treatment conditions.

Due to its streamlined nature and simplicity, the basic DART-FISH chassis described here can be effectively extended in multiple ways. The workflow can be combined with antibody staining, for instance, to target extracellular factors such as matrix proteins and cell-cell communication molecules to enhance the definition of cell-cell interactions in pathological niches⁷⁹. The thickness of tissue sections could be increased for higher resolution mapping of neighborhoods and cell connectivities; while increasing section thickness to 20-30μm should be readily achievable, other strategies in sample mounting and handling may be necessary to increase the diffusion into even thicker sections (>100μm)⁸⁰. Padlock probes could also be designed to anneal directly to mRNA followed by circularization using an RNA-mediated DNA ligase, which would skip the cDNA synthesis and can improve the detection sensitivity.

Methods

Human tissue samples

Human brain

Human Brain tissue was obtained from the University of Washington Biorepository and Integrated Neuropathology (BRaIN) Laboratory under UW School of Medicine and HIPAA compliance. Informed consent was obtained for the use of data and samples. One donor brain with postmortem interval ≤12 h and RIN score ≥7 was selected for DART-FISH assay. Regions were identified and isolated utilizing architectural landmarks, aided by the Allen Brain Human Brain Atlas⁸¹. Multiple parallel 10-μm-thick cryosections were taken from the tissue block and mounted onto vectabond-coated 24 x 60 mm No.1.5 coverslips (Azer Scientific, 1152460). Brain cryosections were stored at −80 °C until use.

Human kidney

Kidney tissue was obtained from the Kidney Translational Research Center (KTRC) biorepository under a protocol approved by the Washington University Institutional Review Board (IRB 201102312). Informed consent was obtained for the use of data and samples. The kidney tissue was dissected from the whole kidney and freshly frozen in Optimal Cutting Temperature embedding media in cryomolds on a liquid nitrogen chilled metal block and stored at −80 ^oC until ready for experimental use⁷⁴. 10-μm-thick sections were cut from the frozen blocks for DART-FISH and flanking sections were used for histopathological assessment by a renal pathologist.

Reagents and enzymes

All reagents were listed as in Supplementary Data 1.

Gene selection

A list of genes was selected based on differential expression analysis of snRNA-seq data from human primary motor cortex^46,48,50 and a few curated marker genes were added manually to target 121 genes in the human M1C. Human kidney gene selection was performed by gpsFISH^82,83 to distinguish subclass level 2 annotation in our kidney reference atlas⁷³. snRNA-seq data from the kidney reference atlas with cell type annotation at subclass level 2 was used as input of gpsFISH. Curated marker genes from prior knowledge were also included as input. The size of the gene panel was set to 300. We ran the optimization for 100 iterations to ensure convergence although the optimization converged around iteration 50.

Probe design and production

DART-FISH probe design

For short genes (length < 1.5kb), we defined the constitutive exon as the union of all isoforms in GencodeV41. For other genes, the constitutive exons were defined as regions in RefSeq where at least (33% for the brain, 50% for the kidney) of isoforms overlap. We used a modified version of ppDesigner³⁸ (https://github.com/Kiiaan/sppDesigner) to find padlock target sequences along the constitutive exons. ppDesigner was run on two settings: 1) no overlap between probes allowed, 2) overlap of up to 20nt allowed. Individual arms were constrained between 17nt and 22nt long with the total target sequences no longer than 40nt. The resulting target sequences were aligned to GRCh38/hg38 with BWA-MEM⁸⁴ and sequences with MAPQ < 40 or secondary alignment were removed. We further removed probes that have GATC (DpnII recognition site). For the brain, a maximum of 50 probes per gene were selected prioritizing the non-overlapping set. For the kidney, a maximum of 40 probes per gene were selected with no overlap. Finally, the target sequences were concatenated with amplification primer sequences, universal sequence, and gene-specific decoder sequences to produce final padlock probe sequences (Supplementary Fig. 3c) and were ordered as an oligo pool from Twist Bioscience (South San Francisco, CA). Amplification primer pairs pAP1V41U and AP2V4 were used for the kidney probe set, while the brain probe set was amplified with AP1V7U and AP2V7 primer pair (Supplementary Data 1).

To select a set of barcodes, we computationally created all possible barcodes in the compact format: an $n$ digit barcode with “1”, “2” and “3” representing each of the three fluorescent channels and “0” indicating off cycles. For example, the barcode for RORB in Fig. 1c is “132000” in the 6-digit format. This amounted to 480 and 840 multi-color barcodes for brain and kidney, respectively. We then used a brute force algorithm to find the largest subset of barcodes, $Q$, in which every pair had a Hamming distance > 2. Followed by this, we created a graph, $G$, in which every possible barcode is a node, and pairs of nodes are connected with edges if their Hamming distance is 1. We then found a maximal independent set (MIS, networkx v2.6.2) that included the nodes in $Q$. This method ensures that every pair of barcodes in the MIS have Hamming distance >1. Because the algorithm for finding MIS is random, we ran it 20,000 times and selected the largest MIS across the runs. For the brain, the MIS consisted of 159 barcodes, 121 of which were randomly assigned to the genes. For the kidney, the MIS had 269 barcodes. We randomly added 31 additional barcodes and counted the number of edges of the induced subgraph of $G$ with the selected nodes. We repeated this selection 20,000 times and proceeded with the run with the lowest edge count. 300 genes were randomly assigned to these barcodes.

Large-scale padlock probe production

A step-by-step protocol can be found on protocols.io (dx.doi.org/10.17504/protocols.io.n92ldm3pxl5b/v1) and is illustrated in Supplementary Fig. 3c. Briefly, oligo pools were PCR amplified on a 96-well plate (10pM per reaction) using KAPA SYBR fast and 0.4μM of each amplification primer (pAP1V41U and AP2V4 for kidney, AP1V7U and AP2V7 for brain, Supplementary Data 1, Supplementary Fig. 3c) until plateau. The PCR products were pooled and concentrated with ethanol precipitation and further purified using QIAquick PCR purification kit (Qiagen 28106).

For the brain probe set, the purified amplicons were divided into parallel reactions (about 5ug each) and were digested with Lambda Exonuclease (0.5U/ul) in 1x buffer (NEB M0262L) at 37 °C for 2 h and purified using Zymo ssDNA/RNA clean & concentrator kit following manufacturer’s instructions (Zymo D7011). Next, the single-stranded probes were further digested with 5 units of USER enzyme (NEB M5505L) in 1x DpnII buffer at 37 °C for 3 h. Subsequently, for each reaction we added DpnII guide oligo (Supplementary Data 1) to final concentration of 5uM in 1x DpnII buffer, heated the mix to 94 °C for 2 min, cooled to 37 °C and added 50 units of DpnII in 1x DpnII buffer and incubated for 5 h. Finally, probes were size-selected using a TBE-Urea gel.

For the kidney probe set, DpnII digestion was performed after PCR. In detail, the purified amplicons were divided into parallel reactions (about 5ug each) and were digested with DpnII (1U/ul) in 1x NEBuffer DpnII (NEB R0543L) at 37 °C for 3 h and purified with QIAquick PCR purification kit. The purified products were digested with Lambda Exonuclease (0.5U/ul) in 1x buffer (NEB M0262L) for 2 h and purified with Zymo ssDNA/RNA clean & concentrator kit. Finally, the library was digested with USER (0.0625U/ul, M5505L) in 1x NEBuffer DpnII in parallel reactions (about 2.5ug each) for 6 h at 37 °C followed by 3 h at room temperature and purified with Zymo ssDNA/RNA clean & concentrator kit.

DART-FISH

The overall workflow, including reverse transcription, cDNA crosslinking, padlock probe capture, RCA, rolony crosslinking and image acquisition, is illustrated in Fig. 1. A step-by-step protocol can be found at protocols.io (dx.doi.org/10.17504/protocols.io.e6nvwjxnzlmk/v1).

Reverse transcription and cDNA crosslinking

Tissue sections were fixed in 4% PFA in 1x PBS at 4 °C for 1 h, followed by two 3-minute washes with PBST (1x PBS and 0.1% Tween-20). Then, a series of 50%, 70%, 100%, and 100% ethanol were used to dehydrate the tissue sections at room temperature for 5 min each. Next, tissues were air dried for 5 min and in the meantime silicone isolators (Grace Bio-Labs, 664304) were attached around the tissue sections. Then, the tissue sections were permeabilized with 0.25% Triton X-100 in PBSR (1x PBS, 0.05U/μl Superase In, 0.2U/μl Enzymatics RNase Inhibitor) at room temperature for 10 min, followed by two chilled PBSTR (1x PBS, 0.1% Tween-20, 0.05U/μl Superase In, 0.2U/μl Enzymatics RNase Inhibitor) washes and a water wash. Next, the sections were digested with 0.01% pepsin in 0.1 N HCl (pre-warmed 37 °C for 5 min) at 37 °C for 90 s and washed with chilled PBSTR twice. Afterwards, acrydite-modified dT and N9 primers (Acr_dc7-AF488_dT20 and Acr_dc10-Cy5_N9, Supplementary Data 1) were mixed to a final concentration of 2.5 μM with the reverse-transcription mix (10U/μL SuperScript IV (SSIV) reverse transcriptase, 1x SSIV buffer, 250 μM dNTP, 40 μM aminoallyl-dUTP, 5 mM DTT, 0.05U/ul Superase In and 1U/μL Enzymatics RNase inhibitor). The sections with the mix were incubated at 4 °C for 10 min and then transferred to a humidified 37 °C oven for overnight incubation. After reverse transcription, tissue sections were washed with chilled PBSTR twice and incubated in 0.2 mg/mL Acryloyl-X, SE in 1x PBS at room temperature for 30 min. Then, the tissue sections were washed once with PBSTR, followed by incubation with 4% acrylamide solution (4% acrylamide/bis 37:1, 0.05U/μL Superase-In, and 0.2U/μL RNase inhibitor) at room temperature for 30 min. Subsequently, the acrylamide solution was aspirated and gel polymerization solution (0.16% Ammonium persulfate and 0.2% TEMED in the 4% acrylamide solution) was added. Immediately, the tissues were covered with Gel Slick (Lonza #50640)-treated circular coverslips of 18 mm diameter (Ted Pella, 260369), transferred to an argon-filled chamber at room temperature and incubated for 30 min. After gel formation, the tissue sections were washed with 1x PBS twice and the coverslip was gently removed with a needle. At this point, the cDNA is crosslinked to the polyacrylamide gel.

Padlock probe capture

After cDNA crosslinking in gel, remaining RNA was digested with RNase mix (0.25U/μL RNase H, 2.5% Invitrogen RNase cocktail mix, 1x RNase H buffer) at 37 °C for 1 h followed by two PBST washes, 3 min each. The padlock probe library was mixed with Ampligase buffer. Then, the mixture was heated to 85 °C for 3 min and cooled on ice. Subsequently, the mixture was supplemented with 33.3U/μL Ampligase enzyme such that the final concentration of padlock probe library was 180 nM and 100 nM for the kidney and brain probe set, respectively, in 1x Ampligase buffer. Finally, the samples were incubated with probes at 37 °C for 30 min, and then moved to a 55 °C humidified oven for overnight incubation.

RCA and rolony crosslinking

After padlock probe capture, the tissue sections were washed with 1x PBS three times, 3 min each and hybridized with RCA primer solution (0.5 μM rca_primer, 2x SSC, and 30% formamide) at 37 °C for 1 h. Then, the tissue sections were washed with 2x SSC twice and incubated with Phi29 polymerase solution (0.2 U/μL Phi29 polymerase, 1x Phi29 polymerase buffer, 0.02 mM aminoallyl-dUTP, 1 mg/mL BSA, and 0.25 mM dNTP) at 30 °C in a humidified chamber for 7 h. Afterwards, the tissue sections were washed with 1x PBS twice, 3 min each and the rolonies were crosslinked with 5 mM BS(PEG)9 in 1x PBS at room temperature for 1 h. The crosslinking reaction was terminated with 1M Tris, pH 8.0 solution at room temperature for 30 min. Finally, samples were washed with 1x PBS twice and stored in a 4 °C fridge until image acquisition.

Image acquisition

Human Brain

Human brain tissue sample was stained with 1x TrueBlack in 70% ethanol at room temperature for 2 min to reduce the lipofuscin autofluorescence and washed with 1x PBS three times for 3 min each before imaging. For the anchor round imaging, a mixture of anchor round probes, including DARTFISH_anchor_Cy3, dcProbe10_ATTO647N, and dcProbe7_AF488 probes, were diluted to 500nM in 2x SSC and 30% formamide. Then, the samples were stained with anchor round probes at room temperature for 5 min and washed with 1 mL washing buffer (2x SSC, 10% formamide and 0.1% Tween-20) twice for 2 min each prior to imaging. The samples were immersed in 1 mL imaging buffer (2x SSC and 10% formamide) during imaging. For decoding imaging, each imaging cycle started with incubating samples with stripping buffer (2x SSC, 80% formamide, and 0.1% Tween-20) at room temperature for 5 min, washed with washing buffer twice for 2 min each, stained with a mixture of AlexaFluor488, Cy3, and ATTO647 fluorophore-labeled decoding probes (dcProbe0-AF488, dcProbe0-Cy3, and dcProbe0-ATTO647N as an example for round 1) in 2x SSC and 30% formamide for 10 min, and washed with washing buffer three times for 2 min each. Then, the samples were immersed in 1 mL of imaging buffer while imaging. After the last cycle of decoding imaging, DRAQ5 staining (5 μM, room temperature, 10 min) was performed for nuclei segmentation. Z-stack images were acquired by a resonant-scanning Leica TCS SP8 confocal microscope with 20x oil-immersion objective (NA = 0.75), pinhole size of 1 airy unit, pixel size of 284 nm x 284 nm (zoom=2) with 1024 x 1024 pixels per image, and 2 line averaging with 26 z-stacks (step size 1μm).

Human Kidney

The same fluorescent probes were used as in the human brain imaging in this order: anchor round, decoding rounds 1 to 7, DRAQ5 nuclear staining. All hybridizations were performed with 500nM of each of the fluorescent oligos in 2x SSC and 30% formamide for 15 min. Following hybridization, the unbound probes were washed with 4–5 washes with PBST each 2–3 min. Imaging was performed in PBST on a resonant-scanning Leica SP8 with a 20x oil-immersion objective (NA = 0.75), pinhole size of 2 airy units, pixel size of 366 nm x 366 nm (zoom=1.55) with 1024 x 1024 pixels per image, 3 line averaging, with 24 z-stacks (step size 2.5um). After each imaging round, stripping was performed with 80% formamide in 2x SSC and 0.1% Tween-20, 3 times each 3-5 min, followed by 2 quick washes with PBST to prepare for the next hybridization.

RNAscope

Sample preparation

RNAscope HiPlex 50x probe stocks of human SLC17A7, RELN, CUX2, RORB, CBLN2, FEZF2, GAD2, PVALB, LAMP5, PLP1, AQP4,and APBB1IP with HiPlex12 Reagent Kit v2 (488, 550, 650) Assay (ACD, 324419) were purchased from Advanced Cell Diagnostics (ACD). The 50x probe stocks and RNAscope HiPlex diluent were warmed at 40 °C for 10 min. The pre-warmed 50x probe stocks were pooled and diluted to 1x with pre-warmed RNAscope HiPlex diluent before use. RNAscope experiments were carried out according to the manufacturer’s protocol (document number UM324419) with slight modifications for post-mortem human brain tissue. Briefly, the human brain tissue sections were fixed with 4% PFA in 1x PBS at 4 °C for 1 h and dehydrated with a series of 50%, 70%, 100%, and 100% ethanol at room temperature for 5 min each. Then, silicone isolators of 20 mm in diameter (Grace Bio-Labs, 664304) were applied around the tissue sections and the tissue sections were slightly digested with 5 drops of Protease IV at room temperature for 30 min and washed with 1x PBS for 2 min twice. Subsequently, enough volume of 1x pooled probes was added to cover the tissue sections entirely and the probe hybridization was performed in the 40 °C HybEZ Hybridization System for 2 h. Then, the tissue sections were washed with 1 mL 1x wash buffer at room temperature for 2 min twice. Later, the tissue sections were hybridized with RNAscope HiPlex Amp1, incubated in the 40 °C HybEZ Hybridization System for 30 min, and washed with 1x wash buffer at room temperature for 2 min twice. Afterwards, we followed the same process to hybridize the tissue sections with RNAscope HiPlex Amp2 and RNAscope Hiplex Amp3. Finally, we incubated the tissue section with freshly prepared 5% HiPlex FFPE reagent at room temperature for 30 min and washed the tissue sections with 1 mL 1x wash buffer at room temperature for 2 min twice prior to image acquisition.

Image acquisition

The tissue sections with silicone isolators were mounted on the stage of a Leica SP8 confocal microscope and 4 cycles of imaging were performed to image 12 RNA species. In the first imaging cycle, RNAscope HiPlex Fluoro T1-T3 probes were prewarmed at 40 °C, added to cover the tissue sections entirely, and hybridized with the tissue sections for 5 min thrice. After probe hybridization, the tissue sections were washed with 1 mL 1x wash buffer at room temperature for 2 min twice and immersed in 1 mL 4x SSC buffer. Z-stack images were acquired by Leica TCS SP8 confocal microscope with 63x oil-immersed objective (NA 1.4) and pixel size of 113 nm x 113 nm. Then, the fluorophores were cleaved with freshly prepared 10% cleaving solution (100 μL cleaving solution diluted with 900 μL 4x SSC buffer) at room temperature for 15 min and the tissue sections were washed with 0.5% PBST (1x PBS with 0.5% Tween-20) at room temperature for 2 min twice. The fluorophore cleaving process was repeated once to ensure the fluorophores were removed entirely. This process was repeated 3 more rounds to image RNAscope HiPlex Fluoro T4-T12. An additional "Empty" cycle was performed to image the autofluorescence of the human brain tissue without any probes. After the last imaging cycle, we added 80% formamide in 2x SSC buffer to remove RNAscope probes completely and stained the nuclei with 5 μM DRAQ5 at room temperature for 10 min.

RNAscope data processing

RNAscope data was processed with the DART-FISH pipeline with one modification. The images from the “Empty” cycle were subtracted from all RNAscope images to remove the autofluorescence.

DART-FISH data processing (DF3D)

The DART-FISH datasets were processed by our custom pipeline. The source codes of the pipeline can be found in this Github page (https://github.com/Kiiaan/DF3D). Raw z-stack images with 4 channels (3 fluorescent channels and brightfield) from the microscope were registered to a reference round by affine transformation implemented in SimpleElastix⁸⁵ using the brightfield channel as the anchor. Then, each field of view (FOV) underwent decoding to obtain a list of candidate spots. Spots from all FOVs were pooled and filtered (See Sparse deconvolution (SpD) decoder for more details). To obtain the global position of the rolonies, the FOVs were stitched by applying FIJI’s⁸⁶ Grid/Collection Stitching plugin⁸⁷ (in headless mode) to the registered and maximum-projected brightfield images. Note that the theoretical positions of the FOVs, defined by the microscope, were used as initial positions for stitching.

Cell boundaries were segmented with Cellpose (v2.1.1)^88,89. The “cyto” model in Cellpose was fine tuned on each tissue by manually segmenting a handful of composite images of DRAQ5 (nuclei channel) and N9 cDNA stain (cyto channel) using the package’s graphical user interface.

Sparse deconvolution (SpD) decoding

In DART-FISH, each gene is represented by a barcode that can be read out in $n$ rounds of $3$-channel imaging. Each barcode is designed to emit fluorescence (be “on”) in exactly $k$ rounds, each time in a single fluorescent channel and stay “off” in other rounds. We concatenate the rounds and channels and represent the barcodes as $3n$-dimensional vectors. In other words, barcode $i$ is represented by vector ${x}_{i}$ in which $1$’s are placed where “on” signal is expected, and 0’s everywhere else. The codebook matrix $X$ (3nx$N$) is then defined as $X=[{{{{{{\bf{x}}}}}}}_{1},\, {{{{{{\bf{x}}}}}}}_{2},...,\,{{{{{{\bf{x}}}}}}}_{N}]$, where $N$ is the total number of barcodes. In the same way, for every pixel we concatenate the fluorescent intensity values (scaled between 0 and 1) to create a $3n$-dimensional vector ${{{{{\bf{y}}}}}}$.

The fluorescence signal at each pixel can be sourced from more than one rolony if the distance between neighboring rolonies is smaller than the optical resolution of the imaging system, or if 3-dimensional stacks are analyzed as maximum-projected 2D images. Nevertheless, because of physical constraints, only a handful of rolonies are expected to be the source of signal to each pixel. In this regard, because of the redundancy in the barcode space, combinations of barcodes in one pixel can be decomposed into their original composing barcodes. We formulated this problem as a regularized linear regression problem where a weighted sum of a few barcodes creates the observed signal intensity, where the vector ${{{{{\bf{w}}}}}}={[{w}_{1},\,{w}_{2},...,\,{w}_{N}]}^{T}$ shows the contribution of each barcode (Fig. 1d) with most ${w}_{s}(1\le s\le N)$ elements equal to 0. We initially used lasso to solve this problem ($\alpha {^{\prime}}=0$ in Fig. 1d) to promote the sparsity of ${{{{{\bf{w}}}}}}$, but later decided to use elastic net with a non-zero value for $\alpha {^{\prime}}$ that is much smaller than $\alpha$ ($\alpha {^{\prime}}=\alpha /100$) to increase stability. We call the solution to this problem ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{lasso}}}}$. Note that, we constrain the problem to positive weight values (${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}_{{{{s}}}}}{{{{\ge }}}}{{{0}}}$ for every $s$). The regression problems are solved for all the foreground pixels (${||}{{{{{\bf{y}}}}}}{{{{||}}}}_{{{{2}}}}\,{{{ > }}}\,{{{0}}}{{{.}}}{{{25}}}$) individually. For every barcode $i$, we can construct an image with the estimated weight values as pixels: 0 for background and rejected pixels, and non-zero values from $\widehat{{{{{{\bf{w}}}}}}}$. We call these images weight maps. Figure 1f shows weight maps constructed with ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{lasso}}}}$ which have not been filtered.

With our current barcode space, we can only confidently decompose bi-combinations. Hence, for every instance of the elastic net problem, we applied an elbow filter and accepted the solution only when the top one or two weights were significantly larger than other weights.

In more detail, for every pixel, the weights in ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{lasso}}}}$ are sorted in decreasing order. If the second largest weight is smaller than half of the top weight, then the top weight passes the elbow filter. Otherwise, if the third largest weight is smaller than 30% of the largest weight, the top two weights pass the elbow filter. All the values that do not pass the filter are set to zero. For accepted solutions, we performed an ordinary least square (OLS) regression using the top one or two weights to obtain unbiased weights (${\widehat{{{{{{\bf{w}}}}}}}}_{{{{OLS}}}}$). Supplementary Fig. 2a shows weight maps constructed with ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{OLS}}}}}$ (OLS maps) after applying a Gaussian smoothing.

Estimating channel-specific coefficients

So far, we have assumed that pixel intensities from different rounds and fluorescent channels all have the same scale and distribution. However, there is usually a variation among rounds and fluorescent channels, with some channel-rounds being brighter than others. To account for this effect, we model the channel-specific variations as a multiplicative factor that connects the weights at each pixel to intensities: ${{{{{\bf{y}}}}}}{{{{=}}}}{{{{{\bf{c}}}}}}\,{{\odot }}\,X{{{{{\bf{w}}}}}}$ where ${{{{{\bf{c}}}}}}={[{c}_{1},\,{c}_{2},...,\,{c}_{3n}]}^{T}$ is the channel coefficient vector and ${{{{{\boldsymbol{\odot }}}}}}$ denotes element-wise multiplication. Suppose for a set of pixels ${{{{{{\bf{y}}}}}}}^{{{{{(}}}}{{{{1}}}}{{{{)}}}}}\,{{{{,}}}}\,{{{{{{\bf{y}}}}}}}^{{{{{(}}}}{{{{2}}}}{{{{)}}}}}{{{{,}}}}{{{{.}}}}{{{{.}}}}{{{{.}}}}{{{{,}}}}\,{{{{{{\bf{y}}}}}}}^{{{{{(}}}}{{{{P}}}}{{{{)}}}}}$ the true barcode weights ${{{{{{\bf{w}}}}}}}^{{{{{{\boldsymbol{(}}}}}}{{{{{\boldsymbol{1}}}}}}{{{{{\boldsymbol{)}}}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\bf{w}}}}}}}^{{{{{{\boldsymbol{(}}}}}}{{{{{\boldsymbol{2}}}}}}{{{{{\boldsymbol{)}}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{\boldsymbol{.}}}}}}{{{{{\boldsymbol{.}}}}}}{{{{{\boldsymbol{.}}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\bf{w}}}}}}}^{{{{{{\boldsymbol{(}}}}}}{{{{{\boldsymbol{P}}}}}}{{{{{\boldsymbol{)}}}}}}}\,$ are given. For pixel $i$ and channel $j$, we could write: ${y}_{j}^{(i)}={c}_{j}{\sum }_{b=1}^{N}{X}_{{jb}}{w}_{b}^{(i)}={c}_{j}{\sum }_{b=1}^{N}{({{{{{{\bf{x}}}}}}}_{{{{{j}}}}})}_{b}{w}_{b}^{(i)}$ where ${({{{{{{\bf{x}}}}}}}_{{{{{j}}}}})}_{b}$ shows the $b$’s element of the $j$’s barcode. In this case, each ${c}_{j}$ can be estimated by solving an OLS problem between ${y}_{j}^{(.)}$ and ${\sum }_{b=1}^{N}{({{{{{{\bf{x}}}}}}}_{{{{{j}}}}})}_{b}{w}_{b}^{(.)}$. Conversely, if the channel coefficients are given, we can set up the decoding problem with normalized intensities: $\bar{{{{{{\bf{y}}}}}}}={{{{{\bf{y}}}}}}{{{{{\boldsymbol{/}}}}}}{{{{{\bf{c}}}}}}=X{{{{{\bf{w}}}}}}$ with $/$ being element-wise division. We estimate the channel coefficients in an iterative manner following the algorithm below:

1.
Initialize ${{{{{\bf{c}}}}}}{{{{=}}}}{{{{{\bf{1}}}}}}$ (no channel variation)
2.
Take a random sample of foreground pixels
3.
Normalize the pixel intensities in the sample with ${{{{{\bf{c}}}}}}$
4.
Run SpD on the normalized pixels
5.
Keep pixels with one dominant unsaturated weight (weight in range 0.1 and 0.5) and obtain unbiased weights through OLS
6.
Update the values of ${{{{{\bf{c}}}}}}$ by solving $3n$ OLS problems
7.
Repeat steps 3–6 ${n}_{{iter}}$ times

We do this procedure for 2 iterations and apply the obtained values when decoding all fields of view.

Setting the elastic net regularization parameter

Because of physical constraints, the solution to the deconvolution problem must be sparse, i.e., only a few non-zero weights should explain the observed intensities. The sparsity of the solution is directly controlled by the L1 regularization term, $\alpha$ (Fig. 1d). For a given pixel ${{{{{\bf{y}}}}}}$, higher values of $\alpha$ shrink the estimated weights (${{{{{||}}}}\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}}{{{{|}}}}{{{{{|}}}}}_{{{{{1}}}}}{{{{\to }}}}{{{{0}}}}$). Conversely, lower values of $\alpha$ allow more weights to be non-zero and ${{{{{||}}}}\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}}{{{{|}}}}{{{{{|}}}}}_{{{{{1}}}}}$ to grow larger. In fact, one can show if the L2 regularization term, $\alpha {^{\prime}}=0$, the largest weight to be undetected for a pixel made purely from one barcode is ${w}_{\max }=\frac{3n}{k}\alpha$ ⁹⁰. For instance, given $\alpha=0.05$ and codebook parameters $n=6,\,{k}=3$, then ${w}_{\max }=0.3$. This means that a pixel composed of one barcode needs to have an underlying intensity $ > 0.3$ to get a non-zero ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}}$. In other words, setting $\alpha$ too strictly will result in dimmer pixels to have ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}}{{{{=}}}}{{{{{\boldsymbol{0}}}}}}$, while setting $\alpha$ too loosely will result in spurious non-zero values in ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{lasso}}}}}$ for brighter more complex pixels, potentially not passing the elbow filter and thus ${\widehat{{{{{{\bf{w}}}}}}}}_{{{{{OLS}}}}}{{{{=}}}}{{{{{\boldsymbol{0}}}}}}$. To accommodate a wide range of rolony intensities, we choose $\alpha$ adaptively based on the pixel norm ${||}{{{{{\bf{y}}}}}}{{{{{||}}}}}_{{{{{2}}}}}$. First, we form a training data from a random subset of foreground pixels indexed by $i$. For a given pixel norm $u$, we find the alpha that maximizes a weighted sum of ${||}{{\widehat{{{{{{\bf{w}}}}}}}}^{(i)}}_{{{{{OLS}}}}}|{|}_{1}$ giving more weights to training pixels with closer norms to $u$ (equation ∗):

$$\alpha (u)={argma}{x}_{\alpha }\mathop{\sum}\limits_{i}g\left(\frac{u-{||}{{{{{{\bf{y}}}}}}}^{\left(i\right)}|{|}_{2}}{\sigma }\right){||}{{\widehat{{{{{{\bf{w}}}}}}}}^{(i)}}_{{{{{OLS}}}}}{{{{(}}}}\alpha {{{{)}}}}|{|}_{1}$$

where $g(.)$ is the Gaussian function. In practice, for the training pixels we solve the sparse decoding problem for every value of $\alpha$ on a grid from 0.01 to 0.1 with a step size of 0.005, ${{{{{{\boldsymbol{\alpha }}}}}}}_{{{{{{\rm{train}}}}}}}$, to obtain estimated weights ${{\widehat{{{{{{\bf{w}}}}}}}}^{(i)}}_{{{{{OLS}}}}}{{{{{\boldsymbol{(}}}}}}\alpha {{{{{\boldsymbol{)}}}}}}$. Then we create a grid of norms ${{{{{{\boldsymbol{u}}}}}}}_{{{{{{\rm{train}}}}}}}$, spanning 0 and 2.8 with 50 steps. For every value of $u$ in ${{{{{{\boldsymbol{u}}}}}}}_{{{{{{\rm{train}}}}}}}$, we solve equation ∗ on the ${{{{{{\boldsymbol{\alpha }}}}}}}_{{{{{{\rm{train}}}}}}}$ grid. In other words, we create a lookup table connecting values of ${{{{{{\boldsymbol{u}}}}}}}_{{{{{{\rm{train}}}}}}}$ to the best $\alpha$ in ${{{{{{\boldsymbol{\alpha }}}}}}}_{{{{{{\rm{train}}}}}}}$. For new pixels, $\alpha$ is determined by the closest norm in the lookup table.

Spot calling

To call spots, Gaussian smoothing is applied to individual OLS maps, followed by peak_local_max filter (scikit-image 0.19.3⁹¹) which returns a binary image with 1’s at the local maxima of the smoothed OLS maps. These peaks are then used as markers for watershed segmentation. From each segmented region, the following features are retained to be used in downstream steps: area, centroid, maximum and average intensity. This formed a list of candidate spots from each FOV.

Spot filtering

To control the specificity of the decoding procedure, we augmented the codebook with a number of barcodes (5-10% of the used barcodes) not used in the probe set (empty barcodes). After spot calling, we record the properties (e.g., area, maximum and average intensity) of spots with an empty barcode. Indeed, we see that empty spots tend to be smaller with lower average/maximum weight (Supplementary Fig. 2c and d). On a small fraction of spots from all fields of view, we train a random forest classifier (scikit-learn v1.1.3) with area, maximum and average weights as features to predict empty/non-empty labels (Supplementary Fig. 2e). We applied the classifier to all spots and obtained emptiness probabilities and set a threshold on these probabilities (0.3–0.35).

Spot assignment to cells

The cell boundaries were computed by applying find_boundaries (scikit-image 0.19.3⁹¹) to the segmentation mask. The distances of all spots were calculated to the closest boundary pixel. The distance was set to 0 if a spot was inside a boundary. A spot was assigned to its closest cell if the distance was less than or equal to 11μm in the kidney, 3μm for non-MBP and 0μm for MBP spots in the brain.

Cell annotation

We used anndata^92,93,94 (v0.8.0) and scanpy⁹²(v1.9.1) to handle and analyze the data. The data normalization was performed using analytic Pearson residuals⁹⁵ (clipped at 40) with a lower bound placed on gene-level standard deviations⁹⁶. Clustering was done with the Leiden algorithm⁹⁷ implemented in scanpy.

Annotating the Brain data set

Cells with counts less than 5 and more than 300 were removed (2980 out of 26348). The top 100 highly variable genes (scanpy.experimental.pp.highly_variable_gene(., flavor=’pearson_residuals’)) were used for normalization, embedding and annotations. PCA was performed on pearson residuals, and the neighborhood graph was created with this command scanpy.pp.neighbors(., n_neighbors = 20, n_pcs = 15, metric=’cosine’). Single-nucleus RNA-seq reference from Jorstad et al.⁶¹ was subsetted to M1C cells and normalized in the same way as DART-FISH. Pax6 and Scng subclasses were removed since we did not design our probe set to target those. Average normalized counts (centroids) were computed for every other subclass in the “within_area_subclass” slot and all clusters of DART-FISH. To annotate the DART-FISH clusters at the class level (excitatory, inhibitory, non-neuronal), we first correlated each cluster to all single-nucleus subclasses, and assigned that cluster to the class of the most highly correlated subclass. Annotation of each class was done separately.

For excitatory neurons, all DART-FISH cells that had a class label of “excitatory” and had at least 20 transcripts were kept (5957 cells). We realized that the Leiden clustering was unstable and by mere shuffling of the order of cells, we would obtain very different clusters. We reasoned that by removing some cells that tend to move between clusters, we could get more stable clusters and have more confidence in their annotation. To find cells that don’t stably cluster, we ran clustering 20 times, every time shuffling the order of the cells. For every cell, we calculated the number of times it was co-clustered with every other cell and took the average of the non-zero values as the co-clustering index (CCI). A perfect CCI of 20 means that the cell is clustered with the same partners in every clustering instance, while lower values show deviations from this limit. We removed the cells with a CCI smaller than 6 and repeated this filtering procedure for three more iterations. The final results show a more stable clustering of the remaining 5101 cells. We then constructed a new neighborhood graph using newly computed principal components (n_neighbors=10, n_pcs=15), followed by Leiden clustering. The cluster centroids were calculated and correlated to the reference subclass centroids. We assigned clusters to their maximally correlated reference subclass if we could also see differential expression of their marker genes (scanpy’s rank_genes_groups), otherwise we labeled them as NA. Of note, the DART-FISH population labeled as L6b/CT was highly correlated with reference subclasses L6b and L6 CT (Supplementary Fig. 5b) and showed expression of marker genes from both subclasses.

For inhibitory neurons and non-neuronal cells, the clustering was more stable to begin with, and we started by constructing the neighborhood matrix (For inhibitory neurons: n_neighbors=20, n_pcs=10. For non-neuronal cells: n_neighbors=25, n_pcs=15) and clustering. Then clusters were assigned to the reference subclass with maximum Pearson’s correlation if the marker genes matched, or otherwise were labeled as NA.

Drawing cortical layer boundaries

Cortical layer boundaries were automatically drawn via Support Vector Machine (SVM) decision boundaries. The Scikit-learn python package (v1.1.3) was used to train a SVM on the following excitatory neuron subtype labels: “L2/3 IT”, “L4 IT”, “L5 IT”, “L6 IT”, “L6b/CT”. First, cells with fewer than 10 total gene counts were filtered out. The x and y coordinates of the cells are standardized via the StandardScaler() function, and the data was fed into a SVM with a radial basis function (RBF) kernel with balanced class weights and one vs. one decision function. The RBF SVM model is then applied to a meshgrid with a fine step size with the same geometric size as the original tissue image. The trained SVM classified the cell type label of each point on the meshgrid to define borders between the cortical layers specified by the excitatory neuron subclasses. We drew contours based on the borders between the various subclasses, and manually superimposed them onto Fig. 3c.

Gene concordance analysis

The RNA portion of the SNARE-seq2 (snare) dataset from Bakken et al.⁴⁶ and Plongthongkum et al.⁵⁰ was used in this section. First, the snare data was subsetted to the DART-FISH genes. Then, DART-FISH and snare data were both normalized (scanpy.pp.normalize_total(., target_sum = 1000)) followed by log-normalization (scanpy.pp.log1p(.)). The average normalized gene expression was calculated for all subclasses. For each gene, the concordance was defined as the Pearson’s correlation between the average expressions across the subclasses between the DART-FISH and snare data (top panel of Supplementary Fig. 5c). The same analysis was performed for a MERFISH data set from Fang et al.⁵⁹ (sample H18.06.006.MTG.250.expand.rep1) with the following details: the subclass labels from metadata column “cluster_L2” were renamed to be consistent with DART-FISH annotations. In particular, subclasses L6b and L6 CT were merged, and subclass L5 ET was removed. Note that subclasses Sst Chodl, Chandelier and Lamp5 Lhx6 were not annotated in the MERFISH dataset and were removed from the DART-FISH analysis for consistency. The rest of the analysis was carried out with 242 shared genes between the datasets (bottom panel of Supplementary Fig. 5c).

Annotating the kidney data set

Cells with less than 5 and more than 100 transcripts were filtered (2024 out of 65565). The top 250 highly variable genes were kept for downstream analyses (scanpy.experimental.pp.highly_variable_gene(., flavor=’pearson_residuals’)). PCA was performed on pearson residuals, and the neighborhood graph was constructed using the command scanpy.pp.neighbors(., n_neighbors = 20, n_pcs = 20, metric=’cosine’) followed by Leiden clustering (l1 clustering). From the kidney reference atlas⁷³, degenerative, cycling, transitioning and medullary cell types were removed. The counts were transformed to pearson residuals and the remaining subclass level 1 and level 2 centroids were calculated. We then calculated the Pearson correlations between subclass level 1 centroids and cluster centroids and assigned each l1 cluster to the subclass level 1 with maximum correlation. We then subclustered each of the l1 clusters and assigned those to subclass level 2 identities with maximum correlation, only if the relevant marker genes were expressed. Through this procedure we could not resolve PT-S1 and PT-S2 subtypes separately; thus, we labeled the clusters that were highly correlated with these populations as PT-S1/S2. Similarly, for immune cells, this procedure could confidently resolve MAC-M2 cells and the general myeloid (IMM_Myl) and lymphoid (IMM_Lym) populations. To annotate the immune cells at higher level of granularity, we updated their subclass level 2 labels with the following strategy: Each DART-FISH cell with subclass level 1 label “IMM” was separately correlated with the following immune subtypes in the reference atlas: B, PL, T, MAC-M2, MDC, cDC. The immune subtypes with highest and 2nd highest correlation were kept. If the highest correlation was larger than 0.4 and the ratio of the highest to the 2nd highest correlation was larger than 1.25, the label was updated to that of the highest correlated subtype, otherwise it remained unchanged.

Cell-cell interaction analysis

We used squidpy.gr.co_occurrence function (v1.2.4.dev27+gb644428) with n_splits = 1 and an interval between 7μm and 110μm⁷⁷.

Comparison of decoding methods

Datasets of varying levels of complexity were simulated to compare SpD with StarFish⁴² (pixel-based naive matching), BarDensr⁴⁴ and ISTDECO⁴³ (deconvolution-based methods). The synthetic datasets were constructed using the human brain codebook (3-on-3-off, 121 genes with 10 empty barcodes) with equal abundance of all genes and uniform spatial distribution of spots. The rolonies were modeled as gaussian spots with peak intensity randomly chosen to be between 0.25 and 0.7 and sigma between 2 and 2.5 pixels. To model channel-specific intensity variation, we randomly drew 18 channel-specific coefficients from a uniform distribution between 0.75 and 1.25 to scale their respective images, while clipping the intensity values above 1. We simulated multiple datasets varying the number of spots between ${5 * 10}^{3}$ to $4 * {10}^{5}$ spots in a field of view of size 1024 x 1024 pixels. Different decoding methods were applied to the synthetic datasets with default settings to the extent possible, with no post-hoc filtering of the spots. The only exception was StarFish for which the distance threshold was set to 0.7 as a fair balance between specificity and sensitivity. Then, the groundtruth spots were matched one-to-one to the decoded spots if the barcodes were identical and the centroids were closer than 6 pixels. Sensitivity is defined as the fraction of groundtruth spots matched with a decoded spot. Specificity is defined as the fraction of matched decoded spots over all decoded spots. Empty rate is the fraction of empty barcodes among all decoded barcodes and is inversely related to specificity.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The spot tables, RiboSoma images and segmentation masks are available on figshare for human brain (https://doi.org/10.6084/m9.figshare.23932863.v1)⁹⁸ and for human kidney (https://doi.org/10.6084/m9.figshare.23937057.v1)⁹⁹. All registered DART-FISH images, codes and intermediate outputs of the processing pipeline are available on Zenodo (https://doi.org/10.5281/ZENODO.8253771)¹⁰⁰. Source data are provided with this paper. The single-nucleus RNA sequencing reference atlas of human kidney⁷³ is available on GEO (GSE183277). SNARE-seq data for human M1C^46,50 is available at Brain Cell Data Center (https://biccn.org/data) under U01 ZhangKun grant ID (U01MH114828). The M1C data from Jorstad et al.⁶¹ is available for download from the Neuroscience Multi-omics Archive (https://data.nemoarchive.org/publication_release/Human_Cross_Areal_Analysis/). Source data are provided with this paper.

Code availability

The python code for the DART-FISH processing pipeline and SpD are available on this Github repository: https://github.com/Kiiaan/DF3D.

References

Crosetto, N., Bienko, M. & van Oudenaarden, A. Spatially resolved transcriptomics and beyond. Nat. Rev. Genet. 16, 57–66 (2015).
Article CAS PubMed Google Scholar
Rao, A., Barkley, D., França, G. S. & Yanai, I. Exploring tissue architecture using spatial transcriptomics. Nature 596, 211–220 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Luengo-Oroz, M. A., Ledesma-Carbayo, M. J., Peyriéras, N. & Santos, A. Image analysis for understanding embryo development: a bridge from microscopy to biological insights. Curr. Opin. Genet. Dev. 21, 630–637 (2011).
Article CAS PubMed Google Scholar
Close, J. L., Long, B. R. & Zeng, H. Spatially resolved transcriptomics in neuroscience. Nat. Methods 18, 23–25 (2021).
Article CAS PubMed Google Scholar
Saviano, A., Henderson, N. C. & Baumert, T. F. Single-cell genomics and spatial transcriptomics: discovery of novel cell states and cellular interactions in liver physiology and disease biology. J. Hepatol. 73, 1219–1230 (2020).
Femino, A. M., Fay, F. S., Fogarty, K. & Singer, R. H. Visualization of single RNA transcripts in situ. Science 280, 585–590 (1998).
Raj, A., van den Bogaard, P., Rifkin, S. A., van Oudenaarden, A. & Tyagi, S. Imaging individual mRNA molecules using multiple singly labeled probes. Nat. Methods 5, 877–879 (2008).
Article CAS PubMed PubMed Central Google Scholar
Rodriguez, A. J., Czaplinski, K., Condeelis, J. S. & Singer, R. H. Mechanisms and cellular roles of local protein synthesis in mammalian cells. Curr. Opin. Cell Biol. 20, 144–149 (2008).
Article CAS PubMed PubMed Central Google Scholar
Buxbaum, A. R., Haimovich, G. & Singer, R. H. In the right place at the right time: visualizing and understanding mRNA localization. Nat. Rev. Mol. Cell Biol. 16, 95–109 (2015).
Article CAS PubMed Google Scholar
Codeluppi, S. et al. Spatial organization of the somatosensory cortex revealed by osmFISH. Nat. Methods 15, 932–935 (2018).
Article CAS PubMed Google Scholar
Chen, K. H., Boettiger, A. N., Moffitt, J. R., Wang, S. & Zhuang, X. RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348, aaa6090 (2015).
Article PubMed PubMed Central Google Scholar
Eng, C.-H. L. et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature 568, 235–239 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Moffitt, J. R. et al. Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science 362, eaau5324 (2018).
Benavides, S. H., Monserrat, A. J., Fariña, S. & Porta, E. A. Sequential histochemical studies of neuronal lipofuscin in human cerebral cortex from the first to the ninth decade of life. Arch. Gerontol. Geriatr. 34, 219–231 (2002).
Article CAS PubMed Google Scholar
Di Guardo, G. Lipofuscin, lipofuscin-like pigments and autofluorescence. Eur. J. Histochem. 59, 2485 (2015).
Article PubMed PubMed Central Google Scholar
Banerjee, B., Miedema, B. E. & Chandrasekhar, H. R. Role of basement membrane collagen and elastin in the autofluorescence spectra of the colon. J. Investig. Med. 47, 326–332 (1999).
CAS PubMed Google Scholar
Autofluorescence microscopy. A non-destructive tool to monitor mitochondrial toxicity. Toxicol. Lett. 206, 281–288 (2011).
Article Google Scholar
Bhargava, P. & Schnellmann, R. G. Mitochondrial energetics in the kidney. Nat. Rev. Nephrol. 13, 629–646 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, X., Sun, Y.-C., Church, G. M., Lee, J. H. & Zador, A. M. Efficient in situ barcode sequencing using padlock probe-based BaristaSeq. Nucleic Acids Res 46, e22–e22 (2017).
Article PubMed Central Google Scholar
Qian, X. et al. Probabilistic cell typing enables fine mapping of closely related cell types in situ. Nat. Methods 17, 101–106 (2019).
Article PubMed PubMed Central Google Scholar
Ke, R. et al. In situ sequencing for RNA analysis in preserved tissue and cells. Nat. Methods 10, 857–860 (2013).
Article CAS PubMed Google Scholar
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Article ADS PubMed Google Scholar
Rodriques, S. G. et al. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Vickovic, S. et al. High-definition spatial transcriptomics for in situ tissue profiling. Nat. Methods 16, 987–990 (2019).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. High-Spatial-Resolution Multi-Omics Sequencing via Deterministic Barcoding in Tissue. Cell 183, 1665–1681.e18 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, A. et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 185, 1777–1792.e21 (2022).
Article CAS PubMed Google Scholar
Polony gels enable amplifiable DNA stamping and spatial transcriptomics of chronic pain. Cell 185, 4621–4633.e17 (2022).
Moses, L. & Pachter, L. Museum of spatial transcriptomics. Nat. Methods 19, 534–546 (2022).
Article CAS PubMed Google Scholar
Lizardi, P. M. et al. Mutation detection and single-molecule counting using isothermal rolling-circle amplification. Nat. Genet. 19, 225–232 (1998).
Article CAS PubMed Google Scholar
Hardenbol, P. et al. Multiplexed genotyping with sequence-tagged molecular inversion probes. Nat. Biotechnol. 21, 673–678 (2003).
Article CAS PubMed Google Scholar
Gunderson, K. L. et al. Decoding randomly ordered DNA arrays. Genome Res 14, 870–877 (2004).
Article CAS PubMed PubMed Central Google Scholar
Gunderson, K. L., Steemers, F. J., Lee, G., Mendoza, L. G. & Chee, M. S. A genome-wide scalable SNP genotyping assay using microarray technology. Nat. Genet. 37, 549–554 (2005).
Article CAS PubMed Google Scholar
[3] Illumina Universal Bead Arrays. in Methods in Enzymology vol. 410 57–73 (Academic Press, 2006).
Fan, J.-B. & Zhang, K. Methods and compositions for single cell genomics. US Patent US14/742,027 (2021).
Sun, Y.-C. et al. Integrating barcoded neuroanatomy with spatial transcriptional profiling enables identification of gene correlates of projections. Nat. Neurosci. 24, 873–885 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gyllborg, D. et al. Hybridization-based in situ sequencing (HybISS) for spatially resolved transcriptomics in human and mouse brain tissue. Nucleic Acids Res 48, e112 (2020).
Article CAS PubMed PubMed Central Google Scholar
Alon, S. et al. Expansion sequencing: Spatially precise in situ transcriptomics in intact biological systems. Science 371, eaax2656 (2021).
Diep, D. et al. Library-free methylation sequencing with bisulfite padlock probes. Nat. Methods 9, 270–272 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chen, F., Tillberg, P. W. & Boyden, E. S. Optical imaging. Expansion microscopy. Science 347, 543–548 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, F. et al. Nanoscale imaging of RNA with expansion microscopy. Nat. Methods 13, 679–684 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. H. et al. Highly multiplexed subcellular RNA sequencing in situ. Science 343, 1360–1363 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Axelrod, S. et al. starfish: scalable pipelines for image-based transcriptomics. J. Open Source Softw. 6, 2440 (2021).
Article ADS Google Scholar
Andersson, A., Diego, F., Hamprecht, F. A. & Wählby, C. Istdeco: In situ transcriptomics decoding by deconvolution. bioRxiv https://doi.org/10.1101/2021.03.01.433040 (2021).
Chen, S. et al. BARcode DEmixing through Non-negative Spatial Regression (BarDensr). PLoS Comput. Biol. 17, e1008256 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gray, D. A. & Woulfe, J. Lipofuscin and aging: a matter of toxic waste. Sci. Aging Knowl. Environ. 2005, re1 (2005).
Bakken, T. E. et al. Comparative cellular analysis of motor cortex in human, marmoset and mouse. Nature 598, 111–119 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Yao, Z. et al. A transcriptomic and epigenomic cell atlas of the mouse primary motor cortex. Nature 598, 103–110 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
BRAIN Initiative Cell Census Network (BICCN). A multimodal cell census and atlas of the mammalian primary motor cortex. Nature 598, 86–102 (2021).
Article Google Scholar
Zhang, M. et al. Spatially resolved cell atlas of the mouse primary motor cortex by MERFISH. Nature 598, 137–143 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Plongthongkum, N., Diep, D., Chen, S., Lake, B. B. & Zhang, K. Scalable dual-omics profiling with single-nucleus chromatin accessibility and mRNA expression sequencing 2 (SNARE-seq2). Nat. Protoc. 16, 4992–5029 (2021).
Article CAS PubMed Google Scholar
White, K. et al. Effect of Postmortem Interval and Years in Storage on RNA Quality of Tissue at a Repository of the NIH NeuroBioBank. Biopreserv. Biobank. 16, 148–157 (2018).
Article CAS PubMed PubMed Central Google Scholar
Müller, C., Bauer, N. M., Schäfer, I. & White, R. Making myelin basic protein -from mRNA transport to localized translation. Front. Cell. Neurosci. 7, 169 (2013).
Article PubMed PubMed Central Google Scholar
Costa, M. R. & Müller, U. Specification of excitatory neurons in the developing cerebral cortex: progenitor diversity and environmental influences. Front. Cell. Neurosci. 8, 449 (2014).
PubMed Google Scholar
Nieto, M. et al. Expression of Cux-1 and Cux-2 in the subventricular zone and upper layers II-IV of the cerebral cortex. J. Comp. Neurol. 479, 168–180 (2004).
Article CAS PubMed Google Scholar
Schaeren-Wierners, N., André, E., Kapfhammer, J. P. & Becker-André, M. The ExDression pattern of the orphan nuclear receptor RORβ in the developing and adult rat nervous system suggests a role in the processing of sensory information and in circadian rhythm. Eur. J. Neurosci. 9, 2687–2701 (1997).
Article Google Scholar
Arlotta, P. et al. Neuronal subtype-specific genes that control corticospinal motor neuron development in vivo. Neuron 45, 207–221 (2005).
Article CAS PubMed Google Scholar
Zeng, H. et al. Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signatures. Cell 149, 483–496 (2012).
Article CAS PubMed PubMed Central Google Scholar
Reiner, A., Yang, M., Cagle, M. C. & Honig, M. G. Localization of cerebellin-2 in late embryonic chicken brain: implications for a role in synapse formation and for brain evolution. J. Comp. Neurol. 519, 2225–2251 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fang, R. et al. Conservation and divergence of cortical cell organization in human and mouse revealed by MERFISH. Science 377, 56–62 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Borm, L. E. et al. Scalable in situ single-cell profiling by electrophoretic capture of mRNA using EEL FISH. Nat. Biotechnol. 41, 222–231 (2022).
PubMed PubMed Central Google Scholar
Jorstad, N. L. et al. Transcriptomic cytoarchitecture reveals principles of human neocortex organization. Science 382, eadf6812 (2023).
Article CAS PubMed Google Scholar
Jorstad, N. L. et al. Comparative transcriptomics reveals human-specific cortical features. Science 382, eade9516 (2023).
Article CAS PubMed PubMed Central Google Scholar
Kim, Y. et al. Brain-wide Maps Reveal Stereotyped Cell-Type-Based Cortical Architecture and Subcortical Sexual Dimorphism. Cell 171, 456–469.e22 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chun, J. J. & Shatz, C. J. Interstitial cells of the adult neocortical white matter are the remnant of the early generated subplate neuron population. J. Comp. Neurol. 282, 555–569 (1989).
Article CAS PubMed Google Scholar
Chun, J. J. & Shatz, C. J. The earliest-generated neurons of the cat cerebral cortex: characterization by MAP2 and neurotransmitter immunohistochemistry during fetal life. J. Neurosci. 9, 1648–1667 (1989).
Article CAS PubMed PubMed Central Google Scholar
Tan, S.-S. et al. Oligodendrocyte positioning in cerebral cortex is independent of projection neuron layering. Glia 57, 1024–1030 (2009).
Article PubMed Google Scholar
Ohtomo, R., Iwata, A. & Arai, K. Molecular Mechanisms of Oligodendrocyte Regeneration in White Matter-Related Diseases. Int. J. Mol. Sci. 19, 1743 (2018).
Article PubMed PubMed Central Google Scholar
Wang, G., Moffitt, J. R. & Zhuang, X. Multiplexed imaging of high-density libraries of RNAs with MERFISH and expansion microscopy. Sci. Rep. 8, 1–13 (2018).
Google Scholar
Tasic, B. et al. Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nat. Neurosci. 19, 335–346 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bulfone, A. et al. Pcp4l1, a novel gene encoding a Pcp4-like polypeptide, is expressed in specific domains of the developing brain. Gene Expr. Patterns 4, 297–301 (2004).
Article CAS PubMed Google Scholar
Schreibing, F. & Kramann, R. Mapping the human kidney using single-cell genomics. Nat. Rev. Nephrol. 18, 347–360 (2022).
Article PubMed Google Scholar
Stewart, B. J., Ferdinand, J. R. & Clatworthy, M. R. Using single-cell technologies to map the human immune system — implications for nephrology. Nat. Rev. Nephrol. 16, 112–128 (2019).
Article PubMed Google Scholar
Lake, B. B. et al. An atlas of healthy and injured cell states and niches in the human kidney. Nature 619, 585–594 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Lake, B. B. et al. A single-nucleus RNA-sequencing pipeline to decipher the molecular anatomy and pathophysiology of human kidneys. Nat. Commun. 10, 1–15 (2019).
Article ADS CAS Google Scholar
Kuppe, C. et al. Decoding myofibroblast origins in human kidney fibrosis. Nature 589, 281–286 (2020).
Article ADS PubMed PubMed Central Google Scholar
Muto, Y. et al. Single cell transcriptional and chromatin accessibility profiling redefine cellular heterogeneity in the adult human kidney. Nat. Commun. 12, 1–17 (2021).
Article ADS Google Scholar
Palla, G. et al. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods 19, 171–178 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bryan, J. P., Cleary, B., Farhi, S. L. & Eldar, Y. C. Sparse recovery of imaging transcriptomics data. In 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI). 802–806 (Nice, France 2021). https://doi.org/10.1109/ISBI48211.2021.9433927.
Extracellular matrix-cell interactions. Focus on therapeutic applications. Cell. Signal. 66, 109487 (2020).
Article Google Scholar
Wang, Y. et al. EASI-FISH for thick tissue defines lateral hypothalamus spatio-molecular organization. Cell 184, 6361–6377.e24 (2021).
Article CAS PubMed Google Scholar
Ding, S.-L. et al. Comprehensive cellular-resolution atlas of the adult human brain. J. Comp. Neurol. 524, 3127–3481 (2016).
Article PubMed PubMed Central Google Scholar
GitHub - kharchenkolab/gpsFISH: Optimization of gene panels for targeted spatial transcriptomics. GitHub. https://github.com/kharchenkolab/gpsFISH.
Zhang, Y. et al. Gene panel selection for targeted spatial transcriptomics. Genome Biol. 25, 35 (2024).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv Preprint at https://doi.org/10.48550/arXiv.1303.3997 (2013).
Marstal, K., Berendsen, F., Staring, M. & Klein, S. SimpleElastix: A user-friendly, multi-lingual library for medical image registration. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 134–142 (2016).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Article CAS PubMed Google Scholar
Preibisch, S., Saalfeld, S. & Tomancak, P. Globally optimal stitching of tiled 3D microscopic image acquisitions. Bioinformatics 25, 1463–1465 (2009).
Article CAS PubMed PubMed Central Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: a generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106 (2021).
Article CAS PubMed Google Scholar
Pachitariu, M. & Stringer, C. Cellpose 2.0: how to train your own model. Nat. Methods 19, 1634–1641 (2022).
Article CAS PubMed PubMed Central Google Scholar
Efron, B. & Hastie, T. Computer Age Statistical Inference, Student Edition: Algorithms, Evidence, and Data Science. 298–321 (Cambridge University Press, 2021).
van der Walt, S. et al. scikit-image: image processing in Python. PeerJ 2, e453 (2014).
Article PubMed PubMed Central Google Scholar
Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol 19, 15 (2018).
Article PubMed PubMed Central Google Scholar
Virshup, I. et al. The scverse project provides a computational ecosystem for single-cell omics data analysis. Nat. Biotechnol. 41, 604–606 (2023).
Article CAS PubMed Google Scholar
Virshup, I., Rybakov, S., Theis, F. J., Angerer, P. & Wolf, F. A. anndata: Annotated data. BioRxiv 2021-12 https://doi.org/10.1101/2021.12.16.473007 (2021).
Lause, J., Berens, P. & Kobak, D. Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data. Genome Biol 22, 1–20 (2021).
Article Google Scholar
Choudhary, S. & Satija, R. Comparison and evaluation of statistical error models for scRNA-seq. Genome Biol 23, 1–20 (2022).
Article Google Scholar
Traag, V. A., Waltman, L. & van Eck, N. J. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 9, 1–12 (2019).
Article CAS Google Scholar
Kalhor, K. & Chen, C.-J. DART-FISH (Kalhor, Chen et al.) human brain motor cortex. https://doi.org/10.6084/m9.figshare.23932863.v1 (2023).
Kalhor, K. & Chen, C.-J. DART-FISH (Kalhor, Chen et al.) human kidney cortex. https://doi.org/10.6084/m9.figshare.23937057.v1 (2023).
DART-FISH. https://doi.org/10.5281/zenodo.8253772.
KPMP Schematics of the Nephron and Renal Corpuscle. https://doi.org/10.48698/DEM4-0Q93.

Download references

Acknowledgements

The authors would like to thank Drs. Prashant Mali, Reza Kalhor, Van Ninh, Eric Griffis, Xiaohua Huang and Bing Ren for useful discussions and comments on this work. The authors would like to acknowledge Kimberly Conklin, Huy Lam and other members of the Zhang lab for their support. This work was supported by NIH grants U01MH098977 (to K.Z., J.C., J.F.), U01MH114828 (to K.Z., J.C., P.V.K.), UG3/UH3DK114933 (to K.Z., S.J.), U54HL145608 (to K.Z., S.J., P.V.K.), R01AG065541 (to J.C.).

Author information

These authors contributed equally: Kian Kalhor, Chien-Ju Chen.

Authors and Affiliations

Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
Kian Kalhor, Chien-Ju Chen, Ho Suk Lee, Matthew Cai, Mahsa Nafisi, Richard Que, Yixu Yuan, Jinghui Song, Blue B. Lake & Kun Zhang
Program in Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA, USA
Chien-Ju Chen
Department of Electrical Engineering, University of California San Diego, La Jolla, CA, USA
Ho Suk Lee
Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, USA
Carter R. Palmer & Jerold Chun
Program in Biomedical Sciences, University of California San Diego, La Jolla, CA, USA
Carter R. Palmer
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Yida Zhang & Peter V. Kharchenko
Altos Labs, San Diego, CA, USA
Xuwen Li, Blue B. Lake, Peter V. Kharchenko & Kun Zhang
Department of Medicine, Washington University School of Medicine, St. Louis, MO, USA
Amanda Knoten & Sanjay Jain
Department of Pathology and Immunology, Washington University School of Medicine, St.Louis, MO, USA
Joseph P. Gaut & Sanjay Jain
Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, WA, USA
C. Dirk Keene
Allen Institute for Brain Science, Seattle, WA, 98103, USA
Ed Lein
Illumina, San Diego, CA, USA
Jian-Bing Fan

Authors

Kian Kalhor
View author publications
You can also search for this author in PubMed Google Scholar
Chien-Ju Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ho Suk Lee
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Cai
View author publications
You can also search for this author in PubMed Google Scholar
Mahsa Nafisi
View author publications
You can also search for this author in PubMed Google Scholar
Richard Que
View author publications
You can also search for this author in PubMed Google Scholar
Carter R. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Yixu Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Yida Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xuwen Li
View author publications
You can also search for this author in PubMed Google Scholar
Jinghui Song
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Knoten
View author publications
You can also search for this author in PubMed Google Scholar
Blue B. Lake
View author publications
You can also search for this author in PubMed Google Scholar
Joseph P. Gaut
View author publications
You can also search for this author in PubMed Google Scholar
C. Dirk Keene
View author publications
You can also search for this author in PubMed Google Scholar
Ed Lein
View author publications
You can also search for this author in PubMed Google Scholar
Peter V. Kharchenko
View author publications
You can also search for this author in PubMed Google Scholar
Jerold Chun
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Jain
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Bing Fan
View author publications
You can also search for this author in PubMed Google Scholar
Kun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.Z. and J.B.F. conceived the in situ decoding concept. C.J.C. performed human brain experiments. K.K. performed human kidney experiments, created the data processing pipeline and analyzed the data. K.K., C.J.C. and M.N. optimized the protocol on human tissues. H.S.L., M.C. and R.Q. performed early developments. C.D.K. and E.L. contributed human brain sections. C.R.P. and J.C. prepared human brain sections and helped with interpretation of the brain data. Y.Y. performed computational layer boundary detection in the human brain. J.S. provided key suggestions on protocol optimization and cell type annotation of the human brain. X.L. assisted in the mouse kidney experiment. Y.Z. and P.V.K. performed gene selection for human kidney with input from B.B.L. and S.J. A.K., J.P.G. and S.J. contributed human kidney sections, performed histology and review. S.J. and B.B.L. helped with interpretation of the kidney data. K.K., C.J.C. and K.Z. wrote the manuscript with suggestions from all authors. K.Z. supervised the project.

Corresponding author

Correspondence to Kun Zhang.

Ethics declarations

Competing interests

K.Z. and J.B.F. are listed as inventors in a patent related to the method described in this manuscript. All remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Carolina Wählby, Jialiang Yang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kalhor, K., Chen, CJ., Lee, H.S. et al. Mapping human tissues with highly multiplexed RNA in situ hybridization. Nat Commun 15, 2511 (2024). https://doi.org/10.1038/s41467-024-46437-y

Download citation

Received: 16 August 2023
Accepted: 23 February 2024
Published: 20 March 2024
DOI: https://doi.org/10.1038/s41467-024-46437-y
Springer Nature Limited

Mapping human tissues with highly multiplexed RNA in situ hybridization

Abstract

Similar content being viewed by others

Introduction

Results

DART-FISH framework

Benchmarking and validation of DART-FISH

Organization of cell types in the human primary motor cortex

Mapping cellular neighborhoods in histopathologically abnormal human kidney

Discussion

Methods

Human tissue samples

Human brain

Human kidney

Reagents and enzymes

Gene selection

Probe design and production

DART-FISH probe design

Large-scale padlock probe production

DART-FISH

Reverse transcription and cDNA crosslinking

Padlock probe capture

RCA and rolony crosslinking

Image acquisition

Human Brain

Human Kidney

RNAscope

Sample preparation

Image acquisition

RNAscope data processing

DART-FISH data processing (DF3D)

Sparse deconvolution (SpD) decoding

Estimating channel-specific coefficients

Setting the elastic net regularization parameter

Spot calling

Spot filtering

Spot assignment to cells

Cell annotation

Annotating the Brain data set

Drawing cortical layer boundaries

Gene concordance analysis

Annotating the kidney data set

Cell-cell interaction analysis

Comparison of decoding methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation