Introduction

It is now clear that the cell nucleus is spatially organized. For example, specific genes are found more frequently in certain radial positions or at certain nuclear hallmarks, and specific chromosomes have a tendency to be found near to each other and to be positioned more towards either the nuclear interior or periphery. In all cases, this organization varies among different cell types or tissues and appears to be probabilistic: the structures of interest are located near to each other at rates well above chance but far from 100 % of the time. How this probabilistic organization arises remains unclear.

One particularly intriguing form of nuclear organization is the tendency for co-expressed genes to be located in a nuclear neighborhood. This can be accomplished either by arranging genes on the linear sequence of a chromosome or by positioning otherwise-separated loci near each other in the nuclear space. In the latter case, this gene positioning is not absolute, but as found in single cell analyses it arises at frequencies in the range of about 1 % to more than 30 % and occurs at distances <1 μm [14].

Positioning co-expressed genes next to each other on the linear sequence of the chromosome is often seen in Prokaryotes where co-regulated genes are arranged in “operons”. A similar organization was observed in Saccharomyces cerevisiae [5], Caenorhabditis elegans [6], mouse [7], and other higher Eukaryotes [811]. Moreover, spatial partitioning of chromosomes into (~100-kb—megabase-sized) topological domains is emerging as a basic feature of the regulatory architecture of eukaryotic genomes [1214].

There is also evidence for non-random positioning of distant genes, located Mbps apart or on different chromosomes. Single-cell analyses revealed spatial positioning at close nuclear locations in the sub-micrometer range for active [14, 15] as well as for inactive genes [16, 17]. Moreover, the use of genome-wide chromosome conformation capture (3C)-based assays, such as 4C, e4C, or ChIA-PET, interrogating large populations of cells, revealed many potential interactions of active and inactive genes in different cell types [4, 1821]. However, even within cells of the same type, i.e., erythroid cells, there are different conclusions about how close spatial positioning of otherwise distant genes might occur [14].

One model is based on SC35 splicing speckles [2, 3, 22, 23]. Frequent sharing of the same splicing speckle has been demonstrated for α- and β-globin and other erythroid-specific genes in human erythroblast cells [2, 3] as well as for key adipogenic genes in porcine adipocytes [23].

An alternative model for spatial organization is based on shared RNA polymerase II organized in “factories”. These so-called “transcription factories” are discrete sites in the nucleus composed of ∼4–30 RNA polymerase molecules, and are associated with many other molecules involved in transcriptional activation and mRNA processing. Several hundreds to thousands of these sites exist in eukaryotic nuclei and at each of them multiple different genes may be transcribed at the same time [2428]. Examples supporting this model are the Myc and Igh genes in mouse B lymphocytes, which are located on different chromosomes [15] or the Hba or Hbb genes in mouse erythroid cells, which are positioned together with other erythroid genes at shared polymerase factories despite the fact that the loci are on different chromosomes [4].

A third model that has been considered is that spatial positioning of genes could arise from a more global organization of the genome within the nucleus. It is known that some chromosomes tend to be near each other [7, 29, 30], and that specific genes may generally be positioned at specific locations within the nucleus [7, 31, 32].

These three different models operate on potentially different spatial scales. Genes associated with the same nuclear body should be in shared neighborhoods comparable to the size of the body, namely ~40–198 nm for a transcription factory and ~0.5–2 μm for a speckle. Higher-order levels of organization based on chromosome folding and relative chromosome positioning are expected to operate on a spatial scale of at least several microns.

In many previous studies, only one of the preceding mechanisms for positioning of genes into nuclear neighborhoods has been investigated, making it uncertain to what extent other mechanisms might also contribute. In this study, we systematically explore the spatial organization of seven co-expressed genes in human multipotent adipose-derived stem (hMADS) cells [33] and by considering all of the three preceding models we ask how the positioning of genes might arise. We predict that merely adjacent positioning of chromosome territories cannot conclusively explain our observations on gene positioning in spatial neighborhoods. Rather, we find that close positioning of active genes can be explained by random association with nearby SC35-enriched splicing speckles and RNA polymerase II factories, but within restricted nuclear domains to which genes must be prepositioned based on a conserved higher spatial organization of the genome. The frequency of association with factories or speckles is determined by the relative number of these two types of nuclear bodies present within the restricted sub-volume. Thus, we propose a model that integrates all three alternative explanations for gene positioning described above, and that can also potentially account for observations made on different genes in different systems.

Results

Non-random spatial topology prepositions gene in the nucleus

A bioinformatic analysis of gene expression microarray data of adipogenesis in hMDAS cells [34] revealed 12 groups of coordinately expressed genes (supplemental information Fig. S1). We chose seven genes (C4orf18, GLUL, SAMHD1, AGTR1, ANGPTL1, FKBP5, SPON2; see also Table 1) from one of these groups and asked if they are spatially organized by specific positioning. We used 3D mFISH to simultaneously localize all seven genes in ~400 nuclei of preconfluent (80–90 % confluence) and induced hMADS cells (Fig. 1a). We reasoned that the topology of these genes in the nucleus is reflected by their relative distances to each other. Therefore it should be possible to compare different nuclei by comparing these distances. We measured the center-to-center distance between each pair of genes and since the two alleles of a gene are indistinguishable by DNA-FISH, we considered only the shorter distance between the two alleles of a given gene pair. These distances were normalized by the nuclear volume and for each nucleus a vector of normalized distances was created. We then calculated the pairwise Pearson correlation coefficients between these vectors and visualized them as heatmaps.

Table 1 Genomic locations and BAC clones
Fig. 1
figure 1

Spatial positioning of coordinately expressed genes (a). Left maximum intensity projections (xy, xz, yz) of DNA mFISH in hMADS cells are shown. Seven different genes were labeled using all possible combinations of three fluorophores, namely Alexa Fluor 488 (green), Rhodamine (orange) and Alexa Fluor 647 (purple)—scale bar = 5 μm. Middle signals of the individual fluorophores as detected by a spot detection algorithm. Right identification of the seven single genes by decoding the color code. be Correlation analysis of nuclear organization. Distance vectors were used to describe the spatial organization of seven coordinately expressed genes in each nucleus. Each pixel of the shown heatmaps represents a color-coded −log10(p) value (p < 0.05) of the correlation between two single nuclei. To the right of each heatmap, the color-coded scale for the −log10(p) values is shown. The upper triangles of the heatmaps display significant correlations between each pair of nuclei in 200 preconfluent cells (b), 110 induced cells (c), and in a comparison of both (d). The lower triangles show the results of a random permutation test. e Simulation of a random positioning model could not reproduce the observed topology correlations in hMADS cells (see text for details)

The upper triangles in Fig. 1b–e illustrate such heatmaps of significant (p < 0.05) correlations. Each square represents the color-coded −log10 p value of the corresponding pairwise correlation. Surprisingly, the spatial organization of the seven genes was significantly correlated in a high number of nuclei. We observed this in preconfluent (Fig. 1b) as well as in induced cells (Fig. 1c). Moreover, it appeared that this organization is preserved not only among nuclei in the same physiological condition but also between nuclei in the induced and uninduced states (Fig. 1d).

To eliminate the possibility that this high correlation could be obtained by chance, we randomly permuted the measured nuclear gene–gene distances within each vector and recalculated the correlation coefficients and p values. The lower triangles in Fig. 1 demonstrate that the correlation of vectors with the correct distance assignment could not be produced by an arbitrary distance assignment. Moreover, entirely random gene–gene distances (0 ≤ d ≤ mean diameter of hMADS nuclei) for seven simulated gene pairs, could also not reproduce the observed correlation (Fig. 1e).

Two of the studied genes, GLUL and ANGPTL1, are only separated by 3.5 Mbps on chromosome 1 (see Table 1), and therefore they are likely to be located at similar spatial distance in any cell under any condition. Thus, this will naturally increase the correlation and therefore we repeated the analysis by excluding one of the partners (GLUL). However, the spatial organization of the remaining six genes, was again significantly correlated in remarkable number of nuclei and was clearly different from random organization (supplemental information Fig. S2 a–c). In addition to GLUL, we also excluded C4orf18, which is located >158 Mbps away from SPON2 on chromosome 4 (see Table 1), but still the correlation was not strongly affected and indicated a non-random nuclear topology (supplemental information Fig. S2 d–f).

These results suggest that a higher-order and non-random spatial gene organization prepositions genes relative to each other into certain nuclear sub-volumes. This relative positioning is preserved between different nuclei and does not undergo dramatic changes upon induction of differentiation.

Coordinately expressed genes are frequently located in shared neighborhoods and this occurs more often with transcriptional upregulation

To further explore this nuclear organization, we next asked if there were any pairs of genes which tended to be in a shared sub-volume or “neighborhood”. Although five of the seven genes we examined were coordinately expressed, they were not near each other on the same chromosomes, so we would not expect to find these genes in spatial proximity within the nucleus based solely on their linear positions within the chromosomes. We scored genes as occupying a shared neighborhood if they were within a distance of 1 μm. This threshold should account for organization mediated by most nuclear bodies, which have a maximal diameter of ~1 μm, like transcription factories [28], splicing speckles [35], paraspeckles [36], PML bodies [37], or Cajal bodies [38].

We first analyzed induced cells, and found several gene pairs positioned in shared neighborhoods at unexpectedly high frequencies, namely: FKBP5 and SPON2 in 17.2 %, C4orf18 and SPON2 in 13 %, C4orf18 and FKBP5 in 12.8 %, and finally GLUL and ANGPTL1 in 68 % (Fig. 2a, supplemental information Table 2, Fig. S6 a–h). Typically, only one of the two alleles showed association with its partner, but all of these frequencies are much more than would be expected by pure chance (~0.55 % see below). Moreover, many of them are higher than determined by actual measurements of a pair of functionally unrelated housekeeping genes (ANXA6/COL6A1: 3.2 % n = 125) or as measured for the two housekeeping genes and a genomic control region that has no annotated protein coding genes (ANXA6/CR: 2.8 % n = 106, COL6A1/CR: 1.9 % n = 106) (see also Table 1, supplemental information Tables 5–7). We then repeated the gene-pair analyses using uninduced cells and compared the results (Fig. 2b, supplemental information Table 3) with data obtained from induced cells (Fig. 2a, supplemental information Table 2). Interestingly, there was a significant difference in frequency of close positioning between the two developmental states for two gene pairs. When cells changed from the uninduced to the induced state, we observed an increase from 1.8 to 12.8 % (Fisher’s exact test, p < 10−5) for C4orf18/FKBP5 and an increase from 8.2 to 17.2 % (Fisher’s exact test, p < 6 × 10−3) for FKBP5/SPON2 (supplemental information Table 4). This increase was not due to a reduction in nuclear size upon induction since we could not detect a significant difference in nuclear volume between induced and uninduced cells. Rather, this increase is more likely connected to the significant increase in the transcriptional activity of the three genes as measured by quantitative RT-PCR (C4orf18 and SPON2 ~2–8 fold, FKBP5 ~40 fold, Fig. 2c). In general, our quantification of absolute levels of expression revealed that six out of seven genes showed a low basal expression in preconfluent hMADS and were then strongly upregulated by hormonal induction, whereas SPON2 was already highly active in the uninduced state (Fig. 2d).

Fig. 2
figure 2

Coordinately expressed genes in spatial neighborhood. The center-to-center distance between each gene pair of these seven genes was measured in ~200 nuclei of induced (a) and ~200 preconfluent (b) cells. The heatmaps represent color-coded percentages of gene pairs found to be separated by a spatial distance of <1 μm. These pairs were then scored as positioned in spatial neighborhood. c, d RT-qPCR analysis of AGTR1, ANGPTL1, C4orf18, FKBP5, GLUL, SAMHD1, and SPON2 in hMADS cells (three biological replicates, error bars SE of means). c Fold changes between preconfluent and induced cells (24-h induction), y-axis in log2 scale. d Absolute expression levels in amol per μg total RNA as measured in preconfluent and induced cells (24-h induction). e Distribution of spatially neighboring gene pairs counted in nuclei of induced (black) and uninduced (white) cells. In induced cells, the number of gene pairs in spatial neighborhood per nucleus is significantly (p < 0.006) higher as compared to the number in uninduced cells. f Number of neighboring gene triplets in induced and preconfluent cells. A gene triplet is defined as a group of three genes in close proximity where the distance between any pair is <1 μm

Table 2 Measured association rates of SPON2 and FKBP5 at nuclear bodies
Table 3 Co-association rates of SPON2 and FKBP5 at nuclear bodies

We further verified the impact of transcriptional activity on neighborhood sharing by blocking transcription with 5,6-dichlorobenzimidazole 1-β-D-ribofuranoside (DRB), which is known to inhibit transcriptional elongation [3941]. When we treated induced hMADS cells for 1 h with 50 μg/ml DRB and performed dual-color DNA-FISH using probes for FKBP5 and SPON2, we detected a significant reduction in neighborhood sharing, which dropped from 17.2 to 7.6 % (Fisher’s exact test p < 0.005, n = 185). The neighborhood sharing frequency of FKBP5/SPON2 in DRB-treated preconfluent hMADS cells dropped only moderately from 8.3 to 5.1 % (Fisher’s exact test p > 0.23, n = 177), which we assume is due to the already very low expression of FKBP5 in uninduced hMADS cells (see Fig. 2d). It is important to note that DRB treatment did not irreversibly damage gene transcription in hMADS cells, as nascent transcription recovered in DRB-treated cells after a 45-min incubation time in normal growth medium (see supplemental information Fig. S7).

To test if there is a general tendency for closer gene positioning in induced cells, we counted all pairs of genes within 1 μm—diameter neighborhoods in both states and compared the two distributions (Fig. 2e). We found that there was a significant (t test, p < 0.006) shift towards a higher frequency of close gene–gene positioning in the induced state. The same could be observed when we extended the analysis from gene pairs to gene triplets (Fig. 2f). Finally, we verified that the genes within shared neighborhoods were transcriptionally active, by application of dual-color RNA-FISH experiments using intron probes for FKBP5 and SPON2 (Fig. 3).

Fig. 3
figure 3

RNA-FISH showing FKBP5 and SPON2 in spatial neighborhood. Representative images obtained from dual-color RNA-FISH experiments in induced hMADS cells show that transcriptionally active FKBP5 (yellow) and SPON2 (green, red in a, e, f) are frequently localized in the spatial neighborhood. In such cases, the transcripts sometimes partially overlap (a, b), or show no overlap but directly contact each other (c, d), or are slightly separated (ei)—scale bar 5 μm

Taken together, these results show that the seven coordinately expressed genes—chosen only based on their expression profile—can share the same neighborhoods and that the frequency at which this positioning occurs can increase upon induction of differentiation and concomitant transcriptional up-regulation. We observed this for pairs which either lie on different chromosomes or in the case of C4orf18 and SPON2 reside on the same chromosome but are widely separated (>158 Mbps). As expected, we also observed neighborhood sharing for two of the genes, GLUL and ANGPTL1, which are closely located on chromosome 1. Thus, we next investigated how spatial positioning of widely spaced genes on different chromosomes might arise.

Frequent close chromosome territory location is not sufficient to explain positioning of specific gene pairs in shared neighborhoods

Chromosome territories (CTs) can be preferentially positioned near each other [7, 30], and furthermore intermingling of neighboring territories can occur more [42] or less [43, 44] extensively. Thus, it is possible that the observed frequency at which specific gene pairs were located in shared neighborhoods could simply reflect a high frequency of close territory positioning and intermingling.

Therefore we examined the chromosome territory locations of three gene pairs: #1—SPON2 and FKBP5; #2—GLUL and SPON2; #3—GLUL and FKBP5. Gene pair #1 showed relatively high spatial association (17.2 %), while gene pairs #2 and #3 showed relatively low association (1.8 and 2.4 %) (Fig. 2a, b). We co-hybridized specific whole chromosome painting probes for chromosome 1, 4, and 6 to preserved nuclei and recorded >80 3D-stacks of uninduced and induced hMADS cells (Fig. 4a).

Fig. 4
figure 4

Spatial organization of chromosome territories (a). Whole chromosome painting and 3D reconstruction of chromosome 1 (green), 4 (blue), and 6 (red) in induced hMADS cells—scale bar = 5 μm. The representative examples demonstrate that the chromosome territories tightly associate and intermingle. b Upper panel distribution of 3D-distances measured between each CT pair in 80 nuclei of induced (white bars) and preconfluent (black bars) hMADS cells. Shown are the percentages of chromosome territory pairs starting from either touching or overlapping (CT distance 0) up to a distance of 4 μm (binning 0.5 μm). The distributions of the individual pairs did not significantly differ from each other (p > 0.05). The red triangles indicate bins with statistically significant (p < 0.05) differences between preconfluent and induced cells. Lower panel degree of intermingling between individual chromosome territory pairs in induced (white) and preconfluent (black) cells. % combined CT volume overlap is the co-occupied fraction of the cumulative volume of two CTs

We found that a considerable fraction (27.9 %) of the inspected CT pairs (CT 4/6) of gene pair #1 were located directly adjacent and touched or partially overlapped each other (CT distance ≤0 μm, Fig. 4b). However, we obtained very similar numbers when we measured the distances of the CT pairs of gene pair #2 and #3 (CT 4/1—29.1 %, CT 1/6—31.3 %). These three spatial distributions did not significantly differ (t test, p > 0.05). This suggests that close CT positioning alone is not sufficient to explain the more frequent close positioning of the genes in pair #1 in comparison to the genes in pairs #2 and #3. Rather, it indicates that there exists an additional higher-order organization and/or additional factors which specifically situate genes in a more confined sub-volume.

Genes in shared neighborhoods frequently associate with the same SC35 splicing speckles

An alternative explanation for close positioning of coordinately expressed genes has come from a study in erythroid cells, which showed that active erythroid genes are frequently localized at SC35-enriched speckles in human intermediate erythroblasts [2, 3]. To test if speckle association was present in the hMADS cells, we performed triple-color RNA immuno-FISH and used an antibody against SC35 and intron probes for FKBP5 and SPON2. We found that 68 % of the FKBP5 and SPON2 RNA-FISH signals that were separated by less than 1 μm were also contacting the same SC35 speckle (Fig. 5b) (n = 121). This means that of the 17.2 % of nuclei that showed close positioning of these two genes, 68 % of these spatial associations arose from association of the two genes with a shared splicing speckle. Thus, splicing speckle association can account for submicron association of these two genes in 0.68 × 17.2 = ~12 % of total nuclei.

Fig. 5
figure 5

Immuno-FISH in induced hMADS cells. a An optical section where primary transcripts of FKBP5 (yellow) and SPON2 (green) are located at the same RNA polymerase II focus (purple) which was detected by the H5 antibody. b RNA immuno-FISH performed in induced hMADS cells shows FKBP5 (yellow) and SPON2 (green) contacting the same SC35-enriched splicing speckle (purple). c DNA immuno-FISH performed in induced hMADS-CLK2GFP (−DOX) cells shows FKBP5 (red) and SPON2 (yellow) contacting the same SC35-enriched splicing speckle (blue)—scale bar 5 μm, the insets show zoomed details of single channels and corresponding overlays. d Frequencies at which FKBP5 and SPON2 DNA-FISH signals were spatially neighboring in induced hMADS-CLK2GFP (−DOX) before (n = 141), and in induced hMADS-CLK2GFP (+DOX) after complete dispersal of speckles (n = 142)

A model with completely random gene-pair/SC35 contacts is not sufficient to explain the observed speckle co-associations

We first considered a random model to explain the SC35 speckle co-association frequencies that we observed for the gene pair FKBP5/SPON2. Speckles are quite large in size (0.5–2 μm) and therefore they occupy a large volume in the nucleus, increasing the odds that two genes would encounter the same speckle by chance. We tested this possibility in a simulation by randomly placing two genes (diameter 320 nm) into 3D-reconstructed nuclei derived from images obtained by immunofluorescence staining of SC35. We scored the genes as neighboring if their center-to-center distance was <1 μm. Each pair of neighboring genes was then scored as a positive hit if they both shared at least one voxel of the same speckle. We ran our algorithm for as many iterations (~180,000 iterations/nucleus) as needed to obtain 1,000 neighboring gene pairs in a single nucleus (~0.55 % chance). Out of these 1,000 simulated neighboring pairs, only 5.2 ± 1.2 % were positive hits (co-associating with the same speckle), which is only a ~0.03 % chance in total (0.55 × 5.2 %).

Our simulation results indicate that although speckles occupy a large volume in the nucleus, merely chance co-association of genes with these large bodies can only account for ~0.03 % of genes in shared neighborhoods, whereas we observed a 68 % rate of shared speckles for neighboring FKBP5 and SPON2. This suggests that the genes must reposition to speckles and be retained there for some amount of time in order to explain our measured association rate.

Dispersal of SC35 splicing speckles results in a significantly decreased frequency of shared neighborhoods

If SC35 splicing speckles indeed promote specific gene–gene positioning in induced hMADS cells, then their removal should change the spatial relation of FKBP5 and SPON2. It was previously shown that overexpression of serine-arginine (SR) protein kinase cdc2-like kinase Clk 1, 2, and 3 leads to the disassembly of splicing speckles and redistribution of SC35 proteins [45, 46]. To use the SR protein kinase as a tool to disperse the SC35 speckles in hMADS, we generated doxycycline-inducible hMDAS-CLK2GFP cells for conditional overexpression of a GFP-tagged version of human Clk2. Addition of doxycycline to these cells led to disassembly of the splicing speckles as a complete redistribution of SC35 splicing factors could be observed by immunofluorescence (supplemental information Fig. S4). We performed 3D DNA immuno-FISH in this system to study the impact of splicing speckle loss on the spatial positioning of FKBP5 and SPON2. Analogous to normal hMADS, these cells were fixed and hybridized 24 h after hormonal stimulation. We recorded 3D DNA immuno-FISH image stacks of both hMADS-CLK2GFP (−DOX), as shown in Fig. 5c, and CLK2 overexpressing hMADS-CLK2GFP (+DOX). From the doxycycline-treated group, we considered only cells lacking speckles and we measured the center-to-center distance between FKBP5 and SPON2 DNA-FISH signals in both groups (−DOX n = 141, +DOX n = 142, supplemental information Fig. S3f). Interestingly, we found that there was a significant (Fisher’s exact test p < 0.004) reduction in the occurrence of close spatial positioning—dropping from 9.2 % down to 1.4 %—when CLK2 was overexpressed and the SC35 speckles were absent (Fig. 5d), which corresponds to a ~85 % reduction. It should be noted that the frequency of close positioning in −DOX hMDAS-CLK2GFP cells was lower than the one we measured in normal hMADS cells, which we reason is mostly due to the higher passage number that was reached during the generation and cultivation of the transduced cells. In earlier differentiation experiments we have seen that higher passage number hMADS cells did not respond to hormonal stimulation to the same extent as cells with low passage number.

Splicing often occurs co-transcriptionally [47], so we asked if dispersal of SC35 splicing speckles might also be affecting gene expression. We performed quantitative RT-PCR with hMADS-CLK2GFP +DOX/−DOX and found that expression of the seven tested genes was decreased in the doxycycline-treated cells (supplemental information Fig. S5). Three of these genes, namely AGTR1, FKBP5, and SPON2, showed a significant reduction in the amount of transcripts (SPON2: −40 %, FKBP5: −50 %, AGTR1: −68 %). We then used immuno RNA FISH (SC35/FKBP5/SPON2) to identify cells expressing FKBP5 or SPON2 and found that <8 % of the cells expressing these genes were lacking intact speckles (FKBP5: 7.9 % n = 101, SPON2: 5.8 % n = 104). Thus, the two tested genes are mostly expressed in cells that have intact speckles (the subpopulation of cells where co-transduction did not succeed, compare also supplemental information Fig. S4), suggesting that dispersal of speckles also affects their transcription.

In summary, these results indicate that the presence of SC35-enriched splicing speckles correlates with the establishment of spatial gene–gene relationships in the sub-micrometer range and disruption of speckles decreases the transcription rate of the tested genes.

Spatially neighboring genes can share the same RNA polymerase II factories

Our analysis of speckles showed that 68 % of genes in shared neighborhoods could be accounted for by association with these bodies. Their disruption led to a decreased frequency (~85 % reduction) of neighborhood sharing and negatively affected gene expression. Thus, some of this loss in neighborhood sharing could also be caused by a loss of sharing RNA polymerase II transcription factories [1, 4], which are known to mediate the clustering of active genes in erythroid cells. To determine the fraction of neighborhood sharing that is based on factory sharing of FKBP5/SPON2, we repeated RNA immuno-FISH in induced hMADS cells using the H5 antibody against the Ser2-phosphorylated elongating form of RNA polymerase II. We found that in 17.5 % (n = 80) of the cells in which both genes were in shared neighborhoods, they were also associated with the same RNA polymerase II focus (Fig. 5a). From these results, we conclude that juxtapositioning of FKBP5 and SPON2, which we observed in 17.2 % of the cells (Fig. 2b), can be explained by association with the same polymerase II focus in 0.175 × 17.2 = ~3 % of the nuclei in total. Note that this number is in the same order of magnitude as found for other co-expressed genes on different chromosomes [1, 3, 4]. Thus, when we assume that also 17.5 % of the 9.2 % FKBP5/SPON2 neighborhood sharings in hMDAS-CLK2GFP (−DOX) cells are due to factory sharing, we calculate a 0.175 × 9.2 = 1.6 % association with the same RNA polymerase II factory in total in these cells, which almost exactly corresponds to the association frequency of 1.4 % that remains in hMDAS-CLK2GFP (+DOX) cells after dispersal of speckles (see Fig. 5d). This suggests that loss of factory association in such cells is not the main cause of the loss in neighborhood sharing.

In sum, we find that speckles and factories together can account for most of the active genes in shared neighborhoods, namely ~68 % (speckles) plus ~17 % (factories) or ~85 % of the cases of shared neighborhoods.

Gene pre-positioning is required to account quantitatively for the observed frequencies of association with SC35 speckles and transcription factories

The simplest model for gene–gene association at a nuclear body presumes that each of the two genes must by chance be recruited to one such body in the nucleus (for example to accomplish transcription or splicing). Thus, the odds that two different genes associate with a particular body in the nucleus are inversely proportional to the number of such bodies present in that nucleus. This probability can be calculated explicitly using the formula derived by Shopland et al. [22]. To apply this formula for the case of speckle association, we first counted the average number of speckles per nucleus (32 ± 6) and also measured the frequency of speckle association individually for SPON2 (99 %) and for FKBP5 (51 %). Using these measured values (see also Table 2) and the formula from Shopland et al. [22], we obtained a probability of ~6.4 % ± 1.2 % at which FKBP5 and SPON2 should be at the same speckle (Table 3). However, this is well below the observed frequency of 12 %, suggesting that a purely random recruitment to any one of the speckles in the nucleus cannot explain the association of these two genes.

One way to achieve a higher predicted frequency of gene–gene association at the same speckle would be to reduce the number of speckles available for association. This is a plausible possibility since we found that all seven genes were already non-randomly positioned in the nucleus before induction. Thus, if FKBP5 and SPON2 were relatively close before induction, then upon transcriptional upregulation they would be expected to sample predominantly this smaller sub-volume for an available speckle.

As a simple means to define sub-volumes for FKBP5 and SPON2, we used their radial position within the nucleus, which has been frequently used before as a measure to define preferred positioning of genes in the nucleus [17, 48, 49]. We found that 80 % of the FKBP5 and SPON2 DNA FISH signals were located in the four innermost nuclear shells (Fig. 6a, b). We then assessed how often the two genes were positioned in shared neighborhoods within each shell and observed that most of the neighborhood sharing (91.9 % which corresponds to ~16 % of the nuclei) also occurred in the inner four shells of the nuclear volume.

Fig. 6
figure 6

Radial gene positioning. The frequency with which a gene was located in a shell was recorded and the distribution was plotted against the shell number (a). The radial position of genes was determined by assigning the corresponding DNA-FISH signals to ten radial shells of equal volume (b). FKBP5 and SPON2 were frequently found in the nuclear interior and 80 % of the signals were assigned to the inner four shells where we found 92 % of the spatial associations of the two genes. In contrast, the location frequency of ANGPTL1 and GLUL was shifted towards to nuclear periphery and these two genes showed a very low frequency of association with either FKBP5 or SPON2 (see also Fig. 2 a, b)

We then calculated the probability of random recruitment to the same speckle in this confined sub-volume and found that this probability increased significantly from 6.4 ± 1.2 % (whole nucleus) to 9.8 ± 1.9 % (sub-volume). This increase occurred because the average number of speckles within the sub-volume decreased to 20 ± 4 compared to 32 ± 6 in the whole nucleus (Table 2), thereby increasing the odds that the two genes would be randomly recruited to the same speckle. Now the resultant estimate of ~10 % for the frequency of gene associations is close to the measured frequency of ~12 % obtained in the preceding section by tabulating when the two loci were found at the same speckle (Table 3). Since ~92 % of close positioned FKBP5/SPON2 was located in the inner four nuclear shells, the predicted numbers are consistent with our observations (0.92 × 12 % = 11 %). Thus, the data suggest that the close positioning of the genes can be explained by stochastic association with nearby speckles located within a restricted nuclear sub-volume, rather than a specific or biased recruitment to a unique speckle.

We next applied the preceding model to transcription factories. As for the calculation with speckles, we measured the individual frequency of factory association for SPON2 (90.5 %) and for FKBP5 (93.5 %), plus the average number of factories in the nuclear sub-volume (Table 2) and used the values in the same formula from Shopland et al. According to this calculation, the estimated frequency of random association with a unique factory was 2.0 ± 0.2 %, which is close to our measured frequency of 3 % for association with the same factory (Table 3). The predicted lower frequency of association with the same factory compared to the same speckle is related to the increased number of factories per cell (~403 ± 61) compared to the number of speckles per cell (~32 ± 6). It should be noted that the H5 antibody that we used in immuno-FISH experiments detects most of the actively elongating transcription factories, since nearly all RNA-FISH signals (>90 %, see Table 2) were located at polymerase II foci and we counted a similar number of factories (432 ± 94, n = 7) in dedicated H5 immunofluorescence stainings.

The preceding results show that we can quantitatively account for the association of two genes in a shared neighborhood based on random associations with speckles and factories within a sub-volume defined by preferred radial positioning. These preferred radial positions presumably arise because FKBP5 and SPON2 are both located in gene-rich domains (89 genes within ten megabase pairs). As we have seen for FKBP5 and SPON2, such domains tend to bias genes to the nuclear interior [49, 50]. Two other genes, GLUL and ANGPTL1, which we analyzed in our cohort, are located in gene-poor domains (59 genes within ten megabase pairs). We found that these two genes were more often positioned in the nuclear periphery (Fig. 6a). Importantly, we also found that the peripherally located genes GLUL and ANGPTL1 rarely shared the same neighborhood with the more interiorly located genes FBKB5 and SPON2 (Fig. 2a, b). These results are consistent with our model in which shared neighborhoods arise because transcriptionally active genes tend to associate with nuclear bodies in their immediate vicinity.

Discussion

Overview

We studied the spatial organization of active genes in a human stem cell model. After analyzing gene expression profiles during the transition from uncommitted cells to mature adipocytes, we identified co-expressed genes and then asked if they showed spatial organization within the nucleus. We examined seven co-expressed genes and found that they were pre-positioned relative to each other according to a non-random topology. This organization was preserved between different nuclei and did not change dramatically after hormonal stimulation.

Within this global organization, we then investigated pair-wise relationships between the co-expressed genes. We found that upon hormonal stimulation and concomitant transcriptional upregulation, five of the seven genes shared 1-μm-diameter neighborhoods with another co-transcribed gene at frequencies higher than expected by chance. Except for GLUL and ANGPTL1, the studied genes were not near each other on the same chromosomes, so these shared gene–gene neighborhoods must have arisen from some set of organizational principles.

Such gene–gene associations have been studied before and different models have been proposed to explain them (e.g., [14, 15]). Our results suggest a new hybrid model that combines features from several previously proposed and apparently contradictory models. We find that prior to induction the genes are prepositioned such that upon transcriptional upregulation they are more likely to encounter nearby speckles and transcription factories. This initial pre-positioning thereby increases the odds of sharing the same SC35 splicing speckle or RNA polymerase II transcription factory to a level that agrees quantitatively with the frequency of shared neighborhoods that we have measured.

Factors influencing shared gene–gene neighborhoods

We defined a shared neighborhood as two genes that were within a distance of <1 μm. This is approximately the diameter of an average nuclear body so it allowed us to investigate how association with nuclear bodies can influence gene positioning within the nucleus. Note however that genes positioned within 1 μm of each other in a fixed cell will occasionally be much closer in live cells due to the diffusion of chromatin, which is known to occur in the sub-micron to micron range [51]. Indeed, it has been shown that interaction signals measured by 4C robustly correlated with distances of up to 1 μm as measured by 3D DNA-FISH [52].

We first considered the naive explanation that intermingling of CTs [42, 44] could explain the frequencies at which gene pairs were found in the same spatial neighborhood. However, we found that the spatial relations between CTs bearing a pair of genes (FKBP5/SPON2) that frequently shared the same neighborhood did not differ significantly from the spatial relation between CTs carrying genes that did not share the same neighborhood. This suggests that CT positioning alone is not sufficient to explain the observed differences in neighborhood sharing and indicates that additional factors might specifically situate genes in shared neighborhoods.

The frequency at which genes were located in shared neighborhoods rose upon increased expression levels and decreased upon inhibition of transcription, hence we tested if genes in a spatial neighborhood were associated with nuclear bodies involved in transcription. We focused on two different nuclear bodies that were previously considered to mediate spatial gene organization, namely SC35 splicing speckles [2, 3] and RNA polymerase II transcription factories [1, 4, 15].

We found that in 68 % of the cases in which FKBP5 and SPON2 were in a spatial shared neighborhood they also associated with the same splicing speckle. It is important to note that these bodies are large in size (~0.5 to ~2 μm). We therefore considered the possibility that pure-chance co-association of genes with a large volume of speckles is enough to explain the measured association frequencies. A computer simulation of this model, however, could not reproduce observations we made in hMADS cells. Thus, it seems unlikely that the two genes share the same neighborhood only by chance; rather it suggests that it is mediated by the presence of speckles.

In fact, we showed that the loss of speckles significantly reduces the frequency at which FKBP5 and SPON2 are found in shared neighborhoods, suggesting that the presence of speckles promotes spatial gene–gene association. It is known that most splicing is occurring co-transcriptionally [47] and that the splicing factor SC35 has an important role in transcriptional elongation [53]. Association of highly active genes with the periphery of speckles brings them to an environment with highly increased concentrations of splicing factors and therefore enables more efficient transcription and processing. Accordingly, in CLK2 overexpressing cells, we measured decreased expression of the seven tested genes. Moreover, FKBP5 and SPON2 were almost exclusively expressed in cells that had intact speckles (= subpopulation of cells where co-transduction did not succeed, see Fig. S3), while <8 % of cells expressing theses genes were lacking intact speckles. In addition to the reduction in the local concentration of splicing factors, the constant overexpression of CLK2 (48 h in our experiments) may lead to imbalanced phosphorylation states of SR proteins with negative effects on splicing processes and in further consequence also on transcription.

Besides speckles, which could account for mediating a substantial fraction of shared neighborhoods, we found a second fraction (17.5 %) of nuclear neighborhoods in which FKBP5 and SPON2 shared the same transcription factory. In fixed cells, factory sharing accounted for the most closely positioned pairs, separated by no more than the diameter of a factory (40–198 nm [54, 55]). It is possible that our measurement of this 17.5 % fraction (3 % of all inspected cells) might be an underestimate, since a recent study found that elongating RNA polymerase II phosphorylated at Ser2 may move out of factories following initiation or early elongation [56]. In addition, it was suggested that two types of transcription factories exist: “poised factories”, containing RNA polymerase II phosphorylated on Ser5, but not Ser2, and “active factories”, which are phosphorylated on both residues [57]. Here we used the H5 antibody, which is directed against the Ser2 phosphorylated form of RNA polymerase II and so we exclusively detected the active type of factories. Moreover, it was previously shown that genes may be localized at polymerase II foci even in the absence of transcriptional elongation [58], suggesting association with poised factories. In line with this observation, it was also reported that the DNA:DNA interactions of different loci as measured by 3C and 4C were still present after DRB treatment [19]. When we treated induced hMADS cells with DRB, we observed a significant drop in the frequency at which FKBP and SPON2 shared the same 1 μm neighborhood, however, there was still a fraction (~2.7 %) that was closely associated (center to center distance <0.5 μm). It is possible that this fraction of FKBP5/SPON2 pairs in our DRB-treated hMADS cells corresponds to the genes that had been sharing the same active—but now poised—transcription factory.

In sum, we found that both speckles and factories could contribute to co-association of the FKBP5 and SPON2 genes in shared neighborhoods. Note that because these genes are sometimes transcribed at the same speckle and other times shared the same factory, their coordinate expression is not likely caused by their association with a particular body. Rather, it seems more likely that their association with these bodies is caused by their coordinate expression since genes that are transcriptionally active at the same time are more likely to be randomly recruited to the same body. Hence, it seems unlikely that sharing the same nuclear body is essential for their coordinate expression or is of significant advantage for the cell.

Quantitative analysis of gene–gene co-association supports a hybrid model

To quantitatively predict the gene–gene co-association at a specific body (speckle or factory), we first presumed a model in which two genes are recruited to any of the bodies present in a nucleus. The probability for sharing a body was calculated based on a formula derived from Shopland et al. [22]. We found that this simple model could not account for the observed frequencies at which two neighboring genes shared a nuclear body. Our analysis on the global gene topology though, suggested a non-random positioning, which does not change dramatically upon hormonal stimulation. In addition, we observed gene density related to radial positioning [49, 50], e.g., FKBP5 and SPON2, which are both located in gene-rich domains, were often positioned in the nuclear interior, whereas GLUL and ANGPTL1, which are located in gene-poor domains showed peripheral nuclear positioning (Fig. 6a). When we considered the information about radial positioning in our calculation, the number of bodies in the radial reach of a gene was reduced. Consequently the probability that two neighboring genes are recruited to a shared body increased, which resulted in a fairly good prediction of the measured values (Table 3). Specifically, we estimate that the model can account for ~75 % of the observations in which FKBP5 and SPON2 were positioned in spatial neighborhoods. This estimate was obtained by accounting for both speckle and factory sharing, minus the overlap between these two populations (see supplemental information for details of this calculation).

Taken together, we propose that a non-random spatial organization prepositions genes into preferred nuclear sub-volumes and association with factories or SC35 speckles occurs stochastically within the radial range of a gene. In this model, the frequency of co-association with a shared body is determined by the limiting number of bodies within the nuclear sub-volume.

Application of the proposed model

Our proposed model for shared gene neighborhoods may help account for the disparate results reported in several previous studies where either speckles [2, 3] or transcription factories [1, 4, 15] were identified as the predominant body mediating specific gene positioning. The Osborne and Schoenfelder studies examined mouse erythroid cells where there are relatively few polymerase factories (~100–300) and no well-defined speckles. Their observation that gene–gene associations occur at shared factories could be expected, given the limited number of factories. Further restricting the number of factories is the fact that “specialized” transcription factories—containing a specific transcription factor—are preferentially shared, as Schoenfelder et al. have suggested. Conversely, in the erythroid cells examined by Brown et al., the fibroblast cells examined by Shopland et al., and the porcine adipocytes examined by Szczerbal and Bridger, well-defined speckles exist in relatively small numbers (~15–40). Although Shopland et al. [22] as well as Szczerbal and Bridger [23] did not report the number of factories in their cells, Brown et al. [3] estimated a large number (~1,100–1,900). Thus, at least in the erythroid cells examined by Brown et al., co-associations with speckles would be expected to predominate since there are far fewer speckles than factories.

Conclusions and outlook

In conclusion, we provide a straightforward explanation for our general observations of spatial organization of active genes in human stem cells with a model that may also be able to account for observations in other eukaryotic cells. Thus, it will be interesting to apply the procedures we have developed here to a larger set of genes in different co-regulatory clusters as well as to genes in other cell systems, in order to test whether the model can capture the basic principles of spatial organization of active genes in a wide variety of genes and cell types.

It is important to note that the approach of 3D FISH that we have used here allows for the analysis of gene–gene associations within single cells. This is in contrast to a number of recent genome-wide studies using 3C-based high-throughput methods like HiC, 4C, e4C, and ChIA-PET, which have revealed a large cohort of potential chromatin interactions and spatial topologies [4, 1821, 43, 52, 59]. Many of these interactions were cell type-specific and are thought to present the basis for cell-specific regulation of transcription. While these are powerful approaches, it will still be important in the future to continue to examine relationships within individual cells. Here high-throughput imaging of 3D FISH preparations could reveal the diversity across the cell population. Combined with the quantitative analyses that we have developed here, this should allow for a more thorough dissection of the rules governing nuclear organization.

Materials and methods

Gene selection by microarray data analysis

We examined microarray expression profiles [34] to select seven genes. Briefly: co-expressed genes were determined by the k-means clustering (k = 12). Cluster 10 (supplemental information Fig. S1) was further subdivided (k = 7) to obtain seven genes with different genomic locations: C4orf18—Chr 4 159.3 Mbps, GLUL—Chr 1 180.6 Mbps, SAMHD1—Chr 20 35.0 Mbps, AGTR1—Chr 3 149.9 Mbps, ANGPTL1—Chr 1 177.1 Mbps, FKBP5—Chr 6 35.7 Mbps, SPON2—Chr 4 1.1 Mbps.

Cell culture

hMADS cells were grown on coverslips as described in [34]. Preconfluent cells were fixed at 80–90 % confluence. For induction, the medium was exchanged as previously described [34] and the cells were fixed after 24 h. For Clk2 overexpression, hMADS-CLK2GFP were treated likewise and ~24 h before induction of adipogenesis 1 μg/ml doxycycline was added to the medium.

Generation of hMDAS-CLK2GFP cells for conditional overexpression of CLK2

The lentiviral plasmid for conditional overexpression of human CLK2 was generated by LR-recombination (GATEWAY, Life Technologies, Carlsbad, CA/USA) of pDONR223-CLK2 (Addgene, Cambridge, MA/USA) with pHR-TetCMV-DEST-IRES-mGFP5 as previously described [60]. For the production of lentiviral particles, HEK293T cells were transfected either with 5 μg pHR-TetCMV-hCLK2-IRES-mGFP5, 5 μg pHR-SFFV-TetRKRAB-IRES-Puro [60], or pHR-SFFV-rtTAM2-T2A-Puro (Ploner et al., in prep) together with 3 μg pSPAX2 and 3 μg pMD-G (kindly provided by D. Trono). hMADS cells were first transduced with lentiviral particles encoding TetR-KRAB and rtTAM2 sequences to enable tetracycline (doxycycline) controlled expression of hCLK2 in a tight system (co-transduction with tetR-KRAB). Puromycin-resistant hMADS cells were further superinfected with lentiviral particles encoding hCLK2-cDNA under the control of the doxycycline-sensitive TetCMV promoter (pHR-TetCMV-hCLK2-IRES-mGFP5).

Cell fixation and permeabilization

The nuclei preservation procedure used in our study is based on fixation with PBS buffered paraformaldehyde (PFA), which was shown to sufficiently preserve large-scale chromatin structures at the level of conventional [61, 62] and super-resolution [63] light microscopy. We avoided the frequently used hypotonic methanol/acetic fixation since it is known that this method strongly affects the nuclear and chromatin morphology, instead we used a long (25 min) fixation in 3.5 % PFA following a short (1 min) permeabilization in CSK buffer (100 mM NaCl, 200 mM sucrose, 10 mM PIPES pH 6.8, 3 mM MgCl2) on ice supplemented with 0.5 % Triton X-100. The cells were then washed 2 × 5 min in PBS.

3D combinatorial M-FISH

Probe generation

BAC-DNA (ImaGenes GmbH, Berlin, Germany: RP11-89C4 for C4orf18, RP1-223H12 for GLUL, RP1-132F21 SAMHD1, RP11-505J9 for AGTR1, RP4-595C2 for ANGPTL1, RP3-368C2 for FKBP5 and RP11-20I20 for SPON2) was isolated and labeled by nick translation with BIO-16-dUTP (biotin), DIG-11-dUTP, (digoxigenin) or DNP-11-dUTP (dinitrophenyl). We used the three labels in all possible combinations (n = 3, 2n − 1 = 7 genes).

Hybridization mix

The hybridization mix was composed of 100 ng of each nick-translated probe, 56 μg salmon sperm DNA, and 40 μg human cot1 DNA (Life Technologies, Carlsbad, CA/USA) in 50 % formamide/2 × SSC 5 % dextran sulfate. Prior to hybridization, the DNA was kept for 10 min at 85 °C and pre-hybridized for 20 min at 37 °C.

In situ hybridization

In situ hybridization of was carried out as described in [64] but with the following modifications: cells were treated with RNase for 20 min, chromatin was denaturated for 4.5 min at 85 °C in 70 % formamide/2 × SSC directly followed by the ethanol series and hybridized over two nights at 37 °C. After the hybridization, a series of washes was performed: 2 × 15 min 50 % formamide/2 × SSC at 45 °C, 10 min 0.1 × SSC, 10 min 2 × SSC, and 5 min 4 × SSC.

Detection of the probes

Biotin labels were detected with streptavidin Alexa Fluor 647 (Life Technologies, Carlsbad, CA/USA) diluted 1:200 in 4 × SSC/0.1 % BSA/0.01 % Tween 20. Washes in 4 × SSC/0.1 % Tween 20 for 10 min, 4 × SSC for 2 × 10 min, 2 × SSC for 10 min, and PBS for 5 min followed the incubation step. The DIG and DNP labels with anti-digoxigenin-rhodamine (Roche Applied Science, Penzberg, Germany) and anti-dinitrophenyl-KLH Alexa Fluor 488 antibodies, diluted 1:200 in 4 % BSA/0.1 % Tween 20/PBS. After washing 3 × 5 min with PBS, the nuclei were counterstained with 0.05 μg/ml DAPI in PBS for 5 min.

RNA-FISH

The intron probes for FKBP5 and SPON2 were generated by PCR using specific primers (supplemental information Table 1). The PCR products were amine-modified by nick-translation using amino-allyl-dUTP, which was then coupled to an amine-reactive fluorescent dye (Cy3 for FKBP5, Alexa Fluor 488 for SPON2) following the ARES DNA Labeling Kit (Life Technologies, Carlsbad, CA/USA) instructions.

In situ hybridization

Fixed and permeabilized nuclei were dehydrated by a series of washes in 70, 90 %, and absolute EtOH on ice for 5 min each. The cells were air-dried and hybridized overnight at 37 °C with 5 μl of the hybridization-cocktail (FKBP5, SPON2 20 ng each, 2 × SSC, 5 % dextran sulfate, 1 mg/ml yeast t-RNA, 50 % formamide). After hybridization, the coverslip was washed 3 × 5 min in 50 % formamide/2 × SSC at 37 °C, 15 min 2 × SSC, and 5 min in PBS. The nuclei were counterstained with 0.05 μg/ml DAPI in PBS for 5 min.

RNA immuno-FISH

The mouse H5 antibody (Convance, Princeton, NJ/USA; Abcam, Cambridge, UK) and a mouse anti-SC35 (Abcam, Cambridge, UK) was used to detect active RNA polymerase II and splicing speckles. The antibodies were diluted 1:200 in 4 % BSA/0.1 % Tween 20/PBS containing 2 μl RNAse OUT (Life Technologies, Carlsbad, CA/USA) and incubated overnight at 4 °C. After three washes with PBS, an anti-mouse Cy5 secondary antibody (Abcam, Cambridge, UK) was diluted as described above, applied for 1 h at room temperature, and washed three times. The signal was fixed for 10 min with 3.5 % PFA in PBS. The RNA hybridization was performed as described above but the dehydration steps were omitted.

DNA immuno-FISH (SC35/FKBP5/SPON2)

Probe generation

BAC-DNA (ImaGenes GmbH, Berlin, Germany: RP3-368C2 for FKBP5 and RP11-20I20 for SPON2) was isolated and sonicated to a size of 200–500 base pairs using the Covaris S2 sonicator. The 500 ng of DNA fragments were directly labeled with AlexaFlour 3 (SPON2) and AlexaFluor 5 (FKBP5) using the BioPrime labeling system (Life Technologies, Carlsbad, CA/USA).

In situ hybridization and immunostaining

Cells grown on coverslips were fixed and permeabilized as described above followed by 5-min incubation with 0.1 N HCl. After washing with PBS, the cells were treated for 30 min with RNAse (20 μg/ml), washed again, and incubated in 2 × SSC for 10 min and then in 2 × SSC/70 % deionized formamide for 45 min. Then the coverslips were directly applied upside down onto the hybridization mix (120 ng probe for both, FKBP5 and SPON2, 6 μg human cot1 DNA, 20 μg salmon sperm DNA in 50 % formamide/2 × SSC 5 % dextran sulfate). The coverslip was sealed with rubber cement before denaturating for 4.5 min at 85 °C. The cells were hybridized at 37 °C overnight and washed 1 × 15 min 50 % formamide/2 × SSC 42 °C, 10 min 0.1 × SSC, 10 min 2 × SSC and 5 min in PBS. SC35 splicing speckles were detected as described in the RNA immuno-FISH section but with the following modifications, addition of RNAse OUT was omitted and an anti-mouse AlexaFluor 488 secondary antibody was used. Primary as well as secondary antibodies were incubated for 1 h at 37 °C or overnight at 4 °C. The nuclei were counterstained with 0.025 μg/ml DAPI in PBS for 5 min.

Chromosome painting

DOP-PCR amplified flow-sorted painting probes [65] for chromosomes 1, 4, and 6 (Applied Spectral Imaging Carlsbad, CA/USA) were hybridized following the procedure described in [66] with modifications: fixation: 15 min in 3.5 % PFA, 15-min permeabilization, over-night equilibration in 2 × SSC/70 % deionized formamide, chromatin denaturation 3 min at 72 °C. After an ethanol dehydration series, the coverslips were air-dried for 5 min and 2 μl of each painting probe—which were heat denaturated for 10 min at 80 °C and pre-hybridized for 20 min at 37 °C—was applied, and hybridized over two nights at 37 °C. Two stringency washes were performed: 4 min in 0.4 × SSC at 74 °C, 2 min 4 × SSC/0.1 % Tween 20.

Immunofluorescence

SC35 splicing speckles: Cells were fixed in 3.5 % PFA/PBS for 10 min and permeabilized for 15 min with 0.5 % Triton X-100 in PBS. Immunostaining of SC35 was performed as described in the DNA immuno-FISH section, but with the following modifications, an anti-mouse AlexaFluor 546 secondary antibody was used. RNA polymerase II (Ser2 phosphorylated): Cells were fixed for 20 min in 3.5 % PFA/PBS following a 1-min permeabilization in CSK buffer. Immunostaining was performed as described in the RNA immuno-FISH section.

BrUTP incorporation

The protocol followed that in [67], with the following modifications. The permeabilization buffer contained 10 μg/ml instead of 25 μg/ml digitonin, dexamethasone was omitted. The transcription reaction was run for 15 min at 37 °C. RNaseOUT (Invitrogen) was used instead of RNasin. Cells were fixed in 2 % PFA/PBS for 10 min after a 30-s permeabilization in CSK buffer + 0.5 % Triton X-100. Incorporated BrUTP was detected using the mouse anti-BrdU (sc-32323, Santa Cruz Biotechnology, Inc., Santa Cruz, CA/USA) and an anti-mouse AlexaFluor 488 secondary antibody.

Transcription inhibition experiments

Transcriptional elongation was inhibited by incubation in growth medium supplemented with 50 μg/ml DRB (Sigma-Aldrich, St. Louis, MO/USA). After 1-h incubation, the cells were prepared for DNA-FISH or BrUTP incorporation. Inhibited cells could be rescued by 45 min. Incubation in normal growth medium.

Quantitative real-time reverse transcriptase PCR (qPCR)

For quantitation of mRNA transcripts in hMADS cultures, the qRT-PCR standard curve method was applied. Standards were prepared by amplicon-selective PCR performed on total cDNA generated from hMADS total RNA. Obtained amplicons were purified and quantitated with Nanodrop (Peqlab, Erlangen, Germany). Standard concentrations in the range 100–0.0001 pM amplicon were applied for qRT-PCR together with the samples to be assayed, applying Platinum SYBR Green, the Viia7 real-time PCR technology and software from Life Technologies (Carlsbad, CA/USA). Primers for generating the standards and for the real-time PCR are found in supplemental information Table 1.

Microscopy, image, and data analysis

Microscopic image acquisition

Images were recorded on a Zeiss AxioImager Z1 epifluorescence microscope equipped with an AxioCam MRm CCD camera and a Heidenhain closed-loop system for highly precise and reproducible z-stacks of equidistant step sizes. The 3D-image stacks were collected using a 100 × Plan-Apochromat 1.4 NA oil immersion lens with an axial sampling distance of 200 nm producing a voxel size of 64.3 × 64.3 × 200 nm. In case of DNA mFISH, nuclei were chosen based on the DAPI stain and the presence of 8 Alexa Flour 488 signals. This minimized a potential bias since the combinatorial labeling of the seven genes was not revealed by looking at only one fluorochrome. In case of RNA-FISH and RNA immuno-FISH, nuclei were chosen by the presence of two hybridization signals.

Image restoration and 3D reconstruction

All recorded image stacks were processed by 3D-deconvolution running 100 iterations of the maximum likelihood estimation algorithm available with the Huygens Deconvolution Software (Scientific Volume Imaging—SVI http://www.svi.nl). The deconvolved image stacks were then corrected for color-shift errors.

For the detection and measurement of the FISH-signals, the processed image stacks were reconstructed in 3D. The reconstruction method was realized with Imaris (Bitplane Inc. South Windsor, CT/USA) using the ImarisXT interface and the Matlab (MathWorks Inc. Natick, MA/USA) programming language, which also included the image processing and statistics toolboxes. The procedure involved the following five steps:

(1) Removal of nuclei touching the lateral image borders, (2) generation of an isosurface in the DAPI channel that delineates the nucleus, (3) removing any extra-nuclear background by setting voxel intensities outside of the nuclear surface to zero in all channels, (4) detection of FISH signals by using the “DetectSpot” function of ImarisXT. By dynamically adjusting the intensity threshold, the expected number of Spots (eight per channel) could be detected, (5) identification of the combinatorial labeled genes by resolving the combinatorial labeling. This was achieved by k-means clustering of all detected spot coordinates from each channel. The k-value was thereby set to the number of genes detected by our approach, usually 14 (7 genes × 2 alleles). The corresponding gene names were then assigned to the centroids of the resulting cluster.

The reconstructed 3D image was used to generate the following data: the 3D coordinates of the detected genes, the center-to-center distances between the single genes, the volumes of the nuclei, and the axes’ lengths including the rotation matrix of the ellipsoid that approximates and describes a nucleus’ surface.

Radial positioning

Each nucleus was approximated by an ellipsoid, which was subdivided into ten shells of equal volume and shape. The frequency of gene localization in each shell was determined over all nuclei.

Frequency of random gene co-association with an SC35 speckle or transcription factory

We calculated the expected frequency of random association for one copy of a gene “a” with an SC35 speckle marked by another gene “b” using the following formula as described in [22]: \({\text{ef = }}\left( {\frac{ 1}{\text{nS}}} \right) \times f_{\text{a}} \times f_{\text{b}} \times 4\), where nS = the average number of speckles per cell and f a, f b = the independent frequencies of domain association for each gene. Based on the rules for propagation of error [68], we calculated the standard deviation for ef as\(\frac{{f_{\text{a}} \times f_{\text{b}} \times 4}}{{{\text{nS}}^{ 2} }} \times \sigma \left( {\text{nS}} \right)\), where \(\sigma \left( {\text{nS}} \right)\) was the measured standard deviation in the number of speckles per cell. The same formulae were used to determine the expected co-association frequencies with RNA polymerase II transcription factories. The average number of speckles or factories was determined by automatic image segmentation, and for the restricted nuclear domain calculations only the number of bodies in the inner four shells was counted.

Correlation of nuclear gene organization

We measured the center-to-center distance between FISH signals of each of the seven genes and used the shortest distance between all alleles of a given pair (2 + 2 alleles) for the analysis. For the general comparison of spatial gene–gene relations, the distances were normalized for the volume of the nucleus in which they were measured [69]. To determine whether the spatial organization of different nuclei correlates, we calculated the pairwise Pearson correlation coefficients of vectors containing the volume-normalized gene–gene distances. Only nuclei with no missing values were included in this analysis. Correlations were considered as significant if the p value of the correlation coefficient was smaller than 0.05. To test for the possibility that the observed correlations between nuclei could be obtained by a random assignment of the measured gene–gene distances, we randomly permuted the values of the inter-gene distance within the vectors representing a nucleus and recalculated the correlation coefficients. In addition, we repeated this analysis using random gene–gene distances (0 ≤ d ≤ mean diameter of hMADS nuclei).