Bits and pieces of history

The lively history of (hetero)chromatin dates back from the nineteenth century to today’s vivid debate about its relation and influence on gene regulation (Fig. 1). Steady progress in microscopy including fixation and staining techniques led to the discovery of cells and, finally, the observation of the nucleus described first by Robert Brown in 1831 (Brown 1831). Brown’s observation immediately raised the questions: “what is the nucleus made of?” and “what is its purpose?”

Fig. 1
figure 1

Timeline compilation of landmark discoveries and concepts (gray) on molecular (red) and cellular (blue) aspects of chromatin. Image sources are (1) Wikipedia; (2) MPI for the History of Science, Berlin, Germany; (3) nobelprize.org; (4) www.dipbot.unict.it/erbario_fr/Baccarini.html; (5) portrait by Esko Suomalainen in 1948; (6) http://www.che.ac.uk/what-we-do/conrad-waddington/; (7) Rockefeller University, NY, USA; (8) Canadian Medical Hall of Fame, Canada; (9) Cold Spring Harbor Laboratory Archives, NY, USA; (10) http://dnaandsocialresponsibility.blogspot.de/2010/09/dna-story-at-kings-hidden-dna-workers.html; (11) http://projects.exeter.ac.uk/lampbrush/people.htm; (12) http://hermes.mbl.edu/events/2007_events_friday/events_friday_07_27_07.html; (13) http:/wbworkshop.com/wb.html; (14) Courtesy of Prof. Niels Ringertz; (15) Deutsche Gesellschaft für Humangenetik, Germany; (16) Bowdoin College, USA; (17) UMass Amherst, USA; (18) Stanford University, Stanford Report (Photo, Linda Cicero), USA; (19) Department of Human Genetics, LMU Munich, Germany; (20) Kirchhoff Institute of Physics, University of Heidelberg, Germany; (21) ETH, Zürich, Swiss; (22) Universität Göttingen, Germany; (23) Birmingham University, UK; and (24) http://epigenome.eu/en/4,14,0

Answers were found to the first question by means of chemical analysis between the 1870s and the early twentieth century. In the search for the components of the nucleus, a first success was Friedrich Miescher’s discovery of a “highly phosphorous organic acid” initially introduced as “nuclein” in 1871 (Miescher 1871) and subsequently renamed “chromatin” by Walther Flemming in 1879 (Flemming 1879), due to its affinity to aniline dyes. Further progress on the major building bricks of chromatin was made with Albrecht Kossel’s discovery of histones in 1884 (Kossel 1911). Finally, with Phoebus Levene’s work (Levene 1903; Levene and La Forge 1915) on the chemical composition of nucleotides and the identification of deoxyribose in 1928, all major chromatin components were identified (reviewed in Choudhuri 2003). Each discovery resulted in intense debates on the question: what is its purpose?

Fueled by the aftermath of Charles Darwin and Alfred R. Wallace’s evolution theory (Darwin and Wallace 1859; Darwin 1859), Wilhelm Roux predicted, already in 1883, chromatin’s complexity and importance (reviewed in Cremer and Cremer 2006). The observation of chromosomes (a term suggested by Waldeyer 1888), chromosome configuration (“Rabl configuration”), and their nonrandom distribution during cell division finally culminated in Walter Sutton (Sutton 1903) and Theodor Boveri’s (Boveri 1904) theory called as “Sutton–Boveri chromosome theory of inheritance” (reviewed in Cremer and Cremer 2006).

While chemical analysis deciphered the composition of chromatin, its organization within the cell nucleus was limited by the microscopic techniques available. However, in 1908, the Italian botanist Pasquale Baccarini reported dark stained bodies in the interphase nucleus of plants, which could be distinguished from nucleoli. Baccarini named those structures “chromocentri” or chromocenters (Baccarini 1908). Although little was known about the nature of chromocenters, they where postulated to be condensed parts of chromosomes by Rosenberg (1903). Finally, in 1928, Emil Heitz introduced the term heterochromatin and euchromatin (Heitz 1928) to describe chromosomal regions that behave differently in terms of staining and compaction throughout the cell cycle. Heitz defined heterochromatin as staining intensely and remaining compacted during the cell cycle, while euchromatin would stain lightly and disappear during telophase. He further observed a spatial relationship between heterochromatin and nucleoli. As chromocenters always stained strongly, it was rightly assumed that they consist of heterochromatin. Despite more than a century of efforts to define the structure and function of chromatin, many of the basic mechanisms that govern the formation of differentiated chromatin types remain poorly understood, and their biological role is largely unknown.

Heterochromatin: cytological versus molecular definition

As described above from light microscopy observations, the cytological definition of heterochromatin is rather straightforward but also very general. This can be best appreciated in the electron micrograph image shown in Fig. 2a, where electron dense heterochromatin is visualized inside the nucleus, mostly around the nucleolus and lining the inner surface of the nuclear envelope.

Fig. 2
figure 2

Heterochromatin: in need of definition? Historically and from a cytological point of view, Emil Heitz (see Fig. 1) distinguished hetero and euchromatin. a Within an exemplary electron microscopy (EM) picture (left) of a mouse liver cell nucleus (N nucleus, Nu nucleolus, NE nuclear envelope), heterochromatin appears as electron dense in contrast to the more open state of euchromatin. b With the recent advent of high-throughput epigenomics, molecular features (histone and DNA modifications) have been assigned to particular chromatin states and are shown in the simplified graphic enlarged in the center. c The cell cycle dynamics and cytological organization of the very condensed chromatin structures around the centromeres can be appreciated in the fluorescence light microscopy (LM) pictures (right) of M phase and interphase cells. FISH-stained mouse metaphase chromosomes and interphase cell with probes against pericentric heterochromatin (black) and DNA counterstaining (gray) are shown. Condensed pericentric heterochromatin regions from multiple chromosomes cluster together in the interphase cell nucleus forming the so-called “chromocenters.” Cytological and molecular definitions have not yet been conclusively linked together. Scale bars EM 0.5 μm and LM 2 μm

From a molecular viewpoint, chromatin contains DNA and also proteins; the major components of which are the basic histone proteins (reviewed in Woodcock 2006). Histones form repeating octameric complexes called nucleosomes consisting of two of each of the core histone proteins H2A, H2B, H3, and H4 (Kornberg 1974). This constitutes the first level of chromatin compaction. Several higher order organization levels (reviewed in Woodcock and Ghosh 2010) have been proposed and give rise to increasing chromatin compaction, with the highest compaction being achieved in metaphase chromosomes (Fig. 2c). Even though the core histones are mainly the same throughout the chromatin, different histone modifications have been correlated with and, henceforth, serve as markers for different types of chromatin (reviewed in Kouzarides 2007). Of special importance are acetylation and methylation marks (Fig. 2b). The positive charge of lysine residues is neutralized through acetylation, which has been proposed to weaken the interactions between histones and DNA, leading to a more open chromatin structure, mostly correlated with the euchromatin state and gene expression. In contrast to acetylation, methylation of lysine 9 in histone H3 has been correlated with constitutive heterochromatin and gene silencing. Histone modification cannot only help to distinguish between hetero and euchromatin but also to distinguish facultative from constitutive heterochromatin, as the former is enriched in histone H3 methylated at lysine 27 (reviewed in Trojer and Reinberg 2007). With the identification of a growing number of histone modifications influencing chromatin structure, a “histone code” (Fig. 1) was hypothesized by which these posttranslational modifications could be combinatorially read by other factors and translated into different gene expression states (Strahl and Allis 2000). Potential readers of this histone code are proteins such as heterochromatin protein 1, polycomb group proteins, and many others (reviewed in Jenuwein and Allis 2001).

Methylation can, in addition, be found as a DNA modification at the position 5 of the cytosine base and is enriched in heterochromatin. How DNA methylation controls gene expression is still an open question, and direct as well as indirect mechanisms are discussed (reviewed in Leonhardt and Cardoso 2000). Methylation on the DNA level is interpreted by a family of methyl-CpG binding proteins (reviewed in Brero et al. 2006) that recruit chromatin regulators (e.g., histone deacetylases), inducing conformational chromatin changes leading to repression of gene expression (Nan et al. 1998; Zhang et al. 1999).

With the advent of high-throughput epigenomics, a catalog of molecular marks has been assigned to particular chromatin states. In this context, regions of increased gene density (RIDGEs) and opposing anti-RIDGEs (gene poor regions) were defined also, taking into consideration CpG islands, CG content, and number of short and long interspaced elements (SINEs and LINEs) (Caron et al. 2001). Recently, an association study of a set of molecular marks lead to the further discrimination of chromatin into five main types (Filion et al. 2010) (Fig. 1, “colorful chromatin”).

In spite of all the recent progress in this area, the cytological and molecular definitions of (hetero)chromatin have not yet been conclusively and comprehensively linked together. Furthermore, our understanding of the higher order architecture of chromatin and its functional consequences is far from satisfactory.

Heterochromatin: a transcriptional silencing compartment?

One of the most important epigenetic roles of heterochromatin was recognized very early on. In 1930, Muller (1930) discovered that Drosophila flies treated with X-rays developed random color patterns of white and brown patches in the eyes. He could show that by random mutation, the white gene locus was translocated adjacent to heterochromatic regions and, thereafter, silenced. This effect was named position effect variegation (PEV). Further studies (Demerec and Slizynska 1937) broadened the knowledge about PEV, showing that genes in direct heterochromatic neighborhood were silenced before more distal genes. Altogether, these experiments showed that usually active genes get silenced just by being in the vicinity of heterochromatin and lead to the development of the concept of heterochromatin spreading. A similar effect was reported in different organisms for genes translocated to telomeric chromosomal regions and referred to as telomeric position effect variegation (TPEV) (Gehring et al. 1984; Horn and Cross 1995; Gottschling et al. 1990). (T)PEV is based on cis chromosomal effects, i.e., genes are affected by heterochromatin proximity within the same chromosome. Interestingly, recent work in Caenorhabditis indicated that large transgenic repeated arrays of tissue-specific gene promoters become heterochromatinized and gene activation within these repeats lead to looping away from the heterochromatic subnuclear domain (Meister et al. 2010). A similar looping out of heterochromatin effect upon transcription factor expression of a transgene integrated within satellite repeat-rich heterochromatin was also observed in mice (Lundgren et al. 2000). In both studies though, looping away from the heterochromatin was not always accompanied by gene activation.

In Drosophila, mouse, and plant cells, constitutive heterochromatin is clustered into chromocenters during interphase as depicted exemplarily in a mouse interphase cell in Fig. 2c. Chromocenters contain pericentric heterochromatin, which resides close to the centromeres and is enriched in satellite repeat sequences. Whether heterochromatin can also influence gene expression in trans has been more recently investigated by several groups (see below and Table 1) but remains controversial likely due to the lack of normalization standards to account for morphological and technical variability.

Table 1 Studies correlating gene expression and heterochromatin proximity

Initial studies in hematopoietic cells showed a correlation between gene activity and colocalization with chromocenters (Fig. 3a). In this study, B lymphocytes were investigated, and a relocation of active genes away from chromocenters was observed, whereas silenced genes were found to colocalize with chromocenters (Brown et al. 1997). However, the number of genes studied was relatively limited, and only genes relevant to the hematopoietic system were evaluated. Follow-up studies also mainly concentrated on the hematopoietic cell system, but soon, others organisms and cell types were examined as well (Table 1). All these experiments were performed in fixed cells, thus capturing only a snapshot in time and without taking chromatin mobility into account (>2 μm in about 4 h in G1 phase cells; Walter et al. 2003). The first study that took into consideration the role of chromatin mobility was performed on Drosophila eye disks, comparing distances from three genes to heterochromatin in cells with active versus inactive gene loci (Harmon and Sedat 2005) (Fig. 3c). They could show a clear relation of gene silencing and proximity to heterochromatin of the same or different chromosomes. Although these studies established a correlation between gene (re)position to pericentric heterochromatin and repression, others found no support for this correlation (Sabbattini et al. 2001; Takizawa et al. 2008), suggesting that the effects might be different for different loci.

Fig. 3
figure 3

Strategies to investigate the role of heterochromatin (proximity) in gene silencing. The schemes (left) present alternative approaches to determine gene (red and blue) to heterochromatin (green) proximity in the nucleus and their respective data outcome (right). a Classic evaluation is based on simple colocalization of genes and heterochromatin yielding a clear yes or no outcome. b Radial distribution analysis within the cell nucleus can be performed in shells of the same distance (equidistance) or of the same volume (equivolume). Both methods yield a fluorescence intensity (FI) based distribution. Depending on which shell distribution is chosen, either interior or exterior genes are underestimated. c Comparing gene to heterochromatin absolute distances in 3D is a very unbiased method but does not take into account morphological variability. d The quantile-based approach combines absolute distance measurements (in 3D) with single-cell normalization (software and audiovisual tutorial available at http://www.cardoso-lab.org/publications/Randomizer.zip). Simulation and measurement of random (virtual) spots (gray) results in a random background distribution that normalizes for morphological differences. For each procedure and outcome, references are indicated below. An annotated list of studies is given in Table 1

Since heterochromatin does not only accumulate in form of chromocenters but also at the periphery of mammalian nuclei, recent studies have concentrated on the nuclear periphery as a silencing compartment. This is of special interest as human cells do not exhibit clear heterochromatic clusters comparable to those found in mouse cells.

The first indication of the periphery being a transcriptional regulator was found in yeast and later in mouse myogenesis (Lee et al. 2006). In yeast, telomeres are enriched at the nuclear periphery, leading to an increased concentration of silent information regulator (Sir) proteins which cause silencing of genes (Maillet et al. 1996). Sir2 deacetylates histone tails recruiting other Sir proteins, leading to spreading of silencing (reviewed in Gasser and Cockell 2001). Silencing at the periphery through tethering of genes could also be observed for some genes albeit not for others in mouse fibroblasts (Reddy et al. 2008), mouse astrocytes (Takizawa et al. 2008), and human cancer cells (Meaburn and Misteli 2008) (see also Table 1).

Using a molecular tagging approach based on expression of DNA adenine methyltransferase (DamID) fusion to lamina components, genome-wide association studies with the nuclear lamina have been performed. Genes within lamina-associated domains (LADs) were generally found to have a lower expression level than genes outside of LADs (Peric-Hupkes et al. 2010; Guelen et al. 2008). As the DNA modification by the Dam fusion protein requires only a transient interaction with the nuclear periphery, it would be important to validate in situ the outcome of these large-scale studies.

Heterochromatin and gene positioning: methods and challenges

The in situ heterochromatin association studies, described above, have limitations regarding their analysis (Fig. 3b). They either score association with the lamina at the nuclear periphery by simple colocalization (Lee et al. 2006) or by using radial positioning within the cell nucleus (Takizawa et al. 2008; Szczerbal et al. 2009; Meaburn and Misteli 2008). Scoring colocalization is problematic because it does not take gene mobility into account. Using a shell-based analysis (reviewed in Ronneberger et al. 2008) poses the additional problem of how the shells should be placed and whether their dimensions have biological meaning. In general, two possibilities of shell placement can be found in the literature: equidistant (Szczerbal et al. 2009) or equivolume shells (de Nooijer et al. 2009). Using one or the other can change the outcome substantially as shown on Fig. 3b. A further experimental challenge posed by the different data sets is the variability of nuclear morphology. Different cells show a great variety of shapes and sizes which is often altered upon differentiation. This challenge is acknowledged by many researchers in the field (Meaburn and Misteli 2008). Furthermore, when correlating gene positioning and heterochromatin, the distribution of heterochromatin within it also plays a role.

To circumvent this conundrum and be able to compare morphologically variable cells, we recently developed a single-cell-based normalization protocol, which compares, in each single cell, the random background distribution of distances from several thousands of simulated locations and the actual position of the gene of interest (Jost et al. 2011). The fraction of random distances smaller than the distance of the actual gene to heterochromatin is taken as the quantile value (Fig. 3d). This value is not only normalized for the specific cell size and chromatin distribution but also robust to variations of image acquisition and threshold settings. To test and validate the approach, the location of pluripotency genes in mouse stem cells, induced pluripotent stem cells, and fibroblasts was compared. The outcome did not corroborate the hypothesis that heterochromatin acts as a gene silencing compartment (Jost et al. 2011). With these new computational tools at hand, genome-wide unbiased studies should be able to settle this question. Furthermore, extending the investigations to several different in vitro and in vivo differentiation systems and organisms will be important to ascertain their general biological validity.

In summary, gene position inside or outside of heterochromatin and gene expression is not yet convincingly linked. Thus, the question persists “does a gene’s location matter?” In support of this, the evidence so far shows that genes are not randomly positioned within the nucleus, and furthermore, their location is influenced by the respective chromosomal neighborhood.

Outlook

Even though heterochromatin is found in many species including mouse, human, and flies, and to a lower extent, even in yeast, it varies extensively in composition and architecture. The same is true for nuclear size, shape, and morphology. Both parameters (molecular composition and in situ organization) need to be integrated in future studies. Heroic attempts to perform high-throughput molecular analysis in single cells are ongoing, and the link to 3D chromatin organization might be just around the corner.