Introduction

How to understand biological function has long been a subject of both philosophical and scientific debate focused on evolutionary biology. Philosophical assessments of biological function have often focused on whether function needs to be understood as a past product of natural selection (Wimsatt 1972; Millikan 1984; Neander 1991) or is aptly described by just what biological causes presently do (Cummins 1975; Hinde 1975; Boorse 1976). The former style of “selected effects” approach is historical—or “etiological,” in philosopher-speak. The latter style of “causal role” approach is often ahistorical but—at least in current biological discourse—nevertheless typically assumes that natural selection should be operating on these entities in the present (Allen and Neal 2020).Footnote 1

In the philosophy of biology, there is also a notion of “actual difference makers” as special targets of biological explanation and investigation (Waters 2007). According to this approach, functional genomic entities like certain stretches of DNA are actual difference makers—since different actual “values” for these causal “variables” (i.e., different sequences of the “same” stretch of DNA among various organisms within a population) can and do generate different actual “values” for intervened-on “variables” (such as RNA and protein) in that population, which can sometimes lead to different actual phenotypic and even fitness “values” for the different organisms in that population.

Here we demonstrate that there is another class of actual difference makers in biology: one which has often been overlooked in discussions of function, role, and import (likely due to the commonly conjoined emphasis on fitness and selection). These are genomic entities whose presence or absence alters the character of the cell and are thus influential—despite that these entities may not have been selected for, and that mutations in these regions do not necessarily affect the fitness of the organism. The presence of these entities generates cellular work, and their complete deletion would likely decrease the fitness of the organism; but their alteration by the typical mutation does not often affect fitness—at least not to the extent that such mutations are detected, eliminated, and/or preserved by organism-level natural selection. One main difference between these genomic regions and typically coding DNAFootnote 2 is not their causal specificity but rather their reach (or lesser extent thereof) and hence their efficacy (both terms described in more detail later in the text). Our aim is to provide a characterization of these overlooked causal entities and relationships, using junk DNA as an illustrative case.

Junk DNA

It is widely agreed (Ponting and Hardison 2011; Rands et al. 2014; Ponting 2017) that approximately 90% of the human genome is not subject to purifying selection and hence—at least according to typical accounts of function in molecular biology—is not functional. Despite this, we argue that paradigmatically “non-functional” DNA, commonly referred to as junk DNA, is still a difference maker (in Waters’ [2007] sense). The fact that there are entities which matter (as they affect cell physiology), yet which are not necessarily sculpted by organism-level natural selection, has profound implications for the evolution of humans, and multicellular life in general.

Junk DNA represents a large fraction of all eukaryotic genomes. Its existence is due to the fact that eukaryotic genomes contain selfish DNA entities (also known as transposable elements), which have the ability to replicate themselves (Doolittle and Sapienza 1980; Orgel and Crick 1980). It is believed that transposable element activity fueled the expansion of the eukaryotic genome. In all eukaryotic genomes examined thus far most transposable elements have accumulated inactivating mutations, with only a minority still having the ability to replicate. These dead transposable elements are a vast graveyard that constitutes much junk DNA.

One critical factor that permitted the expansion of transposable elements, and thus junk DNA, is the nucleus (Martin and Koonin 2006; López-García and Moreira 2006; Palazzo and Gregory 2014). The nucleus effectively divides the cytoplasm into two distinct compartments: the nucleoplasm, where DNA is transcribed into premature messenger RNAs (pre-mRNAs), which are then spliced to form mature messenger RNAs (mRNAs); and the cytoplasm, where these fully processed mRNAs are translated into proteins.Footnote 3 The segregation of mRNA processing machinery away from mRNA translation machinery has two main effects. First, this segregation ensures that for any given mRNA, its splicing is completed before it is allowed to be exported from the nucleus to the cytoplasm (Martin and Koonin 2006). This, in-and-of-itself, has several advantages. It ensures that pre-mRNA is free of translating ribosomes, which would likely collide into—and interfere with—the splicing machinery, and it prevents unprocessed pre-mRNAs from being translated into proteins (many of which are toxic). Hence, this segregation of RNA processing and translation increases the efficiency and fidelity of these two processes. Second, the nucleus/cytoplasmic divide acts as a quality control device. In most eukaryotic systems studied, it has been observed that unprocessed pre-mRNAs and misprocessed mRNAs are not efficiently exported to the cytoplasm, but instead retained in the nucleus and degraded by dedicated RNA decay machinery (Palazzo and Lee 2018). This severely reduces the translation of misprocessed mRNAs, whose protein products are potentially toxic.

So, what does this have to do with junk DNA? Well, DNA in general is a non-specific substrate for RNA polymerase. As a result, intergenic DNA is transcribed at a low level into junk RNA, also known as ‘transcriptional noise’ (Struhl 2007; Palazzo and Gregory 2014; Palazzo and Lee 2015). Like misprocessed mRNAs, most of this junk RNA is retained in the nucleus and eliminated by RNA decay machinery. This makes sense, since if this transcriptional noise was not eliminated and instead allowed to be translated into junk protein, then it would become a significant burden on the organism (Palazzo and Lee 2015). The exposure of the ribosome to misprocessed mRNAs and intergenic RNAs is very toxic in tissue culture cells (Ogami et al. 2017). By retaining junk RNA in the nucleus, and promoting its decay, junk DNA becomes less of a liability. In a sense, the nucleus and all of its associated quality control machinery acts as a global mechanism to suppress the deleterious effects of rampant mRNA mis-processing and transcriptional noise that is inherently present in cells with large junky genomes (Warnecke and Hurst 2011; Rajon and Masel 2011, 2013; Koonin 2016).

Eukaryotic cells have additional mechanisms to reduce the deleteriousness of junk DNA. For example, excess DNA is usually packaged into heterochromatin, which prevents it from being transcribed robustly, thus further limiting the amount of junk RNA that is produced.

Overall, these many processes in eukaryotic cells buffer against the deleteriousness of junk DNA. But these buffering systems are a double-edged sword: by relieving the selection pressure on organisms to get rid of this excess DNA, they also permit junk DNA to accumulate. It is thus possible that the amount of junk and the efficiency of systems that render it less harmful could have arisen in parallel as a sort of evolutionary arms race.

In addition, it is likely that many of these buffering systems are somewhat tuned to the amount of junk DNA they must contend with. Cells with large amounts of junk DNA will likely acquire secondary adaptive changes to help their quality control systems to better deal with this increased load. In addition, these quality control systems have also been co-opted to regulate the functional portion of the genome. For example, RNA decay machinery is also required to properly trim precursor ribosomal RNA to its mature form (Zinder and Lima 2017). Thus, a certain level of junk DNA becomes necessary for the entire system to work properly. In the absence of a certain level of junk RNA, RNA decay machinery will likely start to degrade functional RNAs (Doma and Parker 2007; Garland and Jensen 2020; Wang and Cheng 2020). Similarly, in the absence of junk DNA, proteins that silence this junk by packing it into heterochromatin will likely start to silence functional DNA.

Note that, within this context, junk DNA neither historically evolved for a specific function; nor is it currently being selected for or against for organisms.Footnote 4 According to the predominant accounts of “function” in biology, junk DNA has no function (for the organism). Yet its presence matters for an historical and comparative understanding of evolution. For instance, this type of evolution—one not necessarily directed by natural selection—leads to real differences between diverse branches of the evolutionary tree. Over long stretches of evolutionary time, different organism types have accumulated diverse sets and amounts of these non-functional (yet influential) entities (Gregory et al. 2007). These different entities contribute to features which permit different cellular “life” styles, which can contribute to multi-level selection (Vinogradov 2004; Gregory 2005b). In sum, the existence of junk DNA changes many basic features of the cell. Since these features affect cell physiology, junk DNA is a difference maker. We elaborate on what this means in the next section.

Difference makers

In 2007, the philosopher of science C. Kenneth Waters identified the class of “actual difference makers” as the causes that actually make a difference (Waters 2007). Waters offered this contribution to the extensive literature on causality in order to resolve a stubborn puzzle from the historical study of causation. The puzzle is traditionally framed by asking the following question: when someone strikes a match (say, to light a fire), why do we attribute the cause of the fire-lighting-event to the striking of the match, as opposed to the availability of oxygen (or some other, equally crucial element of the causal situation)?

Prior to Waters’ (2007) contribution, the received philosophical view of this type of situation had been—since the 1843 publication of John Stuart Mill’s A System of Logic—one of causal parity. According to Mill, all causes are ontologically equal, or on a par (hence the term ‘parity’ for this sort of view). On Mill’s view, there is no substantial difference between the striking of the match and the availability of oxygen. This position is at odds with our common, intuitive sense that the actual cause of the lit fire is the match-striking-event, and that other elements such as the presence of oxygen are mere background conditions (or some such). Post-Millian philosophical attempts to explain our tendency to pick out one cause among many have tended, as Waters (2007) details, to point to our interests as the cause of our commonsense beliefs in causal priority rather than parity (Hart and Honoré 1959; Mackie 1974). Believing something like “sure, oxygen is in some sense a cause of the fire lighting, but striking the match is the real cause” is a view which attributes causal priority to one cause (striking the match) over others (such as the presence of oxygen). Here, one cause is being thought of as more significant than others (hence the term ‘priority’ for this sort of view).

Waters develops an account designed to pick out commonly emphasized biological causes, such as differences in the structure of DNA (when it varies in ways which generate variation in other variables such as RNA and other molecules). He then explains the perceived significance, or priority, of this DNA relative to other causal elements, such as the structure of accessory proteins (when these do not tend to vary in ways which generate corresponding variation in other variables). Waters accomplishes all this with the help of a trio of distinctions:

1 – potential difference makers (causes whose variables could vary in a population to make a difference in effect) versus actual difference makers (causes whose variables do vary in a population to make a difference in effect). For example: if there was variation within a population among the structures of one of its protein-folding accessory proteins, and that variation generated corresponding variation in protein conformation, then that protein-folding accessory protein would be an actual difference maker in that population. But in the absence of actual variation among the structures of that protein-folding accessory protein generating actual variation in protein expression within a population, that molecule is merely a potential rather than an actual difference maker in that population.

2 – an actual difference maker (working to generate population-based differences in effect, together with other actual difference-making causes) versus the actual difference maker (working to generate population-based differences in effect, without any other actual difference-making causes). Waters points to RNA synthesis in bacterial cells as an example of the latter kind of case—one in which only activated DNA segments act as the actual difference maker in this process, since only that “variable” takes on different “values” for different cells within the relevant population. Although accessory proteins like RNA polymerase are (of course) required for RNA synthesis in bacterial cells, RNA polymerase is not in this case a “variable” which takes on different “values” in this population, in the sense of individual bacterial cells expressing structurally distinct versions of RNA polymerase (or other protein-folding accessory proteins) which then correspondingly generate differences in synthesized RNA. (RNA polymerase is still a potential difference maker, though, since different bacterial cells within a population could presumably contain structurally distinct versions of RNA polymerase which could then presumably affect instances and rate of RNA synthesis.) Alternatively, Waters points to RNA synthesis in eukaryotic cells as an example of the former kind of case—one in which activated DNA segments act as an actual difference maker along with RNA polymerases, since in this more complicated context there are different kinds of RNA polymerases and “presumably, different accessory molecules are also associated with the synthesis of different RNA molecules” (Waters 2007, p. 574).

3 – causally specific difference makers (causal variables whose values can be distinctly many, variation among which corresponds to distinctly many values in effect-variable) versus causally non-specific difference makers (causal variables whose values are not distinctly many, or whose distinctly many values do not correspond to distinctly many values in effect-variable). Woodward (2010) explicates these notions with a contrast between fine-grained (or dial-like) and coarse-grained (or switch-like) control. To listen to a particular station on your radio, you need to both flip the on/off switch and turn the dial to the appropriate frequency. But only turning the dial gives you fine-grained control over which station you might receive, and there are distinctly many positions on that dial which each correspond to distinctly many stations to which you might listen. Obviously, moving the on/off switch to the correct position is required in order to listen, but “the switch is not causally specific with respect to which program is received” (Woodward 2010).

The upshot of all this conceptual machinery—for biology as it is being practiced, and even more specifically for genetics—is that it provides a potential justification for judgments of the causal priority of coding DNA. Therefore, according to Waters, “it makes sense for biologists to say that DNA is not on a causal par with many of the other molecules that play causally necessary roles in the synthesis of RNA and polypeptides” (2007, p. 579). This stance—that of assigning causal significance, or priority, to DNA over other molecules—is indeed a familiar one in biology. And it is explained, according to Waters (2007), by the fact that DNA is an actual as opposed to a merely potential difference maker. Additionally, although DNA is an actual difference maker, among several others (it is not the only actual difference maker), of those assorted actual difference makers, DNA is the causally specific one (its difference-making is not merely causally non-specific.Footnote 5

Junk DNA as an actual difference maker

A surprising thing about this account of causal priority is that it leaves all “activated” DNA on a par with one another—including junk DNA. Waters writes: “An important actual difference in cells is the difference in the nucleotide sequences of RNA molecules synthesized in a cell” (2007, p. 573). As discussed in the “Junk DNA” section, both coding and junk DNA are transcribed in cells; both make for actual differences in the nucleotide sequences of RNA molecules synthesized in cells; both generate causally specific differences in the sequences of RNA molecules synthesized in cells. Any stretch of transcribed DNA which varies among individuals within a population—transcription of which generates corresponding variation in the RNA synthesized among individuals in that population—is going to look, on Waters’ account, like an important actual difference maker in that population. Given what has recently been learned about widespread transcription activity, this includes junk DNA.

Ironically, since junk DNA is likely subject to less (or even zero) purifying selection relative to coding DNA, junk DNA will often tend to vary more among individuals in a population than does coding DNA, and thus junk DNA might even turn out to be more causally specific than coding DNA on Waters’ view. To apply Woodward’s toy example from above, if we think of a stretch of DNA like a radio dial that can be tuned to different stations (where each station is an instance of phenotypic variation existing in the population), then there are likely going to be more stations available for tuning to, on a dial corresponding to a stretch of junk DNA, than on one corresponding to a stretch of coding DNA. A stretch of junk DNA is a causal variable which is likely to take on more distinct values among individuals in a population than is a stretch of coding DNA, and to thereby correspond with more distinct values of the synthesized RNA effect-variable.

But—as discussed above, and contingently so—biologists and philosophers of biology do not typically view causally specific differences generated by actual differences in junk DNA as on a causal par with causally specific differences generated by actual differences in coding DNA. Although Waters’ account of causal specificity goes some way towards explaining why DNA has been causally elevated relative to other actual difference makers (such as RNA polymerase), it does not explain why coding DNA has been causally elevated relative to junk DNA. Junk DNA has the same two features which Waters has picked out as the distinguishing features of coding DNA: actual difference-making and causal specificity. Yet junk DNA is not on a causal par with coding DNA in biology.

Something is missing from Waters’ account; something related to causal specificity but involving additional concepts we hereby term ‘causal reach’ and ‘causal efficacy’.Footnote 6 One obvious, crucial difference between coding and junk DNA is that, although each of these causal variables generate both proximate and distal effects, not just the immediately proximate but also some of the rather more distal effects of coding DNA can be nonetheless quite causally specific. Although junk DNA has both proximate and distal effects, too—and junk DNA often has casually quite specific proximate effects—the more distal effects of junk DNA tend to be significantly reduced in causal specificity, compared with some of those of coding DNA. So, even taking on board much of Waters’ framework for comprehensively understanding the causal priority typically afforded coding DNA, we need to track not just the specificity of actual difference-making causes in biology, but also the extended (or not) reach of these causes, as well as the maintenance (or not) of that specificity throughout the reach of their effects—i.e., the complex combinatorial feature of causal efficacy.Footnote 7

See Fig. 1 for a conceptual depiction of various causes with distinct combinations of specificity, reach, and corresponding efficacy. Causal specificity has already been explained; causal reach has to do with how many effects a single, initiating cause generates.Footnote 8 These effects can: extend straightforwardly out in a linear chain from the initiating cause; scatter out in a burst from the initiating cause, or both scatter and extend in a cascade from the initiating cause. Explained in engineering terms, causal reach can occur via parallel processing (multiple effects radiating out from one causal node), in sequence (a series of proximate and distal effects), or by combination of these two modes.

Fig. 1
figure 1

Various causes with distinct combinations of specificity, reach, and corresponding efficacy. Top left, an utterly inefficacious cause with no effects. Bottom right, an extremely efficacious cause—one that has both reach and specificity in terms of its effects. Reach is indicated by number of nodes—where each cause can have either unitary or multiple effects, both proximate and ultimate. Causal specificity is indicated by effect opacity—where each node can either transmit its full pigmentation or produced a faded node with increased transparency. Within rows, causes increase in specificity from left to right; within columns, causes increase in reach from top to bottom. Hence, top right has more specificity though less reach relative to bottom left, which has more reach though less specificity. Both are less efficacious than bottom right, but more efficacious than top left; neither are necessarily insignificant causes. (Prepared by Adam Streed using the matplotlib library for Python 3; all the code is his own)

Causal efficacy tracks interaction of the two prior notions of specificity and reach. This concept is required to monitor the effects a difference maker has—as the effects of that difference maker extend and cascade (or not) and given the causal specificity (or not) of those resulting effects. A causally abrupt, non-specific difference maker is by its nature not very efficacious. A difference maker with extensive reach can nonetheless be rather causally inefficacious if its many “reach” effects are mostly non-specific. Likewise, a highly causally specific difference maker can nonetheless be rather causally inefficacious if it does not have much reach in terms of its effects. A difference maker with both extensive reach and specificity in the effects it causes will be extremely efficacious.

See Fig. 2 for a conceptual depiction of various genetic causes with distinct combinations of specificity, reach, and corresponding efficacy. So, one likely reason coding DNA is considered causally so significant by biologists because it can have quite the causal reach, and not just its proximate but often also its more distal effects can be quite causally specific. Note, however, that not all causally specific changes to coding DNA are on a par with one another—in terms of their causal reach, corresponding efficacy and resulting effect on judgments of causal priority.Footnote 9

Fig. 2
figure 2

Various genetic causes with distinct combinations of specificity, reach, and corresponding efficacy. a A synonymous, conservative mutation in any transcribed DNA—coding or junk. b A synonymous, semi-conservative mutation in coding DNA. c A non-synonymous, semi-conservative mutation in coding DNA—one that produces a change in protein amino acid sequence but no change in conformation. d A non-synonymous, non-conservative mutation in coding DNA—a genotypic change with phenotypic and other ultimate effects. e Insertion or deletion of a small “repeat” segment of junk DNA. f Insertion or deletion of a large “repeat” segment of junk DNA—one which ultimately produces an uptick in RNA decay machinery. g Insertion or deletion of a large, unique (not a repeat) segment of junk DNA—the size rather than uniqueness of which also produces an uptick in RNA decay machinery. h Insertion or deletion of another large, unique segment of junk DNA—one that produces in virtue of its size and uniqueness not just an uptick in RNA decay machinery but also other downstream, causally non-specific distal effects. Explication of these cases continues in the main text below (Prepared by Adam Streed using the matplotlib library for Python 3; all the code is his own)

A synonymous, conservative mutation in transcribed DNA (e.g., a change from a GGC codon to GGG), although it will produce a causally specific change in synthesized RNA, will not necessarily produce any further changes of note—even if protein synthesis occurs (since GGC and GGG both code for glycine; Fig. 2a).Footnote 10 However, even synonymous mutations in coding DNA can turn out to be not perfectly conservative (i.e., they can be productive of further, causally non-specific effects; Fig. 2b).Footnote 11 Non-synonymous mutations in coding DNA can also vary in terms of the (semi-) conservativeness of their effects. A replacement event which introduces one hydrophobic amino acid for another during protein expression, or one positively charged amino acid for another (Fig. 2c), is likely to be more conservative in its changes than those events which substitute a hydrophobic amino acid for a charged one, or a negatively charged amino acid for a positively charged one. A non-synonymous, non-conservative mutation in coding DNA which introduces a stop codon in a new place will likely affect more than just one amino acid of a protein’s primary sequence (it may cut off several) and could affect not just genotype but also phenotype (Fig. 2d). All these changes (Fig. 2a–d) are proximately causally specific, but they vary in reach, as well as in the specificity of their more distal effects (i.e., their causal efficacy). The fact that these changes also vary in perceived importance (ascending from a to d) shows that not just causal specificity but also causal reach and efficacy are features relevant to judgments of biological importance.Footnote 12

The relevance of causal reach and efficacy to judgments of biological importance becomes even more apparent when considering potential changes to junk DNA. Of course, many changes to junk DNA are unimportant from this point of view. Insertion or deletion of a small, “repeat” segment of junk DNA can fail—even when transcription occurs—to produce any causally specific effects. Such insertions or deletions are likely—if they alter anything at all—to change only the amount or proportion of various segments of junk RNA already synthesized in the cell (since they will neither introduce an entirely new kind of segment nor entirely remove any kind of previously-existing segment; Fig. 2e). To re-apply and extend the radio metaphor from Woodward (2010), this is something like turning up or down the volume on a station to which the cell is already tuned. Insertion or deletion of a larger but still “repeat” segment of junk DNA might produce additional causally non-specific effects further downstream, e.g., an uptick in RNA decay machinery (described in “Junk DNA” section), via sizable alteration of the amount of junk DNA transcribed in the cell (Fig. 2f). Alternatively, insertion or deletion of a large, unique (not a repeat) segment of junk DNA might have causally specific effects on the types of junk RNA that are synthesized in the cell (since its uniqueness means that it will, if that stretch of genome is transcribed, either introduce or eliminate expression of a segmentFootnote 13), but could also have causally non-specific distal effects on, e.g., the amount of RNA decay machinery, via its sizable alteration of the amount of junk RNA being synthesized (Fig. 2g). Finally, insertion or deletion of a large collection of junk DNA segments could have other cascading, causally non-specific distal effects—such as altered patterns of interaction with RNA decay machinery, changes in cellular size, or degradation of functional RNA (introduced in “Junk DNA” section, additional discussion in “Detecting the hidden significance of junk DNA” section; Fig. 2h). As before, these changes tend to vary in perceived biological significance (ascending from e to h).

Our presentation of these cases reflects the fact that the relative significance of junk DNA, when it is significant, does not usually stem from its occasional uniqueness (and corresponding causal specificity). The diagram also helps us to demonstrate, given the reduced causal specificity of this bottom set of cases (Fig. 2e–h) relative to the prior top set (Fig. 2a–d), why coding DNA is considered causally so significant by biologists—because it can have quite the causal reach, and not just its proximate but often also its more distal effects can be quite causally specific. Coding DNA can be a highly efficacious cause, and this reliably leads to judgements of elevated causal importance. But we can further see, given the significant causal reach of some of these cases (Fig. 2d and h) relative to others (Fig. 2a–c and e–g), how not just causal specificity but also reach might play a role in judgements of causal importance. Although junk DNA may rarely be a highly efficacious cause (due to its diminished distal causal specificity), it can still have reach. The conceptual machinery of causal specificity, reach, and corresponding efficacy gives us the tools to understand the utmost causal importance of certain kinds of changes in coding DNA, the relatively reduced in importance but still non-negligible role of certain other kinds of changes in junk DNA, and the common occurrence of trivial sorts of changes in either—even from a relatively restricted, emphasis-on-selection-at-the-level-of-the-organism style of view.

Obviously, we have not exhausted all the diverse ways that either coding or junk DNA can causally affect change (or fail to do so).Footnote 14 We posit that junk DNA, and other non-functional (in the selectionist sense) components of the cell, are a significant part of the evolutionary story of eukaryotes; low-efficacy causes can still be significant. We suspect that the diminished causal efficacy of these components has often made it difficult for either biologists or philosophers to appreciate and characterize their significance, despite their reach.

From the point of view of organism-level selection, junk DNA may have arrived in cells in a non-adaptive manner (at the organismal level), and typical mutations (such as small nucleotide polymorphisms) in a nucleotide sequence of junk DNA may not generally be selected for or against (at the organismal level), but mechanisms within the cell have generally adapted to its presence. In the context of this evolutionary interplay, the presence and potency of the total sum of junk DNA in the cell dictates the need for various cellular machineries of “quality control,” changing the resources required and expended by the cell, and committing the cell to particular strategies. Significant changes to either party (the junk DNA or the cellular clean-up crew) can produce causally significant though non-specific effects. We detail these possibilities, and how to detect them, in the next section.

Detecting the hidden significance of junk DNA

One way to illustrate the difference between significant but not functional (in the selections sense) and functional biological entities is to explore how these entities as a whole might affect the fitness of an organism: to explore their hypothetical, collective effect. In theory, this could be done. In practice, the amount of resources that would be required to carry out the full experiment may not be feasible. First, we would need to pick a genome. Next, we would need to unambiguously identify non-functional DNA by deleting small segments of the genome. If we chose to investigate the human genome, the deletion size that we would use would be similar to mutations that naturally occur in human populations, since it is precisely these types of mutations that are seen by natural selection. Any deletions that did not affect the number of offspring could be tagged for further investigation. Then, we would combine these deletions in increasing amounts and evaluate how these affect the number of offspring of the resulting organism.

Cataloguing functional versus non-functional roles

If given unlimited resources and time, we could in principle test whether each segment of the chosen genome is functional. Any region of DNA can play a causally specific, sequence-dependent role or a causally non-specific, sequence-independent role.

To test whether any given region of DNA has a sequence-dependent role, we could introduce nucleotide substitutions. However, this would not test whether the region in question has more non-specific roles that contribute to its reach. For example, it is known that 5′ untranslated regions have to be a minimal length—if they are any shorter than this minimum, the translational start site will not be reliably recognized and the mRNA will not be properly translated (Kozak 1991). Thus the 5′ untranslated region acts as a spacer, one that can be quite important—since even from a perspective that emphasizes organismal-level selection, it is a physical entity that contributes to the cellular environment.

To test whether any given region DNA has a sequence-independent role, we could systematically delete these regions.Footnote 15 Assessing the rate of insertions or deletions (indels) between organisms has been used to determine that most annotated non-coding transcripts produced in mammalian cells are non-functional, although these analyses are statistical in nature (Ponting 2017). To precisely assess each individual region, we would have to empirically test it. We would begin by deleting as much DNA as what occurs in natural mutations, since these alterations are the ones on which selection operates. The size of the average deletion can vary quite considerably, typically because those regions of the genome which contain repeat elements are quite prone to large alterations. In non-repeat regions it has been estimated that 96% of all deletions are smaller than 16 base pairs in length (Mullaney et al. 2010). In repeat regions, spontaneous deletions are more common and can be larger. Although very long deletions are rarer, they can accumulate in lineages over generations, and as a result the size of any given repeat region varies considerably in the human population, with many of these deletions being tens of thousands of base pairs in length (Jeffreys et al. 1985; Redon et al. 2006). With this in mind, we would begin our experiment by deleting ~ 100 base pair segments in parts of the genome that show no sequence conservation between humans and other mammals, which represents about 95% of the human genome. It is likely that some of this non-conserved DNA plays a lineage specific-role in humans (i.e., has been under selection in the human lineage but is not conserved between humans and related species, such as chimpanzees), although most estimates place this at less than 5% of the genome (Ward and Kellis 2012; Rands et al. 2014; Gulko et al. 2015; Ponting 2017).

To perform this experiment in an informative manner we would need to track the reproductive success of thousands of individuals having the same mutation under very controlled conditions to infer whether a given deletion was slightly deleterious. Thus, in practice, the effect of slight changes to fitness would be almost impossible to assess. Nevertheless, in theory, this experiment could be done.

Testing the boundary between non-functional and functional roles

With all the functional and non-functional DNA cataloged (though only from the organism-level-selection point of view), we could then move on, in a rather interesting way, to determining how consequential each region of so-called junk DNA might be. We could combine previously tested deletions of junk DNA together into unnatural mutations (i.e., mutations not typically seen in nature, and thus not subjected to selection pressures) and determine their effect on reproductive success.

To reiterate, this experiment would not test whether these sequences are functional (in the purifying selectionist sense), as selection would never encounter these types of combinatorial mutations. Instead, this would be one means of testing whether given regions of junk DNA are influential (i.e., whether they are difference makers). The degree to which deletion mutations would have to be combined before they affect fitness (i.e., reproductive success) might indicate how (in)efficacious these regions are as difference makers.

We expect that quite a large number of deletions must be combined before we would be able to detect any effect on fitness. Indeed, it has been seen that deleting 1.5 Megabases of non-coding DNA from the mouse genome (approximately 0.06% of the entire genome) had no measurable effect on the fitness of the animals (Nóbrega et al. 2004). How much of the junk DNA genome would need to be eliminated before measurable effects can be detected is hard to estimate at this time.

Nevertheless, there would likely come a point where the diffuse effects of deleting many junk DNA regions, when combined, would become detectable and significant—and that this would show the cumulative “difference making” effect of the presence of junk DNA (relative to its absence), along with the corresponding cellular adjustment to its usual presence (relative to its absence). We might expect that quite a sizable amount of junk DNA would need to be removed before we would be able to measure any effects on fitness.

Of course, in practice there would be additional complications. Different genomic regions may play redundant roles, thus elimination of one or the other may have little to no fitness effect, while deletion of both will have a large impact on fitness. This combination would not test the diffuse effects of junk DNA, but rather uncover much more specific effects that were previously hidden. Untangling these two possibilities may be almost impossible.

Non-functional yet significant roles for junk DNA

So, what would the massive elimination of junk DNA change? Alternatively, what non-functional roles does it normally play? Below we list some of the causally non-specific yet nonetheless significant effects which can make junk DNA consequential for the cell.

Cell size

It is well known that DNA content correlates with the size of the cell nucleus, which in turn correlates with the total cell size (Cavalier-Smith, 1978; Gregory, 2001). It has thus been inferred that DNA content affects cell size. Thus, massive reductions in the amount of junk DNA would lead to a reduction in cell size. This would fundamentally change many aspects of cell physiology. First, as cell size changes, surface areas and volumes do not scale with one another in a linear fashion. As cells become smaller, the ratio of surface area/volume increases. An increase in cell membrane/volume ratio would allow molecules to diffuse more rapidly into and out of cells, and thus allow for greater rates of metabolism. It is widely believed that the small genome size of both birds and bats, and the accompanying increase in metabolic rate, was a necessary prerequisite for the evolution of flight (Hughes and Hughes 1995; Gregory 2002). Indeed, even within birds, those which have the smallest genomes and have the most rapid metabolism are the hummingbirds and those with the largest genomes are flightless birds (Gregory et al. 2009). Interestingly, in the avian lineage it appears that genome size reduction happened before the evolution of flight as cell size estimates for closely related non-avian dinosaurs were found to be quite small (Organ 2007).This suggests that there was no initial selection pressure to reduce genomes due to metabolic constraints associated with flight, but rather that the organisms that happened to have small genomes had the metabolic prerequisites that were necessary to evolve flight. Subsequently, it appears that bird genomes have stayed small due to occasional large genomic deletions, which are possibly driven by positive selection (Kapusta et al. 2017).

The increase of surface area/volume as the genome contracts is true not only for the cell (cell plasma membrane/cell volume) but also for some of the organelles, for example the nucleus. Other organelles, such as mitochondria, would most probably remain the same. These alterations will have their own consequences. For example, in the nucleus, newly made mRNAs are known to randomly diffuse from their site of production to the nuclear pore (Shav-Tal et al. 2004), thus the larger the nucleus, the more time it will take for newly made mRNAs to reach the cytoplasm where they are translated into proteins. If we compare the nuclear export rate (t1/2; time it takes for half of the pool of mRNA to be exported from the nucleus to the cytosol) for the exact same mRNA (in this case the ftz reporter mRNA), it is 2 h in frog oocytes (Luo and Reed 1999), whose nuclei have a radius of 200 µm; but only 15 min in mammalian cells (Palazzo et al. 2007), whose nuclei have a radius of about 4 µm. Nuclear export rates (t1/2) are 3–5 times faster in yeast, whose nuclei have a radius that is smaller than a micrometer (Oeffinger and Zenklusen 2012). The nuclear export rate of mRNA is one of the main determinants of the lag time between gene activation and protein production, and can affect biological processes such as the timing of circadian rhythm as well as how cells respond to their environment (Hoyle and Ish-Horowicz 2013).

Besides gene expression timing, it is likely that other cellular processes would also be affected by cell size (Cavalier-Smith 1978). It is not entirely clear how drastic changes in cell size would affect the development and the proper function of tissues and organs, but it is reasonable to expect that many of these may become dysregulated. Note that we are not claiming that cell size could never undergo a drastic change due to these constraints. It is likely that if an organism gradually lost DNA over long periods of time, this would be accompanied by other adaptive changes which would compensate for these alterations in cell physiology. But it is likely that a drastic reduction of cell size, without all the accompanying adaptive changes, would lead to a fair number of problems.

As is the case with birds and bats, it is likely that drastic changes in cell size—and the accompanying adaptive changes to compensate for changes in cell physiology—could alter the organism in such a way that it could open up or limit the ability of the organism to occupy certain niches. In insects and vertebrates it has been documented that certain aspects of development and cell physiology are restricted to animals with a limited range of genome sizes (Gregory 2005b). It is also worth noting that in comparison to animals, plants—whose development and organ function appears to be more plastic—more often experience whole genome duplication events which effectively double their cell size. In animals, whole genome duplication events are rarer and seen most in amphibians (frogs, axolotls)—a class of animals that tend to have large genomes, and thus may have extensive buffering capacities. Again, this may indicate that drastic changes in cell size may only be compatible with certain types of cell physiology. The idea that genome size is consequential due to its effects on cell size has been advanced previously; however, unlike previous commentators (e.g. Cavalier-Smith 1978), we believe that the typical indel mutations that alter cell size are not subject to selection, as these indels are so small (see “Cataloguing functional versus non-functional roles” section) that each contributes negligibly to the overall dimensions of the cell. There may be some species that have enough variation in genome size for it to be under selection, but these appear to be the exceptions rather than the rule (Blommaert 2020).

Timing of cell division

It has been noted that cells with larger genomes take longer to divide. Superficially this makes sense as one would expect it would take more time to replicate the increased amount of DNA. However, this can easily be remedied by increasing the number of DNA polymerases in the cell, and also increasing the origins of replication which would allow all the extra polymerases to have access to the DNA. It thus remains unclear how genome size impacts cell division timing.

Regardless of the reasons, the timing of cell division will have drastic effects on the organism’s ability to grow, generate new tissue, and heal after injuries. It is not hard to see that changing this timing would have a major effect on how the organism develops and its overall fitness.

Providing excess substrates for cellular machineries

As we discussed in “Junk DNA” section, junk DNA also provides additional, non-specific substrates for enzymes in the cell to engage with. All enzymes have a preferred substrate, but they will act on non-optimal substrates as well, especially when subjected to non-adaptive evolutionary pressures (Khersonsky and Tawfik 2010; Bar-Even et al. 2011; Tawfik 2020; Copley 2020). Normally the amount of activity associated with these non-optimal substrates is negligible; however, when the amount of non-optimal substrate is in vast excess of the preferred substrate, non-optimal reactions can become substantial. This is true for RNA polymerase enzymes (Struhl 2007), which transcribe DNA into RNA. It is also true of DNA binding proteins (Villar et al. 2014), both those that recognize certain motifs to regulate gene activation, and those that package DNA non-specifically. The easiest solution to deal with an excess of non-optimal substrates is to increase the number of proteins so the excess substrate binding does not interfere with its normal selected activity. In other words, the amounts and activities of all these enzymes are under selection to be able to accomplish their job in a cellular environment filled with excess non-specific substrates: junk DNA and junk RNA.

For other enzymes, their preferred substrates are junk DNA and junk RNA. This is especially true for RNA decay enzymes, since these generally operate with a certain load of junk RNA that they must constantly eliminate. A drastic drop in junk RNA would mean that these enzymes would work more-and-more on their non-optimal substrates, in this case functional RNAs. This “kinetic competition” in RNA quality control systems has been well documented (Doma and Parker 2007; Garland and Jensen 2020; Wang and Cheng 2020). Other proteins that fall in this class are DNA binding proteins that package and silence junk DNA into heterochromatin.

It is possible that many of these enzymes are regulated by feedback loops that “sense” how much excess substrates there are. A large decrease in the number of excess substrates could lead to a decrease in the levels or activities of these enzymes. This, however, may severely impair the proper functioning of enzymes that have both junk RNA, and functional RNAs, as their optimal substrates. For example many RNA decay enzymes also help to trim ribosomal RNAs and other functional non-coding RNAs (Zinder and Lima 2017). Thus, a drastic reduction in junk RNA, if it activated a feedback loop to decrease total RNA decay capacity, may inadvertently diminish the cell’s ability to properly process functional RNAs.

Other (even) more speculative effects

There are possibly other general aspects of the cellular environment that would be altered if a substantial amount of junk DNA were removed.

In eukaryotic cells, long stretches of DNA that contain numerous genes adopt complicated loops, commonly referred to as “topological associated domains” (TADs). There has been much speculation about the importance of these structures (Szabo et al. 2019; Ghavi-Helm et al. 2019; Beagan and Phillips-Cremins 2020). In some cases, these are thought to bring together distant DNA regions, which allows one DNA regulatory element to influence how a distant gene is either turned on or off. In other cases, it is believed that the genes found in one of these loops are co-regulated. It is possible that elimination of large fractions of junk DNA could perturb these local architectural arrangements of the genome. It should be noted that the relative importance of TADs remains controversial. Moreover, it is unclear whether any given TAD would be disrupted if all of the junk within the TAD were to be removed. We understand even less about larger scale architectural features of the genome, how these are affected by junk DNA, and how these structures ultimately contribute to fitness. Nevertheless, effects on DNA architecture could arise, both at the local and global levels, if a genome were massively compacted.

Another aspect of cell physiology that could change is some of its biophysical properties. Recently, there has been a renewed interest in how certain biopolymers tend to form molecular condensates which phase separate from the bulk solution (Banani et al. 2017). Several recent reports suggest that large portions of the genome which are silenced—and normally referred to as heterochromatin—form these condensates (Larson and Narlikar 2018). Similarly, excess amounts of RNA can also form different condensates and these may constitute the matrix of many membrane-less organelles (Jain and Vale 2017; Treeck et al. 2018; Quinodoz et al. 2021). Removal of junk DNA and RNA may alter the amount of these condensates in the nucleus, and this could ultimately impact the biophysical properties of various subcellular regions. Again, it is unclear whether the presence or absence of these condensates would alter the cellular environment enough to impact how functional parts of the genome operate, but it remains possible.

It is likely that many other cell biological processes would be disrupted after large fractions of the junk DNA were to be eliminated. Our goal is not to provide a complete list, but simply to point out that there are many general aspects of the cellular environment that are likely affected by junk DNA.

Summary

Even from a selection-at-the-level-of-the-organism point of view, junk DNA impacts the environment of the cell in which the more traditionally “functional” parts of the genome operate. Normally these impacts are distal, non-specific, and difficult to detect. We can help ourselves conceive of and understand these impacts by imagining how the cell would react to or be changed by the sudden and drastic removal of large swaths of junk DNA. Note that we are not suggesting that these bits of junk DNA are necessarily functional in the traditional sense. Rather, we are suggesting that junk DNA has causal reach; that its presence more than its specificity affects the cell and by extension the organism. There would be significant effects on cell size, timing of cell division, cellular enzymatic activity—to say the least—if it were drastically altered. These are all features and activities of the cell for which junk DNA is, in a normal context, quite important—even though said junk is not subject to the normal processes of natural selection as paradigmatically understood.

Our analysis of junk DNA thus differs from some notable others: those who solely focus on how junk DNA lacks function at the level of the organism (Palazzo and Gregory 2014) as well as those who argue that junk DNA is functional after all (Cavalier-Smith 1978; ENCODE Project Consortium et al. 2012; Mattick and Dinger 2013; Freeling et al. 2015). Similarly, a more nuanced view could also extend to other biological difference makers which do not typically act as organism-level targets of natural selection. Carving out an intermediary role—not necessarily a functional one, but still significant—allows for the recognition of more complex and diffuse elements of cause and effect in biology.

Potential objections and replies

Here we briefly consider two potential objections to the view we have articulated, and offer a pair of pre-emptive replies.

Objection: junk DNA can be eliminated by natural selection—as evidenced in eukaryotic lineages that have undergone genome reduction—and thus its presence must have adaptive or maladaptive effects on cell physiology

Reply: We do not argue that the long-term elimination of junk DNA cannot happen. In the case of some eukaryotes, such as in certain species of Arabidopsis mustard greens (Hu et al. 2011) and Arachis peanuts (Ren et al. 2018), this has occurred within a relatively very short time span. As stated above, flying birds and bats have undergone numerous large deletions that have prevented their genomes from growing due to the periodic invasion of transposable elements in their genomes (Kapusta et al. 2017). Having said that, it is not clear that these genomic reductions were due to positive selection for smaller genomes. It is also possible that genomes could get smaller by non-adaptive mechanisms; for example, when the rate of DNA deletion surpasses the rate of insertion (Petrov 2002), as is likely the case in Fugu pufferfishes (Neafsey and Palumbi 2003). If these non-adaptive processes were ultimately responsible for reducing genome size, we would expect that said organisms also acquired secondary adaptive mutations—so that their cells could operate once their basic properties (such as cell size, timing of cell division, altered rate of molecule diffusion) have significantly changed. It remains possible that in certain organisms, size reductions (or increases) in the genome could be adaptive for a particular reason (Blommaert 2020). For example, it has been suggested that as mammalian lineages expanded after the extinction of the non-avian dinosaurs, mammalian genomes may have undergone a reduction in size due to an increase in the effective population sizes and an increased selection against transposable elements (Lynch et al. 2011). We would expect this to result in the streamlining of the genome only if the advantage that it conferred outweighed all of the negative side effects associated with smaller genomes. Moreover, these effects would not be expected to be linear. We anticipate that quite a sizeable fraction of the genome would have to be eliminated before any adverse effects would be felt. Once that level of depletion is reached, we would expect that these organisms would acquire secondary adaptive changes to counter any negative effects.

Objection: a lack of negative selection against junk DNA shows that it is a reservoir for future function

Reply: This teleological argument is commonly asserted by biologists (Makalowski 2000, 2003; Muotri et al. 2007)—despite the fact that it has been thoroughly rejected by philosophers of biology and evolutionary theorists (e.g., Hull, 1965). The increase or elimination of junk DNA by positive selection within a lineage is strictly subject to how this alteration impacts an organism’s immediate fitness level (not some future potential fitness level). Lineages that do gain properties due to a steady increase or decrease in junk after many generations (after all junk DNA is a difference maker, albeit a relatively inefficacious one) may outcompete other lineages. So, there may be selection on a higher level between lineages, as suggested elsewhere (Jablonski 2008; Brunet and Doolittle 2015). But this type of selection will not change the amount of junk DNA within a lineage; it will only impact whether one lineage is selected over another that differs in this difference maker. Junk DNA elements may also be co-opted by natural selection to produce new functional units. In some cases, new functions may also arise by constructive neutral evolution (i.e., non-adaptive processes). For example: the conversion of junk RNA to functional lncRNAs (Palazzo and Lee 2018; Palazzo and Koonin 2020). While the efficacy of a particular difference maker remains low, it will be subject to non-adaptive evolution.

From this standpoint, a fair question to ask is whether the future exaptation of junk DNA can ever make it (more or less) influential. In line with previous commentaries (Linquist et al. 2020), we think it is important to describe in what time frame we are speaking. Our arguments evaluate whether non-functional (traditionally understood) elements such as junk DNA are influential even from an organismic-level view and in the present. This entails an examination of their current effects, and how efficacious they are in affecting the organism’s phenotype. Of course, this could change in the future. Certain elements of junk could become more efficacious, especially if they acquire new properties, to the point that they transition from non-functional to functional, even on typically selectionist accounts of function. Other transitions are likely (see Graur et al., 2015).

Conclusions

Throughout molecular biology, as well as in the philosophy of biology, function has often been seen as binary in terms of both type and influence: entities are either functional, or not; and only the functional ones are significant. But there are non-functional (traditionally understood) yet significant entities in biology—entities such as junk DNA. According to the dominant accounts of function in biology, most junk DNA is a non-functional entity—since it is and never was a target of selection at the level of the organism. Nonetheless, junk DNA is often a significant entity in the cellular context, and we have adapted Waters’ (2007) account of DNA as an actual difference maker to account for the significance of junk DNA. Namely, junk DNA typically has causal specificity but only for a limited extent of its reach, and thereby it is a relatively inefficacious actual difference maker in cells. Our identification of this option within the conceptual landscape—that of causal reach with proximate, but not distal, causal specificity—allows us to characterize the importance of junk DNA, while maintaining and explaining why it might be assessed as of lesser significance when compared to (for instance) enduringly causally specific stretches of coding DNA.

On both our and Waters’ (2007) accounts, junk DNA is an actual difference maker, and a causally specific one (at least proximally), precisely because it tends to be involved in actual biological process (such as that of making defined RNAs), and because specific changes in it as a cause correspond with specific changes in its effects (e.g., changes in the nucleotide sequences of junk DNA generate specific changes in resultingly transcribed RNA). However, junk DNA is often quite causally inefficacious: despite that it can produce causally specific proximate effects, its cascading ultimate effects tend towards the non-specific. The constrained significance (for the organism) of junk DNA is not due to a total absence of causal specificity, but rather a limited reach for its causal specificity, and resulting limitations in its causal efficacy.

Junk DNA nonetheless contributes to the environment of the cell in which other, more traditionally functional entities exist. Total DNA levels control cell size, the amount of transcriptional noise, the number of transcription factor binding sites, the timing of cell division and likely many other aspects of the cellular environment. More paradigmatically functional entities of the cell have been tuned by natural selection to operate within this environment. This is especially true for quality control processes, which specifically evolve to deal with junk DNA and all its associated biochemical activity. On the one hand, and despite being an actual difference maker, junk DNA is not typically subject to organism-level natural selection. When non-functional DNA is mutated by natural processes, this does not alter the fitness of the organism sufficiently for natural selection to eliminate such mutations. For these reasons and given the way ‘function’ is predominantly understood in biology and philosophy of biology, junk DNA is typically conceived of as a non-functional entity. On the other hand, various components of cellular machinery exist precisely in order to deal with the effects of junk DNA, and the cellular workload is significantly affected by the need to (for instance) perform routine clean-up of the products of junk DNA transcription. If large swaths of junk DNA were eliminated, this would likely affect the fitness of the organism. Although such instances of widespread elimination of junk DNA are highly unlikely to naturally occur, such alterations could in principle could be performed in the lab. For these alternative reasons, and in our proposed terms, junk DNA is significant for the cell—despite its paradigmatically non-functional role.

Ultimately—since there are actual, causally-specific difference makers which shape and impact the cellular environment, but which are not typically subject to organism-level natural selection—our discussion demonstrates that for certain biological organisms, there can be influential features which nonetheless are non-functional in the selectionist sense, i.e., they are not shaped by natural selection. Indeed, in organisms for which drift dominates, it is possible for non-functional-yet-influential biological components to proliferate—especially when the production of these components is fueled by additional processes. The activity of transposable elements appears, in eukaryotes at least, to have driven the proliferation of junk DNA in just such a manner.