Science Without Species: Doing Science with Tree-Thinking

Matzke, Nicholas J.

doi:10.1007/978-3-030-99031-2_3

Nicholas J. Matzke³

4133 Accesses
4 Altmetric

Abstract

The focus of this volume is speciesism. While the concepts of species and speciation remain the focus of a great deal of research, it is worth exploring how in recent decades evolutionary biology has, in several ways, moved away from species as the key unit of analysis of biological questions. I begin by outlining how phylogenetic comparative methods have become essential methodological tools in statistical analyses of relationships between traits. Species are not statistically independent observations, because the reality is that they are related, genetically and statistically, on a phylogenetic tree. Phylogeny also plays a key role in modern analyses of spatial patterns in biodiversity, and in fact relying on phylogenetic biodiversity measures can avoid a number of problems created by attempting to impose a uniform species rank across different continents and clades. Similarly, a major challenge in modern studies of diversification and extinction concerns the units of analysis and how they are defined and recognized. Both “genus” and “species” are human-defined ranks imposed on the phylogenetic tree. The phylogenetic tree is the more fundamental reality that is produced by the macroevolutionary process, and it could include every level of gradation of genetic and morphological divergence. Once ranks are imposed upon it, a variety of methodological problems are created as scientists attempt to make these ranks standardized and comparable across different datasets and timescales. I outline how phylogenetic thinking might help provide a solution. I conclude with other examples where cutting-edge science is done with phylogenies without much need of the “species” rank—for example, in the battle against Covid-19.

You have full access to this open access chapter, Download chapter PDF

Integrative taxonomy and the operationalization of evolutionary independence

Article Open access 13 April 2018

Unpacking the species conundrum: philosophy, practice and a way forward

Article 19 July 2017

Working with the Tree of Life in Comparative Studies: How to Build and Tailor Phylogenies to Interspecific Datasets

1 From Comparative Methods to Phylogenetic Comparative Methods

Common research questions in ecology and evolution are about the relationship between different traits, observed across many extant taxa. Is brain size related to body size in vertebrates? Is height related to seed dispersal distance in plants? Is genome size related to organism size in eukaryotes? Such questions are always aimed at eventually trying to tease out cause and effect, but a crucial preliminary scientific step is to determine whether or not two traits are even correlated.

In the “olden days” of biology—by which I mean before the 1980s, which is of course not that long ago—the standard procedure in testing for correlation would simply be to measure the trait values across many species, plot the data, fit a linear regression, and check if the slope of the line was significantly different from 0.

A flaw in this work was exposed in a famous paper by Felsenstein (1985). He noted that, if species shared common ancestry, and thus had shared phylogenetic history, then different species will not be statistically independent samples. Instead, species that are more closely related will tend to have more similar traits. He illustrated this with an imaginary “worst-case scenario” (Fig. 3.1).

Two scatter plots illustrate the value of trait hashtag 2 over the value of trait hashtag 1, and the root node at the bottom diversifies into nodes A, which is Clade A, and node B, which is Clade B. — **Fig. 3.1**

In Fig. 3.1 (middle), 20 species in Clade A are equally related to each other, as are another 20 species in Clade B. However, the two sister clades are separated by a long period of independent evolution, indicated by long branches. Imagine we have measured two continuous traits, for example, tree height and seed mass. If we imagine a simple (if no doubt unrealistic) model for evolutionary change in these traits, wherein trait values randomly wander without any directional trend, but with the expected amount of wandering increasing with time, then a lot of evolutionary change can accumulate along those long branches. Even under this simple model, it is likely that the common ancestor of Clade A will have values for Traits #1 and #2 substantially different than the common ancestor of Clade B.

As Node A diversifies into Clade A, and Node B diversifies into Clade B, the traits of each descendant species randomly wander away from their common ancestral values. When the resulting data are plotted (Fig. 3.1, left), it appears that Trait #1 and #2 are strongly correlated, with a positive slope of 0.44, an R² suggesting that Trait #1 explains 25% of the variation in Trait #2, and P-value of 0.001155, reassuringly well below the standard P < 0.05 cutoff for statistical significance. Excited biologists could publish this correlation and speculate on causal relationships.

Unfortunately, as Felsenstein pointed out, the whole analysis is spurious, because the entire signal of relationship is coming from the different mean values of Traits #1 and #2 in the two clades, and because Clades A and B inherited different ancestral starting values from their respective ancestors at Node A and Node B. Instead of having a sample size of n = 40 independent species supporting a relationship between the two traits, in this imaginary study, the real sample size is really just n = 2 sister clades with different means. Of course, a relationship can be postulated based on two points, but n = 2 is not an acceptable sample size—a straight line can always be drawn between two points!

The problem that Felsenstein identified is just an evolutionary special case of the common problem of statistical non-independence in data. Classic statistical methods and their results (such as estimates of the slope of a relationship, or the P-value) assume that data within a group are “IID”—independent and identically distributed. In other words, they assume that each of the data points is an independent sample from the same underlying population or process. However, if the underlying process created correlational structure in the data, and this structure is ignored in the statistical models used, then the statistical results can be badly misleading.

The evolutionary process of descent with modification is just such a process, and the “groups within groups” structure produced by a phylogenetic tree (see the concept “clade” in Table 1.1 of Chap. 1 in this volume) means that lineages’ traits will often share a pattern of similarity that mirrors the closeness of the species on the phylogeny. While this pattern of correlations can itself be useful—notably, it can provide evidence of the phylogenetic tree—it is a major hindrance when attempting to determine if traits are correlated for some other, non-phylogenetic reason.

Fortunately, Felsenstein’s paper did more than just point out the problem: it also pointed out a theoretical solution. Felsenstein noticed that, while the raw measurements of traits across species will be strongly correlated because of shared evolutionary history, the differences between sister clades will be statistically independent because they represent the changes in traits that have accumulated since the clades diverged from a common ancestor. This applies whether the clades are “small” (equivalent to what were traditionally termed species), or “large” (where a clade includes multiple populations/“species”).

Felsenstein’s method is known as Phylogenetic Independent Contrasts (PICs). While the details of the method involve some math and assumptions (Huey et al. 2019) about the evolutionary process, the method is now implemented in open-source programs, for example in the free computer language R (Paradis 2012). It can be run in seconds, given a data table and an estimate of the phylogeny.

An empirical example is given below. A common, but flawed, intuition is that genome size is tied in some way to organismal complexity, with the “better” or bigger organisms having bigger genomes. After all, bacteria have small genomes (often about 1 million DNA “letters,” or bases), and complex organisms like us have large genomes (humans have over 6 billion bases of DNA in our diploid genomes). What happens when we test this hypothesis?

Data to address these questions are now available in various published databases (Gregory et al. 2007; Jones et al. 2009; Myhrvold et al. 2015; Gregory 2021). Birds are an excellent instructive example. Figure 3.2 was created by plotting the logarithm of body mass (x-axis) against genome size (y-axis), represented by the “C-value,” a term for a physical measurement of the mass of the genome in picograms (genomic size has been measurable since the 1950s using techniques from cell biology long before genome sequencing was available).

Two scatter plots of C-values, in picograms versus body mass, log 10 of grams, where R = 0.266, m =0.107+/-0.0139 and P I Cs of C values versus P I Cs of body mass,log 10 of grams, where R = 0.00149,m =0.189+/-0.0381. — **Fig. 3.2**

If we look at the regression in Fig. 3.2 (left), it appears that despite substantial scatter, there is good evidence of a positive relationship. The slope is positive, the R² is 0.27, and the P-value is an impressive P = 1.5 × 10⁻⁴⁴. Conventionally, any P less than 0.05 would be considered a statistically significant relationship. However, when we take the same data and use the bird phylogeny to calculate the body mass contrasts and C-value contrasts, the regression of contrasts against each other (Fig. 3.2, right) has a slope indistinguishable from 0, with a nonsignificant P-value of 0.33.

It turns out that a lot of what is driving the apparent relationship in Fig. 3.2a is the fact that hummingbirds, indicated by the blue circles, have both tiny body masses and slightly smaller genomes than other birds. However, all hummingbirds share these two traits, which were presumably already present in a common ancestor, so in actuality, this only provides one instance where body size and genome size appear to have shifted together. When this contrast is included with all of the other contrasts across the bird tree of life, no overall relationship is supported.

It is important to note that the PICs method does not always destroy apparent relationships. While relationships between genome size and complexity or body size are often not supported in analyses of multicellular clades, some clades show a strong relationship between genome size and the size of their red blood cells (Fig. 3.3). In this case, the apparent relationship in the raw traits is confirmed in the regression of PICs.

Two scatter graphs of C value in picograms versus cell size in microns 3 and P I Cs of C value in picograms versus P I Cs of cell size in microns 3 for amphibians.The points indicate relationship between genome size and the size of their red blood cells. — **Fig. 3.3**

Using our random-wandering model for continuous traits evolution, we can estimate, very approximately, the most probable ancestral values of cell size and genome size. These ancestral state estimates can be mapped onto the amphibian phylogeny (Fig. 3.4). Doing so shows why the correlation between the two traits is not merely a product of phylogenetic structure: across the amphibian tree, increases in cell size co-occur with increases in genome size, and these coincident events occur many times across the tree.

Two phylogenetic trees depict ancestral state estimation for cell volume and genomic size of amphibians. It indicates increases in cell size co-occur with increases in genome size across the tree several times. — **Fig. 3.4**

When Felsenstein proposed PICs in 1985, the peer-reviewers were skeptical that the technique would be useful because good estimates of phylogeny were usually not available (Huey et al. 2019). However, with advances in DNA sequencing technology, and computer programs to estimate phylogenies from DNA (several of them developed by Felsenstein), phylogenies became ubiquitous in the 1990s. Nowadays, it is virtually impossible to publish a paper in comparative biology without using Felsenstein’s Phylogenetic Independent Contrasts, or some more advanced method.

The PICs method has now been around long enough that it is considered conventional. However, it is already clear that more sophisticated methods will come to dominate the science of comparative biology in the future (Uyeda et al. 2018). These more complex methods also rely on phylogeny to account for similarities due to shared evolutionary history.

This vignette is relevant to speciesism because it shows that the shift from species-based thinking to phylogeny-based thinking impacts our general scientific and philosophical understanding of the world and the basic statistical methods that scientists use to test hypotheses. It is worth noting that PICs and related tools can be used to test hypotheses whether or not the “Operational Taxonomic Units” at the tips of the tree are considered species, or some other level of lineage. Populations, virus strains, and languages can all be analyzed phylogenetically since they originate through “descent with modification” involving the copying of information and stepwise changes that are themselves copied. Even anti-evolution legislative proposals have been analyzed phylogenetically! (Matzke 2016).

2 Biodiversity in Space: Spatial Phylogenetics

Phylogeny-based thinking has also had a major impact in another subfield of ecology and evolution: measuring biodiversity and mapping how it changes in time and space. Traditionally, the dominant measure of “biodiversity” has been simple species diversity: the number of species counted in any particular sampling unit, such as a geographical region or a geological time-bin.

However, knowledge of evolutionary trees means we know that not all “species” are equally distinct. Some species represent very young lineages that have diverged from their still-extant relatives in an evolutionarily short period of time, perhaps hundreds of thousands or even just thousands of years. Other species represent the last surviving members of lineages that have existed for hundreds of millions of years.

Both cases are important and worth preserving, but many people have long felt there is something significant about the last surviving members of ancient lineages. Although the term “living fossil” has various problems (Casane and Laurenti 2013), it does capture the astounding fact that some lineages (e.g., the living tuatara, the platypus, southern conifers like the kauri and Wollemi Pine, and the ginkgo), would only be known from fossils except for the survival of these species, often in relatively isolated geographies or habitats.

While this “feeling” of significance is worthwhile, scientists like to quantify, and the advent of widespread phylogenies has enabled scientists to apply numbers that capture the uniqueness of extant lineages (Rosauer et al. 2009; Mishler et al. 2014). The measurable quantity is simply: how much of the living Tree of Life would be lost if a particular species (or some other clade) went extinct? In the case of New Zealand’s tuatara, something like 250 million years of evolutionary history would be lost. The extinction of humans, in contrast, would result in only about 6 million years lost from the Tree of Life, assuming our closest relatives, the chimpanzees, survived. The extinction of the domestic horse might only cause the loss of about 1 million years of history, due to the existence of close relatives (donkey and zebra; Lippold et al. 2011).

Merely having this measurement does not itself resolve moral or policy questions about where to put conservation efforts and money; these are questions for conservation biology and policymakers. However, measuring the “phylogenetic diversity” or phylodiversity (Kling et al. 2018) represented in a region may be a more stable and useful measurement of the biodiversity of a unit than the species count, which can shift as taxonomic conventions and technology change. Crucially, such measures can be mapped, forming the basis of the nascent science of spatial phylogenetics (see Allen and Mishler, Chap. 8, this volume).

Maps of phylodiversity are useful scientifically because they provide a historical dimension for studying the processes that produced and maintained extant biodiversity and its distribution. Interestingly, some regions of the globe have relatively more neoendemics (endemic lineages that originated recently) and other regions have relatively more paleoendemics (endemic lineages that are the last representatives of ancient groups and persisted or accumulated in a particular region). While the mapping of phylogenetic diversity and neo/paleoendemism is well advanced, attempting to fit models to the observed patterns is still at an early stage, albeit with a promising start.

3 Studying Biodiversity in Time: The Challenges Imposed by Ranks

In one sense, paleontology has long done science at something other than the species level. Due to the incompleteness of the fossil record, studying diversification and extinction with only species counts is often statistically dubious. The number of fossil species known in particular regions and time-bins can depend heavily on the amount of fossil-bearing rocks of the right location and age, the quality of preservation, taxonomic opinion, and even paleontological interest—the number of paleontologists available for a particular clade, region, and time, and the amount of effort they spend discovering and describing fossils (Valentine 1985).

For these reasons, it is common place for paleontologists to study macroevolution using coarser measures of biodiversity, typically higher Linnaean ranks, such as the number of genera or families. While the practice of counting Linnaean taxa—which are fairly subjective groupings—has various plausible critiques (de Queiroz and Gauthier 1994; Hendricks et al. 2014), it remains true that major features of the history of life (e.g., the five great mass extinctions [Barnosky et al. 2011]), were discovered by counting Linnaean ranks. Such studies would probably have been difficult or impossible if species-level data had been insisted upon. “Species-level” macroevolutionary studies of the fossil record remain challenging today, and species numbers are often back-calculated from higher level ranks.

Perhaps stemming from these successes of a rank-based approach, paleobiology has, ironically, been one of the slowest areas to adopt the phylogenetic approach that has swept through other fields of biology. There are several major challenges here.

First, fossils do not retain DNA. Subfossils (younger organic remains of organisms, such as Neandertal teeth or frozen mammoths) are an exception, but they are a minor one since DNA degrades over time. Even under the best conditions (frozen), a recoverable DNA sequence seems to have a limit of about 1 million years. Without DNA, phylogenies must be estimated from morphological characteristics. While DNA phylogenies are routinely built from thousands or millions of DNA characters, vertebrate fossils typically only have a few hundred characteristics that can be used (scored) as phylogenetic data. Other fossils, such as clams, may have only a few dozen scoreable characteristics.

A second challenge is the phylogenetic methods available for fossil data. Scientists want a time-scaled phylogeny that includes fossils, but this requires probabilistic models of the data. Modeling DNA datasets is aided by the large amount of DNA data available and the simple manner in which DNA is assumed to evolve. Thus, DNA models have been developed continuously since the 1980s. In contrast, morphological data have only received serious modeling attention in the last half-decade (Matzke and Irmis 2018), and the models available are essentially modifications of the DNA models, which is suspect given the greater complexity of morphological evolution. These models seem to work well enough, at least with reasonably large and semi-complete datasets (Matzke and Wright 2016). However, many fossil datasets are small and incomplete. Models are imaginable where fossil sampling completeness varies strongly among regions, ecologies, and time-bins, but such models will require the combined expertise of computational biologists, paleontologists, biogeographers, and experts in the evolution of development (“evo-devo”) to develop successfully.

A third challenge concerns the different nature of Operational Taxonomic Units (OTUs, the tips of the branches) in living and fossil datasets. Due to the limitations in the fossil record and in the information that fossils retain, it is quite likely that biologists will be able to distinguish OTUs much more finely for living organisms than for fossil ones. Cryptic “species,” for example, are typically only recognized based on molecular data and would be indistinguishable in the fossil record. Similarly, species that are very similar skeletally, but distinguished by different coloration patterns, vocalizations, breeding times, etc., will be indistinguishable in the fossil record. For these and other reasons, there can be a major mismatch between the OTUs available in the recent and those identifiable even with a very complete fossil record.

Finally, apart from the above challenges, a fourth difficulty remains: are the OTUs that are measured in the present “the same kind” of entity as the lineages when a phylogeny goes back in time? It is entirely possible, even likely, they are not. The former are synchronic while the latter are diachronic (Mishler and Wilkens 2018, Fig. 3.1; see also Table 1 in Swartz and Mishler, Chap. 1 in this volume). What I am proposing is something like the process of coalescence that is well-known to population geneticists. Coalescence refers to how gene variants within populations mutate and replace each other rather rapidly on a timescale of millions of years, even with natural selection occurring. For example, all extant human mitochondrial DNA traces to a common ancestor only 250,000 years ago, meaning that in 0.25 million years, the mtDNA of “mitochondrial Eve” took over the entire human population. This process is continually ongoing throughout the genome of sexual species.

With OTUs that we can distinguish in the present (e.g., populations, subspecies, or closely related species), there may be a somewhat analogous process operating. It could be that recently diverged populations are evolutionarily ephemeral, with most of them getting wiped out rapidly by climate shifts, stochastic disasters (major floods or fires, diseases, etc.), competition from nearby relatives (Gutiérrez et al. 2007), etc. Methods that try to learn about the rates of diversification from dated phylogenies typically assume that all the branches of the tree, from the root to the tips, represent the same kind of entity such that the phylogeny could be sampled at any time-point. But what if the tips of the phylogeny consist of ephemeral clades with short evolutionary lifespans, and the deeper branches of the tree represent more stable entities, for example, a group of populations that occupy a distinct region or ecological niche? In this situation, it would be difficult to make accurate inferences about the rates of diversification, let alone extinction (Marshall 2017). The divergence events detectable deep in the tree would represent relatively rare events (e.g., successful colonization of new areas or niches), but the divergence events near the tips would mostly represent a more common and trivial process of partial genetic differentiation of populations on a landscape.

Biologists are hard at work attempting to address these issues. One avenue to try and address the issue of alleged “incomplete speciation” (we should note in passing the almost teleological speciesism assumption here) is to develop models that include heterogeneity in the diversification process. For example, so-called “extended speciation” models postulate not only rates of lineage splitting and extinction, but also a process of “speciation completion,” which assigns a rate at which an already-separated population becomes a “fully distinct species” (Sukumaran et al. 2021). Given this postulate, the speciation-completion process can be incorporated into a large joint model that delineates species and estimates the number of distinct species.

3.1 A Rank-Free Suggestion for Studying Biodiversity in Time

Even assuming terminal clades have been successfully identified in the present, the major challenge of a major mismatch between living OTUs and fossil OTUs remains. One “solution” is to essentially declare them incommensurate. In this case, inferences based on phylogenies of living species would provide information about speciation rates at the tips of a tree, fairly reliable signatures of processes just below the tips (Upham et al. 2021), and the fossil record would be used for information about larger clades in the deeper evolutionary past.

I suggest that the “science without species” paradigm might provide an alternative that is worth exploring when studying biodiversity over time. DNA-based phylogenetics can provide a more finely-resolved tree than can the fossil record, but the fossil record provides information about extinct lineages that is (at best) very difficult to recover from DNA phylogenies. How can these two data sources be combined, rather than merely analyzed separately and compared?

There is room for many approaches here, but one that seems underexplored is a multiscale analysis of phylogenetic diversity. By this I mean that any phylogenetic timetree can be queried not only for the number of lineages, N, recorded at any timepoint t—let’s call this N_t+0—but also for the number of lineages of a given age recorded at each timepoint. For example, the number of lineages recorded 1 million years before any given timepoint t would be N_t+1, and the number of lineages recorded 5 million years before any given timepoint t would be N_t+5. The advantage of this approach is that a metric like N_t+5 is likely to be much more robust to the above-identified vagaries of fossil sampling, such as fossil species and genus diagnosis, and species delimitation identified above. A combined DNA + fossils phylogeny might well serve as the basis for providing good inference of the rates of processes affecting N_t+5 through time, however difficult it might be to study N_t+0 through time.

But there is no need to pick any single time-resolution to study diversification. This would just impose another rank as the one true unit of analysis. Given a phylogeny that includes thorough fossil sampling, the metric is easily calculated at various time-depths. Instead of just relying on a Lineages-through-time (LTT) plot to represent diversification history, LTT curves calculated at a series of time-depths, for example, N_t+0 through N_t+20, could be calculated. Evolutionary models appropriate to shallow-time or deeper-time processes could then be fit, with the expectation that the latter are likely to be easier for time periods before the Recent.

4 More Science Without Species

I have outlined above where the phylogenetics revolution in biology can deemphasize, and perhaps totally dispense with the rank of “species” as the key unit of analysis in comparative biology, biogeography, and macroevolutionary diversification. However, there are other major research topics where the phylogenetics revolution has already thoroughly supplanted the species paradigm.

4.1 Gene Trees

In “first-generation” phylogenetics, roughly into the early 2000s, it was common for biologists to assume that if they sequenced three genes from each of 20 species, they should expect that all of those gene trees would be congruent—meaning that the phylogeny of each gene should show the same relationships between species. Disagreement between gene trees was annoying and was termed “incongruence.” Generally, the “solution” was to combine all the gene sequences into one large alignment and estimate the phylogeny from the combined dataset, providing a best-estimate phylogeny supported by the majority of data. Unresolved parts of the tree could, hopefully, be resolved by sequencing more genes and adding them into the analysis.

While this “concatenation” procedure was not crazy (and yielded reasonably accurate phylogenies), it left out the fact that genes evolve in populations, and recombine in each generation as chromosome segments are shuffled around. As a result, population genetics predicts that many individual genes will have phylogenies slightly different from the overall phylogenetic divergence history of the population, particularly if population sizes are large and the time between population-divergence events is small (Ashman et al. 2018). New phylogenetic methods jointly estimate individual gene phylogenies along with the population phylogeny.

4.2 Virus Trees

Viruses have never fit well into the “species” paradigm—it is even debated whether viruses are technically “alive.” Whatever the case, it is clear that viruses evolve through descent with modification, and thus phylogenies can be good models to represent their evolutionary history. Viral phylogenetics is a fascinating example showing the benefits of “pure science,” basic research that is done to advance scientific understanding rather than practical application, yet which nonetheless turns out to have tremendous application. The field of phylogenetics was largely developed by morphological taxonomists and molecular biologists who wanted to better understand the evolutionary history of their study organisms. One particularly important figure, Willi Hennig, is considered the father of phylogenetic systematics, but his taxonomic expertise was on Micropezidae, a family of flies. However, from humble roots, phylogenetics has grown to be a crucial tool in acute life-and-death situations, namely, viral outbreaks. I used to make this point at meetings by citing a Science paper tracing the 2014 Ebola Virus Disease (EVD) outbreak in West Africa (Gire et al. 2014), which concludes with: “Tragically, five co-authors, who contributed greatly to public health and research efforts in Sierra Leone, contracted EVD and lost their battle with the disease before this manuscript could be published.”

In the 2020s, of course, what was previously shocking now seems almost quaint, with the entire globe battling SARS-CoV-2, currently credited with over 6 million deaths, which is certainly a vast undercount. Every day, scientists are sequencing new variants of our opponent and adding them to an international phylogenetic database so we can watch when and where new strains emerge and spread (https://nextstrain.org/ncov/gisaid/global). Phylogenetics supplies the structure upon which we base the namings of new strains (“Alpha,” “Delta,” etc.). In the case of New Zealand, where I reside, for the first several waves of infection, every new COVID case was sequenced and added to a comprehensive phylogeny used to track viral incursions in real-time (Douglas et al. 2021; Geoghegan et al. 2021; see also https://nextstrain.org/community/narratives/ESR-NZ/ GenomicsNarrativeSARSCoV2/aotearoa-border-incursions). It was common to hear the nation’s Director-General of Health, Ashley Bloomfield, discuss phylogenies and clades before the national media in the daily 1 pm COVID briefing.

5 Conclusion

In summary, we have seen that a major challenge in modern studies of diversification and extinction concerns the units of analysis and how they are defined and recognized. Both “genus” and “species” are human-defined ranks imposed on the phylogenetic tree. The phylogenetic tree is the more fundamental reality that is produced by the macroevolutionary process, and it could include every level of gradation of genetic and morphological divergence. Once ranks are imposed upon it, a variety of methodological problems are created as scientists attempt to make these ranks standardized and comparable across different datasets and timescales.

The only figure that Darwin put in his 1859 Origin of Species was a (hypothetical) phylogenetic tree. While his title put focus on how the entities people call species originated, and this concern dominated evolutionary science for many decades, a careful reading of the Origin indicates that the whole point was to undermine the permanency of species in favor of a phylogenetic view. We can trace a clear path from his phylogenetic speculations in 1859 to the phylogenetics revolution of recent decades (Fig. 3.5) and the real-time global tracking of deadly pathogens today. While showing the impermanence of species was one of Darwin’s major revolutionary advances, it may be that phylogenetic thinking, which in time may allow “science without species,” could, in the end be the bigger of the two revolutions.

A line graph illustrates the number of scientific publications versus 1980 to 2020 that used the word phylogen. The line increases gradually at first and then increases exponentially with a minor dip near 2015. — **Fig. 3.5**

References

Ashman L, Bragg J, Doughty P, Hutchinson M, Bank S, Matzke N, Oliver P, Moritz C (2018) Diversification across biomes in a continental lizard radiation. Evolution 72(8):1553–1569
Article Google Scholar
Barnosky AD, Matzke N, Tomiya S, Wogan GOU, Swartz B, Quental TB, Marshall C, McGuire JL, Lindsey EL, Maguire KC, Mersey B, Ferrer EA (2011) Has the earth’s sixth mass extinction already arrived? Nature 471(7336):51–57
Article CAS Google Scholar
Casane D, Laurenti P (2013) Why coelacanths are not ‘living fossils.’ BioEssays 35(4):332–338
Article Google Scholar
de Queiroz K, Gauthier J (1994) Toward a phylogenetic system of biological nomenclature. Trends Ecol Evol 9(1):27–31
Article CAS Google Scholar
Douglas J, Geoghegan JL, Hadfield J, Bouckaert R, Storey M, Ren X, de Ligt J, French N, Welch D (2021) Real-time genomics to track COVID-19 post-elimination border incursions in Aotearoa New Zealand. medRxiv 2021.05.13.21257194
Google Scholar
Felsenstein J (1985) Phylogenies and the comparative method. Am Nat 125(1):1–15
Article Google Scholar
Geoghegan JL, Douglas J, Ren X, Storey M, Hadfield J, Silander OK, Freed NE, Jelley L, Jefferies S, Sherwood J, Paine S, Huang S, Sporle A, Baker MG, Murdoch DR, Drummond AJ, Welch D, Simpson CR, French N, Holmes EC, de Ligt J (2021) Use of genomics to track coronavirus disease outbreaks, New Zealand. Emerg Infect Dis 27(5):1317–1322
Article CAS Google Scholar
Gire SK, Goba A, Andersen KG, Sealfon RSG, Park DJ, Kanneh L, Jalloh S, Momoh M, Fullah M, Dudas G, Wohl S, Moses LM, Yozwiak NL, Winnicki S, Matranga CB, Malboeuf CM, Qu J, Gladden AD, Schaffner SF, Yang X, Jiang P-P, Nekoui M, Colubri A, Coomber MR, Fonnie M, Moigboi A, Gbakie M, Kamara FK, Tucker V, Konuwa E, Saffa S, Sellu J, Jalloh AA, Kovoma A, Koninga J, Mustapha I, Kargbo K, Foday M, Yillah M, Kanneh F, Robert W, Massally JLB, Chapman SB, Bochicchio J, Murphy C, Nusbaum C, Young S, Birren BW, Grant DS, Scheiffelin JS, Lander ES, Happi C, Gevao SM, Gnirke A, Rambaut A, Garry RF, Khan SH, Sabeti PC (2014) Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak. Science 345(6202):1369
Article CAS Google Scholar
Gregory TR, Nicol JA, Tamm H, Kullman B, Kullman K, Leitch IJ, Murray BG, Kapraun DF, Greilhuber J, Bennett MD (2007) Eukaryotic genome size databases. Nucleic Acids Res 35(Database issue):D332–D338
Google Scholar
Gregory TR (2021) Animal genome size database, from http://www.genomesize.com
Gutiérrez RJ, Cody M, Courtney S, Franklin AB (2007) The invasion of barred owls and its potential effect on the spotted owl: a conservation conundrum. Biol Invasions 9(2):181–196
Article Google Scholar
Hendricks J, Saupe E, Myers C, Hermsen E, Allmon W (2014) The generification of the fossil record. Paleobiology 40(4):511–528
Article Google Scholar
Huey RB, Garland T, Turelli M (2019) Revisiting a key innovation in evolutionary biology: felsenstein’s “Phylogenies and the comparative method.” Am Nat 193(6):755–772
Article Google Scholar
Jones KE, Bielby J, Cardillo M, Fritz SA, O’Dell J, Orme CDL, Safi K, Sechrest W, Boakes EH, Carbone C, Connolly C, Cutts MJ, Foster JK, Grenyer R, Habib M, Plaster CA, Price SA, Rigby EA, Rist J, Teacher A, Bininda-Emonds ORP, Gittleman JL, Mace GM, Purvis A (2009) PanTHERIA: a species-level database of life history, ecology, and geography of extant and recently extinct mammals. Ecology 90(9):2648–2648
Article Google Scholar
Kling MM, Mishler BD, Thornhill AH, Baldwin BG, Ackerly DD (2018) Facets of phylodiversity: evolutionary diversification, divergence and survival as conservation targets. Philos Trans R Soc Lond. Ser B, Biol Sci 374(1763):20170397
Google Scholar
Lippold S, Matzke NJ, Reissmann M, Hofreiter M (2011) Whole mitochondrial genome sequencing of domestic horses reveals incorporation of extensive wild horse diversity during domestication. BMC Evol Biol 11(1):328
Article CAS Google Scholar
Marshall CR (2017) Five palaeobiological laws needed to understand the evolution of the living biota. Nat Ecol Evol 1:0165
Article Google Scholar
Matzke NJ (2016) The evolution of antievolution policies after Kitzmiller versus Dover. Science 351(6268):28–30
Article CAS Google Scholar
Matzke NJ, Irmis RB (2018) Including autapomorphies is important for paleontological tip-dating with clocklike data, but not with non-clock data. PeerJ 6:e4553
Article Google Scholar
Matzke NJ, Wright AM (2016) Inferring node dates from tip dates in fossil Canidae: the importance of tree priors. Biol Let 12(8):20160328
Article Google Scholar
Mishler BD, Knerr N, González-Orozco CE, Thornhill AH, Laffan SW, Miller JT (2014) Phylogenetic measures of biodiversity and neo- and paleo-endemism in Australian Acacia. Nature Commun 5(1):4473
Article CAS Google Scholar
Mishler BD, Wilkins JS (2018) The hunting of the SNaRC: A snarky solution to the species problem. Philos Theory Pract Biol 10(20220112):1–18
Google Scholar
Myhrvold NP, Baldridge E, Chan B, Sivam D, Freeman DL, Ernest SKM (2015) An amniote life-history database to perform comparative analyses with birds, mammals, and reptiles. Ecology 96(11):3109–3109
Article Google Scholar
Paradis E (2012) Analysis of phylogenetics and evolution with R. Springer, New York
Book Google Scholar
Rosauer D, Laffan SW, Crisp MD, Donnellan SC, Cook LG (2009) Phylogenetic endemism: a new approach for identifying geographical concentrations of evolutionary history. Molecular Ecology 18(19):4061–4072
Article Google Scholar
Sukumaran J, Holder MT, Knowles LL (2021) Incorporating the speciation process into species delimitation. PLoS Comput Biol 17(5):e1008924
Article CAS Google Scholar
Upham NS, Esselstyn JA, Jetz W (2021) Molecules and fossils tell distinct yet complementary stories of mammal diversification. Curr Biol
Google Scholar
Uyeda JC, Zenil-Ferguson R, Pennell MW (2018) Rethinking phylogenetic comparative methods. Syst Biol 67(6):1091–1109
Article Google Scholar
Valentine JW, (ed) (1985) Phanerozoic diversity patterns; Profiles in macroevolution. Princeton University Press
Google Scholar

Download references

Author information

Authors and Affiliations

School of Biological Sciences, University of Auckland, Auckland, 1142, New Zealand
Nicholas J. Matzke

Authors

Nicholas J. Matzke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicholas J. Matzke .

Editor information

Editors and Affiliations

Department of Integrative Biology, University and Jepson Herbaria; University of California, Berkeley, BERKELEY, CA, USA
Brian Swartz
Department of Integrative Biology, University and Jepson Herbaria; University of California, Berkeley, BERKELEY, CA, USA
Brent D. Mishler

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Matzke, N.J. (2022). Science Without Species: Doing Science with Tree-Thinking. In: Swartz, B., Mishler, B.D. (eds) Speciesism in Biology and Culture. Springer, Cham. https://doi.org/10.1007/978-3-030-99031-2_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-99031-2_3
Published: 01 January 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-99029-9
Online ISBN: 978-3-030-99031-2
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

Science Without Species: Doing Science with Tree-Thinking

Abstract

Similar content being viewed by others

Integrative taxonomy and the operationalization of evolutionary independence

Unpacking the species conundrum: philosophy, practice and a way forward

Working with the Tree of Life in Comparative Studies: How to Build and Tailor Phylogenies to Interspecific Datasets

1 From Comparative Methods to Phylogenetic Comparative Methods

2 Biodiversity in Space: Spatial Phylogenetics

3 Studying Biodiversity in Time: The Challenges Imposed by Ranks

3.1 A Rank-Free Suggestion for Studying Biodiversity in Time

4 More Science Without Species

4.1 Gene Trees

4.2 Virus Trees

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Science Without Species: Doing Science with Tree-Thinking

Abstract

Similar content being viewed by others

Integrative taxonomy and the operationalization of evolutionary independence

Unpacking the species conundrum: philosophy, practice and a way forward

Working with the Tree of Life in Comparative Studies: How to Build and Tailor Phylogenies to Interspecific Datasets

1 From Comparative Methods to Phylogenetic Comparative Methods

2 Biodiversity in Space: Spatial Phylogenetics

3 Studying Biodiversity in Time: The Challenges Imposed by Ranks

3.1 A Rank-Free Suggestion for Studying Biodiversity in Time

4 More Science Without Species

4.1 Gene Trees

4.2 Virus Trees

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation