How the GFA handles decisions on species concepts
The two conditions in the GFA can serve as a tool to analyze specific cases. We now deepen our analysis of the examples discussed earlier (Section 2), where we highlighted the role of non-epistemic values in decisions on the acceptance of the PSC. We show how the GFA handles the acceptance of classifications according to non-epistemic values and, by comparisons with other accounts of kinds and classification, how the GFA’s machinery works.
First, the conservation case. Three very straightforward criteria in conservation contexts are diagnosability, applicability and practicability. Species must be diagnosable to apply conservation measures to them – that is, we must be able to decide to which species a local population belongs in order to decide whether protective measures should be applied to it. This is why cryptic species have long been a problem for species conservation – a problem that is increasingly being solved by improved methods for fine-grained genetic analysis (Bickford et al., 2007; Chenuil et al., 2019). Applicability is the requirement that a species concept can be applied throughout as many domains of biodiversity as possible and does not only apply to, for example, animals. In this respect, the BSC does not meet this criterion well, as it only applies to sexually reproducing organisms and thus only to a minority of biodiversity. Wide applicability of a species concept is important to enable comparisons between different species with respect to their conservation status: if a species of plant and a species of animal are individuated using different species concepts, it is difficult to decide which should be prioritized in conservation contexts because the comparison is between two different kinds of things. Practicability is a criterion that follows from the fact that the actual implementation of conservation measures costs resources. If a species concept is used that recognizes too many species that are threatened with extinction, for example, there may well be insufficient funds and human power available to protect them all. Thus, a situation may occur in which a species concept that allows a fine-grained diagnosing of species yields many more species than we have capabilities to conserve – i.e., a conflict between diagnosability and practicability.
Participants in the debate generally agree that the PSC is favored over the BSC and other species concepts if only diagnosability and wide applicability were considered. The PSC generally yields very fine-grained diagnosable groupings, because it uses unique traits (character states that originated in one branch of the Tree of Life) in combination with common descent to group organisms (Reydon & Kunz, 2019).Footnote 12 The uniqueness of these traits guarantees diagnosability of the ensuing groups. The PSC often meets the diagnosability requirement better than the BSC, as groups defined by reproductive isolation are very difficult to diagnose in the wild or the laboratory (Agapow et al., 2004: 163). Also, it is applicable throughout the whole of biodiversity, setting it apart from other species concepts (ibid.), whereas the BSC is applicable only to sexually reproducing organisms. Thus, the PSC performs better than competitor concepts on two important epistemic values – diagnosability and applicability. This notwithstanding, conservation biologists criticize the PSC for non-epistemic reasons. As shown in Section 2, using the PSC tends to yield too many species, and it yields species that are too small and thus become threatened too easily, thus increasing the burden on limited conservation resources. Crucially, in the debate we find a trade-off between diagnosability and applicability on the one hand, and practicability on the other. While both sides are clearly important for conservation purposes, in this trade-off between epistemic and non-epistemic values the latter are taken to override the former.
Philosophical accounts of kinds and classification thus should not just make room for non-epistemic values to play some role in classifications (e.g., as factors operating in the background), but should explicitly allow the possibility that non-epistemic values override epistemic values. Therefore, an adequate account of kinds and classification cannot a priori prioritize epistemic values over non-epistemic values (or, for that matter, non-epistemic values over epistemic values), but must be open to either way of prioritizing and follow how researchers actually prioritize values in practice. While often epistemic values will be most prominent, in many other cases non-epistemic values override epistemic values (and we discuss examples in which this is the case). Given their strong (and sometimes even exclusive) focus on epistemic values, available accounts of natural kinds fail to meet this condition. In Section 4.2, we show how the GFA performs better in this respect in comparison to some other accounts.
Before getting to that issue, we want to address another aspect of the GFA, namely that functionality and grounding can come apart. Consider as an example the question whether gene-edited organisms (GEOs) should ontologically be counted as genetically modified organisms (GMOs) or as a separate category.Footnote 13 This is an important question in relation to the legal regulation of agriculture and food production (Wasmer, 2019; Lohse et al., 2020). While many consumer groups and environmental organizations argue that GEOs should be seen as GMOs, many plant scientists argue that GEOs are not GMOs, as GEOs do not include transgenes and could in principle have come into being due to natural spontaneous mutations. Behind the latter arguments often are non-epistemic aims: excluding GEOs from the category of GMOs makes research less complicated, among other things, because no special permissions are required and industry will have a greater incentive to fund research on GEOs when there will not be any obstacles to marketing GEO-derived foodstuffs.
While a classification that separates GEOs and GMOs as distinct kinds thus meets the non-epistemic aims of the involved scientists, this is insufficient to count the groups ‘GEO’ and ‘GMO’ as natural kinds under the GFA. This is because the achievement of the specific aims in question (making research easier, having less restrictions on research, generating incentives for industry funding) is not due to any specific way in which the two groups would be grounded in the world. While GEOs differ from GMOs in the lack of transgenes, whether or not this difference is relevant for making research easier fully depends on how we legally treat this difference. For one, transgenic organisms can also come into being in natural ways (by way of horizontal gene transfer), such that the aspect of natural vs. artificial origins does not ground a distinction between GEOs and GMOs as kinds. Furthermore, GEOs do have artificial origins, even though similar organisms could have also occurred naturally. Research on GEOs thus is not made easier because of any fact of the matter that GEOs are, or are not, GMOs – the entire effect of the classification is due to how we legally treat GEOs and GMOs, and not to any difference in nature between GEOs and GMOs that is relevant from the perspective of scientific theory. Both GEOs and GMOs result from technological interventions in organisms that change their genetic makeup – the difference is that for GMOs these interventions are perceived as involving genetic material that is “foreign” to the species, leading to differences in legal treatment. In both cases, the sequence of As, Cs, Ts and Gs of the organisms involved is changed – whether or not GEOs should be seen as GMOs (and thus whether or not research would be made less complicated) is purely a matter of how we legally treat these changes. That is, while the Functionality Condition is met in this case, from the perspective of those plant scientists that want to distinguish GEOs and GMOs, the Grounding Condition is not.Footnote 14 The groupings ‘GEO’ and ‘GMO’ satisfy the Functionality Condition, but the distinction in their functions is not grounded in the world (but only legally defined) and hence they should not be thought of as distinct natural kinds. This example shows how the GFA does not simply defer to science but enables a critical and normative attitude towards scientific classifications: its normativity derives from the Functionality and Grounding Conditions working in tandem. When functionality and grounding come apart, the GFA tells us that such functional groups – notwithstanding their functionality – should not be counted as natural kinds.
This shows how the GFA’s thorough naturalism explicitly does not involve a view that any aim is as good as any other, such that scientists should construct classifications to further any sort of aim (e.g., political aims, aims related to obtaining funding or improving prestige, etc.) and as philosophers we would have to accept all these classifications as involving natural kinds. The GFA does defer to science with respect to the aims for which classifications are constructed, with the caveat that such aims should be informed by values that are democratically endorsed, especially by relevant stakeholders (see Section 3). But at the same time the GFA poses the requirement that a classification actually succeeds in achieving these aims and that the relevant field of science can explain why this is the case.Footnote 15 The former requirement is embodied in the Functionality Condition, the latter in the Grounding Condition. In the first step of the analysis, the Functionality Condition is used to identify successful kinds in science that are candidates for being attributed natural kind status: success in achieving the aims for which a kind was posed thus is a necessary but not sufficient condition for accepting it as a natural kind. In the second step this group of candidates is narrowed down, using the Grounding Condition to identify those kinds of which the successful use can be explained by their being grounded in relevant aspects of the world. Thus, the GFA works to identify natural kinds and explain their successful use in scientific research.
This can also be seen in our example of the classification of organisms into species. The GFA reconstructs the debate on species concepts in conservation contexts as follows. As it meets the applicable epistemic requirements, the PSC seems to meet the GFA’s Functionality Condition, such that the groups based on it seem to be putative natural kinds (and we would have to invoke the Grounding Condition to decide whether they actually should be given natural kind status). Adding non-epistemic values into the picture, we see why the Functionality Condition is in fact not met by the PSC. While diagnosability furthers the aims of conservation to some extent (as a minimal level of diagnosability is required for conservation efforts to be effective), it yields too many diagnosable groups. In terms of the GFA, the opponents of the PSC (Frankham et al., 2012; Zachos et al., 2013) argue that it fails to meet the Functionality Condition, as it does not yield groupings that can feature in successful conservation efforts.
Had we only considered epistemic aims in our analysis, the conclusion would have been that the PSC meets the Functionality Condition and criticism of the PSC was misguided. Considering both epistemic and non-epistemic aims allows us to give an appropriate reconstruction of the debate and fundamentally changes the conclusion regarding criticism of the PSC. Note that in this analysis, the Functionality Condition exerts some normative force, as it forces us to ask what the actual aims of a particular research context are. In the case considered here, the main aim is the conservation of biodiversity, while the subordinate epistemic aims are diagnosability and broad applicability. This allows us to see that authors rightly criticize the PSC (if their empirical claims are correct), because they emphasize the actual aims of the program for which the classification is intended.Footnote 16 As the Functionality Condition is not met by the PSC in this case, the Grounding Condition doesn’t come into play, as the GFA tells us to first test whether the Functionality Condition is met and second whether the Grounding Condition is met.
We recognize, however, that talking about “the actual aims” of a research context is a vexing issue.Footnote 17 This is not a specific problem for our account, but a more general problem in the literature on science and values. We cannot solve it here, but we want to highlight how we deal with it. First, by examining cases from uncontroversial areas of science (rather than projects that promote controversial aims), we can show how our account works in several actual cases of scientific practice. We do this in the present paper as well as in other work (Ereshefsky & Reydon, 2022; Reydon, 2021). Second, we acknowledge that it is possible for disagreement to exist within one research context on the aims of research and relevant values, such that it is impossible to identify the aims of research for this particular context. But there is a way to treat such cases, we suggest, namely by thinking of a context in which disagreement on aims exists as encompassing multiple distinct classificatory programs, each pursuing its own aims and coexisting with the other classificatory programs within the larger research context. Our notion of classificatory programs makes this possible, as a classificatory program is defined in part by its aims (i.e., its motivating principles), such that disagreement on aims exists between programs. For example, the various competing approaches in biological systematics (see Hull, 1988) can be considered different classificatory programs operating within the same larger research context but with different aims in view and, as Hull described in detail, heated disagreements between programs. Classificatory programs as we understand them thus are very localized and because the GFA assesses the kinds that are posed in one classificatory program at a time and does not perform comparative assessments between different classificatory programs, it is able to avoid disagreements about aims.
Now consider the context of research on human health. As this is a different context of research, and aims and values are context-dependent, we might reach a different conclusion here. Note first that (as discussed above) Attenborough (2015) argues that when it comes to the prevention and eradication of malaria the capability of diagnosing cryptic species favors the PSC. Using the PSC, we can achieve a more fine-grained taxonomy of species in the genus Anopheles, enabling more differentiated associations between the local presence of a particular type of mosquito and connected risks for human health than on the basis of traditionally recognized species. As Attenborough shows (2015: 144; Attenborough’s Table 7.2), using the PSC allows us to split two recognized species of Anopheles into seven species differing in their ranges of occurrence, population densities, and the extents to which their organisms can carry malaria parasites and to which they prey on humans or rather on non-human animals.Footnote 18 Here, too, diagnosability is what counts. But because of the aforementioned differences between the species recognized using the PSC, the epistemic aim of diagnosability here aligns with the non-epistemic aim of the promotion of human health in regions where malaria occurs.
Here, too, the Functionality Condition exerts normative force, telling us to prioritize the actual aims of the research program under consideration (malaria epidemiology, aiming to promote human health). Following this guideline, we see that the PSC fits the actual aims of this specific context better than other species concepts (assuming the scientific claims in Attenborough’s paper are correct). The Grounding Condition adds to this by requiring that the groups identified by the PSC represent aspects of the world relevant to the aims in malaria research, i.e., shared traits due to common descent that pertain to whether or not mosquitoes can carry malaria parasites and whether or not they prey on humans. The PSC rests on common descent as well as unique traits, and as such highlights those traits that explain why different groups of mosquitoes have different relevance when it comes to malaria research and prevention. The Grounding Condition thus is met and the available body of biomedical knowledge explains how it is met. The analysis allows us to see that Attenborough is right in endorsing the PSC (if his empirical claims are correct), because he highlights the actual aims of the program for which the classification is intended. Here, too, non-epistemic values are decisive and play a crucial role in the choice made by the researchers.
A legitimate worry (voiced by one of the anonymous reviewers of this paper) would be whether the assumption is correct that there is general agreement about the aims of research among the researchers in a particular context of research. In the case discussed above, for example, a question would be whether the local aims of research on malaria in Pacific regions and of research on malaria in Africa are sufficiently aligned for the same classification to serve both sets of aims. While we agree that this may be an issue, it is not an issue for the GFA to deal with. As explained above, the GFA works by taking the aims as explicated by researchers working in a particular classificatory program and assessing whether the classification that is proposed in that program successfully meets these aims and the program is able to explain how it does so. When researchers disagree on research aims, the GFA treats this situation as a disagreement between classificatory programs that either compete within the same context of research or apply to different subareas of research in the same context. In the malaria case, for example, researchers might find that the promotion of human health with respect to malaria in Pacific regions requires a different classification than in Africa. As a naturalistic account of kinds, the GFA would then follow the researcher’s findings and assess both classifications separately, asking whether each meets the specific aims it was set for. In the case of competing programs, for example one group of researchers advocating one classification for all malaria research and another group advocating different classifications for different geographic regions, the GFA would be able to exert some normative force. The GFA could be used to assess the competing programs for whether they group organisms into natural kinds. If one of them does not, this would indicate a problem for that program with respect to the grounding of its classification in the world. If all do, this would indicate a need for the different programs to examine how their classifications could be related to each other, for example in a hierarchical manner.
The GFA in comparison
In both examples involving species we see how the GFA can make sense of the scientists’ reasoning. It can do this because it does not assume that all good scientific classifications further the same epistemic aim or aims, but instead examines classifications at a local level, taking the aims of local classificatory programs as the basis for analysis and being fundamentally open to the possibility that non-epistemic aims override epistemic aims. In this respect the GFA contrasts strongly with other prominent accounts. To clarify the contrast, we briefly look at Khalidi’s (2013, 2018) “causal nodes” account, Slater’s (2013, 2015) Stable Property Cluster account, and Boyd’s (1991, 1999) HPC account as examples.
On Khalidi’s account, legitimate natural kinds are groupings of entities that represent nodes in causal networks (i.e., in the causal structure of the world). Khalidi (2013, Section 4.7) holds that classifications should represent such nodes and excludes non-epistemic values from playing a role in determining natural kind status. On Slater’s account, legitimate natural kinds are groupings of entities that represent stable patterns that we find in the world – i.e., stably recurring patterns of similarity between entities. Because patterns can be stable to higher and lesser degrees, on Slater’s account groups can be attributed kind status to higher and lesser degrees. Slater expresses this with his notion of “natural kindness” – being a natural kind (natural kindness) is not a yes-or-no matter but comes in degrees. Khalidi and Slater use global criteria that all kinds in all contexts must meet, and that do not differentiate between kinds that meet them. On Khalidi’s account any classification that represents nodes in the world’s causal nexus is as good as any other – if a grouping represents a node in the world’s causal nexus, it should be given natural kind status. Similarly, on Slater’s account any stable pattern is as good as any other – if a grouping of entities represents a stable pattern of property co-occurrence and the degree of stability satisfies the norms of a discipline, it should be given natural kind status. What the two accounts lack are filters that would allow us to distinguish more from less significant causal nodes and more from less significant stable patterns, respectively.
On Slater’s account, for instance, non-epistemic interests do not play a role in determining whether a stable pattern is a natural kind or not – rather, epistemic interests determine which of the many available stable patterns (or: kinds) researchers focus on and which they disregard. A fundamental difference between Slater’s account and the GFA is that non-epistemic values and interests are internal to the GFA but external to Slater’s account, as for him they do not play a role in determining whether or not a group should be attributed natural kind status. Natural kind status is attributed solely on the basis of being a stable cluster of properties for the epistemic purposes of a discipline (Slater, 2015: 396). Non-epistemic values only play a role in selecting which kinds to focus on. On the GFA, in contrast, interests are co-constitutive of what it is to be a natural kind, as kinds must meet the Functionality Condition. In this sense, non-epistemic values are internal to the GFA. The GFA thus can reject patterns that fail non-epistemic interests and, conversely, allow groups of interest to researchers without being connected to any stable property cluster. By internalizing non-epistemic values, the GFA (in contrast to Slater’s account) can highlight factors that make the difference for scientists in the examples we discussed.
Note that on both Khalidi’s and Slater’s accounts we would have to conclude that Frankham and co-authors, and Zachos and co-authors are wrong when arguing against the use of the PSC in conservation contexts. After all, the PSC works perfectly well when it comes to identifying causal nodes (here inheritance from a common ancestor can be interpreted as a causal node – see Ludwig, 2018: 44-45) or stable patterns (shared traits that uniquely define a group, even if not all members actually exhibit them). Also, we would have to conclude that while Attenborough is right to prefer the PSC in the context of malaria epidemiology, he is right for the wrong reasons: he should have emphasized diagnosability of groups based on causally sustained (genetic, morphological and behavioral) similarities, or on the basis of the fact that the PSC is more powerful than competing concepts when it comes to identifying stable patterns (as it recognizes more finely grained patterns). On Khalidi’s and Slater’s accounts of classification, Attenborough should not have preferred the PSC on the basis of whether it is able to individuate groups of mosquitos that are important for promoting human health.
The crucial problem with both Khalidi’s and Slater’s accounts, then, is that they miss the aims that in actual research contexts are set by the non-epistemic values endorsed by the community of researchers. As we have seen, these non-epistemic aims do not necessarily distinguish between different aspects of the causal structure of the world or between more and less relevant stable patterns – the aims highlighted in Khalidi’s and Slater’s account, respectively.Footnote 19 But in the examples we discussed, the non-epistemic aims matter the most and are taken by researchers to override epistemic values. One important thing these examples of actual scientific practice show is that the GFA’s Functionality Condition is not met overarchingly by one epistemic aim in all scientific contexts. Whether it is met depends on the research context and the goals set in that context. The assessment whether a particular classificatory theory (in the cases considered, a particular species concept) yields natural kinds has to be carried out locally, as kinds are strongly context-dependent – which is an aspect of kinds that other accounts miss.
To be sure, the context-dependency of classifications and the kinds featuring in them is acknowledged in some of the available accounts. Boyd’s account, for example, explicitly conceives of kinds as relative to disciplinary matrices (Boyd, 1999: 148): natural kinds are groupings that accommodate the inferential practices within a particular disciplinary matrix to the causal-mechanical structure of the world. Boyd does not think of disciplinary matrices as corresponding to scientific disciplines as these are commonly understood, but explains that a disciplinary matrix is “a family of inductive and inferential practices united by common conceptual resources, whether or not these correspond to academic or practical disciplines otherwise understood” (Boyd, 1999: 148). This, however, is not a thoroughly local context-dependency, as the disciplinary matrices that Boyd refers to are typically located at comparatively high levels of organization, ranging over one or multiple disciplines. The GFA is much more fine-grained in this respect and aims to be thoroughly local (in line with Reydon, 2016; see also Conix, 2019: 33). It evaluates the status of kinds and classifications according to the aims of local research contexts and programs, which may be a specific school of thought or a concrete practical project within a small subdiscipline.Footnote 20
Furthermore, while on Boyd’s account kinds depend on nature as well as on human classificatory activities (his “bicameralism thesis”), there are important dissimilarities to the GFA. On Boyd’s account, only homeostatically supported property clusters are recognized as the aspects of nature on which kinds depend. The GFA is much less restrictive in this respect, as grounding can be realized in many ways and the GFA does not presuppose that there is only one way of depending on the world that determines kindhood. Also, Boyd’s account exclusively takes induction and inference as the aims for which kinds and classifications are constructed, whereas the GFA is completely open with respect to the aims of a classificatory program (including non-epistemic aims). So, on both counts the GFA is more naturalistic than Boyd’s account.Footnote 21
While we have discussed several accounts that strongly contrast with the GFA, we acknowledge a general trend in the literature on natural kinds with which the GFA aligns. As mentioned at the beginning of this paper, the role of non-epistemic values in classificatory context has been widely acknowledged and one could say that there is a growing movement among authors on this topic to allow non-epistemic values to contribute to the setting of classificatory aims (e.g., Dupré, 1993; Magnus, 2012; Ludwig, 2014, 2016; Brigandt, 2015; Conix, 2018). Our aim here was to contribute to that movement by proposing a general approach to assessing under which conditions epistemic as well as non-epistemic values can function to individuate natural kinds in scientific contexts. Furthermore, we have attempted to show how the GFA allows us to accommodate important aspects of scientific practice that available accounts of kinds and classification do not accommodate, enabling us to reconstruct the decisions that scientists make better than other accounts. These aspects are: that non-epistemic values play important roles in classifications, and that non-epistemic values may override epistemic values because the ultimate aims of some research contexts are non-epistemic.