O’Hara (1997:327) remarked that “beginning students in biology should be taught how to read trees [...] just as beginning students in geography need to be taught how to read maps”. My findings drastically support this opinion. None of the undergraduate or graduate students in my (admittedly small) sample was able to correctly interpret a simple cladogram (Fig. 1). The cladogram was certainly not the first evolutionary tree that the students had met, since most had studied biology for several terms, including introductory courses in evolutionary biology.
It does not require many sentences to explain how a cladogram should be read. The concept of evolutionary relatedness is not hard to grasp once it is explained. If it is not, however, erroneous conceptions seem to be able to survive for several years during prospective biologists’ educational careers. At the end of my course all university students were able to give correct interpretations of cladograms. Matters differed for the students of the regional university college. These students were asked which of the small cladograms in Fig. 3b–i were compatible with the large cladogram (Fig. 3a). This latter question was posed after the meaning of cladograms had been explained and after a considerable amount of time had been spent practicing the reading of cladograms. Still, only one answer out of fourteen correctly identified all of the three compatible cladograms. The remainder failed to identify at least one compatible cladogram and/or indicated at least one incompatible tree as compatible. This sobering result might be related to the fact that many of the college students simply needed some biology courses in order to study, e.g. fishery. They might thus not have been genuinely interested in biology per se, so that these findings would need to be replicated with university students. Still, it is independent evidence that reading cladograms is very much an acquired ability rather than one that can be presupposed.
In their answers, the students proved to pay more attention to how the taxon names at the tree tips were ordered along the left–right axis, than to the topology of the cladogram. Given that the information of cladograms is conveyed in the branching order of taxa, while the left–right ordering is arbitrary, this wrong focus necessarily leads to wrong conclusions.
The main problem is presumably that students simply have not been taught how to read cladograms. Teachers may often assume that cladograms are self-explicatory graphic devices to illustrate phylogenetic relationships, when in fact their interpretation is not trivial. As Halstead (1978:760) put it: “Cladograms are difficult enough for experts in the field to comprehend fully”. Ironically, what he was advocating as the obvious alternative was phylograms—which were meant to not only express phylogenetic relationships, but, in addition, phenotypic similarity, distribution through geologic time and species number. As such, phylograms are not only harder to interpret for untrained readers than cladograms. They may also convey incorrect information on any one of the aspects because they represent a two-dimensional compromise of several multidimensional measures.
Regrettably, the presentation of systematics in text books has been lagging behind the cutting edge of systematic research by several decades. It is thus only quite recently that most biological text books have replaced phylograms with cladograms. A less appreciated fact is that even displaying cladograms does not guarantee that the information provided is unbiased, as many cladograms are drawn in an anthropocentric fashion (Sandvik 2008). It might thus well be that one source of confusion for students is the ambiguity of evolutionary tree drawings in text books. Drawings of evolutionary trees cannot be clearer than their authors’ thoughts about evolutionary processes. And the latter have long been rather muddled, as can be exemplified by the occurrence of “stem groups” in many text book trees (e.g. Villee et al. 1984: Fig. 19.6; Willmer 1990: Fig. 14.2, which has even been reprinted by a number of other text books). However, stem groups have never existed in nature. A taxon above the species level cannot possibly give rise to other taxa—nor to anything else (Ghiselin 1997). One might say, therefore, that stem groups are just another way of expressing (or, worse still, trying to hide) ignorance. That some taxonomists have chosen to even give scientific names to their ignorance (e.g. “Procoelomata”; Bergström 1989), does not really make the matter more transparent for biologists under education. No wonder, then, that stem groups occurred in 54% of the student drawings.
One problem is thus that many tree drawings which students meet during their studies are biased and thereby distort the evolutionary understanding of their readers. Another is that this distorted understanding even impinges on the interpretation of correctly drawn phylogenetic trees. In other words, both problems re-enforce each other. I have demonstrated elsewhere that even cladograms in phylogenetic text books are biased in an anthropocentric way (Sandvik 2008). In terms of ordering of taxa and differential resolution of branches, otherwise correct cladograms may distort the understanding of evolution. These findings make the evidence presented here even more alarming: If taxa are ordered in an anthropocentric manner in most cladograms, and if students rely more on ordering than topology, this affects the interpretation of relationships. The taxon containing our own species is most often placed in the rightmost position in cladograms (Sandvik 2008). This can be illustrated with Fig. 4, which displays the phylogeny of Osteognathostomata in an anthropocentric manner. The topology of the cladogram is the same as in Fig. 1. Accordingly, many untrained student of biology can be expected to conclude that coelacanths are more closely related to ray-finned fishes than to mammals.
The students’ inability to answer question 1 (Fig. 1) might be argued to have other reasons. An obvious alternative explanation would be that the error was caused by a simple misunderstanding of the word “relationship”, and that it is sufficient to tell students that relationship, as used in evolutionary biology, is a technical term meaning “the relative recency of common ancestry” (Mayr 1974). However, the question to the students was posed in Norwegian, were the word “slektskap” only has the latter meaning. In other words, while knowledge of the precise meaning of “relationship” is necessary (especially in English), it is not sufficient for students to understand cladograms.
In passing I would like to mention a speculation on the reason why phylogenetic systematics and cladistic methodology was rather quickly accepted in Germany (Ax 1977; Remane 1956; Schlee 1969), but provoked intense debates in English-language journals (verifiable with almost any issue of Systematic Zoology from the 1970s). I suspect that part of the problem was semantic. The German word for “relationship” is “Verwandtschaft”, but while the English word has all kind of abstract and symbolic connotations, including overall similarity, the German term is reserved for true, genealogical bonds (as is the Norwegian “slekt”, see footnote 1). The statement that for instance the lungfish is more closely “verwandt” to the cow than to the salmon is quite uncontroversial in German. On the other hand, the statement that the lungfish is more closely related to the cow than to the salmon, was able to create a heated discussion—which was only peripherally concerned with the actual phylogeny of the groups concerned (Gardiner et al. 1979; Halstead 1978; Halstead et al. 1979).
A final observation concerns the importance attributed to Linnean categories (i.e. labels such as “family”, “order”, etc.) by students. It is well-established that Linnean categories above the species level do not carry information, that they are not comparable across taxa, and that it is entirely arbitrary to which taxa they are assigned in the first place (de Queiroz and Gauthier 1992; Donoghue 2001; Ereshefsky 1994, 2001, 2002). Many taxonomists have abandoned categories altogether, simply referring to taxa by their names. However, few students are aware of the arbitrariness of Linnean categories. In many undergraduate text books, the enumeration (!) of animal “phyla” or insect “orders” seems to be more important than the discussion of interrelationships between the taxa concerned (e.g. Barnes et al. 1998). This leaves students with the impression that categories must express something. Unless told otherwise, they tend to make up their own explanations. As my results indicate, only a tiny proportion of students seems to question the existence and reality of Linnean categories. Of course, blank answers to question 2 do not necessarily indicate the deliberate rejection of categories—the two students may also have wished to give a number, but were unable to “recall the correct one”. On the other hand, question 2 was clearly a leading (not to say, loaded) question, which may have biased the respondents into thinking that a number was the only acceptable answer. Still, findings from question 3 show that at least 27% of the students perceive the distinction between “orders” and “classes” to be a reflection of real differences between natural levels of organisation.
The findings presented here re-enforce earlier reports that reading cladograms is an ability that has to be practiced (O’Hara 1992, 1997). Even though the transition from developmental thinking to tree thinking is more or less completed in the science of systematics at the collective level, it has to be accomplished anew by every generation of biology students on the individual level. This must not be forgotten when teaching phylogenetics or writing or illustrating text books. Other topics that need to be addressed explicitly are the artificiality of Linnean categories, the non-existence of stem taxa, and the arbitrariness of paraphyla. It does not seem to be sufficient to “get the facts right” in teaching and text books. Students tend to fall back to group and developmental thinking unless explicitly told otherwise.