Keywords

10.1 Introduction

Phylogeography, hybridization, and speciation are inextricably intertwined. To help explain this and to chart a course for this chapter, some definitions and discussion are needed at the outset.

Phylogeography was originally conceived as a bridge between population genetics and systematics formed by analyses of mitochondrial DNA (mtDNA) (Avise et al. 1987). Its goals have always been to describe patterns of genetic diversity within a species across its geographical distribution and to understand the processes driving those patterns. Its molecular scope grew to include multilocus data, a term that in practice describes inclusion of up to 30 single regions of the nuclear genome. Currently, the tools of genomics enable genetic diversity to be studied across the entire genome, often including thousands of single sites that vary within and between species—single nucleotide polymorphisms or SNPs. Phylogeographic data are usually characterized by extensive sampling of individuals across the range of a species. This contrasts with the smaller number of individuals and higher number of loci that can suffice when estimating phylogenetic relationships among species and higher taxa (Nei and Roychoudhury 1974). Phylogeography and phylogeny are increasingly less discrete in scope (Smith et al. 2014a) but are reasonably seen as different parts of a continuum (Edwards et al. 2016).

Phylogeographic data enable tests of biogeographic hypotheses, description of the evolution of reproductive isolation, inference of processes underlying the origin, distribution and maintenance of biodiversity, and inferences about temporal changes in a population’s physical and biotic environments (Beheregaray 2008). Revisions of species boundaries and taxonomic changes are prompted by phylogeographic data (especially when integrated with other data sets). Examples in birds abound [e.g., from Australia (Dolman and Joseph 2015, 2016; McLean et al. 2017a, b); from Eurasia (Olsson et al. 2013); from the Americas (Cadena and Cuervo 2010; Harvey and Brumfield 2015; Oswald et al. 2016); from Africa (Bowie et al. 2009; Voelker et al. 2013)]. Often, phylogeographic studies reveal much previously unknown (cryptic) but strongly geographically structured genetic diversity within a species. When researchers understandably try to understand the drivers of that diversity (Smith et al. 2014b; Harvey et al. 2017), one often has a sense of unfinished taxonomic business. Alternatively, cryptic population structure revealed by phylogeography can open very different, non-taxonomic windows of research into how natural selection has operated at the molecular level (Ribeiro et al. 2011; Pavlova et al. 2013). I will examine some cases in this chapter.

Hybridization has been defined many ways: reproduction between different species (Rheindt and Edwards 2011) or between genetically distinct populations producing offspring of mixed ancestry (Abbott et al. 2013), for example. In molecular terms, hybridization can be seen as an invasion of one genome by another (Mallet 2005), and genomic approaches to hybridization are now commonplace (Payseur and Rieseberg 2016). Perhaps not so often appreciated, among ornithologists at least, is that hybridization can promote rather than impede differentiation and speciation. Hybridization can select for reinforcement of reproductive isolation and cause hybrid speciation (Barton 2013; see Italian Sparrow Passer italiae example below). Also important is what hybridization is not. It is not something that “goes wrong” long after speciation is complete nor is it some “weakness” in how two species maintain separation or reproductive isolation (see Mallet 2005).

Genomic data in phylogeography reveal even higher frequencies of movements of genes between species than had been appreciated (Edwards et al. 2016; Mallet et al. 2016): species boundaries are far more porous than supposed (Chan and Levin 2005). Introgression, the one-way movement of genes from one population into another after hybridization, and two-way gene flow explain this. The result is known as reticulation and is a rearrangement of genetic diversity within and among species as they diverge from their common ancestor. Reticulation at different taxonomic levels will recur in this chapter.

Ornithologists have long studied hybridization and the regions where it is thought to occur—hybrid zones (Sibley 1957; Short 1969; Ford 1974). Descriptions of species hybridizing are common in the literature. Books and review papers have been written about it in birds (Grant and Grant 1992; McCarthy 2006).

Speciation is the study of how two or more species evolve from a common ancestor. In birds, it too has been the subject of books (Price 2008; Newton 2004) and reviews (Friesen 2015 for marine birds; Toews et al. 2016a). Reproductive isolation between two populations is perhaps the most unequivocal criterion by which researchers will accept that two species are involved. As with hybridization, we need to stress what speciation is not. It is not the same as species delimitation, but the process by which populations diverge from their common ancestor is a component of speciation that may or may not lead to its completion. Different lenses of data, say from phylogeography, plumage variation, or ecology and behavior, coupled with different ways of defining species, can mean that different workers consider speciation complete (and so worthy of species delimitation) at different stages of the process of diverging from a common ancestor. This concept has been captured well in de Queiroz’s (2007) well-known figure (Fig. 10.1). The “gray zone” described by de Queiroz (2007) is very familiar to ornithologists faced with taxonomically interpreting cases where genetic distances between two populations under study and calculated from DNA sequences are between 0.5% and 2% and few phenotypic differences are discernible (see Roux et al. 2016).

Fig. 10.1
figure 1

Well-known and extraordinarily helpful figure from de Queiroz (2007) to describe how conflicts arise among different researchers when recognizing that speciation is or is not complete and when viewed through different lenses of data and species concepts (SCs)

How reproductive isolation evolves is a contentious and active area of study. What it offers to species delimitation in contrast to details of how it evolves likely have no simple demarcation as this book will show. Similarly, different evolutionary and environmental histories of birds within and among different regions mean that there is no single level of genetic divergence between two populations that indicates they have completed speciation. Against that, empirical and theoretical work is showing that the process of divergence from a common ancestor can reach a rapid transition to a “tipping point” resulting in reproductive isolation (Gompert et al. 2017; Nosil et al. 2017; Yamaguchi and Iwasa 2017).

Speciation research can only benefit from case-by-case phylogeographic study of how genetic diversity is geographically distributed within and among species whose origins we seek to understand along with consequences of hybridization (cf. Price 2008). The concepts by which we try to recognize the limits between species and so apply names to them are covered more explicitly in other chapters. One point will recur: It is often hard, if not foolhardy, to compartmentalize ongoing evolution by placing names too rigidly on many populations. Inevitably, we need to try and do so, because we need to communicate about species and their component populations, many of which are plainly and unquestioningly distinct. Yet in more and more cases to which we apply modern techniques, we learn that populations are evolutionarily more ephemeral than we perceive (Harvey et al. 2017) and nomenclature captures one phase of this change.

Tying these somewhat anticipatory threads together to form a foundation for the rest of this chapter, I note that hybridization revealed in phylogeographic studies is increasingly being stressed as a process that simply will occur almost inevitably during speciation. Gene flow happens during divergence (Smadja and Butlin 2011). Divergence and speciation in birds are not instantaneous events but can happen more easily and quickly than often thought (Mallet 2008; see Friis et al. 2016 for juncos). The footprint of that gene flow is detectable by analytical methods now in use, but opinions differ about its effect on and potential role in the process of speciation (Butlin and Ritchie 2013). Barton (2013), for example, considers it negligible noting that gene flow of favorable alleles across a broad, two-dimensional habitat may not slow divergence; speciation between populations in geographic contact is then almost as easy as between those completely isolated from each other.

I now move to closer examination of just a few current topics in phylogeography, hybridization, and speciation in birds. The choice of topics discussed, while perhaps eclectic, is designed as an entrée to the explosively burgeoning literature. Cited examples will illustrate research already done or how different findings from across the globe often make similar or even sometimes contrasting points and I hope they will stimulate further research.

10.2 Some General Observations from Avian Phylogeography: Historical Population Size Changes and Introgression

A recurrent finding in many phylogeographic studies has been the genetic footprint of major reductions in population size, i.e., to refugia, at the Last Glacial Maximum (LGM) ~20,000 years ago followed by population (and presumably geographical range expansions) since then. This has been particularly common in European (Hewitt 1999) and North American studies (Boulet and Gibbs 2006; Friis et al. 2016) where widespread glaciation was the driver and even in Antarctica (Younger et al. 2015). Similar patterns in Australia are commonly found, but due to widespread cold, windy aridity at the LGM, not glaciation (Joseph and Omland 2009; Toon et al. 2010; Kearns et al. 2011). It is not a universal result, however. The Australian Grey Butcherbird Cracticus torquatus evidently expanded at the LGM and hybridized with a non-sister species, the Silver-backed Butcherbird C. argenteus (Kearns et al. 2014; see also Dong et al. 2017). Still other studies show important changes in population size having occurred before the LGM (in Brazil, Batalha-Filho et al. 2012; in Africa, Bowie et al. 2006; in Australia, Chan et al. 2014). A distinction to make here is that the species involved usually evolved earlier, in the Pliocene at least, whereas the Pleistocene molded their present-day diversity. This has been stressed in Africa (Voelker et al. 2010) and in the lowland Neotropical rainforests (Smith et al. 2014a, b) where dispersal and differentiation superimposed on earlier, large-scale landscape events such as the uplift of the Andes has been a major driver of avian speciation (see also Ribas et al. 2012). To complicate things, very recent speciation (middle Pleistocene onward) appears to have occurred too in North America, Europe, and Australia (Toews and Irwin 2008; Dolman and Joseph 2016; Friis et al. 2016). The point here is that the interplay between speciation before and sometimes during the Pleistocene relative to the role of the Pleistocene in shaping present-day genetic diversity within species is complex and idiosyncratic.

Introgression repeatedly emerges from phylogeographic data. Introgression during divergence of populations and species from their common ancestor is clearly a pervasive phenomenon (reviews in Rheindt and Edwards 2011; Edwards et al. 2016; Ottenburghs et al. 2017). The application of coalescent theory in population genetics has been instrumental here. By this approach, genetic diversity within a species is traced back in time to where it coalesces in a common ancestor. This is contrasted with the gene frequency-based approach of classical population genetics. Peters et al. (2007) were perhaps the first to use coalescence in detecting ancient introgression (∼14,000 years ago) in birds. They detected the widespread occurrence and high frequency of mtDNA more typical of one species, the Asian Falcated Duck Anas falcata, in another more widespread species, the Gadwall A. strepera of the broader Holarctic. Peters et al. (2007) were perhaps the first to use coalescence theory in birds to carefully distinguish between the two reasons why two or more present-day species can share genetic diversity. One reason is that the different species have hybridized. The other reason, which they rejected, but which is often the favored explanation of such data, is that they still share much of their common ancestor’s genetic variation. That is, unlike say plumage that we can readily see as diagnostic between two species and which can evolve rapidly under natural selection, the genetic marker being studied may evolve at a slower and probably more clock-like rate and so has not yet had time to “catch up” with the more visible species-specific markers of plumage. This phenomenon is termed incomplete lineage sorting (Joseph et al. 2006, e.g., in Artamus woodswallows). In a further study of ducks (Peters et al. 2014) as well as in many other papers in avian phylogeography (Degnan 1993), it has become clear that while mtDNA will always be a useful marker to include in a phylogeographic study, it can mislead, if not carefully interpreted. We will see further examples of this elsewhere in the chapter.

Phylogeographic studies also can detect the difference between recent and current hybridization and past gene flow. Recent hybridization will be characterized by first-generation (F1) hybrids and backcrosses between them and the parental types. Examples come from Manthey and Robbins (2016) showing hybridization likely due to recent anthropogenic disturbance. Older hybridization will lead to more complicated patterns of admixture (e.g., Manthey et al. 2012) as we will see elsewhere in the chapter. Warranting mention here is the “ghost of gene flow past.” An example comes from Australia’s Western Grasswren Amytornis textilis. Most of this species’ populations in the west of the Australian continent are extinct but known from museum specimens. Austin et al. (2013) detected a divergent outlier in the mtDNA diversity of the eastern Australian subspecies A. t. myall. This result would have remained enigmatic were it not for the availability of museum specimens of the extinct western populations, which shared this form of mtDNA. Coupled with subfossil evidence of the species having been more continuously distributed in the past and indeed ecological data, Austin et al. (2013) concluded that they had detected evidence of past gene flow prior to the extinction of the western populations.

10.3 Phylogeography, Sex Chromosomes, and Speciation

Phylogeographic studies of two closely related Australian ducks, the sexually monomorphic Grey Teal Anas gracilis and sexually dimorphic Chestnut Teal A. castanea, lead us neatly, if surprisingly, to a consideration of sex chromosomes in speciation. Initial phylogeographic and systematic studies of genetic diversity within and between these two species found no differences between them, whether studied with allozymes (Sraml et al. 1996), mtDNA (Joseph et al. 2009), or multilocus DNA (Dhami et al. 2013). Geographic structure in one species, A. castanea, was apparent between eastern and western Australia. Insights into the birds’ evolution and speciation were gained nonetheless. For example, the species had only recently diverged from their common ancestor, divergent selection best explaining their obvious differences in ecology and life history. Incomplete lineage sorting was the favored explanation of their shared diversity. But where in the genome were the genetic differences between the two species of teal?

An answer came when loci on the Z sex chromosome of the two teal were sequenced (in birds, males are ZZ and females ZW; Dhami et al. 2016; Fig. 10.2). Differences between the two species were immediately apparent: All Chestnut Teal Z chromosome sequences were more closely related to each other than any were to the Grey Teal Z chromosome sequences and vice versa. The Z chromosome has repeatedly emerged as the site of many and sometimes key important genetic differences especially those controlling reproductive isolation between closely related species. We briefly address why this is so while acknowledging that the Z chromosome will not necessarily reveal all in every case.

Fig. 10.2
figure 2

Networks of DNA sequence diversity from Dhami et al. (2016) showing contrasting patterns in Grey Teal Anas gracilis (each yellow circle is one individual) and Chestnut Teal A. castanea (blue circles). The networks are derived from DNA sequences obtained from (a) the mtDNA control region, (b) 17 autosomal loci, and (c) 7 Z loci. Note how only in the Z chromosome sequences do the two species “pull apart”

Novel mutations on sex chromosomes are directly exposed to selection in the sex having only one of each sex chromosome—XY males in mammals such as humans and ZW females in birds. Selection is therefore expected to act faster on sex chromosomes and to fix some types of beneficial mutations (Charlesworth et al. 1987). This is the basis of fast-X evolution or fast-Z evolution in birds (see Ellegren 2009). It helps explain Haldane’s rule (Haldane 1922; Carling and Brumfield 2008), which states that the heterogametic sex (females in birds) suffers adverse fitness consequences in hybridization. Population genetics theory also predicts more rapid accumulation of genetic divergence on the Z relative to autosomes because of its smaller population size promoting genetic drift.

Examples of Z-linked loci being important in tracking and driving speciation and evolution in birds include European nightingales Luscinia spp. (Storchová et al. 2009) and Ficedula flycatchers (Ellegren et al. 2012), North American Anas ducks (Lavretsky et al. 2015), Passerina buntings (Carling and Brumfield 2008, 2009), Certhia treecreepers (Manthey and Spellman 2014; Manthey et al. 2016), and in a comparative analysis of meliphagoid birds across hybrid zones in northeastern Australia (Peñalba et al. 2017) to name just a few. We should also consider physical rearrangements of entire blocks of chromosomes called chromosomal inversions. These involve an entire section of a chromosome being physically inverted. The genes in these inversions are generally inherited intact. This is because inversions hinder efficient recombination or crossing over between members of a pair of chromosomes during meiosis, which forms gametes. Inversions therefore lock up favorable combinations of genes. Chromosomal rearrangements have been more extensive on the Z chromosome relative to the rest of the genome (with notable exceptions such as one part of Chromosome 1A). This has been elegantly shown by Kawakami et al. (2014) in a comparison between two species with reasonable genomic data, the Australian Zebra Finch Taeniopygia guttata and the European-breeding Collared Flycatcher Ficedula albicollis (Fig. 10.3; see also van Doren et al. 2017).

Fig. 10.3
figure 3

From Kawakami et al. (2014) showing comparative circular visualization of the organization of blocks of loci on homologous chromosomes in the Collared Flycatcher Ficedula albicollis and Zebra Finch Taeniopygia guttata. Each circle represents a chromosome as numbered. Each circle’s left half represents the Collared Flycatcher and the right half represents the Zebra Finch. Very little structural rearrangement has happened on some chromosomes since these two species diverged from their common ancestor, e.g., chromosomes 4 and 15, whereas far greater rearrangements have occurred in one region of chromosome 1A and along the entire Z chromosome. Scale is indicated on the Zebra Finch side of plots, in megabases

Hooper and Price (2015, 2017) found that the number of inversion differences between closely related species is consistently predicted by whether their geographical ranges overlap and the extent of that overlap and illustrated this neatly in estrildid finches. Geographical range overlap in and of itself is not an absolute predictor of inversion differences, however. For example, there are no such differences between Collared and Pied Flycatchers Ficedula albicollis and F. hypoleuca, respectively (Backström et al. 2010a). Nonetheless, those two species have diverged at seven Z-linked loci significantly more than expected under neutrality; two of the detected candidate regions contain genes that are associated with plumage coloration in birds (Backström et al. 2010b). Indeed, the four black-and-white Ficedula flycatcher species (the other two being F. speculigera and F. semitorquata) show greater genetic divergence on the Z chromosome than on the autosomes. This has been attributed most simply to a fast-Z phenomenon (Sætre and Sæther 2010; Hogner et al. 2012).

Hooper and Price (2015, 2017) also reported variable rates of fixation of inversions across the autosomes, but inversions are more likely to be fixed on the Z chromosome than the average autosome. A role for gene flow in divergence is seen here. Gene flow on secondary contact between partially reproductively isolated forms may promote the spread of an inversion and its favorable combination of genes. Hooper and Price (2015) note that, because Z-linked genes diverge in function more rapidly than autosomal genes, so at any time before reproductive isolation is complete, an inversion on the Z chromosome should be more likely to capture two or more alleles locally adapted—either to that population’s habitat or genomic background—than an inversion on an autosome. They reason that inversions on the Z may be more strongly selected for, if gene flow is an important mechanism driving their selective advantage. Ornithologists should heed a most informative example of the role of hybridization spreading an inversion and promoting an adaptive radiation in Heliconius butterflies of South America (Dasmahapatra et al. 2012).

This brings us back to the two Australian teal and why the Z chromosome and its genes warrant attention in speciation genomics. At least for Ficedula flycatchers, the reasons have been neatly summarized by Sæther et al. (2007), Qvarnström and Bailey (2009), and Nater et al. (2015). The Z chromosome is a known location of species-specific male plumage traits and genes causing low hybrid fitness (indicated by an intronic marker in Passerina buntings; Carling and Brumfield 2008, 2009). Limited introgression of genes on it and limited recombination along its length keep blocks of genes together despite hybridization, suggesting that it is a hotspot for sexual traits in adaptive speciation where male signals and female preferences to those signals remain linked despite gene flow. These are points one can easily imagine applying in the case of the Australian teal where one member of the pair, A. castanea, is sexually dimorphic and the other, A. gracilis, is not.

10.4 Bird Species with No Known or Very Few Genetic Differences

Other pairs of closely related birds stand out either for still unknown location of genetic differences or for the small number of loci that appear to be involved (see also Nosil and Feder 2012; Faust-Stryjewski and Sorenson 2017 in Australo-Papuan Lonchura finches). Key examples follow. Hybridization between Blue-winged and Golden-winged Warblers Vermivora cyanoptera and V. chrysoptera, respectively, of North America, has been extensively studied with a view to clarifying their taxonomic status as species or subspecies. They seem to differ genetically at just six regions in their genome, each of which contains genes likely to be involved in feather patterning and pigmentation (Toews et al. 2016b). Hybridization appears to be a long-term feature of their evolutionary history, as also reported in an endangered subspecies of the Australian Yellow-tufted Honeyeater Lichenostomus melanops cassidix and the more common parapatric subspecies L. melanops gippslandicus (Pavlova et al. 2014). In another thoroughly studied case of hybridizing European birds, the Carrion and Hooded Crows Corvus corone and C. cornix, respectively, genomic differences appear to be concentrated in one relatively small region of one chromosome. This is likely a chromosomal inversion and involves genes controlling plumage differences (Poelstra et al. 2014), the details of which in terms of gene expression and biochemical pathways have been elucidated (Poelstra et al. 2015). Finally, the genomes of three Holarctic redpoll finches Acanthis spp. appear largely undifferentiated (Mason and Taylor 2015), and speciation in isolation followed by secondary contact can confidently be eliminated. Differences between the apparent species may be due to differences in gene expression related to the phenotypic differences in what is still one variable lineage.

10.5 Hybrid Zones: A Closer Look

10.5.1 Suture Zones and Multiple Hybrid Zones

Hybrid zones are regions where two well-differentiated taxa interbreed or hybridize (Barton and Hewitt 1981). Having had a similarly long history of research in ornithology (Ford 1974), they have been argued to indicate, at a minimum, that the speciation process is incomplete or may never be complete (Coyne and Orr 2004). Increasingly, porous, “leaky” genetic boundaries between clearly well-differentiated taxa, which nonetheless remain well-differentiated despite such porosity, appear to be the norm (Chan and Levin 2005; Bay and Ruegg 2017; Chattopadhyay et al. 2017; Griffith and Hooper 2017; McLean et al. 2017a, b; Slender et al. 2017). Worth stressing here is that hybrid zones are not black holes from which genes never emerge; they are evolutionary conduits through which adaptive and neutral markers can differentially move (Brumfield et al. 2001; Gompert et al. 2017).

A special opportunity to study speciation is provided by suture zones (Remington 1968, but see Swenson and Howard 2004), regions where species pairs variously either have hybrid zones, or come into close contact, or replace each other on either side of the zone while not occurring within it. Typically, there will be a range of taxonomic divergence associated with these zones from undifferentiated populations on either side through to clearly differentiated allopatric species. These zones are ideal natural experiments in which to apply genomic tools in the study of how divergence and reproductive isolation evolve. Essentially a maturing of comparative rather than single species phylogeography (Bermingham and Moritz 1998), genomic study of these zones informs how gene flow affects divergence of populations that have evolved under shared environmental history (Winger 2017; Winger and Bates 2015; Peñalba et al. 2017; Fuchs and Bowie 2015). Although much has been learned from studies of birds in several such zones using earlier mtDNA and multilocus approaches (e.g., Carpentarian, Nullarbor and Eyrean Barrier regions in Australia – Balakrishnan et al. 2010; Dolman and Joseph 2012, 2016), I acknowledge the surge of this kind of genomic work in birds and in many nonavian groups and regions (e.g., Edwards et al. 2016; Gompert et al. 2017). They show how modern genomic tools are bringing entirely new levels of resolution to our understanding of the details of speciation in allopatry, parapatry, and sympatry. By studying different parts of the genome (autosomes vs. sex chromosomes vs. mtDNA), they also build on some seminal earlier work showing differential patterns of introgression among loci in North American Passerina buntings (Carling et al. 2010) and identifying genomic regions contributing to speciation or adaptive introgression (Borge et al. 2005; Gompert et al. 2017).

At the time of writing, foundational results from two such studies are emerging. One is from the Marañon Valley in Peru (Winger 2017; Winger and Bates 2015) and the other from tropical northeastern Australia where an especially complex set of three zones occurs (Peñalba et al. 2017; Ford 1986, 1987). I briefly review them and then examine the finer detail emerging from single species studies.

Both studies found that stronger plumage divergence (measured spectrophotometrically in Peru or through a proxy of taxonomic rank from Schodde and Mason’s 1999 review in Australia) is associated with deeper genetic divergence. In the Peruvian case, this was further argued to be likely due to geographic isolation for up to two million years (Winger and Bates 2015), consistent with conventional allopatric speciation theory. Z chromosome loci were most divergent in the Australian case at least. This further supports the role of sex chromosomes in the study of speciation and the utility of using different marker sources for reconstructing evolutionary history. Conversely, lineages lacking plumage divergence across the same geographic barriers have been more recently isolated or exhibit a signature of genetic introgression after formerly isolated populations came into contact. This indicates that gene flow will impede divergence in phenotypic traits important to speciation and that evolutionary outcomes of cycles of isolation and divergence are indeed sensitive to how long gene flow is stopped. Morphometric trait evolution showed greater idiosyncrasy in both studies and so was a poor predictor of genetic divergence.

These comparative studies within regions both indicate which phenotypic traits can be predicted to diverge. In the Australian study, an additional variable of habitat (rainforest vs. mangrove vs. woodland) was included. After taxonomic rank, habitat was the next strongest predictor of genetic divergence (Peñalba et al. 2017). Together these studies undoubtedly affirm some long held if informally stated convictions among ornithologists that plumage divergence predicts genetic divergence. The full analyses of the data sets reveal that this is far from absolute, both studies finding several major divergences between essentially undifferentiated populations regarded as consubspecific. Peñalba et al.’s (2017) case of the Dusky Myzomela Myzomela obscura especially warrants further study.

10.5.2 Detail Emerging from Single Species and Hybrid Zones: Three Case Studies

Work on birds has been instrumental in showing how genomic tools can vastly refine our knowledge of how hybridization, hybrid zones, and phylogeography inform our understanding of speciation. The hybrid zone between two Central American manakins, White-collared Manacus candei and Golden-collared Manakin M. vitellinus, has long been a landmark example. Earlier work established discordance among geographical clines in molecular markers and plumage traits. Differential introgression of the M. vitellinus yellow collar across the hybrid zone was driven by sexual selection (Parsons et al. 1993; Brumfield et al. 2001; McDonald et al. 2001). Parchman et al. (2013) took the study further. They tested alternative models concerning whether adaptively important genes involved in reproductive isolation cluster in discrete “islands” in the genome or are scattered through the genome (see Cruickshank and Hahn 2014). Their data comprised 59,100 SNPs, single sites in the genome where there is variation in the DNA sequence between two populations under study. These sites were then mapped to a draft of the Manacus genome to determine in which genes they fell. Finding that genetic regions involved in adaptive divergence and reproductive isolation are scattered throughout the genome, they also concluded that many relevant loci had signatures of strong genetic differentiation and introgression. This is consistent with the hypothesis that loci involved in isolation are often characterized by a history of divergent selection and have not just drifted to differentiation.

A further case is especially valuable, because it integrates so much biological and genetic data to link a species’ deeper history to biological drivers of more recent patterns in gene flow. It epitomizes how phylogeography, hybridization, speciation, and ongoing divergence are intertwined as claimed at this chapter’s outset. The Red-backed Fairywren Malurus melanocephalus occurs in subtropical and tropical Australia. Two subspecies are diagnosed essentially by whether their dorsal color is red or orange (Schodde and Mason 1999; Fig. 10.4). The red-backed subspecies M. m. cruentatus occurs across Australia’s tropical north, the orange-backed M. m. melanocephalus being in more southeastern parts of the range south of Cape York Peninsula (Fig. 10.4). Morphometric traits have likely been subject to ecological selection and simply track local environmental variation regardless of subspecies identity; ecological selection appears to have also influenced the evolution of tail length as an intrasexual signal used primarily among competing males (Baldassarre et al. 2013). The species’ phylogeography was first assayed from 29 anonymous nuclear loci, six introns, and one mtDNA locus, amounting to over 15,000 base pairs per individual. The major phylogeographic disjunction in the species is within the red-backed subspecies’ range west of Cape York Peninsula at the Carpentarian Barrier (Lee and Edwards 2008; CB in Fig. 10.4), and this was confirmed by later work with 2702 SNPs (Baldassarre et al. 2013, 2014; Fig. 10.5). The same later study showed that alleles for red plumage color have introgressed east across the hybrid zone following secondary contact and into the genomic background of the orange subspecies. That is, the plumage cline is displaced significantly east of the major phylogeographic break and the vast majority of the individual SNP clines (Fig. 10.5). Previous experimental work demonstrated an extra-pair mating advantage for red males (Baldassarre and Webster 2013). So it appears that, as in the manakins, sexual selection is driving asymmetrical introgression of red plumage alleles across the hybrid zone. This integration of phylogeography and superb field biology shows how sexual selection can complicate definition of taxonomic boundaries and promote gene flow, particularly at an intermediate stage of divergence.

Fig. 10.4
figure 4

Map modified from Baldassarre et al. (2014), showing the currently recognized distribution of the red-backed M. m. cruentatus subspecies in the west and the orange-backed M. m. melanocephalus subspecies in the east. The area of overlap in the northeast represents the hypothesized region of plumage overlap (Schodde and Mason 1999). The solid vertical line represents the Carpentarian Barrier (CB), and the dashed line represents the Burdekin Gap (BG). Numbers refer to sampling locations

Fig. 10.5
figure 5

Example from Baldassarre et al. (2014) of population genomic data increasingly being seen in studies of genomic clines across hybrid zones. Numbers along the top refer to sampling locations from west to east across northern Australia as shown in Fig. 10.4. The solid black and dashed black lines are summary geographic clines for all 2702 loci or from all 102 diagnostic loci, respectively. Clines for each of the 102 diagnostic SNP loci are shown in gray and that for plumage hue is shown in red. Higher values for hue correspond to redder dorsal plumage color in more western populations. All SNP loci have allele frequency greater than 80% on the western end of the cline and less than 20% on the eastern end, even those with very wide-fitted clines that appear to deviate from this criterion

The third hybrid zone to discuss here will lead us to an entirely new approach to speciation. It concerns three Australian rosella parrots, the Northern, Pale-headed and Eastern Rosellas Platycercus venustus, P. adscitus, and P. eximius, respectively. A hybrid zone around the middle latitudes of Australia’s east coast had long been recognized between P. adscitus and P. eximius. Few data and specimens were available concerning it, however, the original basis having been field observations (review in Schodde 1997). Aside from such scant information forming a basis for considering these two taxa conspecific, it was a surprise and contrary to earlier mtDNA work (Ovenden et al. 1987) when multilocus data showed P. adscitus and P. venustus to be sister species east and west, respectively, of the Carpentarian Barrier (Fig. 10.4) and that P. eximius was in turn their sister (Shipham et al. 2015). The latter study had a key twist, however. Phylogenetic analysis using mtDNA only from the Australian mainland populations of P. eximius leads to a finding of P. eximius and P. adscitus being sisters. Alternatively, when mtDNA from the geographically isolated Tasmanian populations of P. eximius is sampled, a result emerges consistent with multilocus data. Why did mtDNA within one species yield such different results?

Shipham et al. (2017) then used genomic methods to test alternative hypotheses to explain this observation. They favor the view that hybridization between P. eximius and P. adscitus has occurred and that mtDNA of P. adscitus has so completely introgressed southward into mainland southeast Australian P. eximius that it has completely “captured” the latter. The Tasmanian populations of P. eximius isolated from the mainland by Bass Strait retain that species’ original mtDNA. Kearns et al. (2014) found a similar case of mtDNA capture likely still progressing in Australian butcherbirds, and Dong et al. (2014) document a similar case in Taiwanese scimitar babblers Pomatorhinus spp. The problem to address next is this: Given that there are so many cases of hybridization leading to localized gene flow within and across hybrid zones, why is it that in some cases there is such an extreme outcome of the mtDNA of one form completely capturing that of another?

10.6 Mitonuclear Incompatibility, Hybridization, and Speciation

Mitochondria, so often called powerhouses of cells, are vital to an organism’s survival. Sloan et al. (2017) note that deleterious mutations in mtDNA adversely affect energy production and that evolution has devised two solutions to this. One amounts to the mitochondrial genome of one species or population “rescuing” that of another by completely replacing it (e.g., Drovetski et al. 2015), through hybridization, with more selectively fitter, foreign mtDNA. This is adaptive introgression (Borge et al. 2005; Bonnet et al. 2017) and recalls the idea in my Introduction that hybridization involves an invasion of one genome by another. It likely explains the rosella and butcherbird data described above, but experimental work is of course required to test that.

Evolution’s other solution to deleterious mtDNA mutations lies in the nuclear genome. Production of energy by a smoothly functioning mitochondrion requires proteins encoded by the nuclear as well as the mitochondrial genomes. Some 1500 proteins with mitochondrial function are encoded by nuclear genes and 180 of these nuclear-encoded proteins tightly cofunction with proteins encoded by mtDNA genes (Hill 2017; Sunnucks et al. 2017). Mutations in the nuclear genome can therefore lead to potentially fitter combinations of mitochondrial and nuclear genes. Different coadapted mitonuclear genotypes can evolve and lead to incompatibilities within a species; these can then evolve as isolating mechanisms (Hill 2017).

The second of these evolutionary solutions warrants more discussion. It has been developed into a mitonuclear species concept in ornithological literature by Hill (2017 and references therein). The process that generates energy, oxidative phosphorylation, cannot function, and an organism cannot survive, unless the cofunctioning mitochondrial and nuclear gene products needed for oxidative phosphorylation are in step. Tight cofunctioning of nuclear and mitochondrially encoded genes underpins the mitonuclear species concept. As mtDNA evolves faster than the nuclear genome, there must be perpetual coevolution of the two genomes. One can think of species having uniquely coadapted mitonuclear genotypes. Once populations diverge for this genotype, there will be reduced fitness of offspring due to mitonuclear incompatibilities. That mitochondrial genotype, Hill (2017) argues, becomes the best current method for diagnosing species. Does this mean that Eastern and Pale-headed Rosellas are the same species? Hill’s (2017) concept may be taken to argue “yes,” but recall that Northern and Pale-headed Rosellas are closest relatives. One would have to treat all three as conspecific. One could, but is it a useful taxonomy? This shows that different ways of diagnosing species usually always have some merit but can be in conflict with each other.

Whatever the mitonuclear concept’s strengths and weaknesses, and Hill’s (2017) discussion is well balanced with examples challenging the concept, it is a refreshingly compelling one. It prompts us to think differently about existing datasets. For example, the Eastern Yellow Robin Eopsaltria australis of Australia exhibits a geographically structured intraspecific mtDNA divergence of 6%, a magnitude usually seen between genera of birds, not within a species (see Dai et al. 2017 for another example in China). This mtDNA divergence is geographically organized east–west across the bird’s range (Pavlova et al. 2013). Nuclear DNA markers are organized north–south, perpendicular to the mtDNA structure. Conventional explanations of genetic drift and neutral evolution have been eliminated in explaining this pattern (Pavlova et al. 2013). A case has been built arguing that natural selection at the level of proteins encoded by the mtDNA and driven by adaptation to different climates has determined the mtDNA structure (Morales et al. 2015, 2017a). Further, adaptive mitochondrial introgression and selection against incompatible mitonuclear combinations are likely involved (Morales et al. 2017b). Finally, genomic scans (Morales et al. 2017c) show that genetic differentiation between the two adjacent but climatically divergent mtDNA lineages in this species is mostly limited, not only to the mitochondrial genome but also to a large part of chromosome 1A, very possibly in an inversion. This part of chromosome 1A contains tightly linked genes having mitochondrial functions. Notably, a second region of divergence is on the Z chromosome, suggesting that nuclear gene flow occurs primarily via male hybrids, in accordance with Haldane’s rule. These findings suggest not so much that there are two species within the Eastern Yellow Robin but that mitonuclear coevolution has been critical in climatic adaptation during population divergence within the species.

Before leaving mitonuclear incompatibility, I revisit the Z chromosome’s role in speciation. The mitonuclear concept predicts that at least some nuclear-encoded genes for proteins of mitochondrial function should be located on the Z chromosome. This is because in female birds the paternal Z chromosome must cofunction with the maternal mtDNA. The prediction is supported in the Eastern Yellow Robin and in other species such as Gouldian Finches Erythrura gouldiae (Pryke and Griffith 2009).

10.7 Ring Species as a Special Case of Divergence with Gene Flow: Are There Any Surviving Examples?

A theme of current speciation research emphasized in this chapter is that populations and species can diverge despite the reticulation caused by ongoing gene flow. A specific example is Mayr’s (1942) ring species concept (Fig. 10.6). This posits that an ancestral population spreads in a geographical ring away from its origin. Eventually, populations at the beginning and end of the ring are in contact and behave as separate species reproductively isolated from each other in sympatry. The hypothesis makes very specific predictions about patterns of genetic diversity that should be recoverable. In ornithology at least, these predictions have not survived such testing with genetic data or even with retrodictive modeling of paleodistributions. Consequently, more conventional alternative explanations of speciation in allopatry followed by range expansion leading to secondary contact cannot be rejected. They explain the data as well if not better than the ring species hypothesis. Examples are the Herring Gull Larus argentatus complex of Northern Hemisphere circumpolar regions (Liebers et al. 2004), the Crimson Rosella Platycercus elegans complex of southeastern Australia (Joseph et al. 2008; Fig. 10.6), and the Greenish Warbler Phylloscopus trochiloides complex encircling Tibet (Alcaide et al. 2014; Peterson and Anamza 2017). The point here is not that the ring species hypothesis should be abandoned but that it continues to engender stimulating evolutionary insights into the details of speciation in each case (e.g., Mihailova et al. 2014; Irwin et al. 2016).

Fig. 10.6
figure 6

Example of a putative ring species, not supported as such by genetic analysis from Joseph et al. (2008): (a) Distribution and plumage phenotype variation in parrots of the Crimson Rosella complex in eastern Australia (from Forshaw and Cooper (2002) in this and subsequent figures). Note the narrow zone of unsuitable habitat currently separating Adelaide and Yellow. (b, c) Sampling scheme of vouchered museum specimens for mtDNA analyses of the full mainland distribution of the complex

10.8 Hybrid Species

Can hybridization lead to the evolution of new species in birds? A now well-studied case is that of the Italian Sparrow Passer italiae. Long hypothesized to be of hybrid origin because of male plumage traits intermediate between House Passer domesticus and Spanish Sparrows P. hispaniolensis, its hybrid origin is confirmed by extensive nuclear and mitochondrial genetic work as well as ecological and behavioral studies (Elgvin et al. 2011; Hermansen et al. 2011; Trier et al. 2014; Sætre et al. 2017). Z-linked and mitonuclear reproductive barriers limit gene flow and maintain the integrity of the three populations (Trier et al. 2014). Hill (2017) notes it as a case in which a novel but adaptive combination of genes for mitochondrial and nuclear-encoded proteins arose through hybridization. Postzygotic selection against mitonuclear incompatibilities now limits gene flow between the three populations (Trier et al. 2014). Italian Sparrows have come into secondary contact with Spanish Sparrows from which they have diverged significantly across 81 protein-coding genes (Sætre et al. 2017). Six of these genes showing the greatest divergence are associated with learning and neural development in other bird species suggesting a role for behavioral isolating mechanisms. Prezygotic assortative mating may also contribute to the maintenance of coadapted mitochondrial and nuclear genes (Hermansen et al. 2011).

House and Spanish Sparrows have also hybridized across the Mediterranean in North Africa but with outcomes differing in terms of which mtDNA alleles are most frequent in hybrids (Belkacem et al. 2017). Clearly, much remains to be learned (see also Brelsford et al. 2011 for the case of Audubon’s Warbler Setophaga auduboni). Importantly, other cases where two species interact differently at different zones of contact offer rich potential for study. Prime examples have been studied in European populations of Barn Swallows Hirundo rustica (Scordato et al. 2017) and North American Mountain Chickadees Poecile gambeli (Manthey et al. 2012). Much remains to be learned in Australia in the Crimson Rosella group (Joseph et al. 2008) and Brown and Inland Thornbills Acanthiza pusilla and A. apicalis (Black et al. 2015). A special case warranting an entire chapter is the role of hybridization in driving the history of one of the most famous examples of adaptive evolution, Darwin’s finches (see Grant and Grant 2016 and references therein and for alternative views McKay and Zink 2014).

10.8.1 Hybrid Zones Sometimes Move

Although on evolutionarily short time scales of centuries and decades, data are now available to show that some hybrid zones have moved and some have not (Morales-Rozo et al. 2017). Insights gained from this phenomenon vary. Long-term study of North American birds provides the richest examples. In Sphyrapicus sapsuckers, Billerman et al. (2016) used climatic models to accurately recover the hybrid zone’s shift, which was itself documented with museum specimens collected over the last 100 years. They predicted future trends in the ranges of the respective parental taxa and that the climatic niche of the hybrids will disappear under climate change. Similarly in Passerina buntings, Carling and Zuckerberg (2011) suggest that continued spread of the hybrid zone threatens one of the parental species. In Poecile chickadees, Taylor et al. (2014) found a set of loci consistently linked to genomic regions likely under selection and linked to reproductive isolation. These loci showed patterns of elevated divergence and reduced introgression regardless of when samples were collected and despite multiple generations of admixture between the parental species. In contrast, Mettler and Spellman (2009) argued that a Pheucticus grosbeak hybrid zone had not moved in 40 years and constituted a “tension zone” maintained by a balance between dispersal into the hybrid zone and selection against hybrids. In Australia, it is unclear whether a well-studied hybrid zone between forms of the Australian Magpie Gymnorhina tibicen maintained by selection (Hughes et al. 2001) is stationary or moving (Burton and Martin 1976).

10.9 A View to the Future

This chapter has skimmed the surface of the vast body of published work accruing almost daily on the genetic and genomic details of phylogeography, hybridization, and speciation of the world’s birds. Genomic data are bringing much to our understanding of evolutionary history within and among species. As with any mature field, our understanding has reached a point where we appreciate well the theoretical problems still to be addressed and how data now obtainable from genomic methods may help. Disentangling effects of selection and drift in different parts of the genome as populations diverge is one such challenge (Southcott and Kronforst 2017). In contrast, our understanding of the genetics of adaptation at the molecular level, though developing, is still a frontier. Much has been learned from certain species such as climate-driven selection in Australia’s Eastern Yellow Robin (discussed above) and about the role individual genes play [MC1R in plumage coloration (San-Jose et al. 2017; Faust-Stryjewski and Sorenson 2017); ALX1 and HMGA2 in bill morphology (Lamichhaney et al. 2015, 2016); DRD3 in evolution of migration (Chap. 7; Delmore et al. 2015)]. Floodgates are likely to open in the coming years, however, as more and more genomes are sequenced. At present, regions of the genome showing interest for one aspect or another of a bird’s evolution can often be mapped to a reference genome in one of a few phylogenetically distant species for which reasonable genomic assemblies are available (e.g., the non-passerine chicken or the two oscine passerines Collared Flycatcher and Zebra Finch). This is like trying to determine one’s location 3 days into a drive across a continent using a map showing approximate location of one road and no provincial boundaries or towns other than the journey’s start and end points. As more annotated genomes (i.e., with genes mapped and named) come online, more and more genomic reference maps will be available across the avian tree of life. The mapping analogy would be that one knows one’s whereabouts, because all roads, towns, and geographical features are shown accurately to scale. Then we will better understand either the function of genes or their proximity to genes of known function in a given study species (consider the clines in Fig. 10.5 for the Red-backed Fairywren) and so develop an understanding of how speciation occurs by truly integrating history and selection. The importance of this is neatly illustrated in the comparison of Ruegg et al. (2014) and Delmore et al. (2015) who both tried to elucidate the genetic basis to the evolution of migration in Swainson’s Thrush Catharus ustulatus, the former mapping data to the Zebra Finch genome and the latter mapping to a draft assembly for Swainson’s Thrush itself. This explains their different conclusions about the genes involved in the evolution of migration and how they are organized across the genome.

Byers et al. (2017) have neatly summarized these challenges. Figure 10.7, here reproduced from their paper, says it all! At the time of writing, the floodgates of genomic resources with which to study birds are about to open. Research into the evolution of birds through understanding the interplay among phylogeography, hybridization, and speciation has a bright future.

Fig. 10.7
figure 7

Summary figure from Byers et al. (2017) arguing the need for integrating genomics and biology in speciation research. Current research foci and challenges in studying adaptation and speciation using molecular (magenta arrows) and genomic approaches (green arrows). Light red boxes highlight the challenges in these research areas, as follows: Challenge 1, selection on traits might be dynamic and dependent on time; Challenge 2, the fitness effects of individual genes might be dependent on genomic background, such as epistatic interactions among genes; Challenge 3, different genes associated with traits that affect fitness might have different fitness effects, and these effects might additionally vary with environmental conditions; Challenge 4, candidate loci identified using adaptation/speciation genomics might be spurious, with significant outliers resulting from demographic changes, false positives, or selective sweeps, rather than selection on the locus itself; and Challenge 5, even in model organisms (e.g., thale cress Arabidopsis thaliana, common fruit fly Drosophila melanogaster, house mouse Mus musculus), the function of a large proportion of genes remains unknown. Two remaining challenges of an integrative approach—Challenge 6, synthesizing results from multiple systems into a conceptual whole, and Challenge 7, combining evidence from molecular and genomic approaches—must also be addressed. The bottom circle depicts the integration of molecular and genomic approaches to studying adaptation and speciation and suggests some future integrative approaches (blue dashed arrows). See Byers et al. (2017) for further discussion