Phylogenomics of a rapid radiation: is chromosomal evolution linked to increased diversification in north american spiny lizards (Genus Sceloporus)?

Leaché, Adam D.; Banbury, Barbara L.; Linkem, Charles W.; de Oca, Adrián Nieto-Montes

doi:10.1186/s12862-016-0628-x

Phylogenomics of a rapid radiation: is chromosomal evolution linked to increased diversification in north american spiny lizards (Genus Sceloporus)?

Research Article
Open access
Published: 22 March 2016

Volume 16, article number 63, (2016)
Cite this article

Download PDF

You have full access to this open access article

BMC Evolutionary Biology Aims and scope Submit manuscript

Phylogenomics of a rapid radiation: is chromosomal evolution linked to increased diversification in north american spiny lizards (Genus Sceloporus)?

Download PDF

Adam D. Leaché ORCID: orcid.org/0000-0001-8929-6300^1,2,
Barbara L. Banbury^1,3,
Charles W. Linkem¹ &
…
Adrián Nieto-Montes de Oca⁴

5584 Accesses
3 Altmetric
Explore all metrics

Abstract

Background

Resolving the short phylogenetic branches that result from rapid evolutionary diversification often requires large numbers of loci. We collected targeted sequence capture data from 585 nuclear loci (541 ultraconserved elements and 44 protein-coding genes) to estimate the phylogenetic relationships among iguanian lizards in the North American genus Sceloporus. We tested for diversification rate shifts to determine if rapid radiation in the genus is correlated with chromosomal evolution.

Results

The phylogenomic trees that we obtained for Sceloporus using concatenation and coalescent-based species tree inference provide strong support for the monophyly and interrelationships among nearly all major groups. The diversification analysis supported one rate shift on the Sceloporus phylogeny approximately 20–25 million years ago that is associated with the doubling of the speciation rate from 0.06 species/million years (Ma) to 0.15 species/Ma. The posterior probability for this rate shift occurring on the branch leading to the Sceloporus species groups exhibiting increased chromosomal diversity is high (posterior probability = 0.997).

Conclusions

Despite high levels of gene tree discordance, we were able to estimate a phylogenomic tree for Sceloporus that solves some of the taxonomic problems caused by previous analyses of fewer loci. The taxonomic changes that we propose using this new phylogenomic tree help clarify the number and composition of the major species groups in the genus. Our study provides new evidence for a putative link between chromosomal evolution and the rapid divergence and radiation of Sceloporus across North America.

Extensive gene rearrangements in the mitogenomes of congeneric annelid species and insights on the evolutionary history of the genus Ophryotrocha

Article Open access 23 November 2020

Origins and biogeography of the Anolis crassulus subgroup (Squamata: Dactyloidae) in the highlands of Nuclear Central America

Article Open access 21 December 2017

Extensive Interspecific Gene Flow Shaped Complex Evolutionary History and Underestimated Species Diversity in Rapidly Radiated Dolphins

Article Open access 17 November 2021

Background

Rapid radiations represent some of the most intriguing and well-studied biological systems. They also present some of the most difficult phylogenetic problems. The short time intervals separating the speciation events that occur during a rapid radiation leave few opportunities for molecular evolutionary changes to become established in the genome. This lack of phylogenetic information typically leads to large-scale gene tree discordance and a lack of resolution for the phylogenetic relationships [1]. Species involved in rapid radiations are typically partitioned into major clades with clear support from multiple sources of data, yet the interrelationships among the major clades are often ambiguous. This basic conundrum repeats itself across the Tree of Life (e.g., the root of life [2, 3], major bird orders [4, 5], Mammals [6, 7], and Neobatrachian frogs [8]). Attempting to resolve rapid radiations using a combination of large numbers of loci together with coalescent-based species tree inference methods [9–14] represents an important new direction in systematic biology this is expected to help resolve difficult phylogenetic problems.

There are at least three fundamental challenges confronting the resolution of rapid radiations using molecular genetic data: 1) quick bursts of speciation limit the opportunities for character changes to accumulate across the genome [1], 2) long-branch attraction artifacts during phylogeny estimation [15], and 3) incomplete lineage sorting [16]. Increasing the number of loci used to estimate the phylogeny can sometimes help alleviate the first problem [17–19]. However, depending on the method and the model, increasing the amount of data can be positively misleading when faced with long branch attraction and/or incomplete lineage sorting [15, 20, 21]. Overcoming these collective challenges, which are not mutually exclusive and are difficult to distinguish, requires the acquisition of large datasets composed of many independent loci together with the implementation of coalescent models of phylogenetic inference; however, analyzing large datasets is computationally demanding, and this problem is amplified when utilizing complex coalescent-based models. Our ability to generate sequence data is quickly outpacing our capacity to analyze genetic data under complex models such as the multispecies coalescent [22]. Coalescent methods that utilize gene trees instead of sequence data can dramatically decrease computation times [23], but this comes at the cost of information loss as uncertainty in the sequence data is not taken into account.

The phrynosomatid lizard genus Sceloporus is a diverse clade containing 90 + species with a broad distribution across North America [24]. Developing a robust phylogenetic framework for comparative studies of Sceloporus has been of interest for decades (reviewed by [24–30]). Previous phylogenetic studies of Sceloporus based on a few nuclear genes suggest that the group has experienced a period of rapid evolutionary diversification [27]. These successive and rapid speciation events have resulted in bursts of speciation that have impeded the inference of a fully-resolved and strongly supported phylogeny [25, 28, 29]. Differentiation in the fundamental number of chromosomes among species and species groups is hypothesized to be a primary factor responsible for driving the rapid radiation of Sceloporus [27, 31]. The genus is comprised of 19 species groups containing anywhere from one species (two of the species groups are monotypic) to 15 species (Table 1). Most of the polytypic species groups have been the focus of detailed phylogeographic and phylogenetic study, including the formosus group [32], grammicus group [33], torquatus and poinsettii groups [34, 35], magister group [36], scalaris group [37], spinosus group [38], undulatus group [39, 40], and the variabilis group [41]. These systematic studies have advanced our knowledge of the interrelationships within many species groups; however, resolving the phylogenetic relationships among the species groups has remained difficult [28, 29].

Table 1 Specimens included in the study

Full size table

In order to try to resolve the Sceloporus phylogeny and understand the relationship between chromosome evolution and diversification we sought near complete taxon sampling and a broad sampling of loci from throughout the genome. We estimated a phylogenomic tree for Sceloporus using targeted sequence capture data that includes a combination of ultraconserved elements [42] and protein-coding genes used in previous studies of squamate phylogeny [43]. These new data are analyzed using concatenation and coalescent-based species tree inference methods. We conduct a diversification analysis to estimate the number of rate shifts and their locations on the phylogeny. These patterns of diversification are then discussed in relation to chromosomal diversity. The results suggest that differentiation in the fundamental number of chromosomes among species groups may be linked to Sceloporus diversification.

Results

Targeted sequence capture data

We obtained targeted sequence capture (TSC) data from 44 Squamate Tree of Life (ToL) loci and 541 ultraconserved elements (UCE’s; Table 2). Summaries of the sequence capture loci were generated using scripts available from https://github.com/dportik/Alignment_Assessment [44]. and frequency distributions summarizing the properties of the phylogenomic data on a per locus basis are shown in Fig. 1. Although we included 131 samples in our analysis (129 phrynosomatids and two outgroup species), the final sequence alignments for the Squamate ToL loci contained 118 individuals on average (46 min. – 129 max.), and the UCE alignments contained 121 individuals on average (15 min. – 131 max.). Some of the phylogenomic data were taken from previous studies, including 11 samples from a study of phrynosomatid lizards [13] and 17 samples from a study of the genus Phrynosoma [45]. Sequence capture inefficiency during the probe hybridization step and low sequencing effort are two likely reasons for the lack of data for some individuals across loci. A summary of the variation in the TSC data is provided in Table 2. On average, the Squamate ToL loci are longer compared to the UCEs (538 base pairs [bp] vs. 482 bp, respectively), contain more variation (31 % vs. 19 %), and contain more parsimony informative characters (104 vs. 47).

Table 2 Summary of the variation in the targeted sequence capture data

Full size table

Phylogenetic analysis

The phylogenetic trees that we estimated for Sceloporus using the 585 loci using concatenation (RAxML; [46]) and a coalescent-based species tree approach (SVDquartets; [47]) are shown in Fig. 2. The phylogenetic relationships inferred at the base of Sceloporus differ between the two approaches. Using concatenation, a clade containing the angustus and siniferus species groups is sister to the remaining members of Sceloporus, whereas in the coalescent tree the variabilis group is sister to the rest of the genus. This discrepancy has weak support in the concatenation and coalescent trees (68 and 26 % bootstraps, respectively). The phylogenetic relationships for the remaining species groups are consistent starting at the point in the phylogeny where S. merriami diverges. The major relationships include a clade containing the pyrocephalus, gadoviae, and jalapae groups, a clade containing the graciosus and magister groups, a 22-chromosome clade containing the undulatus, formosus, and spinosus groups (sister to the scalaris group), and a 32-chromosome clade containing the megalepidurus, torquatus, and poinsettii groups (sister to the grammicus group and the clarkii group). The support for these clades varies between the concatenation tree (these relationships all have high support) and the coalescent tree (only the 22 and 32 chromosome clades have significant support). One notable difference is that the concatenation tree fails to support the monophyly of the spinosus group, whereas the coalescent tree provides weak support (62 % bootstrap) for this group.

Our time-calibrated phylogeny estimated using the Squamate ToL loci in BEAST [48] (Fig. 3) indicates that the crown age for the family Phrynosomatidae is approximately 54 million years (mean = 54.12, highest posterior density [HPD] = 46.13–61.65 Ma). The age estimate for the genus Sceloporus is 37 million years (mean = 37.02, HPD = 30.71–43.71). Both estimates are consistent with previous estimates [30], but this might not be unexpected given that we used a similar prior. In addition, it is likely that the use of a concatenated data matrix in BEAST is causing divergence time overestimation, and that a species tree approach would provide more accurate estimates. The topology of the BEAST tree is largely similar to the concatenation and coalescent trees shown in Fig. 2, but there are several key differences. First, the BEAST tree places the scalaris group sister to the magister and graciosus groups instead of sister to the 22 chromosome clade. Second, the spinosus group is paraphyletic and S. edwardtaylori is and placed at the base of the 22-chromosome clade. Third, the grammicus group is paraphyletic as a result of moving S. asper to the base of a group containing the 32-chromosome clade and the grammicus group. These differences in topology are likely the result of excluding the ultraconserved elements from the phylogenetic analysis instead of modeling differences between the phylogenetic methods.

Gene tree congruence and rapid radiation

Rapid radiations are expected to produce increased gene tree discordance. We investigated congruence between the 585 gene trees (estimated using RAxML) and the estimated species tree by quantifying the number of gene trees that supported the major relationships obtained in the species tree analysis. This approach for measuring congruence does not distinguish between gene tree discordance resulting from a lack of genetic variability versus incomplete lineage sorting. Three Sceloporus species groups have gene tree congruence that exceeds 50 % (i.e., at least 50 % of the 585 gene trees support their monophyly): the angustus, siniferus, and graciosus groups (Fig. 4). The remaining species groups have higher levels of gene tree discordance, and some are supported by <10 % of the loci, including the undulatus group (40 loci), poinsettii group (36 loci), torquatus group (30 loci), grammicus group (24 loci), and spinosus group (9 loci). The 22-chromosome and 32-chromosome clades are supported by 28 and 18 loci, respectively.

There is a strong correlation between the amount of gene tree congruence for a taxon bipartition (e.g., a species group) and the branch length for a taxon bipartition (Fig. 4). We explored this relationship using the branch duration estimates (measured in millions of years) obtained from the BEAST analysis (Fig. 3). As expected, the branches with the shortest time intervals had low gene tree congruence, and branches with longer time intervals had high gene tree congruence.

Diversification analysis

Diversification analyses conducted using BAMM [49] recovered an average speciation rate (λ) of 0.09 species/Ma across the phrynosomatid tree. The analysis also found a positive extinction rate (μ) of 0.02 species/Ma that has been relatively consistent throughout the history of phrynosomatids (Fig. 5). We found strong evidence for heterogeneous diversification dynamics with a single acceleration in speciation rate at 20–25 million years ago (Fig. 5). The posterior probability for this rate shift occurring on the branch leading to Sceloporus species groups exhibiting increased diversity in the fundamental chromosome number is 0.997 (Fig. 3). The following species groups are included in this rapid radiation: graciosus and magister groups, a 22-chromosome clade containing the undulatus, formosus, and spinosus groups, the scalaris group, a 32-chromosome clade containing the megalepidurus, torquatus, and poinsettii groups, and the grammicus and clarkii groups. Furthermore, we calculated the Bayes factor (BF) for a shift on this branch by incorporating the probability of a rate shift at that branch under the prior alone, and found overwhelming evidence for a shift (BF >139,000). When examined separately, the increased speciation rate for the rapid radiation clade is 0.15 species/Ma, which is double that of the background rate (0.06 species/Ma).

Discussion

Chromosome evolution and diversification

The link between chromosomal evolution and diversification in Sceloporus has been recognized for decades (reviewed by [24, 31]. A previous study of Sceloporus diversification and chromosomal evolution using a Bayesian cross-validation predictive density approach found that species diversity was significantly higher in some parts of the phylogeny than predicted in comparison to background diversification rates [27]. Instead of using a local approach to test hypotheses about diversification rate shifts on pre-specified sections of the phylogeny where chromosomal changes occurred, the BAMM analyses presented here take a global approach with the goal of detecting significant speciation rate shifts anywhere on the phylogeny (Fig. 3; Table 2). The single significant rate shift is estimated to have occurred during the rapid radiation leading to a clade of Sceloporus species groups with high diversity in fundamental chromosome number (Fig. 3). The estimated background rates of diversification are similar between the two methods (approximately 0.06 species/Ma), and this rate doubles in the clade containing increased chromosomal diversity (Fig. 5).

Common methods for testing for trait-dependent diversification are the “state speciation and extinction” models (e.g., BiSSE, MuSSE, QuaSSE, etc.) [50]. This family of methods attempts to identify significant speciation or diversification rate differences between species in relation to a trait of interest. This approach sounds appealing for testing the link between chromosome evolution and diversification in Sceloporus. However it is important to note that detecting trait-dependent speciation is prone to errors from model violations and model inadequacies, and that these problems have led to an excess of trait-dependent speciation associations in the literature [51, 52]. New statistical tests aimed at distinguishing false associations are available, but these tests are currently limited to binary and continuous characters [53]. In Sceloporus, attempting to coerce the multistate karyotype data into a binary model results in few independent associations between the character state and diversification, and this type of problematic character state distribution is expected to return a false positive association [53, 54]. As expected, BiSSE provides strong support for karyotype-dependent diversification in Scelporus (results not shown).

Vertebrate radiations, including Sceloporus, tend to diversify following a semi-predictable trajectory of divergence [55] along axes of habitat [56], trophic morphology [57], and communication [34, 58–60]. Chromosomal variation is a prominent feature of Sceloporus diversity that is putatively linked to their rapid diversification. Disentangling these factors (i.e., ecology, morphology, diet, chromosomes, etc.) to determine their separate and joint contributions to diversification will be an interesting route to take in future studies (see [61] for an example).

Based on a cursory examination of the current geographic distributions of species in relation to their karyotypes, closely-related species of Sceloporus with the same karyotype formula are not typically found in sympatry [24]. Instead, communities with multiple species of Sceloporus tend to contain species with different karyotypes. The relationship between community assembly and chromosome number has not been formally tested, but we predict that communities of Sceloporus will be over-dispersed on the phylogeny and support the observation that species with similar karyotypes are typically not sympatric.

The ancestral karyotype for phrynosomatid lizards is 2n = 34 (12 macrochromosomes, 20 microchromosomes, and an XY sex chromosome pair), and only Sceloporus shows variation around this karyotype formula, which ranges from 2n = 22 to 2n = 46 [31]. The speciation rate shift that we detected on the phylogeny (Fig. 3) is located at the base of a clade containing high chromosome number diversity. There are changes in the karyotypes of Sceloporus that are not associated with this particular clade, including minor modifications such as inversions and/or secondary constrictions near the centromeres of the macrochromosomes [27]. The most dramatic example of a chromosomal change in a species that is outside of the rapid radiation is Sceloporus merriami, which has a karyotype formula of 2n = 46 resulting from the fission of 6 macrochromosomes. The chromosomal changes observed in the species/species groups falling outside of the rapid radiation do not appear to be correlated with any significant shift in speciation rate.

The evolutionary changes in autosomes and sex chromosomes that have produced karyotypic diversity that is distinctive from the ancestral 2n = 34 formula require a reevaluation on our new phylogeny (Figs. 2 and 3). Previous studies suggesting that the magister and graciosus groups were not sister taxa assumed that they must have independently evolved several unique karyotype features. These groups each have missing or indistinct sex chromosomes, and they each contain 2n = 30 chromosomes (although the magister group also contains species with other arrangements). The new phylogenetic trees presented here support these species groups as sister taxa, and therefore the presence of indistinct or missing sex chromosomes presumably evolved once in the common ancestor, and the ancestral karyotype is most likely 2n = 30. The sister group relationship between the 22-chromosome clade and the 2n = 24 scalaris group is unchanged (this clade received 100 % support from concatenation, but only 55 % support from coalescent analysis), and this further supports the notion that multiple fusion events are responsible for progressively reducing the number of chromosomes in these groups. The new phylogeny also supports a 32-chromosome clade composed of the torquatus group, poinsettii group, and megalepidurus group. The 32-chromosome clade is sister to the grammicus group (2n = 32 – 46), and this clade is sister to the clarkii group (2n = 40).

Resolving rapid radiations

Resolving rapid radiations using molecular phylogenetic techniques requires sequencing a very large number of nucleotides. However, there is an important distinction between obtaining enough nucleotides to resolve a gene tree versus sequencing enough loci to resolve a species tree. Resolving a gene tree should be feasible if enough nucleotides are available at the locus and as long as the rate of evolution is adequate for the scope of the investigation. The extreme of this approach is taken when full sequences are obtained for non-recombining animal mitochondrial DNA (mtDNA) genomes (e.g., amphibians [62], birds [63], mammals [64]) or plant chloroplast genomes [65]. The gene trees estimated from these studies typically provide strong support for phylogenetic relationships, even for species involved in rapid radiations. Despite the strong appeal of obtaining a robust tree from just a single locus, there are many reasons to be suspicious of the relationships in gene trees, including problems associated with incomplete lineage sorting, gene duplication and extinction, and horizontal gene transfer [66], as well as issues related to inaccurate phylogenetic model assumptions (reviewed by [67]). The advantage of sampling independent loci from across the genome, rather than focusing effort on obtaining long sequences from one or a few loci, is that some of these problems can be circumvented in an attempt to obtain a more accurate phylogeny.

In Sceloporus lizards, previous studies using mtDNA obtained a fairly well-resolved and strongly supported phylogeny [29], but large discrepancies in relationships were apparent in comparison to a species tree estimated from a few nuclear loci, presumably as a consequence of mtDNA introgression [28]. Instead of sequencing more mtDNA aimed at obtaining an even more robust mtDNA gene tree, we leveraged our resources towards obtaining a large number of independent loci from across the genome using a targeted sequence capture approach. Not all of these loci that we selected were particularly useful for resolving the rapid radiation in Sceloporus. Only 3 % of the 585 loci that we obtained supported the rapid radiation that corresponds to the period of increased chromosomal diversification in Sceloporus (Fig. 4). The lizard-specific probe set that we designed for this project appears to have been barely capable of resolving this rapid radiation, and it is likely that this same set of markers will be incapable of resolving more difficult phylogenetic problems. Aside from developing a new probe set that targets more loci, two ways to increase the percentage of loci that contribute useful phylogenetic information in a targeted sequence capture experiment are to invest in longer sequence reads and/or optimize the lab protocol to obtain longer loci. Overall, the new paradigm of sequencing 100s or 1000s of loci in order to obtain a few loci that resolve a rapid radiation seems highly inefficient. Developing more refined locus selection methods that can identify loci with optimal evolutionary rates for a specific question, and thereby increase the probability that a loci will contribute useful phylogenetic signal, is an important direction for the future of phylogenomic studies.

Systematics of Sceloporus

The phylogenomic estimates for Sceloporus obtained using 585 loci (Fig. 2) provide strong support for relationships that have been difficult to elucidate using smaller amounts of data. At a higher taxonomic level, we find strong support for relationships among genera in the Sceloporine (i.e., [[[Urosaurus + Sceloporus],Uta,],Petrosaurus]) that are consistent with a recent study using the same data [13], and restriction site associated DNA sequencing (RADseq) data [68], but conflict with previous estimates that combine mtDNA and nuclear genes [29]. Within Sceloporus, the relationships at the base of the phylogeny are weak and differ depending on the analysis type (e.g., concatenation vs. coalescent analysis). More data may be necessary to obtain a definitive placement for the initial divergences within the genus. The composition of the early diverging groups is clear [69], including the variabilits group and the close relationships between the siniferus, angustus, and utiformis groups (this group was not sampled in our study), and determining which was the first to diverge requires further study.

The addition of loci has helped provide strong support for some species groups relationships that were unresolved with smaller nuclear datasets. For example, the clade containing the pyrocephalus, gadoviae, and jalapae groups that we obtained with the TSC data is also supported by analyses of mtDNA [28]. However, previous analyses of smaller nuclear gene datasets did not support the monophyly of this group [28, 29]. Several species group relationships have been difficult to determine because of the influence of gene tree conflict between nuclear and mtDNA [28]. One example of this problem pertains to the relationship between S. clarkii and S. melanorhinus, which differs between mtDNA and nuclear genes [28]. The mtDNA gene tree separates these species across the phylogeny, whereas analyses of nuclear data support them as a clade. We find strong support for a clade containing S. clarkii and S. melanorhinus, and since species groups are intended to provide names for monophyletic groups, we recommend naming this clade the clarkii species group.

We find strong support for a clade containing the poinsettii and torquatus groups, and we recommend referring to all species included in these groups as members of the torquatus species group. The poinsettii group was erected to deal with non-monophyly of the torquatus group in relation to the megalepidurus group [29]. Given that there is no longer any evidence of paraphyly in the torquatus group it does not seem necessary to retain the poinsettii group.

Monophyly of the spinosus group is weak or missing depending on the type of analysis and source of molecular data. A recent phylogeographic study of this species group revealed mtDNA introgression and gene flow between species [38]. These processes are likely responsible for the discordant phylogenetic relationships that have been described for these species [28, 29]. Gene flow and introgression play a prominent role in the evolution of Sceloporus [28, 70–72], and future phylogenetic studies of the group will benefit from new analytical approaches that can identify gene flow during species tree estimation.

Methods

Specimen collection

The family Phrynosomatidae is a diverse group of lizards with a broad North American distribution from Canada to Panama. Much of their diversity is centered in the arid regions of the southwestern United States and Mexico. The group has approximately 148 species arranged into nine genera. We sampled 129 phrynosomatid individuals, including one sample of Cophosaurus, Holbrookia, Petrosaurus, Uma, and Uta, two specimens of Callisaurus, all 17 species of Phrynosoma, and 86 species of Sceloporus (see Table 1 for voucher details). We sampled all species groups within Sceloporus with the exception of S. utiformis. We used Gambelia wislizenii and Liolaemus darwinii to root the tree. Specimens collected for this project from Mexico and the United States are deposited at the Burke Museum of Natural History and Culture at the University of Washington and the Museo de Zoologia “Alfonso L. Herrera” at the Universidad Nacional Autónoma de México. Specimens were collected with approval from the University of Washington Institutional Animal Care and Use Committee (IACUC #4209-01). Scientific specimens were collected in México with permission from the Secretariat of Environment and Natural Resources (SEMARNAT Permit No. 05034/11 to ADL, and Permit No. FAUT 0093 to ANMO). We also obtained tissue and/or DNA loans from the following genetic resource collections and herpetology collections: Museum of Vertebrate Zoology (University of California, Berkeley), Burke Museum of Natural History and Culture (University of Washington), California Academy of Sciences, Ambrose Monell Cryo Collection (American Museum of Natural History), Los Angeles County Museum, Royal Ontario Museum, University of Texas at Arlington, and the Museo de Zoologia “Alfonso L. Herrera” (Universidad Nacional Autónoma de México).