Background

Observations of discordance between paleontological and molecular age estimates, or between ages estimated from different molecular datasets, are fairly common and have been observed in angiosperms [1]-[4], mammals [5]-[10], ray-finned fishes [11]-[15], and various other lineages across the Tree of Life [16],[17]. Multiple factors are invoked to explain conflicting molecular age estimates, including modeling of paleontological calibrations [18]-[24], nucleotide substitution rate heterogeneity [25],[26], and the appropriateness of the molecular clock model used [27],[28]. However, the impact of the phylogenetic informativeness of gene sequences on molecular divergence time estimates has received considerably less attention.

Sequences whose sites have experienced large numbers of substitutions, either as a function of a rapid rate of molecular evolution or the passage of long periods of evolutionary time, will likely exhibit a high frequency of homoplasious character states. This homoplasy is problematic because a high frequency of convergent nucleotide states can bias branch length estimation in phylogenetic analyses, as the rate of hidden substitutions will often be under- or overestimated [29]-[32]. However, distinguishing between homoplasy and other sources of discordance is challenging in empirical datasets. This is largely because commonly used metrics that address the influence of nucleotide saturation on phylogenetic branch length estimates, such as saturation plots [33]-[35], are often difficult to interpret in terms of where in a given clade’s history the branch length estimates or inferences of phylogenetic tree topology may be compromised [36].

Phylogenetic informativeness (PI) methods offer a strategy to directly relate nucleotide saturation in data sets to molecular divergence time studies [36]. Using the ratio of a rate of evolution to the optimal rate of evolution for phylogenetic inference at a particular time, PI profiles quantify informativeness. Briefly, this approach takes an inferred vector of site-specific rates and generates a normalized, asymptotic likelihood density for a true synapomorphy occurring at a historical time T under a given model of character evolution. It should be noted that this is a predictive likelihood based entirely on the site-rates inferred, not an assessment of the validity of any empirical result. Analysis of the full site-rate distribution provides a profile of phylogenetic informativeness that spans the entire temporal span of the focal phylogeny, illuminating the scope of phylogenetic signal attributable to the data through evolutionary time [36]. Although phylogenetic informativeness profiles make no direct statement regarding the degree of saturation present in a dataset, the decline of the informativeness profile that follows the peak has been dubbed a “rain shadow of noise,” where the likely quantity of homoplasy influencing a node is comparable to the drop from the peak of the PI profile to its height aligned to the node [37]. This decline at depth from the peak of the PI profile should not only provide insights into where in the temporal span of the phylogeny to expect an increase in phylogenetic noise for topological inference, but also should provide a predictive tool for assessing the potential for data sets to provide strongly supported yet discordant age estimates.

Determining whether estimated ages are the result of estimation biases, or substantial gaps in the fossil record, is especially critical in reconstructing the history of vertebrate diversification. Many analyses of mitogenomic datasets sampled from lineages that span deep evolutionary time scales have resulted in age estimates that are far older than the expectations from the fossil record [14],[38]-[41] and the rapid rate of nucleotide substitution that characterizes the vertebrate mitochondrial genome has prompted multiple investigators to question whether molecular age estimates obtained using mitogenomic data are the result of such biases [30],[42].

The discordance between molecular age estimates is striking within ray-finned fishes (Actinopterygii). While actinopterygians have a fossil record dating to the Devonian [43], limited paleobiological surveys of richness through time [44] and striking disagreement between competing molecular age estimates have impeded our understanding of the timescale underlying their diversification. Estimates made for actinopterygians on the basis of mitogenomic and nuclear datasets often indicate very different evolutionary timelines. For example, the origin of acanthomorphs (spiny-rayed fishes), which include nearly one third of all living vertebrates [45], is placed in the Triassic (252.2-201.3 Ma) based on mitogenomic age estimates [38],[40]. This suggests that the majority of living fish diversity originated during a period of recovery following the Permian-Triassic mass extinction event (252.2 Ma), with major acanthomorph lineages beginning to radiate throughout the Triassic and Jurassic (201.3-145.0 Ma). By contrast, recent molecular clock analyses using nDNA place the origin of acanthomorphs in the Early Cretaceous, followed by extensive diversification in the Late Cretaceous and early Paleocene [13],[46]-[49]. A comparable timescale has also been estimated by some mitogenomic analyses [50], and is more consistent with patterns in the fossil record [44],[51]-[53] than the early Mesozoic acanthomorph radiation implied by other studies. As actinopterygians comprise half of all living backboned animals, including several model organisms and species of great economic importance, understanding the timing of their diversification provides critical insight into the evolutionary history of vertebrates.

Disagreement between mtDNA and nDNA estimates is not directionally consistent, i.e., many mtDNA estimates are older than nDNA counterparts, but there are examples of the opposite pattern [13],[48],[50], and discordant timescales might also arise from other factors that differ between analyses: the selection and placement of calibrations, the clock models applied, and the taxa sampled. However, this study seeks to fix, as completely as possible, these other variables, and explore the impact that different sequence data have on estimation of evolutionary timescales. We evaluate phylogenetic informativeness (PI) profiles [36] for gene sequences sampled among major actinopterygian lineages and demonstrate that is not necessary to attribute divergent molecular age estimates to issues in modeling of paleontological data as calibration age priors as previsouly suggested [13]. Instead, the divergent estimates can be attributed to levels of homoplasy in the mtDNA and nuclear gene datasets that distort inferences at different time scales. By removing data partitions that are saturated, we reconcile divergent molecular age estimates for actinopterygians, bringing these more in line with ages implied by the fossil record. These results demonstrate that selection of sequence data appropriate for the time scale of inferences is as important as the selection of calibrations and molecular clock models for divergence time estimation.

Results

Information-theoretic based searches of partitioning strategies found 11 and 14 partitions as the best fit for the mtDNA (Table 1) and nuclear DNA (nDNA; Table 2) datasets respectively. Molecular age estimates between sets of analyses with differing calibration schemes were very similar (Figures 1 and 2; Additional file 1: Table S1 and S2) and substantially outside of the prior expectations (Figure 1). As manipulation of the calibration age priors to reflect potential uncertainties in the fossil record had a minimal influence on trends in the resulting molecular age estimates, we restrict discussion to the most calibration-rich analysis here. Regardless of calibration strategy, molecular dating analyses of the mtDNA and nDNA datasets including all data partitions resulted in very different posterior age estimates, with the mtDNA posterior age estimates generally being much older and exclusive of the 95% highest posterior density interval (HPD) for many nDNA estimates (Figures 1 and 2). Estimated ages were most consistent towards the root of the tree, with higher uncertainty in the mtDNA estimates. The HPD of the estimated age from the mtDNA analysis for the most recent common ancestor (MRCA) of actinopterygians ranged between 383 and 416 Ma (mean: 395 Ma [Middle Devonian]) similar to the HPD of ages for this node in the nDNA analysis, which ranged between 383 and 399 Ma (mean: 389 Ma [Middle Devonian]; Figure 1). In contrast, instances of discordance between the mtDNA and nDNA age estimates were more extreme within acanthomorphs. For example, the HPD for the MRCA of Tetraodontidae and Diodontidae (pufferfishes) ranged between 50 and 57 Ma (mean: 52 Ma [Eocene]) in the nDNA analyses, whereas the HPD ranged between 149 to 179 Ma (mean: 164 Ma [Middle Jurassic]) in the mtDNA analyses (Figure 1). Similarly, the HPD for the MRCA of Fundulus and Gambusia ranged between 124 and 160 Ma (mean: 141 Ma [Early Cretaceous]) in the mtDNA analyses, compared to an HPD between 29 and 46 Ma in the nDNA analyses (mean: 37 Ma [Eocene]; Figure 1).

Table 1 Best-fit nucleotide substitution models and partition strategies identified by PartitionFinder and peaks of phylogenetic informativeness for mtDNA
Table 2 Best-fit nucleotide substitution models and partition strategies identified by PartitionFinder and peaks of phylogenetic informativeness for nDNA
Figure 1
figure 1

Comparison of the estimated 95% HPD interval for five example contrasts of mtDNA and nDNA age estimates from all data, those from datasets pruned of the saturated partitions, and the prior age expectation based on the effective prior (*analysis run without sequence data): Acanthomorpha (spiny-rayed fishes); MRCA Fundulidae and Poeciliidae (topminnows and livebearers); MRCA African and American cichlids, MRCA Tetraodontidae and Diodontidae (smooth and spiny puffers), and Actinopterygii (ray-finned fishes). Lines indicate bounds of the 95% HPD interval, circles correspond to mean age estimates.

Figure 2
figure 2

Comparison of mtDNA and nDNA chronograms for actinopterygians. Ages were estimated by analyzing A) all data and B) analyzing data that excluded saturated partitions. Bars indicate 95% HPD intervals of age estimates. Light bars indicate posterior probabilities greater than 0.95. Gray bars indicate posterior probabilities below 0.95. Phylogenetic informativeness profiles for both datasets are shown adjacent to the associated chronograms. Colors identify individual partitions. C) Comparison of mtDNA mean age estimates for all nodes when all data is used and the mean ages from the pruned dataset. D) Comparison of nDNA mean age estimates for all nodes when all data is used and the mean ages from the pruned dataset.

The phylogenetic informativeness profile of several mtDNA partitions peaked prior to the majority of nodes present in the tree (Figure 2A; Table 1), and the shapes of the PI profiles for each individual codon position were similar within partitions (Figure 3). Higher profiles of PI indicate greater utility for phylogenetic inference. However, a decline of PI profile following the peak is indicative of a “rainshadow of noise”, reflecting an increased probability that numerous hidden substitutions have accumulated that can mislead phylogenetic inference [37]. While the informativeness profiles of the nuclear gene partitions also exhibited a signature of homoplasy (Table 2) and conservation of PI profile shape within partitions (Figure 4), these declines were much less severe than those observed for the mtDNA partitions (Figure 2A). Removal of nucleotide data partitions with PI profiles that exhibit greater than a five percent decay of informativeness from the PI profile peak prior to the Cretaceous-Jurassic boundary (145.0 Ma) from the subsequent relaxed molecular clock analyses removed 7354 individual sites (Table 1) and resulted in substantial changes to the posterior age estimates using mtDNA (Figures 2B and 5). Indeed, almost all estimated ages from the mtDNA dataset after pruning of saturated data partitions shifted between 50 and 100 million years towards the present (Figures 2C and 5B). In contrast, removal of the saturated nDNA data partitions resulted in a removal of 2710 individual sites (Table 2) that had less effect on divergence time estimates (Figures 2D and 5). This global shift in node age estimates undermines support for a more ancient timescale of ray-finned fish evolution, instead reconciling the divergence time estimates generated by the two datasets. While analysis of the complete mtDNA dataset estimated the bulk of extant ray-finned fish lineages to have originated in the Jurassic (Figures 2A and 5), analysis of datasets pruned of saturated partitions shifted the majority of these estimates into the Cretaceous (145.0-66.0 Ma), and Paleogene (Figures 2B and 5). For example the HPD estimated from the pruned mtDNA analysis for the MRCA of acanthomorphs shifted to a range between 112 and 179 Ma (mean: 145 Ma [Early Cretaceous]). This result is congruent with the HPD of 108–135 Ma based on nDNA (mean: 122 Ma; Figure 2B), but substantially deviates from the HPD of 223–254 Ma estimated using the full mtDNA dataset (mean: 238 Ma [Late Triassic]; Figure 2A). This shift in estimated ages was also observed in younger clades. For instance, the HPD of 34–89 Ma (mean: 50 Ma [Eocene]), estimated from the pruned mtDNA analysis for the MRCA of sticklebacks and eelpouts undermines the credibility for the original estimated HPD of 109–149 Ma (mean: 129 Ma [Early Cretaceous]) obtained from the full mtDNA dataset. Instead this result now largely overlaps with the nDNA-based age HPD of 33–58 Ma (Mean: 45 Ma [Eocene-Paleocene]; Figure 2B).

Figure 3
figure 3

Visualizations of the individual phylogenetic informativeness profiles for each codon position in the mtDNA dataset. Inset letters (A-N) correspond to data partitions (1–14) in Table 1.

Figure 4
figure 4

Visualizations of the individual phylogenetic informativeness profiles for each relative codon position in the nDNA dataset. The first frame of each relative codon position corresponds to starting position of each gene in the Near et al. [48] alignment. Inset letters ( A-L) correspond to data partitions (1–11) in Table 2.

Figure 5
figure 5

Comparison of means and 95% HPD intervals for mtDNA and nDNA based posterior age estimates, (A) when all data from each dataset is included, and (B) when partitions are excluded from each dataset based on their PI profile. Black circles indicate mean age estimates, gray and blue bars respectively indicate the 95% HPD interval of the mtDNA and nDNA based analyses.

Discussion

The timescale of ray-finned fish evolution

Discordance between the older mtDNA age estimates and the more recent nDNA based estimates have been attributed to a lack of shared calibrations between mtDNA and nDNA based studies [13]. However, despite using identical calibrations between datasets, our mtDNA based estimates still support an origin of most major lineages in the Jurassic (Figure 2A). These results are similar to some previous mtDNA based studies [38], and are in direct opposition with patterns of fossil richness that depict a radiation of living actinopterygian, particularly species-rich acanthomorph, lineages in the Cretaceous [44],[52],[54],[55]. These ages are also at odds with our nDNA based age estimates, which more closely match paleontological expectations within acanthomorphs (Figure 2A). We therefore find little support that calibration choice is the primary driver of discordant age estimates between these datasets.

Instead, PI profiles show that for deep divergences more than half of the identified mtDNA protein-coding gene partitions predict potentially misleading saturation (Figures 2A and 3). Saturation of mitogenomic data has been suggested to bias topological inferences in ray-finned fishes [56]. Removal of partitions exhibiting predicted saturation resulted in a topological rearrangement that greatly diminished support for relationships that differed from the nDNA based topology globally. Even when taking uncertainty in phylogenetic relationships into account, removal of saturated partitions resulted in a consistent shift in age estimates by as much as 100 My towards the present for almost all nodes in the tree (Figure 1), with 95% HPD intervals overlapping between nDNA and mtDNA based estimates (Figure 5). In contrast, removal of noisy partitions from the nDNA analysis had a neglible affect on the resulting divergence times, as would be predicted by the shallow decline in the PI profiles (Table 2; Figure 4).

This reconciliation of the ray-finned fish evolutionary timeline contributes to a strengthened historical framework that promises new insights into the evolutionary processes that generate and maintain aquatic biodiversity. Our finding suggests that at least some of the discordance surrounding actinopterygian divergence time estimates [12]-[14],[38],[40],[57],[58] can be explained by branch length estimation biases. However, this finding should not be viewed as a problem restricted to mtDNA or as a general phenomenon in which saturation always results in older age estimates. The latter is certainly not true, as Phillips [59] demonstrated that depending on the calibration placement and the character state patterns of saturated sites, nucleotide saturation can result in either tree extension or compression.

Deep time mitogenomic studies of fishes also do not always conflict with nDNA based studies. For example, in a mitogenomic study by Miya et al. [50], a large range of sampled acanthomorph lineages have divergence time estimates that overlap between nDNA based [46]-[49] and reject previous mtDNA based analyses [40]. This reversal of conflict is partially explained for two reasons. First, our informativeness profiles show partitions of the fish mitochondrial genome to be more informative at recent timescales (Figure 3), and Miya et al. [50] limited their taxon sampling to only 30% of the timescale sampled by Miya et al. [40]. Second, this study excluded the gene nd6, from which we also exclude two thirds of codon positions in this study (Table 2), and recoded transitional changes (changes within purines or pyrimidines) to a single state in an effort to exclude saturated sites [50]. Although Miya et al. [40] also attempted to mitigate the influence of saturation by excluding third codon positions, our results highlight rates of molecular evolution do not always conform to codon positions and that informativeness for some first and second positions also declines over deep timescales (Figure 3).

Our reconciliation of the timeline of ray-finned fish diversification adds support for the growing consensus that the patterns of species richness observed in living fishes are largely the product of diversification during the late Mesozoic and Cenozoic [13],[47],[48],[53],[58],[60]. Much of the diversity of living fishes can be attributed to the success of acanthomorphs, which comprise roughly one in every three vertebrate species. Understanding the timeline underlying this group is therefore not only critical to investigations of the evolution of fish biodiversity, but also to investigations of vertebrate evolution in general. Although there is still conflict between divergence times estimated for acanthomorph clades such as tetraodontiforms [11],[15],[48],[49],[61]-[63], cichlids [47],[48],[64],[65], and notothenioids [48],[66],[67], uncertainties surrounding competing mean age estimates typically show broad overlap between the majority of studies. This is encouraging and suggests convergence on a robust temporal framework from which to unlock the mode and tempo of diversification in this spectacular group of vertebrates.

Profiling phylogenetic informativeness

Recognition that convergence in nucleotide character states, or saturation, diminishes the utility of gene sequences for phylogenetic inference at deeper evolutionary time scales is not new [29],[31],[33],[35],[68]-[70], and multiple approaches exist to assess when homoplasy has or will critically influence phylogenetic inferences. These include saturation plots [31],[33], RY coding variable characters [71],[72], down weighting characters [72],[73], and site removal [70],[74]. However, the development of a predictive framework that facilitates careful scrutiny of the power of diverse datasets to resolve phylogenetic problems has only recently begun [36],[75]-[77]. Our finding that PI profiles predict the temporal optimality of markers for providing branch length estimates identifies a useful heuristic framework for assessing the credibility of existing age estimates.

While discordance in molecular age estimates is frequently attributed to factors such as differential application of fossil-based age constraints [78]-[81], suitable modeling of paleontological calibration data [18]-[20], nucleotide substitution rate heterogeneity [25],[26],[82], or differences between molecular clock models [27],[83]-[85], our results demonstrate that the choice of molecular markers can also drive discordance when identical analysis conditions are employed. Although our study focuses on largely on mitochondrial genes because these have been found to have a nucleotide substitution rate much higher than nuclear exons commonly used in phylogenetic studies of many vertebrate clades [86]-[88], this is not a universal pattern across the tree of life [89]-[92]. Nucleotide saturation is a feature often observed in next-generation phylogenomic datasets [73],[93]-[96].

By utilizing phylogenetic informativeness approaches to identify data partitions characterized by saturation and homoplasy, much of the discordance between mtDNA and nDNA datasets is reconciled (Figure 2). It should be noted that this reconciliation does not always guarantee a one-to-one mapping of divergence time estimates between analyses nor does it predict how changes in the analytical conditions will influence posterior age estimates. For example, our exclusion of a prior age calibration for the MRCA of African and New World cichlids or stem Tetraodontiformes resulted in a shift to slightly older ages when using the pruned mtDNA dataset while not having a pronounced affect on the distribution of the global node ages inferred in nDNA based analyses (Additional file 2: Figure S1 and Additional file 3: Figure S2). However, in this case differences between age estimates were minor, with the removal of saturated data partitions having a far greater impact on resulting age estimates.

Phylogenetic informativeness approaches facilitate a diagnosis of when to expect a rise in homoplasious site patterns, offering an objective criterion for screening data by its utility for molecular divergence time estimation. As phylogenetic datasets become “phylogenomic” in scale [97], development of these approaches becomes essential for the selection of loci that will not be misleading from potentially hundreds of markers [98]. Our results demonstrate that for age-estimation just as for phylogenetic inference [99], more data alone is not enough. While adding more data can provide novel insights into the evolutionary patterns that underlie the Tree of Life [100], more data alone does not render an analysis immune from methodological artifacts such as homoplasy [73],[94],[99],[101]-[103]. As divergence time estimates rely on calibrated substitution rate estimates our results underscore that it is necessary to diagnose the effects of saturation and homoplasy at different time scales, even as hundreds, if not thousands, of loci become applied to dating the Genomic Tree of Life.

Conclusions

We demonstrate that even with identical calibration regimes and molecular clock methods, the influence of homoplasy has a pronounced affect on divergence time estimates. We also demonstrate the utility of PI profiles for providing a much-needed non-arbitrary criterion for data exclusion. By extending the utility of PI profiles to this task, we highlight the ability of these methods to assess the robustness of age estimates relative to the frequency of homoplasious character states in the data. Applying this approach to the timescale of ray-finned fish evolution, we reconcile two fundamentally different views on the timescale of aquatic vertebrate diversification by removing partitions deemed saturated. The results obtained by pruning the saturated positions in this manner are more in line with paleontological expectations, suggesting that most major lineages of extant fishes today are Cretaceous in origin rather than emerging subsequent to the Permian-Triassic mass extinction event or the remnants of an ancient Jurassic radiation. Our results suggest that in addition to care in the selection of calibrations and molecular clock models, careful scrutiny of the potentially misleading impact of homoplasious data to be a fundamental component of divergence time estimation.

Methods

Dataset assembly

All of the DNA sequence data used in this study was obtained from Genbank. The ray-finned fish nuclear gene dataset contained 9 nuclear protein-coding genes (zic1, myh6, rag1, ptr, tbr1, Glyt, SH3PX3, plag12, sreb2), sampled from 44 species that included representatives from most of the major ray-finned fish lineages (Additional file 1: Table S1). We contrasted inferences based on the nuclear gene dataset with alignments from 10 protein-coding mtDNA genes (nd1, nd2, COI, CO2, atp8, atp6, COIII, nd5, nd6, and cytb) for representatives of the same major ray-finned fish lineages (Additional file 1: Table S2). Alignments for each mtDNA and nuclear gene were generated using MUSCLE v3.7 [104], then refined by eye using the translated amino acid sequences. Individual gene alignment files were concatenated using Phyutility [105]. For both datasets, we simultaneously assessed optimal partitioning strategies and the fit of potential nucleotide substitution models by comparing Bayesian Information Criterion (BIC) scores calculated using PartitionFinder [106]. Potential partitioning strategies that were compared included all possible gene and codon partition schemes, ranging from a single global partition to allowing each gene and codon position to have their own partition.

Divergence time estimation

We used BEAST v.1.7.5 [107] to infer the marginal posterior distribution of ultrametric trees under a model of uncorrelated rates that follow a lognormal distribution (UCLN) for all analyses. For each BEAST run, we assigned a birth-death prior to rates of cladogenesis [107] and ran four independent Markov Chain Monte Carlo (MCMC) runs between 100 million and 1 billion generations, sampling every 1000–10000 generations. Chains were deemed convergent by visual examination of the chain likelihoods (Additional file 4: Figure S3) in Tracer 1.5 [108]. To ensure adequate mixing of each chain, the effective sample sizes (ESS) for all model parameters were assessed with ESS values above 200 indicating appropriate sampling from the posterior distribution of each parameter.

We enforced the monophyly of several nodes in both sets of analyses, as this constraint greatly decreased the time to convergence in preliminary analysis. The monophyly of Actinopteri relative to Polypterus was enforced based on results of previously published sets of phylogenetic analyses of morphological and molecular data [43],[109]-[111]. Additionally the monophyly of teleosts, euteleosts, neopterygians, otocephalans, acanthopterygians, cyprinodontiforms, tetraodontiforms, cichlids, and acanthomorphs was enforced based on previous phylogenetic analyses [13],[46]-[49],[112],[113].

Divergence time estimates were calibrated in a series of three sets of analyses that used between five and seven calibrations previously utilized in investigations of actinopterygian divergence times [13],[48]-[50],[64]. Applying identical calibrations and analytical conditions to both datasets allowed us to directly investigate the potential for saturation to influence divergence time estimates. Multiple studies have demonstrated that the width of the calibration age prior distribution influences the posterior distribution of Bayesian age estimates [18],[19],[114], and upper bounds on priors were identical to those in Near et al. [13],[48] and Friedman et al. [64] for direct comparison. All analyses were run with and without nucleotide data to assess the influence of the prior on the posterior distribution of age estimates [27].

Paleontological data

Seven potential calibrations based on paleontogical data were taken from Near et al. [13], Near et al. [48], and Friedman et al. [64]. (1) The most recent common ancestor (MRCA) of all crown Actinopterygii was calibrated based on the occurrence of †Mimipiscis toombsi and †Moythomasia durgaringa from the Gogo Formation of Western Australia [111], 382.5 Ma [115]. The 95% prior age interval was set to 419 Ma based on the appearance of †Guiyu oneiros which represents the minimal age for the MRCA of Actinopterygii and Sarcopterygii [116]. (2) Crown-group Actinopteri was calibrated based on the Mississippian taxon, †Cosmoptychius striatus from the Wardie Shales, Lower Oil Shale Group, Scotland [117]. We assigned an absolute age estimate of 325.5 Ma [13],[118] with 95% of the prior interval set to 373 Ma with a soft upper bound, based on the maximum age bracketing approach of Marshall [20]. Although some analyses place †Cosmoptychius within the actinopteran crown [43],[119], others suggest it is a stem actinopteran [111],[120] . The minimum age estimate used by Near et al. [13] and applied here for †Cosmoptychius corresponds to the mid-Serpukhovian of the revised geological timescale [121]. The Serpukhovian †Discoserra is widely recognized as a crown actinopteran [12],[120] , so the age estimate applied here is appropriate regardless of specific placement of †Cosmoptychius. (3) The MRCA of Holostei (Amia and Atractosteus + Lepisosteus in this study) was calibrated based on the appearance of †Watsonulus eugnathoides from the Middle Sakamena Formation of Madagascar [122] with a minimal age of 245.9 Ma [123],[124] and 95% soft upper bound of 311 Ma based on the age of †Mesopoma planti[13],[125]. (4) The stem polymixiiform †Homonotichthys dorsalis from the Cenomanian Lower Chalk of Sussex and Kent, United Kingdom [126] was used to calibrate the MRCA of Polymixiiformes and Percopsiformes with a minimum age of 93.6 Ma and a 95% prior density interval that spanned 99.6 Ma based on the appearance of the putative stem acanthomorphs †Aulolepis, †Ctenothrissa and †Heterothrissa[13],[126],[127]. (5) The MRCA of extant spiny and smooth pufferfishes (Diodontidae and Tetraodontidae) was calibrated based on the appearance of several stem diodontids including †Prodiodon tenuispinus, †P. erinaceus, †Heptadiodon echinus, and †Zignodon fornasieroae from Bolca, Italy [128]. We do not accept the diodontid dentition described by Gallo et al. [129] as a reliable fossil calibration for the divergence between Diodontidae and Tetraodontidae. A Maastrichtian age is proposed for the fossil based on its color and general locality, but there is no record of the geological horizon from which it was collected and no matrix remains adhered to the specimen that might better constrain provenance. Following Near et al. [13] we set the minimum age of this calibration to 50 Ma with 95% of the prior age interval set to 57.3 Ma based on the maximum age bracketting approach of Marshall [20]. For all the above, we utilized lognormal prior age intervals with soft upper bounds allowing ages to be sampled outside the prior distribution of age estimates [130]. We adopted two calibration strategies for the divergence between Tetraodontiformes and Lophiiformes (6). Plectocretacicoidea contains a set of morphologically diverse Late Cretaceous acanthomorphs that have been interpreted as stem tetraodontiforms [131],[132]. The oldest plectocretacicoid, the early Cenomanian Plectocretacicus, has been nominated as a key fossil calibration for animal phylogenies [23],[24]. However, the interpretation of anatomically similar, coeval armoured acanthomorphs from Mexico as beryciforms [133] raises questions about the affinities of Plectocretacicus specifically and plectocretacicoids generally. The youngest plectocretacicoid is Cretatriacanthus, which is best known from the latest Campanian-earliest Maastrichtian of Nardò, Italy. Based on the argumentation given by Friedman et al. [64], we assign this fossil an age of 70.08 Ma. We applied a 95% prior age interval of 109.845 Ma based on the mean of the upper 95% credible intervals for fossil-based estimates of the age of Percomorpha [64]. Significantly, the age of the more commonly used calibration based on Plectocretacicus falls within this prior distribution. In a second set of analyses, we left the split between Lophiiformes and Tetraodontiformes uncalibrated based on perceived ambiguities in the interpretation of putative stem tetraodontiforms. We calibrated (7) the MRCA of African and neotropical cichlids using a minimum age of 46 Ma based on the stratigraphic information in Friedman et al. [64] with 95% of the prior age interval set at 85.625 Ma, which represents the mean of the upper 95% intervals for fossil-based estimates for Cichlidae [64]. This prior encompasses most [48] or all McMahan et al. [134] of the 95% HPD of ages estimates for the African and neotropical cichlid clade reported by recent molecular clock analyses that applied contrasting calibration strategies. Although this cichlid clade shows a geographic pattern congruent with Gondwanan vicariance, paleontological evidence based on both the distribution of cichlid bearing fossil horizons and the stratigraphic ages of closely related lineages reject an ancient origin for cichlids [64], as do fossil-calibrated timetrees that do not assume vicariance a priori [47],[48],[64],[134]. This finding reflects growing concerns that constraining ages based on present day distributions that reflect putative ancient vicariant events may bias our understanding of the evolutionary pathways that underlie the generation of modern biodiversity [135]. To assess the impact of the cichlid calibration on our divergence time estimates, we conducted a set of analyses leaving the divergence between African and neotropical cichlids uncalibrated.

Profiling informativeness

To quantify phylogenetic informativeness (PI) for each dataset, site-specific rates and informativeness profiles were quantified using the program HyPhy in the PhyDesign web interface [136]. For site-rate calculations, we provided the consensus of the posterior distribution of trees resulting from the BEAST analyses and the respective mtDNA or nDNA alignments as inputs. In comparison to the consensus tree, using a subsample of trees from the posterior distribution inferred from each dataset yielded nearly identical PI profiles. PI plots were generated for the data partitions identified by PartitionFinder [106] used in the BEAST analyses, with PI profiles visualized in comparison to the corresponding consensus ultrametric tree. While removing data partitions in which the apex of the PI profile occurrs prior to the root of the tree would be an optimal strategy to limit the increased probability of partitions containing homoplasious site patterns [37], preliminary analyses suggested that this strategy was not feasible for the mitogenomic dataset as this required removal of almost all data. As the primary contention in age estimates between mtDNA and nDNA based analyses concerns whether or not the bulk of living actinopterygian diversity radiated during and after the Cretaceous, partitions whose profiles exhibit a decline from the peak of informativeness of greater than 5% prior to Jurassic-Cretaceous boundary were removed for subsequent analyses.

Availability of supporting data

Xml files and associated tree files are available on Dryad: doi:10.5061/dryad.3rq51 and all sequence data is available on Genbank (http://www.ncbi.nlm.nih.gov/genbank).

Additional files