Background

Species today face many challenges that influence their genetic, phenotypic, and ecological diversity and divergence patterns across space and time. The genetic variation of their component populations comprises the raw material underlying overall and local adaptedness, providing resilience to anthropogenic stressors –such as climate change, habitat alteration and loss, invasive species, and exploitation [1, 2]. Notably, declines in genetic diversity may lead to decreased fitness, undermining ability to respond to present and future conditions [3, 4]. Evaluating comparative and hierarchical levels of genetic composition and differentiation of widespread taxa, as well as their unique or reservoir populations, thus is important for prioritizing conservation management strategies [5, 6].

Population genetic patterns of today’s temperate taxa resulted from historic and contemporary processes [7, 8], regulated by their physiological requirements, habitat connectivity, and dispersal capability [9, 10]. Broadly distributed taxa display heterogeneous patterns over a breadth of environmental and ecological conditions [11, 12]. Isolated populations experience drift and possibly evolve unique alleles [13, 14], whereas large inter-connected ones frequently have high gene flow and less genetic distinctiveness [15, 16]. Low migration may lead to higher divergences, whereas mobility fosters gene flow and homogeneity [17, 18].

During the Pleistocene Epoch, ~2.6–0.01 million years ago, the North American Laurentide Ice Sheet advanced south to the Ohio River system (Figure 1), altering population distributions and genetic compositions [19, 20]. Taxa became sequestered in glacial refugia and then moved northwards to colonize old and new habitats as the ice retreated [21, 22]. Recent climate warming is accelerating these historic dispersal patterns northward [23, 24].

Figure 1
figure 1

Maps of walleye sampling sites (lettered, corresponding to locations in Table 1and Additional file10). A Distribution across North America (modified from Billington et al. [25]), depicting genetic discontinuities among contemporary walleye spawning groups from Barrier v2.2 analysis [26], with black box denoting area of close-up study for B Lake Erie, with historic collections of walleye (labeled Q) and “blue pike” (R). Maps were created by us with ArcGIS® software by Esri (Redlands, CA).

Anthropogenic factors may further modify genetic patterns and force populations into sub-optimal habitats leading to reduced population size, genetic diversity, and/or local adaptation and resulting in possible extirpation [27]. For example, increasing temperatures may reduce population sizes and ranges of temperate cold-water fishes, e.g., lake trout Salvelinus namaycush (Walbaum 1792) and cutthroat trout Oncorhynchus clarkii (Richardson 1836) [23, 28]. Rising temperatures and decreasing population sizes have been linked to declining genetic diversity through drift and inbreeding in Atlantic salmon Salmo salar Linnaeus 1758 and brown trout S. trutta Linnaeus 1758 [29]. In contrast, warm-water species, such as the walleye Sander vitreus (Mitchill 1818) may broaden their ranges, shifting their distribution centers north [23]. Some outlying populations may fail to adapt, leading to declining genetic variability and extirpation [28], which may particularly affect southerly fringe areas such as the relict North River, Alabama population of walleye [30] (site Y; Figure 1). Exploitation and other anthropogenic stressors may accelerate diversity loss further [2, 31]. The present study analyzes the genetic patterns of walleye, in light of its past and potential future, to evaluate such patterns of variability, isolation, and continuity over space and time.

The walleye is an ecologically and economically valuable fish [32, 33] that supports large Great Lakes’ fisheries, peaking in Lake Erie [34]. It inhabits slow turbid lakes to fast flowing clear streams throughout much of North America (Figure 1). Historically, it ranged from the Mackenzie River in the Northwest Territories of Canada, south to the U.S. Gulf Coast, and northeastward to New Hampshire and Quebec; during the past century it also was transplanted into many other areas for fishing [25] (Figure 1A). After maturing at ~ age three, it migrates annually in spring to early summer to reproduce at natal spawning grounds [35], exhibiting site fidelity, homing, and philopatry [36]. Adults do not provide parental care or nest guarding, and range widely to feed after spawning, travelling 50–300 km [37].

Past studies by our laboratory examined fine-scale population patterns of walleye using nine nuclear DNA microsatellite (μsat) loci, showing that most spawning groups markedly differ even within lake basins and between proximate sites, as well as at broad scales supporting natal homing and spawning site philopatry [30, 3840]. Stepien et al. [38] found that the genetic structure of spawning groups remains similar from year to year, among age cohorts, and from generation to generation. An earlier study [41] addressed patterns across the Great Lakes using mitochondrial (mt) DNA control region sequences, whose geographic scope is extended here to accompany the μsat data set and includes new information from the Canadian Shield lakes for both data sets.

We additionally evaluate the historic genetic diversity of walleye from Lake Erie using preserved samples from 1923–1949. We address the taxonomic identity of a possible historic walleye variant, the “blue pike” S. v. “glaucus” (Hubbs [42]), whose distinction has been controversial [41, 43]. The “blue pike” was reported to be endemic in Lakes Erie and Ontario, where it co-occurred with the common walleye S. v. vitreus (hereafter referred to as walleye). The “blue pike” was believed to inhabit the deeper cooler waters of eastern Lake Erie, but also was caught in the shallow and warmer western basin along with walleye [44]. It reportedly spawned somewhat later and in deeper areas than walleye [43]. The “blue pike” was described to have a steel grey-blue color, larger eyes that were higher on the head, and a smaller interorbital distance than walleye [44, 45]. However, all characters overlapped extensively between the two [44]. Hubbs [42] originally described the “blue pike” as a separate species (“S. glaucus”), which then was demoted to a subspecies due to pronounced intergrades with walleye including color and all other morphological measures [44]. The present study thus addresses their identities using new morphological and genetic data.

Both “blue pike” and walleye shared a popular commercial fishery with the former collapsing in 1959 attributed to exploitation, pollution, and/or habitat alteration and the latter concurrently declining [44]. The last “blue pike” record was reported in 1965 [44] and the US Fish and Wildlife Service declared it officially extinct in 1983 [46]. The “blue pike’s” identity has been confusing since it did not belong to the pike family (Esocidae) and some walleye in northern waters (along the Canadian Shield) are colored bright turquoise-blue due to sandercyanin protein in the mucus [47], which does not match the darker grey-blue color described for “blue pike” [48, 49]. Early fishery biologists called these turquoise mucus walleye “mutants” and stated that they lacked the body color, shape, and other morphological characteristics of “blue pike” [49]; however, these variant colors have become confused in the literature [41]. Turquoise mucus and yellow walleye occur sympatrically in the same northern habitats, and the latter also have some turquoise mucus pigment [45]. The turquoise mucus additionally characterizes some yellow perch Perca flavescens (Mitchill 1814) in those habitats. Some of the turquoise color typically “rubs off” when the fish is collected [46 and CAS, pers. obs.].

This study aims to resolve population genetic patterns of walleye across its native range, providing a baseline for evaluating future anthropogenic pressures. We analyze contemporary walleye spawning groups and compare Lake Erie populations to historic samples, including the putative “blue pike” variant. Morphological characters also are analyzed for the historic samples. We expand the previous sampling coverage to 1181 walleye and combine information from mtDNA control region sequences and nine nuclear DNA μsat loci. Patterns are evaluated at multiple evolutionary and temporal scales, with mtDNA sequences revealing historical context (glacial refugium origins and taxonomic relationships) and μsat loci addressing contemporary microevolutionary processes (migration, gene flow, and genetic drift) [50, 51]. Biogeographic patterns are compared with those of other North American taxa. Specific questions are: (1) What is the genetic structure of walleye across its native distribution?, (2) How does genetic diversity vary across the range?, (3) What is the relationship of contemporary samples to historic Lake Erie (1923–1949) patterns?, and (4) Did the extinct “blue pike” significantly differ from walleye?

Results

Genetic diversity and phylogenetic relationships from mtDNA

Contemporary walleye populations contain 27 control region haplotypes (Figure 2; GenBank Accession numbers U90617, JX442946–56, KC819843–54, KF954732–35). Haplotypes 1–23 match those previously described by our laboratory [41, 52, 53] and five additional ones are identified here (four from contemporary samples, haplotypes 24–27; GenBank:KF954732–35, and one from historic Lake Erie walleye, haplotype 28; GenBank:KF954736) –totaling 28 haplotypes. Haplotype 19 from the Ohio/New Rivers (W–X, 1.00pp/98%) is located basally on the phylogenetic tree as the sister clade to all other walleye haplotypes, diverging by 19 steps (sites W and X; Additional file 1). The remaining 27 haplotypes share a common origin, with most differing by just single mutational steps, excepting haplotype 20 from the North River (site Y), which differs by eight (Figure 2A). Historic walleye haplotype 28 groups with the others (Additional file 1), varying by just a single mutational step from abundant haplotype 3 and found in a single individual.

Figure 2
figure 2

Relationships among mtDNA control region haplotypes of walleye, including A haplotype network from TCS v1.21 [[54]] and B relative frequencies per population sample. Haplotypes from contemporary spawning groups are numbered 1–27 and #28 was found in a historic walleye. Letters = geographic locations of haplotypes from Table 1. * = haplotypes occurring in the historic walleye and “blue pike” samples analyzed here. Circles in the network are sized according to total observed frequency of the haplotype. Lines = a single mutational step between the haplotypes; small unlabeled circles = those hypothesized/not sampled.

Five abundant haplotypes (1–4, 11) comprise 90% of walleye individuals, with haplotype 1 being the most frequent and widely distributed (Figure 2B, Additional file 2). Haplotype 3 is the next most numerous, and is distributed from the upper Great Lakes through the south. It is the sole haplotype shared between the contemporary and historic samples; it predominated in historic Lake Erie walleye and characterized all “blue pike” samples, in contrast to being represented in just 22% of today's Lake Erie walleye (Figure 2B, Additional file 2). The historic “blue pike” samples thus do not genetically differ from historic and contemporary walleye in mtDNA control region sequences. Haplotypes 2 and 4 are less abundant, yet have wide distributions and characterize a large number of individuals and locations. Haplotype 11 is broadly distributed and abundant in the northern samples (A–G, K, V), but mostly is absent from other areas (Figure 2B, Additional file 2). Three other haplotypes appear unique to separate drainages: haplotype 25 in Lake Winnipeg and the Upper Mississippi River (A, D–E), haplotype 19 from the Ohio/New Rivers (W–X), and haplotype 20 in the North River (Y).

Each spawning group contains multiple haplotypes (Figure 2, Table 1), with Lake Erie having the most (O–P) and Lake of the Woods (C), St. Louis River (F), Moon/Musquash Rivers (L), Lac Mistassini (V), and North River (Y) having the least. Historic walleye and “blue pike” appear to have possessed fewer haplotypes compared to populations in Lakes Erie and Ontario today. Modest numbers and proportions of private haplotypes characterize contemporary populations, with Oneida Lake (U) and North River (Y) containing the most. Lake Erie walleye have more private haplotypes than other Great Lakes' locations. A rare private haplotype (28) occurred in a single historic walleye individual, which appears to have been lost to today's populations. Historic “blue pike” samples lacked private haplotypes.

Table 1 Genetic variation of contemporary walleye populations (sampling areas labeled A–P, S–Y), historic walleye (Q), and “blue pike” (R) determined from mtDNA control region sequence data and nuclear DNA μsat loci

Haplotype diversity (HD) varies greatly among populations (Table 1), with the Moon/Musquash Rivers (L) and Mille Lacs (E) being lowest and Ohio/New Rivers (W–X), eastern Lake Erie (P), Cedar Lake (A), and western Lake Erie (O) highest. Historic walleye and “blue pike” samples had much lower diversity than contemporary walleye from Lakes Erie and Ontario, supported by analysis of variance (ANOVA; F = 47.24, df = 2, p = 0.005).

Genetic diversity patterns from nuclear DNA loci

All nine μsat loci conform to Hardy Weinberg Equilibrium (HWE) expectations and are unlinked. Locus Svi17 has a higher FST value, suggesting some possible positive selection, whereas SviL7 has the lowest, suggesting slight balancing selection based on results from Lositan [56] (Additional file 3). The remaining seven loci conform to neutrality, having intermediate FST values (mean = 0.090). Locus Svi18 has the fewest alleles and Svi6 the most. Seven loci are included here for historic walleye and “blue pike”, since Svi17 and L7 failed to amplify. Results from the seven versus the nine loci are identical among the 23 contemporary spawning groups, therefore, the former are presented here; the dataset based on nine loci was detailed by Stepien et al. [30, 40].

Most walleye populations (87%, 152/175 comparisons) are free of null alleles. Micro-Checker v2.2.3 [57] detects some slight homozygote excess at Svi33 (for populations C, L, V), Svi4 (C, P), Svi18 (O, Q, R, V), SviL6 (O, Q, T), Svi6 (F, Q, U, X), and Svi7 (L, M, Q). Since those loci have no such excess in other spawning groups, all populations are in HWE, and are free of heterozygote deficiency, scoring error, and stuttering [57], all loci are analyzed here. The historic walleye sample (Q) contains some possible null alleles (57%, Svi18, L6, 6, and 7), which appear more prevalent at loci having longer allelic lengths (Additional file 4). However, these allelic frequencies are similar among all samples: 0.00–0.20 for contemporary walleye spawning groups, 0.02–0.22 for historic walleye (site Q) and 0.00–0.27 for historic “blue pike” (R), negating their possible bias.

FreeNA analysis [58] results moreover discern no effect from possible null alleles on FST values. With FreeNA corrections, pairwise comparisons between the historic (Q–R) and contemporary samples (O–P, S–T) only slightly change (from 0.141 to 0.120), retaining a similarly large magnitude of difference between the sampling intervals. FST values between the contemporary populations change only at the thousandth decimal point after correction (e.g., 0.058 to 0.054). Thus, there was no need to adjust the allele frequencies against possible null alleles, as there was no apparent effect.

We recover 155 alleles among seven nuclear μsat loci, across 23 contemporary walleye spawning groups, historic walleye, and “blue pike” (Table 1, Additional file 3). Many alleles are shared, widely distributed, and have high frequency in contemporary and historic samples (Additional file 4). Great Lakes’ populations possess more alleles than the others, with Lake Erie housing the most, similar to control region data. Allelic richness is highest in the Ohio (W/X) and North Rivers (Y), moderate in Great Lakes’ populations, and lowest in the northern sites (A–E, V). ANOVA indicates that allelic richness significantly varies among populations (F = 2.50, df = 22, p < 0.001); Tukey’s post hoc tests, however, reveal no overall pattern.

Historic walleye (Q) and “blue pike” (R) possess fewer alleles and lower allelic richness (ANOVA F = 11.129, df = 5, p < 0.001) than characterize contemporary walleye from Lakes Erie (sites O–P) and Ontario (S–T; Table 1, Additional file 4). Allele lengths show some tendency to be smaller in historic samples and longer in the contemporary samples (Additional file 4). Historic walleye also display some inbreeding depression, whereas “blue pike” suggest some outbreeding (Table 1), but both were in HWE. Contemporary populations comparatively have moderate values. Colony v2.0.5.0 [55] analyses indicate that full siblings constitute 8% of contemporary walleye populations, being highest in McKim Lake (D) and Lake Nipigon (G). Higher values also characterize our historic walleye and “blue pike” samples (R).

Modest proportions of private μsat alleles typify the 23 contemporary samples, with the North River (Y) having the highest (Table 1). Historic walleye samples contained the most private alleles. However, all historic “blue pike” samples possessed common walleye alleles, with no unique ones, supporting lack of distinction.

Genetic diversity values from the μsat loci (HO) generally are somewhat higher than in the control region (Table 1). Diversities of historic walleye and “blue pike” were relatively low compared to contemporary samples. ANOVA indicates some differences in diversity among contemporary spawning groups (F = 1.79, df = 22, p = 0.02), and between contemporary versus historic samples (F = 3.83, df = 5, p = 0.007), however, Tukey’s tests describe no overall pattern. Thus, genetic diversity is similar among all samples.

Spatial genetic structure of walleye populations

Relationships among spawning groups show a pattern of genetic isolation with geographic distance (Additional file 5). Most are genetically distinctive (Table 2, Additional file 6) for both the control region and μsat data sets, indicating high genetic structure. The most pronounced divergences distinguish three geographic regions: northern (A–E), Great Lakes (F–U), and southeastern (X–Y). Notably, the Ohio/New (W–X) and North River (Y) populations differ by the greatest FST values. Geneclass2 [59] assignment tests denote high self-assignment among spawning groups and to the three geographic regions, similar to pairwise comparison results (Additional file 7). Hierarchical analysis of molecular variance (AMOVA) [60] also supports division into three geographic regions: northern (A–E, V), Great Lakes (F–U), and southeast (W–Y), which explains the most overall variation, has highest mean FST values, and best characterizes the data. A second well-supported scenario distinguishes walleye from each of the 11 drainage systems, with high mean FST values, but less support. The scenario for potential partitioning between river and lake population groups is unsupported.

Table 2 Genetic F ST divergences between pairs of walleye population samples, including contemporary spawning groups (labeled A–P, S–Y), historic Lake Erie walleye (Q) and “blue pike” (R), from mtDNA control region sequences (below diagonal) and nuclear μsat data (above diagonal)

Barrier v2.2 [26] and Structure v2.3.4 [61] analyses likewise identify significant genetic structuring across the range (Figures 1 and 3, Additional file 8). The first mtDNA barrier separates the southern (W–Y) from northern populations (A–P, S–V), the second isolates the northernmost sites (A–G, V), and the third distinguishes Lake Nipigon (G). In contrast, barriers for μsat loci denote finer-scale discontinuities, with the first separating Lake Nipigon (G, barrier I; 49% bootstrap support, 7/7 loci), then the North River (Y, II; 57%, 7/7), followed by Lac Mistassini (V, III; 43%, 6/7). Remaining barriers for both data sets depict fine-scale divergences, with most among northern spawning groups (Figure 1). Structure analyses indicate high assignment of the northernmost populations (A–G, V, purple color) from those to the south (I–U, W–Y, blue; Figure 3A). Populations in the upper (F–L; orange) and lower (M–U; light blue) Great Lakes assign separately, with St. Louis River (F) and Lake Nipigon (G) walleye clustering more closely to those in the northwest. Finer-scale demarcations by drainage and some spawning groups also are recovered, similar to results from the analyses using pairwise comparisons, Geneclass, and AMOVA.

Figure 3
figure 3

Bayesian Structure analysis [[61]] for A contemporary walleye ( K= 2, 3, 8) and B contemporary Lakes Erie and Ontario and historic Lake Erie walleye (with “blue pike”; K= 2, 3, 6) population groups, based on seven μsat loci. Individuals are represented as thin vertical lines, partitioned into K colored segments.

The combined population genetic distance tree (Additional file 9) also shows separation of the southern samples, with the North River (Y) walleye as basal, matching its high pairwise divergence (Table 2). The tree clusters northern walleye and those from Lake Superior together (sites A–G, V; 62%), similar to Barrier results. Lakes St. Clair (M–N) and Erie populations (O–P) display close genetic relationship, analogous with pairwise comparisons.

Historic vs. contemporary populations of Lake Erie walleye

Historic walleye and “blue pike” samples are indistinguishable in control region sequences, and vary from each other in μsat allele frequencies within the realm among contemporary walleye populations, on the low end among the 23 spawning groups (Table 2, Additional file 6). In contrast, greater values distinguish walleye populations spawning in the New (X) and North Rivers (Y). Thus, there is no genetic evidence that the “blue pike” comprised a distinct taxon from walleye.

Both sets of historic samples differed in haplotypic and allelic frequencies from contemporary Lakes Erie and Ontario walleye in the FST and χ2 analyses. The historic samples significantly varied in genetic composition from all but one contemporary spawning group (Table 2, Additional file 6). The control region frequencies of the Pigeon Lake spawning group (S) alone resemble the historic samples from Lake Erie due to shared high representation of haplotype 3, but differ in the μsat data. This likely is an artifact.

No genetic distinction occurs between turquoise-colored mucus versus yellow walleye from McKim Lake (D, control region: FST = 0.000, χ2 = 0.00, μsat: FST = 0.000, χ2 = 14.05). Together, McKim Lake represents a distinct spawning group, differing from other walleye populations (Table 2, Additional file 6). The historic walleye and “blue pike” samples from Lake Erie were very significantly different from the McKim Lake samples. Thus, both turquoise mucus walleye and “blue pike” are walleye.

AMOVA analyses show no significant difference between the historic (walleye and “blue pike” considered together as a single group) versus the contemporary samples for control region sequences. The historic and contemporary samples differ with μsat data (Table 3) by ~2–5x greater than the mean FST values among contemporary walleye spawning groups. Results thus indicate that μsat allelic frequencies have changed from the past to the present.

Table 3 Relative distribution of genetic variation among contemporary and historic walleye samples using AMOVA, calculated from mtDNA control region sequence and nuclear DNA μsat data

The genetic distance tree of the combined gene data (Additional file 9) and Structure analyses (Figure 3B, Additional file 8) both separate the historic samples into a distinct group. Geneclass assignment tests likewise depict their genetic difference, with 89% of the historic samples self-assigning. Just 10% mis-assign to the contemporary eastern Lake Erie walleye (group P), 6% to historic “blue pike” (R) and 6% to contemporary western Lake Erie walleye (O). Historic “blue pike” individuals self-assigned 48% of the time (N = 12), with 48% mis-assigning to historic walleye. A single “blue pike” individual assigned to contemporary eastern Lake Erie walleye (4%).

Morphological traits of historic samples

Morphological variations occur between walleye and “blue pike” in six characters; “blue pike” individuals tend to have the smallest head and interorbital widths, largest eye diameter, highest numbers of pelvic fin rays, a larger orbit: interorbital ratio, and fewer second dorsal fin rays (Table 4). However, the ranges of all characters extensively overlap and do not reliably distinguish a “blue pike” from a walleye specimen.

Table 4 Morphological data from historic walleye andblue pike” samples, with mean, standard deviation (SD), and range of values

The first four morphometric principal components (PC) explain 86% of the overall variation and the first three meristic PCs explain 79% between historic walleye and “blue pike”. Multivariate analysis of variance (MANOVA) shows significant difference in morphometric (Wilks’ Λ = 0.768, F = 7.244, df = 1, p < 0.001) and meristic principal components analysis (PCA) (Wilks’ Λ = 0.876, F = 4.522, df = 1, p = 0.005; Figure 4). Morphometric PC3 and 4 distinguish walleye and “blue pike” (ANOVA, PC3: F = 10.443, df = 1, p = 0.002, PC4: F = 9.004, df = 1, p = 0.003) and are correlated with body depth (r = 0.850) and head width (r = -0.546). Meristic PC1 and 2 (Figure 4) also differentiate them (ANOVA, PC1: F = 4.323, df = 1, p = 0.040, PC2: F = 5.0832, df = 1, p = 0.026) in number of second (r = -0.827) and anal fin rays (r = -0.578).

Figure 4
figure 4

Principal components analysis plots for historic walleye and “blue pike”, based on A morphometric and B meristic characters. Solid symbols = mean component scores. Circles = 95% confidence intervals for each group.

Quadratic discriminant function analysis (DFA) correctly assigns 72% of samples using the morphometric dataset and just 64% with meristic characters. Historic walleye (76% morphometric, 71% meristic data) and “blue pike” (68%, 57%) more often self-assign, but have high mis-assignment to each other, similar to the Geneclass results. Thus our findings do not support designation of “blue pike” as a subspecies or species, since it lacks reliable morphological characters and has no genetic differentiation, with no unique haplotypes or alleles. Its level of frequency difference in μsat allelic composition appears typical of a walleye population.

Discussion

Broadscale genetic divergence patterns

Walleye populations from the unglaciated southern portion of the range (W–Y) are the most divergent, possessing unique alleles and haplotypes. The earliest and most divergent haplotype (19) occurs in the Ohio and New Rivers (W–X), and is very genetically distinct ([30, 62] this study). Other fishes, including brook trout Salvelinus fontinalis (Mitchill 1814) [63], smallmouth bass Micropterus dolomieu Lacepède 1802 [64], greenside darter Etheostoma blennioides Rafinesque 1819 [65], rainbow darter Etheostoma caeruleum Storer 1845 [66], and yellow perch [67], likewise demonstrate pronounced population divergence of Atlantic coastal groups, indicating long-term isolation.

The North River (Y), which drains south into the Gulf of Mexico, is dominated by an endemic walleye haplotype (20). The population is small, very isolated, and relatively low in diversity, as previously documented [25, 30]. Boschung and Mayden [68] noted that this indigenous North River population is at risk of colonization from northerly walleye due to connections through the TombigbeeTennessee River waterway, which may threaten localized adaptations. Likewise, yellow perch from the Gulf Coastal drainage possessed high endemism, unique haplotypes, and extensive divergence [67]. Southerly populations of walleye and yellow perch thus represent unique genetic sources that may prove valuable for conservation.

Walleye populations exhibit moderate divergence levels across the range, as described by previous studies [30, 41, 69]. In contrast, yellow perch (control region mean FST = 0.469, μsat = 0.236) [67] and smallmouth bass (cytochrome b mean FST = 0.412, μsat = 0.232) [64, 70] possess much higher divergence among spawning groups. This may be due to their more limited migration [71, 72]. Walleye have been documented to disperse 50–300 km [73], followed by yellow perch to 48 km, with occasional individuals travelling 200 km [71], and smallmouth bass only to ~10 km [72].

The European perch P. fluviatilis Linnaeus 1758 discriminates kin from non-kin via olfactory cues and schools in family groups, which may reproduce together [74, 75]. This life history pattern remains to be tested for yellow perch, walleye, or smallmouth bass. Kinship tests by our laboratory reveal high proportions of full siblings in some spawning groups of yellow perch (mean = 0.20 ranging to 75%) [76] and smallmouth bass (0.15, to 67%) [70], which are greater than those identified here for walleye. Limited lifetime migration, and apparent close association among kin for yellow perch and smallmouth bass, may lead to their higher divergences among proximate populations. Walleye populations display intermediate divergence and higher diversity, attributable to more outbreeding. Thus, biogeographic patterns of these widely distributed freshwater fishes largely result from a combination of extrinsic (i.e., changes in climate and drainage patterns) and intrinsic factors (dispersal capabilities, degree of natal homing, population size, and inbreeding), which are regulated by behavior of the species.

Contemporary genetic diversity patterns

Our data indicate relatively high genetic diversity for most walleye spawning groups; these values mirror those reported across their range using nine μsat loci (HO= 0.68) [30]. Difference between the mtDNA and nuclear data is attributable to the former having a slower evolutionary rate [51, 77], ¼ effective the population size, and being more influenced by population bottlenecks [50].

Walleye reproducing in the lower Great Lakes (Lakes St. Clair and Erie) have the highest diversity values, reflecting admixture from glacial refugia and larger population sizes, similar to results from other studies [30, 40]. Some small upper Great Lakes populations display the lowest mtDNA diversities, but average nuclear DNA diversities. Notably, the population from the Moon/Musquash River (site L) has the least control region diversity but higher μsat value. Gatt et al. [78] also recovered low mtDNA diversity at this location, attributing this to stocking and overexploitation. Since this pattern is restricted to mtDNA alone in our study, it likely instead reflects past bottlenecking and small population size.

Yellow perch populations possess much lower mtDNA control region diversity levels across their range (mean HD= 0.31) than nuclear DNA variability (HO= 0.53, 15 μsat loci) [67]. This also is true for the related European perch [79] and Eurasian ruffe Gymnocephalus cernua (Linnaeus 1758) [80]. This difference among percid species may reflect their respective evolutionary history and behavior. Notably, strong association of European perch in kin groups [74, 75] and high proportions of yellow perch full siblings in spawning groups [76] may lead to lower diversity from inbreeding. Smallmouth bass spawning groups also have lower genetic diversity than walleye in mtDNA sequences (mean HD= 0.50) [70] and eight μsat loci (HO= 0.46) [64]. This may reflect higher association of kin groups and limited lifetime migration [70]. Thus, population genetic diversity and divergence of smallmouth bass [64, 70] and yellow perch [76] differ from walleye due to their respective reproductive behavior and life history characters.

Small isolated southern populations of walleye have lower genetic diversity values than those in once glaciated regions. The former contain the oldest and most unique genetic variants, reflecting long-term isolation, smaller population sizes, and likely genetic drift and population bottlenecks [30]. Gulf Coastal yellow perch populations also possess the lowest diversity values, along with unique haplotypes and alleles [67], similar to walleye. These southern populations comprise a historic source of diversity and an important genetic resource for both species. Their adaptations to warmer habitats may provide a critical genetic reservoir in the face of climate warming.

Genetic patterns shaped by anthropogenic factors in Lake Erie

Genetic diversity values for mtDNA and μsat loci are 94% and 37% lower for historic Lake Erie walleye and “blue pike” from 1923–49 compared with contemporary Lakes Erie and Ontario populations. Genetic diversity appears to have increased over the past 70+ years, which may reflect population recovery.

Europeans settled Lake Erie shores during the 1700–1800s, cutting down the forests and draining the marshlands, which disappeared by 1900 [44, 81]. As the region developed and industrialized, untreated wastes were released into the Lake from saw mills, slaughterhouses, and steel factories. By 1830, Lake Erie walleye comprised an abundant and important commercial fishery. In 1874, construction of a major shipping channel drastically modified the Lakes Huron–Erie Corridor connection (Figure 1). Lake Erie steadily lost much of its fish habitat from 1900–1970s due to draining of wetlands, armoring of shorelines, channelization, dredging, and increased industrialization [82, 83]. During the 1960s, high levels of phosphorus caused massive algal blooms, accompanied by oxygen depletion and marked fish die offs [82].

Many native Lake Erie fish populations experienced steady declines from 1900–1970s, including the lake trout, lake sturgeon Acipenser fulvescens Rafinesque 1817, and walleye [81, 84]. Industrial outputs resulted in heavy metal contamination and declining fish health, manifested with neoplasms, tumors, and lesions on walleye and other species [85]. In 1970, walleye fisheries from Lakes Huron through Erie were closed due to high mercury levels. By 1978, fisheries managers declared Lake Erie walleye as being in crisis from overfishing and pollution [82].

Lower allelic numbers and diversity for historic walleye and “blue pike” from 1923–49 likely reflect these population declines. The rare haplotypes and alleles we identify in historic walleye may have disappeared. Most haplotypes and almost all alleles in historic Lake Erie samples are common and widespread today. The historic samples appeared to possess more haplotypes that trace to the Atlantic Refugium, whereas those from the Mississippian Refugium dominate today’s walleye populations [21, 25, 41]. This change merits further testing, but may be a response to climate warming.

Genetic diversity of other Great Lakes fishes similarly was lower in 1927–59 and higher in 1995–2005, attributed to population declines from environmental conditions and overexploitation [86, 87]. Notably, lake whitefish Coregonus clupeaformis (Mitchill 1818) from Lakes Huron and Erie in 1927 had lower diversity at seven μsat loci (0.60) than in 1997–2005 (0.65) [87]. Lake trout from Lakes Superior, Michigan, and Huron were less variable in 1940–59 (0.47) than in 1995–99 (0.51) using five μsat loci [86]. Similarly, Atlantic cod Gadus morhua Linnaeus 1758 declined in diversity and number of alleles from 1954–80s, then increased from 1980–98 according to archived otolith samples and three μsat loci [88]. That study moreover documented that genetic composition changed, similar to the pattern here for Lake Erie walleye, hypothesizing recovered diversity via immigration from a nearby spawning group.

Contemporary Lake Erie walleye populations also may have been influenced by migration and recruitment. Walleye is described to natally home, with chemical cues presumably facilitating recognition of reproductive grounds [89, 90]. Olson and Scidmore [89] discerned lower homing in areas with high habitat degradation. Some contemporary western Lake Erie spawning group samples appear genetically similar to those from Lake St. Clair, suggesting possible genetic exchange. Walleye movement between these lakes during non-spawning times is well known from tagging [91, 92] and Lake Erie walleye may have migrated to spawn at a recently augmented reef in the Detroit River [53].

Lake Erie walleye appear to have recovered from declines in diversity and numbers (~10 million in 1978), after the fishery's closure from 1970–76 [93], implementation of the 1970 Canada Water Act, the 1972 U.S. Clean Water Act, and the 1972 Canada-U.S. Great Lakes Water Quality Agreement [82]. Increasing water temperature –especially in the shallow western basin– changed the Lake Erie fish community from cold water (e.g., lake trout) to warmer water species, favoring walleye and yellow perch. Declining numbers of colder water competitors presumably enhanced walleye abundance [84]. By 1988, Lake Erie had rebounded to ~80 million (8x 1978), decreasing to ~18 million walleye in 2013 [94]. Genetic analyses [38] revealed temporal consistency in genetic diversity from 1995–2008 for three primary Lake Erie walleye spawning groups (Maumee and Sandusky Rivers in the west, and Van Buren Bay in the east). Whether this continues remains to be discerned, as genetic variability may decline with population sizes.

Other walleye populations have shown a mixture of temporal stability and decline. Walleye spawning in Escanaba Lake, Wisconsin displayed consistent diversity levels from 1952–2002 (mean HO = 0.76) using eight μsat loci [95] (six of those here). Garner et al. [96] likewise described consistent diversity levels for walleye in Lake Superior’s Black Bay from 1966–2010 (mean HO = 0.62) employing nine μsat loci (six of those here). However, the Escanaba Lake and Black Bay populations were stocked, likely circumventing fluctuations. MtDNA restriction fragment length polymorphism diversity of walleye spawning in Lake Huron’s Georgian Bay declined over three decades (0.50 in the 1960s to 0.15 in the 1990s), attributed to exploitation and stocking [78]. Our study recovers similarly low mtDNA control region variability for walleye spawning in the Moon/Musquash Rivers of Georgian Bay today. However, this bottleneck effect is restricted to mtDNA, since we denote average levels of nuclear DNA variability.

Other alternatives may explain lower diversity levels in the historic samples. Our contemporary samples were adults collected from spring spawning runs at specific spawning sites. In contrast, historic samples were collected from July–November, when walleye intermix. Thus genetic diversity may have been lowered due to population admixture via a Wahlund effect [97]. For historically archived samples, such as ours, Nielsen and Hansen [98] recommended including positive and negative controls, having a separate laboratory space and separate chemicals, testing for null alleles with Micro-Checker, using samples with complete documentation of biological information, testing for HWE, and applying more than one statistical test to validate patterns. We followed all of these precautions to ensure reliability of data from our formalin fixed historic samples. Some studies have documented issues with historic samples having biased amplification of shorter length alleles [99, 100]. We found slight suggestion of null alleles in historic samples, with shorter allele lengths being more prevalent. Lower template quality may have resulted from DNA shearing with formalin fixation [101], leading to partial repeat amplification if primer sites were unavailable for binding [102]. However, our Micro-Checker tests and other analyses demonstrate lack of statistical support for such problems. Our mtDNA sequences reveal the same pattern as the μsat analyses. Additional analysis of historic walleye and “blue pike” samples from intermediate decades may help to further interpret temporal population genetic patterns.

Taxonomic status of historic “blue pike” and turquoise-mucus variants

We discern that the historic “blue pike” appears genetically indistinguishable from walleye populations. It has no unique genetic variation in our database, rendering S. vitreus “glaucus” invalid. It fails to meet the criteria of the Evolutionary Species Concept (ESC) [103] or the Phylogenetic Species Concept (PSC) [104]. The “blue pike” is not “an entity that kept its identity from others over time and space and that had its own independent evolutionary fate and historical tendencies” and possesses no bootstrap or posterior probability support, lacks reciprocal monophyly, diagnosable synapomorphies, and demarcation from the walleye as required by the ESC and PSC [103, 104]. It also does not show interspecific variation 10x greater than the mean intraspecific variation of walleye [105]. In fact, the “blue pike” has no mtDNA sequence differentiation and its μsat variation is identical to that among typical walleye spawning groups and populations. In contrast, many walleye populations across North America possess much more pronounced genetic variation, particularly from the New (X) and North Rivers (Y). Walleye spawning in those southerly locations meet more of the criteria of being distinct taxa. However, we regard those in the New and North Rivers as divergent populations of walleye, and not as separate taxa, and believe that most ichthyologists and systematists would concur. Our findings thus indicate that the “blue pike” does not constitute a separate genetic taxon from walleye, and does not merit species or subspecies recognition.

Slight morphological variations between historic walleye and “blue pike” samples suggest some possible population-level differences. However, single individuals of the historic “blue pike” and walleye cannot be identified morphologically or genetically as either “blue pike” or walleye. Their coloration also is not a reliable identification character as it was/is very variable among historic as well as contemporary walleye [44, 48]. Ichthyologists from the era of the “blue pike” reported a large numbers of intergrades in color, as well as among all morphological traits [44, 48, 49].

Some fishes, including lake trout, whitefish Coregonus spp., and Arctic char Salvelinus alpinus (Linnaeus 1758), have been regarded as multiple morphological races that developed through adaptation to northern proglacial lakes [106], but possess low genetic divergence [20]. The “blue pike” was reported to inhabit deeper waters, have slower growth [107], and a larger eye [42,this study] compared to walleye. Slower growth likewise characterizes walleye in eastern Lake Erie today [108]. We find that although “blue pike” and walleye display some slight morphological variation, this is rather negligible, and unaccompanied by population genetic distinction, rendering its subspecies status invalid. “Blue pike” were walleye, and fell within the normal range of walleye population variation.

The turquoise-colored mucus walleye from McKim Lake (site D) do not genetically differ from co-occurring yellow walleye. Stepien and Faber [41] likewise analyzed several assorted turquoise mucus walleye from a variety of Canadian Shield lakes using entire mtDNA control region sequences and found no genetic distinction from the normal variation of walleye. Paradis and Magnan [109] morphologically compared sympatric yellow and turquoise mucus walleye in five Canadian Shield lakes near Quebec, reporting longer head lengths and smaller interorbital distances in some of the latter. However, early fishery biologists found that these turquoise mucus walleye did not possess the morphological characteristics of “blue pike” [49]. Laporte et al. [110] alleged slight genetic difference between turquoise mucus and yellow walleye populations sampled within a lake using amplified fragment length polymorphism markers and assignment tests, but lacked diagnostic alleles and their genetic distance analyses showed no significant bootstrap support. It may be that there are some population-level variants within some lakes across the range of the Canadian Shield; many such distinctions among walleye spawning groups are found in the present study and others [30, 39], but these do not warrant taxonomic recognition.

Occasional steel-blue colored walleye regularly are reported from Lake Erie and other waters, including the Ohio River drainage [44, 48]. Yellow perch that are dark blue in color also co-occur [44]. We analyzed mtDNA control region from a steel-grey/blue walleye individual sampled in western Lake Erie near Sandusky OH and found it had mtDNA haplotype 1, the most common walleye haplotype. A skin scraping revealed no turquoise mucus. Wayne Schaeffer (pers. comm., University of Wisconsin, September 2013) also found no turquoise mucus or sandercyanin in Lake Erie walleye using the methods of Yu et al. [47]. Overall, no diagnosable genetic or morphological characters have been found that distinguish historic “blue pike” from walleye, rendering its subspecies status invalid.

Effects of climate change on walleye populations

Global temperatures are predicted to increase over the next 50 years, with the Great Lakes region rising by 5–5.5°C, becoming more like today’s Gulf Coast [111]. Today, Lake Erie houses the largest walleye abundance [45] and greatest genetic diversity. Increased temperatures are predicted to shift walleye distribution northward [23]. Fringe populations may experience declines and increased isolation, with bottlenecks and drift reducing genetic variation, accompanied by loss of unique haplotypes and alleles. Hence, valuable genetic resources may disappear as unconnected populations become sequestered.

High connectivity in Great Lakes’ watersheds allows ample dispersal opportunities, which may homogenize gene pools of distinctive spawning groups as they move northward and mix, producing a Wahlund effect. Thus, climate change may lead to decline of divergence patterns from today’s walleye spawning groups. Walleye likely will remain abundant and adapt, but unique variants may be lost. Common alleles may increase in frequency, raising concerns for retaining adaptive potential, which should become a management priority. It might be possible to utilize unique warm-adapted variants to the south (e.g., the North River, Y) and southeast (Ohio/New Rivers, W–X) to aid future walleye populations.

Conclusions

This study increases understanding of historic and contemporary walleye genetic diversity and divergence patterns via a genetic window approach. Population patterns have been shaped by climate change and drainage connections, with northern ones tracing to post-glacial recolonization. Genetic diversity and abundance of Great Lakes' walleye appear to have rebounded after drastic habitat changes and industrialization of the early 1900s through the 1970s. The next step is to identify adaptations that underlie these genetic diversity and divergence patterns, via genomics [112, 113]. Such applied knowledge will aid efforts to sustain natural populations in the face of ongoing climate change and new anthropogenic stressors. Our goal is that the present results will aid future walleye diversity and adaptedness.

Methods

Sample collection for genetic analysis

Fin clips of 1125 contemporary spawning adult walleye were sampled at 23 spawning locations (labeled A–P, S–Y) across the native range during the spring spawning runs (Figure 1, Table 1, Additional file 10), avoiding sites with documented anthropogenic introductions. Samples were collected directly by federal, state, and provincial agencies according to their regulations and permits, and by Stepien’s Great Lakes Genetics/Genomics Laboratory (University of Toledo’s Lake Erie Center, OH) under Ohio Division of Wildlife permit #140160 (issued to CAS). Samples were labeled, immediately placed in 95% ethanol, and archived at the Great Lakes Genetics/Genomics Laboratory. Genetic comparisons were made to historic formalin-preserved museum specimens of Lake Erie walleye (1923–49, site Q, N = 20 for mtDNA, N = 31 for μsat) and historic “blue pike” paratype individuals (1923, R, N = 20, 25). The latter had been identified by Hubbs [42] and are housed at University of Michigan’s Museum of Zoology, while some of the walleye came from Ohio State University's Museum of Biological Diversity (see Additional file 11 for list of individuals used). We also evaluated contemporary yellow- (N = 10) versus turquoise-colored mucus (N = 10) walleye sympatric in McKim Lake, Ontario, Canada (site D, Figure 1).

DNA extraction

DNA was extracted following standard protocols from previous studies [38, 53]. To circumvent contamination, museum sample extractions were conducted in a separate clean laboratory using separate autoclaved materials. Gloves were changed at all stages and between samples. Pre-extraction, formalin fixed tissues and controls were soaked in 10 mL phosphate buffered saline on an orbital shaker for 48 hours, changing buffer after 24 hours. Historic walleye and “blue pike” samples were extracted at separate times, with negative (no tissue) and positive controls (Lake Erie Maumee River walleye #AZE31); the latter were handled after all solutions were added and tubes were closed.

Mitochondrial DNA sequence data collection and analyses

MtDNA control region sequence data (733 bp) were collected from a representative subset from each of the 23 contemporary walleye spawning groups (N = 711, 11–25 per site; Table 1) used for the nuclear μsat loci. Methods for amplification and sequencing of the 733 bp control region segment followed Haponski and Stepien [53] using the primers LW1-F [78] and HN20 [114]. We additionally sequenced formalin fixed historic Lake Erie walleye (N = 20) and “blue pike” (N = 20). Since formalin fixation often shears DNA into smaller fragments [101], we designed additional internal primers to amplify the control region in overlapping segments: LW1-F [78] and SandercrIR (5’ - CATTCATACTATTTTCTTGC - 3’), SandercrIF (5’ - AGTACATACTCTGTTACC - 3’) with HN20 [114], SandercrIF with SvicrIR2 (5’ - GTGATTTCCACTATTTATGC - 3’), and SvicrIF (5’ - GCAAGAAAATAGTATGAATG - 3’) with HN20. Walleye haplotypes were referenced to Stepien and Faber [41], who sequenced the entire mtDNA control region (~1,086 base pairs (bp)). These were trimmed to 733 bp, omitting the 5’ repeat section, thereby retaining seven of their 14 haplotypes (our haplotypes 1–7 [GenBank:U90617, JX442946–52]).

Haplotype relationships were analyzed with evolutionary trees [50] via maximum likelihood in PhyML v3.0 [115] and Bayesian analyses in MrBayes v3.2.1 [116]. Corrected Akaike information criteria from jModeltest v2[117] selected the TPM3uf model [118], including invariant sites (I = 0.3910) and the gamma distribution (α = 0.2750). Trees were rooted to the sauger S. canadensis (Griffith and Smith 1834), which is the sister species of walleye [52]. A statistical parsimony haplotype network was constructed with TCS v1.21[54]. Analyses in common between the mtDNA and nuclear μsat data sets are below in “Population genetic data analyses”.

Nuclear microsatellite data collection and analyses

We evaluated population genetic structure among 23 contemporary walleye spawning groups (1125 individuals; Table 1, Additional file 10), adding 259 individuals and six spawning sites to the 866 individuals and 17 locations previously analyzed for nine μsat loci (Svi2, 4, 6, 7, 17, 18, 33, L6, and L7) in our laboratory [30, 38] (Additional file 11). Data were compared with museum specimens sampled from Lake Erie, including 31 walleye (1923–1949) and 25 “blue pike” (1923) (Additional file 11).

PCR amplifications followed previous studies [38, 53], including a positive control and a negative control. Loci were run individually for the formalin material. Svi17 and L7 failed to amplify in historic samples and thus were dropped, leaving seven for analyses. Allele scores were manually verified [38, 53].

Analyses followed Stepien et al. [30] and Haponski and Stepien [53]. Conformance to HWE expectations and linkage disequilibrium was tested in Genepop v4.0 [119] and significance levels adjusted with sequential Bonferroni correction [120]. Loci were tested for possible selection using FST outlier comparisons in Lositan [56] and examined for null alleles, scoring errors, or large allele dropout with Micro-Checker v2.2.3 [57]. If null alleles were indicated, their frequencies per population and locus were estimated and potential influence on FST values evaluated in FreeNA [58]. Number of alleles (NA), inbreeding (FIS), overall genetic deviation (FIT), and divergences (FST) were calculated across loci and samples with Fstat v2.9.3.2 [121, 122].

Structure v2.3.4 [61] was employed to evaluate hypotheses of differentiation among population groups, for contemporary spawning groups from K = 1 (null hypothesis of panmixia) to K = 29 (sites A–P, S–Y) and contemporary versus historic walleye from K = 1 to K = 11 (nine contemporary Lakes Erie (O–P) and Ontario (S–U) spawning groups, historic walleye (Q), and “blue pike” (R)), with 250000 burn-in replicates followed by 1000000 generations. Optimal K was selected according to the Evanno et al. [123] method. Geneclass2 [59] assigned individuals to putative populations of origin for contemporary walleye, historic walleye, and “blue pike”, using 1000000 simulations. Colony v2.0.5.0 [55] was employed to test for possible kin relationships (full siblings) in samples.

Population genetic data analyses with both data sets

Genetic diversity comparisons included haplotype diversity (HD) and number of haplotypes (NH) for sequence data in Arlequin v3.5.1.3 [124] and those for nuclear μsat loci were observed (HO) and expected (HE) heterozygosities in Genepop, and FIS, NA, and allelic richness (AR) in Fstat. Significant differences were determined with ANOVA in R v2.15.2 [125], followed by Tukey’s post hoc tests [126]. Convert v1.31 [127] calculated number and proportion of private haplotypes (NPH) and alleles (NPA), i.e., those unique to a sample.

Patterns of genetic divergence were evaluated using unbiased FST estimates [128] in Fstat and pairwise exact tests of differentiation (χ2) [129] in Genepop. Genetic divergence results were used to identify true populations or taxa, i.e., those with significantly divergent gene pools, to address the question of the identity of the “blue pike” and other possibly distinctive groups. We also tested genetic isolation by geographic distance (nearest waterways) in Genepop and identified genetic discontinuities across the range with Barrier v2.2 [26]. Population relationships additionally were analyzed with neighbor-joining trees [130] of Nei’s [131]D genetic distances in Phylip v3.68 [132] and 2000 bootstrap pseudo-replications [133]. The tree was rooted to sauger. Hierarchical partitioning of genetic variation was evaluated with AMOVA [60] in Arlequin, including: (a) between historic and modern samples, (b) among drainages, and (c) between lake and river spawners.

Morphological comparisons

Morphological and meristic data were collected and compared between preserved historic Lake Erie walleye (N = 51 individuals) and “blue pike” samples (N = 52, including 30 paratypes; all individuals are listed in Additional file 11). We compared nine morphometric measurements and five meristic characters from Hubbs and Lagler [45], using identical standard protocols taken by a single investigator (AEH). Measurements (to 0.1 mm) included: standard, head, cheek, upper and lower jaw lengths, body depth, head and interorbital widths, and orbit diameter, with Mitutoyo vernier calipers (Aurora, IL). Measurements were taken to the bone of individual fish, to rule out influence of preservation technique. Meristic characters included: numbers of anal, first and second dorsal, pectoral, and pelvic fin rays, with a Leica Microsystems dissecting microscope (Buffalo Grove, IL). We also analyzed orbit: interorbital and upper: lower jaw ratios, reported by Trautman [44] to vary between walleye and “blue pike”.

Morphometric measurements first were standardized by removing size-dependent variation per Elliott et al. [134], and then log-transformed. Meristic data were not transformed; therefore morphometric and meristic data sets were analyzed separately, encompassing univariate and multivariate analyses in R. Each character was evaluated for departure from univariate normality with Shapiro-Francia tests [135]. Standard length, body depth, and orbit: interorbital ratio were the sole normally distributed characters. Their means were compared with Student’s t-tests, whereas Mann–Whitney U tests evaluated those that were not normally-distributed [126]. Nonparametric MANOVA [136] compared the overall difference between historic walleye and “blue pike”. Their overall morphological variation also was explored with PCA, using the covariance matrix. MANOVA and Wilks' lambda [126] tests evaluated among PCs, followed by individual ANOVAs.

DFA [137] compared historic walleye and “blue pike” in SAS v9.2 (SAS Institute Inc., Cary, NC). χ2 evaluated use of linear or quadratic DFA for significant covariance matrix heterogeneity, rejecting linear DFA (morphometric: χ2 = 103.216, df = 45, p < 0.001; meristic: χ2 = 32.253, df = 15, p = 0.006). Thus, quadratic DFA was used, which based each character on its own variance-covariance matrix, with leave-one-out jackknife re-sampling examining taxon assignment [138].

Data set availability

The data sets supporting the results of this article are available in the Dryad digital repository [http://doi.org/10.5061/dryad.nk470] [139] and GenBank:U90617, JX442946–56, KC819843–54, and KF954732–35.

Authors’ information

This work formed part of the Ph.D. dissertation work of AEH in the laboratory of CAS, who now is a postdoctoral associate in the laboratory. CAS has worked on walleye genetic patterns for over two decades, and her Great Lakes Genetics/Genomics laboratory focuses on invasive species, as well as native fishes.