Introduction

Conservation of migratory salmonids (genera Oncorhynchus, Salmo, and Salvelinus) is complex due to the wide range of habitat types (e.g. marine, freshwater, fluvial, lacustrine) occupied by a diversity of life-history forms (Klemetsen et al., 2003; Schindler et al., 2003). A driving behavior shared by these taxa is homing, where adults produced in a stream tend to return there to spawn (Ueda et al., 1998; Keefer & Caudill, 2014), resulting in reproductive isolation of populations (metapopulations) with differential productivity (Horrall, 1981; Stewart et al., 2003). Variation in productivity may be caused by local genetic adaptation to different physical conditions (e.g. flows, temperatures or spawning substrates) or biotic factors (e.g. competitors or predators) at spawning streams distributed across large spatial areas (Quinn, 2018). Assessing the relative contributions of natal tributaries to overall recruitment is vital for prioritizing protection and restoration of critical spawning and rearing habitats (Brennan et al., 2015; Brennan & Schindler, 2017; Phillis et al., 2018). Cutthroat trout [Oncorhynchus clarkii (Richardson, 1836)] subspecies of the western United States are among the salmonids with migratory life histories (Behnke, 2002). The Yellowstone cutthroat trout [O. clarkii subsp. bouvieri (Jordan & Gilbert, 1883)] in Yellowstone Lake, northwestern Wyoming, is a large genetically-pure population (al-Chokhachy et al., 2018) that exhibits a lacustrine-adfluvial life-history strategy. These Yellowstone cutthroat trout, hereafter cutthroat trout, make spawning migrations during spring into > 60 small tributaries around the lake (Gresswell et al., 1994; Varley & Schullery, 1998), move downstream to spawn in the Yellowstone River outlet (Kaeding & Boltz, 2001), or make long-distance spawning migrations upstream into the remote headwaters of the upper Yellowstone River inlet and its numerous large tributaries in the Thorofare region of the Bridger-Teton Wilderness (Ertel et al., 2017). Like other salmonids they show high spawning fidelity to natal streams (Ball, 1955; Gresswell et al., 1994). Adults of this population are iteroparous and commonly skip-spawn (Cope, 1956, 1957; Jones et al., 1986; Ertel et al., 2017). After a brief spawning period (few weeks) in tributaries, adults return to the relatively stable environment of Yellowstone Lake where they spend most of the year. After emergence, the fry produced in small tributaries generally emigrate to the lake prior to winter (Varley & Gresswell, 1988). In large tributaries (e.g., Pelican Creek) or streams in the remote upper Yellowstone River watershed (Fig. 1), fry may overwinter and return to the lake later as juveniles (Varley & Gresswell, 1988).

Fig. 1
figure 1

Map of study area and underlying geology in Yellowstone National Park, Wyoming. Symbols are the types of samples collected and the stream sample locations numbered: 1. Arnica Creek, 2. Beaverdam Creek, 3. Big Thumb Creek, 4. Bridge Creek, 5. Cabin Creek, 6. Chipmunk Creek, 7. Clear Creek. 8. Columbine Creek, 9. Cub Creek, 10. Flat Mountain Creek, 11. Grouse Creek, 12. Little Arnica Creek, 13. Little Thumb Creek, 14, 15. Thorofare Creek, 16. Mountain Creek, 17. Pelican Creek, 18. Sewer Creek, 19, 25. Upper Yellowstone River, 20. Solution Creek, 21. Trail Creek, 22. Creek 1138, 26, 27. Lower Yellowstone River

Within Yellowstone National Park, cutthroat trout are considered a keystone species with at least 4 mammal and 16 bird species depending upon them as prey (Bergum et al., 2017). Cutthroat trout abundance in Yellowstone Lake has varied among years but declined over several decades (1980s–2000s; Koel et al., 2005). The decline was largely caused by the unintentional introduction of nonnative lake trout [Salvelinus namaycush (Walbaum, 1792)] (Kaeding et al., 1996; Ruzycki et al., 2003; Koel et al., 2020a) which added a novel 4th piscivorous trophic level to the aquatic food web (Tronstad et al., 2010). Cutthroat trout loss was also caused by the introduction of [Myxobolus cerebralis (Hofer, 1903)] the causative agent of whirling disease (Koel et al., 2006), and persistent drought conditions which isolated some streams (Koel et al., 2005; Kaeding, 2020). Cascading interactions throughout the aquatic and associated terrestrial ecosystems of Yellowstone Lake were driven by the cutthroat trout decline (Tronstad et al., 2010; Koel et al., 2019a). Plankton assemblages within the lake were altered, and nutrient transport to tributary streams was reduced (Tronstad et al., 2015). Cutthroat trout once composed a large portion of the spring diet of many grizzly bears [Ursus arctos (Linnaeus, 1758)] and black bears [U. americanus (Pallas, 1780)], however, in recent years fewer bears have been observed feeding on spawning fish (Middleton et al., 2013). Similarly, once-abundant osprey [Pandion haliaetus (Linnaeus, 1758)] nesting near the lake were displaced, and bald eagles [Haliaeetus leucocephalus (Linnaeus 1766)] shifted their diet to compensate for the cutthroat trout loss (Koel et al., 2019a).

To mitigate for threats to cutthroat trout, Yellowstone National Park implemented a large-scale lake trout suppression program that over 25 years drove the invasive population into decline (Koel et al., 2020b). Assessment of fish populations and monitoring for persistent effects of drought and whirling disease in spawning streams also routinely occurs. Prevalence and severity of whirling disease has stabilized and remains high only in Pelican Creek and the Yellowstone River downstream of Yellowstone Lake, two of the largest spawning streams (Koel et al., 2019b). Previously, counts of adult spawning cutthroat trout in tributaries have been used as a metric for assessing the status of the population each year to determine which spawning streams are most important to recruitment, and how recruitment may change over time (Koel et al., 2020b). However, these counts were largely confined to small tributaries on the western side of Yellowstone Lake where fish could be easily observed and counted, while many important spawning streams in remote locations (Reinhart & Mattson, 1990; Ertel et al., 2017) were not routinely monitored due to inaccessibility. Thus, although juvenile cutthroat trout recruitment to Yellowstone Lake has improved in recent years (Koel et al., 2019a, 2020b), little is known about which tributaries are most strongly contributing to this recruitment or how these recruitment patterns may have changed over time.

We hypothesized that spawning tributaries located in lake regions with the highest lake trout densities (streams flowing into West Thumb, Flat Mountain Arm, and South Arm; Fig. 1; Bigelow et al., 2017), streams with the highest whirling disease severity (Koel et al., 2019b), and the streams that are most susceptible to drought (Koel et al., 2005), may have reduced recruitment of juvenile cutthroat trout. The majority of lake trout netted from Yellowstone Lake have been netted in the West Thumb and western side of the main basin of the lake (Bigelow et al., 2017), thus it appears that lake trout densities tend to be higher in some lake regions than others. As a result of the uneven distribution of lake trout throughout the lake, the effects of their predation on cutthroat trout may be more pronounced in relation to spawning streams flowing into portions of the lake with higher lake trout densities. Additionally, the effects of drought and whirling disease is not uniform among spawning tributaries or within Yellowstone Lake (Koel et al., 2006, 2019b), therefore, we suspect that recruitment patterns among tributaries may have shifted considerably during the past 25 years as a result of large and cumulative stressors on this population.

One method for reconstructing the lives of migratory salmonids is by using natural isotopic and trace elemental variation (Martin et al., 2013; Zimmerman et al., 2013; Phillis et al., 2018). The varying chemical composition of spawning tributaries, reflective of differences in underlying geology (Kennedy et al., 2000; Wells et al., 2003; Brennan & Schindler, 2017), impart unique chemical signatures to otoliths during early rearing, allowing for retrospective identification of natal origins and recruitment by matching water and otolith composition (Campana, 1999; Muhlfeld et al., 2012; Brennan et al., 2015; Brennan & Schindler, 2017).

However, low variation in water chemistry in some environments can limit the potential success of this methodology for assessing natal origin and movement patterns (Pangle et al., 2010; Stewart et al., 2021). In this study, our aim was to use a combination of Sr isotopes (87Sr:86Sr) and elemental ratios (Sr:Ca, Ba:Ca, Mg:Ca, Mn:Ca) in otoliths to assess potential shifts in cutthroat trout recruitment among spawning streams in the Yellowstone Lake watershed in response to significant environmental and biotic changes. The spawning stream contributions were compared for two time periods: (1) a ‘pre’ period (in 1997), prior to the significant negative effects of drought, whirling disease, and the full manifestation of lake trout population expansion and high predation pressure on cutthroat trout, and (2) and a ‘post’ period, 16 years later (in 2013), which reflected very low cutthroat trout abundance following the invasion of lake trout, drought, and whirling disease (Koel et al., 2020b). The analysis of microchemical records in otoliths offers a unique approach for assessing fish recruitment patterns over time in response to harvest, climate change, and other environmental stressors (Miller et al., 2011; Brennan et al., 2015, 2016, 2019a, 2019b; Walsworth et al. 2020).

Methods

Study area

The study area encompassed the Yellowstone Lake drainage upstream of the Upper Falls on the Yellowstone River (Figure 1). Yellowstone Lake is the largest high elevation lake (2376 m) in North America (352 km2; Kaplinski, 1991). The drainage has an area of 2616 km2 (Benson, 1961) with 124 tributaries, 68 of which have documented spawning by cutthroat trout (Jones et al. 1986). Underlying bedrock in the drainage is primarily Tertiary andesite and Quaternary rhyolite, and Tertiary sedimentary deposits are present in several tributary drainages in the southern portion of the basin (USGS 1972). At lower elevations, post-glacial deltaic deposits are common, likely as a result of historically higher lake levels (Richmond, 1976).

Two species of fish are native in the Yellowstone Lake drainage, the longnose dace [Rhinichthys cataractae (Valenciennes, 1842)] and cutthroat trout. Besides lake trout, there were four other non-native fishes introduced to Yellowstone Lake, presumably by anglers as baitfish, including redside shiner [Richardsonius balteatus (Richardson, 1836)], lake chub [Couesius plumbeus (Agassiz, 1850)], longnose sucker [Catostomus catostomus (Forster, 1773)] (Varley and Schullery 1998), and cisco [Coregonus artedi (Lesueur, 1818)] (Koel et al. 2020b). Although rarely studied, there is no evidence these fishes influenced recruitment or otherwise negatively affected the native cutthroat trout (Brown & Graham, 1954; Gresswell & Varley, 1988; Furey et al., 2020).

Water chemistry

Water chemistry used to identify natal origins was determined for 22 different tributaries. Study streams were selected by two criteria. First, we included streams known to be important for spawning based on observations of greater than 100 spawning trout observed (Reinhart, 1990; Haroldson et al., 2005; Ertel et al., 2017). Second, to ensure adequate spatial representation of tributaries likely differing in chemical composition, we also added tributaries that represented the range of underlying geology present, which has been shown to be important predictor of isotopic and elemental composition of surface waters (Hegg et al., 2013; Garcez et al., 2014; Humston et al., 2017).

Water samples were collected during summer base flow conditions in 2013 (August and September), which coincided with the post-emergence period of juvenile rearing prior to outmigration to the lake. Due to the large size of three tributaries, two water samples were collected at different locations at these sites (Fig. 1). To test for annual variation in water chemistry, a subset of 5 tributary streams were resampled in 2014. At each stream, 50 ml water samples were collected in the thalweg by inserting a sterile 50 ml syringe vertically into the upstream water column. Samples were filtered with a sterile 0.2 µm Whatman filter and then stored in acid-washed polyethylene bottles containing two drops of nitric acid.

Water samples were analyzed at Woods Hole Oceanographic Institution (WHOI) using solution-based inductively coupled plasma mass spectrometry (ICPMS) for the following isotopic and elemental ratios: 87Sr:86Sr, Sr:Ca, Ba:Ca, Mg:Ca, and Mn:Ca following the methods of Muhlfeld et al. (2012). Isotopic and elemental ratios were measured separately. For 87Sr:86Sr ratios, a portion of each water sample was evaporated and dissolved in 50% HNO3 then eluted through a Sr-specific cation exchange resin to separate out Sr. The remaining filtrate was evaporated and re-dissolved in 1 ml of 5% HNO3 and then analyzed with a Thermo Finnigan Neptune multiple collector ICPMS (Muhlfeld et al., 2012). Strontium isotope ratios were calculated by correcting for interferences of 87Rb on 87Sr and 86Kr on 86Sr using methods described by Jackson & Hart (2006). Lastly, all Sr isotope ratios were normalized to the NIST SRM987 standard. For elemental ratios, water samples were first diluted 10-fold using a 2% HNO3 solution prior to ICPMS measurement of 48Ca, 88Sr, 138Ba, 25Mg and 55Mn. Liquid standards and instrument blanks of 2% HNO3 were run every six samples. Instrument mass bias was corrected using certified values from a river water standard (SLRS-4, NRC), and an internal laboratory river water standard was used to assess measurement precision. External precision (relative standard deviation) of element:Ca ratios for the laboratory standard (n = 20) ranged between 1 and 6%.

Otolith sampling

We used otoliths from known-origin fish to correlate natal stream water chemistry to chemical composition of cutthroat trout otoliths following the methods of Muhlfeld et al. (2012) and Olley et al. (2011). Cutthroat trout fry (~1–2 months post emergence, n = 5–12 per site) were collected by dipnet from a subset of 7 natal streams (Beaverdam Creek, Columbine Creek, Cub Creek, Grouse Creek, Little Thumb Creek, Sewer Creek, and Creek 1138 during base flow conditions in August and September 2013 coincident with water sampling (Table 2). Sample sites were selected based on the underlying geology to represent the range of variation in chemical signatures among streams. Otoliths were removed under a dissecting microscope using nonmetallic forceps and processed and stored using the methods described below.

Natal recruitment during the two study periods was assessed using cutthroat trout captured during annual lakewide gillnetting assessments conducted in 1997 and 2013 (Kaeding & Koel 2011; Syslo et al., 2014). During both assessments, gillnets were distributed in the littoral zone around Yellowstone Lake to ensure sampling effort occurred throughout shallow waters of the lake. Gill nets were multi- mesh (19–51 mm bar measure), and were 38 m long by 1.5 m deep in 1997 and 90-m long by 2.4 m deep in 2013. Although the number of sampling sites differed, 11 sites in 1997 compared to 24 sites in 2013, many of the sites sampled in 2013 were in the same locations as those sampled in 1997. In addition, we know from telemetry studies that adult cutthroat trout move extensively throughout the lake (Ertel et al., 2017), ensuring that fish sampled during both years were from the same sampling population.

Sagittal otoliths from fish collected in 2013 gillnet sampling were removed using non-metallic forceps, and scrubbed, triple-rinsed, and soaked in Milli-Q water overnight prior to air-drying and storage in acid washed microcentrifuge vials. Otoliths from the 1997 sampling had been stored dry in envelopes but the instruments used in their removal were unknown; however, the processing we performed likely removed possible contaminants. Because there was a limited selection of otoliths collected in 1997 (n = 107), all were used for microchemical analysis. For the 2013 sampling, a subset of otoliths was selected for microchemical analysis using a stratified random design. One hundred otoliths were randomly selected for each of three length classes (< 200, 200–400, and > 400 mm) to ensure sampling fish across all age and size classes. For both sampling years, one otolith of each pair was randomly selected for further analysis. Some otoliths were damaged during processing and were removed from analysis; final sample sizes were n = 89 for the 1997 sample and n = 236 for the 2013 sample.

Prior to analysis, stored otoliths were rinsed with Milli-Q water and scrubbed with a nylon brush, dried under a laminar flow hood for 24 h, mounted on petrographic slides (sulcus side up) using cyanoacrylate glue, and then sanded using 600- and 1500-grit sandpaper and polished using 0.5- and 0.1-μm diamond lapping film until the plane of the nucleus became visible. After a final scrubbing and soaking with Milli-Q water, otoliths were remounted on a new petrographic slide. Finally, otoliths were aged independently by two experienced readers to assess whether the proportion of age classes among both past and recent samples were similar.

Trout age composition was similar between the two samples, ranging from age 1 to age 9, representing recruitment years 1988–1996 for the 1997 sample, and recruitment years 2004–2012 for the 2013 sample. Ages 3–6 comprised most of both samples (1997, 74.2%; 2013, 66.5%), with age 2 comprising the next predominant age group (0.1% and 13.1%, respectively).

Remaining age groups (1, 7, 8, and 9) collectively comprised 15.7% and 20.3% of the past and recent trout samples, respectively.

Otolith microchemistry

Otolith microchemistry was performed using laser ablation ICPMS equipped with a 213-mm laser. Otolith 87Sr:86Sr, Sr:Ca, and Ba:Ca ratios were obtained using a Thermo Finnigan Neptune multiple collector ICPMS, whereas elemental Mg:Ca and Mn:Ca ratios were obtained in a separate assay using a Thermo Finnigan Element single collector ICPMS, both of which were coupled to the same laser ablation system (Walther et al., 2008). Both juvenile and adult otoliths were ablated along a 450-μm, curved transect for both assays using a beam diameter of 75 μm, a repetition rate 20 Hz, and a scan speed 5 μm s−1. The length of the transect was chosen to insure an adequate sample was ablated to reduce potential variation from spot measurements.

Transects were positioned adjacent to the core to minimize interference from a maternal signal and within the narrow region corresponding to the early growth period right after emergence but prior to outmigration to the lake. This region, ~ 100 μm from the core, was determined from preliminary transects run across otoliths from the core to the edge, showing a different chemical signature from the maternal core and the lake entry and rearing phase. Sample processing was randomized to minimize potential systematic bias from instrument drift. For quality assurance, a certified reference material (MACS-3; Wilson et al., 2008) was run every 10 samples (Muhlfeld et al., 2012) to assess instrument drift and changes in mass bias. The mean 87Sr:86Sr ratio (± 1 SD) for the certified reference material run throughout the analysis was 0.707601 ± 0.00006, which is within 1 standard deviation of the accepted value of MACS-3 (0.70759). External precision (relative SDs) for Sr:Ca and Ba:Ca ratios based on repeated measurements (n = 34) of a certified reference material (FEBS-1; Sturgeon et al., 2005) was 4.0% for Sr:Ca, 0.1% for Ba:Ca, 1.5% for Mg:Ca, and 4.1% for Mn:Ca. Isotopic interference was corrected following methods described above for water analyses. Lastly, similar to Blair & Hicks (2012), all element:Ca readings that that fell below the detection limits of the LA-ICPMS and outliers greater than 5 standard deviations from the mean were excluded from further analysis.

Statistical analyses

Statistical analyses were conducted using R version 3.1.1 (R Core Team, 2014). We first examined relations between water and otolith chemistry for each isotope (87Sr:86Sr ) and elemental ratio (Sr:Ca, Ba:Ca, Mg:Ca, and Mn:Ca) with known-origin fry using simple linear regression. Initial examination of diagnostic quantile-quantile plots, box plots, and scatter plots found no violations of the parametric assumptions of normality, homoscedasticity, linearity, and equal variances. There was a near 1:1 relation between otolith and water Sr isotopes [(87Sr:86Sr)otolith = 0.98(87Sr:86Sr)water + 0.017; r2 = 0.99; p < 0.001] (Fig 2); otolith and water Sr:Ca ratios were also strongly related [(Sr:Ca)otolith = 0.21(Sr:Ca)water + 0.20; r2 = 0.74; p = 0.029) (Fig. 2). In contrast, there were no clear linear relations between otolith and water Ba:Ca (r2 = 0.21; p = 0.36), Mg:Ca (r2 = − 0.21; p = 0.76), and Mn:Ca r2 = − 0.22; p = 0.78) ratios. Thus, only 87Sr:86Sr and Sr:Ca ratios were retained for further analyses.

Fig. 2
figure 2

Mean (n = 6) otolith 87Sr:86Sr (top) and Sr:Ca (bottom) ratios for known-origin cutthroat trout fry from six streams. Solid line represents least squares regression line; shaded region represents 95% confidence intervals for the regression. Linear regression equations listed in text

Studies that seek to identify the natal origin of fish using otolith microchemistry typically collect known-origin fish from a variety of recruitment sources to develop a model for classifying the natal source of unknown-origin fish (e.g. Munro et al., 2005; Muhlfeld et al., 2012; Sousa et al., 2016). In the case of Yellowstone Lake, where many potential recruitment tributaries exist, the collection of known-origin fish over many potential sources was impractical.

We therefore used associations between otolith and water chemistries to create a model to predict otolith Sr isotopic and elemental signatures from water chemistry data for all 22 study streams. A 3-step process was used to create a range of expected otolith 87Sr:86Sr and Sr:Ca ratios for each stream. First, otolith Sr:Ca ratios were predicted by applying a partition coefficient based on the relationship between water and otolith Sr:Ca derived from known-origin cutthroat trout fry.

Partition coefficients reflect the degree that elemental uptake in otoliths reflects that found in source waters; the relationship observed for Yellowstone cutthroat trout in our study based on the slope of the linear regression for Sr:Ca (DSr:Ca = 0.21) is similar to that found for cutthroat trout in other studies (Wells et al., 2003; Muhlfeld et al., 2012). In contrast, predicted otolith 87Sr:86Sr were estimated directly from the measured water 87Sr:86Sr because of the near 1:1 relationship.

Uncertainty in predicted otolith 87Sr:86Sr and Sr:Ca values was estimated by averaging the standard deviation of known-origin otolith 87Sr:86Sr and Sr:Ca calculated for each stream (n = 6; Beaverdam Creek was excluded because only one otolith remained after preparation). Second, similar to the bootstrapping appproach used by Mikheev et al. (2020) to increase sample size, we then generated 50 expected otolith 87Sr:86Sr and Sr:Ca values for each stream from a normal distribution via the “rnorm” function in R, applying the predicted mean otolith 87Sr:86Sr and Sr:Ca value and standard deviation of otolith isotopic and elemental signatures of known-origin fish as above. Due to the higher variation in incorporation of elemental Sr into otoliths from stream water, two standard deviations were used when generating expected values for Sr:Ca. Comparisons of the expected and observed otolith signatures from known-origin cutthroat trout fry were used to assess the accuracy of the simulated otolith chemistry values (see below).

Bivariate plots of stream water 87Sr:86Sr and Sr:Ca revealed considerable overlap in chemical signatures among some spawning tributaries, making it difficult to classify fish to individual streams with high confidence. Therefore, for the third step, we grouped study streams according to their isotopic and elemental signatures and then assigned fish to their putative natal “stream group” using a cluster-then-predict approach (Booker, 2016). The general approach to this process is to first apply clustering methods to assign streams into groups; visual assessment of cluster separation is used to select the number of clusters for initial grouping. Next, a classification method is applied to assign known-origin fish to a stream group (cluster) based on their otolith 87Sr:86Sr and Sr:Ca. This process is repeated with different numbers of clusters, and classification accuracy is used as the criteria for determining the maximum number of clusters that provides acceptable classification error.

Streams were grouped according to similarity of water chemistry using nearest-neighbor cluster analysis (Mächler et al., 2014). Multiple samples of water chemistry from large streams were pooled by averaging, and stream 87Sr:86Sr and Sr:Ca ratios were standardized to a mean of zero and a standard deviation of 1.0 to accommodate for the different scales of isotopic and elemental values. Variables in the cluster analysis were also weighted to reflect the higher precision of isotopic Sr as a predictor of natal origins than elemental Sr (Kennedy et al., 2000).

Standardized stream Sr:Ca ratios were multiplied by 0.50 to reduce the variance by 50% and to reflect additional weight given to Sr isotopes (Kaufman & Rousseeuw, 2005).

A random forest model was then constructed for the classification of cutthroat trout to their natal ‘cluster’ based on otolith signatures. Mercier et al., (2011) found that random forest modeling had the highest classification accuracy and least violation of statistical assumptions among discrimination methods used for otolith microchemistry studies. Because we were not able to collect fish from each stream in the study area, the ‘simulated’ otolith signatures generated from water-otolith Sr:86Sr and Sr:Ca regressions were used for the training data set.

Random forest classification trees were constructed using a random subset of 75% of the simulated otolith signatures using bootstrap resampling with replacement. For each tree, a random predictor variable was searched at each node to find the one that maximized the within group homogeneity until further splitting resulted in no gain in within-group homogeneity (Mercier et al., 2011). Our sample of known-origin cutthroat trout otoliths and the remaining 25% of the simulated otolith signatures were then used to classify the prediction accuracy of the model using a ‘confusion’ or error matrix (Tharwat, 2018). This process was applied iteratively to determine the optimum number of stream clusters. The final combination of clusters and classification model was used to classify unknown-origin fish to a putative natal stream cluster based on ensemble predictions of 500 bootstrapped random forests each with 2000 trees (see also Mikheev et al., 2020).

To assess whether there were differences in the proportional contributions of stream clusters to recruitment represented in 1997 and 2013 fish samples, differences in the assigned natal origins for each group were quantified using a Pearson’s chi-squared test with Monte-Carlo simulation of P values. Due to insufficient sample sizes (< 5) in some clusters, 2000 random p-value replicates were simulated to create a reference distribution for comparison to satisfy the assumptions of the analysis (Hope, 1968).

Results

Water chemistry

Yellowstone Lake tributaries showed some distinctive differences in 87Sr:86Sr and Sr:Ca ratios among streams but also displayed a substantial amount of overlap. Sr isotope ratios ranged from 0.705824 to 0.709242, and Sr:Ca ratios ranged from 0.92 to 6.22 mmol/mol (Table 1).

Table 1 Stream 87Sr:86Sr, Sr:Ca, Adjusted Sr:Ca ratios, and cluster assignment. Adjusted Sr:Ca ratios were adjusted using the partition coefficient (0.21)

Streams in close proximity generally exhibited similar microchemical signatures, but in some cases nearby streams exhibited very different signatures (Fig. 3). For example, Chipmunk Creek and Grouse Creek, two streams that are within 5 km of one another, exhibited similar Sr:Ca (3.40 and 3.54 mmol/mol, respectively), but much different 87Sr:86Sr (0.706757 and 0.709242, respectively, Table 1). In some instances, streams that are relatively distant from one another exhibited similar microchemical signatures, e.g., Trail Creek and Little Arnica Creek had very similar 87Sr:86Sr (0.707903 and 0.707732, respectively) despite a spatial separation of nearly 47 km (Fig. 1).

Fig. 3
figure 3

Nearest neighbor cluster analysis dendrogram of streams (n = 22) and spatial distribution of 9 distinct clusters based upon stream 87Sr:86Sr and adjusted Sr:Ca ratios. The y-axis ‘height’ is a measure of the closeness of clusters based on where they split. Clear, Cub, Columbine, and Cabin creeks had replicate samples. Colored text and numbers in the left column correspond to the cluster (and corresponding colored line) in which each stream was assigned

Annual variation in water 87Sr:86Sr and Sr:Ca among streams was low, with a mean difference between replicate samples of 0.04‰ (0.01‰–0.12‰) and 6.77% (2.03%–11.90%), respectively. In contrast, differences among streams in 87Sr:86Sr and Sr:Ca were ~ 15 and 3.5 times greater, respectively, than differences among replicate water samples.

Cluster analysis resulted in all 22 sampled streams being grouped into 9 clusters (Table 1; Fig. 3) (n = 22), with the number of streams in each cluster ranging from 1 to 7. Two clusters contained multiple streams that were all located in relatively close proximity to one another, cluster 6 (Cub and Pelican Creeks) and cluster 9 (Mountain and Thorofare Creeks). However, the remaining clusters had no clear spatial patterns; for example, streams in the largest cluster (cluster 2) were distributed throughout the West Thumb, main basin, and the South and Southeast Arms of Yellowstone Lake. Bivariate plots of 87Sr:86Sr and Sr:Ca showed that addition of Sr:Ca in the analysis improved separation of the clusters (e.g. clusters 4 and 9; see Table 1 and Fig. 4).

Fig. 4
figure 4

Bivariate plots of 87Sr:86Sr and Sr:Ca stream clusters for historic (top) and recent (bottom) cutthroat trout otolith signatures

Classification of Natal origins

Mean classification accuracy of simulated otolith signatures for the 9-cluster random forest model across all 500 random forests was 79.0% (range, 72.3–84.5%). Classification accuracy of simulated otolith signatures varied across clusters, ranging from 31.6% to 100.0% (Table 2). Cluster 1 and cluster 4 had the lowest classification accuracy of simulated otolith signatures with 49.3% and 31.6% correct classification, respectively. Cluster 7 and cluster 8 had the highest classification accuracy with 100% correct classification. Known-origin trout fry (representing 4 of the 9 clusters) were classified with 84.4% classification accuracy (range, 77.8–100.0%; Table 3).

Table 2 Confusion matrix for the 9-cluster random forest classification model
Table 3 Classification accuracy of known-origin juvenile cutthroat trout (n = number sampled) for the 9-cluster random forest model

The proportion of both past and recent fish otolith signatures assigned to each cluster ranged from 0 to 0.33 (Table 4). The largest recruitment for both groups, totaling 84% of the 1997 sample and 77% of the 2013 sample, originated from just 3 clusters (2, 5, 6), representing some of the largest tributaries, including the Upper Yellowstone River (cluster 2), the Lower Yellowstone River (cluster 5), and Pelican Creek (cluster 6). Little or no recruitment was observed from clusters 1 (Arnica Creek and Bridge Creek), 3 (Big Thumb Creek) and 8 (Grouse Creek).

Table 4 Proportion of recruitment contribution from each stream cluster based on random forest model (past, n = 89; recent, n = 236)

Proportional trout recruitment differed significantly among the stream clusters (X2 = 15.40; P = 0.03). Recruitment remained similar over time in 4 of the 9 clusters, with proportional differences of less than 0.1 between the two sample periods. However, three clusters, clusters 5, 6, and 7, showed a 15, 33, and 66% respective decrease in recruitment contribution, whereas cluster 2 showed an increase in recruitment of 22%. The largest change occurred in cluster 9, where recruitment increased by a factor of 4.5, from 0.02 in 1997 to 0.11 in 2013.

Discussion

Analysis of otolith chemistry shows promise for assessing changes in life history variation and fish recruitment sources over time that otherwise would be difficult using traditional recruitment assessment methods (Miller et al., 2011; Brennan & Schindler, 2017, Brennan et al., 2015,, 2019a, b; Walsworth et al., 2020). In this study, we demonstrated that cutthroat trout otolith samples collected in a large lake system between a 16-year-interval, could, with some limitations, show differences in sources of natal recruitment over time in response to significant environmental stressors. We also developed a novel method for creating simulated otoliths from natal tributary water samples when otoliths from known-origin juvenile fish used to assign natal origins of adult fish are limited.

The relatively low variation in stream microchemical signatures among the 22 Yellowstone Lake spawning tributaries we sampled limited the power of our recruitment analysis. Because the underlying bedrock geology in the northern and western portion of the drainage is primarily rhyolite, and the underlying geology in the eastern and southern portion of the drainage is primarily mixed clastic/volcanic rock, streams within these respective regions often had similarities in their microchemical signatures. Accordingly, some areas with unique bedrock types within the drainage also exhibited unique microchemical signatures. For instance, Grouse Creek was the only stream that had an abundance of conglomerate bedrock within its watershed and consequently its Sr isotopic signature was much higher than other streams. There were also some anomalies in water chemistry throughout the drainage. For example, Little Thumb Creek and Solution Creek, two streams flowing into the West Thumb region of the lake, exhibited similar microchemical signatures to Cabin Creek, Beaverdam Creek, Upper Yellowstone River, and other streams originating from areas with a different dominant bedrock type. We suspect that these similarities in microchemical signatures despite very different geologies may be due to the presence of alluvial deposits at lower elevations that may mask the signature effects from higher elevation bedrock types.

The low range in variation in 87Sr:86Sr ratios we observed also contributed to the relatively low number of unique stream microchemical signatures. For example, Muhlfeld et al. (2012) found that 87Sr:86Sr ratios varied from 0.71131 to 0.74679 among 41 sites in the Flathead River drainage, MT, an isotopic range of ~0.0355. In contrast, the 87Sr:86Sr ratios in the Yellowstone Lake drainage varied from 0.705824 to 0.709242, an isotopic range of ~0.0034, an order of magnitude lower. The stream Sr:Ca ratios in this study exhibited a wider range of variation (0.92–6.22 mmol/mol) than isotopic ratios and was similar in magnitude to that observed in other studies (0.23–3.10 mmol/mol; Muhlfeld et al., 2012); however, most of the streams (59%) in our study exhibited a much smaller range of variation (3.00–5.00 mmol/mol). Additionally, Sr:Ca ratios of otoliths are heteroscedastic (Wells et al., 2003), with fish from streams with high Sr:Ca ratios, like those found in the Yellowstone Lake drainage, showing a wider range of variability in their otolith Sr:Ca signatures, thus further limiting their discriminatory power.

The low variation in stream and otolith chemistry necessitated the clustering of streams. Clustering can improve classification accuracy for estimating natal origin (Wells et a1., 2003; Carleton, 2011; Brennan & Schindler, 2017). Carleton (2011) found that classification accuracy improved by approximately 20% when streams with similar microchemical signatures were grouped together. A limitation of our study was the relatively low number of clusters identified (9) given the number of streams sampled (22). Additionally, the number of sampled streams represented only 32% of the number of streams where spawning has been documented. Though our analysis included many of the streams with the largest known concentrations of spawners, inclusion of a larger number of streams would likely have increased the number of unique clusters and provided for a finer level of discrimination among sites. However, given the low variation in water chemistry in the watershed, even more extensive water sampling may not have increased discriminatory power.

Our analysis revealed that three stream clusters were most influential for recruitment both before (combined proportion of recruitment contribution = 0.84) and after (0.77) the expansion of lake trout, emergence of whirling disease, and persistent drought. These stream clusters (2, 5, and 6) contain some of the largest tributaries within the watershed, including the upper and lower Yellowstone rivers, and Pelican Creek, all streams that previous studies reported as having large numbers of spawning fish and multiple spawning sites (Reinhart, 1990; Kelly, 1993; Ertel et al., 2017). However, our analyses also revealed some important differences in recruitment patterns over time, reflecting likely changes in response to large-scale stressors. In support of our initial hypothesis, we found 15% and 33% declines in recruitment contribution from clusters 5 and 6, respectively, which may be attributed to high juvenile mortality resulting from very high infection risk for whirling disease in Pelican Creek, the lower Yellowstone River, and the northern region of Yellowstone Lake (Koel et al., 2006, 2019b). We hypothesize that the notable decline (66%) in recruitment contribution from cluster 7, which includes Flat Mountain and Little Arnica creeks, may be due to extremely high lake trout densities in the Flat Mountain Arm and West Thumb regions of the lake. In contrast, we speculate that the increased recruitment contribution from clusters 2 (+ 22%) and 9 (+ 4500%), wherein whirling disease has not been detected, may be due to their higher watershed elevations, relatively large stream sizes, and drainage into the southern sections of Yellowstone Lake where lake trout densities are low. These clusters include Cabin, Mountain, and Thorofare creeks and the upper Yellowstone River mainstem, located in the remote Bridger-Teton Wilderness south of Yellowstone National Park.

We speculate that these larger, higher elevation streams would also be more resilient to drought, which ostensibly reduced recruitment to many of the smaller tributaries around Yellowstone Lake in the early-2000s by reducing access for spawners migrating from the lake (Koel et al., 2005; Kaeding, 2020). The South Arm of Yellowstone Lake, where the inlet of the upper Yellowstone River is located, has the lowest densities of predatory lake trout, and the fry emerging from tributaries long distances from the lake would have a greater ability to escape their predation due to increased size from a more extended growth and rearing phase before entering the lake. An important caveat in interpreting these results is that we had only 1 year each of pre- and post- data. Thus, we were unable to identify the extent of natural variation in recruitment among sites which, in Pacific salmon populations, has been shown to shift considerably even over relatively short time periods (Schindler et al. 2010).

Previous studies have emphasized the importance of sampling water and known-origin reference fish from all areas of potential recruitment when using otolith chemistry to assess natal origins (Campana, 2005; Brennan & Schindler, 2017; Mikheev et al., 2020). However, common to ours and other studies (e.g., Brennan et al., 2015), fish sampling does not occur at all water sampling stations, necessitating the development of models for estimating the degree of spatial resolution of microchemical data. Here, we developed a method for simulating otolith signatures from water sampling alone using a combination of Sr isotope and Sr:Ca data. Our results suggest that using water samples to generate simulated otolith data in combination with clustering analyses could be a useful approach to guide sampling efforts and evaluate the feasibility and discriminatory power in otolith studies. Finally, our findings also underscore the importance of extensive water sampling prior to study initiation to estimate the likely discriminatory power available. Extensive water sampling coupled with development of dendritic network models offers a promising approach for generating isoscape maps that potentially provide a greater level of discriminatory power for apportioning natal origin and recruitment of fish in a mixed stock fishery into specific locations within a watershed (Brennan et al., 2015; Brennan & Schindler, 2017).

Our study demonstrated both the potential strengths as well as the limitations of otolith microchemistry for determining spawning stock contributions and changes in fish recruitment patterns. The analyses used in this study for estimating recruitment changes over time should be useful to other researchers aiming to optimize use of otolith analyses over large drainage basins.