DNA barcoding and qPCR assay design
In 2011, 29 right pelvic fins of brook trout were collected and placed loosely in an envelope and allowed to dry before flattening (Oregon Scientific Taking Permit #16297, Supplemental Table 2). Filter paper was not available to dry out the fin clips, so fins were placed loosely on paper partially within an envelope to air dry prior to closing. Using Qiagen DNeasy blood and tissue kit (Qiagen Inc.) total DNA, genomic and mitochondrial, was extracted from six fin clips following manufacturers guidelines.
To design a species-specific 5′ exonuclease assay, a 485 bp segment of the mitochondrial cytochrome b gene (Cytb) was sequenced for six brook trout using conserved animal primers, H15149 and L14724 (Kocher et al. 1989; Irwin et al. 1991). Resulting Cytb sequences were aligned using Sequencher software version 4.8 (Gene Codes). Using the 485 bp fragment from the brook trout Cytb region as a template, a BLAST search was conducted against the NCBI nucleotide database to search for a sequence of conserved DNA specific to brook trout while having the greatest amount of heterogeneity compared to closely related co-existing species. From this conserved region of the brook Cytb gene, a forward primer, reverse primer, and a species-specific probe were designed using Primer Express® Oligo Design software (Applied Biosystems) to perform a 5′ exonuclease (TaqMan™ MGB Probe) assay (Table 1). An additional BLAST search was conducted on the assay amplicon to ensure in silico specificity.
Table 1 Cytb assay primer and probe sequences used for Brook Trout eDNA surveys qPCR assay validation
The assay was optimized following Thermo Fisher Life Technologies protocol for the optimization of primers and probe. For samples used to validate the species-specific assay, PCR was performed in a 10 μL total volume containing: 4 μL DNA template, 2X TaqMan® Master Mix (Thermo Fisher Life Technologies) with optimized concentrations of both forward and reverse primers and for the probe. Thermal cycling occurred with Bio-Rad’s CFX 96 real-time detector under the following conditions: initial enzyme activation of 10 min at 95 °C, 40 cycles of 15 s denaturation at 95 °C, 1 min annealing/extension at 60 °C. The number of PCR cycles required for reaction fluorescence (C(q)) values were quantified using CFX Manager™ (version 3.1; Bio-Rad). The assay was also validated in vitro for specificity by testing for cross reactivity with a panel of common fish species including: Salvelinus confluentus (bull trout), Oncorhynchus mykiss (rainbow trout), Alosa sapidissima (American Shad), Percina macrolepida (bigscale logperch), Pomoxis nigromaculatus (black crappie), Lepomis macrochirus (bluegill), Cyprinus carpio (common carp), Ictalurus punctatus (channel catfish), Micropterus salmoides (largemouth bass), Spirinchus thaleichthys (longfin smelt), Menidia beryllina (Mississippi silverside), Gambusia affinis (mosquito fish), Clupea pallasii (Pacific herring), Cottus asper (prickly sculpin), Lepomis gibbosus (pumpkinseed), Catostomus occidentalis (Sacramento sucker), Tridentiger bifasciatus (shimofuri goby), Pogonichthys macrolepidotus (Sacramento splittail), Morone saxatilis (striped bass), Dorosoma petenense (threadfin shad), Hypomesus nipponensis (wakasagi smelt), Ameiurus catus (white catfish), Pomoxis annularis (white crappie), and Acanthogobius flavimanus (yellowfin goby). Eight no-template controls were included per plate and the threshold was set above background fluorescence. No contamination was observed in the no-template control samples. A cross amplification panel was tested in triplicate. Sensitivity of the assay was determined by tenfold serial dilution of a positive control derived from brook trout fin clips with a concentration ranging from 20 to 0.00002 ng/μL. A brook trout DNA standard curve was constructed to determine the slope (qPCR efficiency), Y-intercept, and R2 values.
eDNA field sampling and sample analysis
Transect associated with electrofishing survey
Approximately 10 km of Lake Creek up to and including High Lake was surveyed by Burns Paiute Tribal staff at 84 sites in the summer and fall of 2014 for the presence of bull trout and brook trout. Thirty sites were randomly chosen from the 84-site regime to collect eDNA (water) samples, with all sites surveyed for brook trout using standard visual survey methods. The intent of this sampling was to corroborate brook trout eDNA detections using a designed brook trout qPCR assay with electrofishing confirmation of brook trout presence. During the summer sampling event, eDNA (water) samples were obtained from 18 (of the proscribed 30) sites in Lake Creek and nine eDNA (water) samples were obtained from High Lake (Table 2). During the fall sampling event, eDNA samples were taken from the 30 prescribed sites and nine eDNA samples were taken from High Lake (Table 2).
Table 2 Summary of events, filtered volumes, and distances from eDNA sources sampled across years Environmental DNA samples were obtained from field sites using water filtration procedures following Bergman et al. (2016) and Blankenship and Schumer (2017). Water was pumped from the stream using a battery operated peristaltic pump, with organic material sequestered in a Millipore Sterivex™ Filter Column (MilliporeSigma). Each eDNA sample was collected using sterile single-use materials and filters were kept on ice while in the field. Filters were frozen prior to shipping, and all filters were shipped overnight on blue ice to Cramer Fish Sciences in California for processing and analysis.
Field experiments: volume and distance sampling from known source
Field tests were conducted to examine relationships between filtered volume, number of replicate samples, distance from source, and the strength of eDNA detection. Due to the ubiquitous presence of brook trout in the Upper Malheur Watershed, rainbow trout and smallmouth bass were used as proxies for experimental distance tests (Supplemental Table 3). Previously validated species-specific qPCR assays were used for rainbow trout and smallmouth bass (Brandl et al. 2015). Cages were used to hold experimental fish at fixed stream locations and eDNA samples were taken 16–24 h after installation of a single test fish subject at prescribed distances downstream of cage location (100, 250, 500 and 1000 m). Cages were installed at sites within a system where proxy fish species were known to be absent. Background or negative control samples were collected within test sites to ensure that neither naturally introduced nor exogenous DNA from either surrogate species was present before or during the experiment.
Field sampling events occurred two or three times a year during different flows and temperatures from 2015 to 2017 (Supplemental Table 4). Sampling events yielded filtered water samples of varying volumes, representing multiple distances and numbers of replicates (Table 2). Flow, temperature, and turbidity were measured in situ according to modified Oregon Department of Environmental Quality Protocols (DEQ). Rosgen classifications for each sampled reach were used to facilitate information transferability and to assist in data interpretation, as these were readily available for proposed reaches and are a widely utilized stream habitat classification system (Rosgen 1994).
Experimental stream reaches were selected according to a perceived carrying ability of streams based on hydrologic settings including gradient, flow, and temperature in order to obtain a sampling distance that can be applied watershed-wide. Three river sites were selected where smallmouth bass were absent: Meadow Fork of Big Creek (high gradient), Summit Creek (moderate gradient), and Lower Lake Creek (low gradient). For the 2016 experiments, smallmouth bass test subjects were killed prior to installation into cages to avoid accidental release. Permitting restrictions required that any smallmouth bass used as part of this study must be dead prior to use in the field. While evidence suggests the probability of detecting eDNA of live fish differs from that of dead fish (Kamorof and Goldberg 2018), we did not calculate the difference in probability as part of this study. All 2017 sampling events occurred in Lake Creek above Lake Creek Falls. Live rainbow trout were the experimental subjects installed in cages, as rainbow trout are not present upstream from Lake Creek Falls.
Laboratory analysis
DNA from all samples and controls were extracted using PowerWater Sterivex™ DNA Isolation Kit (Qiagen, Inc.) following the manufacturer’s recommended guidelines. A DNA extraction negative control was processed in parallel to ensure sample integrity throughout the extraction procedure. The DNA extraction control consisted of Sterivex™ filtered ultrapure water only. DNA extraction controls were processed using the same equipment used to extract DNA from all samples. Each sample and all controls were analyzed in triplicate for the presence of the target species Cytb mitochondrial gene using species-specific brook trout qPCR primer and probe set and laboratory methods described in Bergman et al. (2016). Each 10 μL qPCR reaction was conducted as described previously. Thermocycling was performed using a Bio-Rad CFX 96 Real time System (Bio-Rad Laboratories, Inc.) with the following profile: 10 min at 95 °C, 40 cycles of 15 s denaturation at 95 °C and 1 min extension at 60 °C. Field collection controls (i.e., blanks) and a positive DNA template were also used. Positive control reactions consisting of target species genomic DNA template were tested in parallel to ensure consistent PCR performance. All PCR master mixes were made inside a UV-sterilized PCR enclosed workstation. The DNA template was added to the master mix outside of the UV PCR workstation on a dedicated PCR set up workbench. All PCR reactions were conducted on instruments located outside of the main lab in a separate portion of the building. Results of the qPCR reactions were analyzed using BioRad CFX manager v3.1 (Bio-Rad Laboratories, Inc.), with the magnitude of the qPCR signal (“detection strength”) reported as the number of quantification cycles (C(q)). C(q) is defined as the number of PCR cycles required for reaction fluorescence to exceed background fluorescence. The results of the qPCR reactions were analyzed using BioRad CFX manager v3.1 (Bio-Rad Laboratories, Inc.). Target species DNA was considered to have been detected in a sample if any of three replicates had C(q) < 40.
Statistical analysis
The goal was to determine a sampling protocol for reliably obtaining positive detection of the target species, given its known presence and location. We initially considered sampling location (a proxy for stream gradient), distance from source, filtered volume, and number of replicate samples as primary predictors of the strength and probability of detection. During sampling events in 2016, poor water quality conditions resulted in the combination of multiple Sterivex filters into the same eDNA sample in order to obtain the target filtered volume of water. It was unclear how (or if) this combination process affected the probability of eDNA detection in an individual sample, and how (or if) this was confounded by distance and sampling site. To reduce bias associated with this uncertainty, these multi-filter samples were removed from the data presented in Fig. 2 and Table 2, which summarizes the remaining 53 samples collected from 2015 through 2017. Because this left an uneven distribution of replicate samples across distances and gradients, distance from source and stream gradient were held constant for statistical modeling. For modeling, only samples taken at 500 m using a single filter per sample from Lake Creek were used. This included 37 samples from 2017, and two samples from 2015, for a total of 39 samples in the model dataset.
We chose a Bayesian statistical approach because it allowed us to model detection strength (a continuous variable with an upper bound, preferable to modeling detection as a binary variable) and simultaneously examine whether high-volume samples or high-replicate samples resulted in more positive (binary) detections than low-volume or low-replicate samples. The employed Bayesian framework made this distinction possible through the derivation of a distribution of binary detection probabilities across volumes and number of replicates from the posterior probability distribution of modeled detection strength. The multivariable linear model was written in the Stan programming language and fit using the rstan package (Version 2.17.3) in the statistical computing software R (Version 3.5.1). Model fit diagnostics included posterior predictive checks, inspection of the potential scale reduction statistic \( \left( {\hat{R}} \right) \) and effective sample size \( \left( {n_{eff} } \right) \), and verification that Markov Chain Monte Carlo chains were well-mixed and stationary. Model code is available from www.github.com/fishsciences/BPT_eDNA.
Equally important to estimating the probability of a positive detection is estimating that of a false negative detection (i.e., failure to detect eDNA when the target species is in fact present). We estimated a detection rate from the observed frequencies of detection across volumes and number of samples and extrapolated that linearly across a potential sampling regime in order to guide future sampling efforts.