Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season

Palacios, Daniel M.; Bailey, Helen; Becker, Elizabeth A.; Bograd, Steven J.; DeAngelis, Monica L.; Forney, Karin A.; Hazen, Elliott L.; Irvine, Ladd M.; Mate, Bruce R.

doi:10.1186/s40462-019-0164-6

Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season

Research
Open access
Published: 18 July 2019

Volume 7, article number 26, (2019)
Cite this article

Download PDF

You have full access to this open access article

Movement Ecology Aims and scope Submit manuscript

Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season

Download PDF

7241 Accesses
24 Citations
41 Altmetric
4 Mentions
Explore all metrics

Abstract

Background

Species distribution models have shown that blue whales (Balaenoptera musculus) occur seasonally in high densities in the most biologically productive regions of the California Current Ecosystem (CCE). Satellite telemetry studies have additionally shown that blue whales in the CCE regularly switch between behavioral states consistent with area-restricted searching (ARS) and transiting, indicative of foraging in and moving among prey patches, respectively. However, the relationship between the environmental correlates that serve as a proxy of prey relative to blue whale movement behavior has not been quantitatively assessed.

Methods

We investigated the association between blue whale behavioral state and environmental predictors in the coastal environments of the CCE using a long-term satellite tracking data set (72 tagged whales; summer-fall months 1998–2008), and predicted the likelihood of ARS behavior at tracked locations using nonparametric multiplicative regression models. The models were built using data from years of cool, productive conditions and validated against years of warm, low-productivity conditions.

Results

The best model contained four predictors: chlorophyll-a, sea surface temperature, and seafloor aspect and depth. This model estimated highest ARS likelihood (> 0.8) in areas with high chlorophyll-a levels (> 0.65 mg/m³), intermediate sea surface temperatures (11.6-17.5 °C), and shallow depths (< 850 m). Overall, the model correctly predicted behavioral state throughout the coastal environments of the CCE, while the validation indicated an ecosystem-wide reduction in ARS likelihood during warm years, especially in the southern portion. For comparison, a spatial coordinates model (longitude × latitude) performed slightly better than the environmental model during warm years, providing further evidence that blue whales exhibit strong foraging site fidelity, even when conditions are not conducive to successful foraging.

Conclusions

We showed that blue whale behavioral state in the CCE was predictable from environmental correlates and that ARS behavior was most prevalent in regions of known high whale density, likely reflecting where large prey aggregations consistently develop in summer-fall. Our models of whale movement behavior enhanced our understanding of species distribution by further indicating where foraging was more likely, which could be of value in the identification of key regions of importance for endangered species in management considerations. The models also provided evidence that decadal-scale environmental fluctuations can drive shifts in the distribution and foraging success of this blue whale population.

Three decades of nearshore surveys reveal long-term patterns in gray whale habitat use, distribution, and abundance in the Northern California Current

Article Open access 23 April 2024

Predicting suitable habitats for foraging and migration in Eastern Indian Ocean pygmy blue whales from satellite tracking data

Article Open access 07 June 2024

Movements and behaviour of blue whales satellite tagged in an Australian upwelling system

Article Open access 03 December 2020

Background

Located off the western coast of North America, the California Current Ecosystem (CCE) boasts a rich biological productivity [1,2,3] and supports important populations of marine megafauna, including blue whales (Balaenoptera musculus) [4,5,6,7]. The Eastern North Pacific (ENP) blue whale population uses the CCE in late summer and early fall months [8,9,10], tracking the seasonal development of dense aggregations of euphausiid crustaceans (“krill”), their primary prey [11,12,13]. Krill are the key trophic link in this relatively simple food chain, consisting of phytoplankton, krill, and their predators [14, 15]. This trophic pathway was invoked by [13] to describe the sequence of events leading to the arrival of blue whales in Monterey Bay off central California in late summer, starting with springtime wind-driven coastal upwelling and followed by increased primary productivity and krill populations in spring and summer, which they termed “wind-to-whales”. This paradigm predicts that whale aggregations can be expected along this coast in areas situated downstream from upwelling centers [13], where a steep sea-floor topography, submarine canyons, and retentive circulation processes [15,16,17,18,19] act in concert with enhanced primary productivity and krill behavior to generate persistently high prey densities [11, 12, 20,21,22,23]. Indeed, satellite telemetry data have shown that in the CCE, blue whales aggregate in large numbers at these coastal “hotspots” [24], while ecosystem-wide studies describing blue whale distribution in relation to bathymetric and oceanographic variables have shown a clear association with the most productive habitats along the coast [9, 25,26,27].

Two of the most important blue whale aggregation hotspots in the CCE are found in areas of intense commercial ship traffic leading in and out of the ports of Los Angeles and San Francisco [24]. Blue whale mortality due to ship strikes has been a growing concern for the recovery of the ENP population [28,29,30,31], which is currently estimated at 1647 animals (CV = 0.07) [32], and is listed as Endangered under both the USA’s Endangered Species Act and the International Union for Conservation of Nature’s Red List [33]. Recent studies have shown that while actively engaged in feeding, whales may be less responsive to anthropogenic threats such as approaching ships [34] and military sonar operation [35], while the persistent proximity of vessels and shipping noise can result in a reduction in feeding opportunities [36, 37]. Therefore, an ecosystem-wide understanding of whale foraging behavior and its drivers would fill a critical information gap toward mitigating ship strikes and other impacts from interactions with human activities [29, 31, 38, 39].

Through the use of state-space modeling techniques, satellite telemetry data can be used to classify animal movement into simple behavioral states along with the estimation of regularly spaced locations in time along a track [40, 41]. Rapid, directed movement between locations is typically classified as “transiting mode”, while slower, localized movement is classified as “area-restricted searching (ARS) mode”. Researchers generally assume that, while in feeding areas, these ARS and transiting modes are indicative of foraging in and moving among prey patches, respectively, although these inferences have only been validated with direct measurement of behavior in a few species [42,43,44,45,46]. Nevertheless, behavioral state can be examined in relation to extrinsic conditions to determine whether patterns or correlations suggest an environmental basis [47,48,49]. Given the tight trophic coupling between blue whales and their prey during the summer-fall feeding season in the CCE, we would expect that regions of high whale density are also where ARS behavior is most prevalent.

Great strides have been made in recent years in the development and validation of habitat models (or species distribution models, SDMs) capable of predicting blue whale population abundance and distribution in the CCE from ship survey sighting data collected concurrently with bathymetric and oceanographic variables [25, 27, 50, 51]. Recently, we also applied SDMs to predict blue whale density in the CCE from satellite tracking data [26]. In this study we used this tracking data set to further investigate whether blue whale behavioral state could be similarly predicted through relationships with habitat variables as proxies for bottom-up ecological processes that favor krill aggregation. Given the relevance of behavioral context to anthropogenic threat response, an ecosystem-wide capability to predict where foraging is more likely would enhance the value of information from density models toward a more comprehensive characterization of key regions of importance for this endangered species in management considerations. Finally, an improved understanding of these species-environment relationships would also aid in better predicting the effects of climate change on the CCE ecosystem and the animal populations it supports [52, 53].

Methods

Blue whale tagging

The tags and tagging methodology have been documented in detail in previous papers [24, 54,55,56]. Briefly, ultra-high-frequency radio transmitters monitored by the Argos satellite system were attached to blue whales in three areas of the ENP (California, USA; Gulf of the California, Mexico; and in international waters of the eastern tropical Pacific) during the period 1993–2008. Tagging was conducted every year except for 1996, 1997, and 2003. Of the 182 tags deployed, 128 transmitted successfully for at least a day (d), and 104 transmitted for more than 7 d [56].

State-space modeling of satellite telemetry data

We fitted the Bayesian switching state-space model (SSSM) developed by [40] to the unfiltered Argos locations for each whale track longer than 7 d (n = 104), using the software packages WinBUGS v. 1.4.3 and R v. 2.12.1 [57], as detailed in [56]. Behavioral state was specified as one of two modes, transiting (mode 1) and ARS (mode 2), based on mean turning angle and autocorrelation in speed and direction. The SSSMs ran two Markov chain Monte Carlo simulations, each for 20,000 iterations, with the first 15,000 iterations being discarded as a burn-in, and the remaining iterations being thinned, removing every fifth one to reduce autocorrelation. The result was a regularized track with one estimated location per day, along with a measure of uncertainty expressed by the posterior 95% credible limits in longitude and latitude, which accounted for the Argos location error and the movement dynamics of the animals [56]. Although only two behavioral modes were modeled, the means of the Markov chain Monte Carlo samples provided a continuous value from 1 to 2 for each location. We chose values greater than 1.75 to represent ARS mode and values lower than 1.25 to represent transiting mode, while values in between were considered “uncertain,” as has been the practice in other studies [24, 43, 56].

The tracking data set covered the entire migratory range of the ENP blue whale population, from the eastern tropical Pacific in the south to the Gulf of Alaska in the north [56]. However, since most of the tagging was conducted off California, the majority of the tracks were concentrated along the western coast of North America [24, 56]. Therefore, for the purpose of this study we delineated our study area using the Exclusive Economic Zone (EEZ) jurisdictional boundary of the USA on the Pacific Ocean, obtained as a polygon shapefile from the Maritime Boundaries Geodatabase [58, 59] available at the Marine Regions web site [60]. We extracted SSSM tracking data within this EEZ region (longitudinal extent: 129–117°W, latitudinal extent: 30–49°N; see Fig. 1a), with the following constraints. First, we limited the tracking data to the period after 1998, when a complete suite of environmental variables from remote sensing was available, and we further restricted the temporal scope of the study to the months from July to November to focus on the summer and fall seasons, when the whales are known to forage intensively in the coastal environments of the CCE [8, 24, 56]. To avoid bias introduced by short-lived tags toward the vicinity of the tagging locations, we only used SSSM tracks with durations longer than 14 d or a distance traveled of at least 888 km (i.e., the average transit time and separation, respectively, between ARS patches reported by [56]). We also excluded locations with SSSM estimation uncertainty exceeding 100 km in radius (based on the posterior 95% credible limits generated by the SSSM), locations with uncertain behavioral mode classification, and the last location of each track (which did not receive a behavioral classification). Lastly, to focus our study on the coastal upwelling environments where blue whales concentrate their foraging activity, we excluded portions of tracks occurring over seafloor depths greater than 2000 m (although we note that some foraging also occurs in offshore waters; see Fig. 1b). The final tracking data set contained 1808 SSSM locations belonging to 72 tagged whales, with observations in all years except 2002 and 2003 (no tag deployments took place during 2003), all occurring within a distance of 113 km from shore (Fig. 2).

Environmental predictor data

We obtained relevant environmental variables for each SSSM location from digital elevation models and remotely sensed observations collected by oceanographic satellites available from various sources (Table 1). Variables describing the seafloor relief were depth (DEPTH), slope (SLOPE; expressed as a depth gradient), slope aspect (expressed as EASTNESS and NORTHNESS components), and distance to the 200-m isobath (or distance to the shelf break, DISTSHELF; with positive values indexing locations shoreward of the 200-m isobath and negative values indexing offshore locations). The oceanographic variables extracted included: vertical upwelling velocity (or Ekman pumping, WEKM), sea surface height (SSH), sea surface temperature (SST), and phytoplankton chlorophyll-a concentration (CHL). When available, these data were obtained through the web service Environmental Research Division Data Access Program (ERDDAP) [61], using the R package xtractomatic v. 3.4.1 [62], a collection of functions that permit client-side access to the data sets served by ERDDAP, to automate the process. Otherwise the data were obtained directly from the source indicated in Table 1.

Table 1 Details of the environmental variables obtained from digital elevation models and from oceanographic remote sensing satellites for this study, including measurement unit, abbreviation used in the text, range, spatial and temporal resolution, product name, and source. A reference to the literature is provided for product documentation or variable derivation. The original sources of the products are also given

Full size table

In order to account for the uncertainty in location estimation by the SSSM, for each environmental variable we obtained the weighted average of the observations around each location occurring within a box defined by the SSSM posterior 95% credible limits. The observations inside this box were weighted based on distance to the SSSM location using a Bézier spatial kernel [63]. The number of observations used in this computation was dependent not only on the extent of the SSSM credible limits around each location, but also on the spatial resolution of the environmental products used, which varied from 1 km for DEPTH to 37 km for SSH (Table 2). In addition to reflecting the uncertainty in location estimation, this approach had the benefit of minimizing the number of locations with missing environmental values due to cloud cover in some of the products, had we simply obtained the single pixel value nearest to a location.

Table 2 Description of the NPMR models reported in this study for the building and validation sets, using locations with complete cases (no missing observations in any of the predictors). For each model, SU is the number of sample units in populated neighborhoods, N_ave is the realized average neighborhood size, n_min is the realized minimum neighborhood size (0.25 × N_ave), logB is the log likelihood ratio, B_ave is the average contribution of a sample unit to logB. Additional measures of fit reported by HyperNiche include the cross-validated pseudo-R² (xR²), the Pearson correlation (r) between presence/absence response data and continuous estimate of probability, and the chi-square value (χ²) representing the deviance comparing the model to a naive model

Full size table

Given the large latitudinal extent of our study area (~ 20 degrees), oceanographic variables susceptible to the effect of the global north-south gradient in solar heating [64] were spatially detrended by fitting a local polynomial regression (loess) smoother on latitude (degree = 1, span = 0.75) in R, such that the latitudinally detrended variables represented a “spatial anomaly” above or below the mean at a given latitude (i.e., the local deviations solely caused by dynamic oceanographic processes). The detrended variables were SSH (dtSSH) and SST (dtSST). The set of environmental variables that were initially considered as candidate predictors in the models included the original nine variables as well as the detrended versions, for a total of 11 predictors. Collinearity among the predictors was assessed with the pairwise Pearson correlation coefficient (r) and graphically with scatterplot matrices. Redundant predictor pairs (i.e., those exceeding the threshold |r| ≥ 0.7; see [80]) were considered for exclusion from multiple-predictor models based on an informal exploration of their performance in single-predictor models.

NPMR modeling

Formal assessment of the relationship between whale behavioral mode (BMODE) and environmental predictors was conducted through nonparametric multiplicative regression (NPMR) modeling in the HyperNiche software v. 2.30 [81]. The HyperNiche implementation of NPMR is specifically designed for habitat modeling and has been shown to outperform other popular statistical techniques used in species distribution modeling like generalized additive models and random forests [82,83,84]. NPMR has been recently applied to animal movement data [85]. HyperNiche uses established standard practices in species distribution modeling, including leave-one-out cross-validation during fitting, overfitting controls, metrics for model selection, generation of bootstrap variability bands around fits, and randomization (Monte Carlo) tests for comparing to a null model [86,87,88]. A methodological overview of NPMR estimation as it applies to the context of this study is provided in the Appendix.

BMODE was encoded as a categorical binary variable indicating the absence (i.e., transiting) or presence of ARS, and the models estimated the likelihood of ARS at each SSSM location (locations with uncertain behavioral mode classification were not used in the analyses). NPMR was formulated using local mean models with Gaussian kernel weighting functions. With this configuration, the Gaussian function (which serves as a smoothing parameter) determines the weight at each location in the predictor data, while its standard deviation determines the size of the “environmental neighborhood” (n*; in units of number of locations around a target point). Model estimates are then computed at each location as the weighted average of the values of the response variable for the observations in the environmental neighborhood (with the univariate weights being combined multiplicatively), while being penalized with leave-one-out cross-validation to minimize overfitting. The corresponding formulations for these steps are shown in Eqs. 1–4 in the Appendix.

The statistics “tolerance” (the standard deviation of the Gaussian function) and “sensitivity” (the average effect size measuring the change in the level of the response for a given change in the predictors; Eq. 5 in the Appendix) indicate the relative scope and influence of the predictors, respectively. The primary metric for model selection for presence/absence response data in HyperNiche is logB, a log likelihood ratio expressing the improvement over a naive model (logB = 0). We also used B_ave, the average contribution of a sample unit to logB, which is independent of sample size (see the Appendix for details).

Models were fitted using the method of free search. All models used the default overfitting controls (medium automatic settings) available in HyperNiche, and an attempt to improve the final model was made with the tuning option (see the Appendix for details). The final model was assessed with a randomization that tested the null hypothesis that the observed fit (logB) was no better than could be obtained by chance. Randomization was carried out by shuffling the response variable and attempting to fit the best model possible using a free search. Confidence intervals for the logB statistic were obtained through bootstrapping, carried out by resampling the data and fitting the model to each sample. Both the randomization and bootstrapping procedures were repeated 100 times on the data set used for model building.

Model validation: predicting under different climatic conditions

Decadal-scale environmental variability has been hypothesized to influence long-term distributional shifts in ENP blue whales via changes in trophic linkages [89]. Given the long-term nature of our data set, we examined the possibility that blue whale movement behavior in the CCE might be in part driven by environmental fluctuations persisting across multiple years. For this purpose we used the North Pacific Gyre Oscillation (NPGO), a climate index that that closely predicts ecosystem-level changes in the ENP, with alternating phases of positive sign characterized by cool and highly productive conditions, and of negative sign characterized by warm conditions and reduced biological productivity, each lasting for several years [90]. Specifically, we built the final NPMR model on data from years dominated by a positive NPGO phase (1998–2004 and 2007–2008) and cross-validated it with data from years dominated by negative NPGO values (2005 and 2006; see Additional file 1: Figure S1) set aside for this purpose and not used for model fitting. The building data set contained 1444 SSSM locations belonging to 55 tagged whales for the positive phase of the NPGO, and the validation set contained 364 SSSM locations belonging to 18 tagged whales for the negative phase of the NPGO. This approach allowed us to test the model’s ability to predict blue whale behavioral state during different climatic regimes.

Model assessment with binary classification

Further assessment of model performance included metrics for the success of binary classification by converting the estimated ARS likelihood into estimated presence or absence of ARS behavior under different cutoffs. To achieve the optimal binary classification, we followed the approach described by [91]. For each cutoff level of the estimates, we computed the true positive rate (TPR; i.e., the proportion of correctly identified presences, also known as sensitivity) and the true negative rate (TNR; i.e., the proportion of correctly identified absences, also known as specificity) assisted by the R package ROCR v. 1.0-7 [92], and calculated the true skill statistic (TSS) as TPR + TNR - 1 [91]. We then used the cutoff level that maximized TSS to implement the binary conversion and to compute classification statistics [prevalence, accuracy, and precision]. We also report the area under the receiver operating characteristic curve (AUC), the root-mean square error (RMSE), and the Brier score [93] as metrics of model performance.

Accounting for spatial autocorrelation and other sources of variability

Unaccounted for spatial autocorrelation in models poses a problem for hypothesis testing because it violates the assumption of independence [94,95,96,97]. The optimized weighted averaging approach implemented by NPMR to derive model estimates (Eq. 4 in the Appendix) automatically accounts for spatial autocorrelation by estimating how much the response variable at a target location reflects response values at surrounding locations (i.e., within the environmental neighborhood), rather than treating them independently. This procedure is equivalent to the formulation implemented in generalized linear models with an autocovariate term to address spatial correlation, as reviewed by [94, 95].

Nevertheless, considering the strongly autocorrelated nature of our tracking and environmental data sets, we conducted a series of steps compatible with the NPMR framework to investigate autocorrelation. First, we fitted a purely spatial (longitude × latitude) NPMR model to capture the spatial structure in the tracking data and calculated the sample variogram [95, 98] on neighborhood sizes and predictions from both spatial and environmental models using the R package gstat v. 1.1-6 [99] to quantify the overall pattern of autocorrelation (a variogram of the Pearson residuals was of limited use given ours was a binary response). Further, we compared the estimated neighborhood sizes from the spatial and the environmental models to explore how NPMR accounted for autocorrelation in the presence of environmental predictors [94, 95].

Finally, even though NPMR modeling in HyperNiche does not have a formal way to incorporate random effects, we included individual as a categorical covariate to address potential differences in behavior driven by different animals, and offered it to the models with the rest of the environmental predictors.

Results

State-space modeled locations

Of the 1808 locations in the final tracking data set, 418 locations (23.1%) were classified as transiting and 1390 (76.9%) were classified as ARS mode (locations with uncertain behavioral mode classification were not used in the analyses). Blue whale locations occurred along the entire western coast of the USA (Figs. 1b and 2). ARS locations tended to cluster around three areas: Point Conception and the Santa Barbara Channel in southern California, around the Gulf of the Farallones in central California, and between Cape Mendocino and Cape Blanco in northern California and southern Oregon. Locations classified as transiting occurred between the clusters of ARS locations along the coast, as well as in the southern offshore part of the Southern California Bight (Fig. 1b).

The mean distance and speed between SSSM locations, providing an indication of the scales of blue whale daily movement within the CCE during July–November, were 39.9 km and 1.7 km/h (median = 25.7 km and 1.1 km/h, respectively). When computed by behavioral mode, the mean distance and speed for transiting locations was 81.5 km and 3.4 km/h, respectively (median = 78.4 km and 3.3 km/h, respectively), while the mean distance and speed for ARS locations was 27.3 km and 1.1 km/h (median = 19.9 km and 0.8 km/h, respectively).

NPMR modeling results

Consideration of collinearity indicated that SSH and dtSSH were highly correlated (r = 0.86), as were DEPTH and DISTSHELF (r = − 0.78) (Additional file 2: Figure S2). In exploratory single-predictor NPMR models, SSH and DISTSHELF had inferior performance (in terms of logB) than dtSSH and DEPTH, respectively, so we retained dtSSH and DEPTH as a candidate predictors in multiple-predictor models. NORTHNESS was also excluded from multiple-predictor models due to a lack of predictive ability in the single-predictor models. Thus, multiple-predictor models were based on an initial set of eight predictors: DEPTH, SLOPE, EASTNESS, WEKM, dtSSH, SST, dtSST, and CHL.

A free search among this set of predictors identified the four-predictor model CHL × SST × EASTNESS × DEPTH as the best model, while meeting the parsimony requirements. After tuning, this model maximized the log likelihood ratio at logB = 26.14, with each location contributing an average of B_ave = 4% to the likelihood ratio (Table 2). This model’s average neighborhood size was N_ave = 93.63 and the minimum neighborhood size allowed for an estimate was n_min = 23.41 (Table 2). A randomization test for logB based on 100 runs provided evidence that this model performed significantly better than a null model (p-value = 0.01, logB mean = 0.07, logB variance = 0.40). Evaluation of fit for the logB statistic through bootstrap resampling with 100 runs indicated that it was quite stable, 90% of the time falling within the range 16.32–28.48 (the 5th and 95th percentiles, respectively). The median bootstrapped fit was logB = 22.20.

The predictors in this model had tolerances ranging from 0.30 mg/m³ for CHL to 571.97 m for DEPTH, which when scaled by the range of the predictor and expressed as a percentage, varied from 4 to 29% (Table 3). The response variable was most sensitive to changes in CHL (sensitivity = 0.71; i.e., a 20% change in CHL resulted in a 14% change in the likelihood of ARS) and least sensitive to changes in DEPTH (sensitivity = 0.07; Table 3).

Table 3 Characteristics of the predictors in the NPMR models reported in this study for the building (n = 1444) and validation (n = 364) sets. Note that the validation step used the tolerance of the predictors from the building step, while the sensitivity was computed for each model

Full size table

The functional response of ARS likelihood with respect to CHL was described by an optimum shape, with increasing ARS likelihood at CHL levels from 0 to 1.1 mg/m³, followed by a broad peak from 1.1 to 2.5 mg/m³ and then by a slight decrease at higher CHL levels (Fig. 3a). The response to SST was also described by an optimum around 15.5 °C followed by a sharp decrease, although there was an indication of a secondary increase in ARS likelihood between 20 and 22.5 °C (Fig. 3b). The response to EASTNESS was characterized by a simple monotonic gradient, with a slight increase in ARS likelihood in the more eastward-facing slopes (although with a high variability; see Fig. 3c). Finally, the functional response of ARS likelihood to DEPTH was similarly characterized by a monotonic gradient, with highest ARS values at shallow depths (< 850 m), and decreasing values at greater depths (Fig. 3d).

The observed prevalence of ARS behavior in the building set (collected during the positive phase of the NPGO; 1998–2004 and 2007–2008; n = 1444) was 0.83 (Table 4). Model estimates showed widespread high ARS likelihood (above 0.8) from Point Conception in California to northern Washington (Fig. 4a). In southern California, elevated ARS likelihood was restricted to a narrow strip along the coastal margin, while the lowest ARS likelihood consistently occurred in the offshore parts of the Southern California Bight, except around offshore islands and shallow banks, where ARS likelihood was also high (Fig. 4a). In predictor space, this secondary ridge of elevated likelihood of ARS occurred in warmer temperature and shallower water (SST = 20–22.5 °C, DEPTH < 1000 m) than elsewhere in the CCE (Fig. 3b and Additional file 3: Figure S3e).

Table 4 Confusion matrix for the binary conversion of the likelihood of ARS estimated by the environmental NPMR model for the building (n = 1444) and validation sets (n = 364), using the cutoff value that maximized the true skill statistic (TSS_max). FPR is the false positive rate, FNR is the false negative rate, TPR is the true positive rate, and TNR is the true negative rate. The second part of the table reports a set of performance metrics for this binary conversion, including prevalence, accuracy, precision, the area under the receiver operating characteristic curve (AUC, range: 0 to 1 with larger numbers indicating a better fit), the root-mean square error (RMSE, range: 0 to infinity with smaller numbers indicating a better fit), and the Brier score (range: 0 to 1 with lower scores indicating a better calibration of the predictions)

Full size table

Model validation: predicting under different climatic conditions

Validation of the environmental NPMR model using data from years dominated by negative NPGO values (2005 and 2006; n = 364; Fig. 4c and Additional file 1: Figure S1) indicated a similar degree of fit to the building set (B_ave = 4.5%; Table 2) despite the lower log likelihood ratio (logB = 6.39), which reflected the smaller sample size. This model’s average neighborhood size was N_ave = 25.10 and the minimum neighborhood size estimate was n_min = 6.28 (Table 2), both also substantially lower than for the building set. The validation set had a lower observed prevalence than the building set (0.66; Table 4).

The response variable had a similar sensitivity to changes in the predictors in the building model except for SST, which more than doubled in the validation set (sensitivity = 0.58; Table 3). An increased sensitivity to SST suggests that while the animals generally occupied the same habitat during both periods, ARS behavior was influenced by the warmer conditions during the negative phase of the NPGO. This was evident in the generalized decrease in likelihood of ARS throughout the study area, particularly in the southern half (between Point Conception and Cape Mendocino; Fig. 4c).

Performance of binary conversion of predictions

The decrease in estimated ARS likelihood estimates for the validation set resulted in substantially different cutoffs for the optimal binary conversion into estimated presence or absence of ARS behavior between the building and validation sets (0.84 versus 0.61, respectively; Table 4 and Additional file 4: Figure S4a,). The predicted prevalence was lower than observed for the building set and higher than observed for the validation set (0.57 and 0.85, respectively; Table 4). The errors in classifying true presences and true absences were very similar (FNR = 0.38 and FPR = 0.34, respectively; Table 4) for the building set. In contrast, for the validation set the error in classifying true absences was high, while the error in classifying true presences was very low (FPR = 0.73 and FNR = 0.09, respectively; Table 4). Consequently, accuracy was slightly higher for the validation set than for the building set (0.69 and 0.63, respectively; Table 4), while precision was substantially higher for the building set than for the validation set (0.90 and 0.70, respectively; Table 4). AUC was higher for the building set than for the validation set (0.69 versus 0.57, respectively; Table 5 and Additional file 4: Figure S4b), while both RMSE and the Brier score were lower for the building set than for the validation set (RMSE 0.36 versus 0.47, respectively; Brier score = 0.13 versus 0.22, respectively; Table 4), in all cases indicating better classification performance for the building set.

Table 5 Confusion matrix for the binary conversion of the likelihood of ARS estimated by the spatial coordinates NPMR model for the building (n = 1444) and validation sets (n = 364), using the cutoff value that maximized the true skill statistic (TSS_max). FPR is the false positive rate, FNR is the false negative rate, TPR is the true positive rate, and TNR is the true negative rate. The second part of the table reports a set of performance metrics for this binary conversion, including prevalence, accuracy, precision, the area under the receiver operating characteristic curve (AUC, range: 0 to 1 with larger numbers indicating a better fit), the root-mean square error (RMSE, range: 0 to infinity with smaller numbers indicating a better fit), and the Brier score (range: 0 to 1 with lower scores indicating a better calibration of the predictions)

Full size table

The pattern of false negatives (i.e., ARS activity not captured by the model) in the building set was extensive throughout the study area, while the pattern of false positives (i.e., observed transiting locations that were classified as ARS by the model) was much sparser (Fig. 4b). In contrast, the proportion of false positives in the validation set was relatively high and widespread, while false negatives were very few (Fig. 4b).

Spatial model for the autocorrelation in the tracking data

The log likelihood ratio of the tuned spatial coordinates (longitude × latitude) NPMR model fitted to the building set was twice that of the environmental model (logB = 52.20), while both the average neighborhood size and minimum allowed neighborhood size were 88% larger (N_ave = 175.80 and n_min = 43.95, respectively; Table 2). Each location in the spatial model contributed an average of B_ave = 3% to the likelihood ratio (Table 2). A randomization test for logB based on 100 runs provided evidence that this model performed significantly better than a null model (p-value = 0.01, logB mean = − 0.24, variance = 0.19). Evaluation of fit for the logB statistic through bootstrap resampling with 100 runs indicated that it was quite stable, 90% of the time falling within the range 35.32–56.19 (the 5th and 95th percentiles, respectively). The median bootstrapped fit was logB = 44.09 (logB mean = 44.92, logB variance = 35.08; N_ave mean = 265.18, N_ave variance = 197.17). Despite strong differences between the results of the spatial and environmental models, the binary conversion returned similar metrics of predictive performance for the two models when fitted to the building set (Tables 4 and 5, Additional file 5: Figure S5).

The tolerance value for longitude was about one fourth that of latitude (0.26 and 1.02 degrees, respectively), which when scaled by their respective ranges and expressed as a percentage, tolerances were 3% for longitude and 6% for latitude (Table 3), indicating a 1:2 scale of spatial anisotropy in longitude:latitude for the likelihood of ARS as a reflection of the more rapid decrease in ARS activity in the cross-shore direction (i.e., longitude) with increasing depth and distance from shore than in the alongshore direction. The sensitivity of the response variable to changes in latitude was correspondingly lower than it was to changes in longitude (sensitivity = 0.32 and 1.21, respectively; Table 3).

The functional responses of ARS likelihood to longitude and latitude were driven by the density of observations (Additional file 6: Figure S6), which were highly concentrated in three regions of predictor space (longitude = 125–124.5°W, 123.8–123°W, 121–119.5°W; latitude = 33–35°N, 37.2–39.5°N, 41–44°N), corresponding to an estimated high ARS likelihood in the areas around Point Conception and the Santa Barbara Channel in southern California, the Gulf of the Farallones in central California, and between Cape Mendocino and Cape Blanco in northern California and southern Oregon (Fig. 5a). The pattern of true positives and true negatives in this model’s binary classification was similarly clumped around areas of high and low likelihood of ARS, respectively (Fig. 5b, Table 5).

Strong peaks in neighborhood size in these areas of high ARS likelihood (Additional file 7: Figure S7, Additional file 8: Figure S8a and S8d) further indicated that these observations were very similar in geographic space as a result of the clumped pattern of ARS behavior in the tracking data (Fig. 1b). Therefore, the spatial coordinates NPMR model captured the pattern and scale of autocorrelation inherent in the response. In contrast, the peaks in neighborhood size and the clumping in the predictions of ARS likelihood were largely absent in the environmental NPMR model (Additional file 7: Figure S7, Additional file 8: Figure S8b and S8e), and the differences in these variables between the two models highlighted the areas where the effects of autocorrelation in the spatial model were stronger (Additional file 8: Figure S8c and S8f).

The variogram of neighborhood size for the spatial coordinates NPMR model showed a cyclical pattern with strong autocorrelation at lags between 20 and 140 km, and secondary regions of autocorrelation between 230 and 330 km, and at 450 km, corresponding to the distances separating the three main areas of high likelihood of ARS (Fig. 6a and Additional file 8: Figure S8a and S8d). In contrast, the variogram of neighborhood size for the environmental NPMR model had much lower levels of semivariance and showed no evidence of autocorrelation at any scale (Fig. 6a). Not surprisingly, the variogram of the predicted response (ARS likelihood) for the spatial model contained the same scales of autocorrelation as neighborhood size (Fig. 6c), while the variogram of ARS likelihood for the environmental model indicated some remaining autocorrelation at scales below 100 km (attributable to measurement error or unexplained latent processes) and above 350 km (attributable to the distances separating predicted areas of high ARS likelihood by the environmental model), although the levels of semivariance were about half of those for the spatial model) (Fig. 6c).

Finally, the spatial coordinates model had a slightly better performance than the environmental model when both were fitted to the validation set (logB = 10.75 versus 6.39, B_ave = 8% versus 4%; Table 2). The performance metrics for the binary conversion were also slightly better for the spatial coordinates model than for the environmental model when both were fitted to the validation set (Tables 4 and 5). The variogram of neighborhood size for the spatial coordinates model showed a shift toward large-scale autocorrelation (lag distances of up to 300 km; Fig. 6b), while the levels of semivariance for the environmental model remained flat (Fig. 6b). The variograms of ARS likelihood showed that the patterns of cyclical autocorrelation observed in the building set were mostly absent from both the spatial coordinates and the environmental models when fitted to the validation set, with a slight indication of a shift toward large-scale autocorrelation (Fig. 6d).

Discussion

Existing SDMs of blue whale population density in the CCE have been built on a variety of environmental predictors. Survey-based SDMs have included various combinations of dynamic (mixed layer depth, wind speed, sea surface salinity, SST, CHL, SSH, standard deviation of SSH) and static predictors (seafloor depth, standard deviation of seafloor depth as a proxy for slope, distance to the shelf edge) [25, 27, 50, 51]. A telemetry-based SDM of blue whale density built on the same tracking data used in this study similarly included SST, CHL, SSH standard deviation, DEPTH, and DEPTH standard deviation as predictors [26]. Although with differences in product source, temporal coverage, or resolution, all these variables variously capture aspects of surface and subsurface dynamic processes relating to upwelling and enhanced primary productivity leading to krill, while the static variables describe geomorphic features that further favor krill aggregation. Indeed, our environmental NPMR model of ARS likelihood contained a similar (but smaller) set of predictors (CHL, SST, EASTNESS, and DEPTH), perhaps because the previous models have included offshore environments (where some foraging behavior does occur; see Fig. 1b), while our model focused on the coastal environment (where blue whales forage most intensively; see Fig. 1b). Nevertheless, it is thus clear that prediction of both blue whale population density and behavioral state in the CCE requires a combination of static and dynamic variables.

The pattern of high ARS likelihood estimated by our environmental model along the coast was consistent with known regions of high whale density predicted by survey-based SDMs [25, 27, 51], as well as with the recently designated Biologically Important Areas for feeding blue whales in the CCE based on independent observations, including those associated with islands and shallow banks in offshore part of the Southern California Bight [100]. A potential region of discrepancy occurred in nearshore waters off Oregon and Washington, where our model predicted high ARS likelihood (and the telemetry-based SDM of [26] predicted high density), while survey-based SDMs predicted low whale density [25, 27, 51]. However, there were relatively few SSSM locations in this region (partly because of tag attrition, as tags were deployed in southern and central California), suggesting that this northern sector of the CCE represents favorable foraging habitat, albeit for fewer whales than in the southern sector. Recently, [19] reported important submarine canyon habitat and krill aggregations north of 45°N off the Washington coast, so additional survey and tagging efforts are needed in this region to clarify blue whale habitat use in the northern CCE, particularly given known population heterogeneity [101] and foraging site fidelity [102] among ENP blue whales.

These patterns can arise because krill distribution is not uniform along the coast, as there are multiple factors that influence its abundance, including limited supply of macro- and micronutrients required to fuel the food chain in certain areas [103,104,105]. In addition, strong surface currents produce additional patchiness by concentrating or dispersing zooplankton [106], while the presence of submarine canyons further determines important krill hotspots in the CCE [19]. Additionally, while in coastal waters blue whales show high prey selectivity, favoring the larger species Thysanoessa spinifera even when other krill species are present or dominant [12, 107].

The generalized decrease in likelihood of ARS during the negative phase of the NPGO (2005–2006), particularly south of Cape Mendocino, was the result of a decrease in neighborhood size (N_ave = 93.63 versus 25.10; see Table 2). This, in turn, was driven by a shift toward large-scale autocorrelation in the tracking data during this period (Fig. 6b and d), likely reflecting increased transiting and reduced ARS behavior, as evidenced by the contrast between the predictions of the spatial model for the positive and negative phases of the NPGO (see Fig. 5a and c). The slightly better performance of the spatial coordinates model over the environmental model during the negative phase of the NPGO suggests that blue whales exhibit strong foraging site fidelity, even when conditions are not conducive to successful foraging, as has been demonstrated elsewhere [27, 102]. Given that decadal-scale environmental fluctuations have been hypothesized to drive observed large-scale distributional shifts in ENP blue whales [89], our results beg the question: how long does it take for blue whales to abandon formerly reliable foraging hotspots under different climatic regimes?

Although we found that the strong autocorrelation inherent in the tracking data was implicitly addressed by the environmental NPMR model, the variograms indicated that the predictions contained some level of autocorrelation at scales below 100 km, attributable to measurement error or unexplained latent processes [98]. The daily SSSM locations indicated that while in the CCE, blue whales cover typical distances of 20 km while engaged in ARS and 78 km while in transiting, so it is possible that Argos telemetry data do not adequately resolve whale movements and behavioral states around smaller-scale features. Therefore, future studies should also conduct direct validation of blue whale behavioral states and their ecological correlates through the use of electronic tags with onboard sensors that detect individual feeding events and environmental conditions [108,109,110].

Despite these advances, however, habitat models will remain a valuable tool for testing and elucidating species-environment relationships within and across ecosystems [51, 111]. Given the basin-scale movements of these animals and their changing behavioral context, our models addressed an ambiguity that often arises in density SDMs: if animals are sighted in an area, are they functionally present or is the habitat irrelevant? However, since neither of these modeling approaches can simultaneously estimate both density and foraging probability, further work is needed to develop modeling approaches that integrate distributional and behavioral data, while also incorporating the various sources of uncertainty [44, 111,112,113,114,115].

Conclusions

In this study, we have identified the most important large-scale environmental correlates of blue whale behavioral state in the coastal environments of the CCE. The predicted response indicated that ARS was generally consistent with foraging activities in the most biologically productive conditions and where blue whales can be expected to be most commonly found. We conclude that the environment, specifically phytoplankton chlorophyll-a levels, water temperature, and seafloor aspect and depth, have quantifiable effects on the movement behavior of blue whales, most likely through indirect effects on their prey. Our ecosystem-wide characterization of blue whale foraging and its drivers can be useful information in management considerations seeking to mitigate ship strikes and other anthropogenic interactions (such as entanglement in fishing gear [33]), for example through the identification of key regions of importance (and their variability in time) for this endangered species. An improved understanding of these species-environment relationships in the context of natural climatic oscillations and foraging site fidelity will also aid in better predicting the effects of climate change on the CCE ecosystem and the animal populations it supports [52, 53]. Finally, further work to integrate behavioral and distribution models for wide-ranging ocean predators such as blue whales will lead to a more complete quantification of their ecology and the risk from anthropogenic activities.

Availability of data and materials

The data used in this study are available on Movebank (movebank.org, study name “Blue whales Eastern North Pacific 1993-2008 - Argos Data”) and are published in the Movebank Data Repository under a Creative Commons Zero license [116].

Abbreviations

ARS:: Area-restricted searching
AUC:: Area under the receiver operating characteristic curve
BMODE:: Behavioral mode
CCE:: California Current Ecosystem
CHL:: Phytoplankton chlorophyll-a levels
CMEMS:: Copernicus Marine and Environment Monitoring Service
DEPTH:: Seafloor depth
DISTSHELF:: Distance to the 200-m isobath
dtSSH:: Spatially detrended sea surface height
dtSST:: Spatially detrended sea surface temperature
EASTNESS:: Eastness component of slope aspect
EEZ:: Exclusive Economic Zone
ENP:: Eastern North Pacific
ERDDAP:: Environmental Research Division Data Access Program
NASA/JPL:: National Atmospheric and Space Administration, Jet Propulsion Laboratory
NOAA/NCEI:: National Oceanic and Atmospheric Administration, National Centers for Environmental Information
NORTHNESS:: Northness component of slope aspect
NPGO:: North Pacific Gyre Oscillation
NPMR:: Nonparametric multiplicative regression
RMSE:: Root-mean square error
RSS:: Remote Sensing Systems
SDM:: Species distribution models
SLOPE:: Seafloor slope
SSH:: Sea surface height
SSSM:: Bayesian switching state-space model
SST:: Sea surface temperature
TNR:: True negative rate
TPR:: True positive rate
TSS:: True skill statistic
UCSD/SIO:: University of California San Diego, Scripps Institution of Oceanography
USA:: United States of America
WEKM:: Vertical upwelling velocity (or Ekman pumping)

References

Carr M-E, Kearns EJ. Production regimes in four eastern boundary current systems. Deep-Sea Res II. 2003;50:3199–221.
Article CAS Google Scholar
Chavez FP, Messié M. A comparison of eastern boundary upwelling ecosystems. Progr Oceanogr. 2009;83:80–96.
Article Google Scholar
Checkley DM, Barth JA. Patterns and processes in the California Current System. Progr Oceanogr. 2009;83:49–64.
Article Google Scholar
Benson SR, Forney KA, Harvey JT, Carretta JV, Dutton PH. Abundance, distribution, and habitat of leatherback turtles (Dermochelys coriacea) off California, 1990–2003. Fish Bull. 2007;105:337–47.
Google Scholar
Barlow J, Forney KA. Abundance and population density of cetaceans in the California Current Ecosystem. Fish Bull. 2002;105:509–26.
Google Scholar
Nur N, Jahncke J, Herzog M, Howar J, Hyrenbach KD, Zamon JE, et al. Where the wild things are: predicting hotspots of seabird aggregations in the California Current System. Ecol Apps. 2011;21:2241–57.
Article Google Scholar
Wingfield DK, Peckham SH, Foley DG, Palacios DM, Lavaniegos BE, Durazo R, et al. The making of a productivity hotspot in the coastal ocean. PLoS One. 2011;6(11):e27874.
Article CAS PubMed PubMed Central Google Scholar
Larkman VE, Veit RR. Seasonality and abundance of blue whales off southern California. CalCOFI Rep. 1998;39:236–9.
Google Scholar
Burtenshaw JC, Oleson EM, Hildebrand JA, McDonald MA, Andrew RK, Howe BM, et al. Acoustic and satellite remote sensing of blue whale seasonality and habitat in the Northeast Pacific. Deep-Sea Res II. 2004;51:967–86.
Article Google Scholar
Oleson EM, Calambokidis J, Barlow J, Hildebrand JA. Blue whale visual and acoustic encounter rates in the southern California bight. Mar Mam Sci. 2007;23:574–97.
Article Google Scholar
Schoenherr JR. Blue whales feeding on high concentrations of euphausiids around Monterey submarine canyon. Can J Zool. 1991;69:583–94.
Article Google Scholar
Fiedler P, Reilly S, Hewitt R, Demer D, Philbrick V, Smith S, et al. Blue whale habitat and prey in the California Channel Islands. Deep-Sea Res II. 1998;45:1781–801.
Article Google Scholar
Croll D, Marinovic B, Benson S, Chavez F, Black N, Ternullo R, et al. From wind to whales: trophic links in a coastal upwelling system. Mar Ecol Progr Ser. 2005;289:117–30.
Article Google Scholar
Mangel M, Marinovic B, Pomeroy C, Croll D. Requiem for ricker: unpacking MSY. Bull Mar Sci. 2002;70:763–81.
Google Scholar
Ish T, Dick EJ, Switzer PV, Mangel M. Environment, krill and squid in the Monterey Bay: from fisheries to life histories and back again. Deep-Sea Res II. 2004;51:849–62.
Article Google Scholar
Graham WM, Largier JL. Upwelling shadows as nearshore retention sites: the example of northern Monterey Bay. Cont Shelf Res. 1997;17:509–32.
Article Google Scholar
Genin A. Bio-physical coupling in the formation of zooplankton and fish aggregations over abrupt topographies. J Mar Sys. 2004;50:3–20.
Article Google Scholar
Allen SE, Hickey BM. Dynamics of advection-driven upwelling over a shelf break submarine canyon. J Geophys Res. 2010;115(C8):C08018–20.
Article Google Scholar
Santora JA, Zeno R, Dorman JG, Sydeman WJ. Submarine canyons represent an essential habitat network for krill hotspots in a large marine ecosystem. Sci Rep. 2018;8:7579.
Article PubMed PubMed Central CAS Google Scholar
Mackas DL, Kieser R, Saunders M, Yelland DR, Brown RM, Moore DF. Aggregation of euphausiids and Pacific hake (Merluccius productus) along the outer continental shelf off Vancouver Island. Can J Fish Aquat Sci. 1997;54:2080–96.
Article Google Scholar
Croll DA, Tershy BR, Hewitt RP, Demer DA, Fiedler PC, Smith SE, et al. An integrated approach to the foraging ecology of marine birds and mammals. Deep-Sea Res II. 1998;45:1353–71.
Article Google Scholar
Santora JA, Sydeman WJ, Schroeder ID, Wells BK, Field JC. Mesoscale structure and oceanographic determinants of krill hotspots in the California Current: implications for trophic transfer and conservation. Progr Oceanogr. 2011;91:397–409.
Article Google Scholar
Dorman JG, Sydeman WJ, García-Reyes M, Zeno RA, Santora JA. Modeling krill aggregations in the Central-Northern California Current. Mar Ecol Progr Ser. 2015;528:87–99.
Article Google Scholar
Irvine LM, Mate BR, Winsor MH, Palacios DM, Bograd SJ, Costa DP, et al. Spatial and temporal occurrence of blue whales off the U.S. West Coast, with implications for management. PLoS One. 2014;9:e102959.
Article PubMed PubMed Central Google Scholar
Becker E, Forney K, Fiedler P, Barlow J, Chivers S, Edwards C, et al. Moving towards dynamic ocean management: how well do modeled ocean products predict species distributions? Remote Sens. 2016;8:149–26.
Article Google Scholar
Hazen EL, Palacios DM, Forney KA, Howell EA, Becker E, Hoover AL, et al. WhaleWatch: a dynamic management tool for predicting blue whale density in the California Current. J Appl Ecol. 2016;54:1415–28.
Article Google Scholar
Becker EA, Forney KA, Redfern JV, Barlow J, Jacox M, Roberts JJ, et al. Predicting cetacean abundance and distribution in a changing climate. Diversity Distrib. 2018;43:459–18.
Google Scholar
Berman-Kowalewski M, Gulland FMD, Wilkin S, Calambokidis J, Mate B, Cordaro J, et al. Association between blue whale (Balaenoptera musculus) mortality and ship strikes along the California coast. Aquat Mamm. 2010;36:59–66.
Article Google Scholar
Redfern JV, McKenna MF, Moore TJ, Calambokidis J, DeAngelis ML, Becker EA, et al. Assessing the risk of ships striking large whales in marine spatial planning. Cons Biol. 2013;27:292–302.
Article CAS Google Scholar
McKenna MF, Calambokidis J, Oleson EM, Laist DW, Goldbogen JA. Simultaneous tracking of blue whales and large ships demonstrates limited behavioral responses for avoiding collision. Endang Species Res. 2015;27:219–32.
Article Google Scholar
Rockwood RC, Calambokidis J, Jahncke J. High mortality of blue, humpback and fin whales from modeling of vessel collisions on the U.S. West Coast suggests population impacts and insufficient protection. PLoS One. 2017;12(8):e0183052–24.
Article PubMed PubMed Central CAS Google Scholar
Calambokidis J, Barlow J. Updated abundance estimates of blue and humpback whales off the US West Coast incorporating photo-identifications from 2010 and 2011. Final Report for contract AB-133F-10-RP-0106. PSRG-2013-13R. 2013. http://www.cascadiaresearch.org/publications/updated-abundance-estimates-blue-and-humpback-whales-us-west-coast-incorporating-photo. Accessed 22 May 2018.
National Marine Fisheries Service. Draft recovery plan for the blue whale (Balaenoptera musculus) - revision. Silver Spring: National Marine Fisheries Service, Office of Protected Resources; 2018. https://www.fisheries.noaa.gov/resource/document/draft-recovery-plan-blue-whale-balaenoptera-musculus
Google Scholar
Parks SE, Warren JD, Stamieszkin K, Mayo CA, Wiley D. Dangerous dining: surface foraging of North Atlantic right whales increases risk of vessel collisions. Biol Lett. 2012;8:57–60.
Article PubMed Google Scholar
Goldbogen JA, Southall BL, DeRuiter SL, Calambokidis J, Friedlaender AS, Hazen EL, et al. Blue whales respond to simulated mid-frequency military sonar. Proc R Soc B. 2013;280:20130657.
Article PubMed Google Scholar
Blair HB, Merchant ND, Friedlaender AS, Wiley DN, Parks SE. Evidence for ship noise impacts on humpback whale foraging behaviour. Biol Lett. 2016;12:20160005.
Article PubMed PubMed Central Google Scholar
Lesage V, Omrane A, Doniol-Valcroze T, Mosnier A. Increased proximity of vessels reduces feeding opportunities of blue whales in the St. Lawrence estuary, Canada. Endang Species Res. 2017;32:351–61.
Article Google Scholar
Redfern JV, Hatch LT, Caldow C, DeAngelis ML, Gedamke J, Hastings S, et al. Assessing the risk of chronic shipping noise to baleen whales off Southern California, USA. Endang Species Res. 2017;32:153–67.
Article Google Scholar
Dransfield A, Hines E, McGowan J, Holzman B, Nur N, Elliott M, et al. Where the whales are: using habitat modeling to support changes in shipping regulations within National Marine Sanctuaries in Central California. Endang Species Res. 2014;26:39–57.
Article Google Scholar
Jonsen I, Flemming J, Myers R. Robust state-space modeling of animal movement data. Ecology. 2005;86:2874–80.
Article Google Scholar
Jonsen ID, Basson M, Bestley S, Bravington MV, Patterson TA, Pedersen MW, et al. State-space models for bio-loggers a methodological road map. Deep-Sea Res II. 2013;88–89:34–46.
Article Google Scholar
Bailey H, Thompson P. Quantitative analysis of bottlenose dolphin movement patterns and their relationship with foraging. J Anim Ecol. 2006;75:456–65.
Article PubMed Google Scholar
Jonsen I, Myers R, James M. Identifying leatherback turtle foraging behaviour from satellite telemetry using a switching state-space model. Mar Ecol Progr Ser. 2007;337:255–64.
Article Google Scholar
Bestley S, Jonsen ID, Hindell MA, Guinet C, Charrassin JB. Integrative modelling of animal movement: incorporating in situ habitat and behavioural information for a migratory marine predator. Proc R Soc B. 2012;280:20122262.
Article PubMed Google Scholar
Beatty WS, Jay CV, Fischbach AS. An evaluation of behavior inferences from Bayesian state-space models: a case study with the Pacific walrus. Mar Mam Sci. 2016;32:1299–318.
Article Google Scholar
Carter MID, Bennett KA, Embling CB, Hosegood PJ, Russell DJF. Navigating uncertain waters: a critical review of inferring foraging behaviour from location and dive data in pinnipeds. Mov Ecol. 2016;4:25.
Article PubMed PubMed Central Google Scholar
Trudelle L, Cerchio S, Zerbini AN, Geyer Y, Mayer F-X, Jung J-L, et al. Influence of environmental parameters on movements and habitat utilization of humpback whales (Megaptera novaeangliae) in the Madagascar breeding ground. R Soc Open Sci. 2016;3:160616–22.
Article PubMed PubMed Central Google Scholar
Thums M, Waayers D, Huang Z, Pattiaratchi C, Bernus J, Meekan M. Environmental predictors of foraging and transit behaviour in flatback turtles Natator depressus. Endang Species Res. 2017;32:333–49.
Article Google Scholar
Andrews-Goff V, Bestley S, Gales NJ, Laverick SM, Paton D, Polanowski AM, et al. Humpback whale migrations to Antarctic summer foraging grounds through the Southwest Pacific Ocean. Sci Rep. 2018;8:767.
Article CAS Google Scholar
Pardo MA, Gerrodette T, Beier E, Gendron D, Forney KA, Chivers SJ, et al. Inferring cetacean population densities from the absolute dynamic topography of the ocean in a hierarchical Bayesian framework. PLoS One. 2015;10:e0120727.
Article PubMed PubMed Central CAS Google Scholar
Redfern JV, Moore TJ, Fiedler PC, de Vos A, Brownell RL Jr, Forney KA, et al. Predicting cetacean distributions in data-poor marine ecosystems. Diversity Distrib. 2017;23:394–408.
Article Google Scholar
García-Reyes M, Sydeman WJ, Schoeman DS, Rykaczewski RR, Black BA, Smit AJ, et al. Under pressure: climate change, upwelling, and eastern boundary upwelling ecosystems. Front Mar Sci. 2015;2:109.
Article Google Scholar
Hazen EL, Jorgensen S, Rykaczewski RR, Bograd SJ, Foley DG, Jonsen ID, et al. Predicted habitat shifts of Pacific top predators in a changing climate. Nat Clim Change. 2013;3:234–8.
Article Google Scholar
Mate BR, Lagerquist BA, Calambokidis J. Movements of North Pacific blue whales during the feeding season off southern California and their southern fall migration. Mar Mam Sci. 1999;15:1246–57.
Article Google Scholar
Mate BR, Mesecar R, Lagerquist B. The evolution of satellite-monitored radio tags for large whales: one laboratory’s experience. Deep-Sea Res II. 2007;54:224–47.
Article Google Scholar
Bailey H, Mate BR, Palacios DM, Irvine L, Bograd SJ, Costa DP. Behavioural estimation of blue whale movements in the Northeast Pacific from state-space model analysis of satellite tracks. Endang Species Res. 2009;10:93–106.
Article Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2016. https://www.R-project.org/
Google Scholar
Claus S, De Hauwere N, Vanhoorne B, Deckers P, Souza Dias F, Hernandez F, et al. Marine regions: towards a global standard for georeferenced marine names and boundaries. Mar Geod. 2014;37:99–125.
Article Google Scholar
Flanders Marine Institute. Maritime Boundaries Geodatabase: Maritime Boundaries and Exclusive Economic Zones (200NM), version 9. 2016. https://doi.org/10.14284/242. Accessed 25 November 2016.
Marine Regions. http://www.marineregions.org/eezdetails.php?mrgid=8456 . Accessed 25 Nov 2016.
Simons RA. ERDDAP. NOAA/NMFS/SWFSC/ERD. Monterey, CA. 2017. https://coastwatch.pfeg.noaa.gov/erddap . Accessed 20 Sep 2016.
Mendelssohn R. xtractomatic: Accessing Environmental Data from ERD’s ERDDAP Server. R package version 3.0.0. 2015. https://cran.r-project.org/package=xtractomatic. Accessed 10 May 2016.
Lemos RT, Sansó B. Conditionally linear models for non-homogeneous spatial random fields. Stat Met. 2012;9:275–84.
Article Google Scholar
Bakun A, Parrish RH. Turbulence, transport, and pelagic fish in the California and Peru current systems. CalCOFI Rep. 1982;23:99–112.
Google Scholar
Becker JJ, Sandwell DT, Smith W, Braud J, Binder B, Depner J, et al. Global bathymetry and elevation data at 30 arc seconds resolution: SRTM30_PLUS. Mar Geod. 2009;32:355–71.
Article Google Scholar
Satellite Geodesy. Global Topography. SRTM30_PLUS: SRTM30, coastal and ridge multibeam, estimated topography. https://topex.ucsd.edu/WWW_html/srtm30_plus.html. Accessed 25 Nov 2016.
Gonzalez RC, Woods RE. Digital image processing. 2nd ed. Upper Saddle River: Prentice Hall; 2002.
Google Scholar
Olaya V. Basic land-surface parameters. In: Hengl T, Reuter HI, editors. Geomorphometry: concepts, software, applications. Developments in soil science, volume 33. Amsterdam: Elsevier; 2009. p. 141–69.
Chapter Google Scholar
ETOPO2 v.2 g 2-minute Gridded Global Relief Data. https://doi.org/10.7289/V5J1012Q. Accessed 20 Sep 2016.
Ducet N, Le Traon PY, Reverdin G. Global high-resolution mapping of ocean circulation from TOPEX/Poseidon and ERS-1 and -2. J Geophys Res. 2000;105(C8):19477–98.
Article Google Scholar
Copernicus Marine Environment Monitoring Service. http://marine.copernicus.eu. Accessed 20 Sep 2016.
Pond S, Pickard GL. Introductory dynamical oceanography. 2nd ed. Oxford: Butterworth- Heinemann Ltd; 1983.
Google Scholar
Risien CM, Chelton DB. A global climatology of surface wind and wind stress fields from eight years of QuikSCAT scatterometer data. J Phys Oceanogr. 2008;38:2379–413.
Article Google Scholar
NASA Jet Propulsion Laboratory. Winds. http://winds.jpl.nasa.gov/. Accessed 20 Sep 2016.
Remote Sensing Systems. QuikScat / SeaWinds. http://www.remss.com/missions/qscat. Accessed 20 Sep 2016.
Casey KS, Brandon TB, Cornillon P, Evans R. The past, present and future of the AVHRR pathfinder SST program. In: Barale V, Gower JFR, Alberotanza L, editors. Oceanography from space: revisited. London: Springer; 2010. p. 273–87.
Chapter Google Scholar
AVHRR Pathfinder Project. https://www.nodc.noaa.gov/sog/pathfinder4km/. Accessed 20 Sep 2016.
Maritorena S, Siegel DA. Consistent merging of satellite ocean color data sets using a bio-optical model. Remote Sens Environ. 2005;94:429–40.
Article Google Scholar
GlobColour Project. http://www.globcolour.info. Accessed 20 Sep 2016.
Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 2012;36:27–46.
Article Google Scholar
McCune B, Mefford. HyperNiche. Nonparametric multiplicative habitat modeling. Version 2.30. Gleneden Beach: MjM Software; 2009.
Google Scholar
McCune B. Non-parametric habitat models with automatic interactions. J Veg Sci. 2006;17:819–30.
Article Google Scholar
McCune B. Nonparametric multiplicative regression for habitat modeling; 2011. http://www.pcord.com/NPMRintro.pdf. Accessed 10 May 2016
Lintz HE, McCune B, Gray AN, McCulloh KA. Quantifying ecological thresholds from response surfaces. Ecol Model. 2011;222:427–36.
Article Google Scholar
Nelson PR, Joly K, Roland C, McCune B. Evaluating relocation extent versus covariate resolution in habitat selection models across spatiotemporal scales. Eco Inform. 2018;48:245–56.
Article Google Scholar
Oreskes N, Shrader-Frechette K, Belitz K. Verification, validation, and confirmation of numerical models in the earth sciences. Science. 1994;263:641–6.
Article CAS PubMed Google Scholar
Guisan A, Zimmermann NE. Predictive habitat distribution models in ecology. Ecol Model. 2000;135:147–86.
Article Google Scholar
Bennett ND, Croke BFW, Guariso G, Guillaume JHA, Hamilton SH, Jakeman AJ, et al. Characterising performance of environmental models. Env Model Software. 2013;40:1–20.
Article Google Scholar
Calambokidis J, Barlow J, Ford JKB, Chandler TE, Douglas AB. Insights into the population structure of blue whales in the eastern North Pacific from recent sightings and photographic identification. Mar Mamm Sci. 2009;25(4):816–32.
Article Google Scholar
Di Lorenzo E, Schneider N, Cobb KM, Chhak K, Franks PJS, Miller AJ, et al. North Pacific gyre oscillation links ocean climate and ecosystem change. Geophys Res Lett. 2008;35:L08607.
Article Google Scholar
Allouche O, Tsoar A, Kadmon R. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). J Appl Ecol. 2006;43:1223–32.
Article Google Scholar
Sing T, Sander O, Beerenwinkel N, Lengauer T. ROCR: visualizing classifier performance in R. Bioinformatics. 2005;21:7881.
Article CAS Google Scholar
Brier G. Verification of forecasts expressed in terms of probabilities. Mon Weather Rev. 1950;78:1–3.
Article Google Scholar
Dormann CF. Effects of incorporating spatial autocorrelation into the analysis of species distribution data. Glob Ecol Biogeogr. 2007;16(2):129–38.
Article Google Scholar
Dormann CF, McPherson JM, Araújo MB, Bivand R, Bolliger J, Carl G, et al. Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. Ecography. 2007;30(5):609–28.
Article Google Scholar
Aarts G, MacKenzie M, McConnell B, Fedak M, Matthiopoulos J. Estimating space-use and habitat preference from wildlife telemetry data. Ecography. 2008;31:140–60.
Article Google Scholar
Dray S, Royer-Carenzi M, Calenge C. The exploratory analysis of autocorrelation in animal-movement studies. Ecol Res. 2010;25:673–81.
Article Google Scholar
Beale CM, Lennon JJ, Yearsley JM, Brewer MJ, Elston DA. Regression analysis of spatial data. Ecol Lett. 2010;13(2):246–64.
Article PubMed Google Scholar
Pebesma EJ. Multivariable geostatistics in S: the gstat package. Comput Geosci. 2004;30:683–91.
Article Google Scholar
Calambokidis J, Steiger GH, Curtice C, Harrison J, Ferguson MC, Becker E, et al. 4. Biologically important areas for selected cetaceans within U.S. waters – West Coast region. Aquat Mamm. 2015;41:39–53.
Article Google Scholar
Calambokidis J, Barlow J. Abundance of blue and humpback whales in the eastern North Pacific estimated by capture‐recapture and line‐transect methods. Mar Mamm Sci. 2004;20:63–85.
Article Google Scholar
Abrahms B, Hazen EL, Aikens EO, Savoca MS, Goldbogen JA, Bograd SJ, et al. Memory and resource tracking drive blue whale migrations. Proc Natl Acad Sci U.S.A. 2019;20:201819031–6.
Google Scholar
Hutchins DA, DiTullio GR, Zhang Y, Bruland KW. An iron limitation mosaic in the California upwelling regime. Limnol Oceanogr. 1998;43:1037–54.
Article CAS Google Scholar
Chase Z, Strutton PG, Hales B. Iron links river runoff and shelf width to phytoplankton biomass along the U.S. west coast. Geophys Res Lett. 2007;34:L04607.
Article Google Scholar
Palacios DM, Hazen EL, Schroeder ID, Bograd SJ. Modeling the temperature-nitrate relationship in the coastal upwelling domain of the California current. J Geophys Res. 2013;118:3223–39.
Article CAS Google Scholar
Messié M, Chavez FP. Nutrient supply, surface currents, and plankton dynamics predict zooplankton hotspots in coastal upwelling systems. Geophys Res Lett. 2017;61:478.
Google Scholar
Nickels CF, Sala LM, Ohman MD. The morphology of euphausiid mandibles used to assess selective predation by blue whales in the southern sector of the California current system. J Crust Biol. 2018;38(5):563–73.
Article Google Scholar
Mate BR, Irvine LM, Palacios DM. The development of an intermediate-duration tag to characterize the diving behavior of large whales. Ecol Evol. 2016;7:585–95.
Article PubMed PubMed Central Google Scholar
DeRuiter SL, Langrock R, Skirbutas T, Goldbogen JA, Calambokidis J, Friedlaender AS, et al. A multivariate mixed hidden Markov model for blue whale behaviour and responses to sound exposure. Ann Appl Stat. 2017;11:362–92.
Article Google Scholar
Irvine LM, Palacios DM, Mate BR. Feeding behavior varies by sex and patch quality in blue and fin whales off southern California, USA. Front Ecol Evol. 2019:452396.
Palacios DM, Baumgartner MF, Laidre KL, Gregr EJ. Beyond correlation: integrating environmentally and behaviourally mediated processes in models of marine mammal distributions. Endang Species Res. 2013;22:191–203.
Article Google Scholar
Mueller T, Fagan WF, Grimm V. Integrating individual search and navigation behaviors in mechanistic movement models. Theoret Ecol. 2010;4:341–55.
Article Google Scholar
Webber QMR, Vander Wal E. An evolutionary framework outlining the integration of individual social and spatial ecology. J Anim Ecol. 2017;87:113–27.
Article PubMed Google Scholar
Jonsen I, McMahon C, Patterson T, Auger-Méthé M, Harcourt R, Hindell M, et al. Movement behaviour responses to environment: fast inference of individual variation with a mixed effects model. Ecology. 2019;100:e02566–8.
Article CAS PubMed Google Scholar
Smith A, Hofner B, Lamb JS, Osenkowski J, Allison T, Sadoti G, et al. Modeling spatiotemporal abundance of mobile wildlife in highly variable environments using boosted GAMLSS hurdle models. Ecol Evol. 2019;200:1–19.
Google Scholar
Mate BR, Palacios DM, Irvine LM, Bailey H, Follett TM (2019) Data from: Behavioural estimation of blue whale movements in the Northeast Pacific from state-space model analysis of satellite tracks. Movebank Data Repository. https://doi.org/10.5441/001/1.5ph88fk2

Download references

Acknowledgements

The support of field crews was essential to the success of whale tagging operations. We thank Tomas Follett, Craig Hayslip, Barbara Lagerquist, and Martha Winsor for fieldwork and analytical assistance, and Kathy Minta and Minda Stiles for logistical and administrative support. The Argos Data Collection and Location System used for this project (http://www.argos-system.org/) is operated by Collecte Localisation Satellites. Argos is an international program that relies on instruments provided by the French Space Agency flown on polar-orbiting satellites operated by the USA’s National Oceanic and Atmospheric Administration, the European Organisation for the Exploitation of Meteorological Satellites, and the Indian Space Research Organization. The Environmental Research Division Data Access Program (ERDDAP) is hosted by the National Marine Fisheries Service’s (NMFS) Southwest Fisheries Science Center. The environmental data sets are produced and distributed by: the NASA Jet Propulsion Laboratory (NASA/JPL), the NOAA National Centers for Environmental Information (NOAA/NCEI), the Scripps Institution of Oceanography (UCSD/SIO), the Copernicus Marine and Environment Monitoring Service, European Union Space Strategy (CMEMS), and the European Service for Ocean Color, European Space Agency (GlobColour Project). We thank Bruce McCune for his invaluable advice on the finer points of NPMR. Comments from Paul C. Fiedler, Jarrod A. Santora, an anonymous reviewer, and the Associate Editor were helpful in improving earlier drafts of this manuscript.

Funding

Funding support for tagging was provided by the Office of Naval Research (Grants 9610608, 0010085 and 0310861), the Tagging of Pacific Pelagics (TOPP) program of the Census of Marine Life, the National Science Foundation, the Alfred P. Sloan Foundation, the Moore Foundation, the Packard Foundation, the National Geographic Society, and contributions from private donors to the Oregon State University Endowed Marine Mammal Institute. Data analysis and writing were supported by the Interagency NASA Climate and Biological Response Program (NASA-ROSES Grant No. NNX11AP71G, Project No. 10-BIOCLIM10–0016), with additional funding provided by the USA Department of the Navy, Commander, US Pacific Fleet under the Marine Species Monitoring Program via contract with HDR, Inc. (Contract No. N62470–15-D-8006, Task Order No. 18F5370). We thank the representatives from US Pacific Fleet and NAVFAC Southwest for technical and contractual support, and Kristen Ampela at HDR, Inc. for project management.

Author information

Authors and Affiliations

Marine Mammal Institute and Department of Fisheries and Wildlife, Hatfield Marine Science Center, Oregon State University, Newport, OR, USA
Daniel M. Palacios
Chesapeake Biological Laboratory, University of Maryland Center for Environmental Science, Solomons, MD, USA
Helen Bailey
Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
Elizabeth A. Becker
Environmental Research Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Monterey, CA, USA
Steven J. Bograd
NOAA West Coast Regional Office, Long Beach, CA, USA
Monica L. DeAngelis
Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Moss Landing, CA, USA
Karin A. Forney
Moss Landing Marine Laboratories, Moss Landing, CA, USA
Karin A. Forney
Environmental Research Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Monterey, CA, USA
Elliott L. Hazen
University of California Santa Cruz, Santa Cruz, CA, USA
Elliott L. Hazen
Marine Mammal Institute and Department of Fisheries and Wildlife, Hatfield Marine Science Center, Oregon State University, Newport, OR, USA
Ladd M. Irvine
Marine Mammal Institute and Department of Fisheries and Wildlife, Hatfield Marine Science Center, Oregon State University, Newport, OR, USA
Bruce R. Mate
Present Address: Naval Undersea Warfare Center, Newport, RI, USA
Monica L. DeAngelis

Authors

Daniel M. Palacios
View author publications
You can also search for this author in PubMed Google Scholar
Helen Bailey
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth A. Becker
View author publications
You can also search for this author in PubMed Google Scholar
Steven J. Bograd
View author publications
You can also search for this author in PubMed Google Scholar
Monica L. DeAngelis
View author publications
You can also search for this author in PubMed Google Scholar
Karin A. Forney
View author publications
You can also search for this author in PubMed Google Scholar
Elliott L. Hazen
View author publications
You can also search for this author in PubMed Google Scholar
Ladd M. Irvine
View author publications
You can also search for this author in PubMed Google Scholar
Bruce R. Mate
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DMP conceived and designed the study. BRM oversaw data collection, with participation from LMI. DMP, HB, LMI, and ELH analyzed the data. EAB and KAF provided statistical support and ecological knowledge, and MLD and SJB assisted with interpretation of results. DMP took the lead in writing the manuscript, and all authors contributed to, and approved, the final manuscript.

Corresponding author

Correspondence to Daniel M. Palacios.

Ethics declarations

Ethics approval

This research was conducted under USA’s National Marine Fisheries Service (NMFS) Marine Mammal Protection Act and Endangered Species Act scientific research permits #841 (1993–1998), #369–1440 (1999–2004), and #369–1757 (2005–2013), authorizing the close approach and deployment of implantable satellite tags on large whales. All tagging procedures authorized by the permit, and used in this manuscript, were subjected to an internal NMFS review. We carried out this study in strict accordance with the policies and guidelines of the Oregon State University Institutional Animal Care and Use Committee (IACUC), composed of veterinarians and other university administrators, and authorized under IACUC permits #2284 (1999–2002), # 2715 (2002–2005), and #3158 (2005–2008). IACUC acceptance assures that the research follows its guidelines for the humane care and use of animals while meeting its objective to reduce, replace, and refine the use of animals in research.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Figure S1. Monthly values of the North Pacific Gyre Oscillation (NPGO) index for the period 1997-2009. Positive NPGO values (cool and highly productive conditions) were prevalent in most years of the study period (1998-2004 and 2007-2008), while two years (2005 and 2006) were characterized by negative NPGO values (warm conditions with reduced biological productivity). (PDF 117 kb)

Additional file 2:

Figure S2. Pairwise scatterplot matrix of the 11 environmental variables used to build initial NPMR models. The data have been binned in the scatterplots in the lower triangle to avoid overplotting, with lighter color shading indicating higher density of observations. Represented along the diagonal is the univariate probability density for each variable. The upper triangle contains the Pearson correlation coefficient (r) between variable pairs. See Table 1 for the units of the variables. (PDF 1.91 mb)

Additional file 3:

Figure S3. The estimated likelihood (lkhd) of ARS on the bivariate response surfaces represented by combinations of the predictors in the environmental NPMR model. Gray areas correspond to regions of the predictor space with non-existent combinations or where there was insufficient data for the model to produce an estimate based on the required minimum neighborhood size (n_min < 23.41). (PDF 774 kb)

Additional file 4:

Figure S4. (a) Probability density of estimated ARS likelihood by the environmental NPMR model for the building (purple polygon) and the validation (orange polygon) sets, with the vertical lines indicating the cutoff value for binary conversion that maximized the true skill statistic for the respective sets. (b) The receiver operating characteristic curve for the binary classification of the predictions by the environmental NPMR model on the building set (purple curve) and the validation set (orange curve), compared to the 1:1 diagonal (black line) corresponding to a model that did no better than random. The AUC value is the area under the receiver operating characteristic curve for the respective curve. (PDF 243 kb)

Additional file 5:

Figure S5. (a) Probability density of estimated ARS likelihood for NPMR models based on spatial coordinates (red polygon) and environmental predictors (purple polygon) sets, with the vertical lines indicating the cutoff value for binary conversion that maximized the true skill statistic for the respective models. (b) The receiver operating characteristic curve for the binary classification of the predictions by the spatial model (red curve) and the environmental predictors model (purple curve), compared to the 1:1 diagonal (black line) corresponding to a model that did no better than random. The AUC value is the area under the receiver operating characteristic curve for the respective curve. (PDF 248 kb)

Additional file 6:

Figure S6. The functional responses of likelihood of ARS to (a) longitude and (b) latitude in the spatial NPMR model (fitted purple curves). Also shown are the model estimates at each location (red points), and the 5th and 95th percentile variability bands obtained through 100 bootstrap samples (gray points). The green portion of the fitted curve in (a) corresponds a region of the predictor where neighborhood size was below the minimum allowed as part of the parsimony controls (n_min < 43.95). (PDF 223 kb)

Additional file 7:

Figure S7. Scatterplots of (a and b) neighborhood size and (c and d) likelihood of ARS as a function of longitude and latitude at each SSSM location for NPMR models based on spatial coordinates (red circles) and environmental predictors (purple circles). (PDF 668 kb)

Additional file 8:

Figure S8. Maps of (a-c) neighborhood size and (d-f) likelihood of ARS at each SSSM location for NPMR models based on spatial coordinates (first column), environmental predictors (second column), and the difference between the two (third column). Polygon with thick black outline is the EEZ boundary. (PDF 524 kb)

Appendix

Nonparametric multiplicative habitat modeling (NPMR) in HyperNiche: NPMR models estimate the probability of occurrence – or in our case likelihood of ARS, at target point v as the weighted average of observations nearby (here referred to as sample units) in the multi-dimensional space defined by the quantitative values of the predictors (i.e., the environmental neighborhood) [82, 83]. The size of the environmental neighborhood (n*) is determined by the standard deviation of the Gaussian function, and sample units near the target point (in environmental space) are weighted more strongly than distant ones in the average. Following the notation of [82, 83], the weight applied to a sample unit within the environmental neighborhood, relative to the target point v, is given by the Gaussian probability density function:

$$ {w}_{ij}^{\ast }={e}^{-\frac{1}{2}{\left[\left({x}_{ij}-{v}_j\right)/{s}_j\right]}^2} $$

(1)

where:

w^*_ij = the univariate weight applied to sample unit i for predictor j,

W = n × n diagonal matrix with each diagonal element being a product of weights from each predictor variable. For sample unit i, the diagonal element is:

$$ {w}_{ii}=\prod \limits_{j=1}^m{w}_{ij}^{\ast}\kern0.5em $$

(2)

X = the matrix of environmental predictors with i = 1 to n rows (sample units) and j = 1 to m columns (predictors),

x_ij = the environmental predictor value at sample unit i for predictor j,

v = a row vector specifying the position of the target point v in the space defined by the environmental predictors, with j = 1 to m columns,

v_j = the target point for predictor j, and.

s_j = the standard deviation of the Gaussian weighting function for predictor j, applied such that the full range of observed values for that predictor falls over six standard deviations. (Essentially, s_j functions as a smoothing parameter, which in the context of NPMR is termed “bandwidth” or “tolerance”).

The size of the neighborhood, n^*_i, from which each estimate of the response variable ŷ_v is calculated, is the sum of weights for sample units:

$$ {n}_i^{\ast }=\sum \limits_{i=1,}^n\left(\prod \limits_{j=1}^m{w}_{ij}^{\ast}\right) $$

(3)

where 0 < n^*_i ≤ n. If n^*_i = 0, then no estimate is possible for that point.

The weights for individual environmental predictors are then combined multiplicatively (rather than additively, as is done, for example, by generalized additive models) into a single weight for a given sample unit. This weighted average gives an estimate of the probability of occurrence at target point v:

$$ {\hat{y}}_v=\frac{\sum \limits_{i=1,i\ne v}^n{y}_i\left(\prod \limits_{j=1}^m{w}_{ij}^{\ast}\right)}{\sum \limits_{i=1,i\ne v}^n\left(\prod \limits_{j=1}^m{w}_{ij}^{\ast}\right)}\kern0.75em $$

(4)

where:

ŷ_v = fitted value, i.e., the estimated probability of occurrence at target point v,

y = the response variable, a column vector of observed presences and absences in BMODE, from i = 1 to n rows, and.

y_i = the value of the response variable at sample unit i.

The notation i ≠ v indicates the leave-one-out cross-validation procedure implemented to help guard against overfitting the model.

Based on this weighted estimation approach, HyperNiche implements an iterative method of free search to identify the best model for a given number of predictors. Starting with the best model for a single predictor and seeking improvement at each step, the method allows both addition and deletion of predictors while simultaneously adjusting the tolerances (s_j) of the weighting function in ±5% increments of the range of the predictors. An exhaustive search is made through all combinations of tolerances of the quantitative predictors to rank the models. In addition to using leave-one-out cross-validation for optimization, HyperNiche guards against overfitting during the search by implementing a series of controls, either with automatic settings (pre-specified levels are: conservative, medium, or aggressive, with medium being a reasonable default for most data sets) or with user-supplied manual settings. These controls include: requiring a model-wide minimum average neighborhood size (N^*) that imposes parsimony on the width of the tolerances and in the number of predictors (default is 5% of the sample size), a minimum neighborhood size around a target point that protects against estimating a response in a region of the predictor space with insufficient data (default is 0.25 × N^*), a maximum allowable number of sample units with missing estimates that controls the completeness of the fitted surfaces (default is 10% of the sample size), and a minimum data:predictor ratio, which for binary responses is the number of sample units in the least represented category (presences or absences) divided by the number of predictors in the model (the default minimum data:predictor ratio is 10). Lastly, HyperNiche enforces an improvement criterion on all models, expressed as a percentage improvement in model fit when a new predictor is added (the default minimum improvement criterion is a 5% increase in logB, the evaluation statistic). The search for additional predictors stops as soon as any one of these criteria is not met. As a final step, selected models can be tuned by further adjusting the tolerances to achieve minor improvements, using an increment of ±1% of the range of the predictors while applying the overfitting controls [82, 83].

For binary responses, overall model quality in NPMR is evaluated with logB, a log likelihood ratio expressing the improvement in predictive ability of a fitted model over a standard naive model (logB = 0) that estimates the probability of occurrence at a particular target point as the overall frequency of presences and absences in the data (in this sense, model evaluation with logB is analogous to evaluation using drop in deviance in logistic regression). The value of logB tends to increase with sample size, all else being equal, so in comparing models it is the relative amount of improvement from model to model what matters, not the absolute scale [82, 83]. Because logB is dependent on sample size, HyperNiche also reports B_ave, the average contribution of a sample unit to logB, which allows for straightforward interpretation and comparison across data sets [82, 83].

For a given model, the sensitivity of the response variable to changes in the level of the individual predictors indicates their relative influence in driving the model (a high sensitivity to small adjustments indicates a greater importance). The most common formula used to estimate sensitivity is provided by [82, 83] as:

$$ Sensitivity=\frac{\left(\sum \limits_{i=1}^n\left|{\hat{y}}_{i+}-{\hat{y}}_i\right|+\sum \limits_{i=1}^n\left|{\hat{y}}_{i-}-{\hat{y}}_i\right|\right)}{2n\ \left|{y}_{max}-{y}_{min}\right|\ \Delta } $$

(5)

where:

$ {\hat{y}}_{i+} $ = estimate of the response variable for case i, having increased the predictor by an arbitrarily small value Δ (say 0.1 of the range of the predictor),

$ {\hat{y}}_{i-} $ = estimate of the response variable for case i, having decreased the predictor by an arbitrarily small value Δ (say 0.1 of the range of the predictor), and.

Δ = a small difference applied to a predictor, expressed as a constant proportion of the range of a predictor.

Finally, a predictor’s tolerance indicates the width of the environmental window at which it operates (a predictor with a smaller tolerance has a narrower window and is a stronger driver in the model) [82, 83].

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Palacios, D.M., Bailey, H., Becker, E.A. et al. Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season. Mov Ecol 7, 26 (2019). https://doi.org/10.1186/s40462-019-0164-6

Download citation

Received: 23 November 2018
Accepted: 26 May 2019
Published: 18 July 2019
DOI: https://doi.org/10.1186/s40462-019-0164-6

Ecological correlates of blue whale movement behavior and its predictability in the California Current Ecosystem during the summer-fall feeding season

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Background

Methods

Blue whale tagging

State-space modeling of satellite telemetry data

Environmental predictor data

NPMR modeling

Model validation: predicting under different climatic conditions

Model assessment with binary classification

Accounting for spatial autocorrelation and other sources of variability

Results

State-space modeled locations

NPMR modeling results

Model validation: predicting under different climatic conditions

Performance of binary conversion of predictions

Spatial model for the autocorrelation in the tracking data

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent for publication

Competing interests

Additional information

Publisher’s Note

Additional files

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation