Abstract
Phytoplankton blooms create harmful toxins, scums, and taste and odor compounds and thus pose a major risk to drinking water safety. Climate and land use change are increasing the frequency and severity of blooms, motivating the development of new approaches for preemptive, rather than reactive, water management. While several real-time phytoplankton forecasts have been developed to date, none are both automated and quantify uncertainty in their predictions, which is critical for manager use. In response to this need, we outline a framework for developing the first automated, real-time lake phytoplankton forecasting system that quantifies uncertainty, thereby enabling managers to adapt operations and mitigate blooms. Implementation of this system calls for new, integrated ecosystem and statistical models; automated cyberinfrastructure; effective decision support tools; and training for forecasters and decision makers. We provide a research agenda for the creation of this system, as well as recommendations for developing real-time phytoplankton forecasts to support management.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Water quality in lakes and reservoirs around the globe is increasingly at risk due to climate and land use change (e.g., Jane et al. 2021; Caretta et al. 2022; Woolway et al. 2022). As a result of warming air temperatures, altered precipitation, and increased nutrient loads, freshwaters are exhibiting increased variability and degradation, exemplified by the changing occurrence of phytoplankton blooms and their associated negative effects on water quality (Ho et al. 2019; Smucker et al. 2021; Merder et al. 2023). Here, we define a bloom as an aggregation of phytoplankton biomass that can pose water quality risks (adapted from Smayda 1997; Isles and Pomati 2021). Phytoplankton blooms are ubiquitous in many regions (e.g., Coffer et al. 2021; Handler et al. 2023), and it is expected that their occurrence and duration may increase in response to global change (Paerl and Huisman 2009; Paerl and Paul 2012; Chapra et al. 2017; Burford et al. 2020). While some lakes have not exhibited major changes in blooms to date (Kraemer et al. 2017; Wilkinson et al. 2022), other lakes experiencing warming have demonstrated increases in peak summer bloom intensity (Ho et al. 2019), and both temperate and tropical waterbodies are exhibiting increased dominance of cyanobacteria, a major bloom-forming group of phytoplankton (Kosten et al. 2012; Taranu et al. 2015).
Changes in phytoplankton bloom incidence and magnitude due to climate change pose major challenges for drinking water managers because of the ecosystem and health risks caused by toxic blooms. Phytoplankton blooms can produce harmful toxins, odors, and thick surface scums that alter lake water temperature and light availability (Robarts and Zohary 1984; Watson et al. 2016a; Chorus and Welker 2021; Zhang et al. 2022). As phytoplankton biomass decomposes in the water column and at the sediments, oxygen concentrations can decrease (Watson et al. 2016b; Dugener et al. 2023). Low oxygen, in turn, can lead to fish kills, elevated concentrations of nutrients and contaminants, greenhouse gas concentrations, and other degradation (Mortimer 1941; Rysgaard et al. 1994; Bartosiewicz et al. 2019; Hounshell et al. 2021; Carey et al. 2022b). Importantly, drinking or swimming in bloom-contaminated water can cause gastrointestinal and neurological damage and even death for humans, livestock, and pets (Jochimsen et al. 1998; Carmichael et al. 2001; Chorus and Welker 2021). Consequently, given the severity of these threats, water management and decision-making would greatly benefit from knowing a priori when and where a bloom will occur.
Variability in freshwater ecosystem dynamics due to climate change makes it challenging to use historical conditions as a baseline to predict whether a bloom will occur in future. Fundamentally, phytoplankton blooms are emergent phenomena, governed by the complex interactions between individual phytoplankton cells and the surrounding environment (Reynolds 2001; Reynolds and Elliott 2012; Breier et al. 2018; Isles and Pomati 2021). Phytoplankton growth rates can exponentially increase in response to the onset of warm temperatures and high light conditions (as well as other environmental cues), resulting in rapid accumulation of biomass (Reynolds 2006). Similarly, bloom collapse, which can also be triggered by changing environmental conditions, cell senescence, viruses, and other factors, can result in a crash of the phytoplankton populations comprising a bloom (reviewed by Harris et al. 2024). Together, the complex abiotic and biotic interactions that control the development, maintenance, and collapse of blooms (Litchman 2023) motivate the need for new approaches to phytoplankton management.
Fundamentally, forecasts of phytoplankton blooms can help managers adapt and mitigate their risk (e.g., Wynne et al. 2013; Stumpf et al. 2016; Scavia et al. 2023). Here, we define forecasts as predictions of future conditions with specified uncertainty (Table 1; Lewis et al. 2022). While several real-time freshwater forecasting systems for phytoplankton have been developed to date, none are both automated and quantify uncertainty in their predictions (reviewed by Lofton et al. 2023; Table 1), which is critical for implementation and manager use. If managers have forecasts of phytoplankton blooms in advance, they can implement preemptive interventions to mitigate or prevent the bloom from occurring: e.g., apply algaecides, activate engineered water quality systems, or divert water from other lakes or reservoirs (Carey et al. 2022a). In addition, forecasts of blooms could help managers adapt to blooms: e.g., alter water quality treatment processes, install additional filters, initiate swimming/beach closures, or change staffing schedules. Together, these improvements could build more resilience in drinking water systems and would likely produce substantial economic benefits as blooms cost the U.S. economy > $2 billion per year (Dodds et al. 2009). Consequently, there is a pressing need to develop a scalable phytoplankton bloom forecasting system that successfully predicts future blooms in freshwaters.
To be most useful for managers seeking phytoplankton bloom adaptation and mitigation options in the face of climate change, forecasts need to be generated with sufficient advance warning, be updated in real-time as new phytoplankton observations become available, and be translated into useful decision support tools. However, five major challenges remain before such a near-term (e.g., one day to one season-ahead) forecasting system can be realized (Fig. 1). First, most phytoplankton models are currently unable to accurately predict historical or future bloom dynamics (e.g., Flynn 2005; Franks 2009; Fennoochi et al. 2019; Ralston and Moore 2020). Second, setting up and managing the cyberinfrastructure for automated, real-time forecasting requires substantial computing time and resources (Carey et al. 2022a). Third, by definition, forecasts need to have quantified uncertainty (Table 1), which can come from many different sources (e.g., model driver data, parameters), yet is not commonly included in forecasting systems (reviewed by Lewis et al. 2022; Lofton et al. 2023). Fourth, despite their promise, forecasts have yet to be broadly integrated into adaptive management workflows and be used for decision support (Lewis et al. 2022), necessitating new approaches for ensuring that forecasts are useful for managers. Finally, as ecological forecasting is an emerging field (Woelmer et al. 2021), there is a pressing need for forecasting training opportunities, both for researchers developing forecasting systems and for water managers who may be more familiar with reactive (vs. preemptive) decision-making.
Our research team identified these five challenges from our experiences co-developing the FLARE (Forecasting Lake And Reservoir Ecosystems; flare-forecast.org) water quality forecasting system during the past seven years (Thomas et al. 2020; Carey et al. 2022a). While FLARE has primarily been applied to generate lake and reservoir water temperature forecasts to date (e.g., Thomas et al. 2023; Wander et al. 2024), we describe some major hurdles that will need to be overcome to successfully develop phytoplankton forecasting systems that are broadly scalable and useful for water managers.
In this Perspective article, we propose a research agenda for advancing phytoplankton forecasting for freshwater management that addresses each of these five challenges. Our overarching goal is to provide recommendations for how these challenges could be potentially overcome to develop a real-time, automated water quality forecasting system. We note that our proposed approaches may not be appropriate for every forecasting application, and that a diversity of forecasting methods are needed to create an operational lake phytoplankton forecasting system for waterbodies across the globe (akin to the well-established meteorological forecasting systems that generate real-time weather forecasts in many countries). However, while such a global system likely remains several years to decades away, we forecast that the deployment of real-time, automated phytoplankton forecasts with specified uncertainty on select lakes and reservoirs is well within our grasp and possible on much shorter-term horizons.
Recommendations
Modeling needs for forecasting phytoplankton
Blooms remain notoriously hard to predict via a range of modeling approaches, despite a century of research by aquatic scientists aiming to improve water quality management (Fig. 1, Challenge 1; Flynn 2005, Ralston and Moore 2020, Rousso et al. 2020, Litchman 2023). Both simple and complex models struggle to accurately predict blooms (reviewed by Rousso et al. 2020). For example, many current trait-based approaches to phytoplankton modeling apply a small, static set of known functional relationships (e.g., nutrient uptake rates and temperature-dependent growth rates) to aggregated groups of phytoplankton in deterministic process-based models that range in complexity from simple nutrient uptake equations to coupled hydrodynamic-water quality models (Franks 2009; Page et al. 2018; Litchman 2023). While these models can reproduce median phytoplankton levels (e.g., Elliott et al. 2006; Page et al. 2018; Soares and Calijuri 2021), they often fall short of predicting emergent bloom events because aggregating heterogeneous species into one group neglects the inherent variability of individual cell responses to the functional relationships (Bruggeman and Kooijman 2007; Hellweger et al. 2008; Cropp and Norbury 2009; Cottingham et al. 2021). This shortcoming of modeling phytoplankton as aggregated functional groups motivates the need to model these complex dynamics at the individual cell level to predict blooms (Ranjbar et al. 2021). Similarly, while machine learning models are showing increasing promise at predicting phytoplankton blooms (Gupta et al. 2023; Lin et al. 2023; Silva et al. 2023), more work is needed to robustly quantify uncertainty and incorporate scientific knowledge into these models to maximize their utility for water management decision-making (Abdar et al. 2021; Varadharajan et al. 2022; Lapeyrolerie and Boettiger 2023). We refer readers to reviews by Rousso et al. (2020) and Lofton et al. (2023) on the diversity of models currently being used to predict and forecast phytoplankton blooms in lakes, as well as their relative predictive skill, advantages, and shortcomings.
In response to this challenge, we propose three key characteristics of potential forecast modeling approaches that show great promise for improving bloom prediction. First, individual-based models (IBMs) are a powerful simulation approach specifically designed to study collective phenomena (e.g., phytoplankton blooms) that emerge from functional relationships implemented to govern the behavior of individuals (Table 1; Bonabeau 2002). Each individual in an IBM is assigned a set of traits (e.g., age, size) by drawing from a distribution of possible trait values (DeAngelis and Grimm 2014), thereby explicitly accounting for heterogeneity among individuals in how they respond to the functional relationships (e.g., Engel et al. 2017). In addition, the behavior of individuals and the relative importance of different processes represented in the model can vary based on environmental conditions (Simon and Fortin 2020) or the number of individuals present at a site (Ellis and Petrovskaya 2020), permitting both abiotic variables and population/community interactions to drive individual dynamics over time and space. Finally, individuals can have a memory of past conditions, which can dynamically alter individual behavior. For example, alteration of the photosynthesis rate of a phytoplankton cell due to changes in light availability will alter the cell’s carbon content and, in turn, change the cell’s sinking rate in the lake (Feng et al. 2018). Together, these IBM attributes may help improve our ability to forecast emergent bloom phenomena (Ranjbar et al. 2021), though we note that IBMs have yet to be applied for real-time forecasting applications and may not be appropriate for all modeling contexts. To best evaluate IBM performance for bloom forecasting, a suite of evaluation metrics should be used that include estimates of forecast accuracy (e.g., bias, root mean square error; Joliffe and Stephenson 2012), forecast precision (e.g., reliability of forecast confidence intervals, Bröcker and Smith 2007), combined accuracy and precision (ignorance score, continuous ranked probability score; Gneiting et al. 2005; Smith et al. 2015), and forecast skill (e.g., skill scores calculated by comparing the forecast vs. null models used for forecast benchmarking; Pappenberger et al. 2015).
Second, it is likely that we need to use an ensemble modeling approach to improve the performance of phytoplankton forecasts. Ensemble modeling (see Table 1) involves generating a set of model simulations (ensemble members) that can differ in their starting conditions, model parameter values, process uncertainty (represented by adding random noise), model inputs (e.g., inputs from an ensemble weather model or hydrological model), and/or model structures (e.g., different types of models; Dietze 2017a). Ensemble modeling is particularly powerful for simulating the dynamics of non-linear systems (Smith 2001), such as phytoplankton blooms, where ensemble members can diverge widely in response to small differences in model initial conditions, driver data, parameters, and process/stochastic noise. When applied to IBMs, members of an ensemble can also represent different distributions of parameters describing traits (e.g., phytoplankton cell size, sinking rates; Litchman et al. 2010; Cocucci et al. 2022; Laraib et al. 2023). Thus, because IBMs inherently have multiple ensemble members with associated relative likelihoods (i.e., individuals), IBMs may be able to better quantify both the probabilities of a large range of phytoplankton bloom outcomes and their uncertainty (sensu Ralston and Moore 2020), relative to other model types.
Third, data assimilation, i.e., the updating of models with observations when they become available (Table 1), will also likely improve model performance (reviewed by Luo et al. 2011; Niu et al. 2014). Data assimilation has been found to greatly improve water quality model performance in a suite of other lake studies, including for methane emissions (McClure et al. 2021), organic carbon (Zwart et al. 2019), and water temperature (Thomas et al. 2020). Data assimilation is important for bloom forecasting because the uncertainty generated by the ensemble will likely be large and data assimilation can help filter out the ensemble members (and their associated parameters, states, and traits) that are inconsistent with the most recent observations (e.g., Woelmer et al. 2022). This filtering reduces the ensemble spread when forecasts are initiated, thus reducing forecasting uncertainty (Wander et al. 2024). As phytoplankton conditions can change rapidly in response to environmental variability, the integration of new data in near-real time (e.g., over the past day) to update model states, parameters, and initial conditions can provide an important ‘check’ on forecast model trajectories and allow the model to evolve in response to fluctuations in phytoplankton levels. While the application of data assimilation in freshwater modeling studies is increasing, its use for water quality forecasting is still limited (reviewed by Lofton et al. 2023), and we encourage interested readers to see reviews on data assimilation by Cho et al. (2020) and Lahoz and Schneider (2014).
Building the computational workflows for real-time forecasting
A second major challenge for phytoplankton forecasters is cyberinfrastructure and computational efficiency (Fig. 1, Challenge 2). Real-time daily forecasting entails downloading observations from phytoplankton sensors in the lake, using those data to update models (i.e., data assimilation) to run the many different simulations comprising an ensemble forecast into the future, and then disseminating the forecast to managers, all within 24 h before the next day’s forecast needs to be generated (Carey et al. 2022a). For forecasts to run automatically, data need to be transferred securely and wirelessly from sensors deployed in the field to models that are running in near-real time (Daneshmand et al. 2021). This automation requires the coordinated execution of a distributed workflow encompassing various hardware and software modules that are responsible for sensor data collection and staging, reliable data transfers, data quality assurance/quality control (QA/QC), ensemble model execution, and data summarization and visualization for use by managers (Thomas et al. 2020). In general, forecasting workflows require computation taking place at both well-resourced cloud computing data centers (which have massive servers and storage), as well as at the network’s edge, i.e., sensors deployed in lake sites that have limited network connectivity and power (Shi et al. 2016), which can be challenging to orchestrate. Moreover, the computational demands of the phytoplankton forecasting models need to be manageable within the overall end-to-end workflow: e.g., a reason why IBMs are not currently used for real-time, daily water quality forecasting is because of their high computational cost (Ranjbar et al. 2021), resulting in IBMs with computational times exceeding simulation durations.
A combination of edge and cloud computing using the Function-as-a-Service (FaaS; Eyk et al. 2017) paradigm may provide a new approach for facilitating data transfer and automation of end-to-end phytoplankton forecasting workflows (see Table 1). Edge computing is a distributed computing architecture in which data processing occurs physically at the site of data collection, thereby decreasing computing, storage, and connectivity needs (Shi et al. 2016), and FaaS is a serverless computing approach to run event-driven applications on-demand (Table 1; Eyk et al. 2017; Schleier-Smith et al. 2021). Two key benefits of FaaS are its scalability and its capacity to abstract server deployment and management without needing constant user input (hence the term “serverless”; Eismann et al. 2021; Schleier-Smith et al. 2021). FaaS is typically supported on cloud resources for scalable execution, but the platform is also extendable to edge resources for event-driven computation (Mampage et al. 2022).
To apply an event-driven and scalable FaaS platform to lake phytoplankton forecasting, an end-to-end ensemble forecasting workflow would first be divided into functions. At the waterbody being forecasted, FaaS functions could include accessing data from environmental sensors deployed at a lake and its inflowing streams; processing data for QA/QC; transferring data to cloud storage via edge gateways, or mini-computers deployed in the field (Daneshmand et al. 2021); and initiating IBM simulations. Each function would be invoked by an event (e.g., a daily timer, or another trigger, such as the completion of the preceding function in the workflow). To forecast rapidly changing phytoplankton conditions, a FaaS platform could be deployed at the edge gateway to both detect when phytoplankton are increasing in the lake and react by triggering functions that implement an increased frequency of sensor data collection and forecast updating. Importantly, these actions could be automatically triggered where the data are collected (i.e., by the sensors at the edge), without any human intervention. While FaaS platforms have primarily been tested in commercial cloud environments (Lynn et al. 2017) and scientific computing applications (Li et al. 2022), applying a FaaS system to adaptively run phytoplankton forecasting workflows spanning both edge and cloud computing could set the foundation for its deployment in other environments. Helpful resources for learning more about FaaS and its many potential applications include Schleier-Smith et al. (2021) and Shafiei et al. (2022).
To improve the computational efficiency of the end-to-end forecasting workflow, surrogate modeling, a statistical computing technique, could enable researchers to harness the power of IBMs and other highly parametrized models for real-time, daily phytoplankton forecasts (Tsattalios et al. 2023). Surrogate modeling, or emulation, is the enterprise of meta-modeling a simulation model, which has many advantages (Table 1; Gramacy 2020). The most well-known benefit of a surrogate model is computational speed (Sacks et al. 1989). By abstracting a simulation with statistical distributions, thereby “memorizing” the input–output relationship exhibited by the computationally intensive model, the surrogate can quickly mimic the essence of the simulation model without having to run the model. While surrogates have yet to be applied to emulate phytoplankton IBMs, they have been successfully used to emulate IBMs of wastewater microbial communities (Oyebamiji et al. 2017). For this application, the surrogate increased computational efficiency by 220-fold, decreasing simulation model run times from 6 h to < 2 min (Oyebamiji et al. 2017), suggesting that surrogates could enable phytoplankton IBMs to be integrated into real-time, daily forecasting workflows in the future.
Forecasts are inherently uncertain
Third, to the best of our knowledge, we are unaware of any automated, near-term freshwater phytoplankton forecasting systems that quantify uncertainty in predictions (Lofton et al. 2023), which may be because robust forecast uncertainty quantification is challenging (Fig. 1, Challenge 3; Dietze et al. 2018). Following our definition of a forecast (Lewis et al. 2022), “forecasts” without specified uncertainty are simply predictions (Lofton et al. 2023). Out of 83 papers published between 2017 and 2022 that presented future water quality predictions, the majority (64%) did not incorporate uncertainty (reviewed by Lofton et al. 2023). If uncertainty is not included in a “forecast,” managers will be unable to interpret the risk associated with a prediction of a phytoplankton bloom, potentially altering subsequent decisions (Berthet et al. 2016). For example, a forecast with a 95% probability that phytoplankton levels will exceed a drinking water threshold next week will likely trigger different management decisions than a forecast with a 5% probability of exceeding the threshold. Moreover, if uncertainty is not specified in a “forecast,” managers may assign their own uncertainty estimates to predictions, which could be incorrect or distorted by internal biases (Berthet et al. 2016). Users may also underestimate probabilities of adverse events, thereby increasing risks of human health or recreational impacts, or overestimate probabilities, leading to increased costs (Roberts et al. 2008).
There are many different sources of uncertainty in forecasts, including uncertainty in model parameters, initial conditions, processes, and driver data, among others (Dietze 2017a; Jakeman et al. 2019; Geary et al. 2020). Of “true” freshwater forecasts that incorporate uncertainty, most include only one or two sources of uncertainty (e.g., Olsson et al. 2024), and those forecasts that do include more than one source rarely partition the individual contributions of their uncertainty sources to total forecast uncertainty (Lofton et al. 2023). As a result, the relative importance of model parameters, initial conditions, processes, and driver data to total uncertainty in water quality forecasts is not well known (Lofton et al. 2023). Quantifying the relative contribution of individual uncertainty sources ensures that the most important sources of total uncertainty are accounted for in a forecast, resulting in well-calibrated uncertainty estimates (Olsson et al. 2024), and identifies how best to reduce total forecast uncertainty (Dietze 2017b; Lofton et al. 2022).
Consequently, researchers need to carefully consider how best to quantify uncertainty as they develop phytoplankton forecasts. Ensemble modeling is a common approach for quantifying forecast uncertainty, whereby multiple alternate simulations of the forecast model are run with different model driver datasets, parameter values, initial conditions, and/or process/stochastic noise (Table 1; Dietze 2017a). The total uncertainty in a forecast can then be determined by simultaneously representing each of these sources in the ensemble, with the individual uncertainty sources and their interactions summing to total forecast uncertainty (Dietze 2017a). When using this ensemble modeling approach, driver data uncertainty may be the most straightforward uncertainty source to incorporate because ensemble members from e.g., weather forecast or hydrological models can be directly used as inputs to different ensemble members of a forecasting model (e.g., Dietze 2017a; Thomas et al. 2020; Zwart et al. 2023; Olsson et al. 2024). Parameter and process uncertainty can also be straightforward to estimate when using simple statistical models (e.g., regression time series models; McClure et al. 2021; Lofton et al. 2022; Woelmer et al. 2022) because the probability distributions of the parameters and residual error (which represents process uncertainty) in these simple models can be easily sampled from and applied to forecast ensemble members (Dietze 2017a). Parameter and process uncertainty may be more challenging to quantify when using complex process-based models that include many equations, model states, and parameters (e.g., hydrodynamic-water quality models and IBMs), highlighting the value of Bayesian approaches for estimating probability distributions (Hobbs and Hooten 2015). While Bayesian approaches can be used to quantify parameter and process uncertainty, they are computationally intensive, requiring simplifications to generate robust estimates of uncertainty for real-time, daily forecasting. Thus, surrogate models (see above) make it more computationally efficient and feasible to quantify the dominant sources of forecast uncertainty in IBMs and other similar types of models (Gramacy 2020), thereby helping to both prioritize improvements to phytoplankton forecasting systems (Lofton et al. 2022) and help managers better interpret the risk of potential future blooms (Woelmer et al. 2023).
Co-producing forecasts with managers for decision support
Fourth, forecasts are extraordinarily challenging to communicate because there are many different ways to report forecast uncertainty, resulting in contradictory and frequently incorrect interpretations by users (Fig. 1, Challenge 4; Broad et al. 2007, Ramos et al. 2013, Cheong et al. 2016, Kinkeldey et al. 2017, Mulder et al. 2023). For example, multiple depictions of uncertainty in phytoplankton bloom forecasts are possible (e.g., time series of phytoplankton biomass with widening predictive intervals into the future, percent likelihood of future exceedance of a certain bloom threshold) and each visualization has different possible (but unknown) interpretations by water managers (Woelmer et al. 2023). While many studies have explored different approaches for communicating uncertainty in scientific data (e.g., Olston and Mackinlay 2002; Smith Mason et al. 2017; Wiggins et al. 2018), determining how best to translate uncertainty in forecasts into visualizations for decision-makers is unresolved, even for well-studied weather forecasts (Gerst et al. 2020; Kamal et al. 2021). Managers likely need a suite of different visualizations to guide different decisions, but the best practices for developing useful forecast output are unknown (Bodner et al. 2021). Moreover, it remains unclear how water managers interpret model outputs or operationalize data into management decisions, creating challenges for researchers trying to identify and develop the most useful forecasting models and visualizations.
Consequently, forecasters must work closely with managers when developing phytoplankton forecasting systems to ensure that forecast output is usable by decision-makers (Bodner et al. 2021; Carey et al. 2022a; Lofton et al. 2023). Many environmental models and visualizations have failed to be adopted by end users because they have not captured relevant trade-offs and have been perceived by practitioners or managers as inconsistent with their experience and expertise (Jakeman et al. 2006; McIntosh et al. 2008; Dubois et al. 2020). Ultimately, the co-production of forecasts using human-centered design (Table 1) takes time and requires iterative interaction of researchers and managers to evaluate the forecasting system, learn from its successes and failures, and update the system to improve decision-making based on forecasts (Chun and Conley 2011; Clark 2011; Boy 2012).
As part of forecasting system co-production, it would be useful for researchers to elicit managers’ (1) operational metrics of phytoplankton blooms; (2) learn managers’ perceptions of the ecological, human health, and technical hazards linked to phytoplankton blooms that drive decision-making in their water management; and (3) assess managers’ attitudes toward uncertainty in forecasts of phytoplankton blooms. For example, if a focal group of water managers make treatment decisions based on algal cell count data, then a forecast of chlorophyll-a may not be easily interpretable or useful. Likewise, managers of a lake with swimmers may quantify a bloom based on the areal extent of a surface scum, whereas managers of a lake where drinking water is extracted from deep intakes may quantify a bloom based on biomass concentrations throughout the water column. It is likely that managers’ perceptions of the hazards associated with phytoplankton blooms also influence how they respond to different representations of uncertainty in bloom forecasts, so knowing this information early in the design process can help researchers present uncertainty in forecast visualizations more effectively (e.g., via predictive intervals on a time series graph, a risk index, etc.). Overall, manager engagement throughout forecasting system development is essential for creating phytoplankton forecast models and decision support tools that are useful.
Forecasting training for both researchers and managers
Fifth, there is a pressing need for forecasting training opportunities, both for researchers developing forecasting systems and for water managers who may be more familiar with reactive (vs. preemptive) decision-making (Fig. 1, Challenge 5). For the researchers, developing phytoplankton forecasts requires skills in programming, statistics, ecosystem modeling, uncertainty analysis, data visualization, cyberinfrastructure, and decision science, which are rarely taught simultaneously in many undergraduate and graduate training programs (Woelmer et al. 2021; Willson et al. 2023). For the managers, water treatment curricula generally lack training on how to interpret high-frequency monitoring data and forecasts to guide predictive management interventions (e.g., VAC 2017). The lack of training in these key forecasting skills hinders the development of phytoplankton forecasting systems and their use for decision support.
Environmental models emerge from the natural sciences as tools to evaluate the possible role of alternative mechanisms and modes of action in environmental phenomena of interest. When adapted to a decision context, they frequently fail to capture the full range of physical, engineered, and social trade-offs and feedbacks that constrain decision-makers (Calder and Schartup 2023). Meanwhile, managers may seek high-level information about operational strategies to achieve environmental objectives but may have less knowledge about the network of phenomena that govern outcomes of interest (Calder et al. 2020). Therefore, iterative model development with both researchers and practitioners is crucial for creating models that increase the scientific basis of decision-making while addressing management realities and constraints.
Together, we need to create training opportunities for both researchers and managers. This training could take many forms, tailored for each community but developed with complementary learning objectives to facilitate collaboration between the two groups. For example, training for the managers could include short educational modules focused on key forecasting concepts embedded within manager certification programs, as well as workshops at management-focused conferences. These modules could then be adapted for undergraduate and graduate student courses and programs and include an additional focus on the data science skills underlying forecast development. Ultimately, managers need to have a high-level understanding of how forecasts are developed and their key assumptions (e.g., how forecast uncertainty is being quantified) and researchers need to understand managers’ needs and how decision-making occurs to ensure that forecasts are useful (e.g., which bloom adaptation and mitigation opportunities are available). Moreover, both groups need to be able to interpret probabilistic forecast visualizations and evaluate different representations of uncertainty. Overall, while manager training has not yet been implemented into ecological forecasting training to date (e.g., Moore et al. 2022; Willson et al. 2023; Woelmer et al. 2023), we encourage researchers to think carefully about how best to craft educational materials that can be used to train the next generation of both forecast developers and users together.
Conclusions
Here, we outline five major challenges that currently serve as bottlenecks for developing real-time phytoplankton bloom forecasting systems. We focused specifically on these five challenges because they inherently require interdisciplinary expertise to be overcome, and may be overlooked by researchers as they embark on forecasting system development. We note that there are many other thorny (i.e., exciting!) challenges that also need to be addressed: e.g., accessing lake data for assimilation from sites without in situ observations, potentially via satellite imagery (Schaeffer et al. 2024); developing accurate forecasted driver data needed to run phytoplankton models (e.g., hydrology, water chemistry); developing the cyberinfrastructure to support real-time ensemble forecasting workflows for many water quality variables simultaneously; extending phytoplankton forecast horizons to improve the usability of forecasts for decision-making (e.g., Jackson-Blake et al. 2022); and maintaining forecasting systems sustainably over the long-term (Carey et al. 2022a). We also note that our proposed solutions to the five challenges may not be applicable for all forecasting contexts, and that forecasting may not be appropriate for all management needs (e.g., Hobday et al. 2019). Rather, given the enormity of the task of setting up a real-time automated forecasting system, we hope that our recommendations serve as a catalyst for helping forecasters examine which of these approaches might be the right fit (or not) for their needs, and aim to lower the barrier to their implementation. Ultimately, phytoplankton bloom forecasts hold much promise for expanding the toolbox of options for water managers. Given the urgent need for new approaches to phytoplankton bloom management in the face of climate change, we encourage researchers and managers alike to consider forecasting as a solutions-focused approach for climate change adaptation and mitigation.
References
Abdar, M., F. Pourpanah, S. Hussain, D. Rezazadegan, L. Liu, M. Ghavamzadeh, P. Fieguth, X. Cao, et al. 2021. A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Information Fusion 76: 243–297.
Bartosiewicz, M., A. Przytulska, J.-F. Lapierre, I. Laurion, M.F. Lehmann, and R. Maranger. 2019. Hot tops, cold bottoms: Synergistic climate warming and shielding effects increase carbon burial in lakes. Limnology and Oceanography Letters 4: 132–144.
Berthet, L., O. Piotte, É. Gaume, R. Marty, and C. Ardilouze. 2016. Operational forecast uncertainty assessment for better information to stakeholders and crisis managers. E3S Web Conference 7: 18005.
Bodner, K., C. Rauen Firkowski, J.R. Bennett, C. Brookson, M. Dietze, S. Green, J. Hughes, J. Kerr, et al. 2021. Bridging the divide between ecological forecasts and environmental decision making. Ecosphere 12: e03869.
Bonabeau, E. 2002. Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences 99: 7280–7287.
Boy, G.A. 2012. The handbook of human–machine interaction: A human-centered design approach. Farnham: Ashgate Publishing Ltd.
Breier, R.E., C.C. Lalescu, D. Waas, M. Wilczek, and M.G. Mazza. 2018. Emergence of phytoplankton patchiness at small scales in mild turbulence. Proceedings of the National Academy of Sciences USA 115: 12112–12117.
Broad, K., A. Leiserowitz, J. Weinkle, and M. Steketee. 2007. Misinterpretations of the “cone of uncertainty” in Florida during the 2004 Hurricane Season. Bulletin of the American Meteorological Society 88: 651–668.
Bröcker, J., and L.A. Smith. 2007. Increasing the reliability of reliability diagrams. Weather and Forecasting 22: 651–661.
Bruggeman, J., and S.A.L.M. Kooijman. 2007. A biodiversity-inspired approach to aquatic ecosystem modeling. Limnology and Oceanography 52: 1533–1544.
Burford, M.A., C. Carey, D.P. Hamilton, J. Huisman, H.W. Paerl, S.A. Wood, and A. Wulff. 2020. Perspective: Advancing the research agenda for improving understanding of cyanobacteria in a future of global change. Harmful Algae 91: 101601.
Calder, R.S.D., A. Alatorre, R.S. Marx, V. Mallampalli, S.A. Mason, L.P. Olander, M. Jeuland, and M.E. Borsuk. 2020. Graphical models and the challenge of evidence-based practice in development and sustainability. Environmental Modelling and Software 130: 104734.
Calder, R.S.D., and A.T. Schartup. 2023. Geohealth policy benefits are mediated by interacting natural, engineered, and social processes. GeoHealth 7: e2023GH000858.
Caretta, M.A., A. Mukherji, M. Arfanuzzaman, R.A. Betts, A. Gelfan, Y. Hirabayashi, T.K. Lissner, J. Liu, et al. 2022. Chapter 4: Water. In Climate change 2022: Impacts, adaptation, and vulnerability, ed. H.-O. Pörtner, D.C. Roberts, M. Tignor, E.S. Poloczanska, K. Mintenbeck, A. Alegría, M. Craig, S. Langsdorf, S. Löschke, V. Möller, A. Okem, and B. Rama, 551–712. New York: Contribution of Working Group II to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press.
Carey, C.C., P.C. Hanson, R.Q. Thomas, A.B. Gerling, A.G. Hounshell, A.S.L. Lewis, M.E. Lofton, R.P. McClure, et al. 2022b. Anoxia decreases the magnitude of the carbon, nitrogen, and phosphorus sink in freshwaters. Global Change Biology 28: 4861–4881.
Carey, C.C., W.M. Woelmer, M.E. Lofton, R.J. Figueiredo, B.J. Bookout, R.S. Corrigan, V. Daneshmand, A.G. Hounshell, et al. 2022a. Advancing lake and reservoir water quality management with near-term, iterative ecological forecasting. Inland Waters 12: 107–120.
Carmichael, W.W., S.M.F.O. Azevedo, S.A. Ji, R.J.R. Molica, E.M. Jochimsen, S. Lau, K.L. Rinehart, G.R. Shaw, et al. 2001. Human fatalities from cyanobacteria: Chemical and biological evidence for cyanotoxins. Environmental Health Perspectives 109: 663–668.
Chapra, S.C., B. Boehlert, C. Fant, V.J. Bierman Jr., J. Henderson, D. Mills, D.M.L. Mas, L. Rennels, et al. 2017. Climate change impacts on harmful algal blooms in U.S. freshwaters: A screening-level assessment. Environmental Science and Technology 51: 8933–8943.
Cheong, L., S. Bleisch, A. Kealy, K. Tolhurst, T. Wilkening, and M. Duckham. 2016. Evaluating the impact of visualization of wildfire hazard upon decision-making under uncertainty. International Journal of Geographical Information Science 30: 1377–1404.
Cho, H.H., Y. Pachepski, M. Ligaray, Y. Kwon, and K.H. Kim. 2020. Data assimilation in surface water quality modeling: A review. Water Research 186: 116307.
Chorus, I., and M. Welker, eds. 2021. Toxic cyanobacteria in water: A guide to their public health consequences, monitoring and management. London: Taylor & Francis.
Chun, J., and C. Conley. 2011. Investigating users’ interaction with physical products applying qualitative and quantitative methods. In Human-centered design, ed. M. Kurosu, 3–12. Berlin: Springer.
Clark, K.A. 2011. Human interaction and collaborative innovation. In Human centered design, ed. M. Kurosu, 13–21. Berlin: Speringer.
Cocucci, T.J., M. Pulido, J.P. Aparicio, J. Ruíz, M.I. Simoy, and S. Rosa. 2022. Inference in epidemiological agent-based models using ensemble-based data assimilation. PLoS ONE 17: e0264892.
Coffer, M.M., B.A. Schaeffer, W.B. Salls, E. Urquhart, K.A. Loftin, R.P. Stumpf, P.J. Werdell, and J.A. Darling. 2021. Satellite remote sensing to assess cyanobacterial bloom frequency across the United States at multiple spatial scales. Ecological Indicators 128: 107822.
Cottingham, K.L., K.C. Weathers, H.A. Ewing, M.L. Greer, and C.C. Carey. 2021. Predicting the effects of climate change on freshwater cyanobacterial blooms requires consideration of the complete cyanobacterial life cycle. Journal of Plankton Research 43: 10–19.
Cropp, R., and J. Norbury. 2009. Parameterizing plankton functional type models: Insights from a dynamical systems perspective. Journal of Plankton Research 31: 939–963.
Daneshmand, V., A. Breef-Pilz, C.C. Carey, Y. Jin, Y.J. Ku, K.C. Subratie, R.Q. Thomas, and R.J. Figueiredo. 2021. Edge-to-cloud virtualized cyberinfrastructure for near real-time water quality forecasting in lakes and reservoirs. In 2021 IEEE 17th International Conference on eScience (eScience), 138–148.
DeAngelis, D.L., and V. Grimm. 2014. Individual-based models in ecology after four decades. F1000Prime Reports 6: 39.
Dietze, M.C. 2017a. Ecological forecasting. Princeton, NJ: Princeton University Press.
Dietze, M.C. 2017b. Prediction in ecology: A first-principles framework. Ecological Applications 112: 6252–6213.
Dietze, M.C., A. Fox, L.M. Beck-Johnson, J.L. Betancourt, M.B. Hooten, C.S. Jarnevich, T.H. Keitt, M.A. Kenney, et al. 2018. Iterative near-term ecological forecasting: Needs, opportunities, and challenges. Proceedings of the National Academy of Sciences USA 115: 1424–1432.
Dodds, W.K., W.W. Bouska, J.L. Eitzmann, T.J. Pilger, K.L. Pitts, A.J. Riley, J.T. Schloesser, and D.J. Thornbrugh. 2009. Eutrophication of US freshwaters: Analysis of potential economic damages. Environmental Science and Technology 43: 12–19.
Dubois, N.S., A. Gomez, S. Carlson, and D. Russell. 2020. Bridging the research-implementation gap requires engagement from practitioners. Conservation Science and Practice 2: e134.
Dugener, I.P., A.D. Stone, and Weinke, and B. A. Biddanda,. 2023. Out of oxygen: Stratification and loading drove hypoxia during a warm, wet, and productive year in a Great Lakes estuary. Journal of Great Lakes Research 49: 1015–1028.
Eismann, S., J. Scheuner, E.V. Eyk, M. Schwinger, J. Grohmann, N. Herbst, C.L. Abad, and A. Iosup. 2021. Serverless applications: Why, when, and how? IEEE Software 38: 32–39.
Elliott, J.A., I.D. Jones, and S.J. Thackeray. 2006. Testing the sensitivity of phytoplankton communities to changes in water temperature and nutrient load, in a temperate lake. Hydrobiologia 559: 401–411.
Ellis, J.R., and N.B. Petrovskaya. 2020. A computational study of density-dependent individual movement and the formation of population clusters in two-dimensional spatial domains. Journal of Theoretical Biology 505: 110421.
Engel, J., L. Hertzog, J. Tiede, C. Wagg, A. Ebeling, H. Briesen, and W.W. Weisser. 2017. Pitfall trap sampling bias depends on body mass, temperature, and trap number: Insights from an individual-based model. Ecosphere 8: e01790.
Eyk, E., A. Iosup, S. Seif, and M. Thömmes. 2017. The SPEC cloud group's research vision on FaaS and serverless architectures. In Proceedings of the 2nd international workshop on serverless computing. association for computing machinery, Las Vegas, Nevada, 1–4.
Feng, T., C. Wang, P. Wang, J. Qian, and X. Wang. 2018. How physiological and physical processes contribute to the phenology of cyanobacterial blooms in large shallow lakes: A new Euler–Lagrangian coupled model. Water Research 140: 34–43.
Fennoochi, A., M. Rogora, G. Morabito, A. Marchetto, S. Sibilla, and C. Dresti. 2019. Applicability of a one-dimensional coupled ecological-hydrodynamic numerical model to future projections in a very deep large lake (Lake Maggiore, Northern Italy/Southern Switzerland). Ecological Modelling 392: 38–51.
Flynn, K.J. 2005. Castles built on sand: Dysfunctionality in plankton models and the inadequacy of dialogue between biologists and modellers. Journal of Plankton Research 27: 1205–1210.
Franks, P.J.S. 2009. Planktonic ecosystem models: Perplexing parameterizations and a failure to fail. Journal of Plankton Research 31: 1299–1306.
Geary, W.L., M. Bode, T.S. Doherty, E.A. Fulton, D.G. Nimmo, A.I.T. Tulloch, V.J.D. Tulloch, and E.G. Ritchie. 2020. A guide to ecosystem models and their environmental applications. Nature Ecology and Evolution 4: 1459–1471.
Gerst, M.D., M.A. Kenney, A.E. Baer, A. Speciale, J.F. Wolfinger, J. Gottschalck, S. Handel, M. Rosencrans, et al. 2020. Using visualization science to improve expert and public understanding of probabilistic temperature and precipitation outlooks. Weather, Climate, and Society 12: 117–133.
Gneiting, T., A.E. Raftery, A.H. Westveld, and T. Goldman. 2005. Calibrated probabilistic forecasting using ensemble model output statistics and minimum CRPS estimation. Monthly Weather Review 133: 1098–1118.
Gramacy, R.B. 2020. Surrogates: Gaussian process modeling, design and optimization for the applied sciences. Boca Raton: Chapman Hall, CRC.
Gupta, A., M.M. Hantush, and R.S. Govindaraju. 2023. Sub-monthly time scale forecasting of harmful algal blooms intensity in Lake Erie using remote sensing and machine learning. Science of the Total Environment 900: 165781.
Handler, A.M., J.E. Compton, R.A. Hill, S.G. Leibowitz, and B.A. Schaeffer. 2023. Identifying lakes at risk of toxic cyanobacterial blooms using satellite imagery and field surveys across the United States. Science of the Total Environment 869: 161784.
Harris, T.D., K.L. Reinl, M. Azarderakhsh, S.A. Berger, M.C. Berman, M. Bizic, R. Bhattacharya, S.H. Burnet, et al. 2024. What makes a cyanobacterial bloom disappear? A review of the abiotic and biotic cyanobacterial bloom loss factors. Harmful Algae 133: 102599.
Hellweger, F.L., E.S. Kravchuk, V. Novotny, and M.I. Gladyshev. 2008. Agent-based modeling of the complex life cycle of a cyanobacterium (Anabaena) in a shallow reservoir. Limnology and Oceanography 53: 1227–1241.
Ho, J.C., A.M. Michalak, and N. Pahlevan. 2019. Widespread global increase in intense lake phytoplankton blooms since the 1980s. Nature 574: 667–670.
Hobbs, N.T., and M. Hooten. 2015. Bayesian models: A statistical primer for ecologists. Princeton: Princeton University Press.
Hobday, A.J., J.R. Hartog, J.P. Manderson, K.E. Mills, M.J. Oliver, A.J. Pershing, and S. Siedlecki. 2019. Ethical considerations and unanticipated consequences associated with ecological forecasting for marine resources. ICES Journal of Marine Science 76: 1244–1256.
Hounshell, A.G., R.P. McClure, M.E. Lofton, and C.C. Carey. 2021. Whole-ecosystem oxygenation experiments reveal substantially greater hypolimnetic methane concentrations in reservoirs during anoxia. Limnology and Oceanography Letters 6: 33–42.
Isles, P.D.F., and F. Pomati. 2021. An operational framework for defining and forecasting phytoplankton blooms. Frontiers in Ecology and the Environment 19: 443–450.
Jackson-Blake, L.A., F. Clayer, S. Haande, J.E. Sample, and S.J. Moe. 2022. Seasonal forecasting of lake water quality and algal bloom risk using a continuous Gaussian Bayesian network. Hydrology and Earth System Sciences 26: 3103–3124.
Jakeman, A., B. Croke, and B. Fu. 2019. Uncertainty in environmental water quality modelling: Where do we stand? In New trends in urban drainage modelling. UDM 2018. Green energy and technology, ed. G. Mannina, 557–565. Cham: Springer.
Jakeman, A.J., R.A. Letcher, and J.P. Norton. 2006. Ten iterative steps in development and evaluation of environmental models. Environmental Modelling and Software 21: 602–614.
Jane, S.F., G.J.A. Hansen, B.M. Kraemer, P.R. Leavitt, J.L. Mincer, R.L. North, R.M. Pilla, J.T. Stetler, et al. 2021. Widespread deoxygenation of temperate lakes. Nature 594: 66–70.
Jochimsen, E., W. Carmichael, J. An, D. Cardo, S. Cookson, C. Holmes, M. Antunes, D. de Melo, et al. 1998. Liver failure and death after exposure to microcystins at a hemodialysis center in Brazil. New England Journal of Medicine 338: 873–878.
Joliffe, I.T., and D.B. Stephenson. 2012. Forecast verification: A practitioner’s guide in atmospheric science, 2nd ed. Oxford: Wiley-Blackwell.
Kamal, A., P. Dhakal, A.Y. Javaid, V.K. Devabhaktuni, D. Kaur, J. Zaientz, and R. Marinier. 2021. Recent advances and challenges in uncertainty visualization: A survey. Journal of Visualization 24: 861–890.
Kinkeldey, C., A.M. MacEachren, M. Riveiro, and J. Schiewe. 2017. Evaluating the effect of visually represented geodata uncertainty on decision-making: Systematic review, lessons learned, and recommendations. Cartography and Geographic Information Science 44: 1–21.
Kosten, S., V.L.M. Huszar, E. Bécares, L.S. Costa, E. van Donk, L.-A. Hansson, E. Jeppesen, C. Kruk, et al. 2012. Warmer climates boost cyanobacterial dominance in shallow lakes. Global Change Biology 18: 118–126.
Kraemer, B.M., T. Mehner, and R. Adrian. 2017. Reconciling the opposing effects of warming on phytoplankton biomass in 188 large lakes. Scientific Reports 7: 10762.
Kurosu, M., ed. 2011. Human centered design. Second International Conference, HCD 2011. Springer-Verlag Berlin Heidelberg: New York.
Lahoz, W.A., and P. Schneider. 2014. Data assimilation: making sense of Earth observation. Frontiers of Environmental Science and Engineering 2: 16.
Lapeyrolerie, M., and C. Boettiger. 2023. Limits to ecological forecasting: Estimating uncertainty for critical transitions with deep learning. Methods in Ecology and Evolution 14: 785–798.
Laraib, M., J. Titocci, I. Rosati, and A. Basset. 2023. An integrated individual-level trait-based phytoplankton dataset from transitional waters. Scientific Data 10: 897.
Lewis, A.S.L., W.M. Woelmer, H.L. Wander, D.W. Howard, J.W. Smith, R.P. McClure, M.E. Lofton, N.W. Hammond, et al. 2022. Increased adoption of best practices in ecological forecasting enables comparisons of forecastability. Ecological Applications 32: e2500.
Li, Z., R. Chard, Y. Babuji, B. Galewsky, T.J. Skluzacek, K. Nagaitsev, A. Woodard, B. Blaiszik, et al. 2022. funcX: Federated function as a service for science. IEEE Transactions on Parallel and Distributed Systems 33: 4948–4963.
Lin, S., D.C. Pierson, and J.P. Mesman. 2023. Prediction of algal blooms via data-driven machine learning models: An evaluation using data from a well-monitored mesotrophic lake. Geoscientific Model Development 16: 35–46.
Litchman, E. 2023. Understanding and predicting harmful algal blooms in a changing climate: A trait-based framework. Limnology and Oceanography Letters 8: 229–246.
Litchman, E., P. de Tezanos Pinto, C.A. Klausmeier, M.K. Thomas, and K. Yoshiyama. 2010. Linking traits to species diversity and community structure in phytoplankton. In Fifty years after the ‘“Homage to Santa Rosalia”’: Old and new paradigms on biodiversity in aquatic ecosystems, ed. L. Naselli-Flores and G. Rossetti, 15–28. Dordrecht: Springer.
Lofton, M.E., J.A. Brentrup, W.S. Beck, J.A. Zwart, R. Bhattacharya, L.S. Brighenti, S.H. Burnet, I.M. McCullough, et al. 2022. Using near-term forecasts and uncertainty partitioning to inform prediction of oligotrophic lake cyanobacterial density. Ecological Applications 32: e2590.
Lofton, M.E., D.W. Howard, R.Q. Thomas, and C.C. Carey. 2023. Progress and opportunities in advancing near-term forecasting of freshwater quality. Global Change Biology 29: 1691–1714.
Luo, Y., K. Ogle, C. Tucker, S. Fei, C. Gao, S. LaDeau, J.S. Clark, and D.S. Schimel. 2011. Ecological forecasting and data assimilation in a data-rich era. Ecological Applications 21: 1429–1442.
Lynn, T., P. Rosati, A. Lejeune, and V. Emeakaroha. 2017. A preliminary review of enterprise serverless cloud computing (Function-as-a-Service) platforms. In 2017 IEEE international conference on cloud computing technology and science (CloudCom), 162–169
Mampage, A., S. Karunasekera, and R. Buyya. 2022. A holistic view on resource management in serverless computing environments: Taxonomy and future directions. ACM Computing Surveys 54: 222.
McClure, R.P., R.Q. Thomas, M.E. Lofton, W.M. Woelmer, and C.C. Carey. 2021. Iterative forecasting improves near-term predictions of methane ebullition rates. Frontiers in Environmental Science 9: 756603. https://doi.org/10.3389/fenvs.2021.756603.
McIntosh, B.S., C. Giupponi, A.A. Voinov, C. Smith, K.B. Matthews, M. Monticino, M.J. Kolkman, N. Crossman, et al. 2008. Chapter three: Bridging the gaps between design and use: Developing tools to support environmental management and policy. In Developments in integrated environmental assessment, ed. A.J. Jakeman, A.A. Voinov, A.E. Rizzoli, and S.H. Chen, 33–48. Amsterdam: Elsevier.
Merder, J., T. Harris, G. Zhao, D.M. Stasinopoulos, R.A. Rigby, and A.M. Michalak. 2023. Geographic redistribution of microcystin hotspots in response to climate warming. Nature Water 1: 844–854.
Moore, T.N., R.Q. Thomas, W.M. Woelmer, and C.C. Carey. 2022. Integrating ecological forecasting into undergraduate ecology curricula with an R Shiny application-based teaching module. Forecasting 4: 604–633.
Mortimer, C.H. 1941. The exchange of dissolved substances between mud and water in lakes. Journal of Ecology 29: 280–329.
Mulder, K.J., L. Williams, M. Lickiss, A. Black, A. Charlton-Perez, R. McCloy, and E. McSorley. 2023. Understanding representations of uncertainty, an eye-tracking study—Part 1: The effect of anchoring. Geoscience Communication 6: 97–110.
Niu, S., Y. Luo, M.C. Dietze, T.F. Keenan, Z. Shi, J. Li, and F.S. Chapin III. 2014. The role of data assimilation in predictive ecology. Ecosphere 5: 65–16.
Olsson, F., C.C. Carey, C. Boettiger, G. Harrison, R. Ladwig, M.F. Lapeyrolerie, A.S.L. Lewis, M.E. Lofton, et al. 2024. What can we learn from 100,000 freshwater forecasts? A synthesis from the NEON ecological forecasting challenge. ESS Open. https://doi.org/10.22541/essoar.171458144.44104603/v1.
Olston, C., and J.D. Mackinlay. 2002. Visualizing data with bounded uncertainty. In IEEE symposium on information visualization, 2002. INFOVIS 2002, 37–40.
Oyebamiji, O.K., D.J. Wilkinson, P.G. Jayathilake, T.P. Curtis, S.P. Rushton, B. Li, and P. Gupta. 2017. Gaussian process emulation of an individual-based model simulation of microbial communities. Journal of Computational Science 22: 69–84.
Paerl, H.W., and J. Huisman. 2009. Climate change: A catalyst for global expansion of harmful cyanobacterial blooms. Environmental Microbiology Reports 1: 27–37.
Paerl, H.W., and V.J. Paul. 2012. Climate change: Links to global expansion of harmful cyanobacteria. Water Research 46: 1349–1363.
Page, T., P.J. Smith, K.J. Beven, I.D. Jones, J.A. Elliott, S.C. Maberly, E.B. Mackay, M. De Ville, et al. 2018. Adaptive forecasting of phytoplankton communities. Water Research 134: 74–85.
Pappenberger, F., M.H. Ramos, H.L. Cloke, F. Wetterhall, L. Alfieri, K. Bogner, A. Mueller, and P. Salamon. 2015. How do I know if my forecasts are better? Using benchmarks in hydrological ensemble prediction. Journal of Hydrology 522: 697–713.
Ralston, D.K., and S.K. Moore. 2020. Modeling harmful algal blooms in a changing climate. Harmful Algae 91: 101729.
Ramos, M.H., S.J. van Andel, and F. Pappenberger. 2013. Do probabilistic forecasts lead to better decisions? Hydrology and Earth System Science 17: 2219–2232.
Ranjbar, M.H., D.P. Hamilton, A. Etemad-Shahidi, and F. Helfer. 2021. Individual-based modelling of cyanobacteria blooms: Physical and physiological processes. Science of the Total Environment 792: 148418.
Reynolds, C.S. 2001. Emergence in pelagic communities. Scientia Marina 65: 5–30.
Reynolds, C.S. 2006. Ecology of phytoplankton. New York: Cambridge University Press.
Reynolds, C.S., and J.A. Elliott. 2012. Complexity and emergent properties in aquatic ecosystems: Predictability of ecosystem responses. Freshwater Biology 57: 74–90.
Robarts, R.D., and T. Zohary. 1984. Microcystis aeruginosa and underwater light attenuation in a hypertrophic lake (Harbeespoort Dam, South Africa). Journal of Ecology 72: 1001–1017.
Roberts, D.C., T.A. Boyer, and J.L. Lusk. 2008. Preferences for environmental quality under uncertainty. Ecological Economics 66: 584–593.
Rousso, B.Z., E. Bertone, R. Stewart, and D.P. Hamilton. 2020. A systematic literature review of forecasting and predictive models for cyanobacteria blooms in freshwater lakes. Water Research 182: 115959.
Rysgaard, S., N. Risgaard-Petersen, N. Sloth, K.I.M. Jensen, and L.P. Nielsen. 1994. Oxygen regulation of nitrification and denitrification in sediments. Limnology and Oceanography 39: 1643–1652.
Sacks, J., W.J. Welch, T.J. Mitchell, and H.P. Wynn. 1989. Design and analysis of computer experiments. Statistical Science 4: 409–423.
Scavia, D., Y.-C. Wang, and D.R. Obenour. 2023. Advancing freshwater ecological forecasts: Harmful algal blooms in Lake Erie. Science of the Total Environment 856: 158959.
Schaeffer, B.A., N. Reynolds, H. Ferriby, W. Salls, D. Smith, J.M. Johnston, and M. Myer. 2024. Forecasting freshwater cyanobacterial harmful algal blooms for Sentinel-3 satellite resolved U.S. lakes and reservoirs. Journal of Environmental Management 349: 119518.
Schleier-Smith, J., V. Skreekanti, A. Khandelwal, J. Carreira, N.J. Yadwadkar, R.A. Popa, J.E. Gonzalez, I. Stoica, et al. 2021. What serverless computing is and should become: The next phase of cloud computing. Communications of the ACM 64: 76–84.
Shafiei, H., A. Khonsari, and P. Mousavi. 2022. Serverless computing: A survey of opportunities, challenges, and applications. ACM Computing Surveys 54: 1–32.
Shi, W., J. Cao, Y. Zhang, Y. Li, and L. Xu. 2016. Edge computing: Vision and challenges. IEEE Internet of Things Journal 3: 637–646.
Silva, E., F. Counillon, J. Brajard, L.H. Pettersson, and L. Naustvoll. 2023. Forecasting harmful algae blooms: Application to Dinophysis acuminata in northern Norway. Harmful Algae 126: 102442.
Simon, R.N., and D. Fortin. 2020. Crop raiders in an ecological trap: Optimal foraging individual-based modeling quantifies the effect of alternate crops. Ecological Applications 30: e02111.
Smayda, T.J. 1997. What is a bloom? A commentary. Limnology and Oceanography 42: 1132–1136.
Smith, L.A. 2001. Disentangling uncertainty and error: On the predictability of nonlinear systems. In Nonlinear dynamics and statistics, ed. A.I. Mees, 31–64. Boston, MA: Birkhäuser Boston.
Smith, L.A., E.B. Suckling, E.L. Thompson, T. Maynard, and H. Du. 2015. Towards improving the framework for probabilistic forecast evaluation. Climatic Change. 132: 31–45.
Smith Mason, J., D. Retchless, and A. Klippel. 2017. Domains of uncertainty visualization research: A visual summary approach. Cartography and Geographic Information Science 44: 296–309.
Smucker, N.J., J.J. Beaulieu, C.T. Nietch, and J.L. Young. 2021. Increasingly severe cyanobacterial blooms and deep water hypoxia coincide with warming water temperatures in reservoirs. Global Change Biology 27: 2507–2519.
Soares, L.M.V., and M.C. Calijuri. 2021. Sensitivity and identifiability analyses of parameters for water quality modeling of subtropical reservoirs. Ecological Modelling 458: 109720.
Stumpf, R.P., L.T. Johnson, T.T. Wynne, and D.B. Baker. 2016. Forecasting annual cyanobacterial bloom biomass to inform management decisions in Lake Erie. Journal of Great Lakes Research 42: 1174–1183.
Taranu, Z.E., I. Gregory-Eaves, P.R. Leavitt, L. Bunting, T. Buchaca, J. Catalan, I. Domaizon, P. Guilizzoni, et al. 2015. Acceleration of cyanobacterial dominance in north temperate-subarctic lakes during the Anthropocene. Ecology Letters 18: 375–384.
Thomas, R.Q., R.J. Figueiredo, V. Daneshmand, B.J. Bookout, L.K. Puckett, and C.C. Carey. 2020. A near-term iterative forecasting system successfully predicts reservoir hydrodynamics and partitions uncertainty in real time. Water Resources Research 56: e2019WR026138.
Thomas, R.Q., R.P. McClure, T.N. Moore, W.M. Woelmer, C. Boettiger, R.J. Figueiredo, R.T. Hensley, and C.C. Carey. 2023. Near-term forecasts of NEON lakes reveal gradients of environmental predictability across the U.S. Frontiers in Ecology and the Environment 21: 220–226.
Tsattalios, S., I. Tsoukalas, P. Dimas, P. Kossieris, A. Efstratiadis, and C. Makropoulos. 2023. Advancing surrogate-based optimization of time-expensive environmental problems through adaptive multi-model search. Environmental Modelling and Software 162: 105639.
VAC. 2017. Waterworks and wastewater works operators licensing regulations. In Virginia assembly code. Section 54.1-2300. Virginia Register, Richmond, Virginia.
Varadharajan, C., A.P. Appling, B. Arora, D.S. Christianson, V.C. Hendrix, V. Kumar, A.R. Lima, J. Müller, et al. 2022. Can machine learning accelerate process understanding and decision-relevant predictions of river water quality? Hydrological Processes 36: e14565.
Wander, H.L., R.Q. Thomas, T.N. Moore, M.E. Lofton, A. Breef-Pilz, and C.C. Carey. 2024. Data assimilation experiments inform monitoring needs for near-term ecological forecasts in a eutrophic reservoir. Ecosphere 15: e4752.
Watson, S.B., C. Miller, G. Arhonditsis, G.L. Boyer, W. Carmichael, M.N. Charlton, R. Confesor, D.C. Depew, et al. 2016b. The re-eutrophication of Lake Erie: Harmful algal blooms and hypoxia. Harmful Algae 56: 44–66.
Watson, S., P. Monis, P. Baker, and S. Giglio. 2016a. Biochemistry and genetics of taste- and odor-producing cyanobacteria. Harmful Algae 54: 112–127.
Wiggins, A., A. Young, and M.A. Kenney. 2018. Exploring visual representations to support data re-use for interdisciplinary science. Proceedings of the Association for Information Science and Technology 55: 554–563.
Wilkinson, G.M., J.A. Walter, C.D. Buelo, and M.L. Pace. 2022. No evidence of widespread algal bloom intensification in hundreds of lakes. Frontiers in Ecology and the Environment 20: 16–21.
Willson, A.M., H. Gallo, J.A. Peters, A. Abeyta, N. Bueno Watts, C.C. Carey, T.N. Moore, et al. 2023. Assessing opportunities and inequities in undergraduate ecological forecasting education. Ecology and Evolution 13: e10001.
Woelmer, W.M., L.M. Bradley, L.T. Haber, D.H. Klinges, A.S.L. Lewis, E.J. Mohr, C.L. Torrens, K.I. Wheeler, et al. 2021. Ten simple rules for training yourself in an emerging field. PLOS Computational Biology 17: e1009440.
Woelmer, W.M., T.N. Moore, M.E. Lofton, R.Q. Thomas, and C.C. Carey. 2023. Embedding communication concepts in forecasting training increases students’ understanding of ecological uncertainty. Ecosphere 14: e4628.
Woelmer, W.M., R.Q. Thomas, M.E. Lofton, R.P. McClure, H.L. Wander, and C.C. Carey. 2022. Near-term phytoplankton forecasts reveal the effects of model time step and forecast horizon on predictability. Ecological Applications 32: e2642.
Woolway, R.I., S. Sharma, and J.P. Smol. 2022. Lakes in hot water: The impacts of a changing climate on aquatic ecosystems. BioScience 72: 1050–1061.
Wynne, T.T., R.P. Stumpf, M.C. Tomlinson, G.L. Fahnenstiel, J. Dyble, D.J. Schwab, and S.J. Joshi. 2013. Evolution of a cyanobacterial bloom forecast system in western Lake Erie: Development and initial evaluation. Journal of Great Lakes Research 39: 90–99.
Zhang, M., Y. Zhang, Y. Zhou, Y. Zhang, K. Shi, and C. Jiang. 2022. Influence of cyanobacterial bloom accumulation and dissipation on underwater light attenuation in a large and shallow lake. Environmental Science and Pollution Research 29: 79082–79094.
Zwart, J.A., O. Hararuk, Y.T. Prairie, S.E. Jones, and C.T. Solomon. 2019. Improving estimates and forecasts of lake carbon dynamics using data assimilation. Limnology and Oceanography: Methods 17: 97–111.
Zwart, J.A., S.K. Oliver, W.D. Watkins, J.M. Sadler, A.P. Appling, H.R. Corson-Dosch, X. Jia, V. Kumar, et al. 2023. Near-term forecasts of stream temperature using deep learning and data assimilation in support of management decisions. JAWRA Journal of the American Water Resources Association 59: 317–337.
Acknowledgements
We thank the Virginia Tech Rules of Life research team, which is supported by the U.S. National Science Foundation (NSF) Grant EF-2318861. This perspective evolved from a decade of collaboration with the Western Virginia Water Authority and their amazing staff, including Jamie Morris, Sarah Baumgardner, Roger Blankenship, Mike McEvoy, Scott Shirley, and Lisa Workman. Karsten Rinke and two anonymous reviewers provided helpful feedback. The co-author team was additionally supported by NSF Grants DBI-1933016, DBI-1933102, OISE-2330211, OAC-2311123, OAC-2311124, DEB-1926050, and DEB-2327030.
Author information
Authors and Affiliations
Contributions
Carey led the writing of the manuscript; all co-authors provided feedback and revised the text.
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Carey, C.C., Calder, R.S.D., Figueiredo, R.J. et al. A framework for developing a real-time lake phytoplankton forecasting system to support water quality management in the face of global change. Ambio (2024). https://doi.org/10.1007/s13280-024-02076-7
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13280-024-02076-7