Global average temperatures were relatively constant, with some annual variation, from the early 1800s to approximately the mid-twentieth century prior to a sharp increase after the mid-twentieth century (Intergovernmental Panel on Climate Change (IPCC), Contribution of Working Groups I, II and III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, 2014 [19, 23]. The global average temperature has increased by 0.67 degrees Celsius since 1986, surpassing the recorded increase in the 59 year period prior to 1986 [17]. This pattern of global average temperature change was described by Mann et al. [23] using a hockey-stick graph. In this study, Mann et al. [23] depicted a recent changepoint, within the past century, in the increased occurrence of temperature anomalies. The hockey-stick model is characterized by a flat left line segment and a sloped right line segment that meet at a changepoint (Fig. 1).
Because temperature is a key feature of climate change, many phenological phenomena respond to climate change in a similar pattern that can also be represented by the hockey-stick model [11, 36]. For example, Qian [30] used the hockey-stick model to describe the temporal changes in the first bloom dates of North American lilacs, showing that lilacs have bloomed earlier in recent years based on multiple graphs with the day-of-the-year on the y-axis and year on the x-axis. Additionally, Dose and Menzel [11] applied three different models to the flowering dates of Cherry blossom, Snow drop blossom, and Lime tree blossoms. They concluded that the hockey-stick, or changepoint, model best represented the phenological data examined [11]. A similar analysis was conducted on common plant species located several parts of Europe by Schleip et al. [36]. This study found the comparable results to Does and Menzel [11], showing that a hockey-stick model explained the phenological phenomena more than any other model explored (Shleip et al. 2009). The research from Qian [30], Dose and Menzel [11], and Shleip et al. [36] shows that while the hockey-stick model is a simplification of complex processes, it resembles the pattern observed in data well and provides more useful information than a traditional linear model does. The hockey-stick trend observed in phenological phenomena is reasonable because the rise in global greenhouse gas concentrations, the mechanism that underlies the rise in global temperature, follows a pattern summarized by the hockey-stick model: a relatively constant trend rapidly transitioning to a steep rising trend (Supplemental Fig. 1 available at GitHub.com/StephAnnieNummer/Lilac_HockeyStick_BHM, [26]). This rise in global temperature in recent years can be attributed to the increase in greenhouse gasses from anthropogenic sources since the mid-1800s [26].
Other events also experience changes induced by the recent changes in temperature due to the increase in greenhouse gas concentrations in the atmosphere. Hayhoe et al. [17] reported changes in temperature extremes associated with the global temperature increase, including surges in heat and cold wave frequencies. Phenological events throughout the world are experiencing changes in association with the increase in global average temperature as well [1, 9, 43]. Ahas et al. [1] examined the phenological shifts of six different plant species in Europe in response to changes in climate and temperature and found that areas with snow coverages are presenting with the greatest change. Additionally, Walther et al. [43] compiled evidence of ecological response to climate change including shifts in bird breeding and migration, earlier butterfly appearances, changes in the spawning of amphibians, and earlier trends in plant phenology. Chen [9] examined and reviewed phenological events across different regions in China and found shifts including changes in green-up dates and first leaf unfolding dates.
Several examples of these phenological shifts and phenomena have been the subject of research. Bird populations are declining or are at risk of declining because food availability is peaking earlier than when the birds arrive due to the warming temperature [4]. This change in peak food availability is a potential driver for bird breeding season to shift earlier. Cherry blossom blooming is an iconic Washington D.C. event, and this city has an almost century-long record of cherry blossom peak bloom dates. This dataset illustrates a shift in the peak bloom dates of these cherry trees (Fig. 2). Furthermore, Mason et al. [24] found that the duration of Great Lakes ice coverage has decreased. Great Lakes ice cover duration, mismatch between bird migration and food availability, and the Washington D.C. cherry blossom peak bloom dates are just a few examples of a large body of evidence of phenological shifts [1, 5, 28, 35, 37, 43].
North American lilacs first bloom dates are another phenological event that responds to the change in global average temperature. Schwartz [38] found that phenological data, including North American lilac first bloom dates, can be used as a missing link for satellite observations and thus adds to the need for cooperative research efforts to understand ecological questions. Schwartz et al. [40] used the North American lilac first bloom dates as an example of this cooperative research efforts because it is one of the first extensive phenological datasets in the USA. Schwartz and Reiter [41] and Wolfe et al. [44] analyzed portions of the North American lilac first bloom data using a linear model and found a shift of 0.14 and 0.092 days earlier each year in first bloom dates, respectively. Earlier blooming of North American lilacs was identified by Cayan et al. [8] using a linear model while Brunsdon and Comber [5] used a multilevel modeling to find a similar trend. Recently, Gerst et al. [16] evaluated Spring Indices applied to North American lilac and honeysuckle phenological data by Schwartz et al. [39] and concluded that these indices are generally a good proxy of observed phenological phenomena in North American lilacs and honeysuckle.
The typically used linear model for phenological studies (e.g., [1, 17] almost always underestimates the magnitude of change when the data include observations made before the onset of climate change effect [30]. Instead, research shows that a hockey-stick model explains phenological data better than a traditional linear regression by avoiding potential overestimation prior to the onset of climate change and underestimation after the onset of climate change [11, 28, 35]. A hockey stick model (or piecewise linear model) can be computationally challenging because of the discontinuity in its first derivatives. An early solution to the difficulty is to introduce smooth connection between the two intersecting line segments [2]. This approach is implemented in Qian [30]. However, using nonlinear regression requires appropriate starting values for all coefficients, a tedious trial and error process. Although Bayesian method is often computationally more intensive, the use of Markov chain Monte Carlo simulation made these computations trackable [32]. Consequently, we propose the use of the hockey-stick model as a modeling framework to describe the general pattern of temperature-sensitive phenomena or events in response to climate change.
Using a hockey-stick model, we can better describe the basal mechanisms, or forcing function, that led to the changes occurring in the phenological variable of interest, as well as the underlying pattern of long-term global average temperature change. The model provides two key parameters that can help us retrospectively estimate the time when the effect of climate change initiated and the magnitude of the change. These two key parameters are the changepoint which represents when the impacts of climate change on the phenological response began and the slope of the line after the changepoint which shows the rate at which the response is changing. We used Bayesian computational method to expand the model to include data from multiple locations.
The two key parameters of the hockey-stick model can lend more information to researchers and better describe phenological data than a traditional linear model [11, 28, 35]. Does and Menzel [11] found that the one-changepoint model (i.e., hockey-stick model) was the optimal model for the application to phenological data. A further understanding of this was provided by, Schleip et al. [35] who compared a constant, linear, and hockey-stick model. The hockey-stick model represented the pattern of phenological data the best. In addition, Pope et al. [28] found the hockey-stick model ideal for modeling the spring response of phenological data. Qian [30] developed an R function that easily fit the hockey-stick model. When applied to North American lilac first bloom dates from four Pacific Northwest locations, the estimated changepoints range from 1974 to 1983 [30]. The same model showed that the climate change effect on Washington, D.C. cherry blossom likely started as early as mid-1960 s and the peak bloom date has since moved about 6–7 days earlier (Fig. 2).
Phenological data, such as first bloom dates of the North American lilac, are inevitably location-specific because weather and climate patterns vary geographically [5]. As such, combining phenological data from different locations are unadvisable and we typically analyze these data by location. However, phenological data from a single location are almost always noisy because of the natural variation in weather. Due to this natural variation in day-to-day and year-to-year weather, we may not be able to see the hockey-stick pattern clearly, just as the long-term mean temperature signal [3]. Thus, accurately depicting the underlying pattern requires a long-term record at a single location. Even with the century-long Washington D.C. cherry blossom data, fitting a hockey-stick model is still unconvincing compared to a simple linear model. Brunsdon and Comber [5] proposed using the multilevel modeling approach that applies the intended model to each group, often geographic location, and partially pool all groups together to improve statistical power [15, 30]. The application of a multilevel, or Bayesian hierarchical, modeling approach provides a practical means to reduce site-specific estimation uncertainty through partially pooling the data from multiple locations [5, 15].
While research has documented when changes in global temperature began to impact phenological responses, this research typically follows one of two approaches: (1) employing a linear regression while using a multilevel model [5, 10] or (2) applying a hockey-stick model to the data from all locations together as one [35]. The goal of our research is to combine these two methods by developing a Bayesian hierarchical hockey-stick model for partially pooling phenological data from multiple locations to better differentiate region-specific patterns of response, as well as the aggregated response pattern.
The general premise of a Bayesian hierarchical modeling approach can be traced back to Stein’s paradox and empirical Bayes methods, which reduces overall estimation uncertainty by partially pooling information from similar observations made in multiple locations (see [12] for an extensive review and [33] for recent references). Although the computational complexity of applying the hockey-stick model [32] to data from a single location is no longer an issue [30], a stable maximum likelihood estimator of a multilevel hockey-stick model is still unavailable. Consequently, we opt to use the Bayesian computational method via Markov chain Monte Carlo simulation. The North American lilac phenological dataset, compiled by Schwartz and Reiter [41], includes annual first bloom and leaf dates for the common lilac (Syringa vulgaris) from 1956–2003 and collected at 1126 locations across North America. This well-studied dataset is ideal for our study because of its long-term records, availability, and large areal coverage [5, 8, 41]. The Bayesian hierarchical hockey-stick model in this study will bring together both the suggested model for phenological data and the need for a hierarchical structure to organize the data from different locations. By using this model, we can quantify when biological systems began to respond to the change in global average temperature and the rate at which they are responding, both at individual locations and at a continental scale.