1 Introduction

The atmospheric circulation of Central America and Mexico plays an important role in the global climate. This includes the close connection between the region and large–scale climate variability, such as El Niño–Southern Oscillation (ENSO) and the North Atlantic Subtropical High (NASH) (Díaz et al. 1994; Mapes et al. 2005; Curtis and Gamble 2008; Gamble et al. 2008; Peralta-Hernández et al. 2008). The region is characterized by steep topography and is bounded to the west by the North Pacific Ocean and to the east by the North Atlantic Ocean (specifically the Gulf of Mexico and the Caribbean Sea). The strong easterly trade winds are modified by this topography across Central America and Mexico, dividing the whole area into two major climate regions identified here as the Pacific side and the Caribbean side.

On the Pacific side of the domain, the annual precipitation cycle is largely dominated by a bimodal distribution (Magaña et al. 1999; Taylor and Alfaro 2005; Amador et al. 2006), characterized by the first precipitation peak between May and June, a reduction in precipitation typically between July and August, and a second precipitation peak between late August and early October. During the first precipitation peak, the warming of the eastern Pacific enhances convection and is accompanied by a weakening of the easterlies and cloudier skies. Then, the decrease in precipitation, known as the “mid-summer drought” (Magaña et al. 1999), follows with the cooling of the ocean around the Pacific coast from July to August, characterized by suppressed convection and clearer skies. During the second precipitation peak, there are relatively weak trade winds and a warmer eastern Pacific, accompanied by enhanced convection and associated cloudiness. The mid-summer drought (MSD) can represent a precipitation reduction by up to 40% (Small et al. 2007), a change that amounts to a dominant proportion of the annual precipitation variability (Curtis 2002).

The annual rainfall over the Caribbean side is more complex in terms of the spatial variability. While some studies report that MSDs are generally absent along the Caribbean coast of Central America (Amador 1998, 2008; Magaña et al. 1999; Taylor and Alfaro 2005; Small et al. 2007), it has been observed that similar bimodal precipitation cycles exist over the Greater Antilles (Taylor et al. 2002; Spence et al. 2004; Ashby et al. 2005) and parts of the Caribbean side of Central America (Maldonado et al. 2016). Similar to MSDs on the Pacific side of Central America and Mexico, the rainfall reduction over the Caribbean side demonstrates a tight connection with the Caribbean Low-Level Jet (CLLJ) through a moisture transport mechanism that suppresses the convection system, inducing drier conditions during July to August (Muñoz et al. 2008; Amador 2008; Whyte et al. 2008). However, it should be noted that the MSD over the Caribbean side has been shown to be less frequent and weaker compared to that over the Pacific side (Zhao et al. 2020; Maurer et al. 2022).

Since the MSD was first identified in the 1960s (Portig 1961), various theories have been proposed to explain its origins and the development of its bimodal signature. Magaña et al. (1999) suggested that the MSD–sea surface temperature (SST)–radiation relationship, which uses the enhancement/suppression of convection induced by the warming/cooling of SST, explains the bimodal precipitation over the Pacific coast of Central America and Mexico. This theory was later revised recognizing the important role of the trade winds, and importantly the CLLJ, in the development of the MSD (Magaña and Caetano 2005; Herrera et al. 2015). However, a recent study found that neither reanalysis nor model simulations showed much evidence on this signature (García-Franco et al. 2022). The NASH and its westward intensification have been recognized as an important contributor to the development of the MSD in various studies, which induces an associated strengthening of the CLLJ moisture transport (Mestas-Nuñez et al. 2007; Small et al. 2007; Wang et al. 2007, 2008). Other factors, including mountain—gap winds over Central America (Romero-Centeno et al. 2003, 2007), the seasonality of tropical cyclones (Curtis 2002; Inoue et al. 2002; Small et al. 2007; Liebmann et al. 2008), vertical wind shear and atmospheric particles (Angeles et al. 2010), solar declination (Karnauskas et al. 2013), and the Madden–Julian Oscillation (MJO: Perdigón-Morales et al. 2019; Zhao et al. 2019), have also been shown to play a role in the generation or modulation of MSDs and their associated large- or local-scale climate patterns.

Although numerous studies have focused on the physical mechanisms that cause MSDs and the signatures and annual characteristics of the bimodal precipitation, including its detection and quantification of its climatology and variability, these studies may lack fine-scale detail of the temporal and spatial variability. Mosiño and García (1966) determined MSDs by finding consecutive months characterized by decreased precipitation relative to those exhibiting precipitation maxima that bounded them during the rainy season (May to October). After slight modification, this method was later applied to examine MSDs over Mexico using Climate Hazards Group InfraRed Precipitation with Station data and subsequently quantified their duration and intensity using a self–defined monthly index (Perdigón-Morales et al. 2018). Using second–order harmonic regression, Curtis (2002) demonstrated that the bimodal signature of the MSD accounted for most annual precipitation variability over Central America, and this tendency can be strengthened during positive ENSO phases. Karnauskas et al. (2013) provided the first global distribution of bimodal precipitation using several observations, demonstrating that the MSD is a climate signal that is detectable at the global scale, albeit that it is significant in specific regions, including Central America and Mexico. Most previous studies that have examined the signature of MSDs were conducted using monthly precipitation data from an average annual cycle, limiting the potential of the results to reveal the interannual variability of MSDs at fine resolution. Studies using daily precipitation to characterize fine–scale MSDs are relatively rare. Using a modified version of an alternative method for indexing timing, duration, and intensity of MSDs (Alfaro 2013, 2014; Alfaro and Hidalgo 2017), Maldonado et al. (2016) argued that MSDs can exist over both the Caribbean and Pacific sides of Central America, and the intensity and magnitude of those MSDs on the Pacific side are more significantly connected with the CLLJ and ENSO. Using a modified version of the method from Anderson et al. (2019), Maurer et al. (2022) showed that the selection of a minimum threshold had significant implication on the characterization of MSDs.

Here, we investigate the MSD across Central America and Mexico using daily precipitation observations, with a focus on identifying the: (1) mean states and variability, (2) connection with large–scale modes of climate variability (specifically, ENSO and the MJO), and (3) potential to skilfully statistically model the MSD. This study aims to provide a comprehensive analysis of MSD signatures over the targeted domain in a fine scale, contributing to the general understanding of this regionally—pronounced climate features.

2 Data and methods

2.1 Data

This study uses the Climate Prediction Center (CPC) global unified gauge-based analysis of daily precipitation (Xie et al. 2007; Chen et al. 2008) from 1979 to 2017 to quantify the bimodal signature of precipitation annually and interannually across Central America and Mexico; the domain is shown in Fig. 1. The CPC data were constructed on a 0.5° \(\times\) 0.5° spatial grid using observed data from various sources. In this dataset, the daily climatology was calculated by summing the first six harmonics of the annual climatological (average) cycle, with the adjustment from the Parameter–Elevation Regressions on Independent Slopes Model (PRISM) monthly climatology (Daly et al. 1994, 2002), which is a climate analysis system providing criteria for the evaluation of climate properties. The daily precipitation ratio with respect to the climatology was obtained by optimal interpolation methods. The CPC data have been widely used in previous studies associated with precipitation (e.g., Preethi et al. 2011; Hou et al. 2014) and their quality have been evaluated against both observations and model simulations using various statistical methods (e.g., Katiraie-Boroujerdy et al. 2013; Rana, et al. 2015).

Fig. 1
figure 1

Domain and bathymetry in this study, including southern Mexico, Central America, northern South America, eastern Pacific, and parts of North Atlantic

In this study, properties used to characterize climate states across the region, including 2 m (above the surface) temperature, 10 m horizontal winds, and surface pressure, were extracted from the ERA-Interim reanalysis data (Dee et al. 2011) for the domain (120° W–60° W, 0°–30° N) on a 0.5° \(\times\) 0.5° spatial grid, from 1979 to 2017. The use of reanalysed near-surface winds in this study is based on the assumption that the topography has been adequately described by reanalysis, which is not always the case. However, the reanalysis data provided by the European Centre for Medium-Range Weather Forecasts (ECMWF) have been used to characterize the MSD patterns in some previous studies (e.g., Zhao et al. 2020; García-Franco et al. 2022), showing that it has the capability to simulate the MSD–associated atmospheric variabilities. The gridded three-hourly data were averaged into daily values. Sea surface temperature (SST) data from 1982 to 2017 were obtained from the NOAA OI SST V2 dataset (Reynolds et al. 2007; Huang et al. 2021) over the same domain on a 0.25° \(\times\) 0.25° grid, while for the period from 1979 to 1981 the data were obtained by re-gridding ERA–Interim reanalysis data onto a 0.25° \(\times\) 0.25° grid since NOAA data do not cover this period.

The ENSO index used in this study is the Oceanic Niño Index (ONI) based on ERSSTv5 data (Huang et al. 2017), which is calculated as 3-month running mean SST anomalies in the Niño3.4 region (5° N–5° S, 120°–170° W). Warm and cold periods over the record are identified when the ± 0.5 °C threshold is met for five consecutive overlapping three-month periods, or longer. Subsequently, El Niño/La Niña years are determined by finding the years when ONI values, in at least five months during September to next year’s February, meet the threshold. The original ONI from September to February is averaged in each year to get the annual ONI, which is used to quantify the interannually-varying phase and magnitude of ENSO.

The MJO index used in this study was developed using a seasonally independent index based on paired empirical orthogonal functions of the combined fields of near-equatorially averaged 200 hPa and 850 hPa zonal winds, and satellite-observations of outgoing longwave radiation (OLR) data; the resultant first two principal component (PC) time series are termed the Real–time Multivariate MJO series 1 and 2 (RMM1 and RMM2: Wheeler and Hendon 2004). This index has been shown to effectively capture the interannual modulation of the global variance induced by the MJO, as well as having the potential to reconstruct a variety of weather patterns induced by the variance in MJO activity. Based on the RMM index, daily MJO responses are classified into eight phases, which demonstrate an approximate propagation from phase 1 to phase 8.

2.2 MSD detecting algorithms

The detection of annual MSD events is achieved using the method recently proposed by Zhao et al. (2020). In this method, the MSD signal in a precipitation time series is detected from the first peak maximum (onset of MSD) of the bimodal precipitation to the second peak maximum (end of MSD), covering the whole period of the precipitation reduction. This detection was achieved by two consecutive steps. First, the seasonally varying climatology of precipitation was smoothed using a 31–day Gaussian filter at each grid point, and the climatological MSD signal at each grid point was determined following three criteria: (1) two local precipitation peaks, P1 and P2, should exist separately within the periods from May 15th to July 15th and August 15th to October 15th, respectively, and their corresponding dates recorded as onset and end dates; (2) the annual maximum precipitation value should be on either the onset or end date; and (3) the linear trend fitted from the first day of the year to the onset date should be positive and statistically significant at the 95% confidence level, and the trend fitted from the MSD-end date to the last day of the year should be negative and also statistically significant at the 95% level; the locations where these precipitation criteria are met are termed the MSD areas. Second, the same criteria were applied to annual precipitation smoothed using a 31-day Gaussian filter at each grid point across the MSD area to show the distribution of the annual MSD signals. Compared to previous methods, most of which detect and quantify the MSD at the monthly climatological scale, this algorithm ensures that the MSD signals are resolved at the daily scale, enabling a more practical detection for studies focusing on MSD signatures (Zhao and Zhang, 2021). The code is publicly available via https://github.com/ZijieZhaoMMHW.

During the detection, some metrics are determined to describe or quantify the corresponding MSD signal, including onset, end and peak (corresponding to the minimum precipitation during MSD signals) dates, duration (period from onset to end date) and intensity of MSD (Imsd; García-Martínez 2015) calculated following \({I}_{msd}=\frac{{P}_{max}-{P}_{min}}{{P}_{max}}\), where Pmax is the larger one in precipitation on onset and end dates and Pmin is the average precipitation through the MSD signal. In this study, the detection of annual MSD signals was executed for CPC precipitation data over the period from 1979 to 2017 in the domain (120° W–60° W, 0°–30°).

2.3 Anomalies and composites

The anomaly for a particular time series was calculated by removing the climatological annual cycle in Julian days, where data on February 29th in each non-leap year was filled by the mean of that on February 28th and March 1st. This step may also be achieved by removing the first several harmonics (e.g., Oliver and Holbrook 2018), but we calculated this here based on the elimination of means in each corresponding Julian day to get a more robust signature of the long–term climatology. Here, anomalies were calculated for CPC precipitation, 2 m temperature, 10 m horizontal winds, surface pressure and SST.

The climate composite (normally termed as the ‘composite’ in this study), used to characterize the mean states of a climate property through a particular time-period, is calculated by temporally averaging the climate property across the corresponding time points. The advantage of this approach is to ensure the continuity of the calculated composite at both the spatial and temporal scales, which could generate more physically explainable patterns. Similar approaches have been used by Oliver et al. (2018).

3 Results

3.1 Mean states

The climatological signatures of detected MSDs over Central America and Mexico, including frequency (Fig. 2a), mean onset (Fig. 2b), peak (Fig. 2c) and end (Fig. 2d) dates, and calculated climatological MSD intensity (Imsd) (Fig. 2f), show significant spatial variabilities. The generally high frequencies of the detected MSD signals reveal robust bimodal characteristics of annual precipitation over most parts of the domain, including the Pacific coast of southern Mexico and Central America, Yucatán Peninsula, coasts around the Gulf of Mexico and Cuba. To the southeast, detected MSD signals tend to start earlier and end later, corresponding to relatively longer climatological durations (Fig. 2e). Compared to other signatures, the mean peak dates of the MSD signals show higher spatial variabilities, indicated by more noisy signals and irregular spatial patterns. Several areas exhibiting climatologically intense and long MSDs can be identified, including the Pacific coast of Central America, and coastal areas around the Gulf of Mexico and Cuba.

Fig. 2
figure 2

Metrics of MSD signatures over lands of Central America, Mexico and northern South America and Caribbean islands. Six panels separately indicate a annual frequency, b average Julian onset, c peak and d end dates, e average durations and f climatological Imsd. Regions exhibiting MSD signatures are shaded by colours. The figure is developed from Fig. 5 in Zhao and Zhang (2021)

The temporal variability of MSD signals is also examined. The proportion of areas exhibiting MSD events demonstrates a time series with high interannual frequency (Fig. 3a). The time series does not have significant autocorrelation (Fig. 3b), implying its predictability cannot be modelled by an autoregressive model, and which has been widely used in studies associated with predictions of climate indexes (e.g., Seo et al. 2009; Ubilava and Helmers 2013; Oliver and Thompson 2016). The time series is also shown to have no correlations with other large–scale climate modes, such as ENSO and the Pacific decadal oscillation (PDO) (not shown here), which is consistent with previous research (Fallas-López and Alfaro 2012). Additionally, there is no significant linear trend in the time series. Generally, the annual frequency of MSD signals over the domain can thus be interpreted as a complex time series with significant interannual variability, which may not be well modelled by its own autocorrelation characteristics.

Fig. 3
figure 3

Total number of MSD events in each year and its autocorrelation. a The time series of annual number of detected MSD signals summed over the whole domain. b The autocorrelation of time series in a in varying lags; the statistical significance in 95% level is indicated by blue line

By spatially averaging the major characteristics of MSD signals (onset, peak and end dates, duration, and Imsd) in each year, the temporal variabilities of MSD signatures are explored. The cross-correlation among the five signatures and annual ONI show a well-organized pattern (Fig. 4). The onset/end date has a significant negative/positive correlation with the duration, which is intuitively reasonable. Imsd is positively correlated with MSD duration, indicating that a MSD signal with stronger intensity is also generally part of a longer duration event. It is specifically notable that the annual ONI is positively correlated with both the duration and Imsd, indicating that stronger and longer MSD signals tend to exist in El Niño years instead of La Niña years.

Fig. 4
figure 4

Cross correlation among metrics of MSD signals, including onset, end, and peak dates, durations and Imsd, and ONI index. The strength of red (blue) indicates negative (positive) correlation, in 95% statistical significance level

Large-scale climate composites of key characteristics of MSD signals are also examined. The climate composites of six properties (anomalies of precipitation, 2 m temperature, 10 m horizontal winds, surface pressure and sea surface temperature) over the domain were calculated for three time points (onset, peak and end dates of the MSD signals) to illustrate the changes of climate patterns during the generation of the MSD (Fig. 5). The anomalously positive precipitation over the domain exists during the onset and end dates of MSD signals, while opposite patterns are determined for peak dates, corresponding to the bimodal characteristics of MSD signals. Significant wind shifting during the development of MSD signals is notable. When the MSD starts over Central America and Mexico, the Pacific coast of Central America and coastal areas around the Gulf of Mexico—where robust MSD signals are found—are dominated by onshore wind anomalies. These enhanced onshore winds together contribute to a low-pressure (cyclonic) system, correspondingly inducing anomalously negative near surface temperature anomalies over both land and ocean. Then, these onshore wind anomalies transition into enhanced offshore wind anomalies on the peak dates of the MSD signals, accompanied by a corresponding high pressure (anticyclonic) system and anomalously positive near surface temperature. The climate composites on the end dates of the MSD signals are remarkably similar to those on the onset dates, indicating the retrieval process from peak to end of MSDs. The precipitation reduction on the peak date of the MSD is accompanied by strong trade wind easterlies, which may be largely due to the enhancement of the CLLJ during July to August, transporting moisture (flux) from the Caribbean region and subsequently suppressing the convection. The influence of the NASH is not significant in the climatological composites.

Fig. 5
figure 5

Climate composites at characteristic time points of MSD. Each column indicates composites in a particular MSD period (onset, peak and end dates), while each row represents a particular atmospheric property (anomalies of precipitation, 2 m temperature, 10 m horizontal winds, surface pressure and sea surface temperature)

3.2 Connections with ENSO and the MJO

According to previous studies (Magaña et al. 1999, 2003; Chen and Taylor 2002; Curtis 2002; Peralta-Hernández et al. 2008; Hidalgo et al. 2017), MSDs demonstrate strong interannual variabilities. MSDs over this region can be modulated by ENSO and/or the MJO, as have been shown previously for the generation and development of the MSD in Costa Rica (Martin and Schumacher 2011; Perdigón-Morales et al. 2019; Zhao et al. 2019, 2020). In this section, we further explore the connection between MSDs and both ENSO (Sect. 3.1) and the MJO (Sect. 3.2) across Central America and Mexico over the period from 1979 to 2017.

3.2.1 ENSO

The climatological precipitation over Central America and Mexico in each ENSO phase is separately calculated for June, July, August and September. Choosing the three time-segments is undertaken to better illustrate the variabilities of precipitation during common MSD periods (e.g., Rauscher et al. 2008; Corrales‐Suastegui et al. 2020). In El Niño years, generally negative precipitation anomalies exist in most parts of the region, including the Pacific Coast of the domain and coastal areas around the Gulf of Mexico, where MSD signals with high frequency are detected from June to September (Fig. 6a–c). It is notable that areas exhibiting bimodal precipitations, such as the Pacific side of Central America, tend to be dominated by suppressed precipitation during July and August in positive ENSO phases (El Niño). Opposite patterns exist in La Niña years (Fig. 6d–f), indicating relatively wet rainy seasons in the July–August period over most parts of the domain.

Fig. 6
figure 6

Climatological precipitation across June to September in different ENSO phases. Each row indicates a particular period for climatological precipitation (June, July–August and September), while each column indicates a particular ENSO phase (El Niño, La Niña and neutral years). Dots here indicate statistical significance in 95% confidence intervals

The climate composites of surface pressure and 10 m winds in each ENSO phase at onset, peak and end dates are shown in Fig. 7. Compared to climatological composites shown in Fig. 5, some biases exist in these patterns. For ENSO-neutral years, the composites of near–surface winds and pressures are broadly similar to the climatological states (cf. Fig. 5g–i), except with relatively weak cyclonic/anticyclonic patterns and more significant influences of the NASH. For El Niño years, the anomalous cyclonic system during the onset of the MSD exists in the Caribbean Sea region; on the peak MSD dates, the climatological onshore wind anomalies at the Pacific coast of Central America (Fig. 5i) transition into anomalous easterly winds from the Caribbean Sea, which is due to the westward extension of the NASH. The influence of the NASH in El Niño years is more significant, shown by the westward extension of the high-pressure center through the entire MSD period. For La Niña years, the wind-pressure patterns at the MSD onset and end dates are similar to the climatological patterns, while anomalous westerly winds pass through Central America on the peak dates, making the climatological anticyclonic system (Fig. 5h) absent. For precipitation anomalies in El Niño years, the generally drier June and September reveal that the precipitation during the two months is lower than climatology. The two months may still be in potential MSD periods, and the onset/end dates of corresponding MSD events may thus be extended to some days in May/October. These features indicate relatively long MSD periods (shown by longer durations of MSD signals) during El Niño years, which is consistent with the positive correlation between MSD durations and ONI shown in Fig. 4. The anomalously high precipitation on the peak dates of the MSD in La Niña years indicates a relatively “shallow” trough of precipitation reduction, corresponding to relatively weak MSD signals (shown by the smaller Imsd).

Fig. 7
figure 7

Composites of 10 m horizontal winds and surface pressure across all detected MSD events in different ENSO phases. Each row indicates a particular time point in the MSD signature (onset, peak and end dates of MSD signals), while each column indicates a particular ENSO phase (El Niño, La Niña and neutral years). Dots here indicate statistical significance in 95% confidence intervals

Generally, ENSO’s modulation of the MSD over Central America is achieved by modifying the low–level wind–pressure system. In positive ENSO phases (El Niño years), the NASH strengthens during the MSD, especially from the peak to end dates. On the peak date of the MSD, the westward extension of the NASH brings stronger easterlies, inducing a more intense CLLJ. The intensified CLLJ can last to late boreal summer and remain through to the end date of the MSD, inducing a drier MSD throughout, since the moisture flux associated with the easterly flow suppresses the convection. This feature makes MSDs in El Niño years more intense (larger Imsd) and longer, resulting in a generally drier summer. In negative ENSO phases (La Niña years), the influence of the NASH tends to be insignificant and the peak of the CLLJ is suppressed. The easterlies on the peak date of the MSD are replaced by strong westerlies from the Pacific, inducing a wetter MSD period and a “shallower” MSD trough.

Another factor to induce the synchronicity between the change of wind–pressure patterns modulated by ENSO and precipitation during the MSD may be the interaction between the winds and topography—that is, the onshore/offshore winds and orographic forcing associated with steep mountainous terrains. Due to the orographic uplift, the interaction between the onshore winds and orography act to enhance the precipitation, while offshore winds tend to have the opposite effect. The shifting of wind patterns over the major MSD areas (the Pacific Coast of Central America and coasts around the Gulf of Mexico) can contribute to the bimodal shape of MSD signals. Therefore, it can be deduced that changes of wind patterns in ENSO years may influence the interannual variability of MSD signals. Similar features have been observed in Central America (e.g., Zhao et al. 2020).

3.2.2 MJO

To analyze the connection between the MSD signals across the domain and the MJO (RMM1 and RMM2), each detected MSD signal is categorized into four periods. To represent the typical signatures of the MSD signals in the different MJO phases, each period is separated based on a particular percentile. For each MSD signal, P1 is from the onset date to the 30th percentile between the onset date and the peak date, while P2 follows P1 and ends in the peak date. Similarly, P3 follows P2 and ends in the 70th percentile between peak dates and end dates, while P4 covers the remaining period. This is illustrated in Fig. 8. Each period of the MSD has a physical meaning. P1 and P4 represent relatively short periods during the onset and end of MSD signals, so they could be intuitively named as the “onset period” and “end period”. We refer to P2 as the “development period”, which spans between the onset and peak date, that is the period when precipitation tends to reduce. We refer to P3 as the “recovery (or decay) period”, when the summer precipitation recovers towards its second peak.

Fig. 8
figure 8

The synoptic illustration for the category of MSD signals. The blue solid line indicates the precipitation, the part of which between the two black vertical dashed line refers to an MSD signal. Four categorized periods, termed as P1–4 separately, are distinguished by different colors

Based on this definition, the temporal connection between MSD periods and MJO phases were analyzed by directly calculating the percentage fraction of each MSD period that occurs during each MJO phase (Fig. 9)—i.e., totals at each individual grid point across all MJO phases should sum to unity (1). While P2 and P3 do not exhibit clear signatures of specific correspondences to MJO phase, P1 and P4 however show strong correspondences with MJO phase 8–1 in areas exhibiting robust bimodal precipitation signatures, such as the Pacific coast of southern Mexico and Central America and Cuba. This is also notable in phases 2–3 but not quite as strong. This signature indicates that detected MSD signals tend to onset/end in MJO phase 8–1 over the domain.

Fig. 9
figure 9

Temporal cooccurrences between MSD periods and MJO phases. Each row indicates a particular MSD period (P1–4), while each column indicates a particular segment of MJO phases (8–1, 2–3, 4–5, 6–7). Colours here indicate the proportion of MSD periods in corresponding MJO phases

The MJO phases also modulate the near surface wind–pressure patterns. Figure 10 shows the wind–pressure composites corresponding to MJO phases during the period when MSD signals are detected. In MJO phases 8–1, westerly anomalies with approximate geostrophic balance approach the Pacific Coast of Central America and subsequently reach the Caribbean region to become southwesterly and form a low–pressure system centered in the Gulf of Mexico. The westerlies from the Pacific weaken in MJO phases 2–3, shown by their general geostrophic balance and weaker meridional pressure gradient. This result indicates the strength of the westerlies from the Pacific associated with the MJO phases is positively correlated with the onset/end of the MSD events along the Pacific coast of Central America and Mexico. Specifically, the westerlies and their associated convergence can induce strong convection and enhanced precipitation, corresponding to the precipitation maxima at the dates of MSD onset and end. This is consistent with previous MSD-MJO related-research for Costa Rica (Zhao et al. 2019), but here extending over the larger domain that includes most parts of the Pacific coast of Central America and southern Mexico. In MJO phases 4–5 and 6–7, easterly anomalies from the Caribbean Sea contribute to a divergence system centered over the Gulf of Mexico, which suppresses the precipitation and subsequently induces the drying through the MSD.

Fig. 10
figure 10

Composites of 10 m horizontal winds and surface pressure in varying MJO phases. Dots here indicate statistical significance in 95% confidence intervals

3.3 Statistical modelling of the MSD

In this section, we examine the statistical modelling potential of the bimodal precipitation signature over Central America and Mexico and infer possible physical mechanisms underpinning this. In MSD areas, the rainy season is characterized by two peaks and a trough in the summertime precipitation time series. Although only the trough of precipitation is referred to as the MSD signal in our definition, the increase/decrease of precipitation before/after the MSD also plays an important role in characterizing the variability of annual precipitation time series, since it is very different from other dry seasons. Further, we statistically model not only the MSD signals (from onset to end), but also the whole precipitation time series in the rainy season, which accounts for most of the annual variabilities.

We first confirm the definition of what is meant by the rainy season. Over Central America and Mexico, the rainy season is typically termed the period from May to October (Hastenrath 1967; Alfaro and Cid 1999; Magaña et al. 1999), with fluctuations of one or two months in some particular years (Alfaro et al. 1998; Alfaro and Enfield 1999; Maldonado et al. 2017). However, this definition is based on the climatological unimodal precipitation maximum in most areas of the northern hemisphere, which may not be adaptable to bimodal precipitation that characterizes MSD regions. Therefore, we specifically define here the rainy season for MSD areas in this study.

After calculating the seasonally varying climatology of precipitation at each grid point across the region characterized by MSD occurrences, and smoothing it using a Gaussian filter, an empirical orthogonal function (EOF) analysis (Lorenz 1956) is applied to the resultant spatiotemporal precipitation data. The first EOF (EOF1; Fig. 11), which explains 77.71% of the total annual climatological cycle precipitation variance, shows higher precipitation along the Pacific coast and Yucatán Peninsula, and relatively low precipitation in southern Mexico and Cuba. The pattern of EOF1 is similar to the climatological precipitation in traditionally defined rainy seasons (May–October) over Central America and Mexico (e.g., Zhao et al. 2020), implying that EOF1 is a useful measure of the characteristic climatological precipitation variability across the region. Characterized by an obvious bimodal shape, the first principal component (PC1) demonstrates the annual bimodal precipitation over Central America and Mexico, which is scaled regionally across the spatial domain by the EOF1 loadings. the rainy season in this study is hence determined as the period when PC1 > 0 (May 17th to October 27th). This determined rainy season is used throughout this section.

Fig. 11
figure 11

Resultant EOF1 spatial patterns (upper panel) of precipitation and associated PC1 time series (lower panel)

Smoothed using a 31-day moving-average window, the seasonal varying climatology of precipitation during the rainy season at each grid point across the domain is modelled using a fourth-order polynomial:

$$P\sim {b}_{0}+{b}_{1}\times t+{b}_{2}\times {t}^{2}+{b}_{3}\times {t}^{3}+{b}_{4}\times {t}^{4}$$

where P indicates the precipitation time series at each grid point during the rainy season, b0 – b4 correspond to the fitted coefficients for each polynomial term, and t indicates the corresponding time (day). The application of the fourth-order polynomial here is due to the fact that it gives the best modelling outputs, while polynomials of smaller order (e.g., third order) generate lower R2 and higher order polynomials (fifth-order or above) induce overfitting problems shown by a generally insignificant coefficient on the largest order term (not shown here). In this study, we focus on analyzing the coefficient of fourth-order polynomial term, b4, which is the key factor to determine the polynomial shape of the fitted model. The resultant R2 and coefficient b4 are shown in Fig. 12a – b, with determined MSD areas indicated by the stippling. In most areas of the domain, the polynomial generates satisfying modelling outputs for the rainy season precipitation, shown by generally high R2 (~ 0.8) across much of the domain. It is notable that R2 is still reasonable (R2 > 0.5) even in those regions where the performance is the weakest, such as the coasts around the Gulf of Mexico and Panama. Overall, we find that the performance of b4 varies with the strength of the bimodal precipitation. In areas exhibiting MSD signals, b4 is generally negative and statistically significant (at the 95% confidence level), whereas it tends to be insignificant or significantly positive in non–MSD characterized areas. The tendency is clearer after the climatological precipitation is spatially averaged based on the performance of b4 (Fig. 13). The bimodal annual cycle of precipitation is evident in areas characterized by significantly negative b4, while precipitation in other regions tends to be dominated by a unimodal precipitation annual cycle.

Fig. 12
figure 12

a – d Performance of 4th order polynomial models on the climatological (a, b) and daily (c, d) precipitation during rainy seasons. a The adjusted R2 and c yearly averaged adjusted R2 resulted from fitted model in each grid. b Coefficients and d yearly—averaged coefficients of the 4th order term in each grid. Regions exhibiting MSD signals are shaded by black dots. eh Performance of 4th order polynomial models with MJO covariates on the annual precipitation during rainy seasons. e Yearly averaged adjusted R2 resulted from fitted models in each grid. f Yearly averaged coefficients of the 4th order term in each grid. Yearly averaged coefficients of g RMM1 and h RMM2 in each grid

Fig. 13
figure 13

Spatially averaged annual climatology of precipitation with respect to different regions, including a domain dominated by negatively significant b4 according to Fig. 11, b the grid point dominated by the most negative b4, c domain dominated by positively significant b4 according, d the grid point dominated by the most positive b4, and e domain dominated by insignificant b4. A 31-days moving average is applied to smooth each generated climatology

The performance of the aforementioned method proves to be unsatisfactory when applied to the daily precipitation time series in the rainy season over the 1979–2017 period. The polynomial is applied to rainy season precipitation for every single year in each grid and the resultant b4 and R2 are temporally averaged (Fig. 12c, d). It is notable that, while the temporally averaged b4 generally follows the pattern shown in Fig. 12a, b, the spatial R2 drops significantly (< 0.5 in a large part of the domain). It indicates that the fourth-order polynomial model fails to reveal some interannual variations of the intraseasonal variability, which may be filtered during the calculation of seasonally varying climatology.

To reveal these unresolved intraseasonal variabilities, the polynomial model is modified by adding the MJO indexes:

$$P\sim {b}_{0}+{b}_{1}\times t+{b}_{2}\times {t}^{2}+{b}_{3}\times {t}^{3}+{b}_{4}\times {t}^{4}+{a}_{1}\times RMM1+{a}_{2}\times RMM2$$

The updated model, with MJO covariates, is applied to the precipitation time series during the rainy season for every year and grid point over the domain. After temporal averaging, the resultant R2, b4 and regression coefficients on the MJO covariates (a1 and a2) are shown in Fig. 12e–h. Although the R2 magnitudes (Fig. 12e) are not as large as the climatological values (cf. Fig. 12a), it clearly increases after adding the MJO indexes, implying that the inclusion of the MJO makes an important contribution to addressing the previously unresolved variability across the region. The temporally averaged b4 is similar to the previous one for the climatological (Fig. 12b) and annual—scale (Fig. 12d) analyses, indicating that the added covariates (MJO indexes) do not disturb the performance of the other polynomial terms. This implies that the added covariates are generally orthogonal to the polynomial terms. The widespread statistical significances of the a1 and a2 coefficients over the domain imply that the inclusion of the MJO indexes considerably enhance the model’s explanation of the overall variability, with relatively little effects of overfitting or overdispersion. It is notable that a1 and a2 are significantly negative in most characteristic MSD regions, while they tend to be positive or insignificant in characteristically non-MSD regions. Based on the RMM1–RMM2 phases (phases 1–8), the performance of a1 and a2 can be interpreted as the phase–shifting of the MJO. The increase of RMM1 and RMM2 together contributes to the phase shifting from MJO phase 2–5, corresponding to a period of precipitation reduction in the rainy season. Similarly, the decrease of RMM1 and RMM2 corresponds to the phase shifting from phase 6 to phase 1 and associated rainfall enhancement in the rainy season.

Based on the results shown above, the bimodal precipitation during the rainy seasons in the MSD characterized regions can be synoptically explained by a fourth-order bimodal signature and the modulation through two complete MJO cycles (Fig. 14). Over characteristic MSD regions, the rainy season starts with a rapid increase in precipitation during MJO phase 5/6–8/1. After reaching the first precipitation peak, the subsequent precipitation reduction (through the MSD period) typically exists from late June/early July through to late August/early September, characterized by a full MJO cycle. Following the end of the MSD period, precipitation tends to rapidly decrease again through the next month, transitioning into the dry season. This coincides with a shift of the MJO from phases 8–1 to 5–6.

Fig. 14
figure 14

Synoptic illustration of the MJO’s modulation on the MSD signals over Central America and Mexico. The increase (decrease) of precipitation during the change of MJO phases is indicated by orange (blue) arrow in left panel. The right panel is similar to Fig. 8, with orange (blue) arrows indicating the increase (decrease) of precipitation and corresponding MJO phase changes labelled in the bottom

4 Discussion and conclusions

4.1 Spatial and temporal variabilities

In this study, we detected MSD signals over Central America and Mexico using daily data over the period from 1979 to 2017. We found that large parts of southern Mexico and Central America display constant and frequent MSD signatures, characterized by significant spatial and temporal variability. The detected MSD signals tend to have longer durations and wide temporal ranges, characterized by earlier onset and later end date, towards the southeast. Several regions exhibit strong MSD signatures, including the Pacific Coast of Central America, Cuba and the Gulf of Mexico. The spatial patterns of detected MSD signals are generally similar to those shown by Zhao et al. (2020), using the same dataset from 1993 to 2017, except for some minor biases in the central region of the Yucatán Peninsula. The pattern similarity highlights the temporal length of the data set we used is sufficient to simulate the climatological signature of MSD mean states; it also supports the robustness of the method. The temporal variability of the MSD signals is characterized by complex, noisy and non-autocorrelated signatures. This implies that the frequency of the MSD signals over the domain does not have a significant trend over the period 1979–2017.

The large-scale composites of climate properties are used to dynamically illustrate the development of MSD signals, including onset, peak and end dates. The most important and interesting signature found is the change of the wind–pressure system relationship for different MSD periods. Specifically, we found that there are characteristically cyclonic anomalies for periods with higher precipitation (onset, end dates) and anticyclonic anomalies for periods with lower precipitation (peak date). During the onset/end of MSD signals, the near–surface temperature (2 m temperature and SST) anomalies over the domain is generally negative, while positive temperature anomalies exist during the peak date.

Magaña et al. (1999) pointed out the warming and cooling of the eastern Pacific Ocean could be a significant contributor to the bimodal signature of precipitation for Central America and Mexico. According to their theory, the warming eastern Pacific could induce active upward convection and corresponding intense precipitation (during the onset/end of the MSD), while oceanic cooling corresponds to the reduction of precipitation (during the peak of the MSD). However, this signature is not evident in near–surface temperature composites in the present study, which is characterized by anomalously low temperatures during the onset/end of MSD signals and anomalously warm temperatures during the peak of MSD. This bias could be due to the examination of anomalies instead of raw SST to calculate the composites in the present study. Magaña et al. (1999) used absolute SST to characterize the change of heat content during the development of the MSD, which highlights the contribution of SST warming and cooling, while the examination of anomalies in this study focuses on the influence from near–surface wind circulations. Mechanisms proposed in previous studies (e.g., Magaña et al. 1999; Karnauskas et al. 2013; Perdigón-Morales et al. 2021) indicate that the generation of the MSD is not induced by a single factor, but rather a combination of multiple physical factors including sea–air–coast interactions and large-scale climate modes. The application of realistic and idealized climate models, which enable the removal of some physical properties, may more clearly reveal the mechanisms behind the MSD.

4.2 Connections with climate modes and associated predictability

The relationship found between MSD metrics and the ONI suggests that longer MSDs tend to be stronger and may be enhanced by El Niño and suppressed by La Niña. The positive ENSO phase (El Niño) can suppress precipitation during boreal summer and strengthen the North Atlantic Subtropical High (NASH) and its associated easterlies from the Caribbean region, which can intensify the MSD and potentially extend the MSD duration. The easterlies, which are significantly contributed to by the peak of the Caribbean Low-Level Jet (CLLJ), can potentially modulate the MSD through two different processes: the strengthened easterlies suppress the convection, especially during the peak to the end dates of the MSD, inducing a drier and subsequently more intense MSD period; on the other hand, the interaction between the coastal wind and steep topography can also contribute to the change of precipitation, since positive ENSO phases can bring easterly anomalies, which are offshore winds on the Pacific side of Central America and Mexico. These offshore winds can suppress the orographic uplifting, and subsequently reduce the rainfall in these regions.

ENSO’s influence on MSDs is controversial. Many studies argue that El Niño years (positive ENSO phases) strengthen the MSD (Chen and Taylor 2002; Curtis 2002; Díaz‐Esteban and Raga 2018) while others suggest that MSD signals can be weaker during El Niño years (Magaña et al. 1999, 2003; Peralta-Hernández et al. 2008). Our results indicate that ENSO’s influence on the MSD is through modulation of the NASH and associated near–surface wind fields. Furthermore, this corresponds to observations of an intensified NASH (Giannini et al. 2000) and CLLJ (Wang 2007; Krishnamurthy et al. 2015), as well as their teleconnection during boreal summer, and highlights the role of the low-level circulation system in the generation of the MSD.

Based on our categorization of the MSD into four periods and their relationship with MJO phases, we have shown that a relatively high proportion of MSD periods tend to start/end in MJO phase 8–1 due to the anomalous onshore winds arising at the coasts of Central America. This result is consistent with a previous study of MSDs over Costa Rica (Poleo et al. 2014; Zhao et al. 2019) and Mexico (Perdigón-Morales et al. 2019). Additionally, the teleconnection between wind–pressure patterns in MSD P1 and P4 and MJO phases 8–1 have also been identified. The varying wind patterns associated with MJO phases imply that the MJO affects precipitation throughout the entirety of MSD signatures, rather than being limited in starting/ending periods.

An important MSD signature investigated in this study is through the fourth-order polynomial regression modelling of the intraseasonal variability, including the phase shifting by the MJO. Overall, we found that the mean states of the rainy season precipitation over the domain were captured well by the model. Specifically, we found that the polynomial coefficient b4 can be a practical index to quantify the strength of climatological MSDs over the domain. We found that a significantly negative b4 dominates MSD areas, while non-MSD areas generally exhibit significantly positive or insignificant b4. The model was found to perform much worse using annual precipitation data, while the rainy season could be optimized by adding the MJO indexes as covariates in the model, implying that the unresolved intraseasonal variability in each year could be largely covered by changes in MJO phases.

The areas exhibiting climatological MSD signals exhibit generally negative coefficients of MJO-associated covariates. We interpret this as an MJO phase-shifting effect here that significantly influences the characteristic intraseasonal variabilities of precipitation during the rainy season across the MSD area. Changes in MJO phases from phase 5/6 to 8/1 lead to increased precipitation, contributing to the first precipitation peak, corresponding to the onset of the MSD. A complete MJO cycle induces a precipitation reduction, corresponding to the trough of the MSD. Finally, the MJO transitions from phase 8/1 through to 5/6, contributing to a rapid decrease in precipitation, implying the end of the rainy season. The intraseasonal variability of precipitation through the rainy season in the MSD areas is clearly and significantly modulated by the MJO.

In the conceptual model proposed in this study, we used two complete MJO cycles to explain the interannual variability of MSD signals during the rainy seasons. The duration of two complete MJO cycles (~ 100 days; Zhang 2005) used in the hypothesis, however, does not critically match the period of rainy season used in this study, which is from May 17th to October 27th (164 days). This bias can be caused by several factors. Firstly, the MJO phases in Julian days of year vary annually, implying the MJO’s modulation on intraseasonal precipitation varies interannually. Secondly, while the addition of MJO indexes in the regression model provides considerable improvement, ~ 20% of the total variance remains unresolved, which may be due to other large-scale climate modes such as ENSO and/or the Interdecadal Pacific Oscillation, and regional forcing such as diabatic heating (Vincent and Lane 2018).

The implication of the MJO on regional circulation, shown by the changing wind–pressure system associated with changes in MJO phases, could largely contribute to the generation and maintenance of the bimodal precipitation signature (including the so-called ‘mid-summer drought’) through the rainy season. We have found that a fourth-order polynomial regression model captures a large proportion of the precipitation variability, indicating the potential predictability of the bimodal precipitation signature. Several MJO modelling approaches, including a damped harmonic oscillator model (Oliver and Thompson 2016), linear inverse models (Cavanaugh et al. 2015), ensemble general circulation models (Seo et al. 2009; Vitart 2014) and atmosphere–ocean coupled models, have been shown to capture MJO predictability and offer the potential to predict MJO dynamically, mathematically or statistically with lead times < 4 weeks (Waliser et al. 2006). Considering the wide use and constant development of MJO indexes (e.g., Wheeler and Hendon 2004; Liu et al. 2016; Mansouri et al. 2021), skillful prediction of MJO events and their transition phases are likely to play a key role in potentially improving the predictability of MSD events.

The limitation of this study are as follows. First, the MSD detection algorithm used in this study only ensure the existence of two precipitation peaks during boreal summer, whereas some trimodal distribution of precipitation may slip into collected events. Secondly, the modulations of ENSO and MJO on the MSD in this study are presented as composites, indicating their temporal variability induced by diversities of these two modes can be partially ignored. Former studies have suggested that diversities of ENSO (Capotondi et al. 2015; Infanti and Kirtman 2016) and MJO (Wang et al. 2019; Xiang et al. 2022) can demonstrate varying influences on climate properties. Therefore, it can be expected that some non-canonical ENSO/MJO events (e.g., the “jumping” MJO events in Wang et al. 2019) may have distinct implications on the MSD. Additionally, the study presented here are primarily constructed using statistical methods, the research described here was also mostly built using statistical techniques, with limited use of dynamical diagnostics of moisture sources contributing to the midsummer drought. Further works should be constructed based on these three limitations.