1 Introduction

The low frequency climate variability observed in various parts of the globe is the result of the interaction of various atmospheric systems (Barnett 1991). Intraseasonal fluctuations are one of the most important ones, because they influence the convective activity, precipitation patterns, the occurrence of hurricanes in the Gulf of Mexico (Maloney and Hartmann 2000), South American monsoon systems (Jones and Carvalho 2002), atmospheric blocking in maritime continent (Inness and Slingo 2006), among others. Moreover, the low frequency variability, usually defined as covering periods between 20–100 days, exhibit considerable complexity both in time and in space, and one has to resort to the use of complex techniques to detect these events in real-time (Alvarez et al. 2014; Alves et al. 2012; Vitorino et al. 2006). For the South American region, the Madden–Julian Oscillation (MJO) is an intraseasonal low frequency system of major relevance, due to its relationship to weather extremes and intraseasonal climate variability (Jones et al. 2004), and yet detailed knowledge of its influence in South American climate is still quite incomplete (Alvarez et al. 2015).

Thus, a more detailed description of the MJO influence on South American rainfall, temperature, and circulation of the atmosphere and their seasonal variations, would provide a fundamental basis for the monitoring of MJO impacts in this region (Alvarez et al. 2015). A number of regionalized studies on tropical South America were conducted, in which they: (a) identified the intraseasonal signal, especially in the rainy season (January to May) (Liebmann et al. 2004; Jones et al. 2004; De Souza and Ambrizzi 2006; Muza et al. 2009; Alves et al. 2012), b) found dynamic links between the South Atlantic Convergence Zone (SACZ) and the MJO (Grimm and Silva Dias 1995; Paegle et al. 2000; Todd et al. 2003; Grimm and Zilli 2009; Kodama et al. 2012; Gonzalez and Vera 2013), and c) found evidence of an increased frequency of precipitation extremes related to active MJO events (Jones et al. 2004; Carvalho et al. 2005). In spite of the large amount of studies on South American intraseasonal variability, an efficient intraseasonal prognosis is still a challenge due to the spatio-temporal characteristic of atmospheric interactions that modulate the variation in precipitation at the intraseasonal scale, especially related to extremes in the tropical region.

In short, the MJO is characterized by a global zonal wave with wave numbers between 1 and 3 and eastward propagation having periods between 30 and 90 days. This oscillation has a strong seasonal signal with greater intensity during the austral summer and fall seasons, which favours the development of the rainy season in tropical Brazil (Madden and Julian 1971, 1972; Weickmann 1983; Knutson and Weickmann 1987; Hendon and Salby 1994; Wu et al. 1999; Madden and Julian 1994; Zhang and Dong 2004; Maloney and Hartmann 2001). The modulations in the aforementioned rainfall season occur through the establishment of a deep quasi-stationary convective band, which is triggered by the simultaneous expression of the South Atlantic Convergence Zone (SACZ) and the Intertropical Convergence Zone (ITCZ). Such regional mechanisms are dynamically linked with the northward propagation of precipitation anomalies during the MJO displacement over South America and the tropical Atlantic Ocean (De Souza and Ambrizzi 2006; Muza et al. 2009; Valadão et al. 2015).

Moreover, there is an interaction between the intraseasonal signals (mainly the MJO signal), and other oceanic and atmospheric events that happen at different scales in the tropics. For example, at the interannual timescale, the intraseasonal signal may interact with the El Niño-Southern Oscillation and the Atlantic Dipole (Kayano and Kousky 2006; Robledo et al. 2013; Carvalho et al. 2004). At the interdecadal scale, the Pacific Decadal Oscillation and the Atlantic Multidecadal Oscillation may interact with the intraseasonal signal (Garreaud and Aceituno 2001; Wu et al. 1999). These interactions may modulate the precipitation patterns in distinct sectors of the tropical belt differently. So, the modulation of the intraseasonal signal with other scales illustrates the importance of studying the intraseasonal variability regionally. Thus, the main objective of this research work is to create an index that can capture the intraseasonal signal in the tropical region of Brazil.

Recently, several studies have been conducted to diagnose the intraseasonal variability. One of the first studies addressing the development of a multivariate index to characterize this variability, focusing mainly on the MJO, was carried out by Mathews (2000), who used the Empirical Orthogonal Function (EOF) applied to the gridded outgoing longwave radiation (OLR) data centered at the equator. The principal components (PC) resulting from the EOF analysis were considered representative in space and time; and through a vectorial projection, it was then possible to analyse the propagation of MJO-related convection in phase and space. Then, Wheeler and Hendon (2004) used the concept of vectorization, applying the EOF to a dataset composed of the equatorial average (15°S–15°N) of: OLR, and the zonal component of the horizontal wind at 850 (U850) and at 200 hPa (U200). They found that the PC index based on unfiltered data, which they called the RMM index (i.e.: Real-Time Multivariate MJO index), was able to capture fluctuations related to the MJO and the seasonal variability. Later, Ventrice et al. (2013) modified the index proposed by Wheeler and Hendon (2004), where they substituted OLR with potential velocity at 200 hPa. Overall, these (and other) studies evaluated the intraseasonal variability by looking at the global (Jones and Carvalho 2009; Maharaj and Wheeler 2005; Mathews 2000; Ventrice et al. 2013) or continental scale (Silva and Carvalho 2007), without considering specific regions, such as over tropical Brazil. Although EOF-based approaches capture the intraseasonal variability at a global scale, they may not capture extreme events at a regional scale. Also, these studies used indices that either considered the MJO or other processes separately, but that did not consider them together.

Hence, the use of a multivariate index has proven to be an efficient way to characterize the intraseasonal climate variability. Considering the potential value these multivariate indices have to capture and reproduce the intraseasonal variability dynamics, this study proposes to replace the multivariate EOF-type approach used in previous studies by the Maximum Covariance Analysis (MCA, hereafter) to characterize the intraseasonal signal. The MCA-based approach makes use of dynamic variables, (OLR, u component of the wind at 850 hPa and 200 hPa levels) with rainfall in tropical Brazil. Note that a combined EOF approach could have been used here to isolate the regional intraseasonal variability over Brazil. However, (Navarra and Simoncini 2010, p. 94) highlights the following: ”the main weakness of the combined EOF: by mixing the autocovariance of each field and the cross-covariance of one field with the other, combined EOF cannot separate the patterns for the different kind of variability and one cannot tell the respective amount due to the autocovariance or to the cross-covariance. The Combined EOF mode will bear the imprint of both sectors of variability of a particular variable.” In the present work, the aim is to capture the local changes in precipitation in TBr as a response to the dynamic variables (OLR, U850, and U200); so, one of these sets of variables could have an internal signal (autocovariance) that is more intense compared with the external signal (cross-covariance), such that the combined EOF would not distinguish between these signals and it would represent the stronger one.

Thus, we have opted for an alternative approach, which uses the Maximum Covariance Analysis to study the relationship between the local precipitation in TBr and the dynamic variables. In doing so, this research work will answer the following scientific questions with respect to the Multivariate Intraseasonal Index for Tropical Brazil (MITB) index proposed here:

  1. (a)

    Does the combination of dynamic variables with precipitation improve the representation of the intraseasonal signal?

  2. (b)

    Can the MITB index reasonably capture and characterize climate extremes, such as drought and heavy precipitation?

The structure of this paper is as follows. Section 2 describes the data and methods used to construct the MITB index. Then Sect. 3 is divided into three parts to: (a) study the basic characteristics of the MITB index; (b) conduct a sensitivity analysis of the MITB index to address the first scientific question; and (c) evaluate climate extremes to address the second scientific question. The summary and main conclusions are presented in Sect. 4.

2 Datasets and statistical model

2.1 Datasets

The intraseasonal variability patterns were identified using rainfall data based on daily totals from the Climate Prediction Center unified gauge (CPC) for the period from January 1979 to December 2013, obtained from their website at http://www.cpc.ncep.noaa.gov/. The CPC uses an optimized interpolation technique, which projects station-based precipitation data onto a horizontal grid at 0.5\(^\circ \times\) 0.5\(^\circ\) horizontal resolution (Chen et al. 2008). The evaluation of convective activity was performed using daily OLR data based on the Advanced Very High Resolution Radiometer (AVHRR) radiation detection imager, using a horizontal resolution of \(2.5^\circ \times 2.5^\circ\) , available from the National Oceanic and Atmospheric Administration (NOAA) (Liebmann and Smith 1996). Also, the 850 and 200 hPa u and v wind data were obtained from the National Center for Environmental Prediction (NCEP)/National Center for Atmospheric Research (NCAR) Reanalysis, with spatial resolution of \(2.5^\circ \times 2.5^\circ\) (Kalnay et al. 1996). These are available on their website at http://www.esrl.noaa.gov/psd/data/.

2.2 Defining the multivariate index

The intraseasonal signal was isolated in the raw data by applying a bandpass Lanczos filter in order to retain the low-frequency variations (Duchon 1979). The Lanczos filter uses a Fourier transform, so that the amplitudes of a time series are changed by weight functions, creating time series filtered to the desired frequency (De Souza and Ambrizzi 2006).

Here, the intraseasonal signal is defined using the Lanczos filter with 201 weights and cut-off frequencies at 20 and 100 days in the period from January 1979 to December 2013. Then, the resulting time series covered the period from April 1979 to September 2013.

2.2.1 Maximum covariance analysis

The MCA approach has been chosen because it is able to capture patterns of maximum covariance between two datasets; it has been found to reasonably capture atmospheric and oceanic processes (Wilks 2015). It is a robust method to investigate dominant modes of interaction, because it favors a better understanding of the relationship between groups of variables (Frankignoul et al. 2011). It consists of constructing a covariance matrix between two datasets and then performing the Singular Value Decomposition (SVD) of the resulting matrix (Levine et al. 2013). To define SVD, let \(\mathbf {X}\) be a \(p \times n\) matrix, where p represents a group of standardized variables and n time steps; \(\mathbf {Y}\) is a \(q \times n\) matrix, where q represents another set of standardized variables. n is the same for \(\mathbf {X}\) and \(\mathbf {Y}\), whereas p and q may have different sizes. Thus, the SVD covariance matrix (\(\mathbf {C}\)) between \(\mathbf {X}\) and \(\mathbf {Y}\) is represented by:

$$\mathbf {C}=\frac{1}{n}\mathbf {XY}^{T} = \mathbf {U}\mathbf {\Lambda }_c\mathbf {V}^{T}$$
(1)

where T represents the transpose of a matrix, \(\mathbf {\Lambda }\) is an \(r \times r\) diagonal matrix, where its non-negative singular values \(\lambda (i=1,\ldots ,r)\) are arranged in descending order, with \(r \le min(p,q,n-1)\). SVD outputs the \(p \times r\) matrix \(\mathbf {U}\) and the \(q \times r\) matrix \(\mathbf {V}\), which represent the singular vectors of the spatial pattern. The first pair of singular vectors, i.e. (\(\mathbf {U}_1,\mathbf {V}_1\)), describes the largest fraction of the quadratic covariance, while successive pairs describe a maximum fraction of quadratic covariance not explained by the previous pairs.

Next, let the \(n \times r\) matrices \(\mathbf {A}\) and \(\mathbf {B}\) represent the temporal expansion coefficients, which satisfy the following equations:

$$\mathbf {X}=\mathbf {U}\mathbf {A}^T \quad \mathbf {Y}=\mathbf {V}\mathbf {B}^T$$
(2)

Then from Eq. 2, one may obtain the following relationships:

$$\mathbf {A}=\mathbf {X}^T\mathbf {U} \quad \mathbf {B}=\mathbf {Y}^T\mathbf {V}$$
(3)

The columns of the matrices \(\mathbf {A}\) and \(\mathbf {B}\) are time series, which characterize each variability mode. Note that since \(\mathbf {X}\) and \(\mathbf {Y}\) are here defined as two standardized fields, the MCA response is not very different from the Canonical Correlation Analysis (CCA) approach. The only difference is that the MCA singular vectors are orthonormal to each other and the expansion coefficients are uncorrelated (Mo 2003).

Here, the MCA results are assessed through the square fraction of covariance (SFC), defined as:

$${ SCF}_k = \frac{\lambda ^2_k}{\sum \lambda ^2_i}$$
(4)

where k represents the MCA mode (i.e. \(k=1,\ldots ,r\)).

In this study, the matrices \(\mathbf {X}\) and \(\mathbf {Y}\) respectively refer to the following set of variables: a) the spatial precipitation over tropical Brazil (‘TBr’, hereafter), such that \(\mathbf {X}=p_{i,j}\) where \(\lbrace -85< i<= -34; -19< j <= 4 | i,j \in Z \rbrace\). i and j represent the longitude and latitude, respectively; and b) the meridional average from 15\(^\circ\)S–15\(^\circ\)N of OLR, U850 and U200, so that \(\mathbf {Y}=({ OLR}_{i}\quad U850_{i} \quad U200_{i})^T\), where \(\lbrace -180<= i <= 180 | i \in Z\rbrace\). Also, \(\mathbf {u}_k\) is the coefficient related to the precipitation for mode k; \(\mathbf {v}_k\) is the coefficient related the equatorial for mode k; \(\mathbf {a}_k\) and \(\mathbf {b}_k\) are the temporal expansion coefficients related to precipitation and equatorial average, respectively.

Simply put, \(\mathbf {Y}\) is a single matrix composed of three variables: OLR, U850, and U200, such that there are 432 time series for \(\mathbf {Y}\) (i.e.: 144 times three variables). \(\mathbf {X}\) corresponds to a matrix composed of the precipitation variable only. So, the MCA will return the relationship between the dynamic component (\(\mathbf {Y}\)) and the precipitation variable (\(\mathbf {X}\)). These two sets of variables, i.e. the (composite) forcing (\(\mathbf {Y}\)) and precipitation (\(\mathbf {X}\)), were filtered to the intraseasonal scale between 20 and 100 days without lag.

2.2.2 Amplitude and phase of the MITB index

The MITB index was constructed based on adapting the following methods: (a) Mathews (2000), who worked using an EOF-based approach applied to OLR; (b) Wheeler and Hendon (2004), who used a multivariate EOF approach applied to OLR, U850, and U200; and (c) Lee et al. (2012), who applied the method in (b) to Asia. The MITB index uses an MCA-based approach, where the two first modes are kept (i.e. MCA1 and MCA2).

In this study, the following MCA expansion coefficients are used, \(Z(t)=[{ MITB}_{1},{ MITB}_{2}]\), where:

$$\mathbf {MITB}_{1}=[\mathbf {a}_1,\mathbf {b}_1] \quad \mathbf {MITB}_{2}=[\mathbf {a}_2,\mathbf {b}_2]$$
(5)

Also, the amplitude (\(\Vert \mathbf {Z}\Vert\)) and phase (\(\hat{\alpha }\)) may be obtained from the \(a_1\) and \(b_1\) coefficients through the following equations:

$$\Vert \mathbf {Z}\Vert (t)=\sqrt{\left( \frac{a_{1} + b_{1}}{2}\right) ^2 + \left( \frac{a_{2} + b_{2}}{2}\right) ^2}$$
(6)
$$\hat{\alpha }(t)=\arctan \left( \frac{a_2(t)+b_2(t)}{a_1(t)+b_1(t)}\right)$$
(7)

2.3 Odds ratio

The odds ratio (OR) was used to assess whether there is a connection between extreme values of the MITB index and extreme precipitation at each grid point and to measure the magnitude of this association. The results are displayed in a contingency table, as shown in Table 1. Extreme precipitation (\({ PRP}_e\), hereafter) is determined as counts of values below (above) the 25th (75th) percentile that match intense MITB events with an amplitude larger than 3.5. This threshold refers to the amplitude of the MITB index selected through a two-sided Student’s t-test. A value of 3.50, or above, represents an amplitude that is statistically significant (at the 5 % significance level).

Table 1 Contingency table used to evaluate the association between \(PRP_e\) and strong MITB events

Note that in Table 1, \(\pi\) is the probability of occurrence of \({ PRP}_e\), and \(\pi _{h}\) is the conditional probability of \({ PRP}_e\) given the occurrence of strong MITB (\(P[\pi | { MITB} \ge 3.5]=\pi _{h}\)). Also, \(\pi _{l}\) is the conditional probability of \({ PRP}_e\) given the occurrence of weak MITB (\(P[\pi | { MITB} < 3.5]=\pi _{l}\)). The ratio of the probability that the event occurs (i.e.: \(\pi _{h}\)) with respect to the probability that the event does not occur (i.e.: \(1-\pi _{h}\)) is called risk or odds (Stephenson 2000; Correia Filho et al. 2014), expressed by \(\omega\) in the equations below:

$$\omega _{h} =\frac{\pi _{h}}{1-\pi _{h}} \quad \omega _{l} =\frac{\pi _{l}}{1-\pi _{l}}$$
(8)

And the ratio between them is given by:

$$OR=\frac{\omega _h}{\omega _{l}}$$
(9)

OR is interpreted as the ratio between the probabilities of an extreme precipitation event due to an extreme MITB event associated with its phase. If \({ OR} < 1\), then there is a reduced probability of MITB-related extreme precipitation events. \({ OR} > 1\) may be described as a risk factor, i.e., there is a higher chance of having extreme precipitation events related to strong MIBT cases. \({ OR} = 1\) indicates that the chance of extreme rainfall events is independent of the MITB signal. A Chi-squared test was used to verify the statistically significance of the odds ratio values. Note that OR was calculated for each grid point.

The OR approach was also used to verify the relationship between the MITB phases, the MITB intensity, and the lagged positive rainfall extremes (\({ PRP}^{+}_{e}\), hereafter). Table 2 shows the contingency table plan used for the OR analysis.

Table 2 Contingency table used to evaluate the association between \(PRP^{+}_e\) and strong MITB events

3 Results and discussion

Figure 1 shows the seasonal variance of the OLR pattern. The global intraseasonal pattern from December to May is represented in Fig. 1a, which shows enhanced intraseasonal variance over the Indian Ocean, Oceania, and the West Pacific during the summer and austral fall (LinHo and Wang 2002; Lee et al. 2012). Figure 1c shows the intraseasonal OLR pattern from June to November, showing a shift in the enhanced variance zones to a range between 10\(^\circ\)N–20\(^\circ\)N and is more concentrated in the north sector of the Indian Ocean and off the China coast. Also, there is a large contrast between the variance patterns in Latin America. Note that the variance is larger off the coast of Brazil (Fig. 1a) and then shifts towards Central America (Fig. 1d). These shifts may account for the changes in precipitation in TBr (Fig. 1d), and because of this, there is a major reduction in the precipitation pattern, which may impact the hydrological cycle in the Amazon and northeastern regions of Brazil (Marengo and Espinoza 2015; Marengo et al. 2013).

The results shown in Fig. 1 are also in agreement with previous studies (Souza and Cavalcanti 2009; Alvarez et al. 2014; Tomaziello et al. 2015), which show that the MJO is the main intraseasonal variability pattern at play in South America, during the austral summer and fall.

Fig. 1
figure 1

Seasonal variance of OLR and precipitation filtered to the range of 20–100 days. a OLR for the period from December to May; b Precipitation from December to May; c OLR from June to November; d precipitation from June to November

3.1 The main MITB features

Considering the strong relationship between the OLR pattern and precipitation in South America, these two variables are used with the zonal component of the wind at 200 and 850 hPa, i.e.: due to the baroclinic eastward propagation of the OLR pattern (Kessler et al. 1995; Gonzalez and Vera 2014; Kiladis et al. 2014), to construct the MITB index. The spatial structure of the MCA is shown in Fig. 2. The MCA modes 1 and 2 explain 85 % of the variance. This value is quite high considering the level of variability of the input data, the seasonality, and the daily time scale. Note that the second mode of variability found here (MCA2) cannot be directly compared with other studies, since most studies do not explore the second mode of variability of multivariate methods, such as the EOF approach (De Souza and Ambrizzi 2006; Muza et al. 2009; Silva and Carvalho 2007).

The spatial rainfall pattern related to the first MCA mode (MCA1), is consistent with the spatial configuration characterized by the EOF-based approach in De Souza and Ambrizzi (2006) and Jones and Carvalho (2013). Regardless of the precipitation anomaly (whether it is positive or negative), MCA1 reasonably captures the rainfall pattern of the largest part of the northeastern region in Brazil (NEB, hereafter), as well as the east and southeast sectors of the Amazon. There are also enhanced anomalies located on the northeast coast and the semi-arid regions in Brazil (Fig. 2a).

Furthermore, there is a marked dipole pattern between the northeastern (negative sign) and the western (positive sign) TBr sectors (Fig. 2b). The anomalous precipitation pattern over the northern and eastern part of NEB is heterogeneous, that is, the anomalies may not have the same sign throughout the region. This is different from the MCA1, where the pattern is homogeneous. This is also consistent with the composite analysis pattern of De Souza and Ambrizzi (2006), where the negative (positive) anomalies precede (succeed) the maximum intraseasonal extreme precipitation.

Fig. 2
figure 2

Spatial structure of the MCA1 and MCA2 modes. a and b represent the singular precipitation values, \(U_1\) and \(U_2\), respectively; c and d are the singular values, \(V_1\) and \(V_2\), related to the dynamic variables (i.e.: U850, U200, and ORL), respectively

When it comes to the dynamic components of MCA1 (Fig. 2c) and MCA2 (Fig. 2d), the OLR maximum is found between 80\(^\circ\)W and 20\(^\circ\)E, in the region that covers South America, the Atlantic Ocean and the west African coast. This maximum is linked with the strengthening of the easterly surface wind and of the high level westerlies. Note also an opposite OLR sign (of lower intensity) over the Indian Ocean, Oceania and east Pacific. So, the intensification (weakening) of the convection in South America and Africa is related to the weakening (intensification) over the Indian Ocean and Oceania (Fig. 2c).

Also, the negative OLR pattern shows negative and intense values in the western sector of South America, whereas the positive sign with lower intensity may be seen between 60\(^\circ\)W and 40\(^\circ\)W (Fig. 2d). These negative and positive patterns correspond to the influx of easterlies (westerlies) in lower (higher) levels over the Atlantic Ocean, Africa and eastern sector of the Indian Ocean. Note that Wheeler and Hendon (2004) and Ventrice et al. (2013) describe the overall propagation of the MJO signal through an EOF-based approach. Although their studies are not directly comparable here, a qualitative comparison of the peak of their convective core with this study could provide insights into the regional propagation of the intrasesonal signal. The localization of their convective core is between 60\(^\circ\)E and 120\(^\circ\)E (i.e. over the Indian Ocean, Oceania, and the eastern Pacific). Here, the MITB index shows a peak over South America because it uses precipitation data over that region. So, the MITB pattern reflects the propagation of the signal observed in Wheeler and Hendon (2004) and Ventrice et al. (2013) as perceived over South America. Hence, this highlights that MITB is able to capture the location and shift of the intraseasonal oscillation that generates the TBr precipitation.

Fig. 3
figure 3

Spectral analysis of the MCA expansion coefficients. The diagram for a represents the temporal expansion coefficients A1 and A2 related to rainfall patterns; and b shows the temporal expansion coefficients B1 and B2 related to the dynamic variables (i.e.: U850, U200, and ORL)

The MITB spatial pattern has been presented so far; but one still needs to verify whether the MCA1 and MCA2 time series are able to represent the intraseasonal scale appropriately. Figure 3a shows the variance of the power spectrum of the A1 and A2 MCA coefficients (related to the precipitation variable). The variance of the power spectrum is concentrated in the intraseasonal period of 20–100 days. Also, the A1 and A2 coefficients are more sensitive to the intraseasonal signal compared with the B coefficients. So, this indicates that the filtering process was arguably reasonable and that the MITB response is dominated by the intraseasonal signal of precipitation, especially taking into account standardized input variables (i.e. precipitation, OLR, U850, and U200) and that the temporal coefficients were standardized by the square root of their respective singular values (\(\lambda _1\) and \(\lambda _2\)). Furthermore, the B1 and B2 coefficients, related to the dynamic variables, have a lower magnitude compared with A1 and A2. However, the frequency captured by the A and B coefficients is still the same, i.e.: between 20 and 90 days (Fig. 3b).

Fig. 4
figure 4

a Spectral coherence analysis between the \({ MITB}_1\) and MIBT2. The panel in (a) represents the phase; b shows the squared coherence; and c displays the cross-correlation between \({ MITB}_1\) and \({ MITB}_2\)

Next, to understand the relationship between the intraseasonal and interannual frequencies and to identify the phase of this relationship, the cross spectrum of the \({ MITB}_1\) and \({ MITB}_2\) indices is evaluated in Fig. 4. It reveals peaks between 21 and 120 days (Fig. 4a), with mean squared coherence values of 0.44 (Fig. 4a lower panel). In Fig. 4a (upper panel), the phase analysis shows that \({ MITB}_1\) and \({ MITB}_2\) have a phase difference of 1/4 of the cycle. Figure 4b shows the cross correlation between \({ MITB}_1\) and \({ MITB}_2\). The phase relationship is negative, such that \({ MITB}_1\) is 10 days ahead of \({ MITB}_2\). The phase difference patterns are part of a propagating mode, and depending on the signal and the region of interest, MITB could potentially have a diagnostic use.

3.2 Sensitivity analysis of the MCA approach

Here, the sensitivity of the MCA to the following variables (and their combinations) is shown: (a) OLR; (b) OLR and U850; and (c) OLR, U850, and U200. Also, the sensitivity to the latitudinal bands is performed, to see whether a different choice of band would improve capturing the MJO signal. So, the bands 5\(^\circ\)S–5\(^\circ\)N and 25\(^\circ\)S–25\(^\circ\)N are tested, from which the meridional averages are calculated.

The meridional average results for the aforementioned variables and latitudinal bands are shown in the first five rows of Table 3. There, the proposed method of calculating the meridional averages of the latitudinal bands from 15\(^\circ\)S to 15\(^\circ\)N applied to the combined variables (i.e.: OLR, U850, and U200) gives the most appropriate result (shown in bold in Table 3).

Finally, the MCA method is applied to the spatial fields, as opposed to calculating meridional averages. The following bands of grid points are tested: (a) 5\(^\circ\)S–5\(^\circ\)N; and (b) 15\(^\circ\)S–15\(^\circ\)N. Note that the region from 25\(^\circ\)S to 25\(^\circ\)N is not tested due to the high computational cost of calculating a 4320 \(\times\) 4434 maximum covariance matrix (i.e. 19,154,888 values for the covariance matrix), which would counteract the objective of creating a method that captures the intraseasonal variability, which is computationally fast and feasible to calculate using a regular desktop computer. So, the results are presented in the last two rows of Table 3, showing that the average meridional values give a higher squared covariance fraction compared to working with grid points.

Table 3 Sensitivity analysis of different MCA inputs

Hence, Table 3 confirms that the proposed combination of OLR, U850, and U200 for the meridional average from 15\(^\circ\)S to 15\(^\circ\)N has higher values of the squared covariance fraction. However, when it comes to the fraction of covariance and the average squared coherence (\({ Coh}^2\)), taking into account the spatial grid structure, higher values of covariance fraction (0.70 and 0.76) and Coh2 are seen (0.34 and 0.42) for the last two rows, respectively, as Table 3 shows. So, a choice had to be made for the selection of an appropriate model, and the principle of parsimony was used. It takes into account that: the meridional average is a simpler method and its computational cost is minimal. So, the chosen method is to work with the combination of the meridional average of OLR, U850, and U200 for its high squared covariance fraction and efficient computational cost. Also, this approach is in agreement with results found in Wheeler and Hendon (2004) and Ventrice et al. (2013).

In order to understand the structure and pattern of the variability captured by the MITB index, phase-space composites were constructed, similar to Wheeler and Hendon (2004) and Lee et al. (2012). Due to the enhanced out-of-phase characteristic between the \({ MITB}_1\) and \({ MITB}_2\), it is useful to diagnose the MITB state in a specific time through a phase-space diagram. So, Eqs. 6 and 7 are used, where the first determines the MITB amplitude and the second gives the angle between \({ MITB}_2\) and \({ MITB}_1\). For the phase-space diagram, events with an amplitude higher than 3.5 are selected, which is the miminum threshold so that the mean of the events is higher than the total mean (obtained through the use of a one-tailed Student’s t-test with a 5 % significance level).

Figure 5 shows the phase-space diagram curves related to composites of strong MITB events. The same approach given in Lee et al. (2012) is used, in which the diagram is divided into eight phases. Here, lags of 31 days were calculated for each phase. Also, note that the composite analysis is temporally dependent. One starts at lag −30, which represents the first dot of each of the lines in Fig. 5. The first dot at phase 1 (blue line), for instance, is at lag −30 days; it starts in phase 1 and ends in phase 5. The second dot is at lag −29, the third is at lag −28, and so on until the last dot (ending up in phase 5) at lag 0. So, the phases have a sequential structure, which shows that MITB is able to represent the spatio-temporal evolution (i.e. propagation) of the intraseasonal signal. The phases with higher event frequency are phase numbers 1 and 8, where the enhanced convective anomaly is over South America and Africa (see Fig. 6). Also, the phases that are opposed (in sign) to those in 1 and 8, are phase numbers 4 and 5, where the enhanced convective anomaly is found over the Indian Ocean and Oceania (see Fig. 6).

Fig. 5
figure 5

\({ MITB}_1\) and \({ MITB}_2\) phase-space compositions. Odd phases are represented in blue and even phases are shown in red. Each phase represents where the MITB amplitude exceeds 3.5. The number of events are shown in parentheses

The spatial pattern of atmospheric variability explained by the MIBT index may be explored through the use of composite analysis. Fig. 6 provides the compositions for each of the eight stages shown in Fig. 5. The seasonal composition from December to May, depicts the structure and evolution of the MJO. In phase 1 (Fig. 6a, b), the core of weak convective activity is present in the central Pacific Ocean, while strengthening of convection is evident in Africa and the western Indian Ocean, related to the low level wind fields in western anomalies over the eastern Pacific, north of Oceania, and the Indian Ocean. In South America, convective activity is seen over the northeastern and southeastern regions, which enhances the positive signal of precipitation anomalies (Fig. 6b).

During subsequent phases, the convective core grows and moves eastward across the Indian Ocean, passing through northern Australia and leaning slightly to the southeast reaching approximately latitude 15\(^\circ\)S over the western Pacific (Fig. 6 panel left). In South America, the center of convection weakened and shifted to the northeastern region (Fig. 6 panel right). From phases 3 through 5, there is strong convective inhibition (i.e.: anticyclonic circulation), providing negative anomalies of intense rainfall over the largest part of TBr. These three propagation phases are in agreement with the negative rainfall anomaly response found in Muza et al. (2006, 2009).

In phases 2 and 6, the spatial precipitation pattern is related to the MCA2 pattern (Fig. 2b). In phase 2, there are enhanced positive anomalies in northern NEB and eastern Amazon, and negative anomalies in the western end of the Amazon region. An enhanced convective band over tropical Atlantic and the Indian Ocean is linked with this phase (Fig. 6). In phase 6, however, there is an opposite spatial precipitation pattern: enhanced negative anomalies in northern NEB and eastern Amazon, and positive in the western end of the Amazon (Fig. 6). This phase is related to an inhibited band of convection over the tropical Atlantic and the Indian Ocean regions. According to Ventrice et al. (2013) and Barrett and Leslie (2009), this large-scale configuration corresponds to a strong MJO modulation of the frequency of hurricanes and cyclogenesis events over the tropical Atlantic Ocean.

Fig. 6
figure 6

Composite analysis. The panel on the left shows the life cycle of OLR anomalies (shaded) and winds at 850 hPa (vectors); and the panel on the right shows the life cycle of rainfall (shaded) for the months from December to May. Only statistically significant values at the 5 % significance level are shown

3.3 Odds ratio

Here, the relationship between the occurrence of \({ PRP}_{e}\) related to the MITB amplitude is discussed. Note that the threshold for extreme precipitation is represented as (\({ PRPe} < 25p\) or \({ PRPe} > 75p\)) [25p represents the 25th percentile and 75p, the 75th percentile]. These values are compared with the MITB amplitude with threshold at 3.5, according to Table 1. So, Fig. 7 shows the odds ratio per grid point, which characterises this relationship.

Fig. 7
figure 7

Point map of the odds ratio values based on the contingency table used to test the association between the amplitude of the MITB index (threshold = 3.5) and the occurrence of extreme precipitation (\({ PRPe} < 25p\) or \({ PRPe} > 75p\)) [25p is 25th percentile and 75p is the 75th]. Only statistically significant results are shown (at the 5 % significance level)

In Fig. 7, a statistically significant \({ OR}>1\) spatial pattern is observed in the continental region of South America, within the tropical belt that extends in the direction of the region of influence of the ITCZ (north of NEB) and the SACZ (southeastern region of Brazil). There are no significant values over the east coast of NEB. Also, this region of \(OR>1\) is influenced by the dynamic pattern of MITB, where the values from 1.5 to 3.5 correspond to relatively high odds ratio values. So, for example, the odds of having extreme precipitation events is 1.5 to 3.5 higher when the MITB index is strong (i.e. MITB with amplitude higher than 3.5) compared to a weak MITB index. Note that the latter 3.5 value refers to the amplitude of the MITB index, which was used for the strong/weak threshold. This means that for (roughly) every three extreme precipitation events, two are related to strong MITB events and one to a weak MITB event. In contrast, there are significant \({ OR}<1\) values between 0.50 and 0.75 in the northern sector of the Amazon, covering the states of Roraima and Guyana, which indicate a higher chance of \({ PRP_e}\) occurrence when MITB is weak, i.e. when the intraseasonal system performance is neutral or weak.

Fig. 8
figure 8

Odds ratio between the extremes of precipitation and MITB considering the phase, amplitude and delay, were plotted only statistically significant at the level of 5 %

Next, the association between MITB and extreme precipitation events with respect to phases and lags are considered through a test of association between strong MITB events (i.e. amplitude above 3.5 for each phase, 1–8) and positive extreme precipitation events (\({ PRP}^{+}_{e}\) for different lags, from 0 to 35 days, as shown in Fig. 8. For the spatial “Lag 0” pattern (Fig. 8, first column on the left), OR shows spatial similarity to each phase of the precipitation composites shown in Fig. 6. For regions with positive precipitation anomalies (Fig. 6), there are statistically significant \({ OR}>1\) values (see Fig. 8). On the other hand, for regions where there are negative precipitation anomalies, there are statistically significant \({ OR}<1\) values.

Also, phases 1, 2, 7, and 8 show mostly \({ OR}>1\) values ranging from 3 to 9, particularly over eastern Amazon and NEB. Besides that, \({ OR}<1\) values ranging from 0.15 and 0.75 are found over the western and northwestern sectors of the Amazon basin. Note that for phases 3, 4, 5, and 6, there is an inverted spatial pattern, that is, for regions where \({ OR}<1\) (i.e.: NEB and east Amazon), there are \({ OR}>1\) values over the western and northwestern parts of the Amazon basin.

Simply put, one can say that for each \({ PRP}^{+}_{e}\) event in phase 1, where \({ OR}<1\), there is a maximum of nine \({ PRP}^{+}_{e}\) events taking place in other phases. However, for regions where \({ OR}>1\), for every nine events in phase 1, there is only one event in the other phases.

Moreover, two main features that should be mentioned in Fig 8 are: (a) there is a spatial feature of persistence of statistically significant OR values between lag 0 and lag +7 (Fig. 8, second panel to the left); b) there is an inversion of \({ OR}\) values between lags +7 and +14. Then, the lag +14 pattern persists until lag +35. This shows that MITB is able to capture the propagation of the intraseasonal signal, as assessed by the \({ OR}\) analysis. Thus, MITB could potentially be used as a diagnostic and prognosis tool to study intraseasonal rainfall variability, because it captures precipitation events between 7 and 35 days.

4 Conclusions

The impact of the intraseasonal variability of precipitation over TBr was analysed through the development of the MITB index. This index has provided a consistent assessment on the time and intensity of the intraseasonal variability signal in the region. Also, it has given a new perspective for its use in monitoring and prognosis of convective activity related to the intraseasonal variability.

The MCA-based index was able to capture about 86 % of the variability between the dynamic variables and the TBr precipitation. The first two modes MCA1 and MCA2 reasonably describe the intraseasonal signal propagation; this is true especially for MCA1, which shows a similar response to that found in other studies (De Souza and Ambrizzi 2006; Muza et al. 2009). Also, \({ MITB}_1\) and \({ MITB}_2\), which were based on the first two MCA modes, show the highest correlation with a 10-day lag.

Also, a vectorial projection of \({ MITB}_1\) and \({ MITB}_2\) shows that phases 1, 2, 7, and 8 are related to the ocurrence of positive precipitation anomalies, whereas phases 3–6 show the same, but for negative anomalies of precipitation for most of TBr. For the maximum and minimum phases, that is, for the pair of phases 8–1 and 5–6, respectively, there is an inverse relationship of the precipitation sign (i.e.: whether positive or negative) between the northwestern Amazon region and the region under the influence of ITCZ and SACZ.

The \({ OR}\) analysis has shown an association between extreme precipitation events in tropical Brazil with strong MITB events. It also captured the temporal dependence of precipitation extremes with respect to both the phase and intensity of MITB. Significant values of \({ OR}>1\) are observed mainly in the propagation of the convective signal from phases 7, 8, 1, and 2. This is true both for lags 0 and +7. In phase 2 at lag +7, however, the convective system starts dissipating and most of TBr has an \({ OR}<1\) pattern. Then, there is a change in OR sign from lags +14 to +35 days, which confirms that MITB has a prognostic potential with respect to \({ PRP}^{+}_{e}\), with at least 14 to 35 days before the occurrence of an extreme precipitation event. This is especially the case for phases 3–4 and 5–6.

In summary, to answer the scientific questions posed at the start, in this research work it has been found that: (a) the equatorial mean from 15\(^ \circ\)N to 15\(^ \circ\)S, and the use of OLR, U850 and U200, were chosen because they were able to represent both the intraseasonal signal and its propagation, in addition to having a lower computational demand to elaborate the MITB index; and (b) MITB was able to capture the intraseasonal features over TBr. One of its main advantages is its ability to identify and reproduce extreme events related to the intraseasonal signal.

It is important to emphasize that the results as found in this paper reinforced the presence of atmospheric oscillations and their impact on precipitation over South America, mainly associated with extreme rainfall over TBr. Further work is underway to explore the relationship between the intraseasonal variability with the seasonal signal and interannual climatic events.