1 Introduction

The onset of the summer monsoon usually signals the seasonal transition from winter to summer in the Northern Hemisphere, characterized by the onset of the rainy season, large-scale convection, and abrupt transitions of large-scale atmospheric circulation (Taniguchi and Koike 2006). An advanced or delayed start of the summer monsoon have severe impacts on agriculture (e.g., crop selection, planting, irrigation) and seriously affect a substantial fraction of the world’s population. As a key component of the Asian–Australian summer monsoon system, the South Asian summer monsoon (SASM), including the summer monsoon over the Bay of Bengal (BOB), Arabian Sea (AS), and India, is closely related to the occurrence of droughts and floods in the southern Indian Peninsula, Indochina Peninsula, and some regions in China (Liu and Ding 2008; Tao and Chen 1987; Xing et al. 2015).

In South Asia, there are two maximum summer rainfall areas: the central BOB and the southeastern AS (Wang and Ho 2002). The SASM first begins in the BOB in early May (Wang and Ho 2002; Wu and Zhang 1998; Xing et al. 2015; Yu et al. 2020; Zhang et al. 2017), and the BOB leads the monsoon onset over the AS and India by about one month because of the ‘monsoon onset barrier’ (Liu et al. 2014a). In late May, significant precipitation is observed over the southern AS near the equator, gradually spreading to most parts of the AS (Abe et al. 2013; Aneesh and Sijikumar 2020; Kajikawa et al. 2012). Around the beginning of June, the monsoon convection reaches the southernmost tip of the Indian Peninsula (in Kerala), and thus the Indian summer monsoon breaks out (Aneesh and Sijikumar 2020; Pai and Rajeevan 2009). Once the Indian summer monsoon has established in Kerala, this marks the full start of the SASM, whereupon abundant vapor is transported from the AS and BOB to eastern China (Liu 2013; Zhang 2001).

The onset of the SASM shows significant interannual variability, and it is modulated by many factors. For instance, following warm El Niño–Southern Oscillation (ENSO) events (El Niño), the breakout of the summer monsoon over the South Asian regions is usually postponed, while following cold ENSO events (La Niña), the SASM onset tends to advance (Liu et al. 2014b; Neena et al. 2020; Sun et al. 2017; Xavier et al. 2007). ENSO can efficiently modulate the onset dates of the SASM through regulating the sea surface temperature (SST) anomalies in tropical oceans (Liu et al. 2014b), the Walker circulation (Mao and Wu 2006; Meng and Wu 2000; Webster and Yang 1992), and the ENSO-induced ‘atmospheric-bridge’ (Sun et al. 2017). Neena et al. (2020) demonstrated that the El Niño in boreal spring can support two distinct eastward waveguides from the central and eastern Pacific to the SASM region, thus inducing Rossby waves to modulate the thermal structure over the SASM region in May and lead to a delayed monsoon onset. Besides, the seasonal evolution and variation of the South Asian High (SAH) play an important role in the development of the monsoon onset vortex, which is closely related to the onset of the SASM (Liu et al. 2013; Wu et al. 2010). The upper-level divergence and the easterly along the southern flank of the SAH, together with the lower-level southwesterly, form the summer circulation structure in SASM regions (Wei et al. 2019). The intensity and location of the SAH strongly depends on the diabatic heating over the Tibetan Plateau (TP) (Wu et al. 2015, 2007).

Generally, previous studies have mainly attributed the substantial interannual variation of the SASM onset to the following factors: (1) the arrival of a wet phase of a low-frequency intraseasonal oscillation and Madden–Julian Oscillation (Taraphdar et al. 2018); (2) the seasonal northward movement of the Intertropical Convergence Zone (Saha and Saha, 1980); (3) the generation of monsoon onset vortexes over the BOB and AS; (4) the role of SST anomalies and air–sea interactions, i.e., the AS warm pool, ENSO (Liu et al. 2014b; Mao and Wu 2006; Vinayachandran et al. 2007); (5) the establishment of the timing of the SAH over the Indochina Peninsula (Liu et al. 2013; Wei et al. 2014; Wu et al. 2015); and (6) the mechanical and thermal effects of the TP (Mao and Wu 2006; Wu and Zhang 1998). Among these factors, the influence of the TP has been consistently and continually emphasized by researchers (Abe et al. 2013; Boos and Kuang 2010; Duan et al. 2020; Liu et al. 2020; Wu et al. 2015; Yu et al. 2020).

The thermal effects of the TP on the SASM onset mainly include an increase of the land–sea thermal contrast by heating the mid-upper troposphere and the resultant strengthening of the surrounding atmospheric circulation. Strong spring surface sensible heating over the central-western TP contributes to an early onset of the Indian summer monsoon through increasing the land–sea thermal contrast (Zhang et al. 2015), while a positive (negative) surface sensible heating anomaly over the southern (northern) TP leads to a delay (advancement) of the SASM by weakening (enhances) the cross-equatorial flow to the AS and BOB (Yu et al. 2020). However, previous studies have focused mainly on the effect of the TP on the large-scale onset of the whole SASM region, and it is necessary to reveal the potential contribution from the TP thermal forcing on the monsoon onset in individual subregions such as the BOB, AS, and India. In this work, we aim to address the following three questions: (1) whether there is a significant linkage of monsoon onsets between the three subregions of the SASM; (2) whether there is a relationship between TP thermal forcing and the monsoon onset in the three subregions of the SASM; and (3) how the thermal forcing over the TP influences the onset of the SASM.

The remaining of the paper is arranged as follows. The data, methods and model employed in this study are briefly described in Sect. 2. Section 3 presents the climatological features of the onset of the SASM, and then in Sect. 4 the connections between the onset dates of each subregion of the SASM and the thermal forcing over the TP are investigated. Then, in Sect. 5, the physical mechanism is investigated using the results from an atmospheric general circulation model CAM4.0. And finally, a summary and discussion are presented in Sect. 6.

2 Data and methods

2.1 Data

The datasets employed in this study are as follows:

  1. (1)

    The daily atmospheric data (wind, temperature, specific humidity, and heating rates) during 1979–2016 were obtained from the Japan Meteorological Agency Japanese 55-year Reanalysis (JRA-55) dataset with a horizontal resolution of 1.25° × 1.25° (Kobayashi et al. 2015) at https://jra.kishou.go.jp/JRA-55/index_en.html#jra-55. The daily heating rate data include heating rates of large-scale condensation, convection, vertical diffusion, longwave radiation, and solar radiation.

  2. (2)

    The daily surface sensible heat flux and the latent heat release of condensation via precipitation over the TP at 80 routine weather stations for the period 1979–2016, and the total atmospheric diabatic heating (E) over the TP for the period 1984–2015, were obtained from Duan et al. (2018). Note that the diurnal variation in the bulk transfer heat coefficient was obtained by the sensible heat flux algorithm proposed by Yang et al. (2009). E comprises three components: surface sensible heat flux, latent heat release of condensation and radiation flux. The calculation of E is named the direct computational method (Ye and Gao 1979), which has been adopted in many studies (e.g., Chen et al. 1985; Duan et al. 2018; Wang et al. 2011). Comparing with the reverse computational method of estimating the atmospheric diabatic heating over the TP (Luo and Yanai 1984; Yanai et al. 1973), E can avoid the errors caused by the uncertainty of vertical velocity in different reanalysis data. The expression of E is as follows:

    $$ E = {\text{SH}} + {\text{LH}} + {\text{NR,}} $$
    (1)

where SH, and LH represent the local surface sensible heat and latent heat, respectively. NR represents the net radiation flux of the air column. NR can be obtained by the net radiation values measured at the top of atmosphere and at the surface. The formula is as the follows:

$$ {\varvec{NR}} = {\varvec{R}}_{\infty } - {\varvec{R}}_{0} = \left( {{\varvec{S}}_{\infty }^{ \downarrow } - {\varvec{S}}_{\infty }^{ \uparrow } } \right) - \left( {{\varvec{S}}_{0}^{ \downarrow } - {\varvec{S}}_{0}^{ \uparrow } } \right) - \left( {{\varvec{F}}_{0}^{ \downarrow } - {\varvec{F}}_{0}^{ \uparrow } } \right) - {\varvec{F}}_{\infty } , $$
(2)

Where \({{\varvec{R}}}_{\boldsymbol{\infty }}\) and \({{\varvec{R}}}_{0}\) are the net radiation values measured at the top of atmosphere and at the surface. S and F represent the shortwave and longwave radiation flux. The superscripts \(\downarrow \) and \(\uparrow \) represent downward and upward transport of the radiation flux, and the subscripts 0 and \(\boldsymbol{\infty }\) represent the radiation flux at the surface and at the top of atmosphere, respectively. More details about the calculation of SH, LH and NR are given by Duan et al. (2018) and Zhao et al. (2018).

The 80 routine weather stations over the TP are mainly distributed in the central and eastern TP, with only three stations in the western TP (Fig. 1a). The complex terrain and harsh living conditions limit the distribution of observation stations, especially over the western TP. Previous studies have shown that JRA-55 performs well in estimating the atmospheric heat source over the TP (Liu and He 2015; Wang et al. 2018; Zhu 2012). Therefore, JRA-55 reanalysis datasets are adopted to represent the heating effect over the TP. The expression of atmospheric diabatic heating based on JRA-55 datasets (E_JRA55) is as follows:

$$ E_{{{\text{JRA55}}}} = \mathop \int \nolimits_{{P_{s} }}^{Pt} \left( {{\text{VDFHR}} + {\text{LRGHR}} + {\text{CNVHR}} + {\text{LWHR}} + {\text{SWHR}}} \right)d_{P} , $$
(3)

where P is the air pressure, the limits of integration used here (\({P}_{s}=1000 \mathrm{hPa}\) and \({P}_{t}=100 \mathrm{hPa}\)) represent the approximate depth of the free troposphere, VDFHR denotes the sensible heat, the sum of CNVHR and LRGHR represents the latent heat release due to condensation, and the sum of LWHR and SWHR represents the net radiation of the air column.

Fig. 1
figure 1

Climatological spatial patterns based on a observed datasets and b JRA-55 datasets and time series c of the atmospheric heat source/sink (E), sensible heat (SH), latent heat (LH) and net radiation (NR) over the TP in May. The purple dots in a and b represent the distribution of the 80 routine weather stations and TP areas with an altitude > 2000 m, respectively. The solid black curves in c denote the 80-station averaged E, SH, LH and NR based on the observed datasets, and the blue dotted curve denotes the dot areas’ averaged E, SH, LH and NR based on JRA-55 in b, and the correlation coefficients between them are marked in the third panel. The black contours in a and b indicate the 2000 m topographic height. Statistical significance is shown by asterisks: *p < 0.1, **p < 0.05 and ***p < 0.01 (the same hereafter)

The atmospheric diabatic heating (E) based on JRA-55 datasets can represent well the spatial distribution and temporal variation of the observed datasets over the TP in May—that is, the atmospheric heating over the TP gradually decreases from the southeastern TP to the northwestern TP, and the diabatic heating over the western TP is relatively weak. Although the E over the TP based on JRA-55 datasets is generally stronger than that in the observation, the correlation coefficient between them is 0.65 (statistically significant at the 99% confidence level) (Fig. 1c). In addition, the temporal variations of E and latent heat over the TP in May exhibit close consistency and both show opposite variations to the sensible heat. This may indicate that the relative importance of sensible heat in the atmospheric diabatic heating decreases as the latent heat gradually becomes more dominant in May (Zhao et al. 2018).

  1. (3)

    Monthly SST data were obtained from the Extended Reconstructed SST dataset, version 5 (ERSST.v5), with a resolution of 2° × 2° over the period 1978–2016 (Huang et al. 2017) (https://psl.noaa.gov/data/gridded/data.noaa.ersst.v5.html).

  2. (4)

    Other fields used in this study include the National Oceanic and Atmospheric Administration (NOAA) daily interpolated outgoing longwave radiation (OLR), with a 2.5° × 2.5° resolution, obtained from the National Center for Atmospheric Research (NCAR) archives (1979–2016) (Liebmann and Smith 1996) (https://www.psl.noaa.gov/cgibin/db_search/DBSearch.pl?Dataset=NOAA+Interpolated+OLR&Variable=Outgoing+Longwave+Radiation), and pentad precipitation on 2.5° × 2.5° grids during 1979–2016 from the Climate Prediction Center Merged Analysis (CMAP) Xie and Arkin 2021) (https://www.psl.noaa.gov/data/gridded/data.cmap.html).

Statistical methods, such as linear regression, empirical orthogonal function (EOF) analysis, composite analysis, Pearson correlation, and partial correlation analysis, were utilized. EOF, linear regression and correlation analysis were performed with detrended data in this study. The Student’s t-test was adopted to evaluate the statistical significance of composite and linear regression analysis. The pentad dataset in this study consists of 73 pentads per year, with the 12th pentad from February 25 to March 1 regardless of whether it was a leap year.

2.2 Model

This study adopted the Community Atmosphere Model, version 4 (CAM4.0; Gent et al. 2011), which is the atmospheric component of the Community Earth System Model, version 1.2.0 (CESM1.2.0; Hurrell et al. 2013) developed by NCAR (http://www.cesm.ucar.edu/models/cesm1.2/). CAM4.0 includes moderate changes in model configurations compared to CAM3, and one of the most important ones is the change from the spectral core used in CAM3.0 to the finite-volume dynamical core used in CAM4.0 (Neale et al. 2010), which exhibits great improvements in simulating climate mean states (Hurrell et al. 2013). The f19_g16 configuration, with a horizontal resolution of approximately 2° (1.9° × 2.5°) for the atmospheric model with the “F_2000” component, was selected. The control run (CTRL) was forced by the climatological annual cycle of SST with the fixed external forcing (greenhouse gases, aerosols, volcanic activity and solar activity) at the year-2000 level. The CTRL run was integrated for 17 model years, and the first dates of the last 15 years were adopted to provide the initial conditions for the two subsequent sensitivity runs.

In order to investigate the effect of the seesaw pattern of diabatic heating over the TP, the sensitivity experiments are designed as follows. We mainly changed the vertical profiles of diabatic heating due to convection over the eastern TP region (20°–40° N, 85°–105° E) (Fig. 2a) in the two sensitivity runs because the significant abnormal signals of diabatic heating are over the eastern and southeastern TP (Fig. 2), and the strong diabatic heating over the eastern TP associated with precipitation dominates in May. Furthermore, the vertical profile of diabatic heating due to convection over the eastern TP region above 1500 m in May was set to 0.5 and 1.5 times the normal based on the CTRL run in the negative (EXP_NEG) and positive (EXP_POS) sensitivity experiments, respectively (Fig. 2c). This is because the main anomalies are concentrated in the middle troposphere (300–600 hPa), and the strongest and weakest vertical profiles of diabatic heating due to convection over the TP are approximately 0.5–1.5 times to the climate mean state in reanalysis data (Fig. 2b). Similar experimental designs were adopted in Sun et al. (2019) and Zhao et al. (2021). Each sensitivity experiment contained 15 members provided by 15 different initial conditions from CTRL to remove the interference by atmospheric noise, and each member was integrated from May 1st to August 1st.

Fig. 2
figure 2

a The heating area (green meshed shading; 20°–40° N, 85°–105° E) over the Tibetan Plateau (> 1500 m) in the sensitivity runs. b, c Vertical profile of temperature tendency due to convection (units: K day−1) averaged over the areas in a based on b JRA-55 datasets (the black, purple and green lines represent the climatological mean value, the strongest value, and the weakest value during 1979–2016), respectively, and c model simulation (the black, purple and green lines represent the CTRL value, the strong sensitivity experiment value (EXN_POS), and the weak sensitivity experiment value (EXN_NEG)). The red contours indicate the 2000 m topographic meters

3 Climatological features of the onset of the SASM

3.1 Definition and division of the SASM onset

Monsoons are complex multi-scale phenomena, and defining their onsets has always been a controversial topic. Previous studies have tended to adopt the precipitation (Fasullo and Webster 2003; Pai and Rajeevan 2009; Wang and Ho 2002), wind (Taniguchi and Koike 2006; Wang et al. 2009; Webster and Yang 1992; Xing et al. 2015) and land–sea thermal contrast (represented by the meridional temperature gradient) (Mao et al. 2004; Xavier et al. 2007) to define the onset of the SASM. Generally, these definitions used to detect the onset of monsoon can be divided into two categories: (1) local-scale definitions and (2) regional to large-scale definitions (Bombardi et al. 2019). The definitions of monsoon onsets based on the land–sea thermal contrast and wind were designed to represent the heating gradient associated with the large-scale monsoon circulation and the continental-scale land–sea breeze, respectively. These large-scale definitions can accurately represent the monsoon interannual variability, but fail to capture the spatial patterns of the monsoon onset. Instead, definitions based on grid-cell scale precipitation (e.g., Wang and Ho 2002) can provide the onset date at the spatial resolution of the original dataset, which can also be aggregated in space to present large-scale phenomenon (Bombardi et al. 2019). Therefore, we adopt the definition of summer monsoon onset from Wang and Ho (2002): the first Julian pentad in which the relative pentad mean rainfall rate (relative to the January mean) exceeds 5 mm day−1 is defined the summer monsoon onset.

The summer monsoon onset dates over the BOB (10°–20° N, 85°–95° E) and AS (8°–20° N, 60°–75° E) (Fig. 3a, green box) are obtained by using the spatial region mean relative pentad mean rainfall rates (Wang and Ho 2002), and the Indian summer monsoon onset is taken from the India Meteorological Department, which is defined by the onset of summer monsoon rainfall over the state of Kerala (Pai and Rajeevan 2009), circumventing uncertain effects caused by fewer grid points over the southern tip of the Indian Peninsula.

Fig. 3
figure 3

Climate-mean OLR (shading, units: W m−2) and 850-hPa wind (vectors, units: m s−1) in the antecedent evolution of the SASM onset from pentad 24 to 32 (ae). Magenta contours in ae represent the 14,280 and 14,300 geopotential meters. The green boxes indicate the AS and BOB regions. f Pentad of monsoon onsets determined by the relative pentad-mean rainfall rate. The red contours indicate the 2000 m topographic height

3.2 Climatology of the SASM

The SASM first begins over the eastern BOB in early May (Fig. 3a and f), characterized by deep convection and a sudden transition of westerly at 850 hPa (Fig. 3a and b) (Wu and Zhang 1998; Yu et al. 2015). With the gradually northward-moving deep convection, the summer monsoon over most regions in the BOB is fully established around mid-May (28th pentad), with the OLR below 230 W m−2 (Fig. 3c and f). Typically, OLR below 230 W m−2 indicates the outbreak of monsoon convection (Mao and Wu 2006; Mao et al. 2004; Tamura and Koike 2010; Wu and Zhang 1998). After that, the convection continues to advance eastwards rather than propagate westwards (Fig. 3f). Diabatic heating due to convection contributes to the warming of the central SAH (Qian et al. 2002), favoring a gradual development on the southern side of the SAH over the BOB, enhancing the upper-level divergence-pumping (Liu et al. 2013) and further strengthening the convection. This configuration of the diabatic heating center and the SAH is conducive to the maintenance and development of the convection system. Then in late May, the strengthening and westward extension of the SAH and the intensive low-level westerly flow facilitate the summer monsoon onset over the AS and India (Fig. 3d and f). The summer monsoon onset over the AS occurs in pentads 28–30, while the onset of the Indian summer monsoon in the 31st pentad. The summer monsoon onset over the northern Indian Peninsula is in late June, about 1 month later than that over the southern Indian Peninsula. These results are consistent with previous studies (Athira and Abhilash 2021).

It is noteworthy that the variability of the monsoon onset date over the three subregions of South Asia are closely related to each other (Fig. 4). The onset of the Indian summer monsoon closely follows the onset over the AS, with a correlation coefficient of 0.54 (statistically significant at the 99% confidence level), and the correlations between the onset over the BOB and that over both the AS and India are 0.28 (statistically significant at the 90% confidence level). The early onset of the BOB summer monsoon accelerates the early increase in monsoon convection, causing considerable latent heat release, which may contribute to the precipitation over the AS and Indian Peninsula through modulating the development of the SAH to a certain degree (Liu et al. 2013; Xing et al. 2015).

Fig. 4
figure 4

Time series of summer monsoon onset dates (presented by the pentad of the year). The magenta, blue and black curves represent the summer monsoon onset over the AS, BOB, and India, respectively. The onset dates of the summer monsoon over the AS and BOB are based on the method of Wang and Ho (2002), while the Indian summer monsoon onset is from the Indian Meteorological Department. The correlation coefficients between them during the period 1979–2016 are marked at the bottom of each panel

The BOB summer monsoon onset is in early May, about one month earlier than that in the AS and India, the onset dates of which are in late May or early June. During the period between early May and early June, the diabatic heating over the TP increase rapidly owing to considerable precipitation condensation (Liu et al. 2020). Therefore, the huge thermal forcing of the TP in May cannot be ignored, raising the question as to whether it plays an important role in the onset of the SASM. We address this key issue in the next section.

4 Influence of the thermal forcing over the TP on the SASM onset

The anomalous heating in the seasonal transition may play an important role in determining the anomalous state of the Northern Hemisphere summer monsoon (Yasunari 1991). Simultaneously, with the increasing precipitation over the southeastern TP, the effect of the overall thermal forcing heating over the TP is quite considerable in May (Ye and Gao 1979).

4.1 Connections between the thermal forcing of the TP and the onset date of the SASM

Correlations between the AS, Indian and BOB summer monsoon onsets and the diabatic heating over the TP are shown in Fig. 5a–c. Obviously, the onset dates of the SASM, including that in the BOB, AS and India, are significant negatively related to the diabatic heating over the southern and southeastern TP and positively related to the diabatic heating over the northwestern TP. This seesaw pattern of correlation coefficients with a southeastern–northwestern inverse distribution over the TP is highly consistent with the leading EOF mode of the diabatic heating over the TP in May (Fig. 5d). Figure 5e shows the time series of the leading EOF mode (PC1), which is significantly related to the diabatic heating averaged over the TP (r = 0.58, p < 0.01). To simplify the following analysis, the positive-southeastern–negative-northwestern seesaw pattern of diabatic heating over the TP is called the positive seesaw pattern, and verse versa.

Fig. 5
figure 5

Correlation pattern between the atmospheric heat source/sink (E) over the TP in May and summer monsoon onsets over a the AS, b India, and c the BOB. d First EOF mode of the atmospheric heat source/sink over the TP (> 2000 m) in May during 1979–2016 based on JRA-55 datasets, and e the time series of PC1, and E_JRA55 that is the atmospheric heat source/sink over the TP estimated through Eq. (3), with the correlation coefficient between them during the period 1979–2016 marked in e. The value in the upper right is the explained variance of the mode. Black and green dotted areas in ac are where the statistical significance passed the 90% and 95% confidence levels, respectively. The black contours indicate the 2000 m topographic height

It is noteworthy that the seesaw pattern of diabatic heating over the TP in May has a significant negative correlation with the monsoon onset date in the AS, India and BOB, with the correlation relationships between them being − 0.70, − 0.58 and − 0.35 (all statistically significant above the 95% confidence level), respectively (Table 1). These relationships are more significant on interannual timescales, with all of them being statistically significant above the 95% confidence level, indicating that the increasing (decreasing) diabatic heating over the southeastern (northwestern) TP is conducive to the early (late) onset of the summer monsoon over the AS and India. Meanwhile, the close relationship between the diabatic heating over the TP and the BOB summer monsoon onset exists mainly over the southern slopes of the TP (Fig. 5c). As documented, the early onset of the BOB monsoon in early May contributes to a supply of abundant moisture to the southern TP during May, causing huge diabatic heating (Liu et al. 2019; Prasad 2005; Xing et al. 2015; Zhang et al. 2017).

Table 1 Correlation coefficients (first row) and partial correlation coefficients (last three rows) during 1979–2016 between PC1 and summer monsoon onset dates over the AS, India and BOB, having removed the influences from IPO (interdecadal Pacific Oscillation) index (second row), Niño3.4 index (thrid row) and SID (subtropical Indian Ocean dipole) index (fourth row) from December of the previous year to May of the current year. Statistical significance is shown by superscript asterisks: *p < 0.1, **p < 0.05 and ***p < 0.01

SST anomalies and air–sea interactions are important factors that modulate the timing of the SASM (Deng et al. 2016; Liu et al. 2014b; Mao and Wu 2006; Wu and Hu 2015). Correlation coefficients between the onset of the Indian summer monsoon and the SST from the previous December to May in the current year shows that the most significant signals exhibit an interdecadal Pacific Oscillation (IPO)–like correlation pattern, an ENSO-related correlation pattern, and southern Indian Ocean dipole–related correlation pattern (Fig. S1). To explore whether the relationship between the seesaw pattern of diabatic heating over the TP (PC1) in May and the onset of the SASM is independent of the impacts from ocean signals, we adopted the partial correlation analysis shown in Table 1. Despite the influence from ocean signals having been controlled, the relationships between the diabatic heating over the TP and monsoon onset dates over the AS and India are still significant, passing the 99% confidence levels (Table 1). Therefore, the seesaw pattern of diabatic heating over the TP in May can be considered as an independent precursor signal of the monsoon onset over the AS and India. Conversely, the correlation coefficients between the monsoon onset dates over the AS and India and the ocean signals (including the IPO, ENSO and subtropical Indian Ocean dipole in May) dramatically decrease after controlling the influence of the seesaw pattern of diabatic heating over the TP (Table S1). These partial correlation analysis results indicate that, although the preceding IPO, ENSO and subtropical Indian Ocean dipole can influence the monsoon onset over the AS and India (Fig. S1 and Table S1), the effect of the diabatic heating over the TP in May is more dominant.

The onset of the BOB monsoon in early May contributes to the positive seesaw pattern of diabatic heating over the TP, especially the atmospheric heating over the southern slopes of the TP (Fig. 5c), but this impact of the onset of the BOB monsoon is no longer robust after removing the influence of ENSO and the IPO (Table 1). The above results confirm the statistical relationships between the seesaw pattern of diabatic heating over the TP in May and the SASM onset over the AS and India, so the physical mechanism underlying these relationships is worthy of exploration.

4.2 Impacts of the thermal forcing of the TP on the onset of the SASM

Composite and regression analysis are employed to exhibit the atmospheric circulation associated with the seesaw pattern of the diabatic heating over the TP in May. Nine years of positive seesaw patterns of diabatic heating over TP (i.e., 1981, 1984, 1990, 1999, 2000, 2001, 2002, 2004 and 2006) and ten years of negative seesaw patterns (i.e., 1979, 1982, 1986, 1987, 1997, 1998, 2005, 2012, 2014 and 2015) were selected for the composite analysis. The criterion was ± 0.75 standard deviations for the PC1 time series during 1979–2016 (black curve in Fig. 5e). Note that the results were unaffected by the choice of threshold (Fig. S2).

Figure 6 shows the regression of circulation including wind, moisture transport, temperature at the mid-upper troposphere, and OLR in May against PC1 of the TP diabatic heating. The positive seesaw pattern of diabatic heating over the TP in May induces strong upward motion over the southern TP and downward motion over the western TP and even over the Iranian Plateau (Fig. 6b)—results that are consistent with previous studies (Liu et al. 2017; Wu et al. 2012). The downward motion over the western TP and Iranian Plateau together with moisture divergence, suppresses the local precipitation and convection (Fig. 6c, d and f), increases the temperature over the western TP in the mid-upper troposphere (500–200 hPa) (Figs. 6e and 7a). The thermal structure in the upper troposphere can reflect the land–sea thermal gradient better than in the lower troposphere (Dai et al. 2013) and dominate the summer monsoon onset (Zhou et al. 2020). Thus, the warming temperature in the mid-upper troposphere over the western TP facilitates the early transition of the meridional temperature gradient (Mao et al. 2004), and further supports the early onset of summer monsoon rainfall over the AS and India (Fig. 6c, d). Meanwhile, because the center of the SAH tends to move towards the area with the larger heating rate (Cen et al. 2020; Qian et al. 2002), the strengthened SAH is located to the west of its climate-mean position, with its ridgeline located at near 20° N (Fig. 6a) (Liu et al. 2020; Yu et al. 2020). \(u\approx 0,\frac{\partial u}{\partial y}>0\) is defined as the ridgeline of the SAH (Liu et al. 2014a). The easterly along with the southern flank of the SAH superimposes well with the low-level westerly and moisture convergence, which supports the early onset of the monsoon rainfall over the AS and India (Fig. 6c, d).

Fig. 6
figure 6

source over the TP in May during 1979–2016: a 150-hPa geopotential height (shading, unit: gpm) and 150-hPa wind (vectors, units: m s−1); b 500-hPa vertical velocity (shading, units: Pa s−1) and 500-hPa wind (vectors, units: m s−1); c precipitation (shading, units: mm day−1) and 850-hPa wind (vectors, units: m s−1); d surface to 100‐hPa vertically integrated moisture divergence (shading, units: 10−5 kg m−2 s−1) and moisture transport anomalies (vectors, units: kg m−1 s−1); e temperature vertically averaged over 500–200 hPa (shading, unit: K); and f OLR (shading, units: W m−2). The green, magenta and blue contours (curves) in a indicate the composite 14,280 geopotential heights (the ridgelines of the SAH) in the climatology, in positive PC1 years and negative PC1 years. Black dotted areas and black vectors indicate statistical significance at or above the 99% confidence level. The red contours indicate the 2000 m topographic height

Regression of atmospheric variables in May quantities against the PC1 of the atmospheric heat

Fig. 7
figure 7

Composite differences in the meridionally averaged (25°–40° N) temperature budget between the positive and negative seesaw pattern of diabatic heating over the TP: a temperature tendency (∂T/∂t); b horizontal advection of temperature (− adv.T); c adiabatic heating (ω term); and d diabatic heating (\({{\varvec{Q}}}_{1}/{{\varvec{c}}}_{{\varvec{p}}}\) (contours, units: K day−1) and zonal circulation (vectors, units: \({\varvec{v}}\) in m s−1, − 3 \({\varvec{\omega}}\) in 10−2 Pa s−1). The red, blue shading and green vectors indicate statistical significance at or above the 95% confidence level. The black shading indicates the topography

The temperature budget equation can elucidate the physical mechanisms contributing to the warming air column over the western TP. The temperature tendency (\(\partial T/\partial t\)) at each pressure level can be estimated using the following equation:

$$ \frac{{\partial {\mathbf{T}}}}{{\partial {\mathbf{t}}}} = - \left( {{\mathbf{u}}\frac{{\partial {\mathbf{T}}}}{{\partial {\mathbf{x}}}} + {\mathbf{v}}\frac{{\partial {\mathbf{T}}}}{{\partial {\mathbf{y}}}}} \right) - {{\varvec{\upomega}}}\left( {\frac{{\partial {\mathbf{T}}}}{{\partial {\mathbf{p}}}} - \frac{{{\varvec{\upalpha}}}}{{{\mathbf{c}}_{{\mathbf{p}}} }}} \right) + \frac{{{\mathbf{Q}}_{1} }}{{{\mathbf{c}}_{{\mathbf{p}}} }}, $$
(4)

where T is the air temperature; t is time; u, v and \({\varvec{\omega}}\) represent the zonal, meridional and vertical velocity, respectively; p is pressure, \({{\varvec{c}}}_{{\varvec{p}}}\) is the specific heat at constant pressure, \(\boldsymbol{\alpha }\) represents the specific volume; and \({{\varvec{Q}}}_{1}\) is the atmospheric apparent heat source (Yanai et al. 1973). The temperature tendency (\(\partial {\varvec{T}}/\partial {\varvec{t}})\) consists of the horizontal advection of temperature term \(\left(-\left({\varvec{u}}\frac{\partial {\varvec{T}}}{\partial {\varvec{x}}}+{\varvec{v}}\frac{\partial {\varvec{T}}}{\partial {\varvec{y}}}\right)\right)\), adiabatic heating term \(\left(-{\varvec{\omega}}\left(\frac{\partial {\varvec{T}}}{\partial {\varvec{p}}}-\frac{\boldsymbol{\alpha }}{{{\varvec{c}}}_{{\varvec{p}}}}\right)\right)\), and diabatic heating term (\({{\varvec{Q}}}_{1}/{{\varvec{c}}}_{{\varvec{p}}}\)). In fact, the \({{\varvec{Q}}}_{1}\) is computed via the temperature budget equation, and the correlation coefficient between \({{\varvec{Q}}}_{1}\) and \({{\varvec{E}}}_{\mathbf{J}\mathbf{R}\mathbf{A}55}\) is 0.93, further verifying the consistency of the TP diabatic heating between the reverse and direct computational methods.

The diagnosis with the temperature budget equation (Fig. 7) shows that the warming air column over the western TP is significantly linked with the seesaw pattern of diabatic heating over the TP. The strong upward motion over the eastern TP and downward motion over the western TP (Fig. 7d) is also detected in Fig. 6b. Over the eastern TP, strong diabatic heating causes significant upward motion, and the increasing temperature due to diabatic heating to the east of 90° E is offset by the decreasing adiabatic heating due to upward motion, leading to cooling over the eastern TP. Between 80° E and 90° E, meanwhile, the diabatic heating dominantly contributes to the warming air column (Fig. 7a, d). Over the western TP, the warming air column is mainly attributed to the strong adiabatic heating due to the downward motion (Fig. 7c) and the subsequent horizontal advection of temperature caused by the conservation of mass in the mid-upper troposphere (Fig. 7b).

To further illustrate the influence of the seesaw pattern of diabatic heating over the TP on the summer monsoon onset dates of the AS and India, the temporal evolutions of the composite OLR, SAH and temperature in the mid-upper troposphere between positive and negative seesaw pattern years are shown in Fig. 8. The climate-mean monsoon convection (low OLR) firstly starts at around 90° E during the beginning of May, signifying the summer monsoon onset over the BOB, and the monsoon over the AS and India break out around the end of May along with the westward-extending SAH represented by the 14,280-gpm isoline (Fig. 8a, b). When the temperature ridge reaches to 20° N at around 1 June (Fig. 8c, d), this means the meridional temperature gradient within the latitudinal band of 10°–20° N is positive and signals the onset of the SASM (Mao et al. 2002). In the years of positive seesaw patterns of diabatic heating over the TP, the temperature ridge reaches to 20° N earlier, in mid-May, with strong monsoon convection and a westward-extending SAH appearing around the AS (India) in mid-May (late-May) (Fig. 8a). During the years with a negative seesaw pattern, meanwhile, the SASM onset is delayed to early June (Fig. 8d). The strong monsoon convection and westward-extending SAH over the AS and India are significant after early June (Fig. 8b).

Fig. 8
figure 8

Composite evolution of a, b meridionally averaged (5°–15° N) OLR (shading, units: W m−2) and c, d zonally averaged (50°–90° E) temperature, vertically averaged over 500–200 hPa (shading, unit: K) for a, c PC > 0.75 standard deviation years, b, d PC <  − 0.75 standard deviation years. The black contour represents 230 W m−2 (a, b) and 246 K (c, d) for climatological mean. The green (magenta) curves in a, b and c, d indicate the 14,280 geopotential height at 20° N and the maximum ridge line, respectively, for climate mean states (|PC|> 0.75 standard deviation years). The light pink denotes 20° N

The above results indicate that the diabatic heating over the TP plays an important role in the early/late onset of the SASM. On the one hand, the seesaw pattern of diabatic heating induces the anomalous warming in the mid-upper troposphere over the western TP, providing a favorable meridional temperature transition for the occurrence of the monsoon rainfall over the AS and India. On the other hand, the westward movement of the SAH associated with the diabatic heating over the TP leads to upper-level divergence-pumping over the AS and India, which superimposes over the low-level westerly and supports favorable vertical circulation over the two regions (Liu et al. 2013, 2014b).

5 Simulation of the impacts of the thermal forcing over the TP on the onset of the SASM

The data diagnosis shows that the seesaw pattern of diabatic heating over the TP is beneficial to the SASM onset, including the summer monsoon over the AS and India. Examining the onset of the SASM represented by the mid-upper tropospheric temperature in the sensitivity runs (Fig. 9) reveals that the mid-upper temperature ridge that responds to the strong (weak) diabatic heating over the eastern TP in May reaches 20° N earlier (later) than the climate-mean state, i.e., the SASM onset is around mid-May when strong diabatic heating over the eastern TP is set in EXP_POS, while it is delayed to around early June in EXP_NEG (Fig. 9). Hence, the response of the SASM onset represented by the meridional temperature gradient under strong or weak diabatic heating over the eastern TP in May is indeed advanced or delayed.

Fig. 9
figure 9

Zonally averaged (50°–90° E) temperature, vertically averaged over 500–200 hPa (shading, unit: K) for a EXP_POS, b EXP_NEG. The black contour represents 246 K for climatological mean. The magenta and green curves indicate the maximum ridge line for sensitive experiment (EXP_POS and EXP_NEG) and control experiment (CTRL). The light pink line denotes 20° N

Figure 10 shows the difference in circulation and precipitation in May between EXN_POS and EXN_NEG to detect the physical effect of the thermal forcing of the TP in May. As mentioned in Sect. 4.2, the circulation response to the diabatic heating over the eastern TP in May supports an early onset of the SASM, including the summer monsoon over the AS and India. In the upper troposphere, the diabatic heating over the TP leads to the westward movement of the SAH, enhancing upper-level divergence-pumping over the AS and India (Fig. 10a, b). The easterly, along with the southern flank of the SAH, coordinates well with the lower-level southwesterly, and forms a summer circulation structure favorable to precipitation in the SASM regions (Fig. 10a, c). In the lower troposphere, the westerly and abundant moisture transport over the AS and India result in significant above-normal precipitation over the eastern AS and western coast of the Indian Peninsula. In addition, significant upward motion appears over the southeastern TP, accompanied by downward motion over its surrounding regions, especially the western TP and Iranian Plateau, in the difference between EXN_POS and EXN_NEG (Fig. 10b), and the response of these vertical movements over the eastern and western TP is intensified with the expansion of the diabatic heating over the eastern TP (Fig. 11a–e). This adiabatic heating caused by the anomalous vertical motions over the TP, and the warm advection of temperature by the easterly anomaly in the mid-upper troposphere, gradually heats the mid-upper troposphere (Fig. 11f–i). As a result, the increasing mid-upper temperature over the TP regions enhances the land–sea thermal contrast and contributes to an early transition of the meridional temperature gradient (Fig. 9a), further supporting the advanced summer monsoon onset. All these results are generally consistent with the data analysis reported in Sect. 4.2.

Fig. 10
figure 10

Differences in May quantities between EXP_POS and EXP_NEG: a 150-hPa geopotential height (shading, unit: gpm) and 150-hPa wind (vectors, units: m s−1); b 500-hPa vertical velocity (shading; units: Pa s−1) and 500-hPa wind (vectors, units: m s−1); c precipitation (shading, units: mm day−1) and 850-hPa wind (vectors; units: m s−1); d surface to 100‐hPa vertically integrated moisture divergence (shading, units: 10−5 kg m−2 s−1) and moisture transport anomalies (vectors, units: kg m−1 s−1); e temperature, vertically averaged over 500–200 hPa (shading, unit: K). The green, magenta and blue contours (curves) in a indicate the composite 14,260 geopotential heights (the ridgelines of the SAH) in CTRL, EXP_POS and EXP_NEG, respectively. Black dotted areas and black vectors indicate statistical significance at or above the 90% confidence levels. The red contours indicate the 2000 m topographic height

Fig. 11
figure 11

Differences in ae meridionally averaged (25°–40° N) diabatic heating (contours, units: W m−2) and zonal circulation (vectors, units: \({\varvec{v}}\) in m s−1, − 3 \({\varvec{\omega}}\) in 10−2 Pa s−1) and fi air temperature (contours, unit: K) between EXP_POS and EXP_NEG from the 25th to 29th pentad. Black dotted areas and green vectors indicate statistical significance at or above the 90% confidence level

However, inevitably, there are some differences between the results of the numerical experiments and the data analysis, which may be related to the lack of atmosphere–ocean interaction in the AGCM and the intensity of the heating source set over the eastern TP. For instance, the rain belts and westerly over the AS and India shift northwards significantly. The anticyclone (Fig. 10a) and increasing mid-upper tropospheric temperature (Fig. 11) in the difference between EXN_POS and EXN_NEG are more significant over the eastern TP. There are significant downward motions and negative precipitation over the northern Indian Peninsula and Indo-China Peninsula associated with strong upward motion and positive precipitation over the eastern TP. The possible reasons for these differences are as follows, i.e., the positive heat source set in the sensitivity runs may trigger a weaker response of the circulation in other regions; while in contrast, the response of the circulation over the eastern TP seems more significant. Nevertheless, the simulated general circulation features associated with anomalous thermal forcing over TP in May agree with the data analysis, which further confirms the significant effect of the thermal forcing over the TP on the early/late onset of the SASM, including the summer monsoon over the AS and India.

6 Summary and discussion

6.1 Summary

Using station datasets from 80 meteorological stations over the TP, JRA-55 data, and OLR and SST from NOAA during 1979–2016, the connections between the thermal forcing of the TP and summer monsoon onset over South Asia, including the AS, India, and BOB, were investigated. Moreover, sensitivity experiments based on CAM4.0 were used to further confirm the physical mechanisms involved. The main conclusions can be summarized as follows:

  1. (1)

    The leading mode of the diabatic heating over the TP in May, established through EOF analysis, shows a seesaw pattern with a southeastern–northwestern dipole distribution. A positive diabatic heating pattern is linked to an early onset of the summer monsoon over the AS and India. This relationship is independent of ENSO, IPO, and the southern Indian Ocean dipole. The anomalous seesaw pattern of diabatic heating over the TP in May, therefore, can be considered as an independent precursor signal for the onset of monsoon over the AS and India. Meanwhile, the relationship between the diabatic heating over the TP and the onset of the BOB summer monsoon mainly appears over the southern slopes of the TP.

  2. (2)

    The seesaw pattern of the diabatic heating anomaly over the TP in May can trigger an anomalous response of vertical motion over the TP. The diabatic heating over the eastern TP, adiabatic heating due to downward motion over the western TP, and warm horizontal advection heating stimulated by the anomalous vertical motion, induce the warming in the mid-upper troposphere over the western TP, providing a favorable meridional temperature transition for the occurrence of monsoon rainfall over the AS and India, thus supporting an early monsoon onset. Furthermore, the westward movement of the SAH associated with the seesaw pattern of diabatic heating over the TP strengthens the upper-level divergence-pumping over the AS and India, and the easterly—along with the southern flank of the SAH—superimpose well with the low-level westerly flow, further stimulating the onset of the SASM, including the summer monsoon over the AS and India.

  3. (3)

    Numerical experiments based on the atmospheric component of CAM4.0 further confirm that the positive seesaw pattern of diabatic heating over the TP in May facilitates a westward-extended SAH, intensifying the anticyclone over the TP, warming the temperature in the mid-upper troposphere, and strengthening the low-level westerly and strong upward (downward) motion over the eastern (western) TP, thus supporting an earlier onset of the summer monsoon onset over the AS and India.

6.2 Discussion

The seesaw pattern of the diabatic heating over the TP exerts great impacts on the summer monsoon onset over South Asia, thus it is necessary to reveal the cause of the seesaw pattern of diabatic heating over the TP. Previous studies suggest that the Indian Ocean SST is a major forcing for the precipitation over the TP in May through affecting the summer monsoon over the BOB (Chen and You 2017; Zhao et al. 2018). This result is further verified in this work. The correlation coefficient between the diabatic heating seesaw pattern and the onset date of the BOB monsoon is of − 0.35, passing 95% confidence level. The precipitation anomaly over the southeastern TP, corresponding to the east center of diabatic heating, can be attributed to the transport of water vapor associated with anomalous BOB summer monsoon activities (Liu et al. 2019; 2017; Zhu et al. 2020). Besides, other factors including the SST in the North Atlantic (Cui et al. 2014; Yu et al. 2020) and the absorbing aerosol (Lau et al. 2006; Meehl et al. 2008) can also modulate the atmospheric heating over the TP in late spring, which is not addressed in this study.

In fact, the early onset of the BOB monsoon is often followed by the early monsoon onset over both the AS and India. The diabatic heating over the northeastern BOB due to deep convection after the BOB monsoon onset facilitates the SASM onset via modulating the SAH (Liu et al. 2014a). As mentioned in Sect. 4.2, the positive seesaw pattern of diabatic heating over the TP in May is in favor of the meridional temperature gradient reverse in the mid-upper troposphere over the AS and India, and such an influence is independent from ENSO, IPO and southern Indian Ocean dipole. Moreover, the difference in the diabatic heating seesaw pattern over the TP between the early and delayed SASM onset becomes obvious from the 5th pentad before the monsoon onset (Fig. S3). Therefore, the diabatic heating over the TP and the onset of the BOB monsoon may influence the onset of the SASM simultaneously.

The interannual variation of the SASM onset is complex and receives influences from many other factors such as Madden–Julian Oscillation (Taraphdar et al. 2018), ENSO (Liu et al. 2014b), the timing of the SAH (Liu et al. 2013), anthropogenic aerosols (Bollasina et al. 2013, 2014), and black carbon (Mahmood and Li 2012). Therefore, the combined effect of these factors should be addressed in future studies. Moreover, a fully coupled model that includes atmosphere–land–ocean interactions is needed to explore a comprehensive mechanism for the SASM onset.