1 Introduction

Aboveground biomass (AGB) is a critical component of global carbon cycle and nutrient cycling (Costanza et al. 1997; Zhu et al. 2019). Amount in grassland AGB widely reflects the productivity and health status of grassland, contributing to monitoring grassland ecosystem functions and dynamics (Harris et al. 2020; Huang et al. 2018). Comprehensive dynamics of grassland AGB are influenced by key factors, and evaluating them is pivotal for assessing grassland productivity and conducting sustainable management for grassland ecosystems (Pan et al. 2023). While it is common to separately discuss the influence of climate or human activities on grassland AGB (Zhang et al. 2023a, b), further studies are needed to comprehensively consider key factors over time (Lei et al. 2022; Li et al. 2023). Therefore, constructing an accurate estimation model for grassland AGB and analyzing long-term and short-term driving factors, such as temperature, precipitation, soil texture, and grazing intensity that influence AGB dynamics in vast regions remain a challenging endeavor. Accurately and quantitatively assessing the driving factors of grassland AGB is of great significance for comprehending changes in grassland vegetation, establishing suitable livestock carrying capacity (He et al. 2022), evaluating the status of regional ecological environment (Huang et al. 2015), and promoting sustainable development of grassland resources (Campana et al. 2021; Adam et al. 2014; Delgado-Baquerizo et al. 2020; Godde et al. 2019).

Grassland AGB presents a significant challenge for ecologists, due to its rapid variability and the influence of various factors (Li et al. 2020; Song et al. 2018). In addressing this challenge, researchers have focused on tackling statistical issues like regression to the mean (RtoM) in assessing AGB change (Muha et al. 2012; Wagg et al. 2014). Many studies have emphasized the unsuitability of directly regressing the change value of a variable between two periods on its control variables due to the high (negative) correlation propensity to its initial value (Stigler 1980; Tu et al. 2005). For instance, Stevens and Van Wesemael (2008) addressed this issue in the analysis of covariance (ANCOVA) for soil organic carbon (SOC) change by using the average of the initial and end SOC as the covariate, aiming to mitigate the impact of the initial SOC on the SOC change (Tang et al. 2019). Moreover, considering the relationship between change and the initial value, the impact of control variables on AGB change should be intertwined with the influence of the initial AGB, rather than directly regressing AGB change on its controls, given the existence of RtoM (Karambas et al. 2016; Muha et al. 2012). Another challenge is mathematical coupling (MC), where one variable directly or indirectly contains all or part of another, leading to their joint analysis through correlation or regression (Dirmeyer et al. 2013; Wu et al. 2022). Attempting to regress the change value on the initial value along with other control variables may render the t test of the coefficient in a regression model inappropriate (Calizza et al. 2018; Tu et al. 2005). However, problem of MC persists when regressing the change value on the mean AGB, as using mean AGB as a covariate involves first regressing it by AGB change. The initial value's influence significantly affects the change value, resulting in the RtoM problem. Successfully eliminating this influence from the change value enables a more effective exploration of the relationship between the adjusted change value and its relevant variables through regression.

To address this, the chosen approach and model validation strategies are crucial. Temporally, we classified the effects of driving factors into long-term and short-term categories, with a particular emphasis on separating short-term effects. This study was based on the mechanism of ecological disturbance and steady-state theory in time scale to distinguish the long-term and short-term effects from the perspective of statistics, the Co-regression method was employed to mitigate RtoM and MC problems and enhance AGB estimation performance in grasslands. The primary questions were: (1) how can the Co-regression algorithm be utilized to assess long-term and short-term effects on grassland AGB change, improving model performance by identifying and addressing RtoM and MC problems? (2) What factors drive the variation of grassland AGB, and how can optimal variable combinations be determined for modeling grassland AGB? (3) How can the contributions of key variables to grassland AGB be quantitatively analyzed? This study broadens our understanding of how driving factors affect grassland AGB change, paving the way for better assessments of grassland AGB in the present and future.

2 Materials and methods

2.1 Study area

This investigation took place in Northern China, spanning latitudes 34.30° to 49.49°N and longitudes 94.34° to 126.99°E, covering Hebei Province, Shanxi Province, Inner Mongolia Autonomous Region, Ningxia Hui Autonomous Region, Liaoning Province, Jilin Province, Heilongjiang Province, Shaanxi Province, Gansu Province, and Qinghai Province (Fig. 1). The study area's elevation ranged from 500 m to 2,000 m. Climatic types in the region primarily included temperate continental, temperate monsoon, and cold temperate climates. Climatic conditions transitioned from semi-humid and semi-arid to arid areas from east to west, experiencing simultaneous rain and heat, along with substantial temperature variations between day and night. Mean annual temperature (MAT) varied between -0.36 °C and 12.23 °C, while mean annual precipitation (MAP) ranged from 43.82 mm to 627.06 mm (statistics are based on meteorological data from the 2000s), displaying a decrease from east to west (Peng et al. 2019). Soil types were divided into sandy soil, clay soil and loam soil according to the influencing factors of the parent material, among which sandy soil and clay soil were the main soil types in the study area.

Fig. 1
figure 1

Geographical location and distribution of sampling sites in Northern China

2.2 Data sources

The grassland AGB data originated from routine sampling surveys conducted at 243 sample sites across Northern China during the peak grass growing period in the 1980s and 2000s (supported by the Grassland Branch Center of the National Agricultural Science Data Platform). For the investigation of grassland biomass, the most important grassland types and groups with the largest distribution area in the northern grassland were selected (at least relatively homogeneous within the surrounding 2 km × 2 km), for which the geographical location of the center in each sample plot was precisely determined by GPS, and the species composition, distribution pattern, topography and soil conditions in the sample plot were determined. Mean AGB was computed for the peak season in both the 1980s and 2000s, and AGB was expressed as carbon density (g C·m−2) using the internationally commonly used conversion rate of 0.45 (Fang et al. 2007; Piao et al. 2004).

Additionally, climate data, including mean annual temperature and mean annual precipitation for 243 sample sites across 56 counties in the study area, were obtained from the China Meteorological Administration (http://cdc.cma.gov.cn/home.do) through spatial analysis and statistics in the 1980s and 2000s. Livestock inventory data for the 1980s and 2000s were sourced from the Rural Socio-economic Survey Team database of the China Bureau of Statistics for each county in the study area. The number of livestock was used to characterize the influence of grazing intensity in the two reference periods, calculated in terms of DSE hm–2 (DSE representing the amount of feed required by a two-year-old, 50 kg Merino sheep to maintain its weight).

The two sets of AGBs, climate and soil factors, and grazing intensity were utilized to formulate the grassland AGB models. The denotation and description of dependent variables in this study are provided in Table 1. The key factors included temperature (MAT1, MAT2, ΔMAT), precipitation (MAP1, MAP2, ΔMAP), soil texture (SClay, SSilt), and grazing intensity (LSK1, LSK2, ΔLSK), which collectively referred to the explanatory variables of the model (Table 2).

Table 1 Denotation and description of dependent variables
Table 2 Denotation and description of explanatory variables (controls)

2.3 Segregating long-term and short-term effects

The impacts of long-term and short-term factors on grassland AGB were quantitatively investigated using the Co-regression model, constructed with IBM SPSS 22.0. The Pearson correlation test was employed for correlation analysis, and the model's fitting degree was enhanced by eliminating relationships between observed variables. Apart from the long-term effects, the variation in AGBL2 was also influenced by the short-term effects of key factors, along with random fluctuations.

To separating long-term effects from short-term effects (Fig. 2):

$${\mathrm{CHG}}_{\mathrm T}={\mathrm{AGB}}_{\mathrm L2}-{\mathrm{AGB}}_{\mathrm L1}$$
(1)
$${\mathrm{CHG}}_{\mathrm S}={\mathrm{AGB}}_{\mathrm L2}-{\mathrm{PRED}}_{{\mathrm{AGB}}_{\mathrm L2}}$$
(2)
$${\mathrm{CHG}}_{\mathrm T}={\mathrm{CHG}}_{\mathrm L}+{\mathrm{CHG}}_{\mathrm s}$$
(3)

where CHGT represents the total effect, which could be divided into change under long-term effect (CHGL), and change under short-term effect and random error (CHGS).

Fig. 2
figure 2

Equations schematic diagram of CHGT, CHGL and CHGS

2.4 Co-regression model

The initial AGB served as the covariate, and AGB2 was regressed on AGB1 to predict the end AGB, thereby eliminating its impact from the end AGB. Consequently, an adjusted change in AGB was generated, as opposed to the (direct) change in AGB resulting from subtracting the (observed) initial value from the (observed) end value. Moreover, the adjusted change in AGB was statistically independent of the initial AGB. The Co-regression analysis method, akin to variance analysis in ANCOVA, is replaced by regression analysis after removing the impact of the covariate.

Two consecutive linear models, Regression 1 and Regression 2, were set up for the Co-regression model. Logarithmic values of the initial AGB (AGBL1) and the end AGB (AGBL2) were employed in conformity with the normality assumption. The first regression (Regression 1) regressed AGBL2 on AGBL1, and generated the predicted value of AGBL2 (PREDAGBL2). Subsequently, the impact of AGBL1 (acting as the covariate) on AGBL2 was removed by subtracting the PREDAGBL2 from AGBL2, and the difference (CHGS) was taken as adjusted change between the two periods, in contrast to the direct difference between AGBL2 and AGBL1.

CHGS was totally independent of AGBL1, and the second regression (Regression 2) could be set up by regressing the CHGS on its relevant variables without problems about RtoM or MC.

The Co-regression model comprised following two linear regressions:

Regression 1: regress the AGBL2 on AGBL1, subtract the predicted AGBL2 from the (observed).

AGBL2, thus generated the CHGS.

Regression 2: regress the CHGS on variables such as temperature, precipitation, soil texture and grazing intensity.

$${\text{Combined}}\_{\text{R}}2 = {{\text{R}}}_{1}^{2} + {{\text{R}}}_{2}^{2} (1 - {{\text{R}}}_{1}^{2})$$
(4)

where \({R}_{1}^{2}\) is the R-square of Regression 1 and \({R}_{2}^{2}\) is the R-square of Regression 2.

Taking the effect of Regression 1 as the long-term effect, Regression 2 as the short-term effect of the model, the following equations were obtained.

$${\text{Long}}-\mathrm{term effect }= \frac{{{\text{SSR}}}_{1}}{{\text{SST}}} = {{\text{R}}}_{1}^{2}$$
(5)
$${\text{Short}}-\mathrm{term effect }=\frac{{{\text{SSR}}}_{2}}{{{\text{SSE}}}_{1}}\times \frac{{{\text{SSE}}}_{1}}{{\text{SST}}} = {{\text{R}}}_{2}^{2} (1 - {{\text{R}}}_{1}^{2})$$
(6)
$$\mathrm{Error }=\mathrm{ SSE}2/\mathrm{SST }= 1 -\mathrm{ Combined}\_{\text{R}}2$$
(7)

where SST is the total sum of square for AGBL2; SSR1 is the regression sum of square for Regression 1; SSE1 is the error (or residual) sum of square for Regression 1, and also the total sum of square for Regression 2; SSR2 is the regression sum of square for Regression 2, and SSE2 is the error sum of square for Regression 2. \(\frac{{{\text{SSR}}}_{2}}{{{\text{SSE}}}_{1}}\) represents coefficient of determination of short-term effect, which is SSR2 divided by SSE1 (the total sum of squares of the short-term effect). \(\frac{{{\text{SSE}}}_{1}}{{\text{SST}}}\) serves as adjustment factor, which equals to 1 minus \(\frac{{{\text{SSR}}}_{1}}{{\text{SST}}}\), namely (1—\({R}_{1}^{2}\)).

2.5 Generalized Linear Model (GLM) and sum of squares (type III)

The multiple regression models of target variables on key driving factors were established, and then GLM method and sum of squares (type III) were applied to assign relevant importance of each factor on target variables.

$${\text{f}}({{\text{y}}}_{{\text{i}}};{\uptheta }_{{\text{i}}},\Phi )={\text{exp}}\left\{\frac{{{\text{y}}}_{{\text{i}}}{\uptheta }_{{\text{i}}}-{\text{b}}({\uptheta }_{{\text{i}}})}{{\text{a}}(\Phi )}+{\text{c}}({{\text{y}}}_{{\text{i}}},\Phi )\right\}$$
(8)

where parameter \({\uptheta }_{{\text{i}}}\) is a regular parameter, also known as identity link function, and variation with the exponent i (i = 1, 2, …, n), the disturbance factor \(\Phi\) is a constant.

$${\upeta }_{{\text{i}}}={{\text{X}}}_{{\text{i}}}^{{\text{T}}}\upbeta ={\sum }_{{\text{j}}=1}^{{\text{P}}}{{\text{X}}}_{{\text{ij}}}{\upbeta }_{{\text{j}}},{\text{i}}=1, 2, \dots \mathrm{ n}$$
(9)

For the ith observation Yi, the linear predicted value of the system part was a linear combination of the variables under the study.

$${\text{g}}\left({\upmu }_{{\text{i}}}\right)={\upeta }_{{\text{i}}}={{\text{X}}}_{{\text{i}}}^{{\text{T}}}\upbeta ,{\text{i}}=1, 2, \dots \mathrm{ n}$$
(10)

where g() is the connection function, which connects the expectation of the random part to the system part, \({\upmu }_{{\text{i}}}={\text{E}}({\text{Y}{i})}\) was the expectation of Yi.

According to the correlation coefficient analysis results, different combinations of explanatory variables and CHGS were selected for regression analysis, and based on the selection criteria of explanatory variables (Table 2), the final model was determined. The explanatory variables under the short-term effect included the initial value, the end value, the difference between the end value and the initial value of all explanatory variables. According to the nature of target variables, MAP2, ΔMAP, MAT2, ΔMAT, LSK2, ΔLSK, SClay and SSilt were selected as alternative explanatory variables. Finally, the Co-regression model's overall goodness of fit was assessed by applying variance decomposition.

3 Results

3.1 Decomposition of grassland AGB

Regression to the mean (RtoM) may arise from measurement error or natural causes, including the long-term impact of key driving factors on AGB change (Tu et al. 2005). In this study, the two AGB variables were positively correlated (Fig. 3a), suggesting either a genuine correlation or non-identical errors in initial and final measurements. Furthermore, the effect of RtoM was evident in the change of AGBL1 (CHGT), which correlates negatively with AGBL1 (Fig. 3b).

Fig. 3
figure 3

The relation of AGBL1 and AGBL2 to CHGT. (a) The relation between AGBL1 and AGBL2; (b) the relation between AGBL1 and CHGT

Due to the presence of RtoM, it was inappropriate to directly regress the change of AGBL to its controls before the impact of AGBL1 had been removed. It was even challenging to discern how much of the change was influenced by relevant controls versus RtoM. To focus on the influence of control variables on AGBL change, AGBL2 was regressed to AGBL1. Subsequently, the predicted AGBL2 was subtracted from the observed AGB2 to eliminate the effect of RtoM by removing the impact of AGBL1 from AGBL2. As shown in Fig. 4d, the adjusted change (CHGS) was almost entirely uncorrelated with AGBL1, indicating successful removal of the effect of RtoM.

Fig. 4
figure 4

Spatiotemporal trends in 1980s and 2000s, the change and relation of grassland AGB in the Northern China. a Trend of grassland AGB in 1980s; (b) trend of grassland AGB in 2000s; (c) changes of grassland AGB in the Northern China between 1980 and 2000s; (d) the relation between AGBL1 and CHGS

The average grassland AGB in Northern China was 50.59 g·m−2 in the 1980s and 47.33 g·m−2 in the 2000s. Spatial distribution patterns showed a consistent decline in grassland AGB from east to west during both periods (Fig. 4a, b). While the influence of initial AGB on AGB variation was mainly attributed to the long-term impact of key factors, the adjusted AGB change (Fig. 4c) was correspondingly interpreted as influenced by the short-term impact of control variables, including climate, soil texture and grazing intensity. In simpler terms, AGB change was affected by long-term controls, short-term controls, and random factors.

3.2 Separating long-term effect from short-term impact on grassland AGB

The adjusted AGB value can be regressed against various controls without the bias of initial values, as the adjusted AGB change shows minimal correlation with the initial AGB value. In Regression 1, the adjusted AGB change, denoted as CHGS, was derived after removing the impact of AGBL1. The influence of AGB1 on AGB2 and subsequent AGBL change could stem from measurement errors or natural causes, which were accounted for in CHGS by considering the natural factors affecting AGBL1's impact on AGBL2 over time. CHGS primarily reflects short-term control impacts.

Regression 2 explored the short-term effects of controls on CHGS. Identifying these control factors posed a challenge, with temperature, precipitation, soil texture, and grazing intensity typically being primary influencers of grassland AGB. Short-term impacts on AGBL change considered both long-term mean values and changes in relevant variables.

The combined R-square is the most effective measure to assess the Co-regression model’s impact, revealing how much of AGBL2’s variation (relative to its mean) is explained by the model. As the Co-regression model comprised two regressions, the combined R-square integrates two separate R-squares: the R-square of Regression 1 and product of the R-square of Regression 2 and (1-R 2) of Regression 1. Figure 5 depicts the relationships between multiple sums of squares and R-squares. Therefore, the Co-regression model clarifies 79.5% of AGBL2’s total variation. Additionally, it distinctly distinguishes between short-term and long-term control effects, with long-term impacts accounting for 73.6% of AGBL2’s variation, and short-term impacts for only 5.9%.

Fig. 5
figure 5

The relation between multiple sums of squares and R squares. SST: sum of squares for total; SSR: sum of squares for regression; SSE: sum of squares for residual or error

3.3 Effects of key variables on grassland AGB

Figure 6a illustrates the relationship between grassland AGB change (CHGS) and corresponding driving variables. During the study period, grassland AGB showed a positive correlation with mean annual temperature (Fig. 6d, e) and a negative correlation with mean annual precipitation (Fig. 6f, g). The maximum correlation coefficients were 0.17 (p < 0.01) and -0.29 (p < 0.01), respectively. The association between livestock carrying capacity (Fig. 6b, c) and grassland AGB varied over time, with correlation coefficients of -0.13 (p < 0.05) and -0.18 (p < 0.01) in the 1980s and 2000s, respectively, and an overall negative correlation (Coef. = -0.13, p < 0.05). Additionally, the correlation coefficient between soil clay content (Fig. 6h) and grassland AGB was 0.01, while the correlation coefficient between soil sand content (Fig. 6i) and grassland AGB was -0.12.

Fig. 6
figure 6

The relation between grassland AGB change and key variables. a Pearson correlations between grassland AGB change and the relevant variables; (b, c) relationship between AGB change and LSK; (d, e) relationship between AGB change and MAT; (f, g) relationship between AGB change and MAP; (h, i) relationship between AGB change and soil texture. The shade of the color and the size of the circle represent the strength of the correlation. * indicates the significant correlation at the 0.05 level; ** indicates the significant correlation at the 0.01 level; *** indicates the significant correlation at the 0.001 level. DSE represents Dry sheep equivalent

The CHGS was regressed under various combinations of these variables, and the best regression model was chosen based on three criteria: a higher R-squared value, significant t-values for each coefficient, and ecologically sensible explanations. Despite a strong correlation between MAP1 and ΔMAP (Coef. = -0.462, p < 0.001), no serious issues of multicollinearity were found in the regression. Thus, both MAP1 and ΔMAP were kept in the model due to their statistical significance. Moreover, the interactions between MAP2 and MAT2 were found to be insignificant. The results of the optimal regression, which aimed to reduce overfitting and improve transferability, are summarized in Table 3.

Table 3 The coefficients of Regression 2

The short-term change within precipitation (ΔMAP) showed significant positive impacts on the short-term change of AGBL, with ΔMAP (Beta = 0.232) being the most influential factors on CHGS. Although MAT1 is less significant (p = 0.089) than MAP1, it still contributed to the short-term variation of AGBL. Despite its insignificance, MAT2 was negatively correlated with CHGS (B = -0.766), suggesting that increasing temperatures could lead to a decrease in AGB. Soil texture, specifically SSilt, did not significantly influence CHGS (Beta = -0.169). While LSK2 was statistically insignificant in the model (p = 0.136), it showed a negative correlation with CHGS (Beta = -0.143), aligning with the anticipated relationship between grazing intensity and CHGS. Although the model's R-squared value may seem relatively low (R 2 = 0.161) for explaining the CHGS variation, the Co-regression model provided a satisfactory overall explanation for the variation in AGBL2.

In Regression 2, the main controls for CHGS were ΔMAT, MAP2, ΔMAP, LSK2 and SClay. The GLM was used to assign relevant importance of these controls. GLM analysis outcome is showed in Table 4.

Table 4 The result of GLM analysis

Dependent variable was CHGS; R 2 = 0.224; Adjusted R 2 = 0.208; SS equaled to sum of squares (type III) / SST × 100%, which was proportion of variances explained by the variable.

The total sum of SS (0.178) for intersect and variables was less than the R-square or SSR/SST (0.224). This is because SS only reflects direct impacts of variables on the dependent variable. However, introduced into the model, they can indirectly affect the dependent variable by influencing other explanatory variables. The difference (R 2 – ΣSS = 0.046) represents total indirect impacts of all controls on CHGS. Notably, SClay, ΔMAP and MAP2 were the most crucial factors, explaining 6.5%, 5.2%, and 2.6% of CHGS variation. Short-term precipitation changes like ΔMAP directly affect grassland AGB (Welti et al. 2020), while long-term changes in precipitation patterns, including alterations in annual precipitation averages or shifts in the timing and intensity of precipitation events, also influence aboveground biomass (Dullinger et al. 2020; Feng et al. 2021). Conversely, LSK2 had the smallest impact (0.6%). Precipitation and soil properties were identified as primary drivers of AGB dynamics, with long-term factors outweighing short-term ones and mutually reinforcing impacts.

4 Discussion

4.1 The validation and effect of Co-regression model

Oldham’s (1962) method is utilized to examine the presence and characteristics of RtoM. It asserts that if the actual end AGB value (observed value minus measurement error) is unrelated to the true initial AGB value, and if the variances of the initial or end measured values are identical, then the change in AGB is independent of the mean of the initial and end values. The accuracy verification of the model demonstrated a strong alignment between the estimated grassland biomass data and the measured data (Fig. 7a). The target AGB variable, influenced by the long-term joint action of all explanatory variables, exhibited a stable trend of development and change. This stability resulted in a clear correlation between the observed AGB values in the 1980s and 2000s (Fig. 7b). Subsequently, a linear correlation between AGBL1 and AGBL2 was established in Regression 1, generating an exponential relationship between AGB1 and AGB2. The effect of RtoM was shown in Fig. 7b. Let P represent an AGB1 value at which PREDAGB2 equaled AGB1 (P = 40.33). If AGB1 < P, then PREDAGB2 > AGB1, indicating a positive change in AGB; if AGB1 > P, then PREDAGB2 < AGB1, indicating a negative change in AGB.

Fig. 7
figure 7

The validation and relation of grassland AGB. (a) Validation of AGB estimation; (b) the relation between AGB1 and AGB2

In the Co-regression model, the temporal variation of AGB between two periods was investigated. While it wasn't possible to generate a growth curve for each site due to the limited two-sample data for each site, spatially distributed data served as the foundation for analyzing the relationship between AGB and its controls for each specific site. Spatial differences were observed in the influence of climate drivers on the spatiotemporal dynamics of grassland AGB in the study area. For instance, AGBL2 on AGBL1 was regressed based on the data from 243 sample sites, and the resulting regression line illustrated the relationship between AGBL2 and AGBL1 for each sample site. It's important to note that although this method adhered to statistical rules and assumptions, there was a notable issue of spatial autocorrelation specific to this study. Fortunately, this challenge could potentially be addressed by employing a multilevel model that conducts regressions based on different data groups (e.g., forming groups for sample sites with correlated data) (Ishikawa et al. 2021).

4.2 Driving factors influencing grassland AGB

Climatic changes exert a significant influence on AGB, with grassland ecosystems typically maintaining a dynamic equilibrium where vegetation growth, death, and decomposition balance to sustain a relatively stable AGB level (Shabbir et al. 2019). Ongoing climate change is a long-term trend that affects the entire productivity level of a region, with a stable baseline over a long time scale. Empirical studies highlight precipitation as the primary climatic factor affecting grassland AGB, influencing various functional and structural aspects (Ghani et al. 2022; Ghimire et al. 2019). Temperature is also crucial, shaping AGB responses and driving environmental changes like desertification (Ghimire et al. 2019). Short-term climate fluctuations disrupt ecosystem balance, notably impacting grassland AGB (Hossain et al. 2023). Disturbances such as extreme weather events (e.g., rainstorms, droughts, high temperatures) can damage grassland vegetation, reducing ecosystem productivity and lowering AGB (Hoover et al. 2014).

Strong correlations between soil parameters and grassland AGB have been consistently observed in various studies (Graham et al. 2021). This study underscores the significant impact of clay content on grassland AGB, attributed to clay's unique properties. Clay particles possess a large specific surface area and adsorption capacity, enabling them to absorb and retain water. Additionally, their negatively charged surface facilitates the absorption and retention of vital nutrients like nitrogen, phosphorus, and potassium, essential for plant growth (Munira et al. 2018). The microporous structure between clay particles enhances water retention, mitigating water loss, improving soil resilience to drought, and ensuring a steady water supply. Moreover, the adsorption and release of mineral elements in clay are integral to soil nutrient cycling, while its microporous structure fosters an optimal habitat for microbial communities, enhancing soil biodiversity and microbial activity (Zia et al. 2021).

The concept of grazing intensity has emerged as a scientific approach for understanding grassland AGB dynamics, with increased livestock density often linked to grassland degradation (Rinot et al. 2019; Zhao et al. 2023), which is not consistent with our result. Sustained grazing pressure is a long-term effect, while year-to-year grazing fluctuations are a short-term effect. Grazing by herbivores is a significant form of land use, impacting plant diversity and ecosystem function through livestock trampling, selective foraging, and excrement deposition (Estes et al. 2011). The effects of grazing on grassland AGB can vary across different succession stages (Zhang et al. 2023a, b). Grazing typically alters plant resources availability, fostering a more diverse environment and influencing soil nutrient cycling, consequently affecting plant productivity (Eskelinen et al. 2022).

4.3 Implications and limitations

The Co-regression model offers distinct advantages in accurately capturing complex nonlinear relationships between biophysical parameters and intricate environmental factors (Scheller et al. 2005; Zeng et al. 2019). Compared to traditional statistical methods, this model proves to be an effective and robust algorithm, showcasing superior abilities in distinguishing long-term variables from short-term factors influencing grassland AGB while establishing intricate interactions with fewer parameters (Kibret et al. 2016; Lehnert et al. 2015). While the Co-regression model tended to underestimate high AGB values and overestimate low AGB values, a phenomenon likely arising from the algorithmic properties and the averaging of single-tree predictions in the Co-regression model (Zwicke et al. 2013). Approximately 2% of extreme values in the dataset contribute to this bias towards average values. While the model simplifies complex ecosystem processes and separates interactions between main factors affecting AGB change, this may affect the accuracy of assessing real ecological status and the reliability of research results.

Grassland AGB, a vital indicator of complex ecological systems, is influenced by a blend of long-term and short-term factors, including climate change, soil parameters, and grazing intensity (Epstein et al. 1997; Peng et al. 2020). The long-term effect is a trend and the instinct, and the short-term effect reflects random fluctuations in the environment, interannual fluctuations, pulses, or extreme events, disasters, and changes in human activity caused by disasters. This study delves into effects of key driving factors on grassland AGB, distinguishing between long-term and short-term impacts, and further categorizing short-term effects into direct effects and indirect effects to understand the mechanism driving AGB changes. However, the study has certain limitations. It primarily considers temperature, precipitation and their variations, while other climatic factors, including wind direction, relative humidity, sunshine duration and other environmental conditions, are crucial for quantitative and in-depth study of grassland AGB (Gui et al. 2021; Jiang et al. 2021). The impacts of climate change on vegetation or AGB vary in different regions and time periods. Gui et al. (2023a, b) studied the effects of drought and climate change on vegetation dynamics, highlighting the time-delay and time-cumulative effects on vegetation coverage, which pose challenges for data collection and analysis.

This study further highlights that the contribution of climate to grassland AGB outweighs that of grazing in Northern China, challenging previous assumptions regarding the predominant role of human activities driving grassland AGB changes (Zhang et al. 2023a, b). Non-climatic factors may include microbial activity, soil pH, soil texture, topographic relief, and various human activities (Hu et al. 2020). Gui et al. (2023a, b) revealed that the climatic changes caused by anthropogenic emissions through aerosols and clouds affected vegetation photosynthesis and carbon flux. They advocate further consideration for impacts of climate change on atmospheric circulation and water cycle to better understand the impacts of anthropogenic emissions on terrestrial ecosystems.

5 Conclusion

This study effectively differentiated long-term and short-term impacts on grassland AGB, quantifying contributions from climatic factors, soil texture, and grazing intensity. Trends in AGB were analyzed using Co-regression and Generalized Linear Model methods, highlighting long-term factors' predominant influence. Divergent impacts from key variables on AGB trends emerged. Soil parameters, notably clay content, explained 6.5% of AGB variation, while mean annual precipitation played a substantial role, accounting for 5.2% and 2.6% of AGB variation, respectively. Determining grazing intensity's influence on AGB change proved challenging, with only marginal contribution detected. The pivotal role of soil properties and climate change in driving grassland AGB dynamics in Northern China became evident. Co-regression significantly enhanced model performance, mitigating issues like regression to the mean. This research expands understanding of AGB dynamics, informing grassland ecosystem management and conservation efforts.