Introduction

In recent years, the relationship between forests and sustainable development has undergone substantial progress (Linser and Lier 2020). Wood serves as a fundamental and valuable forest resource due to its renewable nature, recyclability, and sustainable characteristics. It has consistently demonstrated that wood holds several advantages over alternative materials, primarily in terms of its lower environmental impact (Janiszewska-Latterini and Pizzi (2023); Keshvardoostchokami et al. (2023); Lukawski et al. (2023)).While the conservation and utilization of forest resources have long been crucial subjects in forest management, contemporary developments have brought forth novel dimensions to this discourse. The expansion of forest cover and the augmentation of forest productivity have assumed paramount importance as essential strategies for addressing climate change in the forthcoming 30–50 years. This recognition is shared by numerous countries and international organizations, who acknowledge the crucial role forests play in mitigating the adverse impacts of climate change. For instance, in the USA, forests play a crucial role in the overall carbon cycle and climate regulation. They act as carbon sinks, absorbing more CO2 than they emit. The preservation and expansion of forests are therefore important strategies for mitigating climate change and minimizing CO2 emissions (Walters et al. 2023). Hence, modeling undesirable output (CO2 emissions, etc.) for various forest production systems has attracted considerable scholarly attention and is viewed as pivotal in safeguarding our planet’s ecological balance and combating the adverse effects of climate change. To tackle this, one well-established managerial tool is data envelopment analysis (DEA) which was initially developed by Charnes et al. (1978) and has since been extended by other scholars in order to model undesirable output and measure environmental efficiency (Long et al. 2015). The analysis conducted by the DEA involves evaluating and comparing the environmental efficiency of a specific set of decision-making units (DMUs) based on their utilization of multiple inputs and corresponding outputs. For example, some studies have treated environmental factors as undesirable outputs, such as carbon emissions and waste generation (Seiford and Zhu 2002; Hua and Bian 2007; Zhou et al. 2014; Maghbouli et al. 2014), while others have considered them as inputs, such as energy consumption and water usage (Liu and Sharp 1999; Dyckhoff and Allen 2001; Hailu and Veeman 2001). However, the current methodology falls short in accurately representing the intricacies of the manufacturing process. To address this limitation, Färe et al. (1989) and Picazo-Tadeo et al. (2005) proposed an alternative approach that utilizes a directional distance function and incorporates a weak disposability assumption. In recent attempts, this methodology aims to expand the assessment of positive outputs while also contracting the evaluation of negative outputs (Fujii and Managi 2013; Huang et al. 2014; Tongying et al. 2017; Fan et al. 2017; Yu et al. 2018). In the field of forestry and its related activities, considerable research has been conducted to measure environmental efficiency (Obi and Visser (2018); Obi and Visser (2020); Obi et al. 2023). The predominant model used by scholars in this regard is the SBM-DEA model (Yang et al. 2011; Li et al. 2021; Zhang and Xu 2022). However, there is only one instance of applying a two-stage DEA model for measuring environmental efficiency. Specifically, Tan et al. (2023) employed the super-efficient DEA model to assess the forestry eco-efficiency (FECO) of 30 provinces and cities in China between 2008 and 2021. Additionally, the study utilized the Tobit model to examine the influencing factors on FECO, with the aim of gaining deeper insights into the level of sustainable development in forestry. However, DEA is traditionally recognized as a data-driven methodology that can give rise to issues related to homogeneity. For example, there might exist some contextual factors which impact on DMUs’ environmental efficiency and steer forest managers toward an unfair comparison. Banker and Natarajan (2008)’s definition of contextual variables includes those variables that may be exogenously fixed as well as others that may be under the control of the DMU managers. Managers should therefore include two-stage efficiency measurements in their evaluation process; firstly, they should calculate environmental efficiency to assess DMUs’ performance. Secondly, they need to separately adjust the effects of contextual factors by various regression models in order to obtain reliable results (Djordjević et al. 2023). In the context of DEA methodology, the inquiry recently revolved around ascertaining the maximum feasible augmentation in input allocation for a unit with the objective of augmenting its outputs by a specific magnitude, while simultaneously preserving its current efficiency levels should the unit persist with its operations (Zhang and Cui 2016). To close this theoretical gap, Inverse DEA (IDEA) is an analytical technique employed in post-DEA sensitivity analysis to address resource allocation problems. Its primary aim is to identify the optimal quantities of inputs and/or outputs for each DMU when subjected to perturbations in either inputs or outputs. Since Wei et al (2000) formulated the first instance of an inverse DEA model in 2000, the IDEA approach has garnered considerable attention and popularity in recent years, primarily due to its wide range of applications across various sectors. These sectors encompass business (Hosseininia and Saen 2020; Amin and Ibn-Boamah 2023), supply chain management (Kalantary and Saen 2019; Gharibi and Abdollahzadeh 2021; Moghaddas et al. 2022), education (Guijarro et al. 2020; Le et al. 2021), manufacturing, sustainable production (Hassanzadeh et al. 2018; Yousefi et al. 2021), energy, and environment (Ghiyasi 2019; Lim 2020; Orisaremi et al. 2022). In terms of environmental efficiency assessment, a groundbreaking inverse DEA model was developed with the objective of minimizing greenhouse gas (GHG) emissions across 23 oil companies situated in the USA and Canada (Wegener and Amin 2019). This innovative model provided valuable insights and strategies for effectively managing and reducing GHG emissions within the oil industry. In another recent investigation carried out by Emrouznejad et al. (2019), a unique approach so-called inverse DEA was utilized to allocate CO2 emissions among specific sectors within the Chinese manufacturing industries. The research findings revealed three distinct stages in the process: reduction of total CO2 emissions, allocation to two-digit industries, and subsequent allocation to various provinces. Nevertheless, there is only encompassing research conducted by He et al. (2022); the authors employed DEA to illustrate the efficiency of China's forest carbon sink. Additionally, a gray prediction model was utilized to estimate the alteration in the input indicator as China approaches peak carbon levels. Lastly, the inverse DEA model was applied to investigate the increase in forest carbon sink across various provinces within China.

In all the above-mentioned recent studies, the main goal is to determine the specific input and/or output adjustments necessary for the DMUs to attain a predefined efficiency target. Hence, the initial hypothesis of this practical research is to determine if the suggested methodology is capable of mitigating CO2 emissions and how much the contextual factors can influence the results of this study.

To the best of our awareness, no study has been conducted to assess the environmental efficiency of the US forest sector using the two-stage DEA and simultaneously IDEA approaches, even though this sector holds immense importance for the region. This presents a unique opportunity to explore and uncover new insights into the environmental performance of the sector. To address this void, the novel contribution of this study is threefold:

  • Evaluate environmental efficiency by incorporating the weak disposability assumption to effectively mitigate CO2 emissions in the initial stage.

  • Utilize a regression model as an intermediate analysis in the second stage to justify the impact of contextual factors.

  • Minimize the levels of undesirable outputs, specifically CO2 emissions, to the greatest extent possible within some predefined scenarios. Indeed, our developed IDEA model enables us to conduct a sensitivity analysis on the results of environmental efficiency and ultimately determine the optimal variations of the applied dataset.

The remaining sections of this practical research are outlined as follows: Sect. “Problem statement” describes the problem statement and research questions. Sect. “Methodology” provides a detailed description of the applied DEA, regression analysis, and IDEA modeling techniques. Sect. “An application to forest sector” applies the proposed procedure to a real dataset in the US forest sector. Sects. “Results”, “Discussion”, and “Concluding remarks” present the obtained results, discussion, and concluding remarks, respectively.

Problem statement

Suppose there are \(J\) forest plots with each one using \(I\) inputs to generate \(R\) desirable outputs and \(K\) undesirable outputs. Moreover, we assume that in addition to these inputs and outputs, there are a finite number of contextual and explanatory variables that have significant impact on the performance of the plots. The work process in a sample forest plot is depicted in Fig. 1.

Fig. 1
figure 1

A systemic view to a sample forest plot

How do the contextual variables affect the technical efficiency of forest plots?

Consider a specific \({DMU}_{o}:\) Suppose we are interested in reducing the level of undesirable outputs from \({w}_{o}\) to \({w}_{o}-{\varepsilon }_{o}\), while preserving the current efficiency level. How much the level of desirable outputs and inputs of \({DMU}_{o}\) should be reduced?

Methodology

Environmental efficiency assessment

Suppose there are \(J\) forest plots (each one as a DMU) to be evaluated and\({x}_{j}={\left({x}_{1j} , . . . , {x}_{Ij} \right)}^{T}\ge 0\), \({y}_{j}={\left({y}_{1j}, . . . , {y}_{Rj} \right)}^{T}\ge 0\) and \({z}_{j}={\left({w}_{1j}, . . . , {w}_{Kj} \right)}^{T}\ge 0\) are respectively, the input, desirable output, and undesirable output vectors of plot\(j\). We assume that in addition to the plot-specific inputs and outputs, there are a finite number of contextual variables that have significant impact on the process. Suppose that \({\left({z}_{1j}, . . . , {z}_{Lj} \right)}^{T}\ge 0\) denotes the vector of contextual variables. To achieve plot-specific efficiency, we use a two-stage procedure involving efficiency calculation in the first stage and removing the impact of contextual variables in the second stage. In this sense, in the first stage, we use the weak disposable model of Kuosmanen (2005) to calculate plot-wise environmental efficiency scores. Then, in the second stage, we use ordinary least squares (OLS) technique to remove the impact of contextual variables on the efficiency scores generated from the first stage.

To estimate the technical efficiency of plot \("o"\) in the first stage, we use the following output-orientation model of Kuosmanen (2005):

$$ \begin{gathered} \varphi_{o}^{*} = \min \varphi \hfill \\ {\text{s}}{\text{.t}}{.} \hfill \\ \mathop \sum \limits_{j = 1}^{J} \left( {\lambda_{j} + \mu_{j} } \right)x_{mj} \le x_{mo} , m = 1,...,M, \hfill \\ \mathop \sum \limits_{j = 1}^{J} \lambda_{j} y_{rj} \ge \varphi y_{ro} , r = 1, \ldots ,R, \hfill \\ \mathop \sum \limits_{j = 1}^{J} \lambda_{j} w_{kj} = w_{ko} , k = 1, \ldots ,K, \hfill \\ \mathop \sum \limits_{j = 1}^{J} \left( {\lambda_{j} + \mu_{j} } \right) = 1, \hfill \\ \lambda_{j} , \mu_{j} \ge 0, j = 1, \ldots , J. \hfill \\ \end{gathered} $$
(1)

Suppose \({\varphi }_{o}^{*}\) is relative environmental efficiency of \({{\text{DMU}}}_{o}\). In order to remove the impact of contextual variables on efficiency scores, we use ordinary least squares method. The production frontier in technology set of model 1 is monotone increasing, piecewise linear and concave. Hence, as Banker and Natarajan (2008) stated, the regression of the calculated efficiency scores by Model (2) on the contextual variables using ordinary least squares provides good estimation of the parameters of contextual variables. In this sense, in order to refine efficiency scores, we will apply the following regression model in the second stage:

$$\mathrm{Log }({\varphi }_{o}^{*})={\rho }_{0}+{\rho }_{1}{z}_{1o}+{\rho }_{2}{z}_{2o}+\dots +{\rho }_{L}{z}_{Lo}+\beta $$
(2)

in which \(\mathrm{Log }({\varphi }_{o}^{*})\) is the logarithm of the environmental efficiency score of \({DMU}_{o}\) obtained from model 2. In regression model (2), \({\rho }_{0}\) and \(\beta \) are, respectively, the intercept and error term. The coefficients \({\rho }_{l}:\text{\hspace{0.17em}\hspace{0.17em}}l=1,...,L\) can be positive or negative. The signs of \({\rho }_{l}\) indicate that the \(l-{\text{th}}\) contextual variable has a direct or inverse impact on the environmental performance of \({{\text{DMU}}}_{o}\). Upon estimating the regression parameters using the least squares method, it becomes feasible to compute the residuals. The accurate environmental efficiency is therefore estimated as:

$${\overline{\varphi } }_{o}^{*}={\varphi }_{o}^{*}-\left[\mathrm{Log }\left({\varphi }_{o}^{*}\right)-\left({\rho }_{0}+{\rho }_{1}{z}_{1o}+{\rho }_{2}{z}_{2o}+\dots +{\rho }_{L}{z}_{Lo}+\beta \right)\right]$$

An inverse DEA model

In this section, we discuss the problem of inverse DEA in two scenarios:

First, we examine the case that if we reduce undesirable outputs to a certain amount, how much should we reduce the inputs and the desirable outputs in order to maintain the level of environmental efficiency?

Second, we investigate that if we are interested in increasing the level of outputs to a certain amount, how much the inputs and undesirable outputs are increased while preserving the level of environmental efficiency?

In the first approach, the problem is: if \({{\text{DMU}}}_{o}\) decreases its current level of undesirable outputs to \({w}_{o}-\pi \), how much should the inputs and desirable outputs be reduced to maintain the current efficiency level. We believe that reducing undesirable outputs requires reducing inputs and desirable outputs. In order to determine the optimal values of the changes in inputs and desirable outputs, we solve the following multi-objective linear programming problem model (3):

$$ \begin{gathered} {\text{Max}} \delta_{m} : m = 1, \ldots , M \hfill \\ {\text{Min}} \gamma_{r} : r = 1, \ldots , R \hfill \\ {\text{Min}} \mathop \sum \limits_{m = 1}^{M} s_{m} \hfill \\ {\text{Min}} \mathop \sum \limits_{r = 1}^{R} d_{r} \hfill \\ {\text{s}}{\text{.t}}{.} \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda_{j} + \mu_{j} } \right)x_{mj} + s_{m} = \varphi_{o}^{*} (x_{mo} - \delta_{m} {), }m = 1, \ldots M, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \lambda_{j} y_{rj} - d_{r} = \left( {y_{ro} - \gamma_{r} } \right), r = 1, \ldots , R, \hfill \\ \mathop \sum \limits_{j = 1}^{J} \lambda_{j} w_{kj} = w_{ko} - \pi_{k} , k = 1, \ldots ,K, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda_{j} + \mu_{j} } \right) = 1, \hfill \\ \lambda_{j} \ge 0, j = 1, \ldots , J, \hfill \\ s_{m} ,d_{r} , \delta_{m} {, }\gamma_{r} \ge 0, {\text{for all}} m {\text{and}} r{. } \hfill \\ \end{gathered} $$
(3)

Clearly, the vector \({\left({x}_{o}-\delta , {y}_{o}-\gamma , {w}_{o}-\pi \right)}^{t}\) belongs to \(Pos(A)\), in which \(A={\left[X, Y, W\right]}^{t}\) and is the set of all non-negative linear combinations of \(A\). (Note that \(X, Y\) and \(W\) are matrixes of all inputs, desirable outputs and undesirable outputs, respectively.) This guarantees the feasibility of model (3).

In model 3, the r-th undesirable output is reduced by \({\pi }_{r}\) and we are interested in determining the minimum values of reduction in desirable outputs and maximum values of reduction in inputs. It should be pointed out that \({\pi }_{r}\) are user-defined values. Model 3 is a multi-objective linear programming model, and it is not easy to calculate an optimal solution to satisfy all objects. Suppose.

\(\delta ={\text{Min}} \left\{{\delta }_{m}: m=1, \dots , M\right\}\) and \(\gamma ={\text{Max}} \left\{{\gamma }_{r}: r=1, \dots , R\right\}\). Clearly, \(\delta \le {\delta }_{m},\mathrm{ for all} m=1, \dots , M\) and \(\gamma \ge {\gamma }_{r}, \mathrm{for all} r=1, \dots , R\). In order to derive a non-dominated solution, we can easily solve the following single-objective model:

$$ \begin{gathered} {\text{Max }}\delta - \gamma - \sum\limits_{{m = 1}}^{M} {s_{m} } - \sum\limits_{{r = 1}}^{R} {d_{r} } \hfill \\ {\text{s}}{\text{.t}}. \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda _{j} + \mu _{j} } \right)x_{{mj}} + s_{m} = x_{{mo}} - \delta _{m} ,m = 1, \ldots M, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \lambda _{j} y_{{rj}} - d_{r} = \varphi _{o}^{*} \left( {y_{{ro}} - \gamma _{r} } \right),r = 1, \ldots ,R, \hfill \\ \sum\limits_{{j = 1}}^{J} {\lambda _{j} } w_{{kj}} = w_{{ko}} - \pi _{k} ,k = 1, \ldots ,K, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda _{j} + \mu _{j} } \right) = 1, \hfill \\ \delta \le \delta _{m} ,{\text{for all}}\;m = 1, \ldots ,M, \hfill \\ \gamma \ge \gamma _{r} ,{\text{for all}}\;r = 1, \ldots ,R, \hfill \\ \lambda _{j} \ge 0,j = 1, \ldots ,J, \hfill \\ s_{m} ,d_{r} ,\delta _{m} ,\gamma _{r} \ge 0,{\text{for all }}m\;{\text{and }}r. \hfill \\ \end{gathered} $$
(4)

Now suppose we are interested in increasing the level of desirable outputs from \({y}_{ro}\) to \({y}_{ro}+{\gamma }_{r}\). The object is to determine the new optimal values for inputs and undesirable outputs, while preserving the level of environmental efficiency. To determine the optimal values of the changes in inputs and undesirable outputs, we solve the following linear programming problem:

$$ \begin{gathered} {\text{Min}} \pi + \delta + \mathop \sum \limits_{m = 1}^{M} s_{m} + \mathop \sum \limits_{r = 1}^{R} d_{r} \hfill \\ {\text{s}}{\text{.t}}{.} \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda_{j} + \mu_{j} } \right)x_{mj} + s_{m} = x_{mo} + \delta_{m} {, }m = 1, \ldots M, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \lambda_{j} y_{rj} - d_{r} = \varphi_{o}^{*} \left( {y_{ro} + \gamma_{r} } \right), r = 1, \ldots , R, \hfill \\ \mathop \sum \limits_{j = 1}^{J} \lambda_{j} w_{kj} = w_{ko} + \pi_{k} , k = 1, \ldots ,K, \hfill \\ \delta \le \delta_{m} , {\text{for all}} m = 1, \ldots , M, \hfill \\ \pi \le \pi_{k} , {\text{for all}} k = 1, \ldots , K, \hfill \\ \mathop \sum \limits_{{\begin{array}{*{20}c} {j = 1} \\ \end{array} }}^{J} \left( {\lambda_{j} + \mu_{j} } \right) = 1, \hfill \\ \lambda_{j} \ge 0, j = 1, \ldots , J, \hfill \\ s_{m} ,d_{r} , \delta_{m} {, }\pi_{k} \ge 0, {\text{for all }} m, k {\text{ and }}r{. } \hfill \\ \end{gathered} $$
(5)

In model 5, \({s}_{m},{d}_{r},\delta {, \delta }_{m}\text{,}\pi , {\pi }_{k}, {\lambda }_{j}\) and \({\mu }_{j}\) are decision variables and \({\gamma }_{r}\) is user-defined values. An important point to be noted is that model (5) may lead to infeasibility in some real cases. This is due to the fact that the user-defined values \({\gamma }_{r}\) may be infeasible in practice. In this case, an interaction may be useful to achieve a feasible plan.

An application to forest sector

We now proceed to illustrate the practical application of our proposed approach by employing a dataset comprising 89 forest plots situated in the state of Oklahoma, USA. It should be noted that forest plots serve as the unit of observation for the forest inventory analysis (FIA) program. A standard plot typically comprises about four subplots, each with a radius of 7.3 m (equivalent to 0.015 hectares). Within each standard plot, trees with a diameter greater than 13 cm are measured. Additionally, within each subplot, a nested microplot with a 2.1-m radius (equivalent to 0.001 hectare) is utilized to measure trees with a diameter less than 13 cm (Burrill et al. 2021). This dataset, obtained from the FIA (USDA Forest Service 2023), encompasses detailed information regarding the ecosystem services generated by these forest plots in 2018.

In this application, we seek to estimate the environmental efficiency of the plots with emphasis on their ability to generate desirable outputs and reduce undesirable outputs. In this sense, we chose four desirable outputs, timber productions (\({{\varvec{y}}}_{1}\)), carbon sequestration (\({{\varvec{y}}}_{2}\)), water production (\({{\varvec{y}}}_{3}\)) and tree richness (\({{\varvec{y}}}_{4}\)), and one undesirable output, carbon emitted in the case of harvest (\({{\varvec{w}}}_{1}\)). We also considered one input as site productivity (\({{\varvec{x}}}_{1}\)). In addition to plot-specific input and outputs, we have also considered the following five contextual variables ─ age, damage, ownership, precipitation, and temperature ─ which are denoted by z1-z5, respectively. With the exception of the climatic variables, all input, outputs, and contextual variables were obtained from the FIA program. We used historical records of precipitation and temperatures in each forest plot and obtained from the WASSI model (Caldwell et al. 2019). Tables 1 and 2 show the descriptive variable of the dataset set. All units were taken to the hectare level, when applicable.

Table 1 The statistical description of the inputs and outputs data
Table 2 The statistical description of the contextual variables

Results

Environmental efficiency analysis

First, the assessment of environmental efficiency in the US forest plots is initially conducted through the utilization of the conventional DEA model under weak disposability assumption (Model 1). The objective of this evaluation is to promote enhanced levels of desirable output. The outcomes of this approach are depicted in Table 3.

Table 3 The statistical description of the EE and optimal inputs and outputs from model 1

This analysis reveals that approximately 38% (34 out 0f 89) of forest plots are fully determined efficient (See Table 8 in Appendix A). This signifies that these fully efficient DMUs have optimized their desirable outputs while slightly reducing their input activity levels and keeping a constant undesirable output level, thereby attaining high efficiency and productivity (environmental efficiency = 1). Conversely, the remaining DMUs with inefficiency scores greater than one should improve their efficiency by following the improved input and output values determined by the efficient US forest sectors. In details, the statistical description of the projection points corresponding to inputs and outputs showed that the single input (Site productivity) needs to be reduced by 14%. Moreover, the outputs sawtimber and pulpwood productions, carbon sequestration, water production, and tree richness must be increased by 26, 79, 28, and 28%, respectively. However, as we should expect, the level of undesirable outputs, CO2 emissions, remained unchanged. As we can see, the main source of inefficiency is related to carbon sequestration.

Calculating the impact of contextual variables

In the second step of our analysis, we first calculated the Pearson’s correlation test to examine the relationship between the environmental efficiency of the forest plots and the contextual variables employed in this study. Specifically, we paired the logarithm of the environmental efficiency with each of the contextual variables to measure their correlation. The results are listed in Table 4. Prior to accounting for the influence of other contextual variables, we observed a positive correlation between the \(\mathrm{Log }(\varphi )\) (Logarithm of environmental efficiency) and the contextual variables, including age, ownership, precipitation, and temperature. However, it is worth noting that the correlation becomes negative when considering the variable damage. The analysis revealed that the highest correlation value for \(\mathrm{Log }(\varphi )\) is associated with precipitation. On the other hand, the lowest correlation is observed for the variable damage. This implies that while various types of damages (such as insect, disease, human, animal, fire, and weather-related damages) negatively impact plot efficiencies; however, none of the examined contextual variables showed statistically significant effects, according to the analysis conducted.

Table 4 Pearson correlation coefficients

Now, the impact of contextual factors on the efficiency state of a plot is being measured. Our goal is to identify and minimize their impact so that we can calculate more accurate efficiency scores that are specific to each plot. Toward this end, we apply the regression model (2) during second phase and present the corresponding findings in Table 5. As the results show, ownership and temperature are statistically significant. Moreover, the temperature has inverse relationship with environmental efficiency, while ownership has direct relationship with efficiency.

Table 5 Regression results

The findings presented in Table 5 indicate that while there seems to be a correlation between the logarithm of environmental efficiency scores and variables such as age, damage, and precipitation, these relationships are not statistically significance. However, there is a direct significant relationship between ownership and the logarithm of environmental efficiency, indicating a notable association. More specifically, this finding suggests that private ownership has a favorable positive influence on environmental efficiency. Through our observations, we noted a negative significant relationship between the logarithm of environmental efficiency scores and temperature. This discovery indicates that as the temperature increases, there is a minor decline in efficiency levels across various plots. It should be noted that the R square value stands at approximately 0.26, indicating that the regression model encompasses over 26% of the observed data. Finally, after adjusting for contextual variables, we found that the average environmental efficiency of the plots is calculated to be 1.2428.

An inverse DEA analysis

In the last step, we apply our proposed IDEA model (5) on forest plots data. We design two scenarios to reduce the studied undesirable output: we first assume that we are interested in reducing the level of undesirable output (CO2 emissions) by 5%. The optimal values of inputs and desirable outputs are calculated by model (5). The statistical description of the results is given in Table 6.

Table 6 The statistical description of the optimal inputs and outputs

Our results indicated that if we want to reduce the level of undesirable outputs by 5%, we should reduce the levels of sawtimber and pulpwood production, carbon sequestration, water production, and tree richness by 2, 5, 0.3, and 7%, respectively. In this case, the site productivity should be reduced by 21%.

In the second scenario, aiming to reduce undesirable outputs by 10%, the findings from model (5) in Table 7 indicate that the levels of sawtimber and pulpwood, carbon sequestration, water production, and tree richness must be reduced by 3, 8, 0.5, and 8%, respectively. Additionally, site productivity needs to be reduced by 22%.

Table 7 The statistical description of the optimal inputs and outputs

Discussion

This study explored the applicability of DEA-based approaches to estimate environmental efficiency for the US forest plots, incorporating both undesirable output (CO2 emissions) and contextual variables. Toward this end, an output-oriented DEA model was first implemented using weak disposability assumption to calculate plot-wise environmental efficiency scores. The results indicated that only 34 of forest plots were operating at high-efficiency levels while their total average environmental efficiency was quite high (0.75 out of 1) (Table 3 and Appendix A). However, the inherent characteristic of the implemented environmental DEA model results in the consistent preservation of the level of undesirable output (CO2 emissions) for all inefficient forest plots. This is attributable to the implementation of a weak disposability strategy in which the model aims to proportionally adjust both desirable and undesirable output levels simultaneously based on environmental regulations. (Färe et al. 2007; Long et al. 2015). In practice, the process of mitigating undesirable outcomes like CO2 emissions in forest logging activities involves incurring expenses related to proportional reduction or increased output. Consequently, operational costs emerge as a significant factor in this strategy (Palmer et al. 1995; Sueyoshi and Goto 2012). These costs impact the total operation costs or average operation cost by decreasing or increasing them, respectively, owing to reduced CO2 emissions (Zadmirzaei et al. 2023). The outcomes also indicate the necessity of a 79% increase in carbon sequestration to offset the damages caused by CO2 emissions, and there might be some exogenous/contextual factors which can easily impact on the levels of both desirable and undesirable outputs. Hence, the log of environmental efficiency (\(\mathrm{Log }({\varphi }_{o}^{*})\)) was calculated in the second step in order to mitigate the effect of contextual factors on the previous obtained results. The findings of the OLS regression test showed that the logarithm of environmental efficiency exhibited a direct significant relationship with ownership and a negative significant relationship with temperature (Table 5). These findings are in line with some related research in the forest sector (Gutiérrez and Lozano 2022; Amirteimoori et al. 2023). Moreover, the residuals of the Log (\({\varphi }_{o}^{*}\)) refer to the differences between the observed values and the predicted values based on the estimated regression model which provide a good measure of the studied DMUs’ managerial ability (Banker and Park 2020). For instance, the coefficient of 0.0360 for the ownership variable demonstrates the significant and direct impact of private forest ownership on enhancing the environmental efficiency within this particular production system. Indeed, when forest plots are managed by private ownership, the environmental efficiency of forest management practices can vary depending on the individual goals and strategies of the private owners, leading to potential trade-offs between economic profitability and environmental sustainability (Feliciano et al. 2017). The temperature variable, characterized by a negative coefficient of − 0.0060, holds significance in our analysis. This indicates that a unitary increase in temperature is associated with a \(100\left({e}^{-0.006}-1\right)=\%0.6\) decrease in the average environmental efficiency. Therefore, after adjusting these exogenous/contextual factors, the overall average environmental efficiency score significantly increases to 0.8 (on a scale of 1).

Lastly, to reduce/control the studied undesirable output (CO2 emissions), an inverse DEA model was applied. The model incorporates two distinct scenarios, namely a 5% and 10% reduction in CO2 emissions, to effectively analyze and determine optimal strategies. Although both scenarios exhibit a comparable pattern in reducing undesirable outputs, it is worth noting that the reduction of desired outputs and studied input was marginally more satisfactory in the scenario aiming to reduce CO2 emissions to a level of 10% (Tables 6 and 7). In details, to achieve a 10% reduction in undesirable outputs, the research findings indicated that certain measures need to be taken. The study recommends decreasing the levels of sawtimber and pulpwood by 3 and 8%, respectively. Additionally, carbon sequestration should be reduced by 0.5%, while water production and tree richness should be decreased by 8%. Moreover, site productivity needs to be lowered by 22% to meet the desired target. While these practices may seem counterintuitive in terms of generating economic revenues, actions such as tree harvesting and the use of fertilization (as a means to increase site productivity) are expected to elevate carbon emissions. In light of achieving specific environmental standards, forest landowners/managers might consider reducing tree harvesting levels and minimizing the use of fertilizers to lower their carbon footprint. The use of fertilizers can also contribute to increased tree richness, and by managing tree richness, carbon emissions may be reduced. Furthermore, the quantity of water is inversely associated with the level of tree density (number of trees per hectare). Keeping trees unharvested can decrease water quantity but simultaneously reduce carbon emissions. These findings align with the recommended forest management strategies across various silvicultural practices as discussed in a recent study (Ameray et al. 2021). Therefore, the latest research findings provide compelling evidence that IDEA has the capacity to conduct sensitivity analysis, addressing the significant limitations of DEA models. In addition, the IDEA approach offers enhanced flexibility when compared to traditional DEA models. Given its proven effectiveness as an optimization technique, it presents an intriguing opportunity for further exploration and application in various production and service organizations, particularly in the realm of environmental efficiency measurement (Orisaremi et al. 2021). To elaborate further, the utilization of IDEA allows for the examination of how changes in inputs and outputs impact the efficiency scores obtained from the model. This sensitivity analysis is vital for understanding the robustness and reliability of the IDEA results, especially in complex and dynamic environments where nonlinear relationships exist between inputs and outputs, as well as where sustainable management of forest operations involves embracing a broader perspective that integrates carbon emissions, tree biodiversity, and a multitude of other vital parameters. By doing so, we can strive toward a holistic and harmonious approach that supports the ecological, social, and economic dimensions of sustainable forest management (Cooper and MacFarlane (2023); Latterini et al. (2023a and b), and Bowditch et al. (2023)). By incorporating this capability, IDEA addresses a key limitation of conventional DEA methods that often assume linearity, enabling researchers and practitioners to gain deeper insights into the underlying factors affecting efficiency (Emrouznejad 2023).

Concluding remarks

Mechanized forest logging operations have emerged as a significant source of air pollution, contributing to increased emissions of CO2 and other greenhouse gases (GHGs) per unit of timber harvested. This poses challenges for implementing sustainable and economically viable practices, making it complex to strike a balance between efficient timber harvesting and minimizing carbon footprints in the pursuit of environmentally sustainable forest management. Consequently, incorporating considerations for environmental efficiency problems becomes essential as it encompasses enhancing efficiency and minimizing undesirable outputs. This serves as a valuable managerial tool in promoting environmentally sustainable harvesting practices that simultaneously enhance overall productivity and mitigate CO2 emissions. Hence, the primary focus of this research lies in the formulation of DEA-based methodologies to evaluate the environmental efficiency of forest plots in the USA. These methodologies incorporate not only the measurement of undesirable outputs such as CO2 emissions but also account for contextual variables, thereby providing a comprehensive assessment of environmental efficiency. To this end, the study implemented a two-stage DEA model to calculate plot-wise environmental efficiency scores. The average environmental efficiency was high (075), and the logarithm of environmental efficiency was used to account for contextual factors in the second stage. Ownership had a positive relationship with environmental efficiency, while temperature had a negative relationship. Adjusting for these factors increased the overall average environmental efficiency score (0.8). Finally, an Inverse DEA model was used to analyze strategies for controlling CO2 emissions. It was found that reducing undesirable outputs required reducing other inputs and outputs, because naturally, to reduce the level of undesirable outputs under the weak disposability assumption, some influencing factors of production capacity must inevitably be reduced. These recommendations aimed to balance minimizing undesirable outputs with ecological functions. The specific magnitudes of reductions should be determined based on local conditions, ecological considerations, and sustainability goals.

This aspect is particularly noteworthy in cases where certain outputs, once produced, come at the expense of others. For instance, management practices that lead to higher rates of carbon sequestration and timber production, such as increased tree planting or regeneration, may potentially lead to a decrease in water yield. Examples of such practices include afforestation with fast-growing species and moderately intensive mechanical soil preparation. However, it is essential to acknowledge that certain policies in the USA, such as the conservation reserve program (CRP), have sought ways to promote carbon sequestration while also improving water yield. One approach employed by the CRP is the planting of softwood species (USDA 2015). Balancing these trade-offs between various ecosystem services remains a significant challenge for sustainable land management. Policy initiatives like the CRP provide valuable insights into potential solutions to address these trade-offs and advance more sustainable practices that optimize multiple environmental benefits.

In conclusion, the emergence of IDEA as a powerful tool for sensitivity analysis and its flexible nature present exciting opportunities for research and practical applications. Exploring the potential of this approach in various production and service organizations, particularly within the context of environmental efficiency measurement, holds promise for advancing our understanding of efficiency dynamics and informing decision-making processes. Continued investigation into the capabilities and limitations of IDEA will undoubtedly contribute to the advancement of performance evaluation methodologies and enhance organizational sustainability in an ever-changing world.