1 Introduction

Agriculture remains the foundation of the Ethiopian economy which makes up more than 32.5% of the country’s GDP in 2020–2021 [1] and employs nearly 77.2% of the population, even though it is giving way to the industry and manufacturing sectors. A total of 65.1% of Ethiopian agricultural GDP is derived from crop production [1], and cereal crops are the main source of caloric food for people’s daily food consumption [2]. However, this sector in Ethiopia is highly subsistence due to its reliance on rain, low marketing facilities, poor infrastructural development, insufficient farm input, and climate change-induced natural disasters [3, 4]. Wheat is classified as a cereal crop, so its production contributes significantly to the country’s economic development [5]. Ethiopia has enormous wheat production potential, which makes it the world’s 18th and Africa’s 2nd largest wheat producer. It accounts for approximately 15.31% of the Ethiopian total grain farmland (1.87 million hectares) and 17.71% of the total grain production (58.08 million quintals) [6] in the 2021/2022 main production season. The demand for wheat in Ethiopia is rapidly expanding due to population growth and urbanization, which have shifted dietary preferences toward wheat-based convenience foods [7].

For a country such as Ethiopia to ensure food security and meet the nutritional needs of its rapidly expanding population, increasing wheat production efficiency is essential. In light of this, Ethiopia’s wheat sector is currently undergoing a massive revolution, intended to swiftly expand wheat output, significantly close the supply–demand imbalance, and achieve wheat self-sufficiency as a national goal [8]. However, Ethiopia’s wheat productivity in 2022–2023 was 3 tons/ha, which is below both the global average (3.55 tons/ha) and that of Egypt (6.4 tons/ha) [9] due to biotic and abiotic constraints and technical, socioeconomic, and institutional factors. This research was undertaken in the Arsi zone, Ethiopia’s largest wheat producer. Wheat production in the Arsi zone accounts for 11.03% of the total wheat land (approximately 0.209 million hectares) and 12.45% of the total wheat production (approximately 7.2 million quintals) of the country [6].

Ethiopian wheat production is predominantly carried out by subsistence farmers, which results in low productivity and inefficient wheat production [5, 10, 11], mostly associated with problems of inadequate supply and high costs of critical farm inputs [8, 12]. It is critical to improve Ethiopia’s wheat production efficiency to address the issues caused by an expanding population, climate change, and shifting demands. Understanding the current level of wheat production efficiency will result in better resource management and optimization that eventually increases the productivity and profitability of wheat production, which contributes to overall agricultural development and helps to achieve the national objective of ending hunger and poverty.

Numerous studies on the production efficiency of wheat in Ethiopia confirm the existing wheat production inefficiency, even though there is an opportunity to boost output from the current level of input and production technologies. However, the majority of these studies [11, 13, 14] focused on estimating technical efficiency, with only a few looking at all efficiency estimates simultaneously. Furthermore, Ethiopian wheat production is undergoing a tremendous revolution [8], and wheat production is dynamic similar to that in other agricultural sectors, and fluctuates with existing production technology and climate unpredictability. Therefore, recurrent studies on the production efficiency of wheat are needed. Accordingly, this study was undertaken in the Arsi zone of the Oromia regional state of Ethiopia to assess wheat farmers’ technical efficiency (TE), allocative efficiency (AE), and economic efficiency (EE) and to identify their drivers.

2 Research methodologies

2.1 Description of the study area

The Arsi zone is located in the Oromia region of Ethiopia in the southeastern part of the country. It has a total area of 19,825.22 km2 and is divided into 26 districts [6]. According to the Arsi Zone Office of Agriculture (2024), the rainy season begins in June and peaks in July and August, while the Belg season runs from February to April. The zone receives a yearly average rainfall of 1020 mm and a temperature that ranges between 20 and 25 °C. The zone has an altitude that ranges from 500 to 4377 m above sea level (m.a.s.l.) with four agroecological zones, namely, extreme highlands (above 3200 m.a.s.l.), the highlands (2300–3200 m.a.s.l.), midlands (1500–2300 m.a.s.l.), and the lowlands (below 1500 m.a.s.l.), which cover 6%, 34%, 40%, and 20%, respectively.

The population was officially estimated to be 3,980,967 people in mid-2022, with 49.92% males and 50.07% females [15]. The total population is composed of 88.14% rural residents, 11.59% urban residents, and 0.27% pastoralists. Wheat, barley, maize, tef, sorghum, and coffee are among the most important crops grown in the region. Wheat is the most important crop grown by 360,697 holders, accounting for 43.7% of the zone’s total cereal production from its 39.8% zonally cultivated lands. The study sites are the Digeluna–Tijo, Dodota, and Ludehetosa districts, which are indicated in Fig. 1 below.

Fig. 1
figure 1

Map of the study area

2.2 Data source and sampling techniques

This study was based on cross-sectional data collected by using a semi structured questionnaire and interviews with farm households that were selected randomly using a multistage sampling technique. In the 1st stage, the Arsi zone was purposively selected due to its wheat production potential. In the second stage, accessible major wheat-producing districts in the zone were identified for each agroecological zone with the help of key informants who knew the study area well. Then, the highland districts of Digeluna–Tijo, the lowland district of Dodota, and the midland district of Ludehetosa were chosen randomly by the lottery method. In the 3rd stage, the total number of kebelesFootnote 1 in each district that corresponded to the district’s agroecological class was listed. Then, based on the proportion of kebeles in each district, three kebeles from Ludehetosa, three from Digeluna-Tijo, and two from Dodota were randomly selected. Then, from the kebele-level administrative office, a list of households that produced wheat during the main agricultural production season of 2022/2023 was obtained. Finally, a total of 422 wheat producers were selected and distributed for each district (Table 1) based on the proportion of their total wheat producers using a simple random sampling technique.

Table 1 Distribution of respondents among sampled districts and kebeles

2.3 Method of data analysis

2.3.1 Estimation of wheat farmer efficiencies

The two methods employed to evaluate smallholder farm efficiency levels are the parametric stochastic frontier model (SFM) and nonparametric data envelope analysis (DEA). The parametric SFM is based on an econometric approach with a specific functional form, while the nonparametric DEA relies on mathematical techniques [16]. To estimate efficiency, both techniques use distinct approaches that consider random noise and flexibility in the structure of the production technology. SFM typically has only one output, whereas DEA can include multiple outputs from a producer [17].

Due to their intrinsic differences, the parametric SFM and nonparametric DEA approaches have been debated. Studies on measuring efficiency indicate that researchers can use either approach because there is little difference in the estimated results [18, 19]. According to [20], the SFM forecasts firm-specific inefficiencies and random errors that cause deviations from the frontier. Parametric SFM is still a commonly used approach for measuring farmers’ production efficiency, despite its limitations in explicitly assuming a functional form for production technology and inefficiency term distribution. Since the SFM accounts for stochastic noise and enables hypothesis testing about production technology structure, the existing level of inefficiency, and overall model performance, it is regarded as a more accurate method for measuring efficiency. The objective of the study, data type, and functional forms used to represent the technology of production determine the model used in efficiency analysis. When there is a significant level of disturbance in the data due to measurement error, SFM is better than DEA for calculating the total factor productivity growth rate [16].

In this study, the SFM, which captures statistical noise that the decision-making unit cannot control, such as errors in measurement and changes in the environment, was used to estimate the coefficients of the production/cost function and the levels of wheat farmers’ efficiencies (TE, AE, and EE) [20]. The functional forms that are most frequently utilized in SFM are translog and Cobb‒Douglas (CD). To select the best-fit functional form for the data at hand, null hypotheses were tested using the generalized likelihood ratio (GLR), and CD was selected, as discussed in Sect. 3.2, which assumes unitary substitution elasticity, continuous production elasticity, and constant factor demand [20]. Using the CD over translog production function for smallholder farmers is endorsed, as the changes in returns to scale are unlikely to have a significant impact on production technology [21].

Accordingly, the CD-Stochastic Frontier Production Function (CD-SFPF) is specified as:

$$lny_{i} = \beta_{0} + \sum_{j = 1}^{6} \beta_{i} lnX_{ij} + v_{i} + u_{i}$$
(1)

where ln represents the natural logarithm, \({y}_{i}\) is the wheat yield in quintals; \({X}_{1}\) is the land under wheat (hectares); \({X}_{2}\) is the labor (family/hired) (man/day); \({X}_{3}\) is the chemical fertilizer (kg); \({X}_{4}\) represents the seed (kg); \({X}_{5}\) is the oxen power (oxen-day); \({X}_{6}\) is the chemical (pesticide and herbicide) (liters); \({\beta }_{0}\) is the intercept; \({\beta }_{i}\) are the coefficients of the production function; and \({v}_{i}\) is a random error that is assumed to be independently and identically distributed N (0, \({\upsigma }^{2}\)). \({u}_{i}\) is an error term that is a one-sided nonnegative variable that indicates the household’s technical inefficiency, which reflects how far the observed output falls below the potential yield from a given level of technology and inputs.

By assuming that the production function in Eq. (1) is self-dual CD, the respective cost function of the CD production function can be stated as:

$$ln{TC}_{i}={\alpha }_{0}+{\alpha }_{1}{lnP}_{land}+{\alpha }_{2}{lnP}_{labor}+{\alpha }_{3}{lnP}_{seed}+{\alpha }_{4}{lnP}_{fertilizer}+{\alpha }_{5}{lnP}_{chemical}+{\alpha }_{6}{lnP}_{oxen}+{\alpha }_{7}{lnY}^{*}+({v}_{i}+{u}_{i})$$
(2)

where \({TC}_{i}\) is the minimum cost of wheat production for the \({i}{\text{th}}\) household; \({Y}_{i}^{*}\) is the respective wheat yield adjusted for noise; P represents the prices of inputs; and \({\alpha }_{\text{s}}\) are their coefficients. Then, the input price and adjusted level of the output were substituted by using Shephard’s Lemma technique into the subsequent input demand equations, and the economically efficient input vector of the \({i}{\text{th}}\) farmer, \({X}_{ie}\), can be expressed as

$$\frac{\partial {lnTC}_{i}}{\partial {P}_{n}}={X}_{ie}({P}_{i},{Y}_{i}^{*}, \alpha )$$
(3)

where n = 6 indicates the number of inputs used for wheat production.

The explained cost measure enables us to determine the level of AE and then EE, which can explain the technical efficiency of each wheat farmer in terms of observed output (\({Y}_{i}\)) and the corresponding frontier output (\({Y}_{i}^{*}\)) using the existing production technology [22].

$${TE}_{i}=\frac{{P}^{\prime}{X}_{it}}{{P}^{\prime}{X}_{i}}=\frac{{Y}_{i}}{{Y}^{*}}$$
(4)

Then, the economic efficiency of each farmer is defined as the ratio of the minimum observed total production cost (TC*) to the actual total production cost (TC).

$${EE}_{i}=\frac{{P}^{\prime}{X}_{ie}}{{P}^{\prime}{X}_{i}}=\frac{{TC}^{*}}{TC}$$
(5)

Following [23], the AE index can be derived from Eqs. (4) and (5) as indicated below:

$${AE}_{i}=\frac{{P}^{\prime}{X}_{ie}}{{P}^{\prime}{X}_{it}}=\frac{{EE}_{i}}{{TE}_{i}}$$
(6)

2.3.2 Determinants of wheat farmers’ efficiency

There are two methods for analyzing the source of efficiency of SFPF [24]. The 1st technique is a one-stage procedure in which the technical efficiency level and its driver are estimated simultaneously. The 2nd technique is a two-step procedure where in the 1st stage, the efficiency scores are computed; then, in the 2nd stage, the resulting efficiency scores are regressed using either the OLS method or Tobit regression. In this study, a two-stage estimation procedure with two-limit Tobit regression was employed to identify the determinants of efficiency since it allows the estimation of AE and EE in addition to TE [10, 25,26,27], unlike the one-stage approach, which permits the estimation of only the determinants of TE. Understanding the factors that influence wheat farmers’ efficiency is an important segment of research that aims at tackling the problems encountered by farmers, promoting sustainable agriculture, and increasing the agricultural sector’s competitiveness and resilience.

Accordingly, the two-limit Tobit model is formulated by following [28] as:

$${y}_{i}^{*}={\beta }_{0}+{\beta }_{j}{X}_{ij}+{\mu }_{i}$$
(7)

where i refers to the \({i}{\text{th}}\) farmer and \({y}_{i}^{*}\) is the latent variable that indicates the level of TE, AE, or EE of the \({i}{\text{th}}\) farmer. \(\beta_{0}\) is the intercept while \(\beta_{j}\) are coefficients and \({\mu }_{i}\) is a random error term that is independently and normally distributed. \({X}_{ij}\) are variables that affect the TE, AE, and EE of wheat producers and are presented in Table 2 below.

Table 2 Hypothesized variables with expected effects on wheat production efficiency.

After denoting the observed dependent variable as \({y}_{i}\), the Tobit model can be written as

$${y}_{i}=\left\{\begin{array}{*{20}{ll}} 1& if \; {y}_{i}^{*}\ge 1\\ {y}_{i}^{*}& if \; 0 < {y}_{i}^{*} < 1\\ 0& if \; {y}_{i}^{*}\le 0\end{array}\right.$$
(8)

where \({y}_{i}^{*}\) represents the efficiency score of the \({i}{\text{th}}\) household. The likelihood function of the Tobit model is specified as:

$$L\left(\beta ,\frac{\sigma }{{y}_{j}}, {X}_{j}, {L}_{1j}, {L}_{2j}\right)=\prod_{y={L}_{1j}}\omega \left(\frac{{L}_{1j}-{\beta }^{\prime}{X}_{j}}{\sigma }\right)\prod_{{y}_{j}={y}_{j}^{*}}\frac{1}{\sigma }\varnothing \left(\frac{{y}_{j}-{\beta }^{\prime}{X}_{j}}{\sigma }\right)\prod_{{y}_{j}={L}_{2j}}1-\omega \left(\frac{{L}_{2j}-{\beta }^{\prime}{X}_{j}}{\sigma }\right)$$
(9)

where \({L}_{1j}\) and \({L}_{2j}\) are the lower and upper bounds, respectively, while \(\omega\)(.) = the cumulative normal distribution, φ (.) = the normal density function.

As indicated by [29, 30], the overall marginal effect (ME) of the Tobit model in Eq. (9) is divided into the following three MEs.

  1. i.

    The unconditional expected value of wheat farmers’ efficiencies

    $$\frac{\vartheta E({y}^{*})}{\vartheta {X}_{j}}=[({Z}_{U})-({Z}_{L})]\frac{\vartheta E({y}^{*})}{\vartheta {X}_{j}}+\frac{\partial [({Z}_{U})-({Z}_{L})]}{\vartheta {X}_{j}}+\frac{\partial (1-({Z}_{U}))}{\vartheta {X}_{j}}$$
    (10)
  2. ii.

    The expected value of wheat farmers’ efficiencies, which is conditional upon being between the limits

    $$\frac{\vartheta E({y}^{*})}{\vartheta {X}_{j}}={\beta }_{m}\left[1+\frac{({Z}_{L}({Z}_{L}))-({Z}_{U}({Z}_{U}))}{(({Z}_{L})-{(Z}_{U}))}\right]-\left[\frac{{(\omega ({Z}_{L}))-(\omega ({Z}_{U}))}^{2}}{{(({Z}_{L})-{(Z}_{U}))}^{2}}\right]$$
    (11)
  3. iii.

    The probability of being between the mean value (A) and upper limit (U)

    $$\frac{\vartheta [({Z}_{U})-{(Z}_{A})]}{\vartheta {X}_{j}}=\frac{{\beta }_{m}}{\sigma }\left[\omega ({Z}_{A})-{(Z}_{U})\right]$$
    (12)

    where \({Z}_{L/A}=\frac{{-\beta }^{\prime}X}{\sigma }\) and \({Z}_{U}=\frac{(1-\beta X)}{\sigma }\) are standardized variables that come from the likelihood function given the limits of \({y}^{*}\), and \(\sigma\) is the standard deviation of the model.

3 Results and discussion

3.1 Descriptive statistics

This study comprised 88.9% male-headed households of the 422 randomly selected wheat-producing farmers, with an average age of 46.22 years (Table 3). The mean household size in the study area is 4.88 persons (4.38 adult equivalent), which is consistent with the national average family size (4.6 person) [35], with an average education level of 7.01 years. The mean livestock holdings were 6.47 TLU, while the average land holdings were 2.45 hectares, which is greater than the national average livestock holdings of 2.7 TLU [36] and land holdings of 1.1 hectares [37]. Nearly 59.4% of the respondents used mechanization for wheat production, while 31.3% utilized credit. The mean distance from the main market was 7.08 km, and the average frequency of contact with the extension agent was 4.44 days per year. Nearly 66% of the respondents participated in off/nonfarm activities. The farmers produced 40.12 quintals of wheat, using, on average, 188.1 kg of seed, 13.79 man-days of labor, 8.97 oxen days, 236.14 kg of fertilizer, and 3.76 litters of agrochemicals from an average of 1.41 hectares of wheat land.

Table 3 Descriptive summary of variables.

3.2 Efficiency of wheat producers

Before estimating the SFM, it is important to test the three-model specification-related hypotheses. The first test involved selecting either the CD or translog functional form, and a GLR test was performed. The results indicated that the coefficients of the translog production function were almost zero (\({{\varvec{H}}}_{{\varvec{o}}}\):\({{\varvec{\beta}}}_{{\varvec{i}}{\varvec{j}}}\) = 0), which confirmed that the CD functional form best fit the data at hand (\({{\varvec{H}}}_{1}\):\({{\varvec{\beta}}}_{{\varvec{i}}{\varvec{j}}}\) ≠ 0) (Table 4).

Table 4 GLR test of hypotheses for the stochastic production function.

Then, the null hypothesis argues that the inefficiency component of the total error term is equal to zero (\({H}_{o}:\gamma\) = 0), and an alternative hypothesis (\({H}_{1}:\gamma\) ≠ 0) was tested by calculating the GLR. The findings demonstrate that the null hypothesis that state wheat farmers in the study area are 100% efficient is not accepted. Another hypothesis to be tested was that all coefficients of the inefficiency model are simultaneously equal to zero (i.e., \({H}_{o}\):\({\delta }_{1}\) = \({\delta }_{2}\) = \({\delta }_{3} . . . . . {\delta }_{10}\) = 0) against the alternative hypothesis, and a GLR test was performed. The findings confirmed the rejection of the null hypothesis, favoring the alternative hypothesis, which argues that the explanatory variables that affect wheat farmers’ efficiency levels are not all equal to zero (Table 4).

The maximum likelihood of CD-SFPF was estimated after testing the above hypothesis. Here, the input variables were land under wheat (in hectares), the labor force in man-equivalent for wheat production (in man/days), oxen (oxen-days), the quantity of seed (kg), chemicals (herbicides and pesticides in liters), and fertilizer (both NPS and urea in kg), where the output variable was wheat yield (quintals) in the 2022/23 production year. For the estimate of the CD-SFPF, the model specification result is significant (wald \({chi}^{2}\) = 3056.26, prob > \({chi}^{2}\) = 0.0000), suggesting that at the 1% level of statistical error, the null hypothesis that states all slope coefficients are zero is rejected. Furthermore, sigma squared (\({\upsigma }^{2}\)), which is a diagnostic measure of the inefficiency component that confirms the quality of model estimation and the validity of the distributional form assumed for the composite error term, was statistically significant. The Gamma coefficient (γ = \(\sigma_u^2\)/\({\upsigma }^{2}\)) is 0.766, indicating that technical inefficiency accounts for 76.6% of the wheat yield disparity at the frontier, while 23.4% is attributed to factors outside farmers’ control (Table 5).

Table 5 Maximum likelihood estimates of the wheat production function.

Table 5 presents the maximum likelihood parameter estimates of the CD-SFPF, and the results confirm that all inputs except labor were found to positively and significantly affect wheat output at a 1% level of statistical error. The model output indicated that a 1% increase in land, seed, fertilizer, chemical, and oxen days resulted in increases in wheat yields of 0.477, 0.088, 0.311, 0.130, and 0.003%, respectively, while all other factors remained constant.

The coefficients of inputs indicate the elasticity of output, which is summed to be 1.022, but this value does not guarantee the existing return to scale unless the hypothesis is tested. Accordingly, the hypothesis that farmers follow a constant return-to-scale was tested, which states that the sum of parameter coefficients is equal to 1 (\({H}_{o}\):\({\beta }_{1}+{\beta }_{2}+{\beta }_{3}+{\beta }_{4}+{\beta }_{5}+{\beta }_{6}\) = 1) against the alternative hypothesis (\({H}_{1}\):\({\beta }_{1}+{\beta }_{2}+{\beta }_{3}+{\beta }_{4}+{\beta }_{5}+{\beta }_{6}\) ≠ 1). The test results confirm the acceptance of the null hypothesis, indicating the presence of constant returns to scale among wheat farmers in the research area. This means that a 1% increase in all inputs would boost the overall wheat yield by approximately 1%.

The associated dual cost frontier function parameters are subsequently determined using the CD production function parameter listed below:

$${\text{lnTC}}_{{\text{i}}} = {1}.{949} + 0.{118}\left( {{\text{lnP}}_{{{\text{Labor}}}} } \right) + 0.{355}\left( {{\text{lnP}}_{{{\text{Land}}}} } \right) + 0.{187}\left( {{\text{lnP}}_{{{\text{Seed}}}} } \right) + 0.0{92}\left( {{\text{lnP}}_{{{\text{Chemical}}}} } \right) + 0.{2}00\left( {{\text{lnP}}_{{{\text{Fertilizer}}}} } \right) + \, 0.00{5 }\left( {{\text{lnP}}_{{{\text{Oxen}}}} } \right) + \, 0.0{17 }\left( {{\text{lnY}}^{*}} \right)$$

where P stands for the price of inputs which is measured in BirrFootnote 2 PLand is the rental value of land per hectare for one year (Ludehetosa = 10,000 Birr, Digeluna Tijo = 15,000 Birr, and Dodota = 8000 Birr on average), and Poxen is the price of paired oxen power that is estimated at 400 Birr/day. PLabor is the wage rate of labor estimated at 200 Birr/day, PFertilizer is the price of chemical fertilizer per kg estimated at Birr (urea = 36 Birr/kg and NPS = 42 Birr/kg), PSeed is the price of seed (improved = 55 Birr/kg and local = 40 Birr/kg), and PChemical is estimated at 1800 Birr/liter.

3.3 Wheat production efficiency scores and distributions

Table 6 summarizes the SFM outputs and indicates that the mean TE, AE, and EE of the wheat farmers were approximately 80.8%, 88.1%, and 71.3%, respectively. Of the total number of respondents, approximately 41.47%, 36.26%, and 42.18% had TE, AE, and EE scores below the average level, respectively. The highest mean TE (83.8%) and AE (87.2%) were recorded in the Ludehetosa District, while the lowest mean TE (74.9%) and AE (88.3%) were recorded in Dodota. On the other hand, the highest mean EE (73.3%) was recorded in Digeluna Tijo District, where the lowest was recorded again in Dodota (66.5%). The disparity in wheat efficiency among districts could be attributed to restricted water availability and erratic or insufficient rainfall in lowland (Dodota) areas compared to those in midland (Ludehetosa) and highland areas (Digeluna Tijo), as wheat requires appropriate moisture for proper cultivation.

Table 6 Distribution of TE, AE, and EE.

The mean TE was 80.8%, with the lowest value of 31.8% and the highest value of 96.4%. This means that if farmers can acquire essential managerial and technical skills, on average, they can increase their output by 19.2% (100–80.8). This suggests that if the average wheat farmer maintains the same TE level as its most efficient farmer, it could reduce 16.18% (100–80.8/96.4) of the inputs required to produce the maximum possible output.

The average AE was 88.1%, with the lowest value of 44.2% and the highest value of 99.1%, suggesting that wheat farmers may save 11.9% (100–88.1) of their present input costs if resources are used efficiently. This demonstrates that wheat farmers with an average AE score may reduce production costs by 11.09% (100–88.1/99.1), which is necessary to achieve the level of the most allocatively efficient wheat farmers by cost-effectively reallocating resources.

The mean EE score is 71.3%, with the lowest value of 31.0% and the highest value of 94.1%. This implies that wheat producers with average EE can reduce the existing average costs by 29.4% (100–71.3) to obtain the lowest possible production cost while maintaining the same level of wheat yield. The results also suggested that wheat farmers with an average EE would reduce their average costs of production by 24.22% (100–71.3/94.1) to achieve the most economically efficient production.

3.4 Determinants of wheat farmers’ efficiency

In this study, to identify the factors that affect wheat farmers’ TE, AE, and EE, a two-limit Tobit model was estimated. Before estimation, diagnostic tests were carried out, and the findings verified that there were no serious econometric problems with the data at hand. Tobit regression was used (Table 7), and the results revealed that livestock ownership, household head age, education level, engagement in off/nonfarm activity, and contact with agricultural extension agents had a positive and significant effect on TE, whereas total farm size and distance from the main market had a negative impact. Similarly, wheat producers’ AE increased considerably with family size, frequency of extension contacts, and wheat mechanization, and decreased with distance from the main market. It has also been specified that education level, off/nonfarm activity engagement, extension contact, and wheat mechanization have a positive and substantial effect on EE, while total farm size and distance from the main market have a negative impact.

Table 7 Summary of two-limit Tobit regression results.

The model output indicates that farmers’ age, which is a good predictor of agricultural experience, enhances the TE of wheat producers at a 1% level of statistical error. The ME estimation indicates that a 1-year increase in the age of the household head results in a 0.10%, 0.10%, and 0.50% increase in the level of TE, the expected value of TE, and the likelihood of a household scoring above the mean TE, respectively. This result is in line with the findings of [10, 13], who identified the age of households as a key determinant of agricultural production efficiency. This may be because as the household head’s age increases, farm knowledge accumulated through experience also increases. This helps to make better agricultural decisions and enhances resource management practices in wheat production through better on-farm risk mitigation strategies, which improves wheat production efficiency.

The education level of the household head was found to have a positive and significant effect on the TE and EE of wheat farmers at the 10% level of statistical error. As education level increases by 1 year of schooling, the level, expected value, and likelihood of scoring above the mean value of TE increase by 0.20%, 0.19%, and 0.98%, respectively, while the overall level of EE, the expected value of EE, and the probability of scoring above the mean value of EE increase by 0.24%, 0.23%, and 0.98%, respectively. The same result was also reported by [10, 13, 14], who confirmed the positive role of education in enhancing agricultural production efficiency. This could be because well-educated farmers have greater access to information and can easily comprehend farm instructions since education increases farmers’ consciousness over the need to acquire, analyze, and apply knowledge that increases wheat production efficiency.

Livestock holding in TLU was among the positively significant variables that determined the TE of wheat farmers at a 5% level of statistical error. This implies that households with higher livestock ownership are expected to have better wheat TE. As livestock holdings increased by 1 TLU, the overall level of TE increased by 0.32%, while the expected value of TE increased by 0.29%, and the probability of a household scoring above the mean TE increased by 1.53%. The positive role of holding livestock was expected since it is a good proxy for the wealth of farm households. A similar result was reported by [11, 25], as livestock provide additional income, and traction power for crop cultivation, which is expected to enhance wheat production efficiency.

The model results also confirm that farmers’ residence distance from the major market was inversely correlated with all efficiency estimates (TE, AE, and EE) at the 1% level of statistical error. This implies that as farmers get closer to the main market, from where they purchase farm inputs and sell their products, they will be more efficient than other farmers. This result is consistent with the findings of [10, 25, 32], who reveal a negative role for farm residence distances from the main market. This may be because, as farmers are closer to the market, they are more likely to experience lower expenses associated with marketing transactions.

The coefficient of the total farm size was negative and significantly affected both TE and EE at a 5% level of statistical error. This indicates that as total land holdings increase, the level of wheat farmers’ TE and EE decreases. A study by [13, 25] also confirmed the negative association between landholding and production efficiency, even if the land is an important economic variable that enhances household income by increasing production volume. However, farm size is inversely related to the efficiency level of particular crops since it challenges farm management practices and resource distribution, and smaller landholdings tend to have better management practices and ease the administration of production inputs, which can result in a more effective wheat production system.

The model results confirm that farmers’ contact was positively associated with TE, AE, and EE at the 10%, 5%, and 5% levels of statistical error, respectively. This implies that wheat producers with a greater frequency of contact with agricultural extensions are expected to have better wheat production efficiency. A study by [10, 11] also confirmed the positive role of agricultural extension contact on the efficiency of agricultural production. This may be because agricultural extension centers are the primary source of agricultural information that helps farmers become more informed and proficient, which enhances their farming methods and productivity. In addition, extension agents facilitate and promote the adoption of improved agricultural practices that contribute to the efficiency of agricultural production.

Family size was found to have a positive and significant effect on wheat farmers’ AE at the 1% level of statistical error. This suggests that households with larger families are more likely to have better AE than others. As family size increased by one adult equivalent, the overall level of AE, the predicted value of AE, and the likelihood of scoring above the mean of AE increased by 0.56%, 0.45%, and 1.72%, respectively. This outcome was expected given that family size is a significant source of family labor, which is consistent with previous research [11, 25]. As labor availability increases, agricultural activities that require an intensive labor force will perform well, and the adoption of agricultural technology will also be enhanced [38, 39], which is expected to increase the production efficiency of wheat farmers.

Wheat mechanization is another important factor that positively and significantly affects both AE and EE at the 1% level of statistical error. This implies that households that utilize machinery for wheat production are expected to have greater wheat production efficiency than nonusers. The findings of [33, 34] also support the positive role of mechanization in agricultural development through enhancing production and productivity. This may be because mechanizing wheat production improves farmers’ efficiency by lowering labor costs, shortening the length of plowing and harvesting, and reducing postharvest losses.

Household head participation in off/nonfarm activities was also found to significantly and positively affect both TE and EE at the 1% level of statistical error. This suggests that households engaged in off/nonfarm activities are predicted to have greater TE and AE levels than their counterparts (those who have not participated). The same result was reported by [10, 25], which may be because off/nonfarm activity increases household income and secures diversified income sources that reduce risks. This contributes to the purchase of farm inputs such as improved seeds and fertilizer and enhances the adoption of improved agricultural technologies that increase production efficiencies.

4 Conclusion and policy implications

As Ethiopian wheat production is undergoing a revolution and agricultural production is dynamic, understanding the current level of wheat production efficiency and knowing its determinants are important. Accordingly, the SFM and a two-limit Tobit model were employed to identify wheat farmers’ efficiencies and their drivers, respectively. The results indicated that the average TE, AE, and EE values were 80.8%, 88.1%, and 71.3%, respectively, indicating that wheat farmers are working below the maximum attainable efficiency levels. These data imply that there is space for improving the TE, AE, and EE of wheat farmers in the Arsi zone at given levels of inputs and production technologies. The two-limit Tobit model results show that age, education, livestock holdings, contact with agricultural extension agents, family size, wheat mechanization, and non/off-farm participation enhance the efficiency of wheat producers, while the distance from the main market and total land holdings reduce their efficiency.

Finally, to improve wheat farmers’ efficiency and attain the national objective of wheat self-sufficiency, it is critical to encourage policies and strategies that increase access to education through skilled educators and an appropriate learning environment. It is also essential to promote livestock production, which may be achieved by encouraging consistent animal feed production and providing efficient veterinarian services. Furthermore, it is necessary to boost wheat mechanization through subsidies and financial assistance. In addition, to enhance wheat production efficiency, market development and an environment that encourages farmers’ diverse income streams must be promoted through improved infrastructure and credit provisions.