Study area
The study has been conducted in two areas of Ethiopia: North Wollo in the Amhara region, hereafter labeled Amhara-NW, and Central in Tigray region, hereafter labeled Tigray-C. Both sites are situated in the highland ecosystem, in a range between 2430 and 3240 m above the sea level. Respondents in Amhara lives between 2443 and 3240 m above sea level; their homesteads range between 11° 34′ 12.72″ N and 11° 49′ 59.7″ N of latitude and between 38° 1′ 43.82″ E and 39° 1′ 43.82″ E of longitude. The selected study area in Tigray spans from a latitude of 13° 33′ 33.08″ N to 13° 39′ 57.24″ N, while the longitude is between 39° 6′ 54″ E and 39° 11′ 59.57″ E. Farmers in the area live between 2430 and 2679 m above the sea level.
The regions are in the sub-tropic climatic zone, with one main rainy season (meher) in summer, between June and August, and a second shorter period of occasional rainfalls (belg) from February to May. The study sites are marginal areas in the highland zones of Amhara and Tigray and are representative of low-intensive cereal cultivation regions, especially focused on harvesting wheat, barley, and teff (Kidane et al. 2017). Agriculture and livestock are still the two main sources of living, characterized by low-input soil tillage technology and absence of irrigation; ox-plowing mechanism is ubiquitously used and cropping systems are mainly rainfed; furthermore, animal feeding is based on crop residues. In such remote areas, the complexity of soil fertility management calls for an active role of smallholder farmers (Mowo et al. 2006).
In these study sites, a wide variability in the regional drivers of agricultural systems has resulted in variegated land use and soil management practices (Table 1, Fig. 1).
Table 1 Biophysical, socio-economic, and agronomic description of the two study sites The investigated regions differ by several pedoclimatic aspects: in particular, total annual rainfall, annual mean temperature, and average altitude. The joint combination of these characteristics determines the existence of two different agroecosystems: in Amhara, the cropping system has two growing seasons in the same year, while Tigray presents just one. Regarding socio-economic characteristics, the two locations share religion (Christian Copti) but vary for language spoken: Amharic is spoken in Amhara-NW and Tigrinya in Tigray-C. In both sites, farmers’ livelihood is based on subsistence agriculture and livestock. The low-input rainfed type of agriculture practiced does not allow the cultivation of water-intensive crops; therefore, farmers’ diet is restricted to legumes and cereals. Nonetheless, agrobiodiversity is highly present and each household harvests a range between two and seven crops (among others, predominant are barley, wheat, teff, field pea, and chickpea).
The majority of the households surveyed are male headed, with the head of the household taking the agricultural decisions. In Amhara-NW, on average, a household has 5.3 components (min. 1 and max. 9) and the average SAU (Surface of Arable Unit) is 1.2 ha. In Tigray-C, numbers differ slightly: household size floats around 5.5 members (min. 1 and max. 10), while the average SAU of 0.28 ha is considerably lower than Amhara-NW. In both regions, smallholder farmers tend to settle permanently in the area: 62% household respondents in Amhara-NW declare living in the same homestead for more than 10 years, and of the remaining group, 24% has been living in the same homestead for more than 50 years. In Tigray-C, 60% is the percentage of households permanently living in the village for more than 10 years and 33% is the amount of those living in the same homestead for more than 50 years. Regarding agricultural management practices, virtually all households (98% in Amhara-NW and 97% in Tigray-C) use own labor for mastering the land. Shared labor is a used practice, but it is not endemic: indeed, it is practiced by 27% of the households in Amhara-NW and 34% in Tigray-C.
Among all animals, oxen are the most valuable given the persistent presence of ox-plowing technology in the area. Crop rotation is diffuse in both regions, but often it is merely restricted to cereal crop rotation. Cash crops are absent, with the exception of eucalyptus; this tree is a fast-growing crop, tolerant to low soil fertility and prolonged periods of moisture stress (Jagger and Pender, 2003). In northern Ethiopia, eucalyptus is the most commonly observed tree species and it serves manifold purposes. However, its ecological footprint, especially on the soil, is a source of concern: even though planting trees can return nutrients such as total nitrogen and exchangeable potassium to the soil, one of the most cited criticism associated with eucalyptus is depletion of soil nutrients (Jagger and Pender, 2003). Smallholder farmers involved in the study have often reported issues of soil fertility depletion in connection with eucalyptus plantations. For these reasons, eucalyptus is inserted in the regression analysis relating soil management ability and household’s farming knowledge as control on resource availability at the household level.
Agricultural decisions within the household are usually taken by the household head; participants in both study areas declare to rely extensively on the farming knowledge acquired inside the homestead, suggesting a mechanism of learning from past generations to new entrants in the family. In a complementary fashion, participation in the agricultural communitarian life is a milestone for the households in our study, and it involves attendance in formal meetings (agricultural trainings or seed bank activities) and informal agricultural gatherings. Formal trainings are usually provided according to a participatory extension system, which takes place within a complex bureaucratic structure that often involves top–down approaches. In both regions, agricultural extension workers and formal trainings are the backbone for the actuation of rural development policies; nonetheless, they frequently fail due to a combination of one or more of the following reasons: large number of potential training recipients, geographic dispersion, infrastructure constraints, and diversification of agroecological conditions (Berhane et al. 2018). In a complementary manner, informal gatherings represent an array of institutions traditionally active in both study areas: among them, worth mentioning are idir (insurance institution in case of extreme events), iqub (financial institution of mutual help), debo (labor-sharing institutions among neighbors), and dado (labor-sharing institution during peak harvesting times) (Negera et al. 2019).
Sampling and data acquisition
Twenty-four villages were selected in the two regions, using an approach designed to maximize the representation of the entire territorial extension (with the support of GIS). The resulting coverage includes two counties (or woreda), Amhara-NW and Tigray-C; 12 villages (or kebele) for each county, representing 40% of the villages present in each area; and 12 households in each village, representing 5% of the entire household population living permanently in each village. Combined, the study selected randomly 288 households (144 located per county). The list of villages surveyed in Amhara-NW is the following: Aymat, Agrit, Akat, Taja, Weketa, Timtimat, Gashena, Hamusit Silasie, Workaye, Hana mekuat, Yewotet, Amba Yedogit. In Tigray-C, the 12 villages involved are Hadnet, Adi Kuenti, Resetu, Endamariam Awleo, Guderbo, Hoseya, Melfa, Bowak, Adawro, May sedri, Meda golat, Golat.
Of the randomly chosen 144 households in each region, extension workers were able to locate precisely 141 households in Amhara-NW (97%) and 139 (96%) households in Tigray-C. Each household has been interviewed with an extensive survey, created to allow for quantitative and qualitative data collection; surveys have been performed with tablets running ODK program. The field work was iterated in the months of February and March 2019, before the ploughing season, so as to gain the full attention of the farmers and interfere the least in their work. For the data and soil collection, two teams of 12 enumerators each have been employed, selected, and trained. Enumerators needed to have English knowledge, as well as some basic agronomic experience, in an effort to collect correctly the soil samples and minimize the collector bias effect in the data. For Amhara-NW, the 12 enumerators were actively working at the Sirinka Centre of the Amhara Agricultural Research Institute while in Tigray-C, enumerators were researchers in the Agroecology Department at the Mek’ele University. Focus groups and key informant surveys (among others, with local chiefs and agricultural extension workers) performed during the year 2017 and 2018 are considered preparatory to enrich the narration of the mechanisms described.
Along with the survey data, the team collected one topsoil sample (0–15-cm depth) from each household’s main field, for a total of 280 soil samples in the two regions. GPS points have been recorded both at the household home and at the household main plot, where the soil sample has been taken. Physiochemical analyses have been conducted in Debre Zeit (Ethiopia) for the following 11 parameters: pH-H2O; texture, electric conductibility, available phosphorus, exchangeable potassium, total nitrogen, organic carbon, organic matter, CaCO3, and bulk density.
Data analysis
Analysis of questionnaire data
A household questionnaire was constructed in order to capture socio-economic as well as agronomic characteristics of the households, following the standard for survey development set by the Living Standard Measurement Survey of the World Bank and the Abdul Latif Jameel Poverty Action Lab. The survey contains questions about demographic characteristics of the farmer, agronomic practices, microclimatic managerial choices, farming beliefs, and knowledge. In the latter group, a subset of 5 questions out of 10 was meant to capture first-order belief (FOB) about farming knowledge of the household’s head (i.e., the decision-maker when it comes to agricultural affairs). The goal was to screen three different dimensions of farming knowledge previously identified by the literature: (i) accumulation of farming knowledge inside the household, (ii) acquisition of farming knowledge among peers and between households, and lastly (iii) farming knowledge obtained in formal trainings and education. The target was to capture the effects of each single dimension, in order to provide a comprehensive perspective on farming knowledge dynamics.
Following a theory-driven approach based on previously identified literature (as cited in Section 1), we formalized household’s farming knowledge as follows: (i) home learning, grouping those questions referring to farming practices learnt inside the household; (ii) social learning, gathering impressions on the mechanisms of social imitation in the village; and (iii) education, iconizing general knowledge acquired in school. Each of these three dimensions has been represented by a set of questions in the survey:
-
(i)
Home learning
-
(a)
q.1 “It has happened in my household to try new crops, new combinations and/or new techniques related to farming which were unknown to my friends and neighbors” (T/F answer)
-
(b)
q.2 “I would introduce/abandon a new farming technique if a member of my family suggests so”. (T/F answer)
-
(ii)
Social learning
-
(a)
q.1 “I am open to farming advices coming from outside my household” (T/F answer)
-
(b)
q.2 “I would introduce/abandon a new crop if a member of my village suggests so”. (T/F answer)
-
(iii)
Education
-
(a)
q.1 “What’s your education level?”.
The key variables measuring the home and social learning dimensions were merged utilizing a weighted sum: being two dummies, the final value of local and social learning assumed value 0 (low) if neither of the two dummies were positive, assumed value 1 (middle) if only one of the two was positive, and assumed value 2 (high) if both of them were positive. For the dimension of formal education, only the declaration of school attainment has been included in the analysis; this is represented by a categorical variable between 1 (illiterate) and 4 (attended more than 10 years of schooling), with 5 being literate but with no formal education. The set of knowledge-related questions was analyzed through a principal component analysis (PCA), so as to explore if the theory-based construction would find evidences in a data-driven approach; other information contained in the questionnaires (e.g., off-farm income, age, and gender) were used as controls in the subsequent statistical analysis.
Statistical analysis
The statistical analysis was conducted in two steps: firstly, we computed a new measure of the soil managerial capacity of farmers, called farmer’s soil management ability; secondly, the three knowledge dimensions previously identified were regressed against the soil management ability of each household, utilizing three different models.
Computation of the soil management ability at household level
The soil management ability (SMA) is calculated discretizing for the content of total nitrogen (TN), available phosphorus (P), and exchangeable potassium (K) detected from the soil analysis of each farmer’s main field, after controlling for region and for pedological characteristics of the terrain: i.e., soil texture, organic matter content (OM), pH, altitude, total carbonate (TC), electric conductibility (EC), cations exchange capacity (CEC), and carbon–nitrogen ratio (CN).
The content of TN, P, and K can suggest among the smallholder farmers in the villages those capable of managing the soil fertility more effectively (Öborn et al. 2005); this is especially true in subsistence, low-input rainfed farming systems as those analyzed in this work. Indeed, the information that these three macronutrients can convey are multiple: (i) TN, P, and K are considered major nutrient elements and are essential for plants and animals (Öborn et al. 2005); (ii) they are responsible for crucial physiological processes of the crops, such as photosynthesis, stimulation of early growth, transportation of water, and drought resilience (Tripathi et al. 2014); (iii) P availability, in particular, is extensively limited, and given the current rate of depletion of available phosphorus reserves, this nutrient will become a major limiting crop yield factor by 2050 (Balemi and Negisho 2012); (iv) they have been identified by the several sub-Saharan agricultural agencies (ATA 2013) as the main deficiencies in most agroecosystems of the continent.
Data were aggregated into a stepwise regression model with bidirectional elimination, where TN, P, and K constituted the three dependent variables. The pedological parameters previously listed were utilized as predictors, following the FAO Global Soil Partnership, which fosters the exploration of links between socio-economic and environmental variables utilizing regressions in the context of the Digital Soil Mapping Initiative (Meeting in Teheran, January 2018) (Lombardo et al. 2018).
In this basic model, each of the three outcome variables was regressed against a common set of pedological predictors. They were OM, pH–H2O, altitude, presence of clay, sum between the presence of clay and the presence of silt (inserted for investing the compaction level of the soil, which affects nutrients presence by decreasing the rate of decomposition of soil organic matter and subsequent release of nutrients (USDA-NRCS 1996)), CaCo3, EC, CEC, and CN.
The full model specification at the farmer level is the following:
$$ T{N}_{ik}\sim \alpha +{\beta}_{ijk}{X}_{ijk}+{\varepsilon}_i $$
$$ {P}_{ik}\sim \alpha +{\beta}_{ijk}{X}_{ijk}+{\varepsilon}_i $$
$$ {K}_{ik}\sim \alpha +{\beta}_{ijk}{X}_{ijk}+{\varepsilon}_i $$
With:
βijk = coefficient attached to each j regressors, in each region k, for each farmer i
Xijk = common set of pedological invariant regressors j for each farmer i in each region k
k = study regions (Amhara-NW and Tigray-C)
The residual is the portion of TN, P, and K not predicted by soil long-term conditions. Therefore, for each farmer i in each region k, the predicted value suggests what is the lower (or upper) bound value of that outcome variable (TN, P, or K), given the pedological conditions. The difference between this bound and the actual value measured by the laboratory analysis represents the residual, and it captures the management ability of each farmer i to master the considered macronutrient in the main field’s soil. In order to compute a unique farmer’s soil management ability score generated by the ability of mastering each macronutrient, we estimated the quantile position for each residual in the specific macronutrient distribution. To every quantile, an increasing value (from 1 for the lower 0–25% to 4 for the highest 75–100%) was assigned; given the fact that all three macronutrients matter for soil quality, the sum of the quantile ordering values was calculated.
Therefore, the ability of each farmer was framed into a value which goes from a minimum of 3 (if all the three parameters belong to the lower quantile) up to a maximum of 12 (if all the three parameters belong to the highest quantile). This value was called soil management ability of the farmer and it is the crucial outcome variable for the socio-economic analysis.
COM–Poisson model for regressing household farming knowledge on soil management ability
Isolating the value of the soil management ability for each household, the possible effects of the knowledge’s dimensions were investigated. Utilizing as outcome the soil management ability variable led to the use of the Poisson family of count data model; however, soil management ability data are under-dispersed (and by construction not inflated with zeros, since the minimum value equals 3); this was confirmed by the calculation of the index of dispersion (Selby 1965) for both regions, which was equal to 0.63 for Amhara-NW and 0.55 for Tigray-C. In the aim of performing an effective but clean and simple econometric exercise, we have opted for utilizing a COM (Conway–Maxwell)–Poisson approach (Winkelmann and Zimmermann 1994). The econometric exercise was performed in R software: the package utilized is {COMPoissonReg}. We included controls for resources possessed by the household (levels of on-farm income source converted in kcal/ha and prot/ha, presence of off-farm income activities and woodlot) and for characteristics of the household head (age and gender). If additional controls were inserted (household type, labor type, sex ratio, dependency ratio, farm size, amount of tropical livestock units, market distance), regressions’ results did not vary.
Subsequently, smallholder farmers belonging to the top (75–100%) and bottom (0–25%) quartile of the soil management ability distribution were isolated and introduced in a new logistic regression model as dependent variables. Households in the top quartile of the soil management ability distribution were defined as high soil management ability farmers (HSF), while households in the bottom quartile of the soil management ability distribution were defined as low soil management ability farmers (LSF). Therefore, the full logit model specification at farmer level was the following, for high and low soil farmers, respectively:
$$ {HSF}_{ik}\sim \alpha +{\beta}_{ik}X{1}_{ik}+{\gamma}_{ik}X{2}_{ik}+{\delta}_{ik}X{3}_{ik}+{\mu}_{ijk}{Z}_{ijk}+{\varepsilon}_i $$
$$ {LSF}_{ik}\sim \alpha +{\beta}_{ik}X{1}_{ik}+{\gamma}_{ik}X{2}_{ik}+{\delta}_{ik}X{3}_{ik}+{\mu}_{ijk}{Z}_{ijk}+{\varepsilon}_i $$
With:
βik = coefficient for home leaning, in each region k, for each farmer i
X1, X2, X3ik = categorical variable for home learning, social learning, and education respectively, for each farmer i in region k
μijk = coefficient attached to each j controls, in each region k, for each farmer i
Zijk = common set of resource and farmer’s level controls j for each farmer i in each region k
k = study regions (Amhara-NW and Tigray-C)
The econometric exercise was performed in R software: the package utilized is {miceadds}. Errors were clustered at the village level and controls inserted were the same as the COM–Poisson regression model.
OLS regression of household’s farming knowledge on soil fertility macronutrients
Lastly, using a simple OLS (ordinary least square) model, we tested whether the relation identified between household farming knowledge and soil management ability held for raw values of soil macronutrients as well. This regression highlighted an absolute measure of the knowledge effect on soil fertility of the main investigated fields; however, this measure was affected by locus-specific physio-chemical factors, which could encompass elements of soil macronutrients’ availability not driven by gradients in households’ farming knowledge. The development of a soil management ability indicator cleaned this issue, being a relative, rather than an absolute, measure of the contribution of farmers’ management ability on managing soil macronutrients’ content.
The full model specification at farmer level is the following:
$$ T{N}_{ik}\sim \alpha +{\beta}_{ik}X{1}_{ik}+{\gamma}_{ik}X{2}_{ik}+{\delta}_{ik}X{3}_{ik}+{\mu}_{ijk}{Z}_{ijk}+{\varepsilon}_i $$
$$ {P}_{ik}\sim \alpha +{\beta}_{ik}X{1}_{ik}+{\gamma}_{ik}X{2}_{ik}+{\delta}_{ik}X{3}_{ik}+{\mu}_{ijk}{Z}_{ijk}+{\varepsilon}_i $$
$$ {K}_{ik}\sim \alpha +{\beta}_{ik}X{1}_{ik}+{\gamma}_{ik}X{2}_{ik}+{\delta}_{ik}X{3}_{ik}+{\mu}_{ijk}{Z}_{ijk}+{\varepsilon}_i $$
With:
βik = coefficient for home leaning, in each region k, for each farmer i
X1, X2, X3ik = categorical variable for home learning, social learning, and education respectively, for each farmer i in region k
μijk = coefficient attached to each j controls, in each region k, for each farmer i
Zijk = common set of resource and farmer’s level controls j for each farmer i in each region k
k = study regions (Amhara-NW and Tigray-C)
The regression which involved the raw values of the three macronutrients is represented by an OLS model, performed in R with the package {olsrr}. Controls were inserted for soil texture, OM, and altitude, as the main pedological and environmental influencing factors (Hamiache et al. 2012). Organic matter content played a peculiar role: since many tropical soils are poor in inorganic nutrients, they relied on the recycling of nutrients from soil organic matter to maintain fertility; in this perspective, tests of soil nutrients (such as TN, P, and K) might generate unreliable results (Tiessen et al. 1997).