Background

Potato (Solanum tuberosum) is a major crop in tropical highland regions of Sub-Saharan Africa, where it is grown both as a horticultural crop (due to its high value) and as a food security crop. Globally, it ranks fourth after maize, rice, and wheat [1]. In Kenya, it is the second most important food crop after maize [2, 3]. It plays an important role as a food staple among producing households and also contributes to poverty alleviation through income generation [4]. An estimated 800,000 farmers grow potato in Kenya [5], while over 2.5 million Kenyans are employed along the potato value chain, either directly or indirectly [6]. As in the rest of the tropical highlands of Africa, potato production in Kenya is dominated by smallholder farmers.

Despite its importance, potato production continues to be undermined by lack of certified seed potato (CSP). The lack of CSP in turn leads to recycling of seed, causing the buildup of pests and diseases among smallholder farms [4, 6]. Consequently, the average potato yields in tropical highlands of Africa fluctuate between 8 and 10 tons per hectare, compared to 40 tons per hectare obtained in countries in North America and Western Europe [2, 7]. Low yields, on the other hand, result in low incomes and contribute to food insecurity and poverty among small farm households, and confine them in subsistence agriculture [8, 9].

Until recently, seed production in Kenya was largely in the hands of the public sector. Under this system, the government was responsible for production of seed. However, lack of capacity to increase production in tandem with an increase in demand led to shortage in supply. Further, production of seed was limited to two areas (i.e., Tigoni and Molo)—both of which are quite far from most farmers in the Mount Kenya region. The bulky nature of the seed further made it very costly to transport seed to the producing areas. The low supply and high cost of transportation lead to increase in price, making seed unaffordable to farmers.

Recent joint efforts by some of the public and private sectors to increase the supply of seed hence reverse the low yields, led to the development of a rapid seed multiplication technique, commonly known as “3G” technique. The technique produces CSP in three generations of field multiplication, as opposed to the conventional 5–7 generations [10]. The seed produced is free from most potato pests and diseases. Kenya, Tanzania, Ethiopia, Malawi, and Uganda are among countries that have adopted this technique. To improve access to CSP by smallholder farmers, a private sector partner establishes a seed multiplication farm within the potato producing areas. In Kenya, for instance, Kisima Farm Ltd (KFL), the private sector partner, established a farm in Nanyuki, close to leading producing potato areas of Meru and Nyandarua counties. The farm sells its CSP to smallholder farmers within the radius of about 30 km and has retail outlets in other locations that give smallholder farmers around such outlets easier access to CSP. It multiplies and sells four CSP potato varieties, namely Kenya Mpya, Sherekea, Asante, and Tigoni. The growing of CSP is usually promoted with the goal of reducing yield gap and improving the welfare of farm households by increasing yields, food supply and hence food security. Improvements in household food security status follow from the fact that potato is a staple among producers [4].

This study uses data collected from a random sample of 408 potato farmers in central highland region of Kenya to assess the effect of using CSP on small farm households. It specifically focused on yield, household food security, and input usage as the outcome variables. The rest of this paper is organized as follows: “Methods” section presents the conceptual as well as the analytical framework and the hypotheses tested. “Results and discussion” section presents the results, while “Summary conclusions and implications” section concludes and provides some policy implications of the findings.

Methods

Estimation of the effects of using certified seed potato on productivity and food security

An individual’s or household’s decision to adopt or use a new technology is affected by the net benefits of doing so [9, 11]. Following [12] and [9], this study modeled the effect of using CSP on farm households as a linear function of explanatory variables (X i ) and dummy variable (R i) representing the use of CSP. The linear regression model for assessing the effect of CSP can be specified as;

$$ Y = \beta X_{i} + AR_{i} + \mu_{i} $$
(1)

where Y is the mean outcome variable (yield, input use, and household food insecurity) being affected by the use of CSP, R i  = 1 if a farmer planted CSP, 0 otherwise, and \( \mu_{i} \) the error term.

Whether farmers use CSP or not is dependent on the characteristics of the farmers and farms, as well as asset endowments, among other factors. Assuming a risk-neutralFootnote 1 farmer, the index function to estimate adoption of CSP is expressed as:

$$ R_{i}^{*} = \gamma X_{i} + e_{i} $$
(2)

where \( R_{i}^{*} \) ia latent variable denoting the difference between utility from using CSP \( U_{iK} \) and the utility from using other kinds of seed (\( U_{iN} \)). The farmer will thus use CSP if:

$$ R_{i}^{*} = U_{iK} - U_{iN} > 0. $$
(3)

The term \( \gamma X_{i} \) in Eq. 2 provides an estimate of the difference in utility from using CSP (\( U_{iK} - U_{iN} \)), with household and farm-level characteristics, as explanatory variables, while \( e_{i} \) is an error term. Close inspection of Eqs. 1 and 2 will, however, show that X and Y are interrelated and that there is the presence of selection bias. Selection bias occurs if some unobservable factors influence both the error terms (\( \mu \)) and (e) of the outcome equation and the technology choice equation, respectively, thus resulting in correlation of the two error terms. Hence, estimating Eq. 1 using the ordinary least squares regression technique leads to biased estimates.

To control for the self-selection bias, this study used propensity score matching technique. This technique matches the two groups (i.e., users and non-users of CSP), so as to create a plausible counterfactual which will then address the problem of selection bias. Specifically, it matches a treated individual (i.e., the user) with a control individual (i.e., the non-user) that is similar in all observable characteristics except for the treatment, and computes the difference in outcome variable for the two matched individuals. That difference is the effect of treatment (i.e., the use of CSP), measured in terms of average treatment effect on the treated individuals.

The estimation procedure in PSM proceeds as follows: First, propensity scores (defined as the probability of being in a treatment group given the observable characteristics) are obtained by estimating a binary logit or probit regression model. Second, the matching algorithm is selected based on the data at hand after undertaking matching quality tests. The common matching algorithms are the nearest neighbor matching (NNM), radius matching (RM), and the kernel-based matching (KBM). A discussion of the differences among these matching strategies and how each works can be found in [12]. Third, the control (i.e., non-users of CSP) are matched with the treatment (users of CSP) using the selected matching algorithms. Fourth, tests to assess whether the assumption of common support is satisfied are conducted. Fifth, the treatment effect is estimated in the region of common support based on the matching estimator selected. Finally, sensitivity analysis is undertaken to check whether the influence of an unmeasured variable on the selection process is so strong as to undermine the treatment effect. This is achieved by using the Rosenbaum bounds (rbounds) test. A detailed discussion of the theory and implementation of the matching procedures is provided by [12, 14, 15].

As part of the first step of implementing the PSM, we estimated a binary response model. The functional form of the empirical model estimated was:

$$ useCSP = f(lnage,\;gender,\;occupation,\;education,\;hhdsize,\;lndistagric,\;lndistkisima,\;credit,\;lnland,\;lnasset,\;phone,\;memgroup + \varepsilon $$
(4)

The model was estimated using probit regression. The descriptions of the variables in the empirical model are presented in Table 1. In the empirical model, some variables have been transformed into their natural logarithms. These are age (lnage); distance to agriculture office (lndistagric); distance from the farmer’s home to Kisima Farm (lndistkisima); size of land owned by farmer (lnland); and value of physical asset (lnasset).

Table 1 Variables used in the models and their hypothesized effects on the use of CSP

The second step involved the estimation of the effect using the CSP by comparing the users and non-users on input use (fungicide, insecticide and fertilizer use, potato yield, and food insecurity). The outcome variables used are as defined in Table 1. Food insecurity was measured using the household food insecurity access prevalence (HFIAP) score.

Before comparing the CSP and non-CSP users using the propensity score matching technique, an independent sample t test was done to test whether the characteristics were different between the two groups of farmers. Other variables that were used to compare the CSP users and non-CSP users are defined in Table 2.

Table 2 Other variables used to compare CSP users and non- users, in terms of asset ownership, potato production, and utilization

Sampling procedure and data

The data used in this study were collected from 6 districts of the Mt Kenya region, namely Buuri, Igembe Central, Igembe South, Laikipia East, Meru Central, and Tigania East due to their proximity to the CSP source, i.e., Kisima Farm Ltd. Figure 1 shows the map of Kenya, highlighting the Mt Kenya region where the study was conducted. In each of the districts, the study focused on the villages with smallholder potato farmers who had used CSP from Kisima Farm during any of the four seasons of two years preceding the study (i.e., 2012 and 2013).

Fig. 1
figure 1

Map of Kenya showing the study area

The study respondents were selected as follows: First, a list of all the villages with farmers who planted CSP from Kisima Farm was obtained and the villages that had fewer than 12 CSP purchasers were dropped.Footnote 2 This procedure resulted in the selection of 21 villages in Buuri, one in Igembe Central, one in Igembe South, 2 in Laikipia East, 5 in Meru Central, and 4 in Tigania East. A total of 34 villages were therefore selected from the 6 study districts. Second, for each village, a list of all the farmers who had planted CSP from Kisima Farm Ltd during the 2 years prior to the study was drawn with the help of local administrators (village heads) and contact farmers. A second list of potato growers who used other types of seed potato was also drawn. Third, 12 respondents were selected from in each village, with the respondents drawn from of the two each list using probability proportionate to size sampling technique. That is, more farmers were sampled from the list with more names, and vice versa. This procedure resulted in the selection of 408 farmers: 167 CSP users and 241 non-users.

Eight trained enumerators collected the data from the selected farmers through personal interviews using a pre-tested questionnaire. The interviews ran from March to May 2014. The data were entered in SPSS and analyzed using Stata.

Results and discussion

Characterization of study respondents

Table 3 presents the demographic characteristics of users and non-users of CSP. The mean age of all the respondents was 49 years. On average, the respondents had eight years of education, implying that majority had only attained primary level of schooling.

Table 3 Demographic and households’ facility access statistics

Users of CSP had significantly higher average level of education than the non-users. This finding is in line with the adoption literature which suggests that education increases the likelihood of uptake of new technologies [16]. Users and non-users of CSP also differed in terms of ownership of mobile phones (an important communication tool/asset). However, the dependency ratios did not differ among the users and non-users of CSP. Results further showed that the study respondents had, on average, grown potato for 19 years and therefore had considerable amount of experience in potato farming. There was, however, no difference in potato growing experience between purchasers and non-purchasers and also between male and female farmers.

Table 4 presents the results of the analysis of the respondents’ endowments with different types of assets likely to influence capacity to invest in the costly technology such the purchase of CSP. The assets are in US dollar values. The table shows that the mean value of total physical and livestock assets owned by CSP users was USD1851 and USD1783, respectively, while for non-users was USD 738 and USD 1239, respectively. Thus, CSP users were better endowed with livestock capital assets than their counterparts, as shown by the results of t-tests of differences in means. There was, however, a wide variation in the value of these assets, as indicated by the large standard deviations.

Table 4 Household ownership of various farm assets, by farmer category

Results also show that users of CSP owned more land than their counterparts. The differences in land ownership are also statistically significant. These findings indicate that there are systematic differences between users and non-users of CSP. In other words, the users and non-users of CSP were not drawn from the same population. Consequently, estimation of the impact/effect of CSP needs to take this difference into account—hence the use of propensity score matching technique.

Effect of using CSP seed on small farm households

Table 5 presents the results of the statistical tests of difference in the amount (in value terms) of common inputs used in potato production. It shows that users of CSP differ from their counterparts in their usage of some of the key agricultural inputs. Specifically, the results show that the amount of fertilizer and insecticides used by CSP users was significantly higher. Users of CSP spent USD 52/ha and USD4/ha more on fertilizers and insecticides, respectively, than their counterparts. There was, however, no a statistically significant difference in the amounts of fungicides applied by users and non-users of CSP. Fertilizer is often used for soil amendments and is an important input in increasing crop productivity in many African countries [17, 18]. However, use of fertilizer has been associated with non-point source pollution; hence, an increase in its use can degrade the environment [19, 20]. Results show that there was no significant difference in the value of manure used by the two groups of farmers.

Table 5 Test of differences in input use, in terms of ,value of inputs (USD), by users and non-users of CSP

The use of better CSP and increased use of yield-enhancing inputs are expected to influence potato yields among adopters; hence, the differences in yield between users and non-users of CSP were examined. Table 6 presents the results of the analysis of differences in the sale of both ware and seed potato and incomes. It shows that there is a statistically significant difference in all the three outcome variables (i.e., yield, sales, and income) between users and non-users of CSP. Specifically, users of CSP produced more potato per hectare of land, and hence, sold more, thus earning more income from sales than their counterparts.

Table 6 Potato production, utilization and incomes, comparison between users and non-users of Kisima CSP

The results of analysis of the effect of using CSP on household food security (based on HFIAP) between users and non-users of CSP are presented in Fig. 2. The HFIAP is computed following [21] and is a snapshot measure of the degree of food insecurity in a household based on a 30-day period. The figure shows that the higher proportion of non-users experienced food insecurity problems in the 30 days preceding this study. Specifically, more than 80% of non-users of CSP were either mildly or moderately food insecure as compared to about 59% of their counterparts. A Chi-square test of differences in the prevalence of food insecurity between users and non-users of CSP yielded a p value of 0.000, indicating that the observed differences are indeed statistically significant.

Fig. 2
figure 2

Prevalence of food insecurity in the study households: results of HFIAP analysis, % of responses. Chi-square p value = 0.000

The above section has demonstrated that there is some evidence that the use of CSP is associated with increases in the amount of pesticides used, potato yields, and household food security status. Does this evidence persist under the use of the more rigorous econometric techniques? To address this question, we used propensity score matching (PSM) technique to test whether purchase of CSP increases pesticide use, potato yield, and household food security status.

The results of the Probit regression model, estimated as the first step of PSM estimation method, are shown in Table 7. They show that households located further from Kisima Farm are less likely to purchase CSP, probably due to the bulky nature of the seed and hence the high transport costs. On the other hand, a higher value of assets and land size owned prior to the purchase of CSP increase the likelihood of adopting CSP. The likelihood of purchasing CSP is also affected by membership to a farmer group, perhaps because such groups organize joint purchase to reduce transport and other transaction costs. The results further show a very strong effect of asset ownership on the likelihood of adopting CSP. This indicates that better off households are more likely to use CSP and could be related to the high cost of seed, and the accompanying costs of transporting seed from source to the farm.

Table 7 Results of the probit regression model estimated to generate propensity scores for propensity score matching exercise

Table 8 presents the measures of impact (i.e., the average treatment effect on the treated—ATT) of using CSP, computed using three matching approaches (algorithms) namely NNM, RM and KBM to check the robustness of the findings. Results show that the use of CSP has a statistically significant increase in potato yield. Specifically, on average, the use of CSP increases productivity by 2975–9521 kg per hectare. Thus, these results corroborate those of descriptive analysis that showed that users of CSP had higher yields than their counterparts. Results also show that using CSP reduces household food insecurity. The HFIAP score, the measure of household food insecurity, fell between 2.2 and 2.5. The increase in yield and decline in food insecurity are consistent and statistically significant across all the three algorithms, indicating that these findings are robust. Table 8 further shows that the use of CSP increases the use of insecticides, albeit modestly, but has no significant effect on the quantity of fungicides applied. The value of insecticides used by purchasers of CSP increased by a modest amount of about USD 6/hectare.

Table 8 Results of propensity score matching analysis, for determining the effect of use of CSP

In order to check the sensitivity of the results to the unobserved variables, the Rosenbaum (rbounds) tests were conducted (Table 9). The test sensitivity of the results relating to the value of fungicides used is not included, since Rosenbaum bounds (rbounds) are meaningful only for treatment effects that are significantly different from zero [15]. The levels of gamma in the three algorithms are slightly higher for the NNM on pesticide use and household food insecurity. These results imply, for example, for the NNM, that the odds of the unobserved covariates would have to increase by about twofold before they can negate the estimated impact. The results of the sensitivity analysis therefore indicate that the finding that use of CSP increases yield is very robust. They further suggest that no ordinary changes in the unobserved variables can change the results considerably.

Table 9 Results of sensitivity analysis for the hidden bias in the propensity score matching model

Summary conclusions and implications

This study examined the effects of using CSP on farm households and on the use of yield-enhancing inputs with the potential to degrade the environment. Results of descriptive analysis indicate that use of CSP is associated with increased yields, sales, and hence income. In addition, users of CSP were less food insecure than their counterparts. These gains are, however, accompanied by a modest increase in the use of inputs, notably pesticides and fertilizer.

The results of the survey probit regression model estimated as the first step of implementing PSM indicated that farmer endowment with physical capital assets and group membership, distance to the source of CSP, and land ownership influence the likelihood of using CSP. Distance to source of CSP reduces the likelihood of its use, suggesting the significance of transaction costs in farmers’ decision to use of CSP, but may also reflect the bulky nature of the seed potato.

The findings of the PSM estimation are in line with those of descriptive analysis. They confirm that users of CSP obtained higher yields, pesticide usage, and household food security. Users of CSP specifically used a slightly higher amount of insecticides than their counterparts. They also slightly increased their usage of fertilizers.

This study therefore concludes that the use of CSP has positive welfare effects on small farm households in terms of higher productivity (yield) and hence incomes. However, these gains can lead to increased use of pesticides as farmers defend their investment in certified seed. This study therefore concludes that the use of CSP has positive welfare effects on small farm households in terms of higher productivity (yield) and hence incomes. However, these gains can lead to increased use of pesticides as farmers defend their investment in certified seed. The findings of this study have at least three implications. First, they imply that use of certified seed indeed has welfare benefits to smallholder households. Such benefits are in the form of increased productivity and, subsequently, increased income for those that participate in the market through sales. Hence, the findings imply the need to promote farmer use of quality/clean seed. Second, the finding that the use of certified seed could encourage the use of pesticides implies the need to train farmers on the safe use of pesticides in order to avoid potential negative effects exposure to such chemicals. Third, the strong correlation between asset ownership and access to CSP implies that access by poorer smallholder farmers may be constrained. This can, however, be overcome by organizing such farmers together as a group and sourcing seed jointly, thus enabling them to reduce per unit costs and benefiting through economies (due to discounts or lower per unit transportation costs).

This study has a few limitations. First, the study was conducted among a relatively small sample of potato growers and targeted only one region of Kenya, the Mt Kenya region. Thus, the findings are rather context specific. Future studies need to examine the effect of use of certified seed among a larger sample and in a broader context. Second, the study was conducted during a year that had fewer rains than usual. It is expected that the poor rains resulted in poor yields that would have been attained during normal year. Thus, the productivity and sales estimates presented in this study may have underestimated the actual performance under normal weather.