1 Introduction

There has been increasing interest in replicating the Asian Green Revolution in sub-Saharan Africa (SSA), where stagnant agricultural productivity has long been an impediment to economic growth and food security (Evenson and Gollin 2003; Dawson et al. 2016; Bachewe et al. 2018). Emerging studies have identified lowland rice as the most promising staple crop among major cereals in the SSA, because of the high transferability of improved technology from Asia (Otsuka and Larson 2013, 2016). While the application of modern seeds and increased use of chemical fertilizer have been emphasized, Asian-type yield-enhancing rice-growing technologies also include improved management practices, such as bunding, leveling, and straight-row transplanting.Footnote 1 Since these management practices are knowledge-intensive and require deep understanding and careful execution to fully exploit yield potentials, the dissemination of those technologies may be difficult without credible and persuasive sources of information. How to reduce information friction in the diffusion process remains an important area of research.

Agricultural extension services are designed to play such a role by delivering advanced knowledge from lab and experimental fields to farmers. Yet, directly training millions of small farmers may be prohibitively costly. An alternative method is farmer-to-farmer training, in which selected progressive (contact) farmers participate in training organized by extension workers and then are encouraged to share skills learned with others in their network. While this contact farmer approach is common in many developing countries, evidence of its efficacy is mixed. For example, Feder et al. (2004), Tripp et al. (2005), and Kondylis et al. (2017) found that there is limited impact from the performance of contact farmers who have adopted new agricultural technologies on other farmers. On the other hand, Krishnan and Patnam (2014) and Nakano et al. (2018a, b) show that contact farmers increasingly adopt new technologies immediately after training, and demonstrated improvements spill over to other farmers later through farmer-to-farmer training.

An apparent empirical challenge to the examination of the effectiveness of the farmer-to-farmer extension system is selection bias, which might be one of the potential reasons for mixed empirical results: since progressive, contact farmers tend to be more motivated than ordinary farmers, they would perform better even without training. If this is the case, simply mimicking contact farmers’ observed practices may not necessarily improve fellow farmers’ performance because there are differences in innate characteristics between the contact farmers and fellow farmers. Thus, it is of great importance to determine whether new technologies taught in the training period per se would have the intended positive impact. Rigorously implementing an impact evaluation of this kind, however, involves another empirical challenge in the presence of knowledge spillovers, because non-trained fellow farmers can indirectly benefit by imitation and learning from their trained peers, attenuating the true impact of training. This is a violation of the stable unit treatment value assumption (SUTVA) in social science, according to which the observation of one unit should not be affected by the treatment of other units.

To execute a rigorous impact evaluation of both extension services and the subsequent social learning in a unified framework, we implemented sequential field experiments related to rice production management training for this study, in collaboration with rice-growing farmers in Cote d’Ivoire. To mitigate selection bias, we conducted a randomized controlled trial (RCT), where a subset of randomly selected farmers from each sample site received training in the form of a short course in rice production management. To fulfill the SUTVA, farmers from the treated group were initially asked not to transmit information about their training to the control farmers and the latter were requested to refrain from asking treated farmers for agricultural advice. After one year of observation, we examined whether training had the intended positive impacts for trained farmers, such as the adoption of recommended agronomic practices, improved rice yields, and improved profit. Once the positive impacts were confirmed, we relaxed the restriction and started promoting spillovers by encouraging farmers to exchange information without any monetary incentive given to the trained farmers.

We conducted household surveys three times: at the baseline before the training (January 2015 to May 2015), at the midline one year after the start of training (March 2016 to May 2016), and at the endline two years after the end of the training (March 2017 to May 2017). Using these three-year panel data, we identified the evolution of both the intention-to-treat (ITT) effect and the treatment-on-the-treated (TOT) effect of the training. Furthermore, to explore the dynamics of information exchange between treated and control farmers in the first and second year of observation, we conducted a detailed analysis of information flows and the existence of peer effects using social network data collected in the mid- and endline surveys.

Our main findings are summarized as follows. We found that while the adoption rates of most improved rice management practices were unexpectedly high even at the baseline, the treated farmers were more likely to adopt improved practices by a year after training (i.e., the midline), such as transplanting in rows, canal/drainage construction, and field leveling after training. Higher adoption rates of those recommended agronomic practices lead to improved rice yield as well as increased income per hectare among treated farmers at the midline. Once all farmers were encouraged to exchange information later, the productivity gaps between treated and control farmers narrowed sharply by two years after training (i.e., the endline). It may seem reasonable to interpret this convergence as the sign of short-lived impacts of training where trained farmers dis-adopt new practices and return to traditional ones. However, we observed that trained farmers continued to adopt improved agronomic practices at the endline, and control farmers followed them. Our detailed network analysis based on a dyadic regression further revealed that information flows from treated to control farmers were less active than between control farmers (the reference group) at the midline but become more active at the endline. These results together suggest that farmers largely followed our guidance not to exchange information on rice production within the initial experimental phase, which mitigates estimation bias, if any, in our impact evaluation of training. Yet, once such a restriction is abolished and information exchange is encouraged, control farmers can successfully catch up with treated farmers through social learning. These results imply the importance of not only extension services to trigger the adoption of improved agricultural technologies, but also of social learning for their wider diffusion.

This study contributes to the literature on the role of social learning in the diffusion of agricultural technology (Foster and Rosenzweig 1995; Munshi 2004; Bandiera and Rasul 2006; Conley and Udry 2010). Most previous studies agree with the high potential of social learning, but there is little consensus as to the relative effectiveness of direct training by extension workers and learning from peer farmers. Moreover, a growing number of studies focus on determining who should be targeted to increase the initial adoption rate and to facilitate social learning (Beaman and Dillon 2018; BenYishay and Mobarak 2018; Macours 2019; Shikuku 2019). Our study demonstrates that information dissemination by farmers can be as effective at improving practices as the initial training provided by extension services, which is in line with the study of Nakano et al. (2018a, b). Unlike their study, however, our results show that the entry points to disseminate information are not necessarily the progressive, contact farmers, but can be the ordinary farmers who compose the majority of the rural community. Moreover, unlike BenYishay and Mobarak (2018), who revealed the importance of financial incentives for trained farmers to promote technology diffusion, our results also suggest that social learning can be facilitated by encouragement without any incentive.

The rest of this chapter is organized as follows. Section 3.2 explains the study setting, sampling framework, and experiment design, and examines the summary statistics of our sample. Section 3.3 explains our estimation strategy on the dynamic impact of training and discusses estimation results. Section 3.4 conducts a detailed analysis of the information network and explains the estimation results. Section 3.5 discusses the potentials and limitations of our study while referring to external validity and ethical concerns, and Sect. 3.6 concludes this chapter.

2 Survey and Experimental Design

2.1 The Study Setting

The study took place in the Bellier and Gbeke regions of Cote d’Ivoire, near the capital city of Yamoussoukro. Like other West African countries, rice is one of the major staple foods in this country and its consumption has exceeded domestic production (Fig. 3.1). The government has tried to increase rice yields to sustain food security and save foreign exchange reserves.

Fig. 3.1
A line graph of rice commercial process versus years with 3 plots. The plots of production, consumption, and import increased drastically after 2010.

Rice Production, Consumption, and Imports, Cote d’Ivoire (unit 1000 tons). Note The authors’ calculation from FAOSTAT data. Rice consumption is measured by apparent consumption, that is production plus imports minus exports

The adoption rate of modern inputs, such as improved seeds and chemical fertilizer, is higher in Cote d’Ivoire than in other rice-growing countries in the SSA. However, several recommended agronomic practices, including straight-row transplanting, that have been proven to boost rice yield in tropical Asia as well as other SSA countries, have not been adopted widely (David and Otsuka 1994; Otsuka and Larson 2013, 2016). There is thus room for management training to improve the performance of rice production.

The two regions, Bellier and Gbeke, were selected to improve domestic rice production and increase the quantity of marketed rice under a bilateral official development assistance (ODA) program between the Ivoirian and Japanese governments. Japanese technical experts were dispatched from 2014 to 2018 through an ODA scheme organized by the Japan International Cooperation Agency (JICA). There are a total of 107 production sites suitable for rice production within those two regions, which are all located in the lowlands. Some production sites have sufficient access to irrigation water and are able to cultivate rice twice in a good year. Others are in low-humidity zones (called bas-fonds) and are dependent on rainfall. The main rice cultivation season is roughly from July to December. If irrigated, a second cycle starts around January/February. When water is insufficient, farmers produce other crops, such as yams and peanuts, or leave the paddy field to fallow. Since these two regions are agro-climatically more favorable for rice production than other areas of the country, farmers have received various rice cultivation training packages provided by international donors, including JICA, World Bank, and AfricaRice (formerly known as WARDA [West Africa Rice Development Association], whose headquarters is located within Cote d’Ivoire), as well as local extension agencies, including the Agence Nationale d’Appui au Développement Rural (ANADER).

Out of 107 production sites, two were initially selected for the JICA project in 2014. Thereafter, the target area was expanded every year until 2018 to cover a total of 26 sites. This study relies on the data from the eight production sites selected in 2015. To choose our study sites, we closely collaborated with technical experts. Admittedly, the selection of study sites was not completely random because technical experts have a target to cover 1,500 hectares of land within the five-year project period. Thus, the study sites are relatively larger in operational size than the remaining sites in the Bellier and Gbeke regions. Since the impact of training may potentially vary by agro-ecological and institutional conditions, we classified all potential production sites in terms of access to irrigation and the existence of prior rice training. We selected two sites from each combination of with and without irrigation and prior training, generating a sample of eight production sites in total.Footnote 2

2.2 Sampling Structure and Experimental Design

Prior to the experiment, we had meetings with farmers belonging to agricultural cooperatives at each selected site.Footnote 3 The objective of the meeting was to explain our implementation plan and obtain consent from farmers (see Fig. 3.2 for the timeline, data type, and sample size at each implementation period).

Fig. 3.2
A timeline of different practices in rice cultivation related data management ranging from years 2014 to 2017.

Timeline of Implementation. Note HH refers to the household survey, while NT refers to the network survey

Based on an agreement with technical experts, we outlined our plan to the farmers as follows: (1) We would like to conduct a social experiment to assess the impact of training and ask farmers to cooperate with us; (2) We would group farmers randomly into two, with one eligible to receiving the training offered by JICA experts while the other was expected to apply the best management practice they had access to; (3) All farmers including control farmers were to be provided with the necessary inputs, such as improved seeds, chemical fertilizer, and herbicide, on creditFootnote 4; (4) The experimental phase was to last one year, during which farmers belonging to different groups would be expected not to exchange information about techniques and management practices taught in the training. Specifically, we requested treated farmers not to transmit such information and asked control farmers to refrain from asking treated farmers for agricultural advice; (5) Before and after the experiment, we would conduct household surveys for impact evaluation; (6) If farmers followed our guidance and treated farmers did not transmit information on rice production management, we would acquire precise and valuable knowledge regarding the effectiveness of the technological package taught in the training in their settings; (7) After the impact assessment, we would share which technology (i.e., conventional practice vs. one taught in training) was found to be superior; and (8) After the experimental phase, farmers would be encouraged to share information to facilitate knowledge diffusion.

While unequal treatment during the experimental phase could be a source of tension between treated and control farmers, we attempted to make them feel neither lucky nor unlucky in their treatment status. Rather, we emphasized that once we know which technology is better, everyone can benefit from such knowledge and that the success of this social experiment depends crucially on whether farmers exchange information or not within one year after the training, and also on whether they provide accurate information in the surveys. This sort of explanation seemed to ease tensions. Moreover, despite being uncommon, restrictions on information exchange were accepted by farmers without any revealed complaints once they understood the objectives of the research. Indeed, farmers showed a strong willingness to engage in this experiment.

After obtaining consent, we collected individual member lists from each agricultural cooperative. Out of 414 farmers on the shortlist, 275 households were found to be active rice producers who had cultivated rice at least once in the preceding year. These 275 households constitute the primary sample in this study to whom we assigned eligibility to participate in the training.Footnote 5 One half of the sample households were randomly selected as a treatment group and the other half selected as a control group at each site. Randomization was implemented at the farmer level within each site. Everyone in the sample knew not only own treatment status, but also that of their peers. We then conducted the baseline survey with those households from January 2015 to April 2015. The data obtained covered household demographic characteristics, details of rice production on all plots, other household income-generating activities, and household asset holdings.

Technical experts provided short classroom training sessions to extension agents of ANADER and three key farmers who were selected from each site.Footnote 6 Those extension agents and key farmers in turn offered on-site training to eligible farmers under the supervision of technical experts. This training consisted of (a) land preparation, including bunding and land leveling, which is crucial to reduce the amount of water wasted and to promote the even growth of rice plants, (b) water control, including canal construction and maintenance, which is important in the management of water levels in rice fields during the growth period, (c) seed selection and incubation, (d) fertilizer and herbicide application, (e) transplanting in rows, which can be adopted to facilitate other complementary management practices such as hand or rotary weeding and even the application of fertilizers, herbicides, or insecticides, and (f) harvest and post-harvest management. To mitigate noncompliance, particularly in the participation of the control farmers, local counterparts visited every session of the training and recorded who participated in it. The on-site training proceeded gradually to meet the actual rice cultivation cycle, and, in total, it was held at least six times from June to November 2015 to cover the key practices.Footnote 7

We conducted follow-up surveys twice, the first soon after the training period (March 2016 to May 2016) and, the second two years after training (March 2017 to May 2017), in which detailed data on information exchange across network members were also collected. Because there was a severe lack of rainfall during the 2015–2016 growing seasons, rice cultivation was difficult in those years. Most households could not cultivate rice in the sub-season. Even focusing on the main season, the number of sample plots cultivated for rice dropped sharply to 193 in the midline and further to 168 in the endline survey because of insufficient water. Due to the resulting lack of sufficient observations, we focused on the main season crop in the subsequent analysis and took attrition into consideration where relevant.

2.3 Descriptive Statistics and Balancing Test

Table 3.1, Columns (1)–(3) show the balance test on baseline characteristics for the sample household and plots in the main season. 275 households cultivated 328 rice fields in the baseline, of which 135 households and 160 plots belong to the treatment group.

Table 3.1 Baseline balance of sample plots by treatment and attrition status

About 72% of treated farmers had attended training at least once, while almost no (only two cases) control farmers had done so, indicating that control farmers largely adhered to our request and did not participate in the training. The attendance rate at the training, a percentage representing how many times out of six key training opportunities each farmer attended, was about 38% among treated farmers. On average, households were large (about nine persons) and headed by a male in their mid-40 s with minimal or no formal education.

We conducted a joint significant test except for the treatment and attendance variables in Column (3), demonstrating that we failed to reject the zero-null hypothesis. This suggests that our randomization was mostly successful.

We present the remaining data using plots rather than households as the unit of observation because some characteristics vary at the plot level and because the main regression analysis is conducted at this level. The data include all rice plots cultivated by sample farmers. The average plot size was relatively small, approximately 0.5 hectare. Although the treatment status was randomized, the difference in the plot size between treatment and control farmers was found to be statistically significant. Most land was operated under owner cultivation. If rented, this was generally a fixed-rent contract. The joint significant test in Column (3) again shows that these plot-level variables are jointly statistically not different by the treatment status.

Columns (4)–(6) of Table 3.1 compare the baseline characteristics of attrition and non-attrition samples with a t-test of the equality of the mean between the two and the associated joint significance F-test. Out of 328 plots, 204 continued rice cultivation in either the midline or the endline year, or both.

While attrition is not sensitive to treatment status, many observable characteristics notably differ between attrition and non-attrition samplesFootnote 8: on average, attrition samples were more likely to be female-headed with less education, larger in household size, smaller in the number and size of cultivation plots, and were more likely to own rice plots. The joint significance test shows that the zero-null hypothesis is strictly rejected, implying that attrition is non-random. This non-random sample attrition is a potential threat to causal inference, and should be addressed in the econometric analysis.

Table 3.2 presents the changes in outcome variables of interest regarding the rice management practices and productivity of non-attrition samples over time. We again show the results of t- and F-tests for treated and control plots. In addition, columns (10) and (11) present an unconditional difference-in-differences (DID) regression estimate of the treatment effect (i.e., the difference in the time trend between treated and control plots).

Table 3.2 Changes in outcome variables by treatment status: baseline, midline, and endline

The adoption of recommended management practices was generally quite high even in the baseline (Panel A). Because of its proximity to AfricaRice, adoption of the modern variety of rice was complete and uptake rates had reached 100%.Footnote 9 The use of chemical fertilizers was also remarkably high by SSA standards: on average, more than 200 kg/ha of fertilizer, such as NPK and UREA, were applied. In addition to these external inputs, the adoption rate of improved agronomic practices was mostly high. In our sample, about 90% of plots had selected better seeds by water or winnowing and constructed/repaired water canal/drainage systems in the baseline. It seems that the initial adoption rates were relatively low for leveling fields and greatly so for transplanting in a row, which might be technologies with some room for further improvement from training.

Panel B shows the rice productivity and profitability of sample plots. Gross production value per hectare is computed by multiplying the rice yield (1000 kg/ha) with the price received (CFAF/kg).Footnote 10 Rice income per hectare is equal to the gross production value minus paid-out costs, including land rent, irrigation fees, costs of purchased chemicals, and machinery rental, divided by the plot size. Profits per hectare are equal to rice income minus imputed family labor costs, divided by the plot size. To impute family labor costs, we used the typical prevailing hired wage rate for transplanting in each village.Footnote 11 The average yield exceeds 3.4 tons/ha which is significantly higher than the average of other countries in SSA of just above 2 tons/ha (Otsuka and Larson 2016). The average gross output value, rice income, and profits per hectare were about 600 thousand CFAF (or approximately 1,065 USD), 405 thousand CFAF (or 719 USD), and 320 thousand CFAF (or 568 USD), respectively.

The table also shows that while there were no statistically significant differences in the baseline adoption rate of recommended practices, treated farmers were more likely to adopt leveling, canal/drainage construction/repairs, and transplanting in rows at the time of the midline survey (Column (6)). Looking at each practice in detail, the increased adoption rate among treated farmers relative to control farmers stems partly from the fact that the control farmers did not continue several management practices, while the treated farmers did. This applies to leveling and canal/drainage construction/repairs. It is likely that insufficient rainfall, which tends to attenuate the impact of those improved practices on productivity and profitability, significantly reduced their adoption among control farmers, holding other things constant.

Meanwhile, control farmers increasingly adopted seed selection and transplanting in rows by the midline survey, and once information sharing was encouraged, the incremental adoption rate of most management practices between the mid- and endline was higher for control farmers (Column (11)). These observations suggest the existence of spillovers not only in the endline, but also the midline despite the restriction of information exchange during that period: indeed, there is room for seed selection and transplanting in rows to spill over because seed selection is relatively easy to imitate, and because transplanting depends largely on hired laborers who can assist both treatment and control farmers. Whether such technology dissemination, if any, significantly alters our view of the impact of training is one of the major issues addressed in later sections of this chapter.

On the other hand, all outcome variables except rice yield were not significantly different between the treated and control samples in the baseline, and no outcomes were significantly different in the midline and endline surveys. The unconditional DID estimates show that treatment plots increase rice yield and revenue between the baseline and midline more than control plots, while the reverse was true between the midline and endline surveys.

These results suggest that while the initial adoption rates of most management practices were already high, the treated farmers further improved or continued their rice management practices and performed better than control farmers in the first year after training. However, control farmers caught up with treated farmers in the following year. While these observations support the existence of positive training impacts in the first year and social learning in the following year, there are some reservations about the descriptive statistics. The following sections examine in more detail whether training has real impact and whether spillovers exist especially between the midline and endline surveys.

3 Dynamic Impacts of Training

3.1 Estimation Strategy

To identify the causal relationships between the provision of training and outcomes of interest, we estimate intention-to-treat (ITT) and treatment-on-the-treated (TOT) effects. We examine the average impacts of all production sites, allowing the impacts to vary across time. We are particularly interested in whether the training brings intended positive impacts in the first year after the training and whether the gap generated by the experiment decreases over time through spillovers in the next year. Following McKenzie (2012), we employ an analysis of covariance (ANCOVA) model in the form of:

$${Y}_{ijt}={\beta }_{0}+{\gamma Y}_{ij0}+{\beta }_{1}{T}_{t}+{{\beta }_{2}D}_{ij}+{{\beta }_{3}{(T}_{t}\times {D}_{ij})+X}_{ij0}\delta +{\mu }_{j}+{\varepsilon }_{ijt}$$
(3.1)

where \({Y}_{ijt}\) and \({Y}_{ij0}\) are the post- and pre-treatment outcome variables of plot i in production site j at time t (i.e., either midline or endline) and time 0 (i.e., baseline), respectively; \({T}_{t}\) is a dummy variable for the endline data; \({D}_{ij}\) is a dummy variable equal to one if a household is eligible to participate in the training (ITT estimate) or a continuous variable for the attendance rate of training, instrumented by the treatment status (TOT estimate)Footnote 12; \({X}_{ij0}\) is a set of baseline control variables; \({\mu }_{j}\) is the time-invariant fixed effect at the production site; and \({\varepsilon }_{ijt}\) is the unobserved error term. The parameters of interest are \({\beta }_{2}\) and \({\beta }_{3}\). The former is expected to capture the short-term impacts of training under the imposition of the SUTVA, while the latter represents the mixture of the longer-term training impacts and spillover effects when the SUTVA is relaxed. We note that the pure training impact can be estimable only in the short term.

As outcome variables, we focus on the use of chemical fertilizer (kg/ha), the adoption of seed selection by water or winnowing (=1), leveling (=1), canal/drainage construction/repairing (=1), and transplanting in row (=1) as well as rice yield (ton/ha), gross output value (‘000CFAF), rice income (‘000CFAF), and rice profit (‘000CFAF) per hectare. When the outcome is binary, we apply a linear probability model. As baseline control variables, we include household size, household head’s characteristics (including age, gender, and years of education), plot characteristics (including parcel size and tenure status dummies), and the logged value of household assets at the baseline survey. We cluster all standard errors within production sites.Footnote 13

While the random assignment of treatment status should make the treatment and control groups similar in all dimensions, the estimated parameters may be biased due to non-random sample attrition. To adjust for that, we use the inverse-probability weighting method suggested by Wooldridge (2010). Specifically, we ran a probit regression to compute the predicted probability of non-attrition at the plot level and used the inverse of this as weights in the main equation. This first-stage probit regression result is presented in Appendix 1.

3.2 Estimation Results

Table 3.3 shows the estimation results for the dynamic impacts of management training on rice productivity and profitability with coefficients on control variables suppressed for the sake of brevity. Note that the sample size here is 353, smaller by 8 from the sum of 193 (midline) and 168 (endline) observations due to missing explanatory variables in these few plots.

Table 3.3 Estimated results on the dynamic impacts of training: rice productivity

It is clear that training had positive and significant impacts on rice productivity by the midline, with the rice yield increasing by 0.75 ton/ha, the gross output value per hectare by 140 thousand CFAF, and rice income per hectare by 103 thousand CFAF. These improvements correspond to 20%, 24%, and 29% of the control means, respectively, suggesting that management training was effective in our context.Footnote 14 This improvement in productivity, however, did not lead to an increase in profits. As we will see, this might be because trained farmers test a larger number of improved management practices than control farmers, which require more family labor inputs. Qualitatively similar results were observed for TOT estimates. The fact that we found quantitatively larger magnitudes of impacts in TOT than in the ITT estimates suggests that actual training participation rather than simple eligibility is important in the improvement of production performance.

Notably, the coefficient estimates on the interaction term were negative and significant for rice yield and gross output value per hectare. This indicates that the improvement of performance among treatment groups from the mid- to endline was lower than for the control groups. The Wald test shows that we cannot reject the null hypothesis that the total training effect is zero in most specifications, implying that treated farmers are no better than control farmers by the endline. We could interpret this negative interaction term, \({\beta }_{3}\), as reflecting either the short-lived training effects or the existence of spillover effects. If training impacts do not last long, however, we would have observed some signals, such as a declining adoption rate of improved management practices among treated farmers from the mid- to endline surveys. We did not observe clear dis-adoption patterns among the data presented in Table 3.2. Thus, this finding seems consistent with the operation of a mechanism wherein control farmers improve their performance by learning from treated farmers after the SUTVA is relaxed.

Table 3.4, which shows the estimated impacts of training on the adoption of improved agronomic practices, also provides supportive evidence for the existence of spillovers. When the information exchange between treated and control farmers was restricted during the year after the training, a positive training impact on the adoption of improved management practices, such as leveling, canal/drainage construction/repairing, and straight-row transplanting is observed among treated farmers (ITT estimate) and training participants (TOT estimate). These results are in line with the expectation drawn from Table 3.2 that leveling and straight-row transplanting have a relatively large opportunity to assist improvement.

Table 3.4 Estimated results on the dynamic impacts of training: agronomic practice

On the other hand, we did not observe the same positive effects on fertilizer use and seed selection, presumably because there is little room for improvement due to the high initial adoption rates of those practices at the baseline. Also, given that the same amount of fertilizer was provided to both treated and control groups in the experimental phase, and that it is easy for farmers to imitate seed selection techniques, it seems reasonable to observe negligible effects on these practices.

Once the restriction was lifted two years after the training, control farmers caught up with treated farmers in the adoption of recommended practices, as reflected in the negative and significant coefficients on the interaction term, \({\beta }_{3}\). The Wald tests also revealed that in most outcomes we failed to reject the hypothesis of zero training impact in the longer term.

Note also that, if spillovers exist, the average performance of control groups would improve over time, which should be reflected in \({\beta }_{1}\) (the endline dummy) > 0. \({\beta }_{1}\) is positive for most outcome variables and statistically significant for the adoption of canal/drainage construction/repairing and straight-row planting for TOT estimation, further supporting our interpretation in favor of the existence of spillovers.Footnote 15

Taken together, we confirm that, after removing selection bias using the randomized experiment, training has positive impacts in the short term not only on the adoption of improved rice management practices, but also on rice productivity. Spillovers do not exist or at least do not matter much to completely cancel out positive training impact in the initial phase of the experiment. Our further intervention encouraging farmers to spread information, however, improved control farmers’ practices and contributed to helping them catch up with treated farmers, presumably through spillovers.Footnote 16

4 Spillover Effects

4.1 Information Network Analysis

4.1.1 Learning Link Data

Having outlined the treatment effects across time, we now examine whether social networks actually mediate information spillovers from treated to control farmers, using the detailed learning link data.

An empirical challenge on this topic is how to correctly specify social networks. Asking respondents about their social networks by arbitrarily setting a cap on the number of links may result in truncation bias, while asking an open-ended question tends to capture only the strong links, ignoring the weaker ones (see, for example, Maertens and Barrett (2013) for a thorough discussion of potential bias in empirically eliciting the true social network structure). To address this concern, we exploited a “random matching within sample” technique to elicit social networks, following, among others, Conley and Udry (2010), Maertens and Barrett (2013), and Mekonnen et al. (2018). More specifically, we matched each sample respondent with six other survey respondents randomly drawn from the sample at the same production sites and asked for details of the (non)existence of information exchange about agronomic practices between samples of farmers. We considered that a learning link between a respondent farmer i and a matched farmer j is established if i has ever asked j for advice at some time before interviews. This includes learning even before the training. We did not limit our interviews to only the post-training period, because respondents who violated the no-information-sharing rule, but were eager to satisfy the researchers’ expectations, would likely manipulate their answers if they were asked about their behavior only after our intervention. If the reference period includes the non-intervention phase, they are more likely to freely provide the actual answer.Footnote 17

To examine the differential roles played by treatment and control peers, we selected three matches from treated farmers and another three from control farmers. To capture changes in the network of interactions over time, we collected the learning link data in both the mid- and endline surveys.

Table 3.5 presents summary statistics of the learning link data for each year, separately for the probability of sample farmers knowing their match and the probability of sample farmers asking for agricultural information on the match, conditional on the former knowing the latter. Because of the attrition of own and paired sample households, we had a total of 1,664 and 1,607 observations in the mid- and endline surveys, respectively. These samples included farmers who do not have rice production data in the baseline.

Table 3.5 Summary statistics of network data

Conditional on sample farmers knowing their matches, about 80 to 90% of farmers asked their match for advice on agronomic practice, such as land preparation, transplanting, and fertilizer application in the midline. These results look high at first glance but seem to be reasonable because they reflect the probability that a sample farmer has ever asked matched farmers for agricultural advice. Thus, it may be more useful to focus on the differential probability of information exchange rather than the absolute level.

Compared with Control-Control pairs, Control-Treatment pairs are less likely to ask for advice in the midline, but more likely to do so in the endline. All other pairs have similar trends: they are more eager to ask advice by the endline surveys.

4.1.2 Dyadic Regression

We then ran a dyadic regression for those who know their matches to characterize the flow of information about management practices between farmers over time.Footnote 18 Formally, let \({L}_{ijt}\) be equal to one if a respondent farmer i has ever asked a farmer j (conditional on i knows j) for advice by time t. We explore the correlates of learning links by including the attributes of a household i and j as:

$$ \begin{aligned} L_{ijt} = & \, {\updelta } + {\gamma T}_{\text{t}} + {\upalpha }_1 D_{ij}^1 + {\upalpha }_2 D_{ij}^2 + {\upalpha }_3 D_{ij}^3 \\ & {{ + \beta }}_1 \left( {D_{ij}^1 \times {\text{T}}_{\text{t}} ) + {\upbeta }_2 (D_{ij}^2 \times {\text{T}}_{\text{t}} } \right) + {\upbeta }_3 \left( {D_{ij}^3 \times {\text{T}}_{\text{t}} } \right) \\ & + \left( {X_i + X_j } \right)\rho + \left( {X_i - X_j } \right)\tau + W_{ij} \pi + \varphi + u_{ijt} \\ \end{aligned} $$
(3.2)

where \({D}_{ij}^{1}\), \({D}_{ij}^{2}\), and \({D}_{ij}^{3}\) are a combination of the treatment status of households i and j with [treated, treated], [treated, control], and [control, treated]. The remaining combination [control, control] is a reference group; T is a binary indicator for the endline survey; \({X}_{i}\) and \({X}_{j}\) denote a vector of baseline controls for farmers i and j characteristics, respectivelyFootnote 19; \({W}_{ij}\) describes a dummy equal to one if the gender of both farmers is the same; \(\varphi \) is the production site fixed effect; and \({u}_{ijt}\) is a random disturbance. Following Attanasio et al. (2012) and Takahashi et al. (2019a, b), standard errors are clustered at the production site level to allow for possible correlations not only within dyadic pairs but also across all dyads in the same location.

Table 3.6 presents estimated results using a linear probability model. The coefficient estimate on the [treated, treated] dummy, \({\mathrm{\alpha }}_{1}\), is positive and statistically significant, but its interaction term with the endline data dummy, \({\upbeta }_{1}\), is statistically insignificant. This indicates that information exchange between treated farmers is more active than between control counterparts at the same production site, and this tendency does not systematically change over time. It might be that treated farmers are more likely to exchange information with each other to reinforce agricultural skills taught in the training, although we cannot completely deny the possibility that there was some baseline imbalance in the randomized matching process.

Table 3.6 Estimated results on the dyadic regression

On the other hand, and consistent with our expectations, we observed a negative and significant coefficient on the [control, treated] dummy, \({\mathrm{\alpha }}_{3}\), and a positive and significant coefficient on the interaction term with the endline data dummy, \({\upbeta }_{3}\). The results suggest that controlled farmers refrained from asking agricultural advice from treated farmers or the latter refrained from disclosing management information to the former in the first year after training, but that they were eager and active in doing so in the second year after training. This indicates that impact evaluation in the initial phase was less likely to be undermined by spillovers, supporting our claim that the recommended practices are more productive. It also supports our main finding that there were information spillovers after the relaxation of the SUTVA in the two years after training, which would facilitate control farmers to improve their rice management practices and performance through social learning.

The insignificant effects of the [treated, control] dummy, \({\mathrm{\alpha }}_{2}\), and its interaction with the endline dummy, \({\upbeta }_{2}\), are also broadly consistent with our expectations because treatment farmers may have no more incentive to ask control farmers for advice than control farmers do. Considering that our network data were intended to capture the one directional flow of information, non-symmetric results of \({\mathrm{\alpha }}_{2}\) and \({\mathrm{\alpha }}_{3}\) seem reasonable.

4.2 Extension to the Linear-In-Mean Model

While our analysis so far supports the existence of social learning, one may wonder if social learning actually plays a significant role. If this were so, we might observe the influences of peer behavior and performance on one’s own behavior. As a final robustness check to verify this possibility, we employed an extended linear-in-mean model.

We restricted the observation of this analysis to the endline year as this reflects the normal condition without the prohibition of information exchange in which spillovers are more likely to take place. We also restricted the outcome variables to rice yield, gross output value per hectare, the adoption of field leveling, canal/drainage construction/repairing, and straight-row planting, for which strong information spillovers from treated to control farmers seem to exist as observed in Tables 3.3 and 3.4.

To disentangle social effects from other confounders, we modify Eq. (1) as follows:

$$ \begin{aligned} Y_i = & \, \gamma_0 + X_{i0} \delta_1 + \overline{X}_{ - i0}^N \delta_2 + \gamma_1 Y_{i0} + \gamma_2 \overline{Y}_{ - i, t - 1}^N \\ & + {\text{Networ}}\,k_i + \frac{{\# {\text{Treatment}}_i^N }}{{\# {\text{Network}}_i }} + \mu + \epsilon_i \\ \end{aligned} $$
(3.3)

where \({\overline{X} }_{-i0}^{N}\) denotes the average values of baseline observable characteristics in i’s information network excluding i’s own value, regardless of whether network peers are treated or control farmers. For simplicity, we omit the subscript j to denote a production site. As in the previous sub-section, we define network peers as those persons i had asked for agricultural advice by the time of the end-survey. \({\overline{X} }_{-i0}^{N}\) are then computed using the baseline data for each network peer’s characteristics. This, along with the production site fixed effects, \(\mu \), serves to control for environmental and institutional factors that lead farmers to behave in a similar fashionFootnote 20; \({\overline{Y} }_{-i,t-1}^{N}\) is the average productivity or technology adoption in i’s network at the midline, regardless of their treatment status. This allows us to explore whether their peers’ average behavior and performance directly affect farmer i’s performance. Following Mekonnen et al. (2018), we use lagged rather than contemporaneous values of mean group performance or behavior in the recognition that information on agricultural technology cannot be diffused quickly; and \(Networ{k}_{i}\) is the network size (i.e., max six), while \(\frac{\#Treatmen{t}_{i}^{N}}{\#Networ{k}_{i}}\) is the share of treated farmers in i’s information network. We expect the latter to capture peer effects, especially those mediated by treated farmers. This is akin to the methodology used by Kremer and Miguel (2007) and Oster and Thornton (2012). The original intuition behind this method is that once we control for network size (which could be potentially endogenous),Footnote 21 the share of network peers in the treatment group is random because of the randomized experiment. This exogenous variation can be then used to identify peer effects.

Note that the average peer performance and the share of treated farmers in i’s network are expected to reflect different channels of peer effects; the former may partly capture learning by direct observation even without mouth-to-mouth communication, while the latter may partly capture knowledge transmission from treated farmers even when treated farmers do not actually adopt new technologies.

The estimated results in Table 3.7 show that the share of treated farmers significantly and positively affected their own behavior and performance for gross output value per hectare, leveling, and straight-row planting. We also observed a positive effect on the average performance of network members in most specifications, although this was statistically insignificant except for leveling, perhaps due partly to low statistical power and partly to difficulties in mimicking new technologies without learning through deep communication. While these results should be interpreted with caution to avoid strong causal inference, it seems to be no exaggeration to argue that the results provide further suggestive evidence on the existence of spillovers, especially mediated through treated farmers.Footnote 22

Table 3.7 Estimated results on the linear-in-mean model

5 Discussion

Before concluding this chapter, we must note the several caveats in our study. First, the performance of the rice-growing farmers in our study was better than we expected, due presumably to past training provided by local governmental and international organizations such as AfricaRice. Therefore, many recommended practices were known and practiced by sample farmers even before the training program, except for straight-row transplanting and, to a lesser extent, leveling. This limited our scope, since we could not explore the variations and magnitudes of the spillover effects of different cultivation practices when they are introduced to “virgin land.”

Second, while rice production is sensitive to weather conditions, especially rainfall, there was a significant lack of rainfall during the growing seasons in the midline and endline surveys, which resulted in many farmers halting their rice production during our observation periods. Weather conditions cannot be controlled, so we made attempts to mitigate potential estimation bias. Nevertheless, the conclusions may have been more solid and credible if the experiments had been conducted in more ideal settings.

Third, we proposed a new experimental design to implement rigorous impact evaluation and the promotion of spillovers in a unified framework. To achieve the same dual objectives, many existing studies use two-step randomization, in which they randomly select treated and control villages first, and then treat and control individuals within treated villages, allowing spillovers within the treated village. However, this type of cluster-level randomization is often costly, since it requires a larger sample size than individual-level randomization to have sufficient statistical power. We added to the literature by showing an alternative, less costly approach in collaboration with farmers.

One may argue that our approach, especially the restriction of information exchange between farmers, may pose an ethical concern if the training impacts are a priori known to be positive. Although our expectation of training impact was positive as recommended rice practices were mostly established in experimental stations and several countries in SSA have successfully improved rice productivity (Otsuka and Larson 2013, 2016), we were not sure whether that was the case at our study sites. As a result, we felt it was important to evaluate the training impact through an RCT because it is common to observe differences between on-farm and on-station results as well as across countries.

We also wondered whether management practices taught in our training were ineffective for those who had already received similar training in the past or those whose productivity was already close to the production possibility frontier. Thus, we believe that our approach did not conflict with the “do no harm” principle. Rather, once positive impacts were confirmed, we encouraged information exchange between treated and control farmers. By doing so, we were able to successfully reduce any inequalities between treated and control farmers generated by our experiment. It might be valuable to build in this kind of mechanism in other models to allow control groups to catch up with treated ones, which is often overlooked in existing RCTs.

Finally, while we carefully executed an RCT to establish internal validity, we are not fully confident of the external validity of our method. First, our sample farmers were extremely collaborative, which may not always be the case. Second, we provided control farmers with improved seeds, fertilizer, and herbicide on credit. We took this approach because we wanted to isolate the impact of management training from the use of current inputs and also enhance the cooperation of control farmers. But if the provision of credit induces farmers to be more keen to learn technologies, or there is complementarity between current inputs and management practices (e.g., Ragasa and Mazunda 2018), our findings may not be reproduceable in other experiments or scaled-up implementation without this input provision.

6 Conclusions

This chapter executed an RCT to examine whether rice production management training has positive impacts on the adoption of recommended practices and productivity in the short term as well as whether social learning can be effective for the wider diffusion of recommended practices by facilitating information spillovers from treated to control farmers in the subsequent period. By using a random assignment of farmer’s eligibility for training participation we attempted to reduce selection bias in impact evaluation. Also, by asking farmers not to exchange information in the initial phase of impact evaluation, we attempted to maintain the SUTVA.

We found positive and significant short-term effects from this training, which widen the gap in yield by 20%, the gross output value per hectare by 24%, and the adoption rates of selected rice management practices between treated and control farmers. However, after the restriction on information exchange was removed, control farmers improved their performance significantly, and, as a result, the gap between treatment and control-group farmers becomes virtually zero in the longer term. This suggests that information dissemination by farmers can be as effective in improving practices as the initial training provided by extension services. Although the generalizability of our findings may be questioned, Nakano et al. (2018a, b) found similar results in Tanzania. Our detailed analysis of learning link data and peer effects provides further supportive evidence for the existence of information and technology spillovers.

Our experiment relies on random assignments of management training without any monetary incentive scheme. For future research, it seems to be vital to inquire in different contexts how to best select treated nodes and whether monetary or other incentives should be given to them to maximize social benefit as well as to examine the external validity of our research findings (Kondylis et al. 2017; Maertens 2017; Barrett et al. 2018; Beaman and Dillon 2018; BenYishay and Mobarak 2018; Shikuku 2019).