Abstract
Precise agricultural statistics are necessary to track productivity and design sound agricultural policies. Yet, in settings where multi-cropping is prevalent, even crop yield—perhaps the most common productivity metric—can be challenging to measure. In a survey of the literature on crop yield in low-income settings, we find that scholars specify how they estimate the area denominator used to measure yield in under 10% of cases. Using household survey data from Tanzania, we consider four alternative methods of allocating land area on multi-cropped plots, ranging from treatment of the entire plot as the yield denominator to increasingly precise approaches that account for the space taken up by other crops. We then explore the implications of this measurement decision for analyses of yield, focusing on one staple crop that is often grown on its own (rice) and one that is frequently found on mixed plots and in intercropped arrangements (maize). A majority (64%) of cultivated plots contain more than one crop, and average yield estimates vary with different methods of calculating area planted—particularly for maize. Importantly, the choice among area methods influences which of these two crops is found to be more calorie-productive per hectare. This choice also influences the statistically significant correlates of crop yield, such that the benefits of intercropping and including legumes on a maize plot are only evident when using an area measure that accounts for mixed cropping arrangements. We conclude that the literature would benefit from greater clarity regarding how yield is measured across studies.
1 Introduction
Precise agricultural statistics are necessary to measure and track productivity, allocate scarce resources effectively, and design policies and investments aimed at agricultural sector development in low-income countries. The need for reliable agricultural data has been noted since the 1990s (Kelly et al. 1995; Diskin 1997) and continues to be emphasized to this day (FAO 2010; Carletto et al. 2015a). To address what had been a decline in the quantity and quality of agricultural statistics in low-income countries (and particularly in Africa) (FAO 2010), the Living Standards Measurement Study Integrated Surveys on Agriculture (LSMS-ISA) were launched in 2008 (Carletto et al., 2010). These nationally representative household data sets gather detailed information on agricultural production at the plot level. However, gaps remain in our understanding of how to use household survey data to generate accurate agricultural statistics, with limited attention given to the implications of different choices around how some common variables (such as crop yield) are constructed.
In this paper, we examine the challenge of estimating crop yield on plots that contain more than one crop. Such plots are extremely common in low-income countries, for reasons ranging from risk reduction when crops exhibit different sensitivities to climate variation or disease, to labor constraints, to the maximal utilization of limited space, to the productivity benefits of certain intercropping arrangements (Kelly et al. 1995; FAO 2017).Footnote 1 On mixed plots, area data will be misleading and yields underestimated if the presence of other crops is not accounted for, making it necessary to somehow apportion the cropped area among the component crops. However, no consensus exists on how a plot’s land area ought to be attributed to each crop when calculating yields (Fermont and Benson 2011; FAO 2017). Furthermore, agricultural economists often do not specify how they allocate land area among different crops in mixed cropping systems.
Poor or imprecise yield estimates can potentially affect the quality of research on agricultural systems. Thus, a study of the yield effects of climate variability (Rowhani et al., 2011) necessarily rests on having an accurate record of crop yields. Research evaluating the potential for yield improvement (GYGA 2018) similarly requires a clear view of actual yields. Studies of the returns to investments in agricultural research and development often rely on yield measures (Maredia and Raitzer 2010; Perez and Rosegrant 2015), and the calculated benefits of a fertilizer subsidy program are grounded in the measured effects of fertilizer on crop yield (Jayne et al. 2013). Research on the yield effects of intercropping in smallholder systems also hinges on good measures of crop yield (Himmelstein et al. 2017).
There are several options to use as the denominator in a measure of crop yield (quantity harvested per area planted) in the presence of mixed cropping. Analysts may use the whole plot for each crop found on the plot and then report average crop yields by intercropping status, although this does not allow for aggregating crop areas at a higher level (Fermont and Benson 2011). A second option is to divide the area evenly among all crops present on the plot, although the validity of assuming that crops share the land equally is in doubt (Diskin 1997). A third option is to proportionally allocate the plot area among various crops, essentially adjusting the area and yield values to pure stand estimations. Toward this end, the plot area can be divided based on a visual assessment of the proportion occupied by each crop, or with reference to seeding rates or objective measures of row ratios or crop densities (Fermont and Benson 2011; Sud et al. 2016; FAO 2017). The latter methods are considered preferable, though expensive and time-consuming (FAO 2017).
In this paper, we first conduct a survey of the agricultural economics literature to quantify how often authors specify how yield measurements in the presence of multi-cropped plots are constructed. We then consider four alternative methods of allocating land area on mixed plots in Tanzania, focusing on one crop that is often grown on its own (rice) and one that is frequently found on mixed plots and in intercropped arrangements (maize). We ask whether the average crop yield differs with each approach to allocating plot areas; whether the choice of method affects which crop is found to be more productive, or which region of the country is more favorable for a given crop; and whether the statistically significant correlates of crop yield differ in magnitude with different methods. As expected, we find that average yield estimates do vary with different methods of calculating area planted, although this pattern is more pronounced for maize. The choice among methods also influences which of these two staple crops is found to be more calorie-productive per ha, as well as the extent to which inputs, such as fertilizer, are expected to be productivity-enhancing for maize production in Tanzania. Fewer differences are evident when comparing yield estimates with an equal-allocation method versus a more complex proportional scaling method. This suggests that the simpler approach may be adequate when using household survey data.
The remainder of the paper is organized as follows: Section 2 describes the outcome of a survey of the literature to determine how crop areas on multi-cropped plots are typically estimated. Section 3 introduces the data set used in analysis. A description of the variables constructed is provided in section 4, along with an overview of the empirical approach used for econometric analyses. Descriptive and econometric results are found in sections 5 and 6, respectively. Section 7 concludes with a discussion of implications.
2 Literature survey
We begin with a survey of the literature to tabulate how authors appear to account for multi-cropping in their yield calculations. We considered the top five most highly-ranked journals in agricultural economics (based on the 2012–2016 average impact factors, per InCites). These include Food Policy, Applied Economic Perspectives and Policy, the American Journal of Agricultural Economics, Agricultural Economics, and the Journal of Agricultural Economics. Within each journal, a Scopus search was conducted for the term “yield” in the paper title, abstract, and keywords, for all papers published from 2008 to 2017. This produced 222 papers. Among these, we identified those papers that focus on a low- or middle-income country, that refer to household survey data in a quantitative measure of crop yield, and that focus on field crops. This reduced the sample to 40 papers.
Among these, we tabulated how often the paper either uses the entire plot area as the denominator in a yield measure, adjusts the area under a given crop for the presence of other crops on the plot, or does not specify how the yield denominator is defined. We found that three (7.5%) papers specify that they use the entire plot size as a denominator, and none specify explicitly that they adjust the crop areas to account for intercropping. The remaining 37 (92.5%) do not specify what is used as a denominator in the analysis. In some cases, we can discern clues regarding whether the crop area was adjusted to correct for the presence of other crops on the plot. For example, in three papers, intercropped status was included as an explanatory variable in a yield function, with a positive yield effect. This suggests that the yield denominators were likely adjusted. However, in no paper did we find an explanation of how crop areas were captured in the survey or adjusted in the analysis.Footnote 2 Thus, it is worthwhile to consider the implications of various choices these authors might have made.
3 Data
The National Panel Survey (NPS) of Tanzania is implemented by the Tanzania National Bureau of Statistics and is a research initiative within the Development Economics Research Group of the World Bank. This nationally representative data set captures a rich set of information on farm-household agricultural production at the plot level. We focus primarily on the 2014/15 survey wave, which reflects the main growing season of 2013/14. For this season, the sample contains 2048 households that produced crops. We focus on maize and rice as staple crops that exhibit different cropping patterns. The sample contains 1430 households that produced maize (on 2088 individual plots) and 432 households that produced rice (on 491 plots) in the main season. Some observations are dropped due to incomplete surveys, leaving somewhat smaller sample sizes for analysis. In addition to the estimates of plot area provided by respondents, 71.5% of cultivated plots were measured by Global Positioning Systems (GPS). Population weights are applied in all analyses. The Stata code used in analysis is available from the authors upon request.
4 Variables and empirical approach
In the Tanzania NPS, a plot is defined as a contiguous piece of land of uniform tenure (NBS 2017),Footnote 3 and respondents list the crops grown on a given plot and identify whether each crop is intercropped. For seasonal crops, respondents also estimate the proportion of plot area that was cultivated with each crop, with options of one-quarter, one-half, three-quarters, and the entire plot. Where crops are intercropped, however, this estimate necessarily includes the area shared with other crops. Thus, if a 1 hectare (ha) plot is intercropped with maize and beans, both crops are recorded as being planted on 1 ha of land. We focus only on the main growing season in this paper. The proportion of plot area under fruit trees or permanent crops (such as banana or cassava) is not captured in the survey, although respondents do list the number of plants or trees found on the plot.
Several issues arise when estimating the area under each crop. First, the smallest area estimate possible for seasonal crops is one quarter of the plot, which may overestimate the area for marginal crops. Even with no intercropping, the summed area estimates for various crops on a plot can potentially exceed the plot size. Another challenge is that the area under permanent crops can only be estimated with per-plant (or per-tree) area estimates, which are not provided with the data set. Some crops in this category, including fruit trees, are likely to be present on the farm in small numbers, but others may claim a non-negligible amount of space. Finally, it is not obvious how to allocate the land area where multiple crops are intercropped, as the survey does not capture which crops are intercropped together on the same section of the plot, and at what planting density. For example, a plot that is intercropped with one row of maize followed by one row of beans is described in the survey data in an identical manner as another plot that is intercropped with two rows of maize followed by one row of beans.
We examine the implications of four alternative approaches to estimate the area under crops in the presence of mixed cropping (Table 1). Method 1 considers the area under each crop on the plot to be the entire plot area. In cases where more than one crop is present, each crop is assigned the entire plot area. With this crude approach, the summed areas under crops would necessarily exceed the actual plot size whenever more than one crop is present. Note that any fallow land on the plot (which is possible with the plot definition used in the Tanzania NPS) is treated as if it were cropped. Method 2 utilizes the information collected on the proportion of plot area that was planted with a given seasonal crop. As noted, this area may include within it other crops, and as such, the total summed areas under crops may again exceed the plot size. With this method, the area under crops necessarily excludes any fallow areas. Method 3 assumes that the plot area is divided equally among all seasonal crops present, in addition to several permanent crops that are likely to occupy non-negligible areas on multi-cropped plots, including banana and cassava (both staple crops in Tanzania), as well as pigeon pea and pineapple. Other fruit trees / permanent crops are understood to be less likely to affect the area estimates for the crops being evaluated (maize and rice), and this method of allocating areas therefore ignores their presence on the plot. As with method 1, any fallow land on the plot is treated as if it were cropped.
Method 4 accounts for the estimated proportion of the plot on which each seasonal crop is planted and combines this information with per-plant (or per-tree) area estimates for all fruit trees / permanent crops (NBS 2011). Specifically, crops that are monocropped are first assumed to cover the entire quarter-based proportion of plot area (or, in the case of permanent crops, the area estimated using per-plant area estimates). If rice is present on one-quarter of a 1 ha plot and is not intercropped, we assume it covers one-quarter ha (see Fig. 1). Then, the residual area on the plot (not accounted for by monocropped crops) is allocated among the remaining crops. Where the summed area estimates for these crops exceed the residual area on the plot, the area estimates are scaled down proportionally to equal this residual area. In Fig. 1, consider a 1 ha plot with rice that is monocropped on 0.25 ha. In the residual area (not monocropped), maize and beans are intercropped, such that the proportion under each crop is reported as 0.75 ha. These last two values are scaled down to each equal \( \frac{0.75}{2} \) ha. Fallow land is accounted for only if the areas under crops sum to less than the plot size.
No method listed above is “perfect,” and even method 4 (utilizing the most detailed information available) is unlikely to be perfectly accurate. However, by applying the four methods to the same data set, we can comment on some of the implications of choosing one method rather than another.
As noted, the size of all plots is estimated by respondents. However, survey respondents often overestimate the area of small plots—particularly by rounding up to an even value, such as one acre—and underestimate the area of large plots (de Groote and Traore 2005; Carletto et al. 2015b). We therefore use the GPS area measures that are available for nearly three-quarters of cultivated plots. The sizes of plots that were not measured are imputed using the respondents’ plot size estimates, along with information derived from local measures of other plots.Footnote 4
The definitions of other variables used in analysis are provided in Table 2. The four approaches to estimating area under crops, as outlined in Table 1, are used to produce four estimates of crop yield (kg harvested / ha under crop). These are numbered 1 through 4, reflecting the methods used to compute area. Input application rates are also computed with reference to crop area estimates 1 through 4. Seed kg/ha uses crop-specific seed amounts and areas (i.e., areas 1–4). However, the use of inorganic fertilizer, manure, and labor were captured only at the plot level, and the denominator for these application rates is the summed area that was cultivated on a given plot. This denominator is the plot size for methods 1 and 3 and also for method 2 whenever the summed areas exceed plot size. For method 4, the per-crop areas are summed within each plot. This approach assumes that fertilizer and labor are allocated equally across crops on a plot, and therefore the application rate applies to every crop observation.
Respondents listed up to two household members that held decision-making power over what to plant on each plot. This information is used to categorize plots by the gender of decision-maker, with categories including women-only or men-only/joint management involving both men and women.
To understand the implications of applying each method to estimate the area under crops, we explore in section 5 a range of descriptive statistics for maize and rice. For example, we compare the mean yield estimates and consider which crop would be viewed as most productive (in terms of calorie production) with each yield measure. In section 6, we also consider whether the detected correlates of crop yield vary with different methods of measuring yield, using a set of linear regressions in which different yield measures are used as the dependent variable.Footnote 5 The following equation is used:
where Yijr is a measure of crop yield for crop i on plot j in region r; Areaijr is the estimate of area under the crop that corresponds to the method used to estimate Yijr; Inputsijr is a vector of corresponding input intensities; Plot _ characteristicsjr is a vector of plot characteristics; τr are region fixed effects; and εijr is a stochastic term. The region fixed effects control for broad geographic differences in seasonal weather outcomes in this growing season. This model is estimated separately for maize and rice.Footnote 6
5 Descriptive results
In the 2013/14 main season in Tanzania, 64% of cultivated plots contain more than one crop, and because plots containing a single crop tend to be smaller than others, roughly three-quarters of the area under crops (depending on which area method is used) contains multiple crops. This makes the choice of how to allocate plot area among multiple crops highly consequential when generating agricultural statistics. Descriptive statistics in Table 3 illustrate some of these consequences for maize and rice. As expected, the estimated area under each crop is adjusted downward over areas 2–4 to reflect, in different ways, the presence of other crops on the plot and the plot areas that are left uncultivated. It follows that estimated yields will generally increase when using these methods.
Several other plot characteristics are relevant to yield. It is much more common for rice to be planted on its own, with 73% of rice observations on pure stand plots, while just 21% of maize observations are pure stand. We therefore expect the different decisions regarding area estimates under each crop to be more relevant for maize than for rice. Because legumes fix nitrogen, they are understood to enhance the yield of other crops when intercropped together (Dakora and Keya 1997; Snapp et al. 2010). In 40% of maize cases, a legume is also found on the plot. (The definition of a “plot” in this survey, which can include multiple cropping regimes, leaves us uncertain as to whether an intercropped maize crop is intercropped directly with the legume or is planted adjacent to it.)
As shown in Table 4, mean yield estimates generally increase over yields 1–4. Particularly for maize, accounting for the space taken by other crops on the plot (yields 3 and 4) produces much higher average yields than are otherwise estimated. In the bottom panel of Table 4, we test whether the mean yields that are derived with different methods are statistically significantly different. For maize, it seems that all four methods produce mean values that differ from one another. For rice, yield 1 (using the entire plot size as the area under crop) does differ significantly from the other methods. However, the other three yield measures do not significantly differ from one another, indicating that a decision among these may not be very consequential for analyses of rice yield.
Policy makers, agricultural research stations, or development practitioners may decide to prioritize one crop over the other based on an understanding of their relative productivity. We next consider whether relative productivities differ with different approaches to measuring yield. Because kilograms cannot readily be compared across crops, productivity is measured here as the mean calories produced per ha cultivated, with per-kg calorie values taken from Wu Leung et al. (1968). The results in Table 5 show that, while rice is estimated to be more productive with yields 1 and 2, maize emerges as the more productive crop with yields 3 and 4. Because maize is so often grown with other crops, its superior productivity is obscured until the space claimed by other crops on the plot is somehow addressed.
Policy makers may also allocate resources to different geographic regions with consideration of their relative productivity for a given crop. We next ask, does this calculus shift with different yield measures? Table 6 displays the mean maize yields found in two of the primary maize-producing zones in Tanzania, the Southern Highlands and Northern zone. In Northern zone, maize is somewhat more likely to be intercropped (60%) and to have a higher number of crops sharing the plot (mean = 3.34). For the Southern Highlands, these figures are 55% and 2.64, respectively. For all four yield measures, the Southern Highlands exhibit higher average maize yields. However, for yield 1, this value is 20.86% higher than the average value observed in the Northern zone, while it is just 4.90% higher with yield 3. Furthermore, the differences in mean yields are not statistically significant when using yield 3 or yield 4. Therefore, the extent to which the Southern Highlands are found to be more favorable for maize production does vary, depending on how yield is measured.
Because some area measures do not account for mixed cropping, different measures will necessarily produce divergent stories regarding the aggregate areas found under each crop. (In fact, the commonly-cited FAOSTAT database (FAO 2019) does not seem to provide guidance for how countries should report the area harvested of a given crop, defining this only as “the area from which a crop is gathered.”) Using the Tanzania NPS data, Fig. 2 displays the total areas cultivated with maize and rice in the main season. Methods 1 and 2 necessarily double-count areas that are shared by multiple crops, resulting in high country-level area estimates for each crop. Thus, method 2 results in an aggregate area under maize that is 42.38% greater than method 4. For rice, which is usually planted alone, this difference is just 6.48%. Because method 3 assumes the entire plot is cultivated, the estimated total area cropped is a bit larger than for method 4.
We next consider possible variation in farm and management factors correlated with yield using two measures that vary dramatically from one another, area 2 (the un-adjusted share of plot area under a given crop) and area 4 (with crop areas that have been scaled down proportionally to plot size). Table 7 presents the average yields for these two measures for various subgroups of crop observations. The average difference between yield 4 and yield 2 for a given subgroup is given in the last column, with asterisks that denote a statistically significant difference in this value between the two mutually exclusive subgroups. For example, for intercropped maize observations, the average difference between yield estimates is 798.23 kg/ha, and this is statistically significantly different (at the 1% level) from the average difference seen for monocropped observations (134.85 kg/ha).
As expected, the difference between yield 4 and yield 2 is larger, on average, for crop observations that are found on plots that contain other crops, relative to those with only one crop. For maize, the average difference between yield measures is larger on small plots (≤ 0.5 ha) than on large plots. Descriptively, this suggests that the inverse area-productivity relationship that is commonly observed in developing countries (Larson et al. 2014; Wineman and Jayne 2017) would be more intense when measured with yield 4, as compared to yield 2. For both maize and beans, the average difference in yield estimates between yield 4 and yield 2 is slightly smaller for plots on which men are involved in decision-making, as compared with those on which only women are involved. However, this difference in the average difference is never statistically significant. This suggests that any detected gender gap in crop production would likely be unaffected by the alternative methods used to measure yield.
6 Econometric results
We now apply an econometric analysis to understand whether the method applied for crop area estimation also influences our understanding of the most important correlates of crop yield. Equation (1) is used for each area method, in turn. Recall that the different area methods affect the dependent variable (yield), as well as the estimates for area planted and seed, fertilizer, manure, and labor intensities.
Results for maize yield are given in Table 8. An informal comparison of coefficients across columns suggests that the intensity of the inverse area-productivity relationship is greater (the magnitude of the coefficient is larger) for methods 3 and 4, as compared with methods 1 and 2. Thus, another ha planted is associated with a decrease in maize yield of 28.31 kg/ha (when using method 1) or 164.27 kg/ha (when using method 4). This is consistent with the descriptive results of Table 4. The relationship between seeding rate and yield also varies, depending on the method used. Thus, method 3 indicates that another kg of maize seed/ha is associated with a yield increase of 9.59 kg/ha, though this coefficient is much smaller (2.13) and not statistically significant in column 4. This could have implications for the optimal seeding rate suggested by agricultural extension agents. The coefficient on the use of improved seed varies considerably across equations (by up to 55%), and this may have implications for the estimated returns to maize seed intensification in this setting (Arslan et al. 2015; Komarek et al. 2018). Likewise, the relationship between fertilizer application rate and maize yield also varies with different area methods, a point which is likely to affect analyses of fertilizer profitability (e.g., Burke et al. 2017; Komarek et al. 2018; Liverpool-Tasie 2017; Theriault et al. 2018).
Using methods 1 and 2, which do not adjust for intercropping, it would appear that intercropping is negatively correlated with maize yield. However, once we account for the presence of multiple crops on the same plot, intercropping appears to be beneficial for yields. Along the same lines, the positive yield effect of having legumes present on the plot is not evident (in a statistically significant manner) until columns 3–4, when it becomes clear that planting maize alongside legumes is associated with higher yields on the order of 600 kg/ha. This is consistent with research on the maize yield benefits of intercropping with legumes (Snapp et al. 2010; Arslan et al. 2015; Droppelmann et al. 2017). The coefficient on having only women as plot managers is negative and statistically significant across all models, and this is consistent with other research that finds a gender productivity gap in agriculture (Peterman et al. 2011; Slavchevska 2015).Footnote 7
The same exercise is repeated for rice yield in Table 9. Again, the inverse relationship between area and yield emerges as least intense when using method 1. Again, intercropping initially appears to be detrimental to rice yield (columns 1–2) in a statistically significant manner until we account for the space taken up by other crops. Surprisingly, the use of improved rice seed is not statistically significantly associated with yield, although Wineman et al. (2019) suggest that farmers’ identification of improved varieties may not always be correct.
We next test for statistically significant differences in coefficients across different columns of Tables 8, 9. In a seemingly unrelated estimation of yield functions, we compare the coefficients from column 2 and column 4 (Table 10, top panel). The P-values reported indicate that, across these two models, the coefficients often differ for maize yield, though they rarely differ for rice. It seems clear that, for crops that are often grown on mixed plots, the method applied to estimate area under the crop will affect an analyst’s conclusions regarding the correlates of crop yield and especially the magnitude of these relationships. In the lower panel of Table 10, the same exercise is repeated with the coefficients from columns 3 and 4. In this exercise, we see far fewer small P-values, indicating fewer instances where the magnitude of the coefficients differs across models in a statistically significant manner. This suggests that even the relatively simplistic approach of method 3 may be adequate to discern the correlates of yield.
7 Conclusion
This paper is motivated by the question of whether construction choices for a very common metric of productivity—crop yield—could affect empirical analyses and thereby have policy and investment implications (Anderson et al. 2015). We examine several questions of relevance to policy makers and development partners. For example, when allocating scarce resources earmarked for agricultural development, which crop is more productive? Which region of the country is associated with the highest returns for a given crop? And what is the relationship between certain inputs and crop yield, with implications for economically sound strategies to promote crop production?
We generally find that the method used to assign area (the denominator in crop yield) to a given crop does affect the conclusions derived regarding yield patterns and correlates of higher yields. The most obvious difference is simply around the estimation of average yields, which increase once the space taken by other crops is deducted from the denominator. This is less relevant for rice, which is often grown on its own, than for maize. As a result, cross-crop comparisons of productivity will produce different “winners”, depending on how yield is measured. Another obvious consequence of the choice among area methods is the very different views that emerge regarding the aggregate emphasis on different crops, in terms of production areas. Specifically, the gap between areas devoted to maize versus rice production in Tanzania is much reduced when accounting for the mixed cropping arrangements that are most common for maize.
We further find that, without accounting for the space taken up by other crops on the plot, the anticipated benefits of intercropping and especially including legumes on a maize plot may not be evident. And while the statistical significance of various correlates of crop yield are often consistent, regardless of how crop area is captured, the magnitude of these relationships does shift. This may affect the extent to which a particular input, such as fertilizer used in maize production, appears to be profitable for crop-producing households. Interestingly, the results are fairly (though not perfectly) consistent when a crop’s area is estimated with method 3 and method 4. When detailed information is not available, it seems that the simpler approach—equal division of the plot area among crops present— may be acceptable for this type of basic analysis. Furthermore, the information used in method 3 likely represents a lighter reporting burden for respondents.
It should be noted that the method used to allocate plot area among crops is not the only challenge related to estimating crop yields from household surveys. The approach to plot area measurement can also shift yield estimates (Carletto et al. 2015b). In addition, because some planted area may be left unharvested for reasons that extend beyond production failures, such as theft, wildfire, or wildlife pests (Anderson et al. 2015), analysts may want to use area harvested rather than area planted in the denominator. Furthermore, there is some debate regarding how the numerator in crop yield (total quantity harvested) should be estimated. While attention has been given to the possibility that farmer estimates may be biased (Gourlay et al. 2017), other authors note that crop cuts may also be biased or produce results that are not representative of the entire plot (especially when planting densities are uneven), or may be inconsistent with what a farmer would consider to be harvestable (Diskin 1997; Fermont and Benson 2011; Sud et al. 2016). It can be especially difficult to estimate harvests from crops with an extended harvest period, such as cassava, or to identify the area claimed by crops that are produced more than once in a growing season. It is unclear how crop byproducts, such as the harvesting of pumpkin leaves in addition to the fruit produced, should be treated in yield estimates (Kelly et al. 1995). Finally, it is unclear how to aggregate portions of harvest that are of different forms because the harvests took place at different times during the season. For example, maize in Tanzania is often partly collected before the main harvest and consumed as “green maize”.
A final lesson of this paper is that readers should be somewhat cautious when comparing results of yield research, as different authors may make different construction decisions for estimating yield. This applies to all topics mentioned in the previous paragraph, in addition to decisions around how to account for the presence of multiple crops on a plot. Given our finding that construction decisions can influence the results of analysis, it follows that the literature would benefit from greater clarity regarding how yield is measured across studies.
Notes
The term “mixed cropping” refers to multiple crops being grown together, often with their seed mixed before being broadcast. In contrast, “intercropping” refers to multiple crops being cultivated in a definite pattern (Kelly et al. 1995; FAO 2017). In this paper, we use the terms “multi-cropping” or “mixed plots” to refer to multiple crops being found on the same plot, regardless of their arrangement.
Note that mixed cropping / intercropping may not be relevant for all crops. In particular, irrigated rice production may not be conducive to intercropping. When all studies of rice yield are removed from the analysis, we are left with 29 papers, of which 27 (93.10%) do not specify how they estimate the area under the crop being studied.
In some other surveys, the definition of an agricultural “plot” refers to an area of uniform crop management, such as monocrop or an intercrop regime that is uniform throughout the plot area. Unlike in the Tanzania NPS, respondents in the Uganda LSMS-ISA report a continuous percent of plot area allocated to each crop, and permanent crops are treated as any other crop when estimating areas. Thus, the challenges associated with measuring crop yield are somewhat unique to each survey.
Specifically, we divide plot size estimates into “units”, such as one-quarter acre, one-half acre, or five acres. Thus, a plot that is listed as being 0.75 acres is understood to be comprised of one and a half one-half acre units. The median size of each unit is captured using the GPS measures of other plots, and we apply the median value from the smallest local geographic unit for which 10 measurements have been taken. If a plot is estimated as 0.75 acres, and the median measured value of a one-half acre unit in the same enumeration area is 0.3 acres, then the plot size is imputed as 0.45 acres.
Although nonlinear production functions are often used, a simple, linear model is preferred here for clarity and for purposes of illustration (consistent with Liverpool-Tasie (2017)).
Note that crop yield observations are specific to seed type, such that if two seed types for the same crop are grown on the same plot, it is treated as two observations.
A robustness check using a Cobb-Douglas production function leads to a similar conclusion that the approach to estimating area can sometimes influence the input elasticities in a statistically significant manner.
References
Anderson, C. L., Reynolds, T., Biscaye, P., Harris, K. P., Slakie, E., & Merfeld, J. (2015). How common crop yield measures misrepresent productivity among smallholder farmers. Technical Report No. 303. Seattle: Evans Policy Analysis & Research Group at the University of Washington.
Arslan, A., McCarthy, N., Lipper, L., Asfaw, S., Cattanei, A., & Kokwe, M. (2015). Climate smart agriculture? Assessing the adaptation implications in Zambia. Journal of Agricultural Economics, 66(3), 753–780.
Burke, W. J., Jayne, T. S., & Black, R. (2017). Factors explaining the low and variable profitability of fertilizer application to maize in Zambia. Agricultural Economics, 48, 115–126.
Carletto, G., Beegle, K., Himelein, K., Kilic, T., Murray, S., Oseni, M., Scott, K., & Steele, D. (2010). Improving the availability, quality and policy relevance of agricultural data: The Living Standards Measurement Survey—Integrated Surveys on Agriculture. Third Wye City Group Global Conference on Agricultural and Rural Household Statistics. http://www.fao.org/fileadmin/templates/ess/pages/rural/wye_city_group/2010/May/WYE_2010.2.1_Carletto.pdf. Accessed 19 September 2018.
Carletto, G., Jolliffe, D., & Banerjee, R. (2015a). From tragedy to renaissance: Improving agricultural data for better policies. Policy Research Working Paper No. 7150. Washington, D.C.: World Bank Group.
Carletto, C., Gourlay, S., & Winters, P. (2015b). From guesstimates to GPStimates: Land area measurement and implications for agricultural analysis. Journal of African Economies, 24(5), 1–36.
Dakora, F. D., & Keya, S. O. (1997). Contribution of legume nitrogen fixation to sustainable agriculture in sub-Saharan Africa. Soil Biology and Biochemistry, 29(5–6), 809–817.
de Groote, H., & Traore, O. (2005). The cost of accuracy in crop area estimation. Agricultural Systems, 84, 21–38.
Diskin, P. (1997). Agricultural Productivity Indicators Measurement Guide. Food and Nutrition Technical Assistance Project. Washington, D.C.: Academy for Educational Development.
Droppelmann, K. J., Snapp, S. S., & Waddington, S. R. (2017). Sustainable intensification options for smallholder maize-based farming systems in sub-Saharan Africa. Food Security, 9(1), 133–150.
Fermont, A., & Benson, T. (2011). Estimating yield of food crops grown by smallholder farmers. IFPRI Discussion Paper No. 01097. Washington, D.C.: International Food Policy Research Institute.
Food and Agriculture Organization of the United Nations (FAO). (2010). Global Strategy to Improve Agricultural & Rural Statistics. Report No. 56719-GLB. Rome: The World Bank, FAO, and United Nations.
Food and Agriculture Organization of the United Nations (FAO). (2017). Methodology for Estimation of Crop Area and Crop Yield under Mixed and Continuous Cropping. Improving Agricultural & Rural Statistics Global Strategy Technical Report No. GO-21-2017. The World Bank, FAO, and United Nations: Rome.
Food and Agriculture Organization of the United Nations (FAO). (2019). FAOSTAT database. FAO: Rome. Accessed April 17, 2019 at http://www.fao.org/faostat/en/#data.
Global Yield Gap Atlas (GYGA). (2018). Food security analysis: from local to global. http://www.yieldgap.org/web/guest/home. Accessed 19 September 2018.
Gourlay, S., Kilic, T., & Lobell, D. (2017). Could the debate be over? Errors in farmer-reported production and their implications for the inverse scale-productivity relationship in Uganda. Paper prepared for the CSAE Conference on Economic Development in Africa, March 19–21. Oxford: Center for the Study of Africa Economies.
Himmelstein, J., Ares, A., Gallagher, D., & Myers, J. (2017). A meta-analysis of intercropping in Africa: impacts on crop yield, farmer income, and integrated pest management effects. International Journal of Agricultural Sustainability, 15(1), 1–10.
Jayne, T. S., Mather, D., Mason, N., & Ricker-Gilbert, J. (2013). How do fertilizer subsidy programs affect total fertilizer use in sub-Saharan Africa? Crowding out, diversion, and benefit/cost assessments. Agricultural Economics, 44, 687–703.
Kelly, V., Hopkins, J., Reardon, T., & Crawford, E. (1995). Improving the measurement and analysis of African agricultural productivity: Promoting complementarities between micro and macro data. International Development Paper No. 16. East Lansing: Michigan State University.
Komarek, A. M., Koo, J., Wood-Sichra, U., & You, L. (2018). Spatially-explicit effects of seed and fertilizer intensification for maize in Tanzania. Land Use Policy, 78, 158–165.
Larson, D. F., Otsuka, K., Matsumoto, T., & Kilic, T. (2014). Should African rural development strategies depend on smallholder farms? An exploration of the inverse-productivity hypothesis. Agricultural Economics, 45, 355–367.
Liverpool-Tasie, S. L. O. (2017). Is fertilizer use inconsistent with expected profit maximization in sub-Saharan Africa? Evidence from Nigeria. Journal of Agricultural Economics, 68(1), 22–44.
Maredia, M. K., & Raitzer, D. A. (2010). Estimating overall returns to international agricultural research in Africa through benefit-cost analysis: a ‘best-evidence’ approach. Agricultural Economics, 41(1), 81–100.
National Bureau of Statistics (NBS). (2011). Technical and Operation Report, Agriculture Sample Census Survey 2007/08. Dar es Salaam: NBS.
National Bureau of Statistics (NBS). (2017). Enumerator Manual, National Panel Survey 2014–15. Dar es Salaam: NBS.
Peterman, A., Quisumbing, A. R., Behrman, J. A., & Nkonya, E. (2011). Understanding gender differences in agricultural productivity in Uganda and Nigeria. Journal of Development Studies, 47(10), 1482–1509.
Perez, N. D., & Rosegrant, M. W. (2015). The impact of investment in agricultural research and development and agricultural productivity. IFPRI Discussion Paper No. 01447. Washington, D.C.: International Food Policy Research Institute.
Rowhani, P., Lobell, D., Linderman, M., & Ramankutty, N. (2011). Climate variability and crop production in Tanzania. Agricultural and Forest Meteorology, 151(4), 449–460.
Slavchevska, V. (2015). Gender differences in agricultural productivity: The case of Tanzania. Agricultural Economics, 46, 335–355.
Snapp, S. S., Blackie, M. J., Gilbert, R. A., Bezner-Kerr, R., Kanyana-Phiri, G. Y., & Kates, R. W. (2010). Biodiversity can support a greener revolution in Africa. Proceedings of the National Academy of Sciences, 107(48), 20840–20845.
Sud, U. C., Ahmad, T., Gupta, V. K., Chandra, H., Sahoo, P. M., Aditya, K., Singh, M., Biswas, A., & ICAR-Indian Agricultural Statistics Research Institute. (2016). Synthesis of Literature and Framework for Research on Improving Methods for Estimating Crop Area, Yield and Production under Mixed, Repeated and Continuous Cropping. Working Paper No. 5. Global Strategy for Improving Agricultural and Rural Statistics. Rome: FAO.
Theriault, V., Smale, M., & Haider, H. (2018). Economic incentives to use fertilizer on maize under differing agro-ecological conditions in Burkina Faso. Food Security, 10(5), 1263–1277.
Wineman, A., and Jayne, T. S. (2017). Factor market activity and the inverse farm size-productivity relationship in Tanzania. Research Paper No. 79. Michigan State University Innovation Lab for Food Security Policy: East Lansing, MI.
Wineman, A., T. Njagi, C. L. Anderson, T. Reynolds, D. Alia, P. Wainaina, E. Njue, P. Biscaye, and M. Ayieko. (2019). A case of mistaken identity? Measuring rates of improved seed adoption in Tanzania using DNA fingerprinting. Forthcoming in the Journal of Agricultural Economics.
Wu Leung, W., Busson, F., & Jardin, C. (1968). Food Composition Table for Use in Africa. Bethesda, MD and Rome: U.S. Dept. of Health, Education, and Welfare and the Food and Agriculture Organization of the United Nations.
Acknowledgements
The authors would like to thank Didier Yélognissè Alia of the University of Washington for helpful feedback on this paper. The graduate student researchers of the Evans Policy Analysis & Research Group (EPAR), based at the Daniel J. Evans School of Public Policy and Governance at the University of Washington, also provided constructive comments. The views expressed in this study are those of the authors only.
Funding
This work was supported by The Bill and Melinda Gates Foundation [OPP1135685]. The funders had no role in study design, data analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declared that they have no conflict of interest.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Wineman, A., Anderson, C.L., Reynolds, T.W. et al. Methods of crop yield measurement on multi-cropped plots: Examples from Tanzania. Food Sec. 11, 1257–1273 (2019). https://doi.org/10.1007/s12571-019-00980-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12571-019-00980-5
Keywords
- Agricultural systems
- Crop yield
- Measurement
- Tanzania