Introduction

In many developing countries, dairy provides a regular income, as well as an affordable source of protein. Economic growth and urbanization have fueled demand for animal products (Delgado et al. 1999; FAO 2019). Thus dairy faces a growing market in developing countries and is often associated with "pro-poor growth" (Ravallion and Chen 2003) and a candidate for value chain development action (Swinnen and Maertens 2007; Springer-Heinz 2018, p. 4). Dairy is a rapidly growing industry in Tanzania, and smallholders dominate production. Smallholders’ milk is thought to primarily be consumed on-farm or sold within communities (Makoni et al. 2013). Variations in the volumes and shares of smallholder production that are sold through the informal market are known to occur across regions and between seasons, but some 90% followed this channel at the last national-level review (National Bureau of Statistics 2003). A more recent study by Niras (2010) estimates that some 98% of milk is produced in "traditional" farming systems and that 67.5% of the milk is marketed through informal market channels to urban and peri-urban areas. The formal processing sector handles about 3% of national production (Makoni et al. 2013).

Maintaining and growing the dairy industry in Tanzania requires milk in higher volumes, and dairy cow productivity is vital in this endeavor. Constraints on the development of the Tanzanian dairy industry do, however, extend into numerous spheres. They have been summarized (Shapiro 2017) as being due to, among other things, poor animal nutrition, breeding and health, inadequate financial and credit facilities, poorly organized milk collection and distribution, scarce processing milk handling facilities, difficulties in securing milk supply, and inadequate support services. Research work with Tanzanian dairy specifically on constraints identified land, water, and labor as fundamental constraints (Baker et al. 2015), and this reinforced the importance of improved dairy productivity in Tanzania.

Meeting the changing needs of poor milk producing households is increasingly associated with commercialization. Commercialization is usually defined as a form of organization around production that is targeted at markets, rather than sales of an unplanned or episodic surplus over subsistence needs (Pingali and Rosegrant 1995; Pingali 1997). In practical terms, it relates to producers’ orientation to, access to, and utilization of product and input markets (Gebremedhin and Jaleta 2010). Although, in general, commercialization might be viewed as an alternative to subsistence production, much development effort centers on achieving a transition between these two states (von Braun 1995). A broader view of commercialization might extend to off-farm income as a source of funds for inputs, and to a reorganization of the smallholder household in line with skills and commercial predispositions.

The effect of the scale of operation on milk-producing households’ welfare remains unclear: Gelan and Muriithi (2015) identified no scale-related benefits to producers from increasing herd size, while Twine and Katjiuongua (2015) project substantial aggregate social benefit from scale effects. For Tanzanian dairy, there are interactions between scale, milk volume, and marketing (more milk produced throughout the year attracts buyers’ attention), and scale of operation reduces recurrent and fixed costs per unit of milk. A potentially virtuous circle can operate where the resulting higher margins generate cash for the purchase of inputs and longer-term investment, such as genetic improvement: these, in turn, boost productivity. Group action by farmers in East African dairy settings mimics scale effects to reduce unit costs (including transaction costs) and facilitate technology uptake and training (Rao et al. 2016) but do not necessarily facilitate commercial linkages to producers nor increased productivity (Omondi et al. 2017).

The current study identifies the influences on milk-producing farms dairy productivity as a guide to more effective investment in dairy development. It employs data from a random sample of milk-producing households across a spectrum spanning pre-commercial through to commercial production in four districts of Tanzania’s Morogoro and Tanga regions. Stochastic Frontier Analysis (SFA) is used to derive a statistical measure of technical efficiency (a “frontier”) and identify efficiency drivers. The frontier represents a best case, and points lying within the frontier reveal their inefficiencies as a “distance” within the frontier (Otieno et al. 2012; Bahta et al. 2015). Investigation of the determinants of technical efficiency then enables policy advocacy for approaches to enhance dairy production efficiency and inform investments in Tanzanian dairy production and value chains. In particular, we identify relationships between commercial behavior and its precursors and efficiency.

The current paper contributes to the understanding of the nature and role of commercialization in productivity and pro-poor development. Subsequent parts of the paper are organized as follows: “Materials and Methods” section describes the methodological approach, which includes the empirical estimation followed in the study and the descriptive analysis of the study area and data. “Results and Discussion” section discusses the results, and “Conclusion and Recommendations” section offers conclusions.

Materials and Methods

Empirical Measurement and Analysis of Efficiency

Measurement of farmers’ technical efficiency has long been a research preoccupation with agricultural economists. Commonly, empirical results indicate that large proportions of sample farms fall well within the efficient frontier locus (Laha 2014). Such measurement of efficiency requires estimation of an efficient production function, and this can be achieved by non-parametric and parametric methods (Latruffe 2010).

The non-parametric approach uses mathematical programming techniques, of which the most widely used is data envelopment analysis (DEA) (Mester 2003). DEA has the desirable empirical property of imposing neither a specified functional form nor assumptions about the nature of the residual terms from regression analysis (Tran 2009). However, its limitations include non-inclusion of prices, which means that it accounts only for technical inefficiency in the form of excessive inputs use or producing too few outputs (Mester 2003). DEA is also limited by its implicit assumption of the absence of random errors. It uses a two-stage estimation procedure where the production (or profit) function is estimated to derive efficiency scores as the first stage. In the second stage, the derived efficiency scores are used as explanatory variables in a profit function to be estimated econometrically. Further, DEA ignores a management-related issue in that the firm's input choices are potentially affected by that firm's knowledge of its current level of technical efficiency (Chirwa 2007). Wang and Schmidt (2002) extend this reasoning in a critique of the two-stage estimation process. Parametric analysis of efficiency uses econometric techniques to estimate a frontier function. SFA (Aigner et al. 1977) is the most commonly used parametric approach, and it requires the specification of a functional form for the frontier production function.

The primary point of difference between the two approaches lies in the nature of the assumptions made in identifying the frontier, and the assumptions about distributions of the error terms (Hyuha 2006). The trade-off between using SFA and DEA relates then to the consideration of biases arising from errors in specification on one hand and measurement on the other (Erkoc 2012). In the context of developing countries, SFA is prominent due to its accommodation of data quality and accuracy issues provided that misspecification is not serious (Coelli 1995). Hence, the major advantage of SFA lies in its (single step) estimation procedure and its accommodation of measurement error and other noise in the data (Kolawole 2006).

SFA entails the simultaneous estimation of the frontier production function and an inefficiency model. Usefully, inefficiency effects are specified as a function of other variables (Chirwa 2007). It effectively specifies the relationship between output and input levels and decomposes the error term into separate components representing random errors and inefficiency. By taking into account some management and policy-relevant variables on both the internal (farm management) and external (value chain and service) environments, the current study addresses steps toward commercialization.

The stochastic production frontier analysis begins with specifying a log-linear production function as follows:

$$Y_{i} = \alpha + x^{\prime}_{i} \beta + \varepsilon_{i}$$
(1)
$$\varepsilon_{i} = v_{i} - u_{i}$$
(2)

where \(Y_{i}\) represents the natural logarithm of the observed output of the \(i{\text{th}}\) household, \(x_{i} { }\) is a vector of the natural logarithms of the N inputs for the \(i{\text{th}}\) household, and \(\beta\) is a vector of unknown parameters. The error term \(\varepsilon_{i}\) has two components, \(u_{i}\) and \(v_{i}\). \(u_{i}\) is a non-negative random variable measuring inefficiency, and \(v_{i}\) is a stochastic disturbance term assumed to be independently and identically distributed as \(N\left( {0,{ }\sigma_{v}^{2} } \right)\) across observations.

The probability density of \(Y_{i}\) in Eq. (1) is revealed by computing the joint density of \(\varepsilon_{i}\). Following Greene (2008),

$$f_{\varepsilon , u} \left( {\varepsilon_{i} , u_{i} } \right) = f_{ u} \left( {u_{i} } \right)f_{ v} \left( {\varepsilon_{i} + u_{i} } \right)$$
(3)

Integrating Eq. (3) with respect to \(u_{i} { }\) then gives the marginal density of \({ }\varepsilon_{{\text{i}}}\). This measures the contribution of observation \(i\) to the log-likelihood (ibid).

$$\ln L_{ i} (\alpha , \beta , \sigma_{v}^{2} , \sigma_{u}^{2} |Y_{i, } , X_{i} ) = \ln f_{\varepsilon } (Y_{i} - \alpha - \beta X_{i} \alpha , \beta | \sigma_{v}^{2} , \sigma_{u}^{2} )$$
(4)

The random variable, which is also the inefficiency term \(u_{i}\), can take a variety of distributions: exponential (Meeusen and van den Broeck 1977), half-normal (Aigner et al. 1977), truncated-normal (Stevenson 1980) as well as gamma (Greene 2003). The half-normal distribution is the most commonly used specification in cross-section studies such as ours (Coelli 1995; Bravo-Ureta and Pinheiro 1993; Bauer 1990). Hence, we assume that \(u_{i}\) in Eq. (4) is independently half normally \(N^{ + } \left( {{ }\mu_{i} ,{ }\sigma_{u}^{2} } \right)\) distributed. Logic then enables heteroscedasticity in \(u_{i}\) to be modeled as a function of explanatory variables that may cause inefficiency. Such variables are introduced as follows (Kumbhakar and Hjalmarsson 1991; Huang and Liu 1994):

$$\mu_{i} = z^{\prime}_{i} \eta$$
(5)

where \(\mu_{i}\) is the mean of a pre-truncated inefficiency distribution, \({\text{z}}_{{\text{i}}}\) is a vector of household-specific explanatory variables that affect household-level inefficiency, and \(\eta { }\) is an unknown vector of coefficients to be estimated.

The estimable log-likelihood half-normal specification, with the required heterogeneity in the inefficiency term, will have the following form (Greene 2008):

$$\ln L_{ } \left( {\alpha , \beta , \sigma , \lambda , \eta } \right) = - N\left[ {\ln \sigma + \frac{1}{2}\ln 2\pi + \ln \Phi \left( {\mu_{i} /\sigma_{u} } \right)} \right] + \mathop \sum \limits_{i = 1}^{N} \left[ { - \frac{1}{2} \left( {\frac{{\varepsilon_{i} + \mu_{i} }}{\sigma }} \right)^{2} + \ln \Phi \left( {\frac{{\mu_{i} }}{\sigma \lambda } - \frac{{\varepsilon_{i} \lambda }}{\sigma }} \right)} \right]^{^{\prime}}$$
(6)

where \(\lambda = \sigma_{u} /\sigma_{v}\), \(\sigma^{2}\) = \(\sigma_{u}^{2} + \sigma_{v}^{2}\), \(\sigma_{u} = \lambda \sigma /\sqrt {1 + \lambda^{2} }\), and \(\varepsilon_{i}\) = \(Y_{i} - \alpha - x_{i}^{^{\prime}} \beta\).

The log-likelihood function in Eq. (6) can then be estimated using Stata (Stata Inc. 2013).

Finally, the technical efficiency (TE) of an individual household can be calculated using the following formula (Jondrow et al. 1982):

$$TE_{i} = E\left\{ {\exp \left( { - u_{i} } \right)|\varepsilon_{i} } \right\} = E\left\{ {\frac{{ 1 - \Phi \left( {\sigma_{*} - \mu_{*i/} \sigma_{*} } \right)}}{{1 - \Phi \left( { - \mu_{*i/} \sigma_{*} } \right)}}} \right\}\exp \left( { - \mu_{*i} + \frac{1}{2}\sigma_{*}^{2} } \right)$$
(7)

where \(\mu_{*i}\) = \(\varepsilon_{i} \sigma_{u}^{2} /\sigma^{2}\) and \(\sigma_{*}^{2} = \sigma_{u}^{2} \sigma_{v}^{2} /\sigma^{2}\)

Within this framework, the TE of farms ranges from 0 to 1. Best practice farms score close to 1, and the least efficient close to zero.

Empirical Model

Although the Cobb–Douglas production function follows some restrictive assumptions such as perfect competition in the factor and product markets and constant return to scale (Murthy 2002), the study adopted the functional form after testing for appropriate functional form.Footnote 1 Following Russell and Young (1993), we also assume that functional specification has a discernable but rather small impact on efficiency.

Thus, we employ the semi-log-linear Cobb–Douglas production function as the empirical version of the stochastic frontier production model. Among other useful and convenient properties, this functional form allows direct derivation and interpretation of coefficients as elasticities.

The production function model is specified as follows:

$$\ln {\text{TOTM}}_{i} = \beta_{0} + \beta_{1} x_{i} + \cdots + \beta_{n} x_{n} + \varepsilon_{i}$$
(8)

where \({\text{TOTM}}_{i}\) is total annual milk production by the \(i{\text{th}}\) household during the year 2013 production seasonFootnote 2 in liters; \(x_{n}\) is the natural logarithm of each of N inputs (total cattle, and the number of cows, as measures of scale of operation, percent of cattle that are crossbred as a measure of innovation behavior, as well as input use such as veterinary expenses and feed expenses) considered for the model. \(\beta_{i}\) are unknown coefficients to be estimated and \(\varepsilon_{i}\) is the compound error term specified in Eq. (2).

The following model (Eq. 9) is specified to capture the possible effects of the exogenous variables that affect technical inefficiency.

$$\mu_{i} = \eta_{0} + \eta_{1} z_{1} \ldots \eta_{n} z_{n} + \omega_{i}$$
(9)

where \(\eta_{i}\) are unknown coefficients of the inefficiency effect to be estimated corresponding to each exogenous variable \(z_{n}\), and \(\omega_{i}\) is a stochastic error term that captures the effect of unaccounted-for household-specific variables. The exogenous variables considered for model 1 include indicators of commercial behavior such as credit use, capacity for purchasing—as opposed to producing—additional inputs by utilizing non-cattle income, marketing group membership for inputs and outputs, use of female labor, and participation in training. In model 2, we have introduced a commercialization indicator, market participation variable as motivated in Tirkaso and Hess (2018). Following Wang and Schmidt (2002), Eqs. (8) and (9) are estimated simultaneously.

Data and Descriptive Analysis

Data and Study Area

This paper uses farm-level cross-sectional survey data collected under the auspices of a research project that tested dairy interventions.Footnote 3 The project assembled detailed information on costs of and returns to livestock production, along with selected technical, physical, demographic, and value chain-related variables for farm household operations across groups representing different farm types.

The survey was undertaken in two milk-producing regions of Tanzania—Morogoro, and Tanga—in 2012 and 2013. The specific study sites (Mvomero and Kilosa in Morogoro and Handeni and Lushoto in Tanga) provided suitable variation in the level of commercialization, with selection based on spatial maps' overlaysFootnote 4 of socio-economic data (human population and poverty, consumption levels, and market access), livestock density and profile (% of herd held as milking cows, numbers of exotic breeds), and types of livestock production (particularly investment in improved feeding and animal health) and marketing (such as marketing group membership) system types (Fig. 1).

Table 1 Descriptive statistics of sample characteristics

The Mvomero district which comprises four divisions (Turiani, Mvomero, Mlali, and Mgeta) and has a total cattle population of 187,350, of which 5% (9314 cows) are of improved dairy genetic makeup. Agro-pastoralists keep the majority of the remaining 178,036 indigenous cattle in extensive systems. Kilosa district has a total cattle population of 215,100 of just which 1% (2405) is improved dairy. Similarly, in Mvomero, the majority of the 210,627 indigenous cattle are kept by agro-pastoralists, including the Maasai ethnic group.

For the Tanga region, the Handeni district area is mainly inhabited by the ethnic Zigua tribe who are both arable farmers and sedentary livestock keepers. The total cattle population is 126,780, of which 1% (1045) are improved dairy. Again, the majority of the 124,735 indigenous cattle are kept by agro-pastoralists in extensive systems. Finally, the Lushoto district which is situated in the northern part of Tanga at the foot of the western Usambara Mountains. Lushoto has 119,492 cattle, of which 24% (29,200) are improved cattle. Lushoto’s improved cattle are found mostly in the highland areas where 65% of households own cattle.

Descriptive Analysis

Table 1 presents selected farm household characteristics of the sample of 469 households used for the current study. These characteristics reflect the substantial variation in the factors likely to affect productivity, as well as in elements of commercialization.

The sample's typical milk-producing farm household has about 7.1 persons (minimum is one person, and maximum is 20 person per household), which is slightly higher than the national average of about 6 (Katjiuongua and Nelgen 2014), and owns about 16 acres of cropland, of which 2.2 acres are used for growing improved pasture. The maximum land size and pasture land observed are 300 ha and 50 ha, respectively.

Our average farm household owns 35 cattle, of which about 40% (12.7 in number) are cows, and 19% are crossbred. The ownership of cattle and breed composition ranges are from 5–1600 to 0–100%, respectively. These cows, on average 12.7 per household, produce annually about 576 L of milk of which some is sold and delivered to a buyer or milk collection point, situated on average about 1.45 km away.

The average commercialization index, calculated as the ratio of total milk sales (fresh and fermented) to the total milk production, is 30%. This indicates that, on average, about 30% of the milk produced is sold to both formal and informal markets. Table 1 also shows only about 48% of the milk producers are market participants or have a commercialization index of more than zero.

The average total veterinary and breeding expenditure is 354,221 Tanzanian Shillings (TZS) (222.8 USD), of which de-ticking followed by treatment expenditures is the largest item. The average cost for artificial insemination (AI) services is about 606.6 TZS, while the feed costs that milk producers in the study area incur is on average, 35,090.8 TZS. The maximum total veterinary and breeding expenditure is 34,500,000 TZS.

Table 1 also shows that most farm work is conducted by family labor. A typical milk producer farm household in the study area works about 90 h per week and uses hired labor only for about one day (9 h) per week. The male household members work on the farm on average for 57.8 h per week and female household members 42.2 h per week. Some members of the farm household also engage in non-cattle activities and earn, together as a household, on average, about 4,140,560 TZS (2605 USD) per annum.

Results and Discussion

Production Function Estimates

Results from the SFA are shown in Table 2. The estimated value of γ for both models (gamma = 0.011 and gamma = 0.018, for models 1 and 2, respectively) allows us to reject the hypothesis (H0: γ = 0) of no significant technical inefficiency among Tanzanian milk producers. We also observe that the variation in technical inefficiency among milk producers in Tanzania occurs due to both farmer inefficiency and exogenous factors outside the farmers’ control, rather than from random variation.

Table 2 Results of the stochastic frontier analysis

The strong statistical significance of Sigma-squared (σ2) on both models indicates that our technical efficiency analysis can explain the differences between farms’ technical efficiency and the technical efficiency locus (points on the frontier). Moreover, the significance of σ2 confirms that the frontier model is stochastic rather than deterministic.

As shown in Table 2, the LR testFootnote 5 for models 1 and 2 are 15.2 and 24.6, respectively, with p < 0.05. We thus fail to reject the null hypothesis that the SFA model provides a better specification of the sample data than does our model. This test draws on Table 1 of Kodde and Palm (1986) for a comparison of critical values with the degree of freedom equal to 7 (k = δ + 1). Thus, the null hypothesis is firmly rejected, leading to the acceptance of the full model, which incorporates production inefficiency effects and extends to variables associated with the observed range of commercialization.

All the estimated coefficients for first-order explanatory variables (mostly input variables) have positive signs. They are statistically significant in both models (Table 2), suggesting that increased usage of these inputs would yield more output as postulated by theory and assuming that producers are rational (Coelli et al. 2005). That is, an increase in the number of cattle (a scale effect), cows and crossbreeds (indicators of commercialization), and additional veterinary and feed input usage (commercial behavior), increase productivity until the stage of diminishing marginal returns where marginal output starts to decrease with every additional unit of the inputs.

Determinants of Technical Efficiency

Table 2 presents the determinants of technical inefficiency. It should be noted that in the stochastic frontier estimation, the parameter for inefficiency level usually enters the inefficiency effects component of the model (second stage) as the dependent variable. Therefore, a negative sign on the coefficient variable is interpreted as a positive influence on efficiency (i.e., a reduction in inefficiency). Conversely, a positive coefficient is interpreted as a negative influence on efficiency (an increase in inefficiency) (Brummer and Loy 2000; Coelli et al. 2005).

Results for the estimated inefficiency effects in the stochastic frontier of both models show (see the lower part of Table 2) that many of the technical inefficiency explanatory variables are statistically significant and have the expected signs, except for non-cattle income. Hence training (Model 1) as well as commercialization index (as shown in Model 2) and other indicators of commercialization such as credit access and group membership (model 1) and use of female household labor (a commercial venture as outlined below, on both models) would significantly improve efficiency (reduce inefficiency). In contrast, increases in non-cattle income would reduce efficiency (increase inefficiency).

The finding that access to credit positively influences technical efficiency (model 1) is intuitively linked to the utilization of agricultural inputs and supports earlier results from Brummer and Loy (2000) and Obwona (2000). Group membership, a form of collective action, is likely to reduce transaction costs and hence enhance market access. Gyau et al. (2014) argue on this and extended to include group activities such as training in production methods, negotiation skills, product grading, and sorting and group dynamics, which subsequently enhance bulk marketing of products by members, exploit economies of scale. The effect on efficiency is confirmed in Table 2, which reveals that milk producers that are a member of a group or organization are technically more efficient than those who are not. This finding is in line with Twine et al. (2019), which shows group membership increases the probability of borrowing and the number of funds borrowed by milk-producing households in Tanzania. The group membership variable is highly correlated with the commercialization index dummy variable; hence it was dropped in model 2. Since many of the households (> 50%) have zero commercialization index (Table 1), following (Tirkaso. and Hess 2018) a dummy variable for market participation (1 = sellers, 0 = non-sellers) is used. The result in Model 2 can be interpreted as those milk producers who are highly commercially oriented, sell some of their milk output, are technically more efficient than those who do not sell or non-participants. Further examination of the commercialization variable (Table 3) confirms that about 29.6% of the milk sellers or market participants scored more than 95% technical efficiency. A closer look at the technical efficiency scores of milk sellers and non-sellers also (Fig. 2) shows that a more significant proportion of sellers scored more than 80% technical efficiency. In comparison, a significant proportion of the non-sellers have a technical efficiency in the rage of 55–80%. The finding suggests the importance of increasing milk market participation by farmers in order to improve productivity, supporting the argument that commercialization improves farms’ productivity by increasing their income and access to healthy and nutritious food (Pingali and Rosegrant 1995). Our finding is consistent with existing literature; for example, Tirkaso and Hess (2018) found that farmers with a higher degree of commercialization were technically more efficient compared to those with lower market participation.

Fig. 1
figure 1

Map of selected field sites in Tanzania indicating the locations of selected villages for piloting dairy market hubs. Key: R-to-R: Rural production to rural consumption (pre-commercial), R-to-U: Rural production to urban consumption (more commercial)

Fig. 2
figure 2

Technical efficiency category (55–80% and greater than 80%) of milk sellers and non-sellers

Table 3 Characteristics of dairy-producing households in Tanzania by technical efficiency class that are significantly different at p < 0.10

Farmers’ training is also found to be a significant factor (Model 1) affecting technical efficiency. Examination of the data confirms that more than 88% of the farmers who received training scored more than 65% technical efficiency (Table 3). Despite women spending less of their labor time on animal production and husbandry than do men, the results in Table 2 showed that the time spent by female household labor is a determining factor of technical efficiency in milk producing farms. This finding is in line with Covarrubias et al. (2012), who found that women's participation in livestock activities in Tanzania tends to be associated with selling milk.

Further, consistent with this finding, a baseline study of 634 cattle-keeping households in Tanzania (Katjiuongua and Nelgen 2014)Footnote 6 Showed that women report more significant participation in the sale of milk than do men, and allocate more labor to selling milk than do men in Kilosa, Handeni, and Mvomero districts.

Non-cattle income, which includes off-farm income, has a detrimental impact on the technical efficiency of milk-producing households in Tanzania. It was expected that spending time off the farm might improve farmer's production and management capability through the acquisition of information and knowledge, and hence positively influence farm performance (Bahta and Malope 2014). Further, off-farm work can assist in accumulating capital, which, when invested on the farm, can increase efficiency.

Otieno et al. (2012) and Bahta and Malope (2014) found a positive relationship between off-farm income and efficiency, and argue that this is evidence of re-investment of off-farm earnings into farm production, as a commercial venture. Perhaps in the case of milk producers of Tanzania, the unexpected influence of non-cattle income on efficiency might be attributed to, as noted by Rakipova et al. (2003) and Bahta et al. (2015), the time spent doing off-farm work which reduces time spent on efficiency-improving managerial activities.

Figure 3 and 4 shows the kernel density distribution of the technical efficiency by district and for all sample, respectively.

Fig. 3
figure 3

Kernel density of technical efficiency scores by Districts

Fig. 4
figure 4

Kernel density of technical efficiency, overall average TE = 80%

In general, milk-producing farms in Kilosa are more technically efficient than a farm in other districts. This also implies that milk-producing farms in Kilosa are relatively more homogeneous than are other subsamples. Figures 5, 6, 7, and 8 show the proportion of milk producers by district and technical efficiency (TE) class.

Fig. 5
figure 5

District 1 (Kilosa, N = 63; cumulative proportion of sampled dairy farmers above 90% technical efficiency = 27%; modal class = 70–80%)

Fig. 6
figure 6

District 2 (Mvomero, N = 192; cumulative proportion of sampled dairy farmers above 90% technical efficiency = 18%; modal class = 70–80%)

Fig. 7
figure 7

District 3 (Lushoto, N = 121; cumulative proportion of sampled dairy farmers above 90% technical efficiency = 24%; modal class = 70–80%)

Fig. 8
figure 8

District 4 (Handeni, N = 93; cumulative proportion of sampled dairy farmers above 90% technical efficiency = 22%; modal class = 80–90%)

Similarly, the proportion of milk producers scoring more than 90% TE is relatively higher in Kilosa (27%) and Lushoto (24%). The modal technical efficiency class is higher, 80 to 90%, in Handeni and the same range (70–80%) for the other districts (Kilosa, Mvomero, and Lushoto).

The results in Table 3 present the characteristics of milk-producing households in Tanzania, subdivided by TE class. The F statistics show that milk-producing households falling into the three technical efficiency classesFootnote 7 have significantly different characteristics in terms of their ownership of small ruminants, their breeding and veterinary expenditure, their training, and their technical efficiency scores. Farmers who additionally engage in other agricultural activities such as small stock production can, when necessary, access cash quickly from their sales (Bahta et al. 2015), and this cash could be mobilized to cover farm input expenses.

This assertion is strengthened by the finding in Table 3, which shows a four times higher ownership of small ruminants by farms with technical efficiency scores above 65%. The average number of small ruminants owned by the three technical efficiency classes are significantly different and relatively higher (1.94 TLU) for the milk-producing farms belonging to TE class 2 (TE ≥ 65% and < 95%) followed by class 3 (TE ≥ 95%), who own on average about 1.81 TLU small ruminants.

In line with the findings in Table 2, the total veterinary and breeding expenditure, which is an indicator of the magnitude of veterinary inputs used and breeding technologies accessed, increases with an increase in technical efficiency. Such expense for TE class 3 milk-producing farms (TE scores higher than 95%) is approximately 3.5 and 4 times higher than those farms in TE class 2 and 1, respectively. Similarly, the overall proportion of milk producers who received training is about 10%, but they represent 88.8% and 20% of the farmers that achieved an efficiency score of above 65% and 95%, respectively.

In terms of overall efficiency analysis, Table 3 also shows that the majority (78.5%) of milk producing households have technical efficiency scores between 65 and 95%, with their average at 80.2%. About 11.2% of the farm households scored a technical efficiency of more than 95%, and the rest of the farm households have technical efficiency of less than 65%.

The total average technical efficiency is about 79.88%, implying that there is still considerable scope to improve dairy production in Tanzania, under the current production and input mix and technology and management: by up to 20.12% of the total potential. For those milk producers with a technical efficiency score of less than 65%, such scope of improvements extends to 40.82%.

We find that variables associated with commercialization play a substantial role both in the production function and in the explanation of milk producer household efficiency. In the production function, we observe that parameter estimates for commercialization indicators (numbers of crossbred cows and selected expenditure values) are smaller than for more technical variables such as scale. For the determinants of efficiency, however, the parameter estimate for an indicator of commercialization (group membership in model 1 and commercialization index dummy in model 2) had the most significant magnitude of any in the analysis.

Conclusion and Recommendations

This study employs SFA to evaluate TE and its determinants in Tanzania's milk-producing households. The inference drawn for the study then refers to supporting actions to mobilize commercialization among such households.

Our results support much existing work to show that, assuming milk producers’ rationality, increasing the numbers of cattle, cows, and crossbreeds, and use of additional veterinary and feed inputs all would increase productivity. Our analysis emphasizes the role to be played by behavior change, rather than stand-alone management change involving scale, technology, or organization, in implementing such change. The significance and magnitude of estimates of the importance of the indicators of commercialization that we develop appear throughout the analysis suggesting that technical and organizational change toward market orientation is needed.

Our sample of 469 farms exhibit an average TE estimated at 80%, which suggests that there is still room to improve dairy production in milk-producing farms in Tanzania by about 20%. Moving to value chain influences, the estimated inefficiency effects of the stochastic frontier model show that credit access, training, group membership, and female household labor would improve technical efficiency (reduce inefficiency). In contrast, non-cattle income would reduce efficiency in milk production in Tanzania. These results suggest the importance of actions beyond those of the individual milk producers. By convention, such calls are made for government action. However, our analysis and reasoning extends to facilitation actions and partnerships of actors in the value chain and beyond in order to advance elements of commercialization.

There is a need to enhance access to dairy production inputs and services (feed, veterinary, breeding, training, and credit services) for dairy farmers. To an extent this embodies changes in organization, marketing and service content and orientation, and credit availability. Although future research would productively examine factors affecting uptake of such services, the current study has identified their uptake as a major determinant of productivity and hence advocates for accelerated access. Further, increases in productivity will have feedback effects in terms of economies of scale—particularly milk volumes—and its effect on unit costs, which our study did not investigate but are essential for future research.

We also recommend strengthening credit provision, possibly using local initiatives like SACCOs (Savings and Credit Co-operative Societies) or VICOBA (Village Community Banks) to provide capital to milk-producing households that would facilitate access to better production technologies, inputs, and markets. For larger-scale investments (inherent in crossbreeding or transforming access to inputs or markets), we advocate improving access to formal banks or by enabling check-off (retained milk payments) systems and innovation in collateral recognition for milk producers. In this way, the commercialization offers both direct (via market access) and indirect (via the utilization of market transactions) roles in increasing productivity.

Our results highlight the role of training in improving productivity. The congruity of the effects of training with those of developmental actions listed above further magnifies the role of training in the commercialization of milk producers. Although conventional thinking on public goods would support government provision of training, other options are available. The options include enlistment of dairy processing and input provision actors, and channeling of training and technology via small-scale supported ventures—such as seen in Business Development Services and NGO actions. This illustrates a further potential synergy between commercialization and enhanced productivity.

We identify a positive influence on productivity of membership of a producer group. This effect has been little examined in reviewed research, and yet supports much development impetus for commercialization. Future research should identify the mechanism of this impact, for example by examining the form, role, and size of producer groups for their influence on productivity, as well as that of the specific aspects of commercialization (reduced transaction costs, changed marketing practice, mode of contracting, and others). Moreover, producer groups provide a potential organizational form for commercialization actions and the accelerated information and material flows that it embodies. Our study confirms the previously observed importance of women in dairy production and marketing reported by others, but extends this knowledge to include women’s positive influence on productivity. Women’s reported concentration of dairy-related effort on marketing predisposes to group action in marketing and input supply due to women’s social networks at a micro-level. Although there was some attempt in the past, future research should address this subject. At a grander scale, the extent to which commercialization led or facilitated by women generates better nutritional and social outcomes than does that led by men.

Finally, we recommend any policy efforts aimed at creating an efficient tie between the farm sector and the market will improve the performance of dairy production. Hence, policy measures directed at increasing the market participation rate of farmers, by providing information and other market-related support services, will significantly contribute to improvements in agricultural productivity.