1 Introduction

One of the famous workhorses in economic theory, used in uncountable many applications at all levels of aggregation (economy, industry, enterprise, plant, or whatever), despite its limitations, is the Cobb-Douglas (CD) (production) function.Footnote 1 Its ‘discovery’ was reported by Cobb and Douglas (1928)Footnote 2 and the subsequent success story told in Douglas’s presidential address at the 1947 meeting of the American Economic Association, as reported in Douglas (1948). The years after 1947 were covered in a posthumous article by Douglas (1976) — both articles are still fascinating pieces to read. The colourful history from its start in the mid-1920s to the late 1960s is to be found in the recent book by Biddle (2021).

The universal ‘law’ one believed to have uncovered – Biddle (2021) makes us aware of the fact that the concept of ‘production function’ only gradually crept into the language used – relates production P, measured as an index number of real (i.e., deflated) value added, to an index number of labour input L and an index number of capital stock K by means of the function \(P = aL^{\lambda }K^{1-\lambda }\), where a is some constant. Based on empirical evidence the labour exponent \(\lambda\) was set equal to 0.75, whereas a appeared to be approximately equal to 1. In time-series as well as cross-sectional studies of the manufacturing industry at various levels of aggregation time and again it appeared that \(\lambda\) was remarkably well approximated by the (nominal) share of labour cost in value added (which in turn was conceived as the sum of labour and capital input cost). In the 1947 address Douglas considered the slightly more general formula \(P = aL^{\lambda }K^{\mu }\), but then more often than not it turned out that the sum of the exponents \(\lambda\) and \(\mu\) was close to 1, which was taken as signalling constant returns to scale. Quoting Douglas (1976, 914):

“A considerable body of independent work tends to corroborate the original Cobb-Douglas formula, but, more important, the approximate coincidence of the estimated coefficients with the actual shares received also strengthens the competitive theory of distribution and disproves the Marxian.”

In a relatively unknown article, Tinbergen (1942a) pointed out a number of restrictive implications of the CD function as well as a number of statistical estimation problems, especially in a cross-section context. One of these implications is that labour productivity, P/L, is related to capital intensity, K/P, by \(P/L = a^{1/\lambda }(K/P)^{(1-\lambda )/\lambda }\); for \(a=1\) and \(\lambda = 0.75\) this means that \(P/L = (K/P)^{1/3}\), which implies that in a time-series context the increase of capital intensity would be the sole driving force for the increase of labour productivity. Where is technological progress?

It was more or less customary among economists of those days to see technological progress as part of accumulation of the capital stock and/or as increase of the quality of labour. Tinbergen however suggested to introduce time as an additional variable in the production function; for instance, by replacing the scalar a by the simple function \(a^{t}\), as in Tinbergen (1942b). This can be seen as the beginninng of a development in which the simple function \(a^{t}\) is replaced by a more intricate function A(t), defining ‘total factor productivity’, or ‘efficiency’, or ‘state of technology’, as in Solow (1957).

From the start, the project of Douglas and his followers has met with approval and criticism, both from econometric and conceptual perspectives, as documented extensively in Biddle (2021) and Felipe and McCombie (2013, Chapter 4).Footnote 3 Many empirical applications took place in agricultural economics because of the availability of more and better (firm-level) data; here estimation could be based on less dubious assumptions.

In the meantime theoretical work showed that the conditions for the existence of an aggregate production function are unlikely to being satisfied in any realistic situation — see the lemma on ”Aggregation (production)” in The New Palgrave by Felipe and Fisher (2008). But then: why always the good fit? Is there a sort of law of production that supports the input–output data, or is the observed, remarkably stable, relationship between those data some statistical artefact? But what precisely causes this artefact then to emerge?

Important contributions to the solution of this puzzle were provided first by Phelps Brown (1957),Footnote 4 followed by Simon and Levy (1963), Fisher (1971), Shaikh (1974), and again Simon (1979). It gradually began to dawn that the responsibility rests with the underlying accounting relation stating that (nominal) value added equals (nominal) labour input cost and (nominal) capital input cost. Simon (1979, 473) concluded by saying that

“Since the observed phenomena can as readily be explained on the weaker assumption that what is being observed is simply the accounting relation equating value of output to the sum of factor costs, the criterion of parsimony would lead us to prefer the latter explanation to the classical one.”

In an article commemorating Douglas’s work, Samuelson (1979, 933) also pointed to the role of “the accounting identity involved in the residual definition of profit”.

Further evidence, approaching a body of data with a CES production function, of which the CD function is a special case, was provided by Felipe and McCombie (2001). Their conclusion was that

“... Occam’s razor suggests that the CES production function,..., merely reflect the underlying accounting identity that value added equals the wage bill plus profits.”

Interesting insights can be found in a collection of symposium papers published in the Summer 2005 issue of the Eastern Economic Journal; in particular Felipe and Adams (2005), Shaikh (2005), and Felipe and McCombie (2005). Summaries of the state of the art were provided by Felipe and McCombie (2007) and Felipe and Fisher (2008), and much additional material by Felipe and McCombie (2013). Especially the final chapter of this book, entitled “Why have criticisms of the aggregate production function generally been ignored? On further misunderstandings and misinterpretations of the implications of the accounting identity”, is well worth (re-) reading.

Basically in all these publications it is shown that, in continuous time, the CD function can be derived from the accounting identity under some reasonable assumptions (factor shares are constant, and the weighted growth of the labour input price and the capital input price is constant). In this paper these results are generalized in three ways: (1) the accounting identity contains a (pure) profit term; (2) continuous time is replaced by discrete time periods; (3) additional assumptions appear to be superfluous.

The layout of this article is as follows. Section 2 first provides a sketch of the classic argument, and then shows in a more rigorous way how a CD function emerges from observable data. Section 3 provides a number of comments and extensions. Section 4 discusses the generalization from real value added to gross output. Section 5 goes beyond a single production unit and considers an ensemble of those. Section 6 contains some concluding thoughts.

2 The emergence of a function

Consider a production unit, operating on the market so that output prices are given (or can be imputed). Let t denote an accounting period of a certain length (say, a year). The ex post accounting relation according to the KL-VA model reads

$$\begin{aligned} C_{KL}^t + \Pi ^t = VA ^t, \end{aligned}$$
(1)

where \(C_{KL}^t\) denotes (nominal) primary inputs cost, \(VA ^t\) denotes (nominal) value added (that is, revenue minus intermediate inputs cost; assumed to be positive), and profit \(\Pi ^t\) is defined as the difference between value added and primary inputs cost. Thus the accounting relation is merely a definitional identity. Profit may be positive, zero, or negative. For a more detailed exposition the reader is referred to Balk (2021, Chapter 2). In particular one should note that revenue, intermediate inputs cost, capital input cost, and labour input cost are sums of prices times quantities. Usually the number of commodities (goods and services) making out each of these aggregates is huge. Data are therefore usually presented in the form of aggregate nominal values and corresponding price index numbers.

The reader is warned that there is no standardized terminology here; e.g., Felipe and Adams (2005) call value added minus labour cost, in our notation \(VA ^t - C_L^t\), total profits, which is then seen as remuneration of the capital stock.

Primary inputs concern owned capital and labour (including self-employed persons). Since nominal values are additive, we have, in obvious notation, that

$$\begin{aligned} C_{KL}^t = C_K^t + C_L^t. \end{aligned}$$
(2)

We consider two periods, which may or may not be adjacent. These periods are labeled 0 (base period) and 1 (comparison period).Footnote 5 Let us start by summarizing the classic argument.

2.1 The classic argument

The first assumption is that profit equals zero, \(\Pi ^t = 0\). Then, by combining expressions (1) and (2), the fundamental accounting relation reads

$$\begin{aligned} C_K^t + C_L^t = VA ^t. \end{aligned}$$
(3)

The three elements in this equation are nominal values. In order to turn them into real values we need price indices (aka deflators). Let \(P_K(t,b)\), \(P_L(t,b)\), and \(P_{ VA }(t,b)\) be suitable deflators for capital cost, labour cost, and value added, respectively, for period t relative to a certain reference period b (somewhere in the past); that is, \(P_K(b,b) = P_L(b,b) = P_{ VA }(b,b) = 1\). Real values \(X_K(t,b)\), \(X_L(t,b)\), and \(RVA (t,b)\) are then defined by the three identities

$$\begin{aligned} C_K^t= & {} P_K(t,b)X_K(t,b),\end{aligned}$$
(4)
$$\begin{aligned} C_L^t= & {} P_L(t,b)X_L(t,b),\end{aligned}$$
(5)
$$\begin{aligned} VA ^t= & {} P_{ VA }(t,b) RVA (t,b). \end{aligned}$$
(6)

The notation highlights the fact that unlike nominal values, which are observable, real values are outcomes of functions. Relative deflators for capital and labour cost, respectively, are defined by

$$\begin{aligned} {\tilde{P}}_K(t,b)\equiv & {} P_K(t,b)/P_{ VA }(t,b) \end{aligned}$$
(7)
$$\begin{aligned} {\tilde{P}}_L(t,b)\equiv & {} P_L(t,b)/P_{ VA }(t,b). \end{aligned}$$
(8)

By inserting expressions (4), (5), and (6) into expression (3), dividing both sides of the resulting identity by \(P_{ VA }(t,b)\), and using expressions (7) and (8), we see that the fundamental accounting relation in terms of real values reads

$$\begin{aligned} {\tilde{P}}_K(t,b)X_K(t,b) + {\tilde{P}}_L(t,b)X_L(t,b) = RVA (t,b). \end{aligned}$$
(9)

The second assumption is that all the functions occurring in expression (9) are continuously differentiable functions of continuous time t. By differentiating both sides of this expression we then obtain the following identity,

$$\begin{aligned}{} & {} {\tilde{P}}_K(t,b)dX_K(t,b) + {\tilde{P}}_L(t,b)dX_L(t,b) + X_K(t,b)d{\tilde{P}}_K(t,b) + X_L(t,b)d{\tilde{P}}_L(t,b)\\{} & {} \quad = d RVA (t,b),\nonumber \end{aligned}$$
(10)

which can be rewritten as

$$\begin{aligned}{} & {} \frac{{\tilde{P}}_K(t,b)X_K(t,b)}{ RVA (t,b)}d\ln X_K(t,b) + \frac{{\tilde{P}}_L(t,b)X_L(t,b)}{ RVA (t,b)}d\ln X_L(t,b) \, \\{} & {} \quad +\frac{{\tilde{P}}_K(t,b)X_K(t,b)}{ RVA (t,b)}d\ln {\tilde{P}}_K(t,b) + \frac{{\tilde{P}}_L(t,b)X_L(t,b)}{ RVA (t,b)}d\ln {\tilde{P}}_L(t,b)\nonumber \\{} & {} \quad = d\ln RVA (t,b).\nonumber \end{aligned}$$
(11)

The third assumption is that the two factor shares are constant; that is, using expressions (4)-(8),

$$\begin{aligned} \frac{{\tilde{P}}_K(t,b)X_K(t,b)}{ RVA (t,b)}= & {} \frac{P_K(t,b)X_K(t,b)}{ VA ^t} = \frac{C_K^t}{ VA ^t} = \alpha _K \end{aligned}$$
(12)
$$\begin{aligned} \frac{{\tilde{P}}_L(t,b)X_L(t,b)}{ RVA (t,b)}= & {} \frac{P_L(t,b)X_L(t,b)}{ VA ^t} = \frac{C_L^t}{ VA ^t} = \alpha _L. \end{aligned}$$
(13)

The fundamental accounting relation in expression (3) then implies that \(\alpha _K + \alpha _L = 1\), which in turn implies that expression (11) reduces to

$$\begin{aligned}{} & {} \alpha _Kd\ln X_K(t,b) + (1 - \alpha _K)d\ln X_L(t,b) \, \\{} & {} \quad +\alpha _Kd\ln {\tilde{P}}_K(t,b) + (1 - \alpha _K)d\ln {\tilde{P}}_L(t,b)\nonumber \\{} & {} \quad = d\ln RVA (t,b).\nonumber \end{aligned}$$
(14)

The fourth, and final, assumption is that the weighted mean of the relative deflators is constant; that is,

$$\begin{aligned} \alpha _Kd\ln {\tilde{P}}_K(t,b) + (1 - \alpha _K)d\ln {\tilde{P}}_L(t,b) = \beta . \end{aligned}$$
(15)

The consequence is that expression (14) reduces to

$$\begin{aligned} d\ln RVA (t,b) = d\ln \left( X_K(t,b)^{\alpha _K} X_L(t,b)^{1 - \alpha _K}\right) + \beta . \end{aligned}$$
(16)

On the right-hand side of this identity, between the brackets, we meet the traditional CD function. It is interesting to notice that the fourth assumption basically means that the rate of value-added based price change, \(d\ln P_{ VA }(t,b)\), differs from the rate of capital plus labour price change, \(d\ln (P_K(t,b)^{\alpha _K}\) \(P_L(t,b)^{1 - \alpha _K})\), by a fixed percentage. Another interpretation is that dual total factor productivity change, as introduced by Jorgenson and Griliches (1967, 252), is constant.

This concludes the reproduction of the classic argument: if the nominal data are governed by an accounting identity like expression (3) then a CD function will almost necessarily give a good ‘fit’ to the real data, because the relation between these input–output data exhibits a CD-like form. In the next subsection it will be shown that the first three assumptions of the classic approach are unnecessary, and that the fourth assumption can be replaced by a more modest one.

2.2 A more rigorous approach

We drop all the assumptions made in the previous subsection. Specifically, we return to discrete time periods. It is assumed that the (nominal) value-added ratio of period 1 to period 0 can be decomposed into a price index and a quantity index,

$$\begin{aligned} \frac{ VA ^1}{ VA ^0} = P_{ VA }(1,0)Q_{ VA }(1,0), \end{aligned}$$
(17)

where (1, 0) is used as shorthand notation for the vectors of prices and quantities of the two periods. Likewise, it is assumed that the joint primary inputs cost ratio is decomposed as

$$\begin{aligned} \frac{C_{KL}^1}{C_{KL}^0} = P_{KL}(1,0)Q_{KL}(1,0). \end{aligned}$$
(18)

As in Balk (2021, 34) the value-added based (total factor) productivity index for period 1 relative to period 0 is then defined as

$$\begin{aligned} ITFPROD _{ VA }(1,0) \equiv \frac{Q_{ VA }(1,0)}{Q_{KL}(1,0)}. \end{aligned}$$
(19)

This index can be interpreted as the ‘quantity’ change of value added relative to the joint quantity change of primary inputs; or, as the index of real value added relative to the index of real primary inputs; or, as the real component of value-added-based profitability change.

Using the logarithmic mean.Footnote 6 and the relation in expression (2), the primary inputs cost ratio can be decomposed as follows:

$$\begin{aligned} \ln \left( \frac{C_{KL}^1}{C_{KL}^0}\right)= & {} \frac{C_{KL}^1 - C_{KL}^0}{ LM (C_{KL}^1,C_{KL}^0)}\nonumber \\= & {} \frac{C_{K}^1 - C_{K}^0}{ LM (C_{KL}^1,C_{KL}^0)} + \frac{C_{L}^1 - C_{L}^0}{ LM (C_{KL}^1,C_{KL}^0)} \nonumber \\= & {} \frac{ LM (C_{K}^1,C_{K}^0)\ln (C_{K}^1/C_{K}^0)}{ LM (C_{KL}^1,C_{KL}^0)} + \frac{ LM (C_{L}^1,C_{L}^0)\ln (C_{L}^1/C_{L}^0)}{ LM (C_{KL}^1,C_{KL}^0)}. \end{aligned}$$
(20)

It is useful to define \(\phi _K(1,0) \equiv LM (C_{K}^1,C_{K}^0)/ LM (C_{KL}^1,C_{KL}^0)\) and \(\phi _L(1,0) \equiv LM (C_{L}^1,C_{L}^0)/ LM (C_{KL}^1,C_{KL}^0)\). Thus \(\phi _K(1,0)\) is the mean (over two periods) share of capital in total primary inputs cost, and \(\phi _L(1,0)\) is the mean (over two periods) share of labour in total primary inputs cost. The previous expression can then be simplified to

$$\begin{aligned} \ln \left( \frac{C_{KL}^1}{C_{KL}^0}\right) = \phi _K(1,0)\ln \left( \frac{C_{K}^1}{C_{K}^0}\right) + \phi _L(1,0)\ln \left( \frac{C_{L}^1}{C_{L}^0}\right) \end{aligned}$$
(21)

The next step is to assume that the capital input cost ratio and the labour cost ratio can be decomposed in price and quantity indices as

$$\begin{aligned} \frac{C_{K}^1}{C_{K}^0}= & {} P_{K}(1,0)Q_{K}(1,0). \end{aligned}$$
(22)
$$\begin{aligned} \frac{C_{L}^1}{C_{L}^0}= & {} P_{L}(1,0)Q_{L}(1,0). \end{aligned}$$
(23)

Substituting the expressions (18), (22), and (23) into expression (21), rearranging the right-hand side, and exponentiating both sides, we get

$$\begin{aligned}{} & {} P_{KL}(1,0)Q_{KL}(1,0) \nonumber \\{} & {} \quad = \left( P_{K}(1,0)^{\phi _K(1,0)}P_{L}(1,0)^{\phi _L(1,0)}\right) \left( Q_{K}(1,0)^{\phi _K(1,0)}Q_{L}(1,0)^{\phi _L(1,0)}\right) . \end{aligned}$$
(24)

Now, if

$$\begin{aligned} P_{KL}(1,0) = P_{K}(1,0)^{\phi _K(1,0)}P_{L}(1,0)^{\phi _L(1,0)}, \end{aligned}$$
(25)

then

$$\begin{aligned} Q_{KL}(1,0) = Q_{K}(1,0)^{\phi _K(1,0)}Q_{L}(1,0)^{\phi _L(1,0)}. \end{aligned}$$
(26)

Technically seen, the last two equations mean that the joint capital-labour input price and quantity index numbers are equal to Montgomery-Vartia (MV) index numbers of the underlying, separate capital and labour price and quantity index numbers, respectively.Footnote 7 Thus, if the condition in expression (25) is satisfied then the productivity index defined by expression (19) becomes equal to

$$\begin{aligned} ITFPROD _{ VA }(1,0) = \frac{Q_{ VA }(1,0)}{Q_{K}(1,0)^{\phi _K(1,0)}Q_{L}(1,0)^{\phi _L(1,0)}}. \end{aligned}$$
(27)

Rearranging this expression in growth-accounting form, we obtain

$$\begin{aligned} Q_{ VA }(1,0) = ITFPROD _{ VA }(1,0)Q_{K}(1,0)^{\phi _K (1,0)}Q_{L}(1,0)^{\phi _L(1,0)}, \end{aligned}$$
(28)

which starkly looks like an unrestricted Cobb-Douglas production function. Notice that it is straightforward to reformulate this expression in terms of levels (real value added, real inputs). By taking logarithms we obtain the more familiar form

$$\begin{aligned} \ln Q_{ VA }(1,0) = \ln ITFPROD _{ VA }(1,0) + \phi _K(1,0)\ln Q_{K}(1,0) + \phi _L(1,0)\ln Q_{L}(1,0). \end{aligned}$$
(29)

Realizing that the logarithm of an index number in the neighbourhood of 1 can be interpreted as a growth rate (or, what is the same, a forward-looking percentage change), expression (29) can be read as saying that the growth rate of output is equal to the growth rate of productivity plus a weighted sum of the growth rates of capital and labour.Footnote 8 Notice, however, that these weights are not constant, but depend on the data of the two periods.

If these data – Recall that data are usually given in the form of index numbers. – were generated by a ‘genuine’ CD production function then we would have

$$\begin{aligned} \ln Q_{ VA }(1,0) = \alpha + \alpha _{K}\ln Q_{K}(1,0) + \alpha _{L}\ln Q_{L}(1,0) + \epsilon , \end{aligned}$$
(30)

where \(\alpha , \alpha _{K}, \alpha _{L}\) are constants, with the last two in the interval (0, 1), and \(\epsilon\) is a kind of noise. The coefficients \(\alpha _{K}\) and \(\alpha _{L}\) can be interpreted as marginal productivities. Expression (30) perfectly fits the following description by Douglas (1948, 6), the production unit there being American manufacuring:

“It was twenty years ago last spring that, having computed indexes for American manufacturing of the numbers of workers employed by years from 1899 to 1922, as well as indexes of the amounts of fixed capital in manufacturing deflated to dollars of approximately constant purchasing power, and then plotting these on a log scale together with the Day index of physical production for manufacturing, I observed that the product curve lay consistently between the two curves for the factors of production and tended to be approximately a quarter of the relative distance between the curve of the index for labor, which showed the least increase over the period, and that of the index for capital which showed the most.”

Or, quoting Samuelson (1979, 927-8),

“... the data Douglas presented Cobb practically commanded the C-D formula.”

It is important to notice that whereas expression (30) is the outcome of a model, expression (29) basically reflects an accounting relation. This relation imposes on the noise term of the model that

$$\begin{aligned} \epsilon= & {} (\ln ITFPROD _{VA}(1,0) - \alpha ) + (\phi _K(1,0) - \alpha _K )\ln Q_{K}(1,0) \nonumber \\{} & {} + \, (\phi _L(1,0) - \alpha _L )\ln Q_{L}(1,0). \end{aligned}$$
(31)

It is also interesting to consider what precisely happens when one assumes that the data are generated by a Translog production function; that is,

$$\begin{aligned} \ln Q_{ VA }(1,0)= & {} \alpha + \alpha _{K}\ln Q_{K}(1,0) + \alpha _{L}\ln Q_{L}(1,0) + \alpha _{KK}(\ln Q_{K}(1,0))^{1/2}\\{} & {} + \alpha _{LL}(\ln Q_{L}(1,0))^{1/2} + \alpha _{KL}\ln Q_{K}(1,0)\ln Q_{L}(1,0) + \delta ,\nonumber \end{aligned}$$
(32)

where the \(\alpha\)’s are constants, and \(\delta\) is also a kind of noise. It is straightforward to check that the accounting relation in expression (29) imposes the following relation between the Translog noise \(\delta\) and the CD noise \(\epsilon\),

$$\begin{aligned} \epsilon = \alpha _{KK}(\ln Q_{K}(1,0))^{1/2} + \alpha _{LL}(\ln Q_{L}(1,0))^{1/2} + \alpha _{KL}\ln Q_{K}(1,0)\ln Q_{L}(1,0) + \delta . \end{aligned}$$
(33)

It thus appears that the second-order terms of the Translog production function consume a great deal of the CD noise.

3 Comments and extensions

A couple of things are worth noting about the key result in expressions (28) or (29). First, due to the concavity of the logarithmic mean, the exponents \(\phi _K(1,0)\) and \(\phi _L(1,0)\) don’t add up to 1. In fact, their sum is smaller than 1,

$$\begin{aligned} \phi _K(1,0) + \phi _L(1,0) \le 1, \end{aligned}$$
(34)

but the discrepancy usually appears to be small (see Balk 2008, 88 for an example). Note that there is no reason whatsoever to interpret the sum of the exponents as measuring returns to scale.

Second, \(\phi _K(1,0)\) as well as \(\phi _L(1,0)\) are by definition ratios of logarithmic means. Using the Sato (1974) approximation, it appears that

$$\begin{aligned} \phi _K(1,0) \approx \frac{\frac{1}{3}\left( \frac{C_{K}^1 + C_{K}^0}{2} + 2(C_{K}^1C_{K}^0)^{1/2}\right) }{\frac{1}{3}\left( \frac{C_{KL}^1 + C_{KL}^0}{2} + 2(C_{KL}^1C_{KL}^0)^{1/2}\right) }. \end{aligned}$$
(35)

Thus a close approximation of \(\phi _K(1,0)\) is provided by

$$\begin{aligned} \phi _K(1,0) \approx \frac{(C_{K}^1C_{K}^0)^{1/2}}{(C_{KL}^1C_{KL}^0)^{1/2}} = \left( \frac{C_{K}^1}{C_{KL}^1}\frac{C_{K}^0}{C_{KL}^0}\right) ^{1/2}; \end{aligned}$$
(36)

that is, the geometric mean of the capital input cost shares of the two periods. Similarly, a close approximation of \(\phi _L(1,0)\) is provided by

$$\begin{aligned} \phi _L(1,0) \approx \frac{(C_{L}^1C_{L}^0)^{1/2}}{(C_{KL}^1C_{KL}^0)^{1/2}} = \left( \frac{C_{L}^1}{C_{KL}^1}\frac{C_{L}^0}{C_{KL}^0}\right) ^{1/2}; \end{aligned}$$
(37)

that is, the geometric mean of the labour input cost shares of the two periods. Now, recall that due to relation (2), capital input and labour cost shares add up to 1,

$$\begin{aligned} \frac{C_K^t}{C_{KL}^t} + \frac{C_L^t}{C_{KL}^t} = 1, \end{aligned}$$
(38)

so that we may expect that

$$\begin{aligned} \phi _L(1,0) \approx 1 - \phi _K(1,0). \end{aligned}$$
(39)

This would reduce expression (28) to a lookalike of the original, restricted Cobb-Douglas function, except that \(\phi _L(1,0)\) and \(\phi _K(1,0)\) are not constant. Also, there is no reason to interpret the fact that \(\phi _L(1,0)\) and \(\phi _K(1,0)\) are cost shares in terms of competitiveness, as Douglas would have done; this is just a matter of definition.Footnote 9

Third, it is important to note that in expression (29) the productivity factor, \(ITFPROD _{ VA }(1,0)\), cannot be conceived as an independent source of output growth. Basically, expression (29) is nothing but a rewritten version of the definition of productivity change. Put otherwise, \(ITFPROD _{ VA }(1,0)\) is the result of the interplay of output change, measured by \(Q_{ VA }(1,0)\), and input change, measured by \(Q_{K}(1,0)\) and \(Q_{L}(1,0)\). Contrast this with expression (30) and imagine an estimation setting in which \(\alpha\) represents an exogenous ‘productivity shock’. The fundamental challenge in production function estimation, already recognized in the 1940s, comes from the fact that inputs and outputs are simultaneously chosen by the production unit, in connection with a ‘productivity shock’ about which its management may or may not be informed. Thus the variables on the right-hand side of expression (30) are correlated and potentially endogenous. The recent literature contains several approaches to handle this challenge; however, none of these appears to be completely satisfactory according to Kim et al. (2019).

Fourth, it must be observed that expression (28) holds independent of the magnitude of profit \(\Pi ^t\) \((t=0,1)\), and also independent of the magnitudes of or eventual relations between the price indices \(P_{ VA }(1,0)\), \(P_K(1,0)\) and \(P_L(1,0)\). However, if profit \(\Pi ^t = 0\) \((t=0,1)\) then expression (28) may be replaced by the dual expression

$$\begin{aligned} P_{ VA }(1,0) = ITFPROD _{ VA }(1,0)^{-1}P_{K}(1,0 )^{\phi _K(1,0)}P_{L}(1,0)^{\phi _L(1,0)}. \end{aligned}$$
(40)

This can immediately be checked by using expressions (22), (23), (21), (17), and (3) respectively.Footnote 10 Thus, if profit is zero then input and output price indices are linked by total factor productivity.

Fifth, the functional form of the quantity indices \(Q_{ VA }(1,0)\), \(Q_K(1,0)\) and \(Q_L(1,0)\) is left unspecified. The only restriction is that the Product Test be satisfied according to expressions (17), (22) and (23). Dependent on the time distance between periods 0 and 1 the indices might be direct or chained.

Sixth, in the derivation of expression (28) the only, technical not behavioural, assumption made is that the joint capital-labour input price index number behaves as an MV index number of the underlying factors; recall expression (25). This assumption, however, is rather harmless. More generality is obtained by defining a factor \(\delta (1,0)\) such that

$$\begin{aligned} P_{KL}(1,0) = \delta (1,0) P_{K}(1,0)^{\phi _K(1,0)}P_{L}(1,0)^{\phi _L(1,0)}. \end{aligned}$$

As one easily checks, the consequence is that to the noise \(\epsilon\) must be added \(\ln \delta (1,0)\). Now it is well known that an MV index is a second-order differential approximation of symmetric indices such as Fisher or Törnqvist, and a first-order diferential approximation of asymmetric indices such as Laspeyres and Paasche (Diewert 1978). Thus if the time span between periods 0 and 1 is short enough then it does not matter much which index is employed for deflating primary inputs cost. Put differently, if the time span between periods 0 and 1 decreases then one generally expects \(\delta (1,0)\) to tend to 1.

Seventh, the extension of expression (28) to more than two (aggregate) inputs is straightforward. Let capital K consist of I types, and let labour L consist of J types. Then, in obvious notation,

$$\begin{aligned} C_K^t = \sum _{i=1}^{I}C_{Ki}^t \end{aligned}$$
(41)

and

$$\begin{aligned} C_L^t = \sum _{j=1}^{J}C_{Lj}^t. \end{aligned}$$
(42)

Suppose that each lower level cost ratio can be decomposed into a price index and a quantity index; that is,

$$\begin{aligned} \frac{C_{Ki}^1}{C_{Ki}^0} = P_{Ki}(1,0)Q_{Ki}(1,0) \,\, (i=1,\ldots ,I) \end{aligned}$$
(43)

and

$$\begin{aligned} \frac{C_{Lj}^1}{C_{Lj}^0} = P_{Lj}(1,0)Q_{Lj}(1,0) \,\, (j=1,\ldots ,J). \end{aligned}$$
(44)

Let the aggregate capital input quantity index be defined as a MV index of the lower level indices; that is,

$$\begin{aligned} Q_K(1,0) \equiv \prod _{i=1}^{I}Q_{Ki}(1,0)^{\phi _{Ki}(1,0)} \end{aligned}$$
(45)

where \(\phi _{Ki}(1,0) \equiv LM (C_{Ki}^1,C_{Ki}^0)/ LM (C_{K}^1,C_{K}^0)\) \((i=1,\ldots ,I)\). Similarly, let

$$\begin{aligned} Q_L(1,0) \equiv \prod _{j=1}^{J}Q_{Lj}(1,0)^{\phi _{Lj}(1,0)} \end{aligned}$$
(46)

where \(\phi _{Lj}(1,0) \equiv LM (C_{Lj}^1,C_{Lj}^0)/ LM (C_{L}^1,C_{L}^0)\) \((j=1,\ldots ,J)\). Substituting these two expressions into expression (28) delivers

$$\begin{aligned} Q_{ VA }(1,0)= & {} ITFPROD _{ VA }(1,0) \nonumber \\{} & {} \times \left( \prod _{i=1}^{I}Q_{Ki}(1,0)^{\phi _{Ki}(1,0)}\right) ^{\phi _K(1,0)} \left( \prod _{j=1}^{J}Q_{Lj}(1,0)^{\phi _{Lj}(1,0)}\right) ^{\phi _L(1,0)}. \end{aligned}$$
(47)

Notice that \(\phi _{Ki}(1,0)\phi _K(1,0) = LM (C_{Ki}^1,C_{Ki}^0)/ LM (C_{KL}^1,C_{KL}^0)\) \((i=1,\ldots ,I)\) and \(\phi _{Lj}(1,0)\phi _L(1,0) = LM (C_{Lj}^1,C_{Lj}^0)/ LM (C_{KL}^1,C_{KL}^0)\) \((j=1,\ldots ,J)\), so that we would have obtained expression (47) also when we immediately had acted on the situation that there are \(I+J\) inputs instead of 2. This is due to the fact that the MV indices are consistent-in-aggregation (see Balk 2008, 111).

4 Gross output

Our key expression (28) relates real value added to multiple inputs. Can the expression be extended to account for gross output? For this we have to go back to the definition of (nominal) value added as revenue minus the cost of intermediate inputs (encompassing energy E, materials M, and services S); that is,

$$\begin{aligned} VA ^t \equiv R^t - C_{EMS}^{t}. \end{aligned}$$
(48)

Following Balk (2021, 57-61) it is now supposed that the revenue ratio can be decomposed by price and quantity indices \(P_R(.)\) and \(Q_R(.)\), so that

$$\begin{aligned} \frac{R^1}{R^0} = P_R(1,0)Q_R(1,0), \end{aligned}$$
(49)

and that the intermediate inputs cost ratio likewise can be decomposed as

$$\begin{aligned} \frac{C_{EMS}^1}{C_{EMS}^0} = P_{EMS}(1,0)Q_{EMS}(1,0). \end{aligned}$$
(50)

Then, by following the same steps as leading to expression (21), except that a minus sign now replaces the plus sign, it appears that a value-added based quantity index can rather naturally be defined as a generalized MV index; that is,

$$\begin{aligned} Q_{ VA }(1,0) \equiv \frac{Q_R(1,0)^{\psi _{R}(1,0)}}{Q_{EMS}(1,0))^{\psi _{EMS}(1,0)}}, \end{aligned}$$
(51)

where \(\psi _{R}(1,0) \equiv LM (R^1,R^0)/ LM ( VA ^1, VA ^0)\), that is, mean revenue over mean value added, and \(\psi _{EMS}(1,0) \equiv LM (C_{EMS}^1,C_{EMS}^0)/ LM ( VA ^1, VA ^0)\), that is, mean intermediate inputs cost over mean value added. Notice that \(\psi _{R}(1,0)\) is an instance of the so-called Domar factor.

Substituting expression (51) into expression (28) delivers

$$\begin{aligned} Q_R(1,0)^{\psi _{R}(1,0)}= & {} ITFPROD _{ VA } (1,0)Q_{K}(1,0)^{\phi _K(1,0)}Q_{L}(1,0)^{\phi _L(1,0)} \nonumber \\{} & {} \times Q_{EMS}(1,0))^{\psi _{EMS}(1,0)}, \end{aligned}$$
(52)

or

$$\begin{aligned} Q_R(1,0)= & {} ITFPROD _{ VA }(1,0)^{1/\psi _{R} (1,0)}Q_{K}(1,0)^{\phi _K(1,0)/\psi _{R}(1,0)} \nonumber \\{} & {} \times Q_{L}(1,0)^{\phi _L(1,0)/\psi _{R}(1,0)}Q_{EMS}(1,0) )^{\psi _{EMS}(1,0)/\psi _{R}(1,0)}. \end{aligned}$$
(53)

This is the extension asked for.

However, we could have followed a direct route by considering that combining expressions (3), (2) and (48) lead to the KLEMS-Y accounting identity

$$\begin{aligned} C_{KLEMS}^t + \Pi ^t = C_{K}^t + C_{L}^t + C_{EMS}^t + \Pi ^t = R^t. \end{aligned}$$
(54)

The gross-output based (total factor) productivity index is then defined by

$$\begin{aligned} ITFPROD _{Y}(1,0) \equiv \frac{Q_{R}(1,0)}{Q_{KLEMS}(1,0)}, \end{aligned}$$
(55)

where \(Q_{KLEMS}(1,0)\) is the quantity index component of the total input cost ratio \(C_{KLEMS}^1/C_{KLEMS}^0\) (see Balk 2021, 19). As before, by following the same steps as leading to expression (21) it appears that a total input quantity index can rather naturally be defined as a MV index; that is,

$$\begin{aligned} Q_{KLEMS}(1,0) \equiv Q_K(1,0)^{\chi _{K}(1,0)}Q_L(1,0)^{\chi _{L}(1,0)}Q_{EMS}(1,0)^{\chi _{EMS}(1,0)}, \end{aligned}$$
(56)

where the exponents are, respectively, defined by

$$\begin{aligned} \chi _{K}(1,0)&\equiv LM (C_K^1,C_K^0)/ LM (C_{KLEMS}^1,C_{KLEMS}^0)\\ \chi _{L}(1,0)&\equiv LM (C_L^1,C_L^0)/ LM (C_{KLEMS}^1,C_{KLEMS}^0)\\ \chi _{EMS}(1,0)&\equiv LM (C_{EMS}^1,C_{EMS}^0)/ LM (C_{KLEMS}^1,C_{KLEMS}^0). \end{aligned}$$

By substituting expression (56) into expression (55), and rearranging, we obtain

$$\begin{aligned} Q_{R}(1,0) = ITFPROD _{Y}(1,0)Q_K(1,0)^{\chi _{K}(1,0)} Q_L(1,0)^{\chi _{L}(1,0)}Q_{EMS}(1,0)^{\chi _{EMS}(1,0)}. \end{aligned}$$
(57)

Notice the differences between expressions (53) and (57). If, however, profit \(\Pi ^t = 0\) (\(t = 0,1\)) then the corresponding exponents become the same; that is,

$$\begin{aligned} \phi _K(1,0)/\psi _{R}(1,0)= & {} \chi _{K}(1,0) \end{aligned}$$
(58)
$$\begin{aligned} \phi _L(1,0)/\psi _{R}(1,0)= & {} \chi _{L}(1,0) \end{aligned}$$
(59)
$$\begin{aligned} \psi _{EMS}(1,0)/\psi _{R}(1,0)= & {} \chi _{EMS}(1,0), \end{aligned}$$
(60)

and it turns out that

$$\begin{aligned} ITFPROD _{Y}(1,0) = ITFPROD _{ VA }(1,0)^{1/\psi _{R}(1,0)}. \end{aligned}$$
(61)

Recall that \(\psi _{R}(1,0)\) is an instance of the Domar factor (= revenue divided by value added). Expression (61) reflects Theorem 1 of Balk (2009) (see Balk 2021, 61-2).

5 An ensemble of production units

We now consider an ensemble \({\mathcal {K}}\) of production units. Think of an economy consisting of industries, or an industry consisting of enterprises. For each unit and period the KL-VA accounting identity in nominal values reads

$$\begin{aligned} C_{KL}^{kt} + \Pi ^{kt} = VA ^{kt} \, (k \in {\mathcal {K}}), \end{aligned}$$
(62)

which differs from expression (3) in that a superscript k has been added. Provided that there are no tax wedges between input and output prices and that capital and labour inputs are specific for each production unit, the individual accounting identities may simply be added up to the accounting identity for the aggregate,

$$\begin{aligned} C_{KL}^{{\mathcal {K}}t} + \Pi ^{{\mathcal {K}}t} = VA ^{{\mathcal {K}}t}, \end{aligned}$$
(63)

where \(C_{KL}^{{\mathcal {K}}t} = \sum _{k \in {\mathcal {K}}}C_{KL}^{kt}\), \(\Pi ^{{\mathcal {K}}t} = \sum _{k \in {\mathcal {K}}}\Pi ^{kt}\), and \(VA ^{{\mathcal {K}}t} = \sum _{k \in {\mathcal {K}}} VA ^{kt}\).

The aggregate value-added ratio, for period 1 relative to period 0, can be decomposed as follows (see Balk 2008, 85-86):

$$\begin{aligned} \frac{ VA ^{{\mathcal {K}}1}}{ VA ^{{\mathcal {K}}0}} = \prod _{k \in {\mathcal {K}}}\left( \frac{ VA ^{k1}}{ VA ^{k0}}\right) ^{\Psi ^{k}(1,0)}, \end{aligned}$$
(64)

where

$$\begin{aligned} \Psi ^{k}(1,0) \equiv \frac{ LM \left( \frac{ VA ^{k1}}{ VA ^{{\mathcal {K}}1}},\frac{ VA ^{k0}}{ VA ^{{\mathcal {K}}0}}\right) }{\sum _{k \in {\mathcal {K}}} LM \left( \frac{ VA ^{k1}}{ VA ^{{\mathcal {K}}1}},\frac{ VA ^{k0}}{ VA ^{{\mathcal {K}}0}}\right) } \, (k \in {\mathcal {K}}). \end{aligned}$$

Thus, aggregate nominal value-added change, measured as ratio, is equal to a weighted geometric mean of individual nominal value-added changes. Each coefficient \(\Psi ^{k}(1,0)\) is the (normalized) mean share of production unit k in aggregate nominal value added. Notice that these coefficients add up to 1; that is,

$$\begin{aligned} \sum _{k \in {\mathcal {K}}}\Psi ^{k}(1,0) = 1. \end{aligned}$$
(65)

For each individual unit it is assumed that the value-added ratio, the capital input cost ratio, and the labour cost ratio can be decomposed into price indices and quantity indices as in expressions (17), (22), and (23) respectively, with superscripts k added appropriately. In addition, it is assumed that the aggregate value-added ratio can be decomposed as

$$\begin{aligned} \frac{ VA ^{{\mathcal {K}}1}}{ VA ^{{\mathcal {K}}0}} = P_{ VA }^{{\mathcal {K}}}(1,0)Q_{ VA }^{{\mathcal {K}}}(1,0). \end{aligned}$$
(66)

Substituting expressions (66) and (17) into expression (64), and moving \(P_{ VA }^{{\mathcal {K}}}(1,0)\) from the left-hand side to the right-hand side, delivers

$$\begin{aligned} Q_{ VA }^{{\mathcal {K}}}(1,0) = \prod _{k \in {\mathcal {K}}}\left( \frac{P_{ VA }^{k}(1,0)}{P_{ VA }^{{\mathcal {K}}}(1,0)}Q_{ VA }^{k}(1,0)\right) ^{\Psi ^{k}(1,0)} \end{aligned}$$
(67)

as aggregation rule. Substituting now expression (28), with superscripts k added in appropriate places, into expression (67) delivers our final result:

$$\begin{aligned} Q_{ VA }^{{\mathcal {K}}}(1,0)= & {} \prod _{k \in {\mathcal {K}}}\left( \frac{P_{ VA }^{k}(1,0)}{P_{ VA }^{{\mathcal {K}}}(1,0)}\right) ^{\Psi ^{k}(1,0)} \prod _{k \in {\mathcal {K}}}\left( ITFPROD _{ VA }^{k}(1,0)\right) ^{\Psi ^{k}(1,0)} \\{} & {} \times \prod _{k \in {\mathcal {K}}}\left( Q_{K}^{k}(1,0)^{\phi _K^k(1,0)}\right) ^{\Psi ^{k}(1,0)} \prod _{k \in {\mathcal {K}}}\left( Q_{L}^{k}(1,0)^{\phi _L^k(1,0)}\right) ^{\Psi ^{k}(1,0)}. \nonumber \end{aligned}$$
(68)

A number of comments are worth making. First of all, it must be noted that, apart from the first factor on the right-hand side, expression (68) has the same structure as expression (28). This first factor measures differential price change and becomes unity if and only if

$$\begin{aligned} P_{ VA }^{{\mathcal {K}}}(1,0) = \prod _{k \in {\mathcal {K}}}\left( P_{ VA }^{k}(1,0)\right) ^{\Psi ^{k}(1,0)}; \end{aligned}$$
(69)

that is, \(P_{ VA }^{{\mathcal {K}}}(1,0)\) is a Sato-Vartia (SV) index of the underlying indices \(P_{ VA }^{k}(1,0)\) \((k \in {\mathcal {K}})\).Footnote 11

The second factor on the right-hand side of expression (68) is a weighted mean of individual value-added based productivity indices. The third factor is an aggregate measure of capital input quantity change, whereas the fourth factor is a similarly weighted measure of labour input quantity change. These weights deserve some attention.

Notice that the capital input weights \(\phi _K^k(1,0)\Psi ^{k}(1,0)\) as well as the labour input weights \(\phi _L^k(1,0)\Psi ^{k}(1,0)\) \((k \in {\mathcal {K}})\) do not add up to 1. Each capital input weight approximates a production unit’s share in aggregate value added times its share of capital in joint capital input and labour cost. Similarly, each labour input weight approximates a production unit’s share in aggregate value added times its share of labour in joint capital input and labour cost. Together, their sum is smaller than unity; that is,

$$\begin{aligned} \sum _{k \in {\mathcal {K}}}\left( \phi _K^k(1,0)\Psi ^{k}(1,0) + \phi _L^k(1,0)\Psi ^{k}(1,0)\right) \le 1. \end{aligned}$$
(70)

Again, there is no reason to interpret this inequality in terms of returns to scale as it is entirely due to the concavity of the logarithmic mean.

6 Some concluding thoughts

Why is the Cobb-Douglas (production) function so popular? Biddle (2021, 312) ascribes its success story to a number of factors: 1. the link to the neo-classical paradigm, 2. the flexibility/adaptability of the function, 3. the rhetorical talents of its protagonist, 4. the research and teaching activities of a (small) number of charismatic personalities. Add to this "the prima facie plausibility of the results of Douglas’s research with the regression", and those of many of his followers in a variety of applications.

Why this plausibility? The simple answer appears to be: because the CD structure is an alternative representation of the accounting identity coming with the data. At any level of aggregation, real input and output data are not observables but constructs. Given are nominal data, tied together by an accounting identity that in most general terms states that revenue equals cost plus profit. In this article it is rigorously demonstrated that the real version of this identity exhibits the CD pattern, whatever the underlying data-generating mechanism may be. This explains the good ‘fit’ one usually encounters in empirical work.Footnote 12

With hindsight, Phelps Brown (1957) must be given the credit for having noticed this for the first time, though his explanation was a bit heuristic and needed some assumptions. In deriving the key results in the previous sections of this paper no structural or behavioural assumptions were invoked. It was all a matter of accounting and playing with identities, realizing that addition and multiplication are isomorphic operations.

The history as documented by Biddle (2021) makes clear that through time the status of the CD function has changed from being a ’law’, ideally holding for the input and output of aggregates such as the manufacturing industry, to a handy measurement tool that can be used in very different research situations.Footnote 13 A parallel development was the transformation of the neo-classical paradigm from a set of testable propositions to a set of articles of faith.

Modern production theoryFootnote 14 starts with the concept of a technology set rather than a production function. A technology set contains all the input–output combinations that are feasible in a certain period of time, and to which all the production units under consideration have access. Under mild conditions such a set can be represented by a radial output distance function, of which the production function is a specific materialization. Basically, the production function specifies the boundary (aka frontier) of the technology set.

To be more specific, let us return to the setting of Section 5. We know that the nominal accounting identity in expression (62) implies that

$$\begin{aligned} (X_{K}^{k}(t,b),X_{L}^{k}(t,b), RVA ^{k}(t,b)) \in S^{t} \, \, (k \in {\mathcal {K}}), \end{aligned}$$

where real values were defined in expressions (4), (5) and (6), and \(S^{t}\) denotes the technology set of period t. The production functionFootnote 15 corresponding with \(S^{t}\) is given by

$$\begin{aligned} F^{t}(X_{K},X_{L}) \equiv \max \{ RVA \mid (X_{K},X_{L}, RVA ) \in S^{t}\}, \end{aligned}$$
(71)

where \(X_{K}\), \(X_{L}\), and \(RVA\) denote generic variables. It then immediately follows that

$$\begin{aligned} RVA ^{k}(t,b) \le F^{t}(X_{K}^{k}(t,b),X_{L}^{k}(t,b)) \, \, (k \in {\mathcal {K}}). \end{aligned}$$
(72)

Since \(RVA ^{k}(t,b) = VA ^{kb}Q_{ VA }^{k}(t,b)\), by substituting expression (28) we obtain

$$\begin{aligned}{} & {} VA ^{kb} ITFPROD _{ VA }^{k}(t,b) Q_{K}^{k}(t,b)^{\phi _K^{k}(t,b)}Q_{L}^{k}(t,b)^{\phi _L^{k}(t,b)}\\{} & {} \quad \le F^{t}(X_{K}^{k}(t,b),X_{L}^{k}(t,b)) \, \, (k \in {\mathcal {K}}),\nonumber \end{aligned}$$
(73)

or

$$\begin{aligned}{} & {} ITFPROD _{ VA }^{k}(t,b) \le \frac{F^{t}(X_{K}^{k}(t,b),X_{L}^{k}(t,b))}{ VA ^{kb}Q_{K}^{k}(t,b)^{\phi _K^{k}(t,b)}Q_{L}^{k}(t,b)^{\phi _L^{k}(t,b)}} \, \, (k \in {\mathcal {K}}). \end{aligned}$$
(74)

Using the definitions of the real values in the numerator the last expression can be rewritten as

$$\begin{aligned} ITFPROD _{ VA }^{k}(t,b) \le \frac{F^{t}(C_{K}^{kb}Q_{K}^{k}(t,b),C_{L}^{kb}Q_{L}^{k}(t,b))}{ VA ^{kb}Q_{K}^{k}(t,b)^{\phi _K^{k}(t,b)}Q_{L}^{k}(t,b)^{\phi _L^{k}(t,b)}} \, \, (k \in {\mathcal {K}}). \end{aligned}$$
(75)

It is clear that there are several degrees of freedom here; most notably, the choice of production-unit-specific quantity indices (or, equivalently, deflators) for capital and labour. However, as these indices occur simultaneously in numerator and denominator, the upper bound on productivity growth should not be too dependent on particular choices. The specification of the function \(F^{t}(.)\), however, is more important. Thus, to avoid unwarranted impositions this calls for a specification of the technology set \(S^t\) as a close envelopment of the data, in line with the conclusion of Hildenbrand (1981).Footnote 16