Our empirical analysis is based on two firm-level equations which we specify and estimate jointly: a production function and a wage equation. The production function is used to capture productivity effects related to absenteeism and team work at the firm level, and the wage equation is to capture the corresponding wage effects. By simultaneously estimating the two equations, we can compare the productivity effects with wage effects to determine the equality of marginal productivity and wages. The traditional approach of estimating the wage equation alone to measure the impact of absenteeism does not fully capture productivity differentials associated with different levels of absenteeism.
We think it is useful to baseline our results with an estimate of economy-wide aggregate effects. Thus we begin by estimating a baseline model that restricts the effect of absenteeism to be the same for team workers and non-team workers and in small and large firms. We subsequently relax these restrictions by assuming that absenteeism affects team workers and non-team workers differently, and then by estimating our model separately for small and large firms.
Production function
Our baseline specification of the production function is an extension of the standard Cobb-Douglas [27, 28, 42, 43]. See Additional file 1: Appendix B for its complete deviation. Because the Cobb-Douglas form is restrictive, we assess the robustness of our estimates to more general alternatives described in Section 3.4.
For each workplace, we start with a simple Cobb-Douglas production function:
$$ \mathrm{In}\;{Q}_j=\alpha\;\mathrm{In}\;{L}_j^A+\beta\;\mathrm{In}\;{K}_j+\upeta {F}_j+{\mu}_j $$
(1)
where Q
j
is output, measured as value added by firm \( j,\ {L}_j^A \) is an aggregate labour input defined below, K
j
is the capital stock, F
j
is a matrix of various firm characteristics, α, β are the elasticity of output with respect to labour and capital, respectively, η is a vector of parameters for firm characteristics and μ
j
is the error term.
We divide the labour input into different worker types, that is, workers with different characteristics such as age, sex, education, occupation and team participation. If the total number of characteristics is I and workers are divided into V
i
categories by each characteristic i, then the total number of worker types will be \( {\displaystyle {\prod}_{i=1}^I{V}_i} \). Our aggregate labour input \( {L}_j^A \) can be simplified after making several assumptions: First, we assume perfect substitutability among all types of workers and different marginal productivity for each worker type [27, 28]. Second, we assume that the proportion or distribution of one type of worker defined by one characteristic is constant across all other characteristic groups, which is referred to as the equi-proportionate restriction [27, 28].Footnote 1 Third, we assume the relative marginal productivity of two types of workers within one characteristic group is equal to those within another characteristic group, which is referred to as the equal relative productivity restriction [27, 28].Footnote 2 Fourth, attendance rates have the same marginal impact on productivity for different worker types.
The aggregate labour input can then be written as (equation 8 from Additional file 1: Appendix B):
$$ {L}_j^A={\left(1-{a}_j\right)}^{\theta }{\lambda}_{0,I}{L}_j\left(1+\left({\gamma}_G-1\right){P}_{\;Gj}\right){\displaystyle \prod_{i=1}^{I-1}\left(1+{\displaystyle \sum_{v=1}^{Vi-1}\left({\gamma}_{iv}-1\right){P}_{ivj}}\right)} $$
(2)
where a
j
is the absence rate in firm j, L
j
is the number of all workers in the firm j, P
Gj
is the proportion of team workers among all workers in the firm j, i = 1, 2, …, I-1 indicates worker characteristics other than team participation, v
i
= 1, 2, …, V
i
-1 represents worker categories divided according to the worker characteristic i, \( {P}_{ivj}=\frac{L_{ivj}}{L_j} \) is the proportion of the worker type iv among all workers in the firm j, θ is the parameter of (1-absence rate), i.e., the attendance impact on the marginal productivity for any worker type, λ
0,I
is the marginal productivity for the reference group when work force is divided by I characteristics and absence rate = 0, γ
G
is the relative marginal productivity of team workers compared to non-team workers, and \( {\gamma}_{iv}=\frac{\lambda_{iv}}{\lambda_{io}} \) is the relative marginal productivity of one worker type iv to the worker type i0 for each characteristic i.
By substituting \( {L}_j^A \) into the simple production function, equation 1, we obtain our baseline specification (equations 9 and 10 from Additional file 1: Appendix B), i.e., a “restricted model” as follows:
$$ \begin{array}{l}\mathrm{In}\;{Q}_j={\beta}_0+\beta\;\mathrm{In}\;{K}_j+\alpha\;\mathrm{In}\;{L}_j+\alpha \theta\;\mathrm{In}\left(1-{\alpha}_j\right)+\alpha\;\mathrm{In}\left(1+\left({\gamma}_G-1\right){P}_{Gj}\right)\\ {}+\alpha {E}_j+\upeta Fj+\mu j\end{array} $$
(3)
Where
$$ {E}_j={\displaystyle \sum_{i=1}^{I-1}\mathrm{In}\left(1+{\displaystyle \sum_{v=1}^{Vi-1}\left({\gamma}_{iv}-1\right){P}_{ivj}}\right)} $$
(4)
E
j
refers to workforce characteristics other than team participation, and β
0 is a constant term that incorporates a In λ
0,I
.
In addition, we relax the fourth assumption for team-work participation, that is, the attendance impact on the marginal productivity for team workers (θ
G
) is different from that for non-team workers (θ
N
). A relatively “complete model” (equations 12 and 13 from Additional file 1: Appendix B) is therefore presented as:
$$ {L^A}_j={\lambda}_{0,I}{\left(1-{a}_j\right)}^{\theta_N}{L}_j\left(1+\left({\gamma}_G{\left(1-{a}_j\right)}^{\theta_G-{\theta}_N}-1\right){P}_{Gj}\right){\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\gamma}_{iv}-1\right){P}_{ivj}\right) $$
(5)
and,
$$ \begin{array}{l} \ln {Q}_j={\beta}_0+\beta \ln {K}_j+\alpha \ln {L}_j\\ {}\kern1.68em +\alpha {\theta}_N \ln \left(1-{a}_j\right)+\alpha \ln \left(1+\left({\gamma}_G{\left(1-{a}_j\right)}^{\theta_G-{\theta}_N}-1\right){P}_{Gj}\right)\\ {}\kern1.68em +\upalpha {E}_j+\upeta {F}_j+{\mu}_j\end{array} $$
(6)
Wage equation
Applying the same approach as above, wage effects can be estimated through the relationship between payroll and average absence rate and share of workers participating in a team at the firm level. We write the aggregate wage w
j
as the sum of wage for each worker type. Applying the same assumptions in the production function, the aggregate wage can be simplified as:
$$ {w}_j={w}_{0,I}{\left(1-{a}_j\right)}^{\zeta }{L}_j\left(1+\left({\phi}_G-1\right){P}_{Gj}\right)\ {\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\phi}_{iv}-1\right){P}_{ivj}\right) $$
(7)
where w
j
is the annual payroll of firm j, w
0,I
is the wage for the reference group when work force is divided by I characteristics, ζ is the parameter of attendance rate, i.e., the attendance impact on wages for any worker type, ϕ
G
is the relative wage of team workers to non-team workers, \( {\phi}_{iv}=\frac{w_{iv}}{w_{i0}} \) is the relative wage of one worker type iv to the worker type i0 for each characteristic i other than team participation.
After log transforming equation 7, the “restricted model” for wage equation is written as:
$$ \ln {w}_j={\beta}_{w0}+{\beta}_w \ln {K}_j+{\alpha}_w \ln {L}_j+\zeta \ln \left(1-{a}_j\right)+ \ln \left(1+\left({\phi}_G-1\right){P}_{Gj}\right)+{E}_{wj}+{\upeta}_{\mathrm{w}}{F}_j+{\mu}_{w,j} $$
(8)
where,
$$ {E}_{wj}={\displaystyle \sum_{i=1}^{I-1}} \ln \left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\phi}_{iv}-1\right){P}_{ivj}\right) $$
(9)
β
w0 is a constant term incorporating w
0,I
, α
w
, β
w
are the elasticity of wage with respect to labour and capital, respectively, ηw is a vector of parameters for firm characteristics and μ
w,j
is the error term.
Correspondingly, we assume the attendance impact on wages differ by team participation and thus the relatively “complete model” becomes:
$$ {w}_j={w}_{0,I}{\left(1-{a}_j\right)}^{\zeta_N}{L}_j\left(1+\left({\phi}_G{\left(1-{a}_j\right)}^{\zeta_G-{\zeta}_N}-1\right){P}_{Gj}\right){\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\phi}_{iv}-1\right){P}_{ivj}\right) $$
(10)
and
$$ \begin{array}{l} \ln {w}_j={\beta}_{w0}+{\beta}_w \ln {K}_j+{\alpha}_w \ln {L}_j\\ {}\kern1.68em +{\zeta}_N \ln \left(1-{a}_j\right)+ \ln \left(1+\left({\phi}_G{\left(1-{a}_j\right)}^{\zeta_G-{\zeta}_N}-1\right){P}_{Gj}\right)\\ {}\kern1.68em +{E}_{wj}+{\upeta}_{\mathrm{w}}{F}_j+{\mu}_{w,j}\end{array} $$
(11)
where ζ
N
is the impact of attendance rate for non-team workers and ζ
G
is the impact of attendance rate for team workers.
Estimation
We estimate the production function and wage equation simultaneously via nonlinear least squares (NLS) [27, 28]., under the assumption that errors are correlated across equations (nonlinear seemingly unrelated regression).Footnote 3 All observations are weighted using linked weights provided by Statistics Canada. All standard errors are computed as Statistics Canada’s recommended procedure [44] using 100 sets of provided bootstrap sample weights.
Our null hypothesis of primary interest is that the attendance coefficient in the production function equals the coefficient in the wage equation. In the restricted model, the equality of marginal productivity and wage is tested by comparing the attendance coefficients, θ and ζ. In the complete model, we compare the two coefficients for team workers, θ
G
and ζ
G
, and those for non-team workers, θ
N
and ζ
N
, respectively. We also test the equality of relative productivity of team workers to non-team workers and their relative wage by comparing (λ
G
− 1) and (ϕ
G
− 1).
In order to examine whether parameter estimates vary by firm size, we conduct our analyses separately on two sub-samples: small firms with less than 20 employees and large firms (the remainder).
Robustness
We undertake further analyses to assess the robustness of our estimates. First, we relax restrictions on the functional form of our production function by estimating a specification using the much more flexible translog form. Second, we re-estimate our model using total compensation (payroll plus non-wage benefits) instead of payroll as the outcome of the wage equation.
Third, a key issue in the estimation of production functions is the potential correlation between input levels and unobserved firm-specific productivity shocks. Firms that have a large positive productivity shock may respond by using more inputs, giving rise to an endogeneity issue [45]. Following Hellerstein et al. [27], we address this issue by using value-added as the measure of output in the production function to avoid estimating a coefficient on materials. We also attempt to correct for the potential bias by estimating the model on first differences, which eliminates the effect of any time-invariant unobserved heterogeneity that jointly affects productivity and wages. We also apply Levinsohn and Petrin’s approach [46] using intermediate inputs (expenses on materials which are subtracted out in our value-added production function) to address the simultaneity problem. Specifically, we estimate parameters of our value-added production function using NLS by adding a third-order or a fourth-order polynomial approximation in capital and material inputs [47].
Finally, we conduct sensitivity analyses to examine the impacts of some of the assumptions embodied in our baseline specification. We relax the equi-proportionate restriction between occupation, age, sex, education (> university bachelor versus bachelor and below) and team participation, respectively.Footnote 4 That restriction also implies that the firm-average absence rate is common to all worker types. To test the impact of this assumption, we allow the average absence rate to differ for team workers and non-team workers in each firm. That is, the firm-average absence rate in the complete model is replaced with the firm-average absence rate of team workers and the absence rate of non-team workers, correspondingly, as follows.
$$ \begin{array}{l}{L^A}_j={\left(1-{a}_{Gj}\right)}^{\theta_G}{\lambda}_{G,0,I-1}{L}_{Gj}{\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\gamma}_{iv}-1\right){P}_{ivj}\right)\\ {}\kern1.44em +{\left(1-{a}_{Nj}\right)}^{\theta_N}{\lambda}_{N,0,I-1}{L}_{Nj}{\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\gamma}_{iv}-1\right){P}_{ivj}\right)\\ {}\kern1.56em ={\lambda}_{0,I}{\left(1-{a}_{Nj}\right)}^{\theta_N}{L}_j\\ {}\left(1+\left({\gamma}_G\frac{{\left(1-{a}_{Gj}\right)}^{\theta_G}}{{\left(1-{a}_{Nj}\right)}^{\theta_N}}-1\right){P}_{Gj}\right){\displaystyle \prod_{i=1}^{I-1}}\left(1+{\displaystyle \sum_{v=1}^{V_i-1}}\left({\gamma}_{iv}-1\right){P}_{ivj}\right)\end{array} $$
(12)
and
$$ \begin{array}{l} \ln {Q}_j={\beta}_0+\beta \ln {K}_j\\ {}\kern1.68em +\alpha \ln {L}_j+\alpha {\theta}_N \ln \left(1-{a}_{Nj}\right)+\alpha \ln \left(1+\left({\gamma}_G\frac{{\left(1-{a}_{Gj}\right)}^{\theta_G}}{{\left(1-{a}_{Nj}\right)}^{\theta_N}}-1\right){P}_{Gj}\right)\\ {}\kern1.68em +\upalpha {E}_j+\upeta {F}_j+{\mu}_j\end{array} $$
(14)