Introduction

Economic research is key to understanding sickness absence and its connection with the labor market (Boone & van Ours, 2006; Boone et al., 2011; Pichler, 2015). Workplace injuries have cost the European Union over EUR 460 billion in 2019 (European Commission, 2021), and behavioral effects play an important role in explaining this cost (Koning et al., 2022; Martin-Roman & Moral, 2016; Martín-Román & Moral, 2017). Despite these figures, the literature examining gender/sex differences in the duration of sick leave is lacking, considering the behavioral response of workers to a set of incentives and focusing on biomedical ex-post (after the injury has been reported) differences between male and female workers. Therefore, from a policy-making perspective, understanding the main drivers behind those differences is important to better implement measures aimed at enhancing social welfare.

This study aims to fill this gap by studying the durations of sick leave in men and women related to workplace accidents (hereafter, we will refer to it as “sick leave”). It is also worth mentioning that our main interest is not only measuring the differences between both sexes but also identifying the underlying physiological factors behind them. Moreover, we intend to measure differences as a result of different choices (i.e., behavioral responses) made by men and women, which lead to different outcomes.Footnote 1 Finally, we check whether these differences are the same at the top and bottom of the income distribution ladder, as the behavior of each individual is influenced by different economic incentives such as sick pay (Koning et al., 2022; Puhani & Sonderhof, 2010), salary (Brekke et al., 2020), or income in general (Ásgeirsdóttir & Jóhannsdóttir, 2017). Thus, the research questions of this study can be stated as follows:

  1. (1)

    Are there differences in the sick leave durations between male and female employees as a result of a workplace injury? If so, are these differences a consequence of biomedical traits or behavioral responses?

  2. (2)

    Are these differences constant across income percentile levels?

To answer the abovementioned research questions, we used the stochastic frontier methodology. This technique allows us to divide the total sick leave duration into two different parts: one associated with physiological factors and one related to behavioral responses of workers to socioeconomic incentives (Martín-Román & Moral, 2014, 2017). We draw the main conclusions obtained in this research by analyzing both components.

The results show that if we assume that men and women share the same biological characteristics (i.e., they recover at the same rate after an injury or illness), the behavioral responses were similar between the two groups when looking at the whole distribution.. However, our findings changed when considering statistically the existence of physiological differences. Overall, we observed that women had poorer health than men, resulting in longer physiological recovery periods than them for the same type of injury, severity, and other biological determinants. In turn, this shows that women exhibit less opportunistic behavior in general terms.

When looking at wage distribution, significant outcomes emerged. As the wage rises, different patterns of opportunistic behavior between men and women occur. The opportunistic behavior in men increased with wage monotonically, whereas in women such opportunistic behavior increased first, but after reaching a relative maximum, their behavioral response substantially declined in the upper part of the wage distribution level. Because of these patterns, the gender gap in the sick leave duration is of a significant size for high wage levels. Hence, the developed empirical evidence follows the observed gender differences in the labor market of Western economies, where women tend to face a glass ceiling in their professional advancement, and attempt to break it by trying harder than men.

This study includes valuable evidence to the literature that covers gender differences in the workplace. In particular, using a stochastic frontier approach, we clearly distinguished the physiological factors from behavioral factors for the duration of sick leave. This approach, to the best of our knowledge, implies a twofold novelty—a methodological advance and further evidence on the differences in potential opportunistic behavior between men and women.

Additionally, we also tested whether behavioral differences between men and women were constant across the wage distribution to better understand the whole phenomenon. This allowed us to check whether injured women behaviorally react to break the glass ceiling. Applying the theories of compensating wage differentials and efficiency wages, some firms may be willing to tolerate some level of shirking in exchange for lower wages or offer higher wages to reduce the probability of absenteeism (Suárez & Muñiz, 2018).

From a policy standpoint, the results of this study are valuable in designing well-targeted measures. On the one hand, policymakers may identify to what extent the length of sick leave is due to physiological or behavioral issues and how these two components affect each gender or sex differently. This evidence-based knowledge can lead to an improved public health policy design, which will help policymakers to decide whether to devote more financial resources to workplace safety instruments or to monitor activities aimed at reducing opportunistic behaviors. On the other hand, the results can also serve as the basis for implementing antidiscrimination policies, pointing toward different behavioral responses between men and women at the upper part of the wage distribution level.

The rest of the paper is organized as follows. Section two presents and discusses the literature related to health differences between men and women at work. In section three, we review the disability and work-related accident schemes in Spain. Sections four and five describe the database and accounts for the methodological approach employed in this research. Section six shows the main results obtained. Section seven includes a robustness check. Section eight discusses such results, emphasizing the policy implications, and provides some recommendations. Finally, section nine concludes.

Sex, gender, and sick leave

The duration of sick leave is a gender- and sex-dependent phenomenon. In general, it is well known that women tend to extend their recovery periods longer than men (Fontaneda et al., 2019; Moral de Blas et al., 2012) and there exist some hypotheses that explain this situation (Østby et al., 2018).

For example, women are more involved in domestic chores than men (such as cooking, cleaning, childcare.),Footnote 2 implying a double burden and higher depression levels when they are employed (Pinquart & Sörensen, 2006), leading to longer recovery periods.

Sex differences in labor participation also play a role in explaining the gender/sex gap in the duration of sick leave. The percentage of women actively working in Spain is lower compared with that of men,Footnote 3 and they are also more engaged in precarious jobsFootnote 4 (Franco & Winqvist, 2002; Kim et al., 2008; Menéndez et al., 2007), which leads to high stress levels and insecurity, which negatively affects their health (Rugulies et al., 2008).

Additionally, men and women do not present the same prevalence of illnesses. Women are more prone to present conditions that disable them on a daily basis, such as rheumatism, anemia, thyroid, eczema, headaches, or mental illnesses, whereas men present high odds of fatal illnesses, such as cardiovascular diseases, stroke, lung and kidney diseases, or liver cirrhosis (Case & Paxson, 2005; Macintyre et al., 1996). Women also tend to present higher rates of work disability, that is, physical or psychological symptoms that do not necessarily prevent them from returning to work but complicate their recovery return after their sick leave (Coutu et al., 2021).

Therefore, it is important to distinguish between sex and gender when tackling differences in the duration of sick leave. Sex refers to the physical and biological characteristics that define a male or female, which includes hormonal and anatomic differences, whereas gender refers to the roles, behaviors, and personal identity (Bechtel & Huffmyer, 2020).

Biological differences between men and women may partly explain why women present longer sick leave durations. Women are more likely to suffer from musculoskeletal diseases compared with men,Footnote 5 which leads them to require longer recovery periods after injury and surgery (Wolf et al., 2015). Because of reproductive biology and conditions specific to gender, they use more healthcare services than men (Kananurak, 2014). Women are also more prone to return to hospital after a cardiac surgery compared with their male counterparts because they have smaller coronary arteries and lower surface area, which complicates the efficacy of a bypass and grafts (Bechtel & Huffmyer, 2020). Moreover, women tend to present higher odds of dysphoria, anxiety, and depression after these types of surgical interventions or traumatic events, thus expanding their sick leave durations (Freedman et al., 2002; Kempen et al., 2003; Modica et al., 2014; Oksuzyan et al., 2018).

From a gender viewpoint, there still exists a traditional role attached to women that may lead them to extend their return to work (Côté & Coutu, 2010), as stated at the beginning of this section. However, there are also gender differences in behavior/preferences (Schünemann et al., 2017) affecting the health of each individual (Bauer et al., 2007; Nelson, 2014), accounting for the gender gap in sick leave duration as well. For example, women are more risk averse, less competitive, and more context-sensitive in making their decisions than men (Croson & Gneezy, 2009). Additionally, women present higher social risk aversion (i.e., when individual decisions influence not only the individual’s payoffs but also the payoffs of another individual) and higher inequality aversion compared with their male counterparts (Friedl et al., 2020).

These gender differences in behavior were also tested during the COVID-19 pandemic. Women were more likely to accomplish with the public measures imposed by the different governments than men (Galasso et al., 2020). This may be because women are more aware about their health status than men (Idler, 2003), which leads them to adopt “cautious behaviors” when returning to work after their sick leave.

Sick leave benefits in Spain

In Spain, when a worker is temporarily unable to work as a result of an illness or an accident, whether work-related or non-work-related, the Social Security Administration covers the medical expenses of the injured worker, as well as compensates the loss of income during his/her temporary incapacity (TI). The maximum duration of the TI is 365 days, potentially extendable 180 days more only if the worker is presumed to recover from his/her incapacity during this extension.Footnote 6 After this period, the National Institute of Social Security is the competent authority to determine whether the worker receives a medical discharge, or is transferred to the permanent disability system.

The TI benefits consist of a subsidy equivalent to a percentage of the contributory base, which is mainly the wage earned during the previous months. However, depending on whether the TI was caused by a common illness or by a working accident, or a professional illness, the rules of receiving the TI benefits change. If the TI is due to a common illness (i.e., non-work-related illness or injury), during the first three days of sick leave the worker does not receive any amount of sick leave. From the 4th and 20th day the worker receives the 60% of the reference wage (i.e., the previous month's wage), and finally, he/she will receive the 75% from the 21st day onwards. The employer has to pay the injured worker from the 4th to the 15th day, and from the 16th day onwards, the Social Security Administration assumes this payment. On the contrary, when the TI was caused by a working accident or professional illness, the worker receives the 75% of the reference wage from the day after the general practitioner issues the sick leave certificate, and this is paid by the mutual insurance company (see Fig. 1).

Fig. 1
figure 1

Source: Own elaboration

TI benefits.

For private sector employees, the law that regulates sick leave benefits is the Social Security General Law (SSGL) of 1994, and the rules affecting them have been unchanged since then. In the case of public sector workers, the law changed in 2012 in an attempt to limit public expenditure. The interested reader can find a recent study that explored the impact of this legal change on work absence in Spain (Marie & Vall Castello, 2022). In this paper we focus our attention on private sector workers who are on TI due to a working accident, therefore, no law change has occurred during the period analysis of our study. Article 156 of the SSGL defines a working accident as any bodily injury suffered by a worker that occurred because of executing paid employment.Footnote 7

Regarding contributory base, this is estimated as a function of the previous month's wage (with upper and lower limits) earned by the worker before the working accident. Upper limits of the contributory base are yearly established in the State General budget and they are the same for all professional categories and contingencies (Art. 148 of the SSGL). In the case of lower limits of the contributory base (Art. 19.2 of the SSGL), the amount corresponds to the current minimum wage of each year, increased by one-sixth, unless expressly provided otherwise. The contributory base has not changed during the period analysis of our study either.

Database

In this study, the dependent variable is the sick leave duration resulting from a workplace accident. As a result, the database offering the best information is the Statistics of Accidents at Work (SAW) published by the Spanish Ministry of Labor. This is an annual register of any occupational accident resulting in the inability of an individual to work in Spain. Moreover, this database provides information regarding the characteristics of the injured person, enterprise, workplace, accident, and information about the nature and seriousness of the injuries.

In this study, we used the 2011–2019 waves of the SAW. Figure 2 shows the detailed flow chart of the data cleansing process. We first filtered this database to eliminate register errors and to obtain homogeneous information that allows us to detect the causal effects we are interested in. Given that we aim to analyze the behavior of workers according to the economic compensation received when they are on leave, the compensation variable was the first one used to filter the sample. Thus, all those accidents that reported a compensation outside the limits registered by the Social Security were eliminated, and the existing inflation in the period was corrected.Footnote 8 This procedure does not imply that we have removed accidents of the high contributory base, but all those accidents that are attached to compensations that are not legally possible. We also excluded registration errors referring to nonexistent contracts or ages incompatible with the labor market. In the next step, data on fatal accidents were omitted because after these types of accidents, it is not possible to detect any worker’s behavior. Finally, part-time workers were eliminated because they might affect the values of the daily compensation received and the self-employed were also excluded because they follow a different incentive scheme compared with employees. Evidence from a Dutch sample suggested that their sick leave durations are not affected by moral hazard when considering the level of economic compensation (Spierdijk et al., 2009). After this data cleansing, the final database comprised 3,916,249 records of occupational accidents with sick leave.

Fig. 2
figure 2

Source: Own elaboration

Data cleansing flow chart.

Table 1 gives an initial review of the data regarding the length of each period of sick leave broken down into the following: the sex of the worker, the percentage of males, and the number of observations by different characteristics. Aggregate figures show that 72% of injured workers are male and that the average duration is 0.8 days longer for women. There is also a gradual increase in the duration over time. The sick leave duration among women increased by 4.5 days from 2011 to 2019, whereas in men, it was 3.5 days longer over the period analyzed.

Table 1 Mean durations by gender, percentage of males and number of injuries in terms of various characteristics. Source: Author’s own based on SAW data

Following the compensation variable, some interesting results were also observed. As the amount of compensation rose, the duration of sick leave was longer. Moreover, the percentage of men in higher percentile levels was wider than that of women. Finally, we observe an increase in the difference in duration between men and women as the amount of compensation rises. Under the median, the sick leave duration of women exceeded that of men in less than 1 day (0.3 days between percentiles 5 and 10). However, over percentile 50, this difference was over 2 days.

Methods

Different estimation techniques were applied to the empirical literature analyzing the duration of accident-related sick leave. Some authors used simple regression techniques, such as ordinary least squares (OLS) (Campolieti & Hyatt, 2006; Krueger, 1990) or hazard models (Corrales Herrero et al., 2008; Moral de Blas et al., 2012; Murcia López et al., 2016). Other works used the days out of work as a count variable and applied count data models (Delgado & Kniesner, 1997; Frick & Malo, 2008; Lechmann & Schnabel, 2014) or zero-inflated negative binomial models (Mukuria et al., 2017). There are also some studies that gauged on how changes in legislation affected sick leave duration using natural experiments (Goerke & Pannenberg, 2015; Meyer et al., 1995; Ziebarth & Karlsson, 2010). Fortin and Lanoie (2000) presented a detailed discussion on the different estimation methods within this context.

This paper adopts a different and more recent approach based on the stochastic frontier analysis. Although this type of analysis is more common in papers related to production or cost theory, there already exist previous studies that applied it to the duration of sick leave (Martín-Román & Moral, 2014, 2017). In this case, the variable of interest is the days a worker is absent from work due to an occupational accident. These absence days are considered a cost determined by medical variables such as the type of injury suffered, its severity, the injured part of the body, and other factors such as individuals’ age or gender if we consider physiological differences between men and women. All these variables form the inputs of our function. With this approach, the efficient durations (i.e., cost efficiency) would correspond to those injured workers who present a shorter recovery period in similar circumstances (inputs). This similarity covers presenting the same physiological characteristics (age and sex), suffering from the same injury with the same severity, and that affects the same part of the body. The distance between that minimum duration and the actual duration of sick leave determines the degree of inefficiency related to the recovery process of injured workers.

From this interpretation of the stochastic frontiers, this methodology allows us to obtain a lower boundary attributed to medical and physiological factors labeled as “standard sick leave duration” \({\text{(D}}_{\rm i}^{{\rm s}}{)}\). This minimum recovery spell required to return to work is defined by the following expression:

$${\text{d}}_{{\rm i}}^{{\rm s}} = {\text{X}}_{{\rm i}}\beta + {\text{v}}_{{\rm i}},$$
(1)

where \({\text{d}}_{{\rm i}}^{{\rm s}}\) is the logarithm of the minimum duration (\({{\text{d}}_{{\rm i}}^{{\rm s}}{ = ln(D}}_{{\rm i}}^{{\rm s}}{)}\)), \(\beta\) is a vector of coefficients, \({\text{X}}_{{\rm i}}\) is a vector of individual characteristics, and \({\text{v}}_{{\rm i}}\) is a random error of mean zero and variance \({\sigma}_{\text{v}}^{2}\).

However, there is an information problem that prevents the minimum duration being known. The insurer only perceives the actual duration \({\text{(D}}^{{\rm r}}{)}\), which tends to be longer than the standard duration \({\text{(D}}^{{\rm r}} \, \geq \, {\text{D}}^{{\rm s}}{)}\). In addition to medical and physiological aspects,\({\text{D}}^{{\rm r}}\) is the result of the ability of the workers to obtain an additional period of recovery. This ability reflects an opportunistic behavior on the part of the worker, which indicates how secure they perceive their job situation.Footnote 9 Obviously, greater job security would be linked to better working conditions, which enables the worker to engage more intensely in such opportunistic behaviors. This leads us to conclude that we face a problem of asymmetric information that insurance companies assume when monitoring workers during their work absence.

Therefore, there is a real duration that is the result of adding the standard duration to a non-negative random disturbance. As explained previously, the standard duration forms a lower frontierFootnote 10 that may be estimated based on physiological factors, such as age or the nature of the injury, and that determines the length of recovery time.

$${\text{d}}_{{\rm i}}^{{\rm r}} = {\text{d}}_{{\rm i}}^{{\rm s}} + {\text{u}}_{{\rm i}},$$
(2)

where \({\text{d}}_{{\rm i}}^{{\rm r}}\) is the logarithm of the actual duration \({({\text{d}}_{{\rm i}}^{{\rm r}} = \text{ln}({\text{D}}}_{{\rm i}}^{{\rm r}}\),\({)}{)}\) and \({\text{u}}_{{\rm i}}\) is another error term with a positive mean and variance \({\sigma}_{\text{u}}^{2}\). \({\text{u}}_{{\rm i}}\) must follow a certain statistical distribution to ensure that the model may be estimated. The literature offers several examples of distributions, such as half-normal (Aigner et al., 1977), exponential (Meeusen & van Den Broeck, 1977), truncated normal (Stevenson, 1980), or gamma (Greene, 1980a, 1980b). We decided to use exponential distribution following Meeusen and van Den Broeck (1977). Following (1) and (2), we obtain the following expression:

$${\text{d}}_{{\rm i}}^{{\rm r}} = {\text{X}}_{{\rm i}}{\beta} + {\text{v}}_{{\rm i}}+{\text{u}}_{{\rm i}}$$
(3)

Given that it comprises an error model and the disturbances and regressors are independent, performing an OLS regression would provide unbiased, consistent, and efficient estimators among linear regressions. However, there is inconsistency in the constant term and variances of both disturbances cannot be separated.Footnote 11 Therefore, when using stochastic frontier technique, it is more accurate to use maximum likelihood estimations than OLS.

Additionally, it is possible to calculate the cost efficiency value of each individual using stochastic frontier analysis with the following expression:

$$\text{EF} = \frac{{\text{f}}\left({\text{X}}_{{\rm i}}{\beta}\right)\text{exp}({\text{v}}_{{\rm i}}+{\text{u}}_{{\rm i}}{)}}{{\text{f}}\left({\text{X}}_{{\rm i}}{\beta}\right)\text{exp}({\text{v}}_{{\rm i}}{)}}= \text{exp} {(}{\text{u}}_{{\rm i}}{)}{.}$$
(4)

This efficiency measure is used in this study as an indicator of an opportunistic behavior.

Results

According to what was explained in the Methods section, a cost frontier implies a minimum duration for the target variable. In this concrete case, the estimated frontier represents the minimum duration of sick leave; thus, we only included the variables related to medical and physiological characteristics. These variables included the type of injury; the injured part of the body; age; whether the injured worker was initially treated in a hospital (hospital care); and whether the injury required hospitalization, was serious, or was a recurrence of a previous injury. We also regarded sex because the biological differences between men and women may explain differences in longer sick leave durations. In this case, we obtained different frontiers and recovery periods for the same injuries for men and women alike. To test the possible biological differences between men and women, we conducted some estimations by including interaction terms between the sex variable and each type of injury. This analysis allows us to know whether men and women present different recovery periods for the same injury. Results show that women experience longer sick leave durations for most injuries (Table 7 of the ″Appendix”). Men only present longer sick leave durations in five of the sixteen considered injuries,Footnote 12 suggesting they generally have a better health status than females.

Finally, occupational controls were included because not all jobs required the same level of recovery. Manual work necessarily requires the injured part of the body to be fully recovered to assure a proper return to work, whereas in the case of nonmanual work, this aspect is not essential; for example, a professor can perform his or her work with an arm or leg put in a cast, whereas this would not be the case for a construction worker or a waiter.

We conducted four different specifications to obtain the frontier indicating the minimum duration of sick leave (Table 2). Specification I only included the abovementioned variables, Specification II incorporated individuals’ occupation, Specification III changed Specification I by including individuals’ sex, and Specification IV includes individuals’ sex and occupation. Before the results analysis, it is important to check the existence of inefficiency. To do so, we performed a maximum likelihood test, wherein the null hypothesis was that the variance of the inefficiency term takes a value of zero. If the null hypothesis was true, the stochastic frontier model reduced to an OLS model with normal errors.Footnote 13 In our case, the test rejected the notion that the variance of the disturbance-measuring inefficiency was zero; therefore, in all cases, the stochastic cost frontier was at a 1% level of significance.

Table 2 Frontier estimations of the logarithm of the sick leave duration by specification. Source: Own elaboration based on SAW data

Across all specifications, the longest standard durations corresponded to accidents that led to fractures, traumatic amputation, and heart attacks. Additionally, the accidents that required hospitalization or were labeled as “serious” presented longer standard durations in all specifications. When we included individuals’ occupation, longer standard durations were found among skilled agriculture and fishing workers and machine operators (Specifications II & IV). Including sex in our frontier led men to experience slightly shorter standard sick leave durations compared with women (Specifications III & IV).

Once the standard durations were obtained for each individual, we calculated the estimated efficiency (EF). To do this, we used the quotient between the actual durations and those that would be expected because of medical or physiological factors presented in the Methods section. Thus, we can detect different behaviors between men and women globally and for different compensation levels.

Before obtaining the EF, we checked that all efficiency differences between men and women were statistically significant (see Table 3). We tested whether these differences were different from zero in all specifications and percentile levels and whether they were positive or negative, that is, “favoring” women or men. Analyzing the estimations for the whole distribution, we observed that women were more inefficient in Specifications I and II and more efficient in Specifications III and IV. In all cases, the test accepted the null hypothesis of different means for any significance level. Indeed, it was accepted in the first two specifications that the difference favors women, whereas in Specifications III and IV, the difference favors men. When considering compensation distribution, conclusions were not so homogeneous. In this case, we only considered Models I and III, given that the inclusion of occupational controls did not change the results. In Specification I, we found that men were more efficient if the compensation percentile was below 90. Between percentiles 90 and 95, there were no significant differences, and above percentile 95, women were more efficient. However, when we included the variable sex in the model (Specification III), women were always more efficient, although in the lower part of the distribution, this was significant at 5%.

Table 3 Equality of means hypothesis test for the estimated efficiency by sex

Once the existence of the significant differences was established, the EFs for males and females by compensation percentiles were obtained, as shown in Table 4 (and Figs. 7 and 8 in the ″Appendix”). In addition to performing the analysis for the whole sample of accidents, we also conducted all estimations only including injures that were “hard to diagnose,” that is, strains and sprains. This distinction was made because according to moral hazard literature, in this type of injury where the recovery period is not clearly determined, the existence of opportunistic behaviors among injured workers is common.

Table 4 Estimated efficiency by specification, percentile of the compensation, type of injury and sex

In general, the results show that both sexes extended their recovery periods longer across all percentile levels (i.e., EF > 1). The first two specifications that did not include sex in the frontier estimation indicate that women extended their sick leave duration longer compared with men across all percentile levels, except for the highest percentile level. This fact was reinforced among injuries hard to diagnose, where men extended their recovery periods longer from percentile 90 compared with women. This behavior was not detected when including sex in our frontier (Specifications III & IV). In these cases, men extended their recovery periods longer across all percentile levels. This is because women have longer standard durations.

To complete the analysis, Figs. 3 and 4 present predictions of the efficiency for the whole compensation distribution (for the total accidents and for the hard-to-diagnose groups). These predictions were obtained by regressing the estimated efficiency on compensation and compensation squared as can be seen in the following expression:

$${\text{EF}}_{{\rm i}} = {\beta}_{0}+ {\beta}_{1}{{\text{C}}}_{{\rm i}}+{\beta}_{2}{{\text{C}}}_{{\rm i}}^{2},$$
(5)

where \({\text{EF}}_{{\rm i}}\) is the efficiency measure obtained for an individual \({\text{i}}\) from Eq. 5, \({\text{C}}\) is the variable measuring the received compensation by the injured worker, and \({\beta}_{{\rm i}}\) are the coefficients to be estimated. The quadratic specification was chosen to detect nonlinearities because it fits the inefficiency patterns presented in Figs. 7 and 8 in the ″Appendix”. These predictions allow us to detect the point of the compensation distribution at which individuals become more (or less) inefficient than others. It is also possible to identify for which compensation these individuals change their behavior, from increasing to reducing the efficiency, or vice versa.

Fig. 3
figure 3

Source: Own elaboration based on SAW data

Quadratic prediction of efficiency using compensation by sex and type of injury: Specifications I and II. Notes: Dotted lines represent 95% confidence intervals (CIs).

Fig. 4
figure 4

Source: Own elaboration based on SAW data

Quadratic prediction of efficiency using compensation by sex and type of injury: Specifications III and IV. Notes: Dotted lines represent 95% confidence intervals (CIs).

Table 6 in the ″Appendix” provides detailed information of Figs. 3 and 4, namely, coefficients of the squared estimations, precise compensations where the maximum and minimum inefficiencies reached (depending on whether the quadratic fitting provided a concave or a convex function), and the cut points between both distributions when available.

When not including sex in our frontier and considering the total of accidents (Fig. 3), inefficiency in men increases as compensation is higher in both specifications (I and II). However, inefficiency in women only increases until compensation reaches values close to EUR 55. From that point, inefficiency starts decreasing until it takes values lower than those of men, specifically when compensation reaches over EUR 70. In cases of hard-to-diagnose accidents (right graphs of Fig. 3), the situation slightly changes. Men only increase their inefficiency when compensation is higher than EUR 35, whereas in women, it starts decreasing from almost the beginning of the compensation distribution (approximately EUR 38 in the Specification I and EUR 39 in the Specification II). Consequently, inefficiency in women is below that in men from compensations close to EUR 60.

When we include sex in our frontier, we also observed different behavioral patterns between men and women (Fig. 4). Combining all accidents together, inefficiency in men presents a continuous growth, which is accelerated as compensation increases. On the contrary, women only increase their inefficiency until the compensation reaches values between EUR 55 and EUR 60. From that point, their inefficiency starts to decrease. If we only consider hard-to-diagnose accidents (right graphs of Fig. 4), men only reduce their inefficiency until the compensation reaches values of EUR 36, and from that point, it starts increasing significantly. After a certain stability in lower compensation levels, we observe a reduction in the inefficiency levels of women at higher compensations.

Robustness analysis

To check the robustness of our results, we repeated the analysis for two concrete groups of injured workers—a specific part of the service sector workers (“Other services” sector specified in Table 1) and the most qualified ones.

The “other services” sector covers almost one million accidents that are more evenly distributed between men and women and between white-collar and blue-collar workers. These activities include workers in public administration, education, health, or domestic services. In general, these are non-male-dominated jobs where the gender bias in occupational accidents is corrected. The aim is to test whether the results obtained in the empirical section are conditioned by an asymmetric distribution of accidents highly concentrated in certain occupations, e.g., men develop most manual jobs.

The estimated coefficients are similar in sign and magnitude to those obtained in the total regression. However, when calculating the error terms, we observe a low contribution of inefficiency for both men and women. Figure 5 shows the results of the efficiency prediction by compensation distribution and workers’ sex for the four analyzed models. In the case of specifications I and II, those injured women at the lowest distribution scale present high inefficiency, corroborating our main results. The observed behavior change in the high levels of the compensation distribution is still plausible—the inefficiency is lower among women who receive compensations above EUR80 per day.

Fig. 5
figure 5

Source: Author’s own based on SAW data

Quadratic prediction of EF in the service sector using compensation by sex. Notes: Dotted lines represent 95% confidence intervals.

Secondly, we conducted another robustness analysis focusing on the most qualified workers. The prevalence of injuries differs across occupations, and the glass ceiling theory is associated with high-responsibility positions. We thus conducted additional estimations removing from our sample low-qualified workers.

As can be seen in Fig. 6, results back the patterns observed in our previous estimations, i.e., when considering the total of accidents and the specific part of the services sector. In specifications I and II, where not including the sex variable as a control, we again observe better behavior among men at low compensation levels and among women when the compensation is above EUR65. When including the sex variable (specifications III & IV), the inefficiency degree is higher among males than females in all cases, especially at high compensation levels. As with the service sector sample, we observe a reduction in the inefficiency of these workers.

Fig. 6
figure 6

Source: Author’s own based on SAW data

Quadratic prediction of EF for skilled workers using compensation by sex. Notes: Dotted lines represent 95% confidence intervals.

This robustness analysis allows us to conclude that the observed gender differences in behavior are not related to a higher concentration of accidents among males or to a higher amount of accidents affecting low-skilled workers.

Policy implications

There exist two areas in which this study may contribute to designing well-targeted policy interventions. On the one hand, our results shed light on the significant issues concerning the public health domain. On the other hand, labor policymakers may also find some of the outcomes obtained insightful.

From the public health perspective, a first outcome that can be useful is the division within sick leave duration between days associated with pure physiological determinants and days derived from behavioral aspects. This enables policymakers to address the question properly. For instance, if in a specific group the authentic problem is the opportunistic behavior that extends the sick leave excessively, it would be a mistake to devote a great proportion of the safety budget to develop group-specific safety-enhancing measures to avoid serious injuries. Vice versa, if the problem relies on the lack of preventive instruments and procedures, implementing a strict monitoring strategy to avoid absenteeism would not be appropriate.

Moreover, in this study, the econometric model estimated allows policymakers and insurance managers to compute what a standard duration (i.e., that duration depending only on physiological factors) should be for a set of explanatory variables. With this, it is possible to identify problematic situations from a moral hazard standpoint at the individual level. Hence, it is straightforward to pinpoint those people that may be behaving more opportunistically and take action to address the issue in an informed manner.

From a gender/sex perspective, our estimates show that there are significant differences between women and men regarding sick leave duration, thus posing important policy questions. As our estimates suggest, if women have worse health status in the workplace, gender-specific measures from a public health point of view should be implemented to tackle this issue.

Second, there also exist distinct behavioral responses between genders. Although a simple comparison between women and men regarding “behavioral” days off within sick leave shows no significant differences, this observation is hiding a twofold divergent pattern when considering the income distribution ladder. Opportunistic behavior is higher at the top of the wage distribution in case of men and for low wage levels in case of women. From a policy standpoint, these results suggest devoting more financial resources to screening activities to check low-income women and high-income men as well.

Our results also enable the implementation of fine-tuning policies, as it is possible to accurately determine some relevant thresholds. For instance, it is feasible to compute the wage level at which the opportunistic behavior in men stops decreasing and starts increasing (i.e., the arg min. value). Moreover, the arg max. value of the wage for women can be calculated (i.e., the wage at which the opportunistic behavior in women stops increasing and starts decreasing). Finally, policymakers and insurance managers can assess which group performs more opportunistic behavior at different wage levels, which, in turn, allows them to develop targeted monitoring policies enhancing efficiency.

From a labor relations point of view, our results put forward a response in the behavior of women as a result of the institutional and social barriers in the labor market. Two theories have been popularized in the last decades regarding the career advancement in women: the sticky floor and the glass ceiling hypotheses.

The sticky floor hypothesis posits that there is a discriminatory employment pattern that retains workers, mainly women, in lower ranks of the job scale, which leads to low wages, low job mobility, and invisible barriers to career advancement. This fact could demotivate low-wage women, preventing them from exerting high effort levels, and this is what we precisely observe in our data. The policy prescription would be to raise expectations relating to job promotions to encourage women and to achieve more work-committed low-wage female employees. The main institutional device to achieve this goal is to foster permanent employment among women. Fixed-term labor contracts are widespread within the Spanish labor market, disproportionately affecting women. This type of hiring is associated with low wages and poor career advancement perspectives. Thus, reducing overall temporary labor contracts, particularly among women, should help solve this problem.

The glass ceiling hypothesis postulates that there exists an unacknowledged barrier to advance in a profession, particularly affecting women and minority groups. It is often considered that the glass ceiling is particularly important in the higher ranks of the job scale. In this case, women might be exerting an additional effort (i.e., carrying out a “presentist behavior”) so as to tear down this discriminatory barrier. In other words, in an attempt to break the glass ceiling, women might be striving too hard, which is not efficient, creating room for policy action to fix this market failure. The gender gap in the behavioral component of the sick leave at the top of the wage distribution that we observe in our estimates seems to point toward that direction, supporting the existence of a significant glass ceiling in the Spanish labor market. The policy prescription would be to implement antidiscrimination measures targeted at workers at the top of the wage distribution.

Conclusions

The gender/sex gap in the duration of sick leave has been traditionally studied by analyzing ex-post biological differences between men and women. As behavioral responses are an important factor that may explain this gap, no prior studies have analyzed their role in explaining these differences. This study fills this gap in the literature by adopting a methodology scarcely used in this field of research, that is, the frontier analysis technique, which allows us to split sick absence days into two types of days—those that are exclusively related to biological characteristics and those that are derived from behavioral reasons.

We found that women present longer standard durations, that is, more days purely attached to physiological characteristics, compared with men, thus showing less opportunistic behaviors. However, when we assumed that there are no biological differences between men and women, behavioral differences were negligible.

Gender differences were more plausible when considering the amount of sick leave benefit. If we assume that men and women recover at the same rate from the same injury (i.e., without including the sex variable in our frontier), we found that women are more inefficient at lower levels of income, whereas in case of men, inefficiency is more pronounced at higher wage levels. However, when considering biological differences (i.e., including the sex variable in our frontier, and interaction terms between this variable and each type of injury), we found that women were more efficient than men across all the income distribution, especially at higher levels of income.

Therefore, we provide evidence that it is essential to consider biological and behavioral characteristics when analyzing the gender/sex gap in the duration of sick leaves as it gives a reasonable image of what these differences really hide.