Diabetes is one of the fastest growing chronic diseases in the United States and globally. It has been shown to be associated with increased healthcare costs, disability, lost productivity and premature mortality.1 , 2 Diabetes is the seventh leading cause of death in the United States and it imposes a substantial direct and indirect cost burden on the U.S economy.3 , 4

In 2010, diabetes was estimated to affect 25.8 million people, or 8.3 % of the US population.3 The prevalence among individuals aged 20 years or older has been increasing, which suggests that diabetes has become an important risk factor for the American workforce.3 , 5 Non-Hispanic Blacks (NHB), Hispanic, American Indians, and some Asian Americans are at particularly high risk for type 2 diabetes and its complications.1 , 3 , 6

Individuals with diabetes have been shown to have substantially higher rates of missed work, disability and probability of being early retired than those without diabetes.4 , 7 , 8 Based on 2000–2004 MEPS data, the combined job absenteeism costs of obesity, morbid obesity and diabetes were estimated to be $3.9 billion annually.9 Costs associated with diabetes are estimated to have risen 41 % from $174 Billion ($116 billion direct cost and $58 billion indirect cost) in 20078 to $245 Billion ($176 billion direct cost and $69 billion indirect cost) in 2012.4 Loss of productivity has been shown to be increasing over time and has become a major concern to our nation’s health system and for health policy.4 , 10

Previous studies have examined the association of diabetes with number and cost of missed workdays, work participation and productivity losses.4 , 7 , 9 13 There is a wide range of variation in the statistical models used to estimate incremental and aggregate cost of missed workdays for individuals with diabetes.4 , 7 , 9 , 10 , 13 The American Diabetes Association in 2012 used ordinary least square (OLS) regression with the 2009–2011 NHIS data, and showed that the annual cost of missed workdays per person with diabetes was $149 and the aggregate cost of missed workdays was $5 billion.4 Another7 used Health and Retirement Study data and a two-part model with OLS regression to examine the association of diabetes with the natural logarithm of income, and found that the average incremental lost income of sick days per person with diabetes was $630 and aggregate incremental loss income of sick days was $1.5 billion for 1992–2000.

The statistical model used to evaluate the incremental cost of a disease can have a major impact on the findings, particularly in cost variables with excess zero values and a skewed distribution.14 Number of missed workdays and cost data are typically right-skewed because a relatively small proportion of patients miss extremely high number of workdays and incur high costs. Estimates from the linear regression models are known to lead to biased and unstable estimates due to the skewness and kurtosis of the data distribution and heteroscedasticity.13 For such problems, prior studies of diabetes and associations with work activities modeled the second part of the two-part model using OLS regression of log-transformed workdays or costs.7 , 9 Similarly, previous studies used a two-part model proposed by Duan based on homoskedastic, normally distributed errors and a nonparametric approach, which provides estimates in the original scale after some retransforming.14 However, this retransformation from log scale to the original scale leads to a potential underestimation or overestimation of the number of missed workdays and cost of missed workdays.14

To avoid such problems, we used a generalized linear model (GLM) model with log-link and gamma distribution in the second part of the two-part model. This study contributes to the literature on diabetes by estimating the incremental cost of missed workdays using this novel GLM approach.

METHODS

Sample

We analyzed the responses of 14,429 employed individuals age ≥ 18 years, representing 150,378,648 individuals in 2011 consolidated data (HC-147) from the Medical Expenditure Panel Survey (MEPS). MEPS is an ongoing national household survey for the civilian non-institutionalized U.S. population.15 The data are collected through in-person interviews and include information on the respondents’ health status, demographic and socio-economic characteristics, employment, missed workdays, access to care and satisfaction with healthcare. The survey collects comprehensive data on healthcare utilization and expenditure and has a complex survey design, which includes multistage sampling, clustering and stratification with oversampling of minorities.13

Measures

All variables used for analysis were based on self-report:

Diabetes

Diabetes indicates a yes response to the question “Have you ever been diagnosed with diabetes?” MEPS data does not distinguish between type I and type 2 diabetes. However, MEPS 2011 data showed that 79 % of the respondents reported treating their diabetes with diet and oral medications, which suggests that most respondents have type 2 diabetes.

Missed Workdays

Missed workdays represent the number of times the person lost a half-day or more from work in 2011 because of illness, injury, mental or emotional problems during the study period. Because half-days and full-days lost were not distinguished in MEPS, all days lost were recorded as full-days lost, which is consistent with previous studies that used MEPS data.9 , 16 , 17 A response of “no workdays lost” was coded as zero.

Cost of Missed Workdays

Annual cost of missed workdays was based on multiplying the daily wage of each respondent by the number of missed workdays. The daily wage rate was calculated by multiplying the non self-employed hourly wage rate by mean hours worked per day. Self-employed individuals were excluded from the analysis because hourly wage was not asked of the self-employed.15 MEPS only collected data for usual hours worked per week, not hours worked per day. The mean hours worked per day was calculated by dividing usual hours worked per week by 5 days, similar to a previous MEPS study.17 To obtain the mean daily wage, we multiplied the mean hours worked per day by the hourly wage. To estimate the annual cost of missed workdays per individual, we multiplied mean daily wage by the annual missed workdays of the individual.

Comorbidities

Binary indicators of comorbidities were based on a positive response to the question “Have you ever been diagnosed with major depression, hypertension, coronary heart disease, angina, myocardial infarction, other heart diseases, stroke, emphysema, high cholesterol, joint pain, arthritis and asthma?” Self-reported comorbidities were used in order to control for confounding variables and examine their individual incremental effects on the number and cost of missed workdays in the fully adjusted regression. Previous studies showed that the binary indicators of disease are more effective in accounting for disease burden.18 , 19

Demographic and Socioeconomic Characteristics

Race/Ethnicity

MEPS defines race/ethnic groups as: White, Black, American Indian/Alaska Natives, Asian, Natives Hawaiian/Pacific Islander, Multiple race and Hispanic or not. We re-coded it into four groups: non-Hispanic White (NHW), non-Hispanic Black (NHB), Hispanic and other.

Education

Educational attainment was coded into four groups: less than high school (≤ grade 11), high school (grade 12), college (grade 13–16) and graduate school (≥ grade 17).

Marital Status

Marital status was coded into three groups: married, not married (widowed/divorced/separated) and never married.

Gender

Gender was coded with male as the reference group.

Age

Age was coded into four age groups: 18–24, 25–44, 45–64 and ≥ 65 years.

Census Region

Census region was coded as: Northeast, Midwest, South and West.

Metropolitan Statistical Area (MSA)

Metropolitan Statistical Area (MSA) was based on living in an MSA as of December 2011.

Health Insurance Status

Health insurance was coded into three categories: private, public only and uninsured.

Analyses

We estimated a two-part GLM allowing20 for mixed discrete-continuous variables. In the first part of the two-part model, a probit model is estimated for the probability of observing a zero versus positive value for number and cost of missed workdays. Then, conditional on a positive value, the second part of the model was estimated with a GLM, gamma distribution and log link for the positive values.14 To determine the family distribution for the GLM, we used the modified Park test.14 , 20 The results of the modified Park test verified that the use of a gamma distribution with a log link was the best–fitting GLM to get consistent estimation of coefficients and incremental effects.16 The gamma model is used for data situations in which the responses take only values greater than or equal to 0.21 Cost data are typically right-skewed because a relatively small proportion of patents incur extremely high cost, and the GLM with log link and gamma variance function takes this problem into account.13 Since the dependent variables in our study were characterized by a high concentration of observations with zeros, we used the two-part GLM to improve the precision of the estimates.16 Recently, regression-based methods such as the two-part model, and GLM with gamma distribution and log link, have emerged as a standard for estimating the incremental cost attributable to disease or risk factors.14 , 16 We then generated number and cost of missed workday variables to estimate unadjusted and adjusted means.

In order to generalize the study findings to the U.S population, the complex sampling design of MEPS data set was taken into account by using the sampling weights, variance estimation stratum and primary sampling unit (clustering). The weighted two-part model was used to estimate the association of number and cost of missed workdays with diabetes, adjusting for demographic factors and comorbidities, and to estimate the total burden of diabetes associated with missed workdays and cost on the US population. All statistical analysis was conducted using STATA 13.

RESULTS

Table 1 shows the characteristics for employed U.S. adults. Of the employed adult sample, 6.9 % reported having diabetes. The prevalence of diabetes was found to increase with age and presence of comorbidities, and varied by demographic and economic factors. The prevalence of diabetes tended to be higher for people in the middle and older age groups, NHBs, males and those who were married. Those with diabetes were more likely to be uninsured, to have lower income and more likely to have comorbid conditions.

Table 1 Sample Demographics by Diabetes Status Among Employed Adults, United States 2011

Table 2 shows the unadjusted mean and cost of annual missed workdays for individuals with and without diabetes. The unadjusted annual mean number of missed workdays per person was found to be 1.9 (95 % CI 1.3–2.6) for individuals with diabetes and 1.0 (95 % CI 0.87–1.1) for individuals without diabetes. The difference of annual mean number of missed workdays per person between the diabetes and no diabetes groups was about 0.90 days. The unadjusted annual mean cost of missed workdays per person per year was $277 (95 % CI $177–$378) for the diabetes group and $160 (95 % CI $130–$189) for those without diabetes. The difference in annual mean cost of missed workdays per person per year between the diabetes and no diabetes groups was about $117.

Table 2 Means and Proportions of Missed Workdays and Cost of Missed Workdays by Diabetes Status, United States 2011

Table 3 shows the incremental effect from the probit and GLM equations of the two-part model for the association between the number of missed workdays and diabetes after adjustment for covariates. After adjusting for other factors, having diabetes was associated with 0.84 (95 % CI 0.10–1.57) more missed workdays compared to similar individuals without diabetes. Women missed more workdays than men. Compared to NHW, NHBs missed less workdays. Compared to those with less than a high school education, individuals with a college degree missed less workdays. Compared to those privately insured, uninsured workers missed less workdays. Compared to those with no comorbidities, workers that had myocardial infarction, joint pain, arthritis and asthma missed more workdays..

Table 3 Two-Part Regression Model: Incremental Effect of Missed Workdays by Diabetes Status Accounting for Missed Workdays, United States 2011

Table 4 shows the incremental effect from both the probit and GLM version of the two-part model for the association of cost of missed workdays with diabetes and covariates. After adjusting for other factors, those with diabetes had $120 (95 % CI $30.7–$209.1) higher cost of missed workdays. Women had $48 higher cost of missed workdays. NHB had $59 lower cost of missed workdays compared to NHW. Compared to those with private insurance, uninsured workers had $98 lower cost of missed workdays. Compared to those with less than $25,000, cost of missed workdays was $56 higher in those who earned $25,000–$49,999, $146 higher in those who earned $50,000–$74,999, and $201 higher in those who earned $75,000–$296,955. Among those with chronic conditions, cost of missed workdays was $110 higher in those with asthma, $107 higher in those with arthritis, $96 lower in those with emphysema, $80 higher in those with joint pain, but $47 lower in those with high cholesterol. Based on the nationally representative employed diabetes population (9,774,612) in 2011, the unadjusted aggregate cost of missed workdays for the employed population in U.S with diabetes was estimated to be approximately $2.7 billion and the adjusted incremental burden was estimated to be $1.1 billion.

Table 4 Two-Part Regression Model: Incremental Effect of Cost of Missed Workdays by Diabetes Status Accounting for Cost of Missed Workdays, United States 2011

Appendices Tables 1 and 2 (available online) show the probability of non-zero number and cost of missed workdays, and the number and cost of missed workdays, based on fully adjusted estimates from the first and second parts of the two-part model, respectively.

DISCUSSION

Using novel methodology, this analysis provides updated information on the relationship between diabetes and missed workdays for the US population after controlling for important confounding factors. We found that 6.9 % had diabetes, which is consistent with national estimates of 7 % for diabetes in the US in 2012.4 Our estimate of missed workdays is consistent with the low range of previous studies reported in a recent systematic review,8 which showed that those with diabetes had 0.90–5.7 higher missed workdays than those without diabetes. Other studies reported more missing workdays for females.8 , 9 , 17 , 22 and conflicting evidence for minorities.

One study found NHBs had lower income, so that each day missed from work imposed a larger financial burden on the household.23 Others reported NHBs missing more workdays due to occupational illness or injury and fewer workdays due to influenza.17 , 24

Compared to those who were privately insured, we found uninsured workers had less missed workdays. There are higher out-of-pocket costs for uninsured employees to seek healthcare, and therefore missing work when ill is more of a financial burden for the uninsured. There is mixed evidence from the literature, with one study showing that having sick pay benefit was significantly associated with missed workdays.17 , 22 Another study reported that having health insurance was significantly associated with a decreased likelihood of missed workdays, but access to care was significantly associated with a greater number of missed workdays, and healthcare use was significantly associated with an increased likelihood of and greater number of missed workdays.22 Studies have noted in their limitations being unable to test if healthcare use and a chronic condition is endogenous with missed workdays.22 , 25 In our study, healthcare use is not incorporated within the study model, and future research needs to confirm the relationship among access to care, healthcare use and missed workdays to better understand the association of health insurance and missed workdays.

Our study estimated the cost burden of missed workdays for individuals with diabetes to be approximately $2.7 billion, which is lower than a previous study that estimated the absenteeism cost for employed individuals with diabetes in the year 2012 to be about $5 billion.4 However, the previous study was based on a number of different databases and included lost productivity and disability as well as missed workdays.4 The prevalence of diabetes is increasing in the US population.5 , 9 , 26 The economic burden associated with diabetes is likely to increase as diabetes becomes more prevalent.5 With rising prevalence of diabetes in the US, cost of missed workdays will continue to rise, and quantifying the incremental and aggregate cost of missed workdays is believed to be important to employers, to estimate potential cost savings of minimizing job absenteeism, along with interventions to reduce diabetes. Unless concerted efforts are undertaken, the economic burden of diabetes will continue to rise by contributing to work loss through absenteeism and health-related work limitations in the workplace.5 In addition, our findings suggest that disparities of missed workdays based on gender will assist policy makers to consider further efforts to improve women’s health, thereby reducing the economic burden to women associated with missed workdays due to injury/illness.

Key strategies to reduce missed workdays in people with diabetes include improving the clinical care of diabetes, increasing medication adherence, reducing complications and treating comorbid mental illnesses such as depression. Namely, adding laypersons to the primary care team of individuals with diabetes improved having a retinal examination, microalbuminuria testing, and pneumonia vaccination.27 Likewise, a pharmacist-provided medication therapy management intervention reduced A1c levels on average by 0.27 from their baseline values.28

Our study has a number of limitations. First, diabetes and comorbidity conditions were based on self-report, which was not verified clinically and may underestimate the findings. However, previous studies acknowledged the reliability of self-reported diabetes as a measure of diagnosis to provide unbiased estimates.12 , 29 , 30 Second, the cost of missed workdays estimates are for those individuals with diabetes and exclude the cost of missed workdays associated with family members who missed work to care for family members with diabetes. Hence, the associated cost of diabetes due to missed workdays may underestimate the true national cost estimates. Third, missed workdays and cost of missed workdays for the coexistence of diabetes and any comorbidity may differ from diabetes alone. It may be those comorbidity risk factors associated with uncontrolled diabetes, rather than diabetes itself, which are contributing to lost worked days. Fourth, the analysis was based on cross-sectional studies that are not dynamic. Therefore, longitudinal studies need to be conducted for further research that captures the changes over time. Fifth, we did not test for endogeniety and potential mediation in our model to validate the association between health insurance coverage with missed workdays and race/ethnicity with missed workdays, respectively.22

Despite these limitations, this study has important implications. Several previous studies assessed different areas of research of the association between productivity loss and socio-demographic characteristics in individuals with diabetes.4 , 5 , 8 , 9 , 13 However, we employed a novel two-part model to update previous MEPS estimates while accounting for the likelihood of having any missed workdays and the skewed distribution of cost of missed workdays. The aggregate cost of missed workdays was estimated be about $2.7 billion in adults with diabetes in the U.S population. Future research is needed to identify the best interventions to reduce missed workdays in employed individuals with diabetes and determine whether workplace interventions have a role.