Introduction

Health expenditure in the United States (U.S.) stands at approximately 15 per cent of GDP, and the expenditure growth rate has accelerated from a low of 1.9 per cent in 1995 to a peak of 9.3 per cent between 2001 and 2002.Footnote 1 However, there is significant heterogeneity in the patterns of expenditure growth; for instance, although pharmaceutical expenditure comprised approximately 10 per cent of total national health spending in 2004, this expenditure component contributed 14.7 per cent of total health care spending growth from 1994 to 2004.Footnote 2 While it has been established that insurance coverage is an important determinant of health expenditure growth,Footnote 3 whether the effects of insurance are heterogeneous across different types of health care is an empirical question that remains unanswered. If insurance induces consumers to substitute certain forms of care (e.g., inpatient care) for alternative treatments (e.g., pharmaceuticals), the impact of health insurance may vary by the type of health expenditure. There may also be heterogeneous effects associated with other demographic and socio-economic variables. For example, if age plays a role in explaining the heterogeneity of health and pharmaceutical expenditure, then while inpatient expenditure may vary significantly with proximity to death, pharmaceutical expenditure might not. Because of this possible heterogeneity, it is unclear whether previous empirical results hold if several forms of expenditure are jointly determined, suggesting that an alternative course of action, which captures possible correlations in the error terms of different equations, should be considered.

Insurance coverage may not be independent of health expenditure given that the decision to obtain insurance is influenced by the individual's expectations of how much medical care he needs to consume and given that insurance coverage may affect the prices of health care.Footnote 4 In other words, the insurance decision is potentially endogenous. Although researchers have tested for the endogeneity of insurance coverage in the context of drug expenditure,Footnote 5 the issue is still unresolved. Furthermore, we frequently observe health expenditure for those individuals who need health care and choose to purchase it at the point of need, but failing to correct for this effect may lead to biased estimates.Footnote 6

An analysis of various health expenditure components is timely as the current U.S. administration is pursuing a number of reforms to reduce health expenditure. Proposals include changing how states run the Medicaid programme, extending private insurance coverage through increased tax incentives, allowing insurance companies to operate across state lines, permitting small businesses to pool and purchase health insurance, and increasing Medicare premiums for high-income recipients.Footnote 7 Whether these proposals will offer much relief from health expenditure growth without hurting the most vulnerable populations is an open question. For example, if expenditure increases are driven by private insurance rather than public insurance, then proposals to increase private insurance coverage through tax credits could have the opposite effect of that intended. Moreover, targeting premium increases and coverage reductions at vulnerable populations (e.g., the elderly or the poor) could be harmful if there are barriers in access to care for these groups or if utilization is primarily driven by need rather than incentive inefficiencies like moral hazard. A better understanding of how public and private insurance influences expenditure and how various demographic, socio-economic, and health variables contribute to health expenditure would aid policymakers in determining the likely impact and success of specific proposals.

The main objective of this paper is thus to examine the factors that determine health expenditure, mainly insurance coverage and other socio-economic, demographic, and health variables, and to determine if these factors have differential effects on the main components of health expenditure, notably inpatient, outpatient, pharmaceutical, emergency care, and office visits. The paper contributes to the existing literature by employing an alternative methodology (multivariate regression analysis) to account for the heterogeneity of health expenditure and by further controlling for the endogeneity of insurance and sample selection. It draws upon the 1996–2003 Medical Expenditure Panel Survey, which contains a rich amount of private and public insurance expenditure information. Based on the main results from the analysis, the paper also seeks to contribute to the current policy debate in the U.S.

The following section briefly outlines the institutional background of health insurance coverage in the U.S. and offers a general overview of the empirical evidence relating to insurance and other predictors of health expenditure. The next section details the econometric model and discusses the endogeneity and sample selection corrections along with the intuition behind the multivariate regression. The following section discusses the data and presents the descriptive results, whereas the penultimate section reports the results of the econometric analysis. Finally, the last section concludes by discussing the policy implications.

Institutional background and evidence

Insurance coverage in the U.S. is fragmented with individuals receiving coverage from various private and public third-party payers and 16 per cent of the population having no insurance coverage.Footnote 8 Of the public third-party payers, Medicare mainly covers the elderly and certain other groups.Footnote 9 Hospital coverage is automatically extended to all beneficiaries, while outpatient and physician services and some medical services are available under voluntary Medicare coverage. Until 2006 most outpatient prescription drugs were not covered by the programme. The state-run Medicaid programme is intended to cover low-income and disabled Americans and their families, although each state establishes its own eligibility requirements.Footnote 10 States have considerable freedom in determining eligibility for coverage, although there are certain groups that states must cover in order to receive Federal funds.

Private insurance coverage is mainly in the form of employer-sponsored insurance, as there are tax incentives for employers that offer health insurance.Footnote 11 Virtually all covered workers in employer-based plans (98 per cent) had a prescription drug benefit in 2005. While approximately 60 per cent of the population had employer-sponsored insurance in 2004, only around 9 per cent of the population had non-group (individual) insurance,Footnote 12 which is an alternative form of coverage for those without access to employer-sponsored coverage. Only 80 per cent of enrollees with non-group health insurance had prescription drug coverage in 2000.Footnote 13

The relationship between health insurance and expenditure

A significant body of literature has researched the effect of insurance on medical care expenditure. The RAND experiment was a key study, and a number of studiesFootnote 14 have established that more generous insurance led to increased health care expenditure among the non-elderly using this data. Using alternative datasets, other researchersFootnote 15 have found that insurance coverage led to higher medical expenditure in the elderly. A number of other covariates have been found to increase health expenditure, including having higher income,Footnote 16 more education,Footnote 17 and being in poor health,Footnote 18 although being female led to lower health expenditure.Footnote 19

Some research has focused exclusively on the determinants of pharmaceutical expenditure, and the general finding is that insurance coverage increases total pharmaceutical spending. One finding is that elderly persons with private drug coverage had higher drug expenditure than those only covered by Medicare.Footnote 20 In addition, the type of insurance coverage was a significant predictor of expenditure: conditional on having at least one prescription, elderly Medicaid recipients had lower drug expenditure than individuals with no supplemental coverage.Footnote 21

In addition, researchers have found that elderly individuals with more limitations to activities of daily living (ADLs) and persons diagnosed with angina, congestive heart failure, hypertension, diabetes, or arthritis had higher prescription drug expenditure.Footnote 22 Furthermore, the more individuals in a household who claimed to be in poor health, the higher the pharmaceutical spending for that household.Footnote 23 Few other demographic variables have been found to impact expenditure, although expenditure was higher for elderly persons living in urban areas.Footnote 24 Other research has established that adult women spent twice as much on prescription medicines as menFootnote 25 and increases in the household size led to lower household prescription drug expenditure.Footnote 26 The general findings from the literature regarding pharmaceutical expenditure are reported in the Appendix.

The model

There are several important theoretical and empirical considerations related to the determinants of pharmaceutical and other health-care expenditure, which include the endogeneity of the insurance decision, the unobserved consumption for individuals who do not consume any medical care in a given year, and the inter-dependency of different sources of expenditure. The methodological approach proposed in this study relies on three main corrections. First, we correct for the endogeneity of insurance coverage by estimating a multivariate insurance decision, separated into the possible insurance coverage decisions within the U.S. healthcare context: Medicare only, other public insurance (includes individuals with Medicare and another form of public coverage, Medicaid, TRICARE, VA health insurance, or some other form of public coverage), private insurance (includes individuals with Medicare and private coverage or private coverage only), and no insurance. We then insert the estimated predictions into the expenditure equations after bootstrapping the standard errors. The rationale behind the correction is that because public insurance is intended for the most vulnerable populations, those with private insurance tend to be healthier, to have higher incomes and better education, and to be in full-time employment.Footnote 27 More specifically, insurance coverage may influence the consumption of medical care, but the amount of medical care that an individual expects to consume may influence his choice of insurance coverage. Moreover, we anticipate that individuals with private insurance would be less in need of medical care than those with public insurance, and indications of higher consumption among those with private insurance may be more reflective of moral hazard than need.

Second, given that health expenditure is significantly skewed due to the infrequency of the insurance purchase, the probability of any expenditure is estimated separately (the “selection equation”), and the sample selection correction (Mills lambda) is inserted into the expenditure equation to correct for selection bias.Footnote 28 This requires an identification variable that explains the generation of expenditure but is not related to the participation equation. The rationale behind the sample selection correction is that the individual makes marginal cost–marginal benefit comparisons when faced with the decision on whether to seek medical care. Because of the inability to observe whether a value of zero for expenditure is due to the individual never needing medical care (not a true zero) or the individual being prescribed treatment and choosing not to purchase it (a true zero), sample selection techniques are employed.

An alternative to the Heckman methodology is the two-part model,Footnote 29 which was created to model the demand for health care and is motivated by the conditional mean independence assumption:

There has been extensive debate regarding the choice between the two-part model and the sample selection model.Footnote 30 One argument is that two-part models are more appropriate for sequential decisions, although MaddalaFootnote 31 argues that even when decisions are sequential, the decisions will be correlated if there are omitted variables common to both. Leung and YuFootnote 32 used Monte Carlo simulations to compare these two differing methods and determined that the choice of model depends on the empirical context. Specifically, the sample selection model performs poorly when there is collinearity, which can arise in certain contexts: when there is a large degree of censoring, when there are few exclusion restrictions, when there is little variability between regressors, or when there are weak instruments.Footnote 33 In the absence of collinearity, which can be verified using the condition number, the t-test on the inverse Mills ratio is an indicator of which of the two specifications is more appropriate.

Third, because we estimate the determinants of different expenditure sources simultaneously, the expenditure equation is a multivariate regression model rather than a standard ordinary least squares (OLS) model. Multivariate regression models are a more general form of an empirical equation that allow for more than one endogenous variable. The advantage is that the several sources of expenditure can be disaggregated, permitting us to jointly estimate the model and control for possible unobserved heterogeneity. These models are typically implemented when there are two or more strongly correlated (paired) dependent variables. This is the case with health care decision-making where, for example, an individual's drug consumption depends on previously obtaining a prescription from the physician or hospital.

The insurance equation

We first proceed to estimate the determinants of insurance coverage using a multinomial logistic model where the individual chooses between Medicare only, other public insurance, private insurance, or no insurance. Let I ik represent a vector of k=4 insurance alternatives (I i MO, I i OP, I i PR, I i NI), where MO=Medicare only, OP=other public, PR=private, and NI=no insurance. Let w i denote a set of insurance determinants where it is possible to obtain the probability of each outcome or insurance option conditioned on a set of covariates p(I i MO, I i OP, I i PR, I i NI/w i ) such that the sum adds to unity. Accordingly:

Following Wooldridge,Footnote 34 the fitted probabilities can be used for prediction purposes .

The participation equation

After estimating the insurance decision, the next step is to correct for possible sample selection bias. We designate health expenditure by y i *=x i β+μ i , where j=1,…,5 refers to each type of expenditure measured as a latent variable (y*) and i represents different individuals across the 8 years of the sample. The observed probability of non-negative health expenditure is:

where Z i are vectors of exogenous variables measuring the probability of any expenditure, ɛ i refers to the normally distributed error terms with zero mean, and E(μ i /ɛ i )≠0. Hence, we estimate the Mills lambda (), which is used to correct for possible sample selection in the expenditure equation via the traditional two-step procedure.Footnote 35 Finally, we define y i =d i *y i *, which captures the observed health expenditure.

Expenditure equation and multivariate regression

Health care expenditure is affected by the presence of significant unobserved heterogeneity that results from the fact that expenditure on one source (e.g., drugs) is jointly formed with expenditure from another source (e.g., medical visits). Based on this observation, one possibility is to estimate a model in which multiple response variables are taken as dependent variables. The multivariate regression estimates the same coefficients and standard errors as that of separate OLS models, and given the joint estimation of the relevant equation, the multivariate regression also estimates the between-equation covariances, allowing us to test coefficients across equations. As a result, the only difference from the OLS regression is that the covariance matrix resulting from the dependent variable is a matrix of variables and not just a single vector. Multivariate regression models can be envisaged as a reduced form of a simultaneous equation model.Footnote 36 Define an n-dimensional (n=5) vector of expenditure is source variables y ni by:

where x i is an n*k-dimensional vector of expenditure determinants for each individual, λ is the n-dimensional sample selection correction, and μ i is an n-dimensional random term made up of unknown parameters N(0,Σ).

The data and descriptive statistics

The data

The study employs data from the 1996–2003 Medical Expenditure Panel Survey (MEPS), a study co-sponsored by the Agency for Healthcare Research and Quality (AHRQ) and the National Center for Health Statistics (NCHS). We also gathered data from the 2006 Statistical Abstract of the United States to create certain instrumental variables (IVs). The MEPS survey contains a nationally representative sample of the U.S. civilian, non-institutionalized population with over-sampling of Hispanics and blacks.Footnote 37 Each year, half of the households are replaced by a new sample of households, creating a longitudinal database. The dataset contains a number of socio-economic and demographic variables along with information on health status, expenditure on various categories of medical care, insurance coverage, and access to care.

Although MEPS contains household data, the data is also available at the individual level. As coverage for medical care is sometimes individual rather than group, which is the case with Medicare, Medigap, and other types of private insurance where the policyholder does not extend coverage to dependents, the unit of analysis in this research is the individual. The health expenditure variables were calculated by combining self-reported data on health care use with cost information reported by health care providers. Specifically, respondents were first asked to identify all medical visits and prescription drug purchases along with the medical practitioners associated with this care and the pharmacies where prescriptions were obtained. Then, the survey administrators contacted each relevant medical practitioner and pharmacy to obtain payment information.

As the panel was relatively short at only two years, we chose the most recent observation for each individual in the sample and pooled the cross-sectional data from 1996–2003. We also excluded respondents under the age of 19 years since a child's guardian typically chooses his health insurance coverage and because children are usually covered under a parent's health insurance policy or through public insurance. After excluding observations with missing values and individuals under the age of 19 years, our sample consisted of N=91,039 persons.

The MEPS database has a number of advantages over other U.S. databases that contain health care information, specifically the Panel Study of Income Dynamics (PSID), the National Ambulatory Medical Care Survey (NAMCS), and the Medicare Current Beneficiary Survey (MCBS). The PSID obtains information on medical expenditure by asking the respondent to recall his medical outlays during the past two years, which may lead to rough estimates of medical expenditure that are subject to considerable biases. Another shortcoming of the PSID survey is that expenditure is not separated into different components. Meanwhile, the NAMCS lacks information on prescription drug expenditure along with expenditure on other forms of health care and socio-economic variables. Finally, the MCBS only consists of individuals covered under the Medicare programme, mainly the elderly and some of the disabled. As disease patterns often commence before the start of Medicare eligibility, and this onset of chronic illness has an important impact on health expenditure, we were interested in including the non-elderly in our sample.

Explanatory variables

The impact of insurance on health expenditure can occur via several pathways. First, there is the moral hazard effect of insurance where individuals who face a lower marginal cost of care under insurance consume more health care than if they faced the full price of care.Footnote 38 Second, patients may have little incentive to consume lower-priced treatments as they do not face the full costs of these treatments; however, third-party payers have mechanisms such as gatekeeping, prior authorization, and tiered co-payments to partially counteract this effect. Third, insurance might influence the agency relationship,Footnote 39 where physicians and hospitals may have limited incentives to curtail unnecessary medical care. Accordingly, insurance companies have implemented managed care programmes that often pay physicians a capitated amount per patient rather than a fee-for-service amount per good provided or service rendered, or maintain fee-for-service reimbursement, subject to a volume-based adjustment of the relevant fees. Fourth, insurance influences the price of care; specifically, insurance companies, Pharmacy Benefit Managers, and public programmes tend to exert significant negotiating power on providers, and the cost of medical care is lower for third-party payers than for uninsured patients.Footnote 40 Although we would expect a positive effect of insurance on the probability of obtaining medical care, the overall effect of insurance on expenditure is unclear as it depends on whether the volume effect outweighs the price effect of insurance.

On the basis of the existing literature, income is expected to increase the probability of private insurance and negatively influence the probability of non-insurance, although income should have a negative effect on public insurance (excluding individuals with Medicare only). Age and gender are likely to determine the type of expenditure and insurance because of time horizon effects and age- and gender-related risk attitudes. We also expect poor health to increase the probability of insurance and health-care use.

Table 1 provides aggregate evidence suggesting three specific features. First, across all age groups and all types of insurance, those who have insurance coverage spend more on average than those who are uninsured. Second, expenditure appears to rise with age, although the effect is combined with that of insurance coverage. Finally, when we restrict our sample to those who have consumed any health care, expenditure is higher than for the consumption group.

Table 1 Health and pharmaceutical expenditures by age (1996–2003)

The dependent variables are the different types of expenditure, including prescription drugs, outpatient care, inpatient stays, emergency care, and office-based visits (all measured in logs). We find that the two largest components of health expenditure are drugs and office-based visits, which is expected as both are often consumed together. The independent variables were chosen according to their theoretical soundness, although we excluded other variables due to significant multicollinearity following the standard tests (condition index and variance inflation factor). (More information regarding the specific variables chosen for the model is available in the Appendix.)

The model also includes gender, which explains expenditure jointly with age effects. Besides demographic variables, we have included socio-economic variables, such as marital status, that determine health production and could be associated with the costs of health information. We also consider income per person to determine the individual's budget constraint for health and other goods. Income was defined in terms of poverty categories as employed by Medicaid. We also controlled for health status as insurance companies might engage in cherry picking and upwardly adjust the premium for those who are more likely to be sick, while poorer health status is related to higher expenditure. Another covariate is insurance which is split into Medicare only, other public coverage, private insurance, and none. Finally, the model includes time controls to capture any inter-temporal variability.

Preliminary evidence

Figure 1 exhibits health expenditure patterns by age groups. Consistent with Zweifel et al.,Footnote 41 inpatient expenditure increases steeply after the age of 60 years, suggesting that proximity to death matters; however, other forms of expenditure increase more consistently with age.

Figure 1
figure 1

Total expenditure by age (1996–2003) in 1996 U.S.$

Figure 2 reveals that despite some changes in the distribution of insurance, the share of insurance coverage from 1996 to 2003 has not significantly changed over the period. However, Figure 3 suggests that both public and private insurance expenditure have increased significantly, implying that expenditure increases are related more to factors such as moral hazard than to an expansion of insurance coverage.

Figure 2
figure 2

Per cent of U.S. individuals by source of cover. Note: individuals can have more than one type of coverage such that the percentages add up to more than 100. Source: U.S. Department of Commerce, 2006.

Figure 3
figure 3

Total value of different sources of insurance coverage in 1996 U.S.$ Source: U.S. Department of Commerce, 2006.

Results

In discussing the determinants of insurance coverage, drug consumption, and health expenditure, our focus is on the performance of the model and the factors which contribute to the various types of health expenditure. The first stage of the model predicts insurance coverage. Because we hypothesized that insurance was endogenous to the expenditure model, we employed the SmithFootnote 42 test for the weak exogeneity of insurance, which accounts for the censored nature of the data, and we rejected the hypothesis of exogeneity for each expenditure source. We also tested the exogeneity of insurance for each type of expenditure using the Hausman test, and again, we rejected the null hypothesis that insurance was exogenous. Therefore, we proceeded with a two-stage strategy by including the predictions from the insurance equation in the expenditure equation with adequate identification variables and bootstrapped standard errors. When making statistical inferences from finite sample sizes where there is an asymptotic distribution, bootstrapping is used to improve the reliability of standard error estimates.Footnote 43 We employed no more than 1,000 replications. Also, given the significant share of zeros in the data, we corrected for sample selection by restricting the sample to individuals with positive health expenditure and including the inverse Mills ratios from the predicted probability of positive expenditure in the main expenditure equations.

One of the computational difficulties in our empirical strategy is the selection of adequate instruments. Instrumental variable (IV) models overcome measurement error of several types, including simultaneity and omitted variables, and produce asymptotically unbiased parameter estimates by using an instrument that is not correlated with the measurement or equation error but correlated with the correctly measured variable.Footnote 44 Importantly, the instruments employed in this study are measured at the regional level which could produce a clustering problem and biased standard errors.Footnote 45 To correct for this possibility, the standard errors in both the insurance and probability of expenditure equations were made robust using the WhiteFootnote 46 correction. However, no attempts were made to draw any inferences from the coefficients on the regional instruments as these variables were used exclusively for instrumentation purposes.

The insurance equation contains three instruments, which are highly correlated with the insurance decision, are conceptually relevant, and exhibit a negligible correlation with the expenditure generation process. The first instrument is the percentage of firms in a region that offer health insurance to their employees, which is a measure of price competition in the area. A second instrument is the regional health insurance contribution, namely the average regional family premium per enrolled employee in private companies. This variable likely influences the uptake of different insurance types in the region. The third instrument is the regional age-adjusted mortality rate as there is no evidence of correlation between this instrument and health expenditure, an observation that is consistent with previous work.Footnote 47 We would expect health insurers to regionally adjust premiums and cost-sharing requirements according to expected mortality risks. We obtain a log pseudolikelihood value of −71,432 when we include the instruments in the insurance regression, and the correlation between these instruments and health expenditure was insignificant.

In terms of the sample selection equation, two instruments are employed. The first instrument is the regional concentration of doctors, which may be a supply-side factor that influences an individual's contact with the health care system. The second instrument is total regional health expenditure, which may be reflective of access to care within the region. The log pseudolikelihood value is −43,255, and again, the correlation between these instruments and health expenditure was insignificant.

The determinants of insurance coverage

Table 2 presents the results from the insurance equations. The dependent variable was categorized as Medicare only, other public insurance, private insurance, and no insurance, and the base outcome in Equation (1) was: “does not have insurance”. To avoid multicollinearity problems, we checked the condition index of the model. Overall, the diagnostic indicators suggest an acceptable goodness of the fit (pseudo-R2 of 0.231). Age has the same effect on all types of insurance coverage, indicating that compared with being uninsured, the probability of having Medicare only, other public, or private insurance increases with age. Males are less likely to be publicly or privately insured, possibly due to being less risk averse than women.Footnote 48 As expected, race/ethnicity is an important predictor of insurance: blacks are more likely to obtain other public insurance than whites, while blacks and Hispanics are less likely to have private insurance than whites. We categorized the income variable by the Federal poverty level ($7,955 for one individual in 1996), although education was excluded due to its multicollinearity with income. As expected, higher income individuals are less likely to have public insurance (excluding Medicare only and Medicare and private insurance) and more likely to have private insurance or Medicare only.

Table 2 Probability of insurance coverage (multinomial logit)

We find that poor health status is positively associated with public insurance and negatively associated with private insurance,Footnote 49 possibly indicating the presence of risk selection. However, individuals with at least one ADL are more likely to have any form of insurance. Living in an urban area is associated with a higher probability of having Medicare only or private insurance, potentially due to a greater density of insurers and providers in urban areas. Finally, the year controls, which are expected to capture time-specific changes in insurance conditions, are significant.

Effects on the probability of any health expenditure

The results of the probit regression for any health expenditure are listed in Table 3. Interestingly, age is not a significant predictor of any health-care use, possibly because individuals of all ages are likely to have some contact with the health care system. Consistent with prior evidence, women are more likely to incur any form of expenditure. Married individuals also have a lower likelihood of any expenditure, perhaps because spouses have to share income across at least two individuals. There is a negative relationship between income and the probability of expenditure, potentially because the primary route of income is through the choice of insurance. Individuals in poor health or with physical impairments (ADLs) consistently have a higher likelihood of expenditure. The effect of insurance is as expected, with individuals who have Medicare only or private health insurance having a higher likelihood of any health expenditure, while those with other forms of public insurance exhibiting lower probabilities of health-care consumption, possibly because of reduced access to care.

Table 3 Probability of any health expenditurea (1996–2003)

Effects on expenditure

Table 4 reports the expenditure estimates after correcting for endogeneity and sample selection. The sample selection coefficient (inverse Mills ratio) is significant for all sources of expenditure except inpatient care, suggesting that the selection correction influences the estimated coefficient. With the exception of inpatient care, individuals with Medicare only and individuals with private health insurance spend more on all types of health care than individuals with no insurance, while individuals with other public insurance spend less on all types of health care. For pharmaceutical care, there is a positive age effect, in that elderly patients spend more than non-elderly patients, males spend less than females, married individuals spend less than non-married persons, and individuals with incomes higher than the poverty level spend less than the poorest individuals. Medicare and private insurees also pay more for prescription medicines than the uninsured. For inpatient care, respondents with other public insurance or private insurance spend more than uninsured individuals, while respondents with Medicare only spend less, although this coefficient was not significant.

Table 4 Multivariate regression of all components of health expenditure (1996–2003)

There are a number of reasons why we might have observed these patterns. Although the poorest groups in the population tend to be less healthy than the rest of the population,Footnote 50 it appears that the poor may experience barriers in access to care, for instance, these individuals may not be able to afford to take time off from work for a few hours or days. Medicaid also pays lower prices for medical care than uninsured individuals because of strong negotiating power. The fact that individuals with Medicare only spend more on most types of care than the uninsured is not surprising, given that this population group is older, less healthy, and does not face the full price of care for most outpatient services. This group even spends more on prescription drugs even though Medicare did not offer an outpatient drug benefit during the study period, although this outcome was likely related to greater need for medications in this population. Yet, Medicare individuals incur lower inpatient expenditure than uninsured persons, perhaps because of Medicare's pricing power and because Medicare enrollees may see primary prevention as more important. Those with private insurance spend more on all types of medical care, even after accounting for health status and income. Thus, there appears to be a significant moral hazard effect associated with private insurance, and this volume effect generally outweighs the effect of price negotiations.

Both drug and inpatient expenditure increase with age, which is consistent with the preliminary evidence and the expected effect of ageing. However, age has the opposite effect on outpatient and emergency room expenditure, while it is not significant for office-based visits. Therefore, although ageing exerts an effect on expenditure, the effect is heterogeneously different across various sources of expenditure, suggesting that heterogeneity should be taken into account when examining expenditure data. Across all types of expenditure (except for inpatient), males and married individuals exhibit lower expenditure. After controlling for the endogeneity of insurance, the effect of income is systematically negative, indicating that higher income does not lead to higher health expenditure. The likely explanation is that higher-income individuals purchase more generous insurance, and the increase in consumption is channelled through the insurance effect rather than an income effect. Meanwhile, individuals living in an urban area spend less on all types of care, with the exception of office-based care. This conflicting result for office-based care might be related to a higher concentration of doctors in urban areas.

As expected, poor health status is always associated with higher expenditure, and although having an ADL limitation leads to lower expenditure for outpatient and office-based visits, neither of these results is significant. Finally, the time effects are consistent with the aggregate evidence, suggesting that expenditure peaks in 2002–2003 and indicating that time effects are related to the expansion of drug and outpatient expenditure.

Discussion

By using several years of micro data, the intent of this paper has been to analyse the effects of insurance and need-related determinants on pharmaceutical and other health expenditure and to explain the causes behind recent health expenditure growth in the U.S. In addressing the research questions, we have corrected for three issues: (i) the potential endogeneity of insurance, (ii) the joint determination of different types of health expenditure, and (iii) unobserved health care consumption for some individuals (sample selection). As hypothesized, the corrections suggest that there is an heterogeneous effect of insurance on health care expenditure. We find that the joint estimation of health expenditure as well as additional controls, such as endogeneity adjustments for insurance and sample selection corrections, influence the estimates for each type of expenditure. Furthermore, it appears that socio-economic, demographic, and health characteristics have heterogeneous effects on different components of health care expenditure. Yet, it is important to bear in mind that we are unable to fully capture the unobserved heterogeneity in health expenditure related to specific fixed effects. Future research could address this methodological issue.

The results obtained in this study have a number of policy implications. First, with the exception of inpatient care, age exhibits a non-linear pattern consistent with other studies, indicating some accumulation of health expenditure at early and later stages of life.Footnote 51 Accordingly, policymakers should be less concerned about the impact of an ageing population on health care expenditure and more concerned about other factors behind these costs. Policies aimed at reducing expenditure among the elderly and any proposals to increase premiums for the higher-income elderly may end up doing more harm than good and detract from the real drivers of expenditure increases.

Second, although drug expenditure is the primary source of overall health expenditure increases, it is not necessarily related to the expansion of public insurance coverage. Perhaps there are barriers to access among public insurance beneficiaries, or perhaps Medicaid programmes are relatively successful at containing expenditure increases. Nonetheless, there is evidence of ex-post moral hazard among the privately insured population, explaining the consumption of more expensive treatments rather than determining use. The latter is an important finding considering that these individuals are likely to be the healthier and wealthier Americans. This suggests that measures implemented by private insurance companies have not been completely effective at containing expenditure. The nature of the data does not permit us to determine the reasons for these insurance and expenditure-related findings. However, these results do imply that attempts to privatize Medicare and Medicaid and proposals that involve expanding health insurance coverage through tax credits may do little to contain health expenditure, and in fact, may only lead to higher costs. A less costly option may actually be to expand public insurance coverage, while ensuring that public programmes continue to impose tight reimbursement rules. Of course, there may be health and access implications from imposing these restrictions, but these considerations are outside the scope of the present study.

Third, after controlling for the endogeneity of insurance and sample selection, it appears that income has a negative impact on most expenditure. This may have occurred because health status was included in the analysis, and higher income tends to be positively correlated with health status. Another reason for this finding is that the primary effect of income appears to be through the purchase of insurance. However, the generally negative impact of income on expenditure does not imply that insurance coverage should not be targeted at the poorest groups in the population; on the contrary, it suggests that policymakers need to ensure that these individuals are covered as they are less likely to purchase private insurance and are also more likely to be in poorer health. The recent Federal bill aimed at increasing cost sharing for the Medicaid populationFootnote 52 is an example of a policy that will likely shift costs to the most vulnerable in the population and potentially reduce allocative efficiency.