Introduction

Information on hospital unit costs is valuable to health decision-makers and researchers for at least three purposes: budgeting (now receiving more attention with the availability of additional funds for health in poor countries through the Global Fund to Fight AIDS, Tuberculosis and Malaria); the assessment of hospital efficiency; and the assessment, by means of either cost-benefit or cost-effectiveness analysis, of the efficiency of different health interventions. Recognizing the need to make this information available on a country-specific basis, WHO has undertaken as part of the work programme WHO-CHOICE (CHO osing I nterventions that are C ost-E ffective – see http://www.who.int/evidence/cea), an extensive effort to collate all sources of data on unit costs from as many countries as possible [1]. Large gaps remain, however, particularly for developing countries. Although the long-term solution is that all countries perform their own costing studies, the question arises whether it is possible to predict unit costs for different countries in a standardized way for short-term use. The purpose of the work described in this paper is to use the data collected across countries to predict unit costs in countries for which data are not yet available (both point estimates and uncertainty intervals are reported).

Health economics has a long tradition of estimating hospital-cost functions econometrically [210]. Econometric models explain how total costs change in response to differences in service mix, inputs, input prices, and scale of operations. They allow cost and production functions to be specified with sufficient flexibility that a non-linear relationship can be demonstrated between costs and quantity of inputs: total costs can rise at a lower rate than prices[2].

Previous studies have commonly used microeconomic data to analyse and estimate hospital-cost functions. This literature indicates two main approaches: behavioural cost functions and cost minimization functions [2, 3, 9, 11]. Behavioural cost functions have been used to explain the variations in cost per unit of output among hospitals. They have used as determinants all variables for which a causal relationship to hospital costs is hypothesized and data are available – e.g., bed size, global indicators of hospital activity such as average length of stay and occupancy rate, dummy variables for teaching status, etc. On the other hand, the literature on cost minimization has described the minimum cost of providing a given volume of output as a function of an exogenous vector of input prices and the volume of output. The purpose is to determine whether hospitals are cost-minimizers (profit maximizers).

When testing the hypothesis of cost-minimization, the explanatory variables typically comprise only of output quantities (e.g., number of bed days) and input prices. The remaining variables used in the behavioural cost function specification are not part of the cost minimization question but can be used to explain deviation of observed unit costs from the theoretical minimum functions – e.g., possible reasons for inefficiency [3].

To our knowledge, all previous studies have used within-country data sets; we know of none that has attempted to estimate hospital-cost functions across countries. Such studies require a large number of observations from as many countries as possible.

The model described here follows the tradition of the behavioural cost function literature because its purpose is to estimate country-specific costs per bed-day, not to test the hypothesis of cost-minimization. The analysis controls for across-country price-level differences by using unit costs adjusted for purchasing-power parity, namely in international dollars; and for differences in quantity and complexity of resource use by using macro-level indicators such as per capita GDP [1214]. The model forms part of a series of models that can be used to predict country-specific unit costs for a number of purposes. They may be used, for instance, to estimate: (i) unit costs at different capacity levels for the purposes of efficiency analysis or economic evaluation of health interventions; (ii) the "hotel" component of average cost per bed-day for budgeting exercises; or (iii) unit costs excluding components that might be funded from other sources, such as drugs. The specific objectives of this paper are to:

  • explain the observed differences in hospital inpatient cost per bed-day across and within countries; and

  • use the results to predict cost per bed-day for countries for which these data are not yet available.

Methods

Data

The search sources used to obtain the data were: Medline, Econlit, Social Science Citation Index, regional Index Medicus, Eldis (for developing-country data), Commonwealth Agricultural Bureau (CAB), and the British Library for Development Studies Databases. The range of years was set at 1960 to the present. Data covering costs and charges were included.

The search terms used were: "costs and cost analysis" and hospital costs or health centre or the abbreviations HC (health centre) or PHC (primary health centre) or outpatient care. The language sources searched were English, French, Spanish and Arabic; no Arabic study was found. In addition, a number of studies were found in the grey literature, from such sources as electronic databases, government regulatory bodies, research institutions, and individual health economists known to the authors [2, 1554]. Also included were data from a number of WHO-commissioned studies on unit costs.

A standard template was used for extracting data from all sources. Database variables include: ownership; level of facility (see Additional file 1: Annex 1 for a definition of facility types as coded in the unit cost database); number of beds; number of inpatient and outpatient specialties; cost data (cost per bed-day, outpatient visit, and admission); utilization data (bed-days, outpatient visits, admissions); types of cost included in the cost analysis (capital, drugs, ancillary, food) and whether they were based on costs or charges; capacity utilization (occupancy rate, average length of stay, bed turnover, and average number of visits per doctor per day); reference year for cost data; currency, and methods of allocation of joint costs. The database consists of unit-cost data from 49 countries for various years between 1973–2000, totalling 2173 country-years of observations. Some studies provided information on 100% of the variables described above; at the other extreme, some provided information on less than 15%. The number of observations used in this analysis was 1171 (see Additional file 1: Annex 2 for the percentage of missing data in the model variables and Additional file 1: Annex 3 for the list of countries).

Data cleaning comprised consistency checks and direct derivation of some of the missing variables, when possible, from other variables from the same observation (e.g., occupancy rate was calculated from number of beds and number of bed-days). STATA software was used for data analysis [55].

Cost data were converted to 1998 International dollars by means of GDP deflators [56] and purchasing-power-parity exchange rates used for WHO's national health accounts estimates (PPP exchange rates used in this analysis are available from the WHO-CHOICE website: http://www.who.int/evidence/cea).

Data Imputation

Most statistical procedures rely on complete-data methods of analysis: computational programs require that all cases contain values for all variables to be analyzed. Thus, as default, most software programs exclude from the analysis observations with missing data on any of the variables (list-wise deletion). This can give rise to two problems: compromised analytical power, and estimation bias. The latter occurs, for example, if the probability that a particular value is missing is correlated with certain determinants. For example, if the complete observation sets tend to be from observations with unit costs that are systematically higher or lower than average, the conclusions for out-of sample estimation drawn from an analysis based on list-wise deletion will be biased upwards or downwards [57].

There is a growing literature on how to deal with missing data in a way that does not require incomplete observation sets to be deleted, and several software programs have been developed for this purpose. If data are not missing in a systematic way, missing data can be imputed using the observed values for complete sets of observations as covariates for prediction purposes. Multiple imputation is an effective method for general-purpose handling of missing data in multivariate analysis; it allows subsequent analysis to take account of the level of uncertainty surrounding each imputed value, as described below [5861]. The statistical model used for multiple imputation is the joint multivariate normal distribution. One of its main advantages is that it produces reliable estimates of standard errors: single imputation methods do not allow for the additional error introduced by imputation. In addition, the introduction of random error into the imputation process makes it possible to obtain largely unbiased estimates of all parameters [58].

In this study, multiple imputation was performed with Amelia, a statistical software program designed specifically for multiple imputation of missing data [57, 59, 62, 63]. First, five completed-data sets are created by imputing the unobserved data five times, using five independent draws from an imputation model. The model is constructed to approximate the true distributional relationship between the unobserved data and the available information. This reduces potential bias due to systematic difference between the observed and the unobserved data. Second, five complete-data analyses are performed by treating each completed-data set as an actual complete-data set; this permits standard complete-data procedures and software to be utilized directly. Third, the results from the five complete-data analyses are combined [64] to obtain the so-called repeated-imputation inference, which takes into account the uncertainty in the imputed values.

Model specifications

From the tradition of using cost functions to explain observed variations in unit costs, we estimate a long-run cost-function by means of Ordinary Least Squares regression analysis (OLS); the dependent variable is the natural log of cost per bed-day [2, 3, 68, 65]. The primary reason for using unit cost rather than total cost as the dependent variable is to avoid the higher error terms due to non-uniform variance (heteroscedasticity) in the estimated regression. This could arise if total cost were used as the dependent variable, as the error term could be correlated with hospital size [2, 3]. The reason for using cost per bed-day rather than cost per admission is that "bed-days" are better than "admissions" as a proxy for such hospital services as nursing, accommodation and other "hotel services" [3], permitting more flexibility in the use of estimated unit costs.

As the relationship between unit costs and the explanatory variables are expected to be non-linear, the Cobb-Douglas transformation was used to approximate the normal distribution of the model variables. Natural logs were used. The Cobb-Douglas functional form can be written as follows:

Equation 1

or,

Equation 2

ln (Y) = δ + α1 ln (X 1) + α2 ln (X 2)

where δ = ln (α0). This function is non-linear in the variables Y, X 1 and X 2, but it is linear in the parameters δ, α1, α2, and can be readily estimated using Ordinary Least Squares[66].

Log transformation has the added advantage that coefficients can be readily interpreted as elasticities[3, 66].

Therefore, the cost function specification of the OLS regression model may be written as:

Equation 3

Where UC i is the natural log (ln) of cost per bed-day in 1998 I $ in the ith hospital; X 1 is ln of GDP per capita in 1998 I $; X 2 is ln of occupancy rate; X 3,4 are dummy variables indicating the inclusion of drug or food costs (included = 1); X 5,6 are dummy variables for hospital levels 1–2 (the comparator is level 3 hospital); X 7,8 are dummy variables indicating facility ownership (comparator is private not-for-profit hospitals); X 9 is a dummy variable controlling for USA data (USA = 1); and e denotes the error term.

The choice of explanatory variables is partly related to economic theory and partly determined by the purpose of the exercise, which is to estimate unit costs for countries where the data are not available. In this case, the chosen explanatory variables must be available in the out-of-sample countries. Country-specific – or in the case of large countries such as China, province-specific – GDP per capita in international dollars (I $) is used as a proxy for level of technology [1214]; occupancy rate as a proxy for level of capacity utilization; and hospital level as a proxy for case mix. Unit costs are expected to be correlated positively with GDP per capita and case mix and negatively with capacity utilization.

The inclusion of the seven control variables makes it possible to estimate unit cost for different purposes to suit different types of analysis – for example, cost per bed-day in a primary-level hospital, which does not provide drugs or food; or the cost in a tertiary level hospital, with drugs and food included.

The dummy for the USA was included because all data were charges rather than costs and because there were a large number of observations from that country. Dummies for countries other than the USA with a large number of observations, such as China and the United Kingdom, were also tested as was the use of dummy variables to capture whether the cost estimates included capital or ancillary costs. These variables were not included in the model which best fit the data. Utilization variables, such as number of bed-days or outpatient visits, and hospital indicators, such as average length of stay, were not included as explanatory variables because most out-of-sample countries do not have data on these variables, and prediction of unit costs would, therefore, be impossible.

Model-fit

Regression diagnostics were used to judge the goodness-of-fit of the model. They included the tolerance test for multicollinearity, its reciprocal variance inflation factors and estimates of adjusted R square and F statistics of the regression model.

Predicted values and uncertainty analysis

Two types of uncertainty arise from using statistical modes: estimation uncertainty arising from not knowing β and α perfectly – an unavoidable consequence of having a finite number of observations; and fundamental uncertainty represented by the stochastic component as a result of unobservable factors that may influence the dependent variable but are not included in the explanatory variables [62]. To account for both types of uncertainty, statistical simulation was used to compute the quantities of interest, namely average cost per bed-day and the uncertainty around these estimates. Statistical simulation uses the logic of survey sampling to learn about any feature of the probability distribution of the quantities of interest, such as its mean or variance [62].

It does so in two steps. First, simulated parameter values are obtained by drawing random values from the data set to obtain a new value of the parameter estimate. This is repeated 1000 times. Then the mean, standard deviation, and 95% confidence interval around the parameter estimates are computed. Second, simulated predicted values of ŷ (the quantity of interest) are calculated, as follows: (1) one value is set for each explanatory variable; (2) taking the simulated coefficients from the previous step, the systematic component (g) of the statistical model is estimated, where g= f (X,B); (3) the predicted value is simulated by taking a random draw from the systematic component of the statistical model; (4) these steps are repeated 1000 times to produce 1000 predicted values, thus approximating the entire probability distribution of ŷ. From these simulations, the mean predicted value, standard deviation, and 95% confidence interval around the predicted values are computed. In this way, this analysis accounts for both fundamental and parameter uncertainty.

The predicted log of cost per bed day, ln

, can then be calculated from:

Equation 4

where

and are the estimated parameters, and X i..n are the independent variables. If and , back-transforming Equation 4 (reduced to 1 independent log-transformed variable for simplicity) gives the power function.

Equation 5

where

denotes a biased estimate of the mean cost per bed-day due to back-transformation. This is because one of the implicit assumptions of using log-transformed models is that the least-squares regression residuals in the transformed space are normally distributed. In this case, back-transforming to estimate unit costs gives the median and not the mean. To estimate the mean it is necessary to use a bias correction technique. The smearing method described by Duan (1983) was used to correct for the back-transformation bias [67]. The smearing method is non-parametric, since it does not require the regression errors to have any specified distribution (e.g., normality). If the n residuals in log space are denoted by r i , and b is the base of logarithm used, the smearing correction factor, , for the logarithmic transformation is given by:

Equation 6

Multiplying the right side of Equation 5 by Equation 6 almost removes the bias, so that:

Equation 7

The smearing correction factor (

) for our model was 1.25.

Results

Table 1 shows the variable names, description, mean and standard error, estimated after combining the results of the five datasets of the multiple imputation estimates. Table 2 presents the results of the best-fit regression model. The adjusted R square of the combined regressions is 0.80, with an F statistic of 509 (p < 0.0001), indicating that the model explains a large part of the variation of the cost per bed-day across countries [68]. The signs of the coefficients are consistent with the earlier hypotheses. For example, the GDP per capita is positively correlated with cost per bed-day, while the lower the occupancy rate the higher is the cost per bed-day. Unit costs are lower in level-one hospitals than in those of levels two and three. The coefficients for the two main explanatory variables (GDP per capita and occupancy rate) are highly significant (p < 0.0001), as well as most of the control dummies, e.g., hospital level. The coefficient for food costs is not significant at the 5% level but was included in the model because it added to its explanatory power.

Table 1 Descriptive statistics of the multiple imputation estimates N = 1171
Table 2 Multiple Imputation regression coefficients and SE Dependent variable: Natural log of cost per bed-day in 1998 I $ N: 1171

The tolerance test and its reciprocal variance inflation factors (VIF) showed no evidence for multicollinearity between the model variables (tolerance ranged between 0.20 and 0.89, mean VIF 1.97; tolerance less than 0.05 and VIF more than 20 indicate the presence of multicollinearity).

The only country dummy that was included in the final model was for the USA. The most plausible explanation for the positive, highly significant coefficient for the USA dummy is that USA was the only large data set where charges were reported rather than costs. In this case, the coefficient for the USA could be interpreted as a cost-to-charge ratio, estimated as 1:1.74. In other words, costs represent 57% of the charge on average. This is consistent with published national reports on the average cost-to-charge ratio for the USA such as that published by the United States General Accounting Office (63%) [69].

Figure 1 shows the three regression lines of levels one, two and three hospitals, respectively, plotted against the log of GDP per capita (the Y-axis is log of cost per bed-day). The regression lines were estimated for public hospitals, with occupancy rate of 80%, including food costs and excluding drugs. Because the original data had a lower average occupancy rate (mean 71%, SD 39%), and most observations included drug costs, it is to be expected that the regression lines will be slightly lower than the actual data points in the database. The regression lines do not pass through the USA data points situated at the upper right side of the graph because they have been calculated for the case where the US dummy was set at zero.

Figure 1
figure 1

Regression lines for level one, two and three hospitals against the natural log of GDP per capita. (The Y-axis is the dependent variable: natural log of cost per bed day) Cost in 1998 I$ N = 1171

Overall, Figure 1 shows that the regression lines have a good fit with the data used to develop the model. They not only illustrate the relationship between cost per bed-day, hospital level and GDP per capita, but also show that there remains substantial variation in unit costs for any given level of GDP per capita. It would be inadvisable, therefore, to base cost estimates on a single estimate of hospital costs in a particular setting, something that is a common feature of cost-effectiveness studies.

To use the equation reported in Table 2 to predict unit costs for a number of in and out-of-sample countries, with the appropriate uncertainty interval, requires consideration of the probability distributions of the predicted unit costs, given a specified level of the model variables. In order to derive these distributions, simulation techniques were used following the steps described in the Methods section. Table 3 presents for selected countries in different regions of the world the average simulated predicted values and 95% uncertainty intervals. The estimates are presented in 2000 I $, based on the 2000 GDP per capita in I $ and assuming that the estimated coefficients will remain constant over a short time period. They are specific to public hospitals, at an occupancy rate of 80%, excluding drug, but including food costs. Regional estimates of cost per bed day, with the same characteristics described above, are available from the WHO-CHOICE website: http://www.who.int/evidence/cea.

Table 3 Predicted cost per bed-day (i) in 2000 I $

Discussion

This paper describes recent work on developing models to predict country-specific hospital unit costs, by level of hospital and ownership, for countries where these data are not available. The main purpose of this work was to feed into estimates of the costs and effects of many types of health interventions in different settings. Estimates are typically available for variables such as the number of days in hospital, or the number of outpatient visits, for certain types of interventions, but unit prices are not available for many countries. The model presented in this paper used all data on unit costs that could be collected after a thorough search to estimate costs for countries where this information does not exist. Data imputation techniques were used to impute missing data, which has the advantage of eliminating the bias introduced by list-wise deletion of observations in cases where information for some of the variables required by the model is missing.

The goodness-of-fit of the model was tested by various regression diagnostic techniques including the tolerance test for multicollinearity, adjusted R square and F statistic. All suggested a good fit of the model with the data and that GDP per capita could be used to capture different levels of technology use across countries. Although this is the first time that costs have been compared across countries, the signs of the coefficients are consistent with results from previous microeconomic studies within countries. For example, these studies have found that occupancy rate was negatively correlated with cost per bed-day while hospital level had the opposite relationship, something also found in the model presented in this paper [70, 71]. This adds confidence to the estimated results.

In addition, the estimates produced by this model were sent to health economists and researchers in different countries to check their face validity. Experts from countries in all WHO regions, covering wide differences in GDP per capita and in technologies typically found in hospitals were consulted, including Benin, Canada, Ecuador, Egypt, Kenya, Netherlands and Thailand. They were provided with a description of the estimated unit cost (e.g., which costs were included) and were asked whether they thought they approximated the average cost per bed-day in their countries. All indicated that the results had face validity.

It is of particular note that the model incorporates a more extensive database on unit costs by hospital level and ownership than has previously been available. Increasing the range of observations will increase the validity of extrapolations of cost estimates for countries in which these data are not available. Additional sources of data are being sought for this purpose and to assist countries to develop their own studies. As this body of information grows, the predictive power of unit-cost models will continue to increase.

There are other possible uses of this model such as estimating the possible costs of scaling-up health interventions for the poor, which is receiving increasing attention with the activities of such bodies as the Global Fund to Fight AIDS, Tuberculosis and Malaria. This can be done in many ways, according to the objectives of the analysis. It may be used, for instance, to estimate:

  • unit costs at different capacity levels for purposes of efficiency analysis or economic evaluation of health interventions;

  • the "hotel" component of average cost per bed-day;

  • unit costs, excluding specific items such as drugs or food costs.

Finally, it must be emphasized that there is wide variation in the unit costs estimated from studies within a particular country (Figure 1). These differences are sometimes of an order of magnitude, and cannot always be attributed to different methods. This implies that analysts cannot simply take the cost estimates from a single study in a country to guide their assessment of the cost-effectiveness of interventions, or the costs of scaling-up. In some cases, they could be wrong by an order of magnitude.

Conflict of Interest

None.