Introduction

The National Health Service (NHS) in the UK delivers universal tax-funded healthcare to the population, free at the point of delivery. Within the current structure of the NHS in England, about two thirds of the £106 billion NHS budget is allocated to Clinical Commissioning Groups (CCGs), which are responsible for the commissioning of healthcare services including secondary care, rehabilitation services and mental health services to their local population [1]. In allocating funds to CCGs, NHS England considers several factors including population size, geographical variation in the costs of providing healthcare, population age structure and other socio-economic factors [2]. However, there is considerable discretion about how funds are used, which has led to concern about unwarranted variation in activity and the need to improve efficiency [1]. The current focus on efficiency is also driven by the sustained period of financial austerity in which the NHS is operating [3].

CCGs are membership organisations, representing general medical practices within geographical areas, which design, plan and commission local health services to meet the needs of their populations [4]. In 2013, 211 CCGs were created through the Health and Social Care Act (2012) replacing 152 Primary Care Trusts (PCTs). The reforms brought about by the Act were intended to make the NHS more responsive, efficient and accountable [4]. Measures relating to the quality of health services provided by CCGs and associated health outcomes are published in the CCG Outcomes Indicator Set collected by NHS Digital, formerly known as the Health and Social Care Information Centre (HSCIC) [5]. The indicator set includes a range of data on different measures of quality and outcomes including emergency hospital admissions, mortality rates, and cancer survival rates. However, it does not provide a single measure of overall or comparative efficiency across all indicators at CCG-level.

Data envelopment analysis (DEA) is one method that can be employed to measure efficiency, particularly when comparing the efficiency of organisations that use multiple inputs to produce multiple outputs as is the case in healthcare [6]. Using DEA, this paper aims to estimate the comparative efficiency of English CCGs and assess the impact of environmental factors on efficiency scores using regression analysis.

Methods

DEA is a linear programming technique that is concerned with measuring the relative efficiency of a given set of organisations, entities or programmes which are referred to as “Decision Making Units” (DMUs) [6]. It assumes that the DMUs produce homogenous outputs using the same inputs and provides a way to compare them against each other. It has been applied in many different industries and activities in different countries and contexts [7]. It is computed by the following:

$$ { \max\ \mathrm{h}}_0=\frac{\sum_{r=1}^s{u}_r\ {y}_{r0}}{\sum_{i=1}^m{v}_i\ {x}_{i0}} $$

Subject to

$$ \frac{\sum_{r=1}^s{u}_r\ {y}_{r j}}{\sum_{i=1}^m{v}_i\ {x}_{i j}}<1; j=1,\dots \dots \dots .. n $$
$$ {\mathrm{V}}_{\mathrm{r},}{\mathrm{u}}_{\mathrm{i}\ }\ge 0;\mathrm{r}=1\dots \dots \dots \dots \dots \mathrm{s}:\mathrm{i}=1\dots \dots \dots \dots .\mathrm{m} $$

where y rj and x ij are the known outputs and inputs of the jth DMU respectively and u r and v i are the weights assigned to the inputs and outputs [6, 7]. The weights take a positive value with each DMU having its own set of weights. Each DMU is given the most favourable weighting that constraints allow. The model illustrated above is output-orientated because it shows the maximum output that could be achieved for a given level of input if a DMU is operating efficiently. The inverse of the model would be input-orientated and it would allow for the estimation of the minimum level of input required to produce a level of output if operating at full efficiency. The original model assumes constant returns to scale (CRS). However, it was later extended to include a model that could accommodate variable returns to scale [8].

DMUs with efficiency scores of 1 (i.e. those exhibiting best practice) form a frontier or envelopment surface and any DMUs operating inefficiently will lie below the frontier. This is similar to the production possibility frontier used in economic theory. However, unlike the production functions used in economics, DEA frontiers are derived from empirical data and not economic theory. The constant returns to scale model is only appropriate for use where it is known that the DMUs are operating at an optimal scale. In many cases, there may be diminishing or increasing returns to scale, and it is necessary to use the variable returns to scale model.

DEA has some advantages and disadvantages over other methods of measuring efficiency such as stochastic frontier analysis (SFA) and corrected ordinary least squares (COLS) cost frontier analysis [9]. Mainly, it does not require any prior assumptions about the specification of cost/production functions and standard errors unlike econometric methods.

This study aims to use DEA to measure the efficiency of English CCGs and assess the impact of environmental factors. The decision-making units of concern in this study are English CCGs. Data for each CCG were collected for the financial year 2014/15 and in cases where this was not available, data for the 2014 calendar year were used. The main sources of data were NHS Digital (formerly HSCIC) and NHS England (as shown in the footnotes to Table 1). In all, data for 20 variables were collected, 2 of which were input variables, 7 were output variables and 11 were environmental variables.

Table 1 CCG-level descriptive statistics for input and output variables

The inputs included in this study represent labour and financial inputs, while the outputs include a small selection of the CCG Outcome Indicator Set currently used by NHS Digital to measure the quality of primary care. The outcome indicator selection process began by identifying issues which were leading priorities within the NHS in England [10]. The selected outcome measures were premature mortality, long term conditions and patient satisfaction. The reduction of premature mortality is high on the agenda in the UK [11] and in this paper is assumed to be a key output of the healthcare system. Cancer, respiratory disease and cardiovascular disease are among the top 5 causes of premature mortality in England [11]. A reduction in respiratory disease and cancer mortality was assumed to be a key output of CCG commissioning. Coronary heart disease (CHD) survival was excluded because model stepwise and correlation analysis model selection techniques showed that it added no explanatory power. Additionally, quality and outcomes framework (QOF) achievements in the management of cancer, chronic obstructive pulmonary disease (COPD) and cardiovascular disease were included as short term outputs in these specific disease areas. As mortality is a long term output, it may fail to capture any improvements in the short term, hence the inclusion of QOF outcomes. The proportion of patients who would recommend their general practitioner (GP) practice to others was included as a proxy measure of patient satisfaction. Quality of life of individuals with long term conditions was chosen as an output for the management of long term conditions in the community.

CCGs with missing data on any of the variables were excluded from analysis; this applied to 3 out of a total 211 CCGs in England, thus bringing the total units of analysis to 208. Table 1 shows the descriptive statistics for all variables.

Environmental variables are variables external to the production process that may still affect the utilisation or availability of resources [16]. In this study, economic, social, physical and lifestyle characteristics that are deemed to be wider determinants of health [17,18,19] and are likely to confound any measure of efficiency have been included. Additionally, some disease prevalence variables have been included, as they may influence disease-specific mortality outcomes. The environmental variables included are a subset of the public health outcomes framework, and were selected on the basis that they reflect deprivation, population demographics, and burden of disease and lifestyle factors on CCG resources. The variables included are by no means an exhaustive list of potential candidates. Table 2 presents the descriptive statistics for the variables used.

Table 2 CCG-level descriptive statistics for environmental variables

All mortality variables were transformed into survival rates by converting the values to their inverse form. Additionally, to address the magnitude imbalance within the dataset, all input and output variables were mean normalised.

DEA models were run using MaxDEA 6.6 software [22]. QGIS software [23] was used to map the results of the final DEA model.

Regression analysis was used to quantify the relationship between efficiency and environmental variables. The efficiency scores computed from DEA were used as the dependant variable and the environmental factors used as explanatory variables. As efficiency scores are limited to values between 0 and 1, Tobit regression was used to estimate the relationship between the scores and environmental variables using Stata 13.1 software [24].

Results

The average technical efficiency score for all CCGs in the model was 0.92. A total of 47 CCGs had a score of 1 (Table 3). The remaining 161 CCGs were inefficient and had an average score of 0.9. This means that on average, the inefficient CCGs could reduce their inputs by 10% and still produce the same level of outputs if they were operating efficiently. The CCG that was the least efficient compared to others had a score of 0.75.

Table 3 Distribution of efficiency scores

Geographical distribution

The geographical inequalities that exist in health outcomes are well documented [25]. To explore the implications of this geographical variation on the efficiency of CCGs, technical efficiency scores were plotted on a map of England. Map 1 shows that a high number of CCGs with the lowest technical efficiency scores are clustered in Northern England while CCGs with higher technical efficiency scores are mostly in Central and Southern regions of England.

Map 1
figure 1

Geographical distributions of technical efficiency scores

The percentage of CCGs experiencing technical efficiency scores of 1 in each deprivation quintile varied (Table 4). It appears that the most deprived CCG quintile in England had the lowest mean efficiency score and the fewest efficient CCGs compared to the other quintiles (Table 4). This relation is explored further using the Tobit regression model.

Table 4 Mean technical efficiency scores by deprivation quintile

Environmental factors that affect the efficiency of a CCG

The results of the Tobit regression model are shown in Table 5. The categorical variable for the population size quintile had negative coefficients on quintiles 2–5 with quintile 1 as the reference. Quintile 1 had the lowest population size, therefore being in a quintile with a higher population caused lower efficiency scores. This relationship was statistically significant for quintiles 4 and 5 at the 5% level but not for quintiles 2 and 3. A 1 % increase in population in quintile 4 is associated with a 5.2% fall in efficiency (Table 5).

Table 5 Regression analysis results for predictors of technical efficiency at CCG level

The coefficient for the proportion of children registered within the CCG had a positive sign and was statistically significant. Higher proportions of under 18 s registered in the practice resulted in higher efficiency scores. The rate of unemployment and prevalence of COPD had the opposite effect, with negative coefficients.

IMD ranking, the proportion of patients aged 65 plus, patients with long term conditions, smoking and the prevalence of obesity, cancer and CHD were not found to be statistically significant predictors of technical efficiency.

Discussion

Using DEA, this study has computed the technical efficiency scores of 208 English CCGs. The findings indicate that CCGs serving smaller populations appear to be more efficient. This goes against the a priori assumption that CCGs with larger populations would be able take advantage of economies of scale, and therefore be more efficient than smaller ones. When looking at the geographical distribution of efficiency scores, there appeared to be a negative relationship between deprivation and efficiency scores. However, regression analysis showed that the relationship at CCG-level was not statistically significant.

A further a priori assumption was that the burden of disease (i.e. the prevalence of long term conditions, CHD, COPD and cancer) would have a negative impact on the efficiency of CCGs due to the increased demand for services. Regression analysis showed that this anticipated impact, whilst there, was not statistically significant. The only exception was the prevalence of COPD, which had a significant negative relationship with technical efficiency scores. Lifestyle factors such as smoking and obesity were also expected to increase the burden of disease and lead to lower efficiency scores, however regression analysis showed that there was no statistically significant link. Having high proportions of unemployed residents led to a statistically significant increase in inefficiency. This was not surprising as individuals from low socioeconomic groups are known to experience less favourable health outcomes than more affluent groups. The prevalence of COPD and the rate of unemployment were found to be statistically significant predictors of efficiency. These can be improved through targeted public health interventions to reduce smoking and other lifestyle factors as well as economic investment in affected areas.

This study is the first to use DEA to quantify the efficiency of CCGs in England, and assess environmental factors that impact on efficiency. However, the study has a number of limitations. Firstly, the DEA programme has been allowed full flexibility in assigning weights to input and output variables. As DEA methods aim to paint a DMU in the best light possible, some inputs may be completely ignored (i.e. given a weight of 0) for some DMUs to make them as efficient as possible.

Another limitation was the unavailability of data for particular variables which ideally would have been included in the analyses. For example, the quality of hospital infrastructure’s at CCG level i.e. a new 100 bed hospital is expected to be much better placed than a Victorian 100 bed hospital to provide services. Several environmental variables were obtained from surveys and may not be a true representation of the population. This means that estimated relationships may be constrained by the accuracy of available data. It is possible that some important variables are missing from the models and that some unnecessary variables have been included.

This study found average efficiency scores that are similar to those of previous studies in the UK at the primary care administrative body level. Salinas-Jimenez and Smith [26], Giuffrida and Gravelle [27] and Martin and Smith [28] found that inefficient units had average efficiency scores of 0.93, 0.99 and 0.90 respectively. This study is also comparable to previous studies in its findings around environmental variables. Martin and Smith [28], using a Tobit regression similar to the one in this study, found that deprived PCTs were significantly less efficient than those with lower levels of deprivation. In contrast to this study, they did not find an association between population size and efficiency scores. Varela et al. [29] found the population size of municipalities in Brazil had a strong link to their technical efficiency in delivering healthcare, with smaller municipalities found to be more efficient than larger ones.

This study’s finding that the relationship between deprivation and efficiency is not statistically significant suggests that NHS England’s efforts to adjust for environmental factors, such as those used here, are broadly reflected within the CCG-level per capita budget allocation. The budget allocation formula aims to adjust for comparative need and unavoidable differences in costs for providers associated with their geographical location (known as the market forces factor) [30].

The Health and Social Care Act (2012) took public health departments out of NHS PCTs and placed them within local authorities. It is thought that local authorities are better placed to identify the broader needs of the population that impact on public health. Whether this will lead to improvements in the factors that affect CCG-level efficiency remains to be seen. Additionally, as the economy slowly recovers from the 2008/9 recession, unemployment rates are expected to decrease. However, this study suggests that policies designed to raise employment rates in disadvantaged areas are important.

Further research on appropriate measures of output in healthcare would be valuable to build on these results. Additionally, the development of guidelines on measuring efficiency in healthcare would promote robust methodological processes and enhance comparability across studies.