Background

The growth in pharmaceutical cost is a real problem in health care sustainability [1]. This is due to a number of factors: ageing of population, introduction of new medicines and changes in prescription practices and age-related patient complexity. Furthermore, studies focusing on a clear understanding of pharmaceutical consumption, cost and morbidity patterns are needed to implement effective cost control.

One of the widest used tools for cost control in health expenditure is that of risk adjustment, used to make capitation finance systems. These can be found worldwide for both clinical and pharmaceutical management. In health systems where competition between insurance companies exists, such as USA and Germany, capitation attempts to avoid adverse risk selection. In other countries with comprehensive national health systems, such as UK and Sweden, capitation is used for an equitable distribution of resources [2].

Early approaches to adjustment of health expenditure were based on demographic variables alone. However, the introduction of other clinical variables related with population health statuses has improved this adjustment in several countries.

The work of Mossey and Roos [3] was the starting point for different studies that use disease related cost for risk adjustment, using information from insurance companies. The first diagnostic based models for forecasting health expenditure were introduced in the 80s, employing AAPC (Adjusted Average Per Capita Cost) [4] and DCG (Diagnostic Cost Groups) [5].

In the last 20 years, various studies have been carried out on the use of cost indicators based on information available from electronic records previously used by clinical services. The three best known Diagnostic Based Risk Adjustment Systems (DBRAS) are: Diagnostic Cost Groups/Hierarchical Coexisting Conditions (DCG/HCC) developed by Pope et al. [6, 7], Adjusted Clinical Groups (ACG) developed by Starfield et al. [8] and Weiner et al. [9] at Johns Hopkins University in Baltimore and Clinical Risk Groups (CRG) developed by Hughes et al. [10]. All of these are based on the International Classification of Disease, 9th Revision, Clinical Modification (ICD-9-CM), the codes of which are recorded electronically. While the DCG is based on cost, ACG and CRG were developed to measure health statuses.

According to a study by Berlinguet, Preyra and Dean [11], the CRG have greater clinical relevance while offering a predictive power similar to the other two systems (ACG and DCG/HCC).

Another important classification system is the Chronic Disease Score (CDS), developed by Von Korff et al. [12] using pharmaceutical consumption to identify chronic conditions of patients. This uses pharmacy databases to estimate disease prevalence in the absence of diagnostic information. Moreover, these databases have the advantage of generally being complete, precise and reliable, while codification of diagnostics may only register those conditions treated during a clinical visit or hospital stay and, as such, not reflect other important chronic conditions [13].

Various later studies have analysed the validity of the four health state indicators above, perfecting them and adapting them to each specific situation. Thus, different models of pharmaceutical expenditure were obtained from the CDS [1419], DCG/HCC system [20, 21].

Although the ACG [22, 23], and CRG systems were developed to measure health status, their validity in explaining pharmaceutical expenditure has also been demonstrated [2426].

In the United States, Medicare uses the developed DCG/HCC model. In 2006 they implemented the model, CMS (Centres for Medicare and the Medicaid) prescription drug hierarchical condition categories RxHCC [27]. For capitation payments the CMS-HCC model [7], based only on diagnostics, is used. The use of differentiated models for capitation and medicine payments is based on the findings of Zhao et al. [28], who give better predictive power for future prescription drug costs for mixed models that combine diagnostic and drug use data.

Other countries have their own system, such as that in Germany, where a morbidity based risk adjustment was introduced in 2009, embedded in a broader reform of the statutory health insurance system. The new formula covers 80 “severe” or “costly and chronic” diseases structured in a system of hierarchical groups [29].

The CRG system in Spain was first implemented in the Baix Empordà Health Service (Serveis de Salut del BaixEmpordà) [30], and projects are under way in the Autonomous Communities of Catalonia and Madrid. Other Autonomous Communities like the Pais Vasco [31] and some health centres in the Balearics [32] have opted for ACG. Regardless of the system used, all these suppose a significant advance on previous systems based on epidemiological variables [33] or prescriptions [14].

Over recent years, the Valencian Community (VC) (East coast of Spain) has been interested in adapting health expenditure by linking it with population morbidity. Firstly, the General Directorate of Pharmaceutical and Health Products (DGFPS) of the Valencian Health Department designed a standardised amount indicator [34] which offered primary health care pharmaceutical expenditure data per patient covered in a year. This allowed the standardisation of the population based on two categories: patients covered by the health system who have the right to free medication (fundamentally pensioners) and patients covered by the system who must pay part of the cost of the medication from the pharmacy. According to this, in 2011 primary health care pharmaceutical expenditure per standardised patient was 338 Euros, compared with the 275 Euros obtained without standardisation.

This indicator supposes an improvement regarding precision of results and was achieved thanks to the development of appropriate information systems and more specifically that of the primary health care electronic health record [35].

Vivas et al. [36] developed a model using Anatomical Therapeutic Chemical Classification (ATC) from electronic prescriptions drug data to explain pharmaceutical expenditure at a local level.

Since 2010, the DGFPS has been developing a system of patient classification in the VC based on CRGs, which allows stratification of patients according to morbidity. The differences between the use of data from prescriptions and diagnostics that have been observed in the DCG models do not affect the models based on CRG. The predictive power of CRG for future pharmaceutical expenditure has been demonstrated in three prior studies [2426].

The goal of this work is to obtain a concurrent model of primary health care pharmaceutical expenditure for the entire region using CRGs. From this model we obtain weights for primary health care pharmaceutical expenditure per inhabitant and year based on the CRG, which may then be used to draw up budgets for the following year and control health spending in the 24 health districts of VC.

Methods

Data

The data was taken from the Population Information System’s (PIS) database of all patients registered and assigned to one of the 24 health districts in the VC, a region of 4.7 million inhabitants for the period 1st Jan to 31st Dec for 2012 and 2013. Although the initial number of patients was 5.2 million, 400,000 were discarded as non-residents with a stay of less than one month, leaving a total of 4.7 million for analysis. All information was made anonymous according to data protection regulations and our study was approved by the Behavioural Research Ethics Board at the Generalitat Valenciana.

For each patient we obtained the following information. Socio-demographic data: patient anonymisation code, age, sex, health centre, area, health district, PIS state (active/inactive) and pharmacy status (with or without co-payment). Usage data needed for the CRG grouper were: number of contacts in primary health care, number of hospital admissions, and days in hospital per admission (main diagnosis, coded according to ICD-9-CM). CRG data: CRG Base, ACRG1, ACRG2, ACRG3 (Table 1). Pharmaceutical cost data: cost of medicines according to the invoicing nomenclature of the Ministry for Health, Social Policies and Equality. These costs refer to the pharmaceutical expenditure in primary health care centres, which means the total cost is not included for health statuses 8 and 9, given that the majority of these medicines are provided by the hospitals.

Table 1 Population (N), annual pharmaceutical expenditure in Euros and age by CRG core health status and severity level (ACRG3) 2012

Data sources

Data for the study was obtained from the electronic health record for primary health care (SIA) and the Minimum Data Set (MDS) of hospitals. Data for primary health care pharmaceutical expenditure was obtained from the prescription module of the Pharmaceutical Provision Manager, GAIA. The full amount of each prescription expended during the study period was accounted for.

CRG calculation

To obtain the CRG we used 3 M™ Clinical Risk Grouping Software v.1.4. CRGs capture the resource utilization of all inpatient and ambulatory encounters. The groups identify individuals with multiple chronic co-morbid conditions and explicitly specify the severity of illness for each individual. The CRG system maps each diagnosis to one of 1,079 CRG groups that are similar in terms of relative severity, persistence, or recurrence, and health care resource expectations. CRGs, at the discretion of the user, can then be aggregated in order to reduce the number of groups. There are three tiers of aggregation. These are identified as ACRG1, ACRG2, and ACRG3 (Table 1). Each one progressively reduces the number of groups while maintaining, albeit with some adjustment, severity leveling. In the designed models we use 8 dummy variables - one for each core health status - plus 6 dummy variables for severity levels.

Statistical analysis

With the data for 2012, concurrent regression models were made using the total pharmaceutical expenditure by patient and year (C) as dependent variable. As C was not normally distributed, C is ln-transformed as a better approach to its normal distribution [37].

As 420,000 patients (8%) had cost 0, which results in ln -inf, the final dependent variable was considered to be C + 1, resulting in all cost values < 1 having a positive Ln value. Prediction from these models must then be retransformed by subtracting -1, to obtain estimates on the original scale.

Six models were made using the 2012 data, combining the following independent variables in each model:

  1. (i)

    age and sex (1 male and 0 female);

  2. (ii)

    age, sex, and 8 CRG core health statuses;

  3. (iii)

    8 CRG core health statuses alone;

  4. (iv)

    8 CRG core health statuses, only for the paediatric cohort;

  5. (v)

    8 CRG core health statuses excluding paediatrics cohort;

  6. (vi)

    8 CRG core health statuses, 6 severity levels, age and sex, excluding paediatric population.

In all models except (i), the healthy group (Health Status 1) was the control variable necessary in the regression model. Models (i), (ii) and (vi) were designed to observe if there is any difference between age and sex groups.

The models were estimated by means of ordinary least squares (OLS). The goodness of fit was determined through the corrected R2 value and the F Snedecor, mean squared error, and for each of β coefficients t-Student values were obtained to determine the level of significance. The assumptions of normality, homoscedasticity and linearity were considered. Split analysis validation to avoid overfitting was carried out. We used a random sample of 70% of the study subjects for model development and 30% for model validation.

The weight of each health status according to the pharmaceutical expenditure for this status was obtained by retransforming the predicted ln expenditures back into nominal using the smearing estimate provided by Duan (1983) [38]. With this nonparametric retransformation the homoscedastic error is corrected. Thus, the expression for this smearing estimate is:

E C =E e x i β ϵ
= 1 n i = 1 n e x i β + ϵ 1
= e x i β 1 n i = 1 n e ϵ 1 = e x i β γ
(1)

Where:

ϵ= the residual errors.

n = number of observations.

With model iii selected to predict the cost, we established a case mix (CM) system based on the weights of each of the health statuses. Thus, it is possible to calculate the CM for the region and each health district and compare the allocated budget with real expenditure.

The equation for CM j calculation for each health district j is:

C M j = i = 1 9 N i j W i i = 1 9 N i j
(2)

Where,

N ij = Number of population of group i in the health district j.

W i  = Weight of each i core health status group.

The process used for obtaining a predictive budget by health district was the following: Firstly, the weights were calculated for 2012 using the regression model with real data to establish the relative consumption for each CRG. Through the expression (2) the CM is calculated. The Health Authority then established an overall budget for pharmaceutical expenditure for 2013, partly by taking into account the prior year’s expenditure. Applying the weights, we calculated the number of adjusted patients, and divided the overall budget by this number, obtaining the standard price of an adjusted patient. This allows us to give a budget to each doctor or district according to the number of adjusted patients they have.

The model weights are recalibrated annually to introduce possible changes relating to health status, price changes and clinical practice. A cap is established, however, for maximum target budget. That is, a cost per patient adjusted by morbidity is set each year which is used to establish the budget.

Results

Patient stratification and pharmaceutical expenditure

Table 1 shows the number of patients, classified into each of the nine CRG core health statuses, percentage of patients and the average cost for the year 2012. In the graph for this data (Figure 1) it can be clearly seen how each strata of the population is related to the pharmaceutical expenditure. Health Status 6 (chronic disease in 2 or more organ systems) represents 48% of total primary health care pharmaceutical expenditure despite only representing 11% of population.Health status 5 (single dominant or moderate chronic disease) represents 26.4% of total primary health care pharmaceutical expenditure while only representing 15.1% of population. These two statuses together account 74.4% of total expenditure (Figure 1).

Figure 1
figure 1

Stratification of patients by core health status and pharmaceutical expenditure in 2012. Shows a graph for the number of patients, classified into each of the nine CRG core health statuses, and the average cost in this period. In the graph for this data it can be clearly seen how each strata of the population is related to the pharmaceutical expenditure.

Regression models

In Table 2 we show the model coefficients and other statistics. Of the proposed models, model (vi) achieved the best fit, with an R2 of 60.3%.

Table 2 Results of different predictive models for pharmaceutical expenditure per year and patient (C) in Euros 2012

Furthermore, within the Spanish health system it is important to analyse the pharmaceutical expenditure for patients under 14, as these patients are attended by paediatricians in primary health care. Applying the CRG model (iv) to this cohort, we found a very low level of explanation, 15.8%. Therefore a special predictive model must be developed for these patients.

In spite of models (ii), (v) and (vi) being better, we have taken the coefficients from model (iii), the R2 of which is 55% for reasons of operational and practical use and understanding by clinical users.

For the CM system implemented, relative weights were established by retransforming the coefficient for each health state through the smearing estimator and adding the value of 1 as presented in Table 3. The result of the smearing estimator (γ), the mean of the anti-ln of the residuals, was 1.693 (expression 1).

Table 3 Calculation for weights by CRG core health status from model 2012

It should be noted that for groups 8 and 9 we obtained lower weights than given by the original CRG. This is due to patients in these groups principally using hospital dispensaries, while our study drew data from primary health care only. These patients suffer from malignancies and catastrophic diseases such as renal failure or organ transplants.

Analysis by health district

Figure 2 shows the number of patients assigned to each health district, grouped in health statuses. The line represents the CM in the health departments, calculated as the summation of equivalent patients divided by real patients in each health district, expression (2).

Figure 2
figure 2

CRG core health status by health district and case mix 2012. Shows a stacked column chart, comparing the contribution of each value to a total across categories of CRGs core health statuses for each health districts. The x axis of the chart shows the health districts compared and the y axis represent a double scale with the case mix on the right and the n° of patient grouped by CRG core health status on the left.

Figure 3 shows the relation between predicted expenditure according to CM and real pharmaceutical expenditure in primary care for each health district. 13 health districts have spent less than was estimated by the proposed model considering the health status of patients assigned to them, and 11 have incurred higher costs than predicted. This means that with the same equivalent patients, some departments generate higher outpatient pharmaceutical spending than others and some health departments are managing pharmaceutical spending better than others (Table 4).

Figure 3
figure 3

Pharmaceutical expenditure real and predicted by health district 2013. Shows a scatter plot to display values for real (y axis) and predicted (x axis) pharmaceutical expenditure of each heath district.

Table 4 Real and predicted pharmaceutical expenditure and case mix adjusted by health district in 2013

Discussion

The study results show the basis for a pharmaceutical management model designed to improve efficiency in the use of medicines and allocation of budgets. The main innovation is the linking of pharmaceutical expenditure to patient morbidity, a factor not introduced until now in the majority of European health systems.

Discussion points may be centred around three aspects: the reliability of the classification system, the predictive capacity of the developed model, and practical utility.

The reliability of CRGs with respect to correct patient stratification depends on the appropriate inclusion of the diagnostics in the Electronic Health Record (EHR). One of the indicators for evaluating this is the percentage of healthy patients, both users and non-users. The results presented here have greatly improved with respect to stratifications undertaken in the trial period and those presented by other authors. The deficiencies in the initial diagnostics code gave this group as being 60% of the population, whereas with correct coding the value is 52%, representing the real proportion (34% healthy users and 16% non-users). This indicates a substantial improvement in the codification. Other authors, using data from 2008, give this status to 70% of the population [30].

The proposed model uses population stratification into risk groups based on CRGs, but develops its own weights for CRG core health statuses. If we compare the predictive capacity of this model with others described in the bibliography that use other patient classification systems, we see that it reaches, at minimum, the same level of explanation in terms of R2 and t-student statistics. Thus, the models based on patient classification using ATC [36] achieve an R2 of 57%, the models based on ACG [23] 35.4%, and those that use DxRx-PMs [39] 42.6%.

As the goal of our model is to assign predictive budgets with an objective level of expenditure for the health districts and primary health care physicians, it should be noted that the general model does not serve for the population of under-14s attended by paediatricians. As such, this is one of the weaknesses of a CRG based model. This also occurs in the ACG system, as indicated by Aguado et al. [23], who note that in children there is greater variability among physicians and centres not related to case-mix.

For a target consumption based on forecasts by the model, there are two other variables to consider in pharmaceutical expenditure in budgetary adjustments: the cost of the medicines and the goals of the adjustment. Over recent years we have effectively seen drops of around 7% in the price of medicines, driven by the increase in the consumption of generics and the reduction in prices from the Ministry for Health, Social Policies and Equality. During the last three years there has been a decrease every month in the prices set by the Spanish Agency for Medicines (Agencia Española de Medicamentos).

The other factor is the adjustment of pharmaceutical expenditure made by the health authorities via the protocols of rational use that prevent multiple medication of a patient and the use of medicines not based on evidence, especially in chronic pathologies such as hypercholesterolemia or osteoporosis. As such, in establishing expenditure goals by district and doctor from the specific experience from the VC for 2013, the forecast expenditure of the model decreases by 15%.

We observed the need for advanced and efficient IT development as a condition for the introduction of this system. Firstly, for the stratification of patients using the CRG system and then a further programme to communicate the classifications and predictions to the health districts and health workers. The IT system developed allows us to know the chronic diseases and co-morbidity of the patients included in each health status. This is of the greatest use when managing patients in programmes for the most prevalent chronic diseases. The system is linked to the EHR, making it possible to know all the diagnoses, treatments, hospital stays, etc.

Conclusions

The developed model based on CRGs can be of great use in managing the pharmaceutical spending in integrated health services such as the National Health Service (NHS). In future development hospital pharmaceutical expenditure must be included to better explain the weights of statuses 8 and 9.

The predictive power of the developed model is similar to other models based on diagnostics, validating its use in managing primary health care pharmaceutical expenditure.

The general case-mix model is not applicable for establishing expenditure goals for paediatrics, as the CRG classification system is not valid for isolated patients of under 14 years of age.

The predictive models must consider adjustments which include variations in the price of medicines and rationalisation measures in pharmaceutical expenditure, so as to produce incentive-based targets.

Authors’ information

David Vivas-Consuelo (DVC) and Natividad Guadalajara-Olmeda (NGO) are PhD Professors in the Research Centre for Health Economics and Management at Universitat Politècnica de València (Spain) and Carla Sancho-Mestre is a PhD candidate of the same university. José Luis Trillo-Mata (JLTM) and Ruth Usó-Talamantes (RUT) are Director and Deputy Director of the General Direction of Pharmacy and Pharmaceutical Products in the Valencian Health Departament (Spain). Laia Buigues-Pastor (LBP) works as an economist at the Pharmacoeconomics office in the same institution.