Study design and cohort
This retrospective cohort study used data extracted from the Clinical Practice Research Datalink (CPRD) database [17]. CPRD is a longitudinal database containing anonymised medical records on approximately 12.5 million acceptable patients (December 2012 build) registered across 661 general practices located throughout the UK (April 2013 build). Included in the study were asthma patients aged between 12 and 65 years whose records in CPRD fell within the study period 1997–2010 and who were consented for administrative linkage to their hospital episode statistics (HES) secondary care inpatient records, and who were without chronic obstructive pulmonary disease (COPD). Patients were followed from their respective index date (when the patient met the inclusion criteria for entry into the sample frame) up until either: (a) the end date of the study, or (b) when they reached their 65th birthday, or (c) were diagnosed with COPD, or (d) died or were transferred out of their GP practice. Approval for the study was granted by the Independent Scientific Advisory Committee of the Medicines and Healthcare products Regulatory Agency (protocol number 13_036R).
Data management
Included patients’ ICS prescriptions were collected and the prescribing date and duration (number of days prescribed) were then used to calculate PPR. Missing values for prescribing duration were imputed by calculating the number of doses prescribed (quantity of packs multiplied by its number of doses) divided by the recorded daily prescribed dose. Errors (such as duplications, swaps, or missing values) in the number of doses in the pack (pack type) were checked and any outlying values were corrected based on pack information taken from the British National Formulary [18]. Missing values for the daily prescribed dose were imputed using the patient’s prior prescription records or, if unavailable, by substitution of the sample median of the daily prescribed dose by dosage form.
Asthma exacerbation and severity of asthma
Asthma exacerbations were distinguished by whether they were hospital-recorded (the primary diagnosis ICD-10 coding in HES episode data was J45 or lower) or were managed in primary care (identification of oral prednisolone use to treat exacerbation in the CPRD therapy file). In addition, keywords (“asthma” and “exacerbation”, “emergency prednisolone”, “admit to hospital”, etc) were matched with relevant Read codes to identify occurrences of them in the CPRD clinical file, which were then classified as exacerbation treatment within primary or secondary care.
To identify oral prednisolone prescribing to treat an exacerbation, criteria considering the duration (less than 10 days per prescription, less than 90 days per year) and quantity/dose (qty of less than or equal to 20 and strength is 25 mg, or qty of less than or equal to 112 and strength is 5 mg) were used. Any patient-years with a prednisolone prescription which failed to meet these criteria were not considered to be indicative of an exacerbation. Instead, these patient-years were classified as being treated within step 5 of the British Thoracic Society and Scottish Intercollegiate Guidelines Network (BTS/SIGN) guidelines [11]. These guidelines are such that a patient’s asthma severity is increasing in treatment step (ranging from steps 1 to 5) where additional medicines or higher doses are required to achieve control. A patient presenting at step 1 has their asthma controlled with a short-acting β2-agonist (SABA) alone, whereas a step 5 patient requires routine daily prednisolone treatment in order to control their asthma.
Measuring adherence
In this study, a patient’s adherence to ICS prescriptions was measured using PPR. This was calculated by dividing ‘number of days prescribed during calendar year’ by ‘number of days in the interval’ and thus was the proportion of days in the year where medicine was prescribed [8]. Expressed as a percentage, it was constructed as follows:
$$ PPR = 100 \times \frac{{{\text{Number}}\,\,{\text{of}}\,\,{\text{days}}\,\,{\text{prescribed}}\,{\text{during}}\,\,{\text{calendar}}\,\,{\text{year}}}}{{{\text{Number}}\,\,{\text{of}}\,\,{\text{days}}\,\,{\text{in}}\,\,{\text{the}}\,\,{\text{interval}}}} $$
Several approaches were considered when evaluating the numerator, distinguished by whether or not to include or exclude the overlap in prescription days, and whether to pass excess prescription days over to the next interval or to share these proportionally between intervals (see Fig. 1). The denominator was set to 365 days for an annual interval, but was adjusted at the beginning or end for when a patient entered or left the follow-up, or for missing data in number of doses.
By combining these approaches in differing ways, four possible strategies were defined for calculating PPR (see top half of Table 1), of which the first—strategy 1 (including overlapping days, passing excess days to the next interval, and adjustments to the beginning and end intervals)—represents the base case. A fifth strategy imposed a censoring rule on the base case, namely, any computed value of PPR that exceeded 100 % is reset to 100 %. PPR was calculated for each patient annually using each strategy (1–5) and the results presented descriptively.
Table 1 Descriptive statistics
Modelling adherence
Prescription possession ratio serves as a proxy variable for adherence (we denote adherence by A and note that it is unobservable) and so it is subject to error when using it as a measure of adherence. Let the relationship between these variables be:
where the unobservable error term U is assumed to have zero mean. We constructed and estimated inferential models designed to explain adherence A in terms of its dynamic behaviour through time as well as to demonstrate its relationship to clinical outcomes and other factors. In particular, we use a panel data model to match the study’s data structure (unbalanced panel data) and consider fixed effects representations, for example:
$$ PPR_{it} \,= \,\alpha PPR_{i,t - 1} + \gamma Y_{it} + \beta_{0} + \beta_{1} X_{1it} + \cdots + \beta_{k} X_{kit} + \lambda_{i} + U_{it} $$
where indexes \( i = 1, \ldots ,N \) patients and \( t = 1, \ldots ,T \) time periods. The unknown coefficients to be estimated include: \( \alpha \) the coefficient of the lagged dependent variable \( PPR_{ - 1} \) designed to capture any dynamic relationships adherence may have with itself over time; \( \gamma \) the coefficient of Y that represents clinical outcome and which arguably has a feedback causal effect with adherence; \( (\beta_{0} ,\beta_{1} , \ldots ,\beta_{k} ) \) as the set of \( k + 1 \) coefficients on a set of independent regressors \( (1,X_{1} ,X_{2} , \ldots ,X_{k} ) \) where 1 denotes the intercept and \( (X_{1} ,X_{2} , \ldots ,X_{k} ) \) are patient-measured attributes (see Table 2). The patient-specific, arbitrarily distributed individual fixed effect is represented by \( \lambda \). The error term U can be heteroscedastic and autocorrelated. This type of panel data model has been extensively studied and applied in numerous studies [19–21]. System generalised method of moments (system GMM) is an appropriate estimator for fixed effects models with a mix of lagged dependent variables, endogenous and predetermined regressors as well as strictly exogenous regressors. We implemented this estimator using Roodman’s xtabond2 algorithm [22] which is an add-on to the STATA software.