As mentioned above, the prospective gathering of drug use facilitates unbiased risk estimates provided exposure is precisely defined by reference to a well-defined event with a clearly recognizable onset. Thanks to prospectively gathered and complete medication histories, exposure status can be assessed on every day of the follow-up. This is a great advantage over population-based studies where drug use is assessed on the basis of interview during repeated rounds of cross-sectional measuring. Although the analyses in this paper pertain to cohort studies, this includes nested case–control studies where the prospective exposure data come from the cohort but there are efficiency reasons to perform a case–control analysis. This may occur, for instance, when tissue samples have to be taken or when additional data gathering from medical records makes it unfeasible to perform this in the whole study cohort.
Unlike constant features such as sex and certain genetic traits, drug use is essentially a time-varying determinant. In a traditional Cox regression model [15], the hazard function in the total population at time point t is defined as:
$$ \lambda \left( t \right) = \lambda_{0} \left( t \right) * \exp \left( {\beta x} \right) $$
(1)
in which λ(t) represents the event rate at time t conditional on being still event free before time t. In this model the event rate is assumed to be equal to a baseline risk λ0(t), which is the same for everybody in the population, i.e., independent of the determinants. This baseline risk is multiplied by a term exp(βx), dependent on the determinants x, which are different between individuals. The parameters β quantify the effect of the determinants on the event rate. They have to be estimated from the data, together with the baseline risk λ0(t). There are different choices possible for the time scale t, for instance, t = age (when age is strongly and exponentially associated with event occurrence), t = time since entry in the cohort, or t = calendar time. In the simplest case, x represents only one determinant x
1, for instance sex, with x
1 = 1 (males) and x
1 = 0 (females). Then λ(t) gives the hazard function for developing the event at time point t in males or females. For females, the hazard is λ0(t) and for males the hazard is λ0(t) multiplied by exp(β1). In this model, the determinants x are not necessarily constant during follow-up, but may vary in time, such as drug use. In a study sample, the unknown hazards are then estimated from the data as:
$$ h\left( t \right) = h_{0} \left( t \right) * \exp \left( {b_{1} x_{1} } \right) $$
(2)
Suppose that we performed a study in which m individuals developed the event of interest during follow-up. The follow-up times at which the events (the “cases”) occurred are denoted with t
1,…, t
m
. (for simplicity, we assume that events do not coincide). In a Cox proportional hazards regression analysis with drug exposure as a time-varying determinant [16], the exposure status x
1[t
j] on the index day t
j
of the case number j, is compared to the exposure status of all other cohort members on the same day of the follow-up. In this way, j = 1,…, m strata are formed of one case each and the other cohort members who were still in the follow-up and event free at time t
j
as controls. In an earlier analysis in The Rotterdam Study [17], for instance, it was investigated whether thiazide diuretics protect against hip fracture, thanks to their calcium-retaining effect [18]. An analytical matrix would look like the ones given in Table 1a and b. On the index date t
j
, all cohort members have a history of thiazide use up to that time. In its simplest form, we can characterize this history as use on the index date as 1 (‘yes’) or 0 (‘no’) like in Table 1a. If i denotes the number of an arbitrary cohort member that is under follow-up at the index date t
j
, the model states for this individual i
$$ h\left( {t_{i} } \right) = h_{0} \left( {t_{j} } \right) * \exp \left( {b_{1} x_{1i} \left[ {t_{j} } \right]} \right) $$
(3)
in which the time-varying determinant x
1i
[t
j
] has the numerical value ‘1’ (exposed) or ‘0’ (unexposed) depending on whether cohort member i is exposed or non-exposed at time point t
j
. For each event time t
j
, there is a set R
j
(the “risk set”) containing all individuals who were under observation at t
j
. So, R
j
contains case number j and its corresponding controls. Given the event at t
j
, the conditional probability that out of all cohort members in R
j
the cohort member with number j (the one who was observed to develop the event) will develop the event is:
$$ h_{0} \left( {t_{j} } \right) * \exp \left( {b_{1} x_{1j} \left[ {t_{j} } \right]} \right)/\sum\limits_{{i\,{\text{from}}\,R_{j} }} {\left\{ {h_{0} \left( {t_{j} } \right) * \exp \left( {b_{1} x_{1i} \left[ {t_{j} } \right]} \right)} \right\}} \, $$
(4)
Table 1 Apart from unique patient number, sex and age in years, the columns respectively represent: case status (1 = ‘yes’; 0 = ‘no’); stratum; follow-up in days; cumulative number of days of current use; number of days since last intake in past users; defined daily dose (DDD) [for hydrochlorothiazide: 25 mg and for chlorothiazide: 500 mg]; and total numbers of days of use since study entry
Notice that the baseline hazard rate h
0(t
j
) is present in numerator and denominator and cancels out. Therefore, the conditional likelihood function of all the data, defined as the product of the probabilities as given in (4) over all event times t
j
, is equal to:
$$ L\left( \beta \right) = \mathop \Uppi \limits_{j = 1}^{m} \{ \exp \left( {b_{1} x_{1j} \left[ {t_{j} } \right]} \right)/\sum\limits_{{ \, i\,{\text{from}}\,R_{j} }} {\exp \left( {b_{1} x_{1i} \left[ {t_{j} } \right]} \right)} \} $$
However, this straightforward but simple analysis with only the status exposed/unexposed would mean that we use only a very limited part of the information that is contained in the thiazide use history and thereby introduce non-differential misclassification. As can be seen in Table 1a, knowing the numbers of days of continuous use on the index date of each cohort member facilitates calculation of more valid risk estimates as it is unlikely that only 1 day of thiazide use would already be protective while the model would consider even those people as exposed who started thiazides 1 day before the index date. There are pharmacological reasons to assume that a protective effect on hip fracture may become visible only after at least 6 weeks of calcium retention by thiazide treatment and reaches a maximum after ~1 year. Hence, more information, and consequently less non-differential misclassification, is obtained with the introduction of extra determinants x
2, and x
3, where the cumulative continuous exposure to thiazides at the index date is categorized as: x
1 = 1 through 42 days; x
2 = 42 through 365 days; x
3 > 365 days.
$$ h\left( t \right) = h_{0} \left( t \right) * \exp \left( {b_{1} x_{1} \left[ t \right] + b_{2} x_{2} \left[ t \right] + b_{3} x_{3} \left[ t \right]} \right) $$
(5)
In this way, the risk for these two exposure categories is expressed in comparison to non-use and yields a more valid representation of the drug-event association than in (3) or when thiazide exposure in days would be introduced as a continuous exposure determinant.
Even more information, and therefore less non-differential misclassification, may follow from the introduction of determinants for ‘past use’ when one expects that a carry-over period should be taken into account. For instance, if thiazide use for more than 1 year results in a higher calcification of the hip, it will take a certain time period before discontinuation of thiazides results in returning to the situation before starting treatment. This can be done by introducing determinants for past exposure, defined as the number of days since last intake of thiazides (Table 1b). In the previously mentioned study, additional categorical determinants were created as extra determinants x
4, x
5, and x
6 where the number of days since last intake of thiazides, counting back from the index date, was categorized as: x
4 = 1 through 60 days; x
5 = 61 through 365 days; and x
6 > 365 days.
For obvious reasons, the determinants x
1 through x
6 should be introduced in one model. After all, it is important that the complete follow-up time of each study member is expressed in mutually exclusive episodes of non-use, past use, and current use to decrease the degree of non-differential misclassification as much as possible. Then, the full model is:
$$ h\left( t \right) = h_{0} \left( t \right) * \exp (b_{1} x_{1} \left[ t \right] + b_{2} x_{2} \left[ t \right] + b_{3} x_{3} \left[ t \right] + b_{4} x_{4} \left[ t \right] + b_{5} x_{5} \left[ t \right] + b_{6} x_{6} \left[ t \right]) $$
(6)
This model can be extended with the inclusion of other non-time-varying determinants such as gender and baseline age x
a
, x
b
, …, x
j
, …, x
z
, provided usual precautions against overfitting of the model are taken into account. Adjusting for dosage may be performed by including it as a continuous determinant in mg/day or categorized, for instance by splitting current use as: current use with >1 defined daily dose (DDD); current use with ≤1 DDD.
The analytical matrix in Table 1c facilitates different type of analyses. For instance, would we be interested to find out whether cumulative use of nonsteroidal antiinflammatory drugs (NSAID) are associated with an increased risk of cancer, we might prefer to use the determinant ‘total use’. However, if we would be interested in induction, rather than promotion, we might subtract a theoretical episode of 5 years from the index date of cancer diagnosis and calculate total use in days until that date, or in dose as cumulative DDDs. We would do this to avoid non-differential misclassification by restricting ourselves to the induction period. Would we only be interested in promotion, we would treat NSAID as an effect modifier and restrict our analysis to total use in the 5 years before cancer diagnosis because we would expect that malignant cells would already be present during that latent period.