Subjects and Data
Data from two previous large thorough QT studies were available for this modeling study [5]. Together, the studies investigated 523 healthy subjects (254 females), with a mean age of 33.5 years and an interquartile range (IQR) of age of 26.6–40.1 years. Both source thorough QT studies were appropriately approved by regulatory and relevant ethics bodies, but since we used only their drug-free QT/RR data, their other details are not relevant. No subject participated in both source studies.
In each subject, multiple (average n = 1263, IQR 1060–1437) baseline drug-free daytime QT interval measurements were available together with corresponding QT/RR hysteresis corrected RR intervals [6] representing the underlying heart rate at which the QT interval was measured. These drug-free QT/RR measurements covered wide ranges of heart rate in each subject. The average minimum heart rate in these QT/RR measurements was 51.9 beats per minute (bpm) (IQR 47.8–56.1); the average maximum heart rate was 112.5 bpm (IQR 102.9–122.9).
The QT/RR hysteresis correction was based on individual models of exponential decay of the influence of RR intervals preceding the QT interval measurements. For each QT interval measurement, the corresponding RR interval was obtained as a weighted average of RR intervals within 5 min preceding the QT measurement.
The dense baseline QT/RR data distribution allowed us to describe the drug-free QT/RR relationship in each subject using the previously published curvilinear regression formula [7]:
$${\text{QT}} = {\text{QTcI}} + (\delta /\gamma )({\text{RR}}^{\gamma } - 1),$$
which corresponds to the correction formula:
$${\text{QTcI}} = {\text{QT}} + (\delta /\gamma )(1 - {\text{RR}}^{\gamma } ),$$
where QTcI is the individually corrected QT interval and individually optimized parameters δ and γ represent the slope and the curvature of the QT/RR relationship, respectively (QT and RR interval measurements are expressed in seconds).
In each subject, we used ten baseline datapoints distributed through the daytime hours. This modeled drug-free points during a QT investigation. The subjects were in supine resting positions for at least 5 min before as well as during these datapoints. The electrocardiogram (ECG) measurements of each of the selected datapoints consisted of five replicated QT and RR measurements (the RR values were again QT/RR hysteresis corrected). That is, the selected datapoints provided 50 QT and RR measurements per subject. The intra-subject spread of heart rates measured during these baseline datapoints corresponded to the usual data distribution seen in supine datapoints of clinical investigations of QT interval changes [6]. The average minimum and maximum heart rates of the datapoints were 55.5 bpm (IQR 50.1–60.0) and 75.8 bpm (IQR 69.3–81.8), respectively.
Statistical Modeling Experiments
To model the situation in which restricted baseline data are available for the design of subject-specific and population-specific heart rate corrections, we considered two types of modeling experiments in which the corrections were derived from the selected baseline datapoints, i.e., in which 50 pairs of QT and RR measurements were available in each subject.
The experiments of the first type (Type 1) modeled the situations of standard QT studies with relatively uniform heart rate and QT interval changes over all datapoints. This approximated the investigations after multiple drug doses when the drug plasma concentrations (and, correspondingly, drug effects on heart rate and on the QT interval) do not change during the final dosing day. For each such experiment, we considered 50 participants randomly selected from the 523 healthy subjects whose data were available. This size reasonably modeled the usual thorough QT investigations [1].
The experiments of the second type (Type 2) modeled situations in which only one drug dose is given to a relatively small number of individuals. In these experiments, we took the ten datapoints corresponding to 0, 1, 2, 3, 4, 5, 6, 8, 10, and 12 h after dosing and assumed that the averaged plasma concentrations (and, correspondingly, the heart rate and QT interval effects) followed the modeled concentration profile shown in Fig. 1a. For each of these experiments, we randomly selected ten participants, which approximately corresponded to the size of individual dose investigations in early clinical studies [3].
This means that in the experiments of the first type, the differences between the baseline and on-treatment QT/RR values were similar (subject to the modeled inaccuracy, as described further in Sect. 2.3), while in the experiments of the second type, the differences between the baseline and on-treatment data followed the modeled single-dose plasma concentration (i.e., ranged between no change at time 0 and maximum change at the point of maximum plasma concentration—again, subject to modeled inaccuracy). The experiments also differed in the number of subjects.
QT/RR Changes
In all experiments, we modeled situations in which each subject also provided ten ‘on-treatment’ datapoints corresponding one-to-one to the selected baseline datapoints. Further, in each subject, the known values of averaged QTcI and of the coefficients δ and γ allowed us to estimate the true QT interval duration at any given heart rate. In individual experiments, we were therefore able to simulate situations when the on-treatment datapoints differed from the baseline datapoints by a prescribed amount. More specifically, for a selected datapoint measurement replicate with ECG baseline measurements of \({\text{RR}}_{b}\) and \({\text{QT}}_{b}\), we considered an on-treatment datapoint replicate with ECG measurements as follows:
$${\text{RR}}_{t} = 60/(60/{\text{RR}}_{b} + {\mathbb{C}}\varPhi_{\text{HR}} + \varepsilon_{\text{HR}} ),\,{\text{and}}$$
$${\text{QT}}_{t} = {\text{QT}}_{b} + (\delta /\gamma )({\text{RR}}_{t}^{\gamma } - {\text{RR}}_{b}^{\gamma } ) + {\mathbb{C}}\varPhi_{\text{QTc}} + \varepsilon_{\text{QTc}} ,$$
where \(\varPhi_{\text{HR}}\) and \(\varPhi_{\text{QTc}}\) were parameters of the experiment (modeling the drug-induced changes in the heart rate and of the QTc interval at 100% of maximum drug concentration), \({\mathbb{C}}\) was a drug concentration at the time of the given timepoint (expressed as a proportion to the maximum concentration), and \(\varepsilon_{\text{HR}}\) and \(\varepsilon_{\text{QTc}}\) were random inaccuracy coefficients. Since the modeling experiments need to operate on the uncorrected QT and RR values, the second formula was used for uncorrected \({\text{QT}}_{t}\) intervals. It was derived from the modeling assumption that:
$${\text{QTcI}}_{t} - {\text{QTcI}}_{b} = {\mathbb{C}}\varPhi_{\text{QTc}} + \varepsilon_{\text{QTc}} ,$$
from which the formula can be derived using the optimized correction:
$${\text{QTcI}} = {\text{QT}} + (\delta /\gamma )(1 - {\text{RR}}^{\gamma } ).$$
The principle of modeled QT and heart rate changes is shown schematically in Fig. 1b.
In the experiments of the first type, we assumed that all on-treatment measurements were influenced by the same maximum plasma concentration, whereas in the experiments of the second type we used the modeled plasma profile (Fig. 1a) and randomly assigned 10% variation of the plasma profile between individual subjects. The inaccuracy coefficients \(\varepsilon_{\text{HR}}\) and \(\varepsilon_{\text{QTc}}\) were introduced to model not only the measurement inaccuracies but also approximate the inter- and intra-subject differences and variability in the response as well as the differences between baseline and placebo (with no changes of heart rate or QT interval). Nevertheless, for the simplicity of interpretation of the results of the experiments, the \(\varepsilon_{\text{HR}}\) and \(\varepsilon_{\text{QTc}}\) coefficients were always obtained from uniformly distributed random numbers within ± 2 bpm and ± 5 ms, respectively.
Heart Rate Corrections
In each modeling experiment, both subject-specific and population-specific heart rate corrections were considered. The subject-specific corrections considered the selected baseline datapoints of each subject separately; the population-specific corrections pooled the baseline datapoints of the subjects randomly selected for the experiment (i.e., of either 50 or ten subjects).
To reflect frequent practice, linear and log-linear correction formulas were derived. The linear correction formulas had the form of QTc = QT + α(1 − RR) and were derived from linear regressions QT = α0 + αRR; the log-linear correction formulas had the form of QTc = QT/RRβ and were derived from linear regressions log(QT) = β0 + βlog(RR).
This means that in each modeling experiment, once the group of subjects was randomly selected and the coefficients α and β were obtained for each subject separately and subsequently, another pair of coefficients α and β was obtained for the group of the experiment pooled together.
To assess the impact of using limited versus full baseline QT/RR data, individualized corrections in the form \({\text{QTcI}} = {\text{QT}} + (\delta /\gamma )(1 - {\text{RR}}^{\gamma } )\) with the parameters optimized for each subject were also derived in the experiments.
Organization of Experiments and Statistics
Experiments of both types were conducted for different combinations of programmed heart rate and QT interval changes. The parameters \(\varPhi_{\text{HR}}\) and \(\varPhi_{\text{QTc}}\) of heart rate and QTc interval changes were ranged systematically between − 10 and + 25 bpm in 0.1 bpm steps and between − 25 and + 25 ms in 0.1 ms steps, respectively. For each combination of the \(\varPhi_{\text{HR}}\) and \(\varPhi_{\text{QTc}}\) parameters, both types of experiment (i.e., the first type assuming the same heart rate and QTc interval changes at all selected timepoints, and the second type assuming changes according to the development of plasma concentrations) were repeated 50,000 times with different selections of modeled study populations. A Mersenne Twister random number generator was used [8].
In each individual experiment, the coefficient of subject-specific and population-specific heart rate corrections was optimized and applied to estimate the QTc interval changes between the baseline timepoints and corresponding on-treatment timepoints. In the experiments of the first type, the upper single-sided 95% confidence interval (CI) of the QTc interval changes was calculated for each of the timepoints and the resulting estimate of QTc interval changes was the maximum value of these upper CIs over all timepoints. This corresponded to the standard evaluation of thorough QT studies and resulted in the modeled estimates of the upper CI of \(\Delta \Delta {\text{QTc}}\) values (i.e., QTc changes corrected for both baseline and placebo) expected for thorough QT studies [1]. In the experiments of the second type, the on-treatment timepoint of the maximum plasma concentration was used in each subject and in these timepoints, the upper single-sided 95% CI of the QTc interval changes was calculated. In both types of experiments, the upper CIs were calculated from values in individual subjects assuming normal distribution.
The repetition of modeling experiments led to 50,000 results (for experiments of both the first and second type) for each combination of programmed \(\varPhi_{\text{HR}}\) and \(\varPhi_{\text{QTc}}\) changes. Of these, the median value and the 5th, 10th, 20th, 80th, 90th, and 95th percentiles were obtained.
The principal outcomes of the study were the differences between the QTc interval changes reported by the investigated correction formulas and the initially programmed \(\varPhi_{\text{QTc}}\) values. In ideal situations, the modeled \(\Delta \Delta {\text{QTc}}\)values should be the same as the initially programmed \(\varPhi_{\text{QTc}}\) parameters of the experiments. The differences therefore showed how well or poorly the set-up of the investigated corrections represented the populations of the experiments. The relationships between these \(\Delta \Delta {\text{QTc}} - \varPhi_{\text{QTc}}\)inaccuracies and the programmed \(\varPhi_{\text{HR}}\) heart rate changes were displayed graphically.
Supplementary Analyses
To understand the reasons for inaccuracies in heart rate corrections, the linear and log-linear QT/RR slopes (i.e., the coefficients α and β as described previously) were compared when derived from full QT/RR profiles and from QT/RR data restricted to the selected baseline timepoints. The coefficients were compared using paired two-sided t tests. Where appropriate, data are presented as mean ± standard deviation.
Subsequently, in 100,000 randomly selected groups of ten and 50 subjects, the QT/RR data restricted to the selected baseline timepoints were used and population-specific linear and log-linear QT/RR slopes were compared with the averages of the subject-specific linear and log-linear slopes. That is, for each of these randomly selected groups, coefficients α and β were obtained for each subject as well as for the pool of baseline QT/RR data of all subjects together. The average of the individual coefficients was compared with the population coefficients.