FormalPara Key Points

Inconsistent doses of hydroxychloroquine are included in national guidelines and clinical study protocols for the management of COVID-19 disease

Modeling and simulation approaches have recently been proposed for dose selection, but (external) clinical validation was either lacking or carrying important limitations and unverified assumptions

We propose a population pharmacokinetic model developed using prospectively collected hydroxychloroquine concentrations in COVID-19 patients. This model constitutes a basis for dose optimization

Simulation results showed that previously developed models in rheumatoid arthritis and lupus erythematosus over-predicted COVID-19 patient concentrations

1 Introduction

Hydroxychloroquine (HCQ) is a 4-aminoquinolein drug approved and used for decades for the treatment of malaria [1], rheumatoid arthritis (RA) [2], and cutaneous lupus erythematosus (CLE) [3]. During the first peak of the disease, HCQ was extensively used off-label for the management of coronavirus pandemic, as one of the drugs previously approved in other indications and being repurposed to treat COVID-19 patients. The first evidence for potential efficacy originated from in vitro experiments in kidney cells (VeroE6) that suggested that HCQ might be effective against COVID-19 [4,5,6]; however, the in vivo and clinical relevance of these experiments were challenged in recent publications on macaques [7]. Based on the results of in vitro studies showing antiviral activity of HCQ on SARS-CoV-2, and based on some preliminary results from small clinical studies [8, 9], interim guidelines in several countries recommended treating hospitalized patients suffering from moderate to severe COVID-19 with HCQ at various dosing regimens [10]. Studies completed thus far have used different dosing regimens, and show variable and inconsistent results regarding the efficacy of HCQ in COVID-19 based on clinically relevant endpoints, such as mortality: some studies showed positive efficacy results in COVID-19 patients, whereas several others failed to demonstrate any added-value for the use of HCQ as compared to standard of care or no treatment. The overall evidence on the drug efficacy remains controversial, with conflicting results reported across trials [8, 9, 11,12,13,14,15,16,17], and a lack of robust data.

The safety profile of HCQ is considered relatively good at the approved dose in inflammatory diseases. However, some recent data have also suggested that the safety of HCQ in COVID-19 patients might be different from the inflammatory diseases [18] There is currently a need for the characterization of HCQ safety, and in particular cardiac safety, in COVID-19 patients. [18].

The HCQ dosing regimen still needs to be optimized in order to improve the drug benefit/risk balance, which is known to be highly dependent on the dose used.

The pharmacokinetics of HCQ have been described in inflammatory indications but not so far in COVID-19-disease. Therefore, reference pharmacokinetic exposures in COVID-19 patients are unknown and so are clinical target exposures for disease cure. The doses implemented are inconsistent across publications, national guidelines, and ongoing clinical use. There is an urgent need for HCQ pharmacokinetic characterization in COVID-19, as acknowledged in recent publications. [18,19,20].

Population pharmacokinetic (POPPK) modeling and simulation can be considered as the most appropriate approach to obtain relevant pharmacokinetic information during the pandemic crisis, given the ability to perform acceptable analysis from sparse or random blood samples, overcoming the need for large sampling (needed with traditional pharmacokinetics), that would be very complicated to implement in the context of an outbreak.

A population approach was used to describe the pharmacokinetics of HCQ in inflammatory diseases. Two population pharmacokinetic models have been published for HCQ in patients with rheumatoid arthritis and lupus erythematosus using whole-blood HCQ concentrations. [2, 3] Four additional models describe plasma concentrations or merged blood and plasma concentrations [1, 3, 21, 22] in healthy volunteers and patients. These models have been used to simulate COVID-19 patient exposures at different doses in a recent publication, under the unverified assumption that the pharmacokinetics of HCQ would be similar in other diseases and in COVID-19 (see, e.g., [18, 23]).

The aim of this paper is to describe HCQ pharmacokinetics in COVID-19 patients and to characterize reference pharmacokinetic exposures in this population. A model-based approach is used for this purpose. Simulations of different scenarios are performed to illustrate how the model can be used for dose optimization.

2 Patients and Methods

2.1 Patients

The pharmacokinetic data (blood concentrations) originated from 33 patients enrolled in an open-label, single-arm study evaluating the pharmacokinetics, safety, and efficacy of HCQ in COVID-19 patients in Saint-Luc University Hospital, a tertiary hospital in Brussels (Belgium). HCQ therapy was administered as per the Belgian national protocol guidance: 400 mg twice daily (BID) on day 1, followed by 200 mg BID the subsequent days for a 5-day total duration of treatment. Patients received Plaquenil® tablets, each tablet having 200 mg of racemic HCQ sulfate, equivalent to 155 mg of racemic HCQ base. This study was conducted in order to characterize the pharmacokinetics of HCQ that can serve to optimize the dosing regimen. This study was planned to enrol 50 patients with COVID-19 infection. Pharmacokinetic assessment of HCQ required at least 1 scheduled blood sampling and 1 optional blood sampling via venipuncture or venous catheter. The first sample was an opportunistic sample drawn within the first 4 h after dose administration, after an accurate record of the drawing time. This sample was supplemented by a blood sample at the end of treatment or at the time of treatment interruption for safety reasons.

The protocol (Eudract 2020-001434-35) was approved by the local ethical committee, and a written informed consent was obtained from each patient taking part to the study, in compliance with the International Council for Harmonisation Good Clinical Practice Guideline.

The pharmacokinetic concentrations from the 33 patients enrolled in the study were supplemented by concentrations collected from 23 patients as part of standard of care at Saint-Luc Hospital, and at Saint-Pierre Hospital, another tertiary hospital in Brussels (Belgium).

2.2 Analytical Method

Determination of the whole-blood HCQ concentrations at the different time points was obtained in the laboratory of Analytical Chemistry of Saint-Luc University Hospital, Department of Clinical Chemistry, by a validated liquid chromatography–tandem mass spectrometric method, on a Waters Xevo TQ-S micro, in ESI+ mode (Waters, Milford, MA, USA). Briefly, the red blood cells were hemolyzed by methanol containing HCQ-d4, used as an internal standard. The resulting sample underwent a liquid–liquid extraction using dichloromethane/ether/hexane (30/50/20) + 0.5% methylbutan-1-ol, at a basic pH. The next step was the evaporation of the organic phase followed by the reconstitution of the dry residue in the mobile phase. The reconstituted sample was injected into a BEHC18 UPLC column, 2.1 × 100, 1.7 µm f(Waters). The mobile phase consisted of a buffer, ammonium formate 5 mM, at pH3 with acetonitrile + 0.1% formic acid. The method was linear over the concentration range of 100–3000 ng/mL. Between-day precision measured using 3 internal quality control samples were all within 10%. Accuracy results were also very satisfactory with a bias of < 0.3% obtained with external quality control sample.

2.3 POPPK Model Development and Validation

Data were analyzed using nonlinear mixed effects modeling in NONMEM software, v.7.3 (Icon Development Solutions, Ellicott City, MD, USA). Model parameters were estimated based on data collected during the study (33 patients) supplemented by additional data from routine clinical practice (15 patients). Base and final model selection were based on the results of likelihood ratio tests. The model appropriateness was further assessed using numerical and graphical tools, as described below.

Based on previously published population pharmacokinetic models in malaria, RA, and systemic lupus erythematosus (SLE) and, on exploratory analysis, one-compartment and two-compartment structural models with first-order absorption and elimination were tested. The need for an absorption lag time parameter was tested.

Parameter estimation was performed in NONMEM using First-Order Conditional Estimation with the INTERACTION option.

Exponential models were used to describe the inter-individual variability (IIV) for the structural model parameters. Regardless of model form, IIV parameters was introduced and retained provided that estimates were neither very small nor caused instability in the model-fitting procedure.

Proportional and combined additive and proportional models were investigated to describe the residual variability. The residual variability model was developed until a smoothing spline of the absolute individual weighted residuals versus individual predictions was sufficient without trends.

Comparison between various potential models was based on a likelihood ratio test using the difference in the NONMEM objective function value (∆OFV) for two hierarchical competing models and where the number of degrees of freedom is equal to the difference in parameter numbers between the two models.

The covariate model development for HCQ was performed using the dataset. The covariates that were to be tested were: age, body weight, and sex.

The covariate model building was performed using stepwise covariate modeling: a forward inclusion phase and a backward elimination phase. In each step of the forward phase, covariate–parameter relationships were tested one at a time. The most significant relationship, given that it is statistically significant, was retained in the next step. Each remaining covariate–parameter relationship was again added one at a time to the model and the most significant was retained. This process was repeated until no more relationships were significant, at which point the full model was established. The full model was subsequently subjected to a stepwise backward elimination of relationships. Each covariate–parameter relationship was omitted from the model one at a time. The least significant relationship, given that it is not statistically significant, was removed from the model. This was repeated until no more relationships could be removed, at which point the final model was established. For the forward selection step, a p < 0.05 (\(\chi^{2}_{\text{p}\,=\,0.01, \nu \,=\,1}\,=\,3.84\)) was used while the backwards deletion step used a p < 0.001 (\(\chi^{2}_{\text{p}\,=\,0.01, \nu \,=\,1}\,=\,10.83\)).

An allometric scaling model was used to describe the influence of body size and age on the clearance and volume parameters using Eq. 1:

$$P_{i} = \theta_{1} \times \left( {\frac{{{\text{COV}}_{i} }}{{{\text{COV}}_{\text{POP}} }}} \right)^{{\theta_{2} }} \times e^{{\eta_{i} }}$$
(1)

where θ1 is the population value of the estimated parameter, Pi is the individual-specific realization for the ith subject with the value of the covariate (\({\text{COV}}_{i}\)) scaled to a population typical value (), \(\theta_{2}\) is the allometric effect and ηi is the individual-specific realization of a random variable \(\eta \sim {\mathcal{N}}\left( {0,\omega^{2} } \right)\) (IIV).

A linear model was used to describe the influence of sex on the clearance and volume parameters.

The final model will contain estimates for the model parameters and their associated IIV, and estimates for the residual variability and covariate effects. The criteria for accepting the NONMEM model estimation as the final run included the following:

  • A ‘successful minimization’ statement issued by the NONMEM program, indicating convergence of maximum likelihood estimation.

  • The standard error of estimates (if any) should preferably be less than half the estimate itself.

  • There should be no unacceptable trends in the basic goodness-of-fit plots.

In order to provide an additional assessment of parameter uncertainty not relying on normality assumptions, the final model was bootstrapped 300 times.

Model evaluation included graphical [i.e., goodness-of-fit plots, prediction-corrected visual predictive checks (pcVPCs), normalized prediction distribution errors (NPDEs)] and numerical (bootstraps) evaluation.

External validation was performed by overlapping model predictions with additional data from 8 patients as part of routine clinical practice.

2.4 Simulation Approaches to Inform Dose Optimization

The obtained final POPPK model was used to discuss whether the dosing strategies currently applied are adequate in producing HCQ concentration profiles consistent with the concentration ranges in responders to treatment. Simulations were used to compare the exposure ranges with alternative dosing regimens.

3 Results

A total of 84 concentration samples were obtained from the 33 patients enrolled in the clinical study and from the additional 23 patients treated for COVID-19 disease in St Luc and St-Pierre Hospitals. None of the concentrations measured were above or below the limit of quantification. An exploratory plot of the observed concentrations is shown in Supplementary Figure S1. Body weight values were missing for 2 patients. They were replaced by the median value in the initial dataset. The remaining patients were used for external validation. Characteristics of patients included the model building, and validation datasets are described in Table 1, together with their baseline clinical symptoms and viral load. A one-compartment model with first-order absorption and elimination best described the observed data. The addition of a lag time component did not improve the model fitting performance, so this parameter was therefore not kept in the model for subsequent steps. Correlation was visually tested between candidate covariates, and the results are shown in Fig. 1. Good correlation was found between body weight and sex. Age was not correlated with body weight or sex.

Table 1 Summary baseline characteristics of patients included in the model building and external validation datasets
Fig. 1
figure 1

Visual inspection of correlation between the patient characteristics: a boxplot of weight vs. sex, b boxplot age vs. sex, c age vs. weight. In the boxplots, the ends of the box are the 1st and the 3rd quartiles (comprise 50% of the observations), the central line represents the median and the whiskers correspond to 1.5 times the interquartile range (dots are the outliers)

During the covariate model building, body weight, and sex were found to significantly influence the clearance, with a higher drop in the objective function value for the body weight effect. However, because of the correlation between both, only body weight was kept in the final model. The influence of age on clearance did not reach the predefined statistical significance level to be kept in the final model. The final POPPK model was therefore a one-compartment model with the body weight effect on clearance. Final model parameter estimates are summarized in Table 2, which also includes bootstrap results and corresponding parameters in the previously published model in RA [2], for easy comparison. All the parameters were plausible given the data, and adequately estimated as also supported by the bootstrap results. The model adequately fitted the observed data, as shown by the basic goodness-of-fit plots (Fig. 2a–d), the NPDEs plots (Fig. 2e–f), and the pcVPCs based on 1000 simulations of the used dataset and with binning on time (Fig. 3). Weight influences the clearance value through an allometric relationship. Figure 4 shows the post hoc clearance values with respect to the patient body weights.

Table 2 Final model parameter estimates of hydrochloroquine
Fig. 2
figure 2

Basic goodness-of-fit plots and normalized prediction distribution errors (NPDEs) plots: a observed hydrochloroquine concentration vs. population prediction, b observed concentration vs. Individual prediction, c residuals vs. population prediction, d residuals vs. time, e NPDEs vs. TIME, f histogram of NPDEs. Filled circles, open squares, and crosses correspond to patients with WT ≤ 80 kg, WT > 80 kg, and missing WT, respectively. Solid lines are either the line of identity (a, b) or the line \(x = 0\) (ce)

Fig. 3
figure 3

Prediction-corrected VPC (1000 simulations of the dataset) with binning on time. Width of each bin is 24 h. Red and blue lines are the median and the 5th and 95th percentiles of the prediction-corrected observed blood concentration of hydrochloroquine (HCQ) in each bin. Red and blue areas are the simulation-based 90% confidence interval for the model predicted median and 5th and 95th percentiles of each bin. Points represent the prediction-corrected observations

Fig. 4
figure 4

Individual-specific hydrochloroquine clearance parameter value (post hoc) vs. body weight

This model was used to simulate different dosing scenarios (for 5 and 10 days) as implemented in clinical study protocols and national guidelines. The results of the different simulations are shown in Fig. 5, and related probability of target attainment based on in vitro vero cells EC50 are provided in Table S1.

Fig. 5
figure 5

Simulations of hydrochloroquine (HCQ) blood concentrations (\(n_{\text{sim}}\) = 1000) using the final model for two representative patients of 50 kg (a, b) and 150 kg (c, d). The horizontal line represents the projected target total whole blood concentrations based on EC50 values from Yao et al. [4] (a, c) and Liu et al. 5 (b, d). Blue curve median pharmacokinetic profile simulated according to the Belgian protocol, i.e., 400 twice daily (BID) on day 1, 200 mg BID the subsequent days. Orange curve median pharmacokinetic profile simulated according to a dosing regimen of 800 mg followed by 800 mg 6 h later and 400 mg 6 h later on day 1, 400 mg BID the subsequent days. Yellow curve median pharmacokinetic profile simulated according to a dosing regimen of 600 mg BID on day 1, 200 mg BID the subsequent days. Purple curve median pharmacokinetic profile simulated according to a dosing regimen of 200 mg TID on day 1, 200 mg BID the subsequent days. Green curve median pharmacokinetic profile simulated according to a dosing regimen of 200 mg BID. Black red curve median pharmacokinetic profile simulated according to a dosing regimen of 400 BID on day 1, 400 mg daily the subsequent days. Solid lines treatment for 10 days. Dotted lines treatment for 5 days

Figure 6 shows the predictions obtained with the model by Carmichael et al. [2] using the dosing regimen of 400 mg twice daily (BID) on day 1, followed by 200 mg BID the subsequent days (Belgian protocol). The observed concentrations in our model building dataset are overlaid.

Fig. 6
figure 6

Predictions of hydrochloroquine (HCQ) blood concentrations (\(n_{sim}\) = 1000) using the model of Carmichael et al. [1] Blue lines represent the median of the simulations and black lines are the 5th and 95th percentiles (90% prediction interval). Points correspond to the observations in our study. Filled circles, open squares, and crosses correspond to patients with WT \(\le\) 80 kg, WT >80 kg and missing WT, respectively

4 Discussion

Six POPPK models are available in the literature for HCQ. [1,2,3, 20, 21] Two of these were developed using whole-blood HCQ concentrations in patients with RA [2] and CLE [3]. The four additional models describe plasma concentrations, or merged blood and plasma concentrations, in different indications [1, 3, 21, 22]. The estimated apparent clearance values were quite consistent across previously published POPPK models: 10–14 L/h when whole-blood concentrations were analyzed [1, 2] versus 11–68 L/h when plasma concentrations were analyzed [1, 3, 21, 22]. We are hereby reporting for the first time a POPPK model in COVID-19 patients. The fitting of model by Carmichael et al. [2] without parameter re-estimation resulted in an over-estimation of the concentrations observed in our study (see Fig. 6). This may be due to the fact that, unlike the previously published models, a strong effect of body weight was found on clearance in our final model (see Table 2). The predictive performances of our model were confirmed on an external set of data collected as part of standard of care and not used for model building (the result is shown in Supplementary Figure S2). It should, however, be noted that this model still carries a moderate to high unexplained variability component on the volume of distribution and clearance parameters, which can be explained by untested covariates, such as concomitant use of CYP2D6 modulators and underlying renal impairment. Another difference from previously published models is the higher estimate of Ka in our model. This value can be explained by tablet crunching and administration through nasogastric tubes in some of the patients who were in ICU or unable to swallow the tablets. However, this information was not consistently collected and could therefore not be adequately tested during the covariate analysis.

The actual value of the present model resides in its ability to inform the characterization of reference concentrations for efficacy and/or safety of HCQ in COVID-19, and therefore informs the characterization of the therapeutic range and the (range of) safe and effective doses. For this, there is a need for an exposure–response rationale regarding drug efficacy and drug safety. Good understanding of the determinants of clinical efficacy and safety (QT prolongation) would be the next step.

The mechanism of action of HCQ in the context of SARS-CoV-2 remains poorly defined. Although hydroxychloroquine’s effect on direct inhibition of the virus and its anti-inflammatory and immunomodulatory activities are known, whether these mechanisms play a role against COVID-19 remains to be determined. [24].

While the application of HCQ to COVID-19 stems is justified by its past use as antiviral, including for SARS-CoV-1, there has been conflicting evidence on HCQ effects in reducing viral load with different half-maximal effective concentration (EC50) or 90% of the maximal effective concentration (EC90) values reported [4, 5]. Moreover, according to the recent paper by Maisonnasse et al. [7], HCQ showed antiviral activity in African green monkey kidney cells (VeroE6) but not in a model of reconstituted human airway epithelium, and not in macaques after different treatment strategies were tested in comparison to placebo, before and after peak viral load, alone, or in combination with azithromycin. If their clinical relevance is established, reported EC50 and/or EC90 values can be used for in vitro and in vivo extrapolation to inform the dose selection in conjunction with the pharmacokinetic information provided by our model. For graphical and numerical comparative purposes, EC50 values by Yao et al. [4] (0.72 µM) and Liu et al. [5] (4.51 µM), scaled to whole-blood concentrations (assuming 50% protein binding in patients [25] and a serum/whole-blood ratio of 0.53 [26]), are included in Fig. 5. Information about the proportion of simulated subjects achieving target concentrations are provided in Supplementary Table S1. It should be noted that, if such an exercise is implemented, the dosing regimen that would lead to an average free minimum (Ctrough) plasma concentration higher than the reported in vitro EC50 or EC90 would be higher than the currently tested doses in most of clinical protocols and national guidelines so far published. This would have a particular impact in over-weight patients who would need higher doses due to body weight effects on clearance (see Fig. 4). One should, however, keep in mind that an immunomodulatory effect of HCQ in COVID-19 disease cannot be excluded. Similar to its effects in RA and immunological diseases, HCQ is supposed to have some effect on immune activation. It has been described that HCQ may inhibit production of pro-inflammatory cytokines, [24] and therefore decrease the later inflammatory stage of the disease. An additive (or synergistic) anti-inflammatory effect could thereby support the need for doses lower than those determined based only on the potential antiviral effects.

As regards drug safety, matched concentration and QT data in the same patients are needed to develop a concentration–QT model that could be used in conjunction with this model to characterize unsafe doses as regards QT prolongation.

A firm dose recommendation cannot be provided based solely on our pharmacokinetic results, given the remaining uncertainties in the characterization of the exposure–response for HCQ in COVID-19 for both efficacy and safety. Optimal dose selection should be informed by the pharmacokinetics and the exposure–response analyses. Adequately conducted exposure–response analyses, essential to characterize the target concentrations for drug efficacy and safety, are currently lacking for HCQ in COVID-19. When available, these would complement our findings and permit reliable dose recommendations for HCQ in COVID-19.

That said, some learning and clinical implications are already clear from our findings. The first is that, if one bases the dosing recommendation only on the in vitro results obtained with kidney cells (VeroE6), our simulation results show that the doses used in most of the protocols and clinical studies are suboptimal for HCQ efficacy. The second implication is that, given the important influence of body weight on drug clearance, this should be taken into account for dosing recommendations for HCQ in COVID-19 patients.

5 Conclusion

The developed models provided useful insight for the dosing optimization of HCQ in COVID-19 which is still an unmet and urgent need. The present results should be used in conjunction with exposure–efficacy and exposure–safety data to inform optimal dosing of HCQ in COVID-19.