Background

Combination therapy for drug-susceptible tuberculosis (TB) using isoniazid (H), rifampicin (R), pyrazinamide (Z) and ethambutol (E) has been the standard of care for several decades. While there is recognised toxicity associated with some of the individual drugs (for example hepatotoxicity [1], peripheral neuropathy [2], and gastrointestinal upset [3]) there are few prospective studies of regimen-related toxicity [4,5,6,7]. Consequently, clinicians treating tuberculosis largely rely on retrospective or anecdotal evidence to guide their choices when patients experience adverse events.

Previous reports include female gender, alcohol use, human immunodeficiency virus (HIV) infection and certain ethnic groups [3,4,5, 8,9,10,11] as risk factors for experiencing adverse events during therapy. Accurate definition of the risk is confounded by the varying definitions of adverse events or drug-related toxicity, and uncertainty of the recording mechanism or its completeness. For example, ethambutol related impairment of visual acuity rates vary from 0.02% [12] and 9.4% [13] despite the importance of the complication and the simplicity of its measurement.

To overcome the difficulties of biased reporting we used the consistent adverse event reporting system used in the REMoxTB trial [14] to investigate drug related toxicity, as we believe that this is the most comprehensive source of safety data for standard tuberculosis therapy currently available. The aim of the paper was to accurately characterise the patients at greatest risk, the incidence and nature of the toxicity related to standard TB therapy, and to investigate the impact of toxicity on treatment outcomes. Additionally, the paper aimed to compare the incidence of toxicity for standard TB therapy and the two experimental arms in REMoxTB.

Methods

REMoxTB trial

The REMoxTB trial [14] was a double-blind, placebo-controlled, randomised phase III trial to investigate two experimental moxifloxacin (M)-containing treatment regimens to treat pulmonary tuberculosis. There were 1931 patients randomised between 2007 and 2012 with 655 assigned to the “isoniazid arm” (2MHRZ/2MHR), 636 assigned to the “ethambutol arm” (2EMRZ/2MR), and 639 allocated to standard TB therapy as a control (2EHRZ/4HR). Patients were followed for 18 months after randomisation. We included all randomised patients in the REMoxTB trial who had received at least one dose of their allocated treatment.

Handling of safety data

Adverse events (AEs) were defined as any untoward medical occurrence in a patient administered the trial medication (with or without a causal relationship to the drugs) and were graded on a severity scale of 1 (least severe) to 4 (most severe) based on the Division of AIDS of the National Institute of Allergy and Infectious Diseases criteria [15]. The seriousness of an event (e.g. hospitalisation, life-threatening, death) irrespective of the severity was defined according to standard criteria. The local clinicians made the relatedness assessment for each event and those that were classified as “possibly”, “probably” or “definitely” related to drug therapy were considered to be “related” for the purposes of this analysis. Events that were assessed as either “unlikely related” or “not related” were considered to be “not related”. During the trial, sites were regularly monitored for data collection, and serious AEs (SAEs) were discussed in detail between the study medical monitor and the local clinician before being discussed in a safety board meeting with senior clinical specialists in the trial consortium to ensure quality control. Analyses and data handling were done using Stata statistical software version 14.1 (StataCorp, Texas).

Baseline characteristics in standard therapy

Baseline characteristics for all the patients assigned to standard therapy, and the characteristics for those patients who experienced one or more related or unrelated grade 3/4 AE and those who experienced one or more related grade 3/4 AE were tabulated. Univariable logistic regression was performed using each of the baseline characteristics for patients receiving standard therapy in the table against a binary outcome for experiencing one or more total or related grade 3/4 AE. Those variables with a p-value of < 0.10 were manually selected for inclusion in a multivariable model, with age, gender and baseline weight included regardless of univariable p value due to their clinical relevance. Random-effects multivariable logistic regression was used to test for associations between the selected variables and grade 3/4 AEs with trial centre used as the panel variable to account for any effect from the individual sites.

Adverse events in standard therapy over time

Incidence of grade 3/4 AEs and serious adverse events of any severity grade (SAEs) by treatment phase of standard therapy were categorised based on the grade 3/4 AE start date: intensive (weeks 0–8), continuation (weeks 9–26), and follow-up (week 27-month 18 after randomisation). MedDRA coding for System Organ Class and Preferred Term was used to identify the most common classes of adverse event. Patients were categorised based on the number of grade 3/4 AEs experienced in each phase of treatment. The mean number of SAEs per patient was calculated by dividing the total number of SAEs by the total number of patients with one or more SAE in each treatment phase. Patients who were withdrawn or died in the previous treatment phase were not included in later phases in order to present an accurate denominator for the number of patients at risk of an event in each of the three treatment phases. To illustrate the risk of grade 3/4 AE occurrence by time on standard treatment we constructed an Epanechnikov kernel smoothed hazard estimate with 95% confidence intervals and plotted the hazard function on the y-axis and the number of weeks from first dose on the x-axis.

Adverse events and treatment outcomes on standard therapy

The number of grade 3/4 AEs reported by each patient taking standard therapy was related to their microbiological outcome at 18 months after starting treatment. Patients were scheduled for 8 weekly visits followed by 8 visits until 18 months after randomisation, and early morning and spot sputum samples were to be collected (where possible) at each visit. Sputum culture results were available for both solid and liquid media in the trial database.

For this analysis, cure was defined as patients who were culture negative at either 18 months or when they were last reviewed in the trial with at least two consecutive negative cultures on both solid and liquid media prior to their final negative result. This outcome was based exclusively on recorded culture status and was independent of the patient’s outcome in the original publication [14]. Patients were grouped according to whether they had experienced ≥1 total or related grade 3/4 AE, and the proportions of cured patients in these groups were tabulated. The Chi square test was used to test for significance and binary logistic regression with cure as an outcome was used to test the association between experiencing ≥1 total or related grade 3/4 AE and odds of cure. Sex, age, baseline weight and HIV status were included in a multivariable logistic model.

Adverse events in all treatment arms

The incidence and classification of grade 3 or 4 adverse events (grade 3/4 AEs) and number of patients affected were calculated across the treatment arms according to the timing of the event: weeks 0–8 (EHRZ received on standard arm; MHRZ on isoniazid arm; EMRZ on ethambutol arm), weeks 9–17 (HR on standard arm; MHR on isoniazid arm; MR on ethambutol arm), weeks 18–26 (HR on standard arm; placebo on both isoniazid and ethambutol arms), and months 7–18 (no treatment administered for any arm and in trial follow-up). Grade 3/4 AEs were considered “clinically significant”. The proportion of patients with one or more grade 3/4 AE was compared across the treatment arms at each time window using the Chi square test. Time to first grade 3/4 AE in days after the first dose received was taken as the event of interest and Kaplan-Meier curves were constructed to illustrate the timing of grade 3/4 AEs in the three arms. The log rank test was used to compare the time to event in the standard therapy arm against the experimental arms individually.

Ethics approval and participant consent

The REMoxTB study was carried out with approval from the ethics board at University College London, and this included approval for the use of data and samples collected in other studies to improve the diagnosis and treatment of tuberculosis. All randomised patients agreed to any data and samples collected as part of the trial being used in further studies to improve the diagnosis and treatment of tuberculosis, as stated on the informed consent form for the study. All the research activities and data collection for the study was compliant with the Helsinki Declaration and the principles of Good Clinical Practice.

Results

Baseline characteristics for patients allocated to standard therapy

Of 639 patients taking standard therapy 57 (8.9%) experienced one or more grade 3/4 AEs judged to be related to their treatment, compared to 45 (6.9%) of 655 in the isoniazid and 40 (6.3%) of 636 in the ethambutol arm (p = 0.21, see Tables 1 and 5). Baseline weight as a categorical variable (OR 0.79, 95% CI 0.65–0.97), female sex (OR 1.60, 95% CI 1.06–2.39), and HIV infection (OR 3.45, 95% CI 1.86–6.42) were significantly associated with ≥1 grade 3/4 AE in univariable logistic regression. However, only HIV infection was significantly associated with experiencing any grade 3 or 4 AE in a multivariable model (adjOR 3.43, 95% CI 1.82–6.49). Female sex (adjOR 1.97, 95% CI 0.91–1.83) and HIV infection (adjOR 3.33, 95% CI 1.55–7.14) were significantly associated with grade 3/4 AEs considered related to standard therapy after being selected for inclusion in the multivariable model (both p values < 0.05, see Table 2).

Table 1 Baseline characteristics of patients in the standard therapy arm
Table 2 Logistic Regression output to test the association between baseline characteristics and the risk of experiencing one or more grade 3 or 4 Adverse Event (AE) on standard TB therapy

Adverse events in standard therapy

Among the 113 related grade 3/4 AEs in the standard therapy group 80 (70.8%) were reported in the intensive phase of treatment (month 1 &2) as shown in Table 3 and illustrated in Fig. 1. Of the 57 patients who experienced ≥1 related grade 3/4 AE on treatment, 47 (82.5%) experienced an event in the intensive phase. The related adverse events most commonly reported were elevated liver enzymes (38 of 38 “hepatobiliary” events), arthralgia (15 of 22 “musculoskeletal” events), and diabetic complications (4 of 12 events attributed to “metabolism & nutrition”) (see Table 3). There was one case of deterioration in visual acuity reported in the standard arm (data not shown).

Table 3 Events in standard arm by treatment phase
Fig. 1
figure 1

Hazard Curve for Related Grade 3 & 4 Adverse Events. Hazard Curve for Grade 3 or 4 Related Adverse Events According to Number of Weeks Since First Dose of Standard Therapy. Hazard function for the occurrence of a grade 3 or 4 related adverse event (with 95% confidence intervals) is plotted on the y axis, with the number of weeks following the first dose of standard tuberculosis therapy on the x axis. The rise in the hazard function after week 25 is accounted for by 2 events reported as “possibly” related to study drug

While the majority of SAEs were reported during therapy 10 of the 16 deaths in this treatment arm occurred after treatment was completed (see Table 3). The most common causes of death were trauma, suicide or unknown cause but presumed to be violent (8 of 16), and related to TB disease (3 of 16). Right heart failure, sepsis of unknown origin, and uncontrolled hypertension accounted for the remaining three deaths. None of the deaths in the standard therapy group were assessed as related to treatment.

Treatment outcomes for patients on standard therapy

Patients who had one or more total or related grade 3/4 AE were less likely to achieve microbiological cure compared to patients who did not experience a grade 3/4 AE (see Table 4). Of patients taking standard TB therapy, 21.1% (27 of 128) of patients with ≥1 grade 3/4 AE were not cured, compared to 9.2% (47 of 511) of patients who did not experience any grade 3/4 AE. Similarly, 26.3% (15 of 57) of patients who experienced ≥1 grade 3/4 AE considered related to treatment did not achieve cure compared to 10.1% (59 of 582) of patients who did not experience a related grade 3/4 AE (p value < 0.001).

Table 4 Rates of microbiological cure according to number of Grade 3 or 4 adverse events experienced by patients taking standard TB therapy

Experiencing ≥1 related or unrelated grade 3/4 AE was significantly associated with not being cured of TB in a multivariable logistic regression model (adjOR 2.60, 95% CI 1.52–4.46, p < 0.001). A similar relationship was seen between ≥1 related-only grade 3/4 AE and an outcome of not cured (adjOR 3.11, 95% CI 1.59–6.10, p value < 0.001). The multivariable model included sex, age and baseline weight (clinical significance) and HIV status (due to earlier reported association with AE incidence).

Adverse events across all treatment arms over time

Most grade 3/4 AEs occurred during the intensive phase for all regimens (see Table 5) with 80 (73.4%), 51 (81.0%) and 44 (67.7%) related grade 3/4 AEs during the intensive phase in the standard, isoniazid, and ethambutol arms respectively. Both experimental arms had lower numbers of related grade 3/4 AEs (64 and 66 in the isoniazid and ethambutol arms vs 113 during standard therapy). There was a significant difference in the proportion of patients experiencing ≥1 related grade 3/4 AE in the intensive phase (p value 0.03) with the smallest proportion in the ethambutol arm (25 of 636 [3.9%], see Table 5). In all treatment arms the most common type of related grade 3/4 AEs were “hepatobiliary disorders” (40.7% in standard therapy, and 42.2% & 37.9% in isoniazid & ethambutol arms).

Table 5 Comparing adverse events in treatment arms

There was no difference in either the overall total or related total of grade 3/4 AEs between the three treatment arms during weeks 18–26 when patients in the experimental regimens were receiving placebo. The Kaplan Meier curves in Fig. 2 illustrate the majority of events occurring in the intensive phase followed by a plateau from approximately 9 weeks after starting treatment (log rank p = 0.19 for comparing standard therapy and isoniazid arm; p = 0.07 for standard therapy and ethambutol arm). The drop seen at 8 weeks of treatment in the number of patients at risk was driven by one site reporting 40 grade 3/4 AEs (30 considered related) from all treatment arms in a 30-week time window of the trial between May and December 2010 (the site reported a total of 146 grade 3/4 AEs). 36 of 40 (90%) of these events were reported in the intensive phase of the patient’s treatment.

Fig. 2
figure 2

Related Grade 3 or 4 Adverse Events By Treatment Arm. Kaplan Meier Curves for Time to First Event for Related Grade 3 or 4 Adverse Events in the Treatment Arms. The time to first event is plotted for all the patients at risk in the standard (blue), isoniazid (red) and ethambutol (green) arms. The y axis plots the proportion of the patients still at risk, and the risk table presents this numerically. The data was censored at 200 days after the first dose for all three arms, and there was no significant difference between the isoniazid arm (p = 0.19) or the ethambutol arm (p = 0.07) when compared to the standard therapy using the log rank test

Discussion

In a study encompassing a large number of drug-sensitive TB patients from across the world there is evidence that almost a tenth of patients experienced serious side effects due to their TB medication. The existing literature quotes a rate of approximately 5–20% for significant toxicity from standard TB therapy [5, 9, 16,17,18,19,20,21] and hepatotoxicity is the most frequently detected [1, 22, 23]. The liver enzyme profile on treatment for standard TB therapy and the experimental arms in REMoxTB has been described in more detail elsewhere [24]. As the exclusion criteria removed those with severe disease or concomitant diseases, our estimate must be considered a minimum [14], however the follow-up period was comparable to two other recent large tuberculosis trials [25, 26].

We observed that most of the AEs in standard therapy occurred in the intensive phase. The explanation for this is uncertain, and could involve a degree of survivorship bias or increased tolerance of side effects, but may relate to the presence of pyrazinamide in all three regimens. Pyrazinamide is a drug with a well-recognised toxicity profile [11], while ethambutol (the other drug only present for the intensive phase) has few reported side effects [27]. Additionally, hepatotoxicity and arthralgia were among the most common events and these are frequently reported side effects of pyrazinamide [28, 29]. The sterilising activity of pyrazinamide makes it an essential component of standard therapy [30], but there is still some uncertainty surrounding its ideal dosing [31] and there is evidence of a dose-response relationship with toxicity [32]. There is a pressing need to direct more research to optimise the most effective and least toxic dose alongside the other components of the standard regimen [33].

It is perhaps significant there was little difference in the number of related AEs in months 5 and 6 between those receiving active treatment and those on placebo. This could emphasise the importance of TB induced pathology on the presence and reporting of significant medical events. Reducing toxicity associated with medication is one of the factors driving the development of shorter treatment regimens for TB [34], however this finding suggests that concerns about toxicity may not be as important as previously thought. While the experimental arms were less toxic, it should be noted that they were also less effective. It was notable that both experimental regimens were less toxic and both of these reduced the bacterial load more quickly than standard regimen [14]. Whether there is a causal relationship between these observations is not known. This means, perhaps, that the motivation for shortening treatment needs to focus around patient acceptability and logistical benefits of few doses, visits to clinics and enhanced adherence.

We found female patients and HIV-positive patients to be at significantly higher risk of toxicity. Existing guidelines acknowledge the issues surrounding TB-HIV co-infection [35, 36], but these do not reference female gender as a risk factor for a more complicated treatment course (outside of pregnancy). It is unclear if reporting bias has played a role in AE recording for the trial, as there have been discrepancies noted between the genders in regards to healthcare-seeking behaviour previously [37, 38]. Nonetheless, clinicians should consider closer monitoring of both HIV-positive and female patients taking HRZE, especially in the intensive phase of treatment.

Those patients reporting one or more related grade 3/4 AE were more likely to fail to achieve sustained sputum culture negative status. This is an important observation and emphasises the need to detect toxicity early and manage it properly. The reasons for this difference in outcome is uncertain and would merit further investigation in prospective studies. It may be that better management of drug toxicity in tuberculosis treatment could deliver better outcomes.

It is notable that the majority of deaths occurred after completing treatment and were unrelated to trial medication, emphasising the importance of social context of TB infection. It may be that this is due to other underlying conditions that also contribute to a poor outcome or that experiencing toxicity reduces adherence to therapy. This relationship may explain why the cure rate with standard therapy for drug-sensitive disease can be as low as 80% in real-world settings [39].

This study is limited by innate reporting bias and reliance on a subjective assessment of severity in many cases (for example, pain scores). An example of this is the reporting activity at one site in the trial. After a trial pause this site reported almost one third of its total grade 3/4 AEs, and assessed 75% of them as being related to treatment in a 30 week period. Attributing causality to AEs has been shown to produce unreliable and subjective data [40] and caution has been advised when using trial data to evaluate drug safety profiles [41]. Given the proximity of the recent pause in trial recruitment it could be that there was concern over the safety of the experimental regimens and that in a double-blind trial this translated into a lower threshold to both report events and to attribute causality to the drugs.

While there is still merit in using AEs to investigate drug safety profiles, the often subjective nature of the reporting is a limitation. We are also aware of the potential dangers of drawing conclusions based on relatedness assessments for AEs [40, 42] and to this end have presented both total and related AEs in the analysis. Overall, the careful and consistent way in which data were recorded for this large number of patients does mean, however, that we are able to generate important observations and suggest future research.

In this paper we have shown that most adverse events occur in the intensive phase of treatment with female patients and those who are HIV positive constituting a demographic that should be closely monitored for toxicity. We have also found that those who experience clinically significant drug related-toxicity while taking standard TB therapy are at greater risk of failing treatment. From this we conclude that we need to improve our methods of detecting and managing patients experiencing toxicity, and that there is real need for novel drugs with more favourable toxicity profiles. Our data provide an evidence base to plan future research and to support improved treatment guidelines. Tuberculosis remains a global health threat, predominantly affecting a vulnerable and disadvantaged population, and this paper illustrates the need for clinicians to be quick to respond to side effects from treatment to ensure their patients have the best chance of achieving a cure.