Introduction

Many drugs are associated with the risk of QT prolongation and torsades de pointes (TdP) [1]. TdP is a rare polymorphic ventricular tachycardia that can cause reversible syncope, ventricular fibrillation, and death [2, 3]. According to the QTDrugs Lists of CredibleMeds, about 200 drugs are associated with the risk of QT prolongation and/or cause TdP [1]. Based on ongoing and systematic analysis of available evidence, drugs may be placed into four QTDrugs Lists on the CredibleMeds website [1]. List 1 contains drugs that prolong the QT interval and are known for their risk of TdP, even when taken as recommended [1].

Several risk factors for QT prolongation and TdP are known [3, 4]. QT prolonging drugs were responsible for 48% of the cases of QT prolongation in a Canadian study [5]. Several non-pharmacological factors, including female sex and age ≥ 65 years, several diseases, as well as electrolyte disturbances were found as risk factors in a quite recent systematic review [4]. Patients with multiple clinically recognizable risk factors, like older adults with polypharmacy and comorbidities, have an increased risk for TdP [6, 7].

Concomitant use of QT prolonging drugs is common and has increased in in- and outpatients [810]. When contemplating pharmacotherapy with a QT prolonging drug, patient-specific risk factors need to be considered [6, 11]. However, the ECG screening rate of emergency department patients receiving QT prolonging drugs is considered low (20.9%) [9]. The American Heart Association and the American College of Cardiology Foundation have released recommendations on the prevention and management of QT prolonging pharmacotherapy and drug-induced TdP, mainly concerning ECG monitoring in an inpatient setting rather than other strategies that may identify patients at higher risk [6, 12]. The QT interval corrected for heart rate (QTc) is an indicator of TdP risk, but it does not fully account for all the risk by itself [6, 11, 13]. Use of a risk assessment tool (RAT) may be a more effective way to identify patients at risk of QT prolongation and TdP for whom repetitive or continuous ECG monitoring, discontinuation of QT prolonging drugs, or serum electrolyte concentration monitoring may be necessary [12, 14].

The aim of this review was to systematically summarize the evidence of different RATs for QT prolonging pharmacotherapy.

Material and methods

Data sources and retrieval of material

This systematic review was conducted according to the PRISMA guidelines [15, 16]. PubMed and Scopus databases were searched for potential studies from 2005 to 2017 (Supplementary Table S1). Author SS worked in a research team helping in the search process (author TT and an information specialist at the Medical Library, University of Helsinki). In PubMed, MeSH (Medical Subject Headings) terms were used and all fields were searched. In Scopus, the fields’ title, abstract, and keywords were searched. To avoid searching for duplicates in Scopus, the search was filtered with “AND NOT INDEX (MEDLINE).” The literature search began on September 22, 2017. Alerts on new articles from the databases were sent to the author’s (SS) e-mail. The search was updated in August 9, 2021.

Inclusion and exclusion criteria

Any study design, excluding narrative reviews, was included. The RAT for QT prolonging pharmacotherapy could be used by any healthcare professional, in any healthcare settings. The RAT could involve anything from a risk score calculator to using computer software or databases to assess the risk. Studies involving risk assessment without using a tool or studies only involving ECG measurement and QT correction were excluded. The studies were included if they involved older adults (≥ 65-year-olds [17]) or the mean/median age of the study participants was ≥ 65 years. Articles were included if written in English and if full-text was available through the University of Helsinki Library.

Selection of studies

Titles and abstracts of studies were read by author SS/TT. In cases of unclear articles, other authors also read the articles. Reference lists of included studies were reviewed. Studies were selected based on title and the article abstract. Full texts were retrieved for selected studies.

Quality assessment of included studies

The included studies were assessed for strengths and limitations by considering the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach [18]. The study design was considered for each included study and authors estimated the evidence level and risk of bias, if possible.

Data extraction and analysis

Full-text articles were read (SS/TT), and the following data were extracted and analyzed qualitatively: country; study design, and setting; cohort or sample; number of patients; definition of long QT interval; QT correction formula used; description of the RAT used, strengths and limitations of the RAT, and the study and the primary user of the RAT. The studies were categorized and analyzed according to the RAT used. A detailed summary table of the included studies was made (Supplementary Table S2).

Results

Study selection and description of included studies

The PRISMA diagram of literature search and inclusion process is shown in Fig. 1 [16]. In total, 21 studies were included in the systematic review in which 11 different RATs for prolonged QT were used [12, 14, 1924, 3133, 36, 40, 43, 44]. Most commonly used tools were risk scores (n = 9) [12, 14, 2127], computerized physician order entry systems (n = 3) [3133], and clinical decision support systems (n = 6) [19, 20, 3639]. Of the studies, eight were conducted in the USA [12, 21, 22, 32, 33, 36, 38, 44], four in Belgium [14, 23, 24, 27], two in Australia [25, 40, 43], one in Germany [20], four in the Netherlands [26, 31, 37, 39], and one in Sweden [19]. The RATs were developed mainly for physicians and pharmacists. Most of the studies were conducted in inpatient settings. Of the studies, one applied a prospective controlled interventional study design [20], while the majority were observational studies (Supplementary Table S2).

Fig. 1
figure 1

PRISMA diagram of literature search and inclusion process. PRISMA, preferred reporting items for systematic reviews and meta-analysis

Comparison of studies including risk scores (n = 9)

Nine studies described a risk score for QT prolongation as a RAT (Table 1; Supplementary Table S2) [12, 14, 2127]. In all nine studies, ECG data were available [12, 14, 2127]. The definition of long QT interval differed slightly between the studies (Table 1). The QT interval was corrected for heart rate with three different correction formulae, Bazett’s [28], Fridericia’s [29], and Rautaharju’s correction formulae [30] (Table 1).

Table 1 Studies in which risk scores were applied as a risk assessment tool (n = 9)

In addition to QT prolonging drugs, the risk scores included non-pharmacological risk factors for QT prolongation such as demographic factors, QT interval affecting clinical conditions, and morbidities and electrolyte disturbances (Table 1). QT prolonging drugs were mainly identified using the QTDrugs Lists of CredibleMeds [1, 12, 14, 2127]. Tisdale et al. [12] stated that evidence of QT prolonging drugs was taken from published trials and case reports, and CredibleMeds [1] was a used source. Included QT interval affecting diagnoses differed between the risk scores used in the studies (Table 1). The original RISQ-PATH score by Vandael et al. [14, 24] contained the highest number of risk factors, obtained from a systematic review [4], and the least number of co-morbidities was found in the risk score by Tisdale et al. [12]. The RISQ-PATH score also included risk factors that have not been validated as risk factors (hypertension, cigarette smoking, obesity, and others).

A limitation of five of the studies [2125] was the lack of information on predictive performance (Table 1). The predictive performance was presented in four studies (Table 1) [12, 14, 26, 27]. In the study on the pro-QTc score, an institution-wide QT alert system for assessment of mortality was developed [21]. The system screened all ECGs performed and alerted physicians if finding QTc ≥ 500 ms. The risk scores by Vandael et al. [14, 23, 24, 27] were developed in various hospital wards with different specialties, which makes them more generalizable than the risk score by Tisdale et al. [12], developed in cardiac critical care units. Selection bias may be present in some of the studies [12, 2123]. The RISQ-PATH score developed by Vandael et al. [14] consists of many predictors that are frequently not available, which may challenge the use in clinical practice. The risk of QT prolongation or TdP may have been underestimated in the studies including the preliminary RISQ-PATH score due to missing lab results, exclusion of patients, or unrecognized TdP cases because of lack of information in the patient files or wrongly coded cases [14, 24]. In a later study by Vandael et al. [27], the RISQ-PATH score was optimized and validated in a large patient cohort, and an algorithm was proposed to be used in clinical decision support systems to create smart QT alerts. Risk factors excluded from the original RISQ-PATH score [14] were diabetes, number of drugs in list 2 of CredibleMeds, and thyroid disturbances.

Bindraban et al. [26] developed and validated two risk models (complete and simplified), both having a lower number of risk factors than the RISQ-PATH score [14]. Risk models used variables that usually are automatically available in healthcare information systems, and therefore, the risk models could be implementable in a clinical decision support system.

Comparison of studies on computerized physician order entry systems (n = 3)

Computerized physician order entry systems (CPOE), used to reduce the risk of QT prolongation and related morbidities, were studied by van der Sijs et al. [31], Muzyk et al. [32], and Sorita et al. [33] (Table 2; Supplementary Table S2). The studies did not measure outcomes such as occurrence of cardiovascular illness, TdP, or mortality. The retrospective studies [31, 32] used small sample sizes; thus, their generalizability is limited and selection bias probable. The quasi-experimental study [33] was conducted in an institution-wide setting and could not exclude confounding factors and dependency of data due to the study design [33]. The CPOE QT alert override rate was 77%.

Table 2 Studies using computerized physician order entry systems (CPOE) as a risk assessment tool (n = 3)

Comparison of studies on clinical decision support systems (n = 6)

Clinical decision support systems (CDSS) were studied in six studies [19, 20, 3639] (Table 3; Supplementary Table S2). The interventional study by Bertsche et al. [20] investigated drug-drug interactions (DDIs) and DDI-related adverse drug events in intensive care patients with a CDSS (Table 3). The content of the CDSS was developed by an interdisciplinary team with systematic literature searches. Randomization and blinding were not stated in the article; therefore, selection bias cannot be excluded.

Table 3 Studies researching clinical decision support systems (CDSS) (n = 6)

Tisdale et al. [36] investigated the effectiveness of a CDSS including a risk score [12], for reducing the risk of QT prolongation (Table 3). The pre-intervention group and the implementation group differed significantly in some respects producing a risk of selection bias. Both the CDSS [36] and the included validated risk score [12] were developed in cardiac care units, limiting the external validity. Another limitation was the alert fatigue, as the override rate was 82%. Of all alerts triggered, 13% resulted in additional patient monitoring and 18% resulted in discontinued mediation orders.

Böttiger et al. [19] described the development of PHARAO (Pharmacological Risk Assessment Online), a CDSS presenting a risk profile for adverse events associated with combined effects of concomitantly used drugs (Table 3). A multidisciplinary team searched pharmacological handbooks, summaries of product characteristics, evaluations from European Medicines Agency, and articles through PubMed for evaluation and scoring of 1427 substances. PHARAO did not use pop-ups to alert physicians. To decrease alert fatigue, a restrictive approach was taken to what drug pairs would be included.

Berger et al. [37] developed a tool based on risk factors associated with QTc prolongation determined in a prospective study on QT-DDIs in a university medical center in the Netherlands. The main outcome measure was QTc prolongation defined as a QTc interval > 450 ms for males and > 470 ms for females. Review from literature was conducted on additional risk factors.

Chernoby et al. [38] conducted a retrospective study to evaluate provider response to CDS alert. The tool used the risk score by Tisdale et al. [12]. The primary outcome was the proportion of orders triggering QTc alerts that were continued without intervention in the active phase compared to the silent phase. The proportion reduced 55%.

Berger et al. [39] studied the use of the CDS tool (consisting a paper-based flowchart) to assess the risk of QT drug-drug interactions in community pharmacies in the Netherlands. All QT-DDIs that occurred during a pre- and post-CDS tool period of 3 months were included. The QTc-prolonging drugs involved in the QT-DDIs are listed at the CredibleMeds. Pharmacist intervened in 43.0% and 37.5% of the QT-DDIs pre- and post-CDS tool.

Risk assessment tools focusing on ECG parameters (n = 3)

The QT nomogram [40], the ½ RR rule [43], and the T-wave analysis software [44] were developed in retrospective studies. The RATs [40, 43, 44] focused on ECG parameters.

Chan et al. [40] developed a QT nomogram, based on a cloud diagram [41], for assessing the risk of TdP in a retrospective case-controlled study. Cases of drug-induced TdP (n = 130) were found in a systematic review (329 full-text articles reviewed) [40]. Controls (n = 318) were patients overdosing on non-cardiotoxic drugs, obtained from a previous study [42]. The systematic review included in the retrospective case-controlled study lacked description of included studies and assessment of risk of bias [40].

QT interval and heart rate combinations of cases and controls were plotted with the QT nomogram [40]. For comparison, two curves were plotted corresponding to Bazett’s correction formula [28] at QTc values of 440 ms (medium-risk value) and 500 ms (high-risk value). The validity of some case points of tachycardia was questionable. The sensitivity and the specificity of the QT nomogram and Bazett’s QTc [28] can be seen compared to the ½ RR rule [43] and Fridericia’s QTc [29] in Table 4.

Table 4 Sensitivity and specificity of the QT nomogram [40] and the ½ RR rule [43] compared to QT correction formulae

The ½ RR rule defines an abnormal QT interval as a QT greater than half of the RR interval on an ECG and does not require heart rate correction [43]. Berling and Isbister [43] conducted a study for determination of the sensitivity and specificity and comparison of the ½ RR rule with other methods (Table 4). The obtainment of TdP cases (n = 129) and controls (n = 316) for their study from the literature was described by Chan et al. [40]. Additionally, the study by Berling and Isbister [43] calculated the agreement in eight different sample sets of QT-heart rate pairs from psychotropic medication overdoses. The ½ RR rule [43] and the compared QT nomogram [40] both had poor negative and positive agreement, of which the latter was worse [43]. The ½ RR rule misclassified patients without QT prolongation. Data was collected prospectively [43]. The sensitivity and specificity may be biased for all methods in Table 4 since the cases of TdP were taken from the literature.

Sugrue et al. developed a computer-based repolarization measurement tool to identify electrocardiographic predictors of torsadogenic risk [44]. The software provided extraction of information from automatic 12-lead ECGs. A retrospective T wave analysis of TdP cases and controls using prescribed dofetilide and sotalol was conducted in Mayo Clinic, USA. The sample size was small (ncases = 13, ncontrols = 26). There is potential for confounding by certain clinical factors that differed between the groups, as only sex and age were matched (within 2 years). QTc interval discriminated TdP cases from controls in 79%. The sensitivity and specificity for QTc (Bazett’s) alone were 88.1% and 72.0%, respectively. The positive and negative predictive values were 85.0% and 76.9%, respectively. The sensitivity and specificity in predicting TdP was 79.7% and 46.0%, respectively, for the T wave right slope. The positive and negative predictive values were 58.9% and 70.0%, respectively. Adding the T wave right slope in the analysis with QTc, the discrimination increased to 88%.

Discussion

To our knowledge, this is the first systematic review that summarizes and compares tools for assessing safety risks of patients using QT prolonging drugs. Nine RATs were found of which those applying risk scores were most common, although QT risk assessment was also integrated in some CDSS and CPOE systems. Most of the risk assessment methods were used in inpatient care primarily by physicians and pharmacists. Many of the methods used the QTDrugs Lists of CredibleMeds [1] for identification of QT prolonging drugs. In addition to drugs associated with QT prolongation and TdP, the risk scores considered a wide range of non-pharmacological risk factors [12, 14, 2127].

The risk scores [12, 14, 2127] had similarities regarding the design of the risk score and included risk factors. A strength of the RISQ-PATH score [14, 24] was that it included a high number of risk factors and evidence of the included risk factors was assessed in a previously conducted systematic review [4]. Points were allocated according to evidence level [14]. The risk score by Tisdale et al. [12] allocated weighted points based on log odds ratios for each risk factor. As there is lower evidence of certain risk factors and certain drugs associated with QT prolongation and TdP, it is sensible to allocate fewer points to these variables or exclude them completely from the RAT for better predictive performance. The risk score by Tisdale et al. [12] had good predictive performance and was also included in a CDSS [36]. The RISQ-PATH score had high sensitivity and negative predictive value [14]. The risk scores still need to be validated in prospective studies applying various patient groups and larger sample sizes.

The inclusion of patient-specific risk factors for QT prolongation and TdP in CPOE systems and CDSS varied [19, 20, 3133, 36]. CPOE systems and CDSS could be further developed to minimize alert fatigue [45]. In two included studies, alerts related to the risk for QT prolongation were overridden to a high degree [33, 36]. Two of the studies indicated a reduction in the orders for QT prolonging drugs because of the alerts, but patient outcomes such as TdP were not measured [33, 38]. Böttiger et al. [19] restricted the drug pairs that were included in the CDSS as well as avoided pop-up alerts to avoid a high override rate.

These findings indicate the importance of designing CPOE systems and CDSS with increased sensitivity and specificity and to evaluate them regarding unnecessary generated signals. However, restricting drug pairs seems not to be the only solution to better performance of the RATs. In the updated version of the CPOE in the study by van der Sijs et al. [31], several drugs generating QT alerts were deleted, but the sensitivity problem of the CPOE remained. On the other hand, the RATs including other patient-specific risk factors have been found to excel in outcomes [12, 14, 33, 36]. Thus, further research is needed to optimize the tools for identifying clinically significant risks and patients at risk for QT prolongation and TdP. Future research should also be extended to cover the evaluation of impact of these tools on patient outcomes.

QT correction formulae were not assessed as RATs in our systematic review, but they were mentioned for studies using a correction formula. QT correction formulae have been previously studied [46, 47]. Many of the studies included in our systematic review used Bazett’s correction formula [12, 19, 21, 23, 26, 28, 32, 36, 38, 40, 43, 44]. Bazett’s correction formula has a problem with undercorrection and overcorrection of the QT interval [45]. Fridericia’s correction formula seems to have better performance [47]. This formula was used in five of the included studies [14, 24, 27, 37, 43].

The QT nomogram performed better than Bazett’s correction formula [40]. However, the QT nomogram [40], the ½ RR rule [43], and the T wave analysis tool [44] did not include other patient-specific risk factors than ECG parameters. The QTc is an indicator of risk of TdP, but it does not identify all risks [6, 11, 12]. If these RATs are used, patient-specific risk factors need to be assessed along with other RATs or based on clinical judgment. A survey from 2005 showed that the majority of the healthcare practitioners responding to the questionnaire could not correctly identify factors and drugs that may prolong the QT interval [49]. The T wave analysis tool [44] has potential as a RAT, but if used without considering QTc, it had lower predictive performance [44]. Further studies are needed of the QT nomogram [40] and the T wave analysis tool [44] that apply various patient groups and larger sample sizes, as well as other QT prolonging drugs.

Overall, the RATs may increase safety of patients at risk of QT prolongation or TdP as the studies either showed that a wide range of patient-specific risk factors were considered: ordering of QT prolonging drugs decreased, monitoring of patients increased, advice was given, or healthcare providers were trained about risks associated with QT prolongation. As older adults have an increased risk for QT prolongation and TdP due to the presence of multiple risk factors [6, 7], RATs, considering a wide range of patient-specific risk factors, may be useful when contemplating the use of QT prolonging drugs with these patients. Based on this systematic review, various RATs may be used in combination, e.g., risk scores for QT prolongation and TdP may be incorporated into CDSS and CPOE systems for easier access and risk assessment. Using RATs in combination may also identify high-risk patients from low risk patients and may reduce unnecessary alerts.

This systematic review may assist in the decision to select and use RATs for QT prolonging pharmacotherapy. The included RATs could be further developed to fit different health information systems. There is potential for the RATs to be adapted to outpatient settings and furthermore assist other healthcare providers other than physicians, e.g., community pharmacists [19, 39]. However, no guidelines for managing risks of QT prolongation in primary care or community pharmacies exist and more training on the topic is needed to adopt the tools in clinical practice [4850].

The review was conducted according to the PRISMA guidelines [15, 16]. In the process, the GRADE approach was considered [18]. In the search process, the included studies found in PubMed were searched for additional studies. Since only duplicates were found in the search of the Scopus database, another database could possibly have been searched. Most of those included were observational studies, which are more prone to bias because of their design [51]. However, since TdP occurs rarely [40, 52], an observational study design can be useful in finding these events. Clinical validation of QT-RATs with RCT studies is needed, as our systematic review shows that studies are mainly observational or quasi-experimental studies and RCT studies were not found. A comprehensive bias assessment using a bias assessment tool would be useful in future studies, as it was not conducted in this study. Risk of bias assessment was conducted only by the authors, considering bias assessment presented in the included studies.

Conclusions

Most of the RATs for QT prolonging pharmacotherapy give a comprehensive overview on patient-specific risks of QT prolongation and TdP and reduce modifiable risk factors and actual events. There is potential for the RATs to be adapted to different health information systems in inpatient and outpatient settings. Studies on outcomes of using individual RATs and combining various tools and clinical validation of QT-RATs with RCT studies are needed.