Does Central Statistical Monitoring Improve Data Quality? An Analysis of 1,111 Sites in 159 Clinical Trials

de Viron, Sylviane; Trotta, Laura; Steijn, William; Young, Steve; Buyse, Marc

doi:10.1007/s43441-024-00613-w

Does Central Statistical Monitoring Improve Data Quality? An Analysis of 1,111 Sites in 159 Clinical Trials

Original Research
Open access
Published: 09 February 2024

Volume 58, pages 483–494, (2024)
Cite this article

Download PDF

You have full access to this open access article

Therapeutic Innovation & Regulatory Science Aims and scope Submit manuscript

Does Central Statistical Monitoring Improve Data Quality? An Analysis of 1,111 Sites in 159 Clinical Trials

Download PDF

Sylviane de Viron ORCID: orcid.org/0000-0003-3890-557X¹,
Laura Trotta¹,
William Steijn¹,
Steve Young² &
…
Marc Buyse^1,3,4

1935 Accesses
1 Altmetric
Explore all metrics

Abstract

Background

Central monitoring aims at improving the quality of clinical research by pro-actively identifying risks and remediating emerging issues in the conduct of a clinical trial that may have an adverse impact on patient safety and/or the reliability of trial results. This paper, focusing on statistical data monitoring (SDM), is the second of a series that attempts to quantify the impact of central monitoring in clinical trials.

Material and Methods

Quality improvement was assessed in studies using SDM from a single large central monitoring platform. The analysis focused on a total of 1111 sites that were identified as at-risk by the SDM tests and for which the study teams conducted a follow-up investigation. These sites were taken from 159 studies conducted by 23 different clinical development organizations (including both sponsor companies and contract research organizations). Two quality improvement metrics were assessed for each selected site, one based on a site data inconsistency score (DIS, overall -log₁₀ P-value of the site compared with all other sites) and the other based on the observed metric value associated with each risk signal.

Results

The SDM quality metrics showed improvement in 83% (95% CI, 80–85%) of the sites across therapeutic areas and study phases (primarily phases 2 and 3). In contrast, only 56% (95% CI, 41–70%) of sites showed improvement in 2 historical studies that did not use SDM during study conduct.

Conclusion

The results of this analysis provide clear quantitative evidence supporting the hypothesis that the use of SDM in central monitoring is leading to improved quality in clinical trial conduct and associated data across participating sites.

The impact of clinical trial monitoring approaches on data integrity and cost—a review of current literature

Article 04 January 2016

Does Central Monitoring Lead to Higher Quality? An Analysis of Key Risk Indicator Outcomes

Article Open access 21 October 2022

Evaluating Source Data Verification as a Quality Control Measure in Clinical Trials

Article Open access 01 November 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

For years, regulatory agencies such as FDA and EMA have required that the conduct and the progress of clinical trials be monitored to ensure patient protection and the reliability of trial results [1, 2]. Until recently, the primary approach to meeting this requirement included frequent visits to each investigative site by designated site monitors who manually reviewed all of the patient source data to ensure it was reliably reported to the trial sponsor—a practice known as 100% source data verification (SDV) [3,4,5,6,7]. However, a major revision to the ICH GCP guidance was published in 2016 which encouraged the use of central monitoring to support a more effective and efficient approach to monitoring trial conduct across all sites [8].

Central monitoring, which is a component of risk-based quality management (RBQM), aims to detect emerging quality-related risks (either pre-identified or unanticipated risks) proactively during a clinical trial, resulting in study team mitigating risks and addressing any confirmed issues and therefore drive higher-quality outcomes [1]. By doing so, central monitoring limits the amount of data impacted by confirmed issues and prevents these issues from affecting future data.

The term “quality” as used here and throughout this paper is intended to refer to the extent to which the observed conduct of a study is contributing to the GCP imperative of “human subject protection and reliability of trial results.” [8].

A variety of tools may be applied to support central monitoring, but the following two methods are most commonly used [9]:

(a)
Statistical Data Monitoring (SDM)—The execution of a number of statistical tests against some or all of the patient data in a study, which are designed to identify highly atypical data patterns at sites that may represent various systemic issues in the conduct of the study. The types of issues identified may include fraud, inaccurate data recording, training issues, and study equipment malfunction or miscalibration [1, 3, 10,11,12,13,14,15].
(b)
Key Risk Indicators (KRIs)—Metrics that serve as indicators of risk in specific targeted areas of study conduct. Sites that deviate from an expected range of values (i.e., risk thresholds) for a given KRI are flagged as “at risk.” The risk thresholds can be discrete values (e.g., procedure compliance rate < 10%) or statistically determined (e.g., P-value < 0.05) based on a comparison of data between the site and the trend across all sites in the study [1, 10,11,12,13, 16]. Note that quality tolerance limits (QTLs) as referenced in ICH E6 (R2) are quite similar in concept to KRIs and may be considered a designated type of study-level KRI [17].

Few analyses to date have assessed the impact and the performance of central monitoring for a large pool of studies. There are examples of analyses focusing on the quantitative performance of central monitoring but they have typically analyzed one or only a few datasets or studies retrospectively and generally used simulated data [6, 18, 19]. Papers analyzing ongoing studies have more often explored the adoption of central monitoring rather than its impact on quality [9]. Hence, efforts are still required to quantify the impact of central monitoring on improving quality in clinical trials.

This paper is the second of a series that aims to quantify the impact of central monitoring on quality in clinical studies. The first paper focused on KRIs and found that the use of KRIs is leading to quality improvement in the majority of at-risk study sites (83%) [16]. This paper presents the results of an analysis of quality improvement metrics associated with the use of SDM as part of central monitoring. Specifically, our hypothesis is that the detection of risks (i.e., potential issues) identified through the use of SDM and acted upon by study teams result in higher levels of quality as measured by two metrics that are described in the ‘Quality Improvement Analysis’ section of this article. To assess whether the obtained results might be due to random play of chance, we further applied the same methods to two studies available to us that did not use central monitoring—neither SDM nor KRIs—during the conduct phase of the studies. While the number of studies available for comparison was small, it provided an interesting set of data and results for comparison with the primary analysis.

Materials and Methods

Central Monitoring Solution

A central monitoring software built on statistical algorithms [3,4,5,6,7] was used to generate the data for this analysis. The platform was launched in 2015 to support various RBQM processes, including central monitoring and SDM [14, 16].

Central monitoring including SDM typically involves the analysis of data at regular intervals (e.g., monthly) during the conduct of a study. SDM analyzes clinical data collected from various sources, including electronic Case Report Forms (eCRFs), central laboratories, electronic Patient-Reported Outcome (ePRO) and electronic Clinical Outcome Assessment (eCOA) systems, and wearable technologies. When SDM identifies a site that has exceeded a risk alert threshold (e.g., P-value < 0.05) for a specific statistical test, the system triggers the creation of a risk signal for review and follow-up by members of the study team. Based on an initial review of the risk signal, the study team decides to close it directly if they conclude that it does not represent an actual issue or to open it if further investigation and/or remediation is needed. A risk signal typically remains open until the study team determines that it is either resolved or no longer applicable (e.g., site or study closure and inability to remediate) [16]. All risk signals are closed by the end of the study.

The SDM tool referenced in this analysis applies a battery of standard statistical tests that detect sites with atypical data patterns compared to all study sites [3,4,5,6,7]. The approach is considered “unsupervised” since the same set of standard tests run against all of the clinical data for each study and not pre-directed by a study-specific assessment of risk. Additionally, the tool computes a “Data Inconsistency Score” (DIS) for each site summarizing all statistical test results, which is used to rank sites from the most atypical to the least atypical [6]. The approach is based on the following principles:

(a) data coming from the various sites participating in a clinical trial should be largely similar, within normal variability limits (e.g., in multi-regional clinical trials) [15];

(b) a battery of standard statistical tests are applied to the patient data, where each test compares the distribution of the data in one site (“observed value”) with all study sites (“expected value”) (e.g., compare the mean systolic blood pressure of patients in one site to the mean across all study sites) [4];

(c) tests that are relevant given the type of each variable (continuous, categorical, or date variable) are systematically applied to all patient-level data in a completely unsupervised manner, regardless of their clinical importance, meaning or potential impact on the outcome of the trial [6]. For each variable in a trial between 1 and 8 statistical tests are applied;

(d) mixed-effects models (including both fixed and random effects) are used to allow for the natural variations between the sites, as data coming from all sites should be comparable and statistically consistent [5, 7, 14]; and.

(e) an overall “Data Inconsistency Score” (DIS) is computed for each site across all statistical tests performed on each clinical variable collected in the trial (at least 500 different P-values are typically computed for each site) to provide a summary metric at the site level. It is computed as the mean, on a log scale, of the P-values of all statistical tests performed. For each site, a weighted geometric mean is calculated with down-weighting of highly correlated tests and a resampling procedure is used to assign a P-value to the weighted geometric mean as described by Trotta et al. [6]. The DIS is the −log₁₀[P-value] transformation. A DIS of 1.3 or larger corresponds to an overall P-value less than 0.05 and as such it flags a site whose data significantly differ from the data of all study sites. Note that the DIS is not adjusted for multiplicity, so it may flag more than 5% of the sites even if none are truly atypical. This feature of the SDM tool is considered desirable to err on the side of conservatism (i.e., flagging of too many sites for further inspection).

Selection of Data

The analysis was performed using data collected in the platform from September 1st, 2015 up to February 1st, 2023. The scope of the analysis included sites meeting the following criteria (Figs. 1 and 2):

1.
Site belongs to a study that is completed and for which all risk signals were closed.
2.
One or more risk signals were created for the site based on the SDM test results and were investigated (signal opened) and eventually closed by the study team.
3.
The site’s DIS was > 1.3 at the time of risk signal(s) creation.

These criteria were defined to ensure availability of evidence covering the full history of each SDM risk signal processed by the study team from initiation through closure and where remediation of site issues and subsequent improvement of data quality would be an expected outcome.

Quality Improvement Analysis

The first step in the analysis was to compute the following two quality improvement metrics for the selected sites:

Site DIS Improvement Rate: The total percent change in the site’s DIS from the time the site was first detected at risk (snapshot of data when the first risk signal was opened) until all risk signals at the site were closed by the study team (snapshot of data when the last risk signal is closed). The following formula was applied:

$$\frac{{ - \left( {DIS_{{\text{C}}} - DIS_{O} } \right)}}{{DIS_{O} }}$$

where: DIS_O—Site’s DIS score when it exceeded 1.3 and at least one risk signal was open (“opening DIS”). DIS_C—Site’s DIS score when all risk signals at the site were closed (“closing DIS”).

Figure 1 illustrates this formula with an example, where at month 8 a site was detected with a DIS over 1.3 (DIS_O = 1.57) and 3 signals were opened. Six months later, all risk signals for the site were closed (DIS_C = 1.04). The site DIS improvement rate in this example is $\frac{-(1.04 -1.57)}{1.57}$ which is equal to 34%. Note that in cases where a site’s DIS increases over this period of time (DIS_C > DIS_O), the site DIS improvement rate will take on a negative value indicating a degradation in quality rather than an improvement.

Observed Value Improvement Rate: For each statistical test linked to a risk signal for a given selected site, the percent change of the observed value relative to the overall study estimate (i.e., expected value), from the time the risk signal was first opened until it was closed by the study team. The following formula is applied:

$$- \left[ {\frac{{\frac{{\left( {O_{C} { } - { }E_{C} } \right)}}{{E_{C} }}{ } - \frac{{\left( {O_{O} { } - { }E_{O} } \right)}}{{E_{O} }}{ }}}{{\frac{{\left( {O_{O} { } - { }E_{O} } \right)}}{{E_{O} }}}}} \right]$$

where: O_O—The site’s observed value when the risk signal was opened. E_O—The expected value when the risk signal was opened. O_C—The site’s observed value when the risk signal was closed. E_C—The expected value when the risk signal was closed.

Figure 3 illustrates the formula with an example of one observed value linked to a risk signal for a site. In particular, at month 8 the site was detected to have a low mean patient temperature (O_O = 35.5 °C) for their patients compared to the study average (E_O = 36.9 °C). Based on that finding, the study team opened a risk signal to investigate and remediate with the site as needed. The risk signal was closed six months later and the mean patient temperature at closure was significantly closer to the study average (O_C = 36.3 °C vs. E_C = 37.0 °C). The observed value improvement rate in this is example is

$$- \left[ {\frac{{\frac{{\left( {36.3 - 37.0} \right)}}{37.0} - \frac{{\left( {35.5 - 36.9} \right)}}{36.9}}}{{\frac{{\left( {35.5 - 36.9} \right)}}{36.9}}}} \right]$$

which is equal to 50%, meaning that the site’s observed value was 50% closer to the expected value than when the risk signal was first opened. Note that in cases where a site’s observed value moves further away from the study average over this period of time, the site observed value improvement rate will take on a negative value indicating a degradation in quality rather than an improvement.

Additionally, 95% Wilson score confidence intervals were used to estimate the rate of sites DIS improvement and observed value improvement [20].

Comparison to Studies with No SDM

Data from two historical studies were available to us for which no central monitoring (i.e., SDM or KRIs) was performed during the conduct of the studies. These two studies were used for a comparison analysis to assess the difference in rate of site DIS improvement between studies using SDM and those not using SDM. Study 1 was a neurological study that included 60 sites and 7000 patients. Study 2 was a study in endocrinology that included 370 sites and 3500 patients.

A post-hoc SDM analysis was performed on the final completed study database for each of the two comparison studies, and then iteratively re-executed (retrospectively) on 3 versions of the trial database representing progressively earlier timepoints in the progression of each study. The calendar date by which a specified percentage of the total patient visits had been conducted was used as the cut-off date for each earlier version, and only patient data generated up to this calendar date were included in the analysis of that version.

For each of the two studies, we calculated the Site DIS improvement rate using the same formula as in the main analysis, with slight adaptations of DIS_O and DIS_C definitions (Fig. 4):

DIS_O—Site’s DIS score when it first exceeded 1.3; i.e., the earliest database iteration at which this was observed. This represents the point in time when a study team would have typically opened risk signals for the site if central monitoring and SDM had been employed.
DIS_C—Site’s DIS score on the final completed study database. Since no risk signals were opened or closed for sites on these two studies, there is no meaningful milestone earlier than the end of the study at which to assess a site’s “closing” DIS. The DIS observed at the end of the study is used instead, which enables an assessment of the natural evolution of an “at-risk” site’s DIS in the absence of SDM.

Additionally, 95% Wilson score confidence intervals were used to estimate the DIS improvement in both studies [20].

Results

In total, 1111 sites across 159 studies using SDM were selected (from 23 different sponsor and contract research organizations) (Table 1).

Table 1 Characteristics of the included studies

Full size table

The overall landscape of clinical trials was fairly represented, with studies selected from a broad range of therapeutic areas and study sizes (number of patients and sites). Infectious disease was the highest represented therapeutic area with 45 studies (28%), which included a median of 1,440 patients and 48 sites. Additionally, all clinical phases were represented in the 159 studies selected from phase 1 (N = 13, 8.2%) to phase 3 (N = 98, 62%) (Table 1).

Quality Improvement Analysis

Overall, a lower DIS (i.e., quality improvement) was observed in 83% of the sites (95% CI, 80–85%). Additionally, 64% of the sites had a closing DIS lower than 1.3. Across all sites, the site DIS improvement rate was 46% on average. Those results remained very similar across therapeutic areas and study phases (Table 2).

Table 2 Rate of sites with improved DIS and site DIS Improvement rate

Full size table

For the two comparison studies (for which central monitoring including SDM had never been used), a lower DIS was observed in 56% of the sites (95% CI, 41–70%) and the site DIS improvement rate was 17% (Table 2).

For the sites with improving DIS, 71% of the observed values moved closer to the expected values and 51% of them were no longer statistically significant when the risk signal was closed. Note that 20% of the observed values had no change in the number of records from risk signal open to close. Hence, in these cases there was no opportunity for the observed values to improve except for the possibility of data entry corrections to existing data records. Additionally, the observed values were on average 45% closer to the expected values when the risk signal was closed. The rate of improving observed values remained very similar across the different statistical tests and dataset domains (Table 3).

Table 3 Rate of observed values that improved and observed value improvement rate among sites with an improved DIS

Full size table

Two Sample Sites

Figure 5 displays the evolution of the DIS and risk signal scores for two sample sites from this analysis. The first site shows a DIS improvement and the second one a worsening DIS. Figure 5.A shows a site with improving DIS and risk signal scores in a dermatology study. The site was first flagged with a DIS of 1.45 (P = 0.035). Two risk signals were created and both risk signals were no longer significant at the time of risk signal closure. Additionally, when the risk signals were closed, the DIS was no longer statistically significant (DIS_C = 0.75, P = 0.18). The first risk signal represented a very high disease response rate. After investigation, it appears that it was due to a data entry error. The error was corrected and at the time of the risk signal closure 39 additional disease response scores were added. The second risk signal flagged a low volume of drug dispensation among the different patients of the site. The Clinical Research Associate (CRA) checked weighing techniques and the calibration of the tool. After investigation, the issue was due to a misunderstanding of the reporting requirements. At risk signal closure, no additional erroneous results were reported.

Figure 5.B shows a site in a gastroenterology study with a DIS that did not improve (DIS_O = 1.6, P = 0.025; DIS_C = 3.31, P = 0.0005). In that site, a total of 19 risk signals were created. Fifteen risk signals were not selected in the current analysis as they were all immediately closed (i.e., not investigated as they do not represent quality risk for the study team). Those risk signals clearly described a site with an atypically unhealthy population. Additionally, the site belonged to a country with specific protocol requirements in which some assessments were not applicable. Neither of these explanations represented data quality issues and therefore, as expected, most of the 15 risk signals did not show improvement at closure. The remaining 4 risk signals were investigated by the study team (i.e., opened) and 3 of them improved at closure. Those risk signals flagged missing data along with AEs that were not reported when expected. At the time of risk signal closure, the AE reporting rate increased and the missing data were provided.

Discussion

Central monitoring, including the use of SDM software, is generally designed for the purpose of continually identifying sites that are deviating from an expected pattern of quality behavior, so that study teams can intervene at those sites and address any confirmed issues [1, 2, 10]. The results of the current analysis provide clear evidence that a majority of the sites flagged by this approach show a significant level of quality improvement, across all therapeutic areas and study phases.

This conclusion relies on the premise that the two metrics used in this analysis are valid indicators of quality improvement. Site DIS provides a measure of the overall level of atypicality of the patient data (i.e., risk data quality issue) reported from each site, which is not by itself a conclusive indicator of poor quality. Indeed, as shown in the second example (Fig. 5.B), some sites will have a high DIS because they enrolled an atypical group of patients (e.g., older and more severe condition or disease at baseline) which the study team determines does not represent an actual quality issue. Nevertheless, when investigation of atypical data patterns leads to the confirmation of quality issues at a site, those atypical data patterns become a definitive indicator of poor quality. It is then clearly expected that remediation of the identified issues should result in generation of less atypical data at the site and a correspondingly lower site DIS. The same expectation follows for the site’s observed values on which the data atypicality is measured; i.e., those values should move closer to the expected estimate across all sites in the study. For example, if the rate of patient adverse events (AEs) reported at a site was atypically low and confirmed to be an issue (e.g., site mistakenly thought that they were only supposed to report serious AEs), re-training of the site should result in a subsequent increase in the observed AE reporting rate bringing it closer to the average rate across the study.

A theoretical concern exists that sites observed with a high DIS at one timepoint will naturally tend toward a lower DIS subsequently due to a regression-to-the-mean effect [21]. Indeed, selecting sites with a high DIS means by definition that we are selecting sites at the tail of the distribution. Therefore, by play of chance, there is a high probability that the DIS of the same site becomes less extreme in subsequent timepoints, resulting in an improving DIS for the site Without taking into account the regression-to-the-mean effect, the baseline assumption (i.e., if central monitoring had not been used) is that a site DIS has a 50% chance on average to improve, as a coin flip probability. The results of the analysis on two historical studies not using SDM showed that only slightly more than half of the sites (56%, 95% CI 41–70%) with an initially high DIS were observed to have a lower DIS at study closure, which points to a dominant effect of regression-to-the-mean. However in studies using SDM, this rate increased to 83% (95% CI 80–85%). The confidence intervals from studies not using SDM and those using SDM do not overlap, which suggests that DIS improvements are seen as a result of the SDM approach. However, the non-randomized nature of this comparison, and the limited evidence from studies not using SDM, both call for caution in interpreting this observed difference.

While the results of the current analysis are quite positive—83% of sites with improved DIS and 71% of observed values improving—one might ask why the level of improvement was not even better than this? In particular, 17% of the flagged sites did not end up with an improved DIS and 29% of the observed values did not improve. There are actually multiple factors that explain why some sites do not show improvement. First, as previously mentioned, data atypicality is not a definitive indicator of poor quality and in some cases, the observed atypicality is found to be explainable and simply does not reflect a quality issue. In these cases we would not expect the observed atypicalities to moderate on average following study team review. This is illustrated in a study in which a site recruited mostly older (though still eligible) patients who accordingly had a higher number of medical histories and higher rate of safety and efficacy findings [3].

A second reason for lack of observed improvement is in situations where, by the time the data atypicalities at a site are investigated and issues confirmed, all patients at that site have completed their participation in the study. In such cases there is no further patient data to be generated and/or reported from the site and therefore no opportunity to observe improvement in the data. While we could not quantify the contribution of these two factors on the overall results of our study, it can be hypothesized that they explain some of the observed non-improvement and that the actual rate of improvement is higher than that observed.

We observed no marked difference in the rate of improving sites or the size of improvement across the different therapeutic areas or study phases. This supports a conclusion that SDM is beneficial in a broad range of clinical trials, which is consistent with FDA and EMA recommendations [1, 2]. All data collected during a clinical trial are at risk of data quality issues [4, 14, 15]. However, some factors may increase those risks, including the following: complex study protocols, complicated eCRF and database designs, and poor site training [22, 23]. This is why identifying and controlling risks related to a clinical trial both prior to and during the trial is essential.

A limitation of the current analysis is that we assessed metrics of improved quality only for risks that were identified by the SDM solution and subsequently acted upon by the study team. Therefore, the analysis did not assess how effective SDM is at identifying all of the relevant issues in a clinical trial. Instead, it assesses to what extent study team follow-up on identified risks is resulting in improved quality.

A second limitation is related to the comparison with studies that did not use SDM, for which data from only two studies were available. Although the comparison is based on a more limited volume of data, the results do suggest that the significant level of improvement observed for SDM studies is not due to the play of chance.

This paper complements a previous paper showing that the use of KRIs was effective at improving quality in clinical trials. As mentioned in that paper, "it is important to recognize that improved quality does not come automatically through implementation of central monitoring. The degree of success achieved is highly dependent on the thoughtful design and implementation of all central monitoring tools (including KRIs) and risk follow-up processes.” [16].

Conclusion

These results provide quantitative evidence that central monitoring including SDM, which is recommended by regulatory agencies [1, 2], is resulting in improved quality. When properly implemented, managed and followed-up, SDM enables a targeted approach to identifying and addressing emerging quality-related risks during a study.

Conflict of interest

SdV, LT, WS, and SY are employees of CluePoints. LT, SY, and MB hold stock in CluePoints.

Data Availability

Not applicable.

References

US Department of Health and Human Services, Food and Drug Administration. Guidance for Industry: Oversight of Clinical Investigations – A Risk-Based Approach to Monitoring [Internet]. 2013 [cited 2023 Feb 16]. Available from: https://www.fda.gov/media/116754/download
EMA Guidance. Reflection paper on risk based quality management in clinical trials [Internet]. 2013 [cited 2023 Feb 16]. Available from: https://www.ema.europa.eu/en/documents/scientific-guideline/reflection-paper-risk-based-quality-management-clinical-trials_en.pdf
de Viron S, Trotta L, Schumacher H, Lomp HJ, Höppner S, Young S, et al. Detection of fraud in a clinical trial using unsupervised statistical monitoring. Ther Innov Regul Sci. 2022;56(1):130–6.
Article PubMed Google Scholar
Venet D, Doffagne E, Burzykowski T, Beckers F, Tellier Y, Genevois-Marlin E, et al. A statistical approach to central monitoring of data quality in clinical trials. Clin Trials Lond Engl. 2012;9(6):705–13.
Article Google Scholar
Desmet L, Venet D, Doffagne E, Timmermans C, Burzykowski T, Legrand C, et al. Linear mixed-effects models for central statistical monitoring of multicenter clinical trials. Stat Med. 2014;33:5265–79.
Article CAS PubMed Google Scholar
Trotta L, Kabeya Y, Buyse M, Doffagne E, Venet D, Desmet L, et al. Detection of atypical data in multicenter clinical trials using unsupervised statistical monitoring. Clin Trials Lond Engl. 2019;16(5):512–22.
Article Google Scholar
Desmet L, Venet D, Doffagne E, Timmermans C, Legrand C, Burzykowski T, et al. Use of the beta-binomial model for central statistical monitoring of multicenter clinical trials. Stat Biopharm Res. 2017;9(1):1.
Article Google Scholar
Integrated Addendum to ICH E6(R1): Guideline for Good Clinical Practice: E6(R2) [Internet]. [cited 2023 Feb 16]. Available from: https://www.ich.org/page/efficacy-guidelines
Adams A, Adelfio A, Barnes B, Berlien R, Branco D, Coogan A, et al. Risk-Based Monitoring in Clinical Trials: 2021 Update. Ther Innov Regul Sci [Internet]. 2023;57(3):529–37.
Article PubMed Google Scholar
Wilson B, Provencher T, Gough J, Clark S, Abdrachitov R, de Roeck K, et al. Defining a central monitoring capability: sharing the experience of transcelerate biopharma’s approach, Part 1. Ther Innov Regul Sci. 2014;48(5):529–35.
Article PubMed Google Scholar
Gough J, Wilson B, Zerola M, Wallis P, Mork L, Knepper D, et al. Defining a central monitoring capability: sharing the experience of transcelerate biopharma’s approach, Part 2. Ther Innov Regul Sci. 2016;50(1):8–14.
Article PubMed Google Scholar
Barnes S, Katta N, Sanford N, Staigers T, Verish T. Technology Considerations to Enable the Risk-Based Monitoring Methodology. Ther Innov Regul Sci. 2014;48(5):536–45.
Article PubMed Google Scholar
TransCelerate. Position paper: risk-based monitoring methodology [Internet]. 2013 [cited 2023 Feb 16]. Available from: https://pdf4pro.com/amp/view/position-paper-risk-based-monitoring-methodology-21432f.html
Timmermans C, Venet D, Burzykowski T. Data-driven risk identification in phase III clinical trials using central statistical monitoring. Int J Clin Oncol. 2016;21(1):38–45.
Article PubMed Google Scholar
George SL, Buyse M. Data fraud in clinical trials. Clin Investig. 2015;5(2):161–73.
Article CAS Google Scholar
de Viron S, Trotta L, Steijn W, Young S, Buyse M. Does central monitoring lead to higher quality? an analysis of key risk indicator outcomes. Ther Innov Regul Sci. 2023;57(2):295–303.
Article PubMed Google Scholar
Wolfs M, Bojarski Ł, Young S, Cesario L, Makowski M, Sullivan LB. Quality tolerance limits’ place in the quality management system and link to the statistical trial design: case studies and recommendations from early adopters. Ther Innov Regul Sci. 2023;57(4):839–48.
Article PubMed PubMed Central Google Scholar
van den Bor RM, Vaessen PWJ, Oosterman BJ, Zuithoff NPA, Grobbee DE, Roes KCB. A computationally simple central monitoring procedure, effectively applied to empirical trial data with known fraud. J Clin Epidemiol. 2017;1(87):59–69.
Google Scholar
Goldstein M, Uchida S. A Comparative evaluation of unsupervised anomaly detection algorithms for multivariate data. PLoS ONE. 2016;11(4):e0152173.
Article PubMed PubMed Central Google Scholar
Wallis S. Binomial confidence intervals and contingency tests: mathematical fundamentals and the evaluation of alternative methods. J Quant Linguist. 2013;20(3):178–208.
Article Google Scholar
Barnett AG, van der Pols JC, Dobson AJ. Regression to the mean: what it is and how to deal with it. Int J Epidemiol. 2005;34(1):215–20.
Article PubMed Google Scholar
Adachi K, Shirase M, Kimura Y, Kuboki Y, Yoshino T. What and how will the Risk Based Approach to Monitoring change? Survey of RBM in medical institutions. J Soc Clin Data Manag [Internet]. 2022. https://doi.org/10.47912/jscdm.18.
Article Google Scholar
Stokman PG, Ensign L, Langeneckhardt D, Mörsch M, Nuyens K, Herrera D, et al. Risk-based Quality Management in CDM An inquiry into the value of generalized query-based data cleaning. J Soc Clin Data Manag [Internet]. 2021. https://doi.org/10.47912/jscdm.20.
Article Google Scholar

Download references

Funding

This research received no funding other than from the authors’ companies. The data analyzed in this paper were generated by CluePoints’ risk-based quality management (RBQM) platform.

Author information

Authors and Affiliations

CluePoints S.A, Avenue Albert Einstein, 2a 1348, Louvain-la-Neuve, Belgium
Sylviane de Viron, Laura Trotta, William Steijn & Marc Buyse
CluePoints Inc, King of Prussia, USA
Steve Young
International Drug Development Institute (IDDI), Louvain-la-Neuve, Belgium
Marc Buyse
Interuniversity Institute for Biostatistics and Statistical Bioinformatics (I-BioStat), Hasselt University, Hasselt, Belgium
Marc Buyse

Authors

Sylviane de Viron
View author publications
You can also search for this author in PubMed Google Scholar
Laura Trotta
View author publications
You can also search for this author in PubMed Google Scholar
William Steijn
View author publications
You can also search for this author in PubMed Google Scholar
Steve Young
View author publications
You can also search for this author in PubMed Google Scholar
Marc Buyse
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the design of this study and participated to the interpretation of the results. SdV, LT, and WS performed the analyses. SdV drafted the manuscript. All authors revised and approved the final version of the paper and are accountable for all aspects of the work.

Corresponding author

Correspondence to Sylviane de Viron.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

de Viron, S., Trotta, L., Steijn, W. et al. Does Central Statistical Monitoring Improve Data Quality? An Analysis of 1,111 Sites in 159 Clinical Trials. Ther Innov Regul Sci 58, 483–494 (2024). https://doi.org/10.1007/s43441-024-00613-w

Download citation

Received: 26 May 2023
Accepted: 08 January 2024
Published: 09 February 2024
Issue Date: May 2024
DOI: https://doi.org/10.1007/s43441-024-00613-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Does Central Statistical Monitoring Improve Data Quality? An Analysis of 1,111 Sites in 159 Clinical Trials