Prediction of treatment outcome in clinical trials under a personalized medicine perspective

Berchialla, Paola; Lanera, Corrado; Sciannameo, Veronica; Gregori, Dario; Baldi, Ileana

doi:10.1038/s41598-022-07801-4

Prediction of treatment outcome in clinical trials under a personalized medicine perspective

Article
Open access
Published: 08 March 2022

Volume 12, article number 4115, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Prediction of treatment outcome in clinical trials under a personalized medicine perspective

Download PDF

Paola Berchialla¹^na1,
Corrado Lanera²^na1,
Veronica Sciannameo²,
Dario Gregori² &
…
Ileana Baldi²

5172 Accesses
4 Citations
Explore all metrics

Abstract

A central problem in most data-driven personalized medicine scenarios is the estimation of heterogeneous treatment effects to stratify individuals into subpopulations that differ in their susceptibility to a particular disease or response to a specific treatment. In this work, with an illustrative example on type 2 diabetes we showed how the increasing ability to access and analyzed open data from randomized clinical trials (RCTs) allows to build Machine Learning applications in a framework of personalized medicine. An ensemble machine learning predictive model is first developed and then applied to estimate the expected treatment response according to the medication that would be prescribed. Machine learning is quickly becoming indispensable to bridge science and clinical practice, but it is not sufficient on its own. A collaborative effort is requested to clinicians, statisticians, and computer scientists to strengthen tools built on machine learning to take advantage of this evidence flow.

Predicting Individual Treatment Effects: Challenges and Opportunities for Machine Learning and Artificial Intelligence

Article Open access 22 January 2024

External control arm analysis: an evaluation of propensity score approaches, G-computation, and doubly debiased machine learning

Article Open access 28 December 2022

Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine

Article Open access 16 June 2023

Introduction

Randomized clinical trials (RCTs) are the study design of choice for drawing inferences about a potential causal relationship between treatment and patient outcomes¹. However, in the clinical practice settings, personalized or precision medicine, tailored to individual patient’s characteristics, has questioned the value of average treatment effects estimated by RCTs when dealing with target populations that usually differ from those represented by RCT participants².

In this regard, some of the shortcomings in conventional medicine, which personalized medicine seeks to address, include differences in treatment response and incidence of adverse reactions based on individual variations. In personalized medicine, the focus is on identifying which interventions will be effective for patients based on their genetic, environmental, and lifestyle factors. Carrying out heterogeneous treatment effect analysis³, researchers can stratify individuals into subpopulations that differ in their susceptibility to a particular disease or their response to a specific treatment and identify who benefit most from a particular treatment instead of relying on an average effect estimated on a general population.

The need for new tools to store, manage, and analyze big data has been identified as a critical factor in personalized medicine’s implementation and success⁴. Much progress is expected from the digitalization of clinical research and reuse of de-identified open data for secondary research purposes in a wide area of health applications⁵. Given an appropriate data quality level, data-intensive research using machine learning (ML) could be a turning point for biomedical research and personalized medicine⁶.

ML is an interdisciplinary field aimed at developing models with maximal predictive accuracy, and it is highly tied to the concept of personalized medicine^7,8. ML algorithms' distinctive key is their capability to improve their predictive performance through experience⁹. Typical applications include searching for novel patterns¹⁰, making a diagnosis or outcome prediction¹¹, and optimizing treatment decisions¹². For these reasons, ML is increasingly applied to clinical studies, and it represents a new approach towards conducting medical research and developing ways to predict individual outcomes¹³.

One of the biggest promises of ML is to assist medical decision-making in many domains. A core problem that arises in most data-driven personalized medicine scenarios is estimating of heterogeneous treatment effects. It occurs in RCTs where the goal is to estimate the effect of a treatment on the clinical response as a function of patient characteristics.

Here we discuss both the opportunities and challenges, namely the validation of findings, posed to personalized medicine by the increasing ability to access and analyze open data from RCTs.

This paper aims to investigate ML predictive capabilities in clinical trials to find evidence of patients-specific treatment effects (heterogeneity) and target responsive subgroups of patients. The paper is organized as follows: Materials and Methods section briefly introduces the illustrative example and presents the ensemble model of supervised ML algorithms. The strategy to investigate the model’s predictive capabilities to find evidence of heterogeneous treatment effect and identify the best responsive patients is also presented.

Methods

Illustrative example

A common concern to applying RCT-based estimates to a target population is that many clinical features that differ between the RCT study and target population modify the treatment effect. Our illustrative example is a sub-analysis of a large RCT to examine whether DPP-4 inhibitors provide better glycemic control to conventional therapy in patients with type 2 diabetes. In this example, we exploited ML capabilities to identify systematic variation in treatment outcome, separate it from the variation due to the sampling error and target responsive subgroups of patients.

To conduct such a heterogeneous treatment analysis, we focused on the PROLOGUE RCT¹⁴. The PROLOGUE study is among the largest trials investigating whether DPP-4 inhibitors provide cardiovascular protective effects to patients with type 2 diabetes by slowing carotid stiffness progression associated with conventional diabetes treatment.

The study participants were either allocated to add-on DPP-4 inhibitor (Sitagliptin) treatment or continue therapy with conventional anti-diabetic agents. The primary endpoint was the arterial stiffness’s annual changes, which did not significantly differ between the two groups. However, the study showed that the decrease in Glycated Haemoglobin (HbA1c) in patients treated with Sitagliptin was superior to conventional therapy, proving a better glycemic control. As a sub-analysis of the PROLOGUE study, we then investigated a potential heterogeneous Sitagliptin effect on improving HbA1c.

ML algorithms need to learn the statistical dependencies between clinical features and patients’ treatment outcomes; therefore, we focused on the SAIS1 RCT¹⁵ to train the outcome prediction model. The SAIS1 is a multicenter, prospective randomized parallel-group study comparing the effect of two DPP-4 inhibitors (Sitagliptin and Glimepiride) on endothelial functionality in patients with type 2 diabetes.

Both the SAIS1 and the PROLOGUE RCTs have collected a common subset of patient measures and share the same inclusion and exclusion criteria (see Supplementary Table S1), making them suitable for our investigation’s purpose.

Thus, to evaluate the ML predictive capabilities to find evidence of heterogeneous treatment in an RCT setting, our primary strategy was to train an ML model to learn statistical dependencies between the reduction of HbA1c at 6 months (outcome) and clinical characteristics of patients in the treatment arm (i.e., Sitagliptin) of the SAIS1 RCT and assess its accuracy. Then, we used the predictive outcome model developed to compute for each patient in the PROLOGUE study the probability of lowering HbA1c. By selecting different probability values to be responders, we identified subgroups of best responsive patients on whom we estimated the Sitagliptin effect, assessing the presence of a heterogeneous treatment effect.

All the methods were performed following relevant guidelines and regulations.

Machine learning approach

No single ML algorithm is universally the best-performing technique for all datasets¹⁶. We adopted a weighted combination, also known as an ensemble of algorithms. Ensemble algorithms have proved to give accurate estimates across many different fields. The ensemble approach broadens from one-to-many potential learners, each building on its assumptions. In fact, despite their flexibility, ML algorithms performance on a given problem depends on how well their assumptions fit with the data.

We build on the ensemble algorithm called Super Learner (SL), which uses a cross-validated measure of prediction performance to weight each algorithm's contribution to the final prediction. There is a need for SL to include relevant predictors as part of any predictive model. The ensemble approach is a weighted average that allows multiple models to contribute to a prediction in proportion to their trust or estimated performance.

Building an SL requires defining a set of algorithms or learners \(({\Psi }_{1}.\dots {\Psi }_{L}\)) appropriate for the classification task. Their classification error is assessed using fivefold cross-validation. All the learners are trained on the same 4-folds, and their out of fold predictions are retained. Then for each algorithm, the error is estimated: the difference between each observation and prediction in the out-of-fold set is averaged. In other words, the mean squared error between the observed outcomes in the out-of-fold set and the predicted ones based on the algorithms fit on the training set is estimated.

Then the estimated error is averaged across the out-of-folds to get the cross-validated prediction error for each algorithm. Finally, to compute the contribution of each candidate algorithm to the final Super Learner prediction, non-negative least squares is used to regress the actual outcome against the cross-validated. The final SL obtained is theoretically proved to be asymptotically as good as the best candidate between learners¹⁷. Then, the ensemble model obtained can be used to make predictions on new data.

For a sample size of about 50–70 patients, it is suggested to use 5-folds cross-validation to assess the accuracy error of the SL¹⁸.

A Statistical Model (SM) is a family of probability distributions, which embody the data generating mechanism process, indexed by a set of parameters¹⁹. ML is taken to mean an algorithmic approach that does not use traditional identified statistical parameters and for which a preconceived structure is not imposed on the relationships between predictors and outcomes²⁰.

In the following, a short description of the statistical models (SMs) and ML algorithms used as base learners is provided:

Gradient Boosting Machine (GBM), a tree-based ML model involving a recursive addition to the initial learning from the residuals, was applied. It fits a tree-based model on the residuals using the specified list of variables at hand and explains the variance in the residuals. The total number of trees set for the model building was 500 with interaction depth as 5, and the learning weight of iteration was 0.1²¹.

Generalized Linear Model (GLM) with elastic net regularization is a regression method, and as such, an SM that linearly combines the L1 and L2 penalties of the lasso and ridge methods applied in synergy with a link function a variance function to overcome linear model limitation (such as the constant variability among the mean and the normality of the data)²².

Multivariate adaptive regression splines is an SM that uses non-parametric regression method to model nonlinearities and interactions between covariates²³.

Random Forest (RF) is a typical ML technique, which works by recursively creating decision trees. It selects a subset of available features and recursively partitions the data in the regression space until the subspace variation is small enough. RF is a greedy technique, and as a result, it does not necessarily converge to the optimal global solution. Bagging methods, the ensemble of locally optimal trees, provide a solution to avoid such indecisive convergence. The ensemble of such trees is known as a forest^24,25.

Classification and Regression Trees (CART) are ML methods for constructing prediction models obtained by recursively partitioning the data space and fitting a simple prediction model within each partition. As a result, the partitioning can be represented graphically as a decision tree²⁶.

Bayesian Additive Regression Trees (BART) is an ML ensemble method that relays on a prior-regularized sum-of-tree model, which prevents individual fitted trees to be dominant, powered by an iterative Bayesian back-fitting MCMC algorithm based on a likelihood built on the data in the terminal nodes. Sums of regression trees have a remarkable ability in capturing interactions, non-linearities, and additive effects²⁷.

Support-Vector Machine (SVM) it is an ML method based on projecting of the feature space in a higher-enough dimensional space (possibly of infinite dimension). The classes are linearly separable by a hyper-plane. SVMs are among the most widely used ML techniques for classification since they ensure low computational complexity²⁸.

Statistical analysis

We set as outcome an improvement of at least − 0.5% in HbA1c, obtaining a dichotomized outcome, according to guidelines that consider a difference of 0.5% (5.5 mmol/mol) to be clinically significant²⁹.

As predicting covariates, we used the common clinical patients’ characteristics collected by the two RCTs: age (years), gender (female, male), Body Mass Index (BMI, kg/cm2), Systolic Blood Pressure (SBP, mmHg), Diastolic Blood Pressure (DBP, mmHg), hypertension (SBP > = 130 mmHg OR DBP > = 80 mmHg), Low-Density Lipoprotein (LDL, mg/dl), High-Density Lipoprotein (HDL, mg/dl), HbA1c (%), Fasting Plasma Glucose (FPG, mmol/l), dyslipidemia (LDL > = 130 mg/dl OR HDL < 35 mg/dl OR triglyceride > = 150 mg/dl OR total cholesterol (= LDL + HDL + (Triglyceride/5)) > = 200 mg/dl), adiponectin (mg/l).

Five out of 48 patients in the Sitagliptin arm of the SAIS1 study for whom HbA1c measures were not collected during the follow-up were excluded from the analysis.

To handle missing values on covariates in the PROLOGUE study, as imputation strategy, we have used a Multivariate Imputations by Chained Equations (MICE) approach³⁰, using random forests^24,25 as elementary imputation method. We have performed the imputation with a monotone visit sequence, i.e. the variables are sorted by the increasing amount of missingness to impute the data during each step through the data³⁰.

The learners used were considered on the set of variables and on the subsets selected by a random forest. Supplementary Table S2 reports the variables involved in the training of each base learner. Overall, 26 (i.e. 13 × 2) different algorithms were evaluated to build on the SL. In Table 1, the learners employed and their implementation is listed. They were combined in the SL using the Non-Negative Least Squares algorithm as a meta learner, i.e. the weights they contribute are estimated to minimize the squared prediction error. The procedure for weights computation starts assigning each model a weight equal to 1/n, where n is the number of learners. Next, it evaluates the prediction performance, modifying the weight accordingly. Since the prediction performance is assessed using a fivefold CV procedure, on a sample size of 43 patients each validation set comprises 8 or 9 patients. Thus, the AUC varies with steps of 0.111 or 0.125. Given that the AUC of the learners at the first step (weights equal to 1/n) are similar, we can argue that their performance on the validation sets at each CV step remain very similar. So, it is reasonable that also the final combination is made of equal weights.

Table 1 Base learners and super learner (SL) training performance.

Full size table

Hyperparameter tuning of the SL was conducted by a fivefold cross-validation process for which each fold was balanced to maintain the same ratio of the overall dataset in each training and validation sample. The overall performance was measured by the Area Under the Receiver Operating Characteristics Curve (AUC-ROC).

We use the outcome prediction model developed on the SAIS1 study to assign each patient the probability of being a responder (i.e., getting a reduction at 12 months of HbA1c at least of 0.5%) to each patient in the (imputed) PROLOGUE dataset.

Using distinct probability thresholds to predict PROLOGUE patients as responders to the therapy (patients successfully achieving the reduction of \(\mathrm{\Delta HbA}1\mathrm{c }< -0.5\mathrm{\% })\) we sub-set the PROLOGUE patients into nested groups. The cut-off values used for the AUC-ROC computation on the SAIS1 study were selected as probability thresholds to define nested sub-groups of responsive patients in the PROLOGUE study.

For each subgroup of patients, an independent GLM was used to estimate the effect of Sitagliptin on the variation at 12 months of HbA1c (on the continuous scale) adjusted by the characteristics used in the PROLOGUE RCT to balance the allocation of patients into the two arms, i.e., treatment of diabetes mellitus before randomization (pharmacological or not), use of statin, age, gender, SBP at the office, baseline HbA1c and Maximum Common Carotid Intimal Medial Thickness.

R software³¹, version 4.0.3, was used for the analysis.

Results

The PROLOGUE study initially involved and analyzed data on n = 385 patients with type 2 diabetes, at least 30 years of age, and levels of HbA1c between 6.2 and 9.4% at the baseline. Patients were allocated on conventional therapy (n = 193), and on Sitagliptin (n = 192). Follow-up information was collected at 12 and 24 months (data available at https://datadryad.org/resource/doi:10.5061/dryad.qt743/2) (Table 2).

Table 2 SAIS1 and PROLOGUE patients’ characteristics.

Full size table

The SAIS1 study involved n = 103 patients with type 2 diabetes, between 20 and 75 years of age, and levels of HbA1c between 6.9% and 8.4% at the baseline (data available at https://doi.org/10.1371/journal.pone.0164255.s004), who were allocated to receive Glimepiride (n = 55) or Sitagliptin (n = 48). Follow-up information was collected once, at 6 months (data available at https://doi.org/10.1371/journal.pone.0164255.s005) (Table 2).

Outcome results of both the SAIS1 and the PROLOGUE study are reported in Table 3.

Table 3 Outcome results in SAIS1 and PROLOGUE Study.

Full size table

The performance of the outcome predictive models developed on SAIS1 study patients was measured by the cross-validated AUC-ROC, which resulted equal to 92.05%. In Table 1, each learner’s error rate and weight, with which it contributes to the ensemble SL, are reported.

The cut-off values used for the AUC-ROC computation on the SAIS1 study were selected as probability thresholds to define nested sub-groups of responsive patients. Figure 1 shows the treatment effect estimated for different sub-groups of responders selected by varying the probability value determining the responsive patients to Sitagliptin. At value 0, the estimated treatment effect is on all 385 patients. Overall, 376 out of 385 patients have a probability of getting a reduction of Ha1bc (\(\Delta\) HbA1c ≤ − 0.5%) of at least 19.3%. Then, 259 out of 385 patients have a probability of achieving \(\Delta\) HbA1c ≤ − 0.5% of at least 27.5%. The best treatment effect is achieved in a sub-group of 253 patients selected at the probability value of 41.3%.

On this sub-group of patients (Best responders in PROLOGUE Study in Table 3), the median reduction of glycated haemoglobin at 12 months Δ_0–12 HbA1c is − 0.2 (IQR: − 0.5;0) among the 122 best responsive patients in the conventional group and − 0.4 (IQR: − 07; − 0.2) among the 131 best responsive patients in the Sitagplitn group. The difference among arms is still statistically significant, p = 0.013. Moreover, the two arms of best responders are still balanced for baseline characteristics (data not shown).

Figure 2 shows the distribution of the HbA1c arm among those who were retained in both conventional and Sitagliptin arms at different pre-specified levels of the probability of being a responder. The treatment effect is assessed at each subset of patients retained, and it is not constant across these different patient subpopulations. This effect can be attributed to the heterogeneity of treatment effect and suggest an interaction between treatment and patient characteristics.

Discussion

Precision medicine aims to target the proper treatment to suitable patients. As such, identification of non-random variation in the direction or magnitude of a treatment effect for subgroups within a population is the basis of precision medicine. In clinical trials, individual response to treatment can also be used to improve patients' enrollment and identify patient sub-populations.

In a recent scoping review³, Rekkas and colleagues identified many methodological approaches for assessing the heterogeneity of treatment effects in RCTs developed in the past 20 years. They grouped predictive models into three broad categories (i.e., risk-based, treatment effect modelling and optimal treatment regimen methods) depending on whether and how they incorporated prognostic variables and relative treatment effect modifiers.

Senn et al.³² showed how to estimate the component of variation corresponding to a patient by treatment interaction and investigate the possibility of individual response to treatment from a replicate cross over study.

In the present work, we illustrated an ML framework to carry out a heterogeneous treatment analysis in the context of RCTs. We take advantage of publicly available data upon publication of two clinical trials (SAIS1 and PROLOGUE studies) that share inclusion and exclusion criteria and a set of common clinical patients’ features. One of them (SAIS1 study¹⁵) was used to train an outcome prediction model, which was subsequently applied to the patients enrolled in the second trial (PROLOGUE study¹⁴).

Whereas in Senn et al.³², the patient-by-treatment interaction turned out to be unimportant, in our analysis, the heterogeneous treatment analysis made it possible to identify a subgroup of best responders to the treatment. This illustrates the potential applicability of ML in addressing the issue of finding evidence of individual patient response to treatment.

As clinical research is getting increasingly patient-driven, opportunities to deploy artificial intelligence, especially ML, are overgrowing in the perspective of precision medicine. In the last decade, cutting-edge ML techniques have advanced to a degree of maturity that allows them to be employed under real-world conditions to assist decision-making in medical and healthcare settings³³. Their added value must be demonstrated through external validation and benchmarked in an explainable, ethical, repeatable, and scalable way.

To avoid the problem of ML, that there is no one best-performing algorithm for all situations and thus to avoid building on several models and choose the out-performing one, we used an ensemble approach called Super Learner. SL has the advantage of weighting more algorithms that contribute more accurately to the final estimate, without forcing to choose an individual model/algorithm.

Our framework focused on openly and publicly available clinical trials data. As medical research is becoming more patient-driven, the need for broader access to clinical trial data is getting more urgent. Even if still not widely adopted, open data policies³⁴ have renewed the focus on sharing clinical trial data in peer-reviewed scientific journals, with profound implications in clinical practice and research.

Following this approach, ML can be adopted into the clinical trial ecosystem step-by-step, shifting the focus from the framework of clinical trials to personalized medicine^8,13. RCTs generate immense operational data but consolidating all data—whatever the source—on a shared analytics platform, supported by open data standards, can foster collaboration and knowledge. Furthermore, incorporating a self-learning system designed to improve predictions can proactively deliver reliable analytics insights to users.

Data availability

Publicly available datasets were analyzed in this study. Data can be found here: [https://datadryad.org/resource/doi:10.5061/dryad.qt743/2] and [https://doi.org/10.1371/journal.pone.0164255.s004].

References

Piantadosi, S. Clinical Trials: A Methodologic Perspective (Wiley, 2017).
MATH Google Scholar
Cole, S. R. & Stuart, E. A. Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial. Am. J. Epidemiol. 172, 107–115 (2010).
Article Google Scholar
Rekkas, A. et al. Predictive approaches to heterogeneous treatment effects: A scoping review. BMC Med. Res. Methodol. 20, 264 (2020).
Article Google Scholar
Collins, H., Calvo, S., Greenberg, K., Forman Neall, L. & Morrison, S. Information needs in the precision medicine era: How genetics home reference can help. Interact. J. Med. Res. 5, e13 (2016).
Article Google Scholar
Rockhold, F., Bromley, C., Wagner, E. K. & Buyse, M. Open science: The open clinical trials data journey. Clin. Trials 16, 539–546 (2019).
Article Google Scholar
Topol, E. J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 25, 44–56 (2019).
Article CAS Google Scholar
Data Mining: Practical Machine Learning Tools and Techniques - 3rd Edition. https://www.elsevier.com/books/data-mining-practical-machine-learning-tools-and-techniques/witten/978-0-12-374856-0.
Hulsen, T. et al. From big data to precision medicine. Front. Med. 6, 34 (2019).
Article Google Scholar
Flach, P. Machine learning by peter flach. Camb. Core https://doi.org/10.1017/CBO9780511973000 (2012).
Article Google Scholar
Karpati, T. et al. Patient clusters based on HbA1c trajectories: A step toward individualized medicine in type 2 diabetes. PLoS ONE 13, e0207096 (2018).
Article Google Scholar
Bottigliengo, D. et al. The role of genetic factors in characterizing extra-intestinal manifestations in crohn’s disease patients: Are Bayesian machine learning methods improving outcome predictions?. JCM 8, 865 (2019).
Article Google Scholar
Murray, T. A., Yuan, Y. & Thall, P. F. A Bayesian machine learning approach for optimizing dynamic treatment regimes. J. Am. Stat. Assoc. 113, 1255–1267 (2018).
Article MathSciNet CAS Google Scholar
Harrer, S., Shah, P., Antony, B. & Hu, J. Artificial intelligence for clinical trial design. Trends Pharmacol. Sci. 40, 577–591 (2019).
Article CAS Google Scholar
Oyama, J.-I. et al. The effect of sitagliptin on carotid artery atherosclerosis in type 2 diabetes: The PROLOGUE randomized controlled trial. PLoS Med. 13, e1002051 (2016).
Article Google Scholar
Nomoto, H. et al. A Randomized controlled trial comparing the effects of sitagliptin and glimepiride on endothelial function and metabolic parameters: Sapporo Athero-incretin study 1 (SAIS1). PLoS ONE 11, e0164255 (2016).
Article Google Scholar
Wolpert, D. H. The lack of a priori distinctions between learning algorithms. Neural Comput. 8, 1341–1390 (1996).
Article Google Scholar
Polley, E., LeDell, E., Kennedy, C., Lendle, S. & van der Laan, M. SuperLearner: Super Learner Prediction (2018).
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction 2nd edn. (Springer-Verlag, 2009).
Book Google Scholar
McCullagh, P. What is a statistical model?. Ann. Stat. 30, 1225–1267 (2002).
Article MathSciNet Google Scholar
Breiman, L. Statistical modeling: The two cultures (with comments and a rejoinder by the author). Stat. Sci. 16, 199–231 (2001).
Article Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article MathSciNet Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article Google Scholar
Friedman, J. H. Multivariate Adaptive Regression Splines. Ann. Statist. 19, (1991).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Ho, T. K. Random decision forests. In Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) 278 (IEEE Computer Society, 1995).
Classification and Regression Trees. CRC Press https://www.crcpress.com/Classification-and-Regression-Trees/Breiman-Friedman-Stone-Olshen/p/book/9780412048418.
Chipman, H. A., George, E. I. & McCulloch, R. E. BART: Bayesian additive regression trees. Ann. Appl. Stat. 4, 266–298 (2010).
Article MathSciNet Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Mach Learn 20, 273–297 (1995).
Article MATH Google Scholar
American Diabetes Association. Standards of medical care in diabetes—2014. Diabetes Care 37, S14–S80 (2014).
Article Google Scholar
van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Article Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2020).
Google Scholar
Senn, S., Rolfe, K. & Julious, S. A. Investigating variability in patient response to treatment–a case study from a replicate cross-over study. Stat. Methods Med. Res. 20, 657–666 (2011).
Article MathSciNet Google Scholar
Desai, A. N. Artificial intelligence: Promise, pitfalls, and perspective. JAMA 323, 2448 (2020).
Article Google Scholar
EMA. European Medicines Agency Policy on Publication of Clinical Data for Medicinal Products for Human Use (EMA, 2014).
Google Scholar

Download references

Funding

This research was partially funded by an unrestricted grant from the University of Padova (Investimento Strategico di Dipartimento (SID) 2017, BIRD175998/17).

Author information

These authors contributed equally: Paola Berchialla and Corrado Lanera.

Authors and Affiliations

Center of Biostatistics, Epidemiology and Public Health, Department of Clinical and Biological Sciences, University of Torino, Regione Gonzole 10, Orbassano, 10043, Turin, Italy
Paola Berchialla
Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic, Vascular Sciences and Public Health, University of Padova, Padova, Italy
Corrado Lanera, Veronica Sciannameo, Dario Gregori & Ileana Baldi

Authors

Paola Berchialla
View author publications
You can also search for this author in PubMed Google Scholar
Corrado Lanera
View author publications
You can also search for this author in PubMed Google Scholar
Veronica Sciannameo
View author publications
You can also search for this author in PubMed Google Scholar
Dario Gregori
View author publications
You can also search for this author in PubMed Google Scholar
Ileana Baldi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, I.B. and P.B.; methodology, I.B., P.B. and C.L.; software, C.L.; writing—original draft preparation, I.B., P.B. and C.L.; writing—review and editing, D.G. and V.S. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Paola Berchialla.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table S1.

Supplementary Table S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Berchialla, P., Lanera, C., Sciannameo, V. et al. Prediction of treatment outcome in clinical trials under a personalized medicine perspective. Sci Rep 12, 4115 (2022). https://doi.org/10.1038/s41598-022-07801-4

Download citation

Received: 15 June 2021
Accepted: 22 February 2022
Published: 08 March 2022
DOI: https://doi.org/10.1038/s41598-022-07801-4
Springer Nature Limited

Prediction of treatment outcome in clinical trials under a personalized medicine perspective

Abstract

Similar content being viewed by others

Predicting Individual Treatment Effects: Challenges and Opportunities for Machine Learning and Artificial Intelligence

External control arm analysis: an evaluation of propensity score approaches, G-computation, and doubly debiased machine learning

Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine

Introduction