Skip to main content
Log in

Efficacy and toxicity monitoring via Bayesian predictive probabilities in phase II clinical trials

  • Original Paper
  • Published:
Statistical Methods & Applications Aims and scope Submit manuscript

Abstract

Bayesian monitoring strategies based on predictive probabilities are widely used in phase II clinical trials that involve a single efficacy binary variable. The essential idea is to control the predictive probability that the trial will show a conclusive result at the scheduled end of the study, given the information at the interim stage and the prior beliefs. In this paper, we present an extension of this approach to incorporate toxicity considerations in single-arm phase II trials. We consider two binary endpoints representing response and toxicity of the experimental treatment and define the result as successful at the conclusion of the study if the posterior probability of an high efficacy and that of a small toxicity are both sufficiently large. At any interim look, the Multinomial-Dirichlet distribution provides the predictive probability of each possible combination of future efficacy and toxicity outcomes. It is exploited to obtain the predictive probability that the trial will yield a positive outcome, if it continues to the planned end. Different possible interim situations are considered to investigate the behaviour of the proposed predictive rules and the differences with the monitoring strategies based on posterior probabilities are highlighted. Simulation studies are also performed to evaluate the frequentist operating characteristics of the proposed design and to calibrate the design parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Alvares D, Armero C, Forte A (2018) What does objective mean in a Dirichlet-multinomial process? Int Stat Rev 86:106–118

    Article  MathSciNet  Google Scholar 

  • Bernardo JM, Smith AFM (1994) Bayesian theory. Wiley, Chichester

    Book  Google Scholar 

  • Brutti P, Gubbiotti S, Sambucini V (2011) An extension of the single threshold design for monitoring efficacy and safety in phase II clinical trials. Stat Med 30(14):1648–1664

    Article  MathSciNet  Google Scholar 

  • Bryant J, Day R (1995) Incorporating toxicity considerations into the design of two-stage phase II clinical trials. Biometrics 51:1372–1383

    Article  MathSciNet  Google Scholar 

  • Conaway MR, Petroni GR (1995) Bivariate sequential design for phase II trials. Biometrics 51:656–664

    Article  Google Scholar 

  • Conaway MR, Petroni GR (1996) Design for phase II trials allowing for a trade-off between response and toxicity. Biometrics 52:1375–1386

    Article  MathSciNet  Google Scholar 

  • Hobbs BP, Chen N, Lee JJ (2016) Controlled multi-arm platform design using predictive probability. Stat Methods Med Res 27:65–78

    Article  MathSciNet  Google Scholar 

  • Lee JJ, Liu DD (2008) A predictive probability design for phase II cancer clinical trials. Clin Trials 5(2):93–106

    Article  Google Scholar 

  • R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna

    Google Scholar 

  • Sambucini V (2008) A Bayesian predictive two-stage design for phase II clinical trials. Stat Med 27:1199–1224

    Article  MathSciNet  Google Scholar 

  • Sambucini V (2017) Bayesian vs frequentist power functions to determine the optimal sample size: testing one sample binomial proportion using exact methods. Bayesian Inference Javier Prieto Tejedor, IntechOpen, pp 77–95

  • Sambucini V (2019) Bayesian predictive monitoring with bivariate binary outcomes in phase II clinical trials. Comput Stat Data Anal 132:18–30

    Article  MathSciNet  Google Scholar 

  • Saville BR, Connor JT, Ayers GD, Alvarez J (2014) The utility of Bayesian predictive probabilities for interim monitoring of clinical trials. Clin Trials 11(4):485–493

    Article  Google Scholar 

  • Shi H, Yin G (2017) Bayesian two-stage design for phase II clinical trials with switching hypothesis tests. Bayesian Anal 12(1):31–51

    Article  MathSciNet  Google Scholar 

  • Simon R (1989) Optimal two-stage designs for phase II clinical trials. Control Clin Trials 10:1–10

    Article  Google Scholar 

  • Tan SB, Machin D (2002) Bayesian two-stage designs for phase II clinical trials. Stat Med 21:1991–2012

    Article  Google Scholar 

  • Teramukai S, Daimon T, Zohar S (2015) An extension of Bayesian predictive sample size selection designs for monitoring efficacy and safety. Stat Med 34(22):3029–3039

    Article  MathSciNet  Google Scholar 

  • Thall PF, Simon R (1994) Practical guidelines for phase IIB clinical trials. Biometrics 50:337–349

    Article  MathSciNet  Google Scholar 

  • Thall PF, Simon R, Estey EH (1995) Bayesian sequential monitoring designs for single-arm clinical trials with multiple outcomes. Stat Med 14:357–379

    Article  Google Scholar 

  • Thall PF, Sung HG (1998) Some extensions and applications of a Bayesian strategy for monitoring multiple outcomes in clinical trials. Stat Med 17:1563–80

    Article  Google Scholar 

  • Thall PF, Simon R, Estey EH (1996) New statistical strategy for monitoring safety and efficacy in single-arm clinical trials. J Clin Oncol 14:296–303

    Article  Google Scholar 

  • Tournoux C, De Rycke Y, Medioni J, Asselain B (2007) Methods of joint evaluation of efficacy and toxicity in phase II clinical trials. Contemp Clin Trials 28:514–524

    Article  Google Scholar 

  • Yin G, Chen N, Lee JJ (2012) Phase II trial design with Bayesian adaptive randomization and predictive probability. J R Stat Soc Ser C 61:219–235

    Article  MathSciNet  Google Scholar 

  • Yin H, Wang W, Zhang Z (2019) On construction of single-arm two-stage designs with consideration of both response and toxicity. Biom J 61:1462–1476

    Article  MathSciNet  Google Scholar 

  • Zhou H, Lee JJ, Yuan Y (2017) BOP2: Bayesian optimal design for phase II clinical trials with simple and complex endpoints. Stat Med 36(21):3302–3314

    Article  MathSciNet  Google Scholar 

  • Zhou M, Tang Q, Lang L, Xing J, Tatsuoka K (2018) Predictive probability methods for interim monitoring in clinical trials with longitudinal outcomes. Stat Med 37(14):2187–2207

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Valeria Sambucini.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: Detailed calculations to obtain the predictive probability of interest

It can be useful to illustrate the detailed calculations necessary to obtain the predictive probability in (5). We use very small and unrealistic sample sizes with the only purpose of exemplifying. More specifically, we set the total sample size equal to \(N=10\) and assume that at the interim stage the data \({\varvec{x}}= (1,4,0,2)\) have been observed among \(n=7\) current patients. The first column of Table 10 provides all the possible results \({\varvec{Y}}\) obtainable among the 3 future patients, that is all the possible entries of a \(2 \times 2\) contingency table consistent with a total equal to 3. For each possible outcome, the second column provides its posterior predictive probability computed by using the Multinomial-Dirichlet distribution in (3). Then, the following two columns contain the posterior probabilities of an high efficacy (i.e. \(Pr(p_E> p_E^* |{\varvec{\alpha }}, {\varvec{x}}, {\varvec{y}})\)) and of a small toxicity (i.e. \(Pr(p_T< p_T^* |{\varvec{\alpha }}, {\varvec{x}}, {\varvec{y}})\)), based on the target values \(p^*_E= 0.4\) and \(p^*_T= 0.3\). The thresholds \(\lambda _E\) and \(\lambda _T\) are both set equal to 0.9 and in the table the color gray is used to highlight the cases in which the posterior quantities of interest are above 0.9. The final results are considered positive if they satisfy both the conditions on the posterior probabilities. In our case, we have three positive results and, simply by summing their predictive probabilities, we obtain the predictive probability of interest. Thus, in this fictitious example, the value of PP is equal to 0.5324 and, for reasonable choices of \(\theta _L\) and \(\theta _U\), the decision is to continue the trial to gather additional information before drawing a final conclusion.

Table 10 PP computation when \({\varvec{x}}= (1,4,0,2)\), \(n=7\), \(N=10\), \(p^*_E= 0.4\), \(p^*_T= 0.3\), \(\lambda _E=\lambda _T=0.9\) and \({\varvec{\alpha }} = (0.25,0.25,0.25,0.25)\)

Appendix 2: Description of the software tool to compute PP

We provide a very easy-to-use software tool to compute the predictive probability in (5) to implement the proposed decision rules. The program is coded in the R-programming language and uses the package gWidgets, which offers a relatively simple way of writing graphical user interfaces (GUIs). This package is therefore supposed to be correctly installed within R, as well as the packages gWidgetsRGtk2 and RGtk2. The file containing the R script is called “PP-BivEndpoints.R”.

The code can be run directly using R or its integrated development environment RStudio. When the script is contained in the current working directory, it is sufficient to let R run the source file by using the command line

> source("PP-BivEndpoints.R")

As a result a graphical interface is generated, where the user can insert the design parameters of interest. For instance, in Fig. 3 we show the GUI with the input parameters that correspond to the top-left cell of Table 3.

Fig. 3
figure 3

Graphical user interface to compute the predictive probability in (5)

By clicking on the button “Compute PP”, the output produced on the R console summarizes the design parameters choices and provides the predictive probability of interest. For instance, using the input entries in Fig. 3, we obtain the following output:

figure a

Finally, by selecting “Show additional results”, more details about the calculations performed to obtain the value of PP are displayed. More specifically, the output provides the list of the future data that satisfy the conditions for a successful treatment, their posterior predictive probabilities and the corresponding posterior quantities of interest. In other words, results similar to those presented in Table 10 are displayed with reference only to the future outcomes that fulfill the conditions in (4)

Appendix 3: Numerical algorithm to evaluate the operating characteristics of the proposed design

First of all, we set suitable values for the minimum sample size \(N_{min}\), the maximum planned sample size N, the target values \(p_E^*\) and \(p_T^*\), the vector of hyperparameters \({\varvec{\alpha }}\) and the probability thresholds \(\lambda _E\), \(\lambda _T\) and \(\theta _L\). We also need to fix an assumed scenario by setting the true value of \({\varvec{p}}\), here denoted by \({\varvec{p}}^{\varvec{true}}\), and the number of simulated trials B.

Once these design parameters have been fixed, a single clinical trial with stopping rules based on the proposed approach can be simulated through the following steps:

  1. (a1)

    Set \(n_{ASS}=0\) (the actually achieved sample size) and \(j=1\) (an auxiliary index for indicating potential additional patients)

  2. (a2)

    Enrol the first \(N_{min}\) patients into the trial by simulating the current data \({\varvec{x}}_{N_{min}}\) from a Multinomial distribution with index \(N_{min}\) and parameter \({\varvec{p}}^{\varvec{true}}\).

    Compute PP based on \({\varvec{x}}_{N_{min}}\) by using formula (5) and

    • if \(PP \le \theta _L\), the algorithm stops and set \(n_{ASS}=N_{min}\);

    • if \(PP > \theta _L\), the algorithm proceeds to the following step.

  3. (a3)

    Enrol the \((N_{min}+j)\)th patient into the trial by simulating a value from a Multinomial distribution with index 1 and parameter \({\varvec{p}}^{\varvec{true}}\). Add the simulated value to the previous current data, obtaining the data \({\varvec{x}}_{N_{min}+j}\).

    Compute PP based on \({\varvec{x}}_{N_{min}+j}\) by using formula (5) and

    • if \(PP \le \theta _L\), the algorithm stops and set \(n_{ASS}=N_{min}+j\);

    • \(PP > \theta _L\), set \(j=j+1\) and, if \(j \le N-N_{min}-1\), repeat the step (a3), otherwise proceeds to the following step.

  4. (a4)

    If \(j = N-N_{min}\), the maximum sample size has been reached; therefore set \(n_{ASS}=N\).

We simulate B clinical trials using the steps described above. Each simulated trial provides the value of the actually achieved sample size \(n_{ASS}\). A value of \(n_{ASS}\) less than N indicates that the trial has been terminated early because at a certain interim look the predictive probability of interest was below the desired threshold. In this case \(H_0\) is not rejected and the experimental treatment is considered not sufficiently promising. When, instead, \(n_{ASS}=N\) the null hypothesis is rejected and the new treatment is declared worthy of further investigation in phase III trials.

Therefore, the frequency of simulated trials where \(n_{ASS}=N\) represents an empirical evaluation of the Type I error rate, under a scenario where \(H_0\) is true, and of the statistical power when we assume that \(H_0\) is not true.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sambucini, V. Efficacy and toxicity monitoring via Bayesian predictive probabilities in phase II clinical trials. Stat Methods Appl 30, 637–663 (2021). https://doi.org/10.1007/s10260-020-00537-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10260-020-00537-3

Keywords

Navigation