INTRODUCTION

Censored data is a practical and overriding question when analyzing data. There is no unique and simple answer on how to handle these data (1). If not considered properly, censoring may cause bias and misspecification during data analysis (2), so it is important to apprehend this problem and to propose strategies to get the best of experimental data.

Traditionally for pharmacokinetic studies and more recently for pharmacodynamic studies, a major source of censored data is concentrations below the limit of quantification (LOQ) of the analytical method. In this context, several approaches were proposed to handle observed concentrations reported as below quantification limit (BQL) (35). These approaches have various degrees of complexity. The simpler method consists in discarding BQL values (the so-called M1 method) and applying the method of maximizing the conditional likelihood only for the observations above the LOQ. A more complex approach consists in the simultaneous modeling of above and below quantification limit observations, considering for the latter, the likelihood of the observation to be BQL given the predicted value (the so-called M3 method). Not considering BQL observations in PK studies can produce biased parameter estimations and biased interpretations. In tumor growth inhibition experiments, we were also facing a kind of censoring specifically induced by the measurement technique.

The xenografted animal model (generally on mice or rats) (6) is a widely used experimental model to evaluate anticancer effect of new compounds and possibly predict the efficacy in human thanks to translational approaches (7).

During tumor growth experiment on xenografted animals, the tumor size is measured by a technical operator using a caliper, and reported as the measures of two diameters, often converted into tumor volume or tumor weight. One of the issues with this type of experimentation is the precision of the caliper measurement, reasonable for larger diameters (precision ±0.1 mm) but questionable for smaller ones, whose measurement if often limited by operational difficulties (skin thickness, difficulties to delimit the tumor and to maintain correctly the tumor in the caliper …), and only rounded values for tumor diameters (1, 2, 3,…5 mm) are reported. When converting into tumor volumes, rounded values for small tumor diameters result into discrete values and consequently to interval-censored tumor volumes.

The appreciation of the antitumor effect of a drug being essentially informed when the tumor size is at the lowest; the imprecision of small tumor sizes can largely affect the estimation of the drug effect.

A new approach, the so-called interval-M3 method, adapted to the censoring induced by the caliper measurement is being proposed and evaluated. This article reports the comparison of the performance of several methods (including methods discarding data, methods considering a single limit in the censoring process, and also methods considering interval censoring in its founding) to take into account the impact of interval censoring in tumor growth inhibition modeling exercise. We simulated replicated data of a tumor growth inhibition studies in xenografted mice and estimated model parameters under 12 different approaches for censored data in order to better estimate model parameters and thus to better quantify the antitumor effect.

MATERIAL AND METHODS

Issues of Interval Censoring in Tumor Sizes

In many tumor growth studies in xenografted animals, the dimensions of the tumors are measured with a caliper, whose precision is generally about 0.1 mm, and tumor volume, expressed in cubic millimeter, is often calculated, assuming the tumor has an ovoid form, following the approximation formula (demonstration in Appendix) (8,9):

$$ \mathrm{Tumor}\;\mathrm{volume}\;\left({\mathrm{mm}}^3\right)=\frac{\mathrm{length}\ \left(\mathrm{mm}\right) \cdot {\mathrm{width}}^2\left({\mathrm{mm}}^2\right)}{2} $$

However, due to the imprecision induced by skin thickness and palpability of the tumor, when the length and width are less than 5 mm, the measure of the caliper is then rounded with a precision of 0.5 mm, and reported tumor sizes seem to take discrete values, such as 1 × 1, 2 × 2, 3 × 3, 4 × 4, and 5 × 5 assuming the tumor has a spherical shape.

For example, if we consider a tumor of 2.2 mm long, and a second tumor of 2.8 mm long, the first one will be reported as a 2 × 2 tumor, equivalent to a 4 mm3 tumor while the second one will be reported as a 3 × 3 leading to a calculated tumor volume of 13.5 mm3. Similarly, taking into account the censoring problem, 2 × 2 tumors would include diameters between 1.5 and 2.5 mm and tumor volume ranging from \( \frac{1.5 \times {1.5}^2}{2}=1.69\ {\mathrm{mm}}^3 \) to \( \frac{2.5 \times {2.5}^2}{2}=7.8\ {\mathrm{mm}}^3 \), reported as 4 mm3.

All the possible discrete values (experimental data) of tumor volume reported by the technical operator, according to length and width, and the corresponding “real” (uncensored) tumor volume intervals are summarized in Table I.

Table I Description of All Discrete Values of Tumor Volume and the Corresponding Range of Real Tumor Volume

This kind of interval censoring may result in biased parameter estimations and inflation of residual error if not correctly handled.

Handling Interval Censoring in Tumor Size: Comparison Strategy

Stochastic Simulations and Estimations (SSE) methodology was used to compare and evaluate the different methods to handle interval-censored tumor volumes. SSE are composed of two steps: the first step consists in generating 1000 simulated replicates of datasets using a “true” reference model. The second step is an estimation step where both the reference and alternative models are fitted to the replicates. The alternative models are then compared to the reference model in terms of parameter values and precision, estimated in the 1000 replicates.

SSE process was performed with a Nonlinear Mixed Effects approach (10) using the sse script of PsN suite (Pearl-speaks to NONMEM) (11,12) and NONMEM software (version 7.3) (13).

In order to perform SSE, the following are to be defined: (i) the experimental study design; (ii) the reference simulation model; (iii) the alternative estimation methods to be compared.

Definition of the Experimental Study Design

In each replicated study, four groups of 8 xenografted mice were simulated: a control group, and three groups receiving an anticancer drug dose of either 20, 45, or 100 μg/kg. Tumors were implanted at time t = 0 h, and tumor size monitored every 24 h for 1000 h (42 days). Treatment was given orally for 7 days, from t = 624 h (26 days) to t = 792 h (33 days) every 24 h.

Reference Model: Pharmacokinetic and Tumor Growth Inhibition Models

Model Structure

The pharmacokinetic model is a one-compartment disposition model with both first-order absorption and elimination processes.

$$ \begin{array}{c}\hfill \frac{d{A}_1}{dt}=-{k}_a\cdot {A}_1\hfill \\ {}\hfill \frac{d{A}_2}{dt}={k}_a\cdot {A}_1-{k}_{el}\cdot {A}_2\hfill \\ {}\hfill {C}_p=\frac{A_2}{V_c}\hfill \end{array} $$

With ka (h−1) the first-order rate of absorption, kel (h−1) the first-order rate of elimination, Vc (L/kg) the apparent volume of distribution, Cp (ng/mL) the plasma concentration, A1 (μg/kg) and A2 (μg/kg) the amount of drug respectively in the depot compartment and in the central compartment. At each time of treatment administration, the corresponding dose was added to the amount A1 in the depot compartment.

The unperturbed tumor growth, without treatment, was described using an exponential tumor growth model (λ parameter, h−1), reasonable for smaller tumor sizes (14), and initial tumor size TG0 (mm3), as follows:

$$ \begin{array}{l}\frac{dTG}{dt}=\lambda \cdot TG\hfill \\ {}T{G}_{t=0}=T{G}_0\hfill \end{array} $$

The cytotoxic effect of the drug (k2, (ng/mL)−1.h−1) was modeled proportional to plasma concentration Cp and tumor size TG, as follows:

$$ \begin{array}{c}\hfill \frac{dTG}{dt}=\lambda \cdot TG - {k}_2\cdot {C}_p\cdot TG\hfill \\ {}\hfill T{G}_{t=0}=T{G}_0\ \hfill \end{array} $$

Typical profiles of the model for each dose are presented in Fig. 1.

Fig. 1
figure 1

Typical profiles for each dose group (control group in black, 10 μg/kg in red, 45 μg/kg in green, and 100 μg/kg in blue) of the tumor growth model

Parameter Distributions in the Population

An exponential model was used for interindividual variability (IIV) on k2 and λ parameters, which implies an assumed log-normal distribution of the parameters between the 32 individuals in the population.

$$ {p}_i={p}_{pop}\cdot \exp \left({\eta_p}_i\right) $$

With ppop the typical value of p parameter, pi the value of p parameter for the ith individual and η pi a random variable quantifying the variation from the typical (average) individual. The \( {\upeta}_{{\mathrm{p}}_i} \) are assumed to be normally distributed, with a mean of zero and a variance of ωp 2.

Interval Censoring and Residual Error

When simulated tumor volumes were below 62.5 mm3 (corresponding to a measured 5 × 5 tumor), the reported values were altered following the aforementioned interval-censoring limitation of caliper measurement for smaller tumors (as shown in Table I and Fig. 2).

Fig. 2
figure 2

Observed versus reported tumor growth volumes: visualization of the interval censoring

Above this limit, the residual unexplained error linking the observed value to the corresponding predicted values was considered as a proportional error model, as follows:

$$ {y}_{ij}={f}_{ij}\cdot \left(1+{\varepsilon}_{ij}\right) $$

With y ij , the jth observed tumor volume for the ith individual and fij is the corresponding model predicted value, ε ij a random variable with a normal distribution of mean 0 and standard deviation σ.

Parameter distributions (typical values and variabilities) are presented in Table II. Since PK samples are not drawn in most of tumor growth experiments, PK parameters were fixed to their typical values. The parameters to be estimated during the SSE exercise were the typical values of (TG0, λ, k2), the variances of (λ, k2) and the standard deviation of the proportional residual error, σ.

Table II Parameters of the Simulated Model Used for the Stochastic Simulation and Estimation (SSE) Process

Alternative Error Models: Approaches to Handle Interval-Censored Tumor Size

Twelve different methods were evaluated to handle smaller tumor size values (Table III). In all cases, the model structure and the model for the interindividual variability of the parameters were the same as the reference model. Only the residual error model and method to handle interval-censored data were challenged. Observations greater than the limit of quantification were fitted using a proportional residual error model, unless otherwise noted. The twelve different methods were as follows:

Table III Description of All Estimation Methods
  • (a) Fitting of the whole dataset, not taking into account the censoring, using a combined (additive + proportional) residual error model.

  • (b) and (c) ignoring tumor volumes inferior to LOQ (corresponding to the so-called M1 method (3)), defining LOQ to 0.5 and 4 mm3 respectively (in order to consider several proportions of censored data), using a combined residual error model for observations above LOQ.

  • (d) and (e) replacing values below LOQ by LOQ/2 and using for values below LOQ an additive error model with standard deviation equals to LOQ/4 (corresponding to the so-called M5 method (3)), defining LOQ to 0.5 and 4 mm3 respectively.

  • (f) and (g) taking into account the likelihood of the data to be below LOQ (corresponding to the so-called M3 method (3)), defining LOQ to 0.5 and 4 mm3 respectively.

  • (h) considering a quadratic residual error model (being more flexible than a combined residual error model) for observations corresponding to simulated values below 83.2 mm3 (i.e., 5 × 5 tumors), and a proportional model for observations above this limit; the variance of the quadratic error can be written a + b*f + c*f 2, with a the additive term, b the first-order term and c the second-order term, and f the predicted tumor volume.

  • (i) similar to h, with the first-order term b fixed to 0.

  • (j) taking into account the interval censoring (1 × 1 = 0.5 mm3, 2 × 2 = 4 mm3, …, up to 5 × 5 = 83.2 mm3) and defining a separate additive error model for each interval, setting the standard deviation to the quarter of the width of the corresponding interval (further referred to as multi-additive method) implying that 95% of the values belong to the interval. For example, observed values of 3 × 3 = 13.5 mm3, corresponding to the censoring of the interval [7.8; 21.4 mm3], are associated to an additive residual error with a standard deviation fixed to \( \frac{21.4-7.8}{4}=3.4\ {\mathrm{mm}}^3 \).

  • (k) similar to j but defining a separate proportional error model for each interval (further referred to as multi-proportional method). For example, observed values of 3 × 3 = 13.5 mm3, corresponding to the censoring of the interval [7.8; 21.4], are associated to a proportional residual error with a coefficient of variation fixed to \( \frac{3.4}{13.5}=25.2\ \% \).

  • (l) taking into account the likelihood of the data to be in each censored interval (further referred to as interval-M3 method). Therefore, for observed values below 5 × 5 = 83.2 mm3, the probability of the observed tumor volume (TGobs) to be in each censored interval has to be defined:

    ▪ Below 1 × 1: P(0 < TGobs < 0.0625)

    ▪ 3 × 3: P(7.8 < TGobs < 21.4)

    ▪ 1 × 1: P(0.0625 < TGobs < 1.69)

    ▪ 4 × 4: P(21.4 < TGobs < 45.5)

    ▪ 2 × 2: P(1.69 < TGobs < 7.8)

    ▪ 5 × 5: P(45.5 < TGobs < 83.2)

Implementation of method (l) in NONMEM is detailed in an Appendix.

Performance Evaluation

Several criteria were considered to compare and evaluate the performances of the different methods. The ratio of parameter estimates (ppop, ωp 2 and σ) to the true value in the N = 1000 replicates of the pseudo-experimental study were depicted in box-plots for graphical comparison of the different methods.

Parameters estimated with the 12 methods were also compared in terms of relative bias, and relative Root Mean Squared Error (RMSE).

$$ \begin{array}{c}\hfill \mathrm{Relative}\ \mathrm{bias} = 100\%\cdot \frac{1}{N}{\displaystyle \sum_i}\frac{es{t}_i-tru{e}_i}{tru{e}_i}\hfill \\ {}\hfill \mathrm{Relative}\ \mathrm{RMSE}=100\%\cdot \sqrt{\frac{1}{N}\cdot {\displaystyle \sum_i}\frac{{\left(es{t}_i-tru{e}_i\right)}^2}{tru{e}_i}}\hfill \end{array} $$

The rate of successful minimization runs was also used for the comparison of all methods. Finally, the magnitudes of the residual error were compared to judge the impact of the censoring handling of low values on the prediction of larger values of tumor volumes.

Categorical Visual Predictive Check methodology (VPC) was also used for one example dataset for method (a) (the combined residual error model method) and method (l) (the interval-M3 method). Five hundred simulated replicates of the dataset were generated using the corresponding estimated set of parameters. Evolving over time, the observed proportion of data belonging to each category (i.e., intervals of tumor volume) was compared to the 95% confidence interval of the corresponding proportions computed from simulated data. Simulations from estimated models were also compared to the “true” model to better highlight the impact of biased parameters as well as the calculation of the threshold concentration (also known as CT which is a secondary parameter derived from the model (8)) representing concentration at steady state which eradicates the tumor.

The performance of the new method was also evaluated on two other tumor growth inhibition models, as proposed in published works (15,16): in this way, the methods were compared using different tumor growth models, different set of parameters, and different study designs.

Application to a Real Dataset

Finally, the method was tested with data issued from a real experiment on a new compound currently under development. Data were extracted from an experiment involving three groups (each composed of 4 mice): a control group and two dose groups in which the drug was given orally once a day, 5 days a week for 2 weeks, and 47% of the observations were concerned by the interval censoring.

RESULTS

Simulated Data Description

The proportions of tumor volumes being in each interval after the 1000 simulations are summarized in Table IV. The median frequency in each censored interval was about 10% each, leading to a total interval censoring, i.e., values below 83.2 mm3, of about 60%.

Table IV Distribution of the Proportion of Simulated Tumor Volumes in Each Censored Interval: Median, First and Third Quartiles of the Proportion Among the 1000 Replicates

Performance of Each Method

The box plots of all methods for each estimated parameter are shown in Fig. 3. Table V shows the results about relative bias and relative RMSE.

Fig. 3
figure 3

Box plots of ratio of parameter estimates divided by the true parameter for each tested method. λ is the exponential growth parameter, TG0 is the initial tumor size, k2 is the cytotoxic effect parameter, and ωλ 2 and ωk2 2 the corresponding inter individual variance parameters; σProp is the magnitude of the proportional component of the residual error model

Table V Relative Bias and Relative Root Mean Squared Error (RMSE) for Each Methods of the Estimation of the Parameters and Proportional Component of the Residual Error for Tumor Volumes Superior to 83.2 mm3 for Each Method

The typical value for the tumor growth rate λ did not seem to be affected by the censoring of lower values and was well estimated with all methods. Both relative bias and relative RMSE were about the same value for all methods. Indeed, the information necessary to estimate this parameter was well and sufficiently supported by data from the control group and larger tumors, and as expected censoring had low impact on this parameter. The least biased method was the interval-M3 method (l) with a relative bias of −0.53%.

Typical values of the cytotoxic effect parameter k 2 and the initial tumor volume TG 0 appeared to be more impacted by the censoring of the data, being generally underpredicted and overpredicted, respectively. Indeed, the most informative data were expected to be in the range of the interval-censored area for both parameters. The multi-additive and multi-proportional methods (j and k) gave the worst estimations for TG 0 in terms of relative bias (respectively −23.3 and 22.7%) and relative RMSE (respectively 26.1 and 27.1%) while interval-M3 method (l) gave the best results with a relative bias of −0.27% and a relative RMSE of 11.8%. Regarding the typical value of k 2 , M5-methods (d and e) were the most biased methods (with a relative bias respectively of −14.1 and −16.5%), and the best estimations were given by the interval-M3 method (l) with the smallest relative bias of 0.49% and the most precise estimation with a relative RMSE equal to 4.06%.

Regarding interindividual variability parameters, similarly, the variance in λ parameter, ω λ 2, was not much impacted by the interval censoring, and all methods gave approximately the same range of estimates, but the estimations were generally more biased and less precise than typical value estimations. The imprecision of IIV variance parameters was certainly due to the small number of animals in the simulated study design.

The interindividual variance on k 2 , ω k2 2, was more sensitive to the interval censoring of the data. Estimations of ω k2 2 were more biased with the M5-methods (d and e) (relative bias of 69.0 and 222%), and the largest relative RMSE were found with M5-methods (d, e), M3-methods (f, g) and quadratic methods (h, i) (from 48.4 to 260%). For ω k2 2 parameter, the less biased method was the combined error method (a) with a relative bias equal to 0.99%.

We noticed that method (c) gives the worst results regarding bias and precision which can be probably due to the omission of 20% of the data (LOQ being fixed to 4 mm3).

In summary, the interval-M3 method (l) was found to be the most adequate method, performing well in terms of bias and precision for all the parameters, resulting in the lowest relative RMSE and relative bias for all typical parameters.

Visual Predictive Check

Categorical VPC for the combined error model method (a) and interval-M3 method (l) are presented in Fig. 4. The interval-M3 method (l) better described the evolution of the proportion of tumor volumes to be in each interval than method (a) especially in the range [0; 6.8] mm3. It is to say, in general, classical methods implementing a combined error model would be poorly informed by tumor size observations belonging to the interval [0; 4σadd] mm3 (where σadd is the standard deviation of the additive component of the combined residual error model).

Fig. 4
figure 4

Categorical Visual Predictive Check for method (a) and method (l) for a given dataset. The panels show simulation based 95% confidence intervals (area) around the observations (solid line) for the fraction of censored observations

In addition, the comparison of tumor growth typical profiles for method (a) and method (l) with the true profile is shown in a supplemental figure (Supplemental Figure 1). The profile simulated with the interval M3 method appears to be the closest to the true profile and especially for high dose for which we observe the lowest tumor volumes. The CT defined by Simeoni et al. (8,17) was also impacted. For the “true model” CT = 890 ng ml−1, with the set of parameters estimated with method (a) CT = 974 ng ml−1 and with method (l) CT = 886 ng ml−1.

Magnitude of the Residual Error for Larger Tumors

The magnitudes of the proportional residual error, σ Prop , for larger observations obtained from each method are presented in Table V. This parameter represented the value of the residual error for non-censored tumor sizes and as a result the impact of the interval censoring of smaller tumors on the predictions of larger tumor volumes. Predictions for larger tumors were more altered with methods considering the censoring as a single limit (range from 23 to 49%). Methods considering the interval censoring due to caliper measurement limitation (h, i, j, k, l) gave a residual error value close to 20% (range from 20 to 23%), 20% being the value of the residual error in the reference model. This suggests that understanding and taking into account the source of error when measuring improve the quality of prediction of both smaller and larger tumor volumes.

Rate of Minimization Successful Runs

The rates of minimization successful runs for each method are summarized in a supplemental material (supplemental table 1). We observed the lowest rate for the methods using Laplacian estimation methods with the lowest value for method (l) (42%). This implies that the complexification of the estimation method has an impact on the rate of successful minimization runs. Decreasing the proportion of interval-censored values results in an increase of successful convergence rate (not shown).

The additional comparisons performed using other tumor growth parameters from published works (including the Simeoni model (15) with λ0, λ1, ω2λ0, ω2λ1, TG0, k1, k2, ω2k2, and ω2k1 and including an example with a K-PD model (16)) led to similar results (as shown in supplemental material), reinforcing the benefit of the use of the interval-M3 method when interval censoring occurs in the data.

Application to a Real Dataset

The real dataset has been modeled using a bi-phasic tumor growth model (9) and the cytotoxic effect has been described using a sigmoidal function. It appeared that the use of the interval-M3 method (l) improved the fit of the data with a significant decrease of the objective function (889 for method (a), 839 for method (g) and 783 for method (l)). In addition, the precision of the estimated parameters (as shown in supplemental material) were the best when using the interval-M3 method which is probably due to a better consideration of the information provided by the data. A comparison of the CT parameter calculated with methods (a), (g), and (l) was also done, and a fourfold difference was found between the different methods (1393 vs. 501 vs. 330 ng.ml−1 respectively for method (a), (g), and (l)).

DISCUSSION

Data censoring, if not handled properly, may alter the quality of parameter estimation and model development. In this work, stochastic simulations and estimations were performed to investigate the impact of interval censoring in smaller tumor measurements on the estimation of model parameters and evaluate different approaches to handle these data. Clearly, not considering the underlying process of measurement censor can lead to biased parameters and perhaps incorrect parameter interpretations. A novel approach, the so-called interval-M3 method, is proposed and has been evaluated by stochastic simulations and estimations, taking into account the interval censoring of smaller tumors, due to caliper measurement limitations in xenografted animals for tumor growth inhibition experiments.

Comparison of different methods showed there was a large difference in the quality of estimations (regarding relative bias and relative RMSE) depending on the handling method for tumor size censoring and particularly for the drug cytotoxic effect parameter k 2 , which is the parameter of interest when evaluating drug efficacy. The use of the interval-M3 method (method l), combining the modeling of continuous and categorical data and taking into account the interval censoring process allowed an unbiased and precise estimation of all model parameters. Moreover, this method can be easily implemented in NONMEM (version 6 or later) using the variable F_FLAG to simultaneously fit continuous (F_FLAG = 0) and categorical data (F_FLAG = 1).

The results presented in this article were issued from an example where 60% of observations were interval-censored (<83.2 mm3). Similar analyses were performed, varying the proportion of interval-censored data (data not shown): the larger the proportion of censored observations, the larger the bias in parameter estimation. In any case, the interval-M3 method (l) appeared to be the best method with less biased parameter estimations. However, it is important to note that bias of parameters decreases with the proportion of censored data; when few tumor measurements are interval-censored, handling those values with the use of the interval-M3 method does not have a significant impact on model building.

The most commonly used method (the combined error method (a)) showed some limitations regarding tumor volume predictions (as shown on categorical VPC, Fig. 4), especially for smaller tumor sizes corresponding to the interval [0; 4σadd]. Consequently, this method gives imprecise predictions in this interval. The structural model used in this work was simpler than those usually required, explaining the small value of the additive component. Probably, the use of our interval-M3 method will be even more beneficial than classical methods as the value of σadd increases.

The modeling exercise performed on real data showed the impact on a real situation and the difference in parameter estimates that can occur and especially for CT parameter determination which is one of the key parameter in a xenograft experiment.

A possible limitation in the use of the interval-M3 method (method l) could be the higher rate of non-successful minimization runs (in a Supplemental Table 1). In fact, we noticed a higher non-successful termination rate for methods requiring LAPLACIAN option. This fact may be explained by different elements. First, the Laplacian method runs more complex computations than FOCE-I since Laplacian is a second-order approximation method for the integral of the likelihood while FOCE-I is a first-order approximation (18). It has been also shown that methods maximizing the likelihood for the censored data have a lower rate of successful termination than methods which only apply extended least squares and do not properly consider the censoring (1,4). Moreover, these methods need a longer computation time; a twofold factor was found compared to methods using FOCE-I method (data not shown). Nevertheless, in our simulation-estimation exercise, initial values were not optimized for each dataset, and no retry was done although LAPLACIAN methods are known to be more sensitive to initial values. In total, given the non-negligible gain in terms of bias and precision in parameter estimations, the interval-M3 method seems valuable.

The interval-M3 method is an extension of the M3-method used generally for BQL values. In this approach, there is no natural difference between BQL and rounded values since they are handled in the same way: BQL data corresponding to the lowest interval and the borders of the interval-censoring method being adjustable as needed.

In addition, another interesting area to dig in would be to explore the performances of our interval-M3 method with more complex models such as models dealing with combination therapies which could exhibit more efficacy (and thus more interval-censored data) due to synergistic effect.

CONCLUSION

In conclusion, the interval-M3 method appeared to perform the best (the most precise and the least biased) to take into account interval censoring due to caliper measurement limitations with smaller tumors. This advantage should be important when evaluating antitumor drug effect in tumor growth inhibition experiments in xenografted animals.