Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation

Wang, Xiaoyu; Jenner, Adrianne L.; Salomone, Robert; Warne, David J.; Drovandi, Christopher

doi:10.1007/s00285-024-02045-4

Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation

Open access
Published: 15 February 2024

Volume 88, article number 28, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Biology Aims and scope Submit manuscript

Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation

Download PDF

854 Accesses
6 Altmetric
1 Mention
Explore all metrics

Abstract

Agent-based models (ABMs) are readily used to capture the stochasticity in tumour evolution; however, these models are often challenging to validate with experimental measurements due to model complexity. The Voronoi cell-based model (VCBM) is an off-lattice agent-based model that captures individual cell shapes using a Voronoi tessellation and mimics the evolution of cancer cell proliferation and movement. Evidence suggests tumours can exhibit biphasic growth in vivo. To account for this phenomena, we extend the VCBM to capture the existence of two distinct growth phases. Prior work primarily focused on point estimation for the parameters without consideration of estimating uncertainty. In this paper, approximate Bayesian computation is employed to calibrate the model to in vivo measurements of breast, ovarian and pancreatic cancer. Our approach involves estimating the distribution of parameters that govern cancer cell proliferation and recovering outputs that match the experimental data. Our results show that the VCBM, and its biphasic extension, provides insight into tumour growth and quantifies uncertainty in the switching time between the two phases of the biphasic growth model. We find this approach enables precise estimates for the time taken for a daughter cell to become a mature cell. This allows us to propose future refinements to the model to improve accuracy, whilst also making conclusions about the differences in cancer cell characteristics.

Molecular Dynamics Simulations: Concept, Methods, and Applications

Introduction to Bioinformatics

Free-energy calculations in condensed matter: from early challenges to the advent of umbrella sampling

Article Open access 12 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Cancer is a disease that arises through the progressive alteration of normal cells. Solid cancers form what is commonly referred to as tumours, which are populations of cancerous cells joined together with connective tissue (Weinberg and Weinberg 2006). Over time, these tumours expand and take up residency in healthy tissue. To elucidate the mechanisms of the cancer pathophysiology, researchers have mainly focused on the analysis of aberrant cell dynamics within tumour structures. The pivotal role these abnormal cells play in oncogenesis constitutes a salient focus of these investigations (Noble 2002; Markowetz 2017).

Mathematical modelling of cancer growth and development has been used to study the dynamical process of cancer cells for many years (Altrock et al. 2015; Beerenwinkel et al. 2015; Barbolosi et al. 2016; Tabassum et al. 2019). Deterministic approaches, such as systems of ordinary differential equations (ODEs) and systems of partial differential equations (PDEs), have been used successfully to model cancer growth (Villasana and Radunskaya 2003; Yafia 2011; Tao et al. 2014; Jenner et al. 2020b; Dehingia et al. 2021; Klowss et al. 2022; VandenHeuvel et al. 2022). While insightful, generally these models do not capture the phenotypical and spatial heterogeneity that arises through stochastic processes of tumour growth, or consider the behaviour at an individual cancer cell level (Irurzun-Arana et al. 2020). In turn, such models often do not capture dynamical processes such as individual cell proliferation (Iyer et al. 2011; Sahoo et al. 2011) and cell movement (Groh and Louis 2010). For instance, Warne et al. (2022) highlight that continuum models need to align with specific assumptions that can diverge from reality. Specifically, standard deterministic continuous models that are typically used to represent cell invasions often require cell motility rates to dominate proliferation rates. Such conditions may not hold true for slow-growing tumours. In such tumours, pressure-driven mobility can serve as the principal catalyst for tumour growth. Consequently, stochastic discrete models provide a more adaptable framework for modelling various spatiotemporal patterns, such as clustering. As a result, parameter estimates relating to the proliferation rate of tumor cells are often underestimates of the true proliferation rate due to spatial clustering inhibiting growth through contact inhibition. By explicitly modelling the proliferation and motility processes of a spatially heterogeneous tumor using an ABM the accurate parameter estimates and realistic uncertainty quantification can be obtained, even in situations with spatially irregular boundaries with fingering patterns or multiple local clusters.

More recently, agent-based models (ABMs) have become a popular approach for capturing the stochastic nature of tumour growth (Ozik et al. 2018; Metzcar et al. 2019; Macnamara 2021; Klowss et al. 2022). ABMs model cells as agents, and the interactions of cells are governed by probabilistic rules that depend on physically-meaningful parameters (An 2012). There are two popular paradigms of ABMs: on-lattice and off-lattice models (Railsback and Grimm 2019). On-lattice models restrict movement of cells to neighbouring points on a pre-defined lattice. Whereas cell movement in off-lattice models are governed by interaction forces between neighbouring cells and the surrounding environment. Although, one drawback of ABMs is that model simulation can be more computationally expensive than solving a deterministic system of differential equations.

This work focuses on investigating and extending the Voronoi cell-based model (VCBM), an off-lattice ABM of tumour growth in 2-dimensions based on the model by Jenner et al. (2020a), with the concept originally introduced by Kansal et al. (2000b), Kansal et al. (2000a), Schaller and Meyer-Hermann (2005), Kempf et al. (2010), Fletcher et al. (2013) and more recently investigated by Cleri (2019), Germano et al. (2022). Agents are defined as either healthy cells or tumour cells. The movement of cells through the domain is modelled off-lattice using a force-balance equation based on Hooke’s law to account for cell adhesive and repulsive forces. Cell boundaries are then defined by a Voronoi tessellation.

Many growth factors are known to influence cell behaviour in a biphasic manner, causing an increase or decrease in cellular division contingent upon their concentration (Konstorum et al. 2013). A biphasic relationship has been observed between the speed of an invading tumour front and the concentration of collagen in the surrounding gel, whereby two growth phases can be observed in in vitro or in vivo tumour growth (Konstorum et al. 2013). This biphasic relationship could be due to the complex interaction between cell proliferation and migration (Perumpanani and Byrne 1999). This insight led to the development of mathematical models that could test these hypotheses (Marchant et al. 2006; Konstorum et al. 2013).

Marchant et al. (2006) successfully devised a theoretical model of malignant invasion that accurately reproduces the biphasic dependency of tumour cell invasion speed on the density of the surrounding normal tissue. To encapsulate this biphasic phenomenon effectively within the context of the VCBM, we introduce the concept of a switching time, such as in Murphy et al. (2022). This switching time refers to a specific point in time in which the tumour evolution transitions between two distinct growth phases, that is, it reflects transition of tumour growth in vivo from an ‘establishment’ phase, characterized by initial proliferation and localization, to an ‘expansion’ phase, where rapid and invasive growth occurs (Marchant et al. 2006). This transition point, derived from in vivo measurements, provides a biologically significant depiction of the tumour’s temporal evolution. Thus, such an approach enhances the VCBM’s ability to simulate the complex dynamics of tumour growth accurately.

Cancer cell proliferation is largely the driving factor of tumour growth and is impacted by spatial limitations and nutrient sources (Iyer et al. 2011; Sahoo et al. 2011; Jenner et al. 2020a). As such, the key mechanism of tumour growth in the VCBM is cancer cell proliferation (Cheng et al. 2012; Benzekry et al. 2014). Calibrating the cellular proliferation parameters to tumour growth measurements would provide a better understanding of the differences between tumour types. However, obtaining parameter estimates for an ABM is not trivial. This work proposes a Bayesian framework to calibrate cancer cell proliferation in the VCBM and capture the tumour growth in ovarian, pancreatic and breast tumour cell lines implanted in vivo.

ABMs are simulation-based models, for which inference can be challenging as the likelihood function for such models is often intractable. Hence, approximate Bayesian computation (ABC) (Sisson et al. 2018) is proposed to bypass the likelihood function and estimate the model parameters. ABC requires that the model can be simulated, but does not require evaluation of its corresponding likelihood function. ABC has been previously used to calibrate a small number of biological ABMs. For example, Lambert et al. (2018) apply ABC to a lattice model and Ross et al. (2017) use ABC to improve the experimental design of a wound-healing assay by calibrating a cell-cell adhesion model (Khain et al. 2007; Ross et al. 2015). Rocha et al. (2021) use ABC to calibrate to identify cell motility parameters using biological experiments in their ABM developed in PhysiCell (Ghaffarizadeh et al. 2018). However, they use simple ABC algorithms which require a large number of model simulations, and are thus computationally expensive when the model simulation is not computationally trivial. We calibrate the VCBM to tumour growth time series data using sequential Monte Carlo (SMC)-ABC, specifically the algorithm of Drovandi and Pettitt (2011), since it is much more efficient than the standard ABC rejection algorithm, and it can easily take advantage of parallel computing resources.

Understanding the limitations and flexibility of the VCBM is important to be able to determine its reliability as a predictor of tumour growth. The main contribution of this paper is to provide an efficient Bayesian approach for fitting off-lattice stochastic models of tumour growth to time series data of tumour volumes. As a case study, we extend the VCBM proposed by Jenner et al. (2020a) to a biphasic VCBM, which has not been previously calibrated to data in the literature. By using an efficient Bayesian algorithm, we are also able to thoroughly investigate if the VCBM is flexible enough to recover real tumour volume time series data across a range of cancers. Specifically, we calibrate the model to in vivo measurements for breast cancer, ovarian cancer and pancreatic cancer in mice obtained from Wade (2019), Wade et al. (2020) and Kim et al. (2011). Calibration results suggest that subcutaneous growth leads to less spatial inhibition as the tumour grows. Biomechanically, this suggests that the growth dynamics evolve alongside the tumour’s progression. Subcutaneously grown tumours are known for exhibiting negligible propensity for metastasis due to a lack of spatial constraints, which is one of the causes of metastases associated within an intra-organ growth (Schmidt et al. 2016). Furthermore, it is apparent that cellular proliferation changes over time due to potential cellular mutations that result in faster or slower cell proliferation (Bozic et al. 2010) or even necrosis from vascular limitations. Our calibration approach reveals that the model is sufficiently robust in capturing both monophasic and biphasic growth occurring across data sets.

We now describe the structure of the paper. In Sect. 2, we describe the data we analyse, explain the VCBM that we calibrate to the data and provide details of the ABC calibration method that we adopt for fitting the VCBM to data. Section 3 presents the results of the calibration process. In Sect. 4, we discuss the findings further, the limitations of our study, and directions for future research.

2 Methods

2.1 Experimental measurements: in vivo tumour volume

Measurements from three independent in vivo cancer datasets (Fig 1) are used to calibrate parameters in the VCBM: breast cancer cells (Kim et al. 2011), ovarian cancer cells (Kim et al. 2011), and pancreatic cancer cells (Wade 2019; Wade et al. 2020). In these experiments, tumour volume (mm$^3$) is approximated using caliper measurements in two-dimensions:

$$\begin{aligned} \textrm{volume} = \textrm{width} \times \frac{\textrm{length}^2}{2}, \end{aligned}$$

(1)

where $\textrm{width}$ (mm) is the longest tumour measurement and $\textrm{length}$ (mm) is the tumour measurement along a perpendicular axis (Kim et al. 2011). We also generate a synthetic dataset designed to mirror the quantity of data points present in experimental data. This approach enables an evaluation of the calibration method’s ability to accurately recover known parameter values. The synthetic and experimental datasets are briefly described below. Full details of the experimental data can be found in Kim et al. (2011) and Wade (2019), Wade et al. (2020).

2.1.1 Breast cancer data

Kim et al. (2011) measure the in vivo breast cancer tumour volume over 66 days, with measurements being recorded every second day. Each mouse is injected with Her2/neu-expressing human breast cancer cells MDA-MB435, and tumour volume measurements commenced when the seeded tumours reach 100–120 mm$^3$. Figure 1a depicts tumour growth data for four mice with breast cancer. We remove measurements from mouse two from day 58 onward as the tumour volume began to exhibit a decline, which we deem biologically infeasible for our chosen model assumptions. For more details, see Kim et al. (2011).

2.1.2 Ovarian cancer data

Kim et al. (2011) measure the in vivo ovarian cancer tumour volume over 25 days, with measurements being recorded daily in mice injected with SK-OV3 cells. The tumour volume measurements commence when the seeded tumours reach 100–120 mm$^3$. Figure 1b displays tumour growth data for three mice with ovarian cancer. For more details, see Kim et al. (2011).

2.1.3 Pancreatic cancer data

Tumour growth measurements in mice seeded with pancreatic cancer cells Mia-PaCa-2 are collected by Wade (2019), Wade et al. (2020). Figure 1c displays the four lines representing the tumour volume for four different mice over 33 days which were recorded every day. The cessation of tumour volume measurements occurred once the volume exceeded a given threshold. Hence, the final observation time varies between mice.

2.1.4 Synthetic cancer data

In this study, we generate three synthetic datasets using the model with predetermined parameter values, as outlined in Table S1. Specifically, we generate three synthetic cancer datasets utilizing a biphasic model with a fixed switching time. To ensure consistency with actual datasets that exhibit biphasic growth, which we observe to have a measurement length of 32 days, we set the length of these three synthetic datasets to 32 days. Additionally, we generate two synthetic datasets demonstrating monotonic exponential growth in order to validate the original VCBM.

2.2 Voronoi cell-based model (VCBM)

ABMs are popular computational models that model cells as agents with associated probabilistic sets of rules for the purpose of capturing stochastic cellular dynamics. In this study, we model tumour growth using an ABM as it allows us to capture the spatial evolution of a tumour. In addition, ABMs enable simulation of an array of stochastic processes and can account for individual cell behaviours, including low movement rates, and their interactions within the surrounding microenvironment. This is a more detailed description of the tumour compared to an analogous deterministic system (e.g. ODE or PDE). To enable flexibility in cellular movement processes, we use an off-lattice ABM. Although on-lattice ABMs (Lundh 2007; Lowengrub et al. 2009; Cristini and Lowengrub 2010; Voss-Böhme 2012; Wang et al. 2015; Poleszczuk et al. 2016; Norton et al. 2019) are less expensive to simulate, off-lattice ABMs can be more effective at simulating the heterogeneous nature of the tumour microenvironment.

Here, we describe the VCBM, an off-lattice model developed by Jenner et al. (2020a), Jenner et al. (2022) that simulates the 2-dimensional cellular dynamics of tumour formation. In this study, we simplify their model to capture control in vivo tumour growth only, omitting the treatment-related agents of virus-infected cancer cells, dead cancer cells, and empty space. As a result, we only investigate three primary dynamics governing tumour growth: cancer cell proliferation, cell movement, and cell invasion into healthy tissue. We assume that no cells die during the simulation and that healthy cells are pushed outward solely by the force of multiplying cancer cells.

2.2.1 Voronoi tessellation

The 2-dimensional spatial position of the ith cell agent at time t is defined by a center point $\mathrm {\varvec{r}}_{i}(t) \in \mathbb {R}^2$. We consider a lattice as the set of cell agent points represented by the vector $\mathrm {\varvec{r}}(t)$, where the size of $\mathrm {\varvec{r}}(t)$ depends on the number of cells in the domain at time t. We consider the area of a cell to be the region of space enclosed by a Voronoi tessellation formed by the cell centre point. A Voronoi tessellation is used to define the edges of a cell (see Fig. 2), i.e. the boundary of cell i is the line equidistant between the cell’s centre point and one of its connected cell’s centre points. Voronoi cells on the boundary have an infinite area, and these cells are not explicitly modelled as contributing to the VCBM dynamics. Compared to on-lattice models, the VCBM does not require fixed cell positions. We define the ith cell’s neighbourhood, $\mathcal {N}(i)$, as the set of cells connected to that cell by a Delaunay trangulation (Van Liedekerke et al. 2015).

2.2.2 Cancer cell movement

In the VCBM, cancer cell proliferation is the primary driver of cell movement. The network of forces interacting on cell i is modelled by Hooke’s Law (Fig. 3a), which computes the effective displacement of the cell and we use this to update the cell’s position. The force between cell i and its connected neighbour cell j is modelled by a damped spring (Fig. 3b), and the spring connecting cell i and cell j has a rest length $\textrm{s}_{i,j}(t)$, where we assume the rest length can be a function of time t. The displacement of the ith cell is given by

$$\begin{aligned} m_i\frac{\textrm{d}^2 \varvec{r}_i}{\textrm{d}t^2} = \sum _{j}\varvec{\textrm{F}}_{i,j}^{\textrm{I}} + \varvec{\textrm{F}}_{i}^{\textrm{V}}, \end{aligned}$$

(2)

where $m_i$ is the mass of cell i, $\varvec{r}_i$ is the position of ith cell’s centre point, $\varvec{\textrm{F}}_{i,j}^{\textrm{I}}$ is the interaction force between a neighbourhood cell j connected to cell i and $\varvec{\textrm{F}}_{i}^{\textrm{V}}$ is the viscous force acting on cell i.

Following Jenner et al. (2020a), Kansal et al. (2000a), Kansal et al. (2000b), Schaller and Meyer-Hermann (2005), Kempf et al. (2010), Fletcher et al. (2013), the total interaction force $\varvec{\textrm{F}}_{i}^{\textrm{I}}(t)$ acting on cell i at time t is given by the sum of the individual forces from the neighbourhood connected cells:

$$\begin{aligned} \varvec{\textrm{F}}_{i}^{\textrm{I}}(t) = \sum _{j}\varvec{\textrm{F}}_{i,j}^{\textrm{I}} = \mu \sum _{\forall j} \frac{\varvec{r}_{i,j}(t)}{\Vert \varvec{r}_{i,j}(t)\Vert }(s_{i,j}(t) - \Vert \varvec{r}_{i,j}(t)\Vert ), \end{aligned}$$

(3)

where $\mu $ is the spring constant, $\varvec{r}_{i,j}(t)$ is the vector from the ith point to jth point at time t, $s_{i,j}(t)$ is the spring rest length between cell i and j at time t and $\Vert \varvec{r}_{i,j}(t)\Vert $ is the Euclidean norm of $\varvec{r}_{i,j}(t)$. The viscous force acting on cell i is

$$\begin{aligned} \varvec{\textrm{F}}_i^{\textrm{I}} = -\varvec{\textrm{F}}_i^{\textrm{V}} = \mu \varvec{v}_i, \end{aligned}$$

(4)

(Galle et al. 2005; Macklin et al. 2012; Jenner et al. 2020a) where $\varvec{v}_i$ is the velocity of the ith point. By taking a small time interval $\Delta t$, the effective displacement of the ith point is

$$\begin{aligned} \varvec{r}_i(t+\Delta t) = \varvec{r}_i(t) + \frac{1}{\mu }\varvec{\textrm{F}}_i(t)\Delta t = \varvec{r}_i(t) + \lambda \sum _{\forall j} \frac{\varvec{r}_{i,j}(t)}{\Vert \varvec{r}_{i,j}(t)\Vert }(s_{i,j}(t) - \Vert \varvec{r}_{i,j}(t)\Vert ),\nonumber \\ \end{aligned}$$

(5)

where $\mu $ is the damping constant. This formulation is readily used in other ABM settings, for example see Murray et al. (2012), Murphy et al. (2019), Browning et al. (2019).

2.2.3 Cancer cell proliferation

For modelling cancer cell proliferation, the Euclidean distance, d, between cell i’s centre point $\varvec{r}_i(t)$ and the tumour boundary is used as a proxy for the nutrient source. The boundary of the tumour is defined as the set of tumour cells that reside on the tumour periphery (proliferating edge). We assume cell i can proliferate when $d < d_{\textrm{max}}$, where $d_{\textrm{max}}$ is the maximum radial distance a cell can be from the boundary and still proliferate. The probability of a cell dividing in a given time step $\Delta t$ is

$$\begin{aligned} \textrm{p}_d = p_0 \left( 1 - \frac{d}{d_{\textrm{max}}}\right) , \end{aligned}$$

(6)

where $p_0$ is a proliferation constant. In this way, we are defining the probability of proliferation as a function of a cell’s distance to the tumour boundary/nutrient source, as has been done similarly in Kansal et al. (2000a), Jiao and Torquato (2011).

Figure 3c shows that when cell i divides it creates two daughter cells i and l, which are placed at a random orientation equidistant from the original cell i’s position. To simulate the growth of a daughter cell to a full grown cell, the resting spring length between cell i and cell l is initialised as a value less than s and increases over time until $s_{i,l} = s$. Consider $t_d$ is the time since the last cell division, then

$$\begin{aligned} s_{i,l}(t) = \left\{ \begin{array}{cc}\frac{t_ds}{t_{\textrm{age}}}, &{} \ \ \ \ \text{ if } \text{ time } \text{ since } \text{ last } \text{ division } t_d\le t_{\textrm{age}} \\ s &{} \text{ otherwise }, \end{array}\right. \end{aligned}$$

(7)

where s is the mature resting spring length and $t_{\textrm{age}}$ is the time taken for a cell to grow to its full size. In other words, initially at the first time step immediately after division $t_d = 1$ hour and $s_{i,l} = s/t_{\textrm{age}}$. In the following timestep, the spring length will increase to $s_{i,l} = 2\times s/t_{\textrm{age}}$. The value for $s_{i,l}$ increases for $t_{\textrm{age}}$ time steps, i.e. until $t_d =t_{\textrm{age}}$. In this way, we approximate cell growth from a daughter cell size to an adult size which takes $t_{\textrm{age}}$ time steps. This variable spring length is what makes the rest spring length a function of time, i.e. $s_{i,l}(t)$. Once it has been $t_{\textrm{age}}$ time steps since division we have $s_{i,l}(t) = s$. Another factor for cancer cell proliferation is the time taken for a cell to be able to divide into two daughter cells. In the VCBM, the daughter cells take $g_{\textrm{age}}$ time steps to be able to divide again. Note that $t_{\textrm{age}} < g_{\textrm{age}}$.

2.2.4 Cell invasion

Due to the invasiveness of cancer, we assume a cancer cell’s daughter cell can replace a healthy cell with probability $p_{\textrm{psc}}$. In our model, this property refers to the probability that invasiveness occurs at the boundary of tumour tissue. This assumption is inspired by previous ABM work by Jiao and Torquato (2011).

2.2.5 Simulation

The model uses a time-step of one hour, i.e. $\Delta t = 1$, which is based off the knowledge that the normal cell cycle is about 24 h (Bernard and Herzel 2006) and ovarian cancer’s cell cycle length is greater than 16 h (Fisi et al. 2016). The VCBM simulation is started by first initialising a square domain with cells arranged in a hexagonal lattice. The cell closest to the centre of the domain is designated as a cancer cell and the remainder are designated as healthy cells. The VCBM is then simulated until it reaches a tumour volume of 100 mm$^2$, which is based off the experiment measurements (Wade 2019; Kim et al. 2011). Once the tumour reaches this volume, the lattice for healthy and cancerous cells is stored and this lattice is used to simulate the growth of a tumour over the required number of days for each simulation of the model. The model evolves by first checking whether any cancer cell proliferates using Eq. 6. For cancer cells that are not proliferating in that timestep, they are checked for whether they differentiate into an invasive cell using $p_{\textrm{psc}}$. Following this, all cells (healthy, cancerous) are moved using Hooke’s law, Eq. 5. The values of the underlying mechanical model (i.e. $\lambda $ and $\mu $) were taken from previous estimations in the literature (see Meineke et al. 2001; Jenner et al. 2020a). Comprehensive descriptions of the model simulation are available in (Jenner et al. 2020a, 2022).

2.2.6 Biphasic model

We find for some datasets that the rate of tumour growth appears to change over time (as shown in Fig. 1c). Here, we use a biphasic model to capture the change of growth dynamics. To investigate the biphasic tumour growth phenomenon we assume at time $t = \tau $ days, that the parameters can change value. We assume that there are four parameters, i.e. $\varvec{\theta } = (p_0,p_{\textrm{psc}},d_{\textrm{max}},g_{\textrm{age}})$ in the VCBM. In the biphasic version of the model, for $t < \tau $, the VCBM is governed by the parameters $\varvec{\theta }_1 = (p_0^1,p_{\textrm{psc}}^1,d_{\textrm{max}}^1,g_{\textrm{age}}^1)$, whilst for $t > \tau $ the VCBM is governed by the parameters $\varvec{\theta }_2 = (p_0^2,p_{\textrm{psc}}^2,d_{\textrm{max}}^2,g_{\textrm{age}}^2)$. If the tumour growth is monophasic, there are four unknown parameters $\varvec{\theta }$; if it is biphasic, there are nine unknown parameters $(\varvec{\theta }_1,\varvec{\theta }_2,\tau )$.

2.3 Bayesian inference with intractable likelihood

In Bayesian inference, the posterior distribution of the parameters $\varvec{\theta }$ is given by

$$\begin{aligned} \textrm{P}(\varvec{\theta }{\mid }\varvec{y}) \propto \textrm{P}(\varvec{y}{\mid }\varvec{\theta })\textrm{P}(\varvec{\theta }) , \end{aligned}$$

(8)

where the prior distribution $\textrm{P}(\varvec{\theta })$ is updated by the observation data $\varvec{y} = (y_1,y_2,\dots ,y_T)$ with length T via the likelihood function $\textrm{P}(\varvec{y}{\mid }\varvec{\theta })$. In this paper, the parameters for the monophasic VCBM are $\varvec{\theta } = (p_0,p_{\textrm{psc}},d_{\textrm{max}},g_{\textrm{age}})$ and for the biphasic VCBM are $\varvec{\theta }_1 = (p_0^1,p_{\textrm{psc}}^1,d_{\textrm{max}}^1,g_{\textrm{age}}^1)$, $\varvec{\theta }_2 = (p_0^2,p_{\textrm{psc}}^2,d_{\textrm{max}}^2,g_{\textrm{age}}^2)$ and $\tau $. However, the complexity of the biological data simulating process may render the corresponding likelihood function computationally infeasible (Beaumont 2019). When the likelihood function is intractable, standard Bayesian inference methods cannot be used to sample the posterior distribution. However, even when the likelihood function is intractable, it often remains feasible to generate simulations from the model.

2.3.1 Approximate Bayesian computation (ABC)

A method called approximate Bayesian computation (ABC) has been proposed to solve this problem by avoiding evaluation of the likelihood function (Sisson et al. 2018). The idea is to generate approximate samples from the parameter posterior by repeatedly simulating data $\varvec{x}$ from the model for different sets of parameter values and assessing how ‘similar’ it is with the observed data $\varvec{y}$. Parameter values that generate simulated data that are similar enough to the observed data are retained in the posterior sample. There are two main challenges in the successful implementation of ABC: choosing informative summary statistics (Prangle 2015) and defining a suitable distance function (Drovandi and Frazier 2022), $\rho (\varvec{y},\varvec{x})$, that measures the ‘closeness’ of observed data with the simulated data. In our paper, the summary statistics are the observed data itself since the length of tumour growth time series data is short (i.e. T is small), and we select the distance function as

$$\begin{aligned} \rho (\varvec{y},\varvec{x}) = \sum _{t=1}^T \big ({\textrm{log}(y_t) - \textrm{log}(x_t)}\big )^2. \end{aligned}$$

(9)

We compare the logarithm of the observed and simulated data since the tumour sizes tend to increase quickly with time. The approximate posterior implied by ABC is given by

$$\begin{aligned} \textrm{P}_{\epsilon }(\varvec{\theta } {\mid } \varvec{y}) \propto \textrm{P}(\varvec{\theta })\int \textrm{K}_{\epsilon }\big (\rho (\varvec{y},\varvec{x})\big )\textrm{P}(\varvec{x}{\mid }\varvec{\theta })d\varvec{x}, \end{aligned}$$

(10)

where $\textrm{K}_{\epsilon }: \mathbb {R}_+ \mapsto \mathbb {R}_+$ is called a kernel function with tolerance parameter $\epsilon $. Then, the approximate likelihood $\int \textrm{K}_{\epsilon }\big (\rho (\varvec{y},\varvec{x})\big )\textrm{P}(\varvec{x}{\mid }\varvec{\theta })d\varvec{x}$ can be estimated unbiasedly by using Monte Carlo via simulating $\varvec{x}$ from the model. Here, the kernel function $\textrm{K}_{\epsilon }$ is a weighting function that assigns higher weight to simulated data $\varvec{x}$ that is closer to observed data $\varvec{y}$. In this paper, we choose the indicator function as the kernel function, i.e., $\textrm{K}_{\epsilon }\big (\rho (\varvec{y},\varvec{x})\big ) = \mathbb {I}\big (\rho (\varvec{y},\varvec{x}) < \epsilon \big )$.

Some popular algorithms like the ABC-rejection algorithm (Sisson et al. 2018; Warne et al. 2019) and Markov Chain Monte Carlo (MCMC)-ABC (Marjoram et al. 2003; Bortot et al. 2007; Sisson and Fan 2011) have been used for parameter estimation in the context of biology (Beaumont 2010; Csilléry et al. 2010; Sunnåker et al. 2013; Beaumont 2019). Since ABC rejection usually generates proposed values of $\varvec{\theta }$ from the prior, it can be inefficient if the target posterior differs substantially from the prior. The MCMC-ABC method is often computationally more efficient than ABC-rejection, since it aims to search locally around areas of high posterior probability density. However, MCMC-ABC can also be computationally costly, since the Markov chain can become stuck in low posterior regions, and, due to its serial nature, cannot easily exploit parallel computing architectures. On the other hand, the SMC-ABC algorithm can be more efficient since it can easily harness parallel computing, and it only proposes from the prior in the first iteration, and sequentially improves the proposal distribution. The SMC-ABC replenishment algorithm of Drovandi and Pettitt (2011) is used in a cell biology application in Carr et al. (2021), and we use the same algorithm to calibrate the VCBM. The SMC-ABC replenishment algorithm is explained in more detail in the next section.

2.3.2 SMC-ABC replenishment algorithm

SMC-ABC samples from a sequence of increasingly accurate ABC posteriors based on defining a sequence of non-increasing tolerances $\epsilon _1 \ge \epsilon _2 \ge \dots \ge \epsilon _T$:

$$\begin{aligned} \textrm{P}_{\epsilon _t}(\varvec{\theta }{\mid }S(\varvec{y})) \propto \textrm{P}(\varvec{\theta })\int \mathbb {I}\big ( \rho (S(\varvec{y}),S(\varvec{x})) < \epsilon _t \big )\textrm{P}(S(\varvec{x}){\mid }\varvec{\theta }) dS(\varvec{x}), \text { for } t = 1,\dots ,T.\nonumber \\ \end{aligned}$$

(11)

The algorithm first draws N independent samples from the prior $\textrm{P}(\varvec{\theta })$, denoted here as $\{\varvec{\theta }^i\}_{i=1}^N$. Then, for each draw of $\varvec{\theta }^i$ (referred to as a particle), we simulate the stochastic model and compute its corresponding discrepancy $\rho ^i$ to produce $\{\varvec{\theta }^i,\rho ^i\}_{i=1}^N$. The particle set $\{\varvec{\theta }^i,\rho ^i\}_{i=1}^N$ is then sorted by the discrepancy $\rho $ such that $\rho ^1<\rho ^2<\cdots <\rho ^N$. We then set the first tolerance threshold as $\epsilon _1 = \rho ^N$, i.e. the largest discrepancy value in the particle set. In order to propagate particles through the sequence of target distributions, the next tolerance is defined dynamically as $\epsilon _{t} = \rho ^{N-N_a}$ (where initially $t=2$) where $N_a = \lfloor Na \rfloor $ and a is a tuning parameter that controls the adaptive selection of discrepancy thresholds and $\lfloor \cdot \rfloor $ is the floor function. Effectively, at each SMC iteration, we discard $a\times 100$% of the particle set with the largest value of the discrepancy.

After discarding $a\times 100$% of the particle set, there will only be $N-N_a$ particles remaining. In order to rejuvenate the particle set back to size N, we resample $N_a$ times with equal probability and replacement from the kept set of particles $\{\varvec{\theta }^{i},\rho ^{i}\}_{i=1}^{N-N_a}$, where both the parameter and discrepancy values are copied. Although the resampling step ensures that there are N particles in the set, it creates particle duplication.

In order to diversify the particle set, we apply an MCMC kernel to each of the resampled particles. The tuning parameters of the MCMC proposal distribution $q_{t}(\cdot {\mid }\cdot )$ can be obtained from the set of the particles following resampling, which are already distributed according to the ABC target with tolerance $\epsilon _{t}$. For example, if a multivariate normal random walk proposal is used, its covariance $\Sigma _t$ can be tuned based on the sample covariance computed from the particle set. Defining the current value of a particle we wish to move as $\varvec{\theta }$, we accept a proposal of parameter $\tilde{\varvec{\theta }} \sim \mathcal {N}({\varvec{\theta }},\varvec{\Sigma })$ and simulated data $\tilde{\varvec{x}} \sim \textrm{P}(\varvec{x}{\mid }\tilde{\varvec{\theta }})$ according to the probability

$$\begin{aligned} p_t = \textrm{min}\left( 1,\frac{\textrm{P}(\tilde{\varvec{\theta }})}{\textrm{P}(\varvec{\theta })} \mathbb {I}(\rho (\varvec{y},\tilde{\varvec{x}}) < \epsilon _t) \right) . \end{aligned}$$

(12)

The proposal densities do not appear in the above acceptance probability due to the symmetry of the multivariate normal random walk proposal.^{Footnote 1}

However, the above procedure may reject proposals and we may fail to move a large proportion of the resampled particles. Thus, we propose to apply $R_t$ iterations of the MCMC kernel to each resampled particle, so that there is a high probability of moving each resampled particle at least once. Specifically, we set $R_t = \lceil \frac{\log (c)}{\log (1-p_t^{\textrm{acc}})}\rceil $ where c is a tuning parameter of the algorithm and is, theoretically, the probability that a particle is not moved in the $R_t$ iterations. Here, $p_t^{\textrm{acc}}$ is the expected MCMC acceptance probability at the tth SMC iteration. Since $p_t^{\textrm{acc}}$ is unknown, we estimate it from $S_t < R_t$ trial MCMC iterations. Once it is estimated, we compute $R_t$ and perform the remaining $R_t - S_t$ MCMC iterations on each of the resampled particles. For the next iteration we set the number of trial MCMC iterations to $S_{t+1} = \lceil R_t/2\rceil $.

There are two ways the algorithm can be stopped. One stopping rule is activated if some desired tolerance $\epsilon _T$ is reached, i.e. the maximum discrepancy value in the particle set is below $\epsilon _T$. The second stopping rule is activated when the overall MCMC acceptance probability at a given SMC iteration falls below some user-defined $p_{\textrm{acc}}$, implying that too much computation is required to progress the algorithm further. The overall acceptance rate can be estimated by

$$\begin{aligned} \hat{p}_t^{\textrm{acc}} = \sum _{j=N-N_a+1}^N \sum _{k=1}^{R_t}p_t^{j,k}/\big (R_t(N-N_{a})\big ), \end{aligned}$$

(13)

where $p_t^{j,k}$ is the MCMC acceptance rate for particle j at MCMC iteration k and is obtained based on (12). For our study, we set the tuning parameters to be $a = 0.5$ and $c = 0.01$. We use the MCMC acceptance rate as the stopping rule of the algorithm and set $p_{\textrm{min}} = 0.005$.

2.4 Prior knowledge

The parameters $p_0$ and $p_{\textrm{psc}}$ are constrained to lie between zero and one as they are the probabilities of cell proliferation and cell invasion, respectively. Vague independent priors are used for each of these parameters, which, in each case, is a uniform distribution constrained by 0 and 1. For the switching time $\tau $, we set its prior as a uniform distribution constrained by 2 and the time the maximum measurement taken. Since $\tau $ refers to the day that switching occuring, we take $\tau $ as integer. We assign a uniform distribution constrained by 0 and the maximum measurement day times 24 h as prior to $g_\textrm{age}$ so that the prior has a relatively large variance to create a reasonably vague prior.

Figure S1–2 depicts the distribution of distances d for the population of tumour cells based on the volume of the tumour. The d values obtained are proportional to the volume of the tumour, so as the tumour expands, more cells will be located further from its edge. It appears that d remains between 0 and 30 for ranges of tumour volumes relevant to the in vivo datasets. In other words, no cell has a distance greater than about $d = 30$. Hence, we assign a uniform distribution constrained by 0 and 50 as the prior for $d_\textrm{max}$.

3 Results

In this section, we investigate the SMC-ABC algorithm’s ability to calibrate the VCBM. First, the SMC-ABC algorithm is applied to synthetic data generated from the model with known parameter values. Then, SMC-ABC is applied to three real tumour growth datasets to explore whether the VCBM can capture the tumour growth pattern for each type of cancer. The code and data is available via https://github.com/john-wang1015/Calibration_BVCBM.

We report the details of the computational time required for simulation and inference, so that extensions to our modeling and/or statistical methods may be considered. We calculate the computational cost of 1000 simulations, each with a time series length of 32 days, and parameter configurations drawn from the prior distribution. We report a computational cost range of 1.76–137.27 s per simulation. A significant driver of computational time is the variable $p_{\textrm{psc}}$, which leads to a notable change in computational time as its value increases. For the inference task, we utilize an Intel(R) Xeon(R) Gold 6140 CPU at 2.3 GHz, and parallelising across the 16 cores results in the total computation time for the SMC-ABC algorithm to be approximately 11 h when calibrating with the ovarian cancer dataset, approximately 34 h when calibrating with the breast cancer dataset, and approximately 9 to 16 h (depending on the length of each dataset) when calibrating with the pancreatic cancer dataset.

3.1 Validation with synthetic data

Prior to applying SMC-ABC to the observed experimental data sets, we first perform a preliminary investigation using synthetic datasets generated via simulation from the VCBM. This allows us to verify that the proposed SMC-ABC algorithm is able to produce expected results under simple settings, i.e. exhibit posterior concentration around the true values that were used to generate the data and for the posterior predictive distribution to be consistent with the generated data.

To validate the SMC-ABC method, we generate five sets of synthetic data using five “true” parameter settings (see Table S1 and plots for time series data in Fig. S4). Three of the datasets are simulated with biphasic growth and two with monophasic growth, as the real data (see Fig. 1) appear to exhibit both cases. For example, some mice in the pancreatic dataset appear to exhibit biphasic growth, whereas the mice in the breast cancer dataset appear to exhibit monophasic growth. The biphasic datasets use different values of $g_{\textrm{age}}$ for the two phases. During the generation of simulated datasets, we initialize the tumour volume at 200 ${{mm}}^3$, consistent with the real data.

We use the posterior samples to approximate the posterior predictive distribution of the tumour volumes. In the supplementary document, we plot the (0.25, 0.75), (0.1, 0.9) and (0.025, 0.975) posterior predictive intervals. Our results (see Figs. S6a, S7a, S8a, S9a and S10a) show that SMC-ABC can recover every synthetic dataset with reasonable accuracy, as the associated tumour volume falls within at least one of the intervals in the posterior predictive plots.

The estimated univariate posterior distributions of synthetic time series (see Figs. S6–S10) indicate that $g_{\textrm{age}}$ is the largest driving force for tumour growth in the sense that the posterior is substantially more concentrated compared to the prior. However, the posteriors for $p_0$ and $d_{\textrm{max}}$ are not substantially different to the prior, and thus cannot be identified from the data.

Although Eq. 6 has been widely used in ABM context to model tumour growth (Kansal et al. 2000a, b; Jiao and Torquato 2011; Pourhasanzade et al. 2017; Jenner et al. 2020a), the sensitivity of the parameter $d_{\textrm{max}}$ requires confirmation through simulation. Our simulation shows that $d_{\textrm{max}}$ is only sensitive at a small scale, as seen in Fig. S3a and S3b, but not for large values, such as 100 mm. This insensitivity may be due to the spring-based movement of cells, which allows new cells introduced through proliferation to have minimal impact on cells at the exterior. Consequently, the tumour edge and volume remain unaffected. As a result, the probability of cell division is not sensitive, as shown in Figure S1.

3.2 Monotonic tumour growth data

In this section, we use SMC-ABC to estimate parameters of the monophasic VCBM from the ovarian and breast cancer datasets shown earlier in Fig. 1a and b. We use only the monophasic model for these data sets, as we find that the standard VCBM provides a good fit to the data. Furthermore, the computational cost of fitting the standard VCBM with four parameters is substantially lower than fitting the biphasic VCBM with nine parameters. The parameter $g_{\textrm{age}}$ appears to be the most sensitive parameter driving tumour growth in breast and ovarian datasets (see Figs. 4 and 5). In the supplementary document, we show the posterior distributions and the bivariate plots for ovarian and breast tumour datasets from Figs. S11–22.

The posterior predictive distributions for the ovarian and breast cancer datasets are shown in Figs. 4a and 5a, respectively. Firstly, it is evident that the VCBM provides a good fit to both cancer datasets, as the observed datasets lie comfortably within the prediction intervals. However, two of three datasets (first and third one) in the ovarian cancer dataset show a linear trend in volume size whereas the VCBM predicts exponential growth, and so the model is not as accurate at predicting the volume sizes at later times.

For both ovarian and breast cancer, the posterior distributions for $p_{\textrm{psc}}$ and $d_{\textrm{max}}$ are very similar to the prior, indicating that the data are not providing any additional information about these parameters. The posterior distribution for $p_0$ is also similar to the prior for most datasets, except for some breast and ovarian datasets where there is some preference for smaller values of $p_0$.

In contrast, the data are providing substantial information about $g_{\textrm{age}}$, as indicated by a much more precise posterior distribution compared to the prior as shown in Figs. 4a and 5b. This indicates that $g_{\textrm{age}}$ strongly drives the dynamics of the VCBM, at least in terms of the tumour volume it produces. We also find there is positive correlation between $p_{\textrm{psc}}$ and $g_{\textrm{age}}$ in Fig. S18, S20 and S22 in the ovarian cancer dataset. This indicates that as the probability of cell invasion increases, the growth rate of tumour cell will decrease to compensate.

3.3 Biphasic tumour growth data

In this section, SMC-ABC is employed to analyze the pancreatic cancer datasets, fitting the data with the biphasic VCBM. As observed in Fig. 1c, the first mouse in the pancreatic cancer dataset exhibits a change in growth pattern prior to day 15. Given this, we apply the biphasic VCBM to the entire pancreatic cancer dataset.

The posterior predictive distribution demonstrates that the biphasic VCBM offers a suitable fit for the four pancreatic datasets, as evidenced by the observed datasets falling within the prediction intervals as shown in the first column of Fig. 6.

Our analysis of pancreatic tumor growth using the biphasic VCBM reveals that $g_{\textrm{age}}$ is an important parameter describing the tumor growth dynamics. Changes in the $g_{\textrm{age}}$ parameter can reflect alterations in the tumor microenvironment and genetic or epigenetic alterations in the tumor cells. For the first two mice in the pancreatic dataset, there is a significant difference in $g_{\textrm{age}}$ between the two phases of growth (as exhibited by non-overlapping posterior distributions). Furthermore, for these two mice, the switching time between the two growth phases is reasonably well identified. For the third and fourth mice in the pancreatic dataset, the posterior distribution of $g_{\textrm{age}}$ between the two growth phases has significant overlap, and the posterior distribution of the switching time is close to uniform over the experiment. This suggests that these two mice exhibit monophasic growth, and demonstrates how the biphasic VCBM can effectively reduce to a standard VCBM. In the supplementary document, we present the bivariate plots for parameters in Figs. S23–S26. A follow-up analysis could then apply the standard VCBM for these two mice, as the extra complexity of the biphasic VCBM is not warranted for these data sets.

4 Discussion

In this research, we develop a biphasic off-lattice ABM based on a Voronoi tessellation, which is an extension of Jenner et al. (2020a). We demonstrate the utility of the new model and the standard VCBM by calibrating them to real world tumour growth time series data.

The Voronoi Cell-Based model (VCBM) describes the stochastic nature of cancer cell proliferation and has an intractable likelihood function making it challenging to fit to data. The SMC-ABC replenishment algorithm suggested by Drovandi and Pettitt (2011) is suitable for calibrating the VCBM since the likelihood of the VCBM is intractable. While the SMC-ABC replenishment technique has been widely employed Carr et al. (2021), Varghese et al. (2020), Vo et al. (2015), Warne et al. (2020), this is one of the first times that an ABM like this has been calibrated to actual tumour growth data. The outcome of our calibration not only verifies the conclusion of Jenner et al. (2020a) that $g_{\textrm{age}}$ is the most sensitive parameter, but also demonstrates the biphasic VCBM can capture the switching of tumour growth dynamics and precisely estimate the value of $g_{\textrm{age}}$ in different phases.

In our study, we use breast cancer, pancreatic cancer and ovarian cancer tumour growth measurements in vivo to examine the robustness and flexibility of the VCBM. It is evident that $d_{\textrm{max}}$ and $p_{\textrm{psc}}$ are not informed by any of the data sets. This is most likely due to the fact that tumour volume is most impacted by the expansion of the tumour periphery, which only requires the cells on the periphery to proliferate. As such, modelling the probability of proliferation as a function of the distance to the tumour periphery is not informed by simply tumour volume measurements and requires more informative data collection. In contrast, $g_{\textrm{age}}$ provides rich information about the average time a cell takes to proliferate.

It is evident that some of the pancreatic datasets appear to have biphasic tumour growth patterns. To investigate this, we assumed there was some time at which tumour growth could vary and introduce $\tau $ which is a parameter that control the time at which the tumour growth dynamics change. Performing SMC-ABC, we saw that $d_{\textrm{max}}$ and $p_{\textrm{psc}}$ are not able to provide any information as the posterior for both phases are close to the prior. However, the $g_{\textrm{age}}$ posteriors indicate clearly that the average time for cell proliferation is different based on the two phases. Where it is clear that in the first phase, the average time to proliferation is slower than in the second phase. This highlights a shift in tumour growth which may relate to the expansion of the tumour in the subcutaneous tissue or potentially a mutation in the cell cycling. However, to conclude the cause of this shift needs more investigation from both an experimental and modelling point of view.

To reduce computational cost, we implemented the VCBM in only 2-dimensional space. In this way, we approximate tumour growth in vivo in 2-dimensions which we feel matched the experimental measurements which are also only in two dimensions. However, having the third dimension measurement of the tumour in vivo would provide us with an understanding of the irregularity in tumour volume that we may be missing by only considering 2-dimensional tumour growth. In future work, we hope to use 3-dimensional tumour volume measurements, such as Magnetic resonance imaging (MRI), to calibrate the 3-dimensional form of the VCBM and improve model accuracy. This would allow us to recapitulate tumour shape irregularities, which we currently assume are negligible in the 2-dimensional VCBM.

As our results suggest, the pancreatic cancer data set suggests that biphasic tumour growth can occur. The next steps for our model are to reformulate the cellular proliferation to be a function of tumour space, so that as the tumour grows we may be able to capture multiphasic growth. One idea for this, would be to consider sub-clonal populations within the tumour that arise stochastically with varying proliferation constants.

While the SMC-ABC is a relatively fast off-line algorithm, it is still computationally expensive as for every proposed parameter value generated in a Bayesian algorithm, a full dataset must be simulated. When simulation time is non-trivial, this then creates a computationally intensive calibration algorithm. On the other hand, online algorithms can iteratively update model parameter estimates as data are introduced sequentially. Such an approach would be useful for our application, since we would only need to simulate data forward one time step at each iteration and future work hopes to investigate this further.

Overall, given the slow adoption of likelihood-free algorithms that can infer parameters in ABMs, we feel our manuscript provides inspiration for others using ABMs applied in a biological context where data is available to attempt parameter inference. In turn, our results suggest that not all parameters are practically identifiable in the VCBM, which is information only gained through attempting to infer these parameters to data. Since the underlying structure of the VCBM is used in many different applications, this finding has a flow-on effect for currently published models, whose parameters may not be identifiable. It also motivates future experiments that may be used to identify these parameters, such as time-series flow cytometry measurements to identify cell proliferation markers. Lastly, the clear ability of our algorithm to identify the biphasic switching time of in vivo tumour growth suggests biologically that tumours grown subcutaneously in mice may in reality exhibit two phases. This then allows us the understand more deeply how in vivo tumour growth may differ from in situ tumour growth in humans and help inform our understanding of experimental findings.

In conclusion, we have applied SMC-ABC to calibrate the standard VCBM and biphasic extension with real breast cancer, pancreatic cancer and ovarian cancer tumour growth datasets. It is evident that $g_{\textrm{age}}$ is the informative parameter for the VCBM and allows it to recapitulate in vivo tumour growth data. Unfortunately, $p_{\textrm{psc}}$ and $d_{\textrm{max}}$ are non-identifiable. We also find the biphasic VCBM shows the ability to extract information from biphasic tumour growth datasets like a pancreatic tumour. We intend to extend the VCBM so that it can capture potentially multiphasic tumour growth pattern and also improve the computational cost of SMC-ABC by moving the algorithm online.

Notes

An optimal scaling of the covariance matrix for symmetric multivariate Gaussian proposals in the Metropolis–Hastings algorithm is $(2.38)^2/d$ where d is the number of parameters. For more information, see Gelman et al. (1996).

References

Altrock PM, Liu LL, Michor F (2015) The mathematics of cancer: integrating quantitative models. Nat Rev Cancer 15(12):730–745
CAS PubMed Google Scholar
An L (2012) Modeling human decisions in coupled human and natural systems: review of agent-based models. Ecol Model 229:25–36
Google Scholar
Barbolosi D, Ciccolini J, Lacarelle B, Barlési F, André N (2016) Computational oncology-mathematical modelling of drug regimens for precision medicine. Nat Rev Clin Oncol 13(4):242–254
PubMed Google Scholar
Beaumont MA (2010) Approximate Bayesian computation in evolution and ecology. Annu Rev Ecol Evol Syst 41:379–406
Google Scholar
Beaumont MA (2019) Approximate Bayesian computation. Annu Rev Stat Appl 6:379–403
MathSciNet Google Scholar
Beerenwinkel N, Schwarz RF, Gerstung M, Markowetz F (2015) Cancer evolution: mathematical models and computational inference. Syst Biol 64(1):e1–e25
CAS PubMed Google Scholar
Benzekry S, Lamont C, Beheshti A, Tracz A, Ebos JM, Hlatky L, Hahnfeldt P (2014) Classical mathematical models for description and prediction of experimental tumor growth. PLoS Comput Biol 10(8):e1003800
PubMed PubMed Central ADS Google Scholar
Bernard S, Herzel H (2006) Why do cells cycle with a 24 hour period? Genome Inform 17(1):72–79
CAS PubMed Google Scholar
Bortot P, Coles SG, Sisson SA (2007) Inference for stereological extremes. J Am Stat Assoc 102(477):84–92
MathSciNet CAS Google Scholar
Bozic I, Antal T, Ohtsuki H, Carter H, Kim D, Chen S, Karchin R, Kinzler KW, Vogelstein B, Nowak MA (2010) Accumulation of driver and passenger mutations during tumor progression. Proc Natl Acad Sci 107(43):18545–18550
CAS PubMed PubMed Central ADS Google Scholar
Browning AP, Woodhouse FG, Simpson MJ (2019) Reversible signal transmission in an active mechanical metamaterial. Proc R Soc A 475(2227):20190146
MathSciNet PubMed PubMed Central ADS Google Scholar
Carr MJ, Simpson MJ, Drovandi C (2021) Estimating parameters of a stochastic cell invasion model with fluorescent cell cycle labelling using approximate Bayesian computation. J R Soc Interface 18(182):20210362
PubMed PubMed Central Google Scholar
Cheng L, Yang K, Chen Q, Liu Z (2012) Organic stealth nanoparticles for highly effective in vivo near-infrared photothermal therapy of cancer. ACS Nano 6(6):5605–5613
CAS PubMed Google Scholar
Cleri F (2019) Agent-based model of multicellular tumor spheroid evolution including cell metabolism. Eur Phys J E 42:1–15
CAS Google Scholar
Cristini V, Lowengrub J (2010) Multiscale modeling of cancer: an integrated experimental and mathematical modeling approach. Cambridge University Press, Cambridge
Google Scholar
Csilléry K, Blum MG, Gaggiotti OE, François O (2010) Approximate Bayesian computation (ABC) in practice. Trends Ecol Evol 25(7):410–418
PubMed Google Scholar
Dehingia K, Sarmah HK, Jeelani MB (2021) A brief review on cancer research and its treatment through mathematical modelling. Ann Cancer Res Ther 29:34–40
Google Scholar
Drovandi C, Frazier DT (2022) A comparison of likelihood-free methods with and without summary statistics. Stat Comput 32(3):1–23
MathSciNet Google Scholar
Drovandi CC, Pettitt AN (2011) Estimation of parameters for macroparasite population evolution using approximate Bayesian computation. Biometrics 67(1):225–233
MathSciNet CAS PubMed Google Scholar
Fisi V, Kátai E, Bogner P, Miseta A, Nagy T (2016) Timed, sequential administration of paclitaxel improves its cytotoxic effectiveness in a cell culture model. Cell Cycle 15(9):1227–1233
CAS PubMed PubMed Central Google Scholar
Fletcher AG, Osborne JM, Maini PK, Gavaghan DJ (2013) Implementing vertex dynamics models of cell populations in biology within a consistent computational framework. Prog Biophys Mol Biol 113(2):299–326
CAS PubMed Google Scholar
Galle J, Loeffler M, Drasdo D (2005) Modeling the effect of deregulated proliferation and apoptosis on the growth dynamics of epithelial cell populations in vitro. Biophys J 88(1):62–75
CAS PubMed Google Scholar
Gelman A, Roberts G, Gilks W (1996) Efficient metropolis jumping rules. Bayesian Stat 5(599–608):42
Google Scholar
Germano DP, Zanca A, Johnston ST, Flegg JA, Osborne JM (2022) Free and interfacial boundaries in individual-based models of multicellular biological systems. bioRxiv, pp 2022–12
Ghaffarizadeh A, Heiland R, Friedman SH, Mumenthaler SM, Macklin P (2018) PhysiCell: an open source physics-based cell simulator for 3-d multicellular systems. PLoS Comput Biol 14(2):e1005991
PubMed PubMed Central ADS Google Scholar
Groh A, Louis AK (2010) Stochastic modelling of biased cell migration and collagen matrix modification. J Math Biol 61(5):617–647
MathSciNet PubMed Google Scholar
Irurzun-Arana I, Rackauckas C, McDonald TO, Trocóniz IF (2020) Beyond deterministic models in drug discovery and development. Trends Pharmacol Sci 41(11):882–895
CAS PubMed PubMed Central Google Scholar
Iyer K, Sankaran S, Athale R (2011) Stochastic modelling of tumour immune interactions. In: Proceedings of the international conference on bioinformatics & computational biology (BIOCOMP), p 1
Jenner AL, Frascoli F, Coster AC, Kim PS (2020a) Enhancing oncolytic virotherapy: observations from a Voronoi cell-based model. J Theor Biol 485:110052
CAS PubMed Google Scholar
Jenner AL, Frascoli F, Yun C-O, Kim PS (2020b) Optimising hydrogel release profiles for viro-immunotherapy using oncolytic adenovirus expressing IL-12 and GM-CSF with immature dendritic cells. Appl Sci 10(8):2872
CAS Google Scholar
Jenner A, Kelly W, Dallaston M, Araujo R, Parfitt I, Steinitz D, Pooladvand P, Kim PS, Wade SJ, Vine KL (2022) Examining the efficacy of localised gemcitabine therapy for the treatment of pancreatic cancer using a hybrid agent-based model. BioRxiv
Jiao Y, Torquato S (2011) Emergent behaviors from a cellular automaton model for invasive tumor growth in heterogeneous microenvironments. PLoS Comput Biol 7(12):e1002314
CAS PubMed PubMed Central ADS Google Scholar
Kansal A, Torquato S, Harsh Iv G, Chiocca E, Deisboeck T (2000a) Cellular automaton of idealized brain tumor growth dynamics. Biosystems 55(1–3):119–127
CAS PubMed Google Scholar
Kansal AR, Torquato S, Harsh Iv G, Chiocca E, Deisboeck T (2000b) Simulated brain tumor growth dynamics using a three-dimensional cellular automaton. J Theor Biol 203(4):367–382
CAS PubMed ADS Google Scholar
Kempf H, Bleicher M, Meyer-Hermann M (2010) Spatio-temporal cell dynamics in tumour spheroid irradiation. Eur Phys J D 60(1):177–193
CAS ADS Google Scholar
Khain E, Sander LM, Schneider-Mizell CM (2007) The role of cell–cell adhesion in wound healing. J Stat Phys 128(1):209–218
ADS Google Scholar
Kim P-H, Sohn J-H, Choi J-W, Jung Y, Kim SW, Haam S, Yun C-O (2011) Active targeting and safety profile of peg-modified adenovirus conjugated with herceptin. Biomaterials 32(9):2314–2326
CAS PubMed Google Scholar
Klowss JJ, Browning AP, Murphy RJ, Carr EJ, Plank MJ, Gunasingh G, Haass NK, Simpson MJ (2022) A stochastic mathematical model of 4d tumour spheroids with real-time fluorescent cell cycle labelling. J R Soc Interface 19(189):20210903
PubMed PubMed Central Google Scholar
Konstorum A, Sprowl SA, Waterman ML, Lander AD, Lowengrub JS (2013) Predicting mechanism of biphasic growth factor action on tumor growth using a multi-species model with feedback control. J Coupled Syst Multiscale Dyn 1(4):459–467
PubMed PubMed Central Google Scholar
Lambert B, MacLean AL, Fletcher AG, Combes AN, Little MH, Byrne HM (2018) Bayesian inference of agent-based models: a tool for studying kidney branching morphogenesis. J Math Biol 76(7):1673–1697
MathSciNet PubMed PubMed Central Google Scholar
Lowengrub JS, Frieboes HB, Jin F, Chuang Y-L, Li X, Macklin P, Wise SM, Cristini V (2009) Nonlinear modelling of cancer: bridging the gap between cells and tumours. Nonlinearity 23(1):R1
MathSciNet Google Scholar
Lundh T (2007) Cellular automaton modeling of biological pattern formation: characterization, applications, and analysis authors: Andreas deutsch and sabine dormann, Birkhäuser, 2005, xxvi, 334 p, 131 illus
Macklin P, Edgerton ME, Thompson AM, Cristini V (2012) Patient-calibrated agent-based modelling of ductal carcinoma in situ (DCIS): from microscopic measurements to macroscopic predictions of clinical progression. J Theor Biol 301:122–140
MathSciNet PubMed PubMed Central ADS Google Scholar
Macnamara CK (2021) Biomechanical modelling of cancer: agent-based force-based models of solid tumours within the context of the tumour microenvironment. Comput Syst Oncol 1(2):e1018
Google Scholar
Marchant BP, Norbury J, Byrne HM (2006) Biphasic behaviour in malignant invasion. Math Med Biol 23(3):173–196
PubMed Google Scholar
Marjoram P, Molitor J, Plagnol V, Tavaré S (2003) Markov chain Monte Carlo without likelihoods. Proc Natl Acad Sci 100(26):15324–15328
CAS PubMed PubMed Central ADS Google Scholar
Markowetz F (2017) All biology is computational biology. PLoS Biol 15(3):e2002050
PubMed PubMed Central Google Scholar
Meineke FA, Potten CS, Loeffler M (2001) Cell migration and organization in the intestinal crypt using a lattice-free model. Cell Prolif 34(4):253–266
CAS PubMed PubMed Central Google Scholar
Metzcar J, Wang Y, Heiland R, Macklin P (2019) A review of cell-based computational modeling in cancer biology. JCO Clin Cancer Inform 2:1–13
Google Scholar
Murphy RJ, Buenzli PR, Baker R, Simpson MJ (2019) A one-dimensional individual-based mechanical model of cell movement in heterogeneous tissues and its coarse-grained approximation. Proc R Soc A 475(2227):20180838
MathSciNet CAS PubMed PubMed Central ADS Google Scholar
Murphy RJ, Maclaren OJ, Calabrese AR, Thomas PB, Warne DJ, Williams ED, Simpson MJ (2022) Computationally efficient framework for diagnosing, understanding and predicting biphasic population growth. J R Soc Interface 19(197):20220560
PubMed PubMed Central Google Scholar
Murray PJ, Edwards CM, Tindall MJ, Maini PK (2012) Classifying general nonlinear force laws in cell-based models via the continuum limit. Phys Rev E 85(2):021921
ADS Google Scholar
Noble D (2002) The rise of computational biology. Nat Rev Mol Cell Biol 3(6):459–463
PubMed Google Scholar
Norton K-A, Gong C, Jamalian S, Popel AS (2019) Multiscale agent-based and hybrid modeling of the tumor immune microenvironment. Processes 7(1):37
CAS PubMed Google Scholar
Ozik J, Collier N, Wozniak JM, Macal C, Cockrell C, Friedman SH, Ghaffarizadeh A, Heiland R, An G, Macklin P (2018) High-throughput cancer hypothesis testing with an integrated Physicell-EMEWS workflow. BMC Bioinform 19(18):81–97
Google Scholar
Perumpanani A, Byrne H (1999) Extracellular matrix concentration exerts selection pressure on invasive cells. Eur J Cancer 35(8):1274–1280
CAS PubMed Google Scholar
Poleszczuk J, Macklin P, Enderling H (2016) Agent-based modeling of cancer stem cell driven solid tumor growth. In: Stem cell heterogeneity. Springer, pp 335–346
Pourhasanzade F, Sabzpoushan S, Alizadeh AM, Esmati E (2017) An agent-based model of avascular tumor growth: Immune response tendency to prevent cancer development. Simulation 93(8):641–657
Google Scholar
Prangle D (2015) Summary statistics in approximate Bayesian computation. arXiv preprint arXiv:1512.05633
Railsback SF, Grimm V (2019) Agent-based and individual-based modeling: a practical introduction. Princeton University Press, Princeton
Google Scholar
Rocha HL, Godet I, Kurtoglu F, Metzcar J, Konstantinopoulos K, Bhoyar S, Gilkes DM, Macklin P (2021) A persistent invasive phenotype in post-hypoxic tumor cells is revealed by fate mapping and computational modeling. iScience 24(9):102935
CAS PubMed PubMed Central ADS Google Scholar
Ross RJ, Yates CA, Baker RE (2015) Inference of cell-cell interactions from population density characteristics and cell trajectories on static and growing domains. Math Biosci 264:108–118
MathSciNet PubMed Google Scholar
Ross RJ, Baker RE, Parker A, Ford M, Mort R, Yates C (2017) Using approximate Bayesian computation to quantify cell-cell adhesion parameters in a cell migratory process. NPJ Syst Biol Appl 3(1):1–10
Google Scholar
Sahoo S, Sahoo A, Shearer S (2011) Stochastic modelling of avascular tumour growth and therapy. Phys Scr 83(4):045801
ADS Google Scholar
Schaller G, Meyer-Hermann M (2005) Multicellular tumor spheroid in an off-lattice Voronoi–Delaunay cell model. Phys Rev E 71(5):051910
MathSciNet ADS Google Scholar
Schmidt KM, Geissler EK, Lang SA (2016) Subcutaneous murine xenograft models: a critical tool for studying human tumor growth and angiogenesis in vivo. In: Tumor angiogenesis assays: methods and protocols. Springer, pp 129–137
Sisson SA, Fan Y (2011) Likelihood-free MCMC. Handbook of Markov Chain Monte Carlo, pp 313–335
Sisson SA, Fan Y, Beaumont M (2018) Handbook of approximate Bayesian computation. CRC Press, Boca Raton
Google Scholar
Sunnåker M, Busetto AG, Numminen E, Corander J, Foll M, Dessimoz C (2013) Approximate Bayesian computation. PLoS Comput Biol 9(1):e1002803
MathSciNet PubMed PubMed Central ADS Google Scholar
Tabassum S, Rosli NB, Mazalan MSAB (2019) Mathematical modeling of cancer growth process: a review. In: Journal of physics: conference series, vol 1366. IOP Publishing
Tao Y, Guo Q, Aihara K (2014) A partial differential equation model and its reduction to an ordinary differential equation model for prostate tumor growth under intermittent hormone therapy. J Math Biol 69(4):817–838
MathSciNet PubMed Google Scholar
Van Liedekerke P, Palm M, Jagiella N, Drasdo D (2015) Simulating tissue mechanics with agent-based models: concepts, perspectives and some novel results. Comput Part Mech 2(4):401–444
Google Scholar
VandenHeuvel DJ, Drovandi C, Simpson MJ (2022) Computationally efficient mechanism discovery for cell invasion with uncertainty quantification. bioRxiv
Varghese A, Drovandi C, Mira A, Mengersen K (2020) Estimating a novel stochastic model for within-field disease dynamics of banana bunchy top virus via approximate Bayesian computation. PLoS Comput Biol 16(5):e1007878
CAS PubMed PubMed Central ADS Google Scholar
Villasana M, Radunskaya A (2003) A delay differential equation model for tumor growth. J Math Biol 47(3):270–294
MathSciNet PubMed Google Scholar
Vo BN, Drovandi CC, Pettitt AN, Pettet GJ (2015) Melanoma cell colony expansion parameters revealed by approximate Bayesian computation. PLoS Comput Biol 11(12):e1004635
PubMed PubMed Central ADS Google Scholar
Voss-Böhme A (2012) Multi-scale modeling in morphogenesis: a critical analysis of the cellular Potts model
Wade SJ (2019) Fabrication and preclinical assessment of drug eluting wet spun fibres for pancreatic cancer treatment
Wade SJ, Sahin Z, Piper A-K, Talebian S, Aghmesheh M, Foroughi J, Wallace GG, Moulton SE, Vine KL (2020) Dual delivery of gemcitabine and paclitaxel by wet-spun coaxial fibers induces pancreatic ductal adenocarcinoma cell death, reduces tumor volume, and sensitizes cells to radiation. Adv Healthc Mater 9(21):2001115
CAS Google Scholar
Wang Z, Butner JD, Kerketta R, Cristini V, Deisboeck TS (2015) Simulating cancer growth with multiscale agent-based modeling. In: Seminars in cancer biology, vol 30. Elsevier, pp 70–78
Warne DJ, Baker RE, Simpson MJ (2019) Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface 16(151):20180943
PubMed PubMed Central Google Scholar
Warne DJ, Ebert A, Drovandi C, Hu W, Mira A, Mengersen K (2020) Hindsight is 2020 vision: a characterisation of the global response to the Covid-19 pandemic. BMC Public Health 20:1–14
Google Scholar
Warne DJ, Baker RE, Simpson MJ (2022) Rapid Bayesian inference for expensive stochastic models. J Comput Graph Stat 31(2):512–528
MathSciNet Google Scholar
Weinberg RA, Weinberg RA (2006) The biology of cancer. WW Norton & Company, New York
Google Scholar
Yafia R (2011) A study of differential equation modeling malignant tumor cells in competition with immune system. Int J Biomath 4(02):185–206
MathSciNet Google Scholar

Download references

Acknowledgements

We thank the computational resources provided by QUT’s High Performance Computing and Research Support Group (HPC). We also thank the authors of the previously published experimental data Dr Chae-Ok Yun, Dr Kara Perrow and Dr Samantha Wade for supplying the original published data sets. Xiaoyu Wang and Christopher Drovandi were supported by an Australian Research Council Future Fellowship (FT210100260). Adrianne L. Jenner was supported by the QUT Early Career Researcher Scheme. The project was partly supported by the Centre for Data Science First Byte grant.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions.

Author information

Authors and Affiliations

School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
Xiaoyu Wang, Adrianne L. Jenner, David J. Warne & Christopher Drovandi
Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
Xiaoyu Wang, Adrianne L. Jenner, Robert Salomone, David J. Warne & Christopher Drovandi
School of Computer Science, Queensland University of Technology, Brisbane, QLD, Australia
Robert Salomone

Authors

Xiaoyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Adrianne L. Jenner
View author publications
You can also search for this author in PubMed Google Scholar
Robert Salomone
View author publications
You can also search for this author in PubMed Google Scholar
David J. Warne
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Drovandi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoyu Wang.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 3127 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Jenner, A.L., Salomone, R. et al. Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation. J. Math. Biol. 88, 28 (2024). https://doi.org/10.1007/s00285-024-02045-4

Download citation

Received: 28 June 2023
Revised: 25 October 2023
Accepted: 27 December 2023
Published: 15 February 2024
DOI: https://doi.org/10.1007/s00285-024-02045-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation

Abstract

Similar content being viewed by others

Molecular Dynamics Simulations: Concept, Methods, and Applications

Introduction to Bioinformatics

Free-energy calculations in condensed matter: from early challenges to the advent of umbrella sampling

1 Introduction

2 Methods

2.1 Experimental measurements: in vivo tumour volume

2.1.1 Breast cancer data

2.1.2 Ovarian cancer data

2.1.3 Pancreatic cancer data

2.1.4 Synthetic cancer data

2.2 Voronoi cell-based model (VCBM)

2.2.1 Voronoi tessellation

2.2.2 Cancer cell movement

2.2.3 Cancer cell proliferation

2.2.4 Cell invasion

2.2.5 Simulation

2.2.6 Biphasic model

2.3 Bayesian inference with intractable likelihood

2.3.1 Approximate Bayesian computation (ABC)

2.3.2 SMC-ABC replenishment algorithm

2.4 Prior knowledge

3 Results

3.1 Validation with synthetic data

3.2 Monotonic tumour growth data

3.3 Biphasic tumour growth data

4 Discussion

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 3127 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation