Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation

Duswald, Tobias; Breitwieser, Lukas; Thorne, Thomas; Wohlmuth, Barbara; Bauer, Roman

doi:10.1007/s00285-024-02144-2

Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation

Open access
Published: 08 October 2024

Volume 89, article number 50, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Biology Aims and scope Submit manuscript

Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation

Download PDF

2 Accesses
Explore all metrics

Abstract

Understanding how genetically encoded rules drive and guide complex neuronal growth processes is essential to comprehending the brain’s architecture, and agent-based models (ABMs) offer a powerful simulation approach to further develop this understanding. However, accurately calibrating these models remains a challenge. Here, we present a novel application of Approximate Bayesian Computation (ABC) to address this issue. ABMs are based on parametrized stochastic rules that describe the time evolution of small components–the so-called agents–discretizing the system, leading to stochastic simulations that require appropriate treatment. Mathematically, the calibration defines a stochastic inverse problem. We propose to address it in a Bayesian setting using ABC. We facilitate the repeated comparison between data and simulations by quantifying the morphological information of single neurons with so-called morphometrics and resort to statistical distances to measure discrepancies between populations thereof. We conduct experiments on synthetic as well as experimental data. We find that ABC utilizing Sequential Monte Carlo sampling and the Wasserstein distance finds accurate posterior parameter distributions for representative ABMs. We further demonstrate that these ABMs capture specific features of pyramidal cells of the hippocampus (CA1). Overall, this work establishes a robust framework for calibrating agent-based neuronal growth models and opens the door for future investigations using Bayesian techniques for model building, verification, and adequacy assessment.

Mathematical Models in Neuroscience: Approaches to Experimental Design and Reliable Parameter Determination

Neural Modeling

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The brain is the human’s most complex organ and comprises roughly 86 billion neurons (Azevedo et al. 2009; Herculano-Houzel 2009), each of which is connected to hundreds or thousands of others via synapses. Brain regions accommodate different neuron types, providing specialized function for specific tasks. For example, the primate neocortex is made up of pyramidal cells (70%) as well as calretinin, calbindin, and parvalbumin-expressing interneurons (30%) (DeFelipe and Fariñas 1992; Markram et al. 2004; Elston 2011; DeFelipe 1997; Torres-Gomez et al. 2020). Researchers differentiate neurons by their location in the brain and their morphology, i.e., their shape and form, but even neurons originating from the same species and brain region may show significant morphological differences (Deitcher et al. 2017). Theoretical considerations, e.g., regarding the information capacity of the genome, have led various scholars to conclude that the brain’s wiring and the neurons’ morphologies likely emerge from simple developmental rules (Linsker 1986; Hassan and Hiesinger 2015; Zador 2019). Mechanistic, agent-based neuron growth models hold the potential to investigate this hypothesis and unlock a deeper understanding of how neurons grow and build their elaborate networks.

A rich set of mathematical models has been established to capture the diverse properties of neurons, contributing to a more comprehensive understanding of these complex cells. Early research efforts focused on understanding responses to external electrical stimuli, leading to the development of influential models such as the Hodgkin-Huxley model (Hodgkin and Huxley 1952). Simple, rate-based models capture the signal processing capabilities of neurons embedded in networks and became the workhorse of modern artificial intelligence applications, driving much of success in processing images and text (LeCun et al. 2015). Conceptually different methods have been developed to recreate structures in line with the characteristic neuronal morphologies. Notable examples use L-systems (Lindenmayer 1968; Hamilton 1993; Ascoli et al. 2001), statistically sample components from data (Torben-Nielsen et al. 2008), or derive the structure from optimal wiring principles (Cuntz et al. 2010). While these approaches successfully model the morphology, they provide limited insights into the fundamental processes driving the growth (Zubler and Douglas 2009).

In this work, we focus on mechanistic, agent-based models (ABM) simulating neuronal growth. In contrast to previously-mentioned approaches, such models are based on first principles, act on local information, and simulate growth in a biologically realistic manner. The simulation begins with a simple initial configuration of a single neuronal soma. Afterward, the dendrites and the axons form and extend from the soma to shape the neuron. The models are based on discrete compartments, so-called agents. These agents act independently based on internal state variables and external, local information. Stochastic rules define their behavior, e.g., the rules may encode stochastic branching processes or random walk models. A comprehensive description of the mathematics governing these mechanistic ABMs may be found in the work of Zubler and Douglas (2009). Ultimately, these models yield artificial neurons, which can be compared to real neurons. This comparison is non-trivial and carried out by reducing the neurons to their morphometrics (Torben-Nielsen and Cuntz 2014), i.e., a set of quantities of interest that capture the structural information of the neuron.

Early ABMs and related modelling studies for neuronal growth explored and explained various developmental aspects such as cell proliferation, polarization, and migrations (Ryder et al. 1999; Shinbrot 2006; Samuels et al. 1996; Cai et al. 2006) as well as growth cone steering and neurite extension (Krottje and van Ooyen 2007; Goodhill et al. 2004; Kiddie et al. 2005). Later works composed increasingly extensive models, e.g., Bauer et al. (2014a, 2014b) presented models probing how connectivity arises in the neocortex and how first principles can lead to winner-takes-it-all circuits. Using similar techniques, Kassraian-Fard et al. (2020) presented an ABM explaining axonal branching. More recently, Shree et al. (2022) observed growing neurons in-vivo over time and derived a detailed ABM for sensory neurons.

While different growth models have been proposed, inferring their (latent) parameters presents a significant challenge because the models are strongly stochastic and data is usually limited. Mathematically, we face a stochastic inverse problem: given some data $y_{ obs }$, find the distribution of the model parameters $p(\Theta | y_{ obs })$ which best explain the data. Simple optimization-based procedures for identifying parameters may give false confidence in their values as they typically do not account for uncertainties. However, Bayesian methods have shown significant advances over the past three decades (Martin et al. 2020, 2024), such that solving the inverse problem for neuronal structures in the presence of uncertainties is within reach.

In this work, we advocate embedding the growth models into a Bayesian framework to better understand the problem’s characteristics, fusing ideas from theoretical neuroscience, computer science, and statistics to address the stochastic inverse problem. We describe an abstract concept termed resource-driven neuron growth model motivated through experimental findings highlighting the effect of the neuron’s transport system on its morphology and consider two simple representatives thereof. We propose using Approximate Bayesian Computation (ABC) (Tavaré et al. 1997; Pritchard et al. 1999; Beaumont et al. 2002; Marjoram et al. 2003; Csilléry et al. 2010; Beaumont 2010; Sisson et al. 2007, 2019) combined with a selected set of morphometrics and statistical distances (Bernton et al. 2019; Nadjahi et al. 2020; Jiang 2018; Fujisawa et al. 2021) to find approximations to the posterior distribution $p(\Theta | y_{ obs })$. To this end, we employ Del Moral’s SMCABC algorithm (Del Moral et al. 2012) based on sequential Monte Carlo (SMC) sampling (Del Moral et al. 2006) with modifications to the kernel and the distance metric as proposed by Bernton et al. (2019). These modifications allow us to bypass the definition of summary statistics and measure the discrepancy between data and simulation directly with the Wasserstein distance (or similar). The algorithm is inherently parallel and, thus, scalable on modern computing resources. We leverage the MPI-parallel implementation of ABCpy (Dutta et al. 2021) and embed computational models implemented with the highly efficient BioDynaMo (Breitwieser et al. 2021, 2023) framework into the inner loop of the algorithm resulting in a high-throughput implementation. The data from the study is retrieved from neuromoropho.org (Ascoli et al. 2007), the most extensive database for neuronal morphologies. We demonstrate that the method can find adequate posterior distributions through computational experiments on synthetic data, i.e., data generated via the models, and subsequently show that the models can simulate pyramidal cell morphologies in agreement with experimental data collected by Benavides-Piccione et al. (2019). We share our implementation (see Duswald et al. 2024)), and future research may leverage the framework to calibrate different models efficiently.

The article is structured as follows. We begin with a technical overview of the project, briefly explaining which components are relevant and how they connect. Afterward, we explain them in detail, i.e., we discuss neuron growth models, data sources, sensitivity analysis, and ABC algorithms. We then show numerical experiments that investigate the models’ stochastic components and sensitivities before approaching the inverse problem with synthetic and experimental data. On synthetic data, we explore how the choice of morphometrics, statistical distances, and sample size affect the algorithm. We conclude the manuscript by critically reflecting on the results and embedding the findings in a broader context.

2 Materials and methods

Our principal goal is to determine the parameters and uncertainties of mechanistic neuron growth models for given data. We choose to address the problem in a Bayesian setting. Given a stochastic, computational model parameterized through parameters $\Theta \in \mathbb {R}^N$, we strive to find the posterior distribution $p(\Theta |y_{ obs })$, which is the probability distribution describing the parameters $\Theta $ after observing data $y_{ obs }$. The posterior encapsulates all available information on the parameters, including the most probable values and associated uncertainties. Formally, the solution to this problem is given by Bayes’ theorem

$$\begin{aligned} p(\Theta |y_{ obs }) = \frac{p(y_{ obs } | \Theta ) p(\Theta )}{p(y_{ obs })} \,, \end{aligned}$$

(1)

which defines the posterior in terms of the following three components: the probability distribution of the observed data $p(y_{ obs })$, the prior distribution of the parameters $p(\Theta )$ containing all available knowledge about the parameters prior to the calibration, and the likelihood function $p(y_{ obs } | \Theta )$ which describes how likely observations $y_{ obs }$ are under given parameters $\Theta $. In practice, i.e., in all but the most straightforward cases, finding a closed-form solution to (1) is impossible. Hence, we numerically approximate the posterior distribution with algorithms whose details are presented and discussed in later sections.

As we face a stochastic inverse problem, our study consists of four major components: mechanistic neuron growth models (Sect. 2.1), data sources and processing (Sect. 2.2), a numerical method solving (1) (Sect. 2.3), and efficient software implementations and interfaces (Sect. 4). In the following, we detail the different components and their links. Figure 1 gives an overview and shows how the components interact.

2.1 Mechanistic neuron growth models

This work is concerned with 3D agent-based, mechanistic neuron growth models. Figure 2 displays an example of such a simulation. Panel (a) illustrates the ABM-discretization and the simulated neuron in an early stage of the simulation, panel (b) depicts the final simulated neuron, and (c) shows an experimentally observed pyramidal cell in the mouse hippocampus (Benavides-Piccione et al. 2019). The initial conditions of the simulation are not displayed but may be obtained by stripping all agents from (a) not directly attached to the soma. We will first introduce some biological background, continue presenting the ABM discretization, and then explain the high-level abstraction of a resource-driven neuron growth model. Lastly, we consider two simple model realizations corresponding to the upper and lower parts of panel (b) relevant to the subsequent numerical experiments.

2.1.1 Biological background

Despite a good understanding of the neurons’ properties and their networks, the processes by which neurons develop their morphologies are only partially comprehended. A neuron’s journey begins during neurogenesis-the process in which new neurons form from neural stem or progenitor cells. Afterward, the neuron begins to extend dendrites and axons from the soma. The tips of dendrites and axons display a growth cone, a highly polarized, dynamic structure that guides the growth. External substances serve as attracting and repulsive guidance cues. Examples include netrins, semaphorins, ephrins, and the brain-derived neurotrophic factor. Neurons integrate into complex networks and, depending on the network’s need, they may begin extending or retracting parts of their dendritic or axonal outgrowth.

Ultimately, growth requires resources; consequently, the morphology strongly depends on how resources get distributed within the neuron. The neuron’s transport system is primarily built from three superfamilies of so-called molecular motor proteins: kinesins, cytoplasmic dynein, and myosins. These proteins convert energy stored in adenosine triphosphate into mechanical energy, which they use to move along the cell’s cytoskeletal tracks carrying cargo. Tubulin is a cargo often considered in this context since it is a building block for the cytoskeleton and, thus, required for growth. It has been the driving factor for a set of mathematical models in the past (van Ooyen et al. 2001; McLean et al. 2004; Graham and van Ooyen 2006; Hjorth et al. 2014). The influence of the motor proteins has been extensively reviewed by Hirokawa et al. (2010), concluding that they play a significant role in the intercellular transport, control of neuronal function, morphogenesis, and survival. They also point out that expression levels differ from neuron to neuron (Kamal et al. 2000), hinting that transport systems differ between neurons. Experimental evidence appears in multiple studies. Satoh et al. (2008) showed that gene mutations affecting the motor proteins lead to significant morphological changes in the dendritic arbor in drosophila melanogaster, e.g., an overall shorter arbor and different branching pattern. Zheng et al. (2008) independently observed identical effects. Moreover, Hirokawa et al. (2009) demonstrated that suppressing Kif2a, part of the kinesin superfamily, caused abnormal axon branching, leading to significantly more branching points in the axon, see (Hirokawa et al. 2009, Fig. 5c). Ryu et al. (2006) observed that myosin IIB influences the morphology of dendritic spines, and Lyons et al. (2009) found that suppression of Kif1b hinders the normal outgrowth of axons in zebrafish. We conclude that differences in the generation and distribution of resources significantly impact the morphology of different neurons. This should be reflected in the mathematical models and the resources become key-attributes for the agents.

2.1.2 Agent-based neuron discretization

We consider growth that starts from a few neurites attached to the soma, where the latter is stationary, i.e., it does not change throughout the simulation. The neurites are spatially discretized into cylindrical agents representing small dendritic tree compartments; recall Fig. 2a. The agents reside in a tree-like data structure, i.e., each agent has one mother and either zero, one, or two daughters. If an agent has no daughter, we refer to it as a tip agent. These are particularly important because the tips primarily drive the growth (Shree et al. 2022). For all models, a cylindrical agent is characterized by its position, i.e., the start and end point of the cylinder, as well as their diameter. The orientation and length are implicitly contained in the start and end points.

The total simulation time T is discretized into small time steps $\Delta t$ of equal duration. During each time step, the agents independently execute their stochastic rules governing their behavior. These rules may depend on local information, such as substance concentrations or gradients. Moreover, the rules may further depend on whether the agent has daughters; in other words, the rules may differ between regular and tip agents. It is desireable to parametrize the rules such that different choices for $\Delta t$ yield statistically identical (or at least similar) results. We further note that the stochastic processes encoded in the rules may restrict $\Delta t$ to a specific range.

This modeling approach is generally considered to be biologically feasible because all agents act on locally available information rather than globally optimizing specific properties. Algorithm 1 in Appendix C gives an overview of the simulation logic of ABMs implemented with BioDynaMo (Breitwieser et al. 2021). By implementing the models with BioDynaMo, we implicitly include the neurons’ mechanistic properties into the model. In this work, these properties play a minor role and we refer to Zubler and Douglas (2009) for additional details.

2.1.3 Resource-driven neuron growth model

We introduce a high-level model description termed the resource-driven neuron growth model. It is rooted in the realization that the molecular motors and further transport-related quantities differ in their expression levels between neuron types and the fact that they influence the morphology. Various models presented in the literature, e.g., (Shree et al. 2022; Kassraian-Fard et al. 2020), fit within the descriptions; here, we attempt to phrase it in a general parametric way.

The model needs to account for (1) the migration of the tips (elongation and retraction), (2) external guidance cues, (3) resource availability and transport, and (4) rules for branching and bifurcating. Different external guidance cues are described with scalar fields $\phi _i(x,t)$, and different resources of the agent-j are denoted as $r_i^{(j)}$.

Shree et al. (2022) observed that tips either elongate, stall, or retract. They suggest a model defining the stochastic transitions between the three states. Additionally, they assume that contact of tips with other neurites causes them to retract. When the tip retracts, it simply follows the path; when elongating, the tips execute a persistent, biased, random walk (biased through the gradient of external guidance cues; see (Codling et al. 2008; Hannezo et al. 2017) for details on such walks). We add that, in general, the state transitions may depend on resource availability. Furthermore, the state may influence the resource availability, e.g., it seems natural to assume that elongation reduces and retraction frees resources.

The time dependent resource distribution can be modeled in vastly different ways ranging from heuristic rules (Kassraian-Fard et al. 2020; van Ooyen et al. 2001) defining how the resources propagate when branching or bifurcating to intricate transport equations (Graham et al. 2006; McLean et al. 2004; Hjorth et al. 2014; Qian et al. 2022). Both approaches may capture the branches’ competition for resources. Generally, branching is modelled as a (Poisson-like) stochastic process and the probability of branching per time step may depend on resource availability or external guidance cues. For example, the branching probability may increase with decreasing resource, leading to more, smaller branches towards the distal end.

These high-level requirements allow the construction of complex models with a high degree of incorporated biological information. Nonetheless, the key concern of this work is approximately solving the stochastic inverse problem, and we do not strive to create the biologically most detailed model. We therefore restrict ourselves two particularly simple representatives of such a model, which trace back to (Zubler and Douglas 2009; Breitwieser et al. 2021). Both models use a single resource type, one guidance cue, and only consider the elongating and idle states. We describe the algorithm assuming some (simple) initial neuron structure is present, e.g., a structure as in Fig. 2a.

Example: Model 1. We first note that only tip-agents actively change; others remain untouched. Furthermore, only tip-agents whose resource satisfies $r > r_{\min }$ change; thus, if the resource of a tip-agent falls below the threshold $r_{\min }$, it becomes idle as well. In other cases (tip-agent with a sufficiently large resource), the agent elongates in the direction of a vector $\textbf{d}$ which is composed out of (1) a random component, (2) the current orientation, and (3) the direction of $\nabla \phi $ at the agent’s position. Elongation means that the endpoint of the cylinder is shifted by $v \cdot \textbf{d} / || \textbf{d} ||_2$, where v is the elongation speed parameter. Since the elongation models a notion of stretching, the resource is decreased, i.e., $r(t_i + \Delta t) = r(t_i) - R$, where R is the resource consumption parameter. Each tip-agent can also branch with a constant probability $p_{bra}$. The two daughters created during branching inherit the resources of the mother.

A simulation of the branching process is depicted in Fig. 2b, where the structures below the soma are generated with Model 1. The symmetric resource distribution yields a balanced tree structure, e.g., two daughters of the same mother progress similarly. A pseudo-code representation of this description is given in Appendix C, Algorithm 2.

Example: Model 2. Model 2 is similar to Model 1. In contrast to Model 1, all agents in Model 2 keep decreasing their resources until they reach or fall (slightly) below $r_{\min }$. Additionally, the branching rules differ, i.e., the neurite continues in a straight line and adds a new branch rather than symmetrically splitting into two. Sometimes this behavior is referred to branching, while Model 1’s symmetric splitting is referred to as bifurcating. The asymmetry of the branching also reflects in how the resource is distributed; while the agent of the extended branch inherits the resource of the mother, the agent of newly-created branch initializes it to a fixed resource value of $r_0$, an additional parameter of Model 2. Hence, the resource distribution is asymmetric. We refer to Kassraian-Fard et al. (2020) for a more involved asymmetric model.

While the differences between Model 1 and 2 are subtle, these differences may result in vastly different structures, e.g., Fig. 2b shows the structures arising from Model 2 above the soma. The asymmetry of the resource distribution while branching leads to a more extended main branch with different shorter outgrowths. A pseudo-code representation of this description is given in Appendix C, Algorithm 3.

2.2 Data sources and processing

In this study, we consider two types of datasets: synthetic and experimental data. The former serves as a test case for the algorithm and the choice of morphometrics; the latter is used to identify which models can describe the morphology of real neurons. The comparison of different neuronal structures is facilitated via the morphometrics.

2.2.1 Morphometrics

Morphometrics generally refers to the study of the size and shape of objects. We restrict ourselves to neurons and use the term morphometrics to describe features that quantify the morphology. The morphometrics attempt to answer the following question: How can we map a given neuron morphology onto a vector $x \in \mathbb {R}^n$ that adequately characterizes the neuron? This question is inherently challenging because neurons show very complicated shapes and forms. A short overview of popular morphometrics may, for instance, be found in the work of Torben-Nielsen and Cuntz (2014 Table 1.1) or Deitcher et al. (2017, Table 2). Popular features are total length, number of branches, mean branching length, and many more. Table 1 gives an overview of the simple morphometrics used in this work.

Table 1 Overview of the subset of morphometrics used in this work

Full size table

If we abstractly consider the morphometrics (or a combination thereof) as a mapping $\mathcal {M}$ projecting from the space of neuronal morphologies $\mathcal {N}$ into $\mathbb {R}^n$, i.e., $\mathcal {M}: \mathcal {N} \rightarrow \mathbb {R}^n$, then the mapping does not possess an inverse. In other words, it is impossible to reconstruct the morphology from the morphometrics; however, similar neurons map to points that are close in $\mathbb {R}^n$. In general, the mapping consists of a combination of morphometrics. Describing the different morphometrics (e.g., branching length) as a map $\mathcal {M}_i: \mathcal {N} \rightarrow \mathbb {R}^{n_i}$, we define the morphometrics mapping as

$$\begin{aligned} \mathcal {M} = \mathcal {M}_1 \times \dots \times \mathcal {M}_k: \mathcal {N} \rightarrow \mathbb {R}^n = \mathbb {R}^{n_1 + \dots + n_k} \,. \end{aligned}$$

(2)

While the mathematical nature of $\mathcal {M}$ may be complicated, the algorithmic implementation for computing $x = \mathcal {M}_i ( y \in \mathcal {N})$ is usually straightforward. Following the jargon of predictive computational sciences, we will refer to x as quantity of interest (QoI) and use this term interchangeably with the morphometrics.

2.2.2 Synthetic data

We choose a stochastic computational model to generate a synthetic dataset by repeatedly executing it with different random seeds. M model runs will result in M distinctively different neuron samples. For each sample, we compute the morphometrics either with custom, unit-tested C++ code integrated into the model evaluation or offline with NeuroM (Palacios et al. 2021). This Python package allows the analysis of neuron morphologies saved in the SWC format and the extraction of their morphometrics. Finally, we structure the data in a spreadsheet-like data structure (see Fig. 1) to conveniently retrieve a given neuron’s morphometrics.

2.2.3 Experimental data

Table 2 Overview of the experimental datasets collected by Benavides-Piccione et al. (2019) retrieved from neuromorpho.org (Ascoli et al. 2007)

Full size table

For the experimental data, we proceed akin to the synthetic data. We retrieve the data from neuromopho.org (Ascoli et al. 2007), an online database storing roughly 260 thousand digital reconstructions of neurons (as of October 2023). Typically, we select specific experiments and references or queries based on neuron type, species, and brain region. NeuroMorpho provides full access to the data; thus, we retrieve a set of neuronal morphologies in the SWC file format after selection. We use NeuroM (Palacios et al. 2021) to verify that the morphologies are correct and that no errors in the file could harm the results. Subsequently, we extract the morphometrics and organize the data in a spreadsheet format. We consider two different datasets for our computational experiments. Table 2 gives an overview of them.

2.3 Numerical methods

The third cornerstone of this work are numerical methods fostering the understanding of the model’s inherent stochasticity as well as those (approximately) solving the statistical inverse problem defined in (1). We use sensitivity analysis algorithms to understand the models’ behavior due to parameter variations and verifying which QoIs are informative for the parameter inference. The inference is conducted with Approximate Bayesian Computation.

2.3.1 Sensitivity analysis

Sensitivity analysis (SA) measures how strongly individual parameter influence the prediction of specific QoIs of a complex mathematical model (Saltelli et al. 2008). In this work, we employ a global Sobol SA. We first define the (bounded) parameter space $\Omega $ for a given model. We use Saltelli’s method (Saltelli 2002; Saltelli et al. 2010) to draw K samples from $\Omega $. Afterward, we use the model to generate a synthetic dataset of ${M^\prime }$ neurons for each sample drawn. We determine the QoIs and subsequently compute their expected value $\mathbb {E}[\text{ QoI}]$. We use Sobol’s method (Sobol 2001) to estimate the sensitivity indices $S_1$ and $S_{tot}$ together with their 95% confidence intervals. Here, $S_1$ is the index indicating how much of the variance in a QoI can be attributed to a given parameter (first-order sensitivity). The index $S_{tot}$ accumulates the first-order and higher-order indices to give an idea of the importance of a parameter; it is called the total-effect index. As the sensitivity measures variance, it may indicate which QoIs contain relevant information for parameter inference.

2.3.2 Bayesian computation

Bayesian computation attempts to find numerical solutions and approximations to the Bayesian inverse problem (1). Prominent candidates are Markov Chain Monte Carlo (MCMC) methods (Brooks et al. 2011) such as the Metropolis–Hastings algorithm (Metropolis et al. 1953; Hastings 1970) or Gibbs sampling (Geman and Geman 1984; Gelfand and Smith 1990). These methods are, however, limited to a small subset of real-world problems–the ones with tractable likelihood $p(y_{ obs } | \Theta )$. Nevertheless, many meaningful problems have an intractable likelihood, i.e., there may not be a closed form, or it may be too expensive to evaluate. This realization gave rise to a set of methods commonly referred to as likelihood-free methods. Instead of evaluating the likelihood, the algorithms in this category operate under the assumption that simulating data under the model (or a surrogate thereof) facilitates an understanding of the likelihood. Representatives for these algorithms are Bayesian synthetic likelihood (Price et al. 2018), specific versions of Variational Bayes (Beal and Ghahramani 2003; Jordan et al. 1999; Blei et al. 2017), Integrated nested Laplace (Rue et al. 2009), and, possibly the most popular one, Approximate Bayesian computation (ABC) (Tavaré et al. 1997; Pritchard et al. 1999; Beaumont et al. 2002; Marjoram et al. 2003; Csilléry et al. 2010; Beaumont 2010; Sisson et al. 2007, 2019). In this work, we focus exclusively on ABC, which has proven to facilitate successful calibration in the context of ABMs in biological applications, e.g., (Lambert et al. 2018; Wang et al. 2024). For more information on the historical development of Bayesian computation, we refer to Martin et al. (2020, 2024) and Sisson et al. (2019, Chapter 2).

2.3.3 Approximate Bayesian computation

ABC is based on the fundamentally simple idea of simulating data $y_{ sim }$ under the model and comparing its output against the observed data $y_{ obs }$. ABC algorithms must find N simulations close to the data to obtain a Monte Carlo approximation to the posterior. Whether samples are accepted or not depends on a criterion involving a distance metric d and a function $\eta : \mathbb {R}^n \rightarrow \mathbb {R}^m$ ($m \ll n$) called summary statistics.^{Footnote 1} Simulations are considered close if

$$\begin{aligned} d(\eta (y_{ obs }), \eta (y_{ sim })) < \epsilon \,, \end{aligned}$$

(3)

where $y_{ obs }$ and $y_{ sim }$ denote datasets, i.e, they contain M and ${M^\prime }$ k-dimensional random variables, respectively. The function $\eta $ summarizes their statistics, allowing us to search for close points in a lower dimensional space, significantly speeding up the search. For instance, we may calibrate the parameter m of a Gaussian model $y \in \mathbb {R} \sim \mathcal {N}(m,\sigma )$ by choosing $\eta = \left( \sum _{i=1}^{M^\prime }y_i\right) /{M^\prime }$ since the mean adequately summarizes the statistics of the data.

Choosing appropriate or even sufficient summary statistics for arbitrary models remains one of the biggest challenges when employing ABC in practice. To overcome this limitation, Bernton et al. (2019) suggested using the Wasserstein distance to directly measure the discrepancy between simulated and observed data. Their approach generalizes the use of order statistics to arbitrary dimensions. The Wasserstein distance between two probability distributions measures how much work is necessary to turn one into the other. Hence, it is often called earth-movers distance and is computationally related to optimal transport problems. The distance is sometimes also called the Kantorovich-Rubinstein metric. Other authors promoted similar ideas around the same time: Park et al. (2016) suggested using MMD, Genevay et al. (2018) used Sinkhorn divergences, and Jiang (2018) employed the Kullback–Leibler divergence. Later work considered the sliced-Wasserstein distance (Nadjahi et al. 2020) and $\gamma $-divergence (Fujisawa et al. 2021) targeting certain shortcoming of other distances.

The different statistical distances have similar effects in the ABC context but their interpretation differs; for instance, KL and $\gamma $ divergence measure the information loss when one distribution is used to approximate another. Moreover, their naming convention highlights a subtile mathematical difference; the Wasserstein distance is a metric while KL and $\gamma $ represent a divergence. The former is therefore non-negative, symmetric, and satisfies the triangle equation while the latter share the non-negativity but are non-symmetric and do not obey the triangle equality (e.g., (Amari 2016, Chapter 1)). Their different definitions yield qualitatively different behavior; e.g., considering two multivariate normal distributions $\mathcal {N}_1(\mu _1,\Sigma _1)$ and $\mathcal {N}_2(\mu _2,\Sigma _2)$, the Wasserstein distance scales linearly with $||\mu _1 - \mu _2||_2$ (Dowson and Landau 1982), whereas straightforward calculation shows that the KL divergence scales quadratically with it.

Besides defining appropriate metrics for comparing simulated and observed data, designing algorithms that efficiently propose suitable samples has been a long-standing challenge in ABC. Over the past two decades, researchers derived many different ABC algorithms (Csilléry et al. 2010; Sisson et al. 2019) and Sequential Monte Carlo (SMC) samplers (Del Moral et al. 2006, 2012) became a potent tool. SMC samplers represent the parameter distribution with N particles in the corresponding vector space. Instead of directly moving from the prior to the posterior, SMC algorithms propagate the particles through many intermediate probability distributions that change slowly from iteration to iteration, keeping the sampling efficient. Effectively, the sequence of distributions corresponds to a sequence of thresholds in the acceptance criterion (3), i.e., the sampler sequentially moves through distributions defined by the thresholds

$$\begin{aligned} \epsilon _0 = \infty> \epsilon _1> \dots > \epsilon _{k} = \epsilon \,. \end{aligned}$$

(4)

First, the particles are sampled from the prior ($\epsilon _0$). By reducing $\epsilon _i$ from iteration to iteration, the posterior approximation through particles improves from iteration to iteration until eventually reaching the prescribed quality defined via $\epsilon _{k}$. Del Moral’s algorithm (Del Moral et al. 2006, 2012) bypasses the a priori definition of the approximation levels $\epsilon _{i}$ by demanding that the effective sample size (ESS) of iteration $i+1$ is a certain fraction $\alpha \in (0,1)$ of the ESS of iteration i. The threshold $\epsilon _{i+1}$ can be adaptively computed from the ESS, $\epsilon _{i}$, and $\alpha $ (see (Del Moral et al. 2012, Eq. 12) for details).

To this end, we use Del Moral’s SMCABC algorithm (Del Moral et al. 2006, 2012) with modifications proposed by Bernton et al. (2019), i.e., we favor statistical distances over summary statistics and choose the arguably more efficient r-hit kernel ($r=2$) (Lee 2012; Lee and Latuszynski 2014). We choose to use $N=2^{10}$ particles and $\alpha = 0.6$.

To assess the quality of the resulting posterior distribution, we perform a predictive check. This check involves drawing samples from the posterior, evaluating the model for each sample, and computing the QoIs. We then compare the simulated QoIs to the data; since the QoIs form a high dimensional space, we show the marginals of the distributions, i.e., the projection on one coordinate axis in the QoI space.

We emphasize that ABC only yields an approximation of the actual posterior distribution since it involves several assumptions. First, the bound $\epsilon $ appearing in the distance criterion (3) introduces an approximation: if $\epsilon = 0$, the ABC algorithms would sample from the true posterior; however, for $\epsilon > 0$, the algorithms draw samples from an approximation to the true posterior. Instead of reaching a desired target $\epsilon $, the algorithm is often stopped after a fixed amount of dataset simulations contributing to the same error category. Second, the use of summary statistics introduces another level of approximation. This also holds for statistical distances; for instance, we need a sufficient number of data points in both sets to accurately estimate the Wasserstein distance between two distributions. Figure 3 illustrates this problem by displaying the relative error $ | \mathcal {W}_ true - \mathcal {W}_ num | / \mathcal {W}_ true $ of the numerically computed Wasserstein distance for two multivariate Gaussian distributions. Lastly, the approximation of the posterior in terms of particles and kernel choice may affect the approximation (Sisson et al. 2019).

2.4 Software implementation and interfaces

We leverage the implementation of the Del Moral’s and Bernton’s SMCABC algorithm (Del Moral et al. 2006, 2012; Bernton et al. 2019) provided in the ABCpy python package (Dutta et al. 2021). Propagating the particles is the most expensive step of the algorithm because it involves the simulation of data under the model, i.e., at least $N \cdot {M^\prime }$ model evaluations per iteration for N particles and ${M^\prime }$ samples per parameter. In the overview given in Fig. 1, this step is labeled with . However, the algorithm is inherently parallel in the particle updates, and the implementation offers parallel backends via Spark and MPI, of which we decided to use the latter. For code availability, we refer to the https://doi.org/10.5281/zenodo.13810423).

In order to maximize the utilization of parallel computing resources, it is best practice to parallelize the outer loops and optimize the repeatedly executed code. Our implementation follows this logic by allowing ABCpy to parallelize the particle updates via MPI and implementing the ABM models with the highly efficient BioDynaMo (Breitwieser et al. 2021, 2023) C++ simulation platform.

During our initial computational experiments, we discovered a bottleneck in the current BioDynaMo version. Most of the execution time was spent starting the BioDynaMo simulation engine, more precisely, initializing the C++ interpreter cling (Vasilev et al. 2012). Initialization became the dominant factor since the simulation of a single neuron is very fast compared to the extensive simulations with billions of agents that BioDynaMo supports. To mitigate this performance bottleneck, we avoid launching a new simulation process for each parameter set. Instead, we start a persistent BioDynaMo simulation process for each MPI rank at the beginning of the calibration (here, one per core), keep it alive, and exchange data with ABCpy using IO-redirection. Figure 4 summarizes this concept and the implemented interface between the ABCpy and BioDynaMo software packages.

As ABCpy distributes the computation between the workers, a custom class facilitates the communication and data exchange with the active BioDynaMo processes. If a given SMCABC-MPI rank requires a model run for a set of parameters, this controller class requests the simulation from the BioDynaMo process running on the same core and waits until the simulation has been completed. The process writes the results, i.e., the SWC file of the synthetic neuron and possibly the associated morphometrics, to a RAM disk. After the simulation, the controller allows ABCpy to proceed–it retrieves the results, possibly applies some post-processing, and evaluates if the proposal parameters are accepted. Avoiding the repeated startup overhead, we measured 10x speedup compared to its initial version for simple models. We note that the more expensive the model, the less the coupling affects the runtime.

The implementation of the Sobol SA is analogous; here, the parameters are known a priori such that we assign different parameter combinations to different ranks and process them one after another using the same interface. The initial parameters are obtained from the python library SALib (Herman and Usher 2017).

3 Results

In this section, we explore the numerical experiments and their results conducted to foster an understanding of the stochastic models and the inverse problem. We first investigate the models’ stochasticity to understand how the stochastic components of the model influence the QoIs for a fixed parameter choice. With a Sobol SA, we allow the model parameters to vary and explore how the variations affect the model outputs. We then study the stochastic inverse problem for the mechanistic neuronal growth Model 2 for synthetic data; in particular, we investigate how different choices of morphometrics, statistical distances, and simulated data set sizes affect our ability to recover the data-generating parameters. Afterward, we treat experimental data and calibrate Model 1 and 2 such that they mimic pyramidal cells in the human and mouse hippocampus (CA1 region) and extend our analysis beyond the QoIs with 3D visualizations. Lastly, we comment on runtime and computational costs.

3.1 Model stochasticity

We analyze the influence of the stochastic model components on the QoIs (morphometrics). We choose a fixed parameter vector $\Theta ^\star $ for each model and simulate different neurons (i.e., same model, same parameter, different random seed). Table 3 in Appendix B summarizes the parameter choice for each model and are taken from Breitwieser et al. (2021). We generate $10^4$ artificial neurons per model and compute the QoIs. We then determine and visualize each model’s (marginal) distribution of the QoIs.

Figure 5 shows histograms of the mean and standard deviation of the segment lengths together with the number of segments and total length. Across all the QoIs, Model 1 shows distributions that are significantly wider, less symmetric, and have a heavier tail compared to the output of Model 2. The distributions of Model 2 are, in most cases, symmetric and centered. The marginals of Model 2 appear to be similar to the characteristic shape of the normal distribution; nonetheless, the Anderson-Darling test allows us to reject the hypothesis that Model 2’s marginals follow a normal distribution at a significance level of 1%. Table 4 in Appendix B additionally depicts the descriptive statistics of the marginals depicted in Fig. 5. These statistics underline and quantify the previous observations, for instance, the apparently wider distributions of Model 1 reflect in a larger standard deviation. We remark that although the models are similar and mainly differ in how they distribute resources while branching, the QoI marginals show significant differences. Moreover, the presented data shows that the neuron models are truly stochastic and they must consequently be treated in a probabilistic setting.

3.2 Sensitivity analysis

We continue analyzing the behavior of the forward models, now striving to understand if some parameter influences a given QoI and, if so, by how much. Therefore, we probe the models with a global Sobol SA (Sobol 2001; Saltelli 2002; Saltelli et al. 2010). We define each model’s parameter domain $\Omega \in \mathbb {R}^n$ according to Table 5 in Appendix B.

For each model, Saltelli sampling results in $K=N \cdot (2 \cdot \dim (\Omega ) + 2) = 32768$ parameter combinations for which we need to evaluate the model (we chose $N=4096$). To account for the stochasticity of the model, we simulate ${M^\prime }=20$ samples for each parameter combination to estimate the expectation value of the different QoIs. Thus, we evaluate $6.6 \cdot 10^5$ artificial neurons per model to measure the first oder and total sensitivity indices ($S_1$, $S_{tot}$). We analyze the influence of the branching probability ($p_{bra}$), the resource consumption (R), and the elongation speed (v) on the QoIs. We restrict ourselves to this set of parameter for two reasons. First, simple experiments with the models showed that these parameter have a profound impact on the generated morphology and, second, the models differ in the resource reallocation while branching but tip agents in both models execute the same persistent biased walk. We thus decided to focus on the parameter that differ between the models. The results are displayed in Fig. 6.

We begin with discussing the similarities highlighted by the sensitivity analysis. Figure 6c shows that the mean segment length of both models reacts similarly to the parameters. For both models, this QoI is primarily influenced by $p_{bra}$ and v while R has no influence. This can be understood from realizing that the mean segment length is the average distance that a tip agent migrates between two branching events. The parameter v is the speed of the tip migration while $p_{bar}$ defines the likelihood of branching per time unit. The resource consumption does not influence this pattern and, thus, the corresponding sensitivity indices are very low. Both models share the mathematical description of the tip migration which reflects in the sensitivity analysis. This argument extends to the standard deviation of the segment length.

In contrast to the mean and standard deviation of the segment length, the number of segments and the total length in Fig. 6a, b respond differently to the parameter variations. The analysis shows that Model 1 is much more sensitive to the parameter R. Considering the regime for very low values of R, Model 1 can repeatedly split newly created branches leading to an exponential growth in the number of branches. In the same scenario, Model 2 remains well controlled and continues to produce a main brach with short side extensions. This realizations explains why Model 1 is more sensitive to R. For Model 2, the number of segments is primarily influenced by $p_{bra}$. Similar arguments hold for the total length, here the exponential growth outweighs the linear growth parameter v for Model 1 while Model 2 is mostly sensitive to it. We further observe that Model 1 shows statistically significant gaps between the first order and total sensitivity index, hinting at higher order effects. In its sum, these observations underline the significant impact that the resource distribution has on the morphology.

3.3 Solving the stochastic inverse problem with SMCABC

We study the stochastic inverse problem arising for mechanistic neuronal growth models with the SMCABC algorithms altering the statistical distance measures, algorithmic parameters, morphometrics, and data sets. We first treat synthetic data to verify that we may successfully recover the data-generating parameter in a setting where the model can reproduce the data well. The synthetic data sets used in the calibration are subsets of the ones in the section analyzing the models’ stochasticity. We perform extensive numerical experiments to showcase how the calibration algorithm behaves under different circumstances. Afterward, we apply the SMCABC algorithm to the data sets defined in Table 2 to determine which models adequately describe given experimental data. The majority of the computational experiments deal with Model 2 because it models the structurally more complex part of the neuron (i.e., top part in Fig. 2).

3.3.1 Synthetic data

We begin with calibrating Model 2 with synthetic data comprised of 500 synthetic neurons. We attempt to find the posterior distribution for the same model parameters ($p_{bra}$, R, and v) considered in the SA. The synthetic data was generated with the parameter choice $p_{bra}^\star = 0.38 \cdot 10^{-1}$, $R^\star = 0.71 \cdot 10^{-3}$, and $v^\star = 10^{2}$; recovering these parameter serves as test case throughout this section. We employ the morphometrics to map the simulated neurons to $\mathbb {R}^n$, effectively reducing a neuron morphology to a n-dimensional random variable ($n=1,\dots ,4$). We provide uniform priors for all parameters, use $2^{10}$ particles to approximate the posterior and run the algorithm for a fixed budget of $5\cdot 10^7$ growth-model simulations. We do not interrupt the algorithm in between SMC iterates; we either automatically stop the calibration after the iteration that has exhausted the simulation budget, or if the algorithm’s runtime surpasses a threshold $T_{max}$.

Effect of the morphometrics

First, we investigate the effect of the choice of morphometrics. Intuitively, they capture the information of the neuron morphology. If the morphometrics capture insufficient information, the algorithm will not be able to recover the data-generating parameter for the simple reason that the information needed for their inference is not considered. Thus, the first experiment explores the convergence of the posterior for different combinations of morphometrics, specifically $\mathcal {M}$1, 2, 3, and 4, i.e., the number of segments, the mean and standard deviation of the segment lengths, and the total length. This experiments uses the Wasserstein distance to evaluate (3) and ${M^\prime }= 50$ samples per parameter.

Figure 7 shows how different morphometrics choices affect the posterior marginals. From (a) to (d), we add one dimension to the morphometrics at a time. We begin with the mean of the segment lengths (a) and add the standard deviation (b), the number of segments (c), and, lastly, the total length (d). All four plots depict the evolution of the posterior marginals over the SMC iterations. The marginals are Gaussian kernel density estimates (KDEs) computed from the particles and associated weights. Early iterations appear dark (black, purple), and final iterations in bright colors (orange, yellow). For convenience, the data-generating parameters $p_{bra}^\star $, $R^\star $, and $v^\star $ are indicated with vertical, dashed, and black lines. We judge the algorithm performance by its ability to recover $p_{bra}^\star $, $R^\star $, and $v^\star $, i.e., good algorithmic setups are expected to show posterior marginals condensing around the vertical black lines. This type of visualization will reoccur in the other numerical experiments.

From Fig. 7, we see that the mean segment length is not sufficient to recover the parameter. The posterior in (a) is very wide indicating high parameter uncertainties. Adding the standard deviation of the segment length improves upon the previous case, we find sharper posterior marginals, however, their peaks show a slight offset incorrectly identifying the parameters v and R. The parameter $p_{bra}$ centers around the correct value, however, the width of the posterior again indicates low confidence. Adding the number of branches to the QoIs significantly improves the marginals’ quality; all three parameter peak at the data-generating parameter and are strongly centered indicating good confidence in the identified parameters. We emphasize this substantial improvement in identifying the branching probability which can be understood from the results of the SA; it demonstrated that the number of branches of Model 2 is very informative concerning this parameter. Lastly, adding the total dendritic length seems to neither harm nor further improve the posterior.

To further verify the posterior quality, we measured the Wasserstein distance between a second dataset generated with identical parameters and the one used for calibration. The measured distance is in line with the final $\epsilon $ value supporting the claim that the algorithm found a good posterior distribution whose simulations are close to indistinguishable from the calibration data. While there are barely differences in the posterior quality of (c) and (d), we favour the latter for computational reasons. Using four instead of three morphometrics converged quicker and the overall runtime was roughly the half. For all subsequent experiments, we use the four morphometrics $\mathcal {M}1,2,3,4$.

Effect of different statistical distances

We proceed with analyzing how different statistical distances (Wasserstein (Bernton et al. 2019), sliced-Wasserstein (Nadjahi et al. 2020), KL (Jiang 2018), $\gamma $ (Fujisawa et al. 2021)) affect the calibration results. We run the SMCABC algorithm with the same parameters but vary the statistical distances. Figure 8 displays our findings. The interpretation of the graphs follows the previous section; each plot shows the evolution of the posterior marginals over the SMC iterations with the data-generating parameter indicated by vertical, dashed, and black lines.

The experiment shows that the Wasserstein distance in Fig. 7d as well as the KL and $\gamma $-divergence in Fig. 8a and (b), respectively, recover the parameter well and find concentrated posterior distributions around the reference values. KL and $\gamma $ divergence seem to require slightly fewer SMC iterations to concentrate around the data-generating parameter of the elongation speed v and resource consumption R. This can be seen from comparing Fig. 7d, iteration 82 with Fig. 8a, iteration 78 and b, iteration 76, although this observation is unlikely to be statistically significant. Evidently, the Wasserstein distance outperformed the other distances in identifying the branching parameter.

Neither the KL divergence nor the $\gamma $ divergence exhausted the total number of dataset simulations. After they used 82% and 85% of their budget, respectively, individual particles got trapped in regions with low posterior probability, which caused a significant load imbalance, making it infeasible to let the algorithm run until the end. Similar problems were not observed for the Wasserstein-based inference, which reliably converged in all calibration runs. The sliced-Wasserstein distance failed to discover the data-generating parameter in this particular case (Fig. 13 in Appendix A). While the posterior marginals peak at the correct values, the posterior is significantly wider than for the other statistical distances.

Effect of simulated dataset size

The previous experiments used ${M^\prime }=50$ synthetic neurons per parameter. This number was motivated by the original work on Wasserstein-ABC (Bernton et al. 2019) in which the authors used a sample size of ${M^\prime }=100$ to calibrate the mean vector of a bivariate Gaussian model. Since generating a neuron morphology is disproportionally more expensive, we began experiments with ${M^\prime }=50$ neurons and allowed the same number of dataset simulations ($10^6$) as Bernton et al. (2019). To reduce the algorithm’s runtime, Bernton et al. (2019) considered less expensive approximations to the Wasserstein distance, e.g., Hilbert and swapping distances, because its computation accounted for large fractions of the overall runtime. Here, simulating 50 synthetic neurons is roughly 100 times more time-consuming than the distance computation; thus, we investigate how the sample size ${M^\prime }$ affects the posterior quality in this practical example. Allowing the same number of total neuron simulations ($5\cdot 10^7$), we run the algorithm with ${M^\prime }\in \{10,25,50, 100\}$ samples per parameter while using $\mathcal {M}1,2,3,4$ and the Wasserstein distance.

Figure 9 shows the evolution of the posterior marginals considering synthetic datasets $y_{ sim }$ of cardinality a 100, b 25, and c 10. Figure 7d shows the identical experiment for 50 simulated neurons per parameter which recovers the data-generating parameter well. First, in Fig. 9a, doubling the number of samples from 50 to 100 gives more confidence in the prediction of the model for a given set of parameters; however, the computational cost double and render the execution of many SMC iterations infeasible. After 67 iterations, the algorithm exhausts its simulation budget and yields insufficient posterior marginals to identify the data-generating parameters. Comparing similar iterations (62, ${M^\prime }=50$; 67, ${M^\prime }=100$), we may see a slight advantage for using more samples.

Next, we reduce the sample size to (b) 25 and (c) 10 per parameter. We expect that lowering the sample size allows the number of SMC iterations to increase. Simultaneously, the statistics of the model at a given parameters set are more uncertain; colloquially, we may say that the coupling between parameter and QoI space is looser. For both cases, the posterior marginals in Fig. 9 initially converge quickly. Towards the end, the sampling got inefficient and developed a load imbalance with individual particles holding up the algorithm. We eventually stopped the algorithm after it ran on the server as long as the reference results for 50 samples. To find a posterior distribution similar in quality, the algorithm with $n=25$ and $n=10$ samples used only 69% and 41% of the simulation budget, respectively. While 50 samples per parameter took roughly 4.5 days to recover the parameter, (b) and (c) achieved the same in roughly 2 days. Thus, the runtime is significantly reduced by lowering the sample size while still achieving a similar posterior quality. Additional benefits are not observed, i.e., the posterior is similar but does not show higher densities.

3.3.2 Experimental data

We now shift our attention towards experimental data. We begin with calibrating Model 2 on the apical dendrites of the data sets listed in Table 2 using the morphometrics $\mathcal {M}$1, 2, 3, and 4 (see Table 1). The apical dendrites refers to the structures displayed above the soma in Fig. 2c. We approximate the posterior with $2^{10}$ particles, run the algorithm for a fixed budget of $5 \cdot 10^{7}$ neuron simulations ($10^6$ data sets $y_{ sim }$ of cardinality 50), and use the Wasserstein distance. In contrast to using synthetic datasets, the reference parameters to judge the quality of the solution are unknown; hence, we fall back to the predictive check described earlier, i.e., we sample $2^{10}$ parameter from the posterior, evaluate the model ${M^\prime }=50$ times per sample, compute the QoIs, and compare them with the data. Our focus lies on finding the posterior distribution for a given model–whether or not the model adequately describes the data is beyond the scope of the present article and designing rules for specific resource-driven neuron models for given data is left for future work.

$\mathcal {D}1$- pyramidal cells in human hippocampus (CA1). Fig. 10 shows the result of calibrating Model 2 with data $\mathcal {D}$1. Panel (a) shows the familiar evolution of the posterior marginals over the SMC iterations. The marginals show a convergent behavior and accumulate in certain regions, specifically the KDEs are centered around $v^\star \approx 0.83 \cdot 10^{2}$, $R^\star \approx 0.44 \cdot 10^{-3}$, and $p_{bra}^\star \approx 0.72 \cdot 10^{-1}$. Panel (b) shows the results of the predictive check – data in blue (histogram) and orange (KDE), model predictions in green (KDE). The center (mean) of the marginal distributions of data and predictions match well. The number of segments and the total segment length seem to behave similar for the experiment and the calibrated simulation. The mean and standard deviation centers correctly but the model’s prediction are too narrow indicating that additions to Model 2 may be necessary to better describe the data.

$\mathcal {D}2$- pyramidal cells in mouse hippocampus (CA1). Figure 11 shows the result of calibrating Model 2 with data $\mathcal {D}$2. The interpretation is analogous to Fig. 10. In contrast to $\mathcal {D}1$, we obtain posterior marginals more concentrated around specific values, which can be seen from the higher density values of marginals. In particular, these densities are higher for the elongation speed and the resource consumption. Moreover, the shape of the QoI marginals appears to match the shape of the data better compared to $\mathcal {D}1$. The width of the standard deviation is, again, underestimated by the model. For $\mathcal {D}2$, the marginals peak at $v^\star \approx 0.58 \cdot 10^{2}$, $R^\star \approx 0.59 \cdot 10^{-3}$, and $p_{bra}^\star \approx 0.62 \cdot 10^{-1}$. It appears as if Model 2 is better suited to describe pyramidal cells found in mice than the ones found in humans.

Comparison beyond the QoIs

The selected QoIs ($\mathcal {M}1,2,3,4$) reduce the neuron morphology to a point in $\mathbb {R}^4$, representing a significant, irreversible compression of the information. To further investigate if Models 1 and 2 can produce meaningful, synthetic neurons, we visualize a set of neurons after calibration together with the calibration data.

We proceed as follows. First, we use Model 1 and 2 to describe the pyramidal cell’s basal and apical parts, respectively, akin to the concept presented in Fig. 2b. Using the Wasserstein ABC algorithm, we then calibrate both models as in the previous section. The calibration of Model 1 follows the calibration of Model 2 with datasets extracted from the basal structures. We select a parameter set from the resulting posterior marginals via a maximum a posteriori estimate and subsequently simulate $10^3$ neurons using the fixed parameter set. Compared to Fig. 2b, we lower the bias and increased the persistence of the random walk governing the growth. Lastly, we collect the basal and apical QoIs of the data and simulations, normalize them concerning the variance of the data marginals, and obtain two point clouds in $\mathbb {R}^8$. We assign a simulation to each data point via the shortest Euclidean distance to form data-simulation pairs.

Figure 12 compares simulated and experimental pyramidal cells by displaying their full 3D structure. Panels (a)–(f) shows neurons that appear to be similar regarding the characteristic morphology. Closer investigation also reveals differences, in particular, the apical section (top part) of the experimental neurons shows a characteristic, straight main branch. The simulated neurons develop a similar hierarchy, e.g., a main branch with extensions; however, this feature is more emphasized in the experimental neurons and the main branch barely deviates from a straight line.

While (a)-(f) yield good a agreement, some neurons show morphological features that are beyond the QoIs. For instance, in Panel (g) and (h), the experimental neurons show a main branch that effectively splits into two. Such features are not captured by the selected set of QoIs; we may not expect that calibration drives the model parameter in a suitable range. Moreover, it is questionable if the simplistic Model 2 can produce such growth. Lastly, Panel (i) shows an example for which the curvy main branch folds, a feature that can occur in the simulation but is not readily observed in the data. These limitations seem inherent to the simplistic model and can be partially bypassed with different weights for the bias and persistence of the walk.

3.4 Runtime and computational cost

We omit detailed benchmarks since they strongly depend on the model, and thus, precise numbers may not be adequate estimates for future research endeavors. Nonetheless, we would like to give a few numbers for ballpark estimates.

The numerical experiments were conducted on two different systems. System 1 is an Intel Xeon E7-8890 CPU with 72 physical cores distributed among four sockets with 1 TB RAM. The system’s maximal clock frequency is 3.3 GHz. System 2 is a compute node with two sockets, each hosting an AMD Epyc 7713 CPU with 64 physical cores and 1 TB RAM; hence, a total of 128 cores and 2 TB RAM. The system’s maximal clock frequency is 3.7 GHz. On both systems, we run the experiments in Docker containers.

For our calibration, we capped the runs at 50 million simulated neurons. We stop the calibration after the iteration crossing this threshold has finished. For experiments with the Wasserstein distance and synthetic data from Model 2, the calibration took roughly 95 h on system 1 (using 72 cores) and 62 h on system 2 (using 96 cores). Taking into consideration the total number of neurons sampled during this time, system 1 and system 2 achieve a throughput of roughly 148 and 225 neurons per second, respectively. Per core, the two systems yield a throughput of 2.06 and 2.35 neurons per second.

We note that the previous measurements include the Wasserstein distance calculation. Bernton et al. (2019) pointed out that computing the Wasserstein distance is expensive compared to evaluating summary statistics and suggested cheaper alternatives such as the Hilbert or swapping distance. However, they considered models that are significantly faster to compute (e.g., a bivariate normal distribution); for the case at hand, we usually simulate datasets of ${M^\prime }=50$ neurons, which takes approximately 20–25 s. Computing the Wasserstein distance numerically as in (Flamary et al. 2021; Dutta et al. 2021; Bonneel et al. 2011) takes significantly less than a second and, thus, the computational time of the Wasserstein distance computation is none of our concerns.

We further remark that, in some cases, we observe substantial imbalances in the computational workload across the MPI ranks (similar to (Dutta et al. 2021, Fig. 5)). For instance, some SMC iterations in the experiment displayed in Fig. 8c ran on only four ranks for extended periods of time. Scalability issues were partially addressed in (Dutta et al. 2021, Sect. 3) suggesting using dynamic work sharing in an MPI context. However, the load imbalances seem inherent to the algorithm more than they can be attributed to the implementation.

4 Discussion

Before reflecting the individual computational experiments, we wish to emphasize that our experiments collectively demonstrate that ABC combined with morphometrics and statistical distances presents a potent tool for addressing the statistical inverse problem for neuronal growth models. The remainder of the section structurally follows the preceding one.

4.1 Sensitivities

We analyzed the influence of parameter variations on the expectation value of the QoIs, utilizing Saltelli’s method and computing the Sobol indices. The results showed that almost all QoIs are influenced by a set of parameters and cannot typically be explained by a single parameter in isolation. The discrepancies between the total effect index and the first-order sensitivity index hint at higher-order effects. In sum, this shows that even simple models yield complex emerging behavior.

We recall that sensitivities measure variation and, thus, information. If certain QoIs are sensitive to parameter variations, these QoIs likely contain information that may be leveraged during Bayesian inference to find the posterior distribution of the parameter. On the contrary, QoIs that are insensitive to the model parameters are unlikely to add information that can be leveraged during inference. Such QoIs should be omitted because their statistical fluctuations may pollute the distance metric. Here, our sensitivities verified that the selected morphometrics contain relevant information for inferring the parameters.

4.2 SMCABC on synthetic data

We performed various numerical experiments with synthetic data and Model 2, investigating the choice of morphometrics, influences of statistical distances, and the number of samples per parameter. The results are depicted in the Figs. 7, 8, and 9, respectively. We highlight that the experiments clearly demonstrate that ABC with statistical distances and morphometrics can uncover the posterior distributions for simple, resource-driven neuronal growth models.

Effect of the morphometrics

To infer parameters, the QoIs must be informative for the considered parameter set. This effect is demonstrated in Fig. 7 in which we gradually increase the information available for parameter inference by adding one QoI at a time. By leveraging the statistical distances, changing datasets and QoIs becomes trivial endeavor as they automatically consider the distribution without the need of defining appropriate summary statistics for the additional QoIs. This highlights the versatility of the proposed framework and how easy it is to employ in practice. We accentuate that this feature allows the method to easily generalize to more complex neuron models. For a given model, SA helps quantifying which QoIs are sensitive to parameter variations. These QoIs may then be chosen for the inference–ABC based on statistical distances then automatically accounts for the added information in the comparison step (3).

We initialized all experiments in Fig. 7 with uniform priors. In practical application, Bayesian frameworks support leveraging previous information. As new data becomes available or as new QoIs are computed, the previously determined posterior may serve as a prior to the new inference problem. This technique may be relevant when approaching more intricate models requiring more intensive calculations. We redirect the reader to Oden et al. (2017) and Lima et al. (2018) for more details.

Effect of different statistical distances

Figure 8 shows the convergence of the posterior marginals for different statistical distances. Despite their different characteristics, the SMCABC sampler (Del Moral et al. 2012; Bernton et al. 2019) finds posterior distributions in agreement with the data-generating parameter across most statistical distances. Comparing the Wasserstein distance to the KL and $\gamma $ divergence, we observe that the latter requires fewer SMC iterations to concentrate around the data-generating parameter for the elongation speed and resource consumption. However, the Wasserstein distance better identifies the branching probability. Surprisingly, the sliced Wasserstein performed poorly even though it approximates the Wasserstein distance in high-dimensional settings; it appears as if this approximation failed capture sufficient information in this low-dimensional use-case to infer the parameters. Overall, the Wasserstein distance resulted in the most efficient and reliable calibration algorithm, i.e., it found the best posterior distributions as measured by the density at the data-generating parameters while being reliable in its convergence.

When running with KL and $\gamma $, we encountered issues with the SMCABC sampler. In particular, we aborted both algorithms after exhausting 82% and 85% of their simulation budget, respectively, because individual particles got trapped in low probability regions preventing the algorithm from further progressing. Here, we used $\alpha = 0.6$ for all four algorithms, determining how aggressively the SMC sampler moves forward. We suspect that a different choice of $\alpha $ would aid the convergence of these algorithms. A closer investigation of this hypothesis is of interest for the future but outside the scope of this manuscript. Since the purpose of this experiment was to better understand the effects of different distance measures, we fixed all parameters, including $\alpha $, between experiments.

Effect of simulated dataset size

The posterior quality of ABC algorithms generally improves with the computing resources that can be allocated to it. The more data available, the better simulations can be compared with data. The more SMC iterations are executed, the better the results. The more simulations are executed per parameter, the more profound the understanding of the model’s statistical properties. Moreover, the more simulated and observed data is available, the better the estimates of the statistical distances, see Fig. 3.

From our experiments in Fig. 9, we conclude that Wasserstein ABC finds the data-generating parameter even when only small sample sizes are used, e.g., ${M^\prime }= 10$ or ${M^\prime }= 25$. In the absence of computational limitations, it is clear that larger sample sizes are favorable. We decided to use ${M^\prime }=50$ samples per parameter for the experiments on experimental data as this seems to be the best compromise between computational runtime and reliability given our computational resources.

4.3 SMCABC on experimental data

In both cases, i.e, the apical dendrites of human and mouse pyramidal cells in the hippocampus, Wasserstein ABC finds sharp posterior distributions indicating significant confidence in the parameter estimation, see Fig. 10 and 11. The posterior marginals of the mouse data show higher densities and, thus, indicate more confidence in the parameter values. The predictive check shows that the model’s prediction and the data agree concerning their mean values but also reveals that the predictions underestimate the width of some QoI distributions, for instance, the standard deviation of the segment lengths. This also applies to the mean segment length of the human pyramidal cells. The datasets show larger variations than the models predictions for these cases.

Figure 12 presented nine pairs of simulated and experimental neurons to facilitate a more detailed comparison. Some of the pairs, i.e., (a)-(f) underline the similarities between simulation and experiment while others expose shortcoming of the models. We point out the examples (g) and (h) for which the main branch of the apical dendrites breaks into two parts. Neither Model 2 nor the selected QoIs reflect this feature, thus, we cannot expect to find it in the calibrated neurons. Model extensions considering random walk weights and branching probabilities depending on the resource parameters bear the potential to reduce this reality gap in future work.

George Box famously described this ubiquitous reality gap between natural phenomena and mathematical models as all models are wrong, but some are useful. Whether any model, e.g., Model 1 or 2, is a useful approximation to real neurons depends on the subsequent downstream applications. In other words, different applications require the models to accurately mimic distinct neuronal properties that reflect in a set of carefully chosen, application-dependent QoIs. For instance, simulating the detailed electro-physiology of a single neuron imposes different requirements on synthetic neurons than simulating a cortical region involving thousands of neurons. Researchers implementing a simulation based on synthetic neurons must critically reflect on whether the neuron’s QoIs match the problem sufficiently well.

While the models are simplistic and impose a limit on the agreement of data and simulation within the predictive check, the experiment demonstrated that Wasserstein ABC can find the posterior distributions explaining limited features (selected QoIs) of the observed data for two different types of pyramidal cells.

5 Conclusion

This investigation explored approximate solutions for the Bayesian inverse problem encountered in the context of neuron growth models. We investigated the ability of morphometrics to extract essential characteristics of neuronal morphology and explored how statistical distances can be effectively used to incorporate this information into SMCABC samplers. To achieve this, we initially focused on simplified mathematical models for illustrative purposes. We embedded these models into a more general abstract concept–the resource-driven neuron growth model–drawing upon and summarizing existing literature. Finally, to enhance accessibility and potential adoption of this methodology, we presented an implementation efficiently coupling BioDynaMo and ABCpy.

Our investigations on synthetic data demonstrate the effectiveness of ABC with various statistical distances, e.g., the Wasserstein distance and KL/$\gamma $ divergence, in recovering the data-generating parameters. While Wasserstein yielded superior results within the specific experimental setup, further investigation is necessary to confirm general advantages due to potential bias towards specific parameters, particularly the SMC parameter $\alpha = 0.6$ used in this study. The proposed framework facilitates the seamless integration of additional structural information (morphometrics) into the statistical analysis, demonstrating the flexibility of the approach. Additionally, our findings reveal that the employed SMC sampler achieves satisfactory convergence even with few simulations per parameter. Applications to experimental data confirm successful model calibration through the algorithm. We further find that the simple mathematical models describe data derived from mouse pyramidal cells better than human counterparts. Beyond the analyzed quantities of interest, a comprehensive comparison of complete 3D structures revealed additional similarities and discrepancies between simulation and experiment.

Our framework holds significant promise for advancing research in mechanistic, agent-based neuron growth models and neuroscience. The abstract, resource-driven growth model is a robust foundation for systematically exploring diverse realizations and constructing detailed representations of specific neuron types. This endeavor is supported by the flexibility of the framework pairing morphometrics and statistical distances as it allows for seamless adaptation to new data and relevant quantities of interest. Furthermore, the method paves the way for applying Bayesian model selection and other computational techniques from related fields such as predictive computational science (Oden et al. 2017). We believe the community can significantly benefit from adopting these ideas; however, as highlighted by Robert et al. (2011), critical assessment of the approximations inherent in ABC remains crucial when employing the posterior for model selection.

In a broader context, the proposed framework possesses the potential to contribute to the classification of diverse neuronal types by linking the neuron’s morphology (captured through morphometrics), its functional role, and mathematical models. Moreover, this approach could support the simulation of large-scale brain structures, enabling the exploration of phenomena such as cortical lamination (Bauer et al. 2021) by calibrating neuron models for different brain regions. Additionally, a proper understanding of the stochastic processes driving neuronal growth may empower the development of heuristic, stochastic, and biologically-inspired algorithms for designing neural network architectures tailored for specific tasks.

While our experiments effectively demonstrate the method’s potential, we acknowledge certain limitations in our approach. The utilized models and quantities of interest are intentionally kept simple for illustrative purposes and likely require further refinement for broader applicability. Furthermore, the employed datasets are relatively small, and additional data points would facilitate a more thorough comparison of simulation and experiment. Additionally, the algorithm’s runtime scales linearly with the model’s runtime. More complex models may require increased computational resources or the exploration of computationally cheaper surrogates (e.g., Gaussian processes (Rocha et al. 2022)) in place of the current models. Lastly, the SMC algorithm occasionally encounters convergence issues (specifically with KL/$\gamma $ divergence) where particles become trapped in low-probability regions.

Addressing the above limitations presents promising directions for future research. In particular, developing more detailed and complex neuron growth models based on the presented abstract concept, as well as their verification with the SMCABC framework, is a crucial step for further improvements. Different models and neuron types beyond pyramidal cells deserve attention and should be the subject of future studies. Extensions considering growth of multiple, interacting neurons or activity dependent growth presents another interesting avenue. With new neuron types and more extensive simulations, additional, more involved morphometrics must be considered when mapping neurons to a low-dimensional vector space. Such simulations may also benefit from hierarchical Bayesian calibration procedures allowing to tackle a sequence of simpler sub-problems (Oden et al. 2017; Lima et al. 2018). Concerning the ABC algorithm, one should address the issue of single particles limiting calibration progress. Intuitively, many failed attempts to update a single particle signal that it is trapped in a low probability region. Quantifying this information and including it in the resampling step seems a promising avenue for operating the algorithm even more smoothly. Lastly, the posterior distribution contains information that can be used to choose between different mathematical models (see OPAL algorithm presented by Oden et al. (2017)), which would be a substantial step forward. These efforts would collectively support the method’s robust application in more complex scenarios.

In conclusion, this work establishes SMCABC based on statistical distances and morphometrics as a potent tool for approximating the solution of the Bayesian inverse problem and, therefore, calibrating neuronal growth models.

Data availability

This research did not generate new data. All datasets used in the computational experiments are publicly available and can be accessed through neuromorpho.org.

Notes

The ABC literature often refers to QoIs that summarize the statistical characteristics of a simulation (e.g., $\mathcal {M}2$) as summary statistics. To avoid any ambiguity in the terminology, we refer to values quantifying the simulated system’s properties and statistics as QoIs or morphometrics and use the term summary statistics exclusively for $\eta $.

References

Amari S (2016) Information geometry and its applications, vol 194. Springer, Cham. https://doi.org/10.1007/978-4-431-55978-8
Book Google Scholar
Ascoli GA, Krichmar JL, Scorcioni R, Nasuto SJ, Senft SL, Krichmar GL (2001) Computer generation and quantitative morphometric analysis of virtual neurons. Anat Embryol 204:283–301. https://doi.org/10.1007/s004290100201
Article Google Scholar
Ascoli GA, Donohue DE, Halavi M (2007) Neuromorpho.org: a central resource for neuronal morphologies. J Neurosci 27:9247–9251. https://doi.org/10.1523/JNEUROSCI.2055-07.2007
Article Google Scholar
Azevedo FAC, Carvalho LRB, Grinberg LT, Farfel JM, Ferretti REL, Leite REP, Filho WJ, Lent R, Herculano-Houzel S (2009) Equal numbers of neuronal and nonneuronal cells make the human brain an isometrically scaled-up primate brain. J Comp Neurol 513:532–541. https://doi.org/10.1002/cne.21974
Article Google Scholar
Bauer R, Zubler F, Hauri A, Muir DR, Douglas RJ (2014) Developmental origin of patchy axonal connectivity in the neocortex: a computational model. Cereb Cortex 24:487–500. https://doi.org/10.1093/cercor/bhs327
Article Google Scholar
Bauer R, Zubler F, Pfister S, Hauri A, Pfeiffer M, Muir DR, Douglas RJ (2014) Developmental self-construction and -configuration of functional neocortical neuronal networks. PLoS Comput Biol 10:e1003994. https://doi.org/10.1371/journal.pcbi.1003994
Article Google Scholar
Bauer R, Clowry GJ, Kaiser M (2021) Creative destruction: a basic computational model of cortical layer formation. Cereb Cortex 31:3237–3253. https://doi.org/10.1093/cercor/bhab003
Article Google Scholar
Beal MJ, Ghahramani Z (2003) Bayesian statistics 7, chapter the variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures. Oxford University Press, Oxford, pp 453–463. https://doi.org/10.1093/oso/9780198526155.003.0025
Book Google Scholar
Beaumont MA (2010) Approximate Bayesian computation in evolution and ecology. Annu Rev Ecol Evol Syst 41:379–406. https://doi.org/10.1146/annurev-ecolsys-102209-144621
Article Google Scholar
Beaumont MA, Zhang W, Balding DJ (2002) Approximate Bayesian computation in population genetics. Genetics 162:2025–2035. https://doi.org/10.1093/genetics/162.4.2025
Article Google Scholar
Benavides-Piccione R, Regalado-Reyes M, Fernaud-Espinosa I, Kastanauskaite A, Tapia-González S, León-Espinosa G, Rojo C, Insausti R, Segev I, DeFelipe J (2019) Differential structure of hippocampal CA1 pyramidal neurons in the human and mouse. Cereb Cortex 30(2):730–752. https://doi.org/10.1093/cercor/bhz122
Article Google Scholar
Bernton E, Jacob PE, Gerber M, Robert CP (2019) Approximate Bayesian computation with the Wasserstein distance. J R Stat Soc Ser B Stat Methodol 81:235–269. https://doi.org/10.1111/rssb.12312
Article MathSciNet Google Scholar
Blei David M, Alp K, McAuliffe Jon D (2017) Variational inference: a review for statisticians. J Am Stat Assoc 112:859–877. https://doi.org/10.1080/01621459.2017.1285773
Article MathSciNet Google Scholar
Bonneel N, van de Panne M, Paris S, Heidrich W (2011) Displacement interpolation using Lagrangian mass transport. In: Proceedings of the 2011 SIGGRAPH Asia conference, ACM, pp 1–12. https://doi.org/10.1145/2024156.2024192
Breitwieser L, Hesam A, de Montigny J, Vavourakis V, Iosif A, Jennings J, Kaiser M, Manca M, Di Meglio A, Al-Ars Z, Rademakers F, Mutlu O, Bauer R (2021) Biodynamo: a modular platform for high-performance agent-based simulation. Bioinformatics 38(2):453–460. https://doi.org/10.1093/bioinformatics/btab649
Article Google Scholar
Breitwieser L, Hesam A, Rademakers F, Gómez Luna J, and Mutlu O (2023) High-performance and scalable agent-based simulation with biodynamo. In: Proceedings of the 28th ACM SIGPLAN annual symposium on principles and practice of parallel programming, ACM, pp 174–188. https://doi.org/10.1145/3572848.3577480
Brooks S, Gelman A, Jones G, Meng X-L (eds) (2011) Handbook of Markov chain Monte Carlo. Chapman and Hall/CRC, Boca Raton. https://doi.org/10.1201/b10905
Book Google Scholar
Cai AQ, Landman KA, Hughes BD (2006) Modelling directional guidance and motility regulation in cell migration. Bull Math Biol 68:25–52. https://doi.org/10.1007/s11538-005-9028-x
Article MathSciNet Google Scholar
Codling EA, Plank MJ, Benhamou S (2008) Random walk models in biology. J R Soc Interface 5:813–834. https://doi.org/10.1098/rsif.2008.0014
Article Google Scholar
Csilléry K, Blum MGB, Gaggiotti OE, François O (2010) Approximate Bayesian computation (ABC) in practice. Trends Ecol Evol 25:410–418. https://doi.org/10.1016/j.tree.2010.04.001
Article Google Scholar
Cuntz H, Forstner F, Borst A, Häusser M (2010) One rule to grow them all: a general theory of neuronal branching and its practical application. PLoS Comput Biol 6:e1000877. https://doi.org/10.1371/journal.pcbi.1000877
Article MathSciNet Google Scholar
DeFelipe J (1997) Types of neurons, synaptic connections and chemical characteristics of cells immunoreactive for calbindin-d28k, parvalbumin and calretinin in the neocortex. J Chem Neuroanat 14:1–19. https://doi.org/10.1016/S0891-0618(97)10013-8
Article Google Scholar
DeFelipe J, Fariñas I (1992) The pyramidal neuron of the cerebral cortex: morphological and chemical characteristics of the synaptic inputs. Prog Neurobiol 39:563–607. https://doi.org/10.1016/0301-0082(92)90015-7
Article Google Scholar
Deitcher Y, Eyal G, Kanari L, Verhoog MB, Kahou GAA, Mansvelder HD, de Kock CPJ, Segev I (2017) Comprehensive morpho-electrotonic analysis shows 2 distinct classes of l2 and l3 pyramidal neurons in human temporal cortex. Cereb Cortex 27:5398–5414. https://doi.org/10.1093/cercor/bhx226
Article Google Scholar
Del Moral P, Doucet A, Jasra A (2006) Sequential Monte Carlo samplers. J R Stat Soc Ser B Stat Methodol 68:411–436. https://doi.org/10.1111/j.1467-9868.2006.00553.x
Article MathSciNet Google Scholar
Del Moral P, Doucet A, Jasra A (2012) An adaptive sequential monte carlo method for approximate bayesian computation. Stat Comput 22:1009–1020. https://doi.org/10.1007/s11222-011-9271-y
Article MathSciNet Google Scholar
Dowson DC, Landau BV (1982) The fréchet distance between multivariate normal distributions. J Multivar Anal 12:450–455. https://doi.org/10.1016/0047-259X(82)90077-X
Article Google Scholar
Dutta R, Schoengens M, Pacchiardi L, Ummadisingu A, Widmer N, Künzli P, Onnela J-P, Mira A (2021) Abcpy: a high-performance computing perspective to approximate Bayesian computation. J Stat Softw 100:107. https://doi.org/10.18637/jss.v100.i07
Duswald T, Breitwieser L, Thorne T, Wohlmuth B, Bauer R (2024) Supplementary information– Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation. Zenodo. https://doi.org/10.5281/zenodo.13810423
Elston GN (2011) Pyramidal cells in prefrontal cortex of primates: marked differences in neuronal structure among species. Front Neuroanat 5:2. https://doi.org/10.3389/fnana.2011.00002
Article Google Scholar
Flamary R, Courty N, Gramfort A, Alaya MZ, Boisbunon A, Chambon S, Chapel L, Corenflos A, Fatras K, Fournier N, Gautheron L, Gayraud NTH, Janati H, Rakotomamonjy A, Redko I, Rolet A, Schutz A, Seguy V, Sutherland DJ, Tavenard R, Tong A, Vayer T (2021) Pot: Python optimal transport. J Mach Learn Res 22(78):1–8
Google Scholar
Fujisawa M, Teshima T, Sato I, Sugiyama M (2011) $\gamma $-abc: outlier-robust approximate bayesian computation based on a robust divergence estimator. In: Banerjee A, Fukumizu K (eds) Proceedings of the 24th international conference on artificial intelligence and statistics, proceedings of machine learning research, vol 130. PMLR, pp 1783–1791
Gelfand AE, Smith AFM (1990) Sampling-based approaches to calculating marginal densities. J Am Stat Assoc 85:398. https://doi.org/10.2307/2289776
Article MathSciNet Google Scholar
Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741. https://doi.org/10.1109/TPAMI.1984.4767596
Article Google Scholar
Genevay A, Peyre G, Cuturi M (2018) Learning generative models with Sinkhorn divergences. In: Storkey A, Perez-Cruz F (eds) Proceedings of the 21st international conference on artificial intelligence and statistics, proceedings of machine learning research, vol 84. PMLR, pp 1608–1617
Goodhill GJ, Ming G, Urbach JS (2004) Predicting axonal response to molecular gradients with a computational model of filopodial dynamics. Neural Comput 16:2221–2243. https://doi.org/10.1162/0899766041941934
Article Google Scholar
Graham BP, van Ooyen A (2006) Mathematical modelling and numerical simulation of the morphological development of neurons. BMC Neurosci 7:S9. https://doi.org/10.1186/1471-2202-7-S1-S9
Article Google Scholar
Graham BP, Lauchlan K, Mclean DR (2006) Dynamics of outgrowth in a continuum model of neurite elongation. J Comput Neurosci 20:43–60. https://doi.org/10.1007/s10827-006-5330-3
Article MathSciNet Google Scholar
Hamilton P (1993) A language to describe the growth of neurites. Biol Cybern 68:559–565. https://doi.org/10.1007/BF00200816
Article Google Scholar
Hannezo E, Scheele CLGJ, Moad M, Drogo N, Heer R, Sampogna RV, van Rheenen J, Simons BD (2017) A unifying theory of branching morphogenesis. Cell 171:242–25527. https://doi.org/10.1016/j.cell.2017.08.026
Article Google Scholar
Hassan BA, Robin Hiesinger P (2015) Beyond molecular codes: simple rules to wire complex brains. Cell 163:285–291. https://doi.org/10.1016/j.cell.2015.09.031
Article Google Scholar
Hastings WK (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97. https://doi.org/10.2307/2334940
Article MathSciNet Google Scholar
Herculano-Houzel S (2009) The human brain in numbers: a linearly scaled-up primate brain. Front Hum Neurosci. https://doi.org/10.3389/neuro.09.031.2009
Article Google Scholar
Herman J, Usher W (2017) Salib: an open-source python library for sensitivity analysis. J Open Source Softw 2:97. https://doi.org/10.21105/joss.00097
Article Google Scholar
Hirokawa N, Noda Y, Tanaka Y, Niwa S (2009) Kinesin superfamily motor proteins and intracellular transport. Nat Rev Mol Cell Biol 10:682–696. https://doi.org/10.1038/nrm2774
Article Google Scholar
Hirokawa N, Niwa S, Tanaka Y (2010) Molecular motors in neurons: transport mechanisms and roles in brain function, development, and disease. Neuron 68:610–638. https://doi.org/10.1016/j.neuron.2010.09.039
Article Google Scholar
Hjorth JJJ, van Pelt J, Mansvelder HD, van Ooyen A (2014) Competitive dynamics during resource-driven neurite outgrowth. PLoS ONE 9:e86741. https://doi.org/10.1371/journal.pone.0086741
Article Google Scholar
Hodgkin AL, Huxley AF (1952) A quantitative description of membrane current and its application to conduction and excitation in nerve. J Physiol 117:500. https://doi.org/10.1113/jphysiol.1952.sp004764
Article Google Scholar
Jiang B (2018) Approximate Bayesian computation with Kullback–Leibler divergence as data discrepancy. In: Storkey A, Perez-Cruz F (eds) Proceedings of the 21st international conference on artificial intelligence and statistics, proceedings of machine learning research, vol 84. PMLR, pp 1711–1721
Jordan MI, Ghahramani Z, Jaakkola TS, Saul LK (1999) An introduction to variational methods for graphical models. Mach Learn 37(2):183–233
Article Google Scholar
Kamal A, Stokin GB, Yang Z, Xia C-H, Goldstein LSB (2000) Axonal transport of amyloid precursor protein is mediated by direct binding to the kinesin light chain subunit of kinesin-i. Neuron 28:449–459. https://doi.org/10.1016/S0896-6273(00)00124-0
Article Google Scholar
Kassraian-Fard P, Pfeiffer M, Bauer R (2020) A generative growth model for thalamocortical axonal branching in primary visual cortex. PLoS Comput Biol 16:e1007315. https://doi.org/10.1371/journal.pcbi.1007315
Article Google Scholar
Kiddie G, McLean D, Van Ooyen A, Graham B (2005) Biologically plausible models of neurite outgrowth. In: Development, dynamics and pathiology of neuronal networks: from molecules to functional circuits, progress in brain research, vol 147. Elsevier, pp 67–80. https://doi.org/10.1016/S0079-6123(04)47006-X
Krottje JK, van Ooyen A (2007) A mathematical framework for modeling axon guidance. Bull Math Biol 69:3–31. https://doi.org/10.1007/s11538-006-9142-4
Article MathSciNet Google Scholar
Lambert B, MacLean AL, Fletcher AG, Combes AN, Little MH, Byrne HM (2018) Bayesian inference of agent-based models: a tool for studying kidney branching morphogenesis. J Math Biol 76:1673–1697. https://doi.org/10.1007/s00285-018-1208-z
Article MathSciNet Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Article Google Scholar
Lee A, Latuszynski K (2014) Variance bounding and geometric ergodicity of Markov chain Monte Carlo kernels for approximate Bayesian computation. Biometrika 101:655–671. https://doi.org/10.1093/biomet/asu027
Article MathSciNet Google Scholar
Lee A (2012) On the choice of MCMC kernels for approximate Bayesian computation with SMC samplers. In: Proceedings of the 2012 winter simulation conference (WSC), IEEE, pp 1–12. https://doi.org/10.1109/WSC.2012.6465212
Lima EABF, Ghousifam N, Ozkan A, Oden JT, Shahmoradi A, Rylander MN, Wohlmuth B, Yankeelov TE (2018) Calibration of multi-parameter models of avascular tumor growth using time resolved microscopy data. Sci Rep 8:14558. https://doi.org/10.1038/s41598-018-32347-9
Article Google Scholar
Lindenmayer A (1968) Mathematical models for cellular interactions in development i. Filaments with one-sided inputs. J Theor Biol 18:280–299. https://doi.org/10.1016/0022-5193(68)90079-9
Article Google Scholar
Linsker R (1986) From basic network principles to neural architecture: emergence of spatial-opponent cells. Proc Natl Acad Sci 83:7508–7512. https://doi.org/10.1073/pnas.83.19.7508
Article Google Scholar
Lyons DA, Naylor SG, Scholze A, Talbot WS (2009) Kif1b is essential for MRNA localization in oligodendrocytes and development of myelinated axons. Nat Genet 41:854–858. https://doi.org/10.1038/ng.376
Article Google Scholar
Marjoram P, Molitor J, Plagnol V, Tavaré S (2003) Markov chain Monte Carlo without likelihoods. Proc Natl Acad Sci 100:15324–15328. https://doi.org/10.1073/pnas.0306899100
Article Google Scholar
Markram H, Toledo-Rodriguez M, Wang Y, Gupta A, Silberberg G, Caizhi W (2004) Interneurons of the neocortical inhibitory system. Nat Rev Neurosci 5:793–807. https://doi.org/10.1038/nrn1519
Article Google Scholar
Martin GM, Frazier DT, Robert CP (2020) Computing Bayes: Bayesian computation from 1763 to the 21st century
Martin GM, Frazier DT, Robert CP (2024) Approximating Bayes in the 21st century. Stat Sci 39(1):20–45. https://doi.org/10.1214/22-STS875
Article MathSciNet Google Scholar
McLean DR, van Ooyen A, Graham BP (2004) Continuum model for tubulin-driven neurite elongation. Neurocomputing 58–60:511–516. https://doi.org/10.1016/j.neucom.2004.01.088
Article MathSciNet Google Scholar
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations by fast computing machines. J Chem Phys 21:1087–1092. https://doi.org/10.1063/1.1699114
Article Google Scholar
Nadjahi K, De Bortoli V, Durmus A, Badeau R, şimşekli U (2020) Approximate Bayesian computation with the sliced-wasserstein distance. In: ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 5470–5474. https://doi.org/10.1109/ICASSP40776.2020.9054735
Oden JT, Babuska I, Faghihi D (2017) Predictive computational science: computer predictions in the presence of uncertainty. Encyclopedia of computational mechanics, 2nd edn. Wiley, New York, pp 1–26. https://doi.org/10.1002/9781119176817.ecm2101
Chapter Google Scholar
Palacios J, lidakanari, Zisis E, MikeG, Coste B, asanin epfl, Vanherpe L, jdcourcol, Arnaudon A, haleepfl, Getta P, Povolotsky AV, Sato A, alex4200, Berchet A, Amsalem O, tomdele (2021) Bluebrain/neurom: v3.1.0
Park M, Jitkrittum W, Sejdinovic D (2016) K2-abc: approximate Bayesian computation with kernel embeddings. In: Gretton A, Robert CC (eds) Proceedings of the 19th international conference on artificial intelligence and statistics, proceedings of machine learning research, Cadiz, Spain, vol 51. pp 398–407
Price LF, Drovandi CC, Lee A, Nott DJ (2018) Bayesian synthetic likelihood. J Comput Graph Stat 27:1–11. https://doi.org/10.1080/10618600.2017.1302882
Article MathSciNet Google Scholar
Pritchard JK, Seielstad MT, Perez-Lezaun A, Feldman MW (1999) Population growth of human y chromosomes: a study of y chromosome microsatellites. Mol Biol Evol 16:1791–1798. https://doi.org/10.1093/oxfordjournals.molbev.a026091
Article Google Scholar
Qian K, Pawar A, Liao A, Anitescu C, Webster-Wood V, Feinberg AW, Rabczuk T, Zhang YJ (2022) Modeling neuron growth using isogeometric collocation based phase field method. Sci Rep 12:8120. https://doi.org/10.1038/s41598-022-12073-z
Article Google Scholar
Robert CP, Cornuet J-M, Marin J-M, Pillai NS (2011) Lack of confidence in approximate Bayesian computation model choice. Proc Natl Acad Sci 108:15112–15117. https://doi.org/10.1073/pnas.1102900108
Article Google Scholar
Rocha HL, Silva JVO, Silva RS, Lima EABF, Almeida RC (2022) Bayesian inference using gaussian process surrogates in cancer modeling. Comput Methods Appl Mech Eng 399:115412. https://doi.org/10.1016/j.cma.2022.115412
Article MathSciNet Google Scholar
Rue H, Martino S, Chopin N (2009) Approximate Bayesian inference for latent gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B Stat Methodol 71:319–392. https://doi.org/10.1111/j.1467-9868.2008.00700.x
Article MathSciNet Google Scholar
Ryder EF, Bullard L, Hone J, Olmstead J, Ward MO (1999) Graphical simulation of early development of the cerebral cortex. Comput Methods Programs Biomed 59:107–114. https://doi.org/10.1016/S0169-2607(98)00096-0
Article Google Scholar
Ryu J, Liu L, Wong TP, Dong Chuan W, Burette A, Richard Weinberg Yu, Wang T, Sheng M (2006) A critical role for myosin IIB in dendritic spine morphology and synaptic function. Neuron 49:175–182. https://doi.org/10.1016/j.neuron.2005.12.017
Article Google Scholar
Saltelli A (2002) Making best use of model evaluations to compute sensitivity indices. Comput Phys Commun 145:280–297. https://doi.org/10.1016/S0010-4655(02)00280-1
Article Google Scholar
Saltelli A, Ratto M, Andres T, Campolongo F, Cariboni J, Gatelli D, Saisana M, Tarantola S (2008) Global sensitivity analysis: the primer, vol 1, 1st edn. Wiley, New York
Google Scholar
Saltelli A, Annoni P, Azzini I, Campolongo F, Ratto M, Tarantola S (2010) Variance based sensitivity analysis of model output. design and estimator for the total sensitivity index. Comput Phys Commun 181:259–270. https://doi.org/10.1016/j.cpc.2009.09.018
Article MathSciNet Google Scholar
Samuels DC, Hentschel HGE, Fine A (1996) The origin of neuronal polarization: a model of axon formation. Philos Trans R Soc Lond Seri B Biol Sci 351:1147–1156. https://doi.org/10.1098/rstb.1996.0099
Article Google Scholar
Satoh D, Sato D, Tsuyama T, Saito M, Ohkura H, Rolls MM, Ishikawa F, Uemura T (2008) Spatial control of branching within dendritic arbors by dynein-dependent transport of rab5-endosomes. Nat Cell Biol 10:1164–1171. https://doi.org/10.1038/ncb1776
Article Google Scholar
Shinbrot T (2006) Simulated morphogenesis of developmental folds due to proliferative pressure. J Theor Biol 242:764–773. https://doi.org/10.1016/j.jtbi.2006.04.021
Article MathSciNet Google Scholar
Shree S, Sutradhar S, Trottier O, Yuhai T, Liang X, Howard J (2022) Dynamic instability of dendrite tips generates the highly branched morphologies of sensory neurons. Sci Adv 8:0080. https://doi.org/10.1126/sciadv.abn0080
Article Google Scholar
Sisson SA, Fan Y, Tanaka MM (2007) Sequential Monte Carlo without likelihoods. Proc Natl Acad Sci 104:1760–1765. https://doi.org/10.1073/pnas.0607208104
Article MathSciNet Google Scholar
Sisson SA, Fan Y, Beaumont M (2019) Handbook of approximate Bayesian computation, 1st edn. CRC Press, Boca Raton
Google Scholar
Sobol IM (2001) Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math Comput Simul 55:271–280. https://doi.org/10.1016/S0378-4754(00)00270-6
Article MathSciNet Google Scholar
Tavaré S, Balding DJ, Griffiths RC, Donnelly P (1997) Inferring coalescence times from DNA sequence data. Genetics 145:505–518. https://doi.org/10.1093/genetics/145.2.505
Article Google Scholar
Torben-Nielsen B, Cuntz H (2014) The computing dendrite, chapter introduction to dendritic morphology. Springer, New York, NY, pp 3–22. https://doi.org/10.1007/978-1-4614-8094-5_1
Book Google Scholar
Torben-Nielsen B, Vanderlooy S, Postma EO (2008) Non-parametric algorithmic generation of neuronal morphologies. Neuroinformatics 6:257–277. https://doi.org/10.1007/s12021-008-9026-x
Article Google Scholar
Torres-Gomez S, Blonde JD, Mendoza-Halliday D, Kuebler E, Everest M, Wang XJ, Inoue W, Poulter MO, Martinez-Trujillo J (2020) Changes in the proportion of inhibitory interneuron types from sensory to executive areas of the primate neocortex: Implications for the origins of working memory representations. Cereb Cortex 30:4544–4562. https://doi.org/10.1093/cercor/bhaa056
Article Google Scholar
van Ooyen A, Graham BP, Ramakers GJA (2001) Competition for tubulin between growing neurites during development. Neurocomputing 38–40:73–78. https://doi.org/10.1016/S0925-2312(01)00487-8
Article Google Scholar
Vasilev V, Ph Canal A, Naumann PR (2012) Cling–the new interactive interpreter for root 6. J Phys: Conf Ser 396:052071. https://doi.org/10.1088/1742-6596/396/5/052071
Article Google Scholar
Wang X, Jenner AL, Salomone R, Warne DJ, Drovandi C (2024) Calibration of agent based models for monophasic and biphasic Tumour growth using approximate Bayesian computation. J Math Biol 88(3):28. https://doi.org/10.1007/s00285-024-02045-4
Article MathSciNet Google Scholar
Zador AM (2019) A critique of pure learning and what artificial neural networks can learn from animal brains. Nat Commun 10(12):3770. https://doi.org/10.1038/s41467-019-11786-6
Article Google Scholar
Zheng Y, Wildonger J, Ye B, Zhang Y, Kita A, Younger SH, Zimmerman S, Jan LY, Jan YN (2008) Dynein is required for polarized dendritic transport and uniform microtubule orientation in axons. Nat Cell Biol 10(1172–1180):10. https://doi.org/10.1038/ncb1777
Article Google Scholar
Zubler F, Douglas R (2009) A framework for modeling the growth and development of neurons and networks. Front Comput Neurosci 3:757. https://doi.org/10.3389/neuro.10.025.2009
Article Google Scholar

Download references

Acknowledgements

The work of T.D. has been sponsored by the Wolfgang Gentner Programme of the German Federal Ministry of Education and Research (grant no. 13E18CHA). The work of B.W. was partly funded by the German Research Foundation by grants WO671/11-1. R.B. was supported by the Medical Research Council (grant no. MR/N015037/1) and the Engineering and Physical Sciences Research Council (grant nos. EP/S001433/1 and EP/S001433/2).

Funding

Bundesministerium für Bildung und Forschung (13E18CHA), Deutsche Forschungsgemeinschaft (WO671/11-1), Medical Research Council (MR/N015037/1), Engineering and Physical Sciences Research Council (EP/S001433/1), Engineering and Physical Sciences Research Council (EP/S001433/2).

Author information

Barbara Wohlmuth and Roman Bauer share senior authorship.

Authors and Affiliations

CERN, Geneva, Switzerland
Tobias Duswald
School of Computation, Information, and Technology, Technical University of Munich, Munich, Germany
Tobias Duswald & Barbara Wohlmuth
Department of Information Technology and Electrical Engineering, ETH Zurich, Zurich, Switzerland
Lukas Breitwieser
School of Computer Science and Electronic Engineering, University of Surrey, Guildford, UK
Thomas Thorne & Roman Bauer

Authors

Tobias Duswald
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Breitwieser
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Thorne
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Wohlmuth
View author publications
You can also search for this author in PubMed Google Scholar
Roman Bauer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tobias Duswald.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Sliced-Wasserstein distance

Calibration of Model 2 with the sliced-Wasserstein distance and synthetic data.

See Fig. 13.

Appendix B: Parameter and descriptive statistics

See Tables 3, 4, 5.

Table 3 Parameter sets for model stochasticity study, see Fig. 5

Full size table

Table 4 Descriptive statistics of the distributions depicted in Fig. 5

Full size table

Table 5 Sensitivity analysis: parameter bounds for $\Omega $ for the different models

Full size table

Appendix C: Algorithms

Simulation logic and agent algorithms. This appendix gives pseudo-code of the simulation logic and the algorithms that govern the agents during the neuronal growth simulation.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Duswald, T., Breitwieser, L., Thorne, T. et al. Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation. J. Math. Biol. 89, 50 (2024). https://doi.org/10.1007/s00285-024-02144-2

Download citation

Received: 22 May 2024
Revised: 22 May 2024
Accepted: 31 August 2024
Published: 08 October 2024
DOI: https://doi.org/10.1007/s00285-024-02144-2

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Calibration of stochastic, agent-based neuron growth models with approximate Bayesian computation

Abstract

Similar content being viewed by others

Mathematical Models in Neuroscience: Approaches to Experimental Design and Reliable Parameter Determination

Mathematical Models in Neuroscience: Approaches to Experimental Design and Reliable Parameter Determination

Neural Modeling

1 Introduction

2 Materials and methods

2.1 Mechanistic neuron growth models

2.1.1 Biological background

2.1.2 Agent-based neuron discretization

2.1.3 Resource-driven neuron growth model

2.2 Data sources and processing

2.2.1 Morphometrics

2.2.2 Synthetic data

2.2.3 Experimental data

2.3 Numerical methods

2.3.1 Sensitivity analysis

2.3.2 Bayesian computation

2.3.3 Approximate Bayesian computation

2.4 Software implementation and interfaces

3 Results

3.1 Model stochasticity

3.2 Sensitivity analysis

3.3 Solving the stochastic inverse problem with SMCABC

3.3.1 Synthetic data

3.3.2 Experimental data

3.4 Runtime and computational cost

4 Discussion

4.1 Sensitivities

4.2 SMCABC on synthetic data

4.3 SMCABC on experimental data

5 Conclusion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A: Sliced-Wasserstein distance

Appendix B: Parameter and descriptive statistics

Appendix C: Algorithms

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation