Probabilistic solvers enable a straight-forward exploration of numerical uncertainty in neuroscience models

Oesterle, Jonathan; Krämer, Nicholas; Hennig, Philipp; Berens, Philipp

doi:10.1007/s10827-022-00827-7

Probabilistic solvers enable a straight-forward exploration of numerical uncertainty in neuroscience models

ORIGINAL ARTICLE
Open access
Published: 06 August 2022

Volume 50, pages 485–503, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Computational Neuroscience Aims and scope Submit manuscript

Probabilistic solvers enable a straight-forward exploration of numerical uncertainty in neuroscience models

Download PDF

2773 Accesses
1 Citation
3 Altmetric
Explore all metrics

A Correction to this article was published on 26 June 2023

This article has been updated

Abstract

Understanding neural computation on the mechanistic level requires models of neurons and neuronal networks. To analyze such models one typically has to solve coupled ordinary differential equations (ODEs), which describe the dynamics of the underlying neural system. These ODEs are solved numerically with deterministic ODE solvers that yield single solutions with either no, or only a global scalar error indicator on precision. It can therefore be challenging to estimate the effect of numerical uncertainty on quantities of interest, such as spike-times and the number of spikes. To overcome this problem, we propose to use recently developed sampling-based probabilistic solvers, which are able to quantify such numerical uncertainties. They neither require detailed insights into the kinetics of the models, nor are they difficult to implement. We show that numerical uncertainty can affect the outcome of typical neuroscience simulations, e.g. jittering spikes by milliseconds or even adding or removing individual spikes from simulations altogether, and demonstrate that probabilistic solvers reveal these numerical uncertainties with only moderate computational overhead.

Stochastic Ion Channel Gating and Probabilistic Computation in Dendritic Neurons

Monosynaptic inference via finely-timed spikes

Article 28 January 2021

Modeling and characterizing stochastic neurons based on in vitro voltage-dependent spike probability functions

Article 12 June 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Computational neuroscience is built around computational models of neurons that allow the simulation and analysis of signal processing in the central nervous system. These models can describe neural computations on different levels of abstraction. On the statistical level, e.g. generalized linear models have been used to provide a probabilistic model mapping environmental variables to neural activity (Pillow et al., 2008). For such statistical models, quantifying the uncertainty of the parameters can be achieved using Bayesian approaches (Gerwinn et al., 2008). On the mechanistic level, the models typically take the form of systems of coupled ordinary differential equations (ODEs), which describe the dynamics of the membrane potential and give rise to the spike-times (Gerstner & Kistler, 2002; Izhikevich, 2007). Recently, likelihood-free inference approaches have made it possible to perform uncertainty-aware inference even for such complicated mechanistic models (Gonçalves et al., 2020; Oesterle et al., 2020; Papamakarios et al., 2018).

However, mechanistic models of neurons are subject to an additional source of uncertainty: the numerical error caused by the solution of the model’s ODEs with a concrete algorithm (Hennig et al., 2015). This arises because all numerical solvers are necessarily run with finite time and limited resources, so their estimate diverges from the true solution of the ODE, even if the problem is well-posed. When simulating neurons, one would like to compute a numerical solution close to the true solution of the ODE, to ensure that conclusions drawn from the simulations are based on the mechanisms described by the model rather than the specific choice, setting and implementation of the ODE solver.

Many of the well-established numerical solvers do report a global error estimate and a corresponding tolerance that can be set by the user (Hairer et al., 1993, Chapter II.4). This global scalar error, though, does not capture how the numerical error arising from finite step-sizes used in practice affects crucial quantities of interest in the simulation, such as spike-times or the number of spikes. In practice, it can therefore be challenging to select a tolerance that strikes a good balance between run time and accuracy.

For some of the most common mechanistic models in neuroscience like the Hodgkin–Huxley or Izhikevich neuron model, errors in numerical integration have been studied in detail for a range of solvers and different integration step-sizes (Stewart & Bair, 2009; Borgers & Nectow, 2013; Chen et al., 2020). These studies have shown that standard solvers are often not the best choice in terms of accuracy or the accuracy vs. run time tradeoff. Therefore, the authors of these studies proposed to use specific solvers for the analyzed models, e.g. the Parker–Sochacki method for the Hodgkin–Huxley and Izhikevich neuron (Stewart & Bair, 2009), an exponential midpoint method (Borgers & Nectow, 2013) or second-order Strang splitting (Chen et al., 2020) for Hodgkin–Huxley-like models. While improving computations for the specific problems, applying these to other scenarios requires a detailed understanding of the kinetics of the neuron model of interest; and while choosing a “good” solver for a given model is important, it is typically not necessary to choose the “best” ODE solver. In many cases, it can be sufficient to ensure that the computed solution is within a certain accuracy.

As a more general approach to quantify the numerical uncertainty in mechanistic models in neuroscience, we therefore propose to use probabilistic ODE solvers (Hennig et al., 2015; Oates & Sullivan, 2019; Cockayne et al., 2019). In contrast to classical ODE solvers, this class of solvers does not only yield a single solution, but instead a distribution over solutions that quantifies the numerical uncertainty.

Several frameworks for probabilistic ODE solvers have been proposed, which differ mostly in the tradeoff between computational cost and flexibility of the posterior, from fast Gaussian filters (Schober et al., 2019; Tronarp et al., 2019; Krämer et al., 2021) to sampling-based approaches (Conrad et al., 2017; Chkrebtii et al., 2016; Teymur et al., 2016, 2018; Abdulle & Garegnani, 2020). These solvers have been mostly tested for well-behaved systems with well-behaved solutions, but the ODEs used to simulate neural activity model the non-linear membrane dynamics that underlie the all-or-none nature of an action potential. Here, we use two related approaches of probabilistic ODE integration, the state perturbation proposed by Conrad et al. (2017) and the step-size perturbation of Abdulle and Garegnani (2020). Both build on existing explicit, iterative ODE solvers and stochastically perturb the numerical integration of individual steps taken by the underlying solvers. These perturbations make the solution of every step probabilistic and therefore also the solution as a whole. The magnitude of the perturbation has to be calibrated, such that the solver’s output distribution reflects the numerical uncertainty in the solution.

Here, we explore the potential of probabilistic ODE solvers for neuron models. We show how the state and step-size perturbation methods can be used to quantify and reveal numerical uncertainty caused by the numerical ODE integration and demonstrate that the solver outputs are easy to interpret. For this, we simulate typical neuron models, namely the Izhikevich neuron model (Izhikevich, 2004), as a representative of leaky-integrate-and-fire neuron models, single-compartment Hodgkin–Huxley models (Hodgkin & Huxley, 1952) and a model with three synaptically coupled Hodgkin–Huxley-like neurons (Prinz et al., 2004) as an example of a neuronal network model. Lastly, we discuss practical considerations and limitations of these probabilistic solvers such as the calibration of the perturbation and the computational overhead.

Taken together, our results suggest that probabilistic ODE solvers should be considered as a useful tool for the simulation of neuronal systems, to increase the quality and reliability of such simulations over those achieved with classic solvers.

2 Methods and models

2.1 Probabilistic solvers

Simulating neuron models typically amounts to solving an initial value problem (IVP) based on a set of coupled ODEs. In abstract form, an initial value problem is given by

$$\begin{aligned} \dot{\mathbf {x}}(t) = {f}(t, \mathbf {x}(t)), \quad \mathbf {x} (t_0) = \mathbf {x}_0, \end{aligned}$$

(1)

where f, $\mathbf {x}_0$ and $t_0$ are known and $\mathbf {x}(t)$ for $t>t_0$ is the quantity of interest. The solution to the initial value problem at time $t + \Delta t$ provided the solution at time t, is given by integrating Eq. (1) from t to $t + \Delta t$:

$$\begin{aligned} \mathbf {x}(t + \Delta t) = \mathbf {x}(t) + \int _{t}^{t + \Delta t} f(s, \mathbf {x}(s)) \,\text {d} s. \end{aligned}$$

(2)

Except for special cases, this integral has no analytic form and must be solved numerically. For example, the forward Euler method approximates the integral as $\int _{t}^{t + \Delta t} f(s, \mathbf {x}(s)) \,\text {d} s \approx \Delta t \cdot f(t, \mathbf {x}(t))$. To simulate a neuron, Eq. (2) is solved iteratively, which results in a sequence of solutions $X = [\mathbf {x}(t_0), \mathbf {x}(t_1), \mathbf {x}(t_2), ..., \mathbf {x}(t_M)]$ for a set of time points with $t_{i+1} > t_i$ and a maximum time point $t_M$. Standard solvers yield a deterministic solution in every step, and therefore for the solution X as a whole. In contrast, the probabilistic solvers used in this study stochastically perturb the numerical integration used to approximate Eq. (2), which makes the solution of every step—and therefore of the whole solution—probabilistic. For a given IVP and solver, one can therefore generate a sample distribution of solutions X by repeating the iterative numerical integration from $t_0$ to $t_M$ multiple times. To create these probabilistic solvers, we implemented the state perturbation algorithm of Conrad et al. (2017) and the step-size perturbation algorithm of Abdulle and Garegnani (2020).

In the state perturbation algorithm (Conrad et al., 2017), in each step of the numerical integration, a small independently drawn noise term $\varvec{\xi }_t$ is added to the solution $\mathbf {x}_\text {det.}(t+\Delta t)$ of a corresponding deterministic integration scheme:

$$\begin{aligned} \begin{aligned} \mathbf {x}_\text {prb}(t+\Delta t)&= \mathbf {x}_\text {det.}(t+\Delta t) + \varvec{\xi }_t, \\ \varvec{\xi }_t&\sim \mathcal {N}(\varvec{0}, \text {diag}(\varvec{\nu }_t)^2), \end{aligned} \end{aligned}$$

(3)

where $\varvec{\nu }_t$ controls the magnitude of the perturbation. The perturbation is only efficient when $\varvec{\nu }_t$ is of the right order: if chosen too small, the uncertainty will be underestimated; if chosen too large, it will render the solver output useless. Conrad et al. (2017) suggested calibrating $\varvec{\nu }_t$ to replicate the amount of error introduced by the numerical scheme. We chose $\varvec{\nu }_t = \sigma \varvec{\varepsilon }_t$ using the error estimator $\varvec{\varepsilon }_t$ readily available in methods that were developed for step-size adaptation (see Sect. 5.1), and a scalar perturbation parameter $\sigma$ that can be adjusted to calibrate the perturbation. If not stated otherwise, we used $\sigma =1$. An example of this perturbation method is shown in Fig. 1A for a single integration step and in Fig. 1C for an Izhikevich neuron model.

A related approach to stochastically perturbing the numerical integration was proposed by Abdulle and Garegnani (2020), where noise is added to the integration step-size (i.e. to the “input” of the solver, rather than the “output”, cf. Fig. 1B). The numerical integration is performed using the perturbed step-size $\zeta _t$, but the computed solution is treated as the solution for the original step-size $\Delta t$:

$$\begin{aligned} \mathbf {x}_\text {prb}(t+\Delta t) = \mathbf {x}_\text {det.}(t + \zeta _t), \quad \zeta _t \sim \mathcal {P}, \end{aligned}$$

(4)

where $\zeta _t$ is the perturbed step-size drawn from a distribution $\mathcal {P}$ and $\mathbf {x}_\text {det.}(\bullet )$ is a deterministic integration scheme that approximates Eq. (2). For example, for the forward Euler method Eq. (4) would be computed as $\mathbf {x}_\text {prb}(t+\Delta t) = \mathbf {x}_\text {det.}(t) + \zeta _t \cdot f(t, \mathbf {x}_\text {det.}(t))$. Abdulle and Garegnani (2020) defined three properties the i.i.d. random variables $\zeta _t$ should fulfill:

$\mathcal {P}(\zeta _t > 0) = 1$,
there exists $\Delta t$ such that $\mathbb {E}[\zeta _t] = \Delta t$, and
there exist $p \ge 0.5$ and $C > 0$ independent of t such that $\mathbb {E}[(\zeta _t - \Delta t)^2] = C \cdot {\Delta t}^{2p+1}$.

Based on these restrictions, they proposed, as an example, to use a log-normal distribution:

$$\begin{aligned} \zeta _t \sim \mathcal {LN}_t(m, s^2), \end{aligned}$$

(5)

where m and s are the mean and standard deviation of the underlying normal distribution, which are given as:

$$\begin{aligned} \begin{aligned} m&= \ln ({\Delta t}^2/\phi ), \\ s&= \sqrt{2\ln (\phi /{\Delta t})}, \\ \phi&= \sqrt{\mathbb {E}^2[\zeta _t] + \text {Var}[ \zeta _t]} = \sqrt{{\Delta t}^2 + C \cdot {\Delta t}^{2p+1}}. \end{aligned} \end{aligned}$$

(6)

Using $p \le O$, where O is the order of the method, ensures that the mean-squared convergence order of the method is not changed. We used $p=O$ throughout to maximize the effect of the perturbation without changing the convergence order. We further generalized the example provided by Abdulle and Garegnani (2020) in which $C=1$ to a parametrized distribution by setting $C = \sigma ^2$, i.e. setting $\phi = \sqrt{{\Delta t}^2 + \sigma ^2 \cdot {\Delta t}^{2 O+1}}$. The introduction of the perturbation parameter $\sigma$ allows to—similarly to the perturbation parameter used in the state-perturbation—adjust and calibrate the magnitude of perturbation. If not stated otherwise, we used $\sigma =1$. The perturbation of a single step is illustrated in Fig. 1B.

2.2 Choice of solvers

We used the perturbation methods described above to create probabilistic versions of the solvers listed in Table 1.

Table 1 Summary of the ODE solvers used in this paper

Full size table

The usage of fixed (f) and adaptive (a) step-sizes is indicated with subscripts, and the perturbation method is indicated using the superscripts—x for the state perturbation (Conrad et al., 2017) and t for the step-size perturbation (Abdulle & Garegnani, 2020)—meaning that e.g. FE_f^x is referring to a forward Euler method using fixed step-sizes and the state perturbation. For the exponential integrators, we chose to only use the step-size perturbation because it preserves the important property of these solvers that the activation and inactivation variables cannot leave the interval [0, 1], and also because there are no established methods for local error estimation for these methods.

The second order exponential integrator EEMP was implemented based on the version by Börgers and Nectow (2013) (Sect. 5.2), which is a modification of the midpoint method by Oh and French (2006). Computation of Runge–Kutta steps and step-size adaptation were based on the respective scipy implementations (Virtanen et al., 2020). To avoid computational overhead, we only computed the local error estimates when necessary, i.e. for adaptive step-sizes or the state perturbation.

2.3 Interpolation

The iterative solvers used in this study yield solutions for $\mathbf {x}(t)$ on either a fixed and equidistant grid of time points T or, in the case of adaptive step-size solvers, on a finite set of time points T automatically chosen by the solver. To interpolate these solutions for example for spike-time estimation (see Sect. 2.4), we used linear interpolation for FE, EE and EEMP between solutions of single steps. To interpolate the steps of the Runge–Kutta methods we utilized the “dense output” implemented in the respective scipy methods (Virtanen et al., 2020). These “dense outputs” allow evaluating the solution between two steps $\mathbf {x}(t_i)$ and $\mathbf {x}(t_{i+1})$ for any t with $t_i \le t \le t_{i+1}$ without any additional ODE evaluation. To not discard the effect of the state perturbation during interpolation, we defined the dense output $\hat{d}_\text {RK}(t, t_i, t_{i+1})$ for a state perturbed Runge–Kutta step from time $t_i$ to $t_{i+1}$ as:

$$\begin{aligned} \begin{aligned}&\hat{d}_\text {RK}(t, t_i, t_{i+1}) = d_\text {RK}(t, t_i, t_{i+1}) + \frac{t-t_i}{t_{i+1} - t_i} \varvec{\xi }_{t_i}, \end{aligned} \end{aligned}$$

(7)

where $d_\text {RK}(t, t_i, t_{i+1})$ is the dense output of the respective deterministic Runge–Kutta step and $\varvec{\xi }_{t_i}$ is the perturbation noise that was added to this step to compute $\mathbf {x}(t_{i+1})$ (see Eq. (3)). This is a simplified version of the continuous-time output proposed by Conrad et al. (2017).

2.4 Spike-time estimation

To determine spike-times based on simulated voltage traces v(t), we interpolated the ODE solutions for all steps where v(t) started from below and ended above a certain threshold voltage $v_\text {th}$. For lineally interpolated solutions (Sect. 2.3) we computed spike-times as follows. For every step from a time $t_i$ to $t_{i+1}$ with $v(t_i) < v_\text {th} \le v(t_{i+1})$ we estimated the respective spike-time $t_\text {spike}$ as:

$$t_\text{spike} = {t_{i} + ( t_{i+1}-t_{i})} {\frac{v_\text{th} - v(t_i)}{v(t_{i+1}) - v(t_i)}}$$

(8)

To estimate spike-times for Runge–Kutta methods with “dense-outputs”, we utilized scipy’s “brentq” root finding algorithm to determine the time point $t_\text {spike}$ when the threshold is reached, i.e. $|v(t_\text {spike}) - v_\text {th}| < \epsilon$, with $\epsilon =1e-12$.

2.5 Common ODE models in computational neuroscience

In this study, we use probabilistic ODE solvers to analyze the effect of numerical uncertainty in the following neuroscience models:

The Izhikevich neuron model with a range of dynamics,
the Hodgkin–Huxley neuron model,
and a small network of Hodgkin–Huxley neurons.

We picked these models to cover both single neuron models and models of neuronal networks.

2.5.1 Single Izhikevich neurons

The Izhikevich neuron (IN) model is a simplified non-linear single neuron model that has been used e.g. to build large-scale models of the brain (Izhikevich & Edelman, 2008) and to understand oscillatory phenomena in the cortex (Izhikevich, 2003; Domhof & Tiesinga, 2021) and the olfactory bulb (Galán et al., 2006). An attractive property of the IN is that a whole range of different response dynamics can be simulated (Fig. S1) depending on the setting of the parameters $\varvec{\theta } = [a, b, c, d]$ (Izhikevich, 2004). The IN is described by the following pair of ODEs (Izhikevich, 2003):

$$\begin{aligned} \begin{aligned} \dot{v}(t, v, u)&= 0.04 \cdot v^2 + 5 \cdot v - u + I_{\text {Stim}}(t), \\ \dot{u}(t, v, u)&= a (b \cdot v -u), \end{aligned} \end{aligned}$$

(9)

where v is the membrane potential, u is a recovery variable and $I_{\text {Stim}}$ is a given input current. Whenever the spike threshold, a spike is triggered and the neuron is reset in the next time step of the simulation:

$$\begin{aligned} \begin{aligned} v(t+\Delta t_\text {Sp})&= c, \\ u(t+\Delta t_\text {Sp})&= u(t)+d, \end{aligned} \end{aligned}$$

(10)

where $\Delta t_\text {Sp} \ge 0$. Following the original implementation, we set the threshold to 30. Typically, $\Delta t_\text {Sp} = \Delta t$ is used, but to facilitate the comparison between different step-sizes we used $\Delta t_\text {Sp} = 0$ instead. In the original implementation, the reset can only be triggered after a full integration step. So, whenever $v(t_{i+1}) \ge 30$ after a step from $t_{i}$ and $t_{i+1}$, the neuron is reset as described above, i.e. $v(t_{i+1}+\Delta t_\text {Sp}) = c$ and $u(t_{i+1}+\Delta t_\text {Sp}) = u(t_{i+1})+d$. This is problematic, because it introduces an error of order $O(\Delta t)$ (Stewart & Bair, 2009), independent of the solver scheme.

Therefore, in addition to this discrete version of resetting the neuron, we implemented a continuous version based on two complementary strategies. Fist, we adapted Eq. (9) such that whenever $\dot{v}$ and $\dot{u}$ would have been evaluated for $v(t) \ge 30$—which can only happen for multi-stage methods—the derivatives were evaluated for $v(t)=30$ instead. Second, we implemented the strategy suggested by Stewart and Bair (2009): Every step resulting in a reset is split into two intermediate steps, a step until the threshold is reached, and a step after the reset. For this, the spike-time $t_\text {spike}$ during such as step was estimated as described in Sect. 2.4 with a threshold of $v_\text {th} = 30$. Then, the pre-reset step solution $\mathbf {x}(t_\text {spike})$ was approximated based on the interpolation strategies described in Sect. 2.3. And finally, the post-reset step solution $\mathbf {x}(t_{i+1})$ was computed by resetting (see Eq. (10)) and integrating $\mathbf {x}$ from $t_\text {spike}$ to $t_{i+1}$.

2.5.2 Single Hodgkin–Huxley neurons

Hodgkin–Huxley (HH) models (Hodgkin & Huxley, 1952) are widely used to simulate single and multi-compartment neurons. We study both the classical HH neuron (Hodgkin & Huxley, 1952) and a single compartment HH-like neuron model (Prinz et al., 2003) prominently used to study the stomatogastric ganglion (STG) (Prinz et al., 2004). Both models are described by ODEs that include, among other state variables, the membrane potential v(t):

$$\begin{aligned} \dot{v}(t) = \left( I_{\text {Stim}}(t) - \textstyle {\sum _{i}} I_i(\mathbf {x}) \right) / C, \end{aligned}$$

(11)

where C is the membrane capacitance, $I_{\text {Stim}}$ is the stimulation current and $I_i$ are membrane currents. These membrane currents are described by the following equation:

$$\begin{aligned} I_i(\mathbf {x}) = \bar{g}_i \cdot m_i(\mathbf {x})^{p_i} \cdot h_i(\mathbf {x}) \cdot (v - E_i), \end{aligned}$$

(12)

where $E_i$ is the reversal potential of the current, $\bar{g}_i$ is the maximum channel conductance, $p_i$ are integer exponents, and $m_i$ and $h_i$ are activation and inactivation functions. $m_i$ and $h_i$ were modeled by the following differential equations:

$$\begin{aligned} \begin{aligned} \dot{m}(v)&= \left( m_\infty (v) - m \right) /\tau _m(v), \\ \dot{h}(v)&= \left( h_\infty (v) - h \right) /\tau _h(v), \end{aligned} \end{aligned}$$

(13)

where $m_\infty$, $\tau _m$, $h_\infty$, and $\tau _h$ are voltage dependent functions defining the channel’s kinetics. For non-inactivating channels, $h_i$ is removed from Eq. (12). In the classical HH model, this amounts to a 4-dimensional ODE (Ermentrout & Terman, 2010). For the STG neuron, which has eight instead of two membrane currents and also implements a model for the intracellular calcium concentration, the ODE is 13-dimensional (Prinz et al., 2003). The respective parametrizations can be found in Sect. 5.3.

We simulated the HH neuron’s response to two different input currents $I_{\text {Stim}}$, a step and a noisy step stimulus. Both stimuli were 200 ms long, with $I_{\text {Stim}}(t) = 0$ for $t < t_\text {onset}$ and $t \ge t_\text {offset}$, where $t_\text {onset} = 10 \text { ms}$ and $t_\text {offset} = 190 \text{ ms}$. The amplitude of the step stimulus for $t_\text {onset} \ge t < t_\text {offset}$ was $I_{\text {Stim}}(t) = 0.2 \text { mA}$. The amplitude of the noisy step stimulus were created by drawing 100 values from a uniform distribution between 0.0 mA and 0.4 mA that were spaced equidistantly between $t_\text {onset}$ and $t_\text {offset}$. These points were interpolated using a cubic spline with endpoints at $t_\text {onset}$ and $t_\text {offset}$. At the endpoints both $I_{\text {Stim}}$ and its derivative were set to zero. The single STG neuron was simulated for 3 s using a step stimulus starting at $t_\text {onset} = 0.9\text { s}$ with an amplitude of $I_{\text {Stim}}(t) = 3 \text { nA}$.

2.5.3 STG model

The STG neuron model described above was used by Prinz et al. (2004) in a network of three synaptically coupled neurons, ABPD, LP and PY, to study their firing patterns in dependence of the synaptic and neuronal parametrizations. In the model, there are seven synapses connecting the neurons, that are either modeled as slow or fast synapses. The postsynaptic input current $I_i$ to a neuron is described by:

$$\begin{aligned} I_i(\mathbf {x}) = \bar{g}_i \cdot s_i(\mathbf {x}) \cdot (v - E_i), \end{aligned}$$

(14)

where, similarly to Eq. (12), $E_i$ is the reversal potential of the current, $\bar{g}_i$ is the synapse’s maximum conductance, v is the membrane potential of the postsynaptic neuron and s is the activation function of the synapse. s is described by the following differential equation:

$$\begin{aligned} \begin{aligned} \dot{s}&= \left( \bar{s} - s \right) / \tau _s,\\ \bar{s}&= \left( 1+ \exp ((-35 \text { mV}-v_\text {pre})/5 \text { mV}) \right) ^{-1},\\ \tau _s&= (1 - \bar{s}) / f_s, \end{aligned} \end{aligned}$$

(15)

where $v_\text {pre}$ is the membrane potential of the presynaptic neuron and $\tau _s$ and $f_s$ are constants (see Sect. 5.3).

2.6 Quantifying numerical uncertainty

2.6.1 Reference solutions

None of the aforementioned neuron models has an analytical solution. It is therefore not possible to compare simulations to the true solutions of the respective IVPs. As a substitute, we computed reference solutions using a deterministic RKDP_a solver with a tolerance of $\kappa ={1}\mathrm {e}{-12}$ and a maximum step-size dependent on the model investigated (0.01 ms for IN and HH; 0.1 ms for the STG model). To obtain a reference solution at the same time points of a given fixed step-size solution $X = [\mathbf {x}(t_0), ..., \mathbf {x}(t_M)]$, we forced the reference solver to evaluate $\mathbf {x}(t)$ at least at all time points $T = [t_0, ..., t_M]$ of the given solution. For this, in every step in which the adaptive reference solver automatically picked a step-size that would skip any $t_i$ in T by taking a too large step-size ${\Delta t}_{i-1}$, the step-size ${\Delta t}_{i-1}$ was clipped such that the step was evaluated exactly at $\mathbf {x}(t_i)$. All solutions $\mathbf {x}(t)$ for t not in T were dropped before the comparison. To compare adaptive step-size solvers to reference solutions, we also forced these solvers to evaluate time points on a grid $T = [t_0, ..., t_M]$ with time points space equidistantly using a distance of 1 ms.

2.6.2 Distance metrics

To estimate the uncertainty for a given neuron model and solver, we computed multiple solutions (samples) with the same probabilistic solver to obtain a distribution of solutions. Based on these sample distributions and the respective reference solutions, we evaluated the distributions of sample-sample distances and sample-reference distances using two different distance measures. As a general measure, we computed Mean Absolute Errors (MAEs) between single traces. If not stated otherwise, MAEs were computed on the simulated membrane potentials v(t), because this is typically the quantity of interest. For two traces of equal size $\mathbf {a}=[a_0, ..., a_M]$ and $\mathbf {b}=[b_0, ..., b_M]$ the MAE was defined as:

$$\begin{aligned} \text {MAE} = \frac{1}{M} \mathord {\textstyle \sum }_{i=0}^{M} |a_i - b_i |. \end{aligned}$$

(16)

For n samples from a probabilistic solver, we computed the sample-sample distance distribution MAE_SMas the n MAEs between single samples and the mean trace of the other $n-1$ samples. Sample-reference distance distributions MAE_SRwere computed as the n MAEs between single samples and the reference solution. In some cases, we also computed the distance between the solution of a corresponding deterministic solver to the reference solution, abbreviated as MAE_DR.

As a second metric, we computed “SPIKE-distances” between the spike-times of different solutions (Kreuz et al., 2013). The SPIKE-distance is a bounded distance measure between zero and one that quantifies the dissimilarity between two (or more) spike-trains based on the distances between neighboring spikes. Here, we used an open-source python implementation (Mulansky & Kreuz, 2016).

For plotting, we also computed spike density functions (SDFs) of sample distributions as Gaussian kernel density estimates with a bandwidth optimized through grid-search and tenfold cross-validation using the Scikit-learn toolbox (Pedregosa et al., 2011).

3 Results

In this study, we explored the potential of probabilistic ODE solvers in computational neuroscience. First, we study the effect of numerical uncertainty on simulations of neuron models and qualitatively show that probabilistic solvers can reveal this uncertainty in a way that is easy to interpret. Second, we provide examples and guidelines where probabilistic solvers can be useful when conducting a new study. Third, we analyze potential drawbacks of probabilistic solvers, such as computational overhead.

3.1 Probabilistic solvers can reveal numerical uncertainty in neuron models

To demonstrate the effect of numerical uncertainty on simulations of single neuron models, we first simulated the classical HH neuron with the step stimulus (Fig. 2A). We computed solutions with a deterministic and probabilistic EE solver for a step-size of $\Delta t=0.25 \text { ms}$. Additionally, we computed a reference solution. We found that the exact spike-times of the deterministic EE solver differed substantially from the reference solution (spike-time difference $t^\text {det.}_\text {spike} - t^\text {ref.}_\text {spike}$ of the first three spikes: 0.6 ms, 2.3 ms, 4.0 ms). The probabilistic solver revealed this numerical uncertainty with spike-times varying substantially between samples (standard deviation (SD) of the spike-time $t_\text {spike}$ for the first three spikes over all $100$ samples: 0.2 ms, 0.9 ms, 1.1 ms).

Next we simulated single INs with different parametrizations $\theta _i$ and response dynamics (Izhikevich, 2004). Using the original step-sizes $\Delta t$ and input currents $I_i$, we compute solutions with the original solver scheme—which is related to a FE_f solver (Sect. 5.2)—a deterministic FE scheme and a probabilistic FE_f^t solver. We found, that for the “Inhibition-induced spiking” neuron all solvers produced similar spiking patterns in response to a negative current step (Fig. 2B). However, the original solver produced longer intervals between the spikes compared to the reference, resulting in only three instead of four spikes. The deterministic FE solution matched the reference better (e.g. both had four spikes), but the spike-times were still off by several milliseconds (spike-time difference $t^\text {det.}_\text {spike} - t^\text {ref.}_\text {spike}$ of the last two spikes: 8.2 ms, 3.9 ms). The probabilistic solver revealed this numerical uncertainty (SD of the spike-time $t_\text {spike}$ of the first three spikes: 3.0 ms, 5.6 ms, 7.1 ms).

Similarly, for the “Inhibition-induced bursting” neuron the solution from the original solver and the deterministic FE solver were qualitatively broadly consistent with the reference solution (Fig. 2C). In all simulations, the neuron responded with spike bursts to a negative stimulus current step. The spike-times and the number of spikes of the original solution ($n_\text {spikes} = 11$) and the deterministic FE solution ($n_\text {spikes} = 14$) differed substantially from the reference ($n_\text {spikes} = 33$) though, with the FE solution having only two bursts instead of three during the simulated period. Here, the probabilistic solver revealed the substantial uncertainty in the spike-times and number of spikes ($\overline{n}_\text {spikes}=13.8$ (SD 2.5), where $\overline{n}$ denotes the sample mean. See also histogram in Fig. 2C), with around $33\%$ of the samples having a third burst (Fig. 2C, bottom). All 16 simulated parametrizations are shown in Fig. S1.

To provide an example of a neuronal network, we simulated the STG model for two parametrizations (Fig. 2D and E) that only differ in their synaptic conductances (see Sect. 2.5.3). We computed solutions with a reference solver, a deterministic and a probabilistic EE solver. We focused the analysis on the LP neuron for simplicity. For the first parametrization (Fig. 2D), the LP neuron showed continuous spiking in all simulations. Similar to the HH neuron, we found differences in the exact spike-times and number of spikes between the reference ($n_\text {spikes}=17$) and the deterministic EE solution ($n_\text {spikes}=13$). The uncertainty was again revealed by the probabilistic solver ($\overline{n}_\text {spikes} = 14.6$ (SD 1.3), Fig. 2D). The second parametrization resulted in a different spiking behavior of the LP neuron (Fig. 2E). Here, the neuron started to fire at a high frequency for a prolonged time after approximately two seconds. In the reference solution, the neuron continued to fire. In contrast, in the deterministic solution, the neuron stopped after about another two seconds to then start another burst shortly later. While this also happened in all generated samples from the probabilistic solvers, the sample distribution still indicated a high uncertainty about the duration of the firing periods (Fig. 2E). Simulations of all five synaptic parametrizations from the original paper (Prinz et al., 2004) are shown in Fig. S2.

Finally, we turned to a single STG neuron and stimulated the response to a step stimulus (Fig. 3) based on the original publication (Prinz et al., 2003). Here, we compared the numerical uncertainty in two different state variables, namely the voltage v(t) (Fig. 3A) and the intracellular calcium $\text {Ca}(t)$ (Fig. 3B). We found that the numerical uncertainty differed strongly between these state variables, and was much higher for v(t) (Fig. 3). While this is expected because of the transient and brief nature of spikes in contrast to the slower changing calcium, it highlights the power of probabilistic ODE solvers, as they can guide the choice of the solver and step-size parameter dependent on the quantity of interest and the desired accuracy without requiring detailed knowledge about the model and its kinetics.

All the examples in Fig. 2 used first order methods. To also provide an example where higher order solvers with low tolerances yield solutions qualitatively different from the reference solution, we simulated the classical HH neuron’s response for 50 ms to a step stimulus with an amplitude of $0.022406\text { mA}$ and $t_\text {onset}=10\text { ms}$ and $t_\text {offset}=40\text { ms}$. This amplitude did not evoke a single spike in the reference solver (Fig. 4A), but was very close to the threshold, i.e. slightly larger amplitudes (e.g. $0.022410\text { mA}$) did produce a spike for the reference solver. When simulating this model with a RKDP_a solver, we found that for tolerances of $\kappa ={1}\mathrm {e}{-3}$ and $\kappa ={1}\mathrm {e}{-5}$ the solutions did contain a spike (Fig. 4A and B). Only a tolerance as small as $\kappa ={1}\mathrm {e}{-7}$ yielded a solution with no spike for this solver (Fig. 4C). Simulating the model with probabilistic solvers revealed this numerical uncertainty for both $\kappa ={1}\mathrm {e}{-3}$ and $\kappa ={1}\mathrm {e}{-5}$, with a fraction of samples containing one and a fraction containing zero spikes in both cases (Fig. 4D).

3.2 Probabilistic solvers can guide solver selection

To demonstrate how probabilistic ODE solvers can be used to compare the accuracy vs. run time tradeoff between different solver schemes, we simulated the HH neuron’s response to the noisy step stimulus (Fig. 5A) using the following probabilistic solvers: EE_f^t, EEMP_f^t, FE_f^t and RKBS_a^x. To this end, we computed sample-reference Mean Absolute Errors ($\text {MAE}_\text {SR}$) and sample-reference SPIKE-distances (see Sect. 2.6.2) for each solver as an estimate of the numerical error induced. We compared these errors to the number of ODE evaluations a corresponding deterministic solver would need. We found that the exponential integrators EE and EEMP allowed computing solutions with the fewest ODE evaluations, as they terminated successfully even for the relatively large step-size $\Delta t=0.5\text { ms}$ (Fig. 5B). In contrast, when using the FE solver, all step-sizes $\Delta t \gg {0.05}\text { ms}$ resulted in floating-point overflow errors and therefore in both useless and incomplete solutions. However, when choosing a sufficiently small step-size of $\Delta t \le {0.05}\text { ms}$ the samples obtained with the FE method had on average a smaller error compared to the EE method (Fig. 5B and E). From the methods tested, the adaptive RKBS method was the most efficient one, i.e. it produced the most accurate solutions for the fewest number of ODE solutions, but it also required a substantially higher number of minimum ODE evaluations to successfully terminate compared to the exponential integrators (Fig. 5B and E).

In principle, a very similar analysis could also have been done with deterministic solvers. However, probabilistic solvers have two advantages. First, they yield sample distributions instead of single solutions which make it possible to compute confidence intervals etc. when comparing different solver outputs. Second, and more crucially, probabilistic solvers do not require a reference solution to estimate how numerical errors in a solution affect quantities of interest such as spike-times. For a sufficiently calibrated probabilistic solver, the sample distribution, i.e. the solver’s output, can be used to estimate the numerical error of the solver. In Fig. 5C and F we computed the sample-sample distances which are independent of the reference, for the same samples used in Fig. 5B and E. We found that the mean sample-reference distances were highly similar to the respective mean sample-sample distances for all solvers for both the Mean Absolute Error (Fig. 5B-D) and the SPIKE-distance (Fig. 5E-G). Therefore, the solver comparison described above could have also been based on sample-sample distance instead of the sample-reference distances (e.g. $\text {MAE}_\text {SM}$ instead of $\text {MAE}_\text {SR}$), and thus would not have required a reference solution.

3.3 Calibration of probabilistic solvers

The mean sample-sample distance (e.g. measured as $\overline{\text {MAE}}_\text {SM}$) is only then a good approximation to the mean sample-reference distance (e.g. measured as $\overline{\text {MAE}}_\text {SR}$), when the probabilistic solver is well calibrated. Ideally, the magnitude of the perturbation is large enough to capture the numerical uncertainty of the underlying numerical integration, but it is not too large to severely reduce the accuracy of the integration scheme. To quantify the calibration of different solvers, we therefore defined two metrics, the ratio $R_S=\overline{\text {MAE}}_\text {SM}/\overline{\text {MAE}}_\text {SR}$ and the ratio $R_D = \text {MAE}_\text {DR} / \overline{\text {MAE}}_\text {SR}$, where $\text {MAE}_\text {DR}$ is the distance between a corresponding deterministic solution and the reference. $R_S$ is close to zero if the perturbation is too small (i.e. the sample-sample distance is much smaller than the sample-reference distance) and close to one if the perturbation is sufficiently large to not underestimate the numerical uncertainty (i.e. the sample-sample distance can be used as an approximate measure of the sample-reference distance)^{Footnote 1}. However, $R_S$ is also close to one, when the perturbation is too large and the solver output is dominated by the perturbation rather than the underlying ODE. This is why we also consider $R_D$ to evaluate the calibration. $R_D$ is close to one if the perturbation is either too small to affect the model output (i.e. all samples are approximately equal to the deterministic solution) or if samples are on average approximately equally close to the reference than the deterministic solution. $R_D$ is close to zero, if the perturbation is too large and the perturbation severely reduces the solver accuracy^{Footnote 2}.

For a well calibrated solver, $R_S$ is close to one, such that the sample-reference distance $\overline{\text {MAE}}_\text {SR}$ can be estimated from sample-sample distance $\overline{\text {MAE}}_\text {SM}$ while $R_D$ is close to or larger than one, such that the perturbation does not decrease the solver accuracy.

The magnitude of the perturbation can be adjusted with the perturbation parameter $\sigma$ that we defined for both the state and step-size perturbation (see Sect. 2.1). To analyze how the parameter $\sigma$ affects the calibration of the perturbation and to test for which $\sigma$ the solvers are well calibrated, we simulated the classical HH neuron in response to the noisy and the step stimulus with probabilistic solvers for a range of perturbation parameters (Fig. 6). First, we used a probabilistic EE_f^t solver and computed MAE_SM, MAE_SRand MAE_DRand the ratios of the distribution means $R_S$ and $R_D$ for different $\sigma$ ranging from 0.0625 to 16 for the noisy stimulus (Fig. 6A-C). As expected, we found that with increasing $\sigma$, the mean sample-sample distance $\overline{\text {MAE}}_\text {SM}$ converged to the mean sample-reference distance $\overline{\text {MAE}}_\text {SR}$, and for sufficiently large $\sigma$ the mean sample-sample distance $\overline{\text {MAE}}_\text {SM}$ could therefore be used as an approximate measure of mean sample-reference distance $\overline{\text {MAE}}_\text {SR}$ (Fig. 6A and B). For example, for $\sigma =0.25$, the perturbation magnitude was too small and the solver was underestimating the numerical uncertainty: Here, the mean sample-sample distances was much smaller $\overline{\text {MAE}}_\text {SM}$ (0.33) than the mean sample-reference distance $\overline{\text {MAE}}_\text {SR}$ (4.35) (Fig. 6A) with all sample-reference distances distributed narrowly ($\text {MAE}_\text {SR}$ 10th to 90th percentiles: [4.14, 4.56]) around the deterministic-reference distance MAE_DR(4.36), indicating that all samples were very close to the deterministic solution, despite the numerical error. When using $\sigma \ge 4$, the mean sample-reference distance was higher than the deterministic-reference distance (Fig. 6A and C), indicating a loss of solver accuracy caused by the perturbation (e.g. $R_D = 0.51$ for $\sigma =8$). Here, the best calibration was achieved with $\sigma = 2$, with distributions of MAE_SMclose to MAE_SR($R_S$: 0.83; Fig. 6A and B) and with the mean sample accuracy close to the accuracy of the deterministic solution ($R_D$: 1.08; Fig. 6A and C).

To provide an overview of the calibration for different solvers settings, we defined a scalar measure for the “goodness of calibration” $R^c_S R^c_D$, where $R^c_D = \min (R_D, 1)$ and $R^c_S = 1 - |1 - R_S |$, which is ideally close to one. $R^c_S R^c_D$ is close to zero for either an underestimation of the numerical uncertainty ($R^c_S \approx 0$) or for a too strong perturbation that renders the solver output useless ($R^c_D \approx 0$). We used $R^c_S$ and $R^c_D$ instead of $R_S$ and $R_D$, because in some cases $R_S$ and $R_D$ took values larger than one (for example see Fig. S3) which would make their product more difficult to interpret. We simply clipped $R_D$, because here we only cared about the samples being at least as close to the reference solution as the deterministic solution. For $R_S$ on the other hand, values larger than one are not beneficial, as $R_S$ approximates how well the sample-reference distance can be estimated from the sample distribution alone. $R^c_S$ therefore measures the deviance of $R_S$ from one.

We computed $R^c_S R^c_D$ for different probabilistic solvers and step-sizes—including the EE_f^t solver with $\Delta t={0.025}\text { ms}$ used above—for the HH neuron stimulated with the step and noisy step stimulus (Fig. 6D). The respective values for $R_S$ and $R_D$ are shown in Fig. S3. We found that using a perturbation parameter of $\sigma =1$, which we used as default in other experiments, produced reasonably calibrated solutions overall. However, in most cases $\sigma =1$ was also not ideal. For example for EE_f^t, larger values (e.g. $\sigma =2$ or $\sigma =4$) resulted in better calibration, whereas for FE_f^t the calibration was improved using smaller values (e.g. $\sigma =0.5$ or $\sigma =0.25$). For the state perturbed Runge–Kutta methods RKBS and RKDP with fixed step-sizes, the best $\sigma$ were close to one, but it was both step-size and stimulus dependent if slightly larger or smaller values would result in better calibration. The same Runge–Kutta methods with adaptive step-sizes were well calibrated for a wide range of perturbation parameters $\sigma$, including very small ones (e.g. $\sigma =0.0625$), especially in the high tolerance case ($\kappa =1e-2$). This is likely because even small perturbations cause the solvers to take different step-sizes and therefore to evaluate the ODE at different time points. When using the state perturbation on a new neuron model, setting $\sigma =1$ may therefore be a good strategy to get a solver with reasonable, though often not ideal, calibration.

For the step-size perturbation on the other hand, setting $\sigma =1$ can also result in extremely poor calibration. To illustrate this, we simulated the Hodgkin–Huxley model with different units for the time t, meaning that we rescaled the values for t and $\dot {\mathbf {x}}$ equally. As expected, this rescaling did not affect the solutions of deterministic or state perturbed solvers. For the step-size perturbation method, however, the step-size $\Delta t$ is treated as a unit-less quantity during the perturbation step (Eq. (4)) which makes the method sensitive to such unit changes. We simulated the model in milliseconds, microseconds and seconds for a wide range of perturbation parameters $\sigma$ and computed the goodness of calibration $R^c_S R^c_D$ for an EE_f^t (Fig. 6E) and an EEMP_f^t solver (Fig. 6F). As expected, the perturbation parameter $\sigma$ resulting in the best calibration differed by orders of magnitude (Fig. 6E and F). This is, because here the best calibration did always result in the same ratio between the step-size $\Delta t$ and the standard deviation of the perturbation distribution $\mathcal {P}$. Obviously, the step-size perturbation could be made invariant to such unit changes, but this would not address the underlying problem: A priori it is not clear how $\sigma$, or more generally the distribution $\mathcal {P}$, should be set. Given that the time unit in which a model is simulated is a relatively arbitrary choice, the reasonable calibration achieved with $\sigma =1$ in examples above was at least partially a side effect of all models being simulated in milliseconds with comparable kinetic and is not a generally applicable rule.

Therefore, especially in the case of step-size perturbation, it is advisable to calibrate the perturbation before simulating a new model. In many practical applications, such as simulation-based parameter inference for neuron models (Oesterle et al., 2020), models are simulated repeatedly with slight modifications in their parametrization. Here, it would be impractical to recalibrate the solvers for every evaluation, but on the other hand, it would cause only little overhead to calibrate the perturbation once in the beginning. We therefore tested how transferable the calibration for one model parametrization is to a wider range of other parametrizations. First, we calibrated an EE_f^t solver by maximizing the goodness of calibration $R^c_S R^c_D$ through a grid-search over $\sigma$ (Fig. S4A) on a HH neuron model using the default parameters for the conductances (Sect. 5.3) and a low frequency noisy step stimulus (Fig. S4). Using the optimized perturbation parameter $\sigma$, we computed the goodness of calibration for 100 different neuron parameter sets (Fig. S4B) which resulted in a variety of spiking patterns (Fig. S4C and D). For the vast majority of parametrizations, the perturbation was well calibrated (Fig. S4E): For almost half the parametrizations (39 out of 100) $R^c_S R^c_D$ was even higher than for the original parametrization and in all cases it was larger than 0.3.

3.4 Computational overhead

Probabilistic solvers based on state or step-size perturbation increase the computational costs for two reasons. First, they are sampling based and require computing multiple solutions for a single IVP. While this process can be parallelized, it nevertheless comes with a computational overhead, especially if it conflicts with other computations using parallelized model evaluation, e.g. in simulation-based inference where the same model is evaluated for different model parameters (Cranmer et al., 2020; Gonçalves et al., 2020). Second, probabilistic solvers induce a computational overhead per solution computed relative to their deterministic counterparts. We analyzed both aspects in the following.

3.4.1 Required number of samples

To empirically determine the number of samples necessary to obtain a reliable measure of numerical uncertainty, we simulated the classical HH neuron with probabilistic solvers for the step and noisy step stimulus. To this end, we computed mean sample-sample distances $\overline{\text {MAE}}_\text {SM-n}$ for small numbers of samples n, and divided them by the mean sample-sample distances for a much larger number of samples ($300$) to obtain normalized sample-sample distances $\Psi _\text {MAE}(n) = \overline{\text {MAE}}_\text {SM-n} / \overline{\text {MAE}}_\text {SM-300}$. Similarly, we computed normalized sample-sample SPIKE distances $\Psi _\text {SpD}$ for the spike times. We bootstrapped distributions of $\Psi _\text {MAE}$ (Fig. 7A and B) and $\Psi _\text {SpD}$ (Fig. 7C and D) to see how likely a certain number of samples n would result in a good estimate of the true sample-sample distance. We found that two samples were often already sufficient to estimate the sample-sample MAE. For example, for the step stimulus and $n=2$, more than $80\%$ of the bootstrapped $\Psi _\text {MAE}$ were in [0.5, 2.0] with little difference between the solvers EE_f^t (Fig. 7A) and RKBS_a^x (Fig. 7B). Reliably estimating the sample-sample SPIKE-distance required more samples, especially for the noisy stimulus. Still, only 10 samples were sufficient to obtain a $\Psi _\text {SpD}$ in [0.5, 2.0] with high probability ($>80\%$) for both solvers and stimuli.

3.4.2 Overhead per sample

In addition to the computational overhead caused by the computation of multiple samples, probabilistic methods also come with a computational overhead per solution. For the state perturbation this overhead has three components. First, one needs to compute the local error estimator, which only causes overhead for fixed step-sizes since for adaptive methods the local error estimator needs to be computed anyway. The second potential source of overhead is that the “First Same As Last” property—i.e. that the last stage in one step can be used as the first stage of the next step, which is used in RKBS and RKDP—is not applicable. This is because the last stage is computed before the perturbation, and after the perturbation the evaluation of the ODE is not valid anymore. Lastly, the perturbation itself, which includes sampling form a Gaussian, needs to be computed.

In total, this overhead is relatively small for higher order methods optimized for step-size adaptation like RKBS, RKCK and RKDP. For example, the state perturbation for RKDP_a increases the number of ODE evaluations per step from six to seven ($+16\%$) due to the loss of the First Same As Last property, and for RKCK_a—which does not make use of this property—no additional ODE evaluation is required. However, for first order methods like FE this overhead severely reduces the computational efficiency because instead of a single ODE evaluation per step, a state perturbed version needs two ($+100\%$). Additionally, lower order methods typically require more steps in total compared to higher order methods, because they are typically used in combination with smaller step-sizes. This increases the total computational costs of the perturbation itself, which is done once per step. For the step-size perturbation, the overhead is reduced to the perturbation and, for adaptive step-size methods, the loss of the First Same As Last property.

To quantify this overhead empirically, we simulated the HH neuron with different probabilistic solvers and their deterministic counterparts and compared the run times relative to each other. As expected, for the state perturbation, the computational overhead was larger for the lower order methods (Fig. 7E; on average for FE_f^x: $+113\%$, RKBS_f^x: $+50\%$, RKCK_f^x: $+6\%$, RKDP_f^x: $+23\%$). The adaptive methods—where the local error estimates were computed not only for the probabilistic, but also for the deterministic methods—showed the smallest increase in run times ($+14\%$ on average across all adaptive methods), with RKCK_a^x, not using the First Same As Last property, having the least overhead ($+5\%$). For the step-size perturbation, the increase in run times was on average smaller ($+16\%$ on average across all methods) and without large differences between the solver schemes and the usage of adaptive or fixed step-sizes.

3.5 Errors beyond numerical integration

Finally, we turned back to the “DAP” IN model (Fig. S1), to illustrate numerical uncertainties that cannot be fully captured by the perturbation methods because they are not caused by the numerical approximation of the integral in Eq. (2). For this neuron model we had found a large difference in the number of spikes for the fixed step-size methods, like the original solver, compared to the reference solution (Fig. 8A). While the reference solution had eight spikes during the simulated period, the original solution had only one and a deterministic FE solver had two. While the probabilistic solver FE_f^t arguably indicated some numerical uncertainty ($\overline{n}_\text {spikes}=2.3$ (SD 0.6)), the number of spikes was still much lower compared to the reference. To better understand the source of this numerical uncertainty, we simulated the “DAP” neuron model with different probabilistic solvers, FE_f^t, RKBS_f^x and RKDP_f^x.

First, we simulated the neuron for different fixed step-sizes. We found that all probabilistic solvers underestimated the true number of spikes when using relatively large fixed step-sizes (Fig. 6B). For the largest step-size tested, $\Delta t=0.5 \text { ms}$, only the FE_f^t solver indicated uncertainty in the number of spikes, whereas for RKBS_f^x and RKDP_f^x all samples had only a single spike. When using smaller step-sizes, the probabilistic solvers’ outputs were more indicative of the numerical uncertainty. For $\Delta t={0.02}\text { ms}$, all solvers produced outputs that were closer ($\overline{n}_\text {spikes} = 5.9$ for FE, $\overline{n}_\text {spikes} = 6.5$ for RKBS and $\overline{n}_\text {spikes} = 6.7$ for RKDP) to the reference solution and all methods indicated uncertainty in the number of spikes. With the very small step-size $\Delta t={0.002}\text { ms}$, all samples from all solvers showed the same number of spikes as the reference and the probabilistic solvers indicated no remaining uncertainty about the number of spikes here.

While these results may be unsatisfactory at first glance, they are not necessarily unexpected. The probabilistic solvers used here can only capture the uncertainty arising through the numerical integration; they cannot capture the error that is introduced by restricting spikes to occur only on a fixed time grid, which is the case for the fixed step-size solvers. We therefore simulated the neuron for the same solvers and step-sizes again, but allowed the solver to take intermediate steps (see Eq. (10)) every time a reset occurred. When using these “pseudo-fixed” step-sizes, we found that RKDP_f^x still did not indicate uncertainty in the number of spikes for any step-size tested, but now all samples had the same number of spikes as the reference (Fig. 8C). And while FE_f^t and RKBS_f^x still underestimated the number of spikes for larger step-sizes on average (e.g. for $\Delta t={0.5}\text { ms}$: $\overline{n}_\text {spikes}=3.5$ for FE and $\overline{n}_\text {spikes} = 6.3$ for RKBS), both indicated high numerical uncertainty (e.g. for $\Delta t={0.5}\text { ms}$: $q_{90}(n_\text {spikes})=6$ for FE and $q_{90}(n_\text {spikes}) = 8$ for RKBS, where $q_{90}$ is the 90th percentile).

4 Discussion

The outcome of neuron simulations is affected by numerical uncertainty arising from the inevitably finite step-sizes used in numerical ODE integration. With standard solvers there is no straightforward way to quantify how this uncertainty affects quantities of interest such as spike-times and the number of spikes.

In this study, we demonstrated how probabilistic solvers can be used to quantify and reveal numerical uncertainty in commonly used neuron models. Crucially, these solvers can be easily implemented and do not require a detailed understanding of the underlying kinetics of the neuron model of interest.

Further, we showed that numerical uncertainty can affect the precise timing and the number of spikes in simulations of neuron models commonly used in neuroscience. We also found that some models and parametrizations are more susceptible to numerical uncertainty than others, and that some solvers employed in the neuroscience literature yield rather large uncertainties. These findings highlight the need for a thorough quantification of numerical uncertainty in neuroscience simulations to strike an informed balance between simulation time and tolerated uncertainty.

The idea to quantify the accuracy or numerical errors of different solvers for mechanistic models in neuroscience is not new. For example, Butera and McCarthy (2004) showed that for small step-sizes, the forward Euler method produces more accurate solutions than the exponential Euler method, which is in agreement with our findings. Börgers and Nectow (2013) on the other hand argued that for Hodgkin–Huxley-like systems exponential integrators—such as exponential Euler and the exponential midpoint Euler—are often the best choice, as they allow for much larger step-sizes especially when high accuracy is not necessary, which is again what we observed. Stewart and Bair (2009) argued in favor of the Parker–Sochacki integration method and showed that it can be used to generate highly accurate solutions for both the Izhikevich and Hodgkin–Huxley model. However, this method has the disadvantage that the ODE system at hand has to be put into the proper form and therefore requires specific knowledge about the model and solver. In a more recent study, Chen et al. (2020) recommended to use splitting methods, such as second-order Strang splitting, instead of exponential integrators.

In contrast to these studies, probabilistic solvers offer a more general approach to tackle the problem of numerical uncertainty. Instead of finding the “best” solver for a specific problem, they produce an easy-to-interpret uncertainty measure that can be analyzed without specific knowledge about the solver or solved neuron model. This allows to easily assess if a solver is sufficiently accurate for a given research question. It can therefore facilitate both the choice of the solver and choice of solver settings such as the step-size.

In this study, we used two simple probabilistic solvers that build on deterministic solver and stochastically perturb the numerical integration. For both, the state (Conrad et al., 2017) and the step-size perturbation (Abdulle & Garegnani, 2020) method, it is crucial that the perturbation is of the right order to neither underestimate the numerical uncertainty nor to reduce the solver accuracy unnecessarily. To be able to adjust the perturbation we introduced a perturbation parameter for both the state and step-size perturbation. We found that using the default value for this parameter yielded good calibration for most of the models we simulated, but slight adjustments often improved the calibration further. However, especially for the step-size perturbation, the calibration can be strongly dependent on the model and a certain level of calibration by means of adapting the perturbation distribution to the problem is always recommended. Fortunately, calibration may often be transferable between models with similar kinetics, for example in simulation-based inference where models are repeatably evaluated for different model parametrizations.

A downside of the step-size perturbation is, that it requires a local error estimator to be calibrated. This can introduce a relatively large computational overhead per solution for lower order methods like FE. The step-size perturbation may therefore a better choice for lower order methods. However, for higher order methods like RKDP this difference vanishes and both approaches require an equally small computational overhead per solution. Another advantage of the step-size perturbation is that it preserves desirable properties of the underlying solver schemes (Abdulle & Garegnani, 2020). For example, when Hodgkin–Huxley-like models are solved with exponential integrators like EE or EEMP, the state variables of the activation and inactivation cannot leave their domain [0, 1] by design of the solvers, a property preserved by the step-size but not the state perturbation.

Beyond the two perturbation methods used and discussed in this study, there are other classes of probabilistic numerical algorithms applicable to the problem studied herein. For instance, Teymur et al. (2021) recently proposed a method that builds on Richardson’s deferred approach to the limit and replaces the deterministic interpolant with a stochastic process. Similar to the perturbation methods, this approach can be used to create probabilistic ODE solvers from established deterministic ones. Another class of probabilistic ODE solvers is constructed using techniques from (nonlinear) Gaussian filtering and smoothing (Schober et al., 2019; Kersting et al., 2019; Tronarp et al., 2019). These methods have the advantage that instead of repeatedly integrating the initial value problem, they only require a single forward integration and return local uncertainty estimates that are proportional to the local truncation error. The disadvantage of Gaussian ODE filters and smoothers is that the uncertainty estimates are Gaussian. This restriction can be lifted by replacing Gaussian filters and smoothers with particle filters and smoothers (Tronarp et al., 2019). These and other methods may further extend the applicability of probabilistic numerics in computational neuroscience. In particular for large neural network simulations, more efficient methods like the filtering approaches will be key in quantifying uncertainty. However, because these methods are not sampling based, their success for simulating neurons will depend on how well they can capture the numerical uncertainty arising from the all-or-none behavior of spikes.

Another aspect that will be crucial for the success of probabilistic numerics in computational neuroscience is to develop and test implicit probabilistic solvers for neuron models. For example, the ODEs of multi-compartment neuron models are typically stiff which makes implicit solvers the better choice for such models (Mascagni et al., 1989). A priori, it is often not easy to judge whether a ODE system is stiff or not. A noteworthy attempt to tackle this problem is the algorithm by Blundell et al. (2018) that automatically determines whether an implicit or an explicit solver should be used.

Code availability

The probabilistic solvers and models were implemented in Python and Cython. The code is publicly available at https://github.com/berenslab/neuroprobnum.

Change history

26 June 2023
A Correction to this paper has been published: https://doi.org/10.1007/s10827-023-00856-w

Notes

$R_S$ can take values larger than one, for example when the sample distribution is bimodal, the majority of samples is much closer to one of the modes (e.g. a fraction of samples has two and the larger fraction has three spike bursts as in Fig. 2C) and the reference solution is close to the larger mode.
In some cases, $R_D$ takes values larger than one, which means that the perturbation increases the solver accuracy on average (e.g. see Fig. 2C). This happens, for example, when the deterministic solution is missing a spike but almost reaches the model’s spike threshold, and the perturbation is strong enough to generate the missing spike in some samples.

References

Abdulle, A., & Garegnani, G. (2020). Random time step probabilistic methods for uncertainty quantification in chaotic and geometric numerical integration. Statistics and Computing.
Blundell, I., Plotnikov, D., Eppler, J. M., & Morrison, A. (2018). Automatically selecting a suitable integration scheme for systems of differential equations in neuron models. Frontiers in neuroinformatics, 12, 50.
Article PubMed PubMed Central Google Scholar
Bogacki, P., & Shampine, L. F. (1989). A 3 (2) pair of Runge-Kutta formulas. Applied Mathematics Letters, 2, 321–325.
Article Google Scholar
Borgers, C., & Nectow, A. R. (2013). Exponential time differencing for hodgkin-huxley-like odes. SIAM Journal on Scientific Computing, 35, B623–B643.
Article PubMed PubMed Central Google Scholar
Butera, R. J., & McCarthy, M. L. (2004). Analysis of real-time numerical integration methods applied to dynamic clamp experiments. Journal of Neural Engineering, 1, 187.
Article PubMed Google Scholar
Cash, J. R., & Karp, A. H. (1990). A variable order runge-kutta method for initial value problems with rapidly varying right-hand sides. ACM Transactions on Mathematical Software (TOMS), 16, 201–222.
Article Google Scholar
Chen, Z., Raman, B., & Stern, A. (2020). Structure-preserving numerical integrators for hodgkin-huxley-type systems. SIAM Journal on Scientific Computing, 42, B273–B298.
Article Google Scholar
Chkrebtii, O. A., Campbell, D. A., Calderhead, B., & Girolami, M. A. (2016). Bayesian solution uncertainty quantification for differential equations. Bayesian Analysis, 11, 1239–1267.
Article Google Scholar
Cockayne, J., Oates, C. J., Sullivan, T. J., & Girolami, M. (2019). Bayesian probabilistic numerical methods. SIAM Review, 64, 756–789.
Article Google Scholar
Conrad, P. R., Girolami, M., Särkkä, S., Stuart, A., & Zygalakis, K. (2017). Statistical analysis of differential equations: introducing probability measures on numerical solutions. Statistics and Computing, 27, 1065–1082.
Article PubMed Google Scholar
Cranmer, K., Brehmer, J., & Louppe, G. (2020). The frontier of simulation-based inference. Proceedings of the National Academy of Sciences, 117, 30055–30062.
Article CAS Google Scholar
Dayan, P., & Abbott, L. F. (2001). Theoretical neuroscience: computational and mathematical modeling of neural systems.
Domhof, J. W., & Tiesinga, P. H. (2021). Flexible frequency switching in adult mouse visual cortex is mediated by competition between parvalbumin and somatostatin expressing interneurons. Neural Computation, 33, 926–966.
Article PubMed Google Scholar
Dormand, J. R., & Prince, P. J. (1980). A family of embedded Runge-Kutta formulae. Journal of Computational and Applied Mathematics, 6, 19–26.
Article Google Scholar
Ermentrout, G. B., & Terman, D. H. (2010). The hodgkin–huxley equations. (pp. 1–28).
Galán, R. F., Fourcaud-Trocmé, N., Ermentrout, G. B., & Urban, N. N. (2006). Correlation-induced synchronization of oscillations in olfactory bulb neurons. Journal of Neuroscience, 26, 3646–3655.
Article PubMed Google Scholar
Gerstner, W., & Kistler, W. M. (2002). Spiking neuron models: Single neurons, populations, plasticity.
Gerwinn, S., Bethge, M., Macke, J. H., & Seeger, M. (2008). Bayesian inference for spiking neuron models with a sparsity prior. In Advances in Neural Information Processing Systems (pp. 529–536).
Gonçalves, P. J., Lueckmann, J.-M., Deistler, M., Nonnenmacher, M., Öcal, K., Bassetto, G., et al. (2020). Training deep neural density estimators to identify mechanistic models of neural dynamics. Elife, 9, e56261.
Hairer, E., Nørsett, S. P., & Wanner, G. (1993). Solving ordinary differential equations i – nonstiff problems.
Hennig, P., Osborne, M. A., & Girolami, M. (2015). Probabilistic numerics and uncertainty in computations. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 471.
Hodgkin, A. L., & Huxley, A. F. (1952). A quantitative description of membrane current and its application to conduction and excitation in nerve. The Journal of Physiology, 117, 500–544.
Article CAS PubMed PubMed Central Google Scholar
Izhikevich, E. M. (2003). Simple model of spiking neurons. IEEE Transactions on Neural Networks, 14, 1569–1572.
Article CAS PubMed Google Scholar
Izhikevich, E. M. (2004). Which model to use for cortical spiking neurons? IEEE Transactions on Neural Networks, 15, 1063–1070.
Article PubMed Google Scholar
Izhikevich, E. M. (2007). Dynamical systems in neuroscience.
Izhikevich, E. M., & Edelman, G. M. (2008). Large-scale model of mammalian thalamocortical systems. Proceedings of the National Academy of Sciences, 105, 3593–3598.
Article CAS Google Scholar
Kersting, H., Sullivan, T. J., & Hennig, P. (2020). Convergence rates of Gaussian ODE filters. Statistics and Computing, 30(6), 1791–1816.
Krämer, N., Bosch, N., Schmidt, J., & Hennig, P. (2022, June). Probabilistic ODE solutions in millions of dimensions. In International Conference on Machine Learning (pp. 11634–11649). PMLR.
Kreuz, T., Chicharro, D., Houghton, C., Andrzejak, R. G., & Mormann, F. (2013). Monitoring spike train synchrony. Journal of neurophysiology, 109, 1457–1472.
Article PubMed Google Scholar
Mascagni, M. V., Sherman, A. S. et al. (1989). Numerical methods for neuronal modeling. Methods in neuronal modeling, 2.
Mulansky, M., & Kreuz, T. (2016). Pyspike–a python library for analyzing spike train synchrony. SoftwareX, 5, 183–189.
Article Google Scholar
Oates, C. J., & Sullivan, T. J. (2019). A modern retrospective on probabilistic numerics. Statistics and Computing, 29, 1335–1351.
Article Google Scholar
Oesterle, J., Behrens, C., Schröder, C., Hermann, T., Euler, T., Franke, K., et al. (2020). Bayesian inference for biophysical neuron models enables stimulus optimization for retinal neuroprosthetics. Elife, 9, e54997.
Oh, J., & French, D. A. (2006). Error analysis of a specialized numerical method for mathematical models from neuroscience. Applied mathematics and computation, 172, 491–507.
Article Google Scholar
Papamakarios, G., Sterratt, D. C., & Murray, I. (2018). Sequential neural likelihood: Fast likelihood-free inference with autoregressive flows. arXiv:1805.07226.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Google Scholar
Pillow, J. W., Shlens, J., Paninski, L., Sher, A., Litke, A. M., Chichilnisky, E., & Simoncelli, E. P. (2008). Spatio-temporal correlations and visual signalling in a complete neuronal population. Nature, 454, 995–999.
Article CAS PubMed PubMed Central Google Scholar
Prinz, A. A., Billimoria, C. P., & Marder, E. (2003). Alternative to hand-tuning conductance-based models: construction and analysis of databases of model neurons. Journal of Neurophysiology, 90, 3998–4015.
Article PubMed Google Scholar
Prinz, A. A., Bucher, D., & Marder, E. (2004). Similar network activity from disparate circuit parameters. Nature Neuroscience, 7, 1345–1352.
Article CAS PubMed Google Scholar
Schober, M., Särkkä, S., & Hennig, P. (2019). A probabilistic model for the numerical solution of initial value problems. Statistics and Computing, 29, 99–122.
Article Google Scholar
Stewart, R. D., & Bair, W. (2009). Spiking neural network simulation: numerical integration with the parker-sochacki method. Journal of Computational Neuroscience, 27, 115–133.
Article PubMed PubMed Central Google Scholar
Teymur, O., Foley, C., Breen, P., Karvonen, T., & Oates, C. J. (2021). Black box probabilistic numerics. Advances in Neural Information Processing Systems, 34.
Teymur, O., Lie, H. C., Sullivan, T., & Calderhead, B. (2018). Implicit probabilistic integrators for odes. In Advances in Neural Information Processing Systems (pp. 7244–7253).
Teymur, O., Zygalakis, K., & Calderhead, B. (2016). Probabilistic linear multistep methods. (pp. 4314–4321).
Tronarp, F., Kersting, H., Särkkä, S., & Hennig, P. (2019). Probabilistic solutions to ordinary differential equations as nonlinear Bayesian filtering: a new perspective. Statistics and Computing, 29, 1297–1315.
Article Google Scholar
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., et al. (2020). SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods, 17, 261–272. https://doi.org/10.1038/s41592-019-0686-2
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was funded by the Deutsche Forschungsgemeinschaft through a Heisenberg Professorship (BE5601/8-1, PB), the Excellence Cluster 2064 “Machine Learning — New Perspectives for Science” (ref number 390727645, PB and PH), ADIMEM (01IS18052C and 01IS18052B to PB and PH) and the Tübingen AI Center (FKZ: 01IS18039A, PB and PH). NK and PH gratefully acknowledge financial support by the German Federal Ministry of Education and Research (BMBF) through Project ADIMEM (FKZ: 01IS18052B), as well as by the European Research Council through ERC StG Action 757275 / PANAMA, and funds from the Ministry of Science, Research and Arts of the State of Baden-Württemberg. The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting Nicholas Krämer.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Ophthalmic Research, University of Tübingen, Tübingen, Germany
Jonathan Oesterle & Philipp Berens
Department of Computer Science, University of Tübingen, Tübingen, Germany
Nicholas Krämer & Philipp Hennig
Max Planck Institute for Intelligent Systems, Tübingen, Germany
Philipp Hennig
Tübingen AI Center, University of Tübingen, Tübingen, Germany
Philipp Hennig & Philipp Berens

Authors

Jonathan Oesterle
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Krämer
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Hennig
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Berens
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philipp Berens.

Ethics declarations

Conflict of interest

The authors have no competing interest to declare.

Additional information

Action Editor: David Golomb

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 442 KB)

Supplementary file2 (PDF 188 KB)

Supplementary file3 (PDF 43 KB)

Supplementary file4 (PDF 402 KB)

Appendix

1.1 Local error estimation and step-size adaptation

To compute the local error estimator $\varvec{\varepsilon }_t$ for a single integration step, step solutions were computed with two different numerical methods that were run in parallel to provide two solutions $\mathbf {x}_a(t + \Delta t)$ and $\mathbf {x}_b(t + \Delta t)$ for every step, given t, $\Delta t$ and $\mathbf {x}(t)$ (see, for example, Dormand and Prince (1980)). The pairs of numerical methods used in this study are listed in Table 1 and described in more detail in Sect. 5.2. In every step, the local error estimator was computed as:

$$\begin{aligned} \varvec{\varepsilon }_t = |\mathbf {x}_a(t + \Delta t) - \mathbf {x}_b(t + \Delta t)|. \end{aligned}$$

(17)

For adaptive step-size methods, the error estimator $\varvec{\varepsilon }_t = [\varepsilon _t^1, ..., \varepsilon _t^d]^\top$ was used to compute an error norm $||\mathbf {e}||$ on $\mathbf {e} = [e_1, ..., e_d]^\top$, where d was the dimension of the state vector $\mathbf {x}(t) = [x_1(t), ..., x_d(t)]^\top$. For every state variable $x_i$, $e_i$ was computed as:

$$\begin{aligned} e_i = \frac{\varepsilon _t^i}{\kappa _a + \kappa _r \cdot \max (|x_i(t)|, |x_i(t+\Delta t)|) }, \end{aligned}$$

(18)

with $\kappa _a$ and $\kappa _r$ being the absolute and relative tolerance. For simplicity, we used $\kappa _a = \kappa _r$ in all simulations and therefore refer to these parameters as the tolerance $\kappa$.

$||\mathbf {e}||$ was computed as the root-mean-square of $\mathbf {e}$, i.e.:

$$\begin{aligned} ||\mathbf {e}|| = \sqrt{\frac{1}{d} \sum _i e_i^2}. \end{aligned}$$

(19)

If $||\mathbf {e}|| < 1$, the step was accepted, and rejected otherwise. In both cases, the step-size was adapted and the next step-size $\Delta t_\text {next}$ was computed as:

$$\begin{aligned} \Delta t_\text {next} = 0.9 \cdot \Delta t \cdot \min (\max (||\mathbf {e}||^{-1/k_\text {exp}}, k_\text {min}), k_\text {max}), \end{aligned}$$

(20)

where $k_\text {min}$ and $k_\text {max}$ are the minimum and maximum allowed change factors, that we set to typical values of 0.1 and 5 respectively (Hairer et al., 1993). $k_\text {exp}$ was 3 for RKBS, 4 for RKCK, and 5 for RKDP, corresponding to the order of the method. Furthermore, we limited the step-sizes to be always smaller or equal to a maximum step-size $\Delta t_\text {max}$, which we set to $\Delta t_\text {max} = {1}\text { ms}$ for all simulations.

1.2 Solver details

Runge–Kutta steps were implemented based on the scipy implementation (Virtanen et al., 2020). The Butcher tableau for the RKCK method was taken from (Cash & Karp, 1990). Heun’s method was used as an error estimator for the FE method and implemented as follows. Given t, $\Delta t$, $\mathbf {x}(t)$, $f(t, \mathbf {x}(t))$ and the deterministic FE solution $\mathbf {x}_\text {det.}^\text {FE}(t + \Delta t)$ the solution for Heun’s method was computed as:

$$\begin{aligned} \mathbf {x}(t + \Delta ) = \mathbf {x}(t) + \Delta t \, \left[ f(t, \mathbf {x}(t)) + f(t, \mathbf {x}_\text {det.}^\text {FE}(t + \Delta t)) \right] / 2 \end{aligned}$$

(21)

To use the exponential integrators EE and EEMP, the ODEs were cast into the following form:

$$\begin{aligned} \dot{z}(t, \mathbf {x}(t)) = \left[ z_\infty (\mathbf {x}) - z \right] / z_\tau (\mathbf {x}), \end{aligned}$$

(22)

where z is a state variable of $\mathbf {x}$ (e.g. the membrane potential) and $z_\infty (\mathbf {x})$ and $z_\tau (\mathbf {x})$ are functions depending on $\mathbf {x}$ but not explicitly on t. For a derivation of these functions for the HH model see for example Dayan and Abbott (2001). The EE step was implemented as:

$$\begin{aligned} z(t+\Delta t) = z_\infty (\mathbf {x}) + [z(t) + z_\infty (\mathbf {x})] \exp (-\Delta t / z_\tau (\mathbf {x})). \end{aligned}$$

(23)

The second order exponential integrator EEMP proposed by Börgers and Nectow (2013) builds on the EE method. Given t, $\Delta t$, $\mathbf {x}(t)$, the half-step EE solution $\tilde{\mathbf {x}} = \mathbf {x}_\text {det.}^\text {EE}(t + \Delta t / 2)$ and the evaluations of $z_\infty (\tilde{\mathbf {x}})$ and $z_\tau (\tilde{\mathbf {x}})$ at the half-step, the solution for z using the EEMP method was computed as:

$$\begin{aligned} z(t+\Delta t) = z_\infty (\tilde{\mathbf {x}}) + [z(t) + z_\infty (\tilde{\mathbf {x}})] \exp (-\Delta t / z_\tau (\tilde{\mathbf {x}})). \end{aligned}$$

(24)

We also implemented the solver used in the original implementation of the IN neurons, where the IVP was solved with a method similar to FE of fixed step-size $\Delta t$ (Izhikevich, 2004). The implementation differs from a standard FE scheme in so far, as v and u are updated subsequently:

$$\begin{aligned} \begin{aligned} v(t+\Delta t)&= v(t) + \Delta t \cdot \dot{v}(t,u(t),v(t)), \\ u(t+\Delta t)&= u(t) + \Delta t \cdot \dot{u}(t,u(t),v(t+\Delta t)). \\ \end{aligned} \end{aligned}$$

(25)

1.3 Neuron model parameters

The neuron models simulated in this study were parametrized as follows. The parameters $\varvec{\theta } = [a, b, c, d]$ and $I_\text {Stim}$ of the IN model and the respective original step-sizes $\Delta t$ were taken from https://www.izhikevich.org/publications/figure1.m (Izhikevich, 2004).

The three maximum conductances for the classical HH neuron (see Eq. (12)) were set to $\bar{g}_\text {Na} = 1.2\text { mS}$, $\bar{g}_\text {K}= 0.36 \text { mS}$ and $\bar{g}_\text {leak} = {0.003}\text { mS}$. The membrane capacitance was set to $C={0.01}\text { }\mu \text {F}$ (see Eq. (11)). For all STG neurons, we set the membrane area to $A={0.628e-3}\text { cm}^2$ and the membrane capacitance to $C = A \cdot {1}\mu \text {F}/\text {cm}^2$ (see Eq. (11)). An STG neuron has eight maximum channel conductances (see Eq. (12)):

$$\begin{aligned} \begin{aligned} \varvec{\theta }_\text {STG-neuron} = \\ [\bar{g}_\text {Na}, \bar{g}_\text {CaT}, \bar{g}_\text {CaS}, \bar{g}_\text {A}, \bar{g}_\text {KCa}, \bar{g}_\text {Kd}, \bar{g}_\text {H}, \bar{g}_\text {leak}]. \end{aligned} \end{aligned}$$

(26)

For the single STG neuron, we set $\varvec{\theta }_\text {STG-neuron} = A \cdot [400, 2.5, 10, 50, 20, 0, 0.04, 0] \text { mS} / \text {cm}^2$, taken from an example in Prinz et al. (2003). The STG neuronal network consists of three neuron models ABPD, LP and PY. The network is parametrized by the three neurons’ conductances:

$$\begin{aligned} \begin{aligned} \varvec{\theta }_\text {ABPD}&= A \cdot [100, 2.5, 6, 50, 5, 100, 0.01, 0.0 ]\, \text { mS} / \text {cm}^2, \\ \varvec{\theta }_\text {LP}&= A \cdot [100, 0.0, 4, 20, 0, 25, 0.05, 0.03]\, \text { mS} / \text {cm}^2, \\ \varvec{\theta }_\text {PY}&= A \cdot [100, 2.5, 2, 50, 0, 125, 0.05, 0.01]\, \text { mS} / \text {cm}^2, \\ \end{aligned} \end{aligned}$$

(27)

where $A={0.628e-3}\text { cm}^2$ and the synaptic conductances $\varvec{\theta }_\text {syn}$:

$$\begin{aligned} \begin{aligned} \varvec{\theta }_\text {syn} = [&\bar{g}^\text {fast}_\text {ABPD-LP}, \bar{g}^\text {slow}_\text {ABPD-LP}, \bar{g}^\text {fast}_\text {ABPD-PY}, \bar{g}^\text {slow}_\text {ABPD-PY}, \\&\bar{g}^\text {fast}_\text {ABPD-LP}, \bar{g}^\text {fast}_\text {LP-ABPD}, \bar{g}^\text {fast}_\text {LP-PY}, \bar{g}^\text {fast}_\text {PY-LP}], \end{aligned} \end{aligned}$$

(28)

where for example $\bar{g}^\text {fast}_\text {ABPD-LP}$ is the maximum conductance of the fast synapse connecting neuron ABPD (presynaptic) to neuron LP (postsynaptic). We simulated the network for five different synaptic parametrizations taken from the original publication (Prinz et al., 2004):

$$\begin{aligned} \begin{aligned} \varvec{\theta }_\text {syn}^\text {a}&= [10, 100, 10, 3, 30, 1, 3]\, \text {nS},\\ \varvec{\theta }_\text {syn}^\text {b}&= [3, 0, 0, 30, 3, 3, 0]\,\text {nS},\\ \varvec{\theta }_\text {syn}^\text {c}&= [100, 0, 30, 1, 0, 3, 0]\,\text {nS},\\ \varvec{\theta }_\text {syn}^\text {d}&= [3, 100, 10, 1, 10, 3, 10]\,\text {nS},\\ \varvec{\theta }_\text {syn}^\text {e}&= [30, 30, 10, 3, 30, 1, 30]\,\text {nS}.\\ \end{aligned} \end{aligned}$$

(29)

The frequencies $f_s$ (Eq. (15)) for the fast and slow synapses were 25 Hz and 10 Hz, and the reversal potentials $E_i$ (Eq. (14)) were -70 mV and -80 mV, respectively (Prinz et al., 2004).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oesterle, J., Krämer, N., Hennig, P. et al. Probabilistic solvers enable a straight-forward exploration of numerical uncertainty in neuroscience models. J Comput Neurosci 50, 485–503 (2022). https://doi.org/10.1007/s10827-022-00827-7

Download citation

Received: 15 December 2021
Revised: 14 May 2022
Accepted: 02 July 2022
Published: 06 August 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s10827-022-00827-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Probabilistic solvers enable a straight-forward exploration of numerical uncertainty in neuroscience models

Abstract

Similar content being viewed by others

Stochastic Ion Channel Gating and Probabilistic Computation in Dendritic Neurons

Monosynaptic inference via finely-timed spikes

Modeling and characterizing stochastic neurons based on in vitro voltage-dependent spike probability functions

1 Introduction

2 Methods and models

2.1 Probabilistic solvers

2.2 Choice of solvers

2.3 Interpolation

2.4 Spike-time estimation

2.5 Common ODE models in computational neuroscience

2.5.1 Single Izhikevich neurons

2.5.2 Single Hodgkin–Huxley neurons

2.5.3 STG model

2.6 Quantifying numerical uncertainty

2.6.1 Reference solutions

2.6.2 Distance metrics

3 Results

3.1 Probabilistic solvers can reveal numerical uncertainty in neuron models

3.2 Probabilistic solvers can guide solver selection

3.3 Calibration of probabilistic solvers

3.4 Computational overhead

3.4.1 Required number of samples

3.4.2 Overhead per sample

3.5 Errors beyond numerical integration

4 Discussion

Code availability

Change history

26 June 2023

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Supplementary Information

Supplementary file1 (PDF 442 KB)

Supplementary file2 (PDF 188 KB)

Supplementary file3 (PDF 43 KB)

Supplementary file4 (PDF 402 KB)

Appendix

Appendix

1.1 Local error estimation and step-size adaptation

1.2 Solver details

1.3 Neuron model parameters

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation