Structure Preserving Polytopal Discontinuous Galerkin Methods for the Numerical Modeling of Neurodegenerative Diseases

Corti, Mattia; Bonizzoni, Francesca; Antonietti, Paola F.

doi:10.1007/s10915-024-02581-7

Structure Preserving Polytopal Discontinuous Galerkin Methods for the Numerical Modeling of Neurodegenerative Diseases

Open access
Published: 20 June 2024

Volume 100, article number 39, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Scientific Computing Aims and scope Submit manuscript

Structure Preserving Polytopal Discontinuous Galerkin Methods for the Numerical Modeling of Neurodegenerative Diseases

Download PDF

256 Accesses
1 Altmetric
Explore all metrics

Abstract

Many neurodegenerative diseases are connected to the spreading of misfolded prionic proteins. In this paper, we analyse the process of misfolding and spreading of both $\alpha $-synuclein and Amyloid-$\beta $, related to Parkinson’s and Alzheimer’s diseases, respectively. We introduce and analyze a positivity-preserving numerical method for the discretization of the Fisher-Kolmogorov equation, modelling accumulation and spreading of prionic proteins. The proposed approximation method is based on the discontinuous Galerkin method on polygonal and polyhedral grids for space discretization and on $\vartheta -$method time integration scheme. We prove the existence of the discrete solution and a convergence result where the Implicit Euler scheme is employed for time integration. We show that the proposed approach is structure-preserving, in the sense that it guarantees that the discrete solution is non-negative, a feature that is of paramount importance in practical application. The numerical verification of our numerical model is performed both using a manufactured solution and considering wavefront propagation in two-dimensional polygonal grids. Next, we present a simulation of $\alpha $-synuclein spreading in a two-dimensional brain slice in the sagittal plane. The polygonal mesh for this simulation is agglomerated maintaining the distinction of white and grey matter, taking advantage of the flexibility of PolyDG methods in the mesh construction. Finally, we simulate the spreading of Amyloid-$\beta $ in a patient-specific setting by using a three-dimensional geometry reconstructed from magnetic resonance images and an initial condition reconstructed from positron emission tomography. Our numerical simulations confirm that the proposed method is able to capture the evolution of Parkinson’s and Alzheimer’s diseases.

From a Microscopic to a Macroscopic Model for Alzheimer Disease: Two-Scale Homogenization of the Smoluchowski Equation in Perforated Domains

Article 18 February 2016

Smoluchowski Equation with Variable Coefficients in Perforated Domains: Homogenization and Applications to Mathematical Models in Medicine

Stability in distribution for a stochastic Alzheimer’s disease model with reaction diffusion

Article 06 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A neurodegenerative disease is a process that causes the progressive death or function loss of neurons. Many different pathologies belong to this group and some of them are called proteinopathies because their aetiology involves misfolding and aggregation of prions into toxic and insoluble proteins [28]. Typical examples of proteins that undergo this process are $\alpha $-Synuclein, related to Parkinson’s disease [27], and Amyloid-$\beta $, whose aggregation is a triggering mechanism of Alzheimer’s disease [25].

Recently, the mathematical modelling of prion dynamics has been studied to elucidate how the physical processes at the basis of the agglomeration and diffusion processes can be related to complex brain structure and functioning. A mathematical description of the spreading at the macroscopic level can be a useful tool in clinical practice, where the use of positron emission tomography imaging (PET) is often considered too invasive and expensive [23]. Moreover, for some pathologies, like $\alpha $-sinucleopathies, there not exist satisfactory chemical ligands [24] that prevent diagnostic investigations, and this computed-assisted modelling is mandatory.

Concerning the numerical modelling of neurodegeneration, the most diffused mathematical description of this phenomenon is based on the Fisher-Kolmogorov (FK) equation (also known as Fisher-KPP) [18, 19]. This model is a nonlinear diffusion–reaction equation that is particularly suited to describe biological species’ evolution [20, 33]. Many different numerical methods have been proposed to compute the approximate solution of the FK equation, also in the context of brain neurodegeneration. For example, we recall Finite Element Methods (FEM) [12, 38], Boundary Elements Methods (BEM) [17], and Discontinuous Galerkin (DG) methods [13].

In the context of modelling neurodegenerative disorders, the solution c of the FK problem represents the (relative) concentration of misfolded proteins, which needs to be non-negative. It can be shown that in the continuous formulation, the solution of the FK equation has two equilibrium states: $c=1$ and $c=0$ [35]. However, due to the unstable nature of the second equilibrium, at the discrete level, it is fundamental to construct a positivity-preserving numerical method to avoid numerical instabilities that lead to unphysical (negative) numerical solutions [13]. For this reason, some works analyze the construction of suitable positivity-preserving methods both within the context of finite differences [36] and DG [43] methods. The latter work uses a change of variable based on the exponential transformation to ensure positivity and entropy preservation at the discrete level. Indeed the construction of numerical methods that preserve some structures, such as entropy dissipation [45, 46], other physical constraints [44], or the invariant domain [47, 48], is a fundamental field in the actual research of numerical analysis.

Starting from the high-order idea of [43] - limited to simplicial meshes - in this work we present and analyse a DG formulation on polygonal/polyhedral grids (PolyDG). The proposed approach presents several advantages and novelties: (i) The flexibility in the construction of the mesh, based on mesh agglomeration [29]. This plays an important role, especially because of the complexity of the geometrical domain of the application at hand, i.e., the human brain; (ii) The freedom in the choice of discretization parameters, like the polynomial degree, which might locally change, from element to element [30]. In the context of brain neurodegeneration, where the geometrical complexity of the domain is an issue, the use of elementwise approximation orders allows us to reduce the computational cost, without affecting the correctness of wavefront propagation; (iii) The use of higher-order time integration. Indeed, to the timescales of the brain neurodegeneration process (that typically means over decades), the use of low-order time integration methods is not convenient to catch the wave propagation correctly. For this reason, we adopt a second-order time integration strategy; (iv) Finally, we consider spatially varying and discontinuous physical parameters, which are fundamental to correctly describe the axonal diffusion of prionic proteins [2, 38].

From the point of view of the analysis, we extend the proof of the existence of the numerical solution provided in [43] for the implicit Euler method, to the generic $\vartheta $-method. The proof of existence is based on the use of the Leray-Shauder fixed point theorem and relies on the coercivity and continuity of the diffusion term. Even though the convergence of the fully discrete numerical solution to the analytical one is not theoretically proved, it is numerically demonstrated in the case $\vartheta =0.5$ (Crank-Nicolson (CN) scheme), with application to brain neurodegenerative diseases, and it is shown that it outperforms first-order advancing schemes.

Concerning the application to the modelling of neurodegenerative disorders, the typical solution of the FK model is a wavefront propagating inside the brain geometry. For this reason, we analyze also the capabilities of our method in approximating wavefronts, providing also a comparison with the DG method proposed in [13], which is proven to suffer possible instabilities due to the fact it does not preserve the positivity when low order polynomial degrees are employed.

The paper is organized as follows. In Sect. 2, we introduce the FK mathematical model and discuss its application to neurodegeneration. In Sect. 3, we introduce the PolyDG space discretization and the time discretization using the $\vartheta $-method. Moreover, we show the coercivity and continuity of the variational forms in order to prove the existence of the discrete solution, and we discuss the extension of the convergence results of the fully discrete formulation. In Sect. 4, we present some convergence tests with a known exact solution and we discuss the accuracy of the proposed scheme in approximating travelling waves in a two-dimensional setting, making a comparison with the DG method of [13]. Section 5 is dedicated to the application of the proposed method to $\alpha $-Synuclein spreading in Parkinson’s disease in a two-dimensional framework, employing agglomerated polygonal meshes, and Amyloid-$\beta $ in Alzheimer’s disease in a three-dimensional patient-specific geometry, with initial conditions reconstructed from PET images. Finally, in Sect. 6, we draw some conclusions and discuss future developments.

2 The Mathematical Model

In this section, we present the FK equation to describe the reaction and diffusion of misfolded proteins. Given the final time $T>0$, the problem depends on the time $t\in (0,T]$ and space $\varvec{x}\in \Omega \subset \mathbb {R}^d$ ($d=2,3$) variables. The unknown is the relative concentration of the misfolded protein $c=c(\varvec{x},t)$, taking values in the interval [0, 1]. A detailed derivation of the model can be found in [38]. The problem in its strong formulation reads as follows: Find $c=c(\varvec{x},t)$ such that:

$$\begin{aligned} {\left\{ \begin{array}{ll} \dfrac{\partial c}{\partial t} =\nabla \cdot (\textbf{D} \nabla \, c) + \alpha \,c(1-c) + f, &{} \textrm{in}\ \Omega \times (0,T], \\ (\textbf{D}\nabla c) \cdot \varvec{n} = 0, &{} \textrm{on}\;\Gamma _N\times (0,T], \\ c = c_\textrm{D}, &{} \textrm{on}\;\Gamma _D\times (0,T], \\ c(0)=c_0, &{} \textrm{in}\;\Omega , \\ \end{array}\right. } \end{aligned}$$

(1)

where $\alpha =\alpha (\varvec{x})$ is the reaction parameter, representing the local conversion rate of the proteins from the healthy to the misfolded state, modelling also the clearance mechanisms [3, 4], and $\textbf{D}=\textbf{D}(\varvec{x})\in \mathbb {R}^{d\times d}$ is the diffusion tensor, denoting the spreading of misfolded protein. Finally, $f=f(\varvec{x},t)$ is the forcing term modelling the external addition of mass. Concerning the boundary conditions, we impose null flux at the boundary $\Gamma _N$ of the domain, while $c_\textrm{D}$ fixes the value of concentration on a part of the boundary $\Gamma _D$, where $\{\Gamma _D,\Gamma _N\}$ forms a partition of $\partial \Omega $, namely, $\Gamma _D \cup \Gamma _N = \partial \Omega $, $\Gamma _D \cap \Gamma _N = \emptyset $, and $|\Gamma _D|>0$.

Due to the physical meaning of the solution c, we aim to construct a positivity-preserving numerical scheme. Following [43], we apply the exponential transformation $c=e^\lambda $, where $\lambda =\lambda (\varvec{x},t)$ becomes the new unknown of the problem. As a result, we obtain the following strong formulation of the problem: Find $\lambda =\lambda (\varvec{x},t)$ such that:

$$\begin{aligned} {\left\{ \begin{array}{ll} \dfrac{\partial e^\lambda }{\partial t} =\nabla \cdot (e^\lambda \textbf{D} \nabla \, \lambda ) + \alpha \,e^\lambda (1-e^\lambda ) + f, &{} \textrm{in}\,\Omega \times (0,T], \\ (\textbf{D}\nabla \lambda ) \cdot \varvec{n} = 0, &{} \textrm{on}\;\Gamma _N\times (0,T], \\ \lambda = \lambda _\textrm{D}, &{} \textrm{on}\;\Gamma _D\times (0,T], \\ \lambda (0)=\lambda _0, &{} \textrm{in}\;\Omega , \\ \end{array}\right. } \end{aligned}$$

(2)

The homogeneous Neumann boundary condition in problem (1) reflects a homogeneous Neumann boundary condition also in problem (2). Concerning the initial condition and the Dirichlet boundary term we impose that $c_0=e^{\lambda _0}$ and $c_\textrm{D}=e^{\lambda _\textrm{D}}$, respectively.

We make the following assumption on the data regularity.

Assumption 1

(Data’s regularity) We assume the following regularity on the data appearing in (1):

$\alpha \in L^\infty (\Omega )$;
$\varvec{\textrm{D}}\in L^\infty (\Omega ,\mathbb {R}^{d\times d})$, and $\exists d_0,D_0>0\;\forall \varvec{\xi }\in \mathbb {R}^d:\; d_0|\varvec{\xi }|^2 \le \varvec{\xi }^\top \textbf{D}\varvec{\xi } \le D_0|\varvec{\xi }|^2;$
$f\in L^2((0,T],L^2(\Omega ))$;
$\lambda _\textrm{D} \in L^2((0,T]; H^{1/2}(\Gamma _D))$.

3 Numerical Discretization

This section presents the discretization of the continuous problem (2), which is based on the polygonal discontinuous Galerkin method for the space discretization and the $\vartheta -$method for the time advancement.

3.1 Discrete Setting and Preliminary Estimates

Let $\mathscr {T}_h$ be a polytopic mesh partition of the domain $\Omega $, being the collection of disjoint polygonal/polyhedral elements K. For each element $K\in \mathscr {T}_h$, |K| denotes the Hausdorff measure of the element, and $h_K$ denotes its diameter. We set $h=\max _{K\in \mathscr {T}_h} h_K$. Given two neighboring elements $K_1,\, K_2\in \mathscr {T}_h$, their interface is defined as the intersection of their $(d-1)-$dimensional facets. In the case of $d=2$, the interface is a collection of line segments and the set of all of them is denoted with $\mathscr {F}_h$. In the case $d=3$, the interface can be a generic polygon; for this reason, we introduce a decomposition of the polygon in planar triangles collected in the set $\mathscr {F}_h$. Finally, we decompose $\mathscr {F}_h$ into the union of interior faces ($\mathscr {F}^\textrm{I}_h$) and boundary faces ($\mathscr {F}^\textrm{B}_h$), i.e. $\mathscr {F}_h= \mathscr {F}^\textrm{I}_h\cup \mathscr {F}^\textrm{B}_h$. Moreover, we assume that $\mathscr {F}^\textrm{B}_h$ can be further split according to the corresponding boundary condition: $\mathscr {F}^\textrm{B}_h= \mathscr {F}_h^D\cup \mathscr {F}_h^N$, where $\mathscr {F}_h^D$ and $\mathscr {F}_h^N$ are the boundary faces contained in $\Gamma _D$ and $\Gamma _N$, respectively. The last assumption implies that any $F\in \mathscr {F}^\textrm{B}_h$ is contained in either $\Gamma _D$ or $\Gamma _N$.

Assumption 2

(Mesh Regularity [40]) The mesh sequence $\{\mathscr {T}_h\}_h$ satisfies the following properties:

1.
Shape Regularity: $\forall K\in \mathscr {T}_h\;it\;holds:\quad h_K^d\lesssim q|K|\lesssim h_K^d$.
2.
Contact Regularity: $\forall F\in \mathscr {F}_h$ with $F\subseteq \overline{K}$ for some $K\in \mathscr {T}_h$, it holds $h_K^{d-1}\lesssim |F|$, where |F| is the Hausdorff measure of the face F.
3.
Submesh Condition: There exists a shape-regular, conforming, matching simplicial submesh $\widetilde{\mathscr {T}_h}$ such that:
- $\forall \widetilde{K}\in \widetilde{\mathscr {T}_h}\;\exists K\in \mathscr {T}_h:\quad \widetilde{K}\subseteq K$.
- The family $\{\widetilde{\mathscr {T}_h}\}_h$ is shape and contact regular.
- $\forall \widetilde{K}\in \widetilde{\mathscr {T}_h}, K\in \mathscr {T}_h$ with $\widetilde{K} \subseteq K$, it holds $h_K \lesssim h_{\widetilde{K}}$.

Remark 1

We remark that most of the analysis is valid also under milder assumptions on the mesh [6]; however in this work, we need to refer to the ones in Assumption 2. The technical point is the validity of (7) that holds under mesh assumptions of Assumption 2.3. However, we notice that from the numerical results of Sects. 4 and 5, the assumption seems not to be needed in practice.

Concerning the space discretization, we introduce the following discontinuous finite element spaces with an elementwise variable polynomial degree:

$$\begin{aligned}{} & {} W_{h,p}^\textrm{DG}= \{w\in L^2(\Omega ):\quad w|_K\in \mathbb {P}_{p_K}(K)\quad \forall K\in \mathscr {T}_h\}, \\{} & {} \textbf{W}_{h,p}^{\textrm{DG}} = \{\textrm{W}\in L^2(\Omega ;\mathbb {R}^{d\times d}):\quad \textrm{W}|_K\in \mathbb {P}_{p_K}^{d\times d}(K)\quad \forall K\in \mathscr {T}_h\}, \end{aligned}$$

where $\mathbb {P}_{p_K}(K)$ is the space of polynomials of total degree $p_K\ge 1$ over a mesh element K. Concerning the physical data, we assume $\textbf{D}\in \textbf{W}_{h,p}^{\textrm{DG}}$ and $\alpha \in W_{h,p}^\textrm{DG}$. We introduce the following trace operators [42]. Let $F\in \mathscr {F}^\textrm{I}_h$ be a face shared by the elements $K^\pm $ and let $\varvec{n}^\pm $ be the unit normal vector on face F pointing exterior to $K^\pm $, respectively. Then, for sufficiently regular scalar-valued functions v and vector-valued functions $\varvec{q}$, we define:

the average operator $\{\!\!\{{\cdot }\}\!\!\}$ on $F\in \mathscr {F}^\textrm{I}_h$: $\{\!\!\{{v}\}\!\!\}= \dfrac{1}{2} (v^+ + v^-), \quad \{\!\!\{{\varvec{q}}\}\!\!\}= \dfrac{1}{2} (\varvec{q}^+ + \varvec{q}^-)$;
the jump operator $[\![{\cdot }]\!]$ on $F\in \mathscr {F}^\textrm{I}_h$: $[\![{v}]\!]= v^+\varvec{n}^+ + v^-\varvec{n}^-, \quad [\![{\varvec{q}}]\!]= \varvec{q}^+\cdot \varvec{n}^+ + \varvec{q}^-\cdot \varvec{n}$.

The superscripts ± denote the traces of the functions on F taken within the interior to $K^\pm $. In an analogous way, on the face $F\in \mathscr {F}_h^D$ associated with the cell $K\in \mathscr {T}_h$ with $\varvec{n}$ outward unit normal on $\partial \Omega $, we define:

the average operator $\{\!\!\{{\cdot }\}\!\!\}$ on $F\in \mathscr {F}_h^D$: $\{\!\!\{{v}\}\!\!\}= v, \quad \{\!\!\{{\varvec{q}}\}\!\!\}= \varvec{q}$;
the standard jump operator $[\![{\cdot }]\!]$ on $F\in \mathscr {F}_h^D$, with Dirichlet conditions g, $\varvec{g}$: $[\![{v}]\!]= (v-g)\varvec{n}, \quad [\![{\varvec{q}}]\!]= (\varvec{q}-\varvec{g})\cdot \varvec{n}$.

Let us introduce the following broken Sobolev spaces for an integer $r\ge 1$: $H^r(\mathscr {T}_h) = \{w_h\in L^2(\Omega ): w_h|_K\in H^r(K)\quad \forall K\in \mathscr {T}_h\}$. Moreover, we introduce the shorthand notation for the $L^2$-norm $\Vert \cdot \Vert =\Vert \cdot \Vert _{L^2(\Omega )}$ and for the $L^2$-norm on a set of faces $\mathscr {F}$ as $\Vert \cdot \Vert _\mathscr {F}=\left( \sum _{F\in \mathscr {F}}\Vert \cdot \Vert _{L^2(F)}^2\right) ^{1/2}$. We define the following penalization function $\eta :\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D\rightarrow \mathbb {R}_+$:

$$\begin{aligned} \eta (\lambda ) = \zeta {\left\{ \begin{array}{ll}\max \{(e^\lambda )_{+},(e^\lambda )_{-}\}^2\max \left\{ e^{\Vert \lambda \Vert _{L^\infty (K_{+})}},e^{\Vert \lambda \Vert _{L^\infty (K_{-})}}\right\} &{} \textrm{on}\; F\in \mathscr {F}^\textrm{I}_h, \\ (e^\lambda )^2 e^{\Vert \lambda \Vert _{L^\infty (K)}} &{} \textrm{on}\; F\in \mathscr {F}_h^D, \end{array}\right. } \end{aligned}$$

(3)

where $\zeta = \zeta (p,h,D)$ is a parameter that depends on the discretization parameters and the diffusion tensor, defined as

$$\begin{aligned} \zeta = \zeta (p,h,D) = \eta _0 {\left\{ \begin{array}{ll} \{D_K\}_{\textrm{A}}\dfrac{\{p_K^2\}_\textrm{A}}{\{h_K\}_\textrm{H}}, &{} \textrm{on}\; F\in \mathscr {F}^\textrm{I}_h\\ D_K\dfrac{p_K^2}{h_K}, &{} \textrm{on}\; F\in \mathscr {F}_h^D\end{array}\right. }. \end{aligned}$$

(4)

We point out that in Eq. (3), we are considering both the harmonic average operator $\{\cdot \}_\textrm{H}$ and the arithmetic average operator $\{\cdot \}_\textrm{A}$ on $F\in \mathscr {F}^\textrm{I}_h$ and $\eta _0$ is a parameter at our disposal (to be chosen large enough to ensure stability). Moreover, we are defining $D_K = \Vert \sqrt{\textbf{D}}|_K\Vert ^2$.

Remark 2

The choice of using the harmonic average on the mesh size $h_K$ and not an arithmetic one is fundamental in the theoretical analysis of the next sections. Indeed, in the proof of coercivity and continuity of $\mathscr {A}$, we will exploit the following relation:

$$\begin{aligned} \dfrac{\{h_K\}_\textrm{H}}{ \{D_K\}_\textrm{A} \{p_K^2\}_\textrm{A}} \le 4 \min \left\{ \dfrac{h_{K_-}}{D_{K_-} p^2_{K_-}},\dfrac{h_{K_+}}{D_{K_+} p^2_{K_+}}\right\} . \end{aligned}$$

(5)

This relation cannot be obtained using the arithmetic average of $h_K$, because, in general, the arithmetic average cannot be bounded by a constant multiplied by the minimum of the two terms.

Next, we define the following DG-norm:

$$\begin{aligned} \Vert c\Vert _{\textrm{DG}}^2 = \left\| \sqrt{\textbf{D}}\nabla _h c \right\| ^2 + \Vert \sqrt{\zeta }[\![c]\!]\Vert _{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}^2 \qquad \forall c\in H^1(\mathscr {T}_h). \end{aligned}$$

(6)

The choice of using this combination of harmonic and arithmetic averages is fundamental to obtaining the coercivity and continuity bounds of Propositions 1 and 2 below.

Finally, we recall the result of inverse trace inequality [31, 32]:

$$\begin{aligned} \exists C_I >0: \qquad \Vert v\Vert ^2_{L^2(\partial K)} \le C_I \dfrac{p^2_K}{h_K}\Vert v\Vert _{L^2(K)}\qquad \forall v\in \mathbb {P}^{p_K}(K),\;K\in \mathscr {T}_h. \end{aligned}$$

(7)

3.2 PolyDG Semi-Discrete Formulation

To construct the semi-discrete formulation, we first introduce the interior penalty DG discretization of the nonlinear diffusion term $\mathscr {A}:W_{h,p}^\textrm{DG}\times W_{h,p}^\textrm{DG}\times W_{h,p}^\textrm{DG}\rightarrow \mathbb {R}$ as:

$$\begin{aligned} \begin{aligned} \mathscr {A}(u;v,w) =&\int _{\Omega } e^u \left( \textbf{D}\nabla _h v\cdot \nabla _h w\right) + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}\int _{F}\eta (u) [\![v]\!]\cdot [\![w]\!]\textrm{d}\sigma \\ -&\sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}\int _{F}\left( \{\!\!\{e^u \textbf{D} \nabla v\}\!\!\}\cdot [\![w ]\!]+ [\![v]\!]\cdot \{\!\!\{e^u \textbf{D} \nabla w\}\!\!\}\right) \textrm{d}\sigma \quad \forall u,v,w\in W_{h,p}^\textrm{DG}, \end{aligned} \end{aligned}$$

(8)

where $\nabla _h \cdot $ is the elementwise gradient [39] and $\eta $ is defined as in Eq. (3). The semi-discrete PolyDG formulation reads as follows:

For any $t\in (0,T]$, find $\lambda _h(t)\in W_{h,p}^\textrm{DG}$ such that:

$$\begin{aligned} {\left\{ \begin{array}{ll} \left( \dfrac{\partial e^{\lambda _h(t)}}{\partial t},\varphi _h\right) _\Omega + \mathscr {A}(\lambda _h(t);\lambda _h(t),\varphi _h) - \left( \alpha e^{\lambda _h}\left( 1-e^{\lambda _h}\right) ,\varphi _h\right) _\Omega = F(\varphi _h) &{} \forall \varphi _h\!\in \! W_{h,p}^\textrm{DG}, \\ \lambda _h(0)\!=\!\lambda _{0h} \end{array}\right. } \end{aligned}$$

(9)

where $\lambda _{0h}\in W_{h,p}^\textrm{DG}$ is a suitable approximation of $\lambda _0\in W$. In this work, we choose as $\lambda _{0h}$ the $L^2$-projection of $\lambda _0$ on the space $W_{h,p}^\textrm{DG}$.

Remark 3

We remark that the proposed formulation is not robust with respect to high-contrast or strongly anisotropic diffusion tensor $\textbf{D}$. However, notice that the biological application considered in this work does not feature large jumps in the diffusion tensor $\textbf{D}$. None the less, using the weighted average and jump operators as proposed in [21, 22] should lead to a formulation that also features such robustness.

Remark 4

Notice that, due to the change of variable $c=e^\lambda $, the errors in the estimation of the solution $\lambda $ are exponentially amplified. However, considering that the final goal is to estimate the value of the concentration c, we focus our attention both theoretically and numerically on the computation of the error $\Vert c-e^{\lambda _h}\Vert $.

We next show some preliminary estimates that will be needed in the forthcoming well-posedness and convergence analysis.

Proposition 1

(Coercivity of $\mathscr {A}$) The form $\mathscr {A}$, defined in Eq. (8), satisfies for all $v\in W_{h,p}^\textrm{DG}$:

$$\begin{aligned} \mathscr {A}(v;v,v) \ge \dfrac{1}{2} \Vert e^{v/2}\Vert _\textrm{DG}^2, \end{aligned}$$

(10)

under the assumption on the penalty parameter value $\eta _0 \ge 16 C_I^2D_0$, where $d_0$ and $D_0$ are defined as in Assumption 1, and $C_I$ is the inverse trace inequality constant of relation (7).

Proof

Taking $u=v=w$ in Eq. (8), we have:

$$\begin{aligned} \begin{aligned} \mathscr {A}(v;v,v)&= \underset{\mathrm {(I)}}{\underbrace{\int _\Omega e^v(\textbf{D}\nabla _h v) \cdot \nabla _h v}} + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \int _F \eta (v) |[\![v]\!]|^2 \textrm{d}\sigma \\ {}&\quad - \underset{\mathrm {(II)}}{\underbrace{2 \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \int _F \{\!\!\{e^v\textbf{D}\nabla v\}\!\!\}\cdot [\![v]\!]\textrm{d}\sigma }}. \end{aligned} \end{aligned}$$

(11)

By treating each term separately, we obtain for $\mathrm {(I)}$ the following estimate:

$$\begin{aligned} \mathrm {(I)} \ge \int _\Omega e^v|\sqrt{\textbf{D}}\nabla _h v|^2 = \int _\Omega |\sqrt{\textbf{D}}e^{v/2}\nabla _h v|^2 = 4 \int _\Omega |\sqrt{\textbf{D}}\nabla _h e^{v/2}|^2. \end{aligned}$$

(12)

Then we control the term $\mathrm {(II)}$ by means of the Young’s inequality:

$$\begin{aligned} |\mathrm {(II)}| \le \underset{\mathrm {(III)}}{\underbrace{\sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}\int _F \beta _F |\{\!\!\{e^v\textbf{D}\nabla v\}\!\!\}|^2 \textrm{d}\sigma }} + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \int _F \dfrac{1}{\beta _F} |[\![v]\!]|^2 \textrm{d}\sigma , \end{aligned}$$

(13)

where $\beta _F>0$ is a parameter we define as follows:

$$\begin{aligned} \beta _F = \dfrac{\min \left\{ e^{-\Vert v\Vert _{L^\infty (K_{+})}},e^{-\Vert v\Vert _{L^\infty (K_{-})}}\right\} }{8 D_0 C_I^2\max \{(e^v)_{+},(e^v)_{-}\}^2} {\left\{ \begin{array}{ll} \dfrac{\{h_K\}_\textrm{H}}{\{D_K\}_\textrm{A} \{p_K^2\}_\textrm{A}}, &{} \textrm{on}\; F\in \mathscr {F}^\textrm{I}_h,\\ \dfrac{h_K}{D_K p_K^2}, &{} \textrm{on}\; F\in \mathscr {F}_h^D. \end{array}\right. } \end{aligned}$$

(14)

In (14) $d_0$ and $D_0$ are defined as in Assumption 1 and $C_I$ is the inverse trace inequality constant of relation (7). Then, by applying the inverse trace inequality and relation (5) we obtain:

$$\begin{aligned} \begin{aligned} \mathrm {(III)} \le&\sum _{K\in \mathscr {T}_h} \dfrac{1}{8 D_0}\int _{\partial K} 4\dfrac{h_K\,e^{-\Vert v\Vert _{L^\infty (K)}}}{C_I^2 D_K p_K^2} |\textbf{D} \nabla v|^2 \textrm{d}\sigma \\ \le&\sum _{K\in \mathscr {T}_h} \dfrac{1}{2}\int _{K} e^{-\Vert v\Vert _{L^\infty (K)}} |\sqrt{\textbf{D}}\nabla v|^2 \le \dfrac{1}{2}\int _{\Omega } e^v |\sqrt{\textbf{D}}\nabla _h v|^2= 2\int _{\Omega } |\sqrt{\textbf{D}}\nabla _h e^{v/2}|^2. \end{aligned} \end{aligned}$$

Inserting the above estimates in Eq. (10), we obtain:

$$\begin{aligned} \mathscr {A}(v;v,v) \ge 2\int _{\Omega } |\sqrt{\textbf{D}}\nabla _h e^{v/2}|^2 + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \int _F \left( \eta (v)-\dfrac{1}{\beta _F}\right) |[\![v]\!]|^2 \textrm{d}\sigma . \end{aligned}$$

(15)

For $F\in \mathscr {F}^\textrm{I}_h$, the second integral on the rhs of Eq. (15) is positive provided that:

$$\begin{aligned}{} & {} \eta (v)-\dfrac{1}{\beta _F} \\{} & {} \quad = \left( \eta _0 - 8C_I^2D_0\right) \dfrac{\{D_K\}_{\textrm{A}}\{p_K^2\}_\textrm{A}}{\{h_K\}_\textrm{H}}\max \{(e^v)_{+},(e^v)_{-}\}^2\max \left\{ e^{\Vert v\Vert _{L^\infty (K_{+})}},e^{\Vert v\Vert _{L^\infty (K_{-})}}\right\} >0. \end{aligned}$$

The same bound can be obtained on $F\in \mathscr {F}_h^D$. By taking $\eta _0 \ge 16C_I^2D_0$ the positivity is guaranteed, and by exploiting the following relation

$$\begin{aligned} \max \{(e^v)_{+},(e^v)_{-}\}\max \left\{ e^{\Vert v\Vert _{L^\infty (K_{+})}},e^{\Vert v\Vert _{L^\infty (K_{-})}}\right\} \ge 1 \end{aligned}$$

we obtain:

$$\begin{aligned} \begin{aligned} \mathscr {A}(v;v,v)&\ge 2\int _{\Omega } |\sqrt{\textbf{D}}\nabla _h e^{v/2}|^2 + \sum _{F\in \mathscr {F}^\textrm{I}_h} \dfrac{\eta _0}{2} \int _F \{D_K\}_{\textrm{A}}\dfrac{\{p_K^2\}_\textrm{A}}{\{h_K\}_\textrm{H}} |[\![e^{v/2}]\!]|^2 \textrm{d}\sigma \\ {}&\quad + \sum _{F\in \mathscr {F}_h^D} \dfrac{\eta _0}{2} \int _F D_K\dfrac{p_K^2}{h_K} |e^{v/2}|^2 \textrm{d}\sigma = 2\int _{\Omega } |\sqrt{\textbf{D}}\nabla _h e^{v/2}|^2 \\ {}&\quad + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \dfrac{1}{2} \int _F \zeta |[\![e^{v/2}]\!]|^2 \textrm{d}\sigma \ge \dfrac{1}{2} \Vert e^{v/2}\Vert _\textrm{DG}^2, \end{aligned} \end{aligned}$$

(16)

where $\zeta $ has been defined in (4). $\square $

Proposition 2

(Continuity of $\mathscr {A}$) The form $\mathscr {A}$, defined in Eq. (8), satisfies for all $u, v\in W_{h,p}^\textrm{DG}$:

$$\begin{aligned} \left| \mathscr {A}(u;u,v)\right| \le \mu \max _{K\in \mathscr {T}_h} \{e^{\Vert u\Vert _{L^\infty (K)}}\} \Vert e^u\Vert _{\textrm{DG}} \,\Vert u\Vert _{\textrm{DG}} \,\Vert v\Vert _{\textrm{DG}} \,, \end{aligned}$$

(17)

with $\mu := \max \left\{ 1,\sqrt{\frac{4D_0C_I^2}{d_0\eta _0}}\right\} $, where $d_0$ and $D_0$ are defined as in Assumption 1, $C_I$ is the inverse trace inequality constant in (7), and $\eta _0$ is the penalty constant introduced in 4.

Proof

From Eq. (8) we obtain:

$$\begin{aligned} \begin{aligned} \mathscr {A}(u;u,v)&= \underset{\mathrm {(I)}}{\underbrace{\int _\Omega e^u(\textbf{D}\nabla _h u) \cdot \nabla _h v}} + \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \underset{\mathrm {(II)}}{\underbrace{\int _F \eta (u) [\![u]\!]\cdot [\![v]\!]\textrm{d}\sigma }} \\ {}&\quad - \underset{\mathrm {(III)}}{\underbrace{\int _F \{\!\!\{e^u\textbf{D}\nabla u\}\!\!\}\cdot [\![v]\!]\textrm{d}\sigma }} -\underset{\mathrm {(IV)}}{\underbrace{ \int _F \{\!\!\{e^u\textbf{D}\nabla v\}\!\!\}\cdot [\![u]\!]\textrm{d}\sigma }} \end{aligned} \end{aligned}$$

(18)

By treating each term separately, we obtain for $\mathrm {(I)}$ the following estimate using the regularity assumption on $\textrm{D}$ in Assumption 1:

$$\begin{aligned} |\mathrm {(I)}| \le \int _\Omega e^u|\sqrt{\textbf{D}}\nabla _h u|\ |\sqrt{\textbf{D}}\nabla _h v| = \int _\Omega |\sqrt{\textbf{D}}\nabla _h e^u|\ |\sqrt{\textbf{D}}\nabla _h v| \le \Vert \sqrt{\textbf{D}}\nabla _h e^u\Vert \,\Vert \sqrt{\textbf{D}}\nabla _h v\Vert . \nonumber \\ \end{aligned}$$

(19)

Then, we control the term $\mathrm {(II)}$ by means of the Young’s inequality:

$$\begin{aligned} |\mathrm {(II)}| \le \max _{K\in \mathscr {T}_h} \{e^{\Vert u\Vert _{L^\infty (K)}} \} \Vert \sqrt{\zeta } [\![u]\!]\Vert _{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}\Vert \sqrt{\zeta } [\![v]\!]\Vert _{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}. \end{aligned}$$

(20)

The bound on term $\mathrm {(III)}$ follows thanks to the Young’s inequality:

$$\begin{aligned} |\mathrm {(III)}| \le \sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \Big (\underset{\mathrm {(V)}}{\underbrace{\int _F \gamma _F |\{\!\!\{e^u\textbf{D}\nabla u\}\!\!\}|^2 \textrm{d}\sigma }}\Big )^{1/2}\Big (\int _F \dfrac{1}{\gamma _F} |[\![v]\!]|^2 \textrm{d}\sigma \Big )^{1/2}, \end{aligned}$$

(21)

where $\gamma _F>0$ is defined as follows:

$$\begin{aligned} \gamma _F = \dfrac{d_0^2}{8 D_0 C_I^2} {\left\{ \begin{array}{ll} \dfrac{\{h_K\}_\textrm{H}}{\{D_K\}_\textrm{A} \{p_K^2\}_\textrm{A}}, &{} \textrm{on}\; F\in \mathscr {F}^\textrm{I}_h,\\ \dfrac{h_K}{D_K p_K^2}, &{} \textrm{on}\; F\in \mathscr {F}_h^D. \end{array}\right. } \end{aligned}$$

(22)

Then, by applying the inverse trace inequality in relation (7) and relation (5) we obtain:

$$\begin{aligned} |\mathrm {(V)}|\le & {} \sum _{K\in \mathscr {T}_h} \dfrac{d_0^2}{8 D_0}\int _{\partial K} 4\dfrac{h_K}{C_I^2 D_K p_K^2} |e^u \textbf{D} \nabla u|^2 \textrm{d}\sigma \nonumber \\\le & {} \sum _{K\in \mathscr {T}_h} \dfrac{d_0}{2}\int _{K} |\sqrt{\textbf{D}}\nabla e^u|^2 = \dfrac{d_0}{2} \Vert \sqrt{\textbf{D}}\nabla _h e^u\Vert ^2. \end{aligned}$$

From the above estimates, it follows:

$$\begin{aligned} |\mathrm {(III)}| \le \sqrt{\dfrac{4D_0C_I^2}{d_0\eta _0}} \Vert \sqrt{\textbf{D}}\nabla _h e^u\Vert \,\Vert \sqrt{\zeta }[\![v]\!]\Vert ^2_{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}. \end{aligned}$$

Finally, we estimate the term $\mathrm {(IV)}$ by applying the definition of $\gamma _F$, and relation (5) on the first integral. Then, we apply the inverse trace inequality in relation (7), to obtain:

$$\begin{aligned} \begin{aligned} |\mathrm {(IV)}| \le&\sum _{F\in \mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \Big (\int _F \gamma _F |\{\!\!\{e^u\textbf{D}\nabla v\}\!\!\}|^2 \textrm{d}\sigma \Big )^{1/2} \Big (\int _F \dfrac{1}{\gamma _F} |[\![u]\!]|^2 \textrm{d}\sigma \Big )^{1/2} \\ \le&\Big (\sum _{K\in \mathscr {T}_h} \dfrac{d_0^2}{8 D_0}\int _{\partial K} 4\dfrac{h_K}{C_I^2 D_K p_K^2} |e^u \textbf{D} \nabla v|^2 \textrm{d}\sigma \Big )^{1/2} \sqrt{\dfrac{8D_0C_I^2}{d_0^2\eta _0}}\Vert \sqrt{\zeta }[\![u]\!]\Vert _{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \\ \le&\sqrt{\max _{K\in \mathscr {T}_h} \{e^{\Vert u\Vert _{L^\infty (K)}}\}\dfrac{4D_0C_I^2}{d_0\eta _0}}\Vert \sqrt{\textbf{D}}\nabla _h v\Vert \,\Vert \sqrt{\zeta }[\![u]\!]\Vert _{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}. \end{aligned} \end{aligned}$$

Finally, putting together all the previous bounds, we obtain:

$$\begin{aligned} \left| \mathscr {A}(u;u,v)\right| \le \max \left\{ 1,\sqrt{\dfrac{4D_0C_I^2}{d_0\eta _0}}\right\} \max _{K\in \mathscr {T}_h} \{e^{\Vert u\Vert _{L^\infty (K)}}\} \Vert e^u\Vert _{\textrm{DG}} \,\Vert u\Vert _{\textrm{DG}} \,\Vert v\Vert _{\textrm{DG}} , \end{aligned}$$

(23)

and the proof is complete. $\square $

3.3 Fully Discrete Formulation

To discretize Eq. (9) in time, we consider the $\vartheta -$method scheme. We remark that due to the nonlinear nature of the strong formulation with the change of variable, we need a nonlinear solver, and therefore using an implicit scheme for time integration does not affect the computational cost. In this section, we consider homogeneous Dirichlet conditions $\lambda _D=0$ for simplicity in the calculations. However, the results can be extended to the non-homogeneous case with proper regularity assumptions on $\lambda _D$.

Let $\{t_\ell \}_{\ell =0}^{N_t}$ be the uniform partition of the time interval [0, T] into $N_t$ intervals with length $dt=\frac{T}{N_t}$, namely, $0=t_0<t_1<...<t_{N_t}=T$ and $t_\ell =\frac{\ell T}{N_t}$ for $\ell =0,...,N_t$. Let us introduce a parameter $\varepsilon >0$. Then, the fully discrete formulation of problem (9) reads: given the initial condition $\lambda _h^0=\lambda _{0h}$, find $\lambda ^{k+1}_h$ for $k=0,...,N_t-1$, such that:

$$\begin{aligned} \begin{aligned}&\Bigg ( \dfrac{e^{\lambda _h^{k+1}} -e^{\lambda _h^{k}}}{\Delta t},\varphi _h\Bigg )_\Omega - \left( \alpha \left( \vartheta e^{\lambda _h^{k+1}}+(1-\vartheta ) e^{\lambda _h^{k}}\right) \left( 1-\left( \vartheta e^{\lambda _h^{k+1}}+(1-\vartheta ) e^{\lambda _h^{k}}\right) \right) ,\varphi _h\right) _\Omega \\&\quad + \dfrac{\varepsilon }{\Delta t} (\lambda _h^{k+1},\varphi _h)_\Omega + \dfrac{\varepsilon }{\Delta t} (\textbf{D}\nabla _h \lambda _h^{k+1},\nabla _h \varphi _h)_\Omega + \dfrac{\varepsilon }{\Delta t} (\zeta [\![\lambda _h^{k+1}]\!], [\![\varphi _h]\!])_{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D} \\&\quad + \vartheta \mathscr {A}(\lambda _h^{k+1};\lambda _h^{k+1},\varphi _h) + (1-\vartheta )\mathscr {A}(\lambda _h^k;\lambda _h^k,\varphi _h) = \vartheta F^{k+1}(\varphi _h) + (1-\vartheta ) F^{k}(\varphi _h). \end{aligned} \end{aligned}$$

(24)

The introduction of the additional regularizing terms proportional to the parameter $\varepsilon >0$ is fundamental to prove the existence of the solution via the Leray-Schauder fixed-point theorem [43]. However, from a numerical point of view, the presence of a $\varepsilon > 0$ is not really needed and it can be chosen equal to 0 in the simulations (see Sect. 4).

We next prove that formulation (24) admits a solution.

Proposition 3

(Existence of a solution) Let $\varepsilon >0$. Given $\lambda _h^{k}\in W_{h,p}^\textrm{DG}$, then the fully discrete formulation in Eq. (24) admits a solution $\lambda _h^k\in W_{h,p}^\textrm{DG}$, provided that Assumptions 1 and 2 hold and the penalty constant $\eta _0$ defined as in (4) is chosen sufficiently large.

Proof

The proof is based on the application of the Leray-Schauder theorem. For clarity, we subdivide the proof into 3 steps. $\square $

3.3.1 Step 1: Definition of the Operator $\Phi $

First of all, let us introduce the fixed point operator $\Phi :W_{h,p}^\textrm{DG}\times [0,1]\rightarrow W_{h,p}^\textrm{DG}$ such that $\Phi (w,\sigma )=v$ with $v\in W_{h,p}^\textrm{DG}$ being the unique solution of the linear problem:

$$\begin{aligned} \begin{aligned} \varepsilon (v,\phi )_\Omega +&\varepsilon (\textbf{D}\nabla _h v,\nabla _h \phi )_\Omega + \varepsilon (\zeta [\![v]\!], [\![\phi ]\!])_{\mathscr {F}^\textrm{I}_h\cup \mathscr {F}_h^D}\\&=\sigma (e^{\lambda _h^{k}}-e^w,\phi )_\Omega \\&\quad + \sigma \left( \alpha \Delta t(\vartheta e^w+(1-\vartheta ) e^{\lambda _h^{k}}) (1-(\vartheta e^w+(1-\vartheta ) e^{\lambda _h^{k}})),\phi \right) _\Omega \\&\quad - \sigma \vartheta \Delta t\mathscr {A}(w;w,\phi ) - \sigma (1-\vartheta )\Delta t\mathscr {A}(\lambda _h^k;\lambda _h^k,\phi ) \\&\quad + \sigma \vartheta \Delta t F^{k+1}(\phi ) + \sigma (1-\vartheta )\Delta t F^{k}(\phi )\qquad \forall \phi \in W_{h,p}^\textrm{DG}. \end{aligned} \end{aligned}$$

(25)

3.3.2 Step 2: Compactness of $\Phi $

$\Phi $ is well defined by the Lax-Milgram lemma, thanks to the coercivity and continuity on $W_{h,p}^\textrm{DG}$ of the left-hand side of (25) and to the continuity of the right-hand side of (25). Finally, we observe that $\Phi (w,0)=0$. Due to the finite dimension of the space $W_{h,p}^\textrm{DG}$, these properties are enough to prove also the compactness of the operator.

3.3.3 Step 3: Uniform Bound for all the Fixed Points

To prove the property of uniform bound we take $v\in W_{h,p}^\textrm{DG}$ and $\sigma \in [0,1]$ such that $v=\Phi (v,\sigma )$. First of all, let us notice that we can bound the right-hand side of (25) by using the coercivity of $\mathscr {A}$ and the existence of a constant $M=M(\lambda _h^{k})$ such that $\alpha \Delta t(\vartheta e^v+(1-\vartheta ) e^{\lambda _h^{k}}) (1-\vartheta e^v)v \le M(\lambda _h^{k})$. Indeed, there holds

$$\begin{aligned} \begin{aligned}&\varepsilon \Vert v\Vert ^2 + \varepsilon \Vert v\Vert ^2_{\textrm{DG}} = \sigma (e^{\lambda _h^{k}}-e^v,v)_\Omega - \sigma \vartheta \Delta t\mathscr {A}(v;v,v) - \sigma (1-\vartheta )\Delta t\mathscr {A}(\lambda _h^k;\lambda _h^k,v) \\&\quad + \sigma \left( \alpha \Delta t(\vartheta e^v+(1-\vartheta ) e^{\lambda _h^{k}}) (1-(\vartheta e^v+(1-\vartheta ) e^{\lambda _h^{k}})),v\right) _\Omega \\ {}&\quad + \sigma \vartheta \Delta t F^{k+1}(v) + \sigma (1-\vartheta )\Delta t F^{k}(v) \\&\quad \le \sigma (e^{\lambda _h^{k}}-e^v,v)_\Omega - \sigma \left( \alpha \Delta t (1-\vartheta ) e^{\lambda _h^{k}}(\vartheta e^v+(1-\vartheta ) e^{\lambda _h^{k}}),v\right) _\Omega + \sigma M(\lambda _h^{k})\\&\qquad - \sigma (1-\vartheta )\Delta t\mathscr {A}(\lambda _h^k;\lambda _h^k,v) + \sigma \vartheta \Delta t F^{k+1}(v) + \sigma (1-\vartheta )\Delta t F^{k}(v). \end{aligned} \end{aligned}$$

(26)

Then, by introducing the function $s(x)=x(\log (x)-1)+1\ge 0$ and exploiting its convexity we obtain:

$$\begin{aligned} (e^{\lambda _h^{k}}-e^v)v = (e^{\lambda _h^{k}}-e^v)s'(e^v) \le s(e^{\lambda _h^{k}})-s(e^v). \end{aligned}$$

(27)

Thus, using also the fact that $-s(e^v)\le 0$ and relation (26) we obtain:

$$\begin{aligned} \begin{aligned}&\varepsilon \Vert v\Vert ^2 + \varepsilon \Vert v\Vert ^2_{\textrm{DG}} \le \sigma \int _\Omega \left( s(e^{\lambda _h^{k}}) (1 + \alpha \Delta t \vartheta (1-\vartheta ) e^{\lambda _h^{k}}) \right) - \sigma \left( \alpha \Delta t (1-\vartheta )^2 e^{2\lambda _h^{k}},v\right) _\Omega \\&\quad + \sigma M(\lambda _h^{k}) - \sigma (1-\vartheta )\Delta t\mathscr {A}(\lambda _h^k;\lambda _h^k,v) + \sigma \vartheta \Delta t F^{k+1}(v) + \sigma (1-\vartheta )\Delta t F^{k}(v). \end{aligned} \end{aligned}$$

Using Eq. (17) and the Young’s inequality with suitable coefficients $\epsilon _1$ and $\epsilon _2$, we get:

$$\begin{aligned} \begin{aligned}&\Big (\varepsilon - \dfrac{3}{2}\sigma \Delta t \epsilon _2\Big ) \Vert v\Vert ^2 + \left( \varepsilon -\dfrac{\sigma (1-\vartheta )\Delta t \mu \epsilon _1}{2}\right) \Vert v\Vert ^2_{\textrm{DG}} \le - \sigma \alpha \Delta t (1-\vartheta ) \Vert e^{2\lambda _h^{k}}\Vert ^2 \\&\quad + \sigma \int _\Omega \left( s(e^{\lambda _h^{k}}) (1 + \alpha \Delta t \vartheta (1-\vartheta ) e^{\lambda _h^{k}}) \right) \\&\quad + \dfrac{\sigma \Delta t}{2\epsilon _2} \left( \vartheta \Vert f(t^{k+1})\Vert ^2 + (1-\vartheta )\Vert f(t^{k+1})\Vert ^2 \right) \\&\quad + \sigma M(\lambda _h^{k}) + \dfrac{\sigma (1-\vartheta )\Delta t}{2\epsilon _1} \mu \max _{K\in \mathscr {T}_h} \{e^{\Vert \lambda _h^k\Vert _{L^\infty (K)}}\}^2 \Vert e^{\lambda _h^k}\Vert _{\textrm{DG}}^2 \,\Vert \lambda _h^k\Vert _{\textrm{DG}}^2. \end{aligned} \end{aligned}$$

(28)

By applying the Leray-Schauder theorem [35] we derive the existence of a solution for problem (24), and the proof is complete. $\square $

3.4 Convergence of the Discrete Solution

In this section, we prove the convergence of the solution to the PolyDG fully discrete formulation in Eq. (24) with $\vartheta =1$ (Implicit Euler method) to the solution of the continuous problem. An additional assumption we make in this proof is the forcing-free model $f=0$. The result follows by extending the convergence theorem proved in [43] to the case of polytopal/polyhedral meshes and high-order approximations.

Let us start introducing the notion of entropy $S:[0,T]\rightarrow \mathbb {R}$ of the system [41], namely

$$\begin{aligned} S(t) = \int _\Omega \left( u(t)(\log (u(t))-1)+1\right) . \end{aligned}$$

(29)

To prove the convergence of the numerical solution, we need to show that the discrete entropy $S_h^{k} = \int _\Omega (e^{\lambda _h^k}({\lambda _h^k}-1)+1)$ decays as $k\rightarrow \infty $ [43], and that the DG norm (see Eq. (6)) of the discrete solution is uniformly bounded.

Remark 5

The analysis in this section is performed only for the case $\vartheta =1$. The treatment of the general case $\vartheta \in [0,1]$ is not straightforward, due to the presence of the components from the previous timestep that cannot be easily treated and prevent to recover the decay of the discrete entropy. Nevertheless, as it will be demonstrated in the numerical result sections, the scheme exhibits optimal convergence rates for any $\vartheta \in [0,1]$. The extension of the analysis to the case $\vartheta \ne 1$ is under investigation and will be the subject of future research.

Lemma 1

(Discrete entropy inequality [43]) Let $\varepsilon >0$ and let $\lambda ^{k+1}_h\in W_{h,p}^\textrm{DG}$ be the solution to (24) with a sufficiently large $\eta _0$. Then,

$$\begin{aligned} S_h^{k+1} + C_0\Delta t \int _\Omega \left| e^{\lambda _h^{k+1}/2}-\dfrac{1}{|\Omega |}\int _\Omega e^{\lambda _h^{k+1}/2} \right| ^2 +\Delta t \int _\Omega e^{\lambda _h^{k+1}}\left( e^{\lambda _h^{k+1}}-1\right) \lambda _h^{k+1} \le S_h^{k}, \nonumber \\ \end{aligned}$$

(30)

where the constant $C_0>0$ only depends on the constants of inverse trace inequality in Eq. (7) and on the Poincaré-Wirtinger inequality [40].

Proposition 4

Let Assumptions 1 and 2 hold and let $\eta _0$ defined as in Eq. (4) be chosen sufficiently large. Let $\lambda _h^{k+1}$ be the solution to problem (24) with $\vartheta =1$, $\varepsilon >0$ and homogeneous forcing term $f=0$. Then:

$$\begin{aligned} \left\| e^{\lambda ^{k+1}_h/2}\right\| _\textrm{DG}^2 \le \dfrac{ 2 S_h^0}{\Delta t}, \end{aligned}$$

(31)

where $S_h^0$ is the initial discrete entropy.

Proof

Let us consider the problem (24) with $\vartheta =1$ and $\varphi _h = \lambda _h^{k+1}$:

$$\begin{aligned} \begin{aligned} \Delta t&\left( \alpha e^{\lambda _h^{k+1}} \left( e^{\lambda _h^{k+1}}-1\right) ,\lambda _h^{k+1}\right) _\Omega + \varepsilon \Vert \lambda _h^{k+1}\Vert ^2 + \varepsilon \Vert \lambda _h^{k+1}\Vert _\textrm{DG}^2 \\ +&\Delta t\mathscr {A}(\lambda _h^{k+1};\lambda _h^{k+1},\lambda _h^{k+1}) = \left( e^{\lambda _h^{k}}-e^{\lambda _h^{k+1}},\lambda _h^{k+1}\right) _\Omega . \end{aligned} \end{aligned}$$

By observing that $e^v \left( e^v-1\right) v\ge 0$ for each $v\in W_{h,p}^\textrm{DG}$ and by using Eq. (10), we obtain:

$$\begin{aligned} \frac{\Delta t}{2} \Vert e^{\lambda _h^{k+1}/2}\Vert _\textrm{DG}^2 \le \left( e^{\lambda _h^{k}}-e^{\lambda _h^{k+1}},\lambda _h^{k+1}\right) _\Omega . \end{aligned}$$

Exploiting the convexity of the density of entropy function $s(v) = v(\log (v)-1)+1$ and noticing that $v = s'(v)$, we obtain:

$$\begin{aligned} \frac{\Delta t}{2} \Vert e^{\lambda _h^{k+1}/2}\Vert _\textrm{DG}^2 \le S_h^k - S_h^{k+1} \le S_h^k \le S_h^0, \end{aligned}$$

(32)

where in the last step we used the discrete entropy inequality in Lemma 1. From Eq. (32), the thesis follows. $\square $

Theorem 1

(Convergence) Let Assumptions 1 and 2 hold and let $\eta _0$ be sufficiently large. Let $\varepsilon > 0$, $\vartheta =1$, $\Delta t \in (0,1)$, and let $\lambda _h^{k+1}\in W_{h,p}^\textrm{DG}$ be a solution to (24) with homogeneous forcing term $f=0$. Assume that $\lambda _h^k\in W_{h,p}^\textrm{DG}$ is such that $e^{\lambda _h^k}\rightarrow c^{k}$ strongly in $L^2(\Omega )$ as $(\varepsilon ,h)\rightarrow 0$. Then there exists a unique strong solution $c^{k+1}\in H^2(\Omega )$ to:

$$\begin{aligned} {\left\{ \begin{array}{ll} \dfrac{c^{k+1}-c^{k}}{\Delta t} =\nabla \cdot (\textbf{D} \nabla \, c^{k+1}) + \alpha \,c^{k+1}(1-c^{k+1}), &{} \textrm{in}\,\Omega ,\\ c^{k+1} = c_\textrm{D} = e^{\lambda _\textrm{D}}, &{} \textrm{on}\,\Gamma _D,\\ (\textbf{D} \nabla c^{k+1})\cdot \varvec{n} = 0, &{} \textrm{on}\,\Gamma _N,\\ \end{array}\right. } \end{aligned}$$

(33)

such that $e^{\lambda _h^{k+1}}\rightarrow c^{k+1}$ strongly in $L^2(\Omega )$ as $(\varepsilon ,h)\rightarrow 0$.

The proof follows the same steps as in [43] and it makes use of Propositions 1 and 4, as well as of the extensions of variational inequalities valid for polygonal/polyhedral meshes.

4 Numerical Results: Verification

In this section, we aim at verifying the accuracy of the method presented in section 3.

4.1 Test Case 1: Convergence Analysis in Two Dimensions

For the numerical tests in this section, we use Lymph library [34] to solve the FK equation $(d=2)$. We define the domain $\Omega =(0,1)^2$, which we discretize by means of a polygonal mesh obtained by using PolyMesher [37]. Concerning the time discretization, we use a timestep $\Delta t = 10^{-6}$ and the final time $T=2\times 10^{-5}$. We consider the following manufactured exact solution:

$$\begin{aligned} \lambda (x,y,t) = \log \left( (\cos (\pi x)\cos (\pi y)+2) e^{-t}\right) . \end{aligned}$$

(34)

We fix the physical parameters as follows: $\textbf{D}=\textbf{I}$ and $\alpha =0.1$. The forcing term and the Dirichlet boundary conditions are derived accordingly. To solve the resulting nonlinear system we adopt a Newton method with tolerance equal to $\epsilon = 10^{-10}$.

In Fig. 1, we report the computed errors in both the DG and $L^2$ norms at the final time. We have performed the convergence test keeping fixed the polynomial order of the space approximation $p_K=p=1,...,6$ $\forall K \in \mathscr {T}_h$ and using different mesh refinements $(N_\textrm{el}= 30,100,300,1000)$. It can be observed that the slope of error decrease is equal to the polynomial degree p for the DG-norm and equal to $p+1$ for the $L^2$-norm.

In Fig. 2a, we report the computed errors with respect to the timestep $\Delta t$, using both Crank-Nicolson $(\vartheta =0.5)$ and the Implicit Euler $(\vartheta =1)$ schemes. The space discretization is computed on a mesh of $N_\textrm{el}=1000$ elements and with polynomial degree $p=6$. As expected, the use of the Crank-Nicolson method leads to a second-order convergence whereas the error decays with a first-order rate if the implicit Euler scheme is employed. We remark that the case $\vartheta =1$ is fully covered by our theoretical analysis, whereas the proof of convergence for $\vartheta =1/2$ is under investigation.

A convergence analysis with respect to the polynomial order p is also performed on a coarse mesh of 30 elements and with a time integration based on the Crank-Nicolson scheme with timestep $\Delta t = 10^{-6}$. The results are reported in Fig. 2b, where can observe exponential convergence can be observed.

4.2 Test Case 2: Travelling Waves in Two Dimensions

In this section, we exploit the positivity-preserving PolyDG formulation to simulate a traveling-wave solution of the FK equation in 2D, with the aim of comparing the formulation we propose in this article, with the (non-positivity-preserving) scheme introduced in [13]. The manufactured solution is of the form:

$$\begin{aligned} e^{\lambda (x,y,t)} = c(x,y,t) = \psi (x-vt) = \psi (\xi ). \end{aligned}$$

(35)

By substituting it in Eq. (1) with $f=0$, we obtain the following equivalent system of ordinary differential equations:

$$\begin{aligned} {\left\{ \begin{array}{ll} \chi '(\xi ) = -\dfrac{v}{d}\chi (\xi ) + \dfrac{1}{d}\psi (\xi )(\psi (\xi )-1) &{} \xi \in (0,T), \\ \psi '(\xi ) = \chi (\xi ) &{} \xi \in (0,T), \\ \end{array}\right. } \end{aligned}$$

(36)

where we have used the assumption of isotropic diffusion tensor $\textbf{D}=d\textbf{I}$. In particular, we fix $d=10^{-3}$, $\alpha =1$ and penalty parameter $\eta _0=1$. Concerning the wave’s parameters we take the speed $v=0.1$ and the initial data $\psi (0)=1$ and $\chi (0)=-10^{-2}$. We consider a rectangle $\Omega =(0, 5)\times (0, 1)$ as domain. We present the results of two simulations, with different final times $T=5$ and $T=10$ and timestep $\Delta t = 10^{-2}$. Concerning the nonlinear Newton solver, we fix a tolerance $\epsilon =10^{-6}$.

Table 1 Comparison of the computed errors in the DG-norm based on employing the proposed positivity-preserving scheme and the DG method of [13], for different polynomial degrees, different mesh sizes, and different final times

Full size table

Table 2 Comparison of the computed errors in the DG-norm based on employing the proposed positivity-preserving scheme and the DG method of [13], for different polynomial degrees, different mesh sizes, and different final times

Full size table

In Tables 1 and 2, we report the computed errors in the $L^2$ and DG norms computed at the final times $T=5$ and $T=10$, respectively. In particular, we compare the results obtained by using our positivity-preserving method (24) and the DG method proposed in [13], with a semi-implicit treatment of the nonlinearity and a penalty parameter $\eta =10$. We can observe that, also using low order polynomials $(p=1)$, our method can correctly represent the wave propagation front and leads to smaller errors (one order of magnitude). On the contrary, the method in [13] fails to correctly simulate the wavefront because it does not preserve the positivity of the solution and the equilibrium $c=0$ is unstable.

Moreover, from the results of Tables 1 and 2, we can observe for $p=4$ and $T=10$ that the proposed positivity-preserving scheme does not lead to a reduction of the error compared with the results obtained for $p=3$. Indeed, we can observe in Fig. 3 that for $p=3$ we have the formation of some small oscillations around the equilibrium $c=1$. This is probably due to Newton’s iterations that might be badly conditioned for large values of polynomial degrees. The effect of this problem cannot be observed in the method of [13], but in this case, the positivity of the solution cannot be guaranteed.

In Fig. 4, we report the values of $H=\eta (\lambda _h)/\zeta $ computed on the mesh skeleton for the case of 100 mesh elements and $p=1,2,3,4$. We observe that, in most of the domain, the values assumed by H are small. The largest penalization is applied near the wavefront to stabilize the most delicate region of the physical phenomenon at each time step. This follows from definition (3); indeed near the wavefront the value $e^{\Vert \lambda _h\Vert _{L^\infty (K)}}$ can assume large values on one element while $e^{\lambda _h}$ can be much smaller. We notice that where the solution is near 0, the penalty parameter assumes much smaller values if compared to the $\eta $ to assume the same order of magnitude of the other integrals of $\mathscr {A}$. Moreover, in the right column of Fig. 4, we show the boxplot associated with the mean values of H on all the edges for both the meshes analyzed in the test case. With only 30 elements, the outlier values of the penalty can reach values higher than $10^3$. This fact is coherent because the larger area of the elements reflects the large difference between the maximum and minimum values reached by the solution in a single element. This impacts the values of the penalty, which are particularly high. It can also be noticed that the fine mesh is associated with a smaller average value of the penalty. Finally, we do not notice significant differences changing the value of p.

5 Numerical Results: Brain Applications

In this section, we present the numerical results obtained in two test cases: a two-dimensional simulation of a sagittal section of a brain and a three-dimensional simulation of brain geometries reconstructed from Magnetic Resonance Images (MRI).

In the prions’ spreading applications, the diffusion tensor is typically modelled as the superimposition of an extracellular diffusion effect with magnitude $d_\textrm{ext}$ and an axonal diffusion with magnitude $d_\textrm{axn}$ [38]; for this reason, in this section, we assume that $\textbf{D}$ has the following structure:

$$\begin{aligned} \textbf{D} = d_\textrm{ext}\textbf{I} + d_\textrm{axn}(\varvec{n}\otimes \varvec{n}), \end{aligned}$$

(37)

where $\varvec{n}=\varvec{n}(\varvec{x})$ is the axonal fibres direction in the point $\varvec{x}\in \Omega $ and $d_\textrm{ext}, d_\textrm{axn} \ge 0$. The axonal direction is derived from Diffusion Weighted Imaging (DWI) and represents the principal orientation of the connections between the neurons (axons). Most of the spreading of the prions seems to happen through the axons [38], however, due to the brain structure, this is true only in white matter, while in grey matter, the diffusion can be considered to be isotropic.

In order to construct the axonal component of the diffusion tensor $\textbf{D}$, we derive the diffusion tensor from DWI medical images by using Freesurfer and Nibabel [1]. The principal eigenvector $\varvec{n}$ of the tensor is then computed elementwise to find the diffusion tensor in Eq. (37). We refer to [5] for more details on the reconstruction of $\textbf{D}$ starting from medical images. Concerning the forcing term we fix $f=0$ and we impose homogeneous Neumann boundary conditions in both test cases.

Concerning test case 3 in Sect. 5.1, we simulate the spreading of $\alpha $-Synuclein in Parkinson’s disease in a two-dimensional brain section. The simulation starts with a concentration of the misfolded proteins only at the base of the brainstem, so an initial stage of the pathology, and it requires many years ($\simeq 25$ years) of development. On the contrary, test case 4 in Sect. 5.2, refers to Alzheimers’s disease in a three-dimensional brain. The initial concentration is diffused and derived from a Positron Emission Tomography (PET) image of an 83 years old patient with advanced pathological symptoms.

5.1 Test Case 3: Spreading of $\alpha $-Synuclein in a Two-Dimensional Brain Section

In this section, we address a numerical simulation of the spreading of $\alpha $-Synuclein in Parkinson’s disease on a polygonal agglomerated grid of a sagittal 2D brain section. The geometry is segmented from a structural MRI of a brain from the OASIS-3 database [14] by means of Freesurfer [16]. The construction of the final mesh of a slice of the brain is performed by using VMTK [15]. The resulting triangular mesh is composed of $43\,402$ triangles, and each element of the mesh is labelled to be in white or grey matter, according to the MRI segmentation, as in Fig. 5a. However, the generality of the PolyDG method allows us to use mesh elements of any shape and the use of a smaller number of elements allows saving computational cost. For this reason, by using ParMETIS [11], we agglomerate the initial triangular mesh into a polygonal mesh of 534 elements, as shown in Fig. 5b. In particular, the agglomeration procedure is performed in a segregated way for the white and the grey matter, in this way we are sure to correctly describe both the domain boundary and the interface between grey/white matters. Finally, in Fig. 5c, we report the axonal directions computed in the white matter starting from DWI.

Concerning the physical parameters, we fix the reaction coefficient $\alpha = 0.45/\textrm{year}$ in grey matter, and $\alpha = 0.9/\textrm{year}$ in white matter [8]. Moreover, we impose a constant isotropic diffusion $d_\textrm{ext} = 8\,\textrm{mm}^2/\textrm{year}$, and axonal diffusion which is 10 times faster than the isotropic one in the white matter ($d_\textrm{axn} = 80\,\textrm{mm}^2/\textrm{year}$) and is negligible in the grey matter ($d_\textrm{axn} = 0\,\textrm{mm}^2/\textrm{year}$) [8]. In this simulation, we fix $\Delta t=0.01\,\textrm{years}$ and $p=1$, moreover the penalty parameter $\eta _0=1$.

The simulation of $\alpha $-Synuclein diffusion in Parkinson’s disease starts from an initial condition, with concentration located in the dorsal motor nucleus [9]. In Fig. 6, we report both the initial condition and the computed solution at different times $t=0,5,10,15,20,25$ years. First of all, it can be observed that the directions of protein propagations are coherent with the medical literature [10]. Indeed, the activation of brain regions follows the Braak staging theory [9]. Moreover, we can notice that the heterogeneity of the reaction parameters causes an earlier activation of the white matter in general, which is clearly visible in the frontal cortex at time $t=20$. By making a comparison with the literature results of [13], we have that the reduced reactivity and diffusion inside grey matter causes a slowing of the disease progression times, starting with the same initial condition and an agglomerated mesh with comparable refinement level.

In Fig. 7a, we report the average concentration of misfolded protein $\overline{e^{\lambda _h(t)}}$ inside the brain with respect to the time t. Moreover, we compute the average concentrations in white and grey matter separately. As we can observe, in the first years, the increase in the concentration is almost equivalent for the two regions, after 14 years we have a clear distinction. In particular, the higher reactivity and diffusion of the white matter tissue causes a faster increase in the concentration. Moreover, we compute the activation time of the pathology as:

$$\begin{aligned} \hat{t}(\varvec{x},t) = \chi _{\{e^{\lambda _h(\varvec{x},t)}>c_\textrm{crit}\}} (\varvec{x},t) \qquad \varvec{x}\in \Omega \quad t\in [0,T], \end{aligned}$$

(38)

where $\chi $ is the indicator function and $c_\textrm{crit}=0.95$ is the critical value of $\alpha $-Synuclein concentration. We report the computed activation time in Fig. 7b. From a pathological perspective, high concentrations of $\alpha $-Synuclein alter the electric signal transport. The indicator (38) measures the time after which a region of the brain can be affected by pathological electric stimuli. The result is qualitatively similar to the literature results [13, 38]. Comparing the result with respect to [13], we can notice a longer activation time, due to the reduced reactivity and diffusion in grey matter, introduced in this work.

5.2 Test Case 4: Spreading of Amyloid-$\beta $ in a Three-Dimensional Brain Geometry

In this section, we present a numerical simulation of the spreading of the Amyloid-$\beta $ on a three-dimensional domain, reconstructed starting from an MRI taken from OASIS-3 database [14]. The medical images are associated with an 83-year-old patient, who is diagnosed to be affected by Alzheimer’s disease at the moment of the acquisition. The geometry is segmented by means of Freesurfer [16] and then is used to construct a mesh grid of 323’014 tetrahedral elements, using SVMTK library [5]. The resulting mesh is reported in Fig. 8a. The problem is solved with the use of a FEniCS code [7] (version 2019).

Concerning the parameters of the model, in this test case, for simplicity, we do not make any distinction between white and grey matters, choosing $\alpha = 0.9/\textrm{year}$, $d_\textrm{ext} = 8\,\textrm{mm}^3/\textrm{year}$, and $d_\textrm{axn} = 80\,\textrm{mm}^3/\textrm{year}$ [8, 13].

To set up the initial condition for the FK problem in a patient-specific setting, we estimate the function $\lambda _0(\varvec{x})$ of Amyloid-$\beta $ protein at the initial time $t=0$. To do that, we project the clinical data derived from PET images with Pittsburgh compound B (PET-PiB) [23]. The PET-PiB adopts a radioligand, which identifies the presence of Amyloid-$\beta $ plaques inside the brain parenchyma (for the specifics about the acquisition techniques of the image used in this work we refer to [14]). We report the result of the initial concentration rescaled between 0 and 1 and projected on the mesh grid in Fig. 8b. In particular, we can observe the presence of large damaged regions $(c\simeq 1)$ in the brainstem and in the thalamus.

Starting from pathology in an advanced state, we set up a simulation with a final time $T=2$ years and a timestep $\Delta t = 0.01$ years. Concerning the space discretization we adopt the DG method for $p=1$. The nonlinear solver for the resulting system is based on the relaxed Newton method with absolute tolerance equal to $10^{-10}$ and relaxation parameter $\omega =0.75$.

The results are reported in Fig. 9 for different times ($t=0,0.5,1,1.5,2$ years). The solution is visualized on many slices inside the brain geometry on the three different planes: horizontal, coronal and sagittal. The results show a propagation of the Amyloid-$\beta $ concentration inside the parenchyma, following the typical paths of the pathology [23]. In particular, we can observe a late activation of the cerebellum in the slice along the coronal plane of Fig. 9 (middle line). This is coherent with Braak’s stages of Alzheimer’s pathology, which show the presence of Amyloid-$\beta $ accumulation only in the last stages of the pathological development [26]. Moreover, coherently to the clinical stage of the pathology we are simulating (due to the presence of evident symptoms from the patient’s documentation), we can find a generalised misfolding after a few years from the PET acquisition and this is also coherent with what we expected in the disease evolution [23].

6 Conclusions

In this work, we have proposed a positivity-preserving DG method on polygonal and polyhedral grids for the solution of the FK model. The main applicative motivation is the modelling of neurodegeneration caused by the spreading of prionic proteins, such as $\alpha $-synuclein protein in Parkinson’s disease and amyloid-$\beta $ in Alzheimer’s disease. We have analyzed the existence of the discrete solution by means of the Leray-Schauder theorem and we have discussed the convergence of the numerical scheme.

Numerical tests have been presented both in two and three dimensions. In particular, we have analyzed the convergence in space both with respect to the mesh size and the polynomial order of the method on polygonal grids. Then, we have discussed the convergence in time, by making a comparison between implicit Euler and Cranck-Nicolson schemes. Finally, we have performed a numerical simulation to test the capabilities of the proposed formulation to approximate propagating wavefronts in two dimensions. In this test, we have compared the proposed positivity-preserving method with the polyDG method introduced in [13], highlighting the advantages and disadvantages of both formulations.

Finally, we have presented two applications of the proposed scheme in the framework of neurodegenerative diseases. In particuarl, we have performed a simulation of $\alpha $-synuclein spreading on a slice of a real brain in the sagittal plane, constructing a polygonal agglomerated mesh that preserves the quality of both domain boundaries and the interface between white matter and grey matter. Moreover, starting from initial amyloid-$\beta $ concentrations derived from PET images, we have simulated the spreading of amyloid-$\beta $ in a three-dimensional brain in a patient-specific Alzheimer’s disease setting. The results obtained in both patient-specific settings are coherent with the clinical literature, showing that the proposed approach is a valuable instrument that can be employed for patient-specific computed-assisted simulations of the evolution of Parkinson’s and Alzheimer’s neurodegenerative disorders.

A possible future development of this work consists in extending the convergence analysis to the general $\vartheta $-method, by proving a discrete entropy decay. Another possibility can be the use of PET images at different times of the disease to calibrate the physical parameters of the Fisher-Kolmogorov model, for example by means of inverse uncertainty quantification methods.

Data Availability

Enquiries about data availability should be directed to the authors.

References

Brett, M., Markiewicz, C., Hanke, M., Côté, M., Cipollini, B., McCarthy, P., Cheng, C.: NiBabel 4.0.0: access a cacophony of neuro-imaging file formats. (2022) https://github.com/nipy/nibabel
Brennan, G., Thompson, T., Oliveri, H., Rognes, M., Goriely, A.: The role of clearance in neurodegenerative diseases. J. SIAM Appl. Math. (2023). https://doi.org/10.1137/22M1487801
Article Google Scholar
Ringstad, G., Valnes, L., Dale, A., Pripp, A., Vatnehol, S., Emblem, K., Mardal, K., Eide, P.: Brain-wide glymphatic enhancement and clearance in humans assessed with MRI. JCI Insight. 3, e121537 (2018)
Article Google Scholar
Hornkjøl, M., Valnes, L., Ringstad, G., Rognes, M., Eide, P., Mardal, K., Vinje, V.: CSF circulation and dispersion yield rapid clearance from intracranial compartments. Front. Bioeng. Biotechnol. 10, 1–14 (2022)
Article Google Scholar
Mardal, K., Rognes, M., Thompson, T., Magnus Valnes, L.: Mathematical modeling of the human brain - from magnetic resonance images to finite element simulation. (Springer, 2021)
Cangiani, A., Dong, Z., Georgoulis, E.: hp-version discontinuous Galerkin methods on essentially arbitrarily-shaped elements. Math. Comput. 91, 1–35 (2022)
Article MathSciNet Google Scholar
Alnaes, M., Blechta, J., Hake, J., Johansson, A., Kehlet, B., Logg, A., Richardson, C., Ring, J., Rognes, M., Wells, G.: The FEniCS project version 15. Arch. Numer. Softw. 3(100), 9–23 (2015)
Google Scholar
Schäfer, A., Weickenmeier, J., Kuhl, E.: The interplay of biochemical and biomechanical degeneration in alzheimer’s disease. Comput. Methods Appl. Mech. Eng. 352, 369–388 (2019)
Article MathSciNet Google Scholar
Braak, H., Tredici, K., Rüb, U., De Vos, R., Jansen Steur, E., Braak, E.: Staging of brain pathology related to sporadic parkinson’s disease. Neurobiol. Aging 24, 197–211 (2003)
Article Google Scholar
Goedert, M.: Alzheimer’s and Parkinson’s diseases: the prion concept in relation to assembled A$\beta $, tau, and $\alpha $-synuclein. Science 349, 6248 (2015)
Article Google Scholar
Karypis, G., Schloegel, K., Kumar, V.: Parmetis. Parallel graph partitioning and sparse matrix ordering library, Version 2. (2003)
Engwer, C., Wenske, M.: Estimating the extent of glioblastoma invasion. J. Math. Biol. 82, 10 (2021)
Article Google Scholar
Corti, M., Bonizzoni, F., Dede’, L., Quarteroni, A., Antonietti, P.: Discontinuous Galerkin methods for Fisher-Kolmogorov equation with application to $\alpha $-synuclein spreading in Parkinson’s disease. Comput. Method. Appl. Mech. Eng. 417, 116450 (2023)
Article MathSciNet Google Scholar
LaMontagne, P., Benzinger, T., Morris, J., Keefe, S., Hornbeck, R., Xiong, C., Grant, E., Hassenstab, J., Moulder, K., Vlassenko, A., Raichle, M., Cruchaga, C., Marcus, D.: OASIS-3: Longitudinal Neuroimaging, Clinical, and Cognitive Dataset for Normal Aging and Alzheimer Disease. MedRxiv. (2019)
Antiga, L., Piccinelli, M., Botti, L., Ene-Iordache, B., Remuzzi, A., Steinman, D.: An image-based modeling framework for patient-specific computational hemodynamics. Med. Biol. Eng. Comput. 46, 1097–1112 (2008)
Article Google Scholar
Dale, A., Fischl, B., Sereno, M.: Cortical surface-based analysis: I. segmentation and surface reconstruction. Neuroimage 9, 179–194 (1999)
Article Google Scholar
Gortsas, T., Tsinopoulos, S., Polyzos, D.: A local domain boundary element method for solving the nonlinear fisher KPP diffusion-reaction equation. Eng. Anal. Boundary Elem. 138, 177–188 (2022)
Article MathSciNet Google Scholar
Fisher, R.: The wave of advance of advantageous genes. Ann. Eugen. 7, 353–369 (1937)
Article Google Scholar
Kolmogorov, A., Petrovskii, I., Piskunov, N.: Etude de la diffusion avec croissance de la quantité de matière et son application à un problème biologique. Mosc. Univ. Math. Bull. 1, 1–25 (1937)
Google Scholar
Fornari, S., Schäfer, A., Jucker, M., Goriely, A., Kuhl, E.: Prion-like spreading of Alzheimer’s disease within the brain’s connectome. J. Royal Soc. Interface 16, 20190356 (2019)
Article Google Scholar
Dong, Z., Georgoulis, E.H.: Robust interior penalty discontinuous Galerkin methods. J. Sci. Comput. 92(2), 57 (2022)
Article MathSciNet Google Scholar
Ern, A., Stephansen, A.F., Zunino, P.: A discontinuous Galerkin method with weighted averages for advection-diffusion equations with locally small and anisotropic diffusivity. IMA J. Numer. Anal. 29(2), 235–256 (2009)
Article MathSciNet Google Scholar
Van Oostveen, W., De Lange, E.: Imaging techniques in Alzheimer’s disease: a review of applications in early diagnosis and longitudinal monitoring. Int. J. Mol. Sci. 22, 2110 (2021)
Article Google Scholar
Korat, S., Bidesi, N., Bonanno, F., Nanni, A., Hoàng, A., Herfert, K., Maurer, A., Battisti, U., Bowden, G., Thonon, D., Vugts, D., Windhorst, A., Herth, M.: Alpha-synuclein PET tracer development-an overview about current efforts. Pharmaceuticals 14, 847 (2021)
Article Google Scholar
Bloom, G.: Amyloid-$\beta $ and tau: the trigger and bullet in Alzheimer disease pathogenesis. JAMA Neurol. 71, 505–508 (2014)
Article Google Scholar
Koychev, I., Hofer, M., Friedman, N.: Correlation of Alzheimer disease neuropathologic staging with amyloid and tau scintigraphic imaging biomarkers. J. Nucl. Med. Oct. 61, 1413–1418 (2020)
Article Google Scholar
Stefanis, L.: $\alpha $-Synuclein in Parkinson’s disease. Cold Spring Harbor Perspective In Medicine 2, a009399 (2012)
Article Google Scholar
Walker, L., Jucker, M.: Neurodegenerative Diseases: Expanding the Prion Concept. Annu. Rev. Neurosci. 38, 87–103 (2015)
Article Google Scholar
Antonietti, P., Farenga, N., Manuzzi, E., Martinelli, G., Saverio, L.: Agglomeration of polygonal grids using graph neural networks with applications to multigrid solvers. Comput. Math. Appl. 154, 45–57 (2024)
Article MathSciNet Google Scholar
Antonietti, P., Facciolà, C., Houston, P., Mazzieri, I., Pennesi, G., Verani, M.: High-order Discontinuous Galerkin methods on polyhedral grids for geophysical applications: seismic wave propagation and fractured reservoir simulations. Polyhedral Method. Geosci. (2021). https://doi.org/10.1007/978-3-030-69363-3_5
Article Google Scholar
Rivière, B., Wheeler, M., Girault, V.: A priori error estimates for finite element methods based on discontinuous approximation spaces for elliptic problems. SIAM J. Numer. Anal. 39, 902–931 (2002)
Article MathSciNet Google Scholar
Warburton, T., Hesthaven, J.S.: On the constants in hp-finite element trace inverse inequalities. Comput. Methods Appl. Mech. Eng. 192(25), 2765–2773 (2003)
Article MathSciNet Google Scholar
Corti, M., Bonizzoni, F., Antonietti, P., Quarteroni, A.: Uncertainty quantification for Fisher-Kolmogorov equation on graphs with application to patient-specific Alzheimer’s disease. Mathematical Modelling And Numerical Analysis. in press, ESAIM (2023)
Book Google Scholar
Antonietti, P., Bonetti, S., Botti, M., Corti, M., Fumagalli, I., Mazzieri, I.: lymph: discontinuous poLYtopal methods for Multi-PHysics differential problems. ArXiv (2024)
Salsa, S.: Partial differential equations in action: from modeling to theory. (Springer,2016)
Macías-Díaz, J., Puri, A.: An explicit positivity-preserving finite-difference scheme for the classical Fisher-Kolmogorov-Petrovsky-Piscounov equation. Appl. Math. Comput. 218, 5829–5837 (2012)
Google Scholar
Talischi, C., Paulino, G., Pereira, A., Menezes, I.: PolyMesher: a general-purpose mesh generator for polygonal elements written in Matlab. Struct. Multidiscip. Optim. 45, 309–328 (2012)
Article MathSciNet Google Scholar
Weickenmeier, J., Jucker, M., Goriely, A., Kuhl, E.: A physics-based model explains the prion-like features of neurodegeneration in Alzheimer’s disease, Parkinson’s disease, and amyotrophic lateral sclerosis. J. Mech. Phys. Solids. 124, 264–281 (2019)
Article Google Scholar
Quarteroni, A.: Numerical Models for Differential Problems. (Springer,2017)
Di Pietro, D., Droniou, J.: The Hybrid High-Order Method for Polytopal Meshes: Design, Analysis, and Applications. (Springer,2020)
Jüngel, A.: Entropy methods for diffusive partial differential equations. (Springer,2016)
Arnold, D., Brezzi, F., Cockburn, B., Marini, D.: Unified analysis of discontinuous Galerkin methods for elliptic problems. SIAM J. Numer. Anal. 39, 1749–1779 (2002)
Article MathSciNet Google Scholar
Bonizzoni, F., Braukhoff, M., Jüngel, A., Perugia, I.: A structure-preserving discontinuous Galerkin scheme for the Fisher-KPP equation. Numer. Math. 146, 119–157 (2020)
Article MathSciNet Google Scholar
Dhaouadi, F., Dumbser, M.: A structure-preserving finite volume scheme for a hyperbolic reformulation of the Navier-Stokes-Korteweg equations. Mathematics 11, 876 (2023)
Article Google Scholar
Cancès, C., Guichard, C.: Numerical analysis of a robust free energy diminishing finite volume scheme for parabolic equations with gradient structure. Found. Comput. Math. 17, 1525–1584 (2017)
Article MathSciNet Google Scholar
Lemaire, S., Moatti, J.: Structure preservation in high-order hybrid discretisations of potential-driven advection-diffusion: linear and nonlinear approaches. Math. Eng. 6, 100–136 (2024)
Article MathSciNet Google Scholar
Ern, A., Guermond, J.: Invariant-domain-preserving high-order time stepping: I. explicit Runge-Kutta schemes. SIAM J. Sci. Comput. 44, A3366–A3392 (2022)
Article MathSciNet Google Scholar
Ern, A., Guermond, J., Wang, Z.: Asymptotic and invariant-domain preserving schemes for scalar conservation equations with stiff source terms and multiple equilibrium points. ArXiv (2023)

Download references

Acknowledgements

The brain MRI images were provided by OASIS-3: Longitudinal Multimodal Neuroimaging: Principal Investigators: T. Benzinger, D. Marcus, J. Morris; NIH P30 AG066444, P50 AG00561, P30 NS09857781, P01 AG026276, P01 AG003991, R01 AG043434, UL1 TR000448, R01 EB009352. AV-45 doses were provided by Avid Radiopharmaceuticals, a wholly-owned subsidiary of Eli Lilly.

Funding

Open access funding provided by Politecnico di Milano within the CRUI-CARE Agreement. This research has been funded by the European Union (ERC, NEMESIS, project number 101115663). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. PFA has been partially funded by PRIN2020 n. 20204LN5N5 “Advanced polyhedral discretisations of heterogeneous PDEs for multiphysics problems” research grant, funded by the Italian Ministry of Universities and Research (MUR). FB has received support from the project PRIN2022, MUR, Italy, 2023–2025, P2022N5ZNP “SIDDMs: shape-informed data-driven models for parametrized PDEs, with application to computational cardiology”. FB is partially funded by “INdAM - GNCS Project”, codice CUP E53C22001930001. The present research is part of the activities of “Dipartimento di Eccellenza 2023-2027”, MUR, Italy, Dipartimento di Matematica, Politecnico di Milano. The authors are members of INdAM-GNCS.

Author information

Authors and Affiliations

MOX-Dipartimento di Matematica, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20133, Milan, Italy
Mattia Corti, Francesca Bonizzoni & Paola F. Antonietti

Authors

Mattia Corti
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Bonizzoni
View author publications
You can also search for this author in PubMed Google Scholar
Paola F. Antonietti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mattia Corti.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Corti, M., Bonizzoni, F. & Antonietti, P.F. Structure Preserving Polytopal Discontinuous Galerkin Methods for the Numerical Modeling of Neurodegenerative Diseases. J Sci Comput 100, 39 (2024). https://doi.org/10.1007/s10915-024-02581-7

Download citation

Received: 27 January 2024
Revised: 04 May 2024
Accepted: 21 May 2024
Published: 20 June 2024
DOI: https://doi.org/10.1007/s10915-024-02581-7

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Structure Preserving Polytopal Discontinuous Galerkin Methods for the Numerical Modeling of Neurodegenerative Diseases

Abstract

Similar content being viewed by others

From a Microscopic to a Macroscopic Model for Alzheimer Disease: Two-Scale Homogenization of the Smoluchowski Equation in Perforated Domains

Smoluchowski Equation with Variable Coefficients in Perforated Domains: Homogenization and Applications to Mathematical Models in Medicine

Stability in distribution for a stochastic Alzheimer’s disease model with reaction diffusion

1 Introduction

2 The Mathematical Model

Assumption 1

3 Numerical Discretization

3.1 Discrete Setting and Preliminary Estimates

Assumption 2

Remark 1

Remark 2

3.2 PolyDG Semi-Discrete Formulation

Remark 3

Remark 4

Proposition 1

Proof

Proposition 2

Proof

3.3 Fully Discrete Formulation

Proposition 3

Proof

3.3.1 Step 1: Definition of the Operator \(\Phi \)

3.3.2 Step 2: Compactness of \(\Phi \)

3.3.3 Step 3: Uniform Bound for all the Fixed Points

3.4 Convergence of the Discrete Solution

Remark 5

Lemma 1

Proposition 4

Proof

Theorem 1

4 Numerical Results: Verification

4.1 Test Case 1: Convergence Analysis in Two Dimensions

4.2 Test Case 2: Travelling Waves in Two Dimensions

5 Numerical Results: Brain Applications

5.1 Test Case 3: Spreading of \(\alpha \)-Synuclein in a Two-Dimensional Brain Section

5.2 Test Case 4: Spreading of Amyloid-\(\beta \) in a Three-Dimensional Brain Geometry

6 Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation