Abstract
Confronted with the biological problem of managing plasticity in cell populations, which is in particular responsible for transient and reversible drug resistance in cancer, we propose a rationale consisting of an integro-differential and a reaction-advection-diffusion equation, the properties of which are studied theoretically and numerically. By using a constructive finite volume method, we show the existence and uniqueness of a weak solution and illustrate by numerical approximations and their simulations the capacity of the model to exhibit divergence of traits. This feature may be theoretically interpreted as describing a physiological step towards multicellularity in animal evolution and, closer to present-day clinical challenges in oncology, as a possible representation of bet hedging in cancer cell populations.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction: biological background
One of the main theories explaining the origins of cancer states that the hallmark capabilities of cancer are based on latent functions already existing in the genome of normal human cells, and that cancer represents a reversion to a less differentiated and less cooperative cellular behavior. This theory is usually called the atavistic model of cancer (Davies and Lineweaver 2011). It is opposed to the more commonly admitted somatic mutation theory (Lineweaver and Davies 2020), that states that cancer originated in a single cell, following a catastrophic sequence of stochastic tumorigenic mutations, and from which Darwinian selection produced by divisions from this unicellular basis a more or less organised society of cheating cells, more and more escaping controls by the host organism, i.e., a cancer. In the atavistic theory, accompanied or induced by blockade of differentiation or reverse differentiation of normally maturing cells, societies of cells in a multicellular organism (cancer is always a disease of multicellular organisms) somehow, in some location of the organism, escape the fine control under which they are normally placed and revert to a previous, coarse and disorganised state of multicellularity (Davies and Lineweaver 2011). This may be understood as a process of “deDarwinisation”, through which cancer cells gain a state of plasticity (Clairambault 2020; Shen and Clairambault 2020; Hanahan 2022; Yuan et al. 2019) representative of a former state in the evolution of multicellularity.
The atavistic theory has thus deep connections with the theory of evolution to multicellularity. The passage from unicellular organisms to multicellular ones led to the regulation of capabilities, resulting in controlled proliferation and differentiation of cells leading to specialisation and cooperation between specialised cells. The role of environment-driven cellular stress in this process of specialisation has recently been stressed by various authors (Nedelcu and Michod 2020; Wagner et al. 2019). The new genes responsible for these regulations became tumour suppressors. The atavistic theory states that if these new suppressors become damaged for some reason, then latent genes, associated with functions from unicellular organisms, will reappear and dominate the scenery, thus resulting in the unconstrained proliferation and the lack of cooperation with the other cells of the host organism, as actually found in tumours.
Understanding how evolution led to the emergence of multicellularity then becomes a problem closely related to that of understanding cancer. Firstly because unravelling in detail what is lost (cohesion) and what is gained (plasticity) in this reverse evolutionary process may help understand the nature of cancer from a functional point of view. Secondly because it is reasonable to assume that Darwinian selection in tumours starts from a primitive state of multicellularity in which cells are very plastic with respect to their phenotypes, which sends us back to states in evolution close to the emergence of multicellularity. The division of work through cell differentiation achieved by coherent multicellularity (i.e., designing a stable, cohesive multicellular organism) (Müller 2001) was of vital importance in order to evolve into more complex and more functionally efficient organisms. In a first stage, this differentiation was very likely reversible, due to the high plasticity that cells were endowed with in a primitive state of multicellularity. Under these conditions, one can reasonably assume that these primitive organisms adopted bet hedging strategies, i.e., common risk-diversifying strategies in unpredictably changing and often aggressive environments, in order to maximise their phenotypic fitness (Philippi and Seger 1989; Seger 1987).
In more detail, bet hedging in cell populations may be defined as a diversification of phenotypes in a cohesive (or at least bound by a common fate community, usually of genetic nature) cell population to optimise its fitness, in particular a minima to ensure its survival in a life-threatening environment, in other words to design a fail-safe strategy for the preservation of a propagating element; one cell may be enough to achieve this goal. Enhancing the ability to (quickly) diversify phenotypes by changing differentiation paths (by reversal, i.e., vertical de-differentiation, or by horizontal trandsdifferentiation in a differentiation tree) may in particular be achieved at the chromatin level by means of epigenetic enzymes, molecular instances of cell plasticity at work (Shen and Clairambault 2020).
Among such commonly described strategies of living organisms (unicellular or multicellular) meant to ensure survival in changing environments have been classically described fright, fight and flight. Fright (or freeze) is not likely to induce phenotype evolution. Fight (establishing barriers, secreting poisons, gathering in colonies) and flight (motility to escape unbeatable predators) can. Differentiation between somatic and germinal cells is also a major step in evolution. Bet hedging strategies were not only present at elementary stages of evolution. They are a common adaptive tool that can still be found in nature at different levels of complexity, from prokaryotic organisms to vertebrate ones. In between, tumour cells, thanks to their high plasticity, in the presence of an aggressive environment provided by immune response of the host body or of any anti-cancer treatment, may adopt bet hedging as a strategy to guarantee a prolonged survival of their colony. The wide presence of bet hedging in nature as an evolutionary mechanism, and its many links to the development of cancer is what motivates us in the present attempt towards a mathematical model representing some of the factors that influence this phenomenon (natural selection, epimutations and environmental stress).
In our modelling choices, we have privileged, for the sake of biological interpretation, as in Chisholm et al. (2015), two phenotypes that are often identified as such in theoretical ecology: viability (potential to resist a deadly insult: the elephant strategy) and fecundity (potential to proliferate, or in a way, surviving by numbers, even under hard environmental conditions: the rat strategy). They may influence cell behaviour in opposed directions, as two different choices, incompatible for the same cell, in the same way as fecundity and motility are notably incompatible (cells that divide do not move, and vice versa). To take into account the faculty of rapidly changing phenotypes in case of a life-threatening insult, a capacity reported about many cancer cells (e.g., epithelial to mesenchymal transition or drug-induced drug persistence) (Shen and Clairambault 2020), here we have added plasticity as a complementary structuring phenotype (or trait) of cells. Among questions at stake we will in particular deal with is the optimisation of fitness strategy by concentration of phenotypes, either in two or more ranges surrounding fixed points (multimodality of traits, i.e., bet hedging) or around a central optimal point (unimodality of trait, i.e. no bet hedging when it is not favourable).
2 The model
We consider a population (not necessarily of tumour cells) in which each individual has three defining traits: viability associated with the variable \(x\in [0,1]\) which reflects the potential to resist deadly insults, fecundity associated with the variable \(y\in [0,1]\) representing the potential to proliferate and plasticity associated with the variable \(\theta \in [0,1]\) which represents the potential to continue to differentiate (or de-differentiate, or transdifferentiate, as long as it has not been fixed at its lower bound \(\theta \)=0) within a differentiation tree. We assume furthermore that for a certain regular function \(C:{\mathbb {R}}^2\rightarrow {\mathbb {R}}\) and a positive constant K, \((x,y)\in \Omega :=\{C(x,y)\leqslant K\}\), so \(z=(x,y,\theta )\) ranges over the set \(D:=\{\Omega \times [0,1] \}\). We then consider the evolution problem (1), (2), (3) on the density of population \(n=n(t,z)\geqslant 0\).
In the above equation, chosen for the sake of simplicity as diagonal, the matrix
gives the speed at which non-genetic epimutations occur, otherwise said it is a minimally simple representation of how the internal plasticity trait \(\theta \) affects the non-genetic instability of traits x and y, by tuning the diffusion term; the function
represents the sensitivity, meant here as the force of external evolutionary pressure, of the population to abrupt changes in the environment;
stands for the total amount of individuals in the population at time t.
From a biological point of view, the matrix \(A(\theta )\) represents random epimutations leading to non genetic changes, increasing with plasticity \(\theta \), in the traits viability x and fecundity y, while V stands for an external factor (abrupt biophysical changes in the ecosystem, exposure to life-threatening drugs in the case of cancer cell populations) inducing cellular stress in the population. Throughout the paper, we refer to \(\nabla \cdot (Vn)\) as advection term, or drift term, indistinctly.
We chose the term \(d(z)\rho (t)\) here as the simplest way to represent a death term in the evolution of the population. This multiplicative representation is classic, but most of the results we show in what follows can be extended to more general reaction terms.
Throughout our work we assume
-
(H1)
For some \(p>2\), the initial population density \(n_0(z)\) belongs to \(L^p(D)\).
-
(H2)
The intrinsic growth rate in absence of competition r(z) and the death rate d(z) due to competition for individuals with trait z are positive bounded functions that satisfy \(0<r^-\leqslant r(z)\leqslant r^+\) and \(0<d^-\leqslant d(z)\leqslant d^+\).
-
(H3)
The diffusion parameters \(a_{11}(\theta )\), \(a_{22}(\theta )\) and \(a_{33}\) are strictly positive, with \(a_{11}(\theta )\) and \(a_{22}(\theta )\) being non decreasing with respect to \(\theta \). Hence, the matrix \(A(\theta )\) is elliptic for all values of \(\theta \).
-
(H4)
The function V(t, z) is continuously differentiable for all values of \(t>0\) and \(z\in D\).
Under these hypotheses we are able to prove the existence and uniqueness of a solution for the problem (1–3) using the finite volume method in order to obtain a convergent semi-discrete scheme.
The problem (1–3) underlying hypotheses (H1) to (H4) sets a structured population model of evolution that, as mentioned before, takes into account some of the factors that might lead to the occurrence of “bet hedging”. Amongst the first works in this topic, we can find (Cohen 1966), where is studied the fraction of seeds that germinate and the fraction that remains dormant, in order to maximise the long term expectation of growth. In the same work, the similarities of this phenomenon with economic decision making under risk (so-called “fail-safe strategies”) are noted. Other early works on the subject are Philippi and Seger (1989) and Seger (1987). The reaction term in (1) is a simple way of modelling the selection principle. This term and more general ones are used in Perthame (2006) in order to study some basic properties of structured populations undergoing this type of behaviour. The same reaction term is also used in Pouchol and Trélat (2018), where are analysed the global asymptotic stability properties for integro-differential systems of N species structured by different sets of traits. A similar competition term is used in Desvillettes et al. (2008) to provide results about the long time behaviour of such reaction models. The diffusion term here models non-genetic instabilities (also known as epimutations), which constitute the drift of phenotype without alteration of the genotype. In Perthame (2006) an integral operator in order to model mutations arising during reproduction is used, and something similar could be done for the epimutations. The second order operator used in (1) can be obtained then after re-scaling the time variable. The effect of this phenomenon in cancer development is discussed in Huang (2013) from a biological point of view. Epimutations can also occur because of external stress, and this is represented in (1) by means of the advection terms. A biological example for a population changing phenotype due to external stress can be found in Matuła et al. (2019), where the effect of physical stress on the shape and the cell wall thickness of E.coli bacterias is discussed. Two different models are used in Chisholm et al. (2015) to conclude that the three mechanisms described above might reversibly push an actively-proliferating, and drug-sensitive, cell population to transition into a weakly-proliferative and drug-tolerant state, which will eventually facilitate the emergence of more potent, proliferating and drug-tolerant cells. One of such models is an integro-differential model very similar to (1–3), but without including the effect of plasticity on the evolution of the population and without assuming the existence of a constraint between the traits x and y.
The main results of this paper involve the variational formulation of (1–3), which we now introduce. Denote \(H=L^2(D)\), with \((\cdot ,\cdot )_H\) the usual scalar product in that space, and \({\mathcal {V}}=H^1(D)\) with \(\langle \cdot ,\cdot \rangle =\langle \cdot ,\cdot \rangle _{{\mathcal {V}}'\times {\mathcal {V}}}\) being the duality product in \({\mathcal {V}}\).
For any given \(n_0\in H\), \(T>0\), we say that
is a variational solution of the problem (1–3) if it is a solution in the following weak sense
where
for any \(\varphi \in X_T\). We say that n is a global solution if it is a solution on [0, T] for any \(T>0\).
Theorem 1
For all non-negative \(n_0\in L^p(D)\), \(p>2\), there exists a unique global non-negative weak solution for problem (1–3) in the sense of (4).
We focus on giving a proof for this theorem using a discretised version of problem (1–3) after applying the Finite Volume Method to it. For this purpose, we define a set \(D_h\supset D\), that can be covered by the union of N disjoint cubic cells, denoted as \(D_j\), of side length h. After integrating the Eq. (1) over each of the cells \(D_j\) we derive the system of first-order differential equations
where \(\nu _j\) is an approximation of the average of the solution n(t, z) over \(D_j\), \(N_j\) is the set of indexes corresponding to the neighbours of \(D_j\) and the coefficients \(M_j\) and \(B_{jl}\) are functions of V(t, z), \(A(\theta )\), r(z) and d(z). A full detailed derivation of the scheme is given in Sect. 3. We can then introduce the following result involving the solution for this system:
Theorem 2
For all non-negative \(n_0\in L^p(D)\), \(p>2\), there exists a unique non-negative solution for problem (5–6). Furthermore, the function \({\tilde{n}}_h(t,z)\) defined by
converges in \(L^2(D_T)\) to the unique non-negative weak solution of (1–3) as h goes to zero.
The existence and non-negativity of the solution for (5–6) results from the Cauchy-Lipschitz theorem, while the convergence of \({\tilde{n}}_h(t,z)\) is the consequence of the compactness of the sequence.
The existence result in Theorem 1 will be treated in Sect. 3, but the uniqueness can be directly obtained from the variational formulation with the help of some a posteriori estimates. Let us proceed then to prove the uniqueness before addressing the existence. Assume that there exists a non-negative weak solution n for (1–3). Assume as well that there exist \(0\leqslant t_0<t_1\) such that \(\rho (t)\leqslant \max \{\rho (0),\frac{r^+}{d^-}\}\) for \(t\in [0,t_0]\), and \(\rho (t)>\max \{\rho (0),\frac{r^+}{d^-}\}\) for \(t\in (t_0,t_1)\). We may take on the variational formulation \(\varphi (t,z)=\chi _{\varepsilon }(t)\), where \(\chi _{\varepsilon }(t)\) is a sequence satisfying \(\chi _{\varepsilon }(t)\rightarrow \mathbb {1}_{t>t_0}(t)\) and \(\chi '_{\varepsilon }(t)\rightarrow \delta _{t_0}(t)\). For example, we could consider \(\chi _{\varepsilon }(t)=\frac{1}{2}(1-\cos (\frac{\pi (t-t_0+\varepsilon )}{2\varepsilon }))\mathbb {1}_{t\in [t_0,t_0+\varepsilon ]}+\mathbb {1}_{t>t_0+\varepsilon }\). This leads to the equality
Taking the limit when \(\varepsilon \) goes to 0, we obtain that, for all \(t>t_0\)
where \(\rho (t)=\int \limits _{D}n(t,z)dz\) is a continuous function due to the fact that \(n(t,z)\in {\mathcal {C}}([0,T],H)\). We can write the previous relation as
which, thanks to the hypothesis on \(\rho (t)\) over \((t_0,t_1)\), leads to
for all \(t\in (t_0,t_1)\). This represents a contradiction, and implies that
for all values of t.
Taking now \(\varphi =n\) on the variational formulation, using standard arguments to bound the linear terms from Q[n] and the estimate (7) for the non-linear part together with the Gronwall Lemma, we obtain the relation
for some real positive number a. Using Gronwall’s lemma, this relation implies that
for all values of t. Finally assume the existence of two non-negative weak solutions \(n_1\) and \(n_2\) for the same initial data \(n_0\). Taking the difference between their respective variational formulations, choosing \(\varphi =n_1-n_2\), we get the equality
Once again, the linear part of \(Q [n_1]-Q [n_2]\) can be easily bounded using standard methods, leading to
Consequently, thanks to Gronwall’s lemma, \(\Vert n_1-n_2\Vert ^2_H=0\) for all t, therefore, the solution is unique.
As stated before, the proof of existence of a weak solution will be carried on in Sect. 3. Following the ideas from Carrillo et al. (2020), the semi-discrete numerical scheme (5–6) is developed and the convergence of its solution to the solution of (1–3) is demonstrated. A fully discrete numerical scheme is obtained starting from the semi-discrete one and its convergence is also proved. Section 4 is devoted to the numerical simulations, starting with some numerical computations of the approximation error by comparing the results with an exact solution. Finally, the solutions of some examples with biological meaning is presented.
3 Existence of a weak solution and numerical approximation
We aim to use the Finite Volume method in order to find a sequence of problems whose solutions converge to the solution of (4).
3.1 Preliminaries on the finite volume method
Consider \(h=1/M\) where M is a natural number and define the mesh
with \(i,j,k=0,\dots ,M-1\), such that \(\bigcup \limits _{i,j,k}C_{ijk}=[0,1]^3\).
Now introduce the sets
and
For simplicity, we define \(N:=|{\mathcal {M}}|\) as the amount of elements in \({\mathcal {M}}\), and denote each of its elements as \(D_j\), for \(j=1,\ldots ,N\). For each \(D_j\), we denote its centre of mass as \(\mathbf {z_j}:=(x_j,y_j,\theta _j)\). For each j, define \(N_j\) as the set of indexes l such that \(D_l\) and \(D_j\) share a common boundary. Denote such common boundary as \(\Gamma _{jl}\), its centre of mass as \(\mathbf {z_{jl}}\) and \({\mathbf {n}}_{jl}\) the outer normal vector of \(D_j\), in the direction of \(D_l\). We remark that the distance between the centres of two neighbouring cells \(D_j\) and \(D_l\) will be equal to \(|\mathbf {z_l}-\mathbf {z_j}|\). Having cubes as the mesh cells guarantees that the condition
is fulfilled.
It is important to remark that if D has a regular enough boundary (for example: smooth or polygonal), then the area of \(D_h\setminus D\), which we denote as \(|D_h\setminus D|\), will converge to zero as h vanishes. The approximated problem (1–3) is then given by
where \({\tilde{\rho }}_h(t)=\int \limits _{D_h} {\tilde{n}}_h(t,z)dz\). We propose a classical finite volume method based on local averages of the unknown density over cell grids defined by
Assume that the coefficients and the solution from Eq. (9) are smooth. Then, integrating it over a cell \(D_j\) yields the equality
After integrating by parts and using the boundary conditions (10), we get
For a real function f(t), define the positive and negative part of f as
and
respectively. Using an upwind approximation technique for the advection term, we conclude
where \(u_{jl}(t)=V(t,\mathbf {z_{jl}})\cdot \vec {{\mathbf {n}}}_{jl}\). On the other hand, we have
As \(A(\theta )\) is a diagonal matrix and \({\mathbf {n}}_{jl}\) is either one of the vectors from the euclidean canonical base, or one of their opposites, we have the relation
where \(A_{jl}=(A(\theta _{jl}){\mathbf {n}}_{jl}\cdot {\mathbf {n}}_{jl})\). Notice that, thanks to hypothesis (H3), there exists \(\alpha >0\) such that \(A_{jl}\geqslant \alpha \), for all j and l. So that, together with the expression of the normal vectors, (8) this implies
Due to the approximation of the gradient
we can finally write
Taking into account that
the reaction terms can be easily approximated
where we have adopted the notation \(r_j:=r(\mathbf {z_j})\) and \(d_j:=d(\mathbf {z_j})\). Finally, using again (12), we get
Consequently, collecting (13), (14) and (15), and getting rid of the approximation orders we obtain the semi-discrete scheme
where
This system of equations can be complemented with the set of initial data
where \({\overline{n}}_0(z)\) is the extension by 0 of \(n_0(z)\) to all of \({\mathbb {R}}^3\).
3.2 Global existence, uniqueness, positivity and boundedness of the solution for the semi-discrete scheme
We prove the local existence and uniqueness of the solution for the problem (16–17), by using the Cauchy-Lipschitz theorem. Then, such solution is proved to be non-negative and, as a consequence, bounded independently of t. Finally the boundedness property is used to prove the global existence of the solution.
Proposition 3
(Local existence of solution) For all sets of initial data \(\{\nu ^0_{j}\}\), there exists \(0<T^*<\infty \) such that the problem (16–17) has an unique solution over \([0,T^*)\). Furthermore, if \(\nu _{j}^0\geqslant 0\) for all j, then \(\nu _j(t)\geqslant 0\) for all time \(t\in [0,T^*)\) and all j.
Proof
The RHS term in (16) is Lipschitz continuous for all values of t and \(\nu _j\), therefore the existence and uniqueness of solution over a certain interval \([0,T^*)\) is a direct consequence of the Cauchy-Lipschitz theorem.
On the other hand, consider a strictly positive set of initial values \(\nu _{j}^0\) and define the continuous function \(f(t)=\min \limits _{j}\nu _{j}(t)\). If \(f(t)\geqslant 0\) for all \(t<T^*\), then the solution remains positive at all times. If \(f(t)<0\) for some \(t\in (0,T^*)\), then there exists \(t_0>0\) such that \(f(t_0)=0\) and \(f(t)\geqslant 0\) for \(t<t_0\). This implies the existence of \(j_0\) such that \(\nu _{j_0}(t_0)=0\) with \(\nu '_{j_0}(t_0)\leqslant 0\). If \(\nu _l(t_0)>0\) for some \(l\in N_{j_0}\), then, thanks to (16) we have
which is a contradiction with the previously established fact that \(\nu '_{j_0}(t_0)\leqslant 0\). Consequently \(\nu _l(t_0)=0\) for all \(l\in N_{j_0}\). Furthermore, from the definitions of f(t) and \(t_0\), we also have that \(\nu '_l(t_0)\leqslant 0\) for all \(l\in N_{j_0}\). We can iterate the previous argument in order to obtain that \(\nu _j(t_0)=0\) for all j and consequently, thanks to the uniqueness of the solution, \(\nu _j(t)\equiv 0\) for all j and all t. This is a contradiction with the assumption that f(t) is negative for some value of t and consequently \(f(t)\geqslant 0\) for all t.
Finally, for non-negative initial values \(\nu _j^0\) and \(\varepsilon \) small enough, we define \(\nu _j^{\varepsilon }\) as
Thanks to the previous step, the solution of (16) associated to \(\nu _j^{\varepsilon }\) remains non-negative for all t and all \(\varepsilon \). Hence, thanks to the continuous dependence of the solution of a system with respect to its initial data, we conclude that \(\nu _{j}^0\geqslant 0\) implies \(\nu _j(t)\geqslant 0\) for all \(t\in [0,T^*)\). \(\square \)
Proposition 4
(Discrete \(L^1\) bound) The \(L^1\) norm \({\tilde{\rho }}_h(t):=\sum \limits _lh^3\nu _l(t)\) satisfies the bounds
where \(r^-\), \(r^+\), \(d^-\) and \(d^+\) are the bounds given in (H1) for r(z) and d(z) respectively.
Proof
Multiplying (16) by \(h^3\) for each j, adding up all the equations, recalling that \(|\Gamma _{jl}|=|\Gamma _{lj}|\), \(u^+_{jl}=-u^-_{lj}\), \(A_{jl}=A_{lj}\) and \(|\mathbf {z_l}-\mathbf {z_j}|=|\mathbf {z_j}-\mathbf {z_l}|\) we obtain that
The non-negativity of the solution implies then
These differential inequalities directly imply the bounds over \({\tilde{\rho }}_h(t)\). \(\square \)
Notice that the \(L^1\) bounds are independent of t. Additionally, the upper bound also implies that
So that in general \(\nu _j(t)\leqslant \frac{{\overline{\rho }}}{h^3}\), which is the key estimate in order to prove global existence of solution for (16–17).
Proposition 5
(Global existence of solution) For all sets of initial data \(\{\nu ^0_{j}\}\), there exists a unique solution of problem (16–17) for all \(t>0\). Such solution is non-negative and satisfies the estimate (18).
Proof
For each h, assume that there exists a finite maximal time of existence \(T_h\). However, the estimate (18) on \( {\tilde{\rho }}_h(t)\) implies that for all j, \(\nu _j(T_h)\leqslant \frac{{\overline{\rho }}}{h^3}<\infty \), which, thanks to the Lipschitz continuity of the right hand side of (16) allows to extend to solution to a certain interval \([T_h,T^*_h)\), contradicting this way the maximality of \(T_h\). \(\square \)
3.3 Discrete gradient, \(L^2\) norm estimate and compactness result
In this section we introduce some piecewise constant functions depending on the solution of (16–17) together with some estimates related to such functions. Then, some compactness properties will be proved in order to ensure that such functions converge to some function that will be proved to be a weak solution of (1–2), and their derivatives, respectively.
We first introduce \(n_h(t,z)\) defined as
Notice that \(\Vert n_h\Vert _{L^1}={\tilde{\rho }}_h(t)\). Now, for each \(l\in N_j\), define the polygonal subsets of \(D_{j}\), denoted \(D_{jl}\), having \(\Gamma _{jl}\) as the common side and \(\mathbf {z_j}\) as a vertex. The subsets \(D_{jl}\) are pyramids of area \(s_{jl}=\frac{|\Gamma _{jl}|d(\mathbf {z_j},\Gamma _{jl})}{3}=\frac{h^3}{6}\). Let us define the piecewise constant function
![](http://media.springernature.com/lw408/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ180_HTML.png)
This function can be regarded as a discrete gradient for \(n_h(t)\).
Proposition 6
(\(L^2\) bound) For each value of h, define the space \(H_h:=L^2(D_h)\). Then, there exists positive constants a and b, independent of h, such that the functions \(n_h(t,z)\) and \(v_h(t,z)\) satisfy the following estimate
Proof
Multiplying Eq. (16) by \(h^3\nu _j(t)\) for each j, and adding them up, we obtain the relation
where
Let us proceed to estimate each of these terms. In order to simplify the notation, we define
and write
Consequently, knowing that \(-u^+_{jl}=u^-_{lj}\leqslant 0\), we have
where we used that \(|\mathbf {z_l}-\mathbf {z_j}|=2|\mathbf {z_{jl}}-\mathbf {z_j}|\). From the definition of \(u_{jl}\), we conclude that
where \({\overline{V}}:=\max \limits _{z,t}|V(t,z)|\). This implies
Then, for all \(\varepsilon >0\), Young’s inequality implies
On the other hand
The ellipticity and boundedness of the matrix \(A(\theta )\) imply that there exist positive constants \({\underline{\alpha }}\), \({\overline{\alpha }}\) such that \({\underline{\alpha }}\leqslant A_{jl}\leqslant {\overline{\alpha }}\) for all j and l. From this and the value of \(s_{jl}\) we deduce that
and consequently
notice that the last identity is true since \(v_h\) is a piecewise constant function. Using the bounds over r(z) and the positiveness of d(z) and \({\tilde{\rho }}_h(t)\) we see that
Using (21), (22) and (23) in (20), with the relation
yields the differential inequality
which, after taking \(\varepsilon =\frac{{\underline{\alpha }} }{3{\overline{V}}}\) and using Gronwall’s lemma, leads to the estimate (19). \(\square \)
The result of Proposition 6 can be easily generalised if instead of multiplying each Eq. (16) by \(h^3\nu _j(t)\), we multiply by \(h^3\nu ^{p-1}_j(t)\), for any \(p>1\), which would lead to the following uniform \(L^p\) bound:
Proposition 7
(\(L^p\) bound) There exists positive constants \(a_p\) and \(b_p\), independents of h, such that the functions \(n^p_h(t,z)\) and \(v^p_h(t,z)\) defined as
![](http://media.springernature.com/lw444/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ181_HTML.png)
satisfy the following estimate
Before stating the compactness result that will allow us to extract a convergent subsequence from \(n_h\) we give two important results that are a consequence from estimate (19).
Proposition 8
For all vectors \(\eta \in {\mathbb {R}}^3\) define the translation operator \(\pi _{\eta }:L^2({\mathbb {R}}^3)\rightarrow L^2({\mathbb {R}}^3)\) as \(\pi _{\eta }(u)(z)=u(z+\eta )\). Then
uniformly in h, where \({\overline{n}}_h\) represents the extension by 0 of \(n_h\) to all of \({\mathbb {R}}^3\).
Proof
The proof of this proposition follows the same steps as Lemma 18.3 and Remark 18.8 in Eymard et al. (2000). In particular, thanks to the discrete trace inequality given in the Lemma 10.5 within the same reference, we can prove the existence of a constant C, independent of h and \(\eta \), such that
and we get the result from Proposition 8 thanks to (19). \(\square \)
From now on we introduce the notation \(D_T:=(0,T)\times D\).
Proposition 9
There exists a constant \(C_1\), independent of h, such that for all \(\psi \in {\mathcal {D}}(D_T)\),
Proof
Using the scheme (16), we have that, for all \(\psi \in {\mathcal {D}}(D)\)
where \(\psi _j=h^{-3}\int \limits _{D_j}\psi dz\) is the mean value of \(\psi \) over \(D_j\). Reordering the terms in this equality we have
We can bound each term on this equality as follows: From the definition of \(u_{jl}(t)\) we get that \(|u^{\pm }_{jl}|\leqslant \Vert V\Vert _{L^{\infty }(D_T)}\) and consequently
Using the previously established relation \(A_{jl}\leqslant {\overline{\alpha }}\) we get
Finally, using the boundedness of r(z), d(z) and \({\tilde{\rho }}_h\) we see that
Putting everything together, we obtain the existence of a constant C independent of h, such that
Finally, using the inclusions \(H^3(D)\subset {\mathcal {C}}^{1,\frac{1}{2}}(D)\subset {\mathcal {C}}^1(D)\) that hold true in any open subset of \({\mathbb {R}}^3\) with smooth enough boundary thanks to the Sobolev inequalities, integrating over (0, T), using the Cauchy-Schwartz inequality and estimate (19), we get to the result stated in the proposition. \(\square \)
Proposition 10
If \(n_0\in L^p(D)\) for some \(p>2\), then there exists a function \(n\in L^2(0,T;{\mathcal {V}})\) such that, up to the extraction of a sub-sequence, the sequence of functions \(n_h\) strongly converges to n in \(L^2(D_T)\) and \(v_h\) weakly converges to \(\nabla n\) in \(L^2(D_T)\).
Proof
Propositions 8 and 9 give all the necessary tools in order to ensure the relative compactness of the set of functions \(\{n_h\}_h\) in \(L^2(D_T)\). In order to prove that such set is indeed compact, we will use a variant adapted to our purposes of the proof of Moussa (2016, Theorem 3) to show that \(\{n_h\}_h\) is a Cauchy sequence in \(L^2(D_T)\). We consider a sequence of mollifiers \(\Phi _{\varepsilon }(z)=\varepsilon ^{-3}\Phi (\varepsilon ^{-1}z)\) for a positive, symmetric function \(\Phi \in {\mathcal {D}}(B(0,1))\) satisfying \(\int _{{\mathbb {R}}^{3}}\Phi (z)dz=1\).
Step 1: We claim that
uniformly on h.
We have
thanks to the Cauchy-Schwarz inequality and the value of the integral of \(\Phi _{\varepsilon }(y)\). Consequently
and due to Proposition 8, we obtain the strong convergence of \(\Phi _{\varepsilon }*{\overline{n}}_h\) to \({\overline{n}}_h\) in \(L^2((0,T)\times {\mathbb {R}}^3)\), uniformly in h.
Step 2: We prove that for every fixed \(\varepsilon \) and any compact \(\omega \subset D\), the sequence \(\Phi _{\varepsilon }*{\overline{n}}_h\) is uniformly bounded in \(H^1((0,T)\times \omega )\).
Thanks to Young’s inequality, for a fixed \(\varepsilon \),
Furthermore, for any compact \(\omega \subset D\), \(\psi \in {\mathcal {D}}((0,T)\times \omega )\) and \(\varepsilon \) small enough, we have \(\Phi _{\varepsilon }*\psi \in {\mathcal {D}}(D_T)\) and consequently, using Proposition 9 and Young’s inequality for the convolution product, we get
Then, by duality, for a fixed \(\varepsilon \), \(\frac{d }{dt}\Phi _{\varepsilon }*{\overline{n}}_h\) is uniformly bounded in \(L^{\infty }((0,T)\times \omega )\) and we conclude that for each fixed \(\varepsilon \), \(\Phi _{\varepsilon }*{\overline{n}}_h\) is uniformly bounded in \(H^1((0,T)\times \omega )\).
Step 3: We claim that, for all compacts \(\omega \subset D\), the sequence \({\overline{n}}_{h}\) is a Cauchy sequence in \(L^{2}((0,T)\times \omega )\).
Write
Thanks to Step 1, for all \(\eta >0\), we can fix \(\varepsilon \) in such a way that
Thanks to Step 2 and Rellich-Kondrachov’s theorem, we know that for a fixed \(\varepsilon \), \(\Phi _{\varepsilon }*{\overline{n}}_h\) is a Cauchy sequence in \(L^{2}((0,T)\times \omega )\), hence, for \(h_1\) and \(h_2\) close enough
Consequently, for all positive \(\eta >0\), there exists \(h_1\) and \(h_2\) such that
which proves that \({\overline{n}}_{h}\) is a Cauchy sequence.
Step 4: The sequence \({\overline{n}}_{h}\) is a Cauchy sequence in \(L^{2}(D_T)\).
Fix a natural number m, define \(\omega _m:=\{z\in D:d(z,\partial D)\geqslant 1/m\}\), \(\omega ^T_m:=(0,T)\times \omega _m\) and write
Taking \(q>1\) and using Hölder’s inequality we have
Hence, for \(q=p/2\), thanks to (25), we can conclude that \(\sup \limits _h\Vert n_h\Vert _{L^2((0,T)\times (D\setminus \omega _m))}\) goes to 0 when m goes to infinity. Consequently, for all \(\eta >0\), we can fix m big enough so that the second term in (26) is smaller than \(\eta /2\) and then, thanks to Step 3, we can choose \(h_1\) and \(h_2\) in such a way that the first term is also smaller than \(\eta /2\), which proves that \(\{n_h\}_h\) is a Cauchy sequence in \(L^{2}(D_T)\).
Step 5: Let n be the limit of a suitable subsequence of \(n_h\) and \(v:=(v^1,v^2,v^3)\) the weak limit of \(v_h\) (such limit exists due to the fact that \(v_h\) is bounded in \((L^2(D_T))^3\)). We claim that \(n\in L^2(0,T;{\mathcal {V}})\) and \(v=\nabla n\).
For all \(\phi (t,z)\in {\mathcal {C}}([0,T];{\mathcal {D}}(D))\), the convergence of \(n_h\) to n implies that
where \(\partial _1\phi (z)=\partial _x\phi (z)\), \(\partial _2\phi (z)=\partial _y\phi (z)\) and \(\partial _3\phi (z)=\partial _{\theta }\phi (z)\). Writing \(\partial _i\phi (z)=\nabla \cdot (\phi (t,z) \mathbf {e_i})\), where \(\{{\mathbf {e}}_i\}\) is the eucledian canonical basis, we have that
Therefore, we obtain
Recalling that \(\mathbf {z_{jl}}\) is the center of mass of \(\Gamma _{jl}\), we may estimate the integral over \(\Gamma _{jl}\) as
as done in (12) with the integral over \(D_j\). Using the fact that \(|\Gamma _{jl}|=h^2\) and the amount of cells on the mesh is \({\mathcal {O}}(h^{-3})\), we have
Reordering the terms of this last sum, we have
where
![](http://media.springernature.com/lw238/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ182_HTML.png)
This function strongly converges in \(L^2(D_T)\) to \(\phi \). Substituting (29) in (28) and then in (27), and using the weak convergence of \(v_h\) to v, we obtain that, for all \(\phi \in {\mathcal {C}}([0,T];{\mathcal {D}}(D))\)
Taking \(\phi (t,z)=\varphi (z)\chi (t)\), with \(\varphi \in {\mathcal {D}}(D)\) and \(\chi (t)\in {\mathcal {C}}([0,T])\), this last equality becomes
for all \(\varphi \in {\mathcal {D}}(D)\) and \(\chi (t)\in {\mathcal {C}}([0,T])\), which implies that for each \(\varphi \in {\mathcal {D}}(D)\)
The separability of \({\mathcal {D}}(D)\) finally implies that
As \(v(t,z)\in L^2(D)\) for almost all \(t\in [0,T]\), n(t, z) belongs to \({\mathcal {V}}\) for almost all \(t\in [0,T]\), which proves the statements of the Proposition. \(\square \)
An immediate consequence of Proposition 10, together with estimate (19) is that
Noticing that
the definition of \(D_h\) and Proposition 10 implies that the sequence of functions \(\rho _h(t)\) strongly converges to \(\rho (t):=\int \limits _Dn(t,z)dz\) in \(L^2((0,T))\).
3.4 Existence of weak solution
This section is devoted to prove that the function n is a weak solution of problem (1–2).
Proposition 11
The function n satisfies
for all \(\varphi \in {\mathcal {C}}^1_c([0,T),{\mathcal {V}})\).
Proof
First consider \(\varphi \in {\mathcal {C}}^1_c([0,T),{\mathcal {C}}^{\infty }_c({\mathbb {R}}^3))\), and for each j, multiply Eq. (16) by \(h^3\varphi _j(t):=h^3\varphi (t,\mathbf {z_{j}})\), and add them up for all j, obtaining the relation
where
![](http://media.springernature.com/lw388/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ183_HTML.png)
Reordering the terms from \(A_{\varphi }(t)\), we get
where
Thanks to the boundedness of \(u_{jl}\) and the regularity of \(\varphi \), this term satisfies
Recalling the definition of \(u_{jl}\) and the property (8), we get from (32)
Defining \(V_j(t):=V(t,\mathbf {z_j})\) allows us to write
with
Thanks to the regularity of \(\varphi \) and V, we have
Finally, we write
where
Again, the regularity of \(\varphi \) and the boundedness of V imply
And this together with (33), (34) and estimate (19) ensures that there exists a constant \(C_A\) independent of h such that
On the other hand, we can rewrite the remaining term as
In Eymard et al. (2006), it was proven that
consequently
where
![](http://media.springernature.com/lw290/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ184_HTML.png)
Moreover, the sequence \(\Big (V\cdot \nabla \varphi \Big )_h(t)\) strongly converges to \(V\cdot \nabla \varphi \) in \(L^2(D_T)\), which implies that
Reordering as well the terms from \(D_{\varphi }\), we get
where
Thanks to the boundedness of the coefficients \(A_{jl}\) and the regularity of \(\varphi \) we get
Recalling the definition of \(A_{jl}\), the fact that \(A(\theta )\) is a diagonal matrix and the normal vectors to the boundary of \(D_j\) are elements of the canonical euclidean basis or their opposites, we have that, for all j and l
Consequently, we infer that
where
![](http://media.springernature.com/lw346/springer-static/image/art%3A10.1007%2Fs00285-022-01820-5/MediaObjects/285_2022_1820_Equ185_HTML.png)
strongly converges to \( A(\theta )\nabla \varphi (z)\) in \(\Big (L^2(D_T)\Big )^3\). The regularity of \(\varphi \), the boundedness of \(v_h\) in \(L^2(D_T)\) and (36) guarantee the existence of \(C_D\) such that
so that, consequently,
The sequence of functions
belongs to \(L^{\infty }(D_T)\) and strongly converges in \(L^2(D_T)\) to \(r(z)-d(z)\rho (t)\), which implies that
Finally, we conclude that
which converges to
Putting this result together with (35), (37) and (38), we get that, for all \(\varphi \in {\mathcal {C}}^1_c([0,T),{\mathcal {C}}^{\infty }_c({\mathbb {R}}^3))\)
As \({\mathcal {C}}^1_c([0,T),{\mathcal {C}}^{\infty }_c({\mathbb {R}}^3))\) is dense in \({\mathcal {C}}^1_c([0,T),H^1(\omega ))\) for all compacts \(\omega \subset D \), and the functions n and \(\nabla n\) belong to \(L^2(D_T)\), (39) also holds for all \(\varphi \in {\mathcal {C}}^1_c([0,T),{\mathcal {V}})\). \(\square \)
Proposition 12
The function n belongs to \(H^1((0,T);{\mathcal {V}}')\).
Proof
Taking \(\varphi :=\chi (t)\psi (z)\) with \(\chi \in {\mathcal {D}}((0,T))\) and \(\psi \in {\mathcal {V}}\) in Eq. (31), we get
As this holds true for all \(\psi \in {\mathcal {V}}\), this equation is equivalent with
or in other words
The estimate (30) implies that \(Q[ n]\in L^2((0,T);{\mathcal {V}}')\), consequently, n belongs to \(H^1((0,T);{\mathcal {V}}')\). \(\square \)
Proposition 13
The function n belongs to \({\mathcal {C}}([0,T],L^2(D))\).
Proof
From Proposition 8 we have that \(n\in L^2((0,T),{\mathcal {V}})\). Define \({\overline{n}}(t,z)=n(t,z)\mathbb {1}_{[0,T]}(t)\) and the approximation to the identity sequence
where \(\Phi (t)\) is a mollifier with compact support included in \((-1,-1/2)\). The sequence \(n_{\varepsilon }(t):=n*_t\Phi _{\varepsilon }\) belongs to \({\mathcal {C}}^1({\mathbb {R}},{\mathcal {V}})\), \(n_{\varepsilon }\rightarrow n\) a.e. on [0, T] and in \(L^2((0,T),{\mathcal {V}})\). For a fixed \(\tau \in (0,T)\) and for any \(t\in (0,\tau )\) and any \(0<\varepsilon <T-\tau \), we have \(s\rightarrow \Phi _{\varepsilon }(t-s)\in {\mathcal {D}}(0,T)\), since \(\text{ supp } \Phi _{\varepsilon }(t-\cdot )\subset [t+\varepsilon /2,t+\varepsilon ]\subset [\varepsilon /2,\tau +\varepsilon ]\). Therefore, we get
As a consequence \(n'_{\varepsilon }\rightarrow n'\) a.e. and in \(L^2((0,\tau ),{\mathcal {V}}')\). Now fix \(\tau \in (0,T)\) and \(\varepsilon ,\varepsilon '\in (0,T-\tau )\), and compute
so that for any \(t_1,t_2\in [0,\tau ]\)
Since \(n_{\varepsilon }\rightarrow n\) a.e. in \([0,\tau ]\) in V, fix \(t_1\) such that \(n_{\varepsilon }(t_1)\rightarrow n(t_1)\) in V, so as a consequence of \(n_{\varepsilon }\rightarrow n\) in \(L^2((0,T),{\mathcal {V}})\) and \(n'_{\varepsilon }\rightarrow n'\) in \(L^2((0,\tau ),{\mathcal {V}}')\) we have
So that \(n_{\varepsilon }\) is a Cauchy sequence in \({\mathcal {C}}([0,\tau ],L^2(D))\), and then \(n_{\varepsilon }\) converges in \({\mathcal {C}}([0,\tau ],L^2(D))\) to a limit \({\tilde{n}}\in {\mathcal {C}}([0,\tau ],L^2(D))\). That proves \(n={\tilde{n}}\) a.e. and \(n\in {\mathcal {C}}([0,\tau ],L^2(D))\). By taking \(\Phi (-t)\) as the mollifier function in the previous proof, and choosing \(\tau \in (0,T)\) and \(\varepsilon \in (0,\tau )\) can be proven that \(n\in {\mathcal {C}}([\tau ,T],L^2(D))\), and consequently \(n\in {\mathcal {C}}([0,T],L^2(D))\). \(\square \)
Proposition 14
The function n is a weak solution of problem (1–2).
Proof
Assume first \(\varphi \in {\mathcal {C}}_c([0,T),H)\cap L^2((0,T),{\mathcal {V}})\cap H^1([0,T],{\mathcal {V}}') \). We define \(\varphi _{\varepsilon }(t)=\varphi *_t \Phi _{\varepsilon }\) for a mollifier \(\Phi _{\varepsilon }\) with compact support included in \((0,\infty )\) so that \(\varphi _{\varepsilon }\in {\mathcal {C}}^1_c([0,T);{\mathcal {V}})\) for any \(\varepsilon >0\) small enough and
Writing the Eq. (31) for \(\varphi _{\varepsilon }\) and passing to the limit \(\varepsilon \rightarrow 0\) we get that (31) also holds true for \(\varphi \).
Assume now \(\varphi \in X_T=C([0,T],H)\cap L^2((0,T),{\mathcal {V}})\cap H^1([0,T],{\mathcal {V}}')\). Fix \(\chi \in {\mathcal {C}}^1({\mathbb {R}})\), such that \(\text{ supp }\, \chi \subset (-\infty ,0)\), \(\chi '\leqslant 0\), \(\chi '\in {\mathcal {C}}_c((-1,0))\), and such that the integral of \(\chi '\) is \(-1\). For example, fix \(\delta <1/2\) and define
Now define \(\chi ^t_{\varepsilon }=\chi (\frac{s-t}{\varepsilon })\), so that \(\varphi _{\varepsilon }:=\varphi \chi ^t_{\varepsilon }\in {\mathcal {C}}_c([0,T);H)\) and \(\chi ^t_{\varepsilon }\rightarrow {\mathbf {1}}_{[0,t]}\), \((\chi ^t_{\varepsilon })'\rightarrow -\delta _t\) as \(\varepsilon \rightarrow 0\). Equation (31) for the function \(\varphi _{\varepsilon }\) writes
Passing to the limit when \(\varepsilon \) goes to 0 we obtain that n is a solution for the variational formulation. \(\square \)
3.5 A discrete implicit scheme
Once we have established the proof of existence of a solution for problem (16–17), together with the obtention of a semi-discrete scheme in order to approximate such solution, we proceed to derive an implicit discrete scheme starting from problem (16–17), and to prove its convergence.
Consider a natural number K and define \(\Delta t=\frac{T}{K}\) and \(t_k=k\Delta t\), \(\nu ^k_j:=\nu _j(t_k)\), \(k=1,\ldots ,K\). Using a forward difference approximation for the time derivative in (16) we get the implicit scheme
where
Theorem 15
Let \(\nu ^0_j\) be non-negative initial data with mass \(\rho _0=\sum \limits _{j=1}^Nh^3\nu ^0_j\) and assume that
then there exists a unique non-negative solution \(\nu ^k_j\), \(k=1,\ldots ,N\) to scheme (40). Furthermore, for each h, the sequence of piecewise constant functions
strongly converges to the solution of (16–17) in \((L^2((0,T)))^N\) when \(\Delta t\) goes to 0.
Proof
For all \(\Delta t\) satisfying (41), there exists \(\lambda <1\) such that
where \({\overline{\rho }}_{\lambda }=\frac{{\overline{\rho }}}{1-\lambda }\). Consider the set
and assume \(\nu ^k\in {\mathbb {R}}^M\) to be the solution of (40) for a previous iteration, having all non-negative components and satisfying \(\sum \limits _jh^3\nu ^n_j\leqslant {\overline{\rho }}\). Define the operator \(\nu =F(\eta ):{\mathcal {X}}\rightarrow {\mathbb {R}}^M\) as the solution of the linear system
where the components of matrix \(M(\eta )\) are defined as
From the definition of \({\mathcal {X}}\) and the choice of \(\Delta t\) we have that \(P_{jl}(\eta )\) is positive for all \(\eta \) if \(j=l\), and non-positive if \(j\ne l\). Furthermore \(P(\eta )\) is a column-dominant matrix, so that we may conclude that \(P(\eta )\) is an M-matrix, which implies that its inverse exists and has only non-negative entries. As \(\nu ^k\) has all non-negative components, then \(\nu =F(\eta )\) also has non-negative components for all \(\eta \in {\mathcal {X}}\). Multiplying system (43) by \(h^3\) and adding up all equations, thanks to (42) we obtain
This implies that \(\sum \limits _{j=1}^Nh^3\nu _j<{\overline{\rho }}_{\lambda } \) and consequently, \(F(\eta )\) is a continuous application going from \({\mathcal {X}}\) to itself, and thanks to Brouwer’s fixed point theorem, \(F(\eta )\) has at least a fixed point on \({\mathcal {X}}\). Furthermore, a fixed point of \(F(\eta )\) will satisfy
which in turn implies \(\Vert \nu \Vert _1\leqslant {\overline{\rho }}\). As a consequence, the implicit Euler scheme satisfies
For the uniqueness, assume there exists two different solutions \(\nu \) and \(\mu \) to scheme (40). Let us denote the sign of \(\beta \in {\mathbb {R}}\) as \(sign(\beta )\). Taking the difference for each equation, multiplying by \(h^3 sign(\nu _j-\mu _j)\), adding up all the equations and recalling that
we obtain that
The condition over \(\Delta t\) then implies that \(\Vert \nu -\mu \Vert _1=0\) and consequently \(\nu =\mu \).
To prove the convergence of \(\nu ^j_{\Delta t}\) to \(\nu _j\), we define the sequences of continuous functions
These sequences are in \({\mathcal {C}}([0,T])\) and are uniformly bounded because for all values of j and k, \(\nu ^k_j\leqslant \frac{{\overline{\rho }}}{h^3}\) due to (44). Furthermore, thanks to (40)
where C is independent of \(\Delta t\). Consequently, for each j, when \(\Delta t\) goes to 0 both sequences \(\nu ^j_{\Delta t}\) and \(\mu ^j_{\Delta t}\) strongly converge in \(L^2((0,T))\) to a certain continuous functions \(\nu ^*_j(t)\).
We consider now a function \(\varphi \in {\mathcal {C}}^1_0((0,T))\) and we define \(\varphi _k:=\varphi (t_k)\). For all k, multiply (40) by \(\Delta t\varphi _k\) and add over k in order to obtain
or, after reordering the sum on the left side
For all \(\varphi \) in \({\mathcal {C}}^1_0((0,T))\), the sequence
strongly converges in \(L^2((0,T))\) to \(\varphi '\). The boundedness of the coefficients \(M^{k}_{j}\) and \(B^k_{jl}\) together with strong convergence of \(\nu ^j_{\Delta t}\) imply that
strongly converges to \(M_{j}(t,\rho _h(t))\nu ^{*}_j+\sum \limits _{l\in N_j}B_{jl}(t)\nu ^{*}_l\), so, taking the limit in (45), we get
which implies that \(\nu ^*_j\) is in \({\mathcal {C}}^1\) and is a solution for (16). Furthermore, as \(\nu ^*_j\) is the point-wise limit of \(\mu ^j_{\Delta t}\), it also satisfies the initial conditions (17). \(\square \)
4 Simulations
The first part of this section is devoted to the numerical analysis of the approximation error. For certain values of the coefficients of problem (1–3), it is possible to obtain an analytical solution, which we will use in order to compare with our numerical approximation.
Assume that \(r(x,y,\theta )\) and \(d(x,y,\theta )\) are constants, that \(V(t,x,y,\theta )\) is independent of t and that exists \(W(x,y,\theta )\) such that
Assume as well that
Then, the solution for problem (1–3) is
where
The existence of an analytic solution allows us to compare its values with those obtained from solving (16) for different values of h, and this way, numerically establish the error order of the method. Choosing
\(r=d=1\) and \(n_0=\Big (\pi (1-e^{-1/2})(e-1)\Big )^{-1}e^{-\frac{x^2}{2}-\frac{y^2}{2}+\theta }\), the exact solution for (1–3) is
For a grid of points \(\{(t_k,z_j)\}\) with \(t_k=k\Delta t\), \(k=1,\ldots ,K\), \(\Delta t>0\) and \(j=1,\ldots ,N\), we define the discrete \(L^2(D_T)\) error for the semi-discrete scheme (16) as
where \(\nu (t)\) is the solution of the scheme for the functions introduced above. We set \(\Delta t=0.01\) and in Fig. 1 we show the dependence in log-log scale of \(E_h=E(0.01,h)\) with respect to the inverse of the cell size \(M=1/h\).
In the same way, we define the discrete \(L^2(D_T)\) error for the discrete scheme (40) as
where \(\nu ^k_j\) is the solution of (40). We set \(h=1/50\) and plot the dependence of \(E^2_{\Delta t}:=E^2(\Delta t,0.02)\) with respect to \(\Delta t=1/M_1\), again in log-log scale (Fig. 2).
4.1 Phenotypic dimorphism
We present now several examples illustrating the effect of the environment on a population and how the plasticity trait plays a role in surviving effects.
4.1.1 Monomorphic population
On the first place, we show the evolution of a population which is under the effects of natural selection and non-genetic epimuations, but without considering plasticity as a trait nor accounting for the environmental pressure. Specifically, we solve the problem (1–2) over the domain
This choice of domain represents the existence of a trade-off between traits and it is evidenced here by noticing that the individuals of the population which are close to the maximal value of one of the traits (\(x=1\) or \(y=1\)), must forcibly be close as well to the minimal value of the other trait (\(y=0\) or \(x=0\)). We take the growth rate as \(r(x,y)=e^{-(x-0.1)^2-(y-0.1)^2}\), the death rate as \(d(x,y)=0.5\), the diffusion parameters \(a_{11}(\theta )=a_{22}(\theta )=10^{-6}\), and the drift terms \(V(t,x,y)=(0,0)\). We take an initial condition (3) given by the expression
with \(f(x,y)=\frac{(x-0.25)^2+(y-0.25)^2}{(0.025)^2}\). We choose the value of a in such a way that \(\rho _0=\int _{\Omega }n_0(x,y)dxdy=1\).
With this choice of \(n_0\) we intend to represent a “strongly” monomorphic population. This is, a population where most of the individuals are concentrated around a single set of traits.
We observe in Fig. 3 that the dominant phenotype slowly converges to the point which maximises the fitness, in this case, the point (0.1, 0.1), which maximises the growth rate r(x, y).
4.1.2 Dimorphism due to the effect of the environment
For a second example we keep the same parameters, but add a drift term accounting for the effect of the environment (biologically, a “cellular stress”). Specifically we choose
Notice that we still are not including plasticity as a trait in our analysis. Also notice that the function V(t, x, y) is not continuous, while the results presented in previous sections needed V(t, z) to be smooth in order to ensure the existence and uniqueness of solution for the problem, as well as the convergence of the finite volume method. This is not a big issue, because the conditions of our problem allow us to use a density argument in order to extend our results to any \(V(t,z)\in {\mathcal {C}}([0,T),L^2(D))\), and in any case all numerical approximations are smoothing approximations of the drift V.
The choice of V can be seen (and experimentally replicated) as a certain type of “training”: all the individuals of the population are “pushed” in the direction where they show their largest potential.
We can appreciate in Fig. 4 that adding a drift term resulted in the appearance of dimorphism: the final population is concentrated around two different trait configurations. This evolution into dimorphism due the action of environment, contrasts with the results obtained in Lorenzi and Pouchol (2020), where it was proved that dimorphism can occur in the absence of a drift term, given that the growth rate r(x, y) has several maximum point satisfying certain conditions.
4.1.3 Plasticity, environmental effect and dimorphism
We will now consider plasticity as a trait and modify the parameters from the previous examples accordingly. We first consider the growth rate as
and keep the constant death rate \(d(x,y,\theta )=0.5\). We consider the diffusion matrix
and the drift term
This choice of V is similar to the one shown before, only that now differentiation imposes a cost on adaptability: the more specialised you are, the harder it gets to adapt to new situations. Notice that a higher plasticity increases the effect of non-genetic epimutations (given by the diffusion term) and stress induced mutations (given by the drift term). For the initial data (3) we take a function of the form
with \(f_i(z)=\frac{\Vert z-z_i\Vert ^2}{(0.025)^2}\), \(i=1,2\), \(z_1=(0.25,0.25,0.25)\) and \(z_1=(0.25,0.25,0.75)\). The value of a is again chosen in a way that the total population size over \(D=\Omega \times [0,1]\) is equal 1.
We observe in Fig. 5 that for this set of parameters, the sub-population with the lowest plasticity quickly gets extinct (notice the scale of the density values), while the emergence of dimorphism can be appreciated for the one with higher plasticity.
This kind of behaviour persists up to the final stages of the evolution, when we can observe in Fig. 6 that the low plasticity sub-population is completely extinct while the high plasticity one has completed the differentiation process.
4.1.4 Another example of emergence of dimorphism
We present now a different example of emergence of dimorphism, but this time, not as a result of a response to the effect of the environment, but as a consequence of the existence of two maximum points for the growth rate. We will observe how a population initially concentrated around a single phenotype configuration, will evolve with time into a dimorphic population, in which each upcoming sub-population is more specialised and less plastic that in the initial configuration. For this purpose, over the domain \(D=\Omega \times [0,1]\) we consider an initial density given by the expression
with \(f(z)=\frac{\Vert z-z_0\Vert ^2}{(0.025)^2}\), where \(z_0=(0.25,0.25,0.5)\) and \(\Vert \cdot \Vert \) is the euclidean norm. We choose the value of a in such a way that \(\rho _0=\int _{D}n_0(z)=1\).
We set the growth rate and the death rate as
We choose the diffusion matrix
and finally the drift term
This time, the “push” towards specialisation imposed by V is inversely proportional to the current set of traits (individuals with traits (x, y) are specialising with a rate proportional to \((-y,-x)\). The growth rate was chosen in such a way that it satisfies the sufficient conditions given in Lorenzi and Pouchol (2020) in order to guarantee the appearance of phenotypic polymorphism.
We show in Fig. 7 that initially the population is concentrated around the phenotype \(z_0=(0.25,0.25,0.5)\), and gradually starts differentiating while loosing plasticity.
Initial stages of the population density for different values of \(\theta \): The differentiation process starts. At around \(t=250\) (bottom left) most of the population has already concentrated around the plasticity level \(\theta =0.4375\) and around \(t=300\) (bottom right) we observe that the migration towards a less plastic state continues
As the two new sub-populations become more and more differentiated, the loss in plasticity becomes more evident, and we see in Fig. 8 that most of the mass is migrating towards less plastic states, while the differentiation process continues.
Finally we observe in Fig. 9 that once the sub-populations are fully specialised, the concentration process continues and at the final stage \(t=1000\) we have a dimorphic population which is more specialised but less plastic that the initial one.
Final stages of the population density for different values of \(\theta \): Around \(t=900\) (bottom left) the differentiation process is over and most of the population has reached the plasticity level \(\theta =0.25\). At \(t=1000\) (bottom right) we observe that the population concentrated around any other level of plasticity is almost extinct, and only the one around \(\theta =0.25\) survives
Overall, in the absence of a drift term, the effects of the growth and death rate, with small (or zero) diffusion has been widely studied, for example, see (Perthame 2006; Pouchol and Trélat 2018; Lorenzi and Pouchol 2020). In general, the points where the population concentrates are entirely determined by the reaction part, while the diffusion coefficients determine how concentrated the population is.
The shape of the domain \(\Omega \) directly affects where the fitness function (which depends on the birth and death rates) attains its maxima, therefore, affecting as well where the concentration phenomena occurs.
The choice of initial state does not appear to affect the final configuration, but rather the dynamics of the population during the evolution in time. We have shown that if we start with an already concentrated population, we witness the continuous “migration” of the the population towards the fittest state.
Finally, if we consider the presence of the advection term, we observe that it may affect not only the amount of selected sets of traits, but their positions and the dynamics of convergence towards them as well.
5 Concluding remarks
The validity of the model we constructed is strengthened by the different evolutionary mechanisms described in Wagner et al. (2019), where the authors focus their attention on Stress-Induced Evolutionary Innovation, and compare it to plasticity-based models, in particular the Plasticity First Hypothesis. Quoting the authors: “SIEI and PFH are not competing models but explain different kinds of evolutionary processes that are sometimes distinct and sometimes combined over evolutionary time”. Similar mechanisms were taken into consideration in the construction of our model, environmental stress (aka environmental pressure, biologically, at the single-cell level, “cellular stress”) in the form of an advection term and and mutations thanks to plasticity in the form of a diffusion term, both accompanied by natural selection in the form of reaction term.
It is important to highlight the novelty that represents the inclusion of plasticity as a trait, which has not been considered before (with the exception of Bouin et al. 2012; Bouin and Calvez 2014 and the subsequent works on the cane toad spreading rate, where a similar structural variable is used, but to denote spatial diffusivity, and not adaptability potential, as in our case). Another novelty is the inclusion of constraints between traits modelled through a certain relation between the structural variables. Most of the previous work, and in particular (Chisholm et al. 2015), have considered the variable space as the entire unit square, or all of \({\mathbb {R}}^d\), \(d\in {\mathbb {N}}\), disregarding the existence of constraints between different traits, and the possible effects on the dynamics of the population this might have.
The finite volume method offers a powerful tool in order to numerically approximate the solution of integro-differential or reaction-diffusion equations, such as the one treated in the present paper. The preservation of the structure of the original problem at a semi-discrete level and the excellent approximation for the non-local terms are just two of the reasons why we chose this method. This way, we were able to obtain two numerical schemes in order to approximate the solution for an evolution problem modelling bet hedging strategies.
We proved the existence and uniqueness of solutions for such schemes, and constructed sequences of functions converging to the solution of the original problem. We approximated the convergence error by establishing a comparison with an exact solution. It is worth mentioning that the constructive character of the proofs may provide new and interesting tools in order to obtain further theoretical results.
After simulating various situations, we observed different ways in which a population can respond to external stress, depending on the plasticity levels of its individuals. A highly plastic sub-population can quickly adapt to its surrounding environment, guaranteeing this way its survival, while a less plastic sub-population might go extinct under the same external factor. Another strategy consists in “trading” some of the plasticity by a higher differentiation level.
Furthermore, the emergence of dimorphism as a consequence of external stress, not only is an interesting alternative to the previously established results from Lorenzi and Pouchol (2020), but also shows that bet-hedging strategies are a suitable response to (abrupt) external changes in the environment, and, at the same time, a possible way to survive them. It is fair mentioning that, throughout all the simulations, the symmetry hypothesis required in the reference (Lorenzi and Pouchol 2020) in order to observe dimorphism were respected. It remains to establish what are the essential conditions that will lead to the appearance of dimorphism when an advection term is present.
We thus provided here a rigorous model for the study of the emergence of dimorphism, an event that is likely to have been at the evolutionary origin of multicellularity by divergence of phenotypes and may thus provide a rationale for a renewed conception of animal evolution towards multicellular organisms, and, more pragmatically and consistently with the atavistic theory of cancer, for a possible origin of phenotype bet hedging in cancer cell populations.
Bet hedging in cancer cell populations is indeed a strategy susceptible to yield maximal probabilities of survival to a plastic cell population exposed to life-threatening insults such as by drugs or other deadly therapies. The modelling setting presented here may thus help in the future to test and optimise combined anticancer therapies involving chemotherapies, targeted therapies, and - what is likely still ahead of us for the present time - possible control of cell plasticity by epigenetic drugs.
Availability of data and materials
Not applicable.
Code availability
Not applicable.
References
Bouin E, Calvez V (2014) Travelling waves for the cane toads equation with bounded traits. Nonlinearity 27(9):2233–2253. https://doi.org/10.1088/0951-7715/27/9/2233
Bouin E, Calvez V, Meunier N, Mirrahimi S, Perthame B, Raoul G, Voituriez R (2012) Invasion fronts with variable motility: phenotype selection, spatial sorting and wave acceleration. CR Acad Sci Paris 350(15–16):761–766. https://doi.org/10.48550/ARXIV.1207.2355
Carrillo JA, Filbet F, Schmidtchen M (2020) Convergence of a finite volume scheme for a system of interacting species with cross-diffusion. Numerische Mathematik. https://doi.org/10.1007/s00211-020-01121-3
Chisholm RH, Lorenzi T, Lorz A, Larsen AK, Almeida LNd, Escargueil A, Clairambault J (2015) Emergence of drug tolerance in cancer cell populations: an evolutionary outcome of selection, nongenetic instability, and stress-induced adaptation. Cancer Research 75(6), 930–939. https://cancerres.aacrjournals.org/content/75/6/930.full.pdf. https://doi.org/10.1158/0008-5472.CAN-14-2103
Clairambault J (2020) Stepping from modeling cancer plasticity to the philosophy of cancer. Front Genet 11:1160. https://doi.org/10.3389/fgene.2020.579738
Cohen D (1966) Optimizing reproduction in a randomly varying environment. J Theor Biol 12(1):119–129. https://doi.org/10.1016/0022-5193(66)90188-3
Davies PCW, Lineweaver CH (2011) Cancer tumors as Metazoa 1.0: tapping genes of ancient ancestors. Phys Biol 8(1):015001. https://doi.org/10.1088/1478-3975/8/1/015001
Desvillettes L, Jabin PE, Mischler S, Raoul G (2008) On selection dynamics for continuous structured populations. Commun Math Sci 6(3):729–747. https://doi.org/10.4310/CMS.2008.v6.n3.a10
Eymard R, Gallouët T, Herbin R (2000) Finite volume methods 7:713–1018. https://doi.org/10.1016/S1570-8659(00)07005-8
Eymard R, Gallouët T, Herbin R (2006) A cell-centred finite volume approximation for second order partial derivative operators with full matrix on unstructured meshes in any space dimension. IMA J Numer Anal 26:326–353. https://doi.org/10.48550/arXiv.math/0505109
Hanahan D (2022) Hallmarks of Cancer: New Dimensions. Cancer Discovery 12(1), 31–46 https://aacrjournals.org/cancerdiscovery/article-pdf/12/1/31/3052722/31.pdf. https://doi.org/10.1158/2159-8290.CD-21-1059
Huang S (2013) Genetic and non-genetic instability in tumor progression: link between the fitness landscape and the epigenetic landscape of cancer cells. Cancer Metastasis Rev 32(3–4):423–448. https://doi.org/10.1007/s10555-013-9435-7
Lineweaver CH, Davies PCW (2020) Comparison of the atavistic model of cancer to somatic mutation theory: Phylostratigraphic analyses support the atavistic model, Chap. 12. In: Gerstman BS (ed) The Physics of Cancer. Research Advances. World Scientific, Singapore, pp 243–261. https://doi.org/10.1142/9789811223495_0012
Lorenzi T, Pouchol C (2020) Asymptotic analysis of selection-mutation models in the presence of multiple fitness peaks. Nonlinearity 33(11):5791–5816. https://doi.org/10.1088/1361-6544/ab9bad
Matuła K, Richter Ł, Janczuk-Richter M, Nogala W, Grzeszkowiak M, Peplińska B, Jurga S, Wyroba E, Suski S, Bilski H et al (2019) Phenotypic plasticity of Escherichia coli upon exposure to physical stress induced by ZnO nanorods. Sci Rep. https://doi.org/10.1038/s41598-019-44727-w
Moussa A (2016) Some variants of the classical Aubin-Lions lemma. J Evol Equ 16(1):65–93. https://doi.org/10.1007/s00028-015-0293-3
Müller WEG (2001) Review: How was metazoan threshold crossed? The hypothetical Urmetazoa. Comp Biochem Physiol Part A Molecular Integr Physiol 129(2):433–460. https://doi.org/10.1016/S1095-6433(00)00360-3
Nedelcu AM, Michod RE (2020) Stress responses co-opted for specialized cell types during the early evolution of multicellularity. BioEssays 42(5):2000029. https://doi.org/10.1002/bies.202000029
Perthame B (2006) Transport equations in biology. Springer, Berlin. https://doi.org/10.1007/978-3-7643-7842-4
Philippi T, Seger J (1989) Hedging one’s evolutionary bets, revisited. Trends Ecol Evolut 4(2):41–44. https://doi.org/10.1016/0169-5347(89)90138-9
Pouchol C, Trélat E (2018) Global stability with selection in integro-differential Lotka–Volterra systems modelling trait-structured populations. J Biol Dyn 12(1):872–893. https://doi.org/10.1080/17513758.2018.1515994
Seger J (1987) What is bet-hedging? Oxf Surv Evol Biol 4:182–211
Shen S, Clairambault J (2020) Cell plasticity in cancer cell populations. F1000Research. https://doi.org/10.12688/f1000research.24803.1
Wagner GP, Erkenbrack EM, Love AC (2019) Stress-induced evolutionary innovation: a mechanism for the origin of cell types. BioEssays 41(4):180–188. https://doi.org/10.1002/bies.201800188
Yuan S, Norgard RJ, Stanger BZ (2019) Cellular plasticity in cancer. Cancer Discov. 9(7):837–851. https://doi.org/10.1158/2159-8290.CD-19-0015
Acknowledgements
The authors are gratefully indebted to Camille Pouchol and Tommaso Lorenzi for their constructive remarks about the model. JAC would like to thank the funding received from the Volunteer Lecturer Program of the IMU Commission for Developing Countries to give a numerical analysis course at Universidad de la Habana in March–April 2019. The research reported in this paper started during this period. JAC and FEAB thank the Facultad de Matemáticas y Ciencias de la Computación de la Universidad de la Universidad de La Habana for their hospitality.
Funding
JAC was supported by the Advanced Grant Nonlocal-CPD (Nonlocal PDEs for Complex Particle Dynamics: Phase Transitions, Patterns and Synchronization) of the European Research Council Executive Agency (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 883363). This program has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Grant Agreement No 754362.
The authors would like to thank the Isaac Newton Institute for Mathematical Sciences, Cambridge, for support and hospitality during the programme “Frontiers in kinetic theory: connecting microscopic to macroscopic scales - KineCon 2022” where work on this paper was undertaken. JAC was also partially supported by the EPSRC grant numbers EP/T022132/1 and EP/V051121/1.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare. All co-authors have seen and agree with the contents of the manuscript and there is no financial interest to report.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Alvarez, F.E., Carrillo, J.A. & Clairambault, J. Evolution of a structured cell population endowed with plasticity of traits under constraints on and between the traits. J. Math. Biol. 85, 64 (2022). https://doi.org/10.1007/s00285-022-01820-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00285-022-01820-5