Abstract
We consider numerical approximations of ill-posed elliptic problems with conditional stability. The notion of optimal error estimates is defined including both convergence with respect to discretisation and perturbations in data. The rate of convergence is determined by the conditional stability of the underlying continuous problem and the polynomial order of the approximation space. A proof is given that no approximation can converge at a better rate than that given by the definition without increasing the sensitivity to perturbations, thus justifying the concept. A recently introduced class of primal-dual finite element methods with weakly consistent regularisation is recalled and the associated error estimates are shown to be optimal in the sense of this definition.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Arguably one of the most fundamental results in finite element analysis is the best approximation result for the Galerkin method, known as Cea’s lemma [20], which together with approximation estimates for finite element functions results in quasi-optimal error estimates for finite element methods [3, 27, 44, 48]. This result, that we will review below, essentially says that if a (2m)-order elliptic problem, \(m\ge 1\), is approximated with \(H^m\)-conforming finite elements of local polynomial order p the error in \(H^m\)-norm is of the order \(h^{p+1-m}\) for a sufficiently smooth solution and that this rate is optimal compared to approximation: the best interpolant of the exact solution has similar accuracy.
For ill-posed elliptic problems the situation is different. On the continuous level existence can only be guaranteed after regularisation of the problem. The two main approaches are Tikhonov regularisation [45] and quasi-reversibility [31]. These two approaches are strongly related (see for instance [7]). The main effort in the error analysis has been to estimate the perturbation induced by the addition of regularisation, and how to choose the associated regularisation operator or parameter [6, 28, 35, 38, 43]. The error due to approximation in finite dimensional spaces of such regularised problems has also been analysed [24, 36, 41].
There is also a rich literature on projection methods for ill-posed problems where the discretisation serves as regularisation and refinement has to stop as soon as the effect of perturbations in data becomes dominant [22, 23, 26, 30, 42]. These methods are often based on least squares methods and the convergence of the approximate solution to the exact solution for unperturbed data has been proven in several works. There are also different stopping criteria for mesh refinement in order to avoid degeneration due to pollution from perturbations. However no results on rates of convergence where the discretisation errors and the perturbation errors are both included appear in these references.
The use of conditional stability (continuous dependence on data under the assumption of a certain a priori bound) to obtain more complete error estimates has been proposed in [9,10,11, 18] for a class of finite element methods based on weakly consistent regularisation/stabilisation in a primal-dual framework. Here stability is obtained through a combination of consistent stabilisation and Tikhonov regularisation, scaled with the mesh parameter to obtain weak consistency. The upshot is that for this class of methods an error analysis exists, where the computational error is bounded in terms of the mesh parameter and perturbations of data, with constants depending on Sobolev norms of the exact solution. Similarly to the well-posed case, the error estimates for this approach combine the stability of the physical problem with the numerical stability of the computational method and the approximability of the finite element space. Contrary to the well-posed case, numerical stability can not be deduced from the physical stability, but has to be a consequence of the design of the stabilisation terms. This means that the stabilisation in this framework is bespoke, and must be designed to combine optimal (weak) consistency and sufficient numerical stability. There is often a tension between these two design criteria. As noted above, sometimes Tikhonov regularisation, scaled with the mesh parameter, may be used in the framework. An interesting feature is that the bespoke character also allows for the integration of the dependence of the estimates on physical parameters and different problems regimes [16, 17]. Other physical models that have been considered in this framework include data assimilation for fluids [5, 13, 18], or wave equations [12]. Common for all these references is the fact that the error estimates reflect the stability of the continuous problem and the approximation order of the finite element space, which seems to be an optimality property of the methods. No rigorous proof, however, has been given for this optimality. The objective of the present work is to show, in the model case of unique continuation for Laplace equation, that the proposed error estimates are indeed optimal.
For ill-posed PDEs that are conditionally stable, error estimates in terms of the modulus of continuity in the conditional stability, the consistency error and the best approximation error have also been obtained in [21]. Based on least squares with the norms and the regularisation term dictated by the conditional stability estimate, this variation of quasi-reversibility relies on working with discrete dual norms and constructing Fortin projectors. By choosing the regularisation parameter in terms of the consistency error and the best approximation error, the obtained error bound reflects the conditional stability estimate (qualitatively optimal). Conditional stability estimates have also been used to obtain some bounds on the generalisation error for physics-informed neural networks solving ill-posed PDEs [39]. The question of optimality for both these kind of methods is included in our discussion.
Another well-known ill-posed problem is analytic continuation, which, similarly to unique continuation, possesses conditional stability under the assumption of an a priori bound. We will not discuss this problem here; for its conditional stability/conditioning and numerical approximations, we refer the reader to [46, 47] and the references therein.
1.1 Unique Continuation Problem
Let \(0< r_1< r_2 < R\) and write \(\omega := B(r_1)\) and \(B:= B(r_2)\) where B(r) is the open ball of radius \(r > 0\), with the centre at the origin in \({\mathbb {R}}^n\). The objective is to solve the continuation problem: given the restriction \(u|_\omega \) to the subset \(\omega \), find the restriction \(u|_B\) when u satisfies \(\Delta u = 0\) in B(R).
Further, letting \(r_2< r_3 < R\) and writing \(\Omega := B(r_3)\), it is classical [29] that the following conditional stability estimate holds:
where \(\alpha \in (0,1)\) and the implicit constant do not depend on the harmonic function u. Estimate (1) is often called a three-ball inequality and in this case the constants can be given explicitly, see Theorem 1 below. We may view the unique continuation problem as finding \(u \in H^1(\Omega )\) such that
with a priori knowledge on the size of the solution in \(\Omega \) as prescribed by the \(L^2(\Omega )\)-norm in (1).
1.2 Motivation and Outline
The motivation of this paper comes from error estimates obtained for primal-dual finite element methods applied to this problem, with perturbed data \(u\vert _\omega =q + \delta q\), of the form
which have been shown in [14, 16, 17] in different variations of the second order elliptic equation. Here \(\alpha \in (0,1)\) is the exponent in (1) and \(h>0\) denotes the mesh parameter defining the characteristic length scale of the finite dimensional space. This is in the case of piecewise affine approximation, however the estimate generalises in a natural way to higher order approximation, as we shall see below. One can obtain a similar bound in the \(H^1\)-norm over B. In the counterfactual case that \(\alpha = 1\) one would then recover an estimate that is optimal compared to interpolation. Hence a natural question is if the bound (3) in the \(L^2\)-norm can be improved upon, since it is suboptimal with one order in h when compared to interpolation in the case \(\alpha =1\).
We show in this paper that if the coefficient \(\alpha \) in (1) is optimal and depends continuously on \(r_3\) (Theorem 1), then regardless of the underlying method, no sequence of approximations to (2) can converge with a rate better than that given by (3) (Theorem 2) without increasing the sensitivity to perturbations. We also point out that although the discussion focuses on the finite element method, the definition of optimal convergence given and the proof of optimality hold for any method producing an approximating sequence in \(H^1\) (or relying on such a sequence for the analysis).
The paper is organised as follows. In Sect. 2 we will discuss the notion of optimality of finite element approximations. First we revisit the classical finite element analysis for well-posed problems. In Sect. 2.2 we then discuss how the ideas of the well-posed case translate to the ill-posed case. This leads us to a definition of optimal approximation for the problem (2) and we prove in Sect. 2.3 that no approximation method can converge in a better way than that given by this definition. Finally in Sect. 3 we show that optimality can indeed be attained by presenting a finite element method with optimal error estimates which extend (3) for higher order approximations.
2 Optimal Error Estimates for Elliptic Problems
In this section we will first briefly recall the theory of optimal error estimates for Galerkin approximations of well-posed second order elliptic problems. We then consider the ill-posed model problem (2) and discuss how the construction that led to optimal approximation in the well-posed case can be adapted to this situation. This leads us to a definition of optimality of approximate solutions in the ill-posed case. We let \(V:=H^1(\Omega )\) and \(V_0:=H^1_0(\Omega )\).
For simplicity we consider the Poisson problem for \(f \in V_0'\):
We define the associated weak formulation by: find \(u \in V_0\) such that
where \(a(u,v):= \int _\Omega \nabla u \cdot \nabla v ~\text{ d }x\) and \(\ell (v):= \langle f,v\rangle _{V'_0,V_0}\). It is well known that a is a coercive (V-elliptic), continuous bilinear form on \(V_0 \times V_0\), and \(\ell \) is bounded and continuous on \(V_0\).
It then follows from Lax-Milgram’s lemma [32] that the weak formulation admits a unique solution satisfying the stability estimate
where \(\Vert \cdot \Vert _V:= \sqrt{a(\cdot ,\cdot )}\) and the dual norm is defined by
Introducing a finite dimensional subspace \(V_{0h} \subset V_0\) we define the Galerkin method, find \(u_h \in V_{0h}\) such that
where \(\ell _\delta \) denotes a perturbed right hand side \(\ell _\delta (v):= \langle f + \delta f,v\rangle _{V_0',V_0}\), with \(\delta f \in {V_0'}\) and we assume \(\Vert \delta f\Vert _{{V_0'}}\) to be known. The associated linear system is invertible since a is coercive.
Let \(\bar{u}_h\in V_{0 h}\) be the solution for the unperturbed right hand side, satisfying
Then we have that \(\left\| u-\bar{u}_h\right\| _V=\inf _{w_h \in V_{0\,h}}\left\| u-w_h\right\| _V\), by Galerkin orthogonality, and that \(\left\| u_h-\bar{u}_h\right\| _V \le \Vert \delta f\Vert _{V_0^{\prime }}\). Since \(\Delta : V_0 \rightarrow V_0^{\prime }\) is an isomorphic isometry, an application of the triangle-inequality for the approximation error \(e:= u- u_h\) gives that
This is equivalent to the classical result of Cea’s lemma, but written in a form suitable for our purposes. If \(u\in H^{k+1}(\Omega )\) and \(V_{0\,h}\) is the space of \(H^1\)-conforming piecewise polynomial finite elements of order k we immediately have by approximation that
where \(|\cdot |_{H^{k+1}(\Omega )}\) stands for the seminorm. Observe how the Lipschitz stability of (6) combines with the approximation properties of the finite element space to yield an optimal error estimate. Perturbations in data lead to stagnation of the error at the level of the perturbation.
2.1 Optimal Three-Ball Estimate
Three-ball estimates such as (1) for solutions of second-order elliptic equations are well-known in the literature, see e.g. the review [1] or [8]. However, such results typically contain constants that depend implicitly on the geometry and the coefficients of the differential operator, and whose optimality is not clear [8]. We aim here to give a result in the case of the Laplace operator which, barring optimality, is a variation of existing results in the literature, see [37, Theorem 1] and [2, Eq. (1.2)]. We will consider only the two and three dimensional cases, for which we prove the following three-ball estimate in \(L^2\)-norms with optimal explicit constants.
Theorem 1
Let \(n\in \{2,3\}\) and \(B(r) \subset {\mathbb {R}}^n\) be the open ball of radius \(r > 0\). Let \(0< r_1< r_2 < r_3\). Then for all harmonic functions u there holds
where
Moreover, there does not exist \({\tilde{\alpha }} > \alpha \) such that
Proof
For any \(r > 0\) and \(\theta \in (0,1)\) there holds
see e.g. [2, Eq. (1.2)]. We aim to transform this estimate into (10). We take the logarithm of (13) and write \(f(t) = \log \left\| u \right\| _{L^2(B(\exp (t)) )}\) to obtain
Notice that \(\log (\theta r) + \log (\theta ^{-1} r) = 2 \log (r)\), so that writing \(t = \log (\theta r)\) and \(s = \log (\theta ^{-1} r)\), we obtain that
yielding convexity of f. Hence, for every \(\alpha \in (0,1)\) and \(s,t \in {\mathbb {R}}\)
We now set \(r = r_2\) and \(\theta = r_1 / r_2\). Then \(r_1 = \theta r\) and \(r_3 = \theta ^{-\beta } r\), with \(\beta \) given by (11). Taking \(s = \log (\theta r)\) and \(t = \log (\theta ^{-\beta } r)\) there holds
since \((1-\alpha ) \beta = \alpha \). With this choice, taking the exponential of (15) gives (10).
Suppose now that (12) holds for some \({\tilde{\alpha }} > 0\). We will show that \({\tilde{\alpha }} \le \alpha \). Let us consider first the two dimensional case. Identifying \({\mathbb {R}}^2\) with \({\mathbb {C}}\), consider the function \(u(z) = z^{n-1}\) which is harmonic for \(n \in {\mathbb {N}}\). The following argument is similarly valid for its real part. Using polar coordinates we have that, for \(\rho >0\),
Notice that we have equality in (10) for \(\alpha = \beta /(1+\beta ) = \frac{\log (r_3) - \log (r_2)}{\log (r_3) - \log (r_1)}\).
Recalling that \(r_1/r_2 = \theta ,\, r_3/r_2 = \theta ^{-\beta }\), estimate (12) reads as
As \(n \in {\mathbb {N}}\) is arbitrary and \(\theta \in (0,1)\) we must have \({\tilde{\alpha }} - \beta (1-{\tilde{\alpha }}) \le 0\). In other words,
We turn to the three dimensional case, and consider the function
As above, this is harmonic for \(n \in {\mathbb {N}}\). Passing to spherical coordinates there holds
where the constant \(c_n\) can be written using the Gamma function
The conclusion follows as in the two dimensional case. \(\square \)
Note that the same explicit constants as in Theorem 1 appear in Hadamard’s three-circle theorem (in \(L^{\infty }\)-norms) for holomorphic functions.
Remark 1
In Theorem 1 we proved the optimality and continuous dependence of the exponent \(\alpha \) for unique continuation subject to the Laplace equation. In a more general setting, a discussion of optimality of three-ball inequalities can be found in [25] for elliptic and parabolic problems. In [33, 34] some cases in fluid mechanics and elasticity are considered for which optimality is claimed.
2.2 Definition of Optimal Convergence for Ill-Posed Problems with Conditional Stability
In this section we will try to mimic the development in the well-posed case for the problem (2) and point out where things go wrong. We will do this with minimal reference to a particular approximation method to keep the discussion general. However, in Sect. 3 we introduce a method for which the programme can be carried out.
First we will derive a weak formulation. This time, since no boundary conditions are set on u, we must consider the trial space V. To make form a consistent with the problem, the test space must be chosen to be \(V_0\), as keeping V would imply a homogeneous Neumann condition on the boundary. We may then write a weak formulation of the problem (2), find \(u \in V\) such that \(u\vert _{\omega } = q\) and
We know that the exact solution satisfies this formulation and that (1) holds. Assume now that we have an approximation \(u_h \in V_h\) obtained using the perturbed data \({\tilde{q}}:= q + \delta q\), where \(\delta q \in L^2(\omega )\). Observe that although for this data, most likely, no exact solution will exist, a discrete approximation of the unperturbed exact solution u may still be constructed. Similarly as before the error \(e:=u-u_h\) satisfies
where
Observe that even if \(u_h\) is produced using a Galerkin procedure we can not use here the same techniques as when proving (8), since the trial space V in this case is bigger than the test space \(V_0\).
As before we would now like to apply a stability estimate, this time (1), using the right hand side on the perturbation. However this is not possible, since there is no right hand side in (2) and (1). Instead we first decompose \(e = e_0 + {\tilde{e}}\), where \(e_0 \in V_0\) solves the well-posed problem
and \({\tilde{e}}\) solves (2) with \({\tilde{e}}\vert _{\omega } = (e - e_0)\vert _{\omega }\). Using the triangle inequality and then applying (6) to \(e_0\) and (1) to \({\tilde{e}}\) we arrive at
Using once again the triangle inequality this leads to
We conclude that any approximation \(u_h\) must satisfy the bound
If we assume that the term \(\Vert e\Vert _{L^2(\Omega )}\) is bounded, then inequality (20) gives an a posteriori bound for the error on B in the \(L^2(B)\)-norm.
For the sake of discussion, we will, for a moment, consider an approximation \(u_h\) satisfying certain properties. These properties can be thought of as design criteria for the numerical method, since as it turns out they lead to optimal convergence. In Sect. 3 we construct a finite element method with these properties.
-
1.
Bound on the equation residual:
$$\begin{aligned} \Vert \Delta u_h\Vert _{{V_0'}} \lesssim C h^k|u|_{H^{k+1}(\Omega )} + \Vert \delta q\Vert _{L^2(\omega )}. \end{aligned}$$(21)Observe that this means that the residual convergence in the ill-posed case is as good as the residual convergence in the well-posed case, see (9).
-
2.
Bound on the data fitting term:
$$\begin{aligned} \Vert q- u_h\Vert _{L^2(\omega )} \lesssim C h^k|u|_{H^{k+1}(\Omega )}+ \Vert \delta q\Vert _{L^2(\omega )} \end{aligned}$$(22)This term is suboptimal by one order in h compared to interpolation, but nothing can be gained by assuming better convergence since the term always is dominated by the contribution from \(\Vert \Delta u_h\Vert _{{V_0'}}\) in the bound (20). Strengthening the norm on \(\omega \) on the other hand is possible provided the perturbation \(\delta q\) also has additional smoothness.
-
3.
Finally we need to assume an a priori bound on \(u_h\):
$$\begin{aligned} \Vert u_h\Vert _{L^2(\Omega )} \lesssim |u|_{H^{k+1}(\Omega )} +h^{-k} \Vert \delta q\Vert _{L^2(\omega )}. \end{aligned}$$(23)The rationale for this choice is that it is the strongest control that can be achieved through Tikhonov regularisation without affecting the convergence order when the data is unperturbed, assuming that the previous two assumptions hold.
Injecting these three bounds in (20) we get the error estimate
A generic version of this estimate can be obtained by decoupling the rate of convergence from the sensitivity to perturbations, by considering the following bound
for \(\alpha _1,\, \alpha _2\in (0,1)\). Denoting the upper bound here by
we see that E has a unique critical point
which is a minimum since \(E''(h_{\min }) > 0\). Hence from (25) we get that
with \({\tilde{\alpha }}:= \frac{\alpha _1}{1+\alpha _1 - \alpha _2}\). Notice that \(\tilde{\alpha } = \alpha \) when \(\alpha _1 = \alpha _2 = \alpha \). Considering convergence with respect to perturbations \(\delta q\) in this bound, one would like to have \(\tilde{\alpha }\) as large as possible.
Based on the discussion above we here propose a definition of what it means that a family of approximations \(\{u_h\}\) to an ill-posed problem of the form (2) is optimally convergent.
Definition 1
Assume that \(u\in H^{k+1}(\Omega ),\, k\in {\mathbb {N}}\), solves the unique continuation problem (2). Let \(\alpha \in (0,1)\) be the largest value for which the conditional stability estimate (1) holds. Let \(\{u_h\}_{h>0}\) be a family of functions in \(H^1(\Omega )\). If the family \(\{u_h\}\) satisfies the inequality (25) with \(\frac{\alpha _1}{1+\alpha _1 - \alpha _2} = \alpha \), then we say that its convergence is optimal.
Remark 2
An optimal \(\alpha \in (0,1)\) in the stability estimate (1) is provided in Theorem 1. We prove below that, independently of the method used, no family \(\{u_h\} \subset H^1(\Omega )\) of approximations to the solution of (2) can satisfy (25) with \(\frac{\alpha _1}{1+\alpha _1 - \alpha _2} > \alpha \). In particular, no method can exceed the convergence rate in (24) without increasing the sensitivity to data perturbations nor can it improve this sensitivity without decreasing the convergence rate, i.e. there exist no \(\alpha _1, \alpha _2 \in [\alpha ,1)\) with \(\alpha _1 > \alpha \) or \(\alpha _2 > \alpha \), such that
Remark 3
The question of constructing such an optimal method with \(\alpha _1 > \alpha \) is currently an open question. For a small enough noise level, such a method would realize the best possible error upper bound (27) with a larger \(h_{\min }\), i.e. at a lower computational cost. Indeed, if
then setting \(\frac{\alpha _1}{1+\alpha _1-\alpha _2} = \alpha \) we see that (26) becomes
2.3 Proof of Optimality
The following Caccioppoli-type inequality is known but we give a short proof for the convenience of the reader.
Lemma 1
Let \(r_3< r_4 < R\) and \(k \ge 0\). Then for all \(w \in H^{k+1}(B(R))\) satisfying \(\Delta w = 0\) in B(R) there holds
Proof
Divide the interval \((r_3,r_4)\) in \(k+1\) subintervals \((R_j,R_{j+1})\) of equal length with \(R_j=r_3 + j \delta R\), \(R_0=r_3\), \(R_{k+1} = r_4, \delta R = R_{j+1} - R_j = (r_4 - r_3)/(k+1)\). For an index \(j=0,\ldots ,k\) choose a \(\chi \in C_0^\infty (B(R_{j+1}))\) such that \(\chi = 1\) in \(B(R_j)\) and write \(v = \chi y\), where \(y \in H^1(B(R_{j+1}))\) and \(\Delta y = 0\). Then, if \([\Delta , \chi ]\) denotes the commutator \(\Delta \chi - \chi \Delta \),
and therefore by (6) we have that
Here in the last step we used that
Let \(y= D^{k-j} w\) where \(D^{k-j}\) denotes an arbitrary partial derivative of order \(k-j\), \(j = 0,\ldots ,k\). Then \(\Delta y = 0\). It follows from equation (28) that
By applying this to all partial derivatives of order \(k-j\), we see that
Hence by applying this inequality sequentially for \(j = 0,\ldots ,k\) we see that
This concludes the proof. \(\square \)
Theorem 2
Let \(0<r_1<r_2<r_3<R\) and let \(\omega = B(r_1)\), \(B = B(r_2)\), \(\Omega = B(r_3)\). Let \(\alpha \in (0,1)\) be the optimal exponent in Theorem 1. Let \(u\in H^{k+1}(B(R))\) satisfy \(\Delta u = 0\) in B(R) and let \(q = u|_\omega \). Consider a family of mappings \(\{F_h\}_{h>0},\, F_h: L^2(\omega ) \rightarrow H^1(\Omega ),\, F_h(q + \delta q) =: u_h \), for all \(\delta q \in L^2(\omega )\). Then there exist no \(\alpha _1, \alpha _2 \in (0,1)\) with \(\frac{\alpha _1}{1+\alpha _1 - \alpha _2} > \alpha \), such that
In particular, there exist no \(\alpha _1, \alpha _2 \in [\alpha ,1)\) with \(\alpha _1 > \alpha \) or \(\alpha _2 > \alpha \) such that (29) holds.
Proof
We give a proof by contradiction. Assume that there exist \(\alpha _1, \alpha _2 \in (0,1)\) with
such that (29) holds. Taking \(u=0\) and \(\delta q = {\tilde{u}}|_\omega \) for \({\tilde{u}}\) satisfying \(\Delta {\tilde{u}} = 0\) in B(R), the estimate (29) reduces to
Using (29) again with \(u = {\tilde{u}}\) and \(\delta q = 0\), we get
Hence
We will write \(u = {\tilde{u}}\) from now on, and recall that u is an arbitrary solution to \(\Delta u = 0\) in B(R). For a nonzero u, we define
and choose \(h > 0\) such that
that is,
With this choice, inequality (30) reduces to
which trivially holds for the zero solution also. Observe that (31) would right away contradict the optimality of \(\alpha \) in Theorem 1 if the \(H^{k+1}\)-norm on its right-hand side was an \(L^2\)-norm. To weaken this norm, we can use Lemma 1 to get that \(\Vert u\Vert _{H^{k+1}(B(r_3))} \lesssim \Vert u\Vert _{L^2(B(r_4))}\) for \(r_3< r_4 < R\). Hence, using this bound in (31) leads to
We now denote by \(\hat{\alpha }\) the optimal exponent corresponding to \(r_4\) in the three-ball estimate in Theorem 1, for which
for any harmonic function u. This means that such an inequality cannot hold with an exponent larger than \(\hat{\alpha }\). However, since \(\hat{\alpha }\) depends continuously on \(r_4\), by considering \(r_4 > r_3\) sufficiently close to \(r_3\) we can get \(\hat{\alpha }\) arbitrarily close to \(\alpha \), i.e. \(\tilde{\alpha }> \hat{\alpha } > \alpha \). Thus inequality (32) holds with \(\tilde{\alpha } > \hat{\alpha }\), which contradicts the optimality of \(\hat{\alpha }\) in (33).
Let us finally show that if \(\alpha _1, \alpha _2 \in [\alpha ,1)\) with \(\alpha _1 > \alpha \) or \(\alpha _2 > \alpha \), then \({\tilde{\alpha }} > \alpha \). Consider first the case \(\alpha _1 < \alpha _2\). As \(\alpha _j \in [\alpha ,1)\), \(j=1,2\), there holds \(\alpha _2 - \alpha _1 \in (0,1)\) and
For the case \(\alpha _1 \ge \alpha _2\), we have that \(\alpha _1 = \alpha + \epsilon \) for some \(\epsilon > 0\), and
which concludes the proof. \(\square \)
Remark 4
The proof of Theorem 2 is still valid if we assume (29) to hold with a weaker norm instead of \(\Vert u\Vert _{H^{k+1}(\Omega )}\).
Remark 5
The approximation \(u_h\) in Theorem 2 depends only on h and \(q+\delta q\). This result does not exclude the possibility of a regularisation method that uses more information, for example the size of the perturbation \(\delta q\). The optimal method that we present in the following section can also use this information, see Remark 6.
3 Primal-Dual Finite Element Methods with Weakly Consistent Regularisation
In this section we will use a finite element method with weakly consistent stabilisation to construct a sequence of approximate solutions for unique continuation (2) that satisfy the error estimate (24), showing that the optimal convergence for this ill-posed problem can be attained by a discrete approximation method. This discussion is based on ideas from [11, 14], modified to match the assumptions of the theoretical developments above.
Let \(\{{\mathcal {T}}\}\) be a quasi-uniform family of triangulations of \(\Omega \), where triangles T with curved boundaries are allowed so that the the covering of \(\Omega \) is exact [4, 49]. On these meshes we define a \(C^0\) finite element space \(V_h \subset H^1(\Omega )\), consisting of piecewise polynomials of order k (after mapping of the triangles to a reference element). We also let \(V_{0h} = V_h \cap H^1_0(\Omega )\). It then follows that there exist interpolants \(\Pi _h:H^{1}(\Omega ) \mapsto V_h\) [4, Corollary 4.1] and \(\Pi _h^0: H^1_0(\Omega ) \mapsto V_{0\,h}\) [4, Corollary 5.2] for which the following interpolation estimates hold
where \(\Delta _T:= \{T' \in \mathcal {T}: T' \cap T \ne \emptyset \}\) and \(D^2 u\) is the Hessian of u, and
We will also use the broken norm defined by
To set up the numerical method, we formulate the continuation problem (2) as pde-constrained optimisation and consider the Lagrangian \(L_h: V_h \times V_{0h} \rightarrow {\mathbb {R}}\),
By taking its saddle points, we define the finite element method as follows: find \((u_h,z_h) \in V_h \times V_{0h}\) such that
for all \((v_h,w_h) \in V_h \times V_{0h}\), with
where F denotes a face of a triangle T and the jump of the gradient over a face F is defined by \(\llbracket \nabla u_h \rrbracket _F:= \nabla u_h\vert _{T_1} n_{T_1} + \nabla u_h\vert _{T_2} n_{T_2}\) for \(F = {\bar{T}}_1 \cap {\bar{T}}_2\), with \(n_{T_m}\) the outward pointing unit normal of the triangle \(T_m\). For a more compact formulation we introduce the global form \(A_h\),
to write: find \((u_h,z_h) \in V_h \times V_{0h}\) such that
for all \((v_h,w_h) \in V_h \times V_{0h}\). Observe that this form satisfies the consistency property
To show that this method satisfies the error bound (24), we only need to verify that it satisfies (21), (22) and (23) (which represent the design criteria for the method). To this end we introduce the norm
and we observe that the formulation satisfies the positivity property
which ensures the existence of a discrete solution \((u_h, z_h) \in V_h \times V_{0\,h}\) for all \(h>0\).
We proceed by first proving convergence in the S-norm, which immediately gives (22). The proof of the other two bounds and satisfaction of (24) then follow as a corollary. First we establish an approximation result for the S norm.
Lemma 2
Let \(v \in H^{k+1}(\Omega )\), then there holds
Proof
By the definition of the S-norm we see that
By the approximation property (34) we have that
For the term measuring the jump of \(\Pi _h v\) over element faces we note that
where we used the regularity of v and the trace inequality [40]
We conclude by applying (34) once again and summing over all the faces. \(\square \)
Proposition 1
Let \((u_h,z_h)\) denote the solution to (38) and let \(u \in H^{k+1}(\Omega )\) be the solution to (2), then there holds
Proof
First we decompose the error \(u -u_h = u - \Pi _h u + \underbrace{\Pi _h u - u_h}_{=:e_h}\) in the continuous and discrete parts. By the triangle inequality and Lemma 2 it is enough to bound \(|\!|\!|(e_h,z_h)|\!|\!|_S\). Using (40) and (39) we have
For the last two terms on the right hand side we have
Finally, the following continuity holds
where
To prove the continuity (41) recall that by definition
Using the Cauchy–Schwarz inequality we have that
and
We end the proof by observing that by equation (34) and Lemma 2 there holds
\(\square \)
Corollary 1
Under the same hypothesis as for Proposition 1 there holds
and
Finally \(u_h\) satisfies the error bound (24).
Proof
First we observe that the third claim is an immediate consequence of the first two and Proposition 1. Indeed, this follows from the discussion of Sect. 2.2, using the error bound (20) and Eqs. (21)–(23).
The first inequality is immediate by Proposition 1 observing that
and, for the first term in the right hand side,
For the second inequality, by definition
Using (39), followed by integration by parts, we see that for all \(w_h \in V_{0h}\)
Choosing \(w_h = \Pi ^0_h w\) and using the Cauchy–Schwarz inequality in the first term of the right hand side and the continuity of a in the second, followed by (35), we see that
The conclusion now follows using Proposition 1 to obtain the desired bound
\(\square \)
Remark 6
Both for the well-posed problem (4) and the ill-posed problem (2) there is a lower bound for how well the exact solution can be approximated if the data are perturbed. In the well-posed case the limit is trivially given by \(\Vert \delta f\Vert _{{V_0'}}\) in (9), whereas in the ill-posed case the lower bound occurs when
see (26). If \(h_{\min }\) is known, the numerical scheme can be designed to stagnate at the level of the best approximation, by modifying the last term in the definition of the stabilisation (37) to read \(\max (h,h_{\min })^{2k} (u_h,v_h)_{L^2(\Omega )} \). This shows the connection between this stabilising term and classical Tikhonov regularisation and similar tools as for the latter can be applied here to optimise the parameter compared to perturbations in data. It is straighforward to show that this leads to stagnation at
Here the implicit constant may depend on k. We see that increasing k will increase the value of \(h_{\min }\), so that the best approximation is obtained on a coarser mesh. However due to the k dependence of \(\Vert u\Vert _{H^{k+1}(\Omega )}\) stagnation may take place on a different, potentially higher, value of the error. A similar kind of bound was obtained in [21, Theorem 2.2].
4 Conclusion
In this paper we have shown that the convergence order of the approximation error for unique continuation problems, obtained by combining the approximation orders of the data fitting and the pde-residual with the conditional stability, can not be improved without increasing the sensitivity to perturbations. This shows that the asymptotic accuracy of the methods for unique continuation discussed in [10, 11, 14,15,16,17, 21] is optimal, in the sense that it is impossible to design a method with better convergence properties. The only remaining possibilities to enhance the accuracy of approximation methods is either to resort to adaptivity, or to introduce some additional a priori assumption to make the continuous problem more stable, such as finite dimensionality of target quantities (see [19]).
References
Alessandrini, G., Rondi, L., Rosset, E., Vessella, S.: The stability for the Cauchy problem for elliptic equations. Inverse Problems 25(12), 123004, 47 (2009). https://doi.org/10.1088/0266-5611/25/12/123004.
Armstrong, S., Kuusi, T., Smart, C.: Optimal unique continuation for periodic elliptic equations on large scales (2021). Preprint arXiv:2107.14248
Babuška, I.: Error-bounds for finite element method. Numer. Math. 16, 322–333 (1970/71). https://doi.org/10.1007/BF02165003.
Bernardi, C.: Optimal finite-element interpolation on curved domains. SIAM J. Numer. Anal. 26(5), 1212–1240 (1989). https://doi.org/10.1137/0726068.
Boulakia, M., Burman, E., Fernández, M.A., Voisembert, C.: Data assimilation finite element method for the linearized Navie-Stokes equations in the low Reynolds regime. Inverse Problems 36(8), 085003–85024 (2020). https://doi.org/10.1088/1361-6420/ab9161.
Bourgeois, L.: A mixed formulation of quasi-reversibility to solve the Cauchy problem for Laplace’s equation. Inverse Problems 21(3), 1087–1104 (2005). https://doi.org/10.1088/0266-5611/21/3/018.
Bourgeois, L., Recoquillay, A.: A mixed formulation of the Tikhonov regularization and its application to inverse PDE problems. ESAIM Math. Model. Numer. Anal. 52(1), 123–145 (2018). https://doi.org/10.1051/m2an/2018008.
Brummelhuis, R.: Three-spheres theorem for second order elliptic equations. Journal d’Analyse Mathematique 65(1), 179–206 (1995)
Burman, E.: Stabilized finite element methods for nonsymmetric, noncoercive, and ill-posed problems. Part I: Elliptic equations. SIAM J. Sci. Comput. 35(6), A2752–A2780 (2013). https://doi.org/10.1137/130916862.
Burman, E.: Error estimates for stabilized finite element methods applied to ill-posed problems. C. R. Math. Acad. Sci. Paris 352(7-8), 655–659 (2014). https://doi.org/10.1016/j.crma.2014.06.008.
Burman, E.: Stabilised finite element methods for ill-posed problems with conditional stability. In: Building bridges: connections and challenges in modern approaches to numerical partial differential equations, Lect. Notes Comput. Sci. Eng., vol. 114, pp. 93–127. Springer, [Cham] (2016)
Burman, E., Feizmohammadi, A., Oksanen, L.: A finite element data assimilation method for the wave equation. Math. Comp. 89(324), 1681–1709 (2020). https://doi.org/10.1090/mcom/3508.
Burman, E., Hansbo, P.: Stabilized nonconforming finite element methods for data assimilation in incompressible flows. Math. Comp. 87(311), 1029–1050 (2018). https://doi.org/10.1090/mcom/3255.
Burman, E., Hansbo, P., Larson, M.G.: Solving ill-posed control problems by stabilized finite element methods: an alternative to Tikhonov regularization. Inverse Problems 34(3), 035004, 36 (2018). https://doi.org/10.1088/1361-6420/aaa32b.
Burman, E., Larson, M.G., Oksanen, L.: Primal-dual mixed finite element methods for the elliptic Cauchy problem. SIAM J. Numer. Anal. 56(6), 3480–3509 (2018). https://doi.org/10.1137/17M1163335.
Burman, E., Nechita, M., Oksanen, L.: Unique continuation for the Helmholtz equation using stabilized finite element methods. J. Math. Pures Appl. (9) 129, 1–22 (2019). https://doi.org/10.1016/j.matpur.2018.10.003.
Burman, E., Nechita, M., Oksanen, L.: A stabilized finite element method for inverse problems subject to the convection-diffusion equation. I: diffusion-dominated regime. Numer. Math. 144(3), 451–477 (2020). https://doi.org/10.1007/s00211-019-01087-x.
Burman, E., Oksanen, L.: Data assimilation for the heat equation using stabilized finite element methods. Numer. Math. 139(3), 505–528 (2018). https://doi.org/10.1007/s00211-018-0949-3.
Burman, E., Oksanen, L.: Finite element approximation of unique continuation of functions with finite dimensional trace (2023). Preprint arXiv:2305.06800
Céa, J.: Approximation variationnelle des problèmes aux limites. Ann. Inst. Fourier (Grenoble) 14(fasc. 2), 345–444 (1964)
Dahmen, W., Monsuur, H., Stevenson, R.: Least squares solvers for ill-posed PDEs that are conditionally stable. ESAIM Math. Model. Numer. Anal. 57(4), 2227–2255 (2023). https://doi.org/10.1051/m2an/2023050.
Engl, H.W.: Regularization by least-squares collocation. In: Numerical treatment of inverse problems in differential and integral equations (Heidelberg, 1982), Progr. Sci. Comput., vol. 2, pp. 345–354. Birkhäuser Boston, Boston, MA (1983)
Engl, H.W., Neubauer, A.: On projection methods for solving linear ill-posed problems. In: Model optimization in exploration geophysics (Berlin, 1986), Theory Practice Appl. Geophys., vol. 1, pp. 73–92. Friedr. Vieweg, Braunschweig (1987)
Engl, H.W., Neubauer, A.: Convergence rates for Tikhonov regularization in finite-dimensional subspaces of Hilbert scales. Proc. Amer. Math. Soc. 102(3), 587–592 (1988). https://doi.org/10.2307/2047228.
Escauriaza, L., Fernández, F.J., Vessella, S.: Doubling properties of caloric functions. Appl. Anal. 85(1-3), 205–223 (2006). https://doi.org/10.1080/00036810500277082.
Hämarik, U., Avi, E., Ganina, A.: On the solution of ill-posed problems by projection methods with a posteriori choice of the discretization level. Math. Model. Anal. 7(2), 241–252 (2002)
Helfrich, H.P.: Optimale lineare Approximation beschränkter Mengen in normierten Räumen. J. Approximation Theory 4, 165–182 (1971). https://doi.org/10.1016/0021-9045(71)90027-x.
Ito, K., Jin, B.: Inverse problems, Series on Applied Mathematics, vol. 22. World Scientific Publishing Co. Pte. Ltd., Hackensack, NJ (2015). Tikhonov theory and algorithms
John, F.: Continuous dependence on data for solutions of partial differential equations with a prescribed bound. Comm. Pure Appl. Math. 13, 551–585 (1960)
Kaltenbacher, B.: Regularization by projection with a posteriori discretization level choice for linear and nonlinear ill-posed problems. Inverse Problems 16(5), 1523–1539 (2000). https://doi.org/10.1088/0266-5611/16/5/322.
Lattès, R., Lions, J.L.: The method of quasi-reversibility. Applications to partial differential equations. Translated from the French edition and edited by Richard Bellman. Modern Analytic and Computational Methods in Science and Mathematics, No. 18. American Elsevier Publishing Co., Inc., New York (1969)
Lax, P.D., Milgram, A.N.: Parabolic equations. In: Contributions to the theory of partial differential equations, Annals of Mathematics Studies, no. 33, pp. 167–190. Princeton University Press, Princeton, N. J. (1954)
Lin, C.L., Nakamura, G., Wang, J.N.: Optimal three-ball inequalities and quantitative uniqueness for the Lamé system with Lipschitz coefficients. Duke Math. J. 155(1), 189–204 (2010). https://doi.org/10.1215/00127094-2010-054.
Lin, C.L., Uhlmann, G., Wang, J.N.: Optimal three-ball inequalities and quantitative uniqueness for the Stokes system. Discrete Contin. Dyn. Syst. 28(3), 1273–1290 (2010). https://doi.org/10.3934/dcds.2010.28.1273.
Lukas, M.A.: Convergence rates for regularized solutions. Math. Comp. 51(183), 107–131 (1988). https://doi.org/10.2307/2008582.
Mathé, P., Pereverzev, S.V.: Optimal discretization of inverse problems in Hilbert scales. Regularization and self-regularization of projection methods. SIAM J. Numer. Anal. 38(6), 1999–2021 (2001). https://doi.org/10.1137/S003614299936175X.
Miller, K.: Three circle theorems in partial differential equations and applications to improperly posed problems. Ph.D. thesis, Rice University (1962)
Miller, K.: Stabilized quasi-reversibility and other nearly-best-possible methods for non-well-posed problems. In: Symposium on Non-Well-Posed Problems and Logarithmic Convexity (Heriot-Watt Univ., Edinburgh, 1972), pp. 161–176. Lecture Notes in Math., Vol. 316 (1973)
Mishra, S., Molinaro, R.: Estimates on the generalization error of physics-informed neural networks for approximating a class of inverse problems for PDEs. IMA Journal of Numerical Analysis 42(2), 981–1022 (2022). https://doi.org/10.1093/imanum/drab032.
Monk, P., Süli, E.: The adaptive computation of far-field patterns by a posteriori error estimation of linear functionals. SIAM J. Numer. Anal. 36(1), 251–274 (1999). https://doi.org/10.1137/S0036142997315172.
Natterer, F.: The finite element method for ill-posed problems. RAIRO Anal. Numér. 11(3), 271–278 (1977). https://doi.org/10.1051/m2an/1977110302711.
Natterer, F.: Regularisierung schlecht gestellter Probleme durch Projektionsverfahren. Numer. Math. 28(3), 329–341 (1977). https://doi.org/10.1007/BF01389972.
Natterer, F.: Error bounds for Tikhonov regularization in Hilbert scales. Applicable Anal. 18(1-2), 29–37 (1984). https://doi.org/10.1080/00036818408839508.
Nitsche, J.: Lineare Spline-Funktionen und die Methoden von Ritz für elliptische Randwertprobleme. Arch. Rational Mech. Anal. 36, 348–355 (1970). https://doi.org/10.1007/BF00282271.
Tikhonov, A.N., Arsenin, V.Y.: Solutions of ill-posed problems. V. H. Winston & Sons, Washington, D.C.: John Wiley & Sons, New York-Toronto, Ont.-London (1977). Translated from the Russian, Preface by translation editor Fritz John, Scripta Series in Mathematics
Trefethen, L.N.: Quantifying the ill-conditioning of analytic continuation. BIT Numerical Mathematics 60(4), 901–915 (2020). https://doi.org/10.1007/s10543-020-00802-7.
Trefethen, L.N.: Numerical analytic continuation. Japan Journal of Industrial and Applied Mathematics 40(3), 1587–1636 (2023). https://doi.org/10.1007/s13160-023-00599-2.
Zlámal, M.: On the finite element method. Numer. Math. 12, 394–409 (1968). https://doi.org/10.1007/BF02161362.
Zlámal, M.: Curved elements in the finite element method. I. SIAM J. Numer. Anal. 10, 229–240 (1973). https://doi.org/10.1137/0710022.
Acknowledgements
The authors thank Tuomo Kuusi for valuable advice on three-ball inequalities, and the two anonymous reviewers for their valuable comments and suggestions, which have improved the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Rob Stevenson.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
E.B.: supported by the EPSRC Grants EP/T033126/1 and EP/V050400/1. M. N.: this work was supported by the project “The Development of Advanced and Applicative Research Competencies in the Logic of STEAM + Health” /POCU/993/6/13/153310, project co-financed by the European Social Fund through The Romanian Operational Programme Human Capital 2014–2020. L. O.: Co-funded by the European Union (ERC, LoCal, 101086697). Views and opinions expressed are however those of the authors only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them. Co-funded by the Research Council of Finland (347715, 3530969).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Burman, E., Nechita, M. & Oksanen, L. Optimal Approximation of Unique Continuation. Found Comput Math (2024). https://doi.org/10.1007/s10208-024-09655-w
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10208-024-09655-w
Keywords
- Unique continuation
- Ill-posed problems
- Conditional stability
- Approximation methods
- Finite element methods
- Stabilised methods
- Regularisation
- Error estimates
- Optimality
- Optimal convergence