Exact Discrete Solutions of Boundary Control Problems for the 1D Heat Equation

Lang, Jens; Schmitt, Bernhard A.

doi:10.1007/s10957-022-02154-4

Exact Discrete Solutions of Boundary Control Problems for the 1D Heat Equation

Open access
Published: 09 January 2023

Volume 196, pages 1106–1118, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Exact Discrete Solutions of Boundary Control Problems for the 1D Heat Equation

Download PDF

1622 Accesses
1 Altmetric
Explore all metrics

Abstract

Method-of-lines discretizations are demanding test problems for stiff integration methods. However, for PDE problems with known analytic solution, the presence of space discretization errors or the need to use codes to compute reference solutions may limit the validity of numerical test results. To overcome these drawbacks, we present in this short note a simple test problem with boundary control, a situation where one-step methods may suffer from order reduction. We derive exact formulas for the solution of an optimal boundary control problem governed by a one-dimensional discrete heat equation and an objective function that measures the distance of the final state from the target and the control costs. This analytical setting is used to compare the numerically observed convergence orders for selected implicit Runge–Kutta and Peer two-step methods of classical order four, which are suitable for optimal control problems.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

One main area of application for stiff integration methods is semi-discretizations in space of time-dependent partial differential equations in the method-of-lines approach. In order to test new methods in this area, one may rely on PDE test problems with known analytic solution or reference solutions computed by other numerical methods. However, both approaches have its drawbacks. For PDE problems, the accuracy is limited by the level of space discretization errors and in computing reference solutions one has to trust the reliability of the used code. This background was our motivation to develop the current test problems with exact discrete solutions for a finite difference semi-discretization in space with arbitrarily fine grids.

It is known that, in contrast to multi-step-type methods, one-step methods may suffer from order reduction if applied to MOL systems especially with time-dependent boundary conditions, see [7, 8]. Motivated by our recent work [5, 6] on Peer two-step methods in optimal control, the present example is formulated as a problem with boundary control.

The paper is organized as follows. In Sect. 2, we apply a finite difference discretization with a shifted equi-spaced grid for the 1D heat equation with general Robin boundary conditions and derive exact formulas for the solutions of the discrete heat equation and an optimal boundary control problem. These analytical solutions are used in a sparse setting to study the numerically observed convergence orders in Sect. 3 for several one-step and two-step integration methods which are suitable for optimal control. Conclusions are given in Sect. 4.

2 A Discrete Heat Equation with Boundary Control

2.1 Finite Difference Discretization of the 1D Heat Equation

We consider the initial-boundary-value problem for a function Y(x, t) governed by the heat equation

$$\begin{aligned} \partial _tY(x,t) =&\,\partial _{xx}Y(x,t),\ (x,t)\in [0,1]\times [0,T], \end{aligned}$$

(1)

$$\begin{aligned} \partial _xY(0,t) =&\,0,\; \beta _0 Y(1,t)+\beta _1 \partial _xY(1,t)=u(t),\\ Y(x,0)=&\, \varPsi (x),\nonumber \end{aligned}$$

(2)

where $\varPsi (x)$ and u(t) are given functions. The homogeneous Neumann condition at $x=0$ may be considered as a shortcut for space-symmetric solutions $Y(-x,t)\equiv Y(x,t)$. The coefficients of the general Robin boundary condition are nonnegative, $\beta _0,\beta _1\ge 0$ and nontrivial $(\beta _0,\beta _1)\not =(0,0)$.

Equation (1) is approximated by finite differences with a shifted equi-spaced grid with step size $h=1/m$, $m\in {{\mathbb {N}}} $:

$$\begin{aligned} x_j=\left( j-\frac{1}{2}\right) h,\ j=1,\ldots , m. \end{aligned}$$

For the approximation of the boundary conditions, also the outside points $x_0=-h/2$ and $x_{m+1}=1+h/2$ will be considered temporarily. In the method-of-lines approach with central differences, approximations $y_j(t),\,j=1,\ldots ,m$, are defined by the differential equations

$$\begin{aligned} y_j'=\frac{1}{h^2}\left( y_{j-1}-2y_j+y_{j+1}\right) ,\ j=2,\ldots ,m-1, \end{aligned}$$

(3)

for the grid points in a distance to the boundary. The symmetric difference approximation $0{\mathop {=}\limits ^{!}}h Y_x(0,t)\cong (y_1-y_0)$ leads to the symmetry condition $y_0\equiv y_1$ and yields the MOL equation

$$\begin{aligned} y_1'=\frac{-y_1+y_2}{h^2}. \end{aligned}$$

(4)

In a similar way, the Robin boundary condition is approximated by the equation

$$\begin{aligned} \beta _0\frac{y_m+y_{m+1}}{2}+\beta _1\frac{y_{m+1}-y_m}{h}=u(t), \end{aligned}$$

which may be solved for $y_{m+1}$ by

$$\begin{aligned} y_{m+1}=\frac{2\beta _1-\beta _0h}{2\beta _1+\beta _0h}y_m+ \frac{2h}{2\beta _1+\beta _0h}u(t). \end{aligned}$$

Thus, $y_{m+1}$ may be eliminated from Eq. (3) with $j\!=\!m$ yielding

$$\begin{aligned} y_m'=&\frac{1}{h^2}(y_{m-1}-\theta y_m)+\gamma \,u(t) \end{aligned}$$

(5)

with

$$\begin{aligned} \theta =\frac{2\beta _1+3\beta _0h}{2\beta _1+\beta _0h}=3-\frac{4\beta _1}{2\beta _1+\beta _0h}, \quad \gamma =\frac{2}{(2\beta _1+\beta _0h)h}. \end{aligned}$$

(6)

Hence, we have $\theta \!=\!3$ for the Dirichlet condition and $\theta \!=\!1$ for the pure Neumann condition. Collecting all Eqs. (3), (4) and (5), the following MOL system for the vector $y(t)=\big (y_j(t)\big )_{j=1,\ldots ,m}$ is obtained:

$$\begin{aligned} y'=&My+\gamma e_mu(t), \end{aligned}$$

(7)

$$\begin{aligned} M =&\frac{1}{h^2}\begin{pmatrix} -1&{}1\\ 1&{}-2&{}1\\ &{}&{}\ddots &{}\ddots &{}\ddots \\ &{}&{}&{}1&{}-2&{}1\\ &{}&{}&{}&{}1&{}-\theta \end{pmatrix}, \end{aligned}$$

(8)

where $e_m$ is the m-th unit vector. The initial conditions are simple evaluations of the function $\varPsi $ on the grid,

$$\begin{aligned} y(0)=\psi ,\quad \psi =\left( \varPsi (x_j)\right) _{j=1}^m. \end{aligned}$$

(9)

The basis of our construction is that the eigenvalues and eigenvectors of the symmetric matrix M are known, which is well known for special values of $\theta $, at least.

Lemma 2.1

For $m\ge 2$, the eigenvalues of the matrix $M\in {{\mathbb {R}}} ^{m\times m}$ from (8) are given by

$$\begin{aligned} \lambda _k=-4m^2\sin ^2\left( \frac{\omega _k}{2m}\right) ,\ k=1,\ldots ,m, \end{aligned}$$

(10)

where $\omega _k,\,k=1,\ldots ,m,$ are the m first nonnegative solutions of the equation

$$\begin{aligned} \tan (\omega )\tan \left( \frac{\omega }{2m}\right) =\frac{\beta _0}{2m\beta _1}, \end{aligned}$$

(11)

with the convention that $\omega _k=(k-\frac{1}{2})\pi $, $k=1,\ldots ,m,$ for $\beta _1=0$. The corresponding normalized eigenvectors $v^{[k]}$ have the components

$$\begin{aligned} v_j^{[k]}=&\nu _k\cos \left( \omega _k\frac{2j-1}{2m}\right) ,\ j=1,\ldots ,m, \end{aligned}$$

(12)

with constants $\nu _k=2/\sqrt{2m+\sin (2\omega _k)/\sin (\omega _k/m)}$.

Proof

In the main Eq. (3), the ansatz $v=\big (\Re e^{i\omega x_j}\big )_{j=1}^m$ gives

$$\begin{aligned} \frac{1}{h^2}(v_{j-1}-2v_j+v_{j+1}) =&\frac{1}{h^2}\Re e^{i\omega x_j}\left( e^{-i\omega h)}-2+e^{i\omega h}\right) =-\frac{4}{h^2}\sin ^2\left( \frac{\omega h}{2}\right) v_j. \end{aligned}$$

In the first equation, we have

$$\begin{aligned} \frac{1}{h^2}(-v_1+v_2)&=\frac{1}{h^2}\left( -\cos {\frac{\omega h}{2}}+ \cos \left( 3\frac{\omega h}{2}\right) \right) =\frac{4}{h^2}\left( \cos ^3\left( \frac{\omega h}{2}\right) -\cos {\frac{\omega h}{2}}\right) \\&=-\frac{4}{h^2}\sin ^2\left( \frac{\omega h}{2}\right) v_1, \end{aligned}$$

with the same factor $\lambda \!:=\!-(4/h^2)\sin ^2\big ({\omega h}/2\big )$. In order to satisfy the eigenvalue condition in the last component, we consider the equation $0=e_m^\textsf{T}(Mv-\lambda v)$, i.e.,

$$\begin{aligned} 0{\mathop {=}\limits ^{!}}&\,v_{m-1}-\left( \theta +\lambda h^2\right) v_m =\cos \left( \omega (x_m-h)\right) -\left( \theta +\lambda h^2\right) \cos (\omega x_m)\\ =&\,\left( \cos (\omega h)+4\sin ^2\left( \frac{\omega h}{2}\right) -\theta \right) \cos (\omega x_m)+\sin (\omega h)\sin (\omega x_m)\\ =&\,\left( 1 +2\sin ^2\left( \frac{\omega h}{2}\right) -\theta \right) \cos (\omega x_m)+\sin (\omega h)\sin (\omega x_m), \end{aligned}$$

since $\cos (\omega h)\!=\!1-2\sin ^2(\omega h/2)$. The last grid point is $x_m\!=\!1-h/2$ and with the trigonometric formulas for $\cos (\omega -\omega h/2),\sin (\omega -\omega h/2)$ and the identity $\sin (\omega h)=2\sin (\omega h/2)\cos (\omega h/2)$, we may proceed with

$$\begin{aligned} 0=&\,\left( 1+2\sin ^2\left( \frac{\omega h}{2}\right) -\theta \right) \left( \cos (\omega )\cos \left( \frac{\omega h}{2}\right) + \sin (\omega )\sin \left( \frac{\omega h}{2}\right) \right) \\&\,+2\sin \left( \frac{\omega h}{2}\right) \cos \left( \frac{\omega h}{2}\right) \left( \sin (\omega )\cos \left( \frac{\omega h}{2}\right) -\cos (\omega )\sin \left( \frac{\omega h}{2}\right) \right) \\ =&\,(1-\theta )\cos \left( \frac{\omega h}{2}\right) \cos (\omega )\\&\,+\left( 1+2\sin ^2\left( \frac{\omega h}{2}\right) -\theta +2\cos ^2\left( \frac{\omega h}{2}\right) \right) \sin \left( \frac{\omega h}{2}\right) \sin (\omega )\\ =&\,(1-\theta )\cos \left( \frac{\omega h}{2}\right) \cos (\omega ) +(3-\theta )\sin \left( \frac{\omega h}{2}\right) \sin (\omega ). \end{aligned}$$

Hence, the different versions of $\theta $ in (6) verify the condition (11) for $m=1/h$. Rearranging (11) as $\tan (\omega )=\beta _0/(2m\beta _1)\cot (\omega /(2m))$, for $\beta _0>0$ it is seen that exactly m solutions exist in $(0,m\pi )$ since the function $\omega \mapsto \cot (\omega /(2m))$ is monotonically decreasing and positive. Finally, the vector norms are computed for $\omega \not =0$. Abbreviating $\omega /m=:\varOmega $ and using $\cos ^2(x)=(1+\cos (2x))/2$, we get

$$\begin{aligned} \sum _{j=1}^m\cos ^2\left( \left( j-\frac{1}{2}\right) \frac{\omega }{m}\right) =&\,\frac{m}{2}+\frac{1}{2}\sum _{j=1}^m\cos \left( (2j-1)\varOmega \right) \\ =&\,\frac{m}{2}+\frac{1}{2}\Re \sum _{j=1}^me^{i(2j-1)\varOmega } =\frac{m}{2}+\frac{1}{2}\Re \frac{e^{i2\varOmega m}-1}{e^{i\varOmega }-e^{-i\varOmega }}\\ =&\,\frac{m}{2}+\frac{1}{4}\Im \frac{e^{i2\varOmega m}-1}{\sin (\varOmega )} =\frac{m}{2}+\frac{1}{4}\frac{\sin (2\omega )}{\sin (\varOmega )}, \end{aligned}$$

which leads to the value of the normalizing factor $\nu $ in (12). $\square $

For later use, we introduce the diagonal matrix $\Lambda =\,\text { diag}(\lambda _k)$ and the unitary matrix $V=(v^{[1]},\ldots ,v^{[m]})$ satisfying $M=V\Lambda V^\textsf{T}$.

Remark 2.2

The well-known frequencies for Dirichlet boundary conditions are $\omega _k=(k-\frac{1}{2})\pi $ and $\omega _k=(k-1)\pi $, $k=1,\ldots ,m$ for Neumann conditions. For general values $\beta _0,\beta _1>0$, Eq. (11) may be rewritten in fixed point form

$$\begin{aligned} \omega =f_k(\omega ):=(k-1)\pi +\arctan \left( \frac{\beta _0}{2m\beta _1}\cot \left( \frac{\omega }{2m}\right) \right) , \quad k=1,\ldots ,m. \end{aligned}$$

The functions $f_k$ are monotonically decreasing in $\omega $, and an iteration with initial value $\omega =\max \{1,(k-1)\pi \}$ converges to the desired solution $\omega _k$ at least for $2m>\beta _0/\beta _1\ge 1$, since $1/|f_k'(\omega )|=(4m^2\beta _1/\beta _0)\sin ^2(\omega /(2m))+(\beta _0/\beta _1)\cos ^2(\omega /(2m))$.

Remark 2.3

The eigenvalues $\lambda _k$ in (10) are $O(h^2)$-approximations of the exact eigenvalues $\hat{\lambda }_k\!=\!-\varphi _k^2$, where $y_k(x)=\cos (\varphi _k x),\,k\in {{\mathbb {N}}} ,$ are the eigenfunctions of the boundary value problem with frequencies $\varphi _k$ satisfying $\varphi \tan (\varphi )=\beta _0/\beta _1$. For Dirichlet ($\beta _1=0$) and Neumann ($\beta _0=0$) conditions, the discrete frequencies are exact, $\omega _k=\varphi _k,\,k=1,\ldots ,m$. Here, Taylor expansion in (10) shows that $\lambda _k=-4h^{-2}\sin ^2(h\omega _k/2)\cong -\omega _k^2(1+h^2\omega _k^2/12)=\hat{\lambda }_k+O(h^2\omega _k^3)$. This shows convergence of second order for fixed k. However, for $k\rightarrow m$ the estimate becomes meaningless since then $h\omega _k=O(1)$. For general Robin conditions, $\beta _0,\beta _1>0$, an additional error is added since (11) corresponds to $\beta _0/\beta _1=2h^{-1}\tan (h\omega /2)\tan (\omega )\cong \omega \tan (\omega )(1+h^2\omega ^2/12)$, which is an $O(h^2)$-perturbation of the condition for the exact frequencies $\varphi _k$.

2.2 Exact Solution of the Discrete Heat Equation

Knowing the eigenvectors and eigenvalues of the linear problem (7), the computation of its solution is straightforward. The representation $y(t)=\sum _{k=1,\ldots ,m}\eta _k(t)v^{[k]}$ leads to

$$\begin{aligned} \sum _{k=1}^m\eta _k'(t)v^{[k]}=\sum _{k=1}^m\lambda _k\eta _k(t) v^{[k]}+\gamma e_mu(t). \end{aligned}$$

Since the matrix M is symmetric, the inner product with $v^{[j]}$ yields the decoupled equations $\eta _j'(t)=\lambda _j\eta _j(t)+\gamma v_m^{[j]}u(t)$, which can be solved easily leading to the following result.

Lemma 2.4

With the data from Lemma 2.1, the solution of the initial value problem (7), (9) is given by

$$\begin{aligned} y(t)=\sum _{k=1}^m\left( e^{\lambda _kt}{v^{[k]}}^\textsf{T}\psi +\gamma v_m^{[k]}\int _0^t e^{\lambda _k(t-\tau )}u(\tau )\,\textrm{d}\tau \right) v^{[k]}. \end{aligned}$$

(13)

Remark 2.5

The presence of the terms $v_m^{[k]}$ indicates that simple sparse solutions with only a few terms in (13) may not exist due to the inhomogeneous boundary condition (2).

2.3 Exact Solution of an Optimal Control Problem

The inhomogeneity u(t) inherited from the boundary condition (2) may be considered as a control to approach a given target profile ${\hat{y}}\in {{\mathbb {R}}} ^m$ at some given time $T>0$. In an optimal control context, controls are searched for minimizing an objective function like

$$\begin{aligned} C=\frac{1}{2}\Vert y(T)-{\hat{y}}\Vert _2^2+\frac{\alpha }{2}\int _0^T u(t)^2dt, \end{aligned}$$

with the Euclidean vector norm $\Vert \cdot \Vert _2$ in $ {{\mathbb {R}}} ^m$ and $\alpha >0$. The unique optimal solution may be computed by using some multiplier function p(t) for the ODE restriction (7) and considering the Lagrangian

$$\begin{aligned} L:=&\,C+\int _0^Tp^\textsf{T}\bigl (y'-My-\gamma e_mu\bigr )\,dt+p^\textsf{T}(0)(y(0)-\psi ) \\ =&\,C -\int _0^T\left( \left( p'\right) ^\textsf{T}y+p^\textsf{T}\bigl ( My+\gamma e_mu \bigr )\right) dt+p^\textsf{T}(T)y(T)-p^\textsf{T}(0)\psi . \end{aligned}$$

The partial derivatives of the Lagrangian L with respect to p(t) and p(T) recover (7), (9), and the other ones are

$$\begin{aligned} \partial _{y(t)}L=&\,-p'-M^\textsf{T}p=-p'-M p, \end{aligned}$$

(14)

$$\begin{aligned} \partial _{y(T)}L=&\,y(T)-{\hat{y}}+p(T), \nonumber \\ \partial _{u(t)}L=&\,\alpha u-\gamma e_m^\textsf{T}p. \end{aligned}$$

(15)

Hence, the Karush–Kuhn–Tucker conditions, $\partial _{(\cdot )}L=0$ in (14)–(15), show that the control u(t) may be eliminated by

$$\begin{aligned} u(t)=\frac{\gamma }{\alpha } e_m^\textsf{T}p(t)=\frac{\gamma }{\alpha }p_m(t), \end{aligned}$$

(16)

and a necessary condition for the optimal solution is that it solves the following boundary value problem:

$$\begin{aligned} y'=&\,My+\frac{\gamma }{\alpha }e_me_m^\textsf{T}p,\quad y(0)=\psi , \end{aligned}$$

(17)

$$\begin{aligned} p'=&\,-Mp,\quad p(T)={\hat{y}} - y(T). \end{aligned}$$

(18)

The homogeneous differential equation (18) for p has the simple solution

$$\begin{aligned} p(t)=e^{(T-t)M}p(T)=\sum _{\ell =1}^m e^{\lambda _\ell (T-t)}v^{[\ell ]}{v^{[\ell ]}}^\textsf{T}({\hat{y}} - y(T)) \end{aligned}$$

with the matrix exponential

$$\begin{aligned} e^{sM} =&\,V \,\text { diag}\left( e^{\lambda _ks}\right) V^\textsf{T}, \quad 0\le s\le T. \end{aligned}$$

With u given by (16), this solution may be used in (13) to yield the solution for (17) with the coefficient functions

$$\begin{aligned} \eta _k(t)=&\,e^{\lambda _k t}\eta _k(0)+\frac{\gamma ^2}{\alpha }v_m^{[k]}\int _0^t e^{\lambda _k(t-\tau )}p_m(\tau )\textrm{d}\tau \nonumber \\ =&\,e^{\lambda _k t}\eta _k(0)+\frac{\gamma ^2}{\alpha }v_m^{[k]}\sum _{\ell =1}^m v_m^{[\ell ]}\int _0^te^{\lambda _k t+\lambda _\ell T-(\lambda _k+ \lambda _\ell )\tau }\textrm{d}\tau \cdot {v^{[\ell ]}}^\textsf{T}({\hat{y}} - y(T)). \end{aligned}$$

(19)

Considering the function $\varphi _1(z)=\int _0^1 e^{zt}dt$ satisfying $\varphi _1(z)=(e^z-1)/z$ for $z\not =0$ and $\varphi _1(0)=1$, the integral may be written as

$$\begin{aligned} \int _0^te^{\lambda _k t+\lambda _\ell T-(\lambda _k+\lambda _\ell )\tau }\textrm{d}\tau =te^{\lambda _\ell (T-t)}\varphi _1\left( (\lambda _k+\lambda _\ell )t\right) . \end{aligned}$$

The result (19) may be used in different ways. The first application is computing the solution for a given target profile ${\hat{y}}$.

Lemma 2.6

Let ${\hat{y}}\in {{\mathbb {R}}} ^m$ be given. Then, the coefficient vector $\eta (T)$ of the solution $y(t)=V\eta (t)$ of the boundary value problem (17), (18) is given by the unique solution of the linear system

$$\begin{aligned} (I+Q)\eta (T)=e^{T\Lambda }\eta (0)+QV^\textsf{T}{\hat{y}}, \end{aligned}$$

(20)

with the positive semi-definite matrix $Q=(q_{k\ell })_{k,\ell =1}^m$ having the elements

$$\begin{aligned} q_{k\ell }=\frac{\gamma ^2T}{\alpha }v_m^{[k]}\varphi _1\left( (\lambda _k+\lambda _\ell )T\right) v_m^{[\ell ]},\ k, \ell =1,\ldots ,m. \end{aligned}$$

(21)

Proof

At the end point T, the formula (19) simplifies to

$$\begin{aligned} \eta _k(T)=&\,e^{\lambda _k T}\eta _k(0)+\frac{\gamma ^2T}{\alpha }v_m^{[k]}\sum _{\ell =1}^m v_m^{[\ell ]}\varphi _1\left( (\lambda _k+\lambda _\ell )T\right) \cdot \left( {v^{[\ell ]}}^\textsf{T}{\hat{y}}-\eta _\ell (T)\right) . \end{aligned}$$

This equation may be reordered to the form given in (20) with the matrix elements (21). Finally, we consider the quadratic form of the matrix Q with some vector $w=(w_j)$, obtaining

$$\begin{aligned} w^\textsf{T}Qw=&\,\frac{\gamma ^2T}{\alpha }\sum _{k,\ell =1}^m v_m^{[k]}w_k \int _0^1e^{(\lambda _k+ \lambda _\ell )T\tau }\textrm{d}\tau \cdot v_m^{[\ell ]}w_\ell \\ =&\,\frac{\gamma ^2T}{\alpha }\int _0^1\sum _{k,\ell =1}^m (e^{\lambda _k T\tau }v_m^{[k]}w_k) ( e^{\lambda _\ell T\tau } v_m^{[\ell ]}w_\ell ) \textrm{d}\tau \\ =&\,\frac{\gamma ^2T}{\alpha }\int _0^1\Big (\sum _{k=1}^m e^{\lambda _k T\tau }v_m^{[k]}w_k\Big )^2 \textrm{d}\tau \ge 0. \end{aligned}$$

This means that Q is semi-definite and $I+Q$ definite and the system (20) always has a unique solution. $\square $

In general, solutions computed with (20) will not be sparse, i.e., they will have m nontrivial basis coefficients in the state y and the Lagrange multiplier p. Due to the special inhomogeneity in (17), sparse solutions for the state y probably do not exist. However, by adjusting the target profile ${\hat{y}}$, one may simply start with a sparse multiplier p(t) with, for instance, two terms only,

$$\begin{aligned} p(t)= \delta _1 e^{\lambda _1(T-t)}v^{[1]}+\delta _2 e^{\lambda _2(T-t)}v^{[2]}, \end{aligned}$$

(22)

with coefficients $\delta _1,\delta _2$ belonging to some reasonable form of the control u. Then, by the boundary condition in (18), the corresponding target profile has the form

$$\begin{aligned} {\hat{y}}=y(T)+\delta _1 v^{[1]}+\delta _2 v^{[2]}, \end{aligned}$$

(23)

where, by (19), the coefficients of y(T) are given by

$$\begin{aligned} \eta _k(T)=&\,e^{\lambda _k T}\eta _k(0)+\frac{\gamma ^2T}{\alpha }v_m^{[k]} \sum _{\ell =1}^2\delta _\ell v_m^{[\ell ]}\varphi _1\left( (\lambda _k+\lambda _\ell )T\right) . \end{aligned}$$

(24)

We will use this construction in our numerical example.

3 Test Case: Dirichlet Boundary Control Problem

To illustrate an application of the derived expressions for the exact discrete solutions of the linear heat equation equipped with different boundary conditions, we consider the following ODE-constrained optimal control problem with an incorporated boundary control of Dirichlet type:

$$\begin{aligned} \min _{(y,u)} C&\,:=\frac{1}{2}\Vert y(T)-\hat{y}\Vert ^2_2 +\frac{\alpha }{2} \int _0^T u(t)^2\,dt\\ \text {subject to } y'(t)&\, = My(t)+\gamma e_m u(t),\quad t\in (0,T],\\ y(0)&\, = \mathbbm {1}, \end{aligned}$$

with $T\!=\!1$, $\gamma \!=\!2/h^2$, $\alpha \!=\!1$, $\mathbbm {1}=(1,\ldots ,1)^T\in {{\mathbb {R}}} ^m$, state vector $y(t)\in {{\mathbb {R}}} ^m$, and M as defined in (8) with $\theta \!=\!3$. We set $\delta _1\!=\!\delta _2\!=-\!1/75$ in (22) and compute the target profile $\hat{y}\in {{\mathbb {R}}} ^m$ from (23) with coefficients for y(T) defined in (24).

We will compare numerical results for four time integrators of classical order four: the symmetric 2-stage Gauss method (Appendix: Table 1), the symmetric 3-stage partitioned Runge–Kutta pair Lobatto IIIA-IIIB (Appendix: Table 2) and our recently developed two-step Peer methods AP4o43bdf and AP4o43dif [6]. The two one-step methods are symplectic and therefore well suited for optimal control [4, 9].

Two test scenarios are considered. First, the accuracy of the numerical approximations for y(T) and p(0) is studied, where the exact control $u(t)=\gamma p_m(t)/\alpha $ is used. The initial value for the multiplier is set to $p(T)={\hat{y}} - y_\tau (T)$ with $y_\tau (T)$ being the approximation of y(T) with time step $\tau $. In this case, the Karush–Kuhn–Tucker system decouples and only two systems of linear ODEs have to be solved. In the second scenario, the optimal control problem is solved for all unknowns (y, p, u) by a gradient-based interior point algorithm as implemented in the MATLAB routine fmincon, see, e.g., [1, 2] for more details, and the errors for the control are discussed.

We first reduce the objective function C(y, u) to the so-called Mayer form, which uses terminal solution values only. Introducing an additional differential equation $y'_{m+1}(t)=u(t)^2$ with initial values $y_{m+1}(0)\!=\!0$ and an extended state vector ${\tilde{y}}=(y^\textsf{T},y_{m+1})^\textsf{T}$, the new objective function reads ${\tilde{C}}=(\Vert y(T)-\hat{y}\Vert ^2_2+\alpha y_{m+1}(T))/2$. Let now $U\in {{\mathbb {R}}} ^{sN}$ denote the vector of approximate control values at the nodes $t_{ni}\!=\!t_n+c_i\tau $, $i\!=\!1,\ldots ,s,$ used by an s-stage time integrator on a time grid $\{t_0,\ldots ,t_N\}$ with step size $\tau $ [3, 6], and let ${\tilde{C}}(U):={\tilde{C}}({\tilde{y}}(T))$ be the value of the cost functional associated with these discrete controls. Then, the Karush–Kuhn–Tucker system provides a convenient way to compute the gradient $\nabla _{U}{\tilde{C}}(U)$ for a given $U^{(k)}$ in an iterative optimization algorithm, solving first the forward Eq. (7) for y with given intermediate values for the control to compute approximations $y_\tau $ and then the backward Eq. (18) for p using $y_\tau (T)$ as approximation for y(T). For Runge–Kutta methods, it holds $\nabla _{u_\tau (t_{ni})}{\tilde{C}}(U)=-\tau b_ip_\tau (t_{ni})\nabla _uf(y_\tau (t_{ni}),u_\tau (t_{ni}))$ [3, Formula (27)], where $y_\tau (t_{ni})$, $p_\tau (t_{ni})$, and $u_\tau (t_{ni})$ are the approximations of (y(t), p(t), u(t)) at $t=t_{ni}$, respectively, and $b_i$ are the weights of the Runge–Kutta method, see Appendix. Similar formulas are computable for other time integrators. Eventually, we set $U^{(0)}=0$ as initial guess and call fmincon, where we provide gradients of the objective function for each approximation $U^{(k)}$.

In both test cases, we use $m\!=\!250$ and $m\!=\!500$ to also study the influence of the system size. The number of time steps are $N=2^k$ with $k=4,\ldots ,11$.

In Fig. 1, results for the first test scenario are shown. Not surprisingly, the serious order reduction for the symplectic one-step Runge–Kutta methods is clearly seen. This phenomenon is well understood and occurs particularly drastically for time-dependent Dirichlet boundary conditions [8]. This drawback is shared by all one-step methods due to their insufficient stage order. Note that the number of affected time steps increases when the system size is doubled. In contrast, the newly designed two-step Peer methods for optimal control problems work quite close to their theoretical order four for the state y and the adjoint p. The order reduction for the one-step methods is also visible for the more challenging fully coupled problem. The results plotted in Fig. 2 show a reduction to first order for the approximation of the control, whereas the two-step methods perform with order two for this problem. We refer to [3] for a discussion of the convergence order for general ODE constrained optimal control problems. Once again, the range of the affected time steps depends on the problem size. It increases for finer spatial discretizations.

4 Conclusions

We have derived exact formulas for the solution of an optimal boundary control problem constrained by a one-dimensional discrete heat equation, including Dirichlet and general Robin boundary conditions. These solutions have been used to compare symplectic Runge–Kutta methods and recently developed Peer two-step methods of order four. The numerically observed convergence orders illustrate a serious order reduction for Runge–Kutta methods, which is much less severe for our Peer two-step methods.

Data Availability

Data will be made available on reasonable request.

References

Betts, J.T.: Practical Methods for Optimal Control and Estimation Using Nonlinear Programming, 2nd edn. Society for Industrial and Applied Mathematics (2010)
Book MATH Google Scholar
Byrd, R.H., Hribar, M., Nocedal, J.: An interior point algorithm for large-scale nonlinear programming. SIAM J. Optim. 9, 877–900 (1999)
Article MathSciNet MATH Google Scholar
Hager, W.W.: Runge–Kutta methods in optimal control and the transformed adjoint system. Numer. Math. 87, 247–282 (2000)
Article MathSciNet MATH Google Scholar
Hairer, E., Lubich, C., Wanner, G.: Geometric Numerical Integration, Structure-Preserving Algorithms for Ordinary Differential Equations, Springer Series in Computational Mathematics, vol. 31. Springer, Heidelberg, Berlin (2006)
MATH Google Scholar
Lang, J., Schmitt, B.A.: Discrete adjoint implicit peer methods in optimal control. J. Comput. Appl. Math. 416, 114596 (2022)
Article MathSciNet MATH Google Scholar
Lang, J., Schmitt, B.A.: Implicit A-stable peer triplets for ODE constrained optimal control problems. Algorithms 15, 310 (2022)
Article Google Scholar
Lubich, C., Ostermann, A.: Runge–Kutta approximation of quasi-linear parabolic equations. Math. Comput. 64, 601–627 (1995)
Article MathSciNet MATH Google Scholar
Ostermann, A., Roche, M.: Runge–Kutta methods for partial differential equations and fractional orders of convergence. Math. Comput. 59, 403–420 (1992)
Article MathSciNet MATH Google Scholar
Sanz-Serna, J.M.: Symplectic Runge–Kutta schemes for adjoint equations, automatic differentiation, optimal control, and more. SIAM Rev. 58, 3–33 (2016)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The first author is supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) within the collaborative research center TRR154 “Mathematical modeling, simulation and optimisation using the example of gas networks” (Project-ID 239904186, TRR154/3-2022, TP B01).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Mathematics, Technical University of Darmstadt, Dolivostraße 15, 64293, Darmstadt, Germany
Jens Lang
Department of Mathematics, Philipps-Universität Marburg, Hans-Meerwein-Straße 6, 35043, Marburg, Germany
Bernhard A. Schmitt

Authors

Jens Lang
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard A. Schmitt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jens Lang.

Additional information

Communicated by Lorenz Biegler.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Symplectic Runge–Kutta methods

An implicit s-stage Runge–Kutta method to numerically solve $y'(t)=f(t,y)$, $y(0)=y_0$, with constant step size $\tau >0$ on a uniform partition $0<t_1<\ldots <t_N$, $t_n=n\tau $, is given by

$$\begin{aligned} k_i =&\,f\left( t_n+c_i\tau ,y_n+\tau \sum _{j=1}^{s}a_{ij}k_j\right) , \;i=1,\ldots ,s,\\ y_{n+1} =&\,y_n+\tau \sum _{j=1}^{s}b_ik_i, \end{aligned}$$

where $b_i$, $a_{ij}$ are real numbers, $c_i=\sum _{j=1}^{s}a_{ij}$, and $y_n$ are approximations of $y(t_n)$. The coefficients are usually displayed in a Butcher tableau

$$\begin{aligned} \begin{array}{c|ccc} c_1 &{} a_{11} &{} \ldots &{} a_{1s} \\ \vdots &{} \vdots &{}&{} \vdots \\ c_s &{} a_{s1} &{} \ldots &{} a_{ss} \\ \hline &{} b_1 &{} \ldots &{} b_s \end{array}\,. \end{aligned}$$

We give the coefficients of the symplectic 2-stage Gauss method in Table 1 and the symplectic 3-stage Lobatto IIIA-IIIB pair in Table 2. Further symplectic methods and useful information can be found in [4].

Table 1 Coefficients of the 2-stage Gauss method of order 4

Full size table

Table 2 Coefficients of the 3-stage Lobatto IIIA-IIIB pair of order 4

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lang, J., Schmitt, B.A. Exact Discrete Solutions of Boundary Control Problems for the 1D Heat Equation. J Optim Theory Appl 196, 1106–1118 (2023). https://doi.org/10.1007/s10957-022-02154-4

Download citation

Received: 23 October 2022
Accepted: 22 December 2022
Published: 09 January 2023
Issue Date: March 2023
DOI: https://doi.org/10.1007/s10957-022-02154-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Exact Discrete Solutions of Boundary Control Problems for the 1D Heat Equation

Abstract

1 Introduction

2 A Discrete Heat Equation with Boundary Control

2.1 Finite Difference Discretization of the 1D Heat Equation

Lemma 2.1

Proof

Remark 2.2

Remark 2.3

2.2 Exact Solution of the Discrete Heat Equation

Lemma 2.4

Remark 2.5

2.3 Exact Solution of an Optimal Control Problem

Lemma 2.6

Proof

3 Test Case: Dirichlet Boundary Control Problem

4 Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Symplectic Runge–Kutta methods

Appendix: Symplectic Runge–Kutta methods

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation