Abstract
In this paper we analyse the wellposedness of the Cauchy problem for a rather general class of hyperbolic systems with spacetime dependent coefficients and with multiple characteristics of variable multiplicity. First, we establish a wellposedness result in anisotropic Sobolev spaces for systems with upper triangular principal part under interesting natural conditions on the orders of lower order terms below the diagonal. Namely, the terms below the diagonal at a distance k to it must be of order \(\,k\). This setting also allows for the Jordan block structure in the system. Second, we give conditions for the Schur type triangularisation of general systems with variable coefficients for reducing them to the form with an upper triangular principal part for which the first result can be applied. We give explicit details for the appearing conditions and constructions for \(2\times 2\) and \(3\times 3\) systems, complemented by several examples.
Similar content being viewed by others
1 Introduction
The main aim of this paper is to consider the Cauchy problem for hyperbolic systems
with the usual notation \(D_t=\,\mathrm{i}\partial _t\) and \(D_x=\,\mathrm{i}\partial _x\). Here, we assume that \(A(t,x,D_x) = \big [ a_{ij}(t,x,D_x) \big ]_{i,j=1}^m\) is an \(m\times m\) matrix of pseudodifferential operators of order 1, i.e. \(a_{ij} \in C([0,T],\Psi _{1,0}^1(\mathbb R^n))\) with possibly complex valued symbols. In the first part of the paper we will also assume that
with realvalued symbols in
and
Finally, we assume that
is an \(m\times m\) matrix of pseudodifferential operators of order 0 with possibly complex valued symbols. We can take any \(n\ge 1\) and we can assume that \(m\ge 2\) since in the case \(m=1\) there are no multiplicities and thus much more is known. It is also wellknown that even if all the coefficients in A and B depend only on time, due to multiplicities, the best one can hope for is the wellposedness of the Cauchy problem (1) in suitable classes of Gevrey spaces. Thus, the main questions that we address in this paper are:

(Q1)
Under what structural conditions on the zero order part \(B(t,x,D_x)\) is the Cauchy problem (1) wellposed in \(C^\infty \) or, even better, in suitable scales of Sobolev spaces?

(Q2)
Under what conditions on the general matrix \(A(t,x,D_x)\) of first order pseudodifferential operators can we reduce it (microlocally) to another system with A satisfying the upper triangular condition (2)?
Note that this paper is part of a wider analysis of hyperbolic systems with multiplicities. Here we investigate the wellposedness of these systems. In the second part of this paper we plan to carry out the microlocal analysis of their solutions.
In the case of \(2\times 2\) systems the questions above have been analysed with the answer to (Q1) given by the following theorem:
Theorem A
([27, Theorem 7.2]) Let \(m=2\). Suppose that the pseudodifferential operator \({b}_{21}\) is of order not greater than \(\,1\). Then the Cauchy problem (1) is wellposed in \(C^\infty \). Moreover, it is wellposed in the anisotropic Sobolev space \(\begin{bmatrix} H^{s_1}(\mathbb R^n) \\ H^{s_2}(\mathbb R^n) \end{bmatrix}\) provided \(s_2s_1 \ge 1\). In that case the solution satisfies the following estimates:
for \(u_j^0 \in H_{comp}^{s+j1}(\mathbb R^n)\), \(j=1,2\), with \(c>0\) depending on s, T, and the support of the initial data.
The case of systems of general size but for coefficients depending only on t and for \(n=1\) was also considered. More precisely, in [26] the authors considered the Cauchy problem
with \(A(t) = \big [ a_{ij}(t) \big ]_{i,j=1}^m \in C([0,T])^{m \times m}\) in the form
similar to (2). They showed the following result in the absence of lower order terms and for zero Cauchy data:
Theorem B
([26, Proposition 1]) Let \(B(t) \equiv 0\) and let \(s \in \mathbb R\). Then the Cauchy problem (3) is \(C^\infty \)wellposed. Moreover, there exist \(r_1\),..., \(r_{m1} \in [0,1]\) such that for every \(f \in C(\mathbb R, (H^s(\mathbb R))^m)\) identically 0 at \(t=0\) it admits a unique solution \(u \in C(\mathbb R,({\mathcal {S}}'(\mathbb R))^m)\) satisfying
for \(j = 1, \ldots , m1\), and identically 0 at \(t=0\). In particular, if \(\lambda _j(t) \ne \lambda _k(t)\), \(t \in \mathbb R\), \(1 \le j < k \le m\), no loss of anisotropic regularity appears.
The case of (microlocally) diagonalisable systems of any order with fully variable coefficients was considered by Rozenblum [41] under the condition of transversality of the intersecting characteristics. Also allowing the variable multiplicities, this transversality condition was later removed in [32, 33] with sharp \(L^p\)estimates for solutions, with further applications to the spectral asymptotics of the corresponding elliptic systems.
Before stating our main results and collecting some necessary basic notions we give a brief overview of the state of the art for hyperbolic equations and systems. We have a complete understanding of strictly hyperbolic systems, i.e., systems without multiplicities, with \(C^\infty \)coefficients. This starts with the groundbreaking work of Lax [35] and Hörmander [28] and heavily relies on the modern theory of Fourier integral operators (FIO). Wellposedness is here obtained in the space of distributions \({\mathcal {D}}'\). There are also wellposedness results for less regular coefficients with respect to t. For instance, wellposedness with loss of derivatives has been obtained by Colombini and Lerner [9] for second order strictly hyperbolic equations with LogLipschitz coefficients with respect to t and smooth in x. It is possible to further drop the regularity in t (for instance Hölder), however, this has to be balanced by stronger regularity in x (Gevrey) and leads to more specific (Gevrey) wellposedness results (see [3, 31] and references therein). Paradifferential techniques have been recently used for this kind of strictly hyperbolic equations by Colombini et al. [6, 7].
The analysis of hyperbolic equations with multiplicities (weakly hyperbolic) has started with the seminal paper by Colombini et al. [5] in the case of coefficients depending only on time. Profound difficulties in such analysis have been exhibited by Colombini et al. [4, 8] showing that even the second order wave equation in \({\mathbb {R}}\) with smooth timedependent propagation speed (but with multiplicity) and smooth Cauchy data need not be wellposed in \({\mathcal {D}}'\). However, they turn out to be wellposed in suitable Gevrey classes or spaces of ultradistributions. In the last decades many results were obtained for weakly hyperbolic equations with tdependent coefficients ([3, 11, 16, 18,19,20, 34], to quote only very few). More recently, advances in the theory of weakly hyperbolic systems with tdependent coefficients have been obtained for systems of any size in presence of multiplicities with regular or low regular (Hölder) coefficients [16, 22, 23]. In addition, in [17] precise conditions on the lower order terms (Levi conditions) have been formulated to guarantee Gevrey and ultradistributional wellposedness. Previously very few results were known in the field for systems of a certain size (\(2\times 2\), \(3\times 3\)) [12, 13] or of a certain form (for instance without lower order terms or with principal part of a certain form) [44].
Weakly hyperbolic equations with xdependent coefficients were considered for the first time in the celebrated paper by Bronshtein [2]. As shown already in some earlier works by Ivrii, the corresponding Cauchy problem is wellposed under “almost analytic regularity”, namely, if the coefficients and initial data are in suitable Gevrey classes. Bronshtein’s result was extended to (t, x)dependent scalar equations by Ohya and Tarama [38] and to systems by Kajitani and Yuzawa [31]. The regularity assumptions are always quite strong with respect to x (Gevrey) and not below Hölder in t. See also [10, 37]. Geometrical and microlocal analytic approaches are known for equations or systems under specific assumptions on the characteristics and/or lower order terms. See [29, 30, 33, 36, 39], to quote only a few. Timedependent coefficients of low regularity (distributional) have been considered in [21].
In this paper we will be interested in the case of coefficients depending on both t and x and we will make use of the usual definitions of symbol classes. We say that a (possibly) complex valued function \(a=a(x,\xi ) \in C^{\infty }(\mathbb R^n \times \mathbb R^n)\) belongs to \(S^m_{1,0}( \mathbb R^n \times \mathbb R^n)\) if there exist constants \(C_{\alpha ,\beta } >0\) such that
The set of pseudodifferential operators associated to the symbols in \(S^m_{1,0}( \mathbb R^n \times \mathbb R^n)\) is denoted by \(\Psi ^m_{1,0}( \mathbb R^n \times \mathbb R^n)\).
If there is no question about the domain under consideration, we will abbreviate the symbol and operatorclasses by \(S^m_{1,0}\) and \(\Psi ^m_{1,0}\), respectively, or simply by \(S^m\) and \(\Psi ^m\).
We also denote by \(C([0,T], S_{1,0}^m(\mathbb R^n \times \mathbb R^n))\) the space of all symbols \(a(t,x,\xi )\in S_{1,0}^m(\mathbb R^n\times \mathbb R^n)\) which are continuous with respect to t. The set of operators associated to the symbols in \(C([0,T], S_{1,0}^m(\mathbb R^n \times \mathbb R^n))\) is denoted by \(C([0,T], \Psi _{1,0}^m(\mathbb R^n \times \mathbb R^n))\).
Again, if there is no question about the domain under consideration, we will abbreviate the symbol and operatorclasses by \(C S_{1,0}^m\) and \(C\Psi ^m_{1,0}\), respectively, or simply by \(C S^m\) and \(C\Psi ^m\).
Let us give our main result concerning the first question (Q1) for the systems with the principal part A satisfying the upper triangular condition (2). Here, \(f_k\), \(u_k\) and \(u_k^0\), for \(k=1,\ldots ,m\), stand for the components of the vectors f, u and \(u_0\), respectively.
Theorem 1
Let \(n\ge 1\), \(m\ge 2\), and let
where \(A(t,x,D_x) \in (CS^1)^{m \times m}\) is an uppertriangular matrix of pseudodifferential operators of order 1 in the form (2), and \(B(t,x,D_x) \in (CS^0)^{m \times m}\) is a matrix of pseudodifferential operators of order 0, continuous with respect to t. Hence, if
\(u^0_k\in H^{s+k1}(\mathbb R^n)\) and \(f_k\in C([0,T],H^{s+k1})\) for \(k=1,\ldots ,m\), then (4) has a unique anisotropic Sobolev solution u, i.e., \(u_k\in C([0,T], H^{s+k1})\) for \(k=1,\ldots , m\).
Remark 1
As stated earlier, we allow A and B to have complex valued symbols as long as the symbols of \(\Lambda \) in (2), i.e. the eigenvalues of \(A(t,x,\xi )\), are real valued.
The main condition of Theorem 1 for the Sobolev wellposedness is that the pseudodifferential operator \(b_{ij}\) below the diagonal (i.e. for \(i>j\)) must be of order \(ji\). In other words, the terms below the diagonal at a distance k to it must be of order \(\,k\).
In solving the Cauchy problem (4) we will make use of Fourier integral operators depending on the parameter \(t\in [0,T]\). Namely, we will work with operators of the type
where \(\varphi \) is the solution of a certain eikonal equation and the symbol a is determined via asymptotic expansion and transport equations. In Sect. 2.1 we will recall some wellknown Sobolev estimates for this type of operators.
In Sect. 2 we will prove Theorem 1 after we explain its idea in the cases of \(m=2\) and \(m=3\).
Consequently, in Sect. 3 we give an answer to the second question (Q2) above in the form of a suitable variable coefficients extension of the Schur triangularisation. For constant matrices such a procedure is well known (see e.g. [1, Theorem 5.4.1]).
Theorem C
(Schur’s triagularisation theorem) Given a (constant) \(m \times m\) matrix A with eigenvalues \(\lambda _1, \ldots , \lambda _m\) in any prescribed order, there is a unitary \(m \times m\) matrix T such that \(R = T^{1} A T\) is upper triangular with the diagonal elements \(r_{ii} = \lambda _i\). Furthermore, if the entries of A and its eigenvalues are all real, T may be chosen to be real orthogonal.
It follows that R can be written as \(D+N\), where \(D={{\mathrm{diag}}}(\lambda _1,\ldots ,\lambda _m)\) and N is a nilpotent upper triangular matrix.
If the matrix A depends on one or several parameters, namely \(A=A(t,x,\xi )\), the situation becomes less clear and it is difficult to give a complete description, in particular together with a prescribed regularity of the involved transformation matrices. The regularity of the matrix A and the desire to maintain it through the transformation puts already constrains on the matrix as, in general, the eigenvalues can only be expected to be Lipschitz continuous in the parameters even if all the entries depend smoothly on the parameters (see, e.g., [2, 40] and the references therein). In the sequel, we will present some sufficient conditions to ensure the existence of an upper triangularisation for \(A(t,x,\xi )\) which respects its regularity. For example, it will apply to the case when A is a matrix of first order symbols continuous with respect to t, i.e., \(A(t,x,\xi ) \in \big ( C S^1 \big )^{m \times m}\).
Our main result for this part of the problem is the following theorem.
Theorem 2
Let \(A(t,x,\xi ) \in (C S^1)^{m \times m}\), be a \(m \times m\)matrix with eigenvalues \(\lambda _1, \ldots , \lambda _{m} \in C S^1\), and let \(h_1 , \ldots , h_{m1} \in \big ( C S^0 \big )^m\) be the corresponding eigenvectors. Suppose that for \(e_1=\big [1,0,\ldots ,0 \big ]^T \in \mathbb R^{mi+1}\) the condition
holds for all \(i=1,\ldots , m1\), with the notation for \(h^{(i)}\) explained in (37). Then, there exists a matrixvalued symbol \(T(t,x,\xi ) \in (C S^0)^{m \times m}\), invertible for \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M\}\) with \(T^{1}(t,x,\xi ) \in (CS^0)^{m \times m}\), such that
for all \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \}\), where
and
and N is a nilpotent matrix with entries in \(C S^1\).
Furthermore, there is an expression for the matrix symbol T which will be given in Theorem 6. Also, the assumption (5) can be relaxed, see Remark 6. In Sect. 3 we will prove this result as well as describe the procedure how to obtain the desired upper triangular form. Moreover, we work out in detail the cases of \(m=2\) and \(m=3\) clarifying this Schur triangualisation procedure and give a number of examples.
The results and techniques of this paper are a natural outgrowth of the paper [27] where the case \(m=2\) was considered and to which the results of the present paper reduce in the case of \(2\times 2\) systems. It is with great sorrow that we remember the untimely departure of our colleague and friend Todor Gramchev who was the inspiration for both [27] and the present paper.
2 Wellposedness in anisotropic Sobolev spaces
This section is devoted to proving the wellposedness of the Cauchy problem (1). For the reader’s convenience we first give a detailed proof in the cases \(m=2\) and \(m=3\). This will inspire us in proving Theorem 1. We note that the case \(m=2\) has been studied in [27] and we will briefly review its derivartion. However, first we collect a few results about Fourier integral operators that we will need in the sequel.
2.1 Auxiliary remarks
In solving the Cauchy problem (1), we will deal with solutions of certain scalar pseudodifferential equations. For each characteristic \(\lambda _j\) of A, we will be denoting by \(G^0_j\theta \) the solution to
and by \(G_j g\) the solution to
The operators \(G^0_j\) and \(G_j\) can be microlocally represented by Fourier integral operators
and
with \(\varphi _j(t,s,x,\xi )\) solving the eikonal equation
and with the notation
Here we also have the amplitudes \(A_{j,k}(t,s,x,\xi )\) of order \(\,k\), k \(\in \mathbb N\), giving \(A_j \sim \sum _{k=0}^{\infty } A_{j,k}\), and they satisfy the transport equations with initial data at \(t=s\), and we have \(a_j(t,x,\xi ) = A_j(t,0,x,\xi )\).
If \(a_j\in S^m\), i.e. if the amplitude \(a_j\) in (6) is a symbol of order m, we will write \(G^0_j\in I_{1,0}^{m}.\) However, in the above construction of propagators for hyperbolic equations, we have \(a_j\in S^0\), so that \(G^0_j \in I_{1,0}^{0}\).
By \(I^m_{1,0}\), we denote the class of Fourier integral operators with amplitudes in \(S^m_{1,0}\). For further information, the reader may consult [15, 42, 43] and the references therein.
With that, we can record the following estimate:
Lemma 1
For any \(\sigma \in {\mathbb {R}}\), for sufficiently small t, we have
This statement follows from the continuity of \(\lambda _j, \varphi _j, a_j, A_j\) with respect to t and from the \(H^\sigma \)boundedness of nondegenerate Fourier integral operators, see e.g. [15] (there are also surveys on such questions [42, 43]). It is important to note that the constant for the estimate for \(G_j\) does not depend on the initial data of the Cauchy problem; see also Remark 2.
2.2 The case \(m=2\)
To motivate the higher order cases, here we review the construction for \(2\times 2\) systems adapting it for the subsequent higher order arguments. Hence, in this subsection we follow the proof in [27]. Thus, we consider the system
where \(u_0(x) = \big [u_1^0(x),u_2^0(x) \big ]^T\), \(f(t,x) = \big [ f_1(t,x) ,f_2(t,x) \big ]^T\), and with the operators \(A(t,x,D_x)\) and \(B(t,x,D_x)\) given by
and
We suppose that all entries of \(A(t,x,D_x)\) belong to \( C \Psi _{1,0}^1\) and all entries of \(B(t,x,D_x)\) belong to \( C \Psi _{1,0}^0\). By using the operators \(G^0_j\) and \(G_j\) introduced in Sect. 2.1, we can reformulate the Eq. (7) as
where
Plugging (10) in (9), we obtain
where
Using the rules of composition of Fourier integral operators, see e.g. [15], and by Lemma 1, we get that the operator \(G_1 \circ a_{12} \circ G_2 \circ b_{21}\) in (12) acts continuously on \(H^s\) if it is of order 0. Since \(a_{12} \in C \Psi _{1,0}^1\) we therefore need to assume that \(b_{21} \in C \Psi _{1,0}^{1}\).
The operator \(G_1 \circ b_{12} \circ G_2 \circ b_{21}\) belongs to \( C I_{1,0}^{1}\) since \(b_{21} \in C \Psi _{1,0}^{1}\) and \(b_{12} \in C \Psi _{1,0}^{0}\).
We now introduce the following scale of Banach spaces \(X^s(t) := C([0,t],H^s)\), \(t \in [0,T]\), equipped with the norm
Let
It follows that (12) can be written as
By composition of Fourier integral operators and Lemma 1 we have that the 0order Fourier integral operator \(\mathcal G_1^0\) maps \( C([0,T],H^s)\) continuously into itself and for small time interval it is a contraction, in the sense that there exists \(T^*\in [0,T]\) such that
with \(C_{A,s}T^*< 1\). Banach’s fixed point theorem ensures the existence of a unique fixed point \(u_1\) for the map \(\mathcal G_1^0\). Hence, by assuming that the initial data \({\tilde{U}}^0_1\) belongs to \( C([0,T^*],H^s)\) we conclude that there exists a unique \(u_1\in C([0,T^*],H^s) \) solving (12). Note that the same argument proves that the operator \(I{\mathcal {G}}_1^0\) is invertible on a sufficiently small interval in t since \({\mathcal {G}}_1^0=I\) at \(t=0\). From formula (13) it is clear that in order to get \({\tilde{U}}^0_1\) to belong to \( C([0,T^*],H^s)\) we need to assume that \(U^0_2\in H^{s+1}\). Finally, we get \(u_2\) by substitution of \(u_1\) in (10).
Remark 2
Note that the constant \(T^*\) depends only on A and s. Thus, the argument above can be iterated by taking \(u(T^*,x)\) as new initial data. In this way one can cover an arbitrary finite interval [0, T] and obtain a solution in \( C([0,T],H^s)\times C([0,T],H^{s+1}) \).
Remark 3
Since \(a_{12}(t,x,D_x)\) is a first order operator combining (11) with (13) we easily see that in order to get Sobolev wellposedness of order s we need to take initial data \(u_1^0\) and \(u_2^0\) in \(H^s\) and \(H^{s+1}\), respectively, and right handside functions \(f_1\) and \(f_2\) in \(C([0,T],H^s)\) and \(C([0,T],H^{s+1})\), respectively.
We have therefore proved the following theorem stated for the first time in [27, Theorem 7.2].
Theorem 3
Consider the Cauchy problem (7), with the \(2 \times 2\) matrices
where A is of the form (2). Assume that \({b}_{21} \in C([0,T],\Psi ^{1}_{1,0})\), the right handside functions \(f_1\) and \(f_2\) belong to \(C([0,T],H^s)\) and \(C([0,T],H^{s+1})\), respectively, and the initial data \(u_1^0\) and \(u_2^0\) belong to \(H^s\) and \(H^{s+1}\), respectively. Then, (7) has a unique solution in \(C([0,T],H^s)\times C([0,T],H^{s+1}) \). More generally it is wellposed in the anisotropic Sobolev space \(C([0,T],H^{s_1})\times C([0,T],H^{s_2})\), provided \(s_2s_1 = 1\).
Remark 4
It was also shown in [27] that the solution u satisfies the estimate
for \(u_j^0 \in H^{s+j1}_{comp}\), \(j=1,2\) with \(c>0\) depending on s, T, and the support of the initial data. Since wellposedness is obtained for any Sobolev order s it follows that the Cauchy problem (7) is also \(C^\infty \) wellposed.
2.3 The case \(m=3\)
In this section we will extend the construction to the case of \(3\times 3\) systems. In the argument there is an additional substitution and a fixed point argument step compared to the case \(m=2\). The advantage of giving the case of \(m=3\) here is that we can make the argument more concrete compared to the more abstract construction in the general case that will be given in the following section. Thus, let
where \(u_0(x) = \big [u_1^0(x),u_2^0(x),u_3^0(x) \big ]^T\), \(f(t,x) = \big [ f_1(t,x) ,f_2(t,x),f_3(t,x) \big ]^T\), \(A(t,x,D_x)\) is defined by the matrix
and
We assume that all the entries of \(A(t,x,D_x)\) and \(B(t,x,\xi )\) belong to \( C \Psi _{1,0}^1\) and \( C \Psi _{1,0}^0\), respectively. Using the notations introduced earlier, we can write
where
Now, we plug \(u_3\) into \(u_1\) and \(u_2\) in formula (16) and, thus, obtain
where
We introduce the operator \({\mathcal {G}}^0_2\) by setting
and in analogy with the case \(m=2\) we define
By Lemma 1 we have that for any s, \({\mathcal {G}}^0_2\) has the operator norm in \(H^s\) strictly less than 1 on a sufficiently small interval \([0,T^*]\), so \(L_2\) is a perturbation of the identity operator. By the Neumann series it follows that \(L_2\) is invertible as a continuous operator from \( C([0,T^*],H^s)\) to \( C([0,T^*],H^s)\). Noting now that
we have that
Since this expression depends only on \(u_1\), we can plug it into the formula for \(u_1\) in (18) and obtain
By collecting now the terms with order \(\le 0\) we can simplify the previous formula as follows:
Looking at the terms
and keeping in mind that in order to get the right Sobolev regularity we need to have operators of order 0, we deduce that \(b_{21}\) and \(b_{32}\) must have order \(\,1\) while \(b_{31}\) must have order \(\,2\). Considering now the initial data
by using (17) we obtain
Combining these formulas with an analysis of the term \(G_1(a_{12}L_2^{1}{\widetilde{U}}_2^0)\) we deduce that \({\widetilde{U}}_2^0\) must belong to \(H^{s+1}\). This implies \(U_2^0\in H^{s+1}\) and \(U_3^0\in H^{s+2}\). Concluding, similarly to the case \(m=2\), that is by the Banach fixed point theorem argument on \(u_1\) and substitution in \(u_2\) and \(u_3\), we get anisotropic Sobolev wellposedness by assuming \(u^0_1\) and \(f_1\) in \(H^s\), \(u^0_2\) and \(f_2\) in \(H^{s+1}\), and \(u^0_3\) and \(f_3\) in \(H^{s+2}\). This wellposedness is obtained by means of one invertible operator \(L_2\), and in analogy with case \(m=2\) the wellposedness can be extended to the whole interval [0, T] by an iterated argument. This proves Theorem 1 in the case \(m=3\).
2.4 The general case
We are now ready to prove the main result of our paper in the general case of an uppertriangular \(m\times m\) matrix, i.e, a matrix A of the type
For the convenience of the reader we recall here the statement of Theorem 1.
Theorem 1
Let
where \(A(t,x,D_x)\) is an uppertriangular matrix of pseudodifferential operators of order 1 in the form (2), and \(B(t,x,D_x)\) is a matrix of pseudodifferential operators of order 0, continuous with respect to t. Hence, if the lower order terms \(b_{ij}\) belong to \(C([0,T], \Psi ^{ji})\) for \(i> j\), \(u^0_k\in H^{s+k1}\) and \(f_k\in C([0,T],H^{s+k1})\) for \(k=1,\ldots ,m\) then (20) has a unique anisotropic Sobolev solution u, i.e., \(u_k\in C([0,T], H^{s+k1})\) for \(k=1,\ldots , m\).
Proof
Making use of the notations introduced earlier we can write the components of the solution u as
where
and \(G_i, G_i^0\) are Fourier integral operator of order 0 for \(i=1,\ldots , m\). Note that from the fact that \(b_{ij}\) is a symbol of order 0 for every i, j and, in particular, of order \(ji\) for \(j<i\) we obtain that the operator \(G_i(b_{ij})\) is of order \(ji\) for \(j<i\), while \(G_i(a_{ij}+b_{ij})\) is, in general, of order 1. To simplify the argument we introduce the notations \(G^{ji}_{i,j}\) and \(G^1_{i,j}\) for the operators \(G_i(b_{ij})\) and \(G_i(a_{ij}+b_{ij})\), respectively. Here the superscript stands to remind us of the order of the operator. Hence,
for \(i=1,\ldots ,m\). By begin by substituting
into
We get
Note that it is enough to assume \(U^0_m\in H^{s+1}\) and \(U^0_{m1}\in H^s\) to obtain \(U_{m1}^0+G^1_{m1,m}U_m^0\in H^{s}\). Since all the operators above are of order \(\le 0\) we conclude that the operator
is invertible on a sufficiently small interval [0, T] and, therefore,
yields
with \({\widetilde{U}}^0_{m1}\) and \({\widetilde{G}}^{jm+1}_{m1}\) defined by the righthand side of (22). We now substitute \(u_{m}\) and \(u_{m1}\) into \(u_{m2}\) making use of (23). We obtain
We set
The operators \(G^1_{m2,m1}L^{1}_{m1}\) and \(G^1_{m2,m}\) in (25) are of order 1. Keeping in mind that we already assumed \(U^0_{m}\in H^{s+1}\) and \(U^0_{m1}\in H^s\), in order to obtain Sobolev order s the initial data \(U^0_m\), \(U^{0}_{m1}\) and \(U^0_{m2}\) must belong to \(H^{s+2}\), \(H^{s+1}\) and \(H^s\), respectively. Thus,
where \({\mathcal {G}}^0_{m2}\) is a zero order operator defined by
and the last summand in (26) is obtained by collecting all the operators acting on \(u_j\) with \(j<m2\) in (24). Since the norm of \({\mathcal {G}}^0_{m2}\) can be taken strictly less than one in a sufficiently small interval [0, T] we have that the operator
is invertible and, therefore,
Note that \( {\widetilde{U}}^0_{m2} \in H^s\) if \(U^0_{m}\in H^{s+2}\), \(U^0_{m1}\in H^{s+1}\) and \(U^0_{m2}\in H^s\). By iterating the same procedure we deduce that
where \({\widetilde{U}}^0_k\) depends on \(U^0_k\), \(U^0_j\) and \({\widetilde{U}}^0_j\) with \(j>k\) and \({\mathcal {G}}^0_k\) is a zero order operator defined by using invertible operators \(L_{m1}\), \(L_{m2}\),..., \(L_{k}\). In addition, we obtain \({\widetilde{U}}^0_k\in H^s\) since \(U^0_m\in H^{s+mk}\), \(U^0_{m1}\in H^{s+mk1},\ldots , U^0_k\in H^s\). It follows that for \(k=2\) we have
where the operator \({\mathcal {G}}^0_2\) is of zero operator and defined by invertible operators \(L_{m1}, L_{m2}, \ldots ,L_{2}\), \({\widetilde{G}}^{1}_{2}\) is of order \(\,1\), and \({\widetilde{U}}^0_2\in H^s\) since \(U^0_m \in H^{s+m2}\), \(U^0_{m1}\in H^{s+m3},\ldots , U^0_2\in H^s\). Hence, by inverting the operator \(L_2=I{\mathcal {G}}^0_2\) on a sufficiently small interval [0, T] we have
Now by substitution of \(u_2, u_3, \ldots , u_m\) in the equation of \(u_1\) we arrive at the formula (28) with \(k=1\), i.e.,
where \({\widetilde{U}}^0_{1}\in H^s\) since \(U^0_m\in H^{s+m1}\), \(U^0_{m1}\in H^{s+m2},\ldots , U^0_2\in H^{s+1}, U^0_1\in H^s\). Concluding, by the Banach fix point argument we prove that there exists a unique \(u_1\in C([0,T], H^s)\) solving the equation above with the given initial conditions. By substitution in the equations for \(u_2, \ldots , u_{m1}, u_m\) we arrive at the desired Sobolev wellposedness with \(u_k\in C([0,T], H^{s+k1})\) for \(k=2,\ldots , m\). Note that, since the sufficiently small interval [0, T] where we get wellposedness does not depend on the initial data, by a standard iteration argument we can achieve wellposedness on any bounded interval [0, T] as stated in the theorem. \(\square \)
3 Schur decomposition of \(m \times m\) matrices
In this section we investigate how to reduce an \(m\times m\) matrix to the upper triangular form. We recall that such decomposition is wellknown for constant matrices and goes under the name of Schur’s triangularisation, with its statement given in Theorem C.
One of the difficulties when dealing with variable multiplicities is the loss of regularity in the parameters at the points of multiplicities. In the following, we will assume that A is a matrix of (possibly) complex valued first order symbols, continuous with respect to t, i.e., \(A(t,x,\xi ) \in \big ( C S^1 \big )^{m \times m}\).
We will now develop a parameter dependent extension of the Schur triangularisation procedure and we will describe it step by step. Then we will give an example for it for the systems of low sizes, namely, for \(m=2\) and \(m=3\).
In the case of \(m=2\) the construction below was introduced in [27] and now we give its general version for systems of any size.
Normal forms of matrices depending on several parameters have a long history and are notoriously involved; for some remarks and related works, we refer the reader to [14, 24, 25, 45].
3.1 First step or Schur step
The first step in our triangularisation follows the construction in the constant case except that we will not get a unitary transformation matrix. For this reason we talk of a Schur step. Throughout this paper \(e_i\) denotes the ith vector of the standard basis of \(\mathbb R^n\) with an appropriate dimension n.
Proposition 1
(Schur step) Let the \(m \times m\) matrix valued symbol \(A(t,x,\xi ) \in (C S^1)^{m \times m}\), have a real eigenvalue \(\lambda \in C S^1\) and a corresponding eigenvector \(h \in \big ( C S^1 \big )^{m}\) such that there exists \(j \in \{ 1, \ldots , m \}\) with
for a sufficiently large \(M>0\). Then there exist an \(m \times m\) matrix valued symbol \(T(t,x,\xi ) (C S^0)^{m \times m}\), invertible for \((t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}\) with \(T^{1} \in (C S^0)^{m \times m}\), and an \((m1) \times (m1)\) matrix valued symbol \(E(t,x,\xi ) \in (C S^1)^{m \times m}\), such that
for all \((t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}\).
Proof
First let us note that we can assume that \(j=1\) in (29). If that is not the case, we can exchange the rows 1 and j as well as columns 1 and j to move the jth component of the eigenvector to the first component.
We define the rescaled eigenvector \(\mu \) componentwise by
Now we set
Since \(\mu _1\equiv 1\) it follows that
where \(I_{m1}\) is the \((m1)\times (m1)\) identity matrix. By direct computations we get
where we used that
and denoted the ith column of A by \(A_{(i)}\). The equations in (30) are given by the eigenvalue equation \(A \mu = \lambda \mu \). Further, from \(\mu _{1} \equiv 1\) we obtain
which concludes the proof. Note that by construction the matrix E has entries in \(C S^1\) which depend on A. In particular its eigenvalues are the eigenvalues of A excluding \(\lambda \) (counted as many times as they occur). \(\square \)
Applying Proposition 1 repeatedly for \(m2\) times to E, we obtain a full Schur transformation of A, that is a full reduction to an upper triangular form. In the next subsection we describe this iteration in detail. This triangularisation procedure is summarised in Theorem 6 where sufficient conditions on the eigenvectors of A are given.
3.2 The triangularisation procedure
The reduction to an upper triangular form or the Schur transformation of A is possible under certain conditions on its eigenvectors. More precisely, let
be \(m1\) eigenvectors of \(A(t,x,\xi ) = [a_{ij}(t,x,\xi )]_{i,j=1}^m\), \(a_{ij} \in C S^1\), corresponding to the eigenvalues \(\lambda _1(t,x,\xi )\), \(\ldots \), \(\lambda _{m1}(t,x,\xi ) \in C S^1\). To formulate the sufficient conditions for the existence of such Schur transformation, we introduce a set of auxiliary vectors \(h^{(i)}\), \(i=1,\ldots ,m1\), which depend only on \(h_i\) and the previous vectors \(h^{(j)} \in C S^0\), \(j=1,\ldots ,i1\). When \(i=1\) we set \(h^{(1)}=h_1\).
As in Proposition 1 we begin by assuming
for \((t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}\).
Remark 5
As noted in the proof of Proposition 1, we could have that
for another arbitrary \(j \in \{ 1, \ldots ,m \}\). Then, we could transform the matrix \(A(t,x,\xi )\) by a constant permutation matrix P such that \(P^{1}h^{(1)}\) is eigenvector of \(P^{1}AP\) corresponding to \(\lambda _1\) which satisfies \(\left\langle P^{1} h^{(1)}(t,x,\xi )  e_1 \right\rangle \ne 0\). For this reason we state (32) with \(h^{(1)}\) and \(e_1\).
 Step 1 :

By Proposition 1 there exists a matrix \(T_1\) such that
$$\begin{aligned} T^{1}_1 A T_1 = \begin{bmatrix} \lambda _1&\quad a_{12}&\quad \cdots&\quad a_{1m} \\ 0&\quad&\quad&\quad \\ \vdots&\quad&\quad E_{m1}&\quad \\ 0&\quad&\quad&\quad \end{bmatrix}. \end{aligned}$$The matrix \(T_1\) is given by
$$\begin{aligned} T_1 = \begin{bmatrix} \omega _1&\quad e_2&\quad \ldots&\quad e_m \end{bmatrix}, \quad \omega _1 = \begin{bmatrix} \omega _{11}&\quad \ldots&\quad \omega _{1m} \end{bmatrix}^T \end{aligned}$$with
$$\begin{aligned} \omega _{1j} = \frac{\left\langle h^{(1)}(t,x,\xi )  e_j \right\rangle }{\left\langle h^{(1)}(t,x,\xi )  e_1 \right\rangle }. \end{aligned}$$In the sequel we make use of the projector \(\Pi _k : \mathbb R^m \rightarrow \mathbb R^{mk}\), \(0 \le k \le m1\), defined by
$$\begin{aligned} \Pi _k \begin{bmatrix} x_1 \\ \vdots \\ x_m \end{bmatrix} = \begin{bmatrix} x_{k+1} \\ \vdots \\ x_m \end{bmatrix}. \end{aligned}$$Note that \(\Pi _0\) is the identity map \(I_m:\mathbb R^m\rightarrow \mathbb R^m\).
 Step 2 :

Since \(h_2\) is an eigenvector of A with eigenvalue \(\lambda _2\) we get that \(T_1^{1}h_2\) is an eigenvector of \(T^{1}_1A T_1\) with eigenvalue \(\lambda _2\) as well. By the structure of \(T^{1}_1A T_1\) we easily see that \(h^{(2)} := \Pi _1 T_1^{1} h_2\) is an eigenvector of \(E_{m1}\), corresponding to \(\lambda _2\). Arguing as in Remark 5 we assume that
$$\begin{aligned} \left\langle \Pi _1 T_1^{1} h_2  e_1 \right\rangle \ne 0 \quad \forall (t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}, \end{aligned}$$(33)to be able to apply Proposition 1 to \(E_{m1}\). We get that there exists an \((m1) \times (m1)\) matrix \({\tilde{T}}_2\) such that \({\tilde{T}}_2^{1}E_{m1}{\tilde{T}}_2\) is of form
$$\begin{aligned} \begin{bmatrix} \lambda _2&\quad *&\quad \ldots&\quad *\\ 0&\quad&\quad&\quad \\ \vdots&\quad&\quad E_{m2}&\quad \\ 0&\quad&\quad&\quad \end{bmatrix}, \end{aligned}$$where in the first row the first row of \(E_{m1}\) appears. Thus, setting
$$\begin{aligned} T_2 = \begin{bmatrix} 1&\quad 0&\quad \ldots&\quad 0 \\ 0&\quad&\quad&\quad \\ \vdots&\quad&\quad {\tilde{T}}_2&\quad \\ 0&\quad&\quad&\quad \end{bmatrix}, \end{aligned}$$we obtain
$$\begin{aligned} T_2^{1}T_1^{1} A T_1 T_2 = \begin{bmatrix} \lambda _1&\quad *&\quad *&\quad \ldots&\quad *\\ 0&\quad \lambda _2&\quad *&\quad \ldots&\quad *\\ 0&\quad 0&\quad&\quad&\quad \\ \vdots&\quad \vdots&\quad&\quad E_{m2}&\quad \\ 0&\quad 0&\quad&\quad&\quad \end{bmatrix}. \end{aligned}$$(34)Note that in (34) we write explicitly only the entries most relevant to our triangularisation. To compute the matrix \({\tilde{T}}_2\), we set
$$\begin{aligned} \omega _2 = \begin{bmatrix} \omega _{22}&\ldots&\omega _{2m} \end{bmatrix}^T, \end{aligned}$$where
$$\begin{aligned} \omega _{2j}(t,x,\xi ) := \frac{\left\langle h^{(2)}(t,x,\xi )  e_j \right\rangle }{\left\langle h^{(2)}(t,x,\xi )  e_1 \right\rangle } , \quad j=2,\ldots ,m, \end{aligned}$$and then
$$\begin{aligned} {\tilde{T}}_2 = \begin{bmatrix} \omega _2&e_2&\ldots&e_{m1} \end{bmatrix}. \end{aligned}$$It is clear that \(T_2\) has the same structure as \(T_1\), i.e., it is defined via a rescaled eigenvector as the first column and an identity matrix (\(I_{m1}\) for \(T_1\) and \(I_{m2}\) for \(T_2\)).
 Step K :

By iterating the method \(k1\) times we can find \(k1\) matrices \(T_1, T_2, \ldots , T_{k1}\) of size \(m\times m\) such that
$$\begin{aligned}&T_{k1}^{1} \cdot \ldots \cdot T_1^{1} A T_1 \cdot \ldots \cdot T_{k1} = \\&\qquad \begin{bmatrix} \lambda _1&\quad *&\quad *&\quad \ldots&\quad \ldots&\quad *\\ 0&\quad \ddots&\quad *&\quad \ldots&\quad \ldots&\quad *\\ 0&\quad 0&\quad \lambda _{k1}&\quad *&\quad \ldots&\quad *\\ 0&\quad 0&\quad 0&\quad&\quad&\quad \\ \vdots&\quad \vdots&\quad \vdots&\quad&\quad E_{m{k+1}}&\quad \\ 0&\quad 0&\quad 0&\quad&\quad \end{bmatrix}, \end{aligned}$$where \(E_{mk+1}\) is a \((mk+1) \times (mk+1)\) matrix and the equality is true on \([0,T]\times {\mathbb {R}}^n\times \{\xi \ge M\}\). Since \(h_k\) is an eigenvector of A corresponding to \(\lambda _k\), the vector
$$\begin{aligned} T_{k1}^{1} T_{k2}^{1} \cdot \cdots \cdot T_1^{1} h_k \end{aligned}$$is an eigenvector of
$$\begin{aligned} T_{k1}^{1} T_{k2}^{1} \cdot \cdots \cdot T_1^{1} A T_1 T_2 \cdot \cdots \cdot T_{k1} \end{aligned}$$and
$$\begin{aligned} h^{(k)} := \Pi _{k1} T_{k1}^{1} T_{k2}^{1} \cdot \cdots \cdot T_1^{1} h_k \in \big ( C S^0 \big )^{mk+1} \end{aligned}$$an eigenvector of \(E_{mk+1}\) corresponding to \(\lambda _k\). Thus, to satisfy the assumptions of Proposition 1 and keeping in mind Remark 5, we require that
$$\begin{aligned} \left\langle h^{(k)}(t,x,\xi )  e_1 \right\rangle \ne 0 \quad \forall (t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}. \end{aligned}$$(35)It follows that there exists an \((mk+1) \times (mk+1)\) transformation matrix \({\tilde{T}}_k\) such that \({\tilde{T}}^{1}_{k} \ldots {\tilde{T}}_1^{1}A{\tilde{T}}_{1} \ldots {\tilde{T}}_k\) is of the form
$$\begin{aligned} \begin{bmatrix} \lambda _k&\quad *&\quad \ldots&\quad *\\ 0&\quad&\quad&\quad \\ \vdots&\quad&\quad E_{mk}&\quad \\ 0&\quad&\quad&\quad \end{bmatrix}. \end{aligned}$$and set
$$\begin{aligned} T_k = \begin{bmatrix} I_{k1}&\mathbf {0} \\ \mathbf {0}&{\tilde{T}}_k \end{bmatrix}. \end{aligned}$$The matrix \({\tilde{T}}_k\) is defined by
$$\begin{aligned} {\tilde{T}}_k = \begin{bmatrix} \omega _k&\quad e_2&\quad \ldots&\quad e_{mk+1} \end{bmatrix}, \quad \omega _k = \begin{bmatrix} \omega _{kk}&\quad \ldots&\quad \omega _{km} \end{bmatrix}^T, \end{aligned}$$where
$$\begin{aligned} \omega _{kj} = \frac{\left\langle h^{(k)}(t,x,\xi )  e_j \right\rangle }{\left\langle h^{(k)}(t,x,\xi )  e_1 \right\rangle }, \quad j = k, \ldots , m. \end{aligned}$$  Step m1 :

This is the last step as \(E_2\) is a \(2 \times 2\) matrix. We have that
$$\begin{aligned} h^{(m1)} = \Pi _{m2} T_{m2}^{1} \cdot \cdots \cdot T_{1}^{1} h_{m1} \in \big ( C S^0 \big )^{2} \end{aligned}$$is an eigenvector of \(E_2\) corresponding to \(\lambda _{m1}\) and that \({\tilde{T}}_{m1}\) exists as before if
$$\begin{aligned} \left\langle h^{(m1)}(t,x,\xi )  e_1 \right\rangle \ne 0 \quad \in \forall (t,x,\xi ) \in [0,T] \times {\mathbb {R}}^n \times \{ \xi  \ge M \}. \end{aligned}$$(36)The matrix \({\tilde{T}}_{m1}\) is given by
$$\begin{aligned} {\tilde{T}}_{m1} = \begin{bmatrix} \omega _{m1}&e_{2} \end{bmatrix} = \begin{bmatrix} \omega _{m1,m1}&\quad 0 \\ \omega _{m1,m}&\quad 1 \end{bmatrix}, \end{aligned}$$where
$$\begin{aligned} \omega _{m1,j} = \frac{\left\langle h^{(m1)}(t,x,\xi )  e_j \right\rangle }{\left\langle h^{(m1)}(t,x,\xi )  e_1 \right\rangle } , \quad j =m1,m, \end{aligned}$$and then
$$\begin{aligned} T_{m1} = \begin{bmatrix} I_{m2}&\quad \mathbf 0 \\ \mathbf 0&\quad {\tilde{T}}_{m1} \end{bmatrix}. \end{aligned}$$
We are now ready to state Theorem 6 which summarises the triangularisation procedure explained above. For the convenience of the reader we recall the notations introduced so far:

\(h_1, \ldots , h_{m1}\) are the eigenvectors of the matrix A corresponding to the eigenvalues \(\lambda _1, \ldots , \lambda _{m1}\).

\(h^{(1)}=h_1\) and
$$\begin{aligned} h^{(i)} = \Pi _{i1}T_{i1}^{1} T_{i2}^{1} \cdot \,\cdots \, \cdot T_1^{1} h_i \in \big ( C S^0\big )^{mk+1} , \end{aligned}$$(37)for \(i=2, \ldots ,m1\).

the matrices \(T_k\) are inductively defined as follows: \(T_0 = I_{m}\) and
$$\begin{aligned} T_k = \begin{bmatrix} I_{k1}&\quad \mathbf 0 \\ \mathbf 0&\quad {\tilde{T}}_k \end{bmatrix}, \quad {\tilde{T}}_k = \begin{bmatrix} \omega _{k}&\quad e_{2}&\quad \ldots&\quad e_{mk} \end{bmatrix}, \quad e_i \in \mathbb R^{mk}, \end{aligned}$$where
$$\begin{aligned} \omega _{kj} = \frac{\left\langle h^{(k)}(t,x,\xi )  e_j \right\rangle }{\left\langle h^{(k)}(t,x,\xi )  e_1 \right\rangle }, \quad j = k,\ldots ,m. \end{aligned}$$
Finally, we note that \(h^{(k)}\) depends only on \(T_{k1}\), \(\ldots \), \(T_1\) and, thus, only on the eigenvectors \(h^{(k1)}\), \(\ldots \), \(h^{(1)}\).
Summarising, we can formulate a more precise version of Theorem 2.
Theorem 6
(Schur Decomposition) Let \(A(t,x,\xi ) \in (C S^1)^{m \times m}\) be a matrix with eigenvalues \(\lambda _1, \ldots ,\lambda _{m} \in C S^1\), and let \(h_1 , \ldots , h_{m1} \in \big ( C S^0 \big )^m\) be the corresponding eigenvectors. Suppose that for \(e_1 \in \mathbb R^{mi+1}\) the condition
holds for all \(i=1,\ldots , m1\), with the notation explained above. Then, there exists a matrixvalued symbol \(T(t,x,\xi ) \in (C S^0)^{m \times m}\), invertible for \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M\}\), \(T^{1}(t,x,\xi ) \in (CS^0)^{m\times m}\), such that
for all \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \}\), where
and
and N is a nilpotent matrix with entries in \(C S^1\). Furthermore, the matrix symbol T is given by
with the notation explained above.
Remark 6
Taking into account Remark 5, let us stress that condition (38) is not restrictive as it can be replaced by the following: suppose that there exist \(m1\) numbers \(j_i \in \{ 1,\ldots , mi+1 \}\), \(i=1, \ldots m1\), such that for all \(i=1, \ldots , m1\)
holds.
Remark 7
If \(A(t,x,\xi )\) has complex symbols (as allowed in Theorem 1, see also Remark 1) and real eigenvalues, the eigenvalues of the Schur transformed system clearly remain real. The upper triangular entries may still be complex valued symbols.
Remark 8
Theorem 6 is quite general in the sense that the functions \(a_{ij}\) could be complexvalued. In this paper, we are concerned with hyperbolic matrices, i.e. we assume that the eigenvalues \(\lambda _1, \ldots , \lambda _m\) are real. We stress that the Schur transform does not change the hyperbolicity of the matrix as the eigenvalues of \(T^{1} A T\) are also \(\lambda _1, \ldots , \lambda _m\).
Remark 9
For our applications in this and future work it is important that the transform T in Theorem 6 keeps the regularity of the original matrix A, i.e. that the elements of the Schur transform \(T^{1} A T\) are in the same class as the elements of A. Here, we stated everything with \(C S^1\) and \(C S^0\) as that is the regularity considered in this paper. Note that one could replace C with \(C^k\) or \(C^\infty \) and find a matrix T such that the transformed matrix \(T^{1}AT\) inherits the same regularity with respect to t. In addition, one could also drop the regularity in t to \(L^\infty \) and the triangularisation procedure would still work preserving the boundedness in t through every step.
For the sake of simplicity and the reader’s convenience, in the next subsections we analyse Theorem 6 in the special cases of \(m=2\) and \(m=3\).
3.3 The case \(m=2\)
We now formulate Theorem 6 in the special case \(m=2\). In this way we recover the formulation given in [27].
Theorem 7
([27, Theorem 7.1]) Suppose that \(A(t,x,\xi ) \in (CS^1)^{2\times 2}\) admits eigenvalues \(\lambda _j(t,x,\xi ) \in C S^1\), \(j=1,2\), and an eigenvector \(h(t,x,\xi ) \in (C S^0)^2\) satisfying
for \(j=1\) or \(j=2\). Then, we can find a \(2 \times 2\) matrix valued symbol \(T(t,x,\xi ) \in (C S^0)^{2\times 2}\), invertible for \(\{\xi  \ge M \}\) with \(T^{1}(t,x,\xi ) \in (CS^0)^{2\times 2}\), such that
for all \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \}\).
Proof
For \(2\times 2\) matrices the triangularisation procedure described in the previous subsection can stop at Step 1. By Remark 5, we may assume that (40) holds for the eigenvector h corresponding to \(\lambda _1\) and for \(j=1\). We set \(h=h_1\) and \(h^{(1)}= h_1\). The vector
belongs to \(C S^0\) and is an eigenvector of A associated to \(\lambda _1\). We then set
With that, we obtain
and finally, with
we obtain
By construction, we have
and \(\omega _1=1\). This yields \(a_{11}\omega _{11} + a_{12}\omega _{12} = \lambda _1 \omega _{11} = \lambda _1\) and
Using \(a_{11}+a_{22} = \lambda _1 + \lambda _2\), we obtain
Thus, we get that
for \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \}\). This concludes the proof. \(\square \)
3.3.1 Example

(i)
By direct computations we can easily see that if \(h_1=[h_{11}\,\, h_{12}]^T=e_1\) then the matrix A is automatically in the upper triangular form. Indeed,
$$\begin{aligned} a_{21}h_{11}+a_{22}h_{12}=\lambda _1 h_{12} \end{aligned}$$implies \(a_{21}=0\). A typical example (already discussed in [27]) is the Jordan block matrix
$$\begin{aligned} A=\begin{bmatrix} 0&\quad 1\\ 0&\quad 0 \end{bmatrix}, \end{aligned}$$where \(\lambda _1=0\) is an eigenvalue with eigenvector \(h_1=e_1\).

(ii)
Condition (40) is trivially fulfilled when \(\mathrm{det A}\equiv 0\) and A is of the form
$$\begin{aligned} \begin{bmatrix} a&\quad a\\ a&\quad a \end{bmatrix}, \end{aligned}$$for \(a=a(t,x,\xi )\). Indeed, also in this case one can take 0 as an eigenvalue with eigenvector \(h_1= [1 \,\, 1]^T\).
3.4 The case \(m=3\)
With the notation introduced in Sect. 3.2, we assume that the \(3 \times 3\) matrix \(A(t,x,\xi ) \in (CS^1)^{3 \times 3}\) admits three eigenvalues \(\lambda _i(t,x,\xi ) \in C S^1\), \(i=1,2,3\), and two corresponding eigenvectors \(h_i(t,x,\xi ) \in \big ( C S^0)^3\), \(i=1,2\). Then, we set \(h^{(1)} := h_1\) and, as in Remark 6 we suppose that there is a \(j_1 \in \{1,2,3\}\) with
Thus, we can set
Now, we rearrange the matrix A such that the first component of \(\omega _1\) becomes identically equal to 1. Then, with \(j_2,j_3 \in \{ 1,2,3 \} {\setminus } \{ j_1 \}\), we can write
which leads to
We then get
and the condition (38) that there exists \(j \in \{ 1,2 \}\) such that
translates to: either
or
holds. Thus, assuming that (41) holds, the matrix \({\tilde{T}}_2\) is given by
and the matrix \(T_2\) by
Thus, we obtain
If we have (42) instead of (41), then we would need a permutation matrix
in (43), i.e.
and
See also Remark 6.
Thus, we can state
Theorem 8
Suppose that \(A(t,x,\xi ) \in (CS^1)^{3 \times 3}\) admits three eigenvalues \(\lambda _i \in C S^1\), \(i=1,2,3\), and two corresponding eigenvectors \(h_i(t,x,\xi ) \in \big ( C S^1 \big )^3\), \(i=1,2\). Suppose that there exists a \(j_1 \in \{ 1, 2, 3\}\) such that
Further suppose that there exists \(j_2 \in \{ 1,2,3\} {\setminus } \{ j_1 \}\) such that
Then, there exists a matrixvalued symbol \(T(t,x,\xi ) \in (C S^0)^{3\times 3}\), invertible for all \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \}\) with \(T^{1}(t,x,\xi ) \in (CS^0)^{3 \times 3}\), such that
holds for all \((t,x,\xi ) \in [0,T] \times \mathbb R^n \times \{ \xi  \ge M \},\) where \(\Lambda (t,x,\xi ) = {{\mathrm{diag}}}(\lambda _1, \lambda _2, \lambda _3)\) and
We end this subsection by discussing some examples of \(3\times 3\) matrices fulfilling the assumptions above on their eigenvalues.
3.4.1 Examples

(i)
If the matrix A has eigenvectors
$$\begin{aligned} h_1= \begin{bmatrix} 1\\ 0\\ 1 \end{bmatrix} \quad \text {and} \quad h_2= \begin{bmatrix} 1\\ 1\\ 0 \end{bmatrix} \end{aligned}$$then conditions (44) and (45) are easily fulfilled with \(j_1=1\) and \(j_2=2\). Indeed, \(h_{11}=1\) and
$$\begin{aligned} h_{22}h_{11}h_{12}h_{21}=h_{22}h_{11}=1. \end{aligned}$$More in general to satisfy (44) and (45) it would be enough to have two eigenvectors
$$\begin{aligned} h_1= \begin{bmatrix} h_{11}\\ h_{12}\\ h_{13} \end{bmatrix} \quad \text {and} \quad h_2= \begin{bmatrix} h_{21}\\ h_{22}\\ h_{23} \end{bmatrix} \end{aligned}$$with \(h_{11}\ne 0\), \(h_{22}\ne 0\) and \(h_{12}=0\).

(ii)
A matrix with eigenvectors
$$\begin{aligned} h_1= \begin{bmatrix} 1\\ 0\\ 1 \end{bmatrix} \quad \text {and} \quad h_2= \begin{bmatrix} 1\\ 1\\ 0 \end{bmatrix} \end{aligned}$$has a special form. Indeed, for \(\lambda _1\) and \(\lambda _2\) eigenvalues corresponding to \(h_1\) and \(h_2\), respectively, by using the eigenvector equations we obtain
$$\begin{aligned} \begin{aligned} a_{13}&=\lambda _1a_{11},\\ a_{23}&=a_{21},\\ a_{33}&=\lambda _1a_{31}, \end{aligned} \end{aligned}$$and
$$\begin{aligned} \begin{aligned} a_{12}&=\lambda _2a_{11},\\ a_{22}&=\lambda _2a_{21},\\ a_{32}&=a_{31}. \end{aligned} \end{aligned}$$Hence
$$\begin{aligned} A=\begin{bmatrix} a_{11}&\quad \lambda _2a_{11}&\quad \lambda _1a_{11}\\ a_{21}&\quad \lambda _2a_{21}&\quad a_{21}\\ a_{31}&\quad a_{31}&\quad \lambda _1a_{31} \end{bmatrix}. \end{aligned}$$
References
Bernstein, D.S.: Matrix Mathematics—Theory, Facts, and Formulas, 2nd edn. Princeton University Press, Princeton (2009)
Bronshtein, M.D.: Smoothness of roots of polynomials depending on parameters. Sibirsk. Mat. Zh., 20(3), 493–501, (1979). Sib. Math. J. 20, 347–352 (1980). (English transl)
Colombini, F., Kinoshita, T.: On the Gevrey well posedness of the Cauchy problem for weakly hyperbolic equations of higher order. J. Differ. Equ. 186, 394–419 (2002). https://doi.org/10.1016/S00220396(02)000098
Colombini, F., Spagnolo, S.: An example of a weakly hyperbolic Cauchy problem not well posed in \(C^\infty \). Acta Math. 148, 243–253 (1982). https://doi.org/10.1007/BF02392730
Colombini, F., De Giorgi, E., Spagnolo, S.: Sur les équations hyperboliques avec des coefficients qui ne dépendent que du temps. Ann. Sc. Norm. Super. Pisa Cl. Sci. 6, 511–559 (1979)
Colombini, F., Del Santo, D., Fanelli, F., Métivier, G.: Timedependent loss of derivatives for hyperbolic operators with non regular coefficients. Commun. Partial Differ. Equ. 38(10), 1791–1817 (2013). https://doi.org/10.1080/03605302.2013.795968
Colombini, F., Del Santo, D., Fanelli, F., Métivier, G.: A wellposedness result for hyperbolic operators with Zygmund coefficients. J. Math. Pures Appl. 9(100), 455–475 (2013). https://doi.org/10.1016/j.matpur.2013.01.009
Colombini, F., Jannelli, E., Spagnolo, S.: Nonuniqueness in hyperbolic Cauchy problems. Ann. Math. 126, 495–524 (1987). https://doi.org/10.2307/1971359
Colombini, F., Lerner, N.: Hyperbolic operators with nonLipschitz coefficients. Duke Math. J. 77(3), 657–698 (1995). https://doi.org/10.1215/S0012709495077217
Colombini, F., Nishitani, T.: Second order weakly hyperbolic operators with coefficients sum of powers of functions. Osaka J. Math. 44(1), 121–137 (2007)
D’Ancona, P., Kinoshita, T.: On the wellposedness of the Cauchy problem for weakly hyperbolic equations of higher order. Math. Nachr. 278, 1147–1162 (2005). https://doi.org/10.1002/mana.200310299
D’Ancona, P., Kinoshita, T., Spagnolo, S.: Weakly hyperbolic systems with Hölder continuous coefficients. J. Differ. Equ. 203(1), 64–81 (2004). https://doi.org/10.1016/j.jde.2004.03.016
D’Ancona, P., Kinoshita, T., Spagnolo, S.: On the 2 by 2 weakly hyperbolic systems. Osaka J. Math. 45(4), 921–939 (2008)
Dieci, L., Eirola, T.: On smooth decompositions of matrices. SIAM J. Matrix Anal. Appl. 20(3), 800–819 (1999). https://doi.org/10.1137/S0895479897330182
Duistermaat, J.J.: Fourier Intergal Operators Progress in Mathematics, vol. 130. Birkhäuser Boston, Inc, Boston (1996)
Garetto, C.: On hyperbolic equations and systems with nonregular time dependent coefficients. J. Differ. Equ. 259(11), 5846–5874 (2015). https://doi.org/10.1016/j.jde.2015.07.011
Garetto, C., Jäh, C.: Wellposedness of hyperbolic systems with multiplicities and smooth coefficients. Math. Ann. 369(1–2), 441–485 (2017). https://doi.org/10.1007/s0020801614368
Garetto, C., Ruzhansky, M.: Wellposedness of weakly hyperbolic equations with time dependent coefficients. J. Differ. Equ. 253(5), 1317–1340 (2012). https://doi.org/10.1016/j.jde.2012.05.001
Garetto, C., Ruzhansky, M.: Weakly hyperbolic equations with nonanalytic coefficients and lower order terms. Math. Ann. 357(2), 401–440 (2013). https://doi.org/10.1007/s0020801309109
Garetto, C., Ruzhansky, M.: A note on weakly hyperbolic equations with analytic principal part. J. Math. Anal. Appl. 412(1), 1–14 (2014). https://doi.org/10.1016/j.jmaa.2013.09.011
Garetto, C., Ruzhansky, M.: Hyperbolic second order equations with nonregular time dependent coefficients. Arch. Ration. Mech. Anal. 217(1), 113–154 (2015). https://doi.org/10.1007/s0020501408301
Garetto, C., Ruzhansky, M.: On hyperbolic systems with time dependent Hölder characteristics. Ann. Mat. Pura Appl. 196(1), 155–164 (2017). https://doi.org/10.1007/s1023101605676
Garetto, C., Ruzhansky, M.: On \(C^\infty \) wellposedness of hyperbolic systems with multiplicities. Ann. Mat. Pura Appl. 196(5), 1819–1834 (2017). https://doi.org/10.1007/s1023101706392
Gingold, H.: On continuous triangularization of matrix functions. SIAM J. Math. Anal. 10(4), 709–720 (1979). https://doi.org/10.1137/0510065
Gingold, H., Hsieh, P.F.: Globally analytic triangularization of a matrix function. Linear Algebra Appl. 169, 75–101 (1992). https://doi.org/10.1016/00243795(92)901727
Gramchev, T., Orrú, N.: Cauchy problem for a class of nondiagonalizable hyperbolic systems. Discret. Contin. Dyn. Syst., 533–542 (2011). https://doi.org/10.3934/proc.2011.2011.533
Gramchev, T., Ruzhansky, M.: Cauchy problem for \(2 \times 2\) hyperbolic systems of pseudodifferential equations with nondiagonalisable principal part. Studies in phase space analysis with applications to PDEs. Progr. Nonlinear Differ. Equ. Appl. 84, 129–144 (2013)
Hörmander, L.: The Analysis of Linear Partial Differential Operators, vol. I–IV. Springer, Heidelberg (1985)
Hörmander, L.: Hyperbolic systems with double characteristics. Comm. Pure Appl. Math. 46, 261–301 (1993). https://doi.org/10.1002/cpa.3160460207
Ivrii, V.Ya., Petkov, V.M.: Necessary conditions for the correctness of the Cauchy problem for nonstrictly hyperbolic equations. (Russian). Russ. Math. Surv. 29, 3–70 (1974). https://doi.org/10.1070/RM1974v029n05ABEH001295
Kajitani, K., Yuzawa, Y.: The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to the time variable. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 5(4), 465–482 (2006)
Kamotski, I., Ruzhansky, M.: Estimates and spectral asymptotics for systems with multiplicities. Funct. Anal. Appl. 39, 308–310 (2005). https://doi.org/10.1007/s1068800500522
Kamotski, I., Ruzhansky, M.: Regularity properties, representation of solutions and spectral asymptotics of systems with multiplicities. Comm. Partial Differ. Equ. 32, 1–35 (2007). https://doi.org/10.1080/03605300600856816
Kinoshita, T., Spagnolo, S.: Hyperbolic equations with nonanalytic coefficients. Math. Ann. 336, 551–569 (2006). https://doi.org/10.1007/s0020800600097
Lax, P.: Asymptotic solutions of oscillatory initial value problems. Duke Math. J. 24, 627–646 (1957). https://doi.org/10.1215/S0012709457024717
Melrose, R.B., Uhlmann, G.A.: Microlocal structure of involutiveconical refraction. Duke Math. J. 46, 571–582 (1982). https://doi.org/10.1215/S0012709479046301
Nishitani, T.: On the Cauchy problem for \(D^2_tD_x a(t, x)^n D_x\). Ann. Univ. Ferrara Sez. VII Sci. Mat. 52(2), 395–430 (2006)
Ohya, Y., Tarama, S.: The Cauchy Problem with multiple characteristics in the Gevery class–Hölder coefficients in \(t\). Hyperbolic Equations and Related Topics. Kataka/Kioto, pp. 273–306 (1984)
Parenti, C., Parmeggiani, A.: On the Cauchy problem for hyperbolic operators with double characteristics. Commun. Partial Differ. Equ. 34, 837–888 (2009). https://doi.org/10.1080/03605300902892360
Parusinski, A., Rainer, A.: Regularity of roots of polynomials. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 16(2), 481–517 (2016)
Rozenblum, G.: Spectral asymptotic behaviour of elliptic systems (Russian). Zap. LOMI 96, 255–271 (1980)
Ruzhansky, M.: Singularities of affine fibrations in the theory of regularity of Fourier integral operators. Russ. Math. Surv. 55(1), 93–161 (2000). https://doi.org/10.1070/RM2000v055n01ABEH000250
Ruzhansky, M.: Regularity theory of Fourier integral operators with complex phases and singularities of affine fibrations. CWI Tract, 131. Stichting Mathematisch Centrum, Centrum voor Wiskunde en Informatica, Amsterdam (2001)
Yuzawa, Y.: The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to time. J. Differ. Equ. 219(2), 363–374 (2005). https://doi.org/10.1016/j.jde.2004.12.006
Wasow, W.: On holomorphically similar matrices. J. Math. Anal. Appl. 4(2), 202–206 (1962). https://doi.org/10.1016/0022247X(62)900501
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Y. Giga.
Michael Ruzhansky was supported in parts by EPSRC Grant EP/R003025/1 and by the Leverhulme Grant RPG2017151. No new data was collected or generated during the course of research.
Inspired by our colleague and friend Todor Gramchev (1956–2015).
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Garetto, C., Jäh, C. & Ruzhansky, M. Hyperbolic systems with nondiagonalisable principal part and variable multiplicities, I: wellposedness. Math. Ann. 372, 1597–1629 (2018). https://doi.org/10.1007/s0020801816721
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s0020801816721