CTRW modeling of quantum measurement and fractional equations of quantum stochastic filtering and control

Kolokoltsov, Vassili

doi:10.1007/s13540-021-00002-2

CTRW modeling of quantum measurement and fractional equations of quantum stochastic filtering and control

Original Article
Open access
Published: 07 February 2022

Volume 25, pages 128–165, (2022)
Cite this article

Download PDF

You have full access to this open access article

Fractional Calculus and Applied Analysis Aims and scope Submit manuscript

CTRW modeling of quantum measurement and fractional equations of quantum stochastic filtering and control

Download PDF

Vassili Kolokoltsov^1,2,3

1636 Accesses
Explore all metrics

Abstract

Initially developed in the framework of quantum stochastic calculus, the main equations of quantum stochastic filtering were later on derived as the limits of Markov models of discrete measurements under appropriate scaling. In many branches of modern physics it became popular to extend random walk modeling to the continuous time random walk (CTRW) modeling, where the time between discrete events is taken to be non-exponential. In the present paper we apply the CTRW modeling to the continuous quantum measurements yielding the new fractional in time evolution equations of quantum filtering and thus new fractional equations of quantum mechanics of open systems. The related quantum control problems and games turn out to be described by the fractional Hamilton-Jacobi-Bellman (HJB) equations on Riemannian manifolds. By-passing we provide a full derivation of the standard quantum filtering equations, in a modified way as compared with existing texts, which (i) provides explicit rates of convergence (that are not available via the tightness of martingales approach developed previously) and (ii) allows for the direct applications of the basic results of CTRWs to deduce the final fractional filtering equations.

Quasifree Stochastic Cocycles and Quantum Random Walks

Article Open access 02 May 2019

Large Deviations at Level 2.5 for Markovian Open Quantum Systems: Quantum Jumps and Quantum State Diffusion

Article Open access 09 July 2021

The law of large numbers for quantum stochastic filtering and control of many-particle systems

Article 16 July 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Direct continuous observations are known to destroy quantum evolutions (so-called quantum Zeno paradox), so that continuous quantum measurements have to be indirect, and the results of the observation are assessed via quantum filtering. Initially developed in the framework of quantum stochastic calculus by Belavkin in the 80s of the last century in [6,7,8], see [12] for a readable modern account, the main equations of quantum stochastic filtering, often referred to as the Belavkin equations, were later on derived via more elementary approach, as the limit of standard discrete measurements under appropriate scaling, see e.g. [9, 10, 39]. The scaling arises from the basic Markovian assumption that the times between measurement are either fixed or exponentially distributed, like in a standard random walk. Since such Markovian assumption has no a priori justification, in many branches of modern physics it became popular to extend random walk modeling to the continuous time random walk (CTRW) modeling, where the time between discrete events is taken to be non-exponential, usually from the domain of attraction of a stable law. In the present paper we apply the CTRW modeling to the continuous quantum measurements yielding the new fractional in time evolution equations of quantum filtering in the scaling limit. The related quantum control problems turn out to be described by the fractional Hamilton-Jacobi-Bellman (HJB) equations on Riemannian manifolds (complex projective spaces in the case of finite-dimensional quantum mechanics) or the fractional Isaacs equation in the case of competitive control. By-passing we provide a full derivation of the standard quantum filtering equations (explaining from scratch all underlying quantum mechanical rules used) in a slightly modified and simplified way yielding also new explicit rates of convergence (which are not available via the tightness of martingales approach developed previously) and tailored in a way that allows for the direct applications of the basic results of CTRWs to deduce the final fractional filtering equations.

Several general comments on a wider context are in order.

(i)
The fractional equations of quantum stochastic filtering derived here can be considered as an alternative formulation of fractional quantum mechanics, which is different from the framework of fractional Schrödinger equations suggested in [31] and extensively studied recently. This leads also to a different class of quantum control problems, as those related to fractional Schrödinger formulation, as discussed e.g. in [45].
(ii)
The fractional versions of the classical stochastic filtering (see [2] for the basics) has been actively studied recently, see e.g. [44].
(iii)
The quantum mean-field games as developed by the author in [25] can now be extended to the theory of fractional quantum mean-field games. The classical versions of fractional mean-field games just started to appear in the literature, see [13]. On the other hand, the application of classical stochastic filtering in the study of mean-field games has also started to appear, see [42].
(iv)
Fractional modeling and CTRW become very popular in almost all domains of physics, as well as economics and finances, see e.g. [3, 36, 43, 46] for some representative references.

The contents of the paper is as follows. In Section 2 we recall the basic notions and notations of finite-dimensional quantum mechanics, and in Section 3 we introduce the Markov chain of sequential indirect quantum measurements, which is the standard starting point for dealing with continuous measurements. In Sections 4 and 5 we derive the main quantum filtering equations in the cases of so-called counting and diffusive observations. As was already mentioned, though the derivation of the filtering equations from the approximating Markov chain is well known by now (see e.g. [38]) our approach is new and yields explicit rates of convergence. In Section 6 the limiting equation is derived in a general case of mixed counting and diffusive observations via a multichannel measuring device. This preparatory work allows us to derive our main results, fractional equations of quantum filtering and control, in a more or less straightforward way, by applying the established techniques of CTRW to the setting of the Markov chains of sequential quantum measurements, as developed in Sections 4 - 6. This is done in Sections 7 and 8. In Section 9 we briefly describe a slightly different Markov chain approximation to continuous measurement that can be used to derive filtering equations in certain cases of unbounded operators involved. In Appendices A,B,C several (known) probabilistic techniques are presented in a concise form tailored to our purposes. They are used in the main body of the paper.

Some basic notations to be used throughout the text are as follows.

For two Banach spaces B and D equipped with norms $\Vert .\Vert _B$ and $\Vert .\Vert _D$ respectively, let us denote by ${\mathcal {L}}(D,B)$ the Banach space of bounded linear operators $D \rightarrow B$ equipped with the usual operator norm $\Vert .\Vert _{D\rightarrow B}$. We shall also write ${\mathcal {L}}(B)$ for ${\mathcal {L}}(B,B)$.

The scalar product of operators in a Hilbert space is given by the trace: $(R,S)=\mathrm{{tr}} (RS)$.

For $K={\mathbf {R}}^d$ or a convex closed subset of ${\mathbf {R}}^d$ we denote C(K) the Banach space of continuous bounded functions on K, equipped with the sup-norm and $C^k(K)$ the Banach space of k times continuously differentiable functions on K (with the derivatives at the boundary understood as the continuous extensions of the derivatives in the inner points), with the norm being the sum of the sup-norms of the functions and all their partial derivatives of order not exceeding k.

2 Notations for quantum states and tensor products

Recall that a general isolated quantum system is described by a Hilbert space ${\mathcal {H}}$ and a self-adjoint operator H in it, the Hamiltonian. The pure states of the system are unit vectors in ${\mathcal {H}}$ and the general mixed states are density matrices, that is, non-negative operators in ${\mathcal {H}}$ with unit trace. Let us denote S(H) the set of all such mixed states in H. To a pure state there corresponds a density matrix according to the rule $\psi \rightarrow \gamma =\psi \otimes {\bar{\psi }}$, also denoted in Dirac’s notation as $|\psi \rangle \langle \psi |$. This density matrix is the one-dimensional orthogonal projector on the line generated by $\psi $. Pure states evolve in time according to the rule $\psi \rightarrow e^{-itH} \psi $ and the mixed state according to the rule $\gamma \rightarrow e^{-itH} \gamma e^{itH}$.

If two systems living in spaces ${\mathcal {H}}_0$ and ${\mathcal {H}}_1$ are brought to interaction, the combined system has the tensor product Hilbert space ${\mathcal {H}}_0 \otimes {\mathcal {H}}_1$ as the state space. Recall that, in the coordinate description of tensor products, if ${\mathcal {H}}_0$ and ${\mathcal {H}}_1$ have orthonormal bases $\{e_j\}$ and $\{f_j\}$ respectively, the tensor product is the space with an orthonormal basis $\{e_k\otimes f_j\}$. In particular, if ${\mathcal {H}}_0$ and ${\mathcal {H}}_1$ have finite dimensions n and k, the space ${\mathcal {H}}_0 \otimes {\mathcal {H}}_1$ has the dimension nk. The operators A in ${\mathcal {H}}_0 \otimes {\mathcal {H}}_1$ can be given by matrices $A^{i_1i_2}_{j_1j_2}$, so that

$$\begin{aligned} A (e_{i_1}\otimes f_{i_2})=\sum _{j_1,j_2} A_{i_1i_2}^{j_1j_2} e_{j_1}\otimes f_{j_2}. \end{aligned}$$

Or equivalently, if $X\in {\mathcal {H}}_0 \otimes {\mathcal {H}}_1$ has coordinates $X^{kj}$ in the basis $\{e_k\otimes f_j\}$, the vector AX has the coordinates $\sum _{m,l} A^{kj}_{ml} X^{ml}$ in this basis.

A product $A\otimes B$ of two operators A and B acting in ${\mathcal {H}}_0$ and ${\mathcal {H}}_1$ respectively is defined by its action on tensor products as

$$\begin{aligned} (A\otimes B)(e\otimes f)=Ae\otimes Bf. \end{aligned}$$

In the coordinate description $A\otimes B$ has the matrix elements expressed as $A^{i_1}_{j_1}B^{i_2}_{j_2}$ in terms of the matrix elements of A and B.

An operator A in ${\mathcal {H}}_0$ has the natural lifting $A\otimes I$ (where I is the unit operator) to ${\mathcal {H}}_0\otimes {\mathcal {H}}_1$. Similarly an operator B in ${\mathcal {H}}_1$ has the natural lifting $I\otimes B$ to ${\mathcal {H}}_0\otimes {\mathcal {H}}_1$.

The key notion of the theory of interacting systems is that of the partial trace. For an operator A in ${\mathcal {H}}_0 \otimes {\mathcal {H}}_1$ the partial trace with respect to the second system is the operator $\mathrm{{tr}}_{p1} A$ in ${\mathcal {H}}_0$ given by the matrix

$$\begin{aligned} (\mathrm{{tr}}_{p1} A)^i_j=\sum _k A^{ik}_{jk}. \end{aligned}$$

(2.1)

This partial trace is interpreted as the state of the first system given the state of the coupled one. Therefore it can be looked at as the quantum analog of the notion of marginal distribution of classical probability. Similarly, the partial trace with respect to the first system is the operator $\mathrm{{tr}}_{p0} A$ in ${\mathcal {H}}_1$ given by the matrix

$$\begin{aligned} (\mathrm{{tr}}_{p0} A)^i_j=\sum _k A^{ki}_{kj}. \end{aligned}$$

Clearly,

$$\begin{aligned} \mathrm{{tr}} (\mathrm{{tr}}_{p0} A)= \mathrm{{tr}} (\mathrm{{tr}}_{p1} A)= \mathrm{{tr}} (A). \end{aligned}$$

In a two-dimensional Hilbert spaces ${\mathbf {C}}^2$ one usually chooses the standard basis $e_0=(1,0)$, $e_1=(0,1)$, and represents the Hilbert product space ${\mathcal {H}}_0\otimes {\mathbf {C}}^2$ by the natural decomposition

$$\begin{aligned} {\mathcal {H}}_0 \otimes {\mathbf {C}}^2={\mathcal {H}}_{00}\oplus H_{01}={\mathcal {H}}_0 \otimes e_0 \oplus {\mathcal {H}}_0 \otimes e_1. \end{aligned}$$

Every operator A in this space has the block decomposition

$$\begin{aligned} A= \begin{pmatrix} A_{0\rightarrow 0} &{} A_{1\rightarrow 0} \\ A_{0\rightarrow 1} &{} A_{1\rightarrow 1} \end{pmatrix} = \begin{pmatrix} (A_{j0}^{i0}) &{} (A_{j1}^{i0}) \\ (A_{j0}^{i1}) &{} (A_{j1}^{i1}) \end{pmatrix}\, , \end{aligned}$$

where the operators $A_{i\rightarrow j}$ act from ${\mathcal {H}}_{0i}$ to ${\mathcal {H}}_{0j}$, $i,j=0,1$. The trace (2.1) gets the expression

$$\begin{aligned} (\mathrm{{tr}}_{p1} A)^i_j=A^{i0}_{j0}+A^{i1}_{j1}. \end{aligned}$$

(2.2)

In particular, we shall use the following block representations:

$$\begin{aligned} A\otimes I= & {} \begin{pmatrix} A &{} 0 \\ 0 &{} A \end{pmatrix}, \quad A\otimes \varOmega = \begin{pmatrix} A &{} 0 \\ 0 &{} 0 \end{pmatrix},\nonumber \\ C\otimes \begin{pmatrix} 0 &{} 0 \\ 1 &{} 0 \end{pmatrix}= & {} \begin{pmatrix} 0 &{} 0 \\ C &{} 0 \end{pmatrix}, \quad C\otimes \begin{pmatrix} 0 &{} 1 \\ 0 &{} 0 \end{pmatrix} =\begin{pmatrix} 0 &{} C \\ 0 &{} 0 \end{pmatrix}. \end{aligned}$$

(2.3)

More generally, if $B=(B^i_j)$ is a matrix in ${\mathbf {C}}^2$, then the matrix of $I\times B$ in ${\mathcal {H}}\otimes {\mathbf {C}}^2$ has the block decomposition

$$\begin{aligned} \begin{pmatrix} B^0_0 I &{} B_1^0 I \\ B^1_0 I &{} B_1^1 I \end{pmatrix}. \end{aligned}$$

(2.4)

To conclude this section, let us write down the simple small time asymptotic formula for the evolutions $e^{-itH}$ that we shall use repeatedly. Namely, up to the terms of order higher than $t^2$ in small t, we have

$$\begin{aligned} e^{-itH}\rho e^{itH}= & {} \left( 1-it H -\frac{1}{2} t^2 H^2\right) \rho \left( 1+it H-\frac{1}{2} t^2 H^2\right) \nonumber \\= & {} \rho -it [H,\rho ]-\frac{1}{2} t^2 H^2\rho -\frac{1}{2} t^2 \rho H^2+t^2 H\rho H\nonumber \\= & {} \rho -it [H,\rho ]+t^2 \left( H\rho H-\frac{1}{2} \{H^2,\rho \}\right) . \end{aligned}$$

(2.5)

3 The starting point: Markov chains of sequential indirect observations

Here we describe the Markov chains of sequential indirect observations (rather standard by now, at least after paper [1]) in discrete and continuous time recalling first quickly the main notions related to quantum measurements.

Physical observables are given by self-adjoint operators A in ${\mathcal {H}}$. If A has a discrete spectrum (which is always the case in finite-dimensional ${\mathcal {H}}$, that we shall mostly work with), then A has the spectral decomposition $A=\sum _j \lambda _j P_j$, where $P_j$ are orthogonal projections on the eigenspaces of A corresponding to the eigenvalues $\lambda _j$. According to the basic postulate of quantum measurement, measuring observable A in a state $\gamma $ (often referred to as the Stern-Gerlach experiment) can yield each of the eigenvalue $\lambda _j$ with the probability

$$\begin{aligned} \mathrm{{tr}} \, (\gamma P_j)=\mathrm{{tr}} \, (P_j \gamma P_j), \end{aligned}$$

(3.1)

and, if the value $\lambda _j$ was obtained, the state of the system changes (instantaneously) to the reduced state

$$\begin{aligned} P_j\gamma P_j/ \mathrm{{tr}} \, (\gamma P_j). \end{aligned}$$

In particular, if the state $\rho $ was pure, $\gamma =|\psi \rangle \langle \psi |$, then the probability to get $\lambda _j$ as the result of the measurement becomes $(\psi _,P_j\psi )$ and the reduced state also remains pure and is given by the vector $P_j\psi $. If the interaction with the apparatus was preformed ’without reading the results’, the state $\rho $ is said to be subject to a non-selective measurement that changes $\gamma $ to the state $\sum _j P_j\rho P_j$.

Indirect measurements of a chosen quantum system in the initial space ${\mathcal {H}}_0$, which we shall often referred to as an atom, are organised in the following way. One couples the atom with another quantum system, a measuring devise, specified by another Hilbert space ${\mathcal {H}}$. Namely the combined system lives in the tensor product Hilbert space ${\mathcal {H}}_0\times {\mathcal {H}}$ and its evolution is given by certain self-adjoint operator H in ${\mathcal {H}}_0\times {\mathcal {H}}$. In the measuring device some fixed vector $\varphi \in {\mathcal {H}}$ is chosen, called the vacuum and interpreted as the stationary state of the devise when no interaction is involved. The corresponding density matrix will be denoted $\varOmega =|\varphi \rangle \langle \varphi |$. Indirect measurements of the states of the atom are performed by measuring the coupled system via an observable of the second system and then projecting the resulting state to the atom via the partial trace.

Namely, it is described by an operator R in ${\mathcal {H}}$ with the spectral decomposition $R=\sum _j \lambda _j P_j$ and is performed in two steps: given a state $\gamma $ in ${\mathcal {H}}_0\times {\mathcal {H}}$ one performs a measurement of R lifted as $I\otimes R$ to ${\mathcal {H}}_0\times {\mathcal {H}}$ yielding values $\lambda _j$ and new states

$$\begin{aligned} (I\otimes P_j)\gamma (I\otimes P_j)/ \mathrm{{tr}} \, (\gamma (I\otimes P_j)) \end{aligned}$$

with probabilities $p_j= \mathrm{{tr}} \, (\gamma (I\otimes P_j))$, and then one projects these states to ${\mathcal {H}}_0$ via the partial trace producing the states

$$\begin{aligned} \mathrm{{tr}}_{p1} [(I\otimes P_j)\gamma (I\otimes P_j)/ \mathrm{{tr}} \, (\gamma (I\otimes P_j))]. \end{aligned}$$

(3.2)

The discrete time Markov chain of successive indirect observations (or measurements) evolves according to the following procedure specified by a triple: a self-adjoint operator H in ${\mathcal {H}}_0\times {\mathcal {H}}$, a self-adjoint operator R in ${\mathcal {H}}$ and the vacuum vector $\varOmega $ in ${\mathcal {H}}$. (i) Starting with an initial state $\rho $ of ${\mathcal {H}}_0$ one couples it with the device in its vacuum state $\varOmega $ producing the state $\gamma =\rho \otimes \varOmega $ in ${\mathcal {H}}_0\times {\mathcal {H}}$, (ii) During a fixed period of time t one evolves the system according to the operator H producing the state $\gamma _t=e^{-itH}\gamma e^{itH}$ in ${\mathcal {H}}_0\times {\mathcal {H}}$, (iii) One performs the indirect measurement with the state $\gamma _t$ yielding the states

$$\begin{aligned} \rho _t^j=\mathrm{{tr}}_{p1} \frac{(I\otimes P_j)\gamma _t (I\otimes P_j)}{p_j(t)} =\mathrm{{tr}}_{p1} \frac{(I\otimes P_j)e^{-itH}(\rho \otimes \varOmega ) e^{itH} (I\otimes P_j)}{p_j(t)}\nonumber \\ \end{aligned}$$

(3.3)

with the probabilities

$$\begin{aligned} p_j(t)=\mathrm{{tr}} \, (\gamma _t (I\otimes P_j)) =\mathrm{{tr}} \, (e^{-itH}(\rho \otimes \varOmega ) e^{itH} (I\otimes P_j)).\nonumber \\ \end{aligned}$$

(3.4)

Then the same repeats starting with $\rho _t$ as the initial state. Let us denote $U_t$ the transition operator of this Markov chain that acts on the set of continuous functions on S(H) as

$$\begin{aligned} U_t f(\rho )={\mathbf {E}}f(\rho _t)=\sum _j p_j(t) f(\rho _t^j). \end{aligned}$$

(3.5)

Similarly one can define the continuous time Markov chain of successive indirect observations (or measurements) $O^{\rho }_{t,\lambda }$ and the corresponding Markov semigroup $T_t^{\lambda }$ on C(H(S)) evolving according to the same rules, with only difference that the times t between successive measurements are not fixed, but represent exponential random variables $\tau $ with some fixed intensity $\lambda $: ${\mathbf {P}}(\tau >t)=e^{-\lambda t}$. The generator $L^{\lambda }$ of this Markov process is bounded in C(S(H)) and acts as

$$\begin{aligned} L^{\lambda }f(\rho )=\frac{(U_{\lambda }f-f)(\rho )}{\lambda }=\frac{1}{\lambda }\sum _j p_j(\lambda ) (f(\rho _{\lambda }^j)-f(\rho )). \end{aligned}$$

(3.6)

All “quantum content” of the theory is now captured in the explicit formula (3.3). What follows will be the pure classical probability analysis of these Markov chains, their scaling limits and control.

In this paper we shall work with the measuring devises of the simplest form living in two-dimensional Hilbert spaces ${\mathbf {C}}^2$ or more generally the tensor products of these spaces. Choosing the standard basis $e_0=(1,0)$, $e_1=(0,1)$, we shall use the decomposition

$$\begin{aligned} {\mathcal {H}}_0\otimes {\mathbf {C}}^2={\mathcal {H}}_{00}\oplus H_{01}={\mathcal {H}}_0 \otimes e_0 \oplus {\mathcal {H}}_0 \otimes e_1, \end{aligned}$$

and we shall choose the vacuum vector $\varphi =e_0$, so that

$$\begin{aligned} \varOmega =\begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \end{pmatrix}. \end{aligned}$$

4 Belavkin equations for a counting observation

For simplicity we shall work exclusively with finite-dimensional Hilbert spaces ${\mathcal {H}}_0={\mathbf {C}}^n$, making occasionally some comments about more general case. The set of states $S({\mathbf {C}}^n)$ is a compact convex set in the Euclidean space ${\mathbf {R}}^{n^2}$, the space of complex Hermitian $n\times n$ matrices.

Let us choose an arbitrary self-adjoint operator in ${\mathcal {H}}_0\otimes {\mathbf {C}}^2$ given by its matrix representation

$$\begin{aligned} H= \begin{pmatrix} A &{} 0 \\ 0 &{} B \end{pmatrix} + \begin{pmatrix} 0 &{} -iC^* \\ iC &{} 0 \end{pmatrix}. \end{aligned}$$

We are aiming at calculating the small time asymptotics of the Markov transition operators defined by (3.3).

The main idea for obtaining sensible asymptotic limits suggests enhancing the interaction part C of H by replacing it with the scaled version $C/\sqrt{t}$. Thus we choose the Hamiltonian in the form

$$\begin{aligned} H= \begin{pmatrix} A &{} 0 \\ 0 &{} B \end{pmatrix} + \frac{1}{\sqrt{t}} \begin{pmatrix} 0 &{} -iC^* \\ iC &{} 0 \end{pmatrix}. \end{aligned}$$

Remark 1

The idea of the scaling comes from the analysis of the so-called quantum Zeno paradox. Its essence is a rather simple observation that if one performs repeated measurements with reduction (3.1) and pass to the limit, as time between measurements tends to zero, then the state effectively remains in the initial state all the time irrespectively of the dynamics. This effect is also referred to as the watch dog effect. Therefore the only way to get a sensible dynamics that takes into account both dynamics and observation is to enhance the interaction part of the dynamics to make its effect comparable with that of the repeated reduction (3.1). Thus one can suggest scaling C as $C/t^{\alpha }$ with some $\alpha >0$. As calculations show (one can repeat the calculations below with an arbitrary $\alpha $) only with $\alpha =1/2$ a sensible limit is obtained.

By the second equation in (2.3), we get

$$\begin{aligned} \rho \otimes \varOmega= & {} \begin{pmatrix} \rho &{} 0 \\ 0 &{} 0 \end{pmatrix}, \quad \left[ H, \begin{pmatrix} \rho &{} 0 \\ 0 &{} 0 \end{pmatrix}\right] =\begin{pmatrix} [A,\rho ] &{} + i\rho C^*/\sqrt{t} \\ iC\rho /\sqrt{t} &{} 0 \end{pmatrix}\\ H\begin{pmatrix} \rho &{} 0 \\ 0 &{} 0 \end{pmatrix}H= & {} \begin{pmatrix} A\rho A &{} -i A\rho C^*/\sqrt{t}\\ iC\rho A/\sqrt{t} &{} C\rho C^*/t \end{pmatrix},\\ H^2= & {} \begin{pmatrix} A^2+C^*C/t &{} -i(AC^* +C^* B)/\sqrt{t} \\ i(CA+BC)/\sqrt{t} &{} B^2+CC^*/t \end{pmatrix},\\ \{H^2, \rho \otimes \varOmega \}= & {} \begin{pmatrix} \{A^2+C^*C/t,\rho \} &{} -i\rho (AC^*+C^*B)/\sqrt{t} \\ i(CA+BC)\rho /\sqrt{t} &{} 0 \end{pmatrix}, \end{aligned}$$

where $\{C,D\}=CD+DC$ denotes the anti-commutator. Using (2.5), and keeping terms of order not exceeding t we get the approximation

$$\begin{aligned} e^{-itH} (\rho \otimes \varOmega ) e^{itH} =\begin{pmatrix} \rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \} &{} \sqrt{t} \rho C^* \\ \sqrt{t} C\rho &{} tC\rho C^* \end{pmatrix}, \end{aligned}$$

(4.1)

which is the key formula for what follows.

As it turns out, the limiting processes are of two types, depending on whether the projectors $P_0$ and $P_1$ of the spectral decomposition of R are diagonal, that is

$$\begin{aligned} P_0= \begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \end{pmatrix}, \quad P_1=\begin{pmatrix} 0 &{} 0 \\ 0 &{} 1 \end{pmatrix}, \end{aligned}$$

(4.2)

or otherwise. Let us start with the case of projectors (4.2).

We have

$$\begin{aligned} I\otimes P_0= \begin{pmatrix} I &{} 0 \\ 0 &{} 0 \end{pmatrix}, \quad I\otimes P_1 =\begin{pmatrix} 0 &{} 0 \\ 0 &{} I \end{pmatrix}, \end{aligned}$$

and

$$\begin{aligned} (I\otimes P_0) e^{-itH} \begin{pmatrix} \rho &{} 0 \\ 0 &{} 0 \end{pmatrix} e^{itH} (I\otimes P_0)= & {} \rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \},\\ (I\otimes P_1) e^{-itH} \begin{pmatrix} \rho &{} 0 \\ 0 &{} 0 \end{pmatrix} e^{itH} (I\otimes P_1)= & {} tC\rho C^*. \end{aligned}$$

Hence the non-normalized new states are

$$\begin{aligned} {\tilde{\rho }}_1=\rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \}, \quad {\tilde{\rho }}_2= tC\rho C^*, \end{aligned}$$

occurring with the probabilities

$$\begin{aligned} p_1=1-t \, \mathrm{{tr}} (C^*C \rho ), \quad p_2=t \, \mathrm{{tr}} (C^*C \rho ). \end{aligned}$$

Aiming at using Proposition 1 (ii) we are looking for the limit of the operator $(U_h-1)/h$ for $h \rightarrow 0$.

Denoting $T = \mathrm{{tr}} (C^*C \rho )$ we can write up to terms of order t that

$$\begin{aligned} \frac{U_h-1}{h} f(\rho )= & {} \frac{1}{h} (1-hT)\left[ f\left( \frac{{\tilde{\rho }}_1}{1-hT}\right) -f(\rho )\right] +\frac{1}{h} h\, T \left[ \left( f\left( \frac{{\tilde{\rho }}_2}{hT}\right) -f(\rho )\right) \right] \\\approx & {} \frac{1}{h} (1-hT)\left[ f(\rho -ih[A, \rho ] -\frac{1}{2} h\{C^*C,\rho \}+h\rho T)-f(\rho )\right] \\&+T \left[ f\left( \frac{C\rho C^*}{T}\right) -f(\rho )\right] , \end{aligned}$$

which equals approximately to

$$\begin{aligned} L_{count}f(\rho )=-\left( f'(\rho ), i[A, \rho ] +\frac{1}{2} \{C^*C,\rho \}-\rho T\right) +T \left[ f\left( \frac{C\rho C^*}{T}\right) -f(\rho )\right] . \end{aligned}$$

(4.3)

Summarising by looking carefully at the small terms ignored, we can conclude the following.

Lemma 1

Under the setting considered,

$$\begin{aligned} \left\| \frac{U_h-1}{h} f -L_{count}f\right\| \le \sqrt{h} \varkappa \Vert f\Vert _{C^2(S({\mathcal {H}}_0))} \end{aligned}$$

(4.4)

for $f\in C^2(S({\mathcal {H}}_0))$, with $L_{count}$ given by (4.3) and a constant $\varkappa $.

We can prove now our first result.

Theorem 1

Let ${\mathcal {H}}_0={\mathbf {C}}^n$ and A, C be $n\times n$ square matrices with A being Hermitian. Then :

(i)
The operator (4.3) generates a Feller process $O_t^{\rho }$ in $S({\mathcal {H}}_0)$ and the corresponding Feller semigroup $T_t$ in $C(S({\mathcal {H}}_0))$ having the spaces $C^1(S({\mathcal {H}}_0))$ and $C^2(S({\mathcal {H}}_0))$ as invariant cores, and $T_s$ are bounded in these spaces uniformly for $s\in [0,t]$ with any $t>0$.
(ii)
The scaled discrete semigroups $(U_h)^{[s/h]}$ converge to the semigroup $T_s$, as $h\rightarrow 0$, so that the corresponding processes converge in distribution, with the following rates of convergence:
$$\begin{aligned} \Vert (U_h)^{[s/h]} -T_sf\Vert \le \sqrt{h} s \varkappa (t) \Vert f\Vert _{C^2(S({\mathcal {H}}_0))}, \end{aligned}$$
(4.5)
where the constant $\varkappa (t)$ depends on the dimension n and the norms of A and C.
(iii)
The scaled semigroups $T_s^{\lambda }$ converge to the semigroup $T_s$, as $\lambda \rightarrow 0$, so that the corresponding processes converge in distribution, with the following rates of convergence:
$$\begin{aligned} \Vert T_s^{\lambda }f -T_sf\Vert \le \sqrt{\lambda }s \varkappa (t) \Vert f\Vert _{C^2(S({\mathcal {H}}_0))}. \end{aligned}$$
(4.6)

Proof

(i)
This is a consequence of Proposition 3. To make this conclusion one needs to show property (11.3) with $K=S({\mathbf {C}}^n)$ and
$$\begin{aligned} b(\rho )= -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ \mathrm{{tr}} (C^*C\rho ) \rho . \end{aligned}$$
It is straightforward to see that the solutions to the ODE ${\dot{\rho }}=b(\rho )$ preserve the affine set of Hermitian matrices with unit trace. So the key point is the preservation of positivity. It turns out that a stronger version of (11.3) holds, namely that $d(\rho +h b(\rho ),K)=0$ for any $\rho $ from the boundary of K and all sufficiently small h. By the compactness of a unit ball in ${\mathbf {C}}^n$, this claim follows from the following one. If $\rho $ belongs to the boundary of K, that is, there exists a nonempty set $V(\rho )$ of unit vectors such that $\rho v=0$ for $v\in V(\rho )$, then $(v,(\rho +h b(\rho ))v)\ge 0$ for any unit vector v and all $h\le h(v)$ with some $h(v)>0$ (because, by compactness, then $\min _v h(v)>0$). But this property is obvious for $v\notin V(\rho )$. On the other hand $(v,b(\rho )v)=0$ for $v\in V(\rho )$ implying that $(v,(\rho +h b(\rho ))v)= 0$ for all $h>0$ and all $v\in V(\rho )$.
(ii)
This is a consequence of (i), Proposition 1 (ii) and the observation that (10.5) holds here with the triple of spaces $C^2(S({\mathcal {H}}_0))\subset C^1(S({\mathcal {H}}_0)) \subset C(S({\mathcal {H}}_0))$.
(iii)
This is a consequence of (i), formula (3.6) and Proposition 1 (i), with $B=C(S({\mathcal {H}}_0))$, $D=C^2(S({\mathcal {H}}_0))$. $\square $

Remark 2

This result extends almost automatically to the case of an arbitrary separable Hilbert space ${\mathcal {H}}_0$ and arbitrary bounded operators H, C, with the derivatives understood in the Fréchet sense. The only point where the finite-dimensional setting was used was in proving statement (i) using compactness of a unit ball in ${\mathbf {C}}^n$ and the Brezis theorem. In infinite-dimensional case one can use the compactness of a unit ball in a Hilbert space in the weak topology and the Banach-space version of the Brezis theorem, as presented in [32] and [30].

As is seen directly via Ito’s formula, the Feller process $O_t^{\rho }$ generated by (4.3) can be described as solving the jump type SDE

$$\begin{aligned} d\rho =\left( - i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ \mathrm{{tr}} (C\rho C^*) \rho \right) dt +\left( \frac{C\rho C^*}{\mathrm{{tr}} (C\rho C^*)}-\rho \right) dN_t,\nonumber \\ \end{aligned}$$

(4.7)

with the counting process $N_t$ with the position dependent intensity $\mathrm{{tr}} (C^*C\rho )$, so that the compensated process $N_t-\int _0^t \mathrm{{tr}} (C^*C\rho _s) \, ds$ is a martingale. Equation (4.7) is the Belavkin quantum filtering SDE corresponding to the counting type observation (because the driving process $N_t$ is a counting process). Representation via the generator is an equivalent way of specifying the process of continuous quantum observation and filtering.

Remark 3

Equation (4.7) is slightly nonstandard as the driving noise $N_t$ is itself position dependent. However there is a natural way to rewrite it in terms of an independent driving noise. Namely, with a standard Poisson random measure process $N(dx \,dt)$ on ${\mathbf {R}}_+\times {\mathbf {R}}_+$ (with Lebesgue measure as intensity) one can rewrite equation (4.7) in the following equivalent form:

$$\begin{aligned} d\rho= & {} \left( - i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ \mathrm{{tr}} (C\rho C^*) \rho \right) dt\nonumber \\&+\left( \frac{C\rho C^*}{\mathrm{{tr}} (C\rho C^*)}-\rho \right) {\mathbf {1}}(\mathrm{{tr}} (C^*C\rho )\le x) N(dx\, dt), \end{aligned}$$

(4.8)

see details of this construction in [38]. Alternatively, one can make sense of (4.7) in terms of the general theory of weak SDEs from [20].

Remark 4

The meaning of the term ’counting observation’ (as well as ’diffusive type’ of the next section) becomes more concrete in a more advanced treatment of the process of quantum measurement, see e.g. [12].

5 Belavkin equations for a diffusive observation

Let us turn to the second case of choosing orthogonal projectors $P_0,P_1$, when they differ from the diagonal choice (4.2).

General couple of two orthogonal projectors in ${\mathbf {C}}^2$ is easily seen to be of the form

$$\begin{aligned} P_0= & {} \begin{pmatrix} \cos ^2 \phi &{} \sin \phi \cos \phi e^{i\psi }\\ \sin \phi \cos \phi e^{i\psi } &{} \sin ^2\phi \end{pmatrix}, \\ P_1= & {} \begin{pmatrix} \sin ^2 \phi &{} -\sin \phi \cos \phi e^{i\psi }\\ -\sin \phi \cos \phi e^{i\psi } &{} \cos ^2\phi \end{pmatrix}. \end{aligned}$$

The phase terms with $\psi $ does not make much difference, so we choose further $\psi =0$. Moreover, to avoid diagonal case we assume $\phi \ne \pi k/2$, $k\in N$.

By (2.4),

$$\begin{aligned} I\times P_0= & {} \begin{pmatrix} \cos ^2 \phi I &{} \sin \phi \cos \phi I \\ \sin \phi \cos \phi I &{} \sin ^2\phi I\end{pmatrix},\\ I\times P_1= & {} \begin{pmatrix} \sin ^2 \phi I &{} -\sin \phi \cos \phi I \\ -\sin \phi \cos \phi I &{} \cos ^2\phi I\end{pmatrix}. \end{aligned}$$

Hence, for arbitrary matrices a, b, c, d, we have

$$\begin{aligned} (I\times P_0) \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} = \begin{pmatrix} \cos ^2 \phi \, a + \sin \phi \cos \phi \, c &{} \cos ^2 \phi \, b + \sin \phi \cos \phi \, d \\ \sin \phi \cos \phi \, a + \sin ^2\phi \, c &{} \sin \phi \cos \phi \, b + \sin ^2\phi \, d\end{pmatrix} \end{aligned}$$

and

$$\begin{aligned} (I\times P_0) \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} (I\times P_0) =\begin{pmatrix} \cos ^2 \phi \, \omega _{\phi } &{} \sin \phi \cos \phi \, \omega _{\phi } \\ \sin \phi \cos \phi \, \omega _{\phi } &{} \sin ^2\phi \, \omega _{\phi } \end{pmatrix} \end{aligned}$$

with

$$\begin{aligned} \omega _{\phi }=\omega _{\phi }(a,b,c,d)= \cos ^2 \phi \, a+ \sin \phi \cos \phi (b+c)+ \sin ^2\phi \, d. \end{aligned}$$

Since $P_1$ is obtained from $P_0$ by changing $\phi $ to $\phi +\pi /2$, it follows that

$$\begin{aligned} (I\times P_1) \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} (I\times P_1) =\begin{pmatrix} \sin ^2 \phi \, {\tilde{\omega }}_{\phi } &{} -\sin \phi \cos \phi \, {\tilde{\omega }}_{\phi } \\ -\sin \phi \cos \phi \, {\tilde{\omega }}_{\phi } &{} \cos ^2\phi \, {\tilde{\omega }}_{\phi } \end{pmatrix} \end{aligned}$$

with

$$\begin{aligned} {\tilde{\omega }}_{\phi }=\omega _{\phi +\pi /2} = \sin ^2 \phi \, a -\sin \phi \cos \phi (b+c)+ \cos ^2\phi \, d. \end{aligned}$$

By (2.2) we get

$$\begin{aligned}&\mathrm{{tr}}_{p1} [(I\times P_0) \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} (I\times P_0)]\\&\quad =\omega _{\phi } = \cos ^2 \phi \, a+ \sin \phi \cos \phi (b+c)+ \sin ^2\phi \, d,\\&\mathrm{{tr}}_{p1} [(I\times P_1) \begin{pmatrix} a &{} b \\ c &{} d \end{pmatrix} (I\times P_1)]\\&\quad ={\tilde{\omega }}_{\phi } = \sin ^2 \phi \, a- \sin \phi \cos \phi (b+c)+ \cos ^2\phi \, d. \end{aligned}$$

To get new states we have to take a, b, c, d from (4.1). Hence for the non-normalized states we get the approximate formulas (up to terms of order t):

$$\begin{aligned} {\tilde{\rho }}_1= & {} \cos ^2 \phi (\rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \})\\&+ \sqrt{t} \sin \phi \cos \phi (\rho C^* + C\rho )+ t \sin ^2\phi \,C\rho C^*,\\ {\tilde{\rho }}_2= & {} \sin ^2 \phi (\rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \})\\&- \sqrt{t} \sin \phi \cos \phi (\rho C^* + C\rho )+ t\cos ^2\phi \, C\rho C^*. \end{aligned}$$

These states occur with the probabilities

$$\begin{aligned} p_1= & {} \cos ^2\phi (1-tT)+\sqrt{t} \sin \phi \cos \phi \, \mathrm{{tr}} (\rho C^* + C\rho )+t T \sin ^2\phi ,\\ p_2= & {} \sin ^2\phi (1-tT)-\sqrt{t} \sin \phi \cos \phi \, \mathrm{{tr}} (\rho C^* + C\rho )+t T \cos ^2\phi . \end{aligned}$$

For arbitrary numbers a, b, c, one can write up to terms of order t, that

$$\begin{aligned} \frac{1}{a+b\sqrt{t} +ct} =\frac{1}{a} \frac{1}{1+(b/a) \sqrt{t}+(c/a) t} =\frac{1}{a}(1-(b/a) \sqrt{t}-(c/a) t+(b/a)^2 t). \end{aligned}$$

Consequently, with this order of approximation,

$$\begin{aligned} \frac{1}{p_1}= & {} \frac{1}{\cos ^2 \phi }(1- \tan \phi \sqrt{t} \, \mathrm{{tr}} (\rho C^* + C\rho )-T(\tan ^2\phi -1) t\\&+\tan ^2 \phi \, [\mathrm{{tr}} (\rho C^* + C\rho )]^2 t),\\ \frac{1}{p_2}= & {} \frac{1}{\sin ^2 \phi }(1+ \cot \phi \sqrt{t} \, \mathrm{{tr}} (\rho C^* + C\rho ) -T(\cot ^2\phi -1) t\\&+\cot ^2 \phi \, [\mathrm{{tr}} (\rho C^* + C\rho )]^2 t), \end{aligned}$$

and therefore the normalized states are given by the formulas

$$\begin{aligned} \rho _1= & {} \frac{{\tilde{\rho }}_1}{p_1} = [\rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \} + \sqrt{t} \tan \phi (\rho C^* + C\rho )+ t \tan ^2\phi \,C\rho C^*]\\&\times (1- \tan \phi \sqrt{t} \, \mathrm{{tr}} (\rho C^* + C\rho )-T(\tan ^2\phi -1) t +\tan ^2 \phi \, [\mathrm{{tr}} (\rho C^* + C\rho )]^2 t)\\= & {} \rho + \sqrt{t} \tan \phi (\rho C^* + C\rho -\varOmega \rho )+tB_1 \end{aligned}$$

with

$$\begin{aligned} \varOmega = \mathrm{{tr}} (\rho C^* + C\rho ) \end{aligned}$$

and

$$\begin{aligned} B_1= -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ T\rho +\tan ^2\phi (C\rho C^*- (\rho C^* + C\rho ) \varOmega -T\rho + \varOmega ^2 \rho ), \end{aligned}$$

and

$$\begin{aligned} \rho _2= & {} \frac{{\tilde{\rho }}_2}{p_2} = [\rho -it[A, \rho ] -\frac{1}{2} t\{C^*C,\rho \} - \sqrt{t} \cot \phi (\rho C^* + C\rho )+ t \cot ^2\phi \,C\rho C^*]\\&\times (1+ \cot \phi \sqrt{t} \, \mathrm{{tr}} (\rho C^* + C\rho )-T(\cot ^2\phi -1) t +\cot ^2 \phi \, [\mathrm{{tr}} (\rho C^* + C\rho )]^2 t)\\= & {} \rho - \sqrt{t} \cot \phi (\rho C^* + C\rho -\varOmega \rho )+tB_2 \end{aligned}$$

with

$$\begin{aligned} B_2= -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+T\rho + \cot ^2\phi (C\rho C^*- (\rho C^* + C\rho ) \varOmega -T\rho +\varOmega ^2 \rho ). \end{aligned}$$

The terms of order t in $p_j$ give contributions of lower order, so that to the main order in small h we have

$$\begin{aligned}&\frac{U_h-1}{h} f(\rho )\\&\quad =\frac{1}{h} p_1\left[ f(\rho _1)-f(\rho )\right] + \frac{1}{h} p_1\left[ f(\rho _2)-f(\rho )\right] .\\&\quad =\frac{1}{h}(\cos ^2\phi +\sqrt{h} \sin \phi \cos \phi \varOmega ) \bigl [(f'(\rho ),\sqrt{h} \tan \phi (\rho C^* + C\rho -\varOmega \rho )+tB_1)\\&\qquad +\frac{1}{2} \tan ^2\phi [(\rho C^* + C\rho -\varOmega \rho )f''(\rho ) (\rho C^* + C\rho -\varOmega \rho )]h\bigr ]\\&\qquad +\frac{1}{h}(\sin ^2\phi -\sqrt{h} \sin \phi \cos \phi \varOmega ) \bigl [ (f'(\rho ),- \sqrt{h} \cot \phi (\rho C^* + C\rho -\varOmega \rho )+tB_2)\\&\qquad +\frac{1}{2} \cot ^2\phi [(\rho C^* + C\rho -\varOmega \rho )f''(\rho ) (\rho C^* + C\rho -\varOmega \rho )]h\bigr ], \end{aligned}$$

where, for a matrix A,

$$\begin{aligned} {[}Af''(\rho ) A]=\sum _{ijkl} A_{ij} \frac{\partial ^2 f}{\partial \rho _{ij}\partial \rho _{kl}} A_{kl}. \end{aligned}$$

The terms of order $h^{-1/2}$ cancel and we get in the main term

$$\begin{aligned}&\frac{U_h-1}{h} f(\rho )\approx \frac{1}{2} [(\rho C^* + C\rho -\varOmega \rho )f''(\rho ) (\rho C^* + C\rho -\varOmega \rho )]\\&\quad +(f'(\rho ),\varOmega (\rho C^* + C\rho -\varOmega \rho )+\cos ^2\phi B_1+\sin ^2\phi B_2)\approx L_{dif}f(\rho ) \end{aligned}$$

with

$$\begin{aligned} L_{dif}f(\rho )= & {} \frac{1}{2} [(\rho C^* + C\rho -\varOmega \rho )f''(\rho ) (\rho C^* + C\rho -\varOmega \rho )]\nonumber \\&+\left( f'(\rho ),-i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ C\rho C^* \right) , \end{aligned}$$

(5.1)

which is remarkably independent of $\phi $! Thus, taking into account the terms that were ignored within the approximation, we obtained the following counterpart of Lemma 1:

Lemma 2

Under the setting considered, and for any $\phi \ne \pi k/2$, $k\in {\mathbf {Z}}$,

$$\begin{aligned} \left\| \frac{U_h-1}{h} f -L_{dif}f\right\| \le \sqrt{h} \varkappa \Vert f\Vert _{C^3(S({\mathcal {H}}_0))} \end{aligned}$$

(5.2)

for $f\in C^3(S({\mathcal {H}}_0))$, with $L_{dif}$ given by (5.1).

Unlike the jump-type limiting processes analysed in the previous section, where a straightforward pure analytic proof of the well-posedness of the process generated by L is available, here an approach using SDEs is handy. Ito’s formula shows that a process generated by (5.1) can arise from solving the following Ito’s SDE:

$$\begin{aligned} d\rho&=\left( -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ C\rho C^* \right) \, dt\nonumber \\&\quad +\left( \rho C^* + C\rho -\mathrm{{tr}}\, (\rho C^* + C\rho )\rho \right) \, dW_t, \end{aligned}$$

(5.3)

where $W_t$ is a standard one-dimensional Wiener process. This SDE is the Belavkin quantum filtering SDE for normalized states corresponding to the diffusive type observation.

Theorem 2

Let ${\mathcal {H}}_0={\mathbf {C}}^n$ and A, C be $n\times n$ square matrices with A being Hermitian. Then:

(i)
The operator (5.1) generates a Feller process $O_t^{\rho }$ in $S({\mathcal {H}}_0)$ and the corresponding Feller semigroup $T_t$ in $C(S({\mathcal {H}}_0))$ having the spaces $C^2(S({\mathcal {H}}_0))$ and $C^3(S({\mathcal {H}}_0))$ as invariant cores, and $T_s$ are bounded in these spaces uniformly for $s\in [0,t]$ with any $t>0$. This process is given by the solutions to SDE (5.3), which is well posed as a diffusion equation in $S({\mathcal {H}}_0)$.
(ii)
The scaled discrete semigroups $(U_h)^{[s/h]}$ converge to the semigroup $T_s$, as $h\rightarrow 0$, so that the corresponding processes converge in distribution, with the following rates of convergence:
$$\begin{aligned} \Vert (U_h)^{[s/h]} -T_sf\Vert \le \sqrt{h} s \varkappa (t) \Vert f\Vert _{C^3(S({\mathcal {H}}_0))}, \end{aligned}$$
(5.4)
where the constant $\varkappa (t)$ depends on the norms of A and C.
(iii)
The scaled semigroups $T_s^{\lambda }$ converge to the semigroup $T_s$, as $\lambda \rightarrow 0$, so that the corresponding processes converge in distribution, with the following rates of convergence:
$$\begin{aligned} \Vert T_s^{\lambda }f -T_sf\Vert \le \sqrt{\lambda }s \varkappa (t) \Vert f\Vert _{C^3(S({\mathcal {H}}_0))}. \end{aligned}$$
(5.5)

Proof

Parts (ii) and (iii) are obtained by the same arguments as in the proof of Theorem 1. One only has to mention that estimate (10.3) needed to apply Proposition 1 follows from the standard fact of the theory of diffusion that ${\mathbf {E}}((X_t(x)-x)^2)\le Ct$ for any diffusion $X_t(x)$ with bounded smooth coefficients. So we need only to prove (i). All claims follow if one can construct a diffusion in $S({\mathcal {H}}_0)$ solving (5.3), because in $S({\mathcal {H}}_0)$ all coefficients are bounded, and then both the uniqueness of solution and the required smoothness of solutions with respect to initial data follow automatically from the smoothness of the coefficients by the standard tools of Ito’s SDEs. The main difficulty here lies in proving that solutions to (5.3) preserve the set of positive matrices. But the fact that SDE (5.3) is well-posed in $S({\mathcal {H}}_0)$ is a well known fact, see e.g. Section 3.4.1 in monograph [5]. Thus one can complete a proof of Theorem 2 by referring to this result. However, a proof of [5] is indirect, and the fact is really crucial. Therefore, for completeness we sketch below a different direct proof that the solutions to (5.3) preserve the set of positive matrices. In this approach we shall consider the coefficients of the equation (5.3) to be given as they are only for nonnegative $\rho $ of unit trace and continued smoothly to all Hermitian $\rho $ in such a way that these coefficients vanish outside some neighborhood of this set. The modified equations (5.3) have globally bounded smooth coefficients and hence have unique well defined global solutions. Thus we really only need to show the preservation of positivity.

Our method is based on the Stratonovich integral. Recall that the Stratonovich differential $\circ dX$ is lined with Ito’s differential by the formula $Z\circ dX =Z \, dX +(1/2)dZ \, dX$. Hence denoting

$$\begin{aligned} B(\rho )=\rho C^* + C\rho -\mathrm{{tr}}\, (\rho C^* + C\rho )\rho , \end{aligned}$$

equation (5.3) rewrites in Stratonovich form as

$$\begin{aligned} d\rho= & {} \left( -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ C\rho C^*\right) \, dt +B(\rho ) \circ dW_t-\frac{1}{2} dB(\rho ) \,dW_t\\= & {} \left( -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ C\rho C^* \right) \, dt +B(\rho ) \circ dW_t\\&-\frac{1}{2} [B(\rho ) C^*+CB(\rho )-\mathrm{{tr}} \,(\rho C^*+C\rho ) B(\rho ) - \mathrm{{tr}}\,(B(\rho )C^*+CB(\rho ))\rho ] dt. \end{aligned}$$

Using the fundamental result of the Stratonovich integral, stating that solutions to Stratonovich SDEs can be obtained as the limits of the solutions to the ODEs obtained by approximating the white noise with smooth functions, we can state that the solutions to this Stratonovich equation preserve positivity of matrices, if the equations

$$\begin{aligned} {\dot{\rho }}&=-i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \}+ C\rho C^* +B(\rho ) \phi _t\nonumber \\&\quad -\frac{1}{2} [B(\rho ) C^*+CB(\rho )-\mathrm{{tr}} \,(\rho C^*+C\rho ) B(\rho ) - \mathrm{{tr}}\,(B(\rho )C^*+CB(\rho ))\rho ] \end{aligned}$$

(5.6)

preserve the set of positive matrices for any continuous function $\phi _t$. But this follows by the Brezis Theorem 6. To see this we substitute the expression for $B(\rho )$ in the first three places of the last square bracket yielding the equation

$$\begin{aligned} {\dot{\rho }}= & {} -i[A, \rho ] -\frac{1}{2} \{C^*C,\rho \} +B(\rho ) \phi _t\nonumber \\&-\frac{1}{2} [\rho (C^*)^2+C^2\rho +(\mathrm{{tr}} \,(\rho C^*+C\rho ))^2 \rho - \mathrm{{tr}}\,(B(\rho )C^*+CB(\rho ))\rho ]\qquad \quad \end{aligned}$$

(5.7)

(the key point is that the ’nasty’ term $C\rho C^*$ cancels). It is seen that Theorem 6 applies, because whenever $(v,\rho v)=0$, the r.h.s. $\omega _t(\rho )$ of equation (5.7) satisfies $(v,\omega _t(\rho ) v)=0$ for any function $\phi _t$. The details of the argument are the same as in the proof of Theorem 1. $\square $

Remark 5

Yet another way to prove the preservation of positivity can be carried out via the theory of boundary points. Namely, from Proposition 6.4.1 in [21] it follows that for any unit vector v the matrix $\rho $ of rank $n-1$ such that $(v,\rho v)=0$ belongs to the inaccessible boundary point for the domain $(v, \rho v)>0$. Hence for a dense countable set of unit vectors $\{v_j\}$ we can conclude that $(v_j,\rho _t v_j)>0$ for all j and t almost surely. Consequently $(v,\rho _t v)\ge 0$ for all v and t almost surely.

Remark 6

The methods developed can be used to extend this result to infinite dimensional ${\mathcal {H}}_0$. However, unlike the situation with counting observations, explained in Remark 2, there is some subtlety here in working with SDEs in the space of trace class operators, which we are not going to discuss in this paper.

A remarkable property of the SDEs (4.7) and (5.3) is that they preserve the pure states. Namely if the initial state $\rho $ was pure, $\rho =\psi \otimes {\bar{\psi }}$, then it remains pure for all times. Namely, one can check by a direct application of Ito’s formula that if $\phi $ satisfies the SDE

$$\begin{aligned} d\phi= & {} -[i(A-\langle Re \, C\rangle _{\phi } \, Im \, C)\nonumber \\&+\frac{1}{2}(C-\langle Re \, C\rangle _{\phi })^*(C-\langle Re \, C\rangle _{\phi })]\phi \, dt + (C-\langle Re \, C\rangle _{\phi })\phi \, dW_t, \end{aligned}$$

(5.8)

then $\rho =\psi \otimes {\bar{\psi }}$ satisfies equation (5.3). Equation (5.8) is the Belavkin quantum filtering equation for pure states. It looks much simpler for the most important case of self-adjoint C:

$$\begin{aligned} d\phi =-[iA+\frac{1}{2}(C-\langle C\rangle _{\phi })^2]\phi \, dt + (C-\langle C\rangle _{\phi })\phi \, dW_t. \end{aligned}$$

(5.9)

Another key observation is that there exists an equivalent linear version of (5.3). Namely assume that $\xi $ solves the following Belavkin quantum filtering SDE for non-normalized states:

$$\begin{aligned} d\xi =(-i[A, \xi ] -\frac{1}{2} \{C^*C,\xi \}+ C\xi C^* ) \, dt +(\xi C^* + C\xi ) \, dY_t, \end{aligned}$$

(5.10)

where $Y_t$ is a Brownian motion under a certain measure. Applying Ito’s formula to $\rho =\xi /\mathrm{{tr}} \, \xi $ one finds that $\rho $ satisfies (5.3) with the process W satisfying the equation

$$\begin{aligned} dW_t=dY_t-\mathrm{{tr}} \,(\xi C^* + C\xi ) \, dt. \end{aligned}$$

(5.11)

It follows from the famous Girsanov formula that if $Y_t$ was a Wiener process, then $W_t$ would be also a Wiener process under some different but equivalent measure with respect to one defining $Y_t$. Hence a solution $\xi _t$ to the linear equation (5.10) with some Brownian motion $Y_t$ yields the solution $\rho =\xi /\mathrm{{tr}} \, \xi $ to (5.3) with some other Brownian motion $W_t$.

6 Observations via different channels

Let us now extend the theory to the case of several channels of observation. Namely, we take

$$\begin{aligned} {\mathcal {H}}={\mathcal {H}}_0\otimes {\mathbf {C}}^2 \otimes \cdots \otimes {\mathbf {C}}^2, \quad (K \ \text {multipliers}\ {\mathbf {C}}^2), \end{aligned}$$

(6.1)

and the atom (system with Hilbert space ${\mathcal {H}}_0$) is supposed to interact with each of the K measuring devices with the state space ${\mathbf {C}}^2$. Each of the devises is equipped with the standard basis $(e_0^j,e_1^j)$ with $e_0^j$ chosen as a vacuum vector, that is as its stationary state, with the corresponding density matrix being $\varOmega _j= |e_0^j\rangle \langle e_0^j|$. The Hamiltonian is given by the sum $H=H_0+\sum _{k=1}^KH_k$, where $H_0=A\otimes I^{\otimes k}$ describes the free dynamics of the atom, and $H_j$ connects the atom with the jth device. The same scaling $1/\sqrt{t}$ applies to the interaction parts.

Thus H is specified by $k+1$ operators $A,C_1, \cdots , C_K$ in ${\mathcal {H}}_0$, so that $H_j$ are give by the formulas:

$$\begin{aligned} \begin{aligned}&H_0 (h\otimes e_{i_1}^1 \otimes \cdots \otimes e_{i_K}^K)=A h \otimes e^1_{i_1} \otimes \cdots \otimes e_{i_K}^K, \\&H_j (h\otimes e_{i_1}^1 \otimes \cdots \otimes e_{i_K}^K)|_{e_{i_j}^j=e_1^j} =-\frac{i}{\sqrt{t}} C_j^* h\otimes e_{i_1}^1 \otimes \cdots \otimes e_{i_K}^K)|_{e^j_{i_j}=e_0^j}, \quad j>0, \\&H_j (h\otimes e_{i_1}^1 \otimes \cdots \otimes e_{i_K}^K)|_{e_{i_j}^j=e_0^j} = \frac{i}{\sqrt{t}} C_j (h\otimes e_{i_1}^1 \otimes \cdots \otimes e_{i_K}^K)|_{e_{i_j}^j=e_1^j}, \quad j>0. \end{aligned}\nonumber \\ \end{aligned}$$

(6.2)

At a starting time of an interaction the devices are supposed to be set to their vacuum states, so that a state $\rho $ on ${\mathcal {H}}_0={\mathbf {C}}^n$ lifts to ${\mathcal {H}}$ as

$$\begin{aligned} \rho _{{\mathcal {H}}}=\rho \otimes \varOmega _1 \otimes \cdots \otimes \varOmega _K. \end{aligned}$$

The observation procedure can be specified by choosing two orthogonal projectors $P_0^j$ and $P_1^j$ in the space ${\mathbf {C}}^2$ of each device (that is in each channel of observation) arising from some observables with the spectral decompositions $\sum _l \lambda _l P_l^j$. This choice yields the totality of $2^K$ orthogonal projectors in ${\mathcal {H}}$,

$$\begin{aligned} I\otimes P_{i_1}^1 \otimes \cdots \otimes P_{i_K}^K, \end{aligned}$$

so that the possible new non-normalized states after each step of interaction and measurement are

$$\begin{aligned} {\tilde{\rho }}_t^{i_1 \cdots i_K} =\mathrm{{tr}}_{p1\cdots K} [(I\otimes P_{i_1}^1 \otimes \cdots \otimes P_{i_K}^K) e^{-itH} \rho _{{\mathcal {H}}} e^{itH} (I\otimes P_{i_1}^1 \otimes \cdots \otimes P_{i_K}^K)],\nonumber \\ \end{aligned}$$

(6.3)

where

$$\begin{aligned} \gamma _t= e^{-itH} \rho _{{\mathcal {H}}} e^{itH} = e^{-itH}(\rho \otimes \varOmega _1 \otimes \cdots \otimes \varOmega _K) e^{itH}, \end{aligned}$$

(6.4)

and $\mathrm{{tr}}_{p1\cdots K}$ is the partial trace with respect to all spaces, but for ${\mathcal {H}}_0$. These states may occur with the probabilities

$$\begin{aligned} p_{i_1 \cdots i_K}(t)=\mathrm{{tr}} \, [\gamma _t (I\otimes P_{i_1} \otimes \cdots \otimes P_{i_K})] =\mathrm{{tr}} {\tilde{\rho }}_t^{i_1 \cdots i_K}. \end{aligned}$$

(6.5)

Therefore the multichannel extension of the discrete time Markov chain of successive indirect observations given by (3.3) and (3.4) is given by $2^K$ possible transitions of $\rho $ to the states

$$\begin{aligned} \rho _t^{i_1 \cdots i_K}= \frac{1}{p_{i_1 \cdots i_K}} \mathrm{{tr}}_{p1\cdots K} [(I\otimes P_{i_1} \otimes \cdots \otimes P_{i_K}) \gamma _t (I\otimes P_{i_1} \otimes \cdots \otimes P_{i_K})], \end{aligned}$$

(6.6)

where $\gamma _t$ and the probabilities $p_{i_1 \cdots i_K}$ are given by (6.4) and (6.5). The transition operator of this Markov chain writes down as

$$\begin{aligned} U_t f(\rho )={\mathbf {E}}f(\rho _t)=\sum _{i_1 \cdots i_K} p_{i_1 \cdots i_K}(t) f(\rho _t^{i_1 \cdots i_K}). \end{aligned}$$

(6.7)

The operators in ${\mathcal {H}}$ are best described in terms of blocks. Namely, writing ${\mathcal {H}}=\oplus {\mathcal {H}}_{i_1 \cdots i_K}$, with ${\mathcal {H}}_{i_1 \cdots i_K}$ generated by ${\mathcal {H}}_0\otimes e_{i_1} \otimes \cdots \otimes e_{i_K}$, we can represent an operator ${\mathcal {L}}$ in ${\mathcal {H}}$ by $4^K$ operators $L_{i_1 \cdots i_K}^{j_1 \cdots j_K}$ in ${\mathcal {H}}$, so that

$$\begin{aligned} {\mathcal {L}}( h^{i_1 \cdots i_K}\otimes e_{i_1} \otimes \cdots \otimes e_{i_K}) =\sum _{j_1 \cdots j_K} L_{i_1 \cdots i_K}^{j_1 \cdots j_K} h^{i_1 \cdots i_K} \otimes e_{j_1} \otimes \cdots \otimes e_{j_K}. \end{aligned}$$

The composition and partial trace in this notations are expressed by the following formulas:

$$\begin{aligned} ({\mathcal {L}}_1 {\mathcal {L}}_2)_{i_1 \cdots i_K}^{j_1 \cdots j_K}= & {} \sum _{m_1 \cdots m_K} ({\mathcal {L}}_1)_{m_1 \cdots m_K}^{j_1 \cdots j_K} ({\mathcal {L}}_2)^{m_1 \cdots m_K}_{j_1 \cdots j_K}, \end{aligned}$$

(6.8)

$$\begin{aligned} \mathrm{{tr}}_{p1\cdots K} {\mathcal {L}}= & {} \sum _{j_1 \cdots j_K} L_{j_1 \cdots j_K}^{j_1 \cdots j_K}. \end{aligned}$$

(6.9)

For simplicity let us perform detailed calculations for $K=2$ (they are quite similar in the general case). Thus ${\mathcal {H}}={\mathbf {C}}^n\otimes {\mathbf {C}}^2 \otimes {\mathbf {C}}^2$ and $H=H_0+H_1+H_2$. Let us denote the bases of the two devices $\{e_k\}$ and $\{f_k\}$ respectively. Formulas (6.2) rewrite in a simpler way as

$$\begin{aligned} H_0 (h\otimes e_k \otimes f_j)= & {} A h \otimes e_k \otimes f_j,\\ H_1 (h\otimes e_1 \otimes f_j)= & {} -iC_1^* h \otimes e_0 \otimes f_j/ \sqrt{t}, \\ H_1 (h\otimes e_0 \otimes f_j)= & {} iC_1 h \otimes e_1 \otimes f_j \sqrt{t},\\ H_2 (h\otimes e_j \otimes f_1)= & {} -iC_2^* h \otimes e_j \otimes f_0 /\sqrt{t}, \\ H_2 (h\otimes e_j \otimes f_0)= & {} iC_2 h \otimes e_j \otimes f_1 /\sqrt{t}, \end{aligned}$$

With the chosen vacuum vectors $e_0=(1,0)$ in the first device and $f_0=(1,0)$ in the second device, a state $\rho $ on ${\mathcal {H}}_0={\mathbf {C}}^n$ lifts to ${\mathcal {H}}$ as

$$\begin{aligned} \rho _{{\mathcal {H}}}=\rho \otimes |e_0\rangle \langle e_0| \otimes |f_0\rangle \langle f_0|. \end{aligned}$$

The operators ${\mathcal {L}}$ in ${\mathcal {H}}$ are described by 16 operators $L_{jk}^{lm}$ in ${\mathcal {H}}$. To shorten the formulas, let us perform calculations without scaling $C_j$ (without the factor $1/\sqrt{t}$) and will restore the scaling at the end. In term of the blocks we can write:

$$\begin{aligned} (\rho _{{\mathcal {H}}})^{ml}_{jk}= & {} \delta ^m_0 \delta ^l_0 \delta ^0_j \delta ^0_k \rho .\\ (H_1)^{ml}_{jk}= & {} i\delta ^l_k \delta ^m_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j), \quad (H_2)^{ml}_{jk}=i\delta ^m_j \delta ^l_{{\bar{k}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k), \end{aligned}$$

where we have introduced the following notations: for i being 0 or 1 we denote ${\bar{i}}$ as being 1 and 0 respectively.

By (6.8) it follows that

$$\begin{aligned} {[}H_1, \rho _{{\mathcal {H}}}]^{ml}_{jk}= & {} i\sum \delta ^l_q \delta ^m_{{\bar{p}}}(C_1 \delta ^0_p- C_1^* \delta ^1_p)\, \delta ^p_0 \delta ^q_0 \delta ^0_j \delta ^0_k \rho \\&-i\sum \delta ^m_0 \delta ^l_0 \delta ^0_p \delta ^0_q \rho \, \delta ^q_k \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)\\= & {} i \delta ^0_k \delta ^l_0 (\delta ^0_j \delta ^m_1 C_1\rho +\delta ^1_j \delta ^m_0 \rho C_1^*) =i \delta ^0_k \delta ^l_0 \delta ^m_{{\bar{j}}} (\delta ^0_j C_1\rho +\delta ^1_j \rho C_1^*). \end{aligned}$$

$$\begin{aligned} (H_1^2)^{ml}_{jk}= & {} \sum (H_1)^{ml}_{pq} (H_1)^{pq}_{jk}\\= & {} -\sum \delta ^l_q \delta ^m_{{\bar{p}}}(C_1 \delta ^0_p- C_1^* \delta ^1_p) \delta ^q_k \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)\\= & {} -\delta ^l_k \delta ^m_j (C_1\delta ^1_j-C_1^*\delta _j^0)(C_1\delta ^0_j-C_1^* \delta ^1_j)\\= & {} \delta ^l_k \delta ^m_j ( \delta ^1_j C_1C_1^*+ \delta ^0_jC_1^* C_1),\\ (H_2^2)^{ml}_{jk}=\sum (H_2)^{ml}_{pq} (H_2)^{pq}_{jk}= & {} -\delta ^m_p \delta ^l_{{\bar{q}}}(C_2 \delta ^0_q- C_2^* \delta ^1_q) \delta ^p_j \delta ^q_{{\bar{k}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k)\\= & {} -\delta ^m_j \delta ^l_k (C_2 \delta ^1_k- C_2^* \delta _k^0)(C_2 \delta ^0_k- C_2^* \delta ^1_k)\\= & {} \delta ^m_j \delta ^l_k (\delta ^1_k C_2 C_2^*+ \delta ^0_kC_2^* C_2),\\ (H_1 H_2)^{ml}_{jk}= & {} \sum (H_1)^{ml}_{pq} (H_2)^{pq}_{jk}\\= & {} -\sum \delta ^l_q \delta ^m_{{\bar{p}}}(C_1 \delta ^0_p- C_1^* \delta ^1_p) \delta ^p_j \delta ^q_{{\bar{k}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k)\\= & {} -\delta ^l_{{\bar{k}}} \delta ^m_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)(C_2 \delta ^0_k- C_2^* \delta ^1_k),\\ (H_2 H_1)^{ml}_{jk}= & {} \sum (H_2)^{ml}_{pq} (H_1)^{pq}_{jk}\\= & {} -\delta ^m_p \delta ^l_{{\bar{q}}}(C_2 \delta ^0_q- C_2^* \delta ^1_q) \delta ^q_k \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)\\= & {} -\delta ^l_{{\bar{k}}} \delta ^m_{{\bar{j}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k)(C_1 \delta ^0_j- C_1^* \delta ^1_j), \end{aligned}$$

and

$$\begin{aligned} (H_1\rho _{{\mathcal {H}}}H_1)^{ml}_{jk}= & {} (H_1\rho _{{\mathcal {H}}})^{ml}_{pq} (H_1)^{pq}_{jk} =-\delta ^0_q \delta ^l_0 \delta ^m_{{\bar{p}}} \delta ^0_p C_1\rho \, \delta ^q_k \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)\\= & {} \delta ^0_k \delta ^l_0 \delta ^1_j \delta ^m_1 C_1\rho C_1^*,\\ (H_2\rho _{{\mathcal {H}}}H_2)^{ml}_{jk}= & {} (H_2\rho _{{\mathcal {H}}})^{ml}_{pq} (H_2)^{pq}_{jk} =-\delta ^m_0\delta ^0_p \delta ^l_{{\bar{q}}}\delta ^0_q C_2\rho \, \delta ^p_j \delta ^q_{{\bar{k}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k)\\= & {} \delta ^1_k \delta ^l_1 \delta ^0_j \delta ^m_0 C_2\rho C_2^*, \\ (H_1\rho _{{\mathcal {H}}}H_2)^{ml}_{jk}= & {} (H_1\rho _{{\mathcal {H}}})^{ml}_{pq} (H_2)^{pq}_{jk} =-\delta ^0_q \delta ^l_0 \delta ^m_{{\bar{p}}} \delta ^0_p C_1\rho \, \delta ^p_j \delta ^q_{{\bar{k}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k) \\= & {} \delta ^1_0 \delta ^m_1 \delta ^0_j \delta ^1_k C_1\rho C_2^*, \\ (H_2\rho _{{\mathcal {H}}}H_1)^{ml}_{jk}= & {} (H_2\rho _{{\mathcal {H}}})^{ml}_{pq} (H_1)^{pq}_{jk} =-\delta ^m_0\delta ^0_p \delta ^l_{{\bar{q}}}\delta ^0_q C_2\rho \, \delta ^q_k \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j) \\= & {} \delta ^1_1 \delta ^m_0 \delta ^1_j \delta ^0_k C_2\rho C_1^*. \end{aligned}$$

Therefore

$$\begin{aligned} (H_1+H_2)\rho _{{\mathcal {H}}}(H_1+H_2)^{ml}_{jk}= & {} \delta ^0_k \delta ^l_0 \delta ^1_j \delta ^m_1 C_1\rho C_1^* +\delta ^1_k \delta ^l_1 \delta ^0_j \delta ^m_0 C_2\rho C_2^* \\&+\delta ^1_0 \delta ^m_1 \delta ^0_j \delta ^1_k C_1\rho C_2^* + \delta ^1_1 \delta ^m_0 \delta ^1_j \delta ^0_k C_2\rho C_1^*. \end{aligned}$$

Next,

$$\begin{aligned} \{H_1^2, \rho _{{\mathcal {H}}}\}^{ml}_{jk}= & {} (H_1^2)^{ml}_{pq}(\rho _{{\mathcal {H}}})^{pq}_{jk} +(\rho _{{\mathcal {H}}})^{ml}_{pq}(H_1^2)^{pq}_{jk}\\= & {} \delta ^l_q \delta ^m_p (\delta ^1_p C_1C_1^* +\delta ^0_p C_1^*C_1) \, \delta ^p_0 \delta ^q_0 \delta ^0_j \delta ^0_k \rho \\&+\delta ^m_0 \delta ^l_0 \delta ^0_p \delta ^0_q \rho \, \delta ^q_k \delta ^p_j (\delta ^1_j C_1C_1^* +\delta ^0_j C_1^*C_1)\\= & {} \delta ^m_0 \delta ^l_0 \delta ^0_j \delta ^0_k\{C_1^* C_1, \rho \},\\ \{H_2^2, \rho _{{\mathcal {H}}}\}^{ml}_{jk}= & {} (H_2^2)^{ml}_{pq}(\rho _{{\mathcal {H}}})^{pq}_{jk} +(\rho _{{\mathcal {H}}})^{ml}_{pq}(H_2^2)^{pq}_{jk}\\= & {} \delta ^m_p \delta ^l_q (\delta ^1_q C_2 C_2^*+ \delta ^0_qC_2^* C_2)\, \delta ^p_0 \delta ^q_0 \delta ^0_j \delta ^0_k \rho \\&+\delta ^m_0 \delta ^l_0 \delta ^0_p \delta ^0_q \rho \, \delta ^p_j \delta ^q_k (\delta ^1_k C_2 C_2^*+ \delta ^0_kC_2^* C_2)\\= & {} \delta ^m_0 \delta ^l_0 \delta ^0_j \delta ^0_k\{C_2^* C_2, \rho \}, \end{aligned}$$

and

$$\begin{aligned} \{H_1 H_2, \rho _{{\mathcal {H}}}\}^{ml}_{jk}= & {} (H_1H_2)^{ml}_{pq}(\rho _{{\mathcal {H}}})^{pq}_{jk} +(\rho _{{\mathcal {H}}})^{ml}_{pq}(H_1H_2)^{pq}_{jk}\\= & {} -\delta ^l_{{\bar{q}}} \delta ^m_{{\bar{p}}}(C_1 \delta ^0_p- C_1^* \delta ^1_p)(C_2 \delta ^0_q- C_2^* \delta ^1_q) \delta ^p_0 \delta ^q_0 \delta ^0_j \delta ^0_k \rho \\&-\delta ^m_0 \delta ^l_0 \delta ^0_p \delta ^0_q \rho \delta ^q_{{\bar{k}}} \delta ^p_{{\bar{j}}}(C_1 \delta ^0_j- C_1^* \delta ^1_j)(C_2 \delta ^0_k- C_2^* \delta ^1_k) \\= & {} -\delta ^l_1 \delta ^m_1 \delta ^0_j \delta ^0_k C_1C_2\rho - \delta ^l_0 \delta ^m_0 \delta ^1_j \delta ^1_k \rho C_1^*C_2^*,\\ \{H_2 H_1, \rho _{{\mathcal {H}}}\}^{ml}_{jk}= & {} (H_2H_1)^{ml}_{pq}(\rho _{{\mathcal {H}}})^{pq}_{jk} +(\rho _{{\mathcal {H}}})^{ml}_{pq}(H_2H_1)^{pq}_{jk} \\&-\delta ^l_{{\bar{q}}} \delta ^m_{{\bar{p}}}(C_2 \delta ^0_p- C_2^* \delta ^1_q)(C_1 \delta ^0_p- C_1^* \delta ^1_q) \delta ^p_0 \delta ^q_0 \delta ^0_j \delta ^0_k \rho \\&-\delta ^m_0 \delta ^l_0 \delta ^0_p \delta ^0_q \rho \delta ^q_{{\bar{k}}} \delta ^p_{{\bar{j}}}(C_2 \delta ^0_k- C_2^* \delta ^1_k)(C_1 \delta ^0_j- C_1^* \delta ^1_j) \\= & {} - \delta ^l_1 \delta ^m_1 \delta ^0_j \delta ^0_k C_2C_1 \rho -\delta ^l_0 \delta ^m_0 \delta ^1_j \delta ^1_k\rho C_2^*C_1^*. \end{aligned}$$

Thus,

$$\begin{aligned} \{(H_1+H_2)^2,\rho _{{\mathcal {H}}}\}^{ml}_{jk}= & {} \{H_1^2+H_2^2+H_1H_2+H_2H_1, \rho _{{\mathcal {H}}}\}^{ml}_{jk}\\= & {} \delta ^m_0 \delta ^l_0 \delta ^0_j \delta ^0_k\{C_1^*C_1 +C_2^*C_2, \rho \}\\&-\delta ^l_1 \delta ^m_1 \delta ^0_j \delta ^0_k \{C_1,C_2\}\rho -\delta ^l_0 \delta ^m_0 \delta ^1_j \delta ^1_k\rho \{C_1^*, C_2^*\}. \end{aligned}$$

Thus all parts of (2.5) are collected.

Let us turn to (6.3). From the calculations with a single channel we know that one has to distinguish diagonal and non-diagonal projectors $P^j_k$. Let us start with the case, when in both devises the projectors are diagonal, that is

$$\begin{aligned} P_0^1=P_0^2=\begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \end{pmatrix}, \quad P_1^1=P_1^2=\begin{pmatrix} 0 &{} 0 \\ 0 &{} 1 \end{pmatrix}. \end{aligned}$$

Let us calculate

$$\begin{aligned} (I\otimes P_j^1 \otimes P_k^2) {\mathcal {L}}(I\otimes P_j^1 \otimes P_k^2) \end{aligned}$$

for arbitrary ${\mathcal {L}}$.

We have

$$\begin{aligned}&(I\otimes P_i^1 \otimes P_r^2) \sum h^{jk} \otimes e_j\otimes f_k=h^{ir},\\&(I\otimes P_i^1 \otimes P_r^2)^{ml}_{jk}=\delta ^i_j \delta ^r_k \delta ^m_i \delta ^1_r. \end{aligned}$$

So

$$\begin{aligned} ((I\otimes P_i^1 \otimes P_r^2) {\mathcal {L}})^{ml}_{jk} =(I\otimes P_i^1 \otimes P_r^2)^{ml}_{pq} {\mathcal {L}}^{pq}_{jk} =\delta ^i_p \delta ^r_q \delta ^m_i \delta ^1_r {\mathcal {L}}^{pq}_{jk} = \delta ^m_i \delta ^1_r {\mathcal {L}}^{ir}_{jk} \end{aligned}$$

and

$$\begin{aligned} ((I\otimes P_i^1 \otimes P_r^2) {\mathcal {L}}(I\otimes P_i^1 \otimes P_r^2))^{ml}_{jk}= & {} ((I\otimes P_i^1 \otimes P_r^2) {\mathcal {L}})^{ml}_{pq} (I\otimes P_i^1 \otimes P_r^2)^{pq}_{jk}\\= & {} \delta ^m_i \delta ^1_r {\mathcal {L}}^{ir}_{pq} \delta ^i_j \delta ^r_k \delta ^p_i \delta ^q_r =\delta ^m_i \delta ^1_r \delta ^i_j \delta ^r_k {\mathcal {L}}^{ir}_{ir}. \end{aligned}$$

Thus

$$\begin{aligned} \mathrm{{tr}}_{p12} ((I\otimes P_i^1 \otimes P_r^2) {\mathcal {L}}(I\otimes P_i^1 \otimes P_r^2)) ={\mathcal {L}}^{ir}_{ir}, \end{aligned}$$

and

$$\begin{aligned} {\tilde{\rho }}_{ir}=(e^{-itH} \rho e^{itH})^{ir}_{ir}, \quad p_{ir} =\mathrm{{tr}} (e^{-itH} \rho e^{itH})^{ir}_{ir}. \end{aligned}$$

Thus we have

$$\begin{aligned}&{[}H_1+H_2, \rho _{{\mathcal {H}}}]^{jk}_{jk}=0,\\&(H_1+H_2)\rho _{{\mathcal {H}}}(H_1+H_2)^{jk}_{jk}=\delta ^0_k \delta ^1_j C_1\rho C_1^* +\delta ^1_k \delta ^0_j C_2\rho C_2^*,\\&\{H_1^2+H_2^2+H_1H_2+H_2H_1, \rho _{{\mathcal {H}}}\}^{jk}_{jk} = \delta ^0_j \delta ^0_k\{C_1^*C_1 +C_2^*C_2, \rho \}. \end{aligned}$$

Restoring scaling $C \rightarrow C/\sqrt{t}$ yields approximately

$$\begin{aligned} (e^{-itH} \rho _{{\mathcal {H}}} e^{itH})^{jk}_{jk}= & {} (\rho _{{\mathcal {H}}}-it [H,\rho _{{\mathcal {H}}}]+t^2 (H\rho _{{\mathcal {H}}} H-\frac{1}{2} \{H^2,\rho _{{\mathcal {H}}}\}))^{jk}_{jk}\\= & {} \delta ^0_j \delta ^0_k (\rho -it[A,\rho ])+t \left[ \delta ^0_k \delta ^1_j C_1\rho C_1^*\right. \\&\left. +\delta ^1_k \delta ^0_j C_2\rho C_2^*-\frac{1}{2} \delta ^0_j \delta ^0_k\{C_1^*C_1 +C_2^*C_2, \rho \}\right] \end{aligned}$$

and thus

$$\begin{aligned} {\tilde{\rho }}_{jk}= & {} \delta ^0_j \delta ^0_k (\rho -it[A,\rho ])+t [\delta ^0_k \delta ^1_j C_1\rho C_1^* +\delta ^1_k \delta ^0_j C_2\rho C_2^*\\&-\frac{1}{2} \delta ^0_j \delta ^0_k\{C_1^*C_1 +C_2^*C_2, \rho \}],\\ p_{jk}= & {} \delta ^0_j \delta ^0_k +t [\delta ^0_k \delta ^1_j \mathrm{{tr}} (C_1\rho C_1^*) +\delta ^1_k \delta ^0_j \mathrm{{tr}} (C_2\rho C_2^*)-\delta ^0_j \delta ^0_k \mathrm{{tr}}((C_1^*C_1 +C_2^*C_2) \rho )]. \end{aligned}$$

Thus $p_{11}=0$,

$$\begin{aligned} \rho _{00}= & {} \frac{{\tilde{\rho }}_{00}}{p_{00}} =(\rho -it[A,\rho ] -\frac{1}{2} t\{C_1^*C_1 +C_2^*C_2, \rho \})(1+ t \,\mathrm{{tr}}((C_1^*C_1 +C_2^*C_2) \rho )),\\= & {} \rho -it[A,\rho ]-\frac{1}{2} t\{C_1^*C_1 +C_2^*C_2, \rho \}+t \, \mathrm{{tr}}((C_1^*C_1 +C_2^*C_2) \rho ) \rho ,\\ \rho _{10}= & {} \frac{{\tilde{\rho }}_{10}}{p_{10}}=\frac{C_1\rho C_1^*}{\mathrm{{tr}} (C_1\rho C_1^*)}, \quad \rho _{01}=\frac{{\tilde{\rho }}_{01}}{p_{01}}=\frac{C_2\rho C_2^*}{\mathrm{{tr}} (C_2\rho C_2^*)}. \end{aligned}$$

Thus we get, up to terms of order h in small h, that

$$\begin{aligned}&\frac{U_h-1}{h} f(\rho )\\&\quad =\frac{1}{h}\sum _{jk} p_{jk} \left[ f(\rho _{jk})-f(\rho )\right] \\&\quad =\frac{1}{h} p_{00}[f(\rho -ih[A,\rho ]-\frac{1}{2} h\{C_1^*C_1 +C_2^*C_2, \rho \}\\&\qquad +h \, \mathrm{{tr}}((C_1^*C_1 +C_2^*C_2) \rho ) \rho )-f(\rho )]\\&\qquad +\frac{1}{h} p_{10} \left[ f\left( \frac{C_1\rho C_1^*}{\mathrm{{tr}} (C_1\rho C_1^*)}\right) -f(\rho )\right] +\frac{1}{h} p_{01} \left[ f\left( \frac{C_2\rho C_2^*}{\mathrm{{tr}} (C_2\rho C_2^*)}\right) -f(\rho )\right] \\&\quad = \left( f'(\rho ), -\frac{1}{2} \{C_1^*C_1, \rho \}+ \mathrm{{tr}}(C_1 \rho C_1^*) \rho -\frac{1}{2} \{C_2^*C_2, \rho \}+ \mathrm{{tr}}(C_2 \rho C_2^*) \rho \right) \\&\qquad + \mathrm{{tr}} (C_1\rho C_1^*)\left[ f\left( \frac{C_1\rho C_1^*}{\mathrm{{tr}} (C_1\rho C_1^*)}\right) -f(\rho )\right] \\&\qquad +\mathrm{{tr}} (C_2\rho C_2^*) \left[ f\left( \frac{C_2\rho C_2^*}{\mathrm{{tr}} (C_2\rho C_2^*)}\right) -f(\rho )\right] . \end{aligned}$$

Summarising and extending to arbitrary number of channels k we can conclude that we proved the following extension of Lemma 1.

Lemma 3

Under the setting considered,

$$\begin{aligned} \Vert \frac{U_h-1}{h} f -L_{count}f\Vert \le \sqrt{h} \varkappa \Vert f\Vert _{C^2(S({\mathcal {H}}_0))} \end{aligned}$$

(6.10)

for $f\in C^2(S({\mathcal {H}}_0))$, with $L_{count}$ given by

$$\begin{aligned} L_{count}f(\rho )= & {} -i[A, \rho ] \, dt + \sum _{j=1}^K \left( f'(\rho ), -\frac{1}{2} \{C_j^*C_j, \rho \}+ \mathrm{{tr}}(C_j \rho C_j^*) \rho \right) \nonumber \\&+ \sum _{j=1}^K \,\mathrm{{tr}}\, (C_j\rho C_j^*)\left[ f\left( \frac{C_j\rho C_j^*}{\mathrm{{tr}} (C_j\rho C_j^*)}\right) -f(\rho )\right] . \end{aligned}$$

(6.11)

As a consequence we get the following direct extension of Theorem 1.

Theorem 3

Let ${\mathcal {H}}_0={\mathbf {C}}^n$ and $A,C_1, \cdots , C_K$ be operators in ${\mathcal {H}}_0$ with A being Hermitian. Let the projectors defining the measurements be chosen to be diagonal in each channel:

$$\begin{aligned} P_0^j=\begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \end{pmatrix}, \quad P_1^j=\begin{pmatrix} 0 &{} 0 \\ 0 &{} 1 \end{pmatrix} \end{aligned}$$

(6.12)

for all $j=1, \cdots , K$.

Then all statements of Theorem 1 hold for the operator (6.11) and Markov semigroups described by the transition operator (6.7). In particular, estimates (4.5) and (4.6) hold.

Remark 7

As explained in Remark 2 this result extends automatically to the case of arbitrary separable Hilbert space ${\mathcal {H}}$ and bounded operators $A,C_1,\cdots ,$ $C_K$ in it.

As in the case of a single channel, the process generated by (6.11) can be described by the solutions to the SDE of jump type, which takes now the form

$$\begin{aligned} d\rho= & {} - i[A, \rho ]\, dt +\sum _j \left( -\frac{1}{2} \{C^*_jC_j,\rho \}+ \mathrm{{tr}} (C_j\rho C^*_j) \rho \right) \, dt\nonumber \\&+\sum _j\left( \frac{C_j\rho C^*_j}{\mathrm{{tr}} (C_j\rho C^*_j)}-\rho \right) dN^j_t, \end{aligned}$$

(6.13)

with the counting processes $N_t^j$ are independent and have the position dependent intensities $\mathrm{{tr}} (C^*_jC_j\rho )$. Equation (6.13) is the Belavkin quantum filtering SDE corresponding to the counting type observation via several channels.

As suggested by Theorem 2, exploiting non diagonal pairs of projectors $P_0^j, P_1^j$ should lead to the limiting generator of diffusive type. In fact, performing similar calculations (which we omit) one arrives at the following general result.

Theorem 4

Let ${\mathcal {H}}_0={\mathbf {C}}^n$ and $A,C_1, \cdots , C_K$ be operators in ${\mathcal {H}}_0$ with A being Hermitian. Let the projectors defining the measurements are chosen to be diagonal, that is of type (6.12), for a subset $I\subset \{1, \cdots ,K\}$ of the set of channels. And for $j\notin I$ these channels are chosen as non-diagonal, that is of the form

$$\begin{aligned} P_0^j=\begin{pmatrix} \cos ^2 \phi _j &{} \sin \phi _j \cos \phi _j \\ \sin \phi _j \cos \phi _j &{} \sin ^2\phi \end{pmatrix}, \ \ P_1^j=\begin{pmatrix} \sin ^2 \phi _j &{} -\sin \phi _j \cos \phi _j \\ -\sin \phi _j \cos \phi _j &{} \cos ^2\phi _j \end{pmatrix}, \end{aligned}$$

(6.14)

with $\phi _j\ne k\pi /2$, $k\in {\mathbf {N}}$. Then the limiting generator for the semigroup with the transition operator (6.7) gets the expression

$$\begin{aligned} L_{mix}f(\rho )= & {} \sum _{j\in I} \left( f'(\rho ), -\frac{1}{2} \{C_j^*C_j, \rho \}+ \mathrm{{tr}}(C_j \rho C_j^*) \rho \right) \nonumber \\&+ \sum _{j\in I} \,\mathrm{{tr}}\, (C_j\rho C_j^*)\left[ f\left( \frac{C_j\rho C_j^*}{\mathrm{{tr}} (C_j\rho C_j^*)}\right) -f(\rho )\right] \nonumber \\&+\frac{1}{2} \sum _{j\notin I}[(\rho C_j^* + C_j\rho - \mathrm{{tr}} (\rho C^*_j\nonumber \\&+ C_j\rho ) \rho )f''(\rho ) (\rho C_j^* + C_j\rho - \mathrm{{tr}} (\rho C^*_j + C_j\rho ) \rho )]\nonumber \\&+\sum _{j\notin I} \left( f'(\rho ), -\frac{1}{2} \{C^*_jC_j,\rho \}+ C_j\rho C_j^* \right) -(f'(\rho ), i[A, \rho ]).\nonumber \\ \end{aligned}$$

(6.15)

This operator generates a Feller process $O_t^{\rho }$ in $S({\mathcal {H}}_0)$ and the corresponding Feller semigroup $T_t$ in $C(S({\mathcal {H}}_0))$ such that claims (ii) and (iii) of Theorem 2 hold. The Markov process generated by (6.15) can be given by the solutions of the following SDEs in $S({\mathcal {H}}_0)$:

$$\begin{aligned} d\rho&=-i[A, \rho ] \, dt + \sum _{j\in I} \left( -\frac{1}{2} \{C^*_jC_j,\rho \}+ \mathrm{{tr}} (C_j\rho C^*_j) \rho \right) \, dt\nonumber \\&\quad +\sum _{j\in I}\left( \frac{C_j\rho C^*_j}{\mathrm{{tr}} (C_j\rho C^*_j)}-\rho \right) dN^j_t\nonumber \\&\quad +\sum _{j\notin I}\left( -\frac{1}{2} \{C_j^*C_j,\rho \}+ C_j\rho C_j^* \right) \, dt\nonumber \\&\quad +\sum _{j\notin I}\left( \rho C_j^* + C_j\rho -\mathrm{{tr}}\, (\rho C_j^* + C_j\rho )\rho \right) \, dW^j_t, \end{aligned}$$

(6.16)

where $W_j$ are independent Wiener processes and $N_t^i$ independent jump process of intensity $\mathrm{{tr}} (C_j\rho C^*_j)$.

Proof

In the pure diffusive case, that is with empty I, the proof is exactly the same as in Theorem 2. For the general case one only has to show that operator $L_{mix}$ generates a Feller process in $S({\mathcal {H}}_0)$ preserving the sets of smooth functions (other arguments are again the same). Two proofs for proving this fact can be suggested. (i) One starts with generator ${\tilde{L}}_{mix}$ obtained from (6.15) by ignoring the jump part. This is a well-defined diffusion operator and by the same methods as in Theorem 2 one shows that it generates a Feller processes in $S({\mathcal {H}}_0)$. But the jump part of (6.15) is a bounded operator preserving positivity and smoothness. Hence it can be dealt with straightforwardly via the perturbation theory. (ii) Each of the two parts of (6.15), related to I and its complement, generates a well-defined Feller process in $S({\mathcal {H}}_0)$ preserving smoothness (of arbitrary order in fact). Hence one can derive that the sum of these operators generates a well-defined Feller process in $S({\mathcal {H}}_0)$ via the Lie-Trotter formula, namely from Theorem 5.3.1 of [21]. $\square $

Remark 8

The Markov chain of multichannel measurement that we are using is a bit different from the one used in [38], where measurement is based on a single operator R in the device (no different channels), and counting and diffusive parts of the generator arise from different projectors linked to different eigenspaces of this operator. As was already mentioned the method of [38] did not provide the rates of convergence.

When I is empty, $L_{mix}$ turns to $L_{dif}$ describing the multichannel observations of diffusive type.

7 Fractional quantum stochastic filtering

Now everything is ready for our main result: the derivation of the fractional equations of quantum stochastic filtering. As was shown above the standard Belavkin equations of quantum filtering can be obtained as the scaled limits of the sequences of discrete observations. The main assumption for each of the approximating processes was that the time between successive measurement is either constant (discrete Markov chain approximation) or is exponentially distributed (continuous time Markov chain approximation). Of course there is no a priori reasons for these assumptions. And in fact in several domains of physics it turned out to be more appropriate to model times between successive events by random variables from the domains of attraction of a stable law, that is via CTRW.

Our next result is a direct consequence of Theorem 4 and Proposition 5.

Theorem 5

Under the assumptions of Theorem 4 let the Markov chain (6.6) is modified in such a way that the laws of transitions $\rho \rightarrow \rho _t^{i_1 \cdots i_K}$ remain unchanged, by the time between transitions is taken as scaled random variable from the domain of attraction of a $\beta $-stable law, that is as $T_i^h=h^{1/\beta } T_i$ from Proposition 5. Then the corresponding generalized CTRW processes (12.3) built from the transition operator (6.7) converge to the process $O^{\rho }_{\sigma _t}$ obtained from the process $O^{\rho }_t$ of Theorem 4 via subordination by the inverse stable process $\sigma _t=\max \{y: S_y \le t\}$. Moreover, the functions $f_t(x)={\mathbf {E}}(T_{\sigma _t}f)(x)$ satisfy the fractional Caputo-Djerbashian equation (12.5) with the generator $L=L_{mix}$ given by (6.15).

As noted at the end of Appendix C, the fractional derivative $D^{\beta }_{0+\star }$ is a particular case of a class of mixed fractional derivatives (12.5). Therefore, under appropriately organised scaled times between the acts of measurements the limiting evolution will satisfy a more general fractional equation

$$\begin{aligned} D^{(\nu )}_{0+\star }f_t(x)=L_{mix}f_t(x), \quad f_0(x)=f(x), \end{aligned}$$

(7.1)

with $D^{\nu }$ given by (12.8).

When only one type of observation channels is used, equation (7.1) simplifies to the case, when either $L_{count}$ or $L_{dif}$ are places instead of $L_{mix}$.

Equations (7.1) (and their particular cases with fractional derivative $D^{\beta }$ of order $\beta $) represent the fractional analogs of the process of quantum stochastic filtering. These equations can be also considered as the new equations of fractional quantum mechanics. They are different from the fractional Schrödinger equations suggested in [31] and extensively studied recently.

Equations (7.1) describe the process of continuous quantum control and filtering on the level of the evolution of averages. On the ’micro-level’ of SDEs (6.16) these equations correspond to stopping the solutions of these SDEs at a random time $\sigma _t$ given by the inverse of a Lévy subordinator.

8 Fractional quantum control and games

The theory of quantum filtering reduces the analysis of quantum dynamic control and games to the controlled version of evolutions (6.16). The simplest situation concerns the case when the homodyne device is fixed, that is the operators $C_j$ and the projectors $P_i^j$ are fixed, and the players can control the individual Hamiltonian $H_0$ of the atom, say, by applying appropriate electric or magnetic fields to the atom. Thus equations (6.16) become modified by allowing $H_0$ to depend on one or several control parameters. The so-called separation principle states (see [11]) that the effective control of an observed quantum system (that can be based in principle on the whole history of the interaction of the atom and optical devices) can be reduced to the Markovian feedback control of the quantum filtering equation, with the feedback at each moment depending only on the current (filtered) state of the atom.

In the present case of CTRW modeling of the process of measurements the problem of control becomes the problem of control of scaled CTRW. The theory of such control was built in the series of papers [27,28,29]. The main result is that in the scaling limit the cost functions is a solution of the fractional Hamilton-Jacobi equation. In the present context and in game-theoretic setting it implies the following. Let us consider the controlled version of the process $O^{\rho }_{\sigma _t}$ from Theorem 5, where the individual Hamiltonian is now ${\tilde{H}}_0=H_0+uH_0^1+v H_0^2$ and it depends on control parameters u, v of two players from compact sets U and V respectively. Suppose that it is possible to choose new u, v directly after each act of measurement, and thus a control strategy is the sequence $(u_1, v_1), (u_2,v_2), \cdots )$ of controls applied after each act of measurement, with each $(u_j,v_j)$ applied after jth act of measurement and depending on the history of the process until that time. The case of a pure control (not a game) corresponds to the choice $V=0$ and is thus automatically included. Assume that players I and II play a standard dynamic zero-sum game with a finite time horizon T meaning that the objective of I is to maximize the payoff

$$\begin{aligned} P(t; u(.), v(.)) ={\mathbf {E}}\left[ \int _t^T \mathrm{{tr}} \, (J \rho _s) \, ds +\mathrm{{tr}} \, (F \rho _T)\right] , \end{aligned}$$

(8.1)

where J and F are some operators expressing the current and the terminal costs of the game (they may depend on u and v, but we exclude this case just for simplicity) and W is the collection of all noises involved in (6.16) (both diffusive and Poisson). Then under the scaling limit of Theorem 5 the optimal cost function

$$\begin{aligned} S_t(\rho ) =\max _{u(.)} \min _{v(.)}P(t; u(.), v(.))=\min _{v(.)} \max _{u(.)} P(t; u(.), v(.)) \end{aligned}$$

(8.2)

will satisfy the following fractional HJB-Isaacs equation of the CTRW modeling of quantum games:

$$\begin{aligned} D^{\nu }_{0+\star }S_t(\rho )= & {} \max _u (f'(\rho ), i[\rho , uH_0^1])\nonumber \\&+\min _v (f'(\rho ), i[\rho , vH_0^2]) +\mathrm{{tr}} \, (J \rho _t)+ L_{mix}S_t(\rho ). \end{aligned}$$

(8.3)

In [27] this equation was derived heuristically, in the general framework of controlled CTRW by the dynamic programming approach. As usual in optimal control theory, to justify the derivation one has to show the well-posedness of the limiting HJB equation and then to prove the verification theorem, a classical reference is [15]. For some cases of CTRWs this was performed in [29].

In the present fractional quantum case this problem will be considered elsewhere. The additional complexity of this equation is related to the fact that the state space is a rather nontrivial set of positive matrices with the unit trace. One can reduce the complexity by looking at the dynamics of pure states only. But the set of pure states is not a Euclidean space, but a manifold. In the finite-dimensional setting this manifold is the complex projective space ${\mathbf {C}}P^n$.

Let us mention that in the non-fractional case, that is with the usual derivative $\partial /\partial t$ instead of $D^{\nu }_{0+\star }$ in (8.3), the well-posedness of (the analogs of) equation (8.3) was proved in [18], for a special model of pumping a laser with a counting measurement, with some particular solutions calculated explicitly, and in [24], for a special arrangements of diffusive measuring devises that ensured that the diffusive part of operator $L_{dif}$ was nondegenerate and therefore the optimal control problem was reduced to the drift control of the diffusions on a Riemannian manifold ${\mathbf {C}}P^n$.

9 Other Markov approximations and unbounded generators

We commented above on the possible extension to infinite-dimensional Hilbert spaces. However, for all approximations the assumption of boundedness of all operators involved seemed to be essential in the derivation given, at least of the coupling operators $C_j$ (unboundedness of A can be possibly treated via the interaction representation). However, the quantum filtering equations are used also in the standard setting of quantum mechanics. The mostly studied case is that of the standard Hamiltonian $H=-\varDelta +V(x)$ in $L^2({\mathbf {R}}^d)$ and the coupling operators being either position (multiplication by x) or momentum operators. Different Markov chain approximations may be used to derive the filtering equation in this case.

A powerful approach was suggested by Belavkin in [9]: to use the von Neumann model of unsharp measurement. In this model the effect of measurement for the product state $\phi (x)f(y)$ of an atom and a measuring device, a pointer, is given by the shift

$$\begin{aligned} U: \phi (x) f(y) \mapsto \phi (x) f(y-ax). \end{aligned}$$

Here both $\phi $ and f are from $L^2({\mathbf {R}}^d)$, and $f>0$ describes the stationary state of a pointer (the analog of the vacuum state in our modeling above). Projecting on the state of an atom this yields the transition

$$\begin{aligned} G(y) :\quad \phi (x) \mapsto \phi _y(x)=\phi (x) f(y-ax)/f(y), \end{aligned}$$

(9.1)

depending on the observed position y of the pointer. Assuming the evolution of the atom during time t between the moments of measurements to be given by a Hamiltonian A, the transition of a Markov chain of sequential measurements become

$$\begin{aligned} \phi \mapsto \phi _{t,y}(x)=(e^{-iAt}\phi )(x) f(y-ax)/f(y). \end{aligned}$$

(9.2)

After an appropriate scaling from this Markov chain one derives the diffusive filtering SDE (5.9) with $C=x$ (the multiplication operator), that is directly the filtering equation for pure states, see detail in Appendix to [10]. The model can be extended to more general situations, but seems to be linked with a specific von Neumann instantaneous interaction. For the well-posedness of these kind of diffusive SDEs we can refer to [14, 17] and references therein.

The derivation of the fractional version of this equation, as well as the fractional control of Section 8 can be performed in this setting in the same way as above.

References

Attal, S., Pautrat, Y.: From repeated to continuous quantum interactions. Ann. Henri Poincaré 7, 59–104 (2006)
Article MathSciNet Google Scholar
Bain, A., Crisan, D.: Fundamentals of Stochastic Filtering. Ser. Stochastic Modelling and Applied Probability, 60, Springer, New York (2009)
Baleanu, D., Diethelm, K., Scalas, E., Trujillo, J.J.: Fractional Calculus. Models and Numerical Methods. 2nd Ed., Ser. on Complexity, Nonlinearity and Chaos, 5, World Scientific, Hackensack, NJ (2017)
Barchielli, A., Belavkin, V.P.: Measurements contunuous in time and a posteriori states in quantum mechanics. J. Phys A: Math. Gen. 24, 1495–1514 (1991)
Article Google Scholar
Barchielli, A., Gregoratti, M.: Quantum Trajectories and Measurements in Continuous Case. The Diffusive Case. Ser. Lecture Notes Physics, 782, Springer Verlag, Berlin (2009)
Belavkin, V.P.: Nondemolition measurement and control in quantum dynamical systems. In: Information Complexity and Control in Quantum Physics. CISM Courses and Lectures, 294 (Diner, S., Lochak, G., Eds.), 331–336, Springer-Verlag, Vienna (1987)
Belavkin, V.P.: Nondemolition stochastic calculus in Fock space and nonlinear filtering and control in quantum systems. In: Proc. XXIV Karpacz Winter School, Stochastic Methods in Mathematics and Physics (R. Guelerak, R., Karwowski, W., Eds.), 310–324, World Scientific, Singapore (1988)
Belavkin, V.P.: Quantum stochastic calculus and quantum nonlinear filtering. J. Multivar. Anal. 42, 171–201 (1992)
Article MathSciNet Google Scholar
Belavkin, V.P.: A dynamical theory of quantum measurement and spontaneous localization. Russian J. of Math. Phys. 3(1), 3–23 (1995)
MathSciNet MATH Google Scholar
Belavkin, V.P., Kolokoltsov, V.N.: Stochastic evolution as interaction representation of a boundary value problem for Dirac type equation. Infinite Dimensional Analysis, Quantum Probability and Related Fields 5(1), 61–92 (2002)
Article Google Scholar
Bouten, L., Van Handel, R.: On the separation principle of quantum control. arxiv:0511021v2 [math-ph] (2006)
Bouten, L., Van Handel, R., James, M.: An introduction to quantum filtering. SIAM J. Control Optim. 46(6), 2199–2241 (2007)
Article MathSciNet Google Scholar
Camilli, F., De Maio, R.: A time-fractional mean field game. Adv. Differential Equations 24(9–10), 531–554 (2019)
MathSciNet MATH Google Scholar
Fagnola, F., Mora, C.M.: Stochastic Schrödinger equations and applications to Ehrenfest-type theorems. ALEA Lat. Am. J. Probab. Math. Stat. 10(1), 191–223 (2013)
MathSciNet MATH Google Scholar
Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions, 2nd edn. Sptinger (2006)
Gnedenko, B.V., Korolev, VYu.: Random Summation: Limit Theorems and Applications. CRC Press, Boca Raton, Florida (1996)
MATH Google Scholar
Holevo, A.S.: Statistical inference for quantum processes. In: Quanum Aspects of Optical Communications, 127–137, Springer LNP, Berlin 378 (1991)
Kolokoltsov, V.N.: The stochastic Bellman equation as a nonlinear equation in Maslov spaces. Perturbation theory. Dokl. Akad. Nauk 323(2), 223–228 (1992); Engl. transl. In: Sov. Math. Dokl. 45(2), 294–300 (1992)
Kolokoltsov, V.: Generalized Continuous-Time Random Walks (CTRW), Subordination by hitting times and fractional dynamics. Theory of Probabil. and its Appl. 53(4), 594–609 (2009)
Article Google Scholar
Kolokoltsov, V.N.: The Lévy-Khintchine type operators with variable Lipschitz continuous coefficients generate linear or nonlinear Markov processes and semigroups. Prob. Theory Related Fields 151, 95–123 (2011)
Article Google Scholar
Kolokoltsov, V.N.: Markov Processes, Semigroups and Generators. Studies in Math., De Gruyter (2011)
Kolokoltsov, V.N.: On fully mixed and multidimensional extensions of the Caputo and Riemann-Liouville derivatives, related Markov processes and fractional differential equations. Fract. Calc. Appl. Anal. 18(4), 1039–1073 (2015). https://doi.org/10.1515/fca-2015-0060
Article MathSciNet MATH Google Scholar
Kolokoltsov, V.N.: Differential Equations on Measures and Functional Spaces. Birkhäuser Advanced Texts, Birkhäuser (2019)
Kolokoltsov, V.N.: Dynamic quantum games, arxiv:2002.00271 (2020); Dynamic Games and Applications (Online first, Open Access) (2021). https://doi.org/10.1007/s13235-021-00389-w
Kolokoltsov, V.N.: Quantum mean field games. arxiv:2005.02350 (2020); To appear in: Adv. Appl. Probab
Kolokoltsov, V., Korolev, V., Uchaikin, V.: Fractional stable distributions. J. Math. Sci. (N.Y). 105(6), 2570–2577 (2001)
MathSciNet MATH Google Scholar
Kolokoltsov, V., Veretennikova, M.: Fractional Hamilton Jacobi Bellman equations for scaled limits of controlled Continuous Time Random Walks. Commun. in Appl. and Industr. Math. 6(1), e-484 (2014)
MathSciNet MATH Google Scholar
Kolokoltsov, V., Veretennikova, M.: Well-posedness and regularity of the Cauchy problem for nonlinear fractional in time and space equations. Fract. Diff. Calculus 4(1), 1–30 (2014)
MathSciNet MATH Google Scholar
Kolokoltsov, V.N., Veretennikova, M.: The fractional Hamilton-Jacobi-Bellman equation. J. of Appl. Nonlin. Dynamics 6(1), 45–56 (2017)
MathSciNet MATH Google Scholar
Lakshmikantham, V., Mutchell, R., Mitchell, R.W.: Differential equations in closed subsets of a Banach space. Trans. Amer. Math. Soc. 220, 103–113 (1976)
Article MathSciNet Google Scholar
Laskin, N.: Fractional Schrödinger equation. Phys. Rev. E 66, Art. 056108 (2002)
Martin, R.H., Jr.: Differential equations on closed subsets of a Banach space. Trans. Amer. Math. Soc. 179, 399–414 (1973)
Article MathSciNet Google Scholar
Meerschaert, M.M., Scheffler, H.-P.: Limit Theorems for Continuous-Time Random Walks with infinite mean waiting times. J. Appl. Prob. 41, 623–638 (2004)
Article MathSciNet Google Scholar
M.M. Meerschaert, H.-P. Scheffler, Limit Distributions for Sums of Independent Random Vectors. Wiley Ser. in Probability and Statistics, John Wiley and Sons (2001)
Meerschaert, M.M., Sikorskii, A.: Stochastic Models for Fractional Calculus. De Gruyter Studies in Mathematics, 43, NY (2012)
Metzler, R., Jeon, J.-H., Cherstvya, A.G., Barkai, E.: Anomalous diffusion models and their properties: non-stationarity, non-ergodicity, and ageing at the centenary of single particle tracking. Phys. Chem. Chem. Phys. 16, Art. 24128 (2014)
Montroll, E.W., Weiss, G.H.: Random walks on lattices. II. J. Math. Phys. 6, 167–181 (1965)
Article MathSciNet Google Scholar
Pellegrini, C.: Markov chains approximations of jump-diffusion stochastic master equations. Ann. Inst. H. Poincaré Probab. Statist. 46, 924–948 (2010)
Article MathSciNet Google Scholar
Pellegrini, C.: Poisson and diffusion approximation of stochastic Schrödinger equations with control. Ann. Henri Poincaré 10(5), 995–1025 (2009)
Article MathSciNet Google Scholar
Redheffer, R.M.: The theorems of Bony and Prezis on flow-invariant sets. The American Math. Monthly 79(7), 740–747 (1972)
Article Google Scholar
Saichev, A.I., Zaslavsky, G.M.: Fractional kinetic equations: solutions and applications. Chaos 7(4), 753–764 (1997)
Article MathSciNet Google Scholar
Sen, N., Caines, P.E.: Nonlinear filtering theory for McKean-Vlasov type stochastic differential equations. SIAM J. Control Optim. 54(1), 153–174 (2016)
Article MathSciNet Google Scholar
Uchaikin, V.V.: Fractional Derivatives for Physicists and Engineers, Vols. I and II. Ser. Nonlinear Physical Science. Higher Education Press, Beijing; Springer, Heidelberg (2013)
Umarov, S., Daum, F., Nelson, K.: Fractional generalizations of filtering problems and their associated fractional Zakai equations. Fract. Calc. Appl. Anal. 17(3), 745–764 (2014). https://doi.org/10.2478/s13540-014-0197-x
Article MathSciNet MATH Google Scholar
Wang, J., Zhou, Y., Wei, W.: Fractional Schrödinger equations with potential and optimal controls. Nonlinear Anal. Real World Appl. 13(6), 2755–2766 (2012)
Article MathSciNet Google Scholar
West, B.J.: Fractional Calculus View of Complexity. Tomorrow’s Science. CRC Press, Boca Raton, FL (2016)
Book Google Scholar
Wiseman, H.M., Milburn, G.J.: Quantum Measurement and Control. Cambridge Univesity Press (2010)

Download references

Acknowledgements

This research was funded by the Russian Science Foundation Project No. 20-11-20119.

Author information

Authors and Affiliations

Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK
Vassili Kolokoltsov
Higher School of Economics, Moscow, Russia
Vassili Kolokoltsov
Petrozavodsk State University, Petrozavodsk, Russia
Vassili Kolokoltsov

Authors

Vassili Kolokoltsov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vassili Kolokoltsov.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A. Convergence of semigroups

Here we collect the results on the convergence of Markov semigroups and CTRW, which form the the theoretical basis for our derivations of the filtering equations.

It is well known that the convergence of the generators on the core of the limiting generator implies the convergence of semigroups. We shall use a version of this result with the rates, namely the following result, given in Theorem 8.1.1 of [21].

Proposition 1

Let $F_t=e^{tL}$ be a strongly continuous semigroup in a Banach space B with a norm $\Vert .\Vert _B$, generate by an operator L, having a core D, which is itself a Banach space with a norm $\Vert .\Vert _D\ge \Vert .\Vert _B$ so that $L\in {\mathcal {L}}(D,B)$. Let $F_t$ be also a bounded semigroup in D such that $\Vert F_t\Vert _{D\rightarrow D} \le C_D(T)$ with a constant $C_D(T)$ uniformly for $t\in [0,T]$.

(i)
Let $F_t^h$, $h>0$, be a family of strongly continuous contraction semigroups in a Banach space B with bounded generators $L_h$ such that
$$\begin{aligned} \Vert L_hf-Lf\Vert _B \le \epsilon _h \Vert f\Vert _D \end{aligned}$$
for all $f\in D$ and some $\epsilon _h$ such that $\epsilon _h\rightarrow 0$ as $h\rightarrow 0$. Then the semigroups $F_t^h$ converge strongly to the semigroup $F_t$, as $h\rightarrow 0$, and
$$\begin{aligned} \Vert F_t^hf -F_tf\Vert _B \le t \epsilon _h C_D(T)\Vert L\Vert _{D\rightarrow B}. \end{aligned}$$
(10.1)
(ii)
Let $U_h$ be a family of contractions in B such that
$$\begin{aligned} \left\| \left( \frac{U_h-1}{h} -L\right) f\right\| _B \le \epsilon _h \Vert f\Vert _D, \end{aligned}$$
(10.2)
and
$$\begin{aligned} \left\| \left( \frac{F_h-1}{h} -L\right) f\right\| _B \le \varkappa _h \Vert f\Vert _D, \end{aligned}$$
(10.3)
with $\epsilon _h \rightarrow 0$ and $\varkappa _h\rightarrow 0$, as $h\rightarrow 0$. Then the scaled discrete semigroups $(U_h)^{[t/h]}$ converge to the semigroup $F_t$ and moreover
$$\begin{aligned} \sup _{s\le t}\Vert (U_h)^{[s/h]} -F_sf\Vert _B \le (\varkappa _h+\epsilon _h)t \Vert f\Vert _B. \end{aligned}$$
(10.4)

Additional condition (10.3) makes working with discrete approximation a bit more subtle, than with the continuous chain approximations. Effectively to get (10.3) one needs a deeper regularity. Namely one should have another core ${\tilde{D}}$ such that $D\subset {\tilde{D}}\subset B$ with $L\in {\mathcal {L}}(D,{\tilde{D}}) \cap {\mathcal {L}}({\tilde{D}},B)$. In this case it is easy to see that

$$\begin{aligned} \left\| \left( \frac{F_h-1}{h} -L\right) f\right\| _B \le h \Vert L\Vert _{D,{\tilde{D}}} \Vert L\Vert _{{\tilde{D}},B}\Vert f\Vert _D. \end{aligned}$$

(10.5)

Appendix B. Deterministic motions with random jumps

Let us look at the Cauchy problem

$$\begin{aligned} \frac{\partial f_t}{\partial t} =(\nabla f_t, b(x))+Lf_t(x), \quad f_0(x) \, \text {given}, \end{aligned}$$

(11.1)

with the simplest jump-type operator

$$\begin{aligned} L_f(x)=\sum _{j=1}^J f(Y_j(x)-x), \end{aligned}$$

where $x\in {\mathbf {R}}^d$, $\nabla f=\partial f/\partial x$ and $b,Y_j:{\mathbf {R}}^d\rightarrow {\mathbf {R}}^d$ are given bounded smooth functions. It is more or less obvious that the resolving operators of the Cauchy problem (11.1) form a semigroup of contractions in the space $C({\mathbf {R}}^d)$ preserving the spaces of smooth functions. Let us make a precise statement. The simplest way to see it is via the ’interaction representation’. Namely, let $X_t(x)$ denote the solution to the Cauchy problem $\dot{X}_t(x)=b(X_t(x))$, $X_0(x)=x$, and let us change the unknown function f in (11.1) to $\phi $ via the equation $f(x)=\phi (X_t(x))$. Direct substitution shows that $\phi $ solves the Cauchy problem

$$\begin{aligned} \frac{\partial \phi _t}{\partial t} =L_t\phi _t(x)=\sum _{j=1}^J \phi ((X_t(Y_j(X_{-t}(x))))-x), \quad \phi _0=f_0. \end{aligned}$$

(11.2)

Since $L_t$ is a bounded operator, this Cauchy problem can be solved by the convergence series over the powers of $L_t$. This leads to the following result.

Proposition 2

Let $b, Y_j\in C^2({\mathbf {R}}^d)$, $j=1, \cdots , J$. Then the resolving operators $R_t$ of the Cauchy problem (11.1) form a semigroups of contractions in $C({\mathbf {R}}^d)$ such that the spaces $C^1({\mathbf {R}}^d)$ and $C^2({\mathbf {R}}^d)$ are invariant and $R_t$ form semigroups of operators in these spaces that are uniformly bounded for $\in [0,T]$ with any T.

We need an extension of this result for the subsets of ${\mathbf {R}}^d$. The main tool is the following classical theorem of Brezis, which we formulate in its simplest form referring to proofs, extensions and history to [40].

Theorem 6

Let $b(x):K\rightarrow {\mathbf {R}}^d $ be a Lipschitz continuous function, where K is a convex closed subset of ${\mathbf {R}}^d$, such that

$$\begin{aligned} \lim _{h\rightarrow 0_+} \frac{d(y+hb(x),K)}{h}=0 \end{aligned}$$

(11.3)

for any $x\in K$, where d(z, K) denotes the distance between a point z and the set K. Then K is flow invariant. More precisely, for any $x\in K$ there exists a unique solution $X_t(x)$ of the equation $\dot{X}_t(x)=b(X_t(x))$ with the initial condition x that belongs to K for all t.

As a direct consequence we get the following extension of Proposition 2.

Proposition 3

Let K be a convex compact subset of ${\mathbf {R}}^d$ and $b:K\rightarrow {\mathbf {R}}^d$, $Y_j:K\rightarrow K$ be twice continuously differentiable functions. Let b satisfy the assumptions of Theorem 6. Then the resolving operators $R_t$ of the Cauchy problem (11.1) form a semigroups of contractions in C(K) such that the spaces $C^1(K)$ and $C^2(K)$ are invariant and $R_t$ are uniformly bounded operators in these spaces for $\in [0,T]$ with any T.

Appendix C. Position dependent CTRW

Here we recall the basic result on the convergence of continuous time random walks (CTRW).

Suppose $T_1^h,T_2^h, \cdots $ is a sequence of i.i.d. random variables in ${\mathbf {R}}_+$ such that the distribution of each $T_i^h$ is given by a probability measure $\mu _{time}^h (dt)$ on ${\mathbf {R}}_+$, that depend on a positive (scaling) parameter h. Let

$$\begin{aligned} N_t^h=\max \left\{ n: \sum _{i=1}^n T_i^h \le t\right\} . \end{aligned}$$

(12.1)

Suppose $X_1^h,X_2^h, \cdots $ is a sequence of i.i.d. random variables in ${\mathbf {R}}^d$, such that the distribution of each $X_i^h$ is given by a probability measure $\mu _{space}^h (dt)$, that depends on h. The standard (scaled) continuous time random walk (CTRW) is a random process given by the random sum

$$\begin{aligned} \sum _{j=1}^{N_t^h} X_i^h. \end{aligned}$$

In position dependent CTRW the jumps $X_i^h$ are not independent, but each $X_i^h$ depends on the position of the process before this jump. The natural general formulation can be given in terms of discrete Markov chains as follows. Let $U_h$ be a transition operator of a discrete time Markov chain $O^h_n(x)$ in ${\mathbf {R}}^d$ depending on a positive parameter h, so that

$$\begin{aligned} U_hf(x)={\mathbf {E}}O^h_1(x)=\int f(y) \mu ^h(x, dy), \end{aligned}$$

(12.2)

with some family of stochastic kernels $\mu ^h(x, dy)$ such that $U_h$ is a bounded operator either in the space C(K) with a compact convex subset K of ${\mathbf {R}}^d$ or in the space $C_{\infty }({\mathbf {R}}^d)$ of continuous functions vanishing at infinity. For our purposes we need only the operators of the type

$$\begin{aligned} U_hf(x)={\mathbf {E}}O^h_1(x)=\sum _{j=1}^J f(Y_j^h(x)) p_j^h(x), \end{aligned}$$

with a family of continuous maps $Y_j^h:{\mathbf {R}}^d\rightarrow {\mathbf {R}}^d$ and the probability laws $\{p_1^h(x), \cdots , p_J^h(x)\}$.

Suppose $T_1^h,T_2^h, \cdots $ is a sequence of random variables introduced above, and independent of $O^h_n(x)$. The process

$$\begin{aligned} O^h_{N_t^h}(x) \end{aligned}$$

(12.3)

is a generalized scaled (position dependent) continuous time random walk (CTRW) arising from $U_h$ and $\mu ^h_{time}$.

The CTRW were introduced in [37]. They found numerous applications in physics. The scaling limits of these CTRW were analysed by many authors, see e.g. [26, 33, 34]. The scaling limit for the position dependent CTRW was developed in [19]. Formally in [19] it was developed not in full generality, but for the case of the spacial process $O^h_n(x)$ converging to a stable process. However, the arguments of [19] were completely general and did not depend on this assumption. The only point used was that $O^h_n(x)$ converge in the sense of Proposition 1 (ii). For completeness let us formulate the result [19] in a slightly modified version that we need in this paper and present a short proof with essentially simplified arguments from [19] (see also Chapter 8 in [21]).

As an auxiliary result we need the standard functional limit theorem for the random-walk-approximation of stable laws, see e.g. [16] and [34] and references therein for various proofs.

Proposition 4

Let a positive random variable T belong to the domain of attraction of a $\beta $-stable law, $\beta \in (0,1)$, in the sense that

$$\begin{aligned} {\mathbf {P}}(T>m)\sim \frac{1}{\beta m^{\beta }} \end{aligned}$$

(12.4)

(the sign $\sim $ means here that the ratio tends to 1, as $m\rightarrow \infty $). Let $T_i$ be a sequence of i.i.d. random variables from the domain of attraction of a $\beta $-stable law and let

$$\begin{aligned} \varPhi _t^h=\sum _{i=1}^{[t/h]} h^{1/\alpha } T_i \end{aligned}$$

be a scaled random walk based on $T_i$, $h>0$, and $S_t$ a $\beta $-stable Lévy subordinator, that is a Lévy process in ${\mathbf {R}}_+$ generated by the stable generator

$$\begin{aligned} L_{\beta }(x)=\int \frac{f(x+y)-f(x)}{y^{1+\beta }} dy \end{aligned}$$

(which up to a multiplier represents the fractional derivative $d^{\beta }/d(-x)^{\beta }$). Then $\varPhi _t^h \rightarrow S_t$ in distribution, as $h\rightarrow 0$.

The next result is from [19], though modified and simplified.

Proposition 5

Let the random variables $T_i^h=h^{1/\beta } T_i$, where i.i.d. random variables $T_i$ belong to the domain of attraction of a $\beta $-stable law, $S_t$ be a $\beta $-stable Lévy suboridinator and

$$\begin{aligned} \sigma _y=\max \{t: S_t \le y\} \end{aligned}$$

be its inverse process. Let a family of contractions (12.2) satisfy (10.2) with an operator L generating a Feller process $F_t$. Then

$$\begin{aligned} {\mathbf {E}}U_h^s|_{s=[N_t^h/h]} \rightarrow {\mathbf {E}}F_{\sigma _t}, \quad h\rightarrow 0, \end{aligned}$$

strongly as contraction operators in C(K) or $C_{\infty }({\mathbf {R}}^d)$.

Remark 9

This proposition directly implies the following statement about the processes: the subordinated Markov chains (12.3), that is the scaled CTRW, converge in distribution to the process generated by L and subordinated by the inverse of the Lévy $\beta $-subordinator.

Proof

By the density arguments it is sufficient to show that

$$\begin{aligned} \Vert {\mathbf {E}}U_h^{[s/h]}|_{s=N_t^h}f - {\mathbf {E}}F_{\sigma _t}f\Vert \rightarrow 0 \end{aligned}$$

for functions f from the domain of L. We have

$$\begin{aligned} \Vert {\mathbf {E}}U_h^{[s/h]}|_{s=N_t^h}f - {\mathbf {E}}F_{\sigma _t}f\Vert \le I+II, \end{aligned}$$

with

$$\begin{aligned} I=\Vert {\mathbf {E}}U_h^{[s/h]}|_{s=N_t^h}f - {\mathbf {E}}F_{N^h_t}f\Vert , \quad II=\Vert {\mathbf {E}}F_{N^h_t}f - {\mathbf {E}}F_{\sigma _t}f\Vert . \end{aligned}$$

To estimate I we write

$$\begin{aligned} I= & {} \int _0^{\infty }(U_h^{[s/h]}f-F_sf)\mu ^h_t(ds)\\= & {} \int _0^K(U_h^{[s/h]}f-F_sf)\mu ^h_y(ds) +\int _K^{\infty }(U_h^{[s/h]}f-F_sf)\mu ^h_t(ds), \end{aligned}$$

where $\mu ^h_t$ is the distribution of $N_t^h$. Choosing K large enough we can make the second integral arbitrary small uniformly in h. And then by (10.4) we can make the first integral arbitrary small by choosing small enough h (and uniformly in t from compact sets). It remains II. Integrating by parts we get the following:

$$\begin{aligned} II= & {} \Vert {\mathbf {E}}e^{N_t^h L}f-{\mathbf {E}}e^{\sigma _t L} f\Vert \\= & {} \left\| \int _0^{\infty } \frac{\partial }{\partial s} (e^{sL}f) ({\mathbf {P}}(\sigma _t\le s) -{\mathbf {P}}(N_t^h\le s)) \, ds\right\| \\= & {} \left\| \int _0^{\infty } L e^{sL}f ({\mathbf {P}}(S_s> t) -{\mathbf {P}}(\varPhi _s^h>t)) \, ds\right\| . \end{aligned}$$

By (4), ${\mathbf {P}}(\varPhi _s^h>t) \rightarrow {\mathbf {P}}(S_s> t)$ as $h\rightarrow 0$. Therefore $II \rightarrow 0$ by the dominated convergence, as $h\rightarrow 0$. $\square $

Remark 10

From this proof it is seen how to get some explicit rates of convergence. We are not going to give details.

It is well known, see e.g. [41] and detailed presentations in monographs [23, 35], that the subordinated limiting evolution described by the operators ${\mathbf {E}}F_{\sigma _t}$ solves fractional in time differential equations. Namely, under the conditions of Proposition 5, the function $f_t(x)={\mathbf {E}}(F_{\sigma _t}f)(x)$ satisfies the equation

$$\begin{aligned} D^{\beta }_{0+\star }f_t(x)=Lf(x), \quad f_0(x)=f(x), \end{aligned}$$

(12.5)

where $D^{\beta }_{0+\star }$ is the Caputo-Djerbashian derivative of order $\beta $ acting on the variable t, and the operator L acts on the variable x.

Recall that a Lévy subordinator is a process generated by the operator

$$\begin{aligned} L_{\nu }f(x)=\int _0^{\infty } f(x+y)-f(x)) \nu (dy), \end{aligned}$$

(12.6)

where $\nu $ is a one-sided Lévy measure, that is, it satisfies the condition $\int \min (1,y) \nu (dy)<\infty $. Proposition 5 is based on the central limit for stable laws stating the convergence $\varPhi _t^h \rightarrow S_t$ of random walks approximations to a stable Lévy subordinator. If scaled random walks $\varPhi _t^h$ are designed in such a way that they approximate an arbitrary Lévy subordinator, that is, $\varPhi _t^h \rightarrow S_t$ with $S_t$ generated by (12.6), then similar arguments show that

$$\begin{aligned} {\mathbf {E}}U_h^s|_{s=[N_t^h/h]} \rightarrow {\mathbf {E}}F_{\sigma _t}, \quad h\rightarrow 0, \end{aligned}$$

where

$$\begin{aligned} \sigma _y=\max \{t: S_t \le y\}, \quad {\mathbf {N}}_y^h=\max \{t: \varPhi _t^h \le y\}. \end{aligned}$$

In this case the functions $f_t(x)={\mathbf {E}}(F_{\sigma _t}f)(x)$ satisfy the equation

$$\begin{aligned} D^{(\nu )}_{0+\star }f_t(x)=L_tf(x), \quad f_0(x)=f(x), \end{aligned}$$

(12.7)

see e.g. [19, 22], where $D^{(\nu )}_{0+\star }$ is the generalised Caputo-type mixed fractional derivative defined by the equation

$$\begin{aligned} D^{(\nu )}_{0+\star }f_t=\int _0^t (f_{t-s}-f_t)\nu (ds)+ (f_0-f_t)\int _t^{\infty } \nu (ds). \end{aligned}$$

(12.8)

The derivative $D^{\beta }_{0+\star }$ in (12.5) corresponds to $\nu (dy)=y^{-1-\beta } dy$.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Kolokoltsov, V. CTRW modeling of quantum measurement and fractional equations of quantum stochastic filtering and control. Fract Calc Appl Anal 25, 128–165 (2022). https://doi.org/10.1007/s13540-021-00002-2

Download citation

Received: 20 August 2021
Revised: 18 October 2021
Accepted: 15 November 2021
Published: 07 February 2022
Issue Date: February 2022
DOI: https://doi.org/10.1007/s13540-021-00002-2

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

CTRW modeling of quantum measurement and fractional equations of quantum stochastic filtering and control

Abstract

Similar content being viewed by others

Quasifree Stochastic Cocycles and Quantum Random Walks

Large Deviations at Level 2.5 for Markovian Open Quantum Systems: Quantum Jumps and Quantum State Diffusion

The law of large numbers for quantum stochastic filtering and control of many-particle systems

1 Introduction

2 Notations for quantum states and tensor products

3 The starting point: Markov chains of sequential indirect observations

4 Belavkin equations for a counting observation

Remark 1

Lemma 1

Theorem 1

Proof

Remark 2

Remark 3

Remark 4

5 Belavkin equations for a diffusive observation

Lemma 2

Theorem 2

Proof

Remark 5

Remark 6

6 Observations via different channels

Lemma 3

Theorem 3

Remark 7

Theorem 4

Proof

Remark 8

7 Fractional quantum stochastic filtering

Theorem 5

8 Fractional quantum control and games

9 Other Markov approximations and unbounded generators

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A. Convergence of semigroups

Proposition 1

Appendix B. Deterministic motions with random jumps

Proposition 2

Theorem 6

Proposition 3

Appendix C. Position dependent CTRW

Proposition 4

Proposition 5

Remark 9

Proof

Remark 10

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation