Abstract
We study an optimal liquidation problem with multiplicative price impact in which the trend of the asset price is an unobservable Bernoulli random variable. The investor aims at selling over an infinite time horizon a fixed amount of assets in order to maximise a net expected profit functional, and lump-sum as well as singularly continuous actions are allowed. Our mathematical modelling leads to a singular stochastic control problem featuring a finite-fuel constraint and partial observation. We provide a complete analysis of an equivalent three-dimensional degenerate problem under full information, whose state process is composed of the asset price dynamics, the amount of available assets in the portfolio, and the investor’s belief about the true value of the asset’s trend. Its value function and optimal execution rule are expressed in terms of the solution to a truly two-dimensional optimal stopping problem, whose associated belief-dependent free boundary \(b\) triggers the investor’s optimal selling rule. The curve \(b\) is uniquely determined through a nonlinear integral equation, for which we derive a numerical solution through an application of the Monte Carlo method. This allows us to understand the value of information in our model as well as the sensitivity of the problem’s solution with respect to the relevant model parameters.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In this paper, we consider an investor who possesses a fixed amount of assets and aims at selling them on the market. We assume that the investor faces the issue of causing an adverse price reaction, so that fast selling depresses the stock price while splitting the order over time may take too long. This problem – also known as the optimal execution problem in algorithmic trading – thus deals with the question of how to trade optimally in order to maximise a given profit, and therefore of how to determine the times as well as the sizes of the orders.
Dating back to the early works of Bertsimas and Lo [9], Almgren and Chriss [2] and Almgren [1], the study of optimal execution strategies has received much attention and resulted in a series of important contributions in various settings which, amongst other modelling features, can be distinguished with respect to the considered type of price impact: additive or multiplicative. A comprehensive discussion on the latter class of models can be found in Guo and Zervos [44], who also point out that models with multiplicative price impact seem to be more natural since they ensure prices to remain positive. Amongst those works dealing with multiplicative price impact, let us mention Bertsimas et al. [10] for a discrete-time framework, Forsyth et al. [37] for a continuous-time model à la Black–Scholes, and Guo and Zervos [44] and Becherer et al. [5] for settings involving singular stochastic controls.
A common feature in the literature is the assumption that the investor has full information on the trend of the asset. This, however, can be a strong requirement. As pointed out by Ekström and Lu [30], a statistical estimation of the drift is not an efficient procedure, and obtaining a reasonable precision would need decades or even centuries of data under the same market conditions, which is simply not feasible in reality (see also the discussion in Rogers [58, Sect. 4.2]). In some cases, such as initial public offerings, this price history does not even exist.
To account for this fact, we propose a model of optimal execution with multiplicative price impact in which the drift of the stock price dynamics is a random variable which is not directly observable by the investor. Through monitoring the evolution of the price on the market, the investor is able to update her belief regarding the drift value. However, that observation is noisy as the investor cannot perfectly distinguish whether price variations are caused by the drift or by the stochastic driver of the underlying dynamics. From a mathematical point of view, our model leads to a finite-fuel singular stochastic control problem under partial observation, and we investigate how the presence of incomplete information influences the selling strategy of the investor. In particular, we show that the flow of incoming information – through the observation of the asset’s market price – has a direct effect on the optimal execution rule. Indeed, differently from the case of full information treated in Guo and Zervos [44], the decision to sell is no longer triggered by a constant critical price, but the execution threshold changes dynamically depending on the investor’s current belief on the future trend of the asset. Our results show that the optimal execution strategy is in fact determined by a boundary that is increasing in the belief in the larger drift value, corresponding to the intuition that the decision maker chooses to delay selling assets if future prices are expected to increase.
In this regard, our work relates to the strand of economic and financial literature where questions of optimal decision-making under partial observation have been considered; amongst a large number of contributions, we refer to the seminal papers on portfolio selection by Detemple [26] and Gennotte [42], to Veronesi [64] for an equilibrium model with uncertain dividend drift, to Sass and Haussmann [59] for a terminal-wealth portfolio optimisation problem, and to the more recent paper by Colaneri et al. [15] for an optimal liquidation problem with rate strategies and partial observation. Notably, the recent papers Drissi [29] and Bismuth et al. [11] incorporate Bayesian learning in a model of multi-asset optimal execution, although restricting the agent to absolutely continuous (regular) controls.
Furthermore, we contribute to those models dealing with problems of optimal stopping and singular stochastic control. To name just a few recent works, we mention Callegaro et al. [13] for public debt control, De Angelis [19] and Décamps and Villeneuve [25] for dividend payments, Décamps et al. [24] for investment timing, Ekström and Lu [30] as well as Ekström and Vaicenavicius [31] for asset liquidation, Federico et al. [33] for inventory management, Johnson and Peskir [45] for quickest detection, Gapeev [39] for the pricing problem of perpetual commodity equities, and Gapeev and Rodosthenous [40] for a zero-sum optimal stopping game associated with perpetual convertible bonds.
1.1 Our model, approach and overview of the mathematical analysis
We now discuss the mathematical modelling and analysis. Consider an investor holding a fixed amount \(y\) of assets in her portfolio. In the absence of investor actions, the stock price evolves according to a geometric Brownian motion \(dS_{t} {=}\, \beta S_{t} dt \,{+}\, \sigma\! S_{t} d W_{t}\), where \(W\) is a standard Brownian motion and \(\sigma > 0\) a constant volatility parameter. Furthermore, the price process exhibits a random future trend \(\beta \), which is, however, unknown to the decision maker and is assumed to be a random variable, independent of the Brownian noise, taking two values \(\beta _{0} < \beta _{1}\) for some \(\beta _{0},\beta _{1} \in \mathbb{R}\) and \(\beta _{0} <0\).
The decision maker is able to sell the assets on the market over an infinite time horizon, and we denote by \(\xi _{t}\) the cumulative amount of assets liquidated up to time \(t\). Consequently, the remaining assets in the portfolio follow the dynamics \(Y_{t}^{\xi }= y - \xi _{t}\). Clearly, we must have \(\xi _{t} \leq y\) at any time \(t\geq 0\) (finite-fuel constraint) since no more than the initial amount of assets can be sold. As announced, we assume that the investor causes an adverse price reaction upon selling which, following Guo and Zervos [44], we assume to be of multiplicative type. Hence, the controlled asset price evolves as
where \(\alpha > 0\) denotes the parameter of price impact and the operator ∘ is defined in (2.3) below so as to take care of the continuous and jump components of any admissible selling strategy \(\xi \). As will become clear later, see (2.6), the multiplicative price impact structure allows expressing the asset price process as \(S^{\xi }= \exp (X^{\xi})\), where \(X^{\xi}\) is a linearly controlled drifted Brownian motion with volatility \(\sigma >0\) and drift value \(\mu = \beta - \frac{1}{2} \sigma ^{2}\).
The investor aims at maximising the total expected discounted reward upon selling, net of transaction costs; that is, she looks for
where the optimisation is taken over a suitable admissible class of selling strategies \(\xi \) and the investor discounts her future revenues with a strictly positive factor \(r>0\) that can be interpreted as her subjective impatience. This yields a finite-fuel singular stochastic control problem under partial observation.
By relying on classical filtering techniques (cf. Shiryaev [61, Sect. 4.2]), we begin by determining an equivalent Markovian problem – the so-called separated problem – under full information (see Fleming and Pardoux [35] as a classical reference on the separated problem). To this end, we introduce the process \(\Pi \) according to which the investor can update her belief regarding the true value of the drift. This is done by observing the evolution of the process \(X^{0}\) (denoting the uncontrolled version of the process \(X^{\xi}\)), whose natural filtration \((\mathcal{F}_{t}^{X^{0}})\) models the overall information available up to time \(t\). More precisely, after forming a prior \(\pi := \mathbb{P}[\mu = \mu _{1}] \in (0,1)\), the investor dynamically updates her belief upon the arrival of new information through observing the process \(X^{\xi}\) so that the belief process is given by \(\Pi _{t} = \mathbb{P}[\mu = \mu _{1} | \mathcal{F}_{t}^{X^{\xi}}]\). Notice that a value of \(\Pi \) close to 1 indicates a strong belief in the larger value of the drift, while \(\Pi \) close to 0 displays a strong belief in the lower value. Hence we expect the investor to change the liquidation strategy dynamically and base it not solely on the current price on the market, but also on the present belief at that time.
The separated problem turns out to be a three-dimensional degenerate finite-fuel singular stochastic control problem, so that obtaining explicit solutions through a traditional “guess-and-verify approach” is in general not feasible. We note that this approach would be applicable if we take \(\beta _{0} = - \beta _{1}\), which indeed allows a dimension reduction; see e.g. Décamps and Villeneuve [25]. In the present paper, however, we do not assume any relation between \(\beta _{0} \) and \(\beta _{1}\) other than \(\beta _{0} < \beta _{1}\).
In order to tame the multidimensional nature of the resulting optimal execution problem under full information, we then follow a direct approach which hinges on the study of a suitable optimal stopping problem, with value \(v\), that we expect to be associated to the singular stochastic control problem. This method was studied and refined by many authors such as Beneš et al. [6], El Karoui and Karatzas [32] and Karatzas and Shreve [49], or De Angelis [19], De Angelis et al. [20, 21] and Guo and Tomecek [43] for more recent contributions. The optimal stopping problem, which involves the underlying two-dimensional diffusion \((X^{0},\Pi )\) taking values in \(\mathbb{R} \times (0,1)\), can be interpreted as an optimal selling problem and exhibits a structure similar to that of the problem treated by Décamps et al. [24] (see also Ekström and Lu [30] for a parabolic version). We then solve the optimal stopping problem by relying on techniques from free-boundary theory (as illustrated in the monograph by Peskir and Shiryaev [57, Chap. 3]) and first show that the optimal stopping rule is characterised through a belief-dependent free boundary \(a(\pi )\) for \(\pi \in (0,1)\).
However, the coupled dynamics of the underlying processes \(X^{0}\) and \(\Pi \) as well as the fact that they are driven by the same Brownian motion makes a further study of the free boundary and the value function \(v\) not feasible. For that reason, we proceed by deriving two equivalent representations of the optimal stopping problem which allow a thorough analysis. First, via a change of measure, the state process \((X^{0},\Pi )\) is transformed into \((X^{0},\Phi )\) taking values in \(\mathbb{R}\times (0,\infty )\) and with decoupled dynamics. Here, the process \(\Phi \) is the so-called “likelihood ratio”. Again, we can express the optimal stopping strategy in terms of a free boundary \(\varphi \mapsto b(\varphi )\) which results from a simple transformation of the boundary \(\pi \mapsto a(\pi )\). Second, we pass to yet another formulation by deriving the intrinsic parabolic formulation of the stopping problem in coordinates \((X^{0},Z)\), in which the process \(Z\) now follows purely deterministic dynamics and takes values in ℝ. Even though the monotonicity result of the associated free boundary \(z \mapsto c(z)\) is not trivial to derive and calls for a rigorous technical analysis, it is in this formulation that we are able to provide further regularity results for \(c\) and for the transformed optimal stopping value function \(\widehat{v}\). In fact, borrowing arguments from De Angelis [19], suitably adapted to the present setting, we achieve a global regularity of \(\widehat{v}\), namely \(\widehat{v} \in C^{1} (\mathbb{R}^{2})\). The latter result also allows proving that \(\widehat{v}_{xx} \in L_{{\mathrm{{loc}}}}^{\infty }(\mathbb{R}^{2})\) and finally obtaining a nonlinear integral equation uniquely solved by the optimal stopping boundary \(c\). It is worth mentioning that this characterisation can be translated back to both optimal stopping boundaries \(b\) and \(a\) and is thus tantamount to a complete specification of the optimal stopping rule in the original \((x,\pi )\)-coordinates.
The thorough analysis developed for the optimal stopping problem is then exploited in order to identify an optimal execution strategy. In fact, the derived regularity results for \(\widehat{v}\) permit us to prove a verification theorem that identifies an optimal execution rule and shows that the optimal stopping value function \(v\) indeed coincides with a directional derivative of the separated problem’s value function \(V\). Namely, we show that
Notice that if \(\alpha \downarrow 0\), one finds \(V(x,y,\pi )=y v(x,\pi )\), which is the value of the problem in which the investor has no market impact.
The optimal execution rule can be thought of as a “myopic one”. Indeed, it prescribes to sell assets as if the size of the investor’s portfolio were infinite, and to stop selling once the asset’s inventory is depleted (see also Karatzas [47] and El Karoui and Karatzas [32]). The optimal selling rule involves lump-sum executions (whenever the asset price is sufficiently large) that could eventually result in an immediate depletion of the portfolio (if the initial portfolio size is sufficiently small). However, for relatively large portfolios, an initial lump-sum selling is followed by a policy of oblique reflection type. This is triggered by the belief-dependent boundary \(\varphi \mapsto b(\varphi )\) (equivalently, \(\pi \mapsto a(\pi )\)). Notably, given that all the transformations developed for the resolution of the optimal stopping problem are one-to-one and onto, the integral equation for the boundary \(z \mapsto c(z)\) yields an integral equation for \(\varphi \mapsto b(\varphi )\) and therefore a complete characterisation of the optimal execution rule. In order to provide insights about the sensitivity of the optimal decision mechanism of the investor with respect to the model parameters, we develop a recursive numerical scheme which relies on an application of the Monte Carlo method. Overall, we believe that the contributions of this paper are the following. Even though the literature on optimal execution problems is extensive (to name just a few, see Almgren and Chriss [2], Almgren [1], Becherer et al. [5], Bertsimas and Lo [9], Bertsimas et al. [10], Colaneri et al. [15], Gatheral and Schied [41], Guo and Zervos [44], Moreau et al. [54], Schied and Schöneborn [60]), the combination of incomplete information on the future price trend while allowing lump-sum as well as singularly continuous executions constitutes a novelty. Furthermore, the present study on the optimal execution strategy complements as well as extends the literature on problems with a similar structure under full information. As a matter of fact, the derived optimal execution rule exhibits a broader structure and prescribes to take actions depending on the current belief on the future trend of the asset.
1.2 Our contributions
From a mathematical point of view, to the best of our knowledge, ours is the first work providing a complete characterisation of the value function and of the optimal control rule in a finite-fuel singular stochastic control problem under partial observation (which, in the present setting, is equivalent to a three-dimensional degenerate singular stochastic control problem). Furthermore, we believe that the optimal stopping (selling) problem studied as a device to characterise the optimal solution of the optimal execution problem is of interest on its own. By performing a thorough analysis on the regularity of (a transformed version of) its value function and free boundary, we are able to provide a complete characterisation of the optimal selling rule through a nonlinear integral equation, thus extending the results of the related model studied by Décamps et al. [24]. Notice that an integral equation for the free boundary has been obtained also in Ekström and Lu [30] and Ekström and Vaicenavicius [31], but in settings where the parabolic nature of the problem arises because of an explicit time-dependence. Finally, the probabilistic numerical approach developed for the resolution of the free boundary’s integral equation allows understanding the dependence of the investor’s optimal execution strategy on relevant model parameters such as volatility and trend. Moreover, based on the numerical evaluation of the boundary, we can compare the value of the control problem with partial information with that of an associated average drift problem under full information. This allows us to numerically evaluate the question on whether the introduction of uncertainty over the drift actually harms or benefits the investor.
1.3 Organisation of the paper
The rest of the paper is organised as follows. In Sect. 2, we present our setting and first preliminary results. In Sect. 3, we investigate the benchmark problem under full information, before we consider a corresponding optimal stopping problem and its optimal boundary in Sect. 4. In Sects. 5 and 6, we derive two equivalent formulations of the optimal stopping problem under partial observation which allow a more thorough study. Eventually, in Sect. 7, we return to the optimal control problem and characterise the optimal selling rule of the investor. A numerical study based on the derived integral equation of the execution boundary in then carried out in Sect. 8.
2 Setting and problem formulation
Let \((\Omega , \mathcal{F}, \mathbb{P})\) be a complete probability space rich enough to accommodate a standard one-dimensional Brownian motion \((W_{t})_{t \geq 0}\) and an independent random variable \(\beta \) taking two values \(\beta _{0}\) and \(\beta _{1}\). We denote by \(\mathbb{F}^{W} := (\mathcal{F}_{t}^{W} )_{t \geq 0}\) the filtration generated by \((W_{t})_{t \geq 0}\) and augmented by the ℙ-nullsets of ℱ. We assume that in the absence of any actions of the investor, the asset price on the stock market evolves stochastically according to a geometric Brownian motion
where \(\sigma >0\) is a constant volatility. The investor holds a finite amount \(y \geq 0\) of assets which she is able to sell. We identify the cumulative amount of assets sold up to time \(t \geq 0\), which we denote by \(\xi _{t}\), as the investor’s control variable. We denote by \(\mathbb{F}^{Z} := (\mathcal{F}_{t}^{Z})_{t \geq 0}\) the natural filtration of any process \(Z\), augmented by the ℙ-nullsets of ℱ. The set of admissible execution strategies in this context is given by
where the last condition naturally arises from the fact that the investor cannot sell more than the initial amount of assets. Moreover, the remaining assets in the portfolio evolve according to the dynamics
where we stress the dependence on the selling strategy \(\xi \). Following Guo and Zervos [44], we assume in our model that the investor’s transactions on the market have a proportional impact on the asset price. More precisely, when selling a small amount \(\epsilon > 0\) of assets at time \(t\), the price exhibits a jump of size
for \(\alpha > 0\) denoting the parameter of permanent price impact (see Almgren and Chriss [2], Almgren [1] for early works and Becherer et al. [4], Ferrari and Koch [34], Guo and Zervos [44] for more recent contributions). Hence a small transaction is such that \(S_{t} = (1 - \alpha \epsilon ) S_{t-} \simeq e^{- \alpha \epsilon } S_{t-}\), and by interpreting a lump-sum sale of \(\Delta \xi _{t}\) shares as a sequence of \(N\) individual sales of size \(\epsilon = \Delta \xi _{t} / N\), we have
for \(N\) large enough. It follows that for any \(\xi \in \mathcal{A}(y)\), we can model the controlled asset price process by
where
\(\xi ^{c}\) denotes the continuous part of the process \(\xi \) and \(\Delta \xi _{t} := \xi _{t} - \xi _{t-}\). The solution to (2.2) can be explicitly determined via Itô’s formula and is given by
where \(S^{0}\) is the solution to (2.1) and we observe that the price impact of selling is additive with respect to the logarithm of the asset price.
We assume that the investor aims at maximising the total expected (discounted) profits, net of the total cost of selling, and thus seeks to solve
Here, \(\kappa > 0 \) is a proportional transaction cost which, thinking of \(S_{t}^{\xi}\) as the ask price of the stock at time \(t\), can also be interpreted as a constant bid–ask spread. Notice that the structure of the expected net-profit functional in (2.5) can also be justified through stability results in the Skorokhod \(M_{1}\)-topology in probability (see Becherer et al. [5]). Moreover, (2.5) has a finite value due to \(\xi _{t} \leq y\) a.s. Thanks to (2.4), we have \(S_{t}^{\xi }= \exp (X_{t}^{\xi })\), where
with \(x := \ln s\) and \(\mu := \beta - \frac{1}{2} \sigma ^{2}\). In particular, the drift can take two values \(\mu _{i} = \beta _{i} - \frac{1}{2} \sigma ^{2}\), \(i=0,1\). In the following, when needed, we let \(X^{0}\) denote the solution to (2.6) with \(\xi \equiv 0\), which is then an arithmetic Brownian motion. Furthermore, we state the following assumption.
Assumption 2.1
We have \(\beta _{1} > \beta _{0}\) and \(\beta _{0} < 0\), which implies \(\mu _{0} < 0\).
The maximisation problem (2.5) can thus be rewritten in terms of (2.6) as
Notice that for a constant non-random drift coefficient, a close variant of this problem was considered and solved by Guo and Zervos [44], who also incorporate the option of buying shares of assets and the constraint that the whole inventory must be depleted at the terminal time. However, due to the presence of incomplete information on the drift of the asset, (2.7) is not of Markovian nature and thus requires a different analysis. In order to obtain an equivalent Markovian formulation of (2.7), we rely on classical results from filtering theory, dating back to the contribution of A.N. Shiryaev in the context of quickest detection models (see Shiryaev [62] for a survey). To this end, we introduce the belief process
which reflects the conditional probability at time \(t\) that \(\mu = \mu _{1}\) given the observations of the price process up to that time (indeed, \(\mathbb{F}^{S^{0}} = \mathbb{F}^{X^{0}} = \mathbb{F}^{X^{\xi}} \), noting that \(\xi \) must be \(\mathbb{F}^{S^{0}}\)-adapted). According to this process, the investor is able to update the belief regarding the true value of the drift, based on the arrival of new information by observing the asset price evolution on the market. Notice that a large value of \(\Pi \) close to 1 implies a strong belief in the larger drift value \(\mu _{1}\), while a low value of \(\Pi \) implies the contrary. It follows (see e.g. Shiryaev [61, Sect. 4.2]) that the dynamics of \(X^{\xi}\), \(\Pi \) and \(Y^{\xi}\) can be written as
where \(\gamma = (\mu _{1} - \mu _{0})/\sigma \) is the signal-to-noise ratio and
denotes the innovation process which is an \(\mathbb{F}^{X^{0}}\)-Brownian motion on \((\Omega , \mathcal{F}, \mathbb{P})\). Moreover, notice that the value \(\pi := \mathbb{P}[\mu = \mu _{1}]\) reflects the initial subjective belief of the investor regarding the true value of the drift. We do not question the origin of this initial belief; this can be an instinctive decision or even the result of a constructive approach, for instance by observing the trends of similar assets over the past years. In the new formulation, the process \((X^{\xi }, Y^{\xi },\Pi )\) is an \(\mathbb{F}^{X^{0}}\)-adapted and time-homogeneous Markov process, as it is the unique strong solution to the system of stochastic differential equations in (2.8). Furthermore, we observe that the drift \(\mu \) is replaced by its conditional estimate, and the process \(\Pi \) is a bounded martingale valued in \([0,1]\) with \(\Pi _{\infty }\in \{0,1 \}\) as all information will eventually get revealed. Denoting \(\mathbb{E}_{x,y,\pi} [\, \cdot \,] = \mathbb{E} [ \,\cdot \, \vert X_{0-}^{ \xi }= x, Y_{0-}^{\xi }= y , \Pi _{0} = \pi ]\), we can thus reformulate the problem of incomplete information as a so-called separated problem (cf. Bensoussan [7, Chap. 7.1], and Fleming and Pardoux [35])
with
for any \((x,y,\pi ) \in \mathbb{R} \times (0, \infty ) \times (0,1)\). Notice indeed that \(\Pi _{t} \in (0,1)\) for all \(t \geq 0\) a.s. if \(\pi \in (0,1)\), while \(\Pi _{t} \equiv \pi _{0}\) for all \(t \geq 0\) a.s. if \(\pi _{0} \in \{ 0,1\}\). Problem (2.9) is equivalent to (2.5): They share the same value, and because of the uniqueness of the strong solution to (2.8), a control is optimal for (2.5) if and only if it is optimal for (2.9).
2.1 The Hamilton–Jacobi–Bellman equation
Problem (2.9) takes the form of a three-dimensional singular stochastic control problem with finite-fuel constraint (cf. Baldursson [3], Beneš et al. [6], El Karoui and Karatzas [32], Karatzas [46] and Karatzas et al. [48] for early contributions). We start our analysis by providing a heuristic derivation of the dynamic programming equation that we expect the value function \(V\) to satisfy. To this end, we notice that the investor is faced with two possible actions at the initial time. On the one hand, she could choose to wait for a short period of time \(\Delta t\), not sell any fraction of the assets and then continue with an optimal execution strategy (supposing that one exists). Since this strategy is not necessarily optimal, we obtain
If we assume that the value function \(V\) has enough regularity, we can apply Itô’s formula, divide by \(\Delta t\) and invoke the mean value theorem to let \(t \to 0\) and obtain
Here, \(\mathcal{L}_{X,\Pi}\) denotes the second-order differential operator acting on twice-continuously differentiable functions which is given by
On the other hand, the investor can instantaneously sell an amount \(\epsilon > 0\) of the assets and then proceed by following an optimal execution strategy. Again, this strategy is a priori suboptimal, and since this action is associated with the inequality
adding and subtracting \(V(x - \alpha \epsilon , y , \pi )\) and dividing by \(\epsilon \) yields
Hence by letting \(\epsilon \downarrow 0\), we obtain
Since only one of these actions should be optimal and given the Markovian setting of (2.9), we thus expect that the value function \(V\) should identify with an appropriate solution to the Hamilton–Jacobi–Bellman equation
with boundary condition \(u(x,0,\pi )=0\) since \(y=0\) implies \(\mathcal{A}(0) = \{\xi \equiv 0\}\) and \(J(x,0, \pi , 0) =0\). It is worth noticing that the variable \(y\) plays the role of a parameter in (2.11), which is then a two-dimensional elliptic partial differential equation with a state-dependent directional derivative constraint, parametrised by \(y > 0\). With reference to (2.11) and the reasoning above, we can introduce the waiting region
in which it is expected to be suboptimal to sell any assets, and the selling/execution region, where it should be profitable for the investor to sell a fraction of the assets, as
Due to the multidimensional structure of the problem, a traditional guess-and-verify approach as seen for instance in Guo and Zervos [44] and Ferrari and Koch [34] is not effective. In fact, this would require the construction of an explicit solution to the second-order PDE with state dependent gradient constraint seen in (2.11) above, which is not feasible in general. Instead, we use a different approach and construct an optimal stopping problem connected to the stochastic control problem (2.9), which is then of a simpler structure. Before we do so and in order to get insights from a benchmark problem, we briefly discuss the problem under full information, i.e., where the drift coefficient is constant and equal to either \(\mu _{0}\) or \(\mu _{1}\).
3 Benchmark problem under full information
Suppose that the initial subjective belief \(\pi = \mathbb{P}[\mu = \mu _{1}]\) is such that \(\pi \in \{ 0,1\}\). Observe that there exists no uncertainty in the model other than the Brownian one and the belief process \(\Pi \) will remain constant, as the investor is already certain at the initial time about the true value of the drift. Hence in this formulation, we are in the case of full information. The problem we address in this section has a similar structure to the ones studied by Guo and Zervos [44] as well as Koch [52, Chap. 2] and we therefore do not provide full details. Let us assume \(\pi = 0\). We thus obtain \(\Pi _{t} = 0\) for all \(t \geq 0\), and the dynamics of \(X^{\xi}\) and \(Y^{\xi}\) then write as
We denote the corresponding value function as
where \(\mathbb{E}_{x,y} [\, \cdot \, ] = \mathbb{E}[\, \cdot \, \vert \underline{X}_{0-}^{\xi }= x , Y_{0-}^{\xi }=y ]\). By similar arguments as in the case of incomplete information, we expect that \(V_{0}\) should identify with an appropriate solution to the HJB equation
and \(w(x,0) = 0\). Defining the associated waiting and selling regions as
we expect that the investor is only willing to sell shares of the asset when its price is sufficiently large. Hence we guess that for every \(y \geq 0\), there exists a critical price \(G(y)\) such that (3.4) and (3.5) rewrite as
Notice that the candidate value function should then satisfy \((\mathcal{L}_{\underline{X}} -r) w= 0\) on \(\mathbb{W}^{\mu _{0}}\). It is well known that the latter equation admits two fundamental strictly positive solutions; the only solution that remains bounded as \(x \downarrow - \infty \) is then given by
for some functions \(A: [0,\infty ) \to \mathbb{R}\) and where \(n\) is the positive solution to the quadratic equation \((\sigma ^{2}/2) n^{2} + \mu _{0} n -r = 0\). On the other hand, on \(\mathbb{S}^{\mu _{0}}\), we expect that the value function \(V_{0}\) should instead satisfy
In order to derive the solutions for \(A(y)\) and \(G(y)\), we evaluate the two previous formulas at \(x = G(y)\), require that \(A(0)=0\) and obtain
Notice that the optimal execution threshold – determining the price at which the investor should sell – is independent of the current amount of assets in the portfolio. Moreover, the selling region is partitioned into
and we expect that in \(\mathbb{S}_{1}^{\mu _{0}}\), it is optimal to sell the complete amount of assets instantaneously, while in \(\mathbb{S}_{2}^{\mu _{0}} \), the investor makes a lump-sum execution and then follows the strategy that keeps the process \((X,Y)\) inside \(\overline{\mathbb{W}}^{\mu _{0}}\) until all assets are sold. The candidate value function, according to our previous considerations, then takes the shape
and via a verification theorem (cf. Guo and Zervos [44, Prop. 5.1], Koch [52, Prop. 2.4.1]), one can indeed show that this \(w\) is a \(C^{2,1}\) solution to the HJB equation (3.3) and coincides with the value function \(V_{0}\) of (3.2). Moreover, the process
belongs to \(\mathcal{A}(y)\) and provides an optimal execution strategy for (3.2) (cf. Guo and Zervos [44, Prop. 5.1]; recall that here we are not assuming \(\lim _{T \uparrow \infty} Y_{T}^{\xi }= 0\) as admissibility condition, see also Remark 7.6 below). Figure 1 sketches the optimal execution strategy (3.8) for problem (3.2) under full information. We observe that for an initial price \(x\) strictly larger than \(x_{0}^{*}\), the investor immediately does a lump-sum execution. The latter can already deplete the whole portfolio, whenever \(y \leq \frac{1}{\alpha} (x- x_{0}^{*})\), or bring it to the level \((x_{0}^{*}, y - \frac{1}{\alpha}(x - x_{0}^{*}))\) otherwise. Afterwards, the optimal strategy prescribes to keep the state process \((X,Y)\) inside the waiting region \(\overline{\mathbb{W}}^{\mu _{0}}\) with minimal effort, by reflecting it in the direction \((-\alpha ,-1)\) according to a Skorokhod reflection-type policy (realised through the running supremum in (3.8)).
In light of our subsequent analysis, it is interesting to notice that the derivative \(\alpha \partial _{x} V_{0} + \partial _{y} V_{0}\) can be checked from (3.7) to identify with the value function of an optimal stopping problem. More precisely, for any \(x \in \mathbb{R}\), one has
where \(\underline{X}^{0}\) denotes the solution to (3.1) with \(\xi \equiv 0\), the optimisation is performed over all stopping times of the Brownian filtration and \(\mathbb{E}_{x}\) is the expectation under \(\mathbb{P}_{x} [\, \cdot \, ] = \mathbb{P}[\, \cdot \, | \underline{X}_{0}^{0} = x]\). Moreover, the stopping time
is optimal for (3.9). We can interpret (3.10) as the optimal time at which the investor should sell another unit of shares and notice that it in fact characterises the time at which the marginal expected profit \(\alpha \partial _{x} V_{0} + \partial _{y} V_{0}\) coincides with the marginal instantaneous net profit \(e^{x} - \kappa \) from selling.
Remark 3.1
It is easily checked that the results we obtained for the case \(\mu \equiv \mu _{0}\) can be replicated for the case \(\mu \equiv \mu _{1}\). More precisely, considering the dynamics
and the value function
we can verify the existence of an optimal execution threshold \(x_{1}^{*}\) which triggers the selling strategy of the investor through the optimal control \(\xi ^{\mu _{1}}\), which is of similar structure as (3.8) with \(\mu _{0}\) replaced by \(\mu _{1}\). Furthermore, we have
where \(\overline{X}^{0}\) is the solution to (3.11) with \(\xi \equiv 0\), and \(\tau _{1}^{*} (x) := \inf \{ t \geq 0 : \overline{X}_{t}^{0} \geq x_{1}^{*} \}\) \(\mathbb{P}_{x}\)-a.s. is the optimal stopping time for problem (3.12).
4 A related optimal stopping problem
Motivated by the observed connection to an optimal stopping problem in the benchmark problem of Sect. 3 (see (3.12)), we pursue the following approach in the subsequent analysis: (i) We introduce and study an optimal stopping problem, with value \(v\), that we expect to be associated to the singular stochastic control problem (2.9); (ii) We provide a complete analysis of the optimal stopping problem which is achieved by studying two equivalent formulations of it (cf. Sects. 5 and 6). More precisely, we derive regularity results for the value function (cf. Proposition 6.9) as well as an integral equation for the free boundary (cf. Proposition 6.11); (iii) We verify the expected connection to the original problem of (2.9) by showing that (cf. Theorem 7.3)
and that the optimal execution strategy is triggered by the optimal stopping boundary studied in step (ii). In fact, as in the benchmark case, we can interpret the optimal stopping problem as the marginal problem in the sense that its value \(v\) coincides with the derivative of the value \(V\) of (2.9) in the direction of actions/execution and its optimal stopping strategy characterises the time at which it is optimal to sell a unit of assets.
We recall that \((X^{0}_{t} , \Pi _{t})_{t \geq 0}\) is the two-dimensional strong Markov process solving
and in the following, in order to simplify notation, we write \(X\) instead of \(X^{0}\). For a stopping time \(\tau \) of the filtration \(\mathbb{F}^{X}\), we then define
and consider the optimal stopping problem
Above and in the following, \(\mathbb{E}_{x,\pi} [\, \cdot \, ] = \mathbb{E}[ \, \cdot \, \vert X_{0} = x, \Pi _{0} = \pi ]\). Also, denoting by \((X_{t}^{x,\pi} , \Pi _{t}^{\pi})_{t \geq 0}\) the unique strong solution to (4.1), we often employ the equivalent notation \(\mathbb{E}[f(X_{t}^{x,\pi} , \Pi _{t}^{\pi })] = \mathbb{E}_{x, \pi} [f (X_{t} , \Pi _{t})]\) for any integrable measurable function \(f: \mathbb{R} \times [0,1] \to \mathbb{R}\).
We make the next standing assumption.
Assumption 4.1
We assume that
Remark 4.2
(i) The different conditions we impose on the (subjective) discount factor \(r\) serve distinct purposes. Notice that the first condition is equivalent to imposing \(r > \beta _{1}\) and guarantees wellposedness of problem (4.2).
(ii) Moreover, the forthcoming analysis (in particular Sect. 6) reveals that the other two terms are sufficient to ensure monotonicity of (a transformation of) the optimal stopping boundary of problem (4.2) (cf. Propositions 6.3 and 6.5). This result is crucial when deriving the smooth-fit property and thus, by relying on arguments developed in De Angelis and Peskir [22], the global \(C^{1}\)-regularity of (a transformation of) the value function \(v\) of (4.2). When \(r\) does not satisfy Assumption 4.1, the monotonicity of the (transformed version of the) boundary is not clear and one needs an alternative route to achieve the needed regularity of \(\widehat{v}\). A possible approach could be to prove directly the (local) Lipschitz-regularity of the free boundary (cf. De Angelis and Stabile [23]) and then infer the \(C^{1}\)-property of \(\widehat{v}\) from the continuity of the optimal stopping time with respect to the initial data. Since this is not straightforward to obtain in our formulation, we leave it for future research.
In the following, we derive some preliminary results for the optimal stopping problem (4.2) and its associated free boundary. Noticing that \((x,\pi ) \mapsto X_{t}^{x,\pi}\) as well as \(\pi \mapsto \Pi _{t}^{\pi}\) are continuous and nondecreasing due to classical comparison theorems for strong solutions to stochastic differential equations, the proof of the following lemma follows from standard arguments and is therefore skipped.
Lemma 4.3
The value function \(v\) of (4.2) is such that
i) \(x \mapsto v(x, \pi )\) is nondecreasing;
ii) \(\pi \mapsto v(x, \pi )\) is nondecreasing.
Furthermore, as \((x, \pi ) \mapsto (X_{t}^{x,\pi}, \Pi _{t}^{\pi})\) is continuous ℙ-a.s., Assumption 4.1 and standard estimates using the fact that \(\Pi \) is bounded allow us to invoke dominated convergence and obtain that
is continuous, and hence \((x,\pi ) \mapsto v(x, \pi )\) is lower semicontinuous. As it is customary in optimal stopping theory, we introduce the continuation and stopping regions associated to \(v\) as
Then the continuation region \(\mathcal{C}_{1}\) is an open set while the stopping region \(\mathcal{S}_{1}\) in (4.4) is closed, and by Peskir and Shiryaev [57, Corollary 1.2.2.9], the stopping time
is optimal whenever it is ℙ-a.s. finite; otherwise it is an optimal Markov time. We set
with the convention \(\inf \emptyset = \infty \), and state the following lemma.
Lemma 4.4
It holds that
Proof
Recalling (2.10), an application of Dynkin’s formula yields
For \(x_{1} < x_{2}\) and \(\tau ^{*}\) optimal for \(v(x_{2} , \pi )\), we have by Assumption 4.1 that
For \((x_{1}, \pi ) \in \mathcal{S}_{1}\) and \(x_{2} > x_{1}\), we thus obtain \(0 \leq u(x_{2},\pi ) \leq u(x_{1}, \pi ) = 0\) so that \((x_{2}, \pi ) \in \mathcal{S}_{1}\). □
The free boundary \(a(\pi )\) thus splits \(\mathbb{R} \times (0,1)\) into the continuation and stopping region. In the following lemma, we derive some preliminary properties.
Lemma 4.5
One has the following properties:
i) \(\pi \mapsto a(\pi )\) is nondecreasing on \((0,1)\).
ii) \(\pi \mapsto a(\pi )\) is left-continuous on \((0,1)\).
iii) There exist constants such that \(x_{0}^{*} \leq a(\pi ) \leq x_{1}^{*}\) for all \(\pi \in (0,1)\).
Proof
i) If \(\pi _{2} > \pi _{1}\) and \((x,\pi _{2})\in \mathcal{S}_{1}\), we have \(x \geq a(\pi _{2})\) and \(v(x, \pi _{2})=e^{x}-\kappa \). Since \(\pi \mapsto v(x,\pi ) \) is nondecreasing, \(v(x,\pi _{1}) \leq v(x,\pi _{2}) = e^{x}-\kappa \), which together with \(v(x, \pi _{1}) \geq (e^{x}-\kappa )\) gives \((x, \pi _{1}) \in \mathcal{S}_{1}\). Therefore \(a(\pi _{2}) \geq a(\pi _{1})\).
ii) For a sequence \((\pi _{n})_{n \in \mathbb{N}}\) with \(\pi _{n} \uparrow \pi \), the sequence \((a(\pi _{n}))\) is increasing by i) and \(a(\pi _{n}) \leq a (\pi )\). So \(\lim _{n \to \infty} a(\pi _{n}) =: a(\pi -)\) exists and \(a( \pi -) \leq a(\pi )\). Because \(v(a(\pi _{n}), \pi _{n}) = e^{a(\pi _{n})} - \kappa \) for all \(n \in \mathbb{N}\), lower semicontinuity of \((x, \pi ) \mapsto v(x, \pi )\) yields \(v(a(\pi -), \pi ) = e^{a(\pi -)} -\kappa \). Hence \(a(\pi ) \leq a(\pi -)\) and \(\lim _{n \to \infty} a(\pi _{n}) = a( \pi )\).
iii) Recall from (3.9) and (3.12) the value functions \(v_{0}\) and \(v_{1}\) in the optimal stopping problems with full information when either \(\mu \equiv \mu _{0}\) or \(\mu \equiv \mu _{1}\). The associated continuation regions are given by
where \(x_{0}^{*}\) and \(x_{1}^{*} \) are the optimal execution thresholds (cf. (3.6) and Remark 3.1). Recalling \(\mu _{0} < \mu _{1}\) and \(\Pi _{t} \in (0,1)\) for \(\pi \in (0,1)\), we have \(\underline{X}^{0}_{t} \leq X_{t} \leq \overline{X}_{t}^{0}\) ℙ-a.s. for any \(t \geq 0\) by classical comparison arguments, where \(\underline{X}^{0}\) and \(\overline{X}^{0}\) denote the solutions to (3.1) and (3.11) with \(\xi \equiv 0\). Thus \(v_{0} (x) \leq v(\pi ,x) \leq v_{1} (x)\), which implies
and the latter combined with (4.5) allows concluding that \(x_{0}^{*} \leq a(\pi ) \leq x_{1}^{*}\). □
5 Decoupling change of measure and a new optimal selling problem
We notice that the underlying dynamics in (4.1) are coupled. In order to derive further results about the properties of the optimal stopping problem (4.2) and its associated free boundary, it is useful to address the problem under a different probability measure. With reference to related contributions (cf. De Angelis [19], Ekström and Lu [30], Johnson and Peskir [45] and Shiryaev [62] and references therein), we introduce the so-called likelihood ratio process via
Through an application of Itô’s formula, we can derive its associated dynamics given by
and we aim to remove its dependence on the process \(\Pi \) through a change of measure. For a fixed \(T>0\), we define the measure \(\mathbb{Q}_{T} \approx \mathbb{P}\) on \((\Omega , \mathcal{F}_{T})\) via the Radon–Nikodým derivative
and notice that the process
is a Brownian motion under \(\mathbb{Q}_{T}\) on \([0,T]\). Rewriting the state process \((X,\Phi )\) under \(\mathbb{Q}_{T}\) then yields
and we notice that the processes decouple under this formulation. In the following, when needed, we write \(\mathbb{E}_{x, \varphi}^{\mathbb{Q}_{T}} \) to denote the expectation under \(\mathbb{Q}_{T}\) conditionally on \(X_{0}=x\), \(\Phi _{0} = \varphi \). To rewrite (4.2) in terms of the new variables \((X, \Phi )\), we introduce
and by an application of Itô’s formula, it can be verified that \(\Theta \) admits the representation
Upon using (5.1) and (5.3), we find that
for any stopping time \(\tau \) and \((x, \varphi ) \in \mathbb{R} \times (0,\infty )\). With regard to (5.4), we introduce the stopping problems
and notice that (5.4) implies \(v^{\mathbb{Q}_{T}} (x, \varphi ; T)= (1 + \varphi ) v(x , \varphi /(1 + \varphi ) ; T)\) for fixed \(T>0\). However, since the measure \(\mathbb{Q}_{T}\) changes with \(T\), passing to the limit \(T \to \infty \) in (5.4) requires a bit of care. To this end, we define a probability space \((\widetilde{\Omega }, \widetilde{\mathbb{F}}, \widetilde{\mathbb{Q}})\) with a Brownian motion \(\widetilde{B}\) and a filtration \(\widetilde{\mathbb{F}} = (\widetilde{\mathcal{F}}_{t})_{t \geq 0}\). Moreover, we let \((\widetilde{X}, \widetilde{\Phi })\) be the strong solution to the stochastic differential equation (5.2) driven by the Brownian motion \(\widetilde{B}\) instead of \(B\). Let \(\widetilde{\mathbb{E}}_{x, \varphi} [\, \cdot \, ]\) denote the expectation under \(\widetilde{\mathbb{Q}}\) and define the stopping problems
Due to the equivalence in law of the process \((\widetilde{X}_{t}, \widetilde{\Phi }_{t}, \widetilde{B}_{t})_{t \geq 0}\) under \(\widetilde{\mathbb{Q}}\) and the process \((X_{t}, \Phi _{t} , B_{t})_{t \geq 0}\) under \(\mathbb{Q}_{T}\), both on \([0,T]\), we have \(v^{\mathbb{Q}_{T}} (x,\varphi ; T) = \overline{v}(x, \varphi ;T)\). Moreover, using Fatou’s lemma and simple comparison arguments, one can show that
Hence we finally obtain
For the sake of clarity – and with a slight abuse of notation –, we simply write from now on \((\Omega , \mathbb{F},(\mathcal{F}_{t})_{t \geq 0},\mathbb{Q}, \mathbb{E}^{\mathbb{Q}},X ,\Phi , B)\) instead of \((\widetilde{\Omega }, \widetilde{\mathbb{F}}, ( \widetilde{\mathcal{F}}_{t})_{t \geq 0}, \widetilde{\mathbb{Q}}, \widetilde{\mathbb{E}}, \widetilde{X}, \widetilde{\Phi }, \widetilde{B})\). Henceforth, we thus study the optimal stopping problem
In the sequel, we often write \(\mathbb{E}_{x,\varphi}^{\mathbb{Q}} [ f (X_{t} , \Phi _{t} )] = \mathbb{E}^{\mathbb{Q}} [f(X_{t}^{x}, \Phi _{t}^{\varphi})]\), where \((X_{t}^{x}, \Phi _{t}^{\varphi })_{t \geq 0}\) is the unique strong solution to (5.2). The continuation and stopping regions associated to this problem are then given by
Given the lower semicontinuity of \(v\) and (5.5), we find that \((x,\varphi ) \mapsto \overline{v}(x, \varphi )\) is lower semicontinuous as well. Hence the stopping region \(\mathcal{S}_{2}\) of (5.8) is a closed set, while the continuation region \(\mathcal{C}_{2}\) of (5.7) is open. Also,
is optimal by Peskir and Shiryaev [57, Chap. 1] whenever it is ℚ-a.s. finite. Furthermore, we define
with \(\inf \emptyset = \infty \). In the following lemma, we derive some preliminary properties of the value function (5.6). In light of the relation (5.5), we notice that some of the following results are a direct consequence of Lemma 4.3.
Lemma 5.1
The value function \(\overline{v}\) of (5.6) has the following properties:
i) \(0 \leq \overline{v}(x,\varphi ) \leq K_{1} e^{x} (1 + \varphi )\) for all \((x, \varphi ) \in \mathbb{R} \times (0, \infty )\) and some \(K_{1} > 0\).
ii) \(x \mapsto \overline{v}(x, \varphi )\) is nondecreasing.
iii) \(\varphi \mapsto \overline{v} (x, \varphi )\) is nondecreasing.
iv) \((x, \varphi )\mapsto \overline{v}(x, \varphi )\) is locally Lipschitz over \(\mathbb{R} \times (0,\infty )\).
v) \(\varphi \mapsto \overline{v}(x, \varphi )\) and \(x \mapsto \overline{v} (x, \varphi )\) are convex.
Proof
ii) follows from Lemma 4.3 i) upon using (5.5).
i) For the lower bound, note that \(\{ (x,\varphi ) \in \mathbb{R} \times (0,\infty ) : e^{x}- \kappa < 0 \} \subseteq \mathcal{C}_{2}\). Hence, since \(\Phi ^{\varphi }\geq 0\) a.s., we have \(\overline{v}(x, \varphi ) \geq 0\) for all \((x,\varphi ) \in \mathbb{R} \times (0, \infty )\). For the upper bound, we observe that for any stopping time \(\tau \), setting \(\pi = \varphi /(1 + \varphi )\) gives
where the last inequality follows from standard estimates upon using Assumption 4.1.
iii) Let \(\varphi , \varphi ' \in (0, \infty )\) with \(\varphi ' > \varphi \) and notice that \(\Phi _{t}^{\varphi }= \varphi e^{- \frac{1}{2} \gamma ^{2} t + \gamma B_{t}}\). For \(x\in \mathbb{R}\) and \(\tau ^{*} := \tau ^{*} (x, \varphi )\) optimal for \(\overline{v}(x,\varphi )\), we have
where the last inequality exploits that \(\{ (x,\varphi ) \in \mathbb{R} \times (0,\infty ): e^{x} - \kappa < 0 \} \subseteq \mathcal{C}_{2}\).
iv) Let \(x,x' \in \mathbb{R}\), \(\pi \in (0,1)\) and \(\varphi , \varphi ' \in (0, \infty )\). Recall \(v\) from (4.2). Again, standard estimates yield
for some \(K_{1}, K_{2} >0\). Hence using (5.5), we obtain
and the local Lipschitz property follows.
v) We first prove convexity with respect to \(\varphi \in (0,\infty )\). For \(\varphi _{1}, \varphi _{2} \in (0,\infty )\), \(x \in \mathbb{R}\) and \(\lambda \in (0,1)\), we set \(\overline{\varphi} := \lambda \varphi _{1} + (1- \lambda ) \varphi _{2}\) and obtain
Analogously, upon exploiting the convexity of \(x \mapsto e^{x}\), one can prove the convexity of \(x \mapsto \overline{v}(x, \varphi )\). □
Lemma 5.2
The continuation and stopping region regions in (5.7) and (5.8) satisfy
Proof
We proceed similarly to Lemma 4.4. We first notice that the second-order differential operator associated with the two-dimensional process \((X, \Phi )\) is given by
for \(f \in C^{2} ( \mathbb{R} \times (0,\infty ) )\), and we apply Dynkin’s formula to obtain
For \(x_{2} > x_{1}\) and \(\tau ^{*} := \tau ^{*} (x_{2},\varphi )\) optimal for \(\overline{v} (x_{2} , \varphi )\), we have
where the last inequality follows from \(X^{x_{2}} \geq X^{x_{1}}\) ℚ-a.s. and Assumption 4.1. Hence for \((x_{1}, \varphi ) \in \mathcal{S}_{2}\) and \(x_{2} > x_{1}\), we obtain \(0 \leq \overline{u}(x_{2} , \varphi ) \leq \overline{u}(x_{1}, \varphi ) = 0\) and the claim follows. □
It is interesting to notice that there exists a one-to-one correspondence between the continuation regions \(\mathcal{C}_{1}\) and \(\mathcal{C}_{2}\) of (4.3) and (5.7) as well as the stopping regions \(\mathcal{S}_{1}\) and \(\mathcal{S}_{2}\) of (4.4) and (5.8). Indeed, introducing the diffeomorphism
with inverse
one has
Furthermore, upon using Lemma 4.4 and Lemma 5.2, we find that
Due to this explicit relationship between the optimal stopping boundaries, we obtain some first results on \(b\) thanks to Lemma 4.5.
Lemma 5.3
The boundary \(b(\varphi )\) of (5.9) has the following properties:
i) \(\varphi \mapsto b(\varphi )\) is nondecreasing on \((0,\infty )\).
ii) \(\varphi \mapsto b(\varphi )\) is left-continuous.
iii) \(b\) is bounded by \(x_{0}^{*} \leq b(\varphi )\leq x_{1}^{*}\) for all \(\varphi \in (0,\infty )\), with \(x_{0}^{*}\) and \(x_{1}^{*}\) as in Lemma 4.5.
The relationship (5.14) and the transformation (5.13) allow us to translate back our results from this section – as well as from the following section – to the initial optimal stopping problem (4.2). Moreover, (5.14) turns out to be valuable in the proof of Lemma 5.3, since proving the monotonicity result i) as well as the boundedness iii) is not straightforward without exploiting the relation between \(b\) and \(a\) and the results of Lemma 4.5.
6 A parabolic formulation
Observe that the dynamics of the processes \(X\) and \(\Phi \) in (5.2) are driven by the same Brownian motion. In order to account for this degeneracy, we pass to yet another formulation of the optimal stopping problem. To this end, we rely on a transformation that reveals the true parabolic nature of the generator \(\mathcal{L}_{X,\Phi}\) as in (5.11), i.e., that poses it in its canonical form (cf. Strauss [63, Chap. 1.6]). Define
which is a diffeomorphism with inverse given by
With regard to the transformation (6.1), we introduce the process
and an application of Itô’s formula reveals that its dynamics are given by
Furthermore, we define the transformed version of the value function \(\overline{v}\) of (5.6) via
for \((x,z) \in \mathbb{R}^{2}\) and where now \(\mathbb{E}^{\mathbb{Q}}_{x,z} [\, \cdot \, ] = \mathbb{E}^{ \mathbb{Q}} [ \, \cdot \, \vert X_{0} = x, Z_{0} =z ]\). In light of this explicit relationship between the value functions \(\overline{v}\) and \(\widehat{v}\), we obtain the following result from Lemma 5.1.
Lemma 6.1
The value function \(\widehat{v}\) of (6.4) is locally Lipschitz-continuous on \(\mathbb{R}^{2}\).
The associated continuation and stopping regions are given by
where \(\mathcal{C}_{3}\) is open and \(\mathcal{S}_{3}\) is closed. Furthermore, the global diffeomorphism (6.1) implies that \(\mathcal{C}_{3} = \overline{T}(\mathcal{C}_{2})\) as well as \(\mathcal{S}_{3} = \overline{T}(\mathcal{S}_{2})\), with \(\mathcal{C}_{2}\) and \(\mathcal{S}_{2}\) as in (5.7) and (5.8). Notice that the second-order infinitesimal generator associated to the process \((X, Z)\) is now such that
We can rely on standard arguments from classical PDE theory as well as optimal stopping theory (see e.g. Karatzas and Shreve [50, Theorem 2.7.7]) and obtain the following lemma.
Lemma 6.2
The value function \(\widehat{v}\) of (5.5) is the unique classical \(C^{2,1}\)-solution to the boundary value problem
for \(\mathcal{L}_{X,Z}\) as in (6.7) and any open set ℛ such that its closure is contained in the continuation region \(\mathcal{C}_{3}\) of (6.5). In particular, \(\widehat{v} \in C^{2,1} (\mathcal{C}_{3})\).
In the following, we aim at investigating the geometry of the state space in the coordinates \((X,Z)\). To this end, we define the generalised inverse of the nondecreasing boundary \(b\) by
so that the continuation region \(\mathcal{C}_{2}\) of (5.7) is rewritten as
Since \(\varphi \mapsto b(\varphi )\) is nondecreasing by Lemma 5.3, we observe that
and by setting
we can rewrite (6.5) and (6.6) as
In contrast to the optimal stopping problems in the formulations (4.2) and (5.6), deriving the monotonicity of the boundary \(x \mapsto c^{-1}(x)\) is not straightforward. Moreover – and differently to related contributions such as Federico et al. [33] –, we cannot translate it back to the monotonicity of the boundary \(b\) of (5.9) since the generalised inverse \(b^{-1}\) is nondecreasing as well, and this does not imply monotonicity of \(x \mapsto c^{-1} (x)\). Therefore we follow and adapt arguments presented in De Angelis [19, Sect. 4.4] who studies separately the two cases in which the deterministic process \(Z\) in (6.3) is either increasing (\(\mu _{0} + \mu _{1} \geq 0\)) or decreasing (\(\mu _{0} + \mu _{1} < 0\)).
For the following analysis, it is useful to define
as well as
and we observe that an application of Dynkin’s formula implies that
In the following Propositions 6.3 and 6.5, we establish the existence of a monotone boundary \(c: \mathbb{R} \to \mathbb{R}\) that splits the state space into continuation and stopping region. As we verify later in Remark 6.6, the function \(c^{-1}\) in (6.9) is indeed the right-continuous inverse of this function.
Proposition 6.3
If \(\mu _{0} + \mu _{1} \geq 0\), there exists a nondecreasing function \(c: \mathbb{R}\to \mathbb{R}\) such that the continuation region \(\mathcal{C}_{3}\) of (6.5) is rewritten as
Proof
Let \((x_{0},z_{0}) \in \mathcal{S}_{3}\), \(x_{1} > x_{0}\) and notice that (6.10) implies \((-\infty , z_{0}] \times \{x_{0} \} \subseteq \mathcal{S}_{3}\). Furthermore, we have \(x_{0} > x_{0}^{*}\), and since the process \(Z\) is decreasing, we observe that the process \((X^{x_{1}} , Z^{z_{0}})\) crosses the half-line \((- \infty , z_{0}] \times \{x_{0}\}\) before reaching the level \(x_{0}^{*}\). Hence we have \(\mathbb{Q}_{x_{1} , z_{0} }[\tau ^{*} < \tau _{x_{0}^{*}}] =1\), where \(\tau _{x_{0}^{*}} :=\inf \{t \geq 0: X_{t}^{x_{1}} = x_{0}^{*} \}\) and \(\mathbb{Q}_{x_{1}, z_{0}} [\, \cdot \, ] = \mathbb{Q} [\, \cdot \, \vert X_{0} = x_{1} , Z_{0} = z_{0} ]\). Moreover, it can be verified that the second condition of Assumption 4.1 implies \(x_{0}^{*} > \tilde{x}\), with the latter given by
Consequently, we have \(\exp (X_{s}^{x_{1}}) (r - \frac{1}{2} \sigma ^{2} - \mu _{1}) > r \kappa \) for all \(s \in [0,\tau ^{*})\), and (6.12) and (6.13) imply \(\widehat{u}(x_{1} , z_{0}) \leq 0\) for all \(x_{1} > x_{0}\) and therefore \(\{z_{0}\} \times [x_{0}, \infty ) \subseteq \mathcal{S}_{3}\). We can thus define
and observe that (6.10) implies that \(z \mapsto c(z)\) is nondecreasing. □
In order to establish the same result in the case when \(\mu _{0} + \mu _{1} < 0\), we first state the following lemma.
Lemma 6.4
We have
for all \((x,z) \in \mathbb{R}^{2} \setminus \partial \mathcal{C}_{3}\) and \(\tau ^{*} := \tau ^{*} (x,z)\).
Proof
For \((x,z) \in \mathcal{S}_{3}\), the claim follows immediately since \(\mathbb{Q}_{x,z} [\tau ^{*} = 0] =1\). Hence we let \((x,z) \in \mathcal{C}_{3}\), and for \(\epsilon > 0\), we obtain
where the first inequality follows from the supermartingale property of the process \(( e^{-r (\tau \wedge t)} \widehat{v}(X_{\tau \wedge t}^{x} , Z_{ \tau \wedge t}^{z + \epsilon}) )\) and the martingale property of \(( e^{-r (\tau ^{*} \wedge t)} \widehat{v}(X_{\tau ^{*} \wedge t}^{x} , Z_{\tau ^{*} \wedge t}^{z} ) )\) for \(\tau ^{*} := \tau ^{*} (x,z)\). Upon employing a change of measure as in Sect. 5, we find
where \(\pi = e^{\frac{\gamma}{\sigma}(x+z)} /(1 + e^{\frac{\gamma}{\sigma}(x+z)} )\). It is then easy to verify that Assumption 4.1 implies
and hence applying dominated convergence in (6.18) as \(t \to \infty \) yields
Similar arguments show
and since \(\widehat{v} \in C^{2,1} (\mathcal{C}_{3})\) by Lemma 6.2, dividing (6.19) and (6.20) by \(\epsilon \) and letting \(\epsilon \downarrow 0\) yields the desired result. □
Proposition 6.5
If \(\mu _{0} + \mu _{1} < 0\), there exists a nondecreasing function \(c:\mathbb{R} \to \mathbb{R}\) (the same as in Proposition 6.3) such that the continuation region of (6.5) can be written as
Proof
Let \((x,z) \in \mathbb{R}^{2}\). Notice that \(x < x_{0}^{*}\) implies \((x',z) \in \mathcal{C}_{3} \) for all \(x' < x\) and \(z \in \mathbb{R}\) because of Lemma 4.5 and since the transformations \(T_{1}\) and \(\overline{T}_{1}\) of (5.13) and (6.1), respectively, are the identity; hence \(\{ (x,z): x < x_{0}^{*} \} \subseteq \mathcal{C}_{3}\). We can thus focus on the case \(x \geq x_{0}^{*}\) and distinguish two possibilities:
-
i)
\(\widehat{u}_{x} (x,z) \leq 0\) for all \(x \in (x_{0}^{*},\infty )\) such that \((x,z) \in \mathcal{C}_{3}\);
-
ii)
there exists \(x_{0} \in (x_{0}^{*}, \infty )\) such that \((x_{0},z) \in \mathcal{C}_{3}\) and \(\widehat{u}_{x} (x_{0} ,z) > 0\).
In case i), the map \(x \mapsto \widehat{u}(x,z)\) is decreasing for \(x \in (x_{0}^{*}, \infty )\) and \((x,z) \in \mathcal{C}_{3}\). Hence for any \((x,z) \in \mathcal{C}_{3}\), we obtain \((-\infty ,x] \times \{z\} \subseteq \mathcal{C}_{3}\) and the claim follows in the same spirit as in Proposition 6.3. In case ii), we establish a contradiction. As a first step, we show that ii) implies \([x_{0}, \infty ) \times \{ z \} \subseteq \mathcal{C}_{3}\) which will then lead to a contradiction. We start by noticing that Lemma 6.2 and (6.13) imply
for \((x_{0},z)\) as in ii). As \(\mu _{0} < 0\) and \(\widehat{u}_{x} (x_{0},z) > 0\), we have \(\mu _{0} \widehat{u}(x_{0},z) < 0\) and thus
Next, we notice that we can rewrite (6.17) as
and since
(6.23) gives
which is equivalent to
We can thus plug this last equality into (6.22) and obtain
where the last inequality follows from \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \) in Assumption 4.1 upon noticing that \(x_{0} > x_{0}^{*}\). We deduce that \(\widehat{u}_{x} (\, \cdot \, , z)\) increases in a right neighbourhood of \(x_{0}\), and repeating the arguments for every \(x > x_{0}\) yields \(\widehat{u}_{x} (\, \cdot \, , z) > 0\) on \([x_{0} , \infty )\). It follows that \(\widehat{u}(\, \cdot \, , z)\) is increasing on \([x_{0} , \infty )\) so that \([x_{0} , \infty ) \times \{z\} \subseteq \mathcal{C}_{3}\), and combining the latter with (6.10), we have \(\mathcal{A} := [x_{0} , \infty ) \times [z_{0} , \infty ) \subseteq \mathcal{C}_{3}\). However, this leads to a contradiction. To see this, let \((x,z) \in \mathcal{A}\) and \(\tau _{x_{0}} := \inf \{ t>0: X_{t}^{x} \leq x_{0} \}\). Since \(t \mapsto Z_{t}^{z}\) is increasing, the only possibility for the process \((X^{x},Z^{z})\) to exit \(\mathcal{A}\), and thus eventually the continuation region, is by passing through the horizontal line \([x_{0},\infty ) \times \{z_{0}\}\). We thus have \(\tau _{x_{0}} \leq \tau ^{*}\) \(\mathbb{Q}_{x,z}\)-a.s. and moreover, since \(\mu _{0}< 0\), the stopping time \(\tau _{x_{0}}\) is finite a.s. Upon using Lemma 5.1 i) and (6.4), it follows that
Let now \(\widehat{r} := r- \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert > 0\), where the inequality is from Assumption 4.1, and denote by \(\phi _{r}\) (resp. \(\phi _{\widehat{r}}\)) the strictly decreasing solution to \(\frac{1}{2} \sigma ^{2} f_{xx} + \mu _{0} f_{x} - q f = 0\) for \(q \in \{r , \widehat{r} \}\). Then by results on hitting times for one-dimensional diffusions (see e.g. Borodin and Salminen [12, Chap. II.10]), the above inequality is equivalent to
which thus holds true for all \((x,z) \in \mathcal{A}\). Since \(\mathcal{A}\) is right-connected, we can let \(x \to \infty \) and notice that \((e^{x} -\kappa )(1 + e^{\frac{\gamma}{\sigma}(x+z)}) \to \infty \), while the right-hand side of (6.24) decreases to 0 due to the decreasing property of \(x \to \phi _{q} (x)\) for \(q>0\). We thus obtain a contradiction, which concludes our proof. □
Remark 6.6
Due to the implied geometry of the state space, as observed in Propositions 6.3 and 6.5, we notice that the function \(x \mapsto c^{-1} (x)\) is nondecreasing as well. Moreover, we notice that
and hence the function \(c^{-1}\) is the right-continuous inverse of \(c\) and thus admits the representation
In light of the connection (6.9) between \(c^{-1}\) and \(b^{-1}\) (the generalised inverse of the boundary \(b\)), (6.25) allows us to translate back our results to the formulation of Sect. 5 and then – through the representation (5.14) – to the original setting of Sect. 4.
6.1 Regularity of the value function and of the optimal stopping boundary
We have established the existence of a nondecreasing boundary \(z \mapsto c(z)\) such that \(\mathbb{R}^{2}\) is split into the continuation region \(\mathcal{C}_{3}\) of (6.5) and the stopping region \(\mathcal{S}_{3}\) of (6.6). In the following, we derive some further properties of the optimal stopping boundary and of the value function \(\widehat{v}\) of (6.4). We first state the following result, which will be helpful in the forthcoming analysis.
Lemma 6.7
We have \(\widehat{u}_{z} (x,z) \geq 0\) for \((x,z) \in \mathcal{C}_{3}\).
Proof
Due to (6.4) and (6.1), \(\overline{v}\) of (5.6) satisfies \(\overline{v}(x, \varphi ) = \widehat{v}(x, \frac{\sigma}{\gamma} \ln \varphi - x)\) for all \((x,\varphi ) \in \mathbb{R} \times (0, \infty )\). Since \(\widehat{v}_{z} \in C^{0} (\mathcal{C}_{3})\) by Lemma 6.2, we then also have \(\overline{v}_{\varphi }\in C^{0} (\mathcal{C}_{2})\). Furthermore, \(\varphi \mapsto \overline{v} (x, \varphi )\) is convex on \((0,\infty )\) by Lemma 5.1 iv) and thus so is \(\varphi \mapsto \overline{u}(x, \varphi )\) of (5.12). Then for \((x,\varphi ) \in \mathcal{C}_{2}\) and \(\varphi ' = b^{-1}(x)\) so that \((x, \varphi ') \in \partial \mathcal{C}_{2}\), we obtain, as \(\overline{u}_{\varphi }\in C^{0} (\mathcal{C}_{2})\) as well,
and \(\varphi > b^{-1} (x)\) implies \(\overline{u}_{\varphi }(x, \varphi ) \geq 0\) for \((x,\varphi ) \in \mathcal{C}_{2}\). In light of the relation (6.4), we then obtain \(\widehat{u}_{z} (x,z) \geq 0\) on \(\mathcal{C}_{3}\). □
Proposition 6.8
The optimal stopping boundary \(c(z)\) is such that \(x_{0}^{*} \leq c(z) \leq x_{1}^{*}\) for all \(z \in \mathbb{R}\) and with \(x_{0}^{*}\) and \(x_{1}^{*}\) as in Lemma 4.5. Furthermore, we have \(c \in C(\mathbb{R})\).
Proof
The first assertion follows from Lemma 5.3 iii) and by noticing that the transformation \(\overline{T}_{1}\) of (6.1) is the identity. We derive the continuity of \(z \mapsto c(z) \) in two steps.
1) Left-continuity. Let \(z_{0} \in \mathbb{R}\) and \(z_{n} \uparrow z_{0}\) as \(n \to \infty \). Since \(z \mapsto c(z)\) is nondecreasing and \(\mathcal{S}_{3}\) is closed, we obtain \(\lim _{n \to \infty} (c(z_{n}), z_{n}) = (c(z_{0} -), z_{0} ) \in \mathcal{S}_{3}\), where \(c(z_{0} -)\) denotes the left limit of \(c\) at \(z_{0}\). The definition of \(c\) in (6.16) implies that \(c(z_{0} - ) \geq c(z_{0})\); but since \(c\) is nondecreasing, we must have \(c(z_{0} -) = c(z_{0})\) and the claim follows.
2) Right-continuity. We argue by contradiction and assume there exists \(z_{0} \in \mathbb{R}\) with \(c(z_{0}) < c(z_{0} +)\). Using techniques developed in De Angelis [18], we take \(c(z_{0}) < x_{1} < x_{2} < c(z_{0} +)\) and a nonnegative function \(\phi \in C^{\infty}_{c} (x_{1},x_{2})\) such that \(\int _{x_{1}}^{x_{2}} \phi (x) dx =1\). Recalling (6.21), we have
for \((x,z) \in (x_{1},x_{2})\times (z_{0}, \infty )\). In the following, it is helpful to treat the cases i) \(\mu _{0} + \mu _{1} \geq 0\) and ii) \(\mu _{0} + \mu _{1} < 0\) separately. Let us start with i) and recall that \(\widehat{u}_{z} (x,z) \geq 0\) for \(x\) and \(z\) as above, due to Lemma 6.7. Integration by parts yields
Hence using dominated convergence as \(z \downarrow z_{0}\) and \(\widehat{u}(x,z_{0})=0\), we get
where the latter inequality follows from \(x_{1},x_{2} \geq x_{0}^{*}\) and Assumption 4.1, which implies \(x > \tilde{x}\) for all \(x \in [x_{1}, x_{2}]\) and \(\tilde{x}\) as in (6.15). We thus obtain a contradiction and \(c(z_{0}) = c(z_{0} + )\).
In case ii), we rely on classical results of internal regularity of PDEs (cf. Friedman [38, Theorem 3.10]) which allow taking derivatives in (6.26) with respect to \(x\) and have \(\widehat{u}_{x} \in C^{2,1}(\mathcal{C}_{3})\) solving
Then we obtain for \(z > z_{0}\) that
Let \(F_{\phi} (z) := \int _{x_{1}}^{x_{2}} \widehat{u}_{xz} (x,z) \phi (x) dx\). Integration by parts allows rewriting (6.27) as
and using dominated convergence as \(z \downarrow z_{0}\) as well as \(\widehat{u}(x,z_{0})=0\) results in
for some \(p_{0}\), where the first inequality again follows from Assumption 4.1. Thus there exists \(\epsilon > 0\) such that \(F_{\phi }(z) \geq p_{0}/2\) for all \(z\in (z_{0},z_{0} + \epsilon )\), and we finally obtain
where we used \(\widehat{u}(x,z_{0})=0\) as well as \(\widehat{u}_{x} (x,z) \leq 0\) for \(x \in [x_{1}, x_{2}]\) and \(z > z_{0}\) (cf. Proposition 6.5). Hence \(c(z) = c(z +)\) for all \(z \in \mathbb{R}\) and together with 1), we conclude that \(z \mapsto c(z)\) is continuous. □
In the next result, we derive the regularity of the value function. Its proof can be found in Appendix A.
Proposition 6.9
The value function \(\widehat{v}\) of (6.4) has \(\widehat{v} \in C^{1} (\mathbb{R}^{2})\) and \(\widehat{v}_{xx} \in L_{{\mathrm{{loc}}}}^{\infty }(\mathbb{R}^{2})\).
In light of Proposition 6.9, we are able to derive an integral equation for the free boundary \(c\). Let us first recall that by standard arguments based on the strong Markov property and Proposition 6.9, the value function \(\widehat{v}\) and the free boundary \(c\) solve the free-boundary problem
In the next result, with a suitable application of Itô’s lemma, we derive a probabilistic representation of the value function \(\widehat{v}\). Its proof is postponed to Appendix B.
Proposition 6.10
Recall the free boundary \(c\) of (6.16) and the function \(g\) of (6.12). For any \((x,z) \in \mathbb{R}^{2}\), the value function \(\widehat{v}\) can be written as
Denote now by
the density function of a Gaussian random variable with mean \(m\) and variance \(v^{2}\). Then from Proposition 6.10, we obtain the following result.
Proposition 6.11
Let
Then the free boundary \(c\) of (6.16) is the unique solution in ℳ to the integral equation
with \(g\) as in (6.12) and \(G\) as in (6.30).
Proof
We take \(x = c(z)\) in Proposition 6.10. Using the continuity of the value function, we find for \(z \in \mathbb{R}\) that
Because \(Z^{z}\) is deterministic and \(X_{s}^{c(z)}\) is Gaussian under ℚ with mean \(c(z) + \mu _{0} s\) and variance \(\sigma ^{2} s\), we can reformulate (6.32) as (6.31) upon using (6.30). To show uniqueness, one can employ a four-step-approach exploiting the superharmonic characterisation of \(\widehat{v}\) as originally developed in Peskir [55, Theorem 3.1]. Since the present setting does not exhibit additional challenges, we omit the details for the sake of brevity. □
Remark 6.12
As is turns out, the integral equation (6.31) allows us to derive an integral equation for the boundary \(b^{-1}\) of (6.8) as well. Indeed, taking \(z = c^{-1}(x)\) in (6.31) and using (6.9) yields
In particular, it follows from the latter that
Notice that the domain of \(b^{-1}\) is given by the interval \([x_{0}^{*}, x_{1}^{*}]\) (cf. Lemma 5.3), and hence we do not encounter any problems when dividing by \(e^{x} - \kappa \) since Assumption 4.1 guarantees that \(e^{x} - \kappa > 0\) for \(x \geq x_{0}^{*}\).
7 Solution of the optimal execution problem
In this section, we finally return to the optimal execution problem of Sect. 4 and provide its solution. Before we do so, it is helpful to transform the singular stochastic control problem (2.9) by arguing as for the optimal stopping problems in Sects. 5 and 6, respectively. Since the arguments are in the same spirit of those developed in Sect. 5, the details are omitted (see also Federico et al. [33, Sect. 4]). First, we make a change of measure as in Sect. 5, and for ℚ as introduced there, we let
denote the dynamics of the controlled process \(X^{\xi}\) under ℚ. Conditionally on \(X_{0-}^{\xi }= x\), \(Y_{0-}^{\xi }= y\) and \(\Phi _{0} = \varphi \), we then introduce the transformed optimal control problem
and observe that \(\overline{V}(x,y,\varphi ) = (1 + \varphi ) V(x,y, \frac{\varphi}{1 + \varphi})\). Furthermore, we set
for any \((x, \varphi )\in \mathbb{R} \times (0,\infty )\). An application of Itô’s formula then gives the dynamics
Finally, analogously to (6.4), we define, for \((x,y,z) \in \mathcal{O} := \mathbb{R} \times (0,\infty ) \times \mathbb{R}\),
where \(\mathbb{E}^{\mathbb{Q}}_{x,y,z}\) is the expectation conditionally on \(X_{0-}^{\xi }= x\), \(Y_{0-}^{\xi }=y\) and \(Z_{0-}^{\xi }= z\).
In the following, we introduce a candidate for the value function \(V\) of (2.9) and – through the explicit relationships between the value functions \(v\), \(\overline{v}\) and \(\widehat{v}\) – also for the value functions \(\overline{V}\) and \(\widehat{V}\) of (7.1) and (7.3). To this end, we set
where \(v\) denotes the value function of (4.2). Upon using the explicit relationship (5.5) of \(v\) and \(\overline{v}\), it follows that
which gives a candidate for the value function \(\overline{V}\) of (7.1). Furthermore, we let \(\widehat{U}(x,y,z) := \overline{U}(x,y, e^{\frac{\gamma}{\sigma}(x+z)} )\) and exploit the relationship (6.4) to derive
where the last equality follows from a change of variables. With regard to Proposition 6.9, we can state the following result; its proof is based on direct computations.
Lemma 7.1
The function \(\widehat{U}\) of (7.6) has \(\widehat{U} \in C^{1} (\mathcal{O})\). Moreover, \(\widehat{U}_{xy}, \widehat{U}_{yz} \in C(\mathcal{O})\) as well as \(\widehat{U}_{xx} , \widehat{U}_{xz} \in L_{{\mathrm{{loc}}}}^{\infty }( \mathcal{O})\).
Proof
Notice that (7.6) gives
so that \(\widehat{U}_{x}\), \(\widehat{U}_{y}\) and \(\widehat{U}_{z}\) are continuous due to Proposition 6.9. Moreover, Proposition 6.9 also implies that
is locally bounded, and the mixed derivatives
are continuous, while
is locally bounded. Furthermore, it is easy to see that \(\widehat{U}_{yx} = \widehat{U}_{xy}\), \(\widehat{U}_{xz} = \widehat{U}_{zx}\) and \(\widehat{U}_{yz} = \widehat{U}_{zy}\). □
The proof of the next result follows from (7.7), (7.8) and direct computations.
Corollary 7.2
One has
so that
with \(\mathcal{C}_{3}\) as in (6.5). Furthermore, \(\widehat{U}_{x} - \widehat{U}_{z} = \overline{U}_{x}\) as well as \(\widehat{U}_{y} = \overline{U}_{y}\), and we have
with \(\mathcal{C}_{2}\) from (5.7).
7.1 Construction of the optimal control for the state space process \((X,Y,\Phi )\)
Recall \(b\) from (5.9), which is nondecreasing and left-continuous by Lemma 5.3. Then we define for any \((x,y,\varphi ) \in \mathbb{R} \times (0,\infty )\times (0,\infty )\) the admissible control strategy
according to which the investor should only execute a lump-sum amount of shares whenever \(X_{t-}\) is strictly inside the selling region and hence strictly above the boundary \(b(\Phi _{t})\). More precisely, if \(y \leq \frac{1}{\alpha} (x - b(\varphi ))\), it is optimal to sell the complete amount of shares instantaneously, while for \(y > \frac{1}{\alpha} (x - b(\varphi ))\), the system is brought immediately to the level \((X_{0}, Y_{0}, \Phi _{0})= (b(\varphi ), y - \frac{1}{\alpha} (x - b( \varphi )), \varphi )\). Afterwards, the strategy (7.13) prescribes to take action whenever \(X_{t}\) approaches the boundary \(b(\Phi _{t})\) from below and the process \((X,Y)\) is obliquely reflected at the belief-dependent boundary \(b(\Phi )\) in the direction \((- \alpha , -1)\). Hence \(X_{t}\) is kept inside the interval \((-\infty , b(\Phi _{t})]\) with “minimal effort”. These actions are so-called Skorokhod reflection-type policies and caused by the continuous part \(\widehat{\xi }^{c}\) of the control \(\widehat{\xi }\). Notice that the nondecreasing process \(\widehat{\xi }\) and the induced random measure \(d\widehat{\xi }\) on \([0,\infty )\) are such that (recall (7.12))
Furthermore, due to (7.2), (7.3) and Corollary 7.2, we can express the control \(\widehat{\xi }\) equivalently in terms of the state-process \((X^{\widehat{\xi }} , Y^{\widehat{\xi }}, Z^{\widehat{\xi }})\) by (cf. (7.11))
In the following, we prove that \(\widehat{\xi }\) is in fact an optimal control for problem (7.3) and that \(\widehat{U} = \widehat{V}\). As an immediate consequence, we obtain \(\overline{U} = \overline{V}\) and \(U=V\).
Theorem 7.3
Let \((x,y,z) \in \mathbb{R} \times [0,\infty ) \times \mathbb{R}\) and \(\widehat{U}(x,y,z)\) as in (7.6). Then \(\widehat{U}(x,y,z) = \widehat{V} (x,y,z)\), and \(\widehat{\xi }\) in (7.13) is optimal for the singular control problem (7.3).
Proof
First of all, for \(y =0\), we have \(\widehat{U}(x,0,z) =0= \widehat{V}(x,0,z)\). Hence in the following, we assume \((x,y,z)\in \mathcal{O}\).
1) We prove \(\widehat{U} \geq \widehat{V}\). Take an arbitrary control \(\xi \in \mathcal{A}(y)\) and for \(R>0\) and \(N \in \mathbb{N}\), define \(\tau _{R,N} := \inf \{ s \geq 0: \vert (X_{s}^{\xi }, Z_{s}^{\xi}) \vert > R \} \wedge N\). Due to Lemma 7.1, we can proceed as in Fleming and Soner [36, Theorem 8.4.1] to obtain (after performing an approximation of \(\widehat{U}\) via mollifiers and taking limits)
Denote by \(M_{R,N}\) the \(dB\)-integral in (7.15). Notice that
Hence combining (7.15) and (7.16) and adding the term
on both sides yields
We observe that (7.7)–(7.9) imply
where the last inequality follows from the supermartingale property of the process \((e^{-rt} \widehat{v}(X_{t}, Z_{t}))\), combined with the regularity obtained in Proposition 6.9. Hence due to (7.10), \(\widehat{U} \geq 0\) and \(\mathbb{E}^{\mathbb{Q}}_{x,y,z} [M_{R,N}] = 0\), (7.17) is written as
Taking limits as \(R \uparrow \infty \) as well as \(N \uparrow \infty \) and invoking the dominated convergence theorem thanks to Assumption 4.1, we obtain
Since \(\xi \) was arbitrary, we have
for all \((x,y,z) \in \mathcal{O}\), that is, \(\widehat{U} \geq \widehat{V}\) on \(\mathcal{O}\).
2) We prove that \(\widehat{U} \leq \widehat{V}\). Let \(\widehat{\xi }\) satisfy the conditions in (7.14) and define \(\widehat{\tau }_{R,N} = \inf \{ t \geq 0: \vert (X_{t}^{ \widehat{\xi }} , Z_{t}^{\widehat{\xi }} ) \vert > R \} \wedge N\) for \(R>0\) and \(N \in \mathbb{N}\). Notice that the properties of \(\widehat{\xi }\) imply equalities in (7.10) and (7.18), where the equality in (7.18) follows from the monotonicity of \(c\), and we can deduce that \((x' , x+z -x') \in \mathbb{W}_{3}\) for \((x,y,z) \in \mathbb{W}_{3}\) and \(x' \leq x\). The same arguments as in the first part of the proof yield
It is thus left to prove that
because combining this with (7.19) implies that \(J(x,y,z,\widehat{\xi }) = \widehat{U} (x,y,z)\) and therefore \(\widehat{V}(x,y,z) \geq \widehat{U}(x,y,z)\) for all \((x,y,z) \in \mathcal{O}\). Combining the latter with (7.20) yields \(\widehat{U} = \widehat{V}\) on \(\mathcal{O}\).
In order to prove (7.21), we notice that Lemma 5.1 i), (6.4) and (7.6) imply that
and since \(y \mapsto \widehat{U}(x,y,z)\) is increasing, we obtain
where we used that \(X_{t}^{\xi }\leq X_{t}^{0}\) as well as \(X_{t}^{\xi }+ Z_{t}^{\xi }= X_{t}^{0} + Z_{t}^{0}\) a.s. Hence taking expectations yields
with \(\pi := e^{\frac{\gamma}{\sigma}(x+z)}/(1 + e^{\frac{\gamma}{\sigma}(x+z)} )\), where the last equality follows from a change of measure as in Sect. 5. Upon using Assumption 4.1, it is easy to check that (7.21) holds true, thus completing the proof. □
Remark 7.4
We can use the transformation (6.1) from \((x,z)\)- to \((x, \varphi )\)-coordinates in order to show that \(\widehat{\xi }\) is an optimal control for problem (7.1) as well. Indeed, recall (6.2) and the equality \(\widehat{V}(x,y,z) = \widehat{V}(x,y,\frac{\sigma}{\gamma} \ln \varphi -x) = \overline{V}(x,y,\varphi )\) to conclude
Furthermore, (7.22) and (7.5) imply that we have \(U(x,y,\pi )=V(x,y,\pi )\) for all \((x,y,\pi ) \in \mathbb{R} \times (0,\infty ) \times (0,1)\).
Remark 7.5
Letting \(\tilde{\tau}(x,y,\varphi ) := \inf \{ t \geq 0:x + \mu _{0} t + \sigma B_{t} \geq b(\Phi ^{\varphi}_{t}) \}\), the optimal execution strategy \(\widehat{\xi }\) as in (7.13) converges as \(\alpha \downarrow 0\) to the execution strategy
which prescribes to sell the total amount of shares instantaneously when the process \(X\) reaches the optimal execution boundary \(b(\Phi )\). It is interesting to notice that the optimal strategy and the value function are robust with respect to the parameter \(\alpha \). Indeed, by L’Hôpital’s rule, we see from (7.5) that \(\lim _{\alpha \downarrow 0} = y \overline{v}(x, \varphi )\) for \((x,y,\varphi ) \in \mathbb{R}\times (0,\infty )\times (0,\infty )\), and it is easy to show via a verification theorem that \(y \overline{v}(x,\varphi )\) and \(\widetilde{\xi }\) are the value function and the optimal execution rule in the problem with no market impact.
Remark 7.6
Let \(\widehat{\sigma } := \inf \{ t \geq 0: Y_{t}^{\xi }= 0 \}\) denote the time at which the portfolio is fully depleted. Imposing the constraint that the investor has to sell all assets until the terminal time (cf. Guo and Zervos [44]), we notice that for \(y \leq \frac{1}{\alpha} (x - b(\varphi ))\), the control strategy \(\widehat{\xi }\) of (7.13) still defines an optimal control as the complete amount of shares is sold immediately at time \(t=0\). However, for \(y > \frac{1}{\alpha} (x - b(\varphi ))\), simple calculations yield
and we notice that for increasing \(y\) and decreasing \(x\), the probability increases that the investor does not sell the entire amount of shares until the terminal time. Hence if we restrict the admissible strategies to all \(\xi \in \mathcal{A}(y)\) such that \(\lim _{T \to \infty} Y_{T}^{\xi }= 0\), the control strategy \(\widehat{\xi }\) of (7.13) does not provide an admissible execution strategy. In this case, arguing as in Guo and Zervos [44, Proposition 5.1], we can use \(\widehat{\xi }\) to construct a sequence of \(\epsilon \)-optimal strategies.
8 Numerical study
In this section, we perform a comparative statics analysis on the optimal execution boundaries \(a\) and \(b\) of (4.5) and (5.9), respectively, and also investigate the value of information in our model by comparing the value function \(V\) of (2.9) to the value of an average drift problem.
8.1 Comparative statics analysis
Based on the integral equation (6.31), we implement a recursive numerical scheme which relies on an application of the Monte Carlo method. Let \(\zeta \) denote an auxiliary exponentially distributed random variable with parameter \(r\) that is independent of the Brownian motion \(B\). Recalling that (6.31) can be reformulated as (6.33), we notice that the latter takes the shape of a fixed point problem
for \(x \in \mathbb{R}\) and \(b^{-1}\) being the generalised inverse of \(b\) as in (6.8). Here, the operator \(\Gamma \) is defined via
for \((x,\varphi ) \in \mathbb{R} \times (0,\infty )\) and a function \(f: \mathbb{R} \to (0,\infty ) \). By employing techniques seen in Christensen and Salminen [14], Dammann and Ferrari [17] and Detemple and Kitapbayev [27], we aim to solve (8.1) via an iterative scheme. To this end, we let
define a sequence of boundaries and – for a given boundary \((b^{-1})^{[k]}\) – we estimate the expectation in (8.2) by
where \(N\) denotes the number of realisations of the exponential random variable. The initial boundary \((b^{-1})^{[0]}\) can be chosen as a simple exponential function with \((b^{-1})^{[0]} (x_{0}^{*}) = 0\) and \((b^{-1})^{[0]} (x) \to \infty \) for \(x \uparrow x_{1}^{*}\) with \(x_{0}^{*}\) and \(x_{1}^{*}\) as in (3.6) and Remark 3.1, respectively. The numerical scheme (8.3) is then iterated on a grid over the interval \([x_{0}^{*},x_{1}^{*}]\) until the variation of the boundary points between steps falls below a predetermined level. Finally, we calculate \(b\) from its generalised inverse \(b^{-1}\) and transform the resulting boundary according to the explicit relationship (5.14). We can thus study the sensitivity of \(b(\varphi )\) as well as \(a(\pi )\) with respect to some of the model parameters. Furthermore, we can compare the belief-dependent boundaries to the strategy of a pre-committed agent, who – after forming an initial belief \(\pi = \mathbb{P}[\mu = \mu _{1} ]\) – restrains from updating her belief and thus acts as if the drift value was constant and equal to \(\mu _{1} \pi + \mu _{0} (1- \pi )\). The resulting strategy is then triggered by a constant execution threshold which has a similar structure as the one derived in Sect. 3. Consequently, such an agent cannot react to price movements on the market and is thus not able to decrease or increase the target price at which she would like to sell the asset.
8.1.1 Sensitivity with respect to the drift
In Fig. 2, we can observe the sensitivity of the optimal execution boundaries with respect to one of the possible drift values. Since an increase in \(\mu _{1}\) implies higher expected prices on the market, the investor delays her decision to sell a fraction of her shares and waits for higher prices to evolve. This effect is strongest for higher values of \(\pi \), which reflect a stronger belief in the drift \(\mu _{1}\). On the other hand, we notice that the lower bound \(x_{0}^{*}\) remains untouched by a change in \(\mu _{1}\), since it results from the case of full information when \(\mu = \mu _{0}\). Consequently, for a strong belief in the drift value \(\mu _{0}\), the investor does not significantly change her execution strategy.
8.1.2 Sensitivity with respect to the discount rate
Figure 3 shows the effect on the boundaries \(a\) and \(b\) of a change in \(r\); the latter can be interpreted as the subjective impatience of the investor. For a larger value of \(r\), the investor gets more impatient and discounts future revenues more heavily. Consequently, the investor is willing to liquidate her assets earlier, which is realised by decreasing the target price she aims at achieving on the market. This clear effect can be observed for every value of belief \(\pi \in [0,1]\).
8.1.3 Sensitivity with respect to the volatility
The sensitivity of the optimal execution boundaries \(a\) and \(b\) on the volatility of the underlying asset is more delicate. As pointed out by Décamps et al. [24] who consider an optimal stopping problem of a structure similar to the one in (4.2), the effect of an increase in volatility is ambiguous and cannot always be predicted with the help of standard real option models (see for example Dixit and Pindyck [28, Chap. 6], McDonald and Siegel [53]). In general, one expects a value function increasing with rising volatility, as this increases the spread of possible future values of the asset and thus the maximal possible profit, while the maximal possible loss remains unchanged. The investor exploits this upside potential by delaying her liquidation decision and increasing the target price she aims at realising on the market. This effect, widely known and referred to as the “real option effect” in Décamps et al. [24], can be observed in the benchmark case (3.2) as well as in the problem (2.9) under partial information, as Fig. 4 reveals.
However, this effect need not be robust. To understand how an increase in volatility might indeed harm the investor, we recall the dynamics of the belief process \(\Pi \) given by (2.8). In particular, we observe that increasing volatility lowers the signal-to-noise ratio \(\gamma = (\mu _{1} - \mu _{0})/\sigma \) (determining the variance of the process \(\Pi \)) and thus the efficiency of learning. The latter effect is in contrast to the mentioned real option effect, and the sensitivity of the value function with respect to an increase in volatility “depends on which of the real option and the inefficient learning effect dominates” (Décamps et al. [24, p. 487]). The overall impact of a change in volatility thus clearly depends on the parameter constellation of the model; however, a division of the parameter space is not straightforward. For a broader discussion on this subject, we refer to Décamps et al. [24, Sect. 6.2],
8.2 The value of information
Here, we want to address the question whether incomplete information about the drift actually harms or benefits the investor. To this end, we introduce the “average drift problem”, whose value is denoted by \(V^{A} (x,y)\) and modelled as in (3.2), but with constant and known drift \(\mu _{1} \pi + \mu _{0} (1-\pi )\), i.e., the average of \(\mu \) with respect to the prior Bernoulli distribution. We then investigate the preference of an investor faced with the decision of choosing between two portfolios containing assets with either an unknown drift coefficient, or with a constant and known average drift. An analytical attempt to answer this question is presented in Décamps et al. [24], although the derived result does not hold true in general, as pointed out by Klein [51]. Here, we are able to analyse this question with numerical methods based on the numerical evaluation of the optimal execution boundary (cf. Sect. 8.1) and the representation (6.29) of the optimal stopping value function \(\widehat{v}\). In order to accomplish that, we plug in the numerical evaluation of \(b^{-1}\) into (6.29) and we transform the result according to (5.13). This yields the value function \(v\) of (4.2), which can be finally integrated via (7.4) to obtain a numerical approximation of the control problem’s value function \(V\).
In general, the results derived in Décamps et al. [24] and Klein [51] suggest that the overall impact of introducing uncertainty over the drift is governed by two separate effects: the introduction of uncertainty in general and the impact of learning. If learning is efficient, which is achieved by – for example – specifying a small volatility coefficient \(\sigma \), the latter effect seems to outweighs the former and the investor indeed prefers the problem with only incomplete information on the return. We observe this overall effect in Fig. 5.
In their model, Décamps et al. [24] give an analytical proof to this observation in an optimal stopping environment, although restricting the possible drift values to 0 and 1. For small values of \(\sigma \), depending on the other parameters in the model, this result seems to hold true in our more generalised framework. Nevertheless, this effect cannot be expected to be robust over the whole parameter space. In an example where the parameter values are aligned such that \(\beta _{0} + \beta _{1} = \sigma ^{2}\) (and thus \(\mu _{0} = - \mu _{1}\) in our model), Klein [51] obtains an explicit solution to the optimal stopping problem and shows how the introduction of uncertainty might harm the decision maker. This effect appears to have the peculiarity of being, at least in some cases, dependent on the initial value of the price process, as it determines the distance to the target price at which the investor is willing to execute. We can observe an example of this in Fig. 6. In particular, if the asset price is close to the target value under the current belief and learning is inefficient, the investor will not choose a portfolio with drift uncertainty. This is due to the fact that the downside risk outweighs the upside potential, which could only be achieved if learning is efficient. On the other hand, we observe that for low prices the upside potential might still dominate and the investor is willing to choose the uncertain environment, even if learning is inefficient.
References
Almgren, R.: Optimal execution with nonlinear impact functions and trading-enhanced risk. Appl. Math. Finance 10, 1–18 (2001)
Almgren, R., Chriss, N.: Optimal execution of portfolio transactions. J. Risk 3, 5–40 (1999)
Baldursson, F.M.: Singular stochastic control and optimal stopping. Stoch. Int. J. Probab. Stoch. Process. 21, 1–40 (1987)
Becherer, D., Bilarev, T., Frentrup, P.: Optimal liquidation under stochastic liquidity. Finance Stoch. 22, 39–68 (2018)
Becherer, D., Bilarev, T., Frentrup, P.: Stability for gains from large investors’ strategies in \(M_{1} / J_{1} \) topologies. Bernoulli 25, 1105–1140 (2019)
Beneš, V.E., Shepp, L.A., Witsenhausen, H.S.: Some solvable stochastic control problems. Stoch. Int. J. Probab. Stoch. Process. 4, 39–83 (1980)
Bensoussan, A.: Stochastic Control of Partially Observable Systems. Cambridge University Press, Cambridge (1992)
Bensoussan, A., Lions, J.L.: Applications of Variational Inequalities in Stochastic Control. North-Holland, Amsterdam (1982)
Bertsimas, D., Lo, A.W.: Optimal control of execution costs. J. Financ. Mark. 1, 1–50 (1998)
Bertsimas, D., Lo, A.W., Hummel, P.: Optimal control of execution costs for portfolios. Comput. Sci. Eng. 1, 40–53 (1999)
Bismuth, A., Guéant, O., Pu, J.: Portfolio choice, portfolio liquidation, and portfolio transition under drift uncertainty. Math. Financ. Econ. 13, 661–719 (2019)
Borodin, A.N., Salminen, P.: Handbook of Brownian Motion: Facts and Formulae. Birkhäuser, Basel (2002)
Callegaro, G., Ceci, C., Ferrari, G.: Optimal reduction of public debt under partial observation of the economic growth. Finance Stoch. 24, 1083–1132 (2020)
Christensen, S., Salminen, P.: Multidimensional investment problem. Math. Financ. Econ. 12, 75–95 (2018)
Colaneri, K., Eksi, Z., Frey, R., Szölgyenyi, M.: Optimal liquidation under partial information with price impact. Stoch. Process. Appl. 130, 1913–1946 (2020)
Cox, A.M., Peskir, G.: Embedding laws in diffusions by functions of time. Ann. Probab. 43, 2481–2510 (2015)
Dammann, F., Ferrari, G.: On an irreversible investment problem with two-factor uncertainty. Quant. Finance 22, 907–921 (2021)
De Angelis, T.: A note on the continuity of free-boundaries in finite-horizon optimal stopping problems for one-dimensional diffusions. SIAM J. Control Optim. 53, 167–184 (2015)
De Angelis, T.: Optimal dividends with partial information and stopping of a degenerate reflecting diffusion. Finance Stoch. 24, 71–123 (2020)
De Angelis, T., Federico, S., Ferrari, G.: Optimal boundary surface with stochastic costs. Math. Oper. Res. 42, 1135–1161 (2017)
De Angelis, T., Ferrari, G., Moriarty, J.: A nonconvex singular stochastic control problem and its related optimal stopping boundaries. SIAM J. Control Optim. 53, 1199–1223 (2015)
De Angelis, T., Peskir, G.: Global \(C^{1}\) regularity of the value function in optimal stopping problems. Ann. Appl. Probab. 30, 1007–1031 (2020)
De Angelis, T., Stabile, G.: On Lipschitz continuous optimal stopping boundaries. SIAM J. Control Optim. 57, 402–436 (2019)
Décamps, J.P., Mariotti, T., Villeneuve, S.: Investment timing under incomplete information. Math. Oper. Res. 30, 472–500 (2005)
Décamps, J.P., Villeneuve, S.: Learning about profitability and dynamic cash management. J. Econ. Theory 205, 105522-1–46 (2022)
Detemple, J.: Asset pricing in a production economy with incomplete information. J. Finance 41, 383–391 (1986)
Detemple, J., Kitapbayev, Y.: The value of green energy under regulation uncertainty. Energy Econ. 89, 104807-1–14 (2020)
Dixit, A.K., Pindyck, R.S.: Investment Under Uncertainty. Princeton University Press, Princeton (1994)
Drissi, F.: Solvability of differential Riccati equations and applications to algorithmic trading with signals. Preprint (2022). https://arxiv.org/abs/2202.07478v2
Ekström, E., Lu, B.: Optimal selling of an asset under incomplete information. Int. J. Stoch. Anal. 2011, 543590-1–17 (2011)
Ekström, E., Vaicenavicius, J.: Optimal liquidation of an asset under drift uncertainty. SIAM J. Financ. Math. 7, 357–381 (2016)
El Karoui, N., Karatzas, I.: Integration of the optimal risk in a stopping problem with absorption. In: Azéma, J., et al. (eds.) Séminaire de Probabilités XXIII. Lecture Notes in Mathematics, vol. 1372, pp. 405–420. Springer, Berlin (1989)
Federico, S., Ferrari, G., Rodosthenous, N.: Two-sided singular control of an inventory with unknown demand trend. Preprint (2021). https://arxiv.org/pdf/2102.11555
Ferrari, G., Koch, T.: An optimal extraction problem with price impact. Appl. Math. Optim. 83, 1951–1990 (2021)
Fleming, W.H., Pardoux, E.: Optimal control for partially observed diffusions. SIAM J. Control Optim. 20, 261–285 (1982)
Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions. Springer, Berlin (2006)
Forsyth, P.A., Kennedy, J.S., Tse, S.T., Windcliff, H.: Optimal trade execution: a mean-quadratic-variation approach. J. Econ. Dyn. Control 36, 1971–1991 (2011)
Friedman, A.: Variational Principles and Free Boundary Problems. Wiley, New York (1982)
Gapeev, P.V.: Discounted optimal stopping problems in continuous hidden Markov models. Stochastics 94, 335–364 (2022)
Gapeev, P.V., Rodosthenous, N.: Optimal stopping games in models with various information flows. Stoch. Anal. Appl. 39, 1050–1094 (2021)
Gatheral, J., Schied, A.: Optimal trade execution under geometric Brownian motion in the Almgren and Chriss framework. Int. J. Theor. Appl. Finance 14, 353–368 (2011)
Gennotte, G.: Optimal portfolio choice under incomplete information. J. Finance 41, 733–746 (1986)
Guo, X., Tomecek, P.: Connections between singular control and optimal switching. SIAM J. Control Optim. 47, 421–443 (2008)
Guo, X., Zervos, M.: Optimal execution with multiplicative price impact. SIAM J. Financ. Math. 6, 281–306 (2015)
Johnson, P., Peskir, G.: Quickest detection problems for Bessel processes. Ann. Appl. Probab. 27, 1003–1056 (2017)
Karatzas, I.: A class of singular stochastic control problems. Adv. Appl. Probab. 15, 225–254 (1983)
Karatzas, I.: Probabilistic aspects of finite-fuel stochastic control. Proc. Natl. Acad. Sci. USA 82, 5579–5581 (1985)
Karatzas, I., Ocone, D., Wang, H., Zervos, M.: Finite-fuel singular control with discretionary stopping. Stoch. Stoch. Rep. 71, 1–50 (2000)
Karatzas, I., Shreve, S.E.: Connections between optimal stopping and singular stochastic control I. Monotone follower problems. SIAM J. Control Optim. 22, 856–877 (1984)
Karatzas, I., Shreve, S.E.: Methods of Mathematical Finance. Springer, New York (1998)
Klein, M.: Comment on “Investment timing under incomplete information”. Math. Oper. Res. 34, 249–254 (2009)
Koch, T.: On some stochastic control problems arising in environmental economics and commodity markets. PhD Thesis, Universität Bielefeld (2020). https://pub.uni-Bielefeld.de/record/2942205
McDonald, R., Siegel, D.: The value of waiting to invest. Q. J. Econ. 101, 707–727 (1986)
Moreau, L., Muhle-Karbe, J., Soner, H.M.: Trading with small price impact. Math. Finance 27, 350–400 (2017)
Peskir, G.: On the American option problem. Math. Finance 15, 169–181 (2005)
Peskir, G.: Continuity of the optimal stopping boundary for two-dimensional diffusions. Ann. Appl. Probab. 29, 505–530 (2019)
Peskir, G., Shiryaev, A.N.: Optimal Stopping and Free-Boundary Problems. Birkhäuser, Basel (2006)
Rogers, L.C.: Optimal Investment. Springer, Berlin (2013)
Sass, J., Haussmann, U.G.: Optimizing the terminal wealth under partial information: the drift process as a continuous-time Markov chain. Finance Stoch. 8, 553–577 (2004)
Schied, A., Schöneborn, T.: Risk aversion and the dynamics of optimal liquidation strategies in illiquid markets. Finance Stoch. 13, 181–204 (2009)
Shiryaev, A.N.: Optimal Stopping Rules. Springer, New York (1978)
Shiryaev, A.N.: Quickest detection problems: fifty years later. Seq. Anal. 29, 345–385 (2010)
Strauss, W.A.: Partial Differential Equations: An Introduction. Wiley, New York (2007)
Veronesi, P.: Stock market overreactions to bad news in good times: a rational expectations equilibrium model. Rev. Financ. Stud. 12, 975–1007 (1999)
Acknowledgements
The authors would like to thank the Editor, an anonymous Associate Editor and two referees for their pertinent comments on an earlier version of this work.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors gratefully acknowledge funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), Project-ID 317210226–SFB 1283.
Appendices
Appendix A: Proof of Proposition 6.9
The proof follows the lines of De Angelis [19, Sect. 4], suitably adapted to the present setting, and is obtained through a series of intermediate results. Fix \((x,z)\in \mathbb{R}^{2}\), set
and observe that \(\sigma _{*} = \tau _{*}\) ℚ-a.s. on \(\mathbb{R}^{2} \setminus \partial \mathcal{C}_{3}\) due to the continuity of paths. It is crucial to show that this equality also holds for the boundary points \((x_{0}, z_{0}) \in \partial \mathcal{C}_{3}\). As it turns out, the cases i) \(\mu _{0} + \mu _{1} \geq 0\) and ii) \(\mu _{0} + \mu _{1} < 0\) should be treated in different fashions, and ii) exhibits some more technical difficulties than i). We start with case i), in which the needed result follows upon using the law of the iterated logarithm.
Proposition A.1
Assume that \(\mu _{0} + \mu _{1} \geq 0\). Let \((x_{n},z_{n})_{n\in \mathbb{N}} \subseteq \mathcal{C}_{3}\) be a sequence with \((x_{n}, z_{n}) \to (x_{0},z_{0}) \in \partial \mathcal{C}_{3} \) such that \(x_{0} = c(z_{0})\). We then have \(\tau ^{*} (x_{n}, z_{n}) \downarrow 0\) as well as \(\widehat{\sigma }_{*} (x_{n}, z_{n}) \downarrow 0\), ℚ-a.s.
Proof
Fix \(\omega \in \Omega \) and assume that \(\limsup _{n \to \infty} \tau ^{*} (x_{n}, z_{n}) (\omega ) =: \delta > 0\). Hence there exists a subsequence (still labelled by \((x_{n},z_{n})\)) such that
which is equivalent to
Using that \(z \mapsto c (z)\) is continuous, we let \(n \to \infty \) and obtain for \(t \in [0, \delta /2]\) that
where the last inequality follows from \(\mu _{0} + \mu _{1} \geq 0\) and Proposition 6.3. On the other hand, by the law of the iterated logarithm, there exists for each \(\epsilon > 0\) a sequence \(t_{n} \downarrow 0 \) such that
Combining (A.2) and (A.3) implies
but since \(\sqrt{2 t \log ( \log (1/ t))}/t \to \infty \) for \(t \downarrow 0\), (A.1) can only happen on a ℚ-nullset. Thus we have \(\tau ^{*}(x_{n},z_{n}) \downarrow 0\), and by replacing “<” in (A.1) by “≤”, we obtain \(\widehat{\sigma }_{*} (x_{n} ,z_{n}) \downarrow 0\) as well. □
Notice that the proof of Proposition A.1 cannot be replicated for case ii) in which \(\mu _{0} + \mu _{1} < 0\) since the last inequality in (A.2) no longer applies. To prove the same result for case ii), we have to take a longer route. The reason lies in the fact that the process \((X,Z)\) is moving towards the right in the state space and hence – keeping in mind that the continuation region \(\mathcal{C}_{3}\) of (6.14) lies below the increasing boundary \(c\) – could possibly evade from the stopping set. In the following, we show that this is not the case by adapting the procedure in De Angelis [19, Sect. 4]. As a first step, we state the following result, whose proof follows the lines of Cox and Peskir [16, Corollary 8] and is thus omitted for the sake of brevity.
Lemma A.2
If \(\mu _{0} + \mu _{1} < 0\) and \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \), then \(\mathbb{Q}[ \sigma _{*} = \widehat{\sigma }_{*}]=1\).
In the next step, we aim at proving regularity of the boundary points for the stopping set \(\mathcal{S}_{3}\) in the sense of diffusions, that is, for \((x,z) \in \partial \mathcal{C}_{3}\), we have
It is clear from Blumenthal’s 0–1 law that \(\mathbb{Q}_{x,z} [\sigma _{*} > 0 ] =1\) if (A.4) does not hold. Due to the geometry of the problem already mentioned, which implies that the process \((X,Z)\) could evade from the stopping set when \(\mu _{0} + \mu _{1} < 0\), proving (A.4) is not straightforward since we cannot apply an argument similar to the one for Proposition A.1. Instead, we establish (A.4) in two steps and begin by showing that the classical smooth-fit property, i.e., continuity of \(\widehat{v}_{x} ( \, \cdot \, ,z)\), holds at the free boundary.
Lemma A.3
Assume that \(\mu _{0} + \mu _{1} < 0\) and \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \). For \(\widehat{v}\) of (6.4), we have \(\widehat{v}_{x} (\, \cdot \, , z) \in C(\mathbb{R})\) or, equivalently, \(\widehat{u}_{x} (\, \cdot \, , z) \in C(\mathbb{R})\) for \(\widehat{u}\) of (6.11).
Proof
From (6.21), we obtain
for \((x,z) \in \mathcal{C}_{3}\), and due to (5.10) (which implies an analogous result for \(\widehat{v}\)), we deduce that for a bounded set \(B\), \(\widehat{u}_{xx}\) is bounded on the closure of \(B \cap \mathcal{C}_{3}\). Moreover, we recall that \(\widehat{u}_{x} \leq 0\) in \(\mathcal{C}_{3}\) as verified in the proof of Proposition 6.5. Aiming for a contradiction, we now assume that for \((x_{0}, z_{0}) \in \partial \mathcal{C}_{3}\) so that \(x_{0} = c(z_{0})\), we have
for some \(\delta _{0} > 0\). We take a bounded rectangular neighbourhood \(B\) of \((x_{0},z_{0})\) and define the stopping time \(\tau _{B} := \inf \{ t > 0: (X_{t},Z_{t}) \notin B \}\). Notice that
from the supermartingale property of \((e^{-rt}\widehat{v} (X_{t}, Z_{t}))\). By Lemma 6.7 and because \(t \mapsto Z_{\tau _{B} \wedge t}\) is increasing, we have \(\widehat{u}(X_{\tau _{B} \wedge t}^{x_{0}} , Z_{\tau _{B} \wedge t}^{z_{0}} ) \geq \widehat{u}(X_{\tau _{B} \wedge t}^{x_{0}} , z_{0} )\) ℚ-a.s. Moreover, since the integrand on the right-hand side of (A.6) is bounded on \(B\), we obtain
where \(c_{B}\) is a constant depending on \(B\). Due to the previously discussed local boundedness of \(\widehat{u}_{xx}\), we can apply the Itô–Tanaka formula to the first term in the expectation of (A.7). Let \(\mathcal{L}_{X} := \frac{1}{2} \sigma ^{2} \partial _{xx} + \mu _{0} \partial _{x}\) and denote the local time of \(X\) at \(x_{0}\) by \(L^{x_{0}}\). Moreover, noticing that \(\widehat{u}_{xx} (\, \cdot \, ,z_{0}) = 0\) for \(x > x_{0}\), we obtain
and combining this with (A.7) as well as noticing that \((\mathcal{L}_{X} -r) \widehat{u}(X_{s} ,Z_{s})\) is bounded on \(B\), we find upon using the assumption (A.5) that
This implies \(c_{B} \mathbb{E}^{\mathbb{Q}}_{x_{0},z_{0}} [\tau _{B} \wedge t] \geq \delta _{0} e^{-rt} \mathbb{E}^{\mathbb{Q}}_{x_{0},z_{0}} [L_{ \tau _{B} \wedge t}^{x_{0}}]\), and because \(\mathbb{E}^{ \mathbb{Q}}_{x_{0},z_{0}} [\tau _{B} \wedge t] \approx t\) while \(\mathbb{E}^{\mathbb{Q}}_{x_{0},z_{0}} [L_{\tau _{B} \wedge t}^{x_{0}} ] \approx \sqrt{t}\) (see e.g. Peskir [56, Lemma 15]), we obtain the desired contradiction. Hence \(\widehat{u}_{x} (\, \cdot \, ,z) \in C(\mathbb{R})\). □
We can now state the regularity of the boundary points.
Proposition A.4
Assume that \(\mu _{0} + \mu _{1} < 0\) and \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \). Then all points \((x,z) \in \partial \mathcal{C}_{3}\) are regular, i.e., we have \(\mathbb{Q}_{x,z}[\sigma _{*} > 0]=0\).
Proof
We argue by contradiction and show that if \(\mathbb{Q}_{x_{0},z_{0}}[\sigma _{*} > 0]=1\) for a boundary point \((x_{0},z_{0}) \in \partial \mathcal{C}_{3}\), it follows that \(\widehat{u}_{x} (x_{0} - ,z_{0}) < 0\) which contradicts Lemma A.3. First, we establish an upper bound for \(\widehat{u}_{x}\). Fix \((x,z) \in \mathcal{C}_{3}\) with \(x > \tilde{x}\), with the latter given by (6.15). Define the stopping time \(\tau _{\epsilon }:= \tau _{\epsilon }(x) := \inf \{ t\geq 0:X_{t}^{x} = \tilde{x} + \epsilon \}\) and observe that by the strong Markov property, we have
Moreover, we let \(\tilde{\tau} := \tilde{\tau}(x) := \inf \{ t > 0:X_{t}^{x} = \tilde{x} \}\), and for
we obtain
Notice that \(\tau _{\epsilon }(x) = \tilde{\tau} (x-\epsilon )\) and \(\tau '\) is optimal for (A.8). Hence subtracting (A.9) from (A.8) yields
Since \((\tilde{x} + \epsilon , Z_{\tau _{\epsilon}}^{z} ) \in \mathcal{C}_{3} \) on \(\{\tau ' > \tau _{\epsilon }\}\) and \(\widehat{u}_{x} \leq 0\) in \(\mathcal{C}_{3}\) (see Proposition 6.5), we must have
and so we obtain
If we now divide by \(\epsilon > 0\) and let \(\epsilon \downarrow 0\), we obtain (since \(\tau _{\epsilon }\downarrow \tilde{\tau}\) and \(\tau ' = \tau ^{*} (x,z)\))
In the next step, we assume by contradiction that there exists \((x_{0},z_{0}) \in \partial \mathcal{C}_{3}\) with \(\mathbb{Q}_{x_{0},z_{0}}[\sigma _{*} > 0]=1\) and take an increasing sequence \(x_{n} \uparrow x_{0}\) such that \(x_{n} > \tilde{x}\) for all \(n \in \mathbb{N}\), which is possible due to Assumption 4.1. Let \(\tau _{n} := \tau ^{*} (x_{n}, z_{n})\) and notice that \(\tau _{n} = \sigma _{n} := \sigma _{*} (x_{n}, z_{0})\) for all \(n \in \mathbb{N}\) due to continuity of paths. Furthermore, \(\sigma _{n}\) decreases in \(n\) and \(\sigma _{n} \geq \sigma _{*} := \sigma _{*} (x_{0}, z_{0})\) since \(x \mapsto X_{t}^{x}\) is increasing. Set \(\tilde{\tau}^{n} := \tilde{\tau}(x_{n})\) and notice that \(\tilde{\tau}^{n} \uparrow \tilde{\tau}\). Moreover, we let \(\sigma ^{\infty }:= \lim _{n \to \infty} \sigma _{n}\) and have
We then obtain
where we used \(x_{0} > \tilde{x}\) as well as \(\tilde{\tau} \wedge \sigma ^{\infty }> 0\) thanks to our assumption that \(\mathbb{Q}_{x_{0} , z_{0}}[\sigma ^{\infty }\geq \sigma ^{*} > 0] =1\). As this contradicts Lemma A.3, the claim follows. □
As a corollary of Lemma A.2 and Proposition A.4, we obtain
Corollary A.5
Assume that \(\mu _{0} + \mu _{1} < 0\) and \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \). Then for all \((x,z) \in \mathbb{R}^{2}\), we have
This result allows us to state a continuity result for the optimal stopping time with respect to the initial data.
Lemma A.6
Assume that \(\mu _{0} + \mu _{1} < 0\) and \(r > \frac{\gamma}{2 \sigma} \vert \mu _{0} + \mu _{1} \vert \). Then we have
for any point \((x,z) \in \mathbb{R}^{2}\) and for any sequence \((x_{n},z_{n}) \to (x,z)\). In particular, if \((x,z)\in \partial \mathcal{C}_{3}\), the limit is zero.
Proof
Let \((x,z) \in \mathbb{R}^{2}\) and denote \(\tau _{n} := \tau ^{*} (x_{n} , z_{n})\) as well as \(\tau := \tau ^{*} (x,z)\) for simplicity. In order to show lower semicontinuity, we fix \(\omega \in \Omega \) outside of a nullset. For \(\tau (\omega ) = 0\), we are finished and thus assume \(\tau (\omega ) > \delta > 0\). Due to Proposition 6.8, there exists \(k_{\delta , \omega} > 0\) such that \(c ( Z_{t} (\omega ) ) - X_{t} (\omega ) > k_{\delta , \omega} \) for all \(t \in [0, \delta ]\). The map \((t,x,z) \mapsto c(Z_{t}^{z} (\omega )) - X_{t}^{x} (\omega )\) is uniformly continuous on any compact \([0, \delta ] \times K\); hence we can find \(N_{\omega }\geq 1\) such that for all \(n \geq N_{\omega}\) and \(t \in [ 0 ,\delta ]\), \(c ( Z_{t}^{z_{n}} (\omega ) ) - X_{t}^{x_{n}} ( \omega ) > k_{ \delta , \omega} \), and therefore \(\liminf _{n \to \infty} \tau _{n} (\omega ) \geq \delta \). Since \(\omega \) and \(\delta \) were arbitrary, we obtain \(\liminf _{n \to \infty} \tau _{n} \geq \tau \) ℚ-a.s. and thus lower semicontinuity. By employing similar arguments, we can show \(\limsup _{n \to \infty} \widehat{\sigma }_{n} \leq \widehat{\sigma }\) ℚ-a.s. and the claim thus follows together with Corollary A.5. □
Before we finally state the proof of Proposition 6.9, we give a probabilistic representation of \(v_{x}\) by arguments similar to those in the proof of Lemma 6.4.
Lemma A.7
For all \((x,z) \in \mathbb{R}^{2} \setminus \partial \mathcal{C}_{3}\), we have
We are now ready to prove Proposition 6.9.
Proof
of Proposition 6.9 The first statement trivially holds true for \((x,z) \in \text{int}(\mathcal{S}_{3})\) and \((x,z) \in \mathcal{C}_{3}\), due to Lemma 6.2. It thus remains to prove that \(\triangledown _{x,z} \widehat{v}\) is continuous across the boundary \(\partial \mathcal{C}_{3}\). Let \((x_{0},z_{0}) \in \partial \mathcal{C}_{3}\) and take a sequence \((x_{n}, z_{n}) \to (x_{0} ,z_{0})\) with \(\tau _{n} := \tau ^{*} (x_{n},z_{n})\). For a fixed \(t > 0\), we notice that \((X_{t}, Z_{t} )\in \mathcal{C}_{3}\) on \(\{ \tau _{n} > t \}\) and thus, upon using the tower and the Markov property, we obtain
Due to Assumption 4.1, we can invoke dominated convergence as well as Lemma A.6 to obtain
and hence the continuity of \(\widehat{v}_{x}\) across the optimal boundary. The continuity of \(\widehat{v}_{z}\) across the free boundary follows similarly. For the last claim, we observe that Lemma 6.2 implies
for all \((x,z) \in \mathcal{C}_{3}\). But the right-hand side of (A.10) only involves functions which are continuous on \(\mathbb{R}^{2}\); hence we deduce that \(\widehat{v}_{xx}\) admits a continuous extension to \(\overline{\mathcal{C}}_{3}\) and is therefore bounded there. It follows that \(\widehat{v}_{x} ( \, \cdot \, , z)\) is locally Lipschitz-continuous on \(\overline{\mathcal{C}}_{3}\), with a Lipschitz constant \(K(z)\) that is locally bounded on ℝ. Because \(\widehat{v}_{x}( \, \cdot \, , z)\) is infinitely many times continuously differentiable in the stopping region \(\mathcal{S}_{3}\) (and hence locally bounded there as well), we conclude that \(\widehat{v}_{xx} \in L_{{\mathrm{{loc}}}}^{\infty }(\mathbb{R}^{2}) \). □
Appendix B: Proof of Proposition 6.10
Let \(R >0\) and define \(\tau _{R} := \inf \{ t \geq 0: \vert X_{t} \vert \geq R \text{ or } \vert Z_{t} \vert \geq R \}\). Since \(\widehat{v} \in C^{1} (\mathbb{R}^{2} )\) and \(\widehat{v}_{xx} \in L_{{\mathrm{{loc}}}}^{\infty }(\mathbb{R}^{2})\), we can apply a weak version of Itô’s lemma (see e.g. Bensoussan and Lions [8, Lemma 2.8.1 and Theorem 2.8.5]) up to the stopping time \(\tau _{R} \wedge T\) for some \(T > 0\), which results in
The right-hand-side of (B.1) is well defined because \(Z\) is deterministic, \(X\) has an absolutely continuous transition density and \(\mathcal{L}_{X,Z} \widehat{v}\) is defined up to a set of zero Lebesgue measure. Since \(\widehat{v}\) solves the free-boundary problem (6.28), we have
for almost all \((x,z) \in \mathbb{R}^{2}\). Using again that the transition density of \(X\) is absolutely continuous with respect to Lebesgue measure, (B.1) becomes
Now upon employing a change of measure as in Sect. 5, we obtain
where \(\pi = e^{\frac{\gamma}{\sigma}(x+ z)}/(1 + e^{\frac{\gamma}{\sigma}(x+ z)})\). Due to Assumption 4.1, it is easy to verify that taking limits in (B.2) yields
Furthermore,
where \(\pi = e^{\frac{\gamma}{\sigma}(x+ z)}/(1 + e^{\frac{\gamma}{\sigma}(x+ z)})\) and the last inequality follows again from Assumption 4.1. Hence given the finiteness of the expectation in (B.4), we can apply dominated convergence to interchange expectation and limits as \(R \uparrow \infty \) and \(T \uparrow \infty \). Combining this with (B.3) gives (6.29), which completes our proof. □
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dammann, F., Ferrari, G. Optimal execution with multiplicative price impact and incomplete information on the return. Finance Stoch 27, 713–768 (2023). https://doi.org/10.1007/s00780-023-00508-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00780-023-00508-y
Keywords
- Optimal execution problem
- Multiplicative price impact
- Singular stochastic control
- Partial observation
- Optimal stopping