1 Introduction

A classical topic in finance and actuarial science is that of optimal dividend distribution for a company, which can be phrased as the problem of determining the optimal timing and sizes of dividend payments in the presence of bankruptcy risk, where the usual objective is to maximize the expected value of the cumulative discounted dividend payments until bankruptcy. The earliest work in this setting can be traced back to De Finetti [8] who studied the dividend problem for an insurance company under the binomial model. In continuous time, the problem was posed and solved in a Brownian motion model for the cash reserves by Jeanblanc-Picqué and Shiryaev [20] and Asmussen and Taksar [2], using optimal control theory. Since then an extensive literature has appeared on the dividend problem and its extensions, including reinsurance (e.g. [26]), optimal investment of the reserves (e.g. [18]), tax and proportional cost (e.g. [6, 23]), and growth options (e.g. [7]).

In general, the form of the optimal dividend policy has been found to depend on the expected growth rate and variability of future revenues, and on the discount rate. These quantities will evolve in time, reflecting changing market and economic conditions, and those changes may happen gradually or occur abruptly and be more substantial. Here we focus on the changes of the latter type (also called regime shifts or switches) and model the cumulative net revenues of the company as a Brownian motion with the drift and volatility modulated by a finite state Markov chain, and the discount rate as a deterministic function of the chain. Since Hamilton [16, 17], a substantial econometric literature has appeared that supports the use of Markov regime-switching models to describe business cycles, term structure of interest rates and other macroeconomic quantities. Such models have been shown to be capable of capturing occasional simultaneous and substantial changes of the parameters. Regime-switching models also have the advantage of retaining a degree of analytical tractability, and models from this class can in principle approximate a given diffusion arbitrarily closely by taking the state space large enough and specifying the generator matrix appropriately. In the mathematical finance literature, regime-switching models have become more popular, and have found their applications in stock price models, interest rate models and the real option literature. See e.g. Boyarchenko and Levendorskiǐ [4], Buffington and Elliott [5], Driffill et al. [9], Duan et al. [10], Elliott et al. [12], Guo and Zhang [14], Jiang and Pistorius [21], Naik [24] for derivative pricing, Elliott and van der Hoek [11] and Guidolin and Timmermann [13] for asset allocation, Bäuerle [3], Li and Lu [22], Zhu and Yang [28] and Asmussen [1] for ruin and risk theory, and Guo et al. [15] for irreversible investment.

In this regime-switching setting, we consider the problem of the management of the company to find a dividend distribution policy that maximizes expected discounted dividend payments until bankruptcy, which is defined to occur at the first moment when the level of the cash reserves hits zero. We restrict ourselves to the case that the management can only control the timing and size of the dividend payments. In the case that the drift is positive in every regime, we show that it is optimal to adopt a barrier-type strategy at certain positive levels that depend on the current regime, that is, it is optimal to make the minimal payments needed to keep the cash reserves below these barrier levels. When a regime switch occurs, dividend payments are to be postponed or brought forward in time, according to whether the barrier jumps up or down, and in the latter case a lump sum should be paid if the reserves were above the new barrier at the moment of the switch. In the case of a single regime, this strategy reduces to the classical constant barrier strategy that was found before by Asmussen and Taksar [2].

After an adverse economic regime switch, it could happen that the expected net revenue of the company becomes negative, in which case the optimal strategy takes a different form. Intuitively, it is clear that if the drift is negative and the reserves are sufficiently small, it will be optimal to liquidate the company by paying out the reserves as a lump sum. In the absence of regime switching, this optimality actually holds irrespective of the size of the reserves. In the presence of regime switching, however, we find that it is optimal to continue the business if the drift is small and negative and the reserves are not too small: the prospect of switching to a better regime with suitable positive drift outweighs the risk of ruin. In this case, the value function is not concave, which differs from what is usually found in singular control problems. An explicit solution is derived in Sect. 5 in the case of two regimes.

The dividend optimization problem gives rise to a singular control problem, whose HJB equation takes the form of a coupled system of variational inequalities, due to the fact that the problem is driven by a two-dimensional Markov process. A commonly used direct approach for explicitly solving optimal control problems proceeds by guessing a candidate optimal solution, constructing a corresponding value function, assuming smoothness if necessary, and subsequently verifying its optimality by employing a verification result. Here we follow a different approach to construct the candidate value function, by directly employing a dynamic programming equation. We prove that the value function is the fixed point of a certain contraction operator, which is given explicitly in terms of the initial data, and derive an explicit iterative algorithm to calculate the value function, which ‘decouples’ the different regimes such that, at any stage, one-dimensional control problems are solved. This construction yields in particular that the value function is C 2, which implies that the value function is a classical solution of the HJB equation. At this point, it is worth mentioning that although it is possible to follow the direct approach, this seems to become intractable if the number of states is large, as it leads to a large collection of systems of coupled nonlinear equations (corresponding to different orderings of the dividend levels).

After the first version of this paper was written, we discovered a related work on optimal dividend problems by Sotomayor and Cadenillas [27]. In a setting that is a particular case of ours, with two regimes and constant rate of discounting, they solve three dividend distribution problems with bounded and unbounded dividend rates, and in the presence of fixed cost, respectively, under the assumption of existence of a solution to the smooth fit equation.

The remainder of the paper is organized as follows. In Sect. 2, we give a statement of the problem and present a dynamic programming equation and related theorem. In Sects. 3 and 4, we present the optimal solution and give a proof by constructing an iterative algorithm to calculate the value function V. Section 5 is devoted to a case study of the setting of two regimes, with a numerical illustration of the sensitivities of the optimal barrier levels to the different parameters. Section 6 concludes. Some proofs are presented in the Appendix.

2 Preliminaries and first results

2.1 Problem formulation

Let {W t :t≥0} be a Wiener process and {Z t :t≥0} a continuous time Markov chain with finite state space E and generator matrix Q=(q ij ) i,jE , independent of W. Assume that the cash reserves X={X t ,t≥0} evolve, in the absence of dividend payments, as a regime-switching linear Brownian motion, that is, X satisfies the SDE

$$\mathrm{d}X_t = \mu(Z_t)\,\mathrm{d}t +\sigma(Z_t)\,\mathrm{d}W_t, \qquad X_0=x>0,\qquad Z_0=i,$$

where Z represents the state of economy. For every state i in E, both drift parameter μ(i) and volatility parameter σ(i)>0 are assumed to be known constants. In case there is no notational confusion possible, we write μ i and σ i for μ(i) and σ(i), respectively. The processes X and Z are defined on some filtered probability space \((\varOmega , \mathcal{F}, \mathbf{F}, \mathbb{P})\) where \(\mathbf{F} =\{\mathcal{F}_{t}, t\geq0\}\) denotes the right-continuous completed filtration jointly generated by X and Z. We denote by ℙ x,i and ℙ x the measure ℙ conditioned on {X 0=x,Z 0=i} and {X 0=x}, respectively, and write \(\mathbb{E}_{x,i}\) and \(\mathbb{E}_{x}\) for the corresponding expectations. We assume that the processes X and Z are both fully observable to the shareholders, and that these decide on the dividend strategies on the basis of the available information.

A dividend strategy D is a nondecreasing and right-continuous stochastic process D={D t :t≥0} with D 0−=0. Here D t represents the cumulative amount of dividends that has been paid out until time t. We assume that, apart from reducing the reserves, dividend payments have no effect on the business and that there are no transaction costs associated to the payment or receipt of dividends. The dynamics of the risk reserve process U={U t :t≥0} in the presence of dividend payments are then given by

$$ \mathrm{d}U_t = \mathrm{d}X_t -\mathrm{d}D_t$$
(2.1)

for all t until the time τ of bankruptcy and dU t =0 for t after τ, where

$$\tau=\inf\{t\geq0:U_t = 0 \}$$

is the first time that U hits zero. To avoid degeneracies, only those dividend strategies will be considered that have no lump sum dividend payments larger than the current level of the reserves. A dividend strategy D is called admissible if D is F-adapted, dD t =0 for tτ and

$$ U_{t-}\geq D_{t}-D_{t-}\quad \mbox{for all}\ t<\tau.$$

Denoting by \(\mathcal{D}\) the set of admissible dividend strategies, the objective function of the shareholders is given by

$$ V(x,i) = \sup_{D\in\mathcal{D}} V_D(x,i),$$
(2.2)

where V D denotes the expected value of the discounted dividends until the time of ruin τ under the dividend strategy D,

$$V_D(x,i) = \mathbb{E}_{x,i} \biggl[\int _0^\tau\mathrm{e}^{-\int_0^tr(Z_s)\,\mathrm{d}s}\,\mathrm{d}D_t \biggr],$$

with r:E→(0,∞) the Markov-modulated rate of discounting. The problem for the shareholders is to identify a dividend strategy \(D^{*}\in\mathcal{D}\) that attains the supremum in (2.2), that is, \(V \equiv V_{D^{*}}\).

2.2 A priori bounds

Assume for the moment that there is only a single regime, E={i}. Then we are back in the classical linear Brownian motion setting that was investigated in Asmussen and Taksar [2]. They showed that if μ i >0, the optimal strategy is a constant barrier strategy at the level

$$ a_i^*=\frac{\sigma_i^2}{\sqrt{\mu_i^2+2r_i\sigma_i^2}}\ln\frac {\sqrt{\mu_i^2+2r_i\sigma_i^2}+\mu_i}{\sqrt{\mu_i^2+2r_i\sigma_i^2}-\mu_i}.$$
(2.3)

According to this strategy, the overflow of the reserves above the level \(a_{i}^{*}\) is immediately paid out as dividends. The corresponding value function is given by

$$ V^*_i(x)= \begin{cases}W^{(r_i)}_i(x)/W^{(r_i)\prime}_i(a_i^*)\,,&0\leq x\leq a_i^*,\\[3pt]x-a_i^*+\mu_i/r_i, &x\geq a_i^*,\end{cases} $$
(2.4)

where

$$ W^{(q)}_i(x) = \frac{2}{\sigma_i^2}\;\frac{\mathrm{e}^{\lambda _i^+ x}-\mathrm{e}^{\lambda_i^{-}x}}{\lambda_i^+ - \lambda_i^-}$$
(2.5)

and \(\lambda^{-}_{i}<0 < \lambda^{+}_{i}\) denote the roots of the equation \(\frac{1}{2} \sigma_{i}^{2} \lambda^{2}+\mu_{i} \lambda-q=0\), i.e.,

$$ \lambda_i^\pm= \lambda_i^\pm(q)= - \frac{\mu_i}{\sigma_i^2} \pm \sqrt{ \biggl(\frac{\mu_i}{\sigma_i^2}\biggr)^2+\frac{2q}{\sigma_i^2}}.$$
(2.6)

Equations (2.3)–(2.6) show that the value function and optimal level are both functions of the drift and of the rate of discounting per unit of squared volatility. This observation leads one to expect that V(x,i) is bounded above and below by the values V +(x) and V (x) of firms operating in a more or less favorable environment, with volatility constant equal to one and with drift and discounting equal to \((\frac{\mu_{+}}{\sigma^{2}_{+}},\frac{r_{+}}{\sigma^{2}_{+}}) =(\max_{i\in E}\frac{\mu_{i}}{\sigma^{2}_{i}},\min_{i\in E}\frac{r_{i}}{\sigma^{2}_{i}})\) and \((\frac{\mu_{-}}{\sigma^{2}_{-}},\frac{r_{-}}{\sigma^{2}_{-}}) = (\min_{i\in E}\frac{\mu_{i}}{\sigma^{2}_{i}}, \max_{i\in E}\frac{r_{i}}{\sigma^{2}_{i}})\), respectively. The following result confirms that these explicit bounds indeed hold true.

Proposition 2.1

If μ >0, we have

$$ V_{-}(x) \leq V(x,i) \leq V_{+}(x)$$
(2.7)

for all x≥0,iE.

The bounds in (2.7) will be employed in the construction of the optimal value function in Sect. 4.

2.3 Dynamic programming equation and comparison result

The following dynamic programming equation for the value function of the singular control problem (2.2) will form the basis for its solution.

Proposition 2.2

We have

$$ V(x,i) = \sup_{D\in\mathcal{D}} \mathbb{E}_{x,i} \biggl[\int_0^{\tau\wedge\zeta}\mathrm {e}^{-\varLambda _{t}}\,\mathrm{d}D_t+ \mathrm{e}^{-\varLambda _{\tau\wedge\zeta}}V(U_{\tau\wedge\zeta},Z_{\tau\wedge\zeta}) \biggr],$$
(2.8)

where ζ denotes the epoch of the first regime switch and \(\varLambda _{t} = \int_{0}^{t} r(Z_{s})\,\mathrm{d}s\).

The proof of Proposition 2.2 is given in the Appendix. This dynamic programming equation is associated with the Hamilton–Jacobi–Bellman equation for the value function given by

$$ \max \bigl\{\mathcal{G} w(x,i) - r(i) w(x,i), 1 -w'(x,i) \bigr\} =0,\quad x>0,\ i\in E,$$
(2.9)

where ′ denotes the partial derivative with respect to x and \(\mathcal{G}\) denotes the infinitesimal generator of (X,Z), which acts on functions w:[0,∞)×E→[0,∞) with w(⋅,i)∈C 2([0,∞)) for iE as \(\mathcal{G}w(x,i) = \mathcal{G}_{o} w(x,i) + \mathcal{G}_{s} w(x,i)\), where

$$ \mathcal{G}_o w(x,i) = \frac{\sigma_i^2}{2}w''(x,i)+ \mu_i w'(x,i), \qquad\mathcal{G}_s w(x,i)= \sum_{j\in E}q_{ij}\bigl[w(x,j) - w(x,i)\bigr].$$

The next result shows that any sufficiently regular supersolution of the HJB equation (2.9) dominates the value function.

Theorem 2.3

Assume that there exists a function w=(w(⋅,i),iE), with w(⋅,i), iE, C 1 functions on (0,∞) that are piecewise C 2 and satisfy for x>0

Then:

  1. (i)

    We have w(x,i)≥V(x,i) for all x≥0 and iE.

  2. (ii)

    If in addition w=V D for some \(D\in\mathcal{D}\), then D is an optimal strategy and Vw.

Proof

(i) Fix an arbitrary \(D\in\mathcal{D}\) and let U be the corresponding risk process. The statement will follow once we have shown that w(x,i)≥V D (x,i). Applying a generalized form of Itô’s lemma to the process \(\{\mathrm{e}^{-\varLambda _{T\wedge\tau}}w(U_{T\wedge\tau},Z_{T\wedge \tau}), T\ge0\}\), we find that

(2.10)

where \(D_{t} = D^{c}_{t} + \sum_{0\leq s\leq t}\varDelta D_{s}\) and (M Tτ ) is the local martingale with

Here the last integration is over the set [0,t]×[0,N] and \(\widetilde{\pi}= \pi-\nu\) is a compensated random measure,Footnote 1 where \(\pi(\mathrm{d}t,\mathrm{d}j)=\sum_{s\ge0}1_{\{\varDelta Z_{s}(\omega )\neq0\}} \delta_{(s,Z_{s}(\omega ))}(\mathrm{d}t,\mathrm{d}j)\), with δ (s,z) denoting the Dirac measure at the point (s,z), and the compensator ν is given by

$$\nu(\mathrm{d}t,\mathrm{d}j) = p_{Z_{t-}}(j) [-q_{Z_{t-},Z_{t-}}]\delta(\mathrm{d}j) \,\mathrm{d}t = q_{Z_{t-},j}\delta(\mathrm{d}j)\,\mathrm{d}t, \quad j\in E,$$

where \(p_{Z_{t-}}(j) = \frac{q_{Z_{t-},j}}{-q_{Z_{t-},Z_{t-}}} =P(Z_{t}=j|Z_{t-},\varDelta Z_{t}\neq0)\), where δ is the counting measure on E. Notice from (2.10) that as M is bounded below and M 0=0, M is a supermartingale with \(\mathbb{E}[M_{T\wedge\tau}]\leq0\). In view of the HJB equation (2.9), the first three terms of right-hand side of (2.10) are nonpositive, so that taking expectations yields that

$$w(x,i) \geq\mathbb{E}_{x,i} \bigl[ \mathrm{e}^{-\varLambda _{T\wedge\tau}}w(U_{T\wedge\tau},Z_{T\wedge \tau})\bigr] +\mathbb{E}_{x,i} \biggl[ \int_0^{T\wedge\tau}\mathrm {e}^{-\varLambda _s} \, \mathrm{d} D_s \biggr].$$

By letting T→∞ and invoking the monotone convergence theorem and the fact that w is nonnegative, we obtain w(x,i)≥V D (x,i) and hence w(x,i)≥V(x,i).

(ii) The equality follows since V D V by definition of V and V D V by part (i). □

3 The optimal dividend strategy

Following the classical approach to solving optimal control problems, we next construct a candidate optimal solution. In view of the fact that (U,Z) is a Markov process, we consider strategies that pay out the overflow of the cash reserves above a regime-dependent level.

Definition 3.1

A modulated barrier strategy at level b=(b(i),iE) is a dividend strategy \(D^{b}\in\mathcal{D}\) satisfying

where U b is the risk process (2.1) corresponding to D b.

According to this strategy, dividends are only paid out when U b is at the barrier b, which implies that the process D b is a local time (see Fig 1). It is straightforward to verify that D b can be explicitly expressed in terms of a running supremum as

$$ D^b_t = 0 \vee\sup_{0\leq s\leq t} \biggl\{x+\int_0^s\mu (Z_u)\,\mathrm{d}u+\int_0^s\sigma(Z_u)\,\mathrm{d}W_u - b(Z_s) \biggr\}.$$

Employing the heuristic ‘principle of C 2 fit’ of singular control allows us to define candidate optimal levels as the solution of the system of equations

$$ \bigl(V^b_i\bigr)^{\prime\prime}(b_i)= 0, \quad i\in E,$$
(3.1)

if such a solution exists. In fact, (3.1) follows from Lemma 4.5 and Proposition 4.1 as we shall see later. If the drift is positive in all regimes, this candidate solution is indeed optimal:

Fig. 1
figure 1

Illustrated is the cash reserves process corresponding to a modulated barrier strategy. The barrier levels are represented by horizontal lines. In this case, the barrier jumps down at the moment of the regime switch and a lump sum payment is made

Theorem 3.2

Suppose that we have μ i >0 for all iE. Then there exist levels \(b^{*}= (b^{*}_{i}, i\in E)\) with \(0<b^{*}_{i}<\infty\) that solve the system (3.1), and the following holds true:

  1. (i)

    The optimal value function V is a classical solution of the HJB equation (2.9). In particular, V is equal to the unique solution w={w(x,i),iE} with w(⋅,i)∈C 2([0,∞)) of the system

    (3.2)

    for iE, where w i (x)=w(x,i).

  2. (ii)

    The modulated barrier strategy at b is an optimal policy in (2.2).

If the positive drift condition is not satisfied, it is not necessarily optimal to adopt a modulated barrier strategy. Indeed, in Sect. 5 we show that in the case of two regimes with a small and negative drift in one state and a positive in the other, the optimal dividend barrier depends on the regime as well as on the level of the reserves. In the following section, we give a proof of Theorem 3.2 by presenting an iterative construction of the optimal value function.

4 Algorithm to compute the value function V

Throughout this section we assume that μ i >0 for all iE. We start by observing that the value function V b of a modulated barrier strategy at level b=(b i ,iE) solves a fixed point equation in terms of the function \(W_{i}^{(q)}\).

Proposition 4.1

For iE, we have

$$V^b=T_b\bigl(V^b\bigr),$$

where, for any f:[0,∞)×E→[0,∞),

$$ T_b(f) (x,i) = \begin{cases}W^{(\theta_i)}_i(x) A^f_i(b_i)\\[6pt]\quad-\sum_{j\neq i}q_{ij}\int_0^xW^{(\theta_i)}_i(x-y)f(y,j)\,\mathrm{d}y,& x\in[0,b_i],\\[6pt]x - b_i + T_b(f)(b_i,i), & x\ge b_i,\end{cases} $$
(4.1)

where θ i =r i q ii and, for any function f:[0,∞)×E→ℝ, \(A_{i}^{f}\) is given by

$$ A^f_i(y) = \frac{1}{W^{(\theta_i)\prime}_i(y)} \biggl[1 +\sum_{j\neq i}q_{ij}\int_0^yW^{(\theta_i)\prime}_i(y-z)f(z,j)\,\mathrm{d}z \biggr].$$
(4.2)

The previous result can be utilized to calculate the value function V b of the barrier strategy at b by iterating the map T b :vT b v. Denote by \(\mathcal{B}\) the set

$$\mathcal{B} = \bigl\{f: f_i\in C\bigl([0,\infty)\bigr),f_i(0)=0, f_i(x)/\bigl(1+|x|\bigr)\ \mbox{is bounded}\bigr\},$$

where f i =f(⋅,i), and let \(\|f\| =\max_{i\in E}\sup_{x\ge0}\frac{|f_{i}(x)|}{1+|x|}\) for \(f\in\mathcal{B}\).

Corollary 4.2

The map T b is a contraction on \(\mathcal{B}\) with respect to the norm ∥⋅∥. In particular, for \(f\in\mathcal{B}\) we have

$$ V^b(x,i) = \lim_{n\to\infty}T^n_b(f) (x,i),$$
(4.3)

where the convergence is in ∥⋅∥-norm and \(T^{n}_{b}(f) = T_{b}(T^{n-1}_{b}(f))\) for n>1 with \(T^{1}_{b}=T_{b}\).

Proof of Proposition 4.1

Denote by U i=X iD i the risk process corresponding to dividends D i being paid according to a constant barrier strategy at b i , with \(X^{i}_{t}=x+\mu_{i}t + \sigma_{i} W_{t}\). Let \(\tau^{b}=\inf\{t\ge0:U_{t}^{b}<0\}\) and \(\tau^{i} = \inf\{t\ge0: U^{i}_{t}<0\}\) be the ruin times of U b and U i, and denote by ζ the epoch of the first regime switch and by η(a) an independent exponential random time with mean 1/a. Then we find that the ensemble (U t ,Z 0=i,t<τ b∧ζ) is in distribution equal to \({(U^{i}_{t},t<\tau^{i}\wedge\eta(-q_{ii}))}\). Thus, the value z 1(x,i) of the discounted dividends received before ζ is given by

where θ i =r i q ii and in the last line we used (2.4). Similarly, the value z 2(x,i) of the discounted dividends received after ζ satisfies, in view of the Markov property,

Employing the identity (see e.g. Pistorius [25], Theorem 1) for x∈[0,b i ]

(4.4)

and the fact that V b(x,i)=z 1(x,i)+z 2(x,i), we find the result as stated. □

Proof of Corollary 4.2

Note that \(\mathcal{B}\) endowed with the norm ∥ ⋅ ∥ is a complete metric space and that T maps \(\mathcal{B}\) to itself, by definition of T and the fact that \(W^{(\theta_{i})}_{i}\) is C 1. Subsequently we see that

where h i is given in (4.4) (with b replaced by b i ) and \(C = \max_{i}\sum_{j\neq i}\frac {q_{ij}}{\theta_{i}}< 1\). Here we used that \(\int_{0}^{b_{i}} h_{i}(x,y)\,\mathrm{d}y = 1 -\mathbb{E}_{x,i}[\mathrm{e}^{-\theta_{i}\tau^{i}}]\). Thus it follows that T is a contraction on \(\mathcal{B}\), which implies the convergence in (4.3). □

4.1 Iteration

In a next step, we consider the auxiliary control problem with a prescribed payoff function v to be received at the epoch of the first regime switch ζ, i.e.,

$$ (Uv) (x,i) = \sup_{D\in\mathcal{D}} \mathbb{E}_{x,i}\biggl[\int_0^{\tau\wedge\zeta}\mathrm {e}^{-\varLambda _t}\,\mathrm{d}D_t + 1_{\{\zeta<\tau\}}\mathrm{e}^{-\varLambda _{\zeta}}v(U_{\zeta},Z_{\zeta}) \biggr].$$
(4.5)

This singular control problem can be solved explicitly if v lies in the set of smooth concave payoff functions \(\mathcal{C} =\{v\in\mathcal{B}: \text{$v_{i}$ is increasing and concave}, i\in E\}\).

Proposition 4.3

Let \(v\in\mathcal{C}\). Then Uv(⋅,i)∈C 2[0,∞) for iE, and the optimal strategy in (4.5) is given by a regime-switching barrier strategy at the levels \(b^{v}=(b^{v}_{i}, i\in E)\), \(0<b^{v}_{i}<\infty\), given by

$$ b^v_i = \inf\bigl\{b\ge0:A^v_i(b) \ge A^v_i(x)\ \mbox{\textit{for all} $x\ge 0$}\bigr\},$$
(4.6)

with A v given in (4.2).

Supposing that the map U:vUv preserves concavity and smoothness, this proposition can be applied iteratively, as follows: Initialize by setting n=0 and v=v 0 for some \(v_{0}\in\mathcal{B}\) and then

  1. (1)

    find \(b^{v}=(b^{v}_{i},i\in E)\) in (4.6);

  2. (2)

    set \(v \leftarrow T_{b^{v}}(v)\), nn+1 and v n v, and return to step (1).

The following result shows that the sequence (v n ) generated in this way converges to the value function V as n→∞.

Proposition 4.4

Let \(v_{0}^{\pm}\in\mathcal{C}\) and define \(v_{n}^{\pm}= Uv^{\pm}_{n-1}\) for n≥1. If \(v_{0}^{-}\leq V\leq v_{0}^{+}\), then \(v_{n}^{-} \leq V\leq v_{n}^{+}\) and

$$ V(x,i) = \lim_{n\to\infty} v_n^+(x,i) =\lim_{n\to\infty} v_n^-(x,i),$$

where the convergence is with respect to the norm ∥⋅∥. In particular, V is concave.

In fact, we shall show below that U is a contraction on \(\mathcal{C}\). Notice that Theorem 3.2(i) is now a direct consequence of these results. Indeed, by combining Proposition 4.3 and the dynamic programming equation (2.8), we see that the optimal strategy in (2.2) is given by a modulated barrier strategy at some positive finite levels. Explicit examples of initial functions \(v_{0}^{\pm}\) are the V ± given in Proposition 2.1.

4.2 Proofs

This subsection is devoted to the proofs of Propositions 4.3 and 4.4, which we split into a number of steps. The first step is to verify that the \(b^{v}_{i}\) as defined above are positive and finite, which is a matter of straightforward calculations using the explicit expression (2.5).

Lemma 4.5

(Existence of optimal barrier levels)

Let \(v\in\mathcal{C}\). Then \(b\mapsto A^{v}_{i}(b)\) attains its maximum at some finite and positive b i , which satisfy

$$T_b(v)_i''(b_i):=\frac{\partial^2}{\partial x^2}\, T_b(v) (x,i)\bigg|_{x=b_i} = 0,\quad i\in E.$$

In particular, \(T_{b^{v}}(\cdot, i)\in C^{2}[0,\infty)\) for iE.

The proof of Lemma 4.5 is given in the Appendix. The key step is to verify next that the value function of a barrier strategy at level b v with a concave payoff function v(⋅,i) is itself concave.

Lemma 4.6

(Preservation of concavity)

If \(v\in\mathcal{C}\), then \(T_{b^{v}}(v)\in\mathcal{C}\).

Proof

We first assume that \(v\in\mathcal{C}\cap C^{2}[0,\infty)\), and write b instead of b v to simplify the notation. In view of the smoothness of v and the definition of \(w_{i}(x):=(T_{b^{v}}v)(x,i)\), we can obtain from (2.5) and (4.1) that for x∈(0,b i ),

$$w_i'(x) =W^{(\theta_i)\prime}_i(x)A_i^v(b_i)-\sum_{j\neq i}q_{ij}\int_0^x W^{(\theta_i)\prime}_i(x-y)v(y,j)\,\mathrm{d}y$$

and

From these expressions, (2.5) and vC 2[0,∞), we have \(w_{i}|_{(0,b_{i})}\in C^{4}(0,b_{i})\). In addition, we have \(w_{i}'(b_{i})=1\) from the above expressions and (4.1), and \(w_{i}''(b_{i})=0\) by Lemma 4.5. As a result, w i is in C 2[0,∞).

An application of Itô’s lemma shows that w i satisfies the ODE, for x∈(0,b i ),

$$ f_i^v(x):= \frac{\sigma_i^2}{2}w''_i(x) + \mu_iw'_i(x) - (c_i-q_{ii})w_i(x) + \sum_{j\neq i} q_{ij}v_j(x) = 0,$$
(4.7)

with boundary conditions \(w_{i}(0)=0, w_{i}'(b_{i}) = 1\). Since w i (x)≥0 for x>0 and w i (0)=0, we deduce that \(w_{i}'(0+)\ge 0\). Furthermore, the continuity of w i and the fact that w i (0)=0 and v i (0)=0 imply that

$$\sigma_i^2w_i''(0+)+ 2\mu_i w_i'(0+)=0,$$

so that \(w_{i}''(0+)<0\), as μ i >0 by assumption.

Write now \(\xi_{i}(x)=w_{i}''(x)\) for x>0, and denote \(\xi_{i}(0)=w_{i}''(0+)\). By twice differentiating the first equation of the original system (3.2), which is justified since w i (x)∈C 4(0,b i ) as a consequence of the assumptions, we find that ξ i (x) satisfies the ODE

$$\left\{\begin{array}{l}\displaystyle\frac{\sigma_i^2}{2}\xi_i''(x) + \mu_i\xi_i'(x) -(c_i -q_{ii})\xi_i(x) +\sum_{j\neq i}q_{ij}v''_{j}(x)=0,\quad x\in(0,b_i),\\[9pt]\xi_i(0)< 0, \qquad\xi_i(b_i) = 0,\qquad\xi_i(x)=0,\quad x>b_i.\end{array}\right.$$

Another application of Itô’s lemma then yields for ξ the representation

$$\xi_i(x) = \mathbb{E}_{x} \bigl[\mathrm{e}^{-\theta_iT^i}\xi_i(X_{T^i}) \bigr] +\sum_{j\neq i} q_{ij}\mathbb{E}_{x}\biggl[\int_0^{T^i}\mathrm{e}^{-\theta_is}v_j''(X_s)\,\mathrm{d}s\biggr],$$

where θ i =(c i q ii ) and \(T^{i} = \inf\{t\ge0:X^{i}_{t}\notin(0,b_{i})\}\). Thus, since \(\xi_{i}(X_{T^{i}})\leq0\) and \(v_{j}''(x)\leq0\), it follows that ξ i (x) is nonpositive for all x∈(0,b i ) and iE. In particular, we deduce that xw i (x) is concave and increasing on [0,∞).

Suppose now that \(v\in\mathcal{C}\) and let \(v_{n}\in\mathcal{C}\cap C^{2}[0,\infty)\) be a sequence that pointwise increases to v. Then \(T_{b^{v}}(v)(x,i) = \lim_{n\to\infty} T_{b^{v}}(v_{n})(x,i)\), and the concavity of \(T_{b^{v}}(v)\) directly follows from the fact that the pointwise limit of concave functions is concave. □

We next verify that the modulated barrier strategy at b v is optimal for the problem (4.5).

Lemma 4.7

(Optimality of barrier strategies)

For \(v\in\mathcal{C}\), we have for x>0,iE that \(T_{b^{v}}(v)(x,i) = Uv(x,i)\).

Proof

Defining again \(w_{i}(x) = T_{b^{v}}(v)(x,i)\) and \(f_{i}^{v}(x)\) as in (4.7), we shall verify that \(f^{v}_{i}(x)\leq0\) for x>b i . Next we claim that

$$ f^{v\prime}_i(b_i+)\leq0,\quad i\in E.$$
(4.8)

The claim (4.8) is proved as follows. From the facts that \(w_{i}''(b_{i})=0\) and \(w_{i}''(x)\leq0\) for x<b i (as a consequence of the concavity of w i ), it follows that \(w_{i}'''(b_{i}-)\geq0\). Since both \(w_{i}''(x)\) and \(w_{i}'(x)\) are continuous at x=b i and \(w_{i}'''(b_{i}+)=0\), it follows by considering the left and right limits of \(f^{v\prime}_{i}(x)\) at x=b i that \(f^{v\prime}_{i}(b_{i}-)\geq f^{v\prime}_{i}(b_{i}+)\). Finally, differentiating the identity \(f^{v}_{i}(x)\equiv0\) for x∈(0,b i ) shows \(f^{v\prime}_{i}(b_{i}-)=0\) and thus (4.8) follows.

Noting that \(f^{v\prime}_{i}(x) = -(c_{i}-q_{ii}) + \sum_{j\in E}q_{ij}v_{j}'(x)\) for x>b i , the concavity of v together with (4.8) yields then that \(f^{v\prime}_{i}(x)\leq f^{v\prime}_{i}(b_{i}+)\leq0\) for x>b i , which implies that \(f^{v}_{i}(x)\leq0\) for xb i (noting that \(f^{v}_{i}(b_{i})=0\), by continuity).

Since the w i are C 2 and concave and satisfy (4.7), the assertion of the lemma follows by an argument similar to the one used in the proof of Theorem 2.3. Fix an arbitrary \(D\in\mathcal{D}\) and let U be the corresponding risk process. Applying a generalized form of Itô’s lemma to the process \(\{\mathrm{e}^{-\varLambda _{T\wedge\tau}}w(U_{T\wedge\tau},Z_{T\wedge\tau}), T\ge0\}\), taking expectations and using that \(f^{v}_{i}(x)\leq0\) as in the proof of Theorem 2.3, we find that

By letting T→∞ and invoking the monotone convergence theorem and the fact that w and v are nonnegative and \(f^{v}_{i}(x)\leq0\), we obtain w(x,i)≥Uv(x,i). Since the barrier strategy at level b v is an element of \(\mathcal{D}\), it also holds that Uv(x,i)≥w(x,i), so that w(x,i)=Uv(x,i). □

The convergence of the iteration procedure is an immediate consequence of the following contraction property of Uv.

Lemma 4.8

(Contraction)

The map vUv is a contraction on \(\mathcal{C}\) with respect to ∥⋅∥.

Proof

By Lemmas 4.7 and 4.6, we have \(Uv(x,i) = \sup_{b}T_{b}(v)(x,i) = (T_{b^{v}}v)(x,i)\) and \(Uv\in\mathcal{C}\) for \(v\in \mathcal{C}\). Hence it follows that for \(v,w\in\mathcal{C}\),

$$\|Uv - Uw\| \leq\sup_b \|T_bv-T_bw\| \leq C\|v-w\|,$$

where C<1 and the second inequality follows as in the proof of Corollary 4.2. Thus U is a contraction on \(\mathcal{C}\). □

Proof of Propositions 4.3 and 4.4

Proposition 4.3 directly follows by combining Lemma 4.5 with Lemma 4.7.

From the definition of U and the dynamic programming equation, we directly see that UvVUw if vVw. In particular, taking \(v=v_{0}^{-}\) and \(w=v_{0}^{+}\) and repeatedly applying the former inequality yields that \(v_{n}^{-}\leq V\leq v_{n}^{+}\). It follows from Lemma 4.8 that \((v_{n}^{+})\) and \((v_{n}^{-})\) converge to the unique fixed point of U, which is therefore equal to V. Next note that in view of Lemma 4.6, \(v_{n}^{\pm}\) are concave (as we took \(v_{0}^{\pm}\in \mathcal{C}\)), so that V, a pointwise limit of concave functions, is also concave. This completes the proof of Proposition 4.4. □

5 Case study: two regimes

5.1 Positive drifts

From now on, we restrict ourselves to the case of two regimes, E={0,1}. For the setting of positive drifts, μ 0,μ 1>0, we derive a system of two nonlinear equations for the optimal dividend barriers. We denote by F 0 and F 1 the quadratic polynomials given by

$$ F_k(\lambda) = \frac{1}{2} \sigma^2_k\lambda^2+\mu_k\lambda+q_{kk}-c_k,\quad k=0,1,$$

with two different real roots \(\lambda_{1}^{k}\) and \(\lambda_{2}^{k}\). Consider the fourth order polynomial

$$ F_{0,1}(\lambda) := F_0(\lambda)F_1(\lambda)-q_{00}q_{11}.$$

The equation F k (λ)=0 has two different roots \(\lambda_{-}^{k}<\lambda_{+}^{k}\) given in (2.6), and the equation F 0,1(λ)=0 has four real roots satisfying λ 1<λ 2<0<λ 3<λ 4.

Solving the systems of differential equations in Theorem 3.2 leads to the following result:

Proposition 5.1

Suppose that μ 0,μ 1>0 and let \(b_{0}^{*} < b_{1}^{*}\). Then \((b_{0},b_{1})=(b^{*}_{0},b^{*}_{1})\) solve the two nonlinear equations

where d=(d 1,…,d 4)′ solves the linear system Ad=h, where h=(0,0,1,0)′ and

$$A =\left (\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}1 & 1 & 1 & 1 \\F_{0}(\lambda_1) & F_{0}(\lambda_2) & F_{0}(\lambda_3)& F_{0}(\lambda_4) \\\lambda_1\exp(\lambda_1b_{0}) & \lambda_2\exp(\lambda_2b_{0})& \lambda_3\exp(\lambda_3b_{0}) & \lambda_4\exp(\lambda_4b_{0}) \\\lambda_1^2\exp(\lambda_1b_{0}) & \lambda_2^2\exp(\lambda_2b_{0}) & \lambda_3^2\exp(\lambda_3b_{0}) &\lambda_4^2 \exp (\lambda_4b_{0})\end{array} \right ).$$

The proof of Proposition 5.1 is given in the Appendix.

5.1.1 Sensitivities of the optimal barriers

To illustrate the effects of regime switching and the sensitivities of the optimal barrier levels, we numerically solved the system of nonlinear equations in Proposition 5.1 for different parameter values, and compared the results with the explicit solutions (2.3) and (2.4) corresponding to the absence of regime switching. The nonlinear equations were solved using a Maple routine based on the standard quasi-Newton method. We chose the parameters as in Table 1 and varied μ 0, σ 0, q 00 and r 0 individually while keeping the other parameters fixed—the results are given in Table 2.

Table 1 The parameter set for the comparative statistics
Table 2 The optimal barrier levels for drifts μ 0, volatilities σ 0, transition rates −q 00 and discounting rates r 0

We see that when the drift parameter μ 0 is increased, then initially \(b^{*}_{0}\) and \(b^{*}_{1}\) increase, while they decrease when the drift μ 0 becomes very large. Apparently, for relatively low drift it is optimal to reduce the probability of ruin, while for large drift the effect of discounting takes priority. Table 2 also shows that the two barriers \(b^{*}_{0}\) and \(b^{*}_{1}\) monotonically increase when σ 0 increases. A larger volatility leads to a higher probability of ruin, requiring the company to raise the level of the barrier in order to protect its future operations. We can also observe the effect of the transition rates of the underlying Markov chain. For example, if the rate is −q 00=0.01, the chain spends a large part of the time in state 0 (in equilibrium, 3/3.01≈99.7% of the time), which we find back as \(b^{*}_{0}=1.014\) is very close to \(a_{0}^{*}=1.013\), whereas if −q 00 and −q 11 are of similar size, the chain spends on average similar amounts of time in both states and the level \(b^{*}_{0}\) differs substantially from \(a_{0}^{*}\). Finally, we note that both \(b^{*}_{0}\) and \(b^{*}_{1}\) decrease when the rate of discounting r 0 is increased; if the rate of discounting is higher, it is optimal to increase the dividend payments by lowering the dividend barriers.

5.2 Adverse regime shifts: negative drift

We next consider the case that the drift is positive in one state and negative in the other. Intuitively, it is clear that for sufficiently small reserves a quick bankruptcy of the company is quite likely if the drift is negative, so that it is optimal to liquidate the company by paying out the entire reserves as a lump sum. If, however, the negative drift is moderate and the reserves are not too small, the expected future gains from a regime switch to a ‘good’ state may outweigh the effect of the negative drift, and it may be optimal to continue the business. In that case, a sensible strategy could be to liquidate the company for small initial reserves, but to pay out dividends according to a modulated barrier strategy for larger levels of reserves, which we formalize as follows.

Definition 5.2

A modulated liquidation and dividend barrier strategy at levels d=(d(i),iE) and b=(b(i),iE) is a dividend strategy \(D^{d,b}\in\mathcal{D}\) satisfying

where U d,b is the insurance risk process (2.1) corresponding to D d,b.

Condition (iii) states that all the reserves are paid out as dividends once the risk reserves fall below the level d(Z t ). Define next the critical levels

$$ \varDelta _i = \inf \bigl\{x\ge0: Y_i(x)>0\bigr\},$$

where Y i (x):=μ i c i x+∑ ji q ij (V j (x)−x). Note that if μ i <0, Y i (x) is negative for all x small enough, which implies that Δ i ∈(0,∞]. If Δ i =+∞, which is the case if μ i <0 and |μ i | is sufficiently large, it is optimal in state i to liquidate the company for any level of the reserves, by immediately paying out all the reserves as dividends—this can be directly checked from Theorem 2.3. In the case that μ 0<0<μ 1 and Δ 0<∞ (the case μ 1<0<μ 0 follows by relabeling the states), it turns out that it is optimal to continue paying dividends if the reserves are large enough, where the ‘liquidation’ level \(d^{*}_{0}>0\) solves the smooth fit equation \(V_{0}'(d^{*}_{0}) = 1\). The solution is explicitly given as follows.

Proposition 5.3

Suppose that μ 0<0<μ 1 and Δ 0<∞.

  1. (i)

    The optimal strategy in (2.2) is given by the modulated liquidation and dividend barrier strategy at levels \(d^{*}=(d^{*}_{0},0)\) and \(b^{*}=(b^{*}_{0},b^{*}_{1})\) that solve the system

    $$V_0'\bigl(d^*_0\bigr) = 1,\qquad V_0''\bigl(b^*_0\bigr)=0,\qquad V_1''\bigl(b_1^*\bigr)=0.$$
  2. (ii)

    If \(b^{*}_{1}<b^{*}_{0}\), then \(d^{*}_{0},b^{*}_{0}\), and \(b^{*}_{1}\) solve the system of nonlinear equations

    where \(\epsilon_{i,j} = \mathrm{e}^{\lambda_{i}^{1}d_{0}}[(\lambda^{1}_{i})^{2} -\lambda_{i}^{1}\lambda_{j}]\), \(\phi= (\lambda_{1}^{1})^{2}\mathrm{e}^{\lambda_{1}^{1}d_{0}}- (\lambda_{2}^{1})^{2}\mathrm{e}^{\lambda_{2}^{1} d_{0}}\), and B=(B 1,…,B 4)′ solves the linear system

    $$ A^* B= h,$$

    with h=(d 0,1,q 00,0)′ and \(A^{*}=(A^{*}_{1}, A^{*}_{2}, A^{*}_{3}, A^{*}_{4})\) with columns given by

    $$A^*_i= \left(\begin{array}{c}\exp(\lambda_i d_0)\\\lambda_i\exp(\lambda_i d_0)\\ F_0(\lambda_i) \lambda_i \exp(\lambda_i b_1)\\F_0(\lambda_i) \lambda_i^2\exp(\lambda_i b_1)\end{array} \right) \quad i=1,2,3,4.$$

    The value functions are given by

    where \(\alpha= \frac{\lambda_{2}^{0}}{\lambda_{2}^{0}-\lambda_{1}^{0}}\), \(\beta_{i} = \frac{-q_{ii}}{c_{i}-q_{ii}}\) and

The proof is given in the Appendix. Observe that the value function V 0 is not concave, as there are two disjoint intervals where it has unit slope.

As illustration, we provide next a numerical example of a case where a modulated liquidation-dividend strategy is optimal.

Example

Consider the case where μ 0=−0.08, σ 0=0.40, q 00=−10, c 0=0.06, μ 1=0.14, σ 1=0.50, q 11=−0.001 and c 1=0.08. Numerically solving the system of nonlinear equations in Proposition 5.3, we obtained that d 0=0.086, b 0=1.418 and b 1=1.415, and that the value functions V 0 and V 1 (plotted in Fig. 2) are given by

$$V_0(x)= \begin{cases}x+0.317, & x\ge1.418, \\[2pt]-1174e^{-10.725x} +0.994x \\[2pt]\quad{}+2.43\cdot10^{-4}e^{11.725(x-1.418)}+0.325,& 1.415\leq x \leq1.418,\\[2pt]0.266e^{-10.725x}-1.252e^{-1.536x} + 1.039e^{0.417x},&0.086<x<1.415,\\[2pt]x, & 0\leq x \leq0.086\end{cases}$$

and

$$V_1(x)= \begin{cases}x+0.651, & x\ge1.415, \\[2pt]-2.04\cdot10^{-5} e^{-10.725x} -1.220e^{-1.536x} \\[2pt]\quad+1.222e^{0.417x}, & 0.086 \leq x \leq1.415,\\[2pt]-1.230e^{-1.541x}+1.209e^{0.421x}+0.012x+0.021, & 0\leq x \leq 0.086.\end{cases}$$
Fig. 2
figure 2

The value function V 0(x) and the corresponding optimal dividend barriers d 0,b 0,b 1 in the case where the parameters are μ 0=−0.08, σ 0=0.40, q 00=−10, c 0=0.06, μ 1=0.14, σ 1=0.50, q 11=−0.001, c 1=0.08. Note that V 0 is not concave

6 Conclusion

In this paper, we have shown that in the presence of regime shifts, the optimal dividend policy is given by a threshold strategy set at a level that is a function of the current regime. That is to say, the policy that maximizes the expectation of the net present value of the paid dividends until the moment of default consists of paying out as dividends the overflow of the cash reserves above a certain optimal threshold, where this threshold jumps up or down exactly at the moment when the regime shifts. Hence, at the moment of a regime shift when the key parameters such as drift, volatility and discounting may change, it may be optimal to make a lump sum dividend payment, namely when the threshold level jumps below the current level of the cash reserves. We presented a contraction algorithm for the computation of the optimal threshold levels. As a case study, we numerically investigated the parameter sensitivities of the levels in the case of two regimes. It would be desirable to systematically explore the dependence of the optimal threshold levels on key parameters and its financial significance, which could be achieved by an analytical investigation of its form in specific parametric models; this is a topic left for future research.