Skip to main content
Log in

A Continuous-Time Optimal Insurance Design with Costly Monitoring

  • Published:
Asia-Pacific Financial Markets Aims and scope Submit manuscript

Abstract

We provide a theoretical and numerical framework to study optimal insurance design under asymmetric information. We consider a continuous-time model where neither the efforts nor the outcome of an insured firm are observable to an insurer. The insured may then cause two interconnected information problems: moral hazard and fraudulent claims. We show that, when costly monitoring is available, an optimal insurance contract distinguishes the one problem from the other. Furthermore, if the insured’s downward-risk aversion is weak and if the participation constraint is not too tight, then a higher level of the monitoring technology can mitigate both problems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. In much of the literature on contract theory, communication games with costless reporting have been studied a lot in finite-horizon (typically, two or three period) discrete-time models. There are a few exceptions in a literature on insurance frauds; see e.g. Georges et al. (2009).

  2. Under each of the information problems, more comprehensive coverage is associated with high risk (Chiappori 2000). Accordingly,it is difficult to identify whether it is due to their ability, to their laziness, or neither.Much empirical insurance literature has differentiated moral hazard from ex-ante adverse selection by making use of some different dynamic properties of the incentive structures between the two information problems in insurance contracts for various exogenous cases (typically, a reform of regulatory framework); e.g., see Abbring et al. (2003). However, that literature has not used any dynamic optimal (i.e., endogenous) insurance properties.

  3. We focus on an European-type contract in which monitoring can be undertaken, depending deterministically on the reported wealth level, at time \(T\). Intermediate monitoring and stochastic monitoring are out of our scope in this paper, and will be our next research topics in future.

  4. They assume that the contract can depend on the whole path of the wealth \(X^u_\cdot \) whereas we assume that it depends only on its time-\(T\) value. Still, note that they show that the optimal contract depends only on the time-\(T\) value. In this sense, our paper is not a generalization of Cvitanić and Zhang (2007).

  5. Note that, in this model, the loss/gain process of the wealth is continuous. We may extend the process to have jumps under appropriate mathematical regularities. That will be our future work.

  6. The first three derivatives of a function \(f\) are denoted by \(f^{\prime }\), \(f^{\prime \prime }\) and \(f^{\prime \prime \prime }\), respectively.

  7. Much of the previous continuous-time optimal contracting literature assumes exponential utility forms; see e.g. Holmström and Milgrom (1987), Schättler and Sung (1993). However, those forms are of limited use in financial practices. By contrast, we assume general utility forms.

  8. As one referee points out, this implies \((\frac{1}{U_1^\prime })^\prime \ge 1\), i.e., \(\frac{1}{U_1^\prime }\) grows faster than linear utilities with positive slopes. This assumption is actually technical and is not standard in the utility theory where concave utility functions are usually assumed. In contrast to the standard utility theory, \(e^{U_1}\) is included in the insurer’s expected utility under \({{\mathrm{\mathbb {P}}}}\) in our asymmetric information model, as in Eq. (3.5). The concavity of \(U_1\) is then not enough to obtain a necessary and sufficient condition for optimality of a contract, as shown in Eq. (3.9). Instead, the concavity of \(e^{U_1}\) is sufficient for it.

  9. The assumption of the dependence on the time-\(T\) values might look restrictive. Cvitanić and Zhang (2007) show, in a model similar to ours, that it is not restrictive in equilibrium.

  10. For example, payoff functions are kinked in debt contracts and options.

  11. See e.g. Diamond (1984), Hellwig (2000), Williamson (1987).

  12. For example, consider the case that the insured could not commit to the contract at time \(T\). In the case, the insurer could enforce the insured to pay \(F\) in advance (i.e., at time \(0\)) in order to avoid failure to collect \(F\) at time \(T\).

  13. The penalty can be interpreted as reputation loss and the cost of being imprisoned, for example.

  14. As to the Clark–Ocone formula, see Appendix B.

References

  • Abbring, J. H., Heckman, J. J., Chiappori, P. A., & Pinquet, J. (2003). Adverse selection and moral hazard In insurance: Can dynamic data help to distinguish? Journal of the European Economic Association, 1, 512–521.

    Article  Google Scholar 

  • Chiappori, P.A. (2000) Econometric models of insurance under asymmetric information, In Georges, D (ed.) Handbook of Insurance. Norwell, MA: Kluwer Academic Publishers 365–392.

  • Cvitanić, J., & Zhang, J. (2007). Optimal compensation with adverse selection and dynamic actions. Mathematics and Financial Economics, 1, 21–55.

    Article  Google Scholar 

  • Diamond, D. W. (1984). Financial intermediation and delegated monitoring. Review of Economic Studies, 51, 393–414.

    Article  Google Scholar 

  • Georges, D., Giuliano, F., and Picard, P. (2009) Optimal auditing with scoring: Theory and application to insurance fraud, HEC Montreal: Mimeo.

  • Goldstein, R., Ju, N., & Leland, H. (2001). An EBIT-based model of dynamic capital structure. Journal of Business, 74, 483–512.

    Article  Google Scholar 

  • Harrington, S. E., & Niehaus, G. R. (2004). Risk management and insurance (2nd ed.). NY: McGraw-Hill/Irwin.

    Google Scholar 

  • Hellwig, M. F. (2000). Financial intermediation with risk aversion. Review of Economic Studies, 67, 719–742.

    Article  Google Scholar 

  • Holmström, B., & Milgrom, P. (1987). Aggregation and linearity in the provision of intertemporal incentives. Econometrica, 55, 303–328.

    Article  Google Scholar 

  • Karatzas, I., & Shreve, S. E. (1997). Brownian Motion and Stochastic Calculus Second Edition.  : Springer.

  • MacMinn, R., Garven, J. (2000) On corporate insurance, In Georges, D (ed.) Handbook of insurance Norwell, MA: Kluwer Academic Publishers. 541–564.

  • Misumi, T., Nakamura, H., Takaoka, K. (2013) Optimal risk sharing in the presence of moral hazard under market risk and jump risk, HCFR Working Paper G-1–4.

  • Ocone, D. L., & Karatzas, I. (1991). A Generalized Clark Representation Formula, with Application to Optimal Portfolios. Stochastics and Stochastics Reports, 34, 187–220.

    Google Scholar 

  • Revuz, D., & Yor, M. (1999). Continuous martingales and brownian motion (3rd ed.). Berlin: Springer.

    Book  Google Scholar 

  • Rogers, L. C. G., & Williams, D. (2000). Diffusions, markov processes and martingales (2nd ed.)., Itô calculus Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Rogerson, W. (1985). The first-order approach to principal-agent problems. Econometrica, 53, 1357–1368.

    Article  Google Scholar 

  • Schättler, H., & Sung, J. (1993). The first-order approach to the continuous-time principal-agent problem with exponential utility. Journal of Economic Theory, 61, 331–371.

    Article  Google Scholar 

  • Williamson, S. D. (1987). Costly monitoring, loan contracts, and equilibrium credit rationing. Quarterly Journal of Economics, 102, 135–146.

    Article  Google Scholar 

  • Zvonkin, A. K. (1974). A transformation of the state space of a diffusion process that removes the drift. Mathematics USSR (Sbornik), 22, 129–149.

    Article  Google Scholar 

Download references

Acknowledgments

Financial support from the Grants-in-Aid for Scientific Research (C) and the Dai-ichi Life Insurance Company, Limited and the Dai-ichi Life Research Institute Inc. is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hisashi Nakamura.

Appendices

Appendix A: Proof of Proposition 3.1

As in Eq. (2.1), for \(u \in \mathcal{A}_1\), \(X_t^u = W_t + \int _0^t u_s {{\mathrm{\mathrm{d} }}}s\). As it was shown above, \(X^u\) is a driftless Brownian motion under the probability measure \({{\mathrm{\mathbb {P}}}}^u\) characterized by the Radon-Nikodym derivative:

$$\begin{aligned} \frac{{{\mathrm{\mathrm{d} }}}{{\mathrm{\mathbb {P}}}}^u}{{{\mathrm{\mathrm{d} }}}{{\mathrm{\mathbb {P}}}}} = \mathcal{E} \Big ( -\int \limits u {{\mathrm{\mathrm{d} }}}W \Big )_T. \end{aligned}$$

Define the martingale \(M^u\) as

$$\begin{aligned} M^u_t:={{\mathrm{\mathbb {E}}}}^u [ \, e^{ f(X_T^u)} \,| \, \mathcal{F}_t^{X^u} ], \quad t\in [0,T]. \end{aligned}$$

By the Martingale Representation Theorem, there exists an \( \mathbb {F} ^{X^u}\)-adapted process \(H^u \) such that

$$\begin{aligned} M^u_t=M^u_0+\int \limits _0^t H_s^u{{\mathrm{\mathrm{d} }}}X_s^u, \quad t\in [0,T]\,\text {a.s.}. \end{aligned}$$

We also define \(\gamma ^u\) as

$$\begin{aligned} \gamma _t^u:=\frac{H_t^u}{M^u_t}, \quad t\in [0,T]. \end{aligned}$$

Since \(\int \limits \gamma ^u {{\mathrm{\mathrm{d} }}}X^u= \int \limits \frac{{{\mathrm{\mathrm{d} }}}M^u}{M^u}\), it holds that

$$\begin{aligned} M^u=M^u_0\,\mathcal {E}\left( \int \limits \gamma ^u{{\mathrm{\mathrm{d} }}}X^u\right) \end{aligned}$$

and thus

$$\begin{aligned} e^{f(X_T^u)} ={{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ] \, \mathcal{E} \Big ( \int \limits \gamma ^u {{\mathrm{\mathrm{d} }}}X^u \Big )_T . \end{aligned}$$
(6.1)

Since the Clark–Ocone formula is applicable to \(e^{f(\cdot )}\),Footnote 14

$$\begin{aligned} H_t^u={{\mathrm{\mathbb {E}}}}^u [ \, e^{ f(X_T^u)} f'(X_T^u) \,| \, \mathcal{F}_t^{X^u} ]. \end{aligned}$$
(6.2)

Since \(X^u\) is the driftless Brownian motion under \({{\mathrm{\mathbb {P}}}}^u\), \(X_T^u \sim N(X_t^u, T-t)\) conditional on \(\mathcal{F}_t^{X^u}\). Hence,

$$\begin{aligned} \gamma _t^u=\frac{{{\mathrm{\mathbb {E}}}}^u [ \, e^{ f(X_T^u)} f'(X_T^u) \,| \, \mathcal{F}_t^{X^u} ]}{ {{\mathrm{\mathbb {E}}}}^u [ \, e^{ f(X_T^u)} \,| \, \mathcal{F}_t^{X^u} ]} =h(X_t^u, t), \end{aligned}$$
(6.3)

as in Eq. (3.1). Therefore, \(\gamma ^u\) is bounded. Taking logarithms on both sides of Eq. (6.1),

$$\begin{aligned} f(X_T^u)&= \log {{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ] + \int \limits _0^T \gamma _s^u {{\mathrm{\mathrm{d} }}}X_s^u - \frac{1}{2} \int \limits _0^T (\gamma _s^u)^2 {{\mathrm{\mathrm{d} }}}s \\&= \log {{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ] + \int \limits _0^T \gamma _s^u {{\mathrm{\mathrm{d} }}}W_s -\frac{1}{2} \int \limits _0^T \Big \{ (\gamma _s^u)^2 - 2 u_s \gamma _s^u \Big \} {{\mathrm{\mathrm{d} }}}s. \end{aligned}$$

Subtracting \( \frac{1}{2} \int _0^T (u_s)^2 {{\mathrm{\mathrm{d} }}}s \) from both sides and taking expectations under the probability measure \({{\mathrm{\mathbb {P}}}}\),

$$\begin{aligned}&{{\mathrm{\mathbb {E}}}}\left[ f\Big ( W_T + \int \limits _0^T u_s \, {{\mathrm{\mathrm{d} }}}s \Big ) - \frac{1}{2} \int \limits _0^T (u_s)^2 \, {{\mathrm{\mathrm{d} }}}s \right] = \log {{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ]\nonumber \\&-\frac{1}{2}{{\mathrm{\mathbb {E}}}}\left[ \int \limits _0^T (\gamma _s^u - u_s)^2 {{\mathrm{\mathrm{d} }}}s \right] \nonumber \\&\le \log {{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ]. \end{aligned}$$
(6.4)

Note that \(\log {{\mathrm{\mathbb {E}}}}^u [ e^{f(X_T^u)} ]\) on the right-hand side of Eq. (6.4) is independent of \(u\). If \(\gamma ^u = u\), i.e., if there exists some \(u \in \mathcal{A}_1 \) that satisfies the implementability condition (3.3), it is an optimal solution. Moreover,

$$\begin{aligned} \gamma ^u = u&\Longleftrightarrow&u_\cdot = h(X_\cdot ^u, \cdot ) \qquad \because \hbox {Eq}.(6.3) \nonumber \\&\Longleftrightarrow&X_\cdot ^u = W_\cdot + \int \limits _0^\cdot h(X_s^u, s) {{\mathrm{\mathrm{d} }}}s. \end{aligned}$$
(6.5)

It is well known that, by Zvonkin (1974)(see also Karatzas and Shreve (1997), Chapter 5, Notes, p.396), the last Markovian stochastic differential equation (6.5):

$$\begin{aligned} {{\mathrm{\mathrm{d} }}}X_t^u = {{\mathrm{\mathrm{d} }}}W_t + h(X^u_t, t) {{\mathrm{\mathrm{d} }}}t, \qquad X^u_0=0 \end{aligned}$$

has a unique strong solution \(X^*\). Since it is the strong solution, \( \mathbb {F} ^{X^*} = \mathbb {F} ^W\). \(\square \)

Remark 5.1

The remark right after the statement of Proposition 3.1 is proved as follows. For every adapted process \(u\) satisfying \( \mathbb {E} [\int \limits _0^T(u_t)^2 {{\mathrm{\mathrm{d} }}}t ]<\infty \), it follows from Lebesgue’s dominated convergence theorem that

$$\begin{aligned} J(u)=\lim _{n \rightarrow \infty } J(u^{(n)}) \end{aligned}$$

where \(u^{(n)}_t:=u_t \mathbf {1}_{\{|u_t|\le n\}}\). Since \(u^{(n)}\) is a bounded process, it satisfies Eq. (6.4). By taking the limit \(n \rightarrow \infty \), the process \(u\) also satisfies Eq. (6.4).

Appendix B: Supplementary Note on the Clark–Ocone Formula

The Clark–Ocone formula for general functionals is stated in terms of the Malliavin-Fréchet derivative (see e.g. Ocone and Karatzas 1991; Revuz and Yor 1999; Rogers and Williams 2000), but, in order to show our formula (A.2), the following simple argument is sufficient.

Let us define the function \(g: \mathbf {R} \times [0,T) \rightarrow \mathbf {R} \) by

$$\begin{aligned} g(x,t) := \int \limits _{-\infty }^\infty \frac{1}{\sqrt{2 \pi (T-t)}} e^{-\frac{y^2}{2(T-t)}} e^{f(x+y)} dy, \end{aligned}$$

which is a \(C^2\) function since it is the convolution of \(e^{f(x)}\) and \( \frac{1}{\sqrt{2 \pi (T-t)}} e^{-\frac{x^2}{2(T-t)}}. \) Also, we have

$$\begin{aligned} \mathbb {E} \big [ e^{f(W_T)} \big | \mathcal{F}_t \big ] = g(W_t, t) \quad \text{ a.s. }, \; 0 \le t < T \end{aligned}$$

and thus the process \(\{ g(W_t, t) \}_{0 \le t < T}\) is a martingale. It then follows from Itô’s formula that

$$\begin{aligned} g(W_t, t) = g(0,0) + \int \limits _0^t \frac{\partial g}{\partial x}(W_s ,s) dW_s, \quad 0 \le t < T, \;\; \text{ a.s. } \end{aligned}$$

It remains to prove that

$$\begin{aligned} \frac{\partial g(x,t)}{\partial x}&= \lim _{\epsilon \rightarrow 0} \int \limits _{-\infty }^\infty \frac{1}{\sqrt{2 \pi (T-t)}} e^{-\frac{y^2}{2(T-t)}} \frac{e^{f(x + y+\epsilon )} - e^{f(x+y)} }{\epsilon } {{\mathrm{\mathrm{d} }}}y \\&= \int \limits _{-\infty }^\infty \frac{1}{\sqrt{2 \pi (T-t)}} e^{-\frac{y^2}{2(T-t)}} f'(x+y) e^{f(x+y)} {{\mathrm{\mathrm{d} }}}y. \end{aligned}$$

This can be shown by the following two properties combined with Lebesgue’s dominated convergence theorem:

  • For almost every \(x \in \mathbf {R} ,\) \(\displaystyle \lim _{\epsilon \rightarrow 0} \frac{e^{f(x +\epsilon )} - e^{f(x)}}{\epsilon } = f'(x) e^{f(x)} ;\)

  • Since the Lipschitz constant of \(f\) is \(K:=\sup _{x\in \mathbf {R} \backslash \{x_1,x_2,\cdots ,x_n \}} \left| f^\prime (x)\right| <\infty ,\) we have, for every \(x \in \mathbf {R} ,\)

    $$\begin{aligned} \sup _{0<|\epsilon | < 1} \Big | \frac{e^{f(x + \epsilon )} - e^{f(x)} }{\epsilon } \Big |&= e^{f(x)} \sup _{0<|\epsilon | < 1} \Big | \frac{e^{f(x+\epsilon )-f(x)} - 1 }{\epsilon } \Big | \\&\le e^{f(x)} \sup _{0<|\epsilon | < 1} \frac{e^{|f(x+\epsilon )-f(x)|} - 1 }{|\epsilon |} \\&\le e^{f(x)} \sup _{0<|\epsilon | < 1} \frac{e^{K|\epsilon |} - 1 }{|\epsilon |} \\&= (e^K - 1 ) e^{f(x)}. \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nakamura, H., Takaoka, K. A Continuous-Time Optimal Insurance Design with Costly Monitoring. Asia-Pac Financ Markets 21, 237–261 (2014). https://doi.org/10.1007/s10690-014-9184-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10690-014-9184-9

Keywords

JEL Classification

Navigation