1 Introduction

The reward of a fund manager usually increases when the Asset Under Management (AUM) grows, while it decreases when the AUM shrinks. The AUM may grow either because of a higher value of the assets or because of new money flowing into the fund. Good performances of the fund with respect to its relative benchmark are likely to attract new investors. Therefore, contracts based on the AUM create an implicit incentive for the manager to beat the benchmark. We study the problem of a portfolio manager whose compensation depends on the AUM modelled through the relative performances with respect to a benchmark. This framework generalizes the setting of Basak et al. (2007), by considering a market model with one risk-free and one risky asset whose expected returns depend on an unobservable stochastic process, the “market price of risk”. We introduce the realistic assumption that the manager has a limited knowledge on the market, she can only observe stock prices and estimates the market price of risk from them. Therefore, the manager is facing an optimization problem under partial information.

Optimization problems under partial information are usually solved in two steps: the first step, called reduction, consists of deriving the conditional distribution of the market price of risk with respect to the observed information flow; the second step solves the equivalent problem under the observed information. An important feature of our setting is that, while the market of claims contingent to the knowledge of the market price of risk is incomplete, the market restricted to those claims contingent only to stock prices is instead complete. We will exploit this fact to solve the optimization problem applying a martingale approach with the unique equivalent martingale measure (under the restricted setting) and then using a concavification argument to determine the unique optimal solution.

Although both problems of relative incentives and of optimization under partial information have been separately addressed in many papers (see the literature review provided in Sect. 1.1), this one is, to the best of our knowledge, the first contribution that analyses the combined effect of such issues on the optimal strategy of a portfolio manager. We contribute to the literature by providing the solution to the optimization problem in semi-closed form and we present one example where we show that the optimal strategy depends on the risk aversion of the manager and on the economic situation of the market. When the risk aversion of the manager is larger (lower) than that of a manager equipped with a logarithmic utility, she will tend to decrease (increase) her investment in the risky asset to hedge against the future adjustment in the estimates of the unknown parameter. We will also see that the larger the uncertainty on the value of the market price of risk, the greater the impact on the strategy and the consequent benefits in terms of expected utility when following the optimal strategy. Finally, as expected, the effect of learning becomes more tangible for longer time spans.

The paper is organized as follows. After a literature review (Sect. 1.1), Sect. 2 presents the market model and the portfolio optimization problem faced by the manager. In Sect. 3 we solve the optimization problem in two steps. First, we derive the dynamics of the filtered estimate of market price of risk, in order to reduce the problem to a common information flow. This procedure allows to obtain market dynamics driven by a unique source of randomness and hence the market model under partial information turns out to be complete. Second, we apply the martingale method along with concavification to characterize the optimal final wealth and the optimal investments strategy. Section 4 contains a numerical illustration of our results. Conclusions and comments are provided in Sect. 5. Proofs and computations are collected in the Appendix.

1.1 Literature review

The structure of portfolio managers’ compensation is studied for instance in Ma et al. (2019), who show that performance based incentives represent the main form of compensation for portfolio managers in the US mutual fund industry. Of course, this is not the only type of incentive for fund managers. Option-like incentives of different nature (as for example management fees, investor’s redemption options or funding options by prime brokers) apply in fund managers compensation contract, and influence manager’s leverage decisions (see, e.g. Lan et al. 2013; Buraschi et al. 2014)

Basak et al. (2007) compute the optimal strategy followed by the manager under the assumption that she knows exactly the parameters driving the asset price process. They show that, when at an intermediate date the return of the fund is either very low or very large compared to the benchmark, the manager forgets about the implicit incentives determined by the fund-flows and reverts to the normal strategy, that is the one determined by Merton (1971). However, when the current return is closer to the benchmark, the manager tilts her strategy from the Merton level to try to beat the benchmark. Nicolosi et al. (2018) extend their framework to consider mean-reversion either in the market price of risk or in the volatility. Basak et al. (2008) introduce additional restrictions on the set of admissible strategies to contrast the tendency of managers to increase riskiness when their portfolio under-performs the benchmark, in order to align managers’ scope to that of investors. The optimal allocation problem for institutional investors concerned about their performance with respect to a benchmark index is studied in Basak and Pavlova (2013). Their objective is to show how incentives influence the prices of the assets hold by institutional investors. In particular the authors found that, differently from standard investors, institutions tend to form portfolios of stocks that compose the benchmark index, they push up prices of stocks in the benchmark index by generating excess demand for index stocks and induce excess correlation among these stocks. Carpenter (2000) analyses the optimal investment problem of a risk adverse manager who is compensated with a call option on the asset under management. In this paper the market model is assumed to be complete and the non-concavity of the objective function is addressed by introducing a concavification argument and showing that the optimal solution takes values on a set where the original non-concave objective function is equal to the minimal concave function dominating it. An explicit solution to this problem in the Black-Scholes setting is provided in Nicolosi (2018) while Herzel and Nicolosi (2019) extend the solution to the case of mean-reverting processes. The impact of commonly observed incentive contracts on managers’ decisions is also studied in Chen and Pennacchi (2009), where the authors found that for particular compensation structure, when a fund is performing poorly, the deviation from the benchmark portfolio is larger than in case of good performance.

Other important contributions on the literature of delegated portfolio management problem include Cuoco and Kaniel (2011), who investigate the case where managers receive a direct compensation, related to the performance, from investors and discuss asset price implications in equilibrium. Different compensation schemes have been considered, for instance, in Barucci and Marazzina (2016) in a portfolio optimization problem for a manager who is remunerated through a High Water Mark incentive fee and a management fee and in Barucci et al. (2019) where a penalty on the remuneration is applied if the fund value falls below a fixed threshold, namely a minimum guarantee.

Optimal asset allocation under partial information has been widely studied in the literature. Brendle (2006) considered the optimal investment problem for a partially informed investor endowed with bounded CRRA preferences in a market model driven by an unobservable market price of risk via the HJB approach. Hata and Sheu (2018) also included consumption. A more general setting, not necessarily Markovian, has been analysed for instance in Björk et al. (2010) and Lindensjo (2016), under the assumption of market completeness. The optimization problem in these papers is solved using the Martingale approach.

The partial information case in a delegated portfolio management has been considered in the recent literature by Barucci and Marazzina (2015) in a slightly different setting compared to ours, where market is subject to two regimes, modelled via a continuous time two-state Markov chain and in Huang et al. (2012) where investment learning is studied under a Bayesian approach.

Other contributions in the case where prices are modelled as diffusions are Lakner (1995, 1998). Brennan (1998) and Xia (2001) study the effect of learning on the portfolio choices, and Colaneri et al. (2020) address the problem of computing the price that a partially informed investor would pay to access to a better information flow on the market price of risk. Investment problems in a market with cointegrated assets under partial information are studied in some recent works as for instance Lee and Papanicolaou (2016) and Altay et al. (2018, 2020).

2 Market model and the portfolio optimization problem

We fix a probability space \((\varOmega , {\mathcal {F}}, \mathbf {P})\). Let \({\mathbb {F}}=\{{\mathcal {F}}_t, t\ge 0\}\) be a complete and right continuous filtration representing the global information. We consider a market model with one risky asset with price \(S_t\), the stock, and one risk-free asset with price \(B_t\). We assume that the price of the risk-free asset follows

$$\begin{aligned} \frac{d B_t}{B_t} = r dt \end{aligned}$$
(1)

with the constant \(r>0\) representing the constant interest rate. The risky asset price is modelled by a geometric diffusion

$$\begin{aligned} \frac{d S_t}{S_t} = \mu _t dt + \sigma dZ^S_t \end{aligned}$$
(2)

where \(Z^S_t\) is a one dimensional standard Brownian motion, \(\sigma >0\) is the constant volatility and the drift is the process

$$\begin{aligned} \mu _t = r + \sigma X_t, \end{aligned}$$
(3)

which depends linearly on the market price of risk \(X_t\). The process \(X_t\) satisfies

$$\begin{aligned} d{X_t} = -\lambda ({X_t}-{\bar{X}})dt+ \sigma _X dZ_t^X, \end{aligned}$$
(4)

with starting value \(X_0\) drawn from a normal distribution with mean \(\pi _0\) and variance \(R_0\). The parameter \(\lambda >0\) is a constant representing the strength of attraction toward the long term expected mean \({\bar{X}}\), \(\sigma _X > 0\) is the volatility of the market price of risk and \(Z^X_t\) is a one-dimensional standard Brownian motion correlated with \(Z^S_t\) with correlation \(\rho \in [-1,1]\). We assume that the market price of risk is a latent variable that is not directly observed, and its value can only be derived through the observation of \(S_t\). That means that the available information is given by the filtration \({\mathbb {F}}^S:=\{\mathcal {F}^S_t,\ t \in [0,T]\}\), generated by the process S.Footnote 1 Let us note that, since there are two risk factors \(Z^S_t\) and \(Z^X_t\), but only one traded asset besides the money market account, this market model is incomplete.

We study the problem of a fund manager who trades the two assets, \(S_t\) and \(B_t\), continuously in time on [0, T], starting from an initial capital w. We assume that the stock does not pay dividends before time T. We describe the trading strategy of the manager by a process \(\theta _t\) representing the fraction of wealth invested in the risky asset at any time \(t \in [0,T]\). We only consider trading strategies that are self-financing and based on the available information, hence defining an admissible strategy as a self-financing trading strategy, adapted to the filtration \(\mathbb {F}^S\) and, to prevent arbitrage from doubling strategies,Footnote 2 such that

$$\begin{aligned} \mathbf {E}\left[ \int _0^T\left( |\theta _t X_t|+\theta ^2_t\right) d t\right] <\infty . \end{aligned}$$
(5)

The set of all admissible strategies is denoted by \({\mathcal {A}}^{S}\). The wealth process generated by an admissible strategy \(\theta _t\) is

$$\begin{aligned} \frac{dW_t}{W_t} = (r + \theta _t \sigma X_t)dt+\theta _t\sigma dZ^S_t, \quad W_0=w>0. \end{aligned}$$
(6)

The manager’s compensation is determined by the value of the AUM at time T, expressed by \(f(W_T,Y_T) W_T\), where f is a function, to be defined below, that captures the effects of the relative performance with respect to the benchmark. The value of the benchmark \(Y_t\) is obtained by following the constant investment strategy \(\beta \) and hence it follows

$$\begin{aligned} \frac{dY_t}{Y_t} = (r + \beta \sigma X_t)dt+\beta \sigma dZ^S_t . \end{aligned}$$

The continuously compounded returns on the manager’s portfolio and on the benchmark over the period [0, t] are given by \(R^W_t = \ln \frac{W_t}{W_0}\) and \(R^Y_t = \ln \frac{Y_t}{Y_0}\), respectively. To compare relative performances, we set \(Y_0 = W_0\). The difference \(R^W_T-R^Y_T\) provides the tracking error of the final wealth relative to the benchmark. The funds flow to relative performance relationship is described by the function

$$\begin{aligned} f(W_T,Y_T) = \left\{ \begin{array}{c@{\quad }c@{\quad }c} f_L &{} \text {if} &{} R^W_T-R^Y_T<\eta _L \\ f_L + \psi \cdot (R_T^W -R_T^Y - \eta _L) &{} \text {if} &{} \eta _L \le R_T^W -R_T^Y <\eta _H \\ f_H := f_L + \psi \cdot (\eta _H - \eta _L) &{} \text {if} &{} R_T^W -R_T^Y \ge \eta _H \end{array} \right. \end{aligned}$$
(7)

with \(f_L> 0, \psi >0\), and \(\eta _L \le \eta _H\) and it is illustrated in Fig. 1. This simplified structure of the funds flow to relative performance relationship, called in the literature collar type, shows that if the manager return is below the benchmark return of at least \(\eta _L\) or above the benchmark return of at least \(\eta _H\), the flow rate received by the fund is flat (with different rates \(f_L<f_H\)). When the relative performance, measured in terms of tracking error, is between \(\eta _L\) and \(\eta _H\), the flow function is a linear segment with a positive slope. The function f also has two kinks when the difference \(R_T^W -R_T^Y\) reaches the levels \(\eta _L\) and \(\eta _H\). The funds flow to relative performance relationship in Eq. (7) was proposed by Basak et al. (2007) to describe an implicit incentive scheme and it is based on the empirical analysis of Chevalier and Ellison (1997). The idea is that, if the fund under-performs with respect to the benchmark, investors tend to withdraw their money, the AUM decreases, and the manager receives a lower compensation. The opposite happens in the case of over-performance. Citing Basak et al. (2007): “(...) this simple way of modeling fund flows is able to capture most of the insights pertaining risk-taking incentives of a risk averse manager”.

Fig. 1
figure 1

The funds flow \(f(W_T,Y_T)\) as a function of relative performance \(R_T^W -R_T^Y\), with parameters \(f_L=0.8, f_H=1.5, \eta _L=-0.08\) and \(\eta _H=0.08\)

The manager maximizes the expected utility of her implicit incentives over the set of admissible strategies \({\mathcal {A}}^{S}\),

$$\begin{aligned} \max _{\theta \in {\mathcal {A}}^{S}}\mathbf {E}\left[ u(W_T f(W_T,Y_T))\right] \end{aligned}$$
(8)

with initial budget \(W_0=w\). We assume that the manager is endowed with a power utility function

$$\begin{aligned} u(x) = \frac{1}{1-\gamma }x^{1-\gamma }, \end{aligned}$$
(9)

with nonnegative risk aversion parameter \(\gamma \ne 1\). The case \(\gamma =1\) corresponds to the logarithmic utility. Since the market price of risk is not observable, this is an optimization problem under restricted information. To solve it, we first reduce it to a setting with a common information flow by replacing the unobservable process \(X_t\) with its conditional expectation. This standard procedure allows us to consider an equivalent optimization problem under the available information, see, e.g. Fleming and Pardoux (1982). We characterize the conditional expectation of \(X_t\) in the next section via Kalman filtering.

3 Optimal wealth and strategies

In this section we solve the problem (8). The first step is to estimate the unobservable market price from stock prices. Applying the Kalman filtering theoryFootnote 3 we get that the conditional distribution of market price of risk is Gaussian with conditional mean \(\pi _t:=\mathbf {E}\left[ X_t | \mathcal {F}^S_t\right] \), and conditional variance \(R_t:= \mathbf {E}\left[ \left( X_t-\mathbf {E}[X_t|\mathcal {F}^S_t]\right) ^2|\mathcal {F}^S_t\right] \). To derive \(\pi _t\) and \(R_t\) we introduce the innovation process

$$\begin{aligned} I_t:= Z^S_t+\int _0^t (X_u-\pi _u) d u. \end{aligned}$$
(10)

It is well known (see, e.g. Lipster and Shiryaev 2001 or Ceci and Colaneri 2012, 2014) that \(I_t\) is a Brownian motion with respect to the observable filtration \(\mathbb {F}^S\). The proposition below, proved for instance in Lipster and Shiryaev (2001), provides the dynamics of \(\pi _t\) and \(R_t\).

Proposition 1

The conditional mean and variance of the market price of risk satisfy the equations

$$\begin{aligned} d \pi _t= & {} -\lambda (\pi _t- {{\bar{X}}}) d t + (R_t+\rho \sigma _X) d I_t, \quad \pi _0\in \mathbb {R}, \end{aligned}$$
(11)
$$\begin{aligned} d R_t= & {} \left[ \sigma _X^2 -2\lambda R_t -(R_t+\rho \sigma _X)^2\right] d t, \quad R_0\in \mathbb {R}^+. \end{aligned}$$
(12)

From (12) we see that the conditional variance of the market price of risk is deterministic and satisfies a Riccati ordinary differential equation. Using Eq. (10), we get the dynamics of the stock, of the wealth process and of the benchmark with respect to the innovation process

$$\begin{aligned} \frac{d S_t}{S_t}&= (r + \sigma \pi _t) d t + \sigma d I_t, \end{aligned}$$
(13)
$$\begin{aligned} \frac{d W_t}{W_t}&= (r + \theta _t \sigma \pi _t) d t + \theta _t \sigma dI_t, \end{aligned}$$
(14)
$$\begin{aligned} \frac{d Y_t}{Y_t}&= (r + \beta \sigma \pi _t) d t + \beta \sigma dI_t. \end{aligned}$$
(15)

Since all processes are only driven by the innovation, the sub-market restricted to claims that can be replicated by strategies in \({\mathcal {A}}^S\) is complete. We solve the optimization problem (8) using the martingale method (see, for instance Cox and Huang 1989), transforming the dynamic optimization problem (8) where the control variable is a strategy, into an equivalent static problem where the control variable is the terminal wealth.

To identify terminal wealths that are reachable from the initial budget w with feasible strategies, we introduce the unique state price density process

$$\begin{aligned} \frac{d \xi _t}{ \xi _t} = -r dt - \pi _t dI_t, \quad \xi _0=1. \end{aligned}$$
(16)

The static optimization problem, equivalent to (8) is

$$\begin{aligned} \max _{ W_T} \mathbf {E}[u( W_T f( W_T,Y_T))], \end{aligned}$$
(17)

with budget constraint

$$\begin{aligned} w = \mathbf {E}[\xi _T W_T] . \end{aligned}$$
(18)

The objective function in problem (17)–(18) is not concave in \(W_T\). To overcome this issue we apply the concavification procedure proposed by Carpenter (2000) (see Appendix D for more details on the procedure). Following the approach in Proposition 2 of Basak et al. (2007), we define the optimal final wealth relative to the benchmark, which is given by \(V_T=\frac{ W^\star _T}{Y_T}\), where \(W^\star _T\) is the optimal final wealth in problem (17)–(18). This quantity has an explicit representation (e.g. equation (A7) in Basak et al. 2007 or equation (6) in Nicolosi et al. 2018) given, for completeness, by Eq. (26) in Appendix A. One key characteristic is that \(V_T\) is a function of \(\zeta _T:= \xi _T Y_T^\gamma \) only. Computing \(V_T\), enables us to characterize the optimal terminal wealth. However this is not sufficient to obtain the trading optimal portfolio strategy, for which, we need to know the value of optimal wealth at any time \(t\in [0,T]\), and consequently, we must determine the relative wealth \(V_t= \frac{ W^\star _t}{Y_t}\). To compute \(V_t\) we consider the benchmarked market, where we discount all processes with the numéraire \(Y_t\).Footnote 4 Due to market completeness there exists an equivalent risk neutral probability measure \(\mathbf {Q}\) for the benchmarked market. Put in other words there exists a probability measure \(\mathbf {Q}\) which is equivalent to \(\mathbf {P}\) and such that the price process of any benchmarked traded asset (i.e. any traded asset discounted with \(Y_t\)) is a martingale under \(\mathbf {Q}\).Footnote 5

We define the process \(\zeta _t = \xi _t Y_t^{\gamma }\) and derive its distribution under \(\mathbf {Q}\). Let z be a complex number and define the conditional moment generating function of \(\ln (\zeta _T)\) under the measure \(\mathbf {Q}\) as

$$\begin{aligned} H(t,\zeta ,p;z) =\mathbf {E}^\mathbf {Q}\left[ \zeta _T^{z} \vert \mathcal {F}^S_t\right] = \mathbf {E}^\mathbf {Q}\left[ \zeta _T^{z} \vert \zeta _t = \zeta ,\pi _t = p \right] . \end{aligned}$$

The function \(H(t,\zeta ,p;z)\) plays a key role in solving the optimization problem (see Proposition 2 below). It is characterized in the following technical lemma.

Lemma 1

Let z be a complex number and let A(tz), B(tz) and C(tz) be the solutions of the system of Riccati equations (40)–(41)–(42) in Appendix B on the interval [0,T]. Under suitable regularity conditions (see Eq. (35) in Appendix B), the conditional moment generating function of \(\ln (\zeta _T)\) under the measure \(\mathbf {Q}\) is given by

$$\begin{aligned} H(t,\zeta ,p;z) = \zeta ^{z} e^{A(t; z)+B(t;z) p+\frac{1}{2} C(t;z) p^2}. \end{aligned}$$
(19)

The proof of Lemma 1 goes along the same lines as in Nicolosi et al. (2018). We remark that in this particular case, since both the drift and volatility in the dynamics of the filter \(\pi _t\) are not constant (11), the coefficients of the Riccati equations that characterize the functions AB and C are time-dependent. The solution of non-homogeneous Riccati equations are discussed in Appendix C.

In the next step we use Fourier Transform to compute the optimal relative wealth and the optimal strategy at any time \(t\le T\).

Proposition 2

Let \(K_j\), for \(j=1,2,3,4\) be real numbers such that \(K_1 < -1/\gamma \), \( K_4 > -1/\gamma \) and

$$\begin{aligned} H(t,\zeta ,p;K_j) = \mathbf {E}^\mathbf {Q}\left[ e^{ K_j \ln (\zeta _T)} \vert \zeta _t = \zeta ,\pi _t = p \right] < \infty . \end{aligned}$$
(20)

Then,

  1. (i)

    the relative wealth \(V_t\) is finite and given by

    $$\begin{aligned} V_t = \frac{1}{2\pi }\sum _{j=1}^4 \int _{-\infty }^{+\infty }{\hat{\varphi }}_j(u+iK_j) H(t,\zeta ,p;K_j-iu) du \end{aligned}$$
    (21)

    where the functions \({\hat{\varphi }}_j(z)\) of the complex variable z are given in Appendix A;

  2. (ii)

    the optimal strategy is

    $$\begin{aligned} \theta _t = \beta +\frac{1}{ \sigma V_t}\left( \frac{\partial V}{\partial \zeta }\zeta _t(\gamma \beta \sigma - \pi _t)+ \frac{\partial V}{\partial p} (R_t+\rho \sigma _X) \right) . \end{aligned}$$
    (22)

The proof of Proposition 2 is given in “Appendix B”. Notice that there are no conditions on constants \(K_2\) and \(K_3\), and for instance, if they are both equal to zero condition (20) is trivially satisfied. See the proof in Appendix B for further details. Examples of applications of the formulas for the optimal relative wealth (Eq. 21) and the optimal strategy (Eq. 22) provided by Proposition 2 are given in the next section.

4 A numerical illustration

Now we study the impact of uncertainty on the market price of risk on optimal strategies of a portfolio manager subject to implicit incentives. First we examine the combined effect of risk aversion and of market conditions, then we show that its relevance increases with the level of uncertainty and with the time span of the investment. In a relatively stable market (i.e. low volatility) with lower expected returns, the portfolio manager increases her exposure to the risky asset when underperforming and decreases it when overperforming. The opposite happens when the market is more volatile and expected returns are higher. Moreover, we see that risk-aversion has a direct influence on the views of the manager about the uncertain estimates. Managers with a risk-aversion parameter larger than 1 fear that the value of the market price of risk will turn out to be below their current estimate and consequently reduce their exposure to hedge for the future changes. Managers with risk aversion lower than 1 expect that the value may be higher than the estimate and tilt their strategy in the opposite way.

To illustrate such behavior with an example, we consider a simplified version of our model, where the market price of risk is constant but unknown, and given by a random variable \(X_0\), drawn from a Gaussian distribution with mean \(\pi _0\) and variance \(R_0\). This setting is analogous to that of Brennan (1998), where the manager does not know the value of the drift of the price process and can only estimate its expected value \(m_0\), which is related to the market price of risk \(\pi _0\) by

$$\begin{aligned} \pi _0= \frac{m_0-r}{\sigma }. \end{aligned}$$

Setting \(\lambda =0\) and \(\sigma _X=0\), we get that the conditional expected value and variance follow

$$\begin{aligned} d \pi _t= R_t d I_t, \quad \pi _0 \in \mathbb {R}, \end{aligned}$$
(23)

and

$$\begin{aligned} R_t=\frac{R_0}{R_0 t +1}. \end{aligned}$$
(24)

To highlight the effects of uncertainty on the market price of risk we also consider a manager who erroneously assumes that \(X_0=\pi _0\). We call this manager myopic because, unlike the far-sighting manager, she does not adjust her strategy to hedge for future changes on the estimates. We denote by \(\theta ^0_t\) the fraction of wealth invested by the myopic manager in the risky asset and by \(\theta _t\) the optimal strategy of the partially informed manager given by Eq. (22). The value of \(\theta ^0_t\) is obtained from Eq. (22), by setting \(R_t=0\) and \(\pi _t=\pi _0\) (of course, this also affects the relative wealth \(V_t\) ). As a comparison we also consider the Merton level \(\theta _N = \frac{1}{\gamma }\frac{\mu _0 - r}{\sigma ^2} \) corresponding to the optimal investment of the myopic manager who optimizes only the utility of terminal wealth, without other incentives.

Figure 2 represents the myopic strategy \(\theta ^0_t\) (dotted line) and the optimal strategy under partial information \(\theta _t\) (continuous line) as functions of the relative return of the portfolio with respect to the benchmark, that is \(R_t^W-R_t^Y\), at time \(t=0.25\), either for \(\gamma = 0.8\) (left panels) or for \(\gamma = 2\) (right panels). The parameters of the implicit incentives structure at time \(T=1\) in Eq. (7) are the same as in Fig. 1.

Fig. 2
figure 2

Optimal strategies for different economies and different managers. The optimal strategy \(\theta _t\) (continuous line) and the myopic one \(\theta ^0_t\) (dotted line) at time \(t=0.25\) are reported as functions of the relative return of the portfolio with respect to the benchmark, as well as the Merton level \(\theta _N\) (dashed line). Left panels represent the strategies of managers with risk aversion \(\gamma =0.8\), right panels those of more risk-averse managers (\(\gamma = 2\)). Top panels represent economy a (\(\theta _N > \beta =1\)), that is a less volatile market with higher returns (\(\sigma = 0.15\), \(\pi _t=0.667\)), the bottom panels are referred to economy b (\(\theta _N < \beta =1\)), a more volatile market with lower returns (\(\sigma = 1\), \(\pi _t=0.3\)). The parameters of the payoff function are the same as in Fig. 1. The others parameters are \(T=1\), \(r = 0\), and \(R_0=0.09\) so that the variance \(R_t\) is 0.088 according to (24)

The top panels show the case when the Merton level is above the investment in the risky asset of the benchmark portfolio, that is when \(\theta _N > \beta \). In the illustration we set \(\beta =1\). This setting corresponds to a market situation with relatively small volatility and returns, that is called economy (a) by Basak et al. (2007), obtained by taking \(\sigma = 0.15\), \(r = 0\) and \(m_0 = 0.1\) in our model.

The bottom panels show the situation when the Merton level is below the investment in the risky asset of the benchmark, that is when \(\theta _N < \beta \). This is the economy (b) in Basak et al. (2007), a more volatile and remunerative market, obtained by setting \(\sigma = 1\), \(r = 0\) and \(m_0 = 0.3\). The level of uncertainty on the initial estimate is given by \(R_0 = 0.09\) for all the panels.

The panels on the left represent a less risk averse manager \(\gamma =0.8\), those on the right a more risk-averse one \(\gamma =2\). By comparing the left to the right panels we see that the investment in risky asset decreases with risk aversion for both economies (a) and (b). The strategies for a myopic and a far-sighting manager are qualitatively similar to each other with the far-sighting manager taking either more or less risky positions than the myopic one, depending on the risk-aversion. When the risk aversion parameter \(\gamma \) is equal to 1 (that is the case of logarithmic utility), the two strategies coincide. A far-sighting manager with a risk aversion smaller than 1 (left panels) tends to be more exposed to the risky asset than a myopic manager with the same risk aversion. In this case, the far-sighting manager acts optimistically, as she believes that, increasing the precision of the estimates of the market price, the correct value will be higher than the current one. Instead, the more risk-averse manager (right panels) is pessimistic and reduces the exposition to the risky asset fearing that the future estimates will be lower than the current one. By comparing top to bottom panels in Fig. 2 we see the effects of the overall economic condition on the optimal strategy, depending on the current results of the portfolio management strategy. When the relative performance is either too low or too high for the incentives to have an effect on the final reward, the optimal strategy approaches a constant level that corresponds to the optimal risky exposure without incentives and hence the myopic investment converges to the Merton level. If the manager is underperforming but still hopes to recover, or if she is slightly ahead, but still fearing to end behind, she adjusts the portfolio strategy in a way that depends on the economic conditions. In the case of economy (a) (top panels), she increases the exposure when trailing and decreases it when leading. The economy (b), representing a more volatile market and higher expected returns (bottom panels), induces the same manager to take opposite choices.

Figure 3 represents, for the same scenarios in Fig. 2, the difference \(\theta _t^0-\theta _t\), at time \(t=0.25\), as a function of the relative return of the portfolios with respect to the benchmark, for some values of \(R_0\). We can clearly see that the larger the values of \(R_0\) the larger the differences, in absolute values, between the strategies.

Fig. 3
figure 3

Deviations \(\theta _t^0-\theta _t\), at time \(t=0.25\), as functions of the relative return of the portfolio with respect to the benchmark, for different levels of initial conditional variance \(R_0=[0.09,0.16,0.23]\). Left panels represent the strategies of managers with risk aversion \(\gamma =0.8\), right panels those of more risk-averse managers (\(\gamma = 2\)). Top panels represent economy a (\(\theta _N > \beta =1\)), that is a less volatile market with higher returns (\(\sigma = 0.15\), \(\pi _t=0.667\)), the bottom panels are referred to economy b (\(\theta _N < \beta =1\)), a more volatile market with lower returns (\(\sigma = 1\), \(\pi _t=0.3\)). The parameters of the payoff function are the same as in Fig. 1. The others parameters are \(T=1, r = 0\)

Fig. 4
figure 4

The impact of learning on the relative increment of the Certainty Equivalent as a function of the initial variance \(R_0\), for maturities 0.75 and 1.5 years. The other parameters are \(\gamma =1.5\), \(\sigma = 0.15\), \(\pi _0 = 0.667\)

To assess the impact of learning on the expected utility, we compute the Certainty Equivalent of the terminal wealth \(W_T\) , that is

$$\begin{aligned} CE(W_T):=u^{-1}\left( E(u(W_T)) \right) \end{aligned}$$

and the Relative Increment

$$\begin{aligned} RI:=\frac{CE(W_T^*)}{CE(W^0_T)}-1\end{aligned}$$
(25)

where \(W_T^*\) and \(W^0_T\) are the terminal wealths reached by the optimal strategy and by the myopic one. Therefore, RI measures the relative improvement of the Certainty Equivalent when adopting the optimal, far-sighting, strategy instead of the myopic one. To compute the terminal wealths we proceed by simulation, drawing first \(X_0\) from a normal random variable with mean \(\pi _0\) and variance \(R_0\), and simulating the risky asset S. Then, we get \(W_T^*\) and \(W^0_T\) from Eq. (26). The expected utilities are obtained from the sample mean of the terminal utilities.

Figure 4 shows the values of RI as a function of the variance of the market price of risk \(R_0\), for two values of T. We see that the relative performance improves as \(R_0\) increases and it is more pronounced for longer maturities. This is due to the fact that, while the far-sighting manager reacts to changes in market conditions by observing the asset prices and improves her estimates of the market price of risk, the myopic manager believes that the characteristics of the market are stable and never updates her estimates. Therefore, the two strategies are equal only in the case of zero uncertainty on the value of the market price of risk (when \(R_0 =0\)), and their difference grows as \(R_0\) and T increase.

5 Conclusions

We studied a portfolio optimization problem for a manager who is compensated according to the performance of her portfolio relative to a benchmark. The manager invests in a risk-free asset and in a risky asset whose return depends on a latent variable representing the market price of risk. Hence she solves an optimization problem under partial information. Due to the implicit incentives given by the funds flow to relative performance, the utility function of the manager is not concave and hence existence of the optimum does not trivially hold. We solve the optimization problem using the martingale approach and a concavification procedure. This approach can be successfully applied due to completeness of the market under partial information. Optimal wealth and consequently the optimal strategy are characterized in a semi-explicit form via Fourier transform. We illustrated our results with an example, where we assume that the market price of risk is constant but unknown. We observed that the level of risk aversion has an influence on the manager’s estimate of the market conditions, and consequently on her investment choices. Managers with a small risk aversion parameter are optimistic: they tend to increase their investment in the risky assets compared to myopic managers, believing that the true value of the market price of risk (and hence of the asset return) is more favourable than her estimate. It is also seen that if the market is not subject to large fluctuations, managers invest more in risky assets when they are underperforming the benchmark, in anticipation to retrieve benchmark revenues, and invest less in the risky asset when overperforming, to avoid possible downward movements of the market.

We have also analyzed the effect of uncertainty (i.e. \(R_0\)) on the optimal portfolio strategies and the optimal wealth by comparing the case of the myopic manager with the optimal partially informed investor. We have seen that the deviation between strategies amplifies as \(R_0\) increases. The comparison between Certainty Equivalents corresponding to the optimal wealth (under partial information) and the wealth of the myopic manager, goes in the same direction, that is, it grows with \(R_0\), and increases with time as well, as a consequence of the learning effect.

We conclude with a few clarifications on the setting under partial information and possible extensions. In this paper we assumed that the volatility is constant to end up in the setting of Kalman filtering. This is convenient for the optimization problems since in this case filter is finite dimensional. We also stress that if the volatility depends on the market price of risk (i.e. is a bijective function \(\sigma (X_t)\)) then we end up with a model under full information (see, e.g. Ceci and Colaneri 2017, Remark 3). Indeed, the volatility is always adapted to the filtration \(\mathbb {F}\) and hence observable, therefore X can be derived by function inversion. In this paper we have analyzed the optimal asset allocation under implicit incentives for the manager in the univariate case under partial information. The multivariate case is discussed under full information in Basak et al. (2007) with a constant market price of risk and Nicolosi et al. (2018) with a stochastic market price of risk. The generalization of our partial information problem to the multivariate case can be carried following the same steps of this paper; our choice of considering the one dimensional case, however, allows us to avoid technicalities and keep the notation simpler. For instance the multivariate case requires to filter the multi-dimensional market price of risk using the information coming from multiple risky assets. It is well known that, in such case, the conditional variance is characterized via a system of Riccati equations that does not have an explicit solution. We considered collar-type incentives, but our approach can be applied to study other incentive functions. Of course, a different incentive function will change the concavified objective function and, as a consequence, the functional form of the optimal wealth and the optimal strategies.