1 Introduction

When it comes to modelling trading behaviour, the standard economic paradigm is the maximisation of risk-averse agents’ expected utility in a frictionless market. This criterion, however, has been criticised on many levels. In terms of trading environment, financial friction is omnipresent in reality where transactions are subject to various costs. In terms of agents’ preferences, the behavioural economics literature suggests that many individuals do not make decisions in accordance to expected utility theory. First, utilities are not necessarily derived from final wealth, but typically what matters is the change in wealth relative to some reference point. Second, individuals are usually risk-averse over the domain of gains, but risk-seeking over the domain of losses—this can be captured by an S-shaped utility function. Finally, individuals may fail to take portfolio effects into account when making investment decisions, and this phenomenon is known as narrow framing. These psychological ideas are explored for example in the seminal work of Kahneman and Tversky [16], Tversky and Kahneman [26, 27] and Kahneman and Lovallo [15].

In this paper, we develop a tractable dynamic trading model which captures a number of stylised behavioural biases of individuals as well as market frictions. In our setup, trading is costly due to proportional transaction costs as well as a fixed market entry fee. The goal of an agent is to find the optimal time to buy and then to sell an indivisible risky asset to maximise the expected utility of the round-trip profit under the prospect theory preference of Tversky and Kahneman [27]. While a realistic economy can consist of multiple assets, we can interpret the assumption of a single indivisible asset as a manifestation of narrow framing such that the trading decisions associated with one particular unit of indivisible asset can be completely isolated from the other investment opportunities. We believe the model is the best suitable to describe the trading behaviour of speculative agents. These “less-than-fully rational” agents purchase and sell an asset with a narrow objective of making a one-off round-trip profit rather than supporting consumption or stipulating a long-term portfolio growth.

A sequential optimal stopping problem featuring an S-shaped utility function is solved to identify the entry and exit time of the market by the agent. The solution approach is based on a backward induction idea. In the first stage, we focus on the exit strategy of the agent: Conditionally on the ownership of the asset purchased at a given price level (which determines the agent’s reference point), the optimal liquidation problem is solved. Then the value function of this exit problem reflects the utility value of purchasing the asset at a different price level. Upon comparison against the utility value of inaction, we obtain the payoff function of the real option to purchase the asset which is then used in the second stage problem concerning the entry decision of the agent: The agent picks the optimal time to enter the trade so as to maximise the expected payoff of this real option to purchase the asset.

The traditional route to analyse an optimal stopping problem is to first conjecture a candidate optimal stopping rule, and then the dynamic programming principle is invoked to derive a free boundary value problem that the value function should satisfy. Then one can attempt to solve for the free boundaries via value matching and smooth pasting. For this approach to work, we need to correctly identify the form of the optimal stopping rule, but this exercise may be not trivial. As it turns out, the optimal continuation region of our entry problem can either be connected or disconnected, depending on the model parameters. It is thus difficult to adopt such a guess-and-verify approach since we do not know the correct form of the optimal stopping rule upfront. In our analysis, a martingale method is employed to solve the underlying optimal stopping problems, which has the important advantage that no a priori conjecture on the optimal strategy is required. The optimal continuation/stopping set can be deduced directly by studying the smallest concave majorant of a suitably scaled payoff function.

Despite its relatively simple nature, our model is capable of generating a rich variety of trading behaviours such as “buy and hold”, “buy low, sell high”, “buy high, sell higher” and “no trading”. The risk-seeking preference of a behavioural agent over the loss domain will typically encourage him to enter the trade, but his precise trading behaviour depends crucially on the level of transaction costs relative to his preference parameters. Generally speaking, a high proportional (fixed) transaction cost discourages trading at a high (low) nominal price. When proportional costs are high and the asset is expensive, the agent prefers waiting until the price level declines, and hence he is more inclined to consider a “buy low, sell high” strategy. But if instead the fixed entry fee is high and the asset is cheap, the agent might prefer delaying the purchase decision until the asset reaches a higher price level, and this leads to a trading pattern of “buy high, sell higher”.

Both behavioural preferences and market frictions are studied extensively as separate topics in the mathematical finance literature. To the best of our knowledge, however, their interaction has not been explored to date. Under prospect theory, the risk attitude of the agent is heavily influenced by the reference point. In our model, the reference point is endogenised so that it depends on the cost of purchase including the transaction cost paid. The level of transaction cost therefore has a direct impact on the agent’s risk preference. This subtle interaction between risk preference and transaction cost leads to interesting policy implications on how speculative trading can be curbed effectively. For example, a surprising result is that imposing a fixed market entry fee might indeed accelerate rather than cool down trading participation.

Our paper is closely related to the literature on optimal stopping under an S-shaped utility function. Kyle et al. [18] and Henderson [9] consider a one-off optimal liquidation problem in which the agent solves for the optimal time to liquidate an endowed risky asset to maximise an expected prospect theory utility. They do not consider the purchase decision, and the reference point is taken as some exogenously given status quo. A main contribution of our paper is that we further endogenise the reference point which depends on the purchase price of the asset, and the optimal purchase price must be determined as a part of the optimisation problem. The recent work of Henderson and Muscat [11] extends the model of Henderson [9] by considering partial liquidation of multiple indivisible assets. Both the present paper and [11] consider a sequential optimal stopping problem as the underlying mathematical framework. However, the economic natures of the problems are completely different—we study the sequential decision of purchase and sale, while they exclusively focus on sales.

Another relevant class of works is the realisation utility model which further incorporates a reinvestment possibility within a behavioural optimal stopping model, such as Barberis and Xiong [2], Ingersoll and Jin [14], He and Yang [8], Kong et al. [17] and Dai et al. [3]. In such models, the agent repeatedly purchases and sells an asset to maximise the sum of utility bursts realised from the gain and loss associated with each round-trip transaction. In a certain sense, these models consider an endogenised reference point which is continuously updated based on the historical prices within each trading episode. However, the purchase decision is exogenously given in many of the models where the agent is simply assumed to buy the asset again immediately after a sale. These cited papers on realisation utility models all feature transaction costs which are required to make the problems well posed. As a result, the purchase pattern is not entirely realistic: If the agent is willing to sell an asset and then instantaneously repurchase an identical (or a different, but statistically identical) asset, then the agent is essentially throwing away money in the form of transaction costs without altering his own financial position. Only He and Yang [8] carefully analyse the purchase decision of the agent, but in any case they find that the purchase strategy is trivial—the agent either buys the asset immediately after a sale or never enters the trade again. Our model differs from the realisation utility model in that we do not consider perpetual reinvestment opportunities (which can be understood as narrow framing that the agent only focuses on a single episode of the trading experience when evaluating the entry and exit strategies). Nonetheless, the optimal purchase region of our model is non-trivial under typical parameters and encapsulates many realistic trading strategies.

Beyond the context of behavioural economics, there are a few works attempting to model the sequential purchase and sale decisions under an optimal switching framework. However, identification of a modelling setup which can generate reasonable trading patterns proves to be much more difficult than expected. On the one hand, Zervos et al. [29] report that “… the prime example of an asset price process, namely, the geometric Brownian motion, does not allow optimal buying and selling strategies that have a sequential nature”. Indeed, existing literature which gives “buy low, sell high” as an optimal trading strategy often relies on extra statistical features of the asset price process such as mean reversion. See for example Zhang and Zhang [30], Song et al. [25], Leung et al. [20] and Leung and Li [19]. On the other hand, momentum-based trading strategies are also rarely studied in the mathematical finance literature. The scarce examples include the work of Dai et al. [5] and Dai et al. [4] who find that a trend-following strategy is optimal under a regime-switching model for the asset price. We contribute to this strand of literature by showing that a trading model based on a simple geometric Brownian motion can also generate many realistic trading patterns including both a reversal strategy (buy low, sell high) and a momentum strategy (buy high, sell higher). This is achieved via incorporating standard elements of behavioural preferences and market frictions.

The optimal investment rule in the classical Merton [23, 24] portfolio selection problem can also be viewed as a buy low, sell high strategy: Since the agent keeps a constant fraction of wealth invested in the risky asset, extra units of risky asset are sold (purchased) when the price increases (falls), ceteris paribus. In our paper, we focus on a single indivisible asset and do not consider portfolio effects.

The rest of the paper is organised as follows. Section 2 provides a description of the model and the underlying optimisation problem. In Sect. 3, we outline the solution methods to a standard optimal stopping problem and discuss heuristically how the solution to our sequential optimal stopping problem can be characterised via the idea of backward induction. The main results are collected in Sect. 4. Some comparative statics results and their policy implications are discussed in Sect. 5. Several extensions of the baseline model are discussed in Sect. 6. Section 7 concludes. A few technical proofs are deferred to the Appendix.

2 Problem description

Let \((\Omega , \mathcal{F}, (\mathcal{F}_{t}),\mathbb{P})\) be a filtered probability space satisfying the usual conditions and supporting a one-dimensional Brownian motion \(B=(B_{t})_{t\geq 0}\). There is a single indivisible risky asset in the economy. Its price process \(P=(P_{t})_{t\geq 0}\) is modelled by a one-dimensional diffusion with state space \(\mathcal{J}\subseteq \mathbb{R}_{+}\) and dynamics of

$$\begin{aligned} dP_{t}=\mu (P_{t})dt+\sigma (P_{t})dB_{t}, \end{aligned}$$

where \(\mu :\mathcal{J}\to \mathbb{R}\) and \(\sigma :\mathcal{J}\to (0,\infty )\) are Borel functions. We assume that \(\mathcal{J}\) is an interval with endpoints \(0\leq a_{\mathcal{J}}< b_{\mathcal{J}}\leq \infty \) and that \(P\) is regular in \((a_{\mathcal{J}},b_{\mathcal{J}})\), i.e., for any \(p,y\in (a_{\mathcal{J}},b_{\mathcal{J}})\), we have \(\mathbb{P}[\tau _{y}<\infty |P_{0}=p]>0\), where \(\tau _{y}:= \inf \{t\geq 0: P_{t}=y\}\).

We assume that the interest rate is zero in our exposition. For the non-zero interest rate case, one can interpret the process \(P\) as the numéraire-adjusted price of the asset. Then the drift term \(\mu (\,\cdot \,)\) can be viewed as the instantaneous excess return of the risky asset.

Trading in the asset is costly. If the agent wants to purchase the asset at its current price \(p\), he will need to pay \(\lambda p + \Psi \) to initiate the trade, where \(\lambda \in [1,\infty )\) so that \(\lambda - 1\) represents the proportional transaction cost on purchases and \(\Psi \geq 0\) represents a fixed market entry fee. When the agent sells the asset at price \(p\), he will only receive \(\gamma p\), where \(\gamma \in (0,1]\) so that \(1-\gamma \) represents the proportional transaction cost on sales.

The preference of the agent is described by the prospect theory of Tversky and Kahneman [27]. Under this framework, utility is derived from gains and losses relative to some reference point rather than from the total wealth. Individuals are typically risk-averse over the domain of gains and risk-seeking over the domain of losses. This can be captured by an S-shaped utility function \(U:\mathbb{R}\to \mathbb{R}\) with \(U(0)=0\) and such that \(U\) is concave (resp. convex) over \(\mathbb{R}_{+}\) (resp. \(\mathbb{R}_{-}\)). Finally, individuals also exhibit loss-aversion so that the negative utility increment brought by a unit of loss is much larger in magnitude than the positive utility increment from a unit of gain.

In the behavioural optimal liquidation literature such as Kyle et al. [18] and Henderson [9], the liquidation payoff is always compared against some exogenously given constant reference point. In our setup, we assume the reference point depends on both an exogenous constant \(R\) as well as on the amount paid by the agent to purchase the asset. Suppose the agent has executed a speculative round-trip trade where he has bought and then sold the asset at stopping times \(\tau \) and \(\nu \) (with \(\tau \leq \nu \)), respectively. The liquidation payoff \(\gamma P_{\nu}\) is evaluated against \(\lambda P_{\tau}+\Psi +R\) as the reference point, where \(\lambda P_{\tau}+\Psi \) is the capital spent on purchasing the asset and \(R\) is a constant outside the model specification. The parameter \(R\) can be interpreted as a preference parameter of the agent which reflects his “aspiration level” in the sense of Lopes and Oden [21], where a more motivated agent will set a higher economic benchmark as a profit target to beat. The realised utility of this round-trip trade is \(U(\gamma P_{\nu}-\lambda P_{\tau}-\Psi -R)\).

A caveat, however, is that the agent is not obliged to enter or exit the trade at all if it is undesirable to do so. A realisation of \(\tau =\infty \) refers to the case that the purchase decision is deferred indefinitely, which is economically equivalent to not entering the trade at all. The liquidation value is zero because there is nothing to be sold, and the reference point becomes \(R\) since the required cash outflow \(\lambda P_{\tau}+\Psi \) for the purchase has never materialised. Thus the prospect theory value under this strategy is simply \(U(-R)\). Similarly, the agent may enter the trade at some time point but never liquidate the asset. This corresponds to a realisation of \(\tau <\infty \) and \(\nu =\infty \). In this case, the liquidation value is again zero and evaluated against the reference point \(\lambda P_{\tau}+\Psi +R\). To summarise all the possibilities, the realised prospect theory utility associated with a trading strategy \((\tau ,\nu )\) is written as

$$\begin{aligned} \textstyle\begin{cases} U(\gamma P_{\nu}-\lambda P_{\tau}-\Psi -R) &\quad \text{if $\tau < \infty ,\nu < \infty $}, \\ U(-R) &\quad \text{if $\tau =\infty $}, \\ U(-\lambda P_{\tau}-\Psi -R) &\quad \text{if $\tau < \infty ,\nu =\infty $}. \end{cases}\displaystyle \end{aligned}$$
(2.1)

The objective of the agent is to find the optimal purchase time \(\tau \) and sale time \(\nu \) to maximise the expected value of (2.1). Define the objective function as

J(p;τ,ν):=E[U(γ P ν 1 { τ < , ν < } (λ P τ +Ψ) 1 { τ < } R)| P 0 =p].
(2.2)

Formally, the agent is solving the sequential optimal stopping problem

$$\begin{aligned} \mathcal{V}(p):=\sup _{\tau ,\nu \in \mathcal{T}:\tau \leq \nu}J(p; \tau ,\nu ), \end{aligned}$$
(2.3)

where \(\mathcal{T}\) is the set of \((\mathcal{F}_{t})\)-stopping times valued in \(\mathbb{R}_{+}\cup \{+\infty \}\). Problem (2.3) has two features which make it non-standard relative to a typical optimal stopping problem. First, the decision space is two-dimensional. Second, the objective function has an explicit dependence on the stopping times \(\tau ,\nu \) via the indicator functions, which further complicates the analysis.

Remark 2.1

Similarly to Henderson [9], Xu and Zhou [28] and Henderson et al. [10], we do not explicitly consider subjective discounting in our baseline model. On the one hand, our model features cash flows at different time points and it is not entirely clear what is the most appropriate way to apply subjective discounting, because the standard prospect theory framework is not directly applicable to intertemporal choices. On the other hand, under discounting, an impatient agent is much more inclined to delay losses and to realise profits earlier, which will lead to an extreme disposition effect which is not consistent with the empirical trading pattern of retail investors; see the discussion in Henderson [9]. At a mathematical level, introducing subjective discounting also makes our problem harder to be analysed in full generality. We briefly discuss in Sect. 6.2 how subjective discounting might be incorporated and explore (numerically in some cases) how it affects the optimal trading behaviour.

3 The solution methods

In this section, we give an overview of how problem (2.3) can be solved. We begin by offering a short summary about the solution approach to solve a standard optimal stopping problem for a one-dimensional diffusion.

3.1 The martingale method for optimal stopping problems

We review the martingale method to solve an undiscounted optimal stopping problem; this is based on Dayanik and Karatzas [6].

Consider a general problem in the form of

$$\begin{aligned} V(p)=\sup _{\tau \in \mathcal{T}}\mathbb{E}\left [G\left (P_{\tau} \right )|P_{0}=p\right ] \end{aligned}$$

for some payoff function \(G\). In the standard theory of optimal stopping, the optimal stopping time can be characterised as the first exit time of the process from some open set \(\mathcal{C}\), so that \(\tau =\inf \{t\geq 0: P_{t}\notin C\}\). In a one-dimensional diffusion setting, it is sufficient to consider stopping times which have the form \(\tau _{a,b}:= \tau _{a} \wedge \tau _{b}\), where \(\tau _{a}:= \inf \{t\geq 0: P_{t}=a\}\) and \(\tau _{b}:= \inf \{t\geq 0: P_{t}=b\}\) with \(a\leq p\leq b\). Here \([a,b]\subseteq \mathcal{J}\) is an unknown interval to be identified (and it depends on \(p\) in general).

Let \(s(\,\cdot \,)\) be the scale function of the process \(P\) (it is unique up to an affine transformation), defined as a strictly increasing function such that \(\Theta :=s(P)\) is a local martingale. A simple application of Itô’s lemma shows that \(s(\,\cdot \,)\) should solve the second order differential equation

$$\begin{aligned} \frac{\sigma ^{2}(p)}{2}s''(p)+\mu (p)s'(p)=0. \end{aligned}$$
(3.1)

Let \(\theta :=s(p)\). Then

$$\begin{aligned} J(p;\tau _{a,b})&:=\mathbb{E} [G(P_{\tau _{a,b}})|P_{0}=p ] \\ & \phantom{:} =\mathbb{E}\big[G\bigl(s^{-1}(\Theta _{\tau _{a,b}})\bigr)\big| \Theta _{0}=\theta \big] \\ & \phantom{:} =\mathbb{E} [\phi (\Theta _{\tau _{a,b}}) |\Theta _{0}=\theta ] \\ & \phantom{:} =\mathbb{P}[\tau _{a}< \tau _{b}|\Theta _{0}=\theta ]\phi \bigl(s(a) \bigr) +\mathbb{P}[\tau _{b}< \tau _{a}|\Theta _{0}=\theta ]\phi \bigl(s(b)\bigr) \\ & \phantom{:} =\frac{s(b)-\theta}{s(b)-s(a)}\phi \bigl(s(a)\bigr)+ \frac{\theta -s(a)}{s(b)-s(a)}\phi \bigl(s(b)\bigr), \end{aligned}$$

where \(\phi :=G\circ s^{-1}\). The above can be maximised with respect to \(a\) and \(b\). Moreover, the dummy variables \(a\) and \(b\) can be replaced by \(a'=s(a)\) and \(b'=s(b)\). Hence

$$\begin{aligned} V(p)&=\sup _{a,b:a\leq p\leq b}J(p;\tau _{a,b}) \\ &=\sup _{a',b':a'\leq \theta \leq b'}\bigg(\frac{b'-\theta}{b'-a'} \phi (a')+\frac{\theta -a'}{b'-a'}\phi (b')\bigg)=:v(\theta ), \end{aligned}$$

and thus \(V(p)=v(s(p))\). The scaled value function \(v(\theta )\) can be characterised as the smallest concave majorant of \(\phi (\theta )=G(s^{-1}(\theta ))\), the scaled payoff function over \(s(\mathcal{J})\), which is defined as an interval with endpoints \(s(a_{\mathcal{J}})\) and \(s(b_{\mathcal{J}})\). The continuation and stopping set associated with the optimal stopping rule are given by \(\mathcal{C}=\{p\in \mathcal{J}:v(s(p))>\phi (s(p))\}\) and \(\mathcal{S}=\{p\in \mathcal{J}:v(s(p))=\phi (s(p))\}\), respectively.

3.2 Decomposition of the sequential optimal stopping problem

In the following two subsections, we discuss heuristically how the value function of (2.3) can be constructed by considering two sub-problems based on the idea of backward induction. The wellposedness conditions as well as a formal verification of optimality are explored in Sect. 4 when we specialise the modelling setup.

3.2.1 Exit problem

Suppose for the moment that the agent has already purchased the asset at some known price level \(q\) via paying \(\lambda q+\Psi \) at some time in the past. Conditionally on this information, the reference point has been fixed at the known constant level \(H:=\lambda q + \Psi +R\). Suppose the current time is labelled as \(t=0\) and the current price of the asset is \(P_{0}=p\). The goal of the agent in the exit problem is to find the optimal time to sell this owned asset to maximise the expected prospect theory value of the sale proceeds relative to the reference point \(H\). If the asset is (ever) sold at time \(\nu \), the utility of gain and loss relative to the reference point is U(γ P ν 1 { ν < } H) after taking the transaction cost on a sale into account. Since the realised utility is increasing in \(P_{\nu}\) and the process \(P\) is nonnegative, there is in general no incentive for the agent to forgo the sale opportunity. Hence heuristically, one can drop the indicator function 1 { ν < } and it is sufficient to consider the objective function \(G_{1}(P_{\nu};H):=U(\gamma P_{\nu}-H)\). The agent then solves the optimal stopping problem

$$\begin{aligned} V_{1}(p; H):=\sup _{\nu \in \mathcal{T}}\mathbb{E} [G_{1}(P_{\nu};H) | P_{0}=p ]=\sup _{\nu \in \mathcal{T}}\mathbb{E} [U(\gamma P_{\nu}-H) | P_{0}=p ] \end{aligned}$$
(3.2)

to determine the optimal time of the asset sale. The value function of the exit problem is then given by \(V_{1}(p; H)=\bar{g}_{1}(s(p); H)\), where \(\bar{g}_{1}=\bar{g}_{1}(\theta ;H)\) is the smallest concave majorant of

$$\begin{aligned} g_{1}(\theta ;H):=G_{1}\big(s^{-1}(\theta ); H\big)=U\big(\gamma s^{-1}( \theta )-H\big). \end{aligned}$$

We write the optimiser to problem (3.2) as \(\nu ^{*}(p;H)\); it depends on the initial price level \(p\) and the given reference point \(H\).

3.2.2 Entry problem

Now we assume the agent does not own any asset to begin with. His objective is to determine the optimal time to purchase (and then to sell) the asset to maximise the expected utility of the liquidation proceeds relative to the endogenised reference point.

At a given current asset price level \(p\), there are two possible actions for the agent. First, he can opt to initiate the speculative trade by buying the asset now, which fixes the reference point as \(\lambda p+\Psi +R\), and then sell it later in the future. When the asset is liquidated at his choice of the sale time \(\nu \), the realised utility is \(U(\gamma P_{\nu}-\lambda p-\Psi -R)\). Conditionally on the decision to purchase the asset today at price \(p\), the agent can find the best time of sale to maximise his expected utility by solving problem (3.2) for \(H=\lambda p + \Psi +R\). Then the best possible expected utility he can attain is

$$\begin{aligned} \sup _{\nu \in \mathcal{T}}\mathbb{E} [U(\gamma P_{\nu}-\lambda p- \Psi -R) | P_{0}=p ]=V_{1}(p; \lambda p+\Psi +R), \end{aligned}$$

provided that he decides to enter the trade at the given price of \(p\).

Alternatively, the agent can forgo the opportunity to enter the trade and stay away from the market forever. In this case, the payoff is zero and the reference point is simply equal to \(R\). The utility he will receive is just the constant \(U(-R)\).

Therefore, the opportunity to enter the speculative trade can be viewed as a real option. At a given price level \(p\), the agent is willing to enter the trade only if the maximal expected utility of trading is not less than that of inaction, i.e., if

$$ V_{1}(p; \lambda p+\Psi +R)\geq U(-R). $$

This is similar to a financial option being in the money. The payoff of this real option to the agent in utility terms as a function of the price level \(p\) is given by

$$\begin{aligned} G_{2}(p):=\max \{V_{1}(p;\lambda p+\Psi +R),U(-R) \}. \end{aligned}$$
(3.3)

The entry problem for the agent is to find the optimal time to initiate the trade so as to maximise the expected value of (3.3). This is equivalent to solving

$$\begin{aligned} V_{2}(p)&:=\sup _{\tau \in \mathcal{T}}\mathbb{E} [G_{2}(P_{\tau}) |P_{0}=p ] \\ & \phantom{:} =\sup _{\tau \in \mathcal{T}}\mathbb{E} [\max \{V_{1}(P_{\tau}; \lambda P_{\tau}+\Psi +R),U(-R) \} |P_{0}=p ], \end{aligned}$$
(3.4)

if the exit problem value function \(V_{1}\) is well defined. We identify \(\bar{g}_{2}=\bar{g}_{2}(\theta )\) as the smallest concave majorant of

$$\begin{aligned} g_{2}(\theta ):=G_{2}\big(s^{-1}(\theta )\big)&=\max \big\{ V_{1} \bigl(s^{-1}(\theta ); \lambda s^{-1}(\theta )+\Psi + R\bigr),U(-R) \big\} \\ &=\max \big\{ \bar{g}_{1}\bigl(\theta ; \lambda s^{-1}(\theta )+\Psi + R \bigr),U(-R)\big\} . \end{aligned}$$

Then the value function of the entry problem is \(V_{2}(p)=\bar{g}_{2}(s(p))\).

Let the optimiser to (3.4) be \(\tau ^{*}(p)\). With \(p\) being the initial price of the asset at \(t= 0\), the agent will purchase the asset at the time \(t=\tau ^{*}(p)\). Then conditionally on the realisation of the entry price level \(P_{\tau ^{*}(p)}\), the agent solves the exit problem (3.2) with initial value \(P_{\tau ^{*}(p)}\) and reference point \(H=\lambda P_{\tau ^{*}(p)}+ \Psi +R\). The corresponding optimiser is given by \(\nu ^{*}(P_{\tau ^{*}(p)};\lambda P_{\tau ^{*}(p)}+ \Psi +R)\) and reflects the time lapse between the initiation and closure of the trade. In particular, the agent will sell the asset at the time \(t=\tau ^{*}(p)+\nu ^{*}(P_{\tau ^{*}(p)};\lambda P_{\tau ^{*}(p)}+ \Psi +R)\). This gives a complete characterisation of the optimal entry and exit strategy of the agent. Note that it is possible to have \(\mathbb{P}[\tau ^{*}(p)<\infty ]<1\). In other words, there is a possibility that the entry strategy is not executed in finite time, and hence there is no decision to sell (see the discussion in Sect. 4).

Intuitively, we expect \(\mathcal{V}(p)=V_{2}(p)\), where \(\mathcal{V}\) is the value function of the original sequential optimal stopping problem (2.3). This must be verified formally. Without any further specifications of the utility function \(U\) and the underlying price process \(P\), however, it is hard to make further progress. For example, it is not even clear upfront whether (2.3) is a well-posed problem with a finite value function.

4 Main results

The procedures described in Sect. 3 are very generic and can guide us to write down a candidate for the value function of the sequential optimal stopping problem under a range of model specifications. To derive stronger analytical results, we specialise in the rest of this paper to the piecewise power utility function of Tversky and Kahneman [27] in the form

$$\begin{aligned} U(x)= \textstyle\begin{cases} x^{\alpha }&\quad \text{for $x>0$}, \\ -k |x|^{\alpha }&\quad \text{for $x\leq 0$}. \end{cases}\displaystyle \end{aligned}$$

Here \(\alpha \in (0,1)\) so that \(1-\alpha \) is the level of risk-aversion and risk-seeking on the domains of gains and losses, and \(k>1\) controls the degree of loss-aversion. Experimental results in [27] give estimates of \(\alpha =0.88\) and \(k=2.25\).

The price process of the risky asset \(P=(P_{t})_{t\geq 0}\) is assumed to be a geometric Brownian motion,

$$\begin{aligned} dP_{t}=P_{t}(\mu dt +\sigma dB_{t}), \end{aligned}$$

with \(\mu \geq 0\) and \(\sigma >0\) being the constant drift and volatility of the asset. Define \(\beta :=1-\frac{2\mu}{\sigma ^{2}}\leq 1\). Then by substituting \(\mu (p)=\mu p\) and \(\sigma (p)=\sigma p\) in (3.1), the scale function of \(P\) can be found as

$$\begin{aligned} s(x)= \textstyle\begin{cases} x^{\beta} &\quad \text{if $\beta > 0$}, \\ \ln x &\quad \text{if $\beta =0$}, \\ x^{-\beta} &\quad \text{if $\beta < 0$}. \end{cases}\displaystyle \end{aligned}$$

Finally, we assume \(R>0\) so that the aspiration level of the agent is always positive. This is not unreasonable since this parameter can be understood as some performance benchmark that an agent wants to outperform, and such a goal is typically positive.

We begin with a necessary condition under which (2.3) is well posed.

Lemma 4.1

If \(\beta \leq 0\) or \(0<\beta <\alpha \), then there exists a sequence of stopping times \((\tau _{n},\nu _{n})_{n=1,2,\dots }\) such that \(J(p;\tau _{n},\nu _{n})\to +\infty \) as \(n \to \infty \), where \(J(p;\,\cdot \,,\,\cdot \,)\) is defined in (2.2).

Proof

Consider the sequence of stopping times defined by

$$\begin{aligned} \tau _{n}:=0,\qquad \nu _{n}:=\inf \{t\geq 0: P_{t} \geq n\}. \end{aligned}$$
(4.1)

If \(\beta \leq 0\), then

$$ P_{t}=P_{0}\exp \biggl(\Big(\mu -\frac{\sigma ^{2}}{2}\Big)t+\sigma B_{t} \biggr)=P_{0}\exp \biggl(\sigma \Big(-\frac{\sigma}{2}\beta t+ B_{t} \Big)\biggr)$$

so that the Brownian motion in the exponent has nonnegative drift. Hence \(P\) can reach any positive level in finite time, and so \(\nu _{n}<\infty \) and \(P_{\nu _{n}}=n\), almost surely. Hence

J ( p ; τ n , ν n ) = E [ U ( γ P ν n 1 { τ n < , ν n < } ( λ P τ n + Ψ ) 1 { τ n < } R ) | P 0 = p ] = U ( γ n λ p Ψ R ) +

as \(n\to \infty \).

If \(\beta >0\), then instead \(P\) cannot reach any arbitrary level above its starting value in finite time and we have \(\lim _{t\to \infty}P_{t}=0\) almost surely. Then \(\{v_{n}< \infty \}=\{P_{\nu _{n}}=n\}\), and for sufficiently large \(n>p\), one can compute that

$$\begin{aligned} \mathbb{P}[\nu _{n}=\infty |P_{0}=p]= \frac{n^{\beta}-p^{\beta}}{n^{\beta}}, \qquad \mathbb{P}[\nu _{n}< \infty |P_{0}=p]=\frac{p^{\beta}}{n^{\beta}}. \end{aligned}$$

Then

$$\begin{aligned} J(p;\tau _{n},\nu _{n})&=\mathbb{P}[\nu _{n}=\infty |P_{0}=p]U\left (- \lambda p-\Psi -R\right ) \\ & \phantom{=:} +\mathbb{P}[\nu _{n}< \infty |P_{0}=p]U\left (\gamma n-\lambda p-\Psi -R \right ) \\ &=-\frac{n^{\beta}-p^{\beta}}{n^{\beta}}k(\lambda p +\Psi +R)^{\alpha}+ \frac{p^{\beta}}{n^{\beta}}(\gamma n-\lambda p-\Psi -R)^{\alpha} \longrightarrow + \infty \end{aligned}$$

as \(n\to \infty \) if \(\beta <\alpha \). □

Mathematically speaking, the sequential optimal stopping problem (2.3) is ill posed under the parameter combination in Lemma 4.1, and its value function diverges to infinity. This arises when the performance of the asset is too good relative to the agent’s risk-aversion level over gains. Equation (4.1) corresponds to a “buy and hold” trading rule as a possible optimal strategy: the agent purchases the asset immediately from the outset, and the profit-target level of sale can be set arbitrarily high.

Remark 4.2

Empirically, historical returns on equities are excessively high relative to their risk level. For example, the annualised mean and standard deviation of the equity premium (i.e., excess return above the risk-free rate) of the U.S. market over the time period 1889–1978 are 6.18% and 16.67%, respectively (see Mehra and Prescott [22]), so that \(\beta =1-2\mu /\sigma \approx -3.45\), while the estimates based on the more recent time period 1950–2015 are 7.15% and 16.83% (see Bai and Lu [1]) so that \(\beta \approx -4.05\). Although this may cast doubt on the practical relevance of the condition \(0<\alpha \leq \beta \) (where buy and hold is an optimal strategy if the condition does not hold), we should like to emphasise that \(\beta \) in general is a noisy statistical quantity which is hard to forecast. In our model, \(\beta \) should be interpreted as the agent’s subjective assessment of the asset performance which can be much more conservative than the historical estimates. The parameter \(\mu \) may also encapsulate subjective discounting which further lowers its value; see Sect. 6.2.

From here onwards, we focus on the case \(0<\alpha \leq \beta \) which is a necessary condition for (2.3) to be well posed. The form of the solutions to the exit problem (3.2) and entry problem (3.4) are first provided, and then we discuss the economic intuition behind the associated trading strategies. Towards the end of this section, the optimality of the value function of the entry problem is formally verified to show that it indeed corresponds to the solution of the sequential optimal stopping problem (2.3).

We first state the solution to the exit problem (3.2); a similar result can be found in Henderson [9].

Lemma 4.3

For the exit problem (3.2), if \(0<\alpha \leq \beta \), the agent will sell the asset when its price level first reaches \(\frac{cH}{\gamma}\) or above, where \(c>1\) is a constant given by the solution to the equation

$$\begin{aligned} \frac{\alpha}{\beta} c(c-1)^{\alpha -1}-(c-1)^{\alpha}-k=0. \end{aligned}$$
(4.2)

The value function is given by

$$\begin{aligned} V_{1}(p; H)= \textstyle\begin{cases} \frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}H^{\alpha -\beta}( \gamma p)^{\beta}-kH^{\alpha} &\quad \textit{for $p< \frac{cH}{\gamma}$}, \\ (\gamma p-H)^{\alpha}&\quad \textit{for $p\geq \frac{cH}{\gamma}$}. \end{cases}\displaystyle \end{aligned}$$
(4.3)

Proof

Recall the notation introduced in Sect. 3.2.1. For \(\beta >0\), the scaled payoff function of the exit problem is given by

$$\begin{aligned} g_{1}(\theta )=g_{1}(\theta ;H)=G_{1}\big(s^{-1}(\theta );H\big)&=U( \gamma \theta ^{\frac{1}{\beta}} -H) \\ &= \textstyle\begin{cases} -k(H - \gamma \theta ^{\frac{1}{\beta}})^{\alpha}&\quad \text{for $0\leq \theta < (\frac{H}{\gamma} )^{\beta}$}, \\ (\gamma \theta ^{\frac{1}{\beta}} -H)^{\alpha}&\quad \text{for $\theta \geq (\frac{H}{\gamma} )^{\beta}$}. \end{cases}\displaystyle \end{aligned}$$

It is straightforward to work out the derivatives of \(g_{1}\) as

$$\begin{aligned} g_{1}'(\theta )= \textstyle\begin{cases} \frac{k\alpha \gamma}{\beta}\theta ^{\frac{1}{\beta}-1}(H - \gamma \theta ^{\frac{1}{\beta}})^{\alpha -1} &\quad \text{for $0\leq \theta < (\frac{H}{\gamma} )^{\beta}$}, \\ \frac{\alpha \gamma}{\beta}\theta ^{\frac{1}{\beta}-1}(\gamma \theta ^{ \frac{1}{\beta}} -H)^{\alpha -1} &\quad \text{for $\theta \geq (\frac{H}{\gamma} )^{\beta}$}, \end{cases}\displaystyle \end{aligned}$$

and

$$\begin{aligned} g_{1}''(\theta )= \textstyle\begin{cases} \frac{k\alpha \gamma}{\beta}\theta ^{\frac{1}{\beta}-2}(H-\gamma \theta ^{\frac{1}{\beta}})^{\alpha -2}( \frac{\gamma (\beta -\alpha )}{\beta}\theta ^{\frac{1}{\beta}}+ \frac{1-\beta}{\beta}H) &\quad \text{for $0\leq \theta < (\frac{H}{\gamma} )^{\beta}$}, \\ \frac{\alpha \gamma}{\beta}\theta ^{\frac{1}{\beta}-2}(\gamma \theta ^{ \frac{1}{\beta}}-H)^{\alpha -2}(\frac{\gamma (\alpha -\beta )}{\beta} \theta ^{\frac{1}{\beta}}-\frac{1-\beta}{\beta}H) &\quad \text{for $\theta \geq (\frac{H}{\gamma} )^{\beta}$}. \end{cases}\displaystyle \end{aligned}$$

Given the standing assumption \(\beta \leq 1\) and the condition \(0<\alpha \leq \beta \), \(g_{1}\) is increasing concave for \(\theta >(\frac{H}{\gamma})^{\beta}\) and increasing convex for \(0\leq \theta <(\frac{H}{\gamma})^{\beta}\). The smallest concave majorant of \(g_{1}\) can be formed by drawing a straight line from \((0,g_{1}(0))\) which touches \(g_{1}\) at some \(\theta ^{*}>(\frac{H}{\gamma})^{\beta}\). In particular, \(\theta ^{*}\) is a solution to \(\frac{g_{1}(\theta )-g_{1}(0)}{\theta}=g_{1}'(\theta )\) for \(\theta >(\frac{H}{\gamma})^{\beta}\), i.e.,

$$\begin{aligned} \frac{\alpha \gamma}{\beta}\theta ^{\frac{1}{\beta}-1}(\gamma \theta ^{ \frac{1}{\beta}}-H)^{\alpha -1}= \frac{(\gamma \theta ^{\frac{1}{\beta}} -H)^{\alpha}+kH^{\alpha}}{\theta}. \end{aligned}$$

We conjecture that the solution has the form \(\theta ^{*}=c^{\beta }(\frac{H}{\gamma})^{\beta}\) for some constant \(c>1\). Then direct substitution shows that the constant \(c\) should solve (4.2). The smallest concave majorant of \(g_{1}\) is then

$$\begin{aligned} \bar{g}_{1}(\theta )&= \textstyle\begin{cases} g_{1}(0)+\theta g_{1}'(\theta ^{*}) &\quad \text{for $0\leq \theta < \theta ^{*}$}, \\ g_{1}(\theta ) &\quad \text{for $\theta >\theta ^{*}$}, \end{cases}\displaystyle \\ &= \textstyle\begin{cases} -kH^{\alpha}+\frac{\alpha}{\beta}H^{\alpha -\beta}c^{1-\beta}(c-1)^{ \alpha -1}\gamma ^{\beta}\theta &\quad \text{for $0\leq \theta < c^{\beta }(\frac{H}{\gamma} )^{\beta}$}, \\ (\gamma \theta ^{\frac{1}{\beta}} -H)^{\alpha}&\quad \text{for $\theta >c^{\beta }(\frac{H}{\gamma} )^{\beta}$}. \end{cases}\displaystyle \end{aligned}$$

The value function is given by \(V_{1}(p;H)=\bar{g}_{1}(s(p))=\bar{g}_{1}(p^{\beta})\), leading to (4.3). The corresponding optimal stopping time is

$$\begin{aligned} \tau &=\inf \left \{t\geq 0:\bar{g}_{1}(\Theta _{t})=g_{1}(\Theta _{t}) \right \} \\ &=\inf \{t\geq 0:\Theta _{t}\geq \theta ^{*} \} =\inf \bigg\{ t\geq 0:P_{t} \geq c \bigg(\frac{H}{\gamma}\bigg)\bigg\} . \end{aligned}$$

 □

The optimal sale strategy is a gain-exit rule where the agent is looking to sell the asset when its price is sufficiently high without considering stop-loss. Note that the gain-exit target \(\frac{cH}{\gamma}\) is increasing in transaction costs (i.e., decreasing in \(\gamma \)). This means that the agent tends to delay the sale decision in a more costly trading environment.

Remark 4.4

Inspired by the literature, we expect that more sophisticated exit strategies can be observed under alternative model setups. For example, the agent will consider stop-loss when the asset has negative drift (see Sect. 6.1 and Henderson [9]) or when there are multiple trading opportunities as in the realisation utility models (see Sect. 6.3.2 as well as Ingersoll and Jin [14] and He and Yang [8]). Introducing jump-diffusion price dynamics will further result in disconnected waiting regions where an agent who has recently suffered from a huge jump loss may refuse to stop-loss (Dai et al. [3] and Kong et al. [17]). We opt to work with a simpler baseline model to make the whole entry-and-exit problem as tractable as possible.

Remark 4.5

Since we require \(\beta >0\), if the initial price of the asset is below the gain-exit target, there is a strictly positive probability that the asset is never sold. Moreover, an agent who fails to sell the asset at his target gain-exit level will suffer a total loss in the long run.

Remark 4.6

The expression of the target gain-exit threshold and in turn the value function of the exit problem are available in closed form, thanks to the specialisation that the degree of risk-aversion over gains is the same as that of risk-seeking over losses. This allows us to make a lot of analytical progress when studying the entry problem. We also lose the closed-form expressions in Lemma 4.3 if a fixed transaction cost on sales is introduced; in this case, the agent will only sell the asset when the utility proceed \(U(\gamma p - H-\Gamma )\) from the sale is larger than that of inaction \(U(-H)\), where \(\Gamma \geq 0\) represents a fixed market exit fee. Then the payoff function of the exit problem becomes

$$ G_{1}(p;H):=\max \{U(\gamma p - H-\Gamma ),U(-H) \}.$$

We now proceed to describe the optimal entry strategy of the agent. The proofs of the two propositions below are given in the Appendix.

Proposition 4.7

Suppose \(\alpha \leq \beta < 1\). For the entry problem (3.4), the value function is given by \(V_{2}(p)=\bar{g}_{2}(p^{\beta})\), where \(\bar{g}_{2}=\bar{g}_{2}(\theta )\) is the smallest concave majorant of

$$\begin{aligned} g_{2}(\theta )&:=\max \{v_{1}(\theta ),-kR^{\alpha} \}:=\max \biggl\{ (R+ \Psi )^{\alpha}f\biggl(\Bigl(\frac{\gamma}{R+\Psi}\Bigr)^{\beta} \theta \biggr),-kR^{\alpha}\biggr\} \end{aligned}$$
(4.4)

with

$$\begin{aligned} f(x):= \frac{\frac{\alpha}{\beta}c^{1-\beta}(c-1)^{\alpha -1}x-k(\frac{\lambda}{\gamma} x^{1/\beta}+1)^{\beta}}{(\frac{\lambda}{\gamma} x^{1/\beta}+1)^{\beta -\alpha}}. \end{aligned}$$
(4.5)

There are three different cases:

(1) If \(\frac{\lambda}{\gamma}\leq (\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{ \alpha -1})^{\frac{1}{\beta}}\), there exists \(p_{1}^{*}\in [0,\infty )\) such that the agent will enter the trade when the asset price is at or above \(p_{1}^{*}\). The value function is

$$\begin{aligned} V_{2}(p)= \textstyle\begin{cases} \frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}(\lambda p+\Psi +R)^{ \alpha -\beta}(\gamma p)^{\beta } \\ \quad -k(\lambda p+\Psi +R)^{\alpha} & \quad \textit{for $p\geq p_{1}^{*}$}, \\ (\frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}(\lambda p_{1}^{*}+ \Psi +R)^{\alpha -\beta}(\gamma p_{1}^{*})^{\beta}& \\ \quad -k(\lambda p_{1}^{*}+\Psi +R)^{\alpha}+kR^{\alpha} ) \frac{p^{\beta}}{(p_{1}^{*})^{\beta}}-kR^{\alpha} &\quad \textit{for $p< p_{1}^{*}$}. \end{cases}\displaystyle \end{aligned}$$
(4.6)

(2) If \(\frac{\lambda}{\gamma}> (\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{ \alpha -1})^{\frac{1}{\beta}}\), there exists a constant \(C\in (0,\infty )\) independent of \(\Psi \) and \(R\) such that

(a) if \(\Psi < CR\), there exist \(0\leq p_{1}^{*}< p_{2}^{*}<\infty \) such that the agent will enter the trade when the asset price is within the interval \([p_{1}^{*},p_{2}^{*}]\). The value function is

$$\begin{aligned} V_{2}(p)= \textstyle\begin{cases} \frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}(\lambda p_{2}^{*}+ \Psi +R)^{\alpha -\beta}(\gamma p_{2}^{*})^{\beta}& \\ \quad -k(\lambda p_{2}^{*}+\Psi +R)^{\alpha}&\quad \textit{for $p> p_{2}^{*}$}, \\ \frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}(\lambda p+\Psi +R)^{ \alpha -\beta}(\gamma p)^{\beta}& \\ \quad -k(\lambda p+\Psi +R)^{\alpha}&\quad \textit{for $p_{1}^{*} \leq p \leq p_{2}^{*}$}, \\ (\frac{\alpha}{\beta} c^{1-\beta}(c-1)^{\alpha -1}(\lambda p_{1}^{*}+ \Psi +R)^{\alpha -\beta}(\gamma p_{1}^{*})^{\beta}& \\ \quad -k(\lambda p_{1}^{*}+\Psi +R)^{\alpha}+kR^{\alpha} ) \frac{p^{\beta}}{(p_{1}^{*})^{\beta}}-kR^{\alpha} &\quad \textit{for $p< p_{1}^{*}$}. \end{cases}\displaystyle \end{aligned}$$
(4.7)

(b) if \(\Psi \geq CR\), the agent will never enter the trade. The value function is \(V_{2}(p)=-kR^{\alpha}\). Moreover, in cases (1) and (2)(a), \(p_{1}^{*}=\frac{R+\Psi}{\gamma} (x_{1}^{*})^{1/\beta}\), where \(x^{*}_{1}\) is the smaller solution to the equation

$$\begin{aligned} k =&\bigg(1+\frac{\Psi}{R}\bigg)^{\alpha} \\ &\times \frac{k (\frac{\lambda}{\gamma}x^{\frac{1}{\beta}}+1 )^{\beta }(\frac{\lambda }{\gamma} (1-\frac{\alpha}{\beta} )x^{\frac{1}{\beta}}+1 )-\frac{\alpha}{\beta}c^{1-\beta}(c-1)^{\alpha -1}\frac{\lambda}{\gamma} (1-\frac{\alpha}{\beta} )x^{\frac{1}{\beta}+1}}{ (\frac{\lambda}{\gamma}x^{\frac{1}{\beta}}+1 )^{\beta -\alpha +1}}. \qquad \quad \end{aligned}$$
(4.8)

In case (2)(a), \(p_{2}^{*}=\frac{R+\Psi}{\gamma}(x_{2}^{*})^{1/\beta}\), where \(x_{2}^{*}\) is the unique solution to the equation

$$\begin{aligned} c^{1-\beta}(c-1)^{\alpha -1}\bigg(x^{-\frac{1}{\beta}}+ \frac{\lambda \alpha}{\gamma \beta}\bigg)-\frac{k\lambda}{\gamma} \bigg(x^{-\frac{1}{\beta}}+\frac{\lambda}{\gamma}\bigg)^{\beta}=0. \end{aligned}$$
(4.9)

In the special case \(\beta =1\) where the asset has zero drift, the results are slightly different from those in Proposition 4.7.

Proposition 4.8

Suppose \(\alpha < \beta = 1\). For the entry problem (3.4), we have:

(1) If \(\frac{\lambda}{\gamma}\leq \frac{\alpha}{k}(c-1)^{\alpha -1}\), there exists \(p_{1}^{*}\in [0,\infty )\) such that the agent will enter the trade when the asset price is at or above \(p_{1}^{*}\). The value function is given by (4.6) on setting \(\beta =1\).

(2) If \(\frac{\alpha}{k}(c-1)^{\alpha -1}<\frac{\lambda}{\gamma}< \frac{(c-1)^{\alpha -1}}{k}\), there exists a constant \(C\in (0,\infty )\) independent of \(\Psi \) and \(R\) such that

(a) if \(\Psi < CR\), there exist \(0\leq p_{1}^{*}< p_{2}^{*}<\infty \) such that the agent will enter the trade when the asset price is within the interval \([p_{1}^{*},p_{2}^{*}]\). The value function is given by (4.7) on setting \(\beta =1\).

(b) if \(\Psi \geq CR\), the agent will never enter the trade. The value function is \(V_{2}(p)=-kR^{\alpha}\).

(3) If \(\frac{\lambda}{\gamma}\geq \frac{(c-1)^{\alpha -1}}{k}\), the agent will never enter the trade. The value function is \(V_{2}(p)=-kR^{\alpha}\).

Moreover, \(p_{1}^{*}\) has the same form as in Proposition 4.7on setting \(\beta =1\), and the expression of \(p_{2}^{*}\) is available in closed form as

$$\begin{aligned} p_{2}^{*}= \frac{ (R+\Psi ) ((c-1)^{\alpha -1}-k\lambda /\gamma )}{\lambda (k\lambda /\gamma -\alpha (c-1)^{\alpha -1} )}. \end{aligned}$$

Remark 4.9

For certain types of agents such as retail investors, fixed transaction costs are typically insignificant so that their trading behaviour might be better described by the specialisation \(\Psi =0\). In this case, Propositions 4.7 and 4.8 can be simplified; see Corollary 4.11 and the discussion thereafter.

The value function of the entry problem is characterised as the smallest concave majorant of the payoff function in (4.4). Indeed, \(v_{1}\) defined in (4.4) is simply the scaled value function of the exit problem so that \(v_{1}(\theta )=V_{1}(\theta ^{1/\beta}; \lambda \theta ^{1/\beta}+ \Psi +R)\), as discussed in Sect. 3.2.2. At a mathematical level, the various cases arising in Propositions 4.7 and 4.8 are due to the different possible shapes of \(v_{1}\) under different combinations of parameters. Some illustrations are given in Fig. 1.

Fig. 1
figure 1

The scaled value functions of the entry problem for the different cases of Proposition 4.7. Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(R=1\), \(\beta =0.85\)

Economically, the optimal entry strategy crucially depends on the level of transaction costs relative to the market and preference parameters. A fixed market entry fee in general discourages trading when the asset price is low. Paying a flat fee of $10 to purchase an asset at $20 is much less appealing compared to the case that the asset is trading at $1000, because in the former case the asset has to increase in value by 50% just to break even against the fixed transaction fee paid. Meanwhile, proportional transaction costs are the most significant for an asset trading at a high nominal price. A 10% transaction fee charged on a million worth of property is much more expensive in monetary terms relative to the same percentage fee charged on a penny stock.

In case (1) of both Propositions 4.7 and 4.8, the proportional transaction costs are relatively low. Hence the agent does not mind purchasing the asset at a high nominal price. He will just avoid purchasing the asset when its price is very low due to the consideration of fixed transaction costs, and therefore the purchase region has the form \([p_{1}^{*},\infty )\).

In case (2)(a), proportional transaction costs start becoming significant. On the one hand, the agent avoids initiating the trade when the asset price is too low since the fixed entry cost will be too large relative to the size of the trade. On the other hand, the agent does not want to trade an expensive asset when the proportional costs are large. Upon balancing these two factors, the agent will wait when the asset price is either too low or too high, and will only purchase the asset when the price first enters an interval \([p_{1}^{*}, p_{2}^{*}]\). A very interesting feature of the optimal entry strategy is that the waiting region here is disconnected.

In case (2)(b) of Proposition 4.7, or cases (2)(b) and (3) of Proposition 4.8, the overall level of transaction costs is too high and hence the agent is discouraged from entering the trade in the first place. The key difference between Propositions 4.7 and 4.8 is that when the asset has a strictly positive drift (\(\beta <1\), i.e., \(\mu >0\)), one must impose a strictly positive fixed entry cost in order to stop the agent from trading at all price levels (if \(\Psi =0\), then either case (1) or (2)(a) in Proposition 4.7 applies, in which case the agent is willing to enter the trade at a certain price level). When the asset is a statistically fair gamble (\(\beta =1\), i.e., \(\mu =0\)), a high proportional transaction cost alone is sufficient to discourage the agent from trading. It is interesting to note that the trading decision also depends on the agent’s aspiration level \(R\). Comparing cases (2)(a) and (2)(b) in Propositions 4.7 and 4.8, a low value of \(R\) will more often lead to the “no trading” case. The economic interpretation is that an agent with low aspiration level (i.e., a low target benchmark) is less likely to participate in trading, especially when the (proportional) costs of trading are high. In Sect. 6.4, we briefly discuss how the aspiration level \(R\) may be endogenised.

When viewed in conjunction with the results of wellposedness (Lemma 4.1) and the optimal exit strategy (Lemma 4.3), our model can encapsulate many styles of trading behaviour. First, if \(\beta \leq 0\) or \(\beta <\alpha <1\) so that the problem is ill posed, we have already shown that a simple “buy and hold” strategy of the form (4.1) is optimal in the sense that the attained utility level can be arbitrarily high.

In cases (1) or (2)(a) of Propositions 4.7 and 4.8, if the asset price starts below \(p_{1}^{*}\) at time zero, the agent will purchase the asset when its price level increases to \(p_{1}^{*}\). Note that similarly to Remark 4.5, the price process \(P\) may fail to reach a fixed level \(p_{1}^{*}>P_{0}\) in finite time. In this case, the entry strategy will not be executed and the payoff to the agent is zero. But otherwise, if a purchase is realised, then at the time of purchase, the reference point is set as \(H=\lambda p_{1}^{*}+\Psi +R\). Then due to Lemma 4.3, the agent is looking to sell this asset later when its price level further increases to \(\frac{cH}{\gamma}=\frac{c(\lambda p_{1}^{*}+\Psi +R)}{\gamma}\). Since \(c>1\), \(\gamma \leq 1\) and \(\lambda \geq 1\), it is clear that the target sale level \(\frac{c(\lambda p_{1}^{*}+\Psi +R)}{\gamma}\) is strictly larger than \(p_{1}^{*}\). This trading rule is thus a momentum strategy of the form “buy high, sell higher”.

If the asset price starts above \(p_{2}^{*}\) at time zero in case (2)(a), the agent will buy the asset when its price level drops to \(p_{2}^{*}\) and later sell when the price increases to \(\frac{c(\lambda p_{2}^{*}+\Psi +R)}{\gamma}>p_{2}^{*}\). This is a counter-trend trading strategy of the form “buy low, sell high”.

Finally, in the high transaction cost cases (2)(b) of Proposition 4.7 and (2)(b) or (3) of Proposition 4.8, the agent will never participate in trading at any asset price level.

The various cases above are generated by different levels of transaction costs relative to the other model parameters. The following two corollaries further highlight the role of transaction costs in relationship to the optimal trading strategies.

Corollary 4.10

If \(\lambda =\gamma =1\), the agent will purchase the asset when its price level is at or above \(p_{1}^{*}\) for some \(p_{1}^{*}\in [0,\infty )\).

Proof

The result follows if we can show that \((\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{\alpha -1})^{ \frac{1}{\beta}}>1\) so that case (1) of Propositions 4.7 and 4.8 always applies when \(\lambda =\gamma =1\). Rewriting the required inequality in equivalent ways gives

$$\begin{aligned} \bigg(\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{\alpha -1}\bigg)^{ \frac{1}{\beta}}>1 &\iff \frac{\alpha}{\beta }c^{1-\beta}(c-1)^{ \alpha -1}>k \\ &\iff \frac{\alpha}{\beta }c^{1-\beta}(c-1)^{\alpha -1}> \frac{\alpha}{\beta} c(c-1)^{\alpha -1}-(c-1)^{\alpha} \\ &\iff (c-1)^{\alpha}- \frac{\alpha}{\beta }c(c-1)^{\alpha -1}(1-c^{- \beta})>0, \end{aligned}$$

where we have used (4.2). Using simple calculus, we can show that

$$ F(x):=(x-1)^{\alpha}- \frac{\alpha}{\beta }x(x-1)^{\alpha -1}(1-x^{- \beta})>0$$

for all \(x>1\). This concludes the proof. □

Corollary 4.11

If the model parameters satisfy the conditions in cases (1) or (2)(a) in Propositions 4.7and 4.8and if \(\Psi =0\), then we have \(p_{1}^{*}=0\).

Proof

This follows immediately from Propositions 4.7 and 4.8 by observing that \(x= 0\) is the solution to (4.8) when \(\Psi =0\). □

From Corollary 4.10, if there is no proportional transaction cost, then the agent does not care about entering the trade at a high nominal price level because he no longer needs to worry about the large magnitude of the trading fee arising from the proportional nature of the transaction costs. Hence “buy low, sell high” will not be observed as an optimal strategy. Similarly, Corollary 4.11 suggests that in the absence of a fixed market entry fee, the agent is happy to purchase an asset at any arbitrarily low price (in the non-degenerate case) since now he does not need to take the size of the trade into account against any fixed cost for break-even considerations. Thus “buy high, sell higher” will not be an optimal strategy in this special case.

Remark 4.12

If we further assume \(R=\Psi =0\) (where the main results in Propositions 4.7 and 4.8 still hold except that (4.4) will have a different and simpler expression), then only case (1) or case (2)(b) of Proposition 4.7 can arise for \(\beta <1\). Alternatively, only cases (1), (2)(b) or (3) of Proposition 4.8 can arise for \(\beta =1\). But \(p_{1}^{*}=0\) when \(\Psi =0\) as in Corollary 4.11. The entry behaviour is thus trivial—the agent either is willing to purchase the asset at any price level if \(\frac{\lambda}{\gamma}\leq (\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{ \alpha -1})^{\frac{1}{\beta}}\), or never enters the trade if \(\frac{\lambda}{\gamma}>(\frac{\alpha}{\beta k}c^{1-\beta}(c-1)^{ \alpha -1})^{\frac{1}{\beta}}\). From the modelling perspective, the constant \(R+\Psi \) is a crucial component which enables the model to produce non-trivial purchase behaviour; see Sect. 6.3.2 as well.

The decomposition of the original purchase-and-sale problem (2.3) into two sub-problems of optimal exit and optimal entry is based on some economic heuristics described in Sects. 3.2.1 and 3.2.2. To close this section, we formally show that the value function of the entry problem in Propositions 4.7 and 4.8 indeed corresponds to the value function of the sequential optimal stopping problem (2.3).

Theorem 4.13

Recall the definition of \(V_{2}\) in Propositions 4.7and 4.8as the value function of the entry problem (3.4). We have \(\mathcal{V}(p)=V_{2}(p)\), where \(\mathcal{V}\) is the value function of the sequential optimal stopping problem (2.3). The optimal purchase and sale rules are given by

$$\begin{aligned} \tau ^{*}&:=\inf \{t\geq 0: V_{2}(P_{t})=G_{2}(P_{t}) \}, \\ \nu ^{*}&:=\inf \bigg\{ t\geq \tau ^{*}: P_{t} \geq \frac{c}{\gamma}( \lambda P_{\tau ^{*}}+\Psi +R) \bigg\} . \end{aligned}$$
(4.10)

Proof

Starting from (2.2), we have for any \(\tau ,\nu \in \mathcal{T}\) with \(\tau \leq \nu \) that

J ( p ; τ , ν ) = E [ U ( γ P ν 1 { τ < , ν < } ( λ P τ + Ψ ) 1 { τ < } R ) | P 0 = p ] E [ U ( ( γ P ν λ P τ Ψ ) 1 { τ < } R ) | P 0 = p ] = P [ τ < | P 0 = p ] E [ U ( γ P ν λ P τ Ψ R ) | P 0 = p , τ < ] = : + P [ τ = | P 0 = p ] U ( R ) = E [ E [ U ( γ P ν λ P τ Ψ R ) | P τ , τ < ] | P 0 = p , τ < ] P [ τ < | P 0 = p ] = : + P [ τ = | P 0 = p ] U ( R ) = E [ E [ G 1 ( γ P ν ; λ P τ + Ψ + R ) | P τ , τ < ] | P 0 = p , τ < ] P [ τ < | P 0 = p ] = : + P [ τ = | P 0 = p ] U ( R ) E [ sup ν T : ν τ E [ G 1 ( γ P ν ; λ P τ + Ψ + R ) | P τ , τ < ] | P 0 = p , τ < ] = : × P [ τ < | P 0 = p ] + P [ τ = | P 0 = p ] U ( R ) .

But using the Markovian structure of the problem,

$$\begin{aligned} &\sup _{\nu \in \mathcal{T}:\nu \geq \tau}\mathbb{E} [G_{1}(\gamma P_{ \nu};\lambda P_{\tau}+\Psi +R) | P_{\tau}=s,\tau < \infty ] \\ &=\sup _{\nu \in \mathcal{T}:\nu \geq 0}\mathbb{E} [G_{1}(\gamma P_{ \nu};\lambda P_{0}+\Psi +R) | P_{0}=s ] \\ &=V_{1}(s;\lambda s+\Psi +R). \end{aligned}$$

Then we have

$$\begin{aligned} J(p;\tau ,\nu )&\leq \mathbb{E} [V_{1}(P_{\tau};\lambda P_{\tau}+ \Psi +R) | P_{0}=p,\tau < \infty ] \,\mathbb{P}[\tau < \infty |P_{0}=p] \\ & \phantom{=:} +\mathbb{P}[\tau =\infty |P_{0}=p]U(-R) \\ &\leq \mathbb{E} [\max \{V_{1}(P_{\tau};\lambda P_{\tau}+\Psi +R),U(-R) \} | P_{0}=p ] \\ &\leq \sup _{\tau \in \mathcal{T}}\mathbb{E} [\max \{V_{1}(P_{\tau}; \lambda P_{\tau}+\Psi +R),U(-R) \} | P_{0}=p ] \\ &=\sup _{\tau \in \mathcal{T}}\mathbb{E}[G_{2}(P_{\tau})|P_{0}=p] \\ &=V_{2}(p). \end{aligned}$$

Taking the supremum on both sides leads to \(\mathcal{V}(p)\leq V_{2}(p)\).

To complete the proof, it is sufficient to show \(J(p;\tau ^{*},\nu ^{*})=V_{2}(p)\), where \(\tau ^{*},\nu ^{*}\) are defined in (4.10). This can be directly verified for the various cases covered in Propositions 4.7 and 4.8 with different initial price level \(p\).

As an example, we cover case (2)(a) in Proposition 4.7 and a level \(p< p_{1}^{*}\). We can deduce from the shape of \(V_{2}\) in this case that \(\tau ^{*}=\inf \{t\geq 0: P_{t}\notin (0,p_{1}^{*})\}\). Since \(\beta >0\), there are two possibilities: the asset price reaches the purchase target level \(p_{1}^{*}\) in finite time, which happens with probability \(\mathbb{P}[\tau ^{*}<\infty |P_{0}=p]= \frac{p^{\beta}}{(p_{1}^{*})^{\beta}}\), or it never reaches \(p_{1}^{*}\) in finite time so that the agent never enters a trade and faces a realised utility of \(U(-R)\), which happens with probability \(\mathbb{P}[\tau ^{*}=\infty |P_{0}=p]=1- \frac{p^{\beta}}{(p_{1}^{*})^{\beta}}\). In the first scenario, after the agent purchases the asset at price \(p_{1}^{*}\), he will sell the asset when its price further increases to \(x^{*}:=\frac{c}{\gamma}(\lambda p_{1}^{*}+\Psi +R)\). The conditional probability of a successful sale is \(\mathbb{P}[\nu ^{*}<\infty |P_{\tau ^{*}}=p_{1}^{*}]= \frac{(p_{1}^{*})^{\beta}}{(x^{*})^{\beta}}\), where the realised utility is \(U(\gamma x^{*}-\lambda p_{1}^{*}-\Psi -R)\). Otherwise, the target sale level is never reached with conditional probability \(\mathbb{P}[\nu ^{*}=\infty |P_{\tau ^{*}}=p_{1}^{*}]=1- \frac{(p_{1}^{*})^{\beta}}{(x^{*})^{\beta}}\), and the realised utility becomes \(U(-\lambda p_{1}^{*}-\Psi -R)\). Then we can directly compute

J ( p ; τ , ν ) = E [ U ( γ P ν 1 { τ < , ν < } ( λ P τ + Ψ ) 1 { τ < } R ) | P 0 = p ] = P [ τ = | P 0 = p ] U ( R ) = : + P [ τ < , ν = | P 0 = p ] U ( λ p 1 Ψ R ) = : + P [ τ < , ν < | P 0 = p ] U ( γ x λ p 1 Ψ R ) = ( p 1 ) β p β ( p 1 ) β k R α k p β ( p 1 ) β ( x ) β ( p 1 ) β ( x ) β ( λ p 1 + Ψ + R ) α = : + p β ( p 1 ) β ( p 1 ) β ( x ) β ( γ x λ p 1 Ψ R ) α = p β ( p 1 ) β ( ( p 1 ) β ( x ) β ( γ x λ p 1 Ψ R ) α k ( x ) β ( p 1 ) β ( x ) β ( λ p 1 + Ψ + R ) α ) = : ( p 1 ) β p β ( p 1 ) β k R α = p β ( p 1 ) β ( ( γ p 1 ) β c β ( ( c 1 ) α + k ) ( λ p 1 + Ψ + R ) α β k ( λ p 1 + Ψ + R ) α ) = : ( p 1 ) β p β ( p 1 ) β k R α = p β ( p 1 ) β ( ( γ p 1 ) β α β c 1 β ( c 1 ) α 1 ( λ p 1 + Ψ + R ) α β k ( λ p 1 + Ψ + R ) α ) ( p 1 ) β p β ( p 1 ) β k R α = p β ( p 1 ) β ( ( γ p 1 ) β α β c 1 β ( c 1 ) α 1 ( λ p 1 + Ψ + R ) α β = : k ( λ p 1 + Ψ + R ) α + k R α ) k R α = V 2 ( p ) ,

where we have used the definition \(x^{*}:=\frac{c}{\gamma}(\lambda p_{1}^{*}+\Psi +R)\) and the fact that \(c\) is the solution to (4.2). The other cases can be handled similarly. □

5 Comparative statics of the optimal trading strategies

The critical trading boundaries in Propositions 4.7 and 4.8, although not available in closed form in general, can be analysed to shed light on the comparative statics of the optimal trading strategies with respect to a few underlying model parameters. The proof of the following proposition is given in the Appendix.

Proposition 5.1

Away from case (2)(b) in Proposition 4.7and cases (2)(b) and (3) in Proposition 4.8, we have that

  1. 1)

    \(p_{1}^{*}\) is decreasing in \(\gamma \) and increasing in \(\Psi \);

  2. 2)

    \(p_{2}^{*}\) is decreasing in \(\lambda \), increasing in \(\gamma \) and increasing in \(\Psi \).

Figure 2(b) shows the critical purchase boundaries \(p_{1}^{*}\) and \(p_{2}^{*}\) as \(\gamma \) varies. For very large values of \(\gamma \) such that the condition in case (1) of Proposition 4.7 holds, the optimal strategy is to buy the asset when its price exceeds \(p_{1}^{*}\), and the agent is willing to enter the trade no matter how high the price is. Once \(\gamma \) is smaller than a certain critical value (labelled by the vertical dotted line in the figure), the parameter condition in case (2)(a) of Proposition 4.7 applies. The optimal strategy now becomes to purchase the asset only when its price is within a bounded range \([p_{1}^{*},p_{2}^{*}]\). As \(\gamma \) further decreases, \(p_{1}^{*}\) increases while \(p_{2}^{*}\) decreases so that the purchase region \([p_{1}^{*},p_{2}^{*}]\) shrinks. Once \(\gamma \) reaches another critical value, \(p_{1}^{*}\) and \(p_{2}^{*}\) converge and the purchase region vanishes entirely. This corresponds to case (2)(b) of Proposition 4.7 that the agent will not enter the trade at any price level. As a reminder, the constant \(C\) in case (2) of Propositions 4.7 and 4.8 depends on \(\lambda \) and \(\gamma \). Increasing \(\frac{\lambda}{\gamma}\) will result in a switch from case (2)(a) to case (2)(b).

Fig. 2
figure 2

Comparative statics of the optimal purchase boundaries \(p_{1}^{*}\) and \(p_{2}^{*}\). Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(R=1\), \(\beta =0.85\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =5\)

We do not mention in Proposition 5.1 the effect of \(\lambda \) on \(p_{1}^{*}\). Indeed, \(p_{1}^{*}\) is not monotonic in \(\lambda \) in general; see Fig. 3. Hence when viewed in conjunction with \(p_{2}^{*}\), the purchase region \([p_{1}^{*}, p_{2}^{*}]\) does not necessarily shrink uniformly when the proportional cost on purchases increases, i.e., the agent need not delay the purchase decision. Similar observations regarding potential non-monotonicity of trading decisions with respect to (proportional) transaction costs are made by Hobson et al. [12, 13] in the context of portfolio optimisation.

Fig. 3
figure 3

An illustration that \(p_{1}^{*}\), the lower bound of the purchase region, need not be monotonic with respect to \(\lambda \). Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(\beta =0.85\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =1\)

Similarly, we can also examine the impact of the fixed market entry cost on the purchase decision. As shown in Fig. 2(c), \(p_{1}^{*}\) and \(p_{2}^{*}\) are both increasing in \(\Psi \). The fact that \(p_{2}^{*}\) is increasing in \(\Psi \) is indeed somewhat counter-intuitive and has a few interesting policy implications. Suppose there is a market regulator who wants to discourage the agent from purchasing the asset (for example, as a means to cool down a highly speculative real estate market). A natural measure to curb trading participation is to increase transaction costs. However, Fig. 2 reveals that there is a subtle difference between the impact of proportional and fixed transaction costs on the agent’s trading behaviour.

From Fig. 2(b), the effect of increasing proportional transaction costs on sales (i.e., decreasing \(\gamma \)) is “monotonic” in terms of changing the trading decision of the agent. At any given current asset price level, decreasing \(\gamma \) (while all other parameters are held fixed) can only take the price from the purchase region to the no-trade region. Increasing proportional transaction costs on sales can therefore unambiguously suppress the trading activities in the market.

In contrast, the impact of the fixed market entry cost is somewhat unclear. Take Fig. 2(c) as an example and suppose the current price of the asset is $100. If there is no fixed market entry fee initially (i.e., \(\Psi =0\)), the agent will not participate in trading as the price is in the no-trade region. However, increasing \(\Psi \) from zero to $4 will now put the price in the purchase region so that the agent is willing to purchase the asset immediately (given that the current asset price stays the same at $100). This is exactly opposite to the intended outcome of the market regulator, because the increase in \(\Psi \) actually encourages trading participation.

The rationale behind this phenomenon is as follows. The speculative agent is evaluating the trading opportunity by comparing the potential sale proceeds against the reference point which is partly determined by the initial capital \(\lambda p + \Psi \) required to enter the trade if the purchase price is \(p\). Increasing the fixed market entry fee \(\Psi \) increases the total costs required to initiate the trade and results in a higher effective reference point. However, under a prospect theory framework, the agent’s risk attitude is tied to the level of the reference point. When the purchase price is kept the same, a large \(\Psi \) will put the agent deeper in the loss territory over which he becomes highly risk-seeking. Thus he will give a higher valuation to the opportunity to enter the speculative trade and hence is more prone to immediate trade participation.

Of course, increasing \(\Psi \) will also decrease the potential profit of the trade, which is economically unfavourable. As the fixed cost further increases, say from \(\Psi =4\) to \(\Psi =8\), the price will eventually enter the no-trade region again. Hence the precise effect of \(\Psi \) on the trading decision is ambiguous and governed by the two opposing forces of increasing the agent’s risk appetite versus decreasing profitability. When the economy is consisting of multiple agents with heterogeneous preferences, it is unclear whether increasing the fixed transaction costs can uniformly discourage trading participation for all agents.

The non-monotonicity of \(p_{1}^{*}\) with respect to \(\lambda \), the proportional transaction cost on purchases, also implies that an increase in \(\lambda \) can potentially bring certain prices from the no-trade region to the purchase region. The rationale is the same as above that \(\lambda \) partly determines the cost of purchase and in turn the reference point. Hence increasing \(\lambda \) might actually make an agent find a speculative opportunity attractive.

6 Extensions

In this section, we briefly discuss several variations of our baseline model.

6.1 Risky asset with negative drift and voluntary stop-loss

Among all the non-trivial strategies derived in our baseline setup, the agent will never voluntarily realise a loss. This is not consistent with real world trading behaviour given the prevalent usage among market participants of stop-loss orders. One way to enable the model to generate stop-loss behaviour is to allow the excess return of the asset to be negative as inspired by Henderson [9]. A negative excess return is equivalent to \(\beta :=1-\frac{2\mu}{\sigma ^{2}}>1\) in our setup.

Lemma 6.1

Suppose the model parameters are such that \(\beta >1\). For the exit problem (3.2), the agent will sell the asset when its price level first exits the interval \(\big(\frac{c_{1} H}{\gamma},\frac{c_{2} H}{\gamma}\big)\), where \(0< c_{1}<1<c_{2}\) are the solutions to the system of equations

$$ \frac{k\alpha}{\beta}c_{1}^{1-\beta}(1-c_{1})^{1-\alpha}= \frac{\alpha}{\beta}c_{2}^{1-\beta}(c_{2}-1)^{\alpha -1}= \frac{(c_{2}-1)^{\alpha}+k(1-c_{1})^{\alpha}}{c_{2}^{\beta}-c_{1}^{\beta}}.$$

The value function is

$$\begin{aligned} V_{1}(p;H)= \textstyle\begin{cases} -k(H-\gamma p)^{\alpha}&\quad \textit{for $p< \frac{c_{1} H}{\gamma}$}, \\ -kH^{\alpha}(1-c_{1})^{\alpha}& \\ \quad + \frac{H^{\alpha -\beta} ((c_{2}-1)^{\alpha}+k(1-c_{1})^{\alpha} ) ((\gamma p)^{\beta}-c_{1}^{\beta} H^{\beta} )}{c_{2}^{\beta}-c_{1}^{\beta}}& \quad \textit{for $\frac{c_{1} H}{\gamma}\leq p\leq \frac{c_{2} H}{\gamma}$}, \\ (\gamma p-H)^{\alpha}&\quad \textit{for $p>\frac{c_{2} H}{\gamma}$}. \end{cases}\displaystyle \end{aligned}$$
(6.1)

Proof

This follows from a slight extension of [9, Proposition 3]. □

Given that a purchase of the asset has occurred at some time \(\tau \) which determines the reference level for the exit problem via \(H=\lambda P_{\tau}+\Psi +R\), the agent is willing to impose a stop-loss level at \(\frac{c_{1} H}{\gamma}\) if the asset has a negative drift. Of course, it is not clear upfront whether the agent is willing to purchase an asset with negative drift in the first place. To understand the purchase behaviour, one needs to solve the entry problem

$$ V_{2}(p):=\sup _{\tau \in \mathcal{T}}\mathbb{E} [\max \{V_{1}(P_{ \tau};\lambda P_{\tau}+\Psi +R), U(-R) \} |P_{0}=p ], $$

where \(V_{1}\) is now given by (6.1). Although the principle of the martingale method still applies, we have so far not managed to analyse the problem thoroughly to explicitly characterise different possible shapes of the scaled payoff function under different combinations of model parameters. We hence opt to obtain numerical solutions by solving the underlying variational inequality

$$ \min \bigg\{ -\frac{\sigma ^{2} p^{2}}{2}V_{2}''(p)-\mu pV_{2}'(p),V_{2}(p)-G_{2}(p) \bigg\} =0$$

by the standard projected successive over-relaxation (PSOR) method and infer the optimal purchase (stopping) and no-trade (continuation) regions by numerically identifying the sets \(\{p\geq 0: V_{2}(p)=G_{2}(p)\}\) and \(\{p\geq 0: V_{2}(p)>G_{2}(p)\}\), respectively.

Figure 4(a) shows the optimal entry decisions as \(\beta \) varies. When \(\beta \leq 1\), the optimal purchase boundaries \(p_{1}^{*}\) and \(p_{2}^{*}\) are obtained semi-analytically from Propositions 4.7 and 4.8, while the values for \(\beta >1\) are obtained numerically by the PSOR method. The agent is willing to purchase the asset if and only if the current price level is between \(p_{1}^{*}\) and \(p_{2}^{*}\). As the drift of the asset becomes more and more negative (i.e., \(\beta \) increases), the purchase region \([p_{1}^{*},p^{*}_{2}]\) shrinks and eventually vanishes when \(\beta \) is around 1.4. Beyond this critical level, the agent will not purchase the asset at any price level because of its poor quality.

Fig. 4
figure 4

Optimal purchase and sale decisions as \(\beta \) varies; \(\beta < 1\) (resp. \(\beta >1\)) indicates that the asset has a positive (resp. negative) drift. Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(R=1\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =1\)

In parallel, Fig. 4(b) plots the optimal sale boundaries in the form \(q^{*}_{i}:=\frac{c_{i} }{\gamma}\) so that if the asset has been purchased at level \(p\), it will be sold when its price level leaves the interval \((H q_{1}^{*},H q_{2}^{*})\), where \(H=\lambda p+\Psi +R\). If \(\beta \leq 1\), it is never optimal to voluntarily realise a loss, which is equivalent to \(q_{1}^{*}=0\). But for \(\beta >1\), \(q^{*}_{1}\) becomes strictly positive which represents a stop-loss level. Figure 4 demonstrates that there exist some combinations of model parameters such that the agent is willing to purchase the asset at some price level and subsequently liquidate it at a loss. This happens when \(\beta \) is between 1 and 1.4 in our numerical example. Explicitly characterising conditions on the model parameters where this phenomenon occurs is an interesting research question.

6.2 Subjective discounting

To investigate the effect of impatience on the optimal strategy, it is constructive to consider a version of the problem with discounting. However, to the best of our knowledge, there is no consensus in the literature how discounting should be incorporated within a prospect theory framework with intertemporal cashflows. We briefly present two possible modelling choices.

6.2.1 Profit-discounting

The first idea is that the S-shaped utility function is applied to the net present value of the trading proceed, which we term as “profit-discounting”. We consider the problem

sup τ , ν T : τ ν E[U(γ e δ ν P ν 1 { τ < , ν < } e δ τ (λ P τ +Ψ) 1 { τ < } R)| P 0 =p],

where \(\delta >0\) is the subjective discount factor. If we further assume \(\Psi =0\) (which is perhaps relevant in the context of retail trading where the fixed transaction cost is insignificant), then this problem is the same as the undiscounted one except that the drift of the asset is now replaced by \(\mu -\delta \) under the geometric Brownian motion assumption for the asset price. For small \(\delta \) such that \(\delta \leq \mu \), our baseline results under the standing assumption \(\beta \leq 1\) apply. Otherwise, when \(\delta >\mu \), the analysis becomes similar to the one in Sect. 6.1. Increasing \(\delta \) has the same effect as increasing \(\beta \), where an endowed asset tends to be liquidated sooner (lower gain-exit level and possibly higher stop-loss level) while the purchase region shrinks; see Fig. 5. This result is quite interesting because impatience affects sale and purchase decisions somewhat differently. Increasing impatience will cause an agent who already owns the asset to sell sooner, which is in line with common intuition; but surprisingly, a higher \(\delta \) will also delay the purchase decision. The economic rationale is that an increasing discount rate makes the opportunity to sell the asset less valuable, and hence the agent is more reluctant to enter the trade in the first place.

Fig. 5
figure 5

Optimal purchase and sale decisions as \(\delta \) varies under the profit-discounting criterion. Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(R=1\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =0\), \(\mu =0.025\), \(\sigma =0.5\)

Remark 6.2

If we insist on \(\Psi >0\), then the objective function has an explicit dependence on \(\tau \) which will make the entry problem time-inhomogeneous. Such a problem is more difficult to be analysed analytically or numerically.

6.2.2 Utility-discounting

The second possibility to incorporate discounting is to assume the utility of the round-trip proceed is discounted by a single discount factor evaluated at the liquidation date. We call this approach “utility-discounting”. The problem is formulated as

sup τ , ν T : τ ν E[ e δ ν U( P ν 1 { τ < , ν < } ( λ P τ + Ψ ) 1 { τ < } R)| P 0 =p].
(6.2)

The downside of this approach is that it does not properly take into the account the timing of the cash outflow (incurred at \(\tau \)) and inflow (incurred at \(\nu \)), but an advantage is that this formulation resembles a standard discounted optimal stopping problem.

It turns out that introducing discounting in this fashion will drastically change the agent’s entry behaviour, as summarised by the result below whose proof is given in Appendix A.2.

Proposition 6.3

Suppose \(\delta >0\) and let \(\omega _{1}<0<\omega _{2}\) be the two distinct real roots of the quadratic equation \(\frac{\sigma ^{2}}{2}x^{2}+\big(\mu -\frac{\sigma ^{2}}{2}\big)x- \delta =0\). Under the assumption \(\alpha < \beta \), the pair of stopping times

$$\begin{aligned} \tau ^{*}=0, \qquad \nu ^{*}=\inf \bigg\{ t\geq \tau ^{*}: P_{t}\geq \frac{\omega _{2}(\lambda P_{\tau ^{*}}+\Psi +R) }{\gamma (\omega _{2}-\alpha )} \bigg\} \end{aligned}$$

is optimal for (6.2). In other words, the agent always enters the trade immediately at any price level and subsequently adopts a gain-exit strategy.

Proposition 6.3 applies to the case \(\beta >1\) as well. Unlike profit-discounting, the agent will never stop-loss under utility-discounting even when the asset drift is negative. More remarkably, the entry strategy now becomes trivial, provided that the standing assumption \(\alpha <\beta \) holds. It is also interesting to point out that the optimal strategy is not continuous at \(\delta =0\), in the sense that letting \(\delta \downarrow 0\) in Proposition 6.3 does not recover the no-discounting baseline results in Propositions 4.7 or 4.8. See Fig. 6 for an illustration where we show the optimal sale boundary as a function of \(\delta \) under utility-discounting. Once discounting is applied to the utility term (no matter how small \(\delta \) is), the impact of a poor trading performance in the form of negative utility can be mitigated by indefinitely deferring the realisation of a loss. The agent is then effectively protected from negative outcomes, and there is no downside to taking risk regardless of the asset quality or level of the transaction costs. The agent is impatient so that it is optimal to enter the trade as early as possible, and there is no reason to realise a loss thereafter because it can be discounted away.

Fig. 6
figure 6

Optimal sale decisions as \(\delta \) varies under the utility-discounting criterion, where the agent will sell the endowed asset whenever the price level is at or above \(q^{*}_{2} H\). There is a discontinuity at \(\delta =0\), where the star ∗ marks the optimal threshold for the undiscounted problem. Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(\beta =0.85\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =1\)

In view of the above results, profit-discounting seems to yield more reasonable and flexible predictions of the agent’s optimal trading behaviour. Nonetheless, a proper understanding of the implications behind the two discounting approaches is of interest to the field of behavioural economics.

6.3 Multiple round-trip trades

We have exclusively focused on the case that the agent’s utility is derived from a single round-trip trade. But what if the agent can repeatedly purchase and sell the asset? Similarly to Sect. 6.2, we discuss two possible modelling choices how utility over multiple trades can be computed.

6.3.1 Optimisation of utility over total trading profit

The first possibility is to consider utility derived from the total net present value of the trading proceeds, which is similar to the profit-discounting idea in Sect. 6.2. Suppose the agent can perform \(N\) round-trip trades; then the objective function can be written as

$$\begin{aligned} V(p):=\sup _{\tau _{i},\nu _{i}\in \mathcal{T}:\,i=1,\dots ,N}J\big(p; (\tau _{i})_{i=1,\dots ,N},(\nu _{i})_{i=1,\dots ,N}\big) \end{aligned}$$
(6.3)

with

J ( p ; ( τ i ) i = 1 , , N , ( ν i ) i = 1 , , N ) : = E [ U ( i = 1 N ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } ) R ) | P 0 = p ] .

Here \((\tau _{i})_{i=1,\dots ,N}\) (resp. \((\nu _{i})_{i=1,\dots ,N}\)) is an increasing sequence of stopping times representing the entry (resp. exit) time of the \(i\)th trade with

$$ 0\leq \tau _{1}\leq \nu _{1}\leq \tau _{2}\leq \nu _{2}\leq \tau _{3} \leq \cdots \leq \tau _{N}\leq \nu _{N}\leq +\infty .$$

In particular, the agent’s utility is now derived from the total trading profits from the \(N\) available round-trip trading opportunities.

The problem can still be approached by a similar backward induction principle which we outline below based on heuristics. Define

V b ( n ) ( p ; Q ) : = sup ( τ i , ν i ) i = 1 , , n E [ U ( i = 1 n ( γ P ν i 1 { τ i < , ν i < } = : ( λ P τ i + Ψ ) 1 { τ i < } ) Q ) | P 0 = p ]

and

V s ( n ) ( p ; H ) : = sup ν 1 , ( τ i , ν i ) i = 2 , , n E [ U ( i = 2 n ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } ) = : + γ P ν 1 1 { ν 1 < } H ) | P 0 = p ] .

Here \(V^{(n)}_{b}(p;Q)\) represents the value function when there are \(n\) purchase and sale opportunities available under some reference point \(Q\), and \(V^{(n)}_{s}(p;H)\) represents the value function when the agent already owns the asset and there are \(n-1\) purchase and \(n\) sale opportunities available under some reference point \(H\). Then heuristically, we expect from the dynamic programming principle that

V b ( n ) (p;Q)= sup τ 1 E[ 1 { τ 1 = } U(Q)+ 1 { τ 1 < } sup ν 1 , ( τ i , ν i ) i = 2 , , n ϒ| P 0 =p],

where

ϒ : = E [ U ( i = 2 n ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } ) = : + γ P ν 1 1 { ν 1 < } ( λ P τ 1 + Ψ ) Q ) | F τ 1 ] .

Hence we expect that

V b ( n ) ( p ; Q ) = sup τ 1 E [ 1 { τ 1 = } U ( Q ) + 1 { τ 1 < } V s ( n ) ( P τ 1 ; λ P τ 1 + Ψ + Q ) | P 0 = p ] = sup τ 1 E [ max { V s ( n ) ( P τ 1 ; λ P τ 1 + Ψ + Q ) , U ( Q ) } | P 0 = p ] .
(6.4)

The first equality is due to the definition of \(V^{(n)}_{s}\), and the second is expected to hold because for the optimal stopping problem in (6.4), it is clearly suboptimal to stop the process when \(V^{(n)}_{s}(P_{t};\lambda P_{t}+\Psi +Q)< U(-Q)\) and hence the payoff function can be replaced by \(\max \{V^{(n)}_{s}(P_{\tau _{1}};\lambda P_{\tau _{1}}+\Psi +Q),U(-Q) \}\).

Based on a similar reasoning, we expect for \(n>1\) that

V s ( n ) (p;H)= sup ν 1 E[ 1 { ν 1 = } U(H)+ 1 { ν 1 < } sup ( τ i , ν i ) i = 2 , , n Θ| P 0 =p],

where

Θ:=E [ U ( i = 2 n ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } ) + γ P ν 1 H ) | F ν 1 ] .

Then we expect that

V s ( n ) ( p ; H ) = sup ν 1 E [ 1 { ν 1 = } U ( H ) + 1 { ν 1 < } V b ( n 1 ) ( P ν 1 ; γ P ν 1 + H ) | P 0 = p ] = sup ν 1 E [ V b ( n 1 ) ( P ν 1 ; γ P ν 1 + H ) | P 0 = p ] .

The last equality holds because \(V^{(n)}_{b}(p;Q)\) is obviously increasing in \(p\) and decreasing in \(Q\) so that \(V^{(n)}_{b}(p;-\gamma p+H)\geq V^{(n)}_{b}(0;H)= U(-H)\) for all \(p\) and \(n\). Finally, we obviously have

V s ( 1 ) ( p ; H ) = sup ν E [ U ( γ P ν 1 { ν 1 < } H ) | P 0 = p ] = sup ν E [ U ( γ P ν H ) | P 0 = p ] .

In summary, \(V^{(n)}_{s}\) and \(V^{(n)}_{b}\) should satisfy the recursion

V s ( n ) ( p ; H ) = sup ν E [ U ( γ P τ H ) | P 0 = p ] , n = 1 , V b ( n ) ( p ; Q ) = sup τ E [ max { V s ( n ) ( P τ ; λ P τ + Ψ + Q ) , U ( Q ) } | P 0 = p ] , n 1 , V s ( n ) ( p ; H ) = sup ν E [ V b ( n 1 ) ( P ν ; γ P ν + H ) | P 0 = p ] , n 2 .

The required value function for (6.3) with \(N\) round-trip trading opportunities is given by \(V^{(N)}_{b}(p;R)\). A formal verification of the above assertion as well as a thorough analysis of this recursive problem is beyond the scope of the current paper.

6.3.2 Optimisation of total utilities from individual trading episodes

One may also assume that a burst of utility is derived upon completion of each round-trip trade, i.e., the objective function with \(N\) round-trip trading opportunities is

J ( p ; ( τ i ) i = 1 , , N , ( ν i ) i = 1 , , N ) : = E [ i = 1 N U ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } R ) | P 0 = p ] .

Unlike the criterion in Sect. 6.3.1 where a single utility is derived from the total profit of all trades, utility is now realised upon completion of each round-trip trade and the agent’s goal is to optimise the sum of those utilities. Define

V b ( n ) ( p ) : = sup ( τ i , ν i ) i = 1 , , n E [ i = 1 n U ( γ P ν i 1 { τ i < , ν i < } ( λ P τ i + Ψ ) 1 { τ i < } R ) | P 0 = p ]

and

V s ( n ) ( p ; H ) : = sup ν 1 , ( τ i , ν i ) i = 2 , , n E [ U ( γ P ν 1 1 { ν 1 < } H ) = : + i = 2 n U ( γ P ν i 1 { τ i < , ν i < } = : ( λ P τ i + Ψ ) 1 { τ i < } R ) | P 0 = p ] .

Here \(V^{(n)}_{b}(p)\) is the value function with \(n\) purchase and sale opportunities remaining, and \(V^{(n)}_{s}(p;H)\) represents the value function when agent has an endowed asset (with some given reference point \(H\) for the first trading episode) and there are \(n-1\) purchase and \(n\) sale opportunities remaining.

Based on the dynamic programming principle, one can heuristically write down the recursive system satisfied by the value functions as

$$\begin{aligned} V^{(1)}_{s}(p;H)&:=\sup _{\nu}\mathbb{E} [U(\gamma P_{\nu}-H) | P_{0}=p ], \end{aligned}$$
(6.5)
$$\begin{aligned} V^{(n)}_{b}(p)&:=\sup _{\tau}\mathbb{E}\big[\max \{V^{(n)}_{s}(P_{ \tau};\lambda P_{\tau}+\Psi +R),n U(-R)\}\big| P_{0}=p\big], \end{aligned}$$
(6.6)
$$\begin{aligned} V^{(n)}_{s}(p;H)&:=\sup _{\nu}\mathbb{E} \big[U(\gamma P_{\nu}-H)+V^{(n-1)}_{b}(P_{ \nu}) \big| P_{0}=p\big], \end{aligned}$$
(6.7)

where (6.6) holds for \(n\geq 1\) and (6.7) holds for \(n\geq 2\). For the “payoff function” in (6.7), the first term \(U(\gamma P_{\nu}-H)\) represents the utility burst when the endowed asset is sold, and the second term \(V_{b}^{(n-1)}(P_{\nu})\) reflects the maximal sum of expected utilities from the remaining \(n-1\) round-trip trading opportunities. Otherwise, if the agent does not own the asset to begin with and decides to purchase at time \(\tau \), the maximal expected utility attainable is given by \(V^{(n)}_{s}(P_{\tau};\lambda P_{\tau}+\Psi +R)\), where a new trading episode is initiated with reference point set to \(H=\lambda P_{\tau}+\Psi +R\) (the total cost of purchase at time \(\tau \) plus the exogenous aspiration level). But the agent can also choose not to purchase at all and forgo all the \(n\) remaining trading opportunities. This will result in a payoff of \(U(-R)\) for each trading opportunity given up. Hence in (6.6) the “payoff function” for the entry problem is given by \(\max \{V^{(n)}_{s}(P_{\tau};\lambda P_{\tau}+ \Psi + R),n U(-R) \}\). When \(n=1\), (6.5) and (6.6) agree with the sequential optimal stopping problem deduced in Sect. 3.

This generalisation is conceptually close to the realisation utility model in the literature. The canonical formulation (in our notation) of such a model is

$$\begin{aligned} \sup _{(\tau _{i},\nu _{i})_{i=1,2,3,\dots}}\mathbb{E}\bigg[ \sum _{i=1}^{ \infty }e^{-\delta \nu _{i}}U (G_{\nu _{i}-},Q_{\nu _{i}-} ) \bigg], \end{aligned}$$

where \(Q=(Q_{t})_{t\geq 0}\) is the reference point process so that \(Q_{t}\) is the benchmark to be used for performance evaluation at time \(t\), and \(G=(G_{t})_{t\geq 0}\) is the gain-and-loss process with \(G_{t}:= \gamma P_{t} - Q_{t}\) representing the size of realised gain-and-loss if the agent liquidates an owned asset at time \(t\). Typically, the function \(U(G,Q)\) is assumed to be homogeneous in \(Q\) so that \(U(G,Q)=Q^{\eta}u(G/Q)\) for some \(\eta \in (0,1]\) and \(u(\,\cdot \,)\) is S-shaped.

There are many choices for the reference point process \(Q\). Ingersoll and Jin [14] consider \(Q_{t}= P_{\tau _{i}}\) for \(t\in [\tau _{i},\tau _{i+1})\) (up to a constant multiplier), which is simply the most recent purchase price of the asset. Barberis and Xiong [2] and Dai et al. [3] take \(Q_{t}= P_{\tau _{i}} e^{r(t-\tau _{i})}\) for \(t\in [\tau _{i},\tau _{i+1})\), which is the most recent purchase price growing at the risk-free rate. He and Yang [8] incorporate an additional term which asymmetrically adapts to the paper gain-and-loss. Kong et al. [17] study a path-dependent reference point which is a weighted average of the asset prices throughout the current trading episode. A common feature of the papers cited above is that \(Q_{t}\) is proportional to \(P_{\tau _{i}}\) over \(t\in [\tau _{i},\tau _{i+1})\), and hence a dimension reduction is possible via introducing the new state variable \(X_{t}:=P_{t}/Q_{t}\). This greatly simplifies the entry problem, but it also trivialises the optimal strategy—one either immediately enters the trade again after a sale or never enters the trade in the first place. This observation remains the same even if one introduces additional modelling elements such as a Poisson random termination time and an extra utility term over final wealth; see Proposition 3.4 of He and Yang [8].

Our formulation can be seen as a version of the realisation utility model with a finite number of trading opportunities, where the reference level process is (see Remark 6.4 below as well)

$$\begin{aligned} Q_{t}:= \textstyle\begin{cases} \lambda P_{\tau _{i}} + \Psi +R &\quad \text{for $t\in [\tau _{i},\nu _{i})$,} \\ R &\quad \text{for $t\in [\nu _{i},\tau _{i+1})$,} \end{cases}\displaystyle \end{aligned}$$
(6.8)

the gain-and-loss process is G t :=γ P t 1 { t < } Q t , the utility function is given by \(U(G,Q)=Q^{\alpha}u(G/Q)\) with u(x):= x α 1 { x 0 } k | x | α 1 { x < 0 } , and the discount rate \(\delta \) is set to zero. The most important distinction of our framework from the existing realisation utility models is that our reference point consists of a constant component \(\Psi +R\) reflecting a fixed transaction cost and some baseline aspiration level. Without this component, the reference level over a particular trading episode is always proportional to the asset value at the beginning of the episode. Specifically, if \(\Psi =R=0\), then using (6.5)–(6.7), the scaling property of \(U\) and the geometric Brownian motion assumption of \(P\), we can inductively deduce for all \(n\) that \(V^{(n)}_{s}(p;H)=H^{\alpha}V(\frac{p}{H};1)\) and

$$\begin{aligned} V^{(n)}_{b}(p)&=\sup _{\tau}\mathbb{E}\big[\max \{V^{(n)}_{s}(P_{\tau}; \lambda P_{\tau}),0\}\big|P_{0}=p\big] \\ &=\sup _{\tau}\mathbb{E}\big[\max \{\lambda ^{\alpha} P_{\tau}^{ \alpha}V^{(n)}_{s}(1/\lambda ;1),0\}\big|P_{0}=p\big]. \end{aligned}$$

If \(V^{(n)}_{s}(1/\lambda ;1)<0\), then \(\tau \equiv +\infty \) is optimal and the associated value function for the entry problem is \(V^{(n)}_{b}(p)=0\). If instead \(V^{(n)}_{s}(1/\lambda ;1)>0\), then we have \(V^{(n)}_{b}(p)=\lambda ^{\alpha}V^{(n)}_{s}(1/\lambda ;1)\sup _{\tau} \mathbb{E}[P_{\tau}^{\alpha}|P_{0}=p]=\lambda ^{\alpha}V^{(n)}_{s}(1/ \lambda ;1)p^{\alpha}\) under the standing assumption \(\alpha \leq \beta \). The corresponding optimal entry strategy is \(\tau \equiv 0\).

The above observations in conjunction with our main theoretical results for \(n=1\) suggest that incorporating a constant component within the dynamic reference point (e.g. in the form of a fixed transaction cost or a default aspiration level) might enable a realisation utility model to generate more realistic and non-trivial purchase behaviours. For the sale decision, recall that in the baseline model with \(n=1\), the optimal sale strategy is a simple gain-exit rule (Lemma 4.3). We expect this to change when \(n>1\), since the effective payoff function of the exit problem (6.7) now contains an additional term \(V^{(n-1)}_{b}(p)\) which will drastically change the convexity/concavity of the scaled exit payoff function. We leave the full analysis of this problem for future work.

Remark 6.4

At first sight, (6.8) looks more complicated than the models proposed in the existing literature because our value of \(Q_{t}\) depends on whether the agent is inside a trading episode holding the asset (\(t\in [\tau _{i},\nu _{i})\) for some \(i\)) or outside a trading episode without any asset (\(t\in [\nu _{i},\tau _{i+1})\) for some \(i\)). From an economic point of view, the reference point should not depend on \(P_{\tau _{i}}\) any more once the \(i\)th trading episode is complete, and hence should be reset to the baseline aspiration level until the start of the \((i+1)\)th trading episode. In the special case \(\tau _{i+1}=\nu _{i}\) for all \(i\) so that there is no time delay between the exit of an existing trade and the entry of a new trade, (6.8) simplifies to \(Q_{t}=\lambda P_{\tau _{i}}+\Psi +R\) for \(t\in [\tau _{i},\tau _{i+1})\), which resembles the usual definition in the literature. In the absence of a constant component within the reference point process, the entry decision is trivial as \(\tau _{i+1}=\nu _{i}\) for all \(i\) is indeed optimal because of the scaling property discussed previously; so there is no need to “correctly” specify \(Q_{t}\) over \(t\in [\nu _{i},\tau _{i+1})\). But this result is no longer true when \(R\) or \(\Psi \) is non-zero, and hence it is necessary to model the reference point process more carefully.

6.4 Endogenous aspiration level

Among all the model parameters, the “aspiration level” \(R\) of the agent is the hardest one to be interpreted and estimated. It can be a purely psychological parameter representing the agent’s subjective threshold which distinguishes gains and losses. In some applications such as delegated portfolio management, \(R\) can also be the performance target imposed on the agent by a manager. Instead of calibrating \(R\) where the exercise can be very context-specific, one may seek to endogenise this parameter by introducing a further optimality criterion.

Consider a principal–agent setup as an example. The principal imposes an aspiration level on the agent in the form of a performance target. For a given \(R\), the agent’s optimal trading rule \((\tau ^{*}(R),\nu ^{*}(R))\) can be obtained by solving (2.3). If the principal has a utility function \(\tilde{U}(\,\cdot \,)\) over the trading profit, then a particular choice of \(R\) will bring the principal an expected utility level of

V ˜ (R):=E[ U ˜ (γ P ν ( R ) 1 { τ ( R ) < , ν ( R ) < } (λ P τ ( R ) +Ψ) 1 { τ ( R ) < } )| P 0 =p],

where we have suppressed the dependence of \(\tilde{V}\) on \(p\). The above can be maximised (numerically) with respect to \(R\), where the solution in general depends on the initial asset price \(P_{0}\). In Fig. 7, we consider a risk-averse principal with utility function \(\tilde{U}(x)=-e^{-\eta x}\), where \(\eta >0\) is the constant absolute risk-aversion level and \(\tilde{V}(R)\) can be maximised at some interior \(R\) in this particular example. The higher the Sharpe ratio of the asset (equivalent to a lower level of \(\beta \)), the higher the level of endogenous aspiration. In other words, a more aggressive performance goal is set in a bullish market. However, there are also examples where the principal’s maximisation problem is degenerate (e.g. \(\tilde{V}(R)\) being monotonically increasing or decreasing in \(R\)). We leave the complete analysis of such principal–agent problems for future research.

Fig. 7
figure 7

The principal’s certainty equivalent \(-\ln (- \tilde{V}(R))/\eta \) as a function of \(R\) when \(\tilde{U}(x)=-e^{-\eta x}\). Base parameters used are \(\alpha =0.5\), \(k=2.25\), \(\lambda =1.05\), \(\gamma =0.95\), \(\Psi =1\), \(\eta =0.01\), \(P_{0}=30\)

An alternative consideration to endogenise \(R\) is to modify the agent’s utility function such that a round-trip profit of \(x\) leads to a utility value of \(U(x-R,R)\), where \(U(\,\cdot \,,\,\cdot \,)\) is increasing in the first and decreasing in the second argument. The second argument of \(U\) can reflect the agent’s desire for “self-improvement” and “self-enhancement” which is achieved by choosing a high aspiration level \(R\); see for example Falk and Knell [7]. The optimal \(R\) can then be determined alongside with the agent’s optimal trading rule.

7 Concluding remarks

This paper considers a dynamic trading model under prospect theory preference with transaction costs. By solving a sequential optimal stopping problem, we find that the optimal trading strategy can have various forms depending on the model parameters and the price level of the asset. The impact of transaction costs is subtle. In contrast to conventional wisdom, increasing transaction costs does not necessarily deter economic agents from trading participation because the agents may face a higher reference point and in turn be more risk-aggressive in an expensive trading environment. These results could potentially be useful to policy makers to better understand how undesirable speculative trading behaviour in certain markets can be curbed effectively.

Our key mathematical results are derived under a somewhat stylised modelling specification. In particular, asymmetry of the degree of risk-aversion/seeking over gains/losses, a fixed transaction cost on sales and a negative aspiration level are currently omitted from the baseline analysis. While these omissions allow us to derive a sharp characterisation and comparative statics of the optimal trading rules, it will nonetheless be constructive to extend the model to further examine the impact of other economic factors. Section 6 also highlights a number of possible extensions for further rigorous mathematical analysis in the presence of a negative drift, subjective discounting, repeated trading opportunities and an endogenous aspiration level. In particular, a natural extension is to further explore the implications of our results for the literature of realisation utility, where we believe that a different specification of the reference point process (e.g. incorporation of a constant component) can lead to more realistic predictions of purchase behaviour.

A more ambitious goal is to further incorporate probability weighting within our continuous-time optimal stopping model (as in Xu and Zhou [28] and Henderson et al. [10]) to fully reflect the features of the cumulative prospect theory framework of Tversky and Kahneman [27]. However, technical difficulties are likely to arise due to the time-inconsistency brought by probability weighting. A precise formulation of the problem as well as the development of appropriate mathematical techniques should be an another interesting proposal for future research.

Finally, the surprising comparative statics documented in this paper also reveal that the economic interaction between market frictions and behavioural preferences can be subtle or even counter-intuitive. The model considered in this paper is just one of the many possible motivations that these two topics should not be studied in isolation, and this could potentially open up a new strand of literature to advance our understanding towards trading behaviour in a more realistic setup.