Markov Perfect Equilibria in Multi-Mode Differential Games with Endogenous Timing of Mode Transitions

We study Markov perfect equilibria (MPE) of two-player multi-mode differential games with controlled state dynamics, where one player controls the transition between modes. Different types of MPE are characterized distinguishing between delay equilibria, inducing for some initial conditions mode switches after a positive finite delay, and now or never equilibria, under which, depending on the initial condition, a mode switch occurs immediately or never. These results are applied to analyze the MPE of a game capturing the dynamic interaction between two incumbent firms among which one has to decide when to extend its product range by introducing a new product. The market appeal of the new product can be influenced over time by both firms through costly investments. Under a wide range of market introduction costs a now or never equilibrium coexists with a continuum of delay equilibria, each of them inducing a different time of product introduction.


Introduction
The main agenda of this paper are to improve our understanding of strategic effects arising in dynamic economic and managerial settings characterized by potential structural breaks, which induce jumps in the payoff functions of the economic actors, in the law governing the dynamics of relevant state variables, or both. Areas of application where such structural breaks, which we will refer to as mode changes, arise include environmental economics dealing with potential catastrophic transitions (e.g., [8]), financial portfolio and real investment problems incorporating potential crashes and macroeconomic regime shifts (e.g., [7,13]) or analyses of innovation dynamics capturing abrupt changes in the market structure due to the adoption of new technologies or the introduction of new products by some market participants ( [1,3,4]). Whereas all these different problems can be formulated as multi-mode models, crucial differences arise with respect to the way mode transitions are triggered. In particular, the timing and the type of mode transition might be deterministic or stochastic and might be purely exogenous or directly, respectively, indirectly (through the state dynamics) controlled by some economic agent(s).
In this paper, we focus on multi-mode settings with deterministic, controlled mode transitions and strategic interaction. In particular, we are interested in characterizing Markov perfect equilibria (MPE) in multi-mode differential games with a finite number of continuously evolving states, the dynamics of which are controlled by the actions of all players, and a set of modes, where the time of the transition between the modes is determined by one of the players; for easier exposition we assume this is player 1. Problems of this kind arise, for example, in dynamic competition models, like capital accumulation games or dynamic models of reputation formation, where one of the competitors through the introduction of new products or technologies to the market can change the demand structure. Whereas the literature on MPE in timing games with stochastic un-controlled state dynamics is large and well established (see, e.g., [9,10,17]), there is considerably less work dealing with situations with controlled mode transitions where the players can also influence the state dynamics. Intuitively, characterizing MPE in such a setting generates intricate strategic effects. The Markovian strategy determining the time of the mode switch induces a split of the state space in a given mode in regions with and without an immediate jump to an alternative mode. 1 In a setting where all players through their controls can influence the state dynamics, this implies that each player might influence the timing of the mode switch and this has to be taken into account when determining the optimal strategies. At the same time, player 1, when determining the optimal mode switching strategy, has to take into account the state dynamics under the equilibrium strategies of all players.
Few contributions have addressed these issues. Reddy et al. [15] and Gromov and Gromova [6] study multi-mode games with controlled state dynamics, where, however, the mode transition is not directly controlled by players but determined by state or time constraints. Hence, players can influence the timing of the mode transition only indirectly by steering the state variable through their control. In contrast to these papers, we consider a scenario where the timing of the mode transition is directly controlled by one of the two players and additionally both players can influence the dynamics of a continuously evolving state variable. Closest to our paper is Long et al. [14], where in a differential game model with multiple regimes, the concept of piecewise-closed-loop Nash equilibria (PCNE) is introduced. They consider a two-player multi-mode differential where both players can induce a change of the regime of the game and study piecewise-closed-loop Nash equilibria (PCNE). Under this equilibrium concept the state at which a player carries out a mode switch is derived from the condition, that it is optimal for the corresponding player to switch at that point, and the timing of the mode switches is determined as the point in time when the state variable under the equilibrium controls arrives at that switching state. However, in their setting, it is assumed that firms commit to their switching time in the sense, that they do not alter that time even if the other firm would deviate from its equilibrium control path and hence also the state variable would deviate from its equilibrium path. Therefore, the considered equilibrium is not fully Markov perfect with respect to the timing decision. Also in Dawid and Gezer [2], a multi-mode differential game with controlled state dynamics and mode switches is considered, but again the strategic interactions arising under Markovian strategies are not fully captured. In particular, the authors consider a strategy space where the players' controls influencing the state dynamics are determined by Markovian strategies, whereas the time of the regime switch is determined with full commitment at time zero and hence can be seen as following an open-loop strategy. To our best knowledge so far, no characterizations of full MPE in a setting similar to that considered in this paper are available in the literature.
The innovation of this paper relative to the literature discussed above is that we characterize a (simultaneous move) Nash equilibrium of multi-mode differential games where both the controls of the players steering the state variable and the decision, whether a mode switch occurs at given point in time, are determined by Markovian strategies and therefore depend solely on the current value of the state and the mode. Put differently, in contrast to the existing literature, we do not assume any kind of commitment of player 1 with respect to the timing of a mode switch. Since we consider an infinite horizon problem, where objective functions and state dynamics do not explicitly depend on time, these equilibrium profiles are also Markov Perfect. The equilibrium strategy profiles therefore correspond to Nash equilibrium behavior for any combination of values of the state and mode and this in particular implies, that, whenever player 1 triggers a mode switch, this change is (intertemporally) optimal for player 1 in light of the current state.
In light of the complexity associated with the characterization of the interplay of Markovian strategies determining controls that steer the state variable and such that determine the timing of mode switches, in this paper we restrict attention to a relative simple setting with two players, two modes and a one-dimensional state space in order to obtain first insights into the structure of MPE in multi-mode games with controlled states and mode switches. This is done mainly for reasons of tractability and easier exposition. Moreover, we believe that the main strategic effects occurring in such a setting and the types of equilibria arising can already be seen in such a relatively simple setup. We derive a set of sufficient conditions for a strategy profile to be a MPE of the game and, based on these conditions, identify different possible types of equilibria. There are delay equilibria, with the property that for some initial conditions the mode switch occurs after some finite delay, and now or never equilibria, under which the state space is partitioned in two areas with the property that for initial conditions in one area the game stays in the first mode forever, whereas for initial conditions in the other area player 1 already at t = 0 induces the switch to the second mode. Furthermore, we show that among the delay equilibria only for a special class of MPE, labeled as maximum delay equilibria, a standard smooth pasting condition holds for player 1. Such equilibria always exist if the game has any delay equilibria, but generically in addition to a maximum delay equilibrium there also exists a continuum of forced switch delay equilibria in which the mode transition is triggered earlier compared to the maximum delay equilibrium. For these equilibria, no smooth pasting conditions hold at the mode switching threshold. Intuitively, the mode switch by player 1 is triggered by the fact that, if the state would cross the switching threshold, while the game is in the initial mode, the action of player 2 would jump to an action with strong adversarial effects for player 1. Since under such an equilibrium strategy profile the game never stays for a positive amount of time at a state above the threshold in the first mode, any action of player 2 in this part of the state space and the first mode can be supported in a MPE. These arguments show that in the considered settings the fact that a strategy profile constitutes a Markov perfect equilibrium does not prevent the occurrence of what could be described as 'incredible threats' in the sense that a player's strategy prescribes an action which would be suboptimal for the player to implement if forced to do so for a time interval with positive measure.
We illustrate these findings by studying the optimal timing of new product introduction for a producer (firm 1) facing on an established market a competitor (firm 2), which has not developed a new product yet and therefore does not have the option of a new product introduction. The new product introduction corresponds to a mode change, and the state variable of the considered differential game is the appeal of the new product with consumers, which is influenced by the two firms through advertising (firm 1) and negative campaigning (firm 2). 2 Furthermore, the market introduction of the new product is associated with lumpsum costs. We show that for a wide range of values of the market introduction costs all different types of equilibria mentioned above coexist, and therefore, for a substantial subset of initial values of the new product's market appeal no clear cut prediction about the time of market introduction of the new product is possible. Furthermore, we determine a threshold such that for values of the market introduction costs above this value only now or never equilibria exist. Overall, this analysis highlights that multi-mode timing games of this kind almost generically give rise to multiplicity of Markov perfect equilibria. A potential implication of this insight is that more refined equilibrium concepts than MPE should be considered to analyze such games.
The remainder of the paper is organized as follows. In Sect. 2, we introduce the type of multi-mode games we are considering, and in Sect. 3, we derive necessary conditions for MPE in such a setting and also some additional results on equilibrium profiles. In Sect. 4, we study the optimal timing of new product introduction, thereby illustrating our general findings. A discussion of our results and conclusions is provided in Sect. 5. 'Appendix' contains all proofs and some additional analysis.

The Model
We consider a differential game between two players i = 1, 2, in which each player intends to maximize an infinite horizon discounted payoff stream of the form where x ∈ R is a one-dimensional state and m(t) ∈ {m 1 , m 2 } is the mode of the game.
The interval X = [x l , x u ] with x l < x u is the state-space of the game. 3 Here, 1 denotes the indicator function and τ = inf{t ≥ 0|m(t) = m 2 } is the point in time in which the mode process moves from m 1 to m 2 . In case no transition to m 2 occurs, we set τ = ∞. At time τ , transition costs of κ ≥ 0 arise for player 1. The vector u = (u 1 , u 2 ) denotes the controls of both players with u i ∈ U i ⊆ R n . We assume that F i (x, u, m) is continuous and differentiable with respect to x and u for each m ∈ {m 1 , m 2 }. The state evolves according tȯ where f (x, u, m) is Lipschitz continuous and differentiable with respect to x and u for all m ∈ {m 1 , m 2 }. The mode process m(t) initially is in m(0) = m 1 and is controlled by player 1. More precisely, player 1 can determine at which point in time the process jumps from m 1 to m 2 . Once m(t) = m 2 , no additional mode transitions are possible. 4 In what follows, we restrict attention to time-invariant Markovian strategies. More precisely, we consider strategy profiles (( 1 (x, m) , m(t)), i = 1, 2 for all t and the process m(·) jumps from m 1 to m 2 at t if and only if m(t) = m 1 and x(t) ∈ . Intuitively, the switching set , which is chosen by player 1, determines the set of states at which player 1 induces a switch from m 1 to m 2 and hence represents a Markovian strategy of that player. 5 A Markov perfect equilibrium of the (simultaneous move) differential game is a strategy profile such that each player solves the dynamic optimization problem defined by (1) and (2) given that the other player determines her control using her equilibrium strategy.

Markov Perfect Equilibria
In this section, we derive necessary conditions to be satisfied by a Markov Perfect equilibrium profile of the problem described in Sect. 2. The fact that one of the players controls the transition from mode m 1 to m 2 implies that, in addition to standard conditions characterizing the value functions and optimal strategies in each of the two modes, the effect of the endogenous timing of the mode transition has to be taken into account. Intuitively, this means that on the one hand the time of the mode transition has to be optimal for player 1, thereby fulfilling standard conditions for optimal stopping problems. On the other hand, for a given mode switching strategy of player 1, the opponent player 2 can also influence the dynamics of x, and thereby the time of the mode switch, by choosing her controls in mode m 1 . The interplay of these effects has to be taken into account when characterizing a MPE profile. In what follows, we focus on equilibria in which the mode switching set is of threshold type, i.e., it has the form = [x, x u ], where player 1 chooses the value ofx. 6 We denote such a threshold strategy characterized byx as x . The following set of sufficient conditions characterizes MPE of such threshold type. (The proof of this and all following propositions is given in 'Appendix' A.) Proposition 1 Consider a multi-mode differential game described in Sect. 2. If there exists a set of except at (x, m 1 ) everywhere continuous and continuously differentiable value functions V i : X ×{m 1 , m 2 } → R and a profile of almost everywhere continuous Markovian strategies (( 1 (x, m), x ), 2 (x, m)) such that the conditions below are satisfied for i = 1, 2, then this profile constitutes to a Markov perfect equilibrium of the game: where the inequality for player 2 has to hold as equality if there exists an > 0 such In order to interpret the conditions listed in the proposition, we first observe that conditions (i) and (ii) are standard conditions for a MPE of the game in mode m 2 . Since for any value of the state x above the thresholdx player 1 immediately switches to mode m 2 , the value functions in mode m 1 in this part of the state space coincide with that in mode m 2 net of the costs associated with the switch from m 1 to m 2 . For values of x belowx, the value function and the equilibrium strategies are characterized by standard Hamilton-Jacobi-Bellman (HJB) equations (see (iv) and (viii)). The boundary condition for these HJB equations is given by the value matching condition in (v) which guarantees that the value function of player 1 is continuous at the thresholdx, whereas the value function of player 2 might exhibit a jump at the thresholdx at which player 1 switches to mode m 2 . However, as we will illustrate below, a non-continuous value function for player 2 can arise only if the state dynamics in equilibrium is such that the switch to mode m 2 occurs either immediately or never, depending on the initial state, i.e., if the dynamics in mode m 1 leads the state away from the threshold x (cf. condition (v)). The optimality of the thresholdx from the perspective of player 1 is guaranteed by conditions (vi) and (vii), where condition (vi) ensures that it is not optimal for player 1 to switch to mode m 2 at any x ∈ [x l ,x) and (vii) guarantees that for any x >x it is optimal for player 1 to switch immediately to mode m 2 rather than remaining in mode m 1 .
Finally, we like to point out that the sufficient conditions for an MPE, provided in Proposition 1, impose hardly any restrictions on the strategies φ i (x, m 1 ) for x >x. This implies that in spite of the fact that we consider Markov perfect equilibria, which constitute equilibria of every subgame defined by the current state and mode, the players might use strategies which for x >x in mode m 1 induce actions that would not be optimal for that player if implemented for a positive amount of time. This is still optimal because for these states the game immediately switches to mode m 2 . We will discuss this issue in more detail below and illustrate that this feature might give rise to a wide range of coexisting equilibria.
For our further discussion, it is helpful to distinguish different types of equilibria that can arise in our setting. The main property to be considered is whether the state dynamics under the equilibrium strategies ( 1 (x, m 1 ), 2 (x, m 1 )) for x <x in the neighborhood of x points toward the thresholdx , i.e., whether f (x, ( 1 (x, m 1 ), 2 (x, m 1 )), m 1 ) > 0 for x ∈ (x − ,x). If this condition holds, then for x ini ∈ (x − ,x) the game switches to m 2 with some positive delay. We refer to such equilibria as delay equilibria. 7 On the other hand, if , then the game either switches immediately to mode m 2 , for x(0) ≥x or remains in mode m 1 forever (for x(0) <x). We denote such equilibria as now or never equilibria. The following definition puts these notions more formally.

Definition 1 Consider a MPE strategy profile
)) satisfying the conditions of Proposition 1 and denote by τ ( • the MPE profile P is called a now or never equilibrium if for every initial state x ini ∈ X either τ (x ini ) = 0 or τ (x ini ) = ∞. • the MPE profile P is called a delay equilibrium if there exist initial states x ini ∈ X such that 0 < τ(x ini ) < ∞. • a delay equilibrium profile P = (( 1 (.), x ), 2 (.)) is called a forced switch delay equilibrium if there is a small > 0 such that for any alternative strategy˜ 2 (.) of player • a delay equilibrium which is not a forced switch delay equilibrium is called maximum delay equilibrium.
Intuitively, a forced switch delay equilibrium has the property that the switch of player 1 to mode m 2 at the statex is triggered by the threat of a jump in the control player 2 would implement atx if the mode would stay in m 1 . 8 Formally in Definition 1, this is expressed by the fact that under a forced switch delay equilibrium, any continuous extension of player 2's strategy 2 (., m 1 ) beyondx would make it a best response for player 1 to switch to mode m 2 at some threshold abovex. Delay equilibria, where the mode switch by player 1 is not triggered by such a threat, are called maximum delay equilibria. In any delay equilibrium, the fact that player 1 has incentives to delay the switch to mode m 2 for any x ∈ (x − ,x) induces that the inequality The inequality has the standard interpretation that the left-hand side gives the (flow) profit player 1 receives in mode m 1 from marginally delaying the mode switch and the right-hand side the associated (discounted) loss in value received in m 2 . In general, (3) might hold as a strict inequality even in the limit as x converges tox, in particular if the strategy 2 (., m 1 ) of player 2 exhibits a jump atx. In an equilibrium with this property, player 1 has strict incentives to delay the switch to mode m 2 for any value of the state x up tox, and the switch to m 2 atx is then triggered by the jump the action of player 2 would exhibit if the game would remain in mode m 1 . From the perspective of player 2, the jump in 2 (., m 1 ) atx in such a setting, however, is optimal only because player 1 switches the mode to m 2 for any x ≥x such that player 2 is never required to implement the actions induced by 2 (x, m 1 ), x ≥x for a positive amount of time. In such a situation, if player 2 would move the jump of its control fromx tox + , this would also induce a change in the optimal switching strategy of player 1 in the sense that the optimal mode switching threshold for player 1 would move upward. These are the forced switch delay equilibria. If an equilibrium has the maximum delay property, and therefore the choice ofx by player 1 is not triggered by the threat of a jump of the action of player 2, standard optimal stopping arguments imply that player 1 delays the switch until a value of x is reached where (3) holds as equality. In such equilibria, also a standard smooth pasting condition for player 1 holds in the sense that the slope of the value function of player 1 at x =x in mode m 1 coincides with the slope in mode m 2 . In the following lemma, we state these observations more formally.
and lim In the remainder of this section, we discuss the implications of the different types of equilibria for the continuity of the players' controls over time. First, we observe that in case of now or never equilibria with continuous strategies i (., m 1 ) and i (., m 2 ) the trajectories of the actions of both players are continuous over time, since no switch from m 1 to m 2 occurs for t > 0. Hence, we focus here on delay equilibria and the question whether controls are continuous at the point of time when the mode is changed by player 1. Given that we consider deterministic models, where for a given strategy profile both player can perfectly predict the time of the switch to mode m 1 , intuitively one could expect that in equilibrium controls are continuous as long as the marginal effects of the controls on the players instantaneous profits and on the state dynamics is the same in both modes. As we show below, this intuition holds true for player 1 as long as we consider a maximum delay equilibrium where the smooth pasting condition holds for player 1 at the time of the mode switch and the optimal action of player 1 is not affected by potential jumps of the control of player 2. To show this formally, we introduce the following notions. Definition 2 A differential game of the structure described in Sect. 2 has • mode independent control effects if and only if • separable control effects if For twice differentiable functions F i and f , separable control effects are equivalent to the condition that the cross-derivatives of these functions with respect to the two controls vanish. In many types of games, including, for example, different variants of investment games, direct effect of the control of one player on the incentives of the other player indeed does not exist.
Even if we consider games that have both of these properties, equilibrium profiles, as characterized by Proposition 1, in general might exhibit jumps in the feedback strategies of both players such that lim x→x− i (x, m 1 ) = i (x, m 2 ). More precisely, in the case of forced switch delay equilibria, the actions of both players jump at the point in time when the modes switch to m 2 . Concerning the investment of player 2, who cannot directly control the switch from mode m 1 to m 2 , in general jumps in the control occur as the game moves from mode m 1 to mode m 2 in all types of delay equilibria. In equilibrium, player 2 perfectly predicts the time of the mode switch and also the value of its investment in mode m 1 for her future profits in mode m 2 . Therefore, at first sight the discontinuity of the investment of player 2 might be surprising. The jump in the action of player 2 results from the fact that in mode m 1 player 2 can influence the time of the switch to mode m 2 , since the choice of her control affects the dynamics of the state and thereby the time the state hits the thresholdx determined by the equilibrium strategy of player 1. This effect, which influences the optimal choice of the control of player 2, immediately disappears, at the point in time when the mode changes to m 2 . Hence, the equilibrium strategy of player 2 in general exhibits a jump at the statex in any delay equilibrium even if the game exhibits mode independent and separable control effects.
Considering the action of player 1, we have to distinguish between maximal delay equilibria and forced switch delay equilibria. Whereas Proposition 2 shows that the control of player 1 is continuous at the time of the regime switch in maximal delay equilibria, in general this does not hold for delay equilibria not having this property. Intuitively, in such an equilibrium player 1 has strict incentives to delay the jump to mode m 2 for all values of the state below the thresholdx, and, differently to the maximal delay equilibrium, this incentive to delay does not converge to zero as x approachesx. The reason for player 1 to switch to mode m 2 at x is that player 2 threatens to discontinuously change its action atx if the game would stay in mode m 1 . Hence, for a given strategy of player 2, the thresholdx at which player 1 switches to mode m 2 is determined by the state at which player 2 threatens to change its control. Based on this, player 1 has to adjust its investment in order to influence the duration of the game in mode m 1 . Similarly to what we described for player 2 above, this effect disappears as soon as the mode is m 2 and hence also the action of player 1 in general exhibits a jump at the mode switch. More formally, in the absence of a smooth pasting condition the state derivative of the value function of player 1 in general exhibits a jump as the mode switches from m 1 to m 2 . Therefore, the marginal return of investment for player 1 jumps as the game switches from mode m 1 to m 2 atx, which induces a jump in the control of player 1. In Sect. 4, we will illustrate these scenarios using a simple example analyzing the optimal timing of new product introduction in a duopoly.
The following proposition formally establishes that under maximum delay equilibria the action of player 1 is continuous over time although the mode changes at some positive t from m 1 to m 2 .
) is a maximum delay equilibrium profile of a game with mode independent and separable control effects such that in each mode m the strategies i , i = 1, 2 are continuous with respect to the state x and the right-hand side of condition (ii) in Proposition 1 has a unique maximizer at x =x, then The intuition for this result is straightforward. For the maximal delay equilibrium, the smooth pasting condition holds for player 1 at the thresholdx. This ensures that the slope with respect to the state of her value function is identical in modes m 1 and m 2 at this point. If the marginal effects of the own control on the instantaneous payoff and the state dynamics are not affected directly by the change in mode (mode independent control effect) and by the potential change in the other player's control (separable control effect) due to the mode switch, then the optimization problem which player 1 faces at statex is equivalent, no matter whether the firm is in mode m 1 or in m 2 . Hence, its equilibrium feedback strategy is continuous atx.

An Illustrative Example: Optimal Timing of New Product Introduction
To illustrate our general findings, we now consider the timing problem of a firm, denoted as firm 1, which has to decide when to introduce a new product, that it has developed. We assume that the firm is already active producing an established product and competes with a second firm (firm 2) on the market for the established product. Only firm 1 has developed the new product and therefore has the option to introduce that product at any point in time t ≥ 0.
Before the new product is introduced (mode m 1 ), the inverse demand for the established product is given by where p o denotes the price of established product and q io the quantity of that product supplied by firm i, i = 1, 2. After the introduction of the new product (mode m 2 ), the inverse demand changes to Here, α o is the reservation price of the established product, whereas α 0 n + α n denotes the reservation price of the new product. We write this reservation price as the sum of a constant parameter α 0 n , giving the minimal value of the reservation price, and the state variable α n (t) ≥ 0 capturing the effects of the efforts of the two competitors to influence the (inverse) demand for the new product. Firm 1, as the (potential) innovator, tries to increase the willingness to pay for the new product, whereas the competitor firm 2 might invest in reducing this willingness, e.g., by providing information to consumers about negative features of the new product or by negative campaigning. Both firms choose nonnegative effort h i , measured e.g., in work hours, to do advertising (firm 1), respectively, negative campaigning (firm 2) for the new product. The unit cost of such effort is denoted by c i > 0, i = 1, 2 ,and we assume that there are decreasing returns to effort in terms of the size of the effect on the inverse demand for the new product. More precisely, we assume that the change in reservation price is given byα Here, γ > 0 is a parameter, and δ is the rate by which the effect of firms' effort on inverse demand vanishes. We assume that the equation governing the dynamics of α n is the same in both modes, which captures a situation where firms might invest effort in influencing the reservation price of the new product also before the product is actually introduced to the market. 9 For analytical convenience, we transform the firms' control variables according to u i = √ 2h i to obtain the following formulation with linear state dynamicṡ and quadratic cost functions ξ i (u i ) = c i 2 u 2 i . In slight abuse of notation, we will refer to the controls u i as firm effort in the further analysis. Production costs for the established product are assumed to be symmetric across firms and given by ν o 2 q 2 io , i = 1, 2 and analogously for the new product firm 1 has production costs ν n 2 q 2 1n . Firms maximize profits by choosing the production quantities as well as the effort u i at every point in time t. In addition, firm 1 decides about the time at which the new product is introduced. The introduction of the new product is associated with lump-sum costs of κ > 0.
Since the quantity choice in this setting does not have any intertemporal effects, firms at t choose quantities according to the Cournot equilibrium, which depends on the value of α n (t) for all t after the introduction of the new product. Standard calculations (see 'Appendix' B for a derivation of the Cournot equilibrium quantities and profits in both modes) show that before the introduction of the new product we have and the market profit of each firm is given by π m 1 io = 1 + ν o 2 (q m 1 io ) 2 . Hence, we obtain for the instantaneous profit in mode m 1 In order to guarantee that after the introduction of the new product also a positive amount of this good is produced in equilibrium regardless of the value of α n , we assume that α 0 n > 3ηα o 3+ν o . Furthermore, we restrict attention to scenarios where in equilibrium both firms also sell a positive quantity of the established product. This is true as long as α n < α U B n := , we obtain as the equilibrium quantities in mode m 2 : The corresponding instantaneous profit in mode m 2 is Overall, the dynamic strategic interaction between the two firms constitutes a two-mode differential game of the form considered in Sect. 2 with the single state α n evolving according to (7) and the instantaneous profits in mode m 1 given by (9) and those in mode m 2 given by (11). Both firms decide on their effort u i as a function of the current state and mode and additionally firm 1 determines the timing of the switch from m 1 to m 2 , which means that firm 1 decides on a thresholdᾱ n such that it introduces the new product as soon as the reservation price of the new product α 0 n + α n is larger than α 0 n +ᾱ n . Formally, in accordance with Sect. 3 we consider profiles of Markovian strategies of the form (( 1 (α n , m), ᾱ n ), 2 (α n , m)) and in what follows characterize Markov perfect equilibria of this game.

MPE in Mode m 2
In order to characterize the different types of equilibria in our game, we first consider the final mode m 2 . Since the instantaneous profits given in (11) are quadratic functions of state and controls and the state Eq. (7) is linear, the game in mode m 2 is of linear quadratic structure. Following a wide range of the literature about such games (see, e.g., [5]), we assume that the game has a MPE with linear feedback strategies 10 , giving rise to quadratic value functions of both players. The value and feedback functions can then be determined by a guess and verify approach using the first-order conditions and Hamilton-Jacobi-Bellman equations, see 'Appendix' C for details.  Figure 1 shows the value functions of both players in the MPE for our benchmark parameter setting. 11 The green arrows indicate the direction of the state dynamics under the equilibrium feedback strategies. There is a unique stable steady state α * ,m 2 n , and it is easy to verify that α * ,m 2 n < α U B n under our parameter setting such that both firms sell positive quantities of the old product in the steady state. Figure 1 clearly shows that, as expected, the value of firm 1 increases with α n , whereas firm 2 is hurt by an increase in the attractiveness of the new product.

Different Types of MPEs in Mode m 1
In order to characterize MPE strategies in mode m 1 , we first determine the value functions of both players under the extreme scenarios in which the new product, regardless of the initial condition, is either immediately or never introduced. We denote the value function of firm i in the former case of immediate introduction by V 0 i (α n , m 1 ). Clearly, we have If the new product is never introduced, positive investment is never optimal for either firm, i.e., u i (t) = 0, i = 1, 2, t ≥ 0, and hence, the value functions read Note that V ∞ i is constant with respect to α n because the attractiveness of the new product is irrelevant if this product is never introduced. Given that V 0 1 (α n , m 1 ) increases with α n , it follows that V 0 and it is optimal for firm 1 to introduce the new product at some finite point in time t. If this inequality is violated, then in general it might depend on the initial value α n (0) whether the new product is introduced. In order to gain a better understanding of the potential structure of equilibria under which the new product introduction depends on the initial state, we first characterize the properties of equilibrium state dynamics on (0,ᾱ n ).
The following proposition shows that under an MPE profile the state α n is either monotonously increasing or decreasing on the entire interval below the threshold value at which firm 1 introduces the new product (i.e., for α ∈ (0,ᾱ n )). (( 1 (α n , m), ᾱ n ), 2 (α n , m)) is a MPE profile of the game, then it holds for all α 1 n , α 2 n ∈ (0,ᾱ n ) that sgn f (α 1 n , ( 1 (α 1 n , m 1 ), 2 (α 1 n , m 1 ))) = sgn f (α 2 n , ( 1 (α 2 n , m 1 ), 2 (α 2 n , m 1 ))) . The observation that in equilibrium the state α n is either strictly increasing or strictly decreasing on the entire interval (0,ᾱ n ) has several important implications. If in a given equilibrium the new product is introduced after a positive and finite delay for some initial value α n (0), then under this equilibrium profile the product is introduced after finite time regardless of the initial state α n (0). Conversely, if in a given equilibrium the product is never introduced for some initial value of the new market size, then under this equilibrium a positive finite delay in product introduction can never occur, regardless of α n (0). Relating to the different types of equilibria introduced in Sect. 3, the proposition shows that the property 'delay equilibrium,' which in Definition 1 was defined locally in the sense that for some initial condition the mode switches after some delay, in the framework of this model is actually a global property in the sense that in any delay equilibrium the switch to m 2 occurs after a positive finite delay for all α n ∈ [0,ᾱ n ). Similarly, in this model in any now or never equilibrium the switch never occurs for any initial value of the state below the thresholdᾱ n .

Proposition 3 If
In what follows, we illustrate our general results from Sect. 3 by showing that, depending on the size of the new product introduction costs κ, qualitatively different types of MPE constellations exist in in our model.

Small Costs of Market Introduction
If κ is sufficiently small, i.e., κ < κ := V 1 (0, m 2 ) − V ∞ 1 (0, m 1 ), then it follows directly that immediate introduction is more profitable for firm 1 than no introduction regardless of the value of α n . 12 Hence, in equilibrium the new product is always introduced. Concentrating first on a maximum delay equilibrium, we need to characterize the threshold value ofᾱ n , above which the product is introduced. Denoting the threshold in the maximum delay equilibrium byᾱ md n , we obtain that the following condition has to be satisfied for some positive value of u m 1 2 (see 'Appendix' D): Whereas the first equation captures the smooth pasting condition, the second is the HJB equation of firm 2 at α md n . To obtain these equations, we have used the fact that the equilibrium value function of firm 2 has to coincide between the two modes for all α n ≥ᾱ n (see It can be easily checked that these value functions in combination with feedback functions derived from the maximization of the right-hand side of the HJB equation on [0,ᾱ md n ] satisfy all conditions of Proposition 1 for any profile ( 1 , 2 ) satisfying 2 (α n , m 1 ) ≥ 2 (ᾱ md n , m 1 ) for all α n ≥ᾱ md n (in order to ensure condition (vii) of Proposition 1). The equilibrium investment functions under the maximum delay equilibrium are shown in Fig. 3a. It can be clearly seen that the investment of firm 1 in mode m 1 (solid black line) intersects with the equilibrium investment in mode m 2 (black dashed line) exactly at the thresholdᾱ md n above which firm 1 immediately introduces the new product. We do not show any values for the function 1 (α n , m 1 ) for α n ≥ᾱ md n since any choice of the function on this interval is compatible with equilibrium. The figure also illustrates the downward jump of the investment of firm 2 at the threshold α n =ᾱ md n where the mode switches from m 1 to m 2 . In mode m 1 , there is an additional incentive for player 2 to invest. Such investment decreases the speed of the increase of α n and delays the point in time when the state variable arrives atᾱ md n and the new product is introduced. Firm 2 has an incentive to delay the new product introduction, which provides additional investment incentives. Once the new product is introduced, this additional incentive vanishes which results in a downward jump in player 2's investment.
However, the maximum delay equilibrium is not the only MPE in our setting. Actually, for any thresholdᾱ d n ∈ [0,ᾱ md n ) there is a forced switch delay equilibrium such that firm 1 introduces the new product immediately for all α n ≥ᾱ d n . In Fig. 2, the value functions corresponding to such an equilibrium withᾱ d n = 0.01 are illustrated with coarsely dashed lines. It can be clearly seen that both value functions have a kink at α n =ᾱ d n such that also for firm 1 the smooth pasting condition does not hold at this threshold where the firm introduces the new product. The equilibrium feedback function corresponding to this MPE is shown in Fig. 3b. The figure illustrates that the investments of both firms jump at the point in time when the new product is introduced, where the jump is upward for firm 1 and downward for firm 2. Whereas the intuition for the downward jump of firm 2 is analogous to that developed for the maximum delay equilibrium, the upward jump for firm 1 is due to the fact that after the introduction of the new product an increase of α n has a positive impact on the instantaneous profit of firm 1, whereas in mode m 1 such an increase only has impact on the remaining time till the new product is introduced. If the new product is introduced at a level of α n , where the value function in mode m 2 is still strictly steeper than that in mode m 1 , the investment incentives jump upward at the point of market introduction. A crucial feature of this equilibrium profile is that 2 (α n , m 1 ) for α n ≥ᾱ d n is sufficiently large such that it is optimal for firm 1 to introduce the new product immediately atᾱ d n . In our example, this is guaranteed by setting 2 (ᾱ d n , m 1 ) > 1 γ 1 (ᾱ d n , m 1 ), which implies that under optimal investment by firm 1 the state variable α n does not move aboveᾱ d n . As discussed in Sect. 3, although such high investments by firm 2 would not be optimal, if it were to be carried out for a time interval with positive measure, in equilibrium the firm is never required to carry out such investment, regardless of the value of α n (0).

Intermediate Costs of Market Introduction
We now consider the case where κ > κ, which implies that for α n (0) = 0 immediate introduction of the new product yields a lower value for firm 1 than never introducing the new product and abstaining from any investment into the buildup of α n . Although immediate introduction of the new product is not optimal, an introduction with some delay might still be more profitable than no introduction. Taking into account that the largest value for firm 1 is obtained under the maximum delay equilibrium, V md is a necessary and sufficient condition for the existence of delay equilibria in our setting. It should be noted that V md 1 (0, m 1 ) is a decreasing function of κ such that this condition implies an upper bound κ for the costs of market introduction such that for all κ ≤κ there exists an equilibrium such that for all α n (0) ≥ 0 the new product is introduced to the market after finite time. For this subsection, we assume that κ ∈ [κ,κ], i.e., the market introduction costs are in an intermediate range such that for α n (0) = 0 immediate introduction is not optimal but there exists an equilibrium which induces new product introduction with some delay for this initial value.
In Fig. 4, this case is illustrated by showing the value functions for immediate market introduction (dashed lines), the maximum delay equilibrium (solid line) and no market introduction (dotted line). We denote byᾱ nn n the largest value of α n such that Clearly, for all values of α n <ᾱ nn n immediate market introduction is not optimal; therefore, the range of threshold valuesᾱ d n for which delay equilibria can exist is restricted toᾱ d n ∈ [ᾱ nn n ,ᾱ md n ]. However, for a delay equilibrium to exist, the associated value function must also satisfy V d where we denote by V d 1 (0, m 1 ;ᾱ d n ) the value function of firm 1 under a (candidate for a) delay equilibrium with thresholdᾱ d n . If this inequality does not hold, condition (ii) in Proposition 1 would be violated at α n = 0. This can be seen by realizing that by choosing u i = 0 for α n = 0 we haveα = 0 and the value of the right-hand side of the HJB equation becomes Since under the equilibrium feedback functions in the delay equilibrium the right-hand side of the HJB equation has to be equal to r V d 1 (0, m 1 ;ᾱ d n ) (see condition (i) in Proposition 1), the feedback function d (α n , m 1 ;ᾱ d n ) in that equilibrium candidate does not maximize the righthand side of the HJB equation. This shows that no delay equilibrium with V d can exist with the property that the new product is introduced after a positive delay for α n (0) = 0. Furthermore, we know from Proposition 3 that in any equilibrium the sign of the state dynamics cannot change on the interval [0,ᾱ n ]. This rules out any equilibrium profile under which the new product is never introduced (and hence α n decreases over time) for a small value of α n (0), but is introduced after a delay for larger values of α n (0). Put together these arguments establish that for any thresholdᾱ d n with V d 1 (0, m 1 ;ᾱ d n ) < V ∞ 1 (0, m 1 ) no delay equilibrium can exist. However, there exists a continuum of delay equilibria where at the market introduction threshold this inequality does not hold. In particular, there exists a unique α d n ∈ (ᾱ nn n ,ᾱ md n ] such that V d To see this, note that, since the smooth pasting condition is satisfied atᾱ md n , for allᾱ d n <ᾱ md n an equilibrium with a later switch to m 2 leads to a higher value forfirm 1. Hence, V d 1 (0, m 1 ;ᾱ d n ) Since without market introduction of the new product the value of firm 1 does not depend on In the two panels of Fig. 4, we illustrate this observation by showing, as coarsely dashed lines, the value functions of the two firms under the (forced switch) delay equilibrium with thresholdᾱ d n = α d n . For any threshold valuē α d n in the interval [α d n ,ᾱ md n ], a delay equilibrium profile can be constructed in the same way as discussed in the previous subsection.
However, in the case of intermediate values of market introduction costs considered here, there also exists a now or never equilibrium, as discussed in Sect. 3. Similarly to the forced switch delay equilibria, also this equilibrium is characterized by a threat of a strong investment of player 2 in m 1 as soon as the state variable α n is larger or equal thanᾱ nn n . Hence, firm 1 introduces the product immediately if α n ≥ᾱ nn n . As discussed above, under a potential delay equilibrium in which firm 1 invests positive amounts we would have for all α n <ᾱ nn n and therefore setting 1 (α n , m 1 ) = 0 is the optimal strategy for firm 1 on this interval. Hence, in equilibrium we also must have 2 (α n , m 1 ) = 0 for all α n <ᾱ nn n . We obtain a MPE profile under which for all α n (0) <ᾱ nn n both firms invest nothing and the new product is never introduced, whereas for α n (0) ≥ᾱ nn n the product is introduced at t = 0 and both firms afterward invest according to the MPE strategies in mode m 2 . The equilibrium feedback functions underlying this equilibrium are illustrated in Fig. 5, where, as in Fig. 3, we do not show any values of 1 (α n , m 1 ) for α n ≥ᾱ nn n since any choice is compatible with an equilibrium profile. Figure 6 illustrates the dynamics emerging under the different types of equilibria that coexist for an intermediate level of the market introduction costs. In particular, the trajectories of the market attractiveness (α n ) and of the investments of both firms (u 1 , u 2 ) are depicted for a small initial value of the market attractiveness. The dotted lines corresponding to the now or never equilibrium show that in such an equilibrium, due to absence of any investments the attractiveness of the new product decreases toward zero and the product is never introduced.  The dashed and the solid lines correspond to the dynamics under the (forced switch) delay equilibrium with the minimal possible threshold, i.e.,ᾱ d n = α d n , and the maximum delay equilibrium. The points in time at which the new product is introduced under the forced switch delay and the maximum delay equilibrium are denoted by τ d and τ md , where τ d < τ md . The figure illustrates that under both equilibria the state α n increases over time. In the time interval [0, τ d ], the increase is slower under the delay equilibrium than under the maximum delay equilibrium, because the investments of firm 2, which decrease the speed of growth of α n , are larger under the delay than under the maximum delay equilibrium. The investments of firm 1, fostering the growth of α n , are virtually identical under both equilibria. Intuitively, firm 2 invests more under the forced switch delay equilibrium, because the downward jump of its instantaneous profit associated with the switch to mode m 2 is much closer time-wise in the forced switch delay equilibrium than in the maximum delay equilibrium and therefore less heavily discounted. Hence, the incentive to invest in delaying the switch is stronger in the forced switch delay equilibrium. For firm 1, no such effect occurs and the incentive to invest during mode m 1 is hardly affected by the type of the delay equilibrium. Under the forced switch delay equilibrium, both controls exhibit a jump at τ d with the investment of firm 1 jumping upward and those of firm 2 jumping downward. As a result, α n grows faster after τ d compared to the time before the new product introduction and we observe a higher attractiveness of the new market under the forced switch delay equilibrium than under the maximum delay equilibrium. It should, however, be noticed that under both equilibria the state α n converges to the steady state in mode m 2 (α * ,m 2 n ) and therefore this difference between the two equilibria disappears in the long run. Finally, Fig. 6b illustrates again that in the maximum delay equilibrium the investment of firm 1 is continuous throughout the entire trajectory, i.e., also at period τ md at which the mode switches from m 1 to m 2 .

Large Costs of Market Introduction
If market introduction costs are large, i.e., κ >κ, then we have V md 1 (0, m 1 ) < V ∞ 1 (0, m 1 ), which means that any candidate for a delay equilibrium yields a smaller value for firm 1 at α n (0) = 0 than not investing in the build-up of α n and never introducing the new product. We illustrate the value functions of both firms for this case in Fig. 7. Due to V md in the buildup of α n and eventually introduces the new product. Using the same arguments as developed in the previous subsection, this rules out the existence of any MPE under which the product is introduced with delay for some initial value α n (0). Hence, the only equilibrium that still exists in this scenario is the now or never equilibrium, where firm 1 introduces the new product immediately for α n (0) ≥ᾱ nn n and never introduces it for α n (0) <ᾱ nn n . The value functions of both firms corresponding to this equilibrium are indicated in bold in Fig.  7. It should be noted that under this equilibrium the value function for firm 1 has a kink at α nn n (panel (a)), while the value of the game for firm 2 exhibits a downward jump as the initial state α n (0) crosses the thresholdᾱ nn n (panel (b)).

Discussion and Conclusions
In this paper, we study Markov perfect equilibria in two-player multi-mode differential games where both players can influence the dynamics of the state variable and one player controls the timing of the mode switches. We identify different types of equilibria which can occur in such games and our analysis of a simple model of dynamic competition under potential new product introduction illustrates the properties of these different types of equilibria. In particular, we show that forced switch delay equilibria, maximum delay equilibria and now or never equilibria might coexist for some parameter constellations. More precisely, we have demonstrated that the following three scenarios arise: (i) κ ≤ κ: There exists a maximum delay equilibrium with product introduction at α = α md n and a continuous investment path of player 1 across both modes of the game. Furthermore, for anyᾱ d n ∈ [0, α md n ) there exists a forced switch delay equilibrium with new product introduction atᾱ d n . In each of these equilibria, the investments of both players jump as the mode switches from m 1 to m 2 .
(ii) κ < κ ≤κ: There exists a maximum delay equilibrium plus for eachᾱ d n ∈ [α d n ,ᾱ md n ) there is also a forced switch delay equilibrium. Furthermore, in this case also a now or never MPE exists. (iii) κ >κ: Whereas strategy profiles inducing market introduction with delay for some initial value of α n do not constitute a MPE, the now or never equilibrium still exists.
In scenario (iii) for each initial condition, a unique prediction about the occurrence and timing of the new product introduction can be made if we restrict attention to the MPE analyzed here. This is not true in the first two cases. In case (ii), there is a set of initial conditions α n (0) ∈ [0,ᾱ nn n ] such that MPEs under which the new product is eventually introduced coexist with the now or never equilibrium under which no introduction of the new product occurs. For α n (0) >ᾱ nn n , the new product is introduced under all existing equilibria although the time of the introduction varies across equilibria. Similarly, under scenario (i) the product is eventually introduced for all initial conditions, where the time of the introduction depends on the chosen equilibrium.
Whereas all equilibria considered here are Markov perfect, intuitively the forced switch delay equilibria seem less plausible compared to the maximum delay and the now or never equilibrium. As discussed above, the equilibrium profiles underlying these equilibria include strategies for firm 2, under which in mode m 1 for α n ≥ᾱ d n investments are chosen which would not be optimal if the game would stay in mode m 1 for an amount of time with positive measure even after the state α n has crossed the thresholdᾱ d n . Since in equilibrium the state is never larger or equal thanᾱ d n in mode m 1 for a positive amount of time, this feature does not contradict the optimality of the strategy of firm 2. However, it nevertheless constitutes an 'incredible threat' in the sense that it would not be optimal for firm 2 to stick to this investment strategy, in case firm 1 would deviate from its own equilibrium strategy by increasing the threshold at which it switches to mode m 2 .
A main insight of our analysis is that a strategy profile might be Markov perfect even though the strategy of some players induces actions in some parts of the state space which would not be optimal for the player if carried out for a positive amount of time. Although this phenomenon has been demonstrated in this paper only for multi-mode games, it seems that it might arise also in other classes of differential games. For example, in games in which the state dynamics is controlled by potentially singular controls of both players MPEs might exist in which the strategy of one player in a certain region of the state space make it optimal for some other player to induce a jump of the state out of that region by means of a singular control. Similar to our setup, Markov perfection does not put any restrictions on the regular control of that player in the region in which the other player chooses a singular control. 13 Intuitively, under such a MPE profile the state always immediately jumps out of that singular control region such that the amount of time for which the actions induced by the players' strategies in that region are actually implemented has measure zero. Hence, also in the framework of such differential games incredible threats, in the sense that actions are chosen which would be suboptimal if implemented for a positive amount of time, can occur as part of Markov perfect equilibrium profiles. An interesting question for future research might be to explore whether such Markov perfect equilibria with 'incredible threats' in multi-mode or singular control games can be eliminated by appropriate equilibrium refinements. Developing such a refinement would also provide a valuable theoretical basis for alleviating the problem of non-uniqueness of predictions about equilibrium timing of regime switches in applications like the dynamic innovation model considered in this paper.
Funding Open Access funding enabled and organized by Projekt DEAL.

Data Availability Statement
The code for the numerical analysis used in the current study is available from the corresponding author on reasonable request.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Hence, switching to mode m 2 atx yields at least the same value as any such path staying in [x l ,x].
Consider now a potentially optimal path fromx which stays in mode m 1 and has x(t) >x for all t > 0. Taking into account the compactness of the state space and the monotonicity of any optimal path, there must exist a steady statex >x, associated with a steady-state control u 1 , of such a path and we must have since otherwise it would be optimal for player 1 to switch to mode m 2 atx. Taking into account that f (x, (ũ 1 , 2 (x, m 1 ))) = 0, we obtain which contradicts condition (vii). This shows that under the given conditions no path from x which never jumps to mode m 2 can be strictly better than switching to mode m 2 atx. The same arguments establish that this holds also for any x ini >x. Focusing on paths on which player 1 switches to mode m 2 , it can be shown that (vii) implies that by for x >x switching immediately is strictly better than marginally delaying the switch to mode m 2 .
To see this, consider for some x(t) >x the value in mode m 1 of delaying the switch to mode m 2 from t to t + which is denoted by V 1 (x(t), m 1 ): We need to show that V 1 (x(t), m 1 ) < V 1 (x(t), m 2 ) − κ for all (small) . Direct calculations yield where the last inequality follows from condition (vii). Hence, for all x >x it is optimal for player 1 to immediately switch to mode m 2 . Consider now a value x ini <x. Applying 1 (x, m 1 ) and switching to mode m 2 , once the generated path hits,x yields a value given by V 1 (x ini , m 1 ). Due to (vi), we have V 1 (x ini , m 1 ) > V 1 (x ini , m 2 ) − κ and therefore switching to mode m 2 at x ini is not optimal. This shows that the switching strategy x is indeed optimal for player 1.

Proof of Lemma 1
Assume that for a delay equilibrium profile P we have ( 1 (x, m 1 ), 2 (x, m 1 )), m 1 ) We show that this implies that P is a forced switch delay equilibrium. More precisely, we show that in this situation for sufficiently small > 0 and any strategy variation˜ 2 (.) for player 2, satisfying the conditions given in the definition of a forced switch delay equilibrium in Definition 1, there is an alternative strategy for player 1 with a later mode switch which is a better response than ( 1 (.), x ) for player 1. To show this, we first observe that sticking to the strategy profile ( 1 (.), x ) player 1 would switch to m 2 atx and hence the value at x =x would be given by V 1 (x, m 2 ) − κ since˜ 2 (x, m 2 ) = 2 (x, m 2 ) ∀x ∈ X . Now, consider an alternative strategy (˜ 1 (.), x ) for player 1 withx =x + and The value for player 1 under this strategy for an initial state x =x and mode m 1 is given bỹ whereτ ( ) is the time when the state hitsx under the profile (˜ 1 (x, m 1 ),˜ 2 (x, m 1 )) of player controls. Since P is a delay equilibrium, we have f (x, ( 1 (x, m 1 ), 2 (x, m 1 )), m 1 ) > 0 for all x ∈ (x − ,x) and by continuity for sufficiently small , we therefore must have , it is easy to see that this function is continuous in with (0) = 0 and where the second line follows from the continuity of˜ 1 ,˜ 2 atx and the inequality in the last line from (13). Note that it is assumed here that V 1 (x, m 2 ) is continuously differentiable atx. This shows that ( ) > 0 for sufficiently small and that for such value of the profile (˜ 1 (.), x ) is a better response for player 1 than ( 1 (.), x ) to all strategies˜ 2 which are locally continuous atx. Hence, P must be a forced switch delay equilibrium. This implies that for any maximum delay equilibrium (4) has to hold. Furthermore, since The HJB equation (iv) in mode m 1 for x <x implies which by continuity of V 1 (., m 1 ) implies Equations (14) and (15) together imply that the smooth pasting condition (5) holds.
Since both g 1 and g 3 are continuous with respect to x and u 1 , it follows that lim x→x− arg max where we have used that the argmax of g 3 has only one element. Hence, lim x→x− arg max and since φ 1 (x, m 1 ) is a maximizer of g 1 and φ 1 (x, m 2 ) is the maximizer of g 2 , this proves the proposition.

Proof of Proposition 3
We define as g(α n ) = f (α n , ( 1 (α n , m 1 ), 2 (α n , m 1 ))) the righthand side of the state dynamics under the equilibrium profile. Assume first that there exists a state α * n ∈ (0,ᾱ n ) such that in a neighborhood around α * n we have g(α n ) > 0 for all α n < α * n and g(α n ) < 0 for all α n > α * n . Then, this neighborhood is invariant under the state dynamics and for initial values α n (0) in this neighborhood the thresholdᾱ n is never reached and therefore the game never switches to mode m 2 . Given that m 2 is never reached, any positive investment u 1 > 0 is clearly suboptimal for player 1, which contradicts g(α n ) > 0 for α n < α * n . Therefore, the only remaining possibility for a scenario in which the direction of the state dynamics changes in the interval [0,ᾱ n ) is that there exists a unique point α * n ∈ (0,ᾱ n ) such that g(α n ) > 0 ∀α n > α * n and g(α n ) < 0 ∀α n < α * n . Assume that such an MPE exists and (˜ 1 (α 1 n , m 1 ),˜ 2 (α 1 n , m 1 ))) is the strategy profile giving rise to this pattern. We denote the value functions of the two firms corresponding to this profile byṼ i (α n , m 1 ), i = 1, 2.
The proof now proceeds by showing first that lim →0Ṽ2 (α * n − , m 1 ) > lim →0Ṽ2 (α * n + , m 1 ) and, second, that in light of this inequality the optimal strategy of player 2 at a state α * n + for sufficiently small has to be such that g(α * n + ) < 0, which contradicts our assumption and therefore rules out the existence of a state α * n with the properties given above. To show that lim , we first observe that in light of g(α n ) < 0 for all α n ∈ [0, α * n ) and α * n <ᾱ n the new product is never introduced under this investment profile. Therefore, any optimal investment strategy for firm 2 must have˜ 2 (α n , m 1 ) = 0 for all α n ∈ [0, α * n ) and therefore for any α n < α * n we must haveṼ 2 (α n , m 1 ) = V ∞ 2 (α n , m 1 ) = 1 + Furthermore, taking into account that for α n (0) > α * n , the thresholdᾱ n is reached in finite time under the strategies (˜ 1 (α n , m 1 ),˜ 2 (α n , m 1 )), at which point the game switches to mode m 2 . Hence, we obtain lim →0Ṽ 2 (α * n + ) = τ t=0 e −rt 1 + ν o 2 (q m 1 2o ) 2 − ξ 2 ( 2 (α n , m 1 ))dt where τ is the point in time when the mode switches from m 1 to m 2 . Taking into account that ξ 2 (u) ≥ 0 ∀u ≥ 0, it is sufficient for proving (19) to show that q m 2 2o (α n ) < q m 1 2o for all α n ≥ 0. To see this, we first observe that inserting α n = 0 and α 0 n = 3ηα o 3+ν o into (10) gives q m 2 2o (0) = q m 1 2o . Furthermore, and therefore, due to our assumption that α 0 n > 3ηα o 3+ν o we have q m 2 2o (α n ) < q m 1 2o for all α n ≥ 0. This establishes that the inequality (19) holds.