1 Introduction and motivation

The rapid growth of products and services marketed via the Internet increases the importance of considering revenues and profits generated by so called “superstars” or “superstar firms”. Examples are music streaming, search engines, Internet market places, news websites, where on the supply side goods can be marketed worldwide at almost zero marginal cost. At the same time consumers are often able to pick the—according to their perception—best product at no higher cost than the second best, third best etc. For example, since prices for streaming a film or listening to music on a particular platform are often fixed, there is no good reason to stream the second best film instead of the best (according to the consumer’s preferences), listen to the second best orchestra performing a particular piece of music instead of the best, etc. This is so, regardless of how small the perceived difference in quality between the films, orchestras, etc. is. Hence the ranking between competing artists, films, services, etc. becomes more important than the absolute quality level per se. Another important example are internet search engines. A keyword-search is “cost-less" (respectively the cost is always the same, users give away their data) and there is no good reason to use the second best search-engine instead of the best—at least this was the perception of most consumers during the early up to quite recent days of the internet economy.

By contrast, in the physical goods economy prices differ largely depending on quality levels or characteristics—that is why most of us are more likely to drive a medium-sized vehicle than a luxury car. Goods of similar quality can be sold at similar prices achieving at the same time similar market shares. Hence, for physical goods the payoffs depend continuously on the perceived quality level of these goods (see also heterogenous versions of the Cournot and the Bertrand model, e.g., [12] Chapter 16). Therefore, investing into firms that are active in the physical goods economy is much more predictable and less risky than investing into new economy firms.

Typically financing of projects related to Internet firms is performed by business angels and venture capitalists. In contrast to investment problems considered in the standard corporate finance literature, the investor is confronted with payoffs which depend on the rank of the corresponding project as already described in the above paragraphs. She faces cash flows arsing from investment projects where the main market share goes to the “winner”. Already the second-best makes a much smaller profit compared to the “winner”, let alone the lower-ranked competitors.

Gompers et al. [11] performed a survey among 885 venture capitalists. They [in their Table 5] compare the relevance of several factors, such as the team, the business model, the product, the market, etc. and find out that the founding team is the most important factor in the evaluation process. In addition, regarding valuation methods a bulk of venture capital firm use the net present value method or the internal rate of return, however “... 9% do not use any financial metrics”. By these two issues, a plausible way to approximate this behavior in a model is that investors are only able to rank the corresponding projects. Especially if the management team is very important, measuring the ability of a team on a cardinal scale is not very realistic, a ranking seems to be more plausible and provides an additional argument to consider rank dependent investment strategies.

As motivated by the above paragraphs, we study investment problems for superstar projects, that is, investment projects with rank dependent payoffs. To simplify the analysis, the financial resources the investor is able to spend is considered to be fixed, for example already provided by an investment fund. Entrepreneurs submit their business proposals. The venture capital investor is able to compare the corresponding projects with respect to their relative quality. That is, the relative rank of a project is observed. The profitability of a particular investment depends on its realized ranking among all projects. That is, the rank of the chosen project at the end of the investment horizon. Hence, the investment problem considered is different from standard portfolio optimization problems investigated in Finance (for a brief overview, see, e.g., [2] diversification effects can be expected by investing into multiplie investment opportunities in standard problems, while in our model investing into the best assests only is important). Due to competition among investors for superstar projects, the investor is forced to be quick with her decision. Taking this to the extreme, she has to decide on whether and how much to invest before she receives the next proposal. Based on the superstar property only those n alternatives with the highest quality characteristic generate positive returns in the future. Given these assumptions, this article obtains optimal investment strategies.

This article is related to the economics of superstars literature, finance literature, and to the mathematical literature on so called “secretary problems”: Older literature on the economics of superstars, such as Frank and Cook [9], considers optimal remuneration schemes and when/where they are rank dependent (see, e.g., Pancs [21]). So called winner-take-all markets (see, e.g., [9]) are characterized by returns depending chiefly on the relative ranking of competitors and much less on the absolute quality of the products. Instead of looking at optimal payoffs schemes as discussed in the economics of superstar literature, the focus of this article is on optimal investment schemes when the expected payoffs strongly depend on the ranking of the corresponding investment project.

Recently Opp [20] developed an interesting dynamic equilibrium model describing the venture capital market. The framework considered in Opp [20] allows to obtain asset prices and returns in closed form. In particular, Opp [20] also derives equilibrium prices and abnormal expected returns. In Opp [20] at each point of time the most productive entrepreneur is selected (see Section 1.3.3 and 1.3.4 in Opp [20]). By contrast, in our model the n best projects generate returns at the end of the investment horizon T. The main task of the investor in our model is to optimally select the investment opportunities, where an investment project can turn out to be completely disastrous, if too many better projects appear in the unforeseen future.

Korteweg and Nagel [14] claim that linear factor pricing methods—as applied in standard asset pricing—are quite difficult to be applied to venture capital investing. As an alternative they propose to use stochastic discount factors and applied this methodology to empirical data from venture capital funds and individual investment projects. First, Korteweg and Nagel [14] inferred strong positive abnormal returns arsing from venture capital investments, while for venture capital funds these abnormal returns are close to zero. Probably a lot of the abnormal return generated by the corresponding investment projects is soaked up by costs arsing for the venture capital firm to evaluate business plans, select projects, and support the enterprises, etc. Hence, finding good investment projects is complicated, especially if the decision has to be quite fast due to competition with other venture capital firms. Second, the authors used data from Sand Hill Econometrics to apply their evaluation method to empirical data. The data base considers cash flows from already realized investment projects, while this papers models the time period before and obtains optimal investment rules. In this paper we also perform a simulation study to get some further insights into the investment behavior implied by our optimal rules. To do this, estimates of the expected gross returns become necessary. To get estimates for gross returns of superstar projects we use estimates from Cochrane [7] in our simulation study.

In the Mathematics literature investment decisions where only the winner (\(n=1\)) obtains a positive return are solved in Bruss and Ferguson [3]. The case where the two best alternatives (\(n=2\)) generate a positive return is considered in Łebek and Szajowski [15]. To the best of our knowledge these two articles are the only papers that consider investment models with rank dependent payoffs. The optimal investment strategy for the linear case with general n is related to Mucci [18]’s solution of the secretary problem with generalized payoffs. That paper is a generalization of the classical secretary problem with rank dependent payoffs (see the seminal paper of [5]). In the latter two articles exactly one secretary has to be chosen, while in former papers and in the current article a budget exists which can be invested into multiple alternatives and need not be invested at all. In all the previously cited articles of this paragraph the returns are non-stochastic. In this article we allow for stochastic returns and extend the results obtained in Bruss and Ferguson [3] and Łebek and Szajowski [15] to the case where the n best alternatives result in a positive payoff.

This article is organized as follows: Sect. 2 describes the investment problem in more detail and provides an almost closed form solution for an investor with a monotone increasing and continuous Bernoulli utility function. Then Sect. 3 investigates the risk-neutral case for \(n \ge 1\) projects and deterministic returns. Section 4 considers stochastic returns. Finally, Sect. 5 presents results from a simulation study to investigate how optimal investment behavior changes with the attitude towards risk and the number of time periods considered.

2 The investment problem

In this section we follow Bruss and Ferguson [3] and consider \(t=1,\dots ,T\) periods of time. An investor is endowed with initial wealth \(w_0 >0\). At each period of time an investment opportunity, described by a uniformly iid random variable \(X_t\) on the unit interval [0, 1] occurs. The random variable \(X_t\) describes the unobserved quality level of the project occurring at time t. Let \(\rho _t \left( X_{s} \right) := \sum _{v=1}^{t} \textbf{1}_{X_v \ge X_s}\), \(s \le t\), and \(\rho _T \left( X_{s} \right) := \sum _{v=1}^{T} \textbf{1}_{X_v \ge X_s}\) denote the relative (at period t) and the absolute rank of \(X_s\). Correspondingly, for a realization \(x_s\), \(\rho _t \left( x_{s} \right) \) and \(\rho _T \left( x_{s} \right) \) denote the realized relative rank of \(x_s\) at time t, respectively the realized final rank. Hence the largest draw arising in \(1,\dots ,t\) has relative rank 1 and the second best relative rank 2, etc. At time t, the period where the kth best observation (of the draws that occurred so far) was realized is denoted by \(\tau _{t}^k\). Note that \(\tau _{t}^k\) depends on \(x_1,\dots ,x_{t}\) and is therefore a function of time t.

In any period t the investor can invest any amount \(b_t\), with \(0 \le b_t \le w_{t-1}\) into the risky project showing up at this moment. But at time t the investor cannot invest in projects that occurred in one of the previous periods (recall is impossible). We assume that at time t the investor remembers past investments \(b_s\) and the current relative ranks of the past realizations \(x_s\), \(s \le t-1\). However, she never observes the values \(x_s\), but only the ranks among the past realizations.Footnote 1 The investment history at time t is \(\textbf{b}_{t-1}:= \left( b_1,\dots ,b_{t-1} \right) \in \mathbb {R}^{t-1}_{\ge 0}\). By investing \(b_t\) the remaining budget is \(w_{t} = w_{t-1} - b_t\).

Returns can be deterministic or stochastic. In the deterministic case the gross-returns per unit invested are described by \(\left( \beta _t^i \right) _{t=1,\dots ,T; \ i=1,\dots ,n}\). In more detail, \(\beta _{t}^{j}\) abbreviates the deterministic return per unit invested in period t, if the draw of period t becomes the overall j-th best draw at time T. This is of course a simplifying assumption reflecting the idea that the selection period is relatively short compared to the future point in time when the returns finally realize as often observed with high-risk investment projects. If the lifetimes of the investment projects are different, this can be accounted for by discounting the payoffs to a common reference period. To keep the notation simple these discount factors are already contained in the \(\beta _t^i\), that is:

Remark 1

A deterministic discount factor \(\delta \), \(\delta \in (0,1]\), can be included in a straightforward way by considering \(\beta _{t}^{j} = \tilde{\beta }_{}^{j} \delta ^{t}\) or \(B_{t}^{j} = \tilde{B}_{}^{j} \delta ^{t}\), etc. Stochastic discount factors \(\delta _t\), and time dependent \(\tilde{\beta }_{}^{j}, \ \tilde{B}_{}^{j}\) can incorporated accordingly.

Our model has the following limitations: First, in contrast to what is often observed on the venture capital market, we abstract from investment stages. Second, we need to assume that there are no returns obtained before the last investment opportunity occurs. As long as all returns realize after the last investment opportunity shows up, differences in the time points of the realization of the returns can accounted for by choosing the \(\beta _j^t\) (or \(B_t^j\)) accordingly. In this case, \(\beta _j^t\) (or \(B_t^j\)) can also describe an accumulated return.

Suppose that the best n, \(n \le T\), projects yield a strictly positive return (almost surely non-negative and strictly positive with positive probability, in the case of stochastic returns). In the stochastic case considered in Sect. 4, the returns are denoted by \( \left( B_t^i \right) _{t=1,\dots ,T; \ i=1,\dots ,n}\). We assume that \(B_t^i\) only depends on t, the rank of the corresponding \(X_t\), and the (unobserved) realized value of \(X_t\). For any realization \(x_s\) with absolute rank \(\rho _T(x_s)\) larger than n the return is equal to zero. Note that by these assumptions on the returns the payoffs of the investment projects become more discontinuous. While small changes in technological shocks usually have small impacts on asset prices and returns in production based asset pricing models (see, e.g., [4] Chapter 7), in our model small changes in shocks \(X_t\) can have large effects if the ranking changes.

Next, we describe the \(n(n+1)/2\)-dimensional vector of potential payoffs \(\textbf{y}_{t-1}\) coming from past investments. Let \(y_{i| \ell }^{t} \ge 0\) denote the payoff of the currently i-th best alternative—after \(X_t\) realized—if it becomes \(\ell \)-th best at T. For example, \(y_{1|1}^{t} = \beta _{\tau ^{1}_{t}}^{1} b_{\tau ^{1}_{t}}\) is the payoff resulting from the corresponding investment \(b^{ \tau ^{1} _{t}}\) if the current best alternative (i.e. \(x_{\tau ^{1}_{t}}\), where \(\tau ^{1}_{t} \le t\)) remains the best at time T, \(y_{1|2}^{t}= \beta _{\tau ^{1}_{t}}^{2} b_{\tau ^{1}_{t}} \) is the payoff if the current best alternative becomes the second best at T, \(y_{2|2}^{t} = \beta _{\tau ^{2}_{t}}^{2} b_{\tau ^{2}_{t}} \) is the payoff if the current second-best alternative remains the second-best at T, etc.

Note that \(y_{i | \ell }^{t}\) where \(\ell < i\) is infeasible (since “better draws stay better”), while \(y_{i | \ell }^{t}= 0 \) for all \(\ell >n\) by the assumption that only the best n projects can result in a positive return and the returns of all other projects are zero. If \(t < n\) the corresponding \(y_{j|\cdot }^{t}\) with \(t < j \le n\) are set to zero. Let \(\ell _i\) denote the final rank (at time T) of the i-th best observation at time t. Hence, \(\ell _i \ge i\) and \(\ell _i \le i + T- t\) given that there are \(T- t\) remaining draws. For any pair \(y_{i | \ell _i}^{t}\), \(y_{j | \ell _j}^{t}\), where \(i<j\), we get \(\ell _i < \ell _j\). By considering the current n best draws after \(X_t\) realized, we obtain \(\frac{{n}({n}+1)}{2}\) terms \(y_{j | \ell _j}^{t}:= \beta _{\tau ^{j}_{t}}^{\ell _j } b_{\tau ^{j}_{t}} \ge 0\) and the lower triangular \(n \times n\) matrix

$$\begin{aligned}{} & {} \left( \begin{array}{ccccc} \beta _{\tau ^{1}_{t}}^1 b_{\tau ^{1}_{t}} &{} &{} \\ \beta _{\tau ^{1}_{t}}^2 b_{\tau ^{1}_{t}} &{} \beta _{\tau ^{2}_{t}}^2 b_{\tau ^{2}_{t}} &{} \\ \beta _{\tau ^{1}_{t}}^3 b_{\tau ^{1}_{t}} &{} \beta _{\tau ^{2}_{t}}^3 b_{\tau ^{2}_{t}} &{} \beta _{\tau ^{3}_{t}}^3 b_{\tau ^{3}_{t}} &{} \\ \vdots &{} \vdots &{} \vdots &{} \ddots &{} \\ \beta _{\tau ^{1}_{t}}^n b_{\tau ^{1}_{t}} &{} \beta _{\tau ^{2}_{t}}^n b_{\tau ^{2}_{t}} &{} \beta _{\tau ^{3}_{t}}^n b_{\tau ^{3}_{t}}&{} \dots &{} \beta _{\tau ^{n}_{t}}^n b_{\tau ^{n}_{t}} \end{array} \right) \nonumber \\= & {} \left( \begin{array}{ccccc} y_{1 | 1}^{t} &{} &{} \\ y_{1 | 2}^{t} &{} y_{2 | 2}^{t} &{} &{} \\ y_{1 | 3}^{t} &{} y_{2 | 3}^{t} &{} y_{3 | 3}^{t} &{} \\ \vdots &{} \vdots &{} \vdots &{} \ddots &{} \\ y_{1 | n}^{t} &{} y_{2 | n}^{t} &{} y_{3 | n}^{t} &{} \dots &{} y_{n | n}^{t} \end{array} \right) =: \mathcal {Y}_{t} . \end{aligned}$$
(1)

The remaining upper triangular part of this matrix is zero since \(y_{j | \ell _j}^{t} = 0\) for all \(\ell _j<j\). To obtain the row vector \(\textbf{y}_{t} \in \mathbb {R}^{n(n+1)/2}\) the lower triangular part of \(\mathcal {Y}_{t}\) is vectorized, in formal terms

$$\begin{aligned} \textbf{y}_{t} = \left( y_{1|1}^{t},\dots ,y_{1|n}^{t}, y_{2|2}^{t}, \dots , y_{2|n}^{t}, \dots , y_{n-1 | n-1}^{t}, y_{n-1 | n}^{t},y_{n|n}^{t} \right) . \end{aligned}$$

The following paragraph describes the transition of \(\mathcal {Y}_{t-1}\) to \(\mathcal {Y}_{t}\): Depending on the realization of \(X_t\), \(\mathcal {Y}_{t-1}\) changes to \(\mathcal {Y}_{t}\) in period t. Consider a realization \(x_t \) with realized rank \(\rho _t (x_t) = j\) and let \(\varvec{\beta }_{t}^{j:n}:= \left( \beta _{t}^{j},\dots ,\beta _{t}^{n} \right) \). Recall that \(\varvec{\tau }_t = \left( \tau ^{1}_{t},\dots ,\tau ^{n}_{t} \right) ^{{{\,\mathrm{\top }\,}}}\) denotes the points of time where the \(x_s\), \(s \le t\), with the ranks \(1,\dots ,n\) had realized. If the rank of \(x_t\) is larger than n, then \(\varvec{\tau }_{t-1}\) remains the same: \(\varvec{\tau }_{t} = \varvec{\tau }_{t-1}\). If \(x_t\) has rank j, \(1 \le j \le n\), then we obtain the new \(\varvec{\tau }_{t} = \left( \tau ^{1}_{t-1},\dots , \tau ^{j-1}_{t-1},t,\tau ^{j}_{t-1},\dots , \varvec{\tau }^{n-1}_{t-1} \right) ^{{{\,\mathrm{\top }\,}}}\). This update of \(\varvec{\tau }_{t-1}\) is abbreviated by \(\varvec{\tau }_{t-1} (x_t)\) resulting in the new \(\varvec{\tau }_t\). If the agent invests \(b_t\), then the new matrix of payoffs is provided by

$$\begin{aligned} \mathcal {Y}_{t}&= \mathcal {Y}_{t-1} \left( b_t \varvec{\beta }_{t}^{j:n} \right) \ , \text { where the function } \nonumber \\ \mathcal {Y}_{t-1} \left( b_t \varvec{\beta }_{t}^{j:n} \right)&:= \left( \begin{array}{ccccccc} y_{1 | 1}^{t-1} &{} &{} &{} &{} &{} &{}\\ y_{1 | 2}^{t-1} &{} y_{2 | 2}^{t-1} &{} &{} &{} &{} &{} \\ \vdots &{} \vdots &{} \ddots &{} &{} &{} &{} \\ y_{1|j-1}^{t-1} &{} y_{2|j-1}^{t-1} &{} \dots &{} y_{j-1|j-1}^{t-1} &{} &{} &{} \\ y_{1 | j}^{t-1} &{} y_{2 | j}^{t-1} &{} \dots &{} y_{j-1|j}^{t-1} &{} \beta _{t}^{j} b_t &{} &{} \\ y_{1 | j+1}^{t-1} &{} y_{2 | j+1}^{t-1} &{} \dots &{} y_{j-1|j+1}^{t-1} &{} \beta _{t}^{j+1} b_t &{} y_{j|j+1 }^{t-1} &{} \\ \vdots &{} \vdots &{} \vdots &{} \vdots &{} \ddots &{} \ddots &{} \\ y_{1 | n}^{t-1} &{} y_{2 | n}^{t-1} &{} \dots &{} y_{j-1|n}^{t-1} &{} \beta _{t}^{n} b_t &{} y_{j|n}^{t-1} \dots &{} y_{n-1 | n}^{t-1} \end{array} \right) = \left( \begin{array}{ccccc} y_{1 | 1}^{t} &{} &{} \\ y_{1 | 2}^{t} &{} y_{2 | 2}^{t} &{} &{} \\ y_{1 | 3}^{t} &{} y_{2 | 3}^{t} &{} y_{3 | 3}^{t} &{} \\ \vdots &{} \vdots &{} \vdots &{} \ddots &{} \\ y_{1 | n}^{t} &{} y_{2 | n}^{t} &{} y_{3 | n}^{t} &{} \dots &{} y_{n | n}^{t} \end{array} \right) . \ \ \ \end{aligned}$$
(2)

The dimension of \(\varvec{\beta }_{t}^{j:n}\) determines the column j where \(b_t \varvec{\beta }_{t}^{j:n} \) is inserted in (2). If \(\rho _t (x_t) > n\), then \(\mathcal {Y}_{t} =\mathcal {Y}_{t-1}\). We obtain the new vector of payoffs \(\textbf{y}_{t}\) by vectorizing \(\mathcal {Y}_{t-1} \left( b_{t} \varvec{\beta }_{t}^{j:n} \right) \). An alternative approach to storing \(\textbf{y}_t\) is to work with the investment histories \(\textbf{b}_t = \left( b_{1},\dots ,b_{t} \right) \). By means of \(\textbf{b}_t\) and \(\varvec{\tau }^{}_{t}\), \(\textbf{y}_t\) can be obtained in a straightforward way.

The investor maximizes expected utility. The Bernoulli utility function \(u(\cdot )\) is monotone increasing and continuous [that is, we allow for risk-averse, risk-neutral or risk-loving behavior].

Venture capitalists and business angels are typical investors in high-risk investment projects. Venture capital firms can be considered as institutionalized investors that have various opportunities to diversify, such that we consider them as risk-neutral investors. Business angels usually have less opportunities to diversity, hence we consider them as risk-averse.Footnote 2

The remaining budget after period \(T-1\) is \( w_{T-1}\) and the information on prior investments is provided by \(\textbf{y}_{T-1}\). Then, the term \(\mathscr {V}_{T-1} \left( w_{T-1}, \textbf{y}_{T-1} \right) \) denotes the conditional expected utility when applying an optimal investment strategy in period T.Footnote 3

Accordingly, \(\mathscr {V}_{t-1} (w_{t-1}, \textbf{y}_{t-1} )\) denotes the conditional expected utility of the final wealth given that the investor applies an optimal investment strategy from t onward [subject to an investment history described by \(w_{t-1}\) and \(\textbf{y}_{t-1}\)]. After investing \(b_t\), the remaining budget becomes \(w_t = w_{t-1} - b_t\) and \(\textbf{y}_{t-1}\) becomes \(\textbf{y}_{t}\), as already described above. In particular, the value functions \(\mathscr {V}_{t}(\cdot )\) for \(t=T\) and \(t=T-1\) are

$$\begin{aligned} \mathscr {V}_{T} (w_T,\textbf{y}_{T})= & {} u(w_T + \sum _{j=1}^n y_{j|j}^{T} ) \ \text { and } \nonumber \\ \mathscr {V}_{T-1} (w_{T-1},\textbf{y}_{T-1} )= & {} \frac{T-n}{T} u \left( w_{T-1} + \sum _{j=1}^n y_{j|j}^{T-1} \right) \nonumber \\{} & {} + \frac{1}{T} \sum _{j=1}^{n} u \left( w_{T-1} \max \{1, \beta _{T}^j\} + \sum _{i=1}^{j-1} y_{i|i}^{T-1} + \sum _{i=j}^{n-1} y_{i|i+1}^{T-1} \right) . \ \ \ \end{aligned}$$
(3)

To see this, note that at \(t=T_{}\) there are no more steps to go and we obtain a final wealth of \(w_{T} + y_{1|1}^{T} + \dots + y_{n|n}^{T}\). \(w_T\) is the remaining non-invested budget, and \(y_{j|j}^{T}\) the payoffs of the first to n-th best alternatives, when there are no further steps to go. If there is still one step to go, that is \(t=T-1\), we get \(w_{T-1} + y_{1|1}^{T-1} + \dots + y_{n|n}^{T-1}\) if the last draw \(X_T\) has a (final) rank larger than n. The probability of this event is \(\frac{T-n}{T}\). With a probability of 1/T we get a draw of rank j, where \(j \le n\). In this case the current best \(j-1\) draws remain the \(j-1\) best, the realization \(x_{T}\) becomes the j-th best, while the current j-th to n-th best draws become the \(j+1\) to \(n+1\)-th best. The payoff of the \(n+1\)-th best draw is zero by the model assumptions. At \(t=T\) the best strategy is to invest all the remaining budget \(w_{T-1}\) into this alternative, if and only if the relative (\(=\) absolute) rank of \(x_{T}\) is equal to \(j \le n\) and \(\beta _{T}^{j} \ge 1\). For example, with a probability of 1/T the T-th draw becomes the best one. In this case the former best draw becomes the new second best, yielding the payoff \(y_{1|2}^{T-1}\), while the former second best becomes third best, etc. For the value function we get the recursion

$$\begin{aligned}{} & {} \mathscr {V}_{t-1} (w_{t-1}, \textbf{y}_{t-1} ) \nonumber \\= & {} {\textrm{max}} \left\{ \frac{t-n}{t}, 0\right\} \mathscr {V}_{t} (w_{t-1},\textbf{y}_{t-1} ) \nonumber \\{} & {} + \frac{1}{t} {\textrm{max}}_{0 \le b_{t}\le w_{t-1 } } \mathscr {V}_{t} \left( w_{t-1}-b_{t}, \textbf{y}_{t-1} \left( b_t \varvec{\beta }^{1:n}_{t}\right) \right) + \nonumber \\{} & {} + \frac{1}{t} {\textrm{max}}_{0 \le b_{t}\le w_{t-1} } \mathscr {V}_{t} \left( w_{t-1} - b_{t},\textbf{y}_{t-1} \left( b_{t} \varvec{\beta }^{2:n}_{t} \right) \right) + \nonumber \\{} & {} \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \vdots \nonumber \\{} & {} + \frac{1}{t} {\textrm{max}}_{0 \le b_{t}\le w_{t-1} } \mathscr {V}_{t} \left( w_{t-1} - b_{t}, \textbf{y}_{t-1} \left( b_{t} \varvec{\beta }^{{\textrm{ min}} \{ n, t\} ):n}_{t} \right) \right) \nonumber \\= & {} {\textrm{ max}}\left\{ \frac{t-n}{t}, 0\right\} \mathscr {V}_{t} (w_{t-1},\textbf{y}_{t-1} ) \nonumber \\{} & {} + \frac{1}{t} \sum _{j=1}^{{\textrm{ min}} \{ n, t\} } {\textrm{max}}_{0 \le b_{t}\le w_{t-1} } \mathscr {V}_{t} \bigg ( w_{t-1} - b_{t}, \textbf{y}_{t-1} \left( b_{t} \varvec{\beta }^{j:n}_{t} \right) \bigg ). \end{aligned}$$
(4)

The current value described in (4) is obtained as follows: The first term accounts for draws worse than the n-th best, in which case \(w_t = w_{t-1}\) and \(\textbf{y}_{t} = \textbf{y}_{t-1}\). The probability of this event is \(\frac{t-n}{n}\) [provided that \(t > n\) and zero else]. The further n terms consider expected utility contributions arising if the relative rank of the draw \(x_t\) is \(j=1,\dots ,n\). If \(x_{t}\) is the j-th best, the payoffs resulting from investing \(b_t\) in this opportunity are described by \(b_{t} \varvec{\beta }_{t}^{j:n} \), in addition, in this case, we have \(y^t_{i+1|k} = y^{t-1}_{i|k}\) for \(i=j, \dots , n-1\), \(k=i+1,\dots ,n\), and \(\textbf{y}_{t} = \textbf{y}_{t-1} \left( b_{t} \varvec{\beta }_{t}^{j:n} \right) \).

For \(t=0\) we get

$$\begin{aligned} \mathscr {V}_{0} (w_0)= & {} {\textrm{ max}}_{0 \le b_{1} \le w_0} \mathscr {V}_{1} \left( w_0-b_{1}, b_{1} \varvec{\beta }_{1}^{1:n},0,\dots ,0 \right) . \end{aligned}$$
(5)

For \(t = 1\), we have \(\tau ^{1}_{t}=1\), \(\tau ^{2}_{t},\dots ,\tau ^{n}_{t}=0\), \(\textbf{y}_{t} = \textbf{y}_{1}=\left( {y}_{1|1}^1,\dots ,{y}_{1|n}^1,0,\dots ,0\right) \), and for \(n> 1\) we getFootnote 4

$$\begin{aligned}{} & {} \mathscr {V}_{1} \big (w_1, {y}_{1|1}^1, \dots , {y}_{1|n}^1, 0,\dots ,0 \big ) \\= & {} \frac{1}{2} \textbf{1}_{ (\rho _2 (x_2) = 1 ) } {\textrm{ max}}_{0 \le b_2 \le w_1} \mathscr {V}_{2} \left( w_1 -b_{2}, b_{2} \varvec{\beta }^{1:n}_{1}, y_{1|2}^1,\dots ,y_{1|n}^1, 0,\dots ,0 \right) \\{} & {} + \frac{1}{2} {\textrm{ max}}_{0 \le b_{2} \le w_1} \mathscr {V}_{2} \left( w_1-b_{2}, y_{1|1}^1,\dots ,y_{1|n}^1, b_{2} \varvec{\beta }_{1}^{2:n},0,\dots ,0 \right) \, \end{aligned}$$

while for \(n=1\) we obtain

$$\begin{aligned}{} & {} \mathscr {V}_{1} \big (w_1, {y}_{1|1}^1, \dots , {y}_{1|n}^1, 0,\dots ,0 \big ) \nonumber \\= & {} \frac{1}{2} \textbf{1}_{ (\rho _2 (x_2) = 1 ) } {\textrm{ max}}_{0 \le b_2 \le w_1} \mathscr {V}_{2} \left( w_1 -b_{2}, b_{2} \varvec{\beta }^{1:n}_{1}, y_{1|2}^1,\dots ,y_{1|n}^1, 0,\dots ,0 \right) \nonumber \\{} & {} + \frac{1}{2} \mathscr {V}_{2} \left( w_1, y_{1|1}^1 \right) . \end{aligned}$$
(6)

The value functions \(\mathscr {V}_{t} (w_{t},\textbf{y}_{t} )\), \(t < n\), also follow from (4) in the same way, where the last \(n-t\) columns of \(\mathcal {Y}_{t}\) are zero.

Suppose that we follow the strategy “do nothing in the periods \( > t\)”. Before \(X_{t+1}\) realizes the decision maker knows \(w_{t}\) and \(\textbf{y}_{t} = \left( y_{1|1}^t,\dots ,y_{n(t)|n}^t, 0, \dots ,0 \right) ^{{{\,\mathrm{\top }\,}}}\), where \(n(t):= {\text{ min }\{n, t\}}\). If we do nothing in the remaining periods, we obtain \(w_{t} + y_{1| \ell _1}^t +\dots +y_{n(t)|\ell _{n(t)} }^t\), where \(\ell _j\), \(j=1,\dots ,n(t)\), are the final ranks for best n(t) draws at the end of period t. The probabilities of these ranks \(\ell _j\), \(j = 1,\dots ,n(t)\), are

$$\begin{aligned} p_{t} \left( \ell _1,\dots ,\ell _{n(t)} \right) = \frac{ \left( \begin{array}{c} T - \ell _{n(t)}\\ t - n(t) \end{array} \right) }{ \left( \begin{array}{c} T \\ t \end{array} \right) } . \end{aligned}$$
(7)

These probabilities are derived in Appendix A. Then, the conditional expected utility of the strategy “do nothing in the remaining periods \( > t\)” is given by

$$\begin{aligned} \mathscr {W}_{t}^{} (w_{t}, \textbf{y}_{t}):= \sum _{ \begin{array}{c} \ell _1,\dots ,\ell _{n(t)}: \\ 0< \ell _{1}< \dots < \ell _{n(t)} \le T - t + n(t) \end{array}} p_{t} \left( \ell _1,\dots ,\ell _{n(t)} \right) u \left( w_{t} + \sum _{j=1}^{n(t)} y_{j | \ell _j}^{t} \right) . \end{aligned}$$
(8)

Let \(Z \left( \ell _1,\dots ,\ell _{n(t)+1};w_{t} - b_{t+1}, \textbf{y}_{t} ( b_{t+1} \cdot \varvec{\beta }_{t+1}^{j:n}) \right) := (w_{t} -b_{t+1}) + y_{1| \ell _1}^{t} + \dots + y_{j-1| \ell _{j-1} }^{t} + \beta _{t+1}^{\ell _j} b_{t+1} + {y}_{j | \ell _{j+1} }^{t} + \dots + {y}_{ n(t) |\ell _{n(t)+1} }^{t}\) denote a final payoff for the strategy where \(b_{t+1} > 0\) is invested, if past and current investments have final ranks \(\ell _1,\dots ,\ell _{n(t)+1}\) [and the time-\(t+1\) draw has rank j at period \(t+1\)] while \(Z \left( \ell _1,\dots ,\ell _{n(t)+1};w_{t}, \textbf{y}_{t} (0 \cdot \varvec{\beta }_{t+1}^{j:n} ) \right) := w_{t} + y_{1| \ell _1}^{t} + \dots + y_{j-1| \ell _{j-1} }^{t} + 0 + {y}_{j| \ell _{j+1} }^{t} + \dots + {y}_{ n(t) |\ell _{n(t)+1} }^{t}\) is the payoff for the strategy “doing nothing” after time t if past (up to and including time t) investments have final ranks \(\ell _1,\dots ,\ell _{j-1}, \ell _{j+1}, \dots ,\ell _{n(t)+1}\). Endowed with this notation for our investment problem we get:

Theorem 1

Consider a monotone increasing and continuous utility function \(u(\cdot )\) and the conditional expected utility of the strategy “do nothing in the periods \( > t\)” defined in (8). For \(t=0,1,\dots ,T\), the value function \(\mathscr {V}_{t} (w_{t},\textbf{y}_{t} )\) satisfies

$$\begin{aligned}{} & {} \mathscr {V}_{t} (w_{t}, \textbf{y}_{t} ) = \mathscr {W}_{t}^{} (w_{t}, \textbf{y}_{t} ) + r_{t} \left( w_{t}, \textbf{y}_{t} \right) . \end{aligned}$$
(9)

The residual terms \(r_{t} \left( w_{t}, \textbf{y}_{t} \right) \) are obtained as follows: \(r_{T} (w_T,\textbf{y}_{T}) = 0\) and the \(r_{t} \left( w_{t}, \textbf{y}_{t} \right) \), \(t=T-1,\dots ,1\), follow from the recursion

$$\begin{aligned} r_{t} (w_{t},\textbf{y}_{t} ) =&{\textrm{ max}}\left\{ \frac{t+1-n}{t+1}, 0 \right\} r_{t+1} ( w_{t},\textbf{y}_{t} ) \nonumber \\&\quad + \frac{1}{t+1} \; \sum _{j=1}^{{\textrm{ min}} \left\{ {t+1}, n \right\} } \max _{0 \le b_{t+1} \le w_{t} } \bigg [ r_{t+1} \left( w_{t} - b_{t+1}, \textbf{y}_{t} \left( b_{t+1} \varvec{\beta }^{j:n}_{t+1} \right) \right) \bigg . \nonumber \\&\quad \bigg .+ \sum _{\begin{array}{c} \ell _1,\dots ,\ell _{n (t)+1}: \\ 0< \ell _{1}< \dots < \ell _{n (t)+1} \le T - t + n(t) \end{array} } p_{t+1} \left( \ell _1,\dots ,\ell _{ n(t)+1 } \right) \left( u \left( \tilde{Z} \right) \right. \bigg . \nonumber \\&\quad \bigg . \left. - \; u \left( Z \left( \ell _1,\dots ,\ell _{ n (t) +1};w_{t}, \textbf{y}_{t} ( 0 \cdot \varvec{\beta }_{t+1}^{j:n} ) \right) \right) \right) \bigg . \bigg . \bigg ] \, \end{aligned}$$
(10)

where \(\tilde{Z}= Z \left( \ell _1,\dots ,\ell _{ n (t) +1 };w_{t} - b_{t+1}, \textbf{y}_{t} ( b_{t+1} \varvec{\beta }_{t+1}^{j:n} ) \right) \). We have \(r_{t} (w_{t},\textbf{y}_{t}) \ge 0\) for all \(t=1,\dots ,T\) and \(\mathscr {W}_{0} (w_0 ) = w_0\). The optimal investments \(b_{t+1}\), \(t=0,1,\dots ,T-1\), are the maximizers of the expressions inside the squared brackets in (10). If \(\beta _{t}^{j}<1\) for all t and j, then \(b_t=0\) for all \(t=1,\dots ,T\). \(\beta _{t}^{j} \ge 1\) for some t and j is necessary but not sufficient for \(b_t>0\).

For the proof see Appendix B.

The above theorem is an extension of Bruss and Ferguson [3] and Łebek and Szajowski [15], who consider the cases \(n=1\) and \(n=2\), respectively. We observe a split of the value function \(\mathscr {V}_{t}\left( w_{t}, \textbf{y}_{t} \right) \) into two parts. The term \(\mathscr {W}_{t}\left( w_{t}, \textbf{y}_{t} \right) \) is the expected utility, that after period-t uncertainty has resolved and period-t investments have been made, the investor expects to receive, if from that moment on she would passively lean back without investing any of the non-invested remaining budget \(w_t\). This is the expected utility coming from past and present investments. As (8) shows, this term is easy to compute. The term \(r_{t} \left( w_{t}, \textbf{y}_{t} \right) \) is the additional expected utility that the investor expects to receive if she would instead invest optimally in the future, that is, in all the remaining periods. Since “leaning back" is always a feasible strategy, this additional expected utility must be non-negative.

For the optimal investment strategies we observe: (i) The maximization problem arsing from the term in squared brackets in (10) is non-linear in the investments \(b_{t+1}\). Hence, in general, we get some optimal \(b_{t+1}^*\), where \(0 \le b_{t+1}^* \le w_t\). By contrast, for the risk-neutral case considered in Sect. 3, we show that a “bang-bang” strategy, that is \(b_{t+1}^*\) is either 0 or \(w_t=w_0\), is an optimal strategy.

(ii) Consider the case of constant relative risk-aversion. In a purely static standard investment problem the optimal investment weights b/w, do not depend on the size of w. In the current investment model the investment weights \(b_t/w_t\) do not only depend on the relative rank of \(x_t\) and time t, but also on prior investments \(\textbf{b}_{t-1}\). To see this, note that prior investments enter via \(\textbf{y}_{t-1}\) into the utility maximization problem contained in the recursions (10).

(iii) Let us now consider the case when the number of periods considered becomes large. For linear utility, Bruss and Ferguson [3] [see Example 2.3.1] derived a closed form expression for the value function when the number of periods \(T \rightarrow \infty \). In this case, given an initial wealth \(w_0\) and a return \(\beta _k^1 = \beta ^1 >1\) the value function is \(w_0 \beta ^1 \exp (- (\beta ^1-1)/\beta ^1) > w_0\). Hence, even if the number of periods approaches infinity the investor expects to receive a strictly positive net return. As an example, for \(\beta ^1=2\) and an initial wealth of \(w_0 = 1\), the value function becomes \(\beta ^1 \exp (- (\beta ^1-1)/\beta ^1)=1.21306\). To consider the non-linear case we proceed as follows: In the classical secretary problem the goal is to maximize the probability to get the element with rank one when only the relative ranks are observed. Lindley [16] has shown that it is optimal to reject the first \(r^*\) options and then take the first alternative where the realized relative rank is equal to one. The number \(r^*\) follows from \(r^* = \min \{ 1 \le r \le T | \sum _{k=r+1}^T \frac{1}{k-1} \le 1 \}\). If we let the number of periods \(T \rightarrow \infty \), \(r^*\) becomes \(\frac{T}{e}\). With this strategy the probability of selecting the overall best approaches \(\frac{1}{e}\) for \(T \rightarrow \infty \). Using this, let us briefly remark on the case when T approaches infinity: Assume that \(\beta ^1_t \ge 2\), for all t and the utility function \(u(\cdot )\) is continuously differentiable. Consider the investment strategy: “Invest \(\eta >0\) in the first investment opportunity with relative rank 1 occurring at times larger than \(\frac{T}{e}\), and otherwise do not invest at all”. According to the solution of the classical secretary problem, the investor invests in the opportunity with absolute rank 1 with a probability of \(\frac{1}{e}\). In this case the investor receives a payoff of \(w_0 + (\beta _t^1 -1) \eta \). With probability \(\frac{1}{e}\) the opportunity with rank 1 occurred among the first \(\frac{T}{e}\) periods and hence the investor will not invest and will end up with a final wealth of \(w_0\). With the remaining probability of \(\frac{e-2}{e}\) the investor invests and receives at least \(w_0 - \eta \). This strategy yields the expected utility of \(\frac{1}{e} u \left( w_0 + (\beta _t^1 -1) \eta \right) + \frac{1}{e} u \left( w_0 \right) + \frac{e-2}{e} u \left( w_0 - \eta \right) \). As \(\frac{e-2}{e}< \frac{1}{e}\), for some small \(\eta >0\) and a monotone increasing and continuously differentiable utility function \(u(\cdot )\), the inequality \(\frac{1}{e} u \left( w_0 + (\beta _t^1 -1) \eta \right) + \frac{1}{e} u \left( w_0 \right) + \frac{e-2}{e} u \left( w_0 - \eta \right) > u(w_0)\) is satisfied. If there are further positive payoffs \(\beta _t^j>1\), \(j>1\), the expected utility of this strategy is larger and hence the condition on \(\beta ^1\) can be relaxed. Note that this inequality is independent of T. Obviously the expected utility of the above described strategy is a lower bound for the expected utility of the optimal strategy. Hence, we have:

Remark 2

Under the above assumptions on the \(\beta _t^1\) and the utility function, the expected utility of the optimal strategy will be strictly larger than \(u(w_0)\), even if T approaches infinity. In particular in the risk-neutral case, \(\beta ^1_t > 1\), for all t, is sufficient for an expected payoff larger than \(w_0\), also for \(T \rightarrow \infty \).

One of the most unrealistic features of our model is that investment opportunities must be taken up immediately. One could modify this by allowing for each opportunity a fixed time window (e.g., 3 periods), in which the decision maker could invest in this opportunity. So, in period t one can invest into the opportunities that appeared in the periods \(t-2\), \(t-1\), and t. Then not much would change qualitatively. Of course, the investor would always wait till the last moment before investing in a given opportunity, to get as much information as possible.

3 Risk neutrality

This section considers the linear case, where \(u(\cdot )\) is the identity function and the investor maximizes expected wealth. The risk-neutral case is mainly import for institutional investors such as venture capital firms. Suppose that the investor either invests 0 or the remaining budget \(w_{t}\) (which will turn out to be a feature of some optimal strategy for a risk-neutral investor). Then, by applying this strategy at t, the remaining budget \(w_t\) is either equal to \(w_0\) [such that \(b_1,\dots ,b_{t-1}=0\) and \(y_{j|\ell _j} =0\)] or the remaining budget is \(w_t=0\). In the second case the investor already invested \(b_{s} = w_0\) in some period s, where \(1 \le s \le t-1\). Hence, \(\mathcal {Y}_{t-1}\) contains at most one non-zero column. The non-zero entries are \(w_0 \varvec{\beta }_{s}^{ \rho _{t-1} \left( x_{s} \right) :n } = \left( y_{ \rho _{t-1} \left( x_{s} \right) | \rho _{t-1} \left( x_{s} \right) },\dots , y_{ \rho _{t-1} \left( x_{s} \right) | n } \right) ^{{{\,\mathrm{\top }\,}}}\). For ranks larger than n the return of the investment is zero and knowledge of the rank of \(x_{s}\) is irrelevant. For the strategies “doing nothing in the remaining periods \(> t\)” and “invest \(w_0\) or 0” we obtain

$$\begin{aligned}{} & {} \mathscr {W}_{t}^{} (w_{t}, \textbf{y}_{t} ) \nonumber \\= & {} {\left\{ \begin{array}{ll} w_0 &{} \text {, if } b_{s} = 0 \text { for all }\, s=1,2,\dots ,t \\ &{} \quad \quad \text {and } w_{t}=w_0, \\ w_0 \sum _{\ell = \rho _{t} (x_{s} )}^{n (t) } \ p_{t} \left( \{ \rho _{T} \left( x_{s} \right) = \ell \} | \rho _{t} (x_{s} ) \right) \beta _{s}^{\ell } &{} \text {, if } b_{s} = w_0 \text { at } s \in \{ 1,\dots ,t\} \\ &{} \quad \quad \text {and } w_{t}=0. \end{array}\right. } \ \ \ \ \ \ \end{aligned}$$
(11)

The probabilities \(p_{t} \left( \{ \rho _{T} \left( x_{s} \right) = \ell \} | \rho _{t} (x_{s} ) \right) \) that \(x_{s}\) has final rank \(\ell = \rho _{t} (x_{s}), \rho _{t} (x_{s})+1, \dots \) given that its current rank [immediately after \(X_t\) realizes] is \(\rho _{t} (x_{s}) \) follows from (7).Footnote 5 That is,

$$\begin{aligned} p_{t} \left( \{ \rho _{T} \left( x_{s} \right) = \ell \} | \rho _{t} (x_{s} ) =j \right) = \frac{ \left( \begin{array}{c} \ell -1 \\ j-1 \end{array} \right) \left( \begin{array}{c} T - \ell \\ t - j \end{array} \right) }{ \left( \begin{array}{c} T \\ t \end{array} \right) } \, \text { for }\,\ell = j, \dots , j + T - t \;. \end{aligned}$$
(12)

We recursively define the additional value that one obtains when investing an amount of 1 optimally after period t. For \(t=T\) we have \(c_T=0\). For \(t <T \) we get

$$\begin{aligned} c_{t-1} =&{\textrm{ max}}\left\{ \frac{t-n}{t}, 0\right\} c_{t} \nonumber \\&\quad + \frac{1}{t} \sum _{j=1}^{{\textrm{ min}}\left\{ n, t\right\} } \max \bigg \{ \sum _{\ell = j }^{ n (t)} p_{t} \left( \{ \rho _{T} \left( X_{t} \right) = \ell \} | \rho _{t} \left( X_{t} \right) = j \right) \left( \beta _t^{\ell }-1 \right) , c_{t} \bigg \}. \ \ \ \end{aligned}$$
(13)

The residual term \(c_{t} \) is the expected contribution per unit of remaining capital to the value function by investing optimally instead of investing nothing in the last \(T-t \) periods. In the proof of Theorem 2 we observe that \(c_t = r_t(1,\textbf{0})\). Then, for the linear case we obtain:

Theorem 2

Consider a risk-neutral investor where \(u(\cdot )\) is the identity \(id(\cdot )\).

  1. (i)

    An optimal investment strategy is “invest \(b_t=w_t = w_0\) at the first time t where

    $$\begin{aligned} \sum _{\ell = \rho _t (x_t) }^{ n(t) } p_{t} \left( \{ \rho _{T} \left( x_{t} \right) = \ell | \rho _t (x_t) \} \right) \left( \beta _t^{\ell } -1 \right) \ge c_{t} \end{aligned}$$

    and 0 else“.

  2. (ii)

    The value function \(\mathscr {V}_{t} (w_{t},\textbf{y}_{t} ) \) satisfies

    $$\begin{aligned}{} & {} \mathscr {V}_{t}^{} (w_{t}, \textbf{y}_{t} ) = \mathscr {W}_{t}^{} (w_{t}, \textbf{y}_{t} ) + w_{t} c_{t} \nonumber \\= & {} {\left\{ \begin{array}{ll} w_{0} ( 1 + c_{t} ) &{} \text {, if } b_{s} = 0 \text { for all}\, s=1,\dots ,t \\ &{} \quad \quad \text {and } w_{t} = w_0, \\ w_0 \sum _{\ell = \rho _{t} (x_{s} )}^{n} \ p_{t} \left( \{ \rho _{T} \left( x_{s} \right) = \ell \} | \rho _{t} (x_{s} ) \right) \beta _{s}^{\ell } &{} \text {, if } b_{s} = w_{0} \text { for one }\, s \in \{ 1,\dots ,t \} \\ &{} \quad \quad \text {and } w_{t}=0. \end{array}\right. } \ \ \ \ \ \ \end{aligned}$$
    (14)

For proof see Appendix C.

The value function \(\mathscr {V}_{t} (w_{t},\textbf{y}_{t} )\) is the sum of the conditional expected utility of the strategy “we do nothing in the remaining periods \(> t\)\(\mathscr {W}_{t}^{} (w_{t}, \textbf{y}_{t} )\) and the contribution of investing the remaining budget optimally in the remaining periods, i.e. \(w_{t} c_{t} \). We observe that, if it is optimal to invest a positive amount, then it is also optimal to invest the whole remaining budget \(w_0\). The secretary problem with generalized payoffs was already investigated in Mucci [18]. The chief difference to our model under risk-neutrality is that in the secretary problem a candidate has to be hired, while in our model the investor has the option of non-investing.

Remark 3

Note that the residual term in the risk-neutral case, \(r_{t} (w_{t}) = w_{t} c_{t}\), does not depend on prior realizations of the random variable \(X_s\), \(s \le t\). By contrast, in the general case the residual term is of the form \(r_{t} (w_{t}, \textbf{y}_t)\) [see Eq. (10)]. Hence, in the linear case the complexity of the optimization problem is reduced enormously, also from a computational point of view as observed in Sect. 5.

Let us now assume that \(\beta ^1_t \ge \beta ^2_t \ge \dots \ge \beta ^n_t \ge 0\), for all \(t = 1,\dots ,T\). It is intuitive that if the risk-neutral investor finds it optimal to invest at time t all her remaining budget in the current draw given that this draw has relative rank j, then she finds it also optimal to invest all her budget, if the draw at t has relative rank i with \(i < j\). Her position is better in the latter scenario. This implies that for each point of time there exists a threshold \(d_t\), such that the investor (upon having not invested before) invests all her budget at time t if the time-t draw has relative rank \(d_t\) or smaller.

Now let us assume in addition that the \(\beta ^j_t\) do not depend on time. That is, there are \(\beta ^1 \ge \beta ^2 \ge \dots \ge \beta ^n \ge 0\), such that \(\beta ^j_t = \beta ^j\), for all \(j = 1,\dots ,n\) and all \(t = 1,\dots ,T\). Then it is also intuitive that if the risk-neutral investor finds it optimal to invest at time t all her remaining budget in the current draw of relative rank j, then she finds it also optimal to invest all her remaining budget at a later time, if that later draw has relative rank j. The reason for this is that in later periods there are less possibilities for the rank of a past investment to increase. This results in the following remark:

Remark 4

Consider a risk-neutral investor and assume that \(\beta ^1_t \ge \beta ^2_t \ge \dots \ge \beta ^n_t \ge 0\), for all \(t = 1,\dots ,T\).

  1. 1.

    Then there are rank-thresholds \(d_t\), \(t = 1,\dots ,T\) with \(0 \le d_t \le n\) such that upon not having invested so far, the investor invests all the budget \(w_0\) in period t if \(\rho _t (x_t)\le d_t\). If \(d_s = 0\) for some period s, then this means that the investor never invests in that period.

  2. 2.

    If we assume in addition that the \(\beta ^j_t\) do not depend on time, then these rank-thresholds are non-decreasing over time. That is \(1 \le t < s \le T\) implies \(d_t \le d_s\). That means, these thresholds become less and less stringent over time.

For the cases \(T=3\) and \(T=5\) Appendix F obtains these thresholds for the case where \(\beta ^1_t=3\) and \(\beta ^2_t=2\), for all t. For \(T=3\) we get \(d_1=0\), \(d_2=1\), and \(d_3=2\), while for \(T=5\) the thresholds are \(d_1=d_2=0\), \(d_3=d_4=1\), and \(d_5=2\). That is, given the returns \(\beta ^1=3\) and \(\beta ^2=2\) and an investment horizon of \(T=3\) periods, the investor never invests at \(t=1\), she/he invests all the remaining wealth at \(t=2\) if the relative rank of the second draw is one, while she/he does not invest at \(t=2\) if the rank of the draw is two. At \(t=3\), the investor invests all the remaining wealth if the rank of the third draw is one or two, otherwise she/he does not invest.

Finally, let us compare the limited information to the full information case in a small example (all the calculations are provided in Appendix F, the case where only the best draw results in a positive return was already investigated in Bruss and Ferguson [3]). We once again consider \(T=3\) and \(\beta ^1=3\), \(\beta ^2=2\). For the limited information case, that is where the decision rule has to be rank dependent, we get the thresholds \(d_1=0\), \(d_1=1\) and \(d_2=2\) and an expected wealth of \(7/3 \approx 2.33\). For the full information case, that is if the investor is able to observe \(x_t\) at t, an agent invests into the first draw if and only if \(x_1 \ge 0.679449\). At \(t=2\) she invests all the remaining wealth if and only if the relative rank is one and \(x_2 \ge \frac{1-x_1}{2}\). At \(t=3\) all the remaining wealth is invested if the rank of the third draw is either 1 or 2, otherwise we do not invest. By applying this strategy under full information, the expected wealth is 2.59603. That is, already for a small number of periods the fully informed investor expects an additional wealth of approximately 0.2627. Hence, if an investor is only able to rank the projects compared to evaluating them quantitatively (observing \(x_t\)) the difference is quite costly.

4 Stochastic returns

Suppose that the returns per unit are random variables. These random variables are \(B_1^j,\dots ,B_T^j\), \(j=1,\dots ,n\), the realizations are abbreviated \(\beta _t^j\). The stochastic analogon of \(\varvec{\beta }^{j:n}_t\) is \(\textbf{B}^{j:n}_t\). All the returns are realized and therefore known by the investor after T. We assume \(\mathbb {P} \left( B_t^j> 0 \right) >0\) and \(\mathbb {P} \left( B_t^j \ge 0 \right) =1\) for all \(j=1,\dots ,n\) and \(t=1,\dots ,T\). We assume that the distribution of the return \(B_t^j\) is allowed to depend on t, the final rank \(j = \rho _T \left( X_t \right) \), and the realization of \(X_t\). The results of Sect. 2 can be extended to this stochastic case. Here, the deterministic returns are replaced by the stochastic returns \(B_t^j\). The utility of the final wealth \(u \left( w_{t} + \sum _{j=1}^{{\textrm{ min}}\left\{ n, t \right\} } {y}_{j| \ell _1}^t +\dots +{y}_{j| \ell _{ n(t) }}^t \right) \) has to be replaced by the conditional expectationFootnote 6 of \(u \left( w_{t} + \sum _{j=1}^{{\textrm{ min}}\left\{ n, t \right\} } {Y}_{j| \ell _1}^t +\dots +{Y}_{j| \ell _{ n(t) } }^t \right) \), where \({Y}_{j| \ell }^t\) are the stochastic analogons of \({y}_{j| \ell }^t\) considered in (1). We assume that the corresponding expected values and conditional expectations exist in the following.Footnote 7 Let

$$\begin{aligned} \mathbb {E}_{t} \left( \mathscr {W}_{t}^{} \left( w_{t}, {\textbf{Y}}_{t} \right) \right):= & {} \sum _{ \begin{array}{c} \ell _1,\dots ,\ell _{n (t) }: \\ 0< \ell _{1}< \dots < \ell _{n (t)} \le T-t + n (t) \end{array}} p_{t} \left( \ell _1,\dots ,\ell _{n (t)} \right) \nonumber \\{} & {} \ \ \ \cdot \mathbb {E}_{t} \left( u \left( w_{t} + \sum _{j=1}^{{\textrm{ min}}\left\{ n, t \right\} } {Y}_{j| \ell _1}^t +\dots +{Y}_{j|\ell _{ n(t) } }^t \right) \right) \, \end{aligned}$$
(15)

denote the utility contribution of the strategy “do nothing in the remaining periods \(> t\)”. In the stochastic setup the vector \(\textbf{y}_t\) becomes the random vector \(\textbf{Y}_t\), with coordinates \({Y}_{j| \ell }\). In addition, the stochastic payoffs when investing \(b_t\) are \(Z_t \left( \ell _1,\dots ,\ell _{n(t)};w_{t-1} - b_t, \textbf{Y}_{t-1} ( b_{t} \textbf{B}_t^{j:n} ) \right) := (w_{t-1} -b_t) + Y_{1| \ell _1}^{t-1} + \dots + Y_{j-1| \ell _{j-1} }^{t-1} + B_{t}^{\ell _j} b_t + {Y}_{j| \ell _{j+1} }^{t-1} + \dots + {Y}_{ n(t) -1 |\ell _{n(t)} }^{t-1}\) denote a final payoff for the strategy where we invest \(b_t > 0\), while \(Z_t (\ell _1,\dots ,\ell _{ n (t) };w_{t-1}, \textbf{Y}_{t-1} ( 0 \cdot \textbf{B}_t^{j:n} ) ):= w_{t-1} + Y_{1| \ell _1}^{t-1} + \dots + Y_{j-1| \ell _{j-1} }^{t-1} + 0 + {Y}_{j| \ell _{j+1} }^{t-1} + \dots + {Y}_{n (t) -1 |\ell _{n(t)} }^{t-1}\). Endowed with this notation for our investment problem we obtain:

Theorem 3

Consider the investment problem investigated in Theorem 1, with stochastic returns \(B_t^j\) replacing the deterministic payoffs \(\beta _t^j\). The value contribution of “doing nothing in the remaining periods \(> t\)” follows from (15). For \(t=0,1,\dots ,T\), the value function \(\mathbb {E}_t \left( \mathscr {V}_{t} (w_{t}, {\textbf{Y}}_{t} ) \right) \) satisfies

$$\begin{aligned}{} & {} \mathbb {E}_{t} \left( \mathscr {V}_{t} (w_{t}, \textbf{Y}_{t} ) \right) = \mathbb {E}_{t} \left( \mathscr {W}_{t}^{} (w_{t}, \textbf{Y}_{t} ) \right) + \mathbb {E}_{t} \left( r_{t} \left( w_{t}, \textbf{Y}_{t} \right) \right) . \end{aligned}$$
(16)

For the residual term we get \(r_{T} \left( w_T,{\textbf{Y}}_{T} \right) = 0\) and the recursion

$$\begin{aligned} \mathbb {E}_t \left( r_{t} (w_{t},\textbf{Y}_{t} ) \right) =&{\textrm{ max}}\left\{ \frac{t+1-n}{t+1}, 0 \right\} \mathbb {E}_t \left( r_{t+1} ( w_{t},\textbf{Y}_{t} ) \right) \nonumber \\&\quad + \frac{1}{t+1} \; \sum _{j=1}^{{\textrm{ min}} \left\{ {t+1}, n \right\} } \max _{0 \le b_{t+1} \le w_{t} } \bigg [ \mathbb {E}_t \left( r_{t+1} \left( w_{t} - b_{t+1}, \textbf{Y}_{t} \left( b_{t+1} \textbf{B}^{j:n}_{t+1} \right) \right) \right) \bigg . \nonumber \\&\quad \bigg . + \sum _{\begin{array}{c} \ell _1,\dots ,\ell _{n (t ) + 1 }: \\ 0< \ell _{1}< \dots < \ell _{n (t) +1 } \le T- t + n(t) \end{array} } p_{t+1} \left( \ell _1,\dots ,\ell _{ n (t) +1} \right) \mathbb {E}_t \left( \left[ u \left( \tilde{Z} \right) \right. \right. \bigg . \nonumber \\&\quad \bigg . \left. \left. - \; u \left( Z \left( \ell _1,\dots ,\ell _{n(t)+1};w_{t}, \textbf{Y}_{t} ( 0 \cdot \textbf{B}_{t+1}^{j:n} ) \right) \right) \right] \bigg . \bigg . \right) \bigg ] \, \end{aligned}$$
(17)

where \(\tilde{Z} =Z \left( \ell _1,\dots ,\ell _{ n(t) +1};w_{t} - b_{t+1}, \textbf{Y}_{t} ( b_{t+1} \textbf{B}_{t+1}^{j:n} ) \right) \). \(\mathbb {E}_{t} \left( r_{t} (w_{t}, {\textbf{Y}}_{t} ) \right) \ge 0\) for all \(t=0,1,\dots ,T\). The optimal investments \(b_{t+1}\), \(t=0,\dots ,T-1\), are the \(b_{t+1}\) solving the maximization problems \(\max _{0 \le b_{t+1} \le w_{t}} \mathbb {E}_t \left[ \cdot \right] \) in (17).

For a risk-neutral investors Theorem 3 implies

Corollary 1

Consider a risk-neutral investor where \(u \left( \cdot \right) \) is the identity. Suppose that \(B_t^j\) only depends on the realized rank of the random variable \(X_t\). Then, we can follow the deterministic case investigated in Theorem 2. That is for any \(X_t\) with final rank j the expected return is \(\beta _t^j= \mathbb {E} (B_t^j)\). This reduces the numerical burden in the simulation analysis where we need not approximate the conditional expectation terms by means of numerical integration.

Finally, we show that for a time invariant distribution of the returns, the value function is non-increasing in T. Hence, in a market with more superstar candidates it is harder to successfully invest.

Theorem 4

Consider a monotone increasing and continuous utility function \(u(\cdot )\) and let \(T \ge n \ge 1\) and \(w_0>0\). Let \(\mathbb {E}_{0}^T \left( \mathscr {V}_{0} (w_{0}) \right) \) denote the expected value function of the T period problem. Suppose that the distribution of \(B_t^1,\dots ,B_t^n\) does not depend on t. Then \(\mathbb {E}_{0}^T \left( \mathscr {V}_{0} (w_{0} ) \right) \ge \mathbb {E}_{0}^{T+1} \left( \mathscr {V}_{0} (w_{0} ) \right) \).

Proof

The following arguments are based on Chow et al. [5] and Assaf and Samuel-Cahn [1]. Consider the \(T+1\) period problem, but where the investor is told at the beginning in which period the draw with final rank \(T+1\) will occur. Obviously the value of this modified problem is at least as high as the original \(T+1\) period problem. By ignoring the period of the worst draw, the investor faces the same problem as a T period investor. \(\square \)

5 Simulation analysis and optimal investment

Table 1 Descriptive statistics: we consider \(n=2\), and stochastic returns \(B_1\) and \(B_2\)

In this section we perform a simulation study for risk-neutral investors as well as risk-averse investors with Bernoulli utility function \(u(w) = \ln (w)\). These cases are motivated by possibly different risk attitudes of venture capital firms and business angels. The initial wealth is \({w}_{0} =1\). For the non-linear case the computation of the residual term \(r_{t} (w_{t}, \textbf{y}_{t})\) is numerically intensive. To see this, for each t, \(t=1,\dots ,T\), we have to obtain the optimal investment strategy \(b_t\), given various \(w_{t-1}\) and \(\textbf{y}_{t-1}\). That is, we construct a grid for \(w_{t-1}\) and possible prior investments \(\textbf{b}_{t-1} = \left( b_1,\dots ,b_{t-1} \right) \) resulting in \(\textbf{y}_{t-1}\). In particular, we work with a grid of 20 points including the end points 0 and \({w}_{0}=1\), such that \(w_{t-1}\) and \(b_t\) are multiples of \(1/(20-1) = 0.0526\). More details are provided in Appendix E. By starting with \(r_{T} = 0\), the residual terms \(\mathbb {E}_t \left( r_{t} (w_{t}, \textbf{y}_{t}) \right) \) can be obtained recursively, for \(t=T,T-1,\dots ,1\). For the linear case, the term \(c_t\) is computed within seconds. Since the computational burden increases rapidly in T and n, we work with \(n=2\) and \(T=5\), 10 and 20. For the linear case we additionally consider \(T=50\). The number of Monte-Carlo replications is M = 5000.Footnote 8

Note that the random variables \(B_t^1\) and \(B_t^2\) represent the gross-returns. Performing the simulation study with realistic values, requires estimates of the distribution of the gross returns \(B_t^{\ell }\). Estimates of sample means based on empirical data are e.g. provided in Korteweg and Nagel [14][Table IV]. According to literature, the log-normal distribution provides a reasonable approximation of the return distribution. To approximate returns we use estimates of log-returns provided in Cochrane [7][Table 4; ’Parameter estimates in the round-to-round sample’], where the sample mean of the log-returns is 0.2 and the estimated standard deviation is 0.84, such that the mean arithmetic net-return is close to 1.73 and the sample variance is 1.36.

By using these numbers we have a first estimate for gross-returns of superstar projects. Up to our knowledge more precise estimates on the expected gross-returns of the first best, the seconds best, etc. and the corresponding variances are not available. In the following simulation study we assume that the first best has a gross-return higher than the expected gross-return following from Cochrane [7][Table 4], while the second best has an expected return below this value. In particular, the expectation of the gross-returns \(B_t^1\) should be larger than the estimate \(1+1.73\) obtained in Cochrane [7], while the expectation of \(B_t^2\) should be smaller that 2.73. In particular, we assume that \(B_t^1\) is log-normally distributed with mean parameter 0.8 and variance parameter \(\sigma ^2= 0.8^2\), while \(B_t^2 \) is log-normally distributed with mean parameter \(\mu _2 = 0.2\) and variance parameter \(\sigma ^2 = 0.8^2\), for all \(t=1,\dots ,T\). The arithmetic gross-returns \(B_t^1\) have an expectation of \(\exp \left( \mu _1 + \frac{1}{2} \sigma ^2 \right) = 3.0649\) and variance \( \left( \exp \left( \sigma ^2 \right) -1 \right) \exp \left( 2 \mu _1 + \sigma ^2 \right) = 8.24 \), while \(B_t^2\) has expectation \(\exp \left( \mu _2 + \frac{1}{2} \sigma ^2 \right) = 1.6820\) and variance \( \left( \exp \left( \sigma ^2 \right) -1 \right) \exp \left( 2 \mu _2 + \sigma ^2 \right) = 2.54\), respectively. The probability \(\mathbb {P} \left( B^1_t \le 1 \right) \approx 15.87 \%\) while \(\mathbb {P} \left( B^2_t \le 1 \right) \approx 40.13 \%\). By the properties of the log-normal distribution \(\mathbb {P} \left( B^j_t> 0 \right) > 0\). Hence, in our simulation study we allow for realizations of \(B^j_t\) which are smaller than one [negative net-returns] also for investment projects with final best or second best rank with positive probability. By the properties of the log-normal distribution the returns \(B^j_t\) are positive (almost surely).

Table 1 presents the sample mean (mean), the sample standard deviations (sd), the minimum (min), the median, and the maximum (max) of realizations of the final wealth, \(w_T + Y_{1| 1}^{T} + Y_{2| 2}^{T} = w_T + b_{\tau _1^T} B_{\tau _1^T}^1 + b_{\tau _2^T} B_{\tau _2^T}^2\), the realizations of the final payoff, \(Y_{1| 1}^{T} + Y_{2| 2}^{T}\), and the final non-invested budget \(w_T\). For a risk-averse investor with logarithmic Bernoulli utility function we additionally consider realizations of final utility \(u \left( w_T + Y_{1| 1}^{T} + Y_{2| 2}^{T} \right) = \ln \left( w_T + Y_{1| 1}^{T} + Y_{2| 2}^{T} \right) \) [by “final” we mean the point of time, when all investment decisions haven taken place and the random variables \(B_t^1\) and \(B_t^2\) are realized].

Table 1 shows that for the linear and the risk-averse case the sample means of the final wealth are decreasing in T. The mean final payoff in the linear case and mean final utility in the non-linear case decrease as expected from Theorem 4. In addition, Table 1 presents the minimum, the maximum and the median of the realized final wealth, the final payoff and the non-invested budget.Footnote 9 Since in the linear case—conditional on investing—the investor invests all budget \(w_0\), we observe a minimum final budget of 0. The maximum of the non-invested final budget is \(w_T = w_0 = 1\). Hence, with \(T=5\), 10, 20 and 50, we observe paths of the random variable \(X_t\), \(t=1,\dots ,T\), where the investor does not invest. The maxima of the realizations of \(w_T + Y_{1| 1}^{T} + Y_{2| 2}^{T}\) and \(Y_{1| 1}^{T} + Y_{2| 2}^{T}\) obtained for the linear case are almost the same, differences are due to sampling effects. Comparing the linear to the risk-averse case—as can be expected by economic intuition—the mean final wealth is smaller while the average non-invested budget \(w_T\) is larger for the risk-averse investor (for any fixed T). In addition, we observe investment paths where the risk-averse investor invests all the remaining budget available. This happens if the relative rank is one and the number of remaining steps to go is one or zero, or if the relative rank is equal to two and there is no further step to go. For a final rank of \(x_T\) \(\le 2\), the decision of a log-utility investor to spend all the remaining budget into this risky alternative depends on the distribution of the returns and the remaining budget \(w_{T-1}\) (recall that the returns \(B_T^1\) and \(B_T^2\) are random variables).

Note that for the risk-neutral case and our assumption that the expectation of \(B_t^1\) is larger than two, the argument of Remark 2 applies here. That is, for \(T \rightarrow \infty \) the expected payoff is larger than the initial wealth w. In line with this, our simulation analysis shows that also for \(T=50\), the average final wealth remains significantly above the initial wealth \(w=1\) and the drop from \(T=10\) to \(T=50\) is rather small [around 4%].

By considering the Tables 2, 3, 4 and 5, we are able to have a closer look at the structure of the optimal investment paths. The tables show the average amounts invested at t in an investment alternative where the realized rank at t is 1 (or 2) [invest rank 1/invest in rank 2 in the corresponding tables]. For the log-utility case we also report the average of the corresponding amounts invested at period t (amount best/amount second best; for the linear case these amounts are equal to the average investments since \(w_0=1\) and conditional on investing to invest all is optimal). For the log-utility case, we report the sample average of \(b_t/w_t\) and the corresponding sample means if the rank is 1 or 2, respectively.

Consider for example the case where \(T=5\). A risk-neutral investor invests for the first time at \(t=3\) if the relative rank of the realization of \(X_3\) is equal to one. In the case of a risk-neutral investor all budget, i.e. \(w_0=1\), is invested at \(t=3\). The probability that the relative rank of \(X_3\) is equal to one is \(\frac{1}{3}\). The number observed, 0.3264, is close to \(\frac{1}{3}\). At \(t=4\) the investor invests in 17.14% of the random draws. Note that the probability that \(X_4\) has relative rank of one is 1/4. Since with a probability of \(\frac{1}{3}\) the risk-neutral investor invested all the budget before \(t=4\), the probability that the relative rank of \(X_4\) is one and the investor did not invest before is \(\frac{1}{4} \frac{2}{3} = \frac{1}{6} \approx 0.1667\). The value observed, 0.1714, is close to \(\frac{1}{6}\). For \(t=5\) the investors starts to invest into the second best opportunity. The probabilities for the best and second best in this period are \(\frac{1}{5}\) each time. With probability \(\frac{1}{2}\) no investment happened in the first four periods. Thus, the corresponding probabilities to invest into the best or second best alternative in \(t=5\) are \(\frac{1}{2} \frac{1}{5} = \frac{1}{10}\) each time. The corresponding numbers in the first and the second row of Table 2 are close to these values. With \(T=10\), 20 and 50, for the linear case investments in rank 1 follow from the probabilities that the relative rank is equal to one and the investor did not invest before. For \(T=10\), the probability that \(X_4\) has rank one is \(\frac{1}{4}\), etc. In addition, we observe that the risk-neutral investor starts to invest in periods larger than T/3 [of course this empirical observations depends on the distributions chosen for \(B_t^1\) and \(B_t^2\)].

Next, we consider the log-utility investor: For \(T=5\) the investor already starts to invest at \(t=2\) if the relative rank is one. This happens with probability \(\frac{1}{2}\) and the amount \(b_2 = 0.3158\) is invested. The relative frequencies where the risk-averse investor invests into an alternative with rank equal to one is close to \(\frac{1}{2}\) at \(t=2\). For \(t \ge 3\) the numbers in the ’invest rank 1’ row are higher than in the risk-neutral case. This is caused by the fact that the risk-averse investor did not invest all the budget available at \(t=2\). The investor invests on average 0.5823 if the relative rank of \(x_3\) is one and 0.0798 if the relative rank of \(x_3\) is two. The final rows show the proportions of the remaining budget invested in the current best or current second best alternative. Note that for \(t \ge 3\) the optimal investments \(b_3\) also depend on the distribution of \(Y_{1|1}^2\), \(Y_{1|2}^2\) and \(Y_{2|2}^2\). The ratio \(b_t/w_t\) is increasing in t for both, investing into the first best and investing into the second best draw, respectively.

Compared to the risk-neutral case, the risk-averse agent starts to invest in earlier periods. For example for \(T=10\), if the relative rank of \(x_3\) is one the amount invested is \(b_3=0.0526\) at \(t=3\). Hence, the risk-averse investor already invests smaller amounts into alternatives with rank one in earlier stages of the investment process. Since the expected returns are quite high, all the remaining budget is invested at \(t=T\) if the relative rank (\(=\) final rank) of \(x_T\) is one [to see this \(b_t/w_t=1\) for \(t=T\) with a rank of one]. For \(x_T\) with a rank of two we observe that \(b_T/w_T\) is close but not equal to one. This effect is caused by paths where a larger \(w_T\) is available. For the given payoff distribution of \(B_T^2\) the investor need not invest all the remaining budget. In our case, the probability that \(\mathbb {P} \left( B^2_T \le 1 \right) \approx 40.13 \%\). Hence, for a sufficiently large remaining budget \(w_{T-1}\), the risk-averse investor need not invest all the remaining \(w_{T-1}\).

Table 2 Investment paths: We consider \(n=2\), \(T=5\) and stochastic returns \(B_1\) and \(B_2\)
Table 3 Investment paths: We consider \(n=2\), \(T=10\) and stochastic returns \(B_1\) and \(B_2\)
Table 4 Investment paths: We consider \(n=2\), \(T=20\) and stochastic returns \(B_1\) and \(B_2\)
Table 5 Investment paths: We consider \(n=2\), \(T=50\) and stochastic returns \(B_1\) and \(B_2\)

6 Conclusion

In this article we consider superstar investment problems, where the payoff function depends on the final ranking of the project. Our model investigates high-risk investment opportunities, where an investor is only able to rank the projects seen so far. Investment decisions have to take place immediately after a project shows up. Returns realize at the end when all investment decisions have been taken. Only for a small number of superstars positive returns can be expected.

We obtain the value function and the optimal investment rules for expected utility investors for deterministic as well as stochastic returns. In any case (e.g. risk-neutrality and risk-aversion) the value can be split up into the expected utility given that the investor does nothing in the remaining periods and the additional gain arising from investing the remaining budget optimally. We prove that—as might be expected—the value function is non-increasing in the investment horizon. What is perhaps more surprising is that the expected utility of our optimal strategy is strictly larger than the utility of the initial endowment, even if the number of projects approaches infinity (where we keep the returns of the n best investment opportunities constant).

We show that for the risk-neutral case a simple rank-threshold strategy is optimal for time invariant returns. These rank thresholds are decreasing over time. That is, the conditions to invest are less and less demanding. To invest “all or nothing” – conditional on whether such a threshold is satisfied—is an optimal strategy.

For a risk-averse investor the optimal strategy becomes non-linear. The optimal investment in period t depends on the rank of the current project and the prior investments. For the special case of log-utility, except for very last few periods (at most n periods), it is never optimal to invest all the remaining wealth.

We numerically implement our optimal strategies for a risk-neutral and a log-utility investor. The number of superstars in our simulation analysis is two. The assumed distribution of the returns is based on estimates in Cochrane [7]. In line with our theoretical findings, we observe that the value function is decreasing in the number of investment periods. On average the risk-neutral investor invests a larger share of the initial budget available than a log-utility investor. The risk-averse investor starts to invest earlier, but the amounts invested are smaller than in the risk-neutral case, where—upon investing—without loss of generality all the total budget available is invested.

In the next steps the model considered in this article could be modified to become more realistic. For example, we assumed that due to competition the investor is forced to decide immediately whether to invest into the current investment project. This assumption could be relaxed by allowing the investor to decide to invest into the current project today and in the next \(m>0\) periods. A further possible modification is to consider multiple investment rounds as observed on the venture capital market. To do this, a second signal could be included which is correlated with the ranking of the project in question, where the investor—after some time span—has to decide to either quit investing into the corresponding project or to impute further capital.

Throughout our analysis the number of time periods (i.e. the number of possible investment projects) is fixed at T. Hence, instead of working with a deterministic number of projects, an extension would be to describe the number of projects by a random variable. A starting point to investigate problems with a stochastic number of projects could be Bruss and Ferguson [3][Section 6].

A further interesting way to extend our approach is to consider ambiguity-averse investors. For the classical secretary problem Chudjakow and Riedel [6] obtained an optimal threshold strategy also for ambiguity-averse agents. The optimal strategy was obtained by applying the optimal stopping results derived in Riedel [22]. An extension to the full information best choice problem—originally investigated in Gilbert and Mosteller [10] – was obtained in Obradović [19].

A natural way to extend our approach is to augment the approach of Chudjakow and Riedel [6] to the investment problem considered in this article. Thereby the question arises whether under risk-neutrality and ambiguity-aversion a strategy of the form “invest all or nothing (conditional on time and rank)” is still optimal and under which assumptions the investment thresholds are shifted to earlier or later periods. While we are optimistic that the risk-neutral case can be extended to ambiguity-aversion (at least for deterministic returns), we expect that the non-linear case becomes even more involved.