1 Introduction

Investment-cash flow sensitivities (ICFS) have attracted significant interest in the corporate finance literature and a growing number of contributions have documented empirical as well as theoretical evidence for non-linearities in the relationship between investment and cash flow (see e.g. Cleary et al. 2007; Guariglia 2008; Hovakimian 2009, or Lyandres 2007). In particular subsequent to the debate between Fazzari et al. (1988) and Kaplan and Zingales (1997), many contributions were putting their focus on the question if nonlinearities exist and if ICFS are useful for measuring financial constraints. In contrast, only little attention has been paid sofar to the question why nonlinearities occur and which economic mechanism is driving them.Footnote 1 We address this latter question by shedding new empirical as well as theoretical light in particular on the negative part of the investment-cash flow relationship. We do so by discriminating between two prominent explanations for a negative ICFS in the recent literature. On the one hand, Cleary et al. (2007) is a prominent contribution which put forward a model of a U-shaped investment curve where the negative ICFS is driven by the following economic mechanism: Firms with low internal funds prefer to invest more since their financing costs actually decrease due to the fact that creditors (as senior claimholders) are increasingly likely to capture the corresponding investment return in case of a default. This mechanism has been labeled the revenue effect. On the other hand, Hovakimian (2009) argues that the negative ICFS for low internal funds is likely to be the result of a corporate life-cycle pattern: In an early stage of the life-cycle, firms tend to display both, low (even negative) cash flows as well as high investments. A negative relationship between both variables then follows from their tendency to develop in opposite directions as the firm matures. Our empirical identification strategy to discriminate between both hypotheses builds upon the model of Cleary et al. (2007). While we adopt their basic model formulation, our innovation is to formulate and prove two empirically relevant extensions, which yield novel model predictions. Testing them on a comprehensive data sample – which covers in particular also small- and medium-sized enterprises – uncovers the following main result of our analysis: Although we do find a U-shaped investment curve similar to Cleary et al. (2007), we find no convincing evidence that supports the investment revenue effect, while we can largely confirm the life-cycle hypothesis of Hovakimian (2009).

More specifically, we contribute in the following ways to the existing literature. First, in order to derive testable predictions and to disentangle between the revenue effect and the corporate life-cycle effect, we extend the theoretical model by Cleary et al. (2007) along two dimensions: We show how ICFS vary with (i) the extent of a firm’s senior debt financing, and (ii) the riskiness of firm’s revenues. Both aspects are also directly related to the corporate life-cycle hypothesis, as smaller and younger firms with higher growth opportunities tend to display lower senior debt ratios and higher levels of revenue risk. We take both model predictions to the data and find contradictory results regarding the cost-revenue effect. Second, testing the corporate life-cycle hypothesis necessitates a dataset which covers companies in their early stages. Therefore, relying on the standard Compustat universe, as it is done in many studies (see e.g. Cleary et al. 2007; Aǧca and Mozumdar 2008), runs into severe limitations as it covers predominantly the segment of large mature US firms.Footnote 2 In contrast, to circumvent these limitations, we test our hypothesis on a comprehensive German dataset that explicitly covers the SME segment. We obtain a data sample of 75,692 firm year observations (after data cleansing) out of which approximately 65% can be classified as SME. We show that the evidence for non-linear ICFS is substantially stronger within the subset of SME companies. Thus, we find strong support for the arguments in Hovakimian (2009). By using a German dataset, we also add to the literature on international evidence on ICFS such as in Pawlina and Renneboog (2005) for UK data, Bertoni et al. (2010) for Italian evidence or Moshirian et al. (2017) for a global comparison. Third, by running spline regressions similar to Cleary et al. (2007), we confirm the nonlinear relationship of capital expenditures with internal funds more generally. Thereby we complement and strengthen the mounting evidence for negative ICFS as already documented in various papers (see e.g. Allayannis and Mozumdar 2004; Cleary et al. 2007; Aǧca and Mozumdar 2008; Hovakimian 2009).

Since the Cleary et al. (2007) model has received significant attention and is used amongst others in Chowdhury et al. (2016), our results imply that their model setup has to be used cautiously. To be clear, our results by no means invalidate the consistency of their model. However, they strongly question the interpretation that the revenue effect is a first-order explanation for the negative ICFS in the real data.

There exists a large bunch of literature on ICFS with a focus on financial constraints which displays partly conflicting results. On the one hand there is evidence of a positive correlation between cash flow and investment among financially constrained companies. For instance, Fazzari et al. (1988) and many subsequent studies such as e.g. Carpenter et al. (1998); Guaraglia (1998), or Carpenter and Petersen (2002), find that firms with low-dividend payout ratios have positive investment-cash flow sensitivities. On the contrary, Kaplan and Zingales (1997) showed that the investment rates of financially unconstrained firms exhibit a higher sensitivity to cash flow than the ones of financially constrained firms, thereby suggesting that ICFS should be used with caution when predicting financial constraints. Following the debate, one can easily ascertain that the dissimilarities in results can be attributed to the differences in the sub-samples used. While Fazzari et al. (1988) applied capital market imperfections to divide firms into sub-samples of financial constraints, Kaplan and Zingales (1997) used financial ratios that are highly correlated with internal funds. Subsequent to this debate, a prominent model by Cleary et al. (2007) proposes a theoretical model as well as empirical support which shows that investment is rather U‑shaped in internal funds. With this model they are – amongst others – able to reconcile the conflicting results of Fazzari et al. (1988) and Kaplan and Zingales (1997).

It is important to recognize, that the economic mechanism which drives the U‑shape is given by what they call the tradeoff between the cost and the revenue effects of investment. To put it simply, the negative ICFS for low level of internal funds derives from the fact that (within the model) the investor (i.e. creditor) will benefit to a larger extent from additional investments in the firm, the closer the firm gets to default because the creditor as senior claimholder will then capture the investment return. While Cleary et al.’s (2007) model setup follows a fairly standard textbook description of debt contracts, it is ultimately an empirical question if it describes a first-order effect in real-world decision-making.

An alternative explanation for a U-shaped investment curve has been put forward by Hovakimian (2009), who argues that the negative ICFS for low internal funds is caused by the corporate life-cycle pattern. At an early stage of the business life-cycle, firms tend to have both, low cash flows and high investment opportunities, which then tend to develop in opposite directions. The empirical results in Hovakimian (2009) as well as in Hovakimian and Hovakimian (2009) are strongly supporting her line of reasoning.

By considering these two competing rationales in the literature, the question arises which effect prevails and drives the nonlinearities in ICFS in a data sample containing both SMEs and large-sized firms. Our analysis attempts to contribute to the clarification of this particular question. Our focus is thereby on a cross-sectional perspective and does not address the temporal variation in ICFS, such as Aǧca and Mozumdar (2008); Brown and Petersen (2009) and recently Christodoulou and Artem Prokhorov (2022) who discuss the apparently declining evidence for ICFS.

The remainder of the paper is structured as follows. Sect. 2 formulates the extensions of the theoretical model by Cleary et al. (2007) and derives testable predictions. Sect. 3 presents the empirical analysis, including the strategy, the summary statistics and the results. Finally, Sect. 4 concludes. Proofs and further methodological details are contained in the Appendices.

2 Testable model predictions

This section outlines the derivation of our empirical hypotheses. To clearly identify the economic mechanism and to derive testable predictions, we review briefly the Cleary et al. (2007) model.Footnote 3 We then develop and proof two extensions to the model, which are able to yield testable hypotheses which can discriminate between the cost-revenue and the life-cycle hypothesis. To fix notation, consider a firm with access to a scalable investment opportunity \(I\), that yields a return of \(F(I,\theta)\) which depends on the state variable \(\theta\) (say, e.g. random demand) being distributed on \([\underline{\theta},\bar{\theta}]\) with pdf (cdf) \(\omega(\theta)\) (\(\Omega(\theta)\)). The firm has internal funds of \(W\) and finances the remaining \(I-W\) through a debt contract.Footnote 4 The firm defaults if \(\theta\) realizes below a critical threshold \(\hat{\theta}\), which in turn is determined by the (nominal) amount of debt financing \(D\). In case of default, the firm will be either liquidated at the current market value of assets \(L\) or is allowed to continue as going-concern with the former owners in which case the assets have a value of \(\pi_{2}\), with \(\pi_{2}> L\). The difference \(\pi_{2}-L\) can be interpreted either as private benefits of owners which are lost in default or as bankruptcy costs. Investors’ (break-even) participation constraint is therefore given by:

$$\int\limits_{\underline{\theta}}^{{\hat{\theta}}}\left({{{F}({I,\theta})}}+\frac{{{{D-F}({I,\theta})}}}{{{\pi}_{{2}}}}{L}\right){{\omega}({\theta})d{\theta}}+ \left( 1- {\Omega} ({\hat{\theta}})\right) {D}={{I-W}}.$$
(1)

The second term of the integrand represents the liquidation value \(L\) which is multiplied with the probability of liquidation which is \((D-F(I,\theta))/\pi_{2}\) and which follows from deriving the optimal debt contract for a situation of non-verifiable project returns such as in Diamond (1984) or Bolton and Scharfstein (1990). The LHS of (1) represents the expected payoff to investors which has to equal the provided investment amount \(I-W\) in order to break-even. However, note that the liquidation value \(L\) is not crucial to results and may be set to zero without changing the qualitative implications of the model.Footnote 5 It is only important that \(L<\pi_{2}\), which represents the threat for existing owners to loose their private benefits in the case of liquidation and which guarantees that owners repay creditors truthfully.

The payoff to owners in solvent states is simply the project returns plus their additional private benefits net of debt cost, i.e. \(F(I,\theta)+\pi_{2}-D\). In default states, the owners repay investors in full and retain their private benefits \(\pi_{2}\) only in case of continuation, which happens with probability \(1-(D-F(I,\theta))/\pi_{2}\). The owners objective function is therefore

$$\int_{\underline{\theta}}^{\hat{\theta}}\left(1-\frac{D-F(I,\theta)}{\pi_{2}}\right)\pi_{2}\omega(\theta)\mathrm{d}\theta+\int_{\hat{\theta}}^{\bar{\theta}}(F(I,\theta)-D+\pi_{2})\omega(\theta)\mathrm{d}\theta\Leftrightarrow\mathbb{E}(F(I,\theta))+\pi_{2}-D(I,W),$$
(2)

which is optimized over investment volume \(I\) and debt amount \(D\) (or equivalently the default threshold \(\hat{\theta}\)) given its level of internal funds \(W\).Footnote 6 The constraint maximization program in (1)-(2) yields the optimal pair \((I^{*},\hat{\theta}^{*})\), which can be considered as a function of \(W\) and therefore plotted against the exogenous \(W\). Cleary et al. (2007) show that under fairly general conditions the graph of \(I^{*}\) is convex, or more precisely U‑shaped in \(W\), which they term the U‑shaped investment curve.

It is important to recognize from (1) that debt financing costs are directly affected by the investment decision of the firm. As the first term on the LHS shows, creditors capture the investment returns in distress states (plus an additional liquidation value independent of investment). This dependence on the investment return is called the revenue effect. In the extreme case of imminent bankruptcy (i.e. as \(\hat{\theta}\rightarrow\bar{\theta}\)), creditors are essentially the new entrepreneurs and thus optimal investment will be first-best. Therefore, the revenue effect essentially makes expected (marginal) debt financing costs, and thus also the optimal investment, non-monotonic in internal funds. In particular, it displays a negative ICFS for low or negative values of internal funds.

The Cleary et al. (2007) model is consistent with empirical evidence in the sense of being able to generate the U‑shaped investment curve. However, other models such as Hovakimian (2009) or Lyandres (2007) are also consistent with the U‑shape. So, to discriminate between explanatory hypotheses and to derive further testable predictions, we need to extend the Cleary et al. (2007) model. While we follow the basic model setup of Cleary et al. (2007), our innovation is to provide extensions in two directions. As we want to disentangle between the cost-revenue effect by Cleary et al. (2007) and the corporate life-cycle hypothesis by Hovakimian (2009), we implement testable predictions that are also relevant for the corporate life-cycle hypothesis. First, we allow for different degrees of riskiness of the investment return, and second, we introduce outstanding senior debt.Footnote 7 Firms at an earlier stage of their life-cycle tend to exhibit more risky revenues, while their long-term debt ratios tend to be lower.

Model extensions and predictions

With respect to the riskiness, it is natural to analyze the change in the ICFS given a change in the variance of \(\theta\) by holding constant its mean; i.e. for a mean-preserving spread applied to \(\omega(\theta)\). To be consistent with Cleary et al. (2007), we adopt the same parametric specification, where \(F(I,\theta)=\theta\sqrt{I}\) and \(\theta\) is uniformly distributed, i.e. \(\theta\sim U[\underline{\theta},\bar{\theta}]\). For the uniform distribution, it is obvious that variance is governed by the choice of \(\underline{\theta}\) and \(\bar{\theta}\), so we denote \(\mu\) as its mean \(E(\theta)\) and define \(\underline{\theta}=\mu-\delta\) and \(\bar{\theta}=\mu+\delta\), so \(\delta\) represents the standard deviation of \(\theta\). As we prove in Appendix A.1, it can be shown that the default threshold \(\hat{\theta}\) increases with \(\delta\) (i.e. \(\mathrm{d}\hat{\theta}/\mathrm{d}\delta> 0\)), which makes the investor more sensitive to the investment return in the default states. The reasoning behind the relationship appears straightforward. Higher risk makes debt financing generally more expensive, i.e. investors demand a higher repayment for the additional risk in revenues thereby increasing the sensitivity of financing costs with respect to investment returns. More formally, we prove the following model prediction:

Prediction 1a:

As the riskiness of the state variable increases in the sense of a mean-preserving spread, the ICFS (\(I_{W}\)) always decreases, i.e. \(\frac{\mathrm{d}I_{W}}{\mathrm{d}\delta}<0\).

Furthermore, from the property of prediction 1a, it directly follows that the average investment decreases with risk, i.e. for a given investment curve, an increase in risk leads to an investment curve which always lies below the initial one. We summarize this as Prediction 1b:

Prediction 1b:

As the riskiness of the state variable increases in the sense of a mean-preserving spread, the investment curve always decreases, i.e. for \(\delta^{\prime}> \delta\), \(I(W,\delta^{\prime})<I(W,\delta),\forall W\).

Proofs are in Appendix A.1. We illustrate the predictions numerically in Fig. 1 by using the same parameter set as Cleary et al. (2007) in the left panel (\(\theta\) being uniformly distributed) and using an alternative specification (\(\theta\) being lognormally distributed) in the right panel. In both panels of Fig. 1, the solid (gray) line displays the curve with the highest level of risk, while the bold dashed line represents the curve with the lowest level of risk. As implied by the proofs of our predictions, Fig. 1 clearly shows that a higher riskiness (i.e. higher \(\delta\) or \(s\)) of the underlying state variable \(\theta\) leads to a higher convexity of the investment curve, i.e. the curve is steeper when the investment-cash flow sensitivity is negative. Furthermore, the overall level of investment decreases with riskiness as well, in line with the intuition from the theoretical model. Note that in the left graph of Fig. 1, the solid line represents the benchmark case of Cleary et al. (2007).

Fig. 1
figure 1

Riskiness and the U‑shaped investment curve. Notes: The graphs plot the investment curve \(I(W)\) for different levels of risk. The left panel uses the parameter set of Cleary et al. (2007) with \(\theta\) being uniformly distributed, i.e. \(\theta\sim U[\underline{\theta},\bar{\theta}]\), with \(\underline{\theta}=\mu-\delta\) and \(\bar{\theta}=\mu+\delta\), where \(\mu=2\) and \(\delta=2\) (solid line), \(\delta=1.8\) (gray dashed line), \(\delta=1.6\) (thick black line). The right panel uses an alternative parametric assumption with \(\theta\) being lognormally distributed, i.e. \(\theta\sim LogN(m,s^{2})\) with \(m=\{\)1; .895; .78\(\}\) and \(s=\{\)1; 1.1; 1.2\(\}\) leading to a constant \(E(\theta)=e^{(3/2)}\) for all three graphs but increasing variance. Again, the thick black line corresponds to the lowest level of risk

Note further, that the life-cycle hypothesis relies on the idea that younger firms tend to invest (proportionally) more while being at the same time exposed to a larger extent of business risk. As they mature, they tend to be able to consolidate their revenue streams and thus become safer. Empirically, this tendency would manifest as a (cross-sectionally) positive relation between risk and investment level.

Next, we turn to the second model extension concerning the pre-existence of senior debt. We modify the Cleary et al. (2007) model in a straightforward way by assuming that the firm has already senior debt outstanding, denoted by \(D^{s}\). The investment will be financed by junior debt, \(D^{j}\), so that total nominal debt equals: \(D=D^{s}+D^{j}\). We assume that strict seniority is enforced, so that from both, the investment return \(F(I,\theta)\) as well as the liquidation value \(L\), junior lenders only obtain the residual, i.e. junior lenders’ payoff is \(\max(F(I,\theta)-D^{s},0)\) and \(\max(L-D^{s},0)\) respectively. The junior investor’s participation constraint is therefore:

$$\begin{aligned}\int_{\underline{\theta}}^{\hat{\theta}}\left(\max(F(I,\theta)-D^{s},0)+\frac{D-F(I,\theta)}{\pi_{2}}\,\max(L-D^{s},0)\omega(\theta)\right)\mathrm{d}\theta+(1-\Omega(\hat{\theta}))D^{j}=I-W.\end{aligned}$$
(3)

Note that the critical (default) threshold \(\hat{\theta}\) follows from the condition that the firm is not able to meet total liabilities \(D\) out of their investment return, i.e. \(\hat{\theta}\) is implicitly defined from \(D=F(I,\hat{\theta})\).

In order to analyze the impact of senior debt \(D^{s}\) on the ICFS \(I_{W}\), we again use the same specification as Cleary et al. (2007), with \(F(I,\theta)=\theta\sqrt{I}\) and \(\theta\) being uniform, i.e. \(\theta\sim U[\underline{\theta},\bar{\theta}]\). We provide a formal proof in Appendix A.2 and numerical illustrations in Fig. 2 of our second prediction.

Prediction 2:

A firm with a larger extent of existing senior debt displays a more negative ICFS (\(I_{W}\)).

The proof in Appendix A.2 shows among others that the critical threshold \(\hat{\theta}\) always increases with \(D^{s}\), thereby making investors more sensitive to the investment return. The formal proof relies on the simplified model where we set the liquidation value \(L\) equal to zero,Footnote 8 while the numerical result in Fig. 2 also includes the case of \(L> 0\). In all three graphs, the solid line represents zero senior debt and the bold dashed line represents a senior debt level of \(D^{s}=0.2\). The qualitative prediction remains the same for different levels of \(L\) and is even more pronounced for the (more realistic) case of \(L> 0\). The convexity of the investment curve increases with senior debt when internal funds are low, i.e. the ICFS is more negative as implied by Prediction 2.Footnote 9 Note that in the left graph of Fig. 2, the solid line represents the benchmark case of Cleary et al. (2007). Note further, that from the perspective of the life-cycle hypothesis, we should expect an opposite prediction. Young firms, which tend to display a stronger ICFS, are less likely to be financed with large amounts of debt. In contrast, young growth firms are known to be rather equity-financed.

Fig. 2
figure 2

Senior debt and the U‑shaped investment curve. Notes: The graphs plot the investment curve \(I(W)\) for different levels of existing senior debt, and by using the parameter set of Cleary et al. (2007) with \(\theta\) being uniformly distributed, i.e. \(\theta\sim U[0,4]\). In all three panels we display three levels of senior debt: \(D^{s}=0\) (gray solid line), \(D^{s}=0.1\) (gray dashed line), and \(D^{s}=0.2\) (thick black line). Across panels we vary the liquidation value \(L\), with \(L=0\) (a), \(L=0.5\) (b), and \(L=1\) (c)

We next turn to the empirical testing.

3 Empirical analysis

3.1 Empirical strategy

To test the different predictions of the theoretical model empirically, we have to estimate investment-cash flow sensitivities which we operationalize as follows. We assess capital expenditures by using an error-correction model according to Bond et al. (2003); Mizen and Vermeulen (2005); Mulier et al. (2016) and Buca and Vermeulen (2017). The error-correction model goes back to Bean (1981) and was later taken up by Bond et al. (2003). In principle, the error-correction model links a long-run equilibrium relation of the capital stock with flexible short-run investment dynamics by using a regression model (Bond et al. 2003).Footnote 10

We assume that capital markets are not frictionless in the sense that firms’ investment decisions are not independent of its status of financial constraints. To account for these frictions as well as the nonlinearities in ICFS, we include operating cash flow (\(OCF\)) to the investment equation derived in Appendix B. Following Cleary et al. (2007), we use spline regressions, i.e. we divide our sample into terciles of \(OCF\). The regression equation to test the model predictions empirically is as follows:

$$\frac{ \textit{CAPEX}_{i,t}}{{K}_{i,t-1}}=\alpha_{1}\frac{ \textit{CAPEX}_{i,t-1}}{{K}_{{i,t-2}}}+\alpha_{2}\frac{ \textit{OCFT1}_{i,t}}{{K}_{i,t-1}}+\alpha_{3}\frac{ \textit{OCFT2}_{i,t}}{{K}_{i,t-1}}+\alpha_{4}\frac{ \textit{OCFT3}_{i,t}}{{K}_{i,t-1}}+\alpha_{5}{\mathrm{\Delta}{\ln S}_{i,t}}+\alpha_{6}{\mathrm{\Delta}{\ln S}_{i,t-1}}+\alpha_{7}({\ln K}_{{i,t-2}}-{\ln S}_{{i,t-2}})+\nu_{t}+\nu_{i}+\epsilon_{i,t},$$
(4)

where OCFT1-OCFT3 represent the different terciles of operating cash flow of equal size to measure investment as a piecewise linear and continuous function of operating cash flow. We focus on operating cash flow as it is supposed to be unaffected by investment decisions. At first glance, using a flow variable in the empirical test appears at odds with the interpretation of the theoretical model which uses a stock variable. However, within a one-period model, the distinction into flow and stock variable is rather pointless, and there are important empirical drawbacks of using cash holdings as stock variable as cash holdings suffer from endogeneity concerns. In line with Cleary et al. (2007) and much of the ICFS literature, we use the flow variable \(OCF\) for our empirical strategy.Footnote 11

\({ \textit{CAPEX}_{i,t}}\) is depreciation in year \({t}\) plus the change in tangible fixed assets from year \({t-1}\) to year \({t}\). The replacement value of the capital stock is computed as \(K_{i,t}=K_{i,t-1}\times(1-\delta)+{ \textit{CAPEX}_{i,t}}\). The first value of \({K}\) is quantified by the starting observation of tangible fixed assets. The depreciation rate \(\delta\) is set at a constant rate of 4.5%.Footnote 12\(\mathrm{\Delta}{{\ln S}_{i,t}}\) is the change in log sales. \(({{\ln K}_{{i,t-2}}}-{{\ln S}_{{i,t-2}}})\) denotes the error-correction parameter and assesses the long-run equilibrium between the natural logarithm of capital and its target value proxied by the natural logarithm of total sales.

We estimate the baseline model (Eq. (4)) with different approaches: (i) Simple OLS, (ii) the within estimator to control for time invariant and unobservable firm-specific characteristics and (iii) the two-step first-difference GMM estimator with lagged variables in levels as instruments for the first differences of the explanatory variables to control for autocorrelation and for endogeneity problems as developed by Arellano and Bond (1991). All further regressions are exclusively estimated with the GMM estimator.Footnote 13 Finally, to account for business cycle fluctuations and other life-cycle effects, we include year dummies.

3.2 Data

For our empirical analysis we use the Creditreform database provided by Creditreform AG, which is a comprehensive firm database containing annual balance sheet and income statement data of active companies registered in Germany.Footnote 14 The time frame we observe lasts from 2007 to 2015. To moderate misspecification, firm-year observations with negative values of total assets, total sales, tangible fixed assets and so on were deleted. We eliminate companies from the banking, finance, insurance and real estate sector as well as utilities because of their differences in balance sheets and income statements. We additionally exclude all firms that have switched in their size classification, which is defined below, i.e. from SME to large or vice versa. Finally, we only retain firms in our sample with leastwise four consecutive available observations to have at least two observations per firm. To control for outliers and potential measurement error in the capital ratio we trim dependent and independent variables at the third percentile in both tails.Footnote 15 After data cleansing and filters, we end up with an unbalanced sample containing 75,692 firm year observations on 19,201 firms. Table 1 presents the summary statistics of the variables used in the regressions for the full sample. Fig. 3 represents mean and median of investment to capital and cash flow to capital ratios sorted by percentiles of cash flow to capital. It clearly shows that the investment ratio decreases both in mean and median with increasing cash flows as long as cash flows are low, but increases when cash flows become larger which is in line with a non-monotonic investment curve as advocated by Cleary et al. (2007); Allayannis and Mozumdar (2004); Aǧca and Mozumdar (2008); Hovakimian (2009) and others.

Table 1 Summary statistics
Fig. 3
figure 3

Mean and median capex for percentiles of operating cash flow. Notes: This figure plots means and medians of \(\frac{{{ \textit{CAPEX}_{i,t}}}}{{{K}_{i,t-1}}}\) for percentiles of \(\frac{{{ \textit{OCF}_{i,t}}}}{{{K}_{i,t-1}}}\)

3.3 Results

3.3.1 Baseline results

First, we start our analysis by estimating the baseline model of Eq. (4) for the aggregate sample. Panel A of Table 2 summarizes the results. While Columns 1–3 report standard results of ICFS, Columns 4–6 illustrate the results of the spline regressions when dividing the sample into terciles of cash flow (\(\frac{{{ \textit{OCFT1--3}_{i,t}}}}{{{K}_{i,t-1}}}\)) (see e.g. Cleary et al. 2007). Columns 1–3 illustrate that the overall relation between cash flow and investment appears to be positive independent of which model has been chosen. However, having a closer look at cash flow, i.e. dividing it into terciles, exhibits a U-shaped investment-cash flow sensitivity in our data sample. All three models, OLS (Column 4), the within estimator (Column 5) as well as the GMM estimator (Column 6),Footnote 16 provide empirical evidence for a nonlinear relationship between cash flow and investment. Firms with cash flows being allocated in the first tercile show a significantly negative relation, i.e. investment is increasing with decreasing cash flows. In contrast, for firms in the second tercile the relation becomes significantly positive, thus increasing cash flows lead to increasing investments. The same holds true also for the third tercile, however the smaller coefficient implies that the positive relation decreases for high levels of internal liquidity. Thus, consistent with the model and results by Cleary et al. (2007), firms with low internal funds display a negative ICFS, while the relation is positive for firms with higher cash flows.Footnote 17

Guariglia (2008) states in her work that testing the nonlinearities among a sample of unquoted firms has the advantage to be able to build proxies of capital market frictions that vary in the cross-section. We make use of this advantage and split our sample into small- and medium-sized (SMEs) and large-sized firms.Footnote 18 Size is often used as a proxy for both measuring capital market frictions, i.e. information asymmetries, as well as growth opportunities. By splitting the sample into SMEs and large-sized firms, we want to test whether the U‑shape of investment-cash flow sensitivities behaves similar within different size classes. Panel B of Table 2 shows that the nonlinear relationship between cash flow and investment is strongly significant for the sample of SMEs. However, for large-sized firms we find correct signs but no statistical significance. At first sight, this result may seem still consistent with results by Cleary et al. (2007) as size could be considered as proxy for information asymmetries.Footnote 19 However, also the life-cycle hypothesis cannot be rejected, as SMEs are on average supposed to be at an earlier stage in their life-cycle compared to large-sized firms (see e.g. Hovakimian 2009). Thus, it is not yet clear whether the U‑shape of investment-cash flow sensitivities is driven by the cost-revenue effect or by the life-cycle hypothesis.

Table 2 Baseline results

3.3.2 Testing the model predictions

In this subsection, we empirically test the different predictions of the theoretical model, which we derived in Sect. 2. Prediction 1a states that firms with more risky revenues should display a more pronounced negative ICFS. To operationalize different levels of risk, we build mutually exclusive subsamples based on sales risk.Footnote 20 For sales risk, we generate a dummy variable which is one, if a firm’s standard deviation of total sales to total assets is above the two-digit industryFootnote 21 sample median and zero otherwise, i.e. we estimate the standard deviation for each firm and then the median within the firm’s two-digit industry. Columns 1–2 of Table 3 summarize the results of our estimations. The results show some tentative evidence that the negative relation between investment and cash flow when cash flows are low is only evident among firms with a high level of sales risk. The variable of particular interest is \(\frac{{{ \textit{OCFT1}_{i,t}}}}{{{K}_{i,t-1}}}\), which has a negative sign in both subsamples but turns out significant (at 10%) only in the high-risk subsample. Taking it at face value, the ICFS is more negative among risky firms in line with Prediction 1a of the theoretical model. However, investigating the subset of more risky firms, we also identify them to be younger and smaller in size, while they exhibit lower asset tangibility, i.e. they tend to be at an earlier stage of their corporate life-cycle. In this sense, we cannot reject the revenue effect nor the life-cycle hypothesis at this stage.

Table 3 Testing model predictions – Spline regressions

To further analyze the impact of risk, we test Prediction 1b, which states that firms with more risky revenues should invest dominantly less than firms with less risky revenues. Again, to consider different levels of revenue risk, we build mutually exclusive subsamples based on sales risk as outlined above. Table 4 reports means and medians as well as differences in means of our main dependent variable (capital expenditures) and of variables that proxy for a firm’s current status in its life-cycle within subsamples of high and low levels of sales risk. Results convincingly illustrate that both mean and median investments are significantly higher among firms with higher sales risk, which clearly contradicts Prediction 1b. A potential concern is that this result is caused by different levels of cash flow. However, if we plot investment against operating cash flow ranked according to percentiles of cash flow levels for both high and low levels of sales risk, we find that the corresponding graph for high-risk firms is always above the graph of low risk firms. (see Fig. 4). This finding is a first indication that the predictions of the theoretical model are not in line with empirical evidence, while the results are still consistent with the corporate life-cycle theory. As outlined above, the results in Table 4 highlight, that firms with a higher level of sales risk tend to be at an earlier stage of their corporate life-cycle. Thus, the negative relation can be caused by the fact that investment opportunities are high, while internal funds are low at the same time.

Table 4 Means and medians of investment – High vs. low levels of sales risk
Fig. 4
figure 4

Mean capex for percentiles of operating cash flow – High vs. low levels of sales risk. Notes: This figure plots means of \(\frac{{{ \textit{CAPEX}_{i,t}}}}{{{K}_{i,t-1}}}\) for percentiles of \(\frac{{{ \textit{OCF}_{i,t}}}}{{{K}_{i,t-1}}}\) for firms with high and low levels of sales risk. We sort according to sales risk, i.e. we generate a dummy variable which is 1 (High) if the firm’s overall standard deviation of sales is above the two-digit industry sample median and 0 (Low) otherwise. We refer to Table 8 in Appendix C for details on the respective variables.

In Prediction 2 we allow for senior debt in the theoretical model. We argue that firms with a higher senior debt ratio, should display a more convex investment curve when cash flows are low; in particular they should display a more negative ICFS compared to firms with a lower senior debt ratio. A simple explanation is given by the fact that allowing for senior debt increases the firm’s probability of default, which is why investors will demand higher compensation for their provision of funds. In case of default, junior debt holders have to share revenues from investment with senior debt holders and thus become more sensitive to investment returns when internal funds are low. To test this model prediction empirically, we split the sample with respect to long-term debt, which should come closest to senior debt, as creditors usually demand collateral. We build mutually exclusive subsamples of firms with high and low long-term debt ratios, i.e. we generate a dummy variable being one if the firm’s average long-term debt ratio lies above the sample median and zero otherwise.Footnote 22 Results are reported in Columns 3–4 of Table 3. Coefficients on the main variable of interest (\(\frac{{{ \textit{OCFT1}_{i,t}}}}{{{K}_{i,t-1}}}\)) are both negative, but while being far from significant in the subsample of high long-term debt firms, it is clearly significant (at 5%) for the low long-term debt firms. Therefore, the empirical evidence is opposite to the theoretical predictions and we have to reject Prediction 2. Investigating the firms in the low long-term debt sub-sample reveals that these firms tend to be younger, have higher sales growth and lower asset tangibility. Our findings are reminiscent of results such as e.g. Beck et al. (2006), who argue that obtaining external finance to fund investments is more difficult for young firms. Nevertheless, this limitation disappears when firms pass through the early stage of their life-cycle. Taken together, our empirical evidence again casts serious doubts that the cost-revenue effect is a first-order effect for the negative ICFS, while it provides support for the predictions of the corporate life-cycle hypothesis.

To strengthen our evidence and as robustness check, our final test focuses on direct test of the model’s predictions. Cleary et al. (2007) claim that as the probability of default rises, the return originated by the company’s investment becomes increasingly important to the investor who obtains the return in the event of default. An enhancement of investment, thus, reinforces the firm’s solvency to meet its financial obligations and increases the investor’s payoff in case of default. From this line of reasoning, we expect firms which are closer to default to increase their investments as cash flows are low. To account for default risk, we sort the sample according to Altman’s (1968) Z‑score which is a proxy for a firm’s bankruptcy probability. We use the Altman (2002) Z‑score for private firms which is considered to be more industry independent.Footnote 23 As higher Z‑scores indicate lower insolvency risk (Altman 1968), we create a dummy variable which is one (\(High\)) if the firm’s average Z‑score is higher than the sample median and zero otherwise. Columns 5–6 of Table 3 report results and show a significant relationship only among the high Z‑score firms. In particular, we can only find a significant negative coefficient on the first tercile, \(\frac{{{ \textit{OCFT1}_{i,t}}}}{{{K}_{i,t-1}}}\) in the high Z‑score sample, which is clearly a contradiction to the prediction. In line with our previous findings on long-term debt, the result implies that firms being closer to bankruptcy do not increase investments when internal funds are very low contradicting the theoretical predictions by Cleary et al. (2007). So far, our findings in this section can be interpreted as strong evidence that the revenue effect is not a first-order effect in the real data and thus unlikely to be driving the U‑shaped investment curve.

3.3.3 Testing the corporate life-cycle hypothesis

The previous section has documented contradictions to the predictions of the (extended) theoretical model of Cleary et al. (2007), while at the same time, we indicated that the evidence appears consistent with the corporate life-cycle theory, as argued by Hovakimian (2009). We next turn to a more rigorous test of the latter hypothesis. To account for the different stages of firms’ life-cycles and thus for their growth opportunities, we test whether the U‑shaped ICFS are more pronounced among firms that are at an earlier stage of their life-cycle compared to those that are at a later stage. We assume that the former ones possess high growth opportunities, which demand large investments, while they are at the same time constrained in internal funds. To account for the life-cycle, we estimate Eq. (4) on various mutually exclusive subsamples of firms sorted by different measures. We capture the stage of a firm’s life-cycle by the firm’s (i) size, (ii) age, (iii) size-age index, (iv) growth opportunities, and (v) industry asset specificity. First, size (SME) is defined as above and only reported for the sake of completeness. Second, we sort our sample with respect to Age. We generate a dummy variable which is one if the firm’s year of foundation is larger than the sample median. According to evidence by e.g. Dickinson (2011), both the firm’s size as well as its age are often related to the life-cycle. Third, we sort our sample according to Hadlock and Pierce (2010) and use the size-age index (SA-index) of German firms provided by Klepsch and Szabo (2016).Footnote 24 The dummy variable is one if the firm’s average size-age index is higher than the sample median and zero otherwise, while a higher value accounts for firms being younger and smaller in asset size. Fourth, we sort the sample according to growth opportunities (Growth), which we measure by future sales growth: \(SG_{i,t}=\frac{S_{i,t+1}-S_{i,t-1}}{S_{i,t-1}}\). We establish a dummy variable that is one if the firm’s average future sales growth lies above its two-digit industry sample median and zero otherwise. Fourth, we use industry asset specificity to implement a more exogenous proxy for a firm’s growth opportunities. We follow Acharya et al. (2007) and Garcia-Appendini (2018) and assign a dummy variable to be one if the two-digit industry median of tangible assets lies above the sample median and zero otherwise. The underlying reasoning is that, while this industry measure is more exogenous, growth opportunities tend to be lower in industries with highly specific assets and a high level of tangibility and vice versa.

Table 5 reports the results when we build mutually exclusive subsamples based on the life-cycle hypothesis and estimate Eq. (4). All five specifications confirm the assumptions of the corporate life-cycle theory. In particular, firms that are smaller in size, younger, have higher growth opportunities and belong to industries with low asset specificity exhibit negative investment-cash flow sensitivities when cash flows are low, while the relationship becomes positive when cash flows increase. In contrast, companies that are larger, older and have lower growth opportunities appear to be cash flow insensitive. According to the model of Cleary et al. (2007) the firms’ set of growth opportunities is exogenous and does not vary and thus has no impact on the model. However, Table 5 shows that this cannot be confirmed empirically within our sample of SMEs and large-sized firms.

Table 5 Life-cycle – Spline regressions

To further strengthen the empirical evidence of the corporate life-cycle theory, we also perform a sorting approach according to more than one of the above-mentioned measures, i.e. we sequentially sort according to size and age, according to size, age and the SA-index and finally according to size, age, the SA-index and growth opportunities. Table 6 illustrates distinct results. If we constrain the sample by additional measures to identify a firm’s stage of life-cycle, we can see that the negative coefficient between investment and cash flow for low cash flow levels increases in absolute size, i.e. gets more negative. We again underline the fact that the negative correlation between investment and cash flow is mainly determined by young companies with high growth opportunities and low internal financial resources.Footnote 25

Table 6 Life-cycle II – Spline regressions

We finally sort the sample according to both size and long-term debt to control for the corporate life-cycle hypothesis and the theoretical model by Cleary et al. (2007) within one regression. According to Prediction 2, the negative sensitivity should be more pronounced among firms with a high level of long-term debt independent of their size. In contrast, the corporate life-cycle hypothesis implies, that the negative effect should be particularly evident among SMEs with low levels of long-term debt, as these firms are the ones that exhibit higher growth opportunities, are younger and have lower asset tangibility. Table 7 illustrates the results. Again, we can show that the nonlinearities are evident among SMEs but not among large-sized firms. Further, and even more important, the negative relation between investment and cash flow is statistically significant only among SMEs with low levels of long-term debt and not so among SMEs that have high debt ratios conflicting the derived predictions of the theoretical model by Cleary et al. (2007).Footnote 26

Overall, our findings challenge the view that the revenue effect is a first-order effect for the negative ICFS in the real data, while we find strong support for the implications of the corporate life-cycle hypothesis.

Table 7 Senior debt and size – Spline regressions

4 Conclusion

This paper analyzes the nonlinearities of ICFS with a special focus on the negative part of the investment-cash flow relationship. In particular, we discriminate between two prominent explanations for a negative ICFS in the recent literature – the cost-revenue effect by Cleary et al. (2007) and the life-cycle hypothesis by Hovakimian (2009). The cost-revenue effect is derived from an investment model which assumes endogenous financing costs and relies on the idea that investors (creditors) capture the investment return in default states. The more investors’ returns are sensitive to the investment return, the more pronounced is the negative ICFS. We use this insight to extend the Cleary et al. (2007) model in two directions and derive testable predictions considering the riskiness of revenues and the existence of senior debt. In the theoretical framework, we prove that higher revenue risk makes the negative relation between investment and cash flow steeper when cash flows are low, while the overall investment decreases with risk. For the existence of senior debt, we show theoretically that higher senior debt also increases the negative sensitivity between investment and cash flow, when cash flows are low, i.e. the curve gets steeper.

We take these model predictions to the data. We deliberately choose a rich German dataset covering explicitly the SME segment. Thereby we avoid the drawback of using a standard Compustat sample that only very limited inference can be made with respect to young and small companies – which is crucial for testing the life-cycle hypothesis.

From our empirical results, we are able to verify the steeper relation between investment and cash flow among risky firms. However, the level of investment is dominantly higher among firms with higher revenue risk, thereby contradicting the model. Overall, more risky firms tend to be younger, smaller in size, and have lower asset tangibility in line with the early stage of a firm’s life-cycle.

With respect to the predictions concerning senior debt, we do not find evidence that the U‑shape is more pronounced among firms with a high level of long-term debt. Quite to the contrary, firms with a low level of long-term debt exhibit a more pronounced nonlinear relation between investment and cash flow. Having a closer look at firms with low levels of long-term debt exhibits a negative correlation with age and asset tangibility and a positive one with growth opportunities providing an additional indication that rather the firm’s life-cycle has an impact on the nonlinear relationship between investment and cash flow.

As final piece of evidence, we sort firms according to (a variant of) Altman’s (1968) Z‑score, i.e. bankruptcy risk. This test is directly addressing the implication of Cleary et al. (2007), since the revenue effect implies that the negative ICFS should be more pronounced among firms with high insolvency risk. Again, our results do not support the model’s predictions, as we find that mainly firms with lower bankruptcy risk exhibit strong negative ICFS.

Overall, the contradictions between the theoretical model predictions and the empirical results lead us to the conclusion that the revenue effect – or in other terms, the endogenous financing cost effect – does not appear to be a first-order effect for the emergence of negative ICFS in our data.

In contrast, when checking our evidence against the implications of the corporate life-cycle hypothesis as advanced by Hovakimian (2009), we overall find neat consistency. In the early stage of a firm’s life-cycle, firms tend to have limited cash flows, while investment opportunities are high inducing a negative relation between investment and cash flow. As the firm matures, cash flows increase as well as investments to ensure growth. At a later stage of their life-cycle, cash flows tend to be high, while investment opportunities are low and thus, investments tend to be insensitive to cash flows. Testing these arguments empirically, we find the nonlinear relationship to be more pronounced among firms that are younger, smaller in size, have higher growth opportunities and operate in industries with lower asset specificity.

In sum, our evidence strongly favors life-cycle dynamics as an explanation for negative ICFS, while we do not find support for the idea that they are driven by the endogenous financing cost, i.e. the revenue effect. In this sense, our results are a sign of caution for using the investment model with endogenous financing costs in related settings.