Nothing but good intentions: the search for equity and stock price crash risk

Reichmann, Doron; Möller, Rouven; Hertel, Tobias

doi:10.1007/s11573-022-01085-w

Nothing but good intentions: the search for equity and stock price crash risk

Original Paper
Open access
Published: 30 March 2022

Volume 92, pages 1455–1489, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Business Economics Aims and scope Submit manuscript

Nothing but good intentions: the search for equity and stock price crash risk

Download PDF

3339 Accesses
Explore all metrics

Abstract

In this study, we examine whether and how managers’ intentions to raise equity are associated with future stock price crash risk. Therefore, we apply modern information retrieval techniques to corporate textual disclosures by constructing document embeddings that preserve contextual relationships in managers’ discussions on liquidity and capital resources. Using these document embeddings, we construct a continuous measure of managers’ intentions to raise equity. We document that the search for equity is associated with higher future stock price crash risk. Further analyses suggest that managers with stronger intentions to raise equity are more likely to block negative news flow and that these intentions reinforce the effects of earnings manipulation and textual obfuscation on stock price crash risk. In summary, our results suggest that managers’ search for equity incentivises managerial bad news hoarding.

Bankruptcy prediction and the discriminatory power of annual reports: empirical evidence from financially distressed German companies

Article 22 June 2019

Social Integrity and Stock Price Crash Risk

Article 03 May 2023

Textual tone in corporate financial disclosures: a survey of the literature

Article 27 June 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the seminal work of Jin and Myers (2006) firm-specific stock price crashes are linked to bad news hoarding by firm managers. Due to incentives such as compensation packages, empire building, and reputation concerns (Ball 2009; Graham et al. 2005), managers have the tendency to hide adverse information from outside investors which leads to bad news being stockpiled within the firm. Once the accumulated bad news reaches a tipping point where it can no longer be contained, stock prices are prone to crash. Given the devastating effects of crash risk on investor welfare, literature in the field of accounting and finance has increasingly turned to understanding the determinants of bad news hoarding behaviour. In this study we examine managers’ intentions to offer equity (hereafter: equity intent) as a potential incentive to hoard bad news. In the case of equity offerings, managers are known to time the market with regard to the current market valuation (Graham and Harvey 2001). More specifically, firm managers seeking equity are incentivised to provide more optimistic information prior to the equity offering to raise more capital, even if this contrasts the firm’s true performance (Lang and Lundholm 2010). Given this evidence, we believe that managers who try to persuade investors to invest in their company are more likely to hoard bad news. However, whether managers’ equity intent serves as a driver for bad news hoarding behaviour as reflected in future stock price crash risk remains an open empirical question. To address this issue, we apply modern techniques of information retrieval to firm disclosures to identify cues revealing managers’ equity intent and quantify its effects on future stock price crash risk.

A prominent example of managerial bad news hoarding with severe effects on investor welfare is the case of Marrone Bio Innovations. Former COO Hector M. Absi Jr. allegedly profited from hiding various sales concessions from independent auditors and outside investors.^{Footnote 1} When the SEC uncovered the alleged fraud in August 2014, the company’s share price plunged by more than 40%. Given the severe effects of opportunistic management behaviour, section 954 of the 2010 Dodd-Frank Act mandates that publicly listed companies adopt clawback provisions that require managers to return part of all benefits they received as the result of misstated accounting numbers. However, Bao et al. (2018) provide empirical evidence that clawback adoptions do not help mitigating crash risk as managers are induced to hide bad news through different channels than manipulated accounting numbers. Since stock price crash risk can neither be mitigated by portfolio diversification (Sunder 2010), literature in the field of accounting and finance has focused on identifying determinants of managerial bad news hoarding to help investors detecting crash prone companies via screening.^{Footnote 2} Interestingly, we find that, prior to the stock price crash of Marrone Bio Innovations in 2014, the company openly stated in its 10-K filing for the fiscal year 2013 that it was searching for additional external funding (e.g. equity intent) in order to expand its operational business. Considering our opening arguments, the intention to raise equity could have served as an incentive for the management to provide an overly optimistic outlook and hide bad news from investors. In this case, screening companies for managers’ equity intent could have helped investors to better assess firm-specific crash risk and preserve shareholder welfare.

Although firm fundamentals (e.g., leverage ratio, market-book-ratio) help to explain firm financing choices on average, balance sheet items cannot reveal corporate financing intentions. To grasp managers’ equity intent, we build on a concept proposed by Hoberg and Maksimovic (2014) and search for commonly used phrases in the Liquidity and Capital Resources (LIQ + CAP) section, a mandated disclosure of the MD&A section in 10-K files, that managers use to express their equity intent. However, managers’ equity intent is likely to come in shades. The authors suggest that, even if managers do not actively offer equity eventually, the mere intention to do so should be reflected in their choice of words when discussing the capital resources of their firm. Consequently, we follow Hoberg and Maksimovic (2014) to compile a continuous measure of equity intent that allows us to consider early signals that may mark the start of managers’ tendencies to hoard bad news. Specifically, we estimate cosine similarities between vector representations of LIQ + CAP sections in which managers explicitly state that they are issuing equity and all other LIQ + CAP sections in our sample. Instead of using a standard Bag-of-Words (BOW) model that is frequently used in accounting and finance research to estimate document similarities, we construct neural network embeddings of each LIQ + CAP section using the Doc2Vec model developed by Le and Mikolov (2014) that is proven to outperform standard BOW models as it accounts for semantic relationships.

After the identification of equity intent, we quantify its effect on firm specific future stock price crash risk. We follow the extant literature and analyse the effect of equity intent in conjunction with earnings manipulation—another popular determinant of stock price crash risk related to bad news hoarding (Hutton et al. 2009; Kim et al. 2019). Our results suggest that firms with stronger equity intent are more likely to suffer from stock price crashes in the future. These findings hold for different popular stock price crash risk measures and remain robust after controlling for firms’ earnings manipulation, which is a popular vehicle to hide adverse information as well as an extensive set of control variables. Moreover, the effects have a comparable magnitude like the effect earnings manipulation and textual obfuscation exert on stock price crash risk. To strengthen the case that equity intent is in fact a motive for managers to hoard bad news, we conduct a causal mediation analysis where we use the fraction of negative words in each firm’s MD&A section as a mediator variable. In line with the theory of bad news hoarding, our results show that stronger equity intent incentivises managers to withhold negative information by using less negative language in their MD&A, supporting the view that bad news hoarding serves as an important economic mechanism predicting the likelihood of future price crashes following a sudden release of accumulated negative information.

In additional analyses we examine how managers’ equity intent relates to the usage of earnings manipulation and textual obfuscation. The results show that equity intent reinforces the effects of these two popular vehicles of bad news hoarding. While earnings manipulation and textual obfuscation offer only marginal predictive power for future stock price crash risk for firms with low equity intent, they are strongly significant determinants for future stock price crash risks for high equity intent firms. Furthermore, we conduct a series of analyses to assess the robustness of our main results. Our robustness tests show that equity intent is an early warning indicator for stock price crash risk up to a prediction window of three years while predictive power decreases for wider prediction windows. In the next analysis we evaluate whether equity intent also predicts upward stock price jumps. If equity intent is a motive for bad news hoarding, we should not be able to predict upward stock price jumps in a similar way as stock price crashes. In fact, the results are generally consistent with this notion. We further check whether our findings are driven by model specifications of the applied neural network to estimate the LIQ + CAP embeddings. By exploring an alternative approach to estimate LIQ + CAP embeddings, we find that our main results remain robust for all stock price crash risk measures. Finally, we test the sensitivity of our results by controlling for several additional business risk and textual proxies that are associated with stock price crash risk. Again, our results remain virtually unchanged.

This paper contributes to the literature in the following way: In line with the SEC’s plan to use textual analyses to detect managerial opportunistic behaviour (Eaglesham 2013), we contribute to the literature that uses textual information to complement the prediction of stock price crashes. Consistent with El‐Haj et al. (2019), who state that “mainstream accounting and finance research appears to be behind the curve in terms of computational linguistic sophistication” (p. 266), prior works in the field of stock price crash risk mainly rely on basic content analysis methods, providing evidence that more complex and ambiguous company disclosures obfuscates financial information and thus, facilitates bad news hoarding (Ertugrul et al. 2017; Kim et al. 2019). Instead of identifying a new vehicle that managers use to hide bad news from outside investors, we draw on the bad news hoarding theory of Jin and Myers (2006) which requires the existence of certain incentives that drive managers decision to hoard bad news in the first place. Hence, by raising the question of whether company disclosures contain information about managerial incentives to hide bad news, we add a new perspective on how to use company filings to identify determinants of future crash risk. Furthermore, we argue that, as compared to prior works relying on broad textual characteristics of company files, our approach is superior in detecting managers’ bad news hoarding as it identifies a theoretically grounded incentive to use asymmetric information in efforts to time markets within managers’ capital structure decisions.

The remainder of this paper is structured as follows: Sect. 2 reviews literature in the field of stock price crash risk and provides our research hypothesis. Section 3 describes our data, the crash risk measures, the neural network embeddings we use to identify equity intent and the models to predict future crash risk. In Sect. 4 we present our prediction results before Sect. 5 provides additional analyses and Sect. 6 concludes the paper.

2 Literature review and hypothesis

Since Jensen and Meckling (1976) first introduced agency costs, a large stream of literature analyses how linking the performance measurement of managers to certain budgetary or financial targets exacerbates existing conflicts of interests between them and outside investors that incentivise managers to engage in earnings management (Franco-Santos et al. 2012; Jensen 2005). Likewise, since the current market valuation of a firm can influence its managers' strategic capital structure decisions, in theory, managers are incentivised to withhold information that could negatively affect the firms’ market valuation (Alti 2003; Baker and Wurgler 2002; DeAngelo et al. 2010; Graham and Harvey 2001). Unsurprisingly managers that engage in earnings management in this context, also try to maximize their seasoned equity offerings (SEO) proceeds by increasing SEO pricing (Kim and Park 2005) but at the same time experience a declining operating performance as well as a distinctly post-SEO underperformance as the real consequences of their earnings management decisions reach the market (Cohen and Zarowin 2010; DuCharme et al. 2004; Walker and Yost 2008). However, thus far the literature has neglected that managerial opportunistic behaviour may also have much larger implications for investors welfare due to its impact on stock price crash risk. Hence, we contribute to this literature by analysing how a firms equity intent serves as a motive for opportunistic behaviour in the form of bad news hoarding, which leads to stock price crashes.

In this regard Jin and Myers (2006) set up a model in which the degree of opaqueness affects the division of risk bearing between firm insiders (managers) and outside investors. Managers’ ambitions to capture parts of the firm’s cash flow is limited by outside investors’ perception of the firm’s cash flow and value. If cash flows exceed investors’ expectations, managers are able to capture a larger fraction of it. Conversely, if managers underperform relative to investors’ expectations, they suffer reduced compensation and might even run the danger to lose their job. Thus, managers are incentivised to overstate financial performance and increase opacity by strategically withholding negative information from outside investors. In this sense, managers’ disclosure preferences are not aligned with those of outside investors (Kothari et al. 2009). However, there is a natural limit to the accumulation of bad news, because the probability that outsider investors grasp the firm’s true value continuously increases the more negative news are accumulated. When further obfuscations become too costly or difficult to maintain, all the negative information are released at once, resulting in the firm’s stock price to crash. Given the subsequent destruction of shareholder welfare, managers’ incentives to engage in bad news hoarding and the underlying mechanisms of this practice have received increasing attention from financial research. Previous literature has already established the positive relation between information asymmetries and stock price crash risk. For instance, incentive-based management compensation, CEO overconfidence (Kim et al. 2016), and overall career concerns (Kothari et al. 2009) provide motives for managers to uphold and expand information asymmetries against outside investors, thus, increasing the risk of a stock price crash (Kim et al. 2011).

In search for other motives for managers to obfuscate corporate disclosure, we turn our focus on corporate financing decisions, more precisely managers’ equity intent. Besides the obvious importance for the continuation and success of a firm’s business activities, financial management decisions are subject to agency conflicts between managers and outside investors, especially in the case of equity financing. While we do not plan to extend the comprehensive literature on choosing the optimal financial strategy, we are interested in the possibilities of opportunistic management behaviour (and its financial consequences) that are associated with managers’ equity intent.

In general, managers should be interested in a successful SEO for at least two reasons. Firstly, their future courses of action inside the firm benefits from higher financial resources. Secondly, they may fear the negative resonance following a disappointing SEO from a career perspective. Conversely, a positive stock performance and a successful equity issuance should enhance their job security and increase their value on the labour market (Chi and Gupta 2009). Consistently, the influential survey of corporate financing decisions by Graham and Harvey (2001) showed that managers proactively chose their equity issuances with regard to the current market valuation of the firm. Since firm outsiders are unable to assess the true value of a firm (given semi-informationally efficient markets), managers are able to exploit these information asymmetries to maximize the corresponding capital inflow. Lang and Lundholm (2010) observe changes in corporate disclosure policies around the period of an SEO. They show that firms seeking for equity tend to provide more detailed and more optimistic disclosure prior to the equity offering, leading to an increase of their stock price prior to the offering announcement. While the increase of the stock price may also be the result of improved economic conditions, Lang and Lundholm (2010) provide evidence, that this effect also holds true for firms without an economic improvement which “hype up” their stock. Interestingly, firms dialling up their disclosure suffer from comparably higher price declines after the actual announcement of the equity offering, indicating a market correction for “hyped up” stocks. Bergmann and Roychowdhury (2008) show that managers strategically adapt the level of disclosure (long-horizon earnings forecasts) to mould investor expectations and foster the firm’s valuation. In the case of high investor sentiment, managers are expected to decrease the level of voluntary disclosure to manage future expectations and maintain optimistic firm valuations. Thus, managers are incentivised to uphold high investor sentiment by concealing negative information in the interest of optimizing their SEO. Due to the limits of bad news accumulation, an opportunistic manager should offer equity when the (positive) difference between outside investors’ assessment and firm’s true value is the highest. In this sense, the search for equity could be interpreted as signal that managers do not expect the market’s perception of the firm’s value to further increase in the near future (Krasker 1986). Following the notion of Bergmann and Roychowdhury (2008), the search for equity would mark the beginning of a decrease in a firm’s disclosure level and the starting point of negative news concealment. Given this evidence, managers who intend to issue equity in the foreseeable future might be more likely to withhold negative news from outside investors to secure a successful SEO and maximize their capital inflow. We thus posit the following hypothesis:

Hypothesis

Managers’ equity intent incentivises bad news hoarding behaviour as reflected by increasing future stock price crash risk.

The analysis above suggests that managers’ equity intent may help investors to better grasp potential risks arising from bad news hoarding behaviour. However, managers’ intentions cannot be extracted from simple balance sheet data and thus, require a rigorous identification strategy. Therefore, we construct neural network embeddings from textual disclosures to estimate managers’ equity intent based on linguistic characteristics. Moreover, since firms seeking equity might generally be associated with a riskier business profile, we employ a battery of variables capturing fundamental risk to alleviate the concern that our results are simply driven by business risk.

3 Sample selection, data, and research design

3.1 Sample selection

Table 1 documents the sample selection process. Following Schmidt et al. (2019) in screening data from Refinitiv Datastream and Worldscope, our sample starts with available US firm-years from 1994 to 2019. After removing financial firms (SIC codes 6000–6999) and utilities (SIC codes 4900–4999), we collect weekly returns for each firm-year. To avoid bias arising from thin trading, we remove firms with a fiscal year-end price lower than $1.^{Footnote 3} In addition, we remove observations with less than 26 weeks of stock return data and observations with nonpositive total assets at the beginning of a fiscal year.^{Footnote 4} In order to estimate managers’ equity intent, we next collect all available 10-K files from the SEC’s EDGAR system for our sample of firms and extract the MD&A section using the parsing procedure proposed by Reichmann and Reichmann (2022).^{Footnote 5} Having extracted the MD&A section, we compile Regular Expressions to extract the LIQ + CAP section. Since the following methodology relies on a clean identification of the LIQ + CAP section, we describe the parsing procedure in greater detail in Appendix 1. After dropping observations that have no machine-readable LIQ + CAP section and insufficient financial data to construct control variables, 28,382 observations from 2825 firms remain in the final sample.

Table 1 Sample selection

Full size table

3.2 Measuring managers’ equity intent

While common balance sheet data does not provide any cues on managers’ equity intent, Hoberg and Maksimovic (2014) show that managers address financing needs within the LIQ + CAP section, a mandated disclosure of the MD&A section in 10-K files. Thus, the authors compile a set of phrases managers commonly use in the LIQ + CAP section to express their equity intent. These include the following:

issuing equity securities OR expects equity securities OR through equity financing OR sources equity financing OR seek equity investments OR seek equity financings OR access equity markets OR raised equity arrangements OR undertake equity offerings OR sell common stock OR issuing common stock OR selling common stock OR use equity offerings OR offering equity securities OR planned equity offering OR seek equity offering OR raise equity offering OR equity offering would add OR additional equity offering OR considering equity offering OR seek equity financing OR pursue equity offering OR consummates equity offering OR raises equity capital OR raise equity offering OR sources equity offering

Using the above list of words and phrases, we identify a sample of firms that explicitly express equity intent.^{Footnote 6} However, manager’s equity intent is likely to come in shades. While some managers will explicitly state to seek equity, others may still consider it even though they to do not explicitly state to do so. Based on the idea that managers with equity intent are likely to discuss similar content (i.e. use similar words) within the LIQ + CAP section, Hoberg and Maksimovic (2014) suggest to construct continuous measures of managers’ equity intent. Specifically, the authors estimate cosine similarities to score how proximate the overall LIQ + CAP vocabulary is to that of firms that explicitly express their equity intent. Therefore, the authors employ a BOW approach to convert each LIQ + CAP section to a vector representation of word counts. Formally, if the full sample of LIQ + CAP sections is made of $|V|$ unique words (the total vocabulary), the BOW approach represents a given LIQ + CAP section $j$ as a vector with a length of $|V|$, where each entry is assigned to a word in the vocabulary and its corresponding word count, $coun{t}_{i,j}$. However, research in the field of computational linguistic points out two major drawbacks of this method (Le and Mikolov 2014): firstly, the BOW approach does not account for word order. As long as the same words are used, different documents can have exactly the same vector representations. Secondly, BOW vectors do not account for the semantics of words. For instance, when using the BOW approach, the words “profit”, “revenue”, and “water” are considered equally distant, even though “profit” and “revenue” are semantically much closer than “profit” and “water”. To address these issues, Le and Mikolov (2014) developed Doc2Vec, an unsupervised machine learning algorithm that learns dense vector representations from a sample of documents.

The authors build on the famous Word2Vec model developed by Mikolov et al. (2013a, b) which represents a neural network that attempts to estimate so-called word embeddings of each word found in a sample of documents. Word embeddings can be understood as learned vector representations of words that capture their semantic information, or put differently, their meaning. Therefore, Word2Vec relies on an old concept in linguistics suggesting that words that have similar meaning are more likely to co-occur with similar word-neighbours (Harris 1954). Hence, the neural network “reads” through a sample of documents and thereby learns to predict the neighbouring words of each word it encounters. To improve its predictions, the model continually updates the weights of hidden layers that are used to predict the neighbouring words. When learning is completed after a certain number of iterations, these layer weights are used as effective vector representations of words (i.e. word embeddings). The word embeddings all have a fixed, pre-defined dimension $d$ that captures the semantic information learned from the co-occurrence relationship of each word and its neighbours. As a result, words that co-occur with similar neighbouring words are assigned with more similar word embeddings, that, when plotted into vector space, occupy closer locations to one another.

While Word2Vec was developed to estimate vector representations of words, Doc2Vec has become a widely used technique to create vector representations of whole documents such as our LIQ + CAP sections. Le and Mikolov (2014) propose two different architectures to estimate vector representations of documents. The Distributed Memory (DM) model is based on Word2Vec but also adds a $d\times N$ document matrix to the modelling process, where $d$ denotes a given number of dimensions and $N$ denotes the total number of documents in the sample. Therefore, besides training word embeddings, the model also trains a $d$-dimensional feature vector that is unique to each of the $N$ documents in the sample. In comparison, the Distributed Bag Of Words (DBOW) model is less complex as it ignores word order and only uses the document vector to predict a randomly sampled set of words from the respective document. In both cases, the models run against a loss function and continuously update the document matrix to achieve better predictions. After finalizing a given number of iterations, the document matrix contains $d$-dimensional document vectors for each of the $N$ documents. In their influential paper, Le and Mikolov (2014) test how these so-called document embeddings compare to BOW vectors by estimating cosine similarities between the content of web pages and respective search queries. Their findings show that, compared to a standard BOW model, the usage of Doc2Vec gives a 53% relative improvement in terms of error rate when performing information retrieval tasks.^{Footnote 7} Consequently, we construct document embeddings in the spirit of Le and Mikolov (2014).

We begin by employing extensive text-preprocessing steps to reduce textual noise, improve the generalizability of our Doc2Vec model and form phrases that are specific to the LIQ + CAP sections. Specifically, we closely follow the recommendations of Reichmann and Reichmann (2022) by (i) splitting the document into sentences for further processing, (ii) replacing named entities such as firm names, dates, or money values to predefined tags, (iii) employing lemmatization to reduce feature dimensionality,^{Footnote 8} (iv) forming phrases in the spirit of Mikolov et al. (2013b) and (v) removing stopwords that are defined as words that do not add meaning to a document.^{Footnote 9} In the following, we illustrate the impact on the text preprocessing steps on an exemplary sentence taken from our sample of LIQ + CAP sections^{Footnote 10}:

“Investments in property, plant and equipment were $128 million in 2009 and $154 million in 2008”.

After employing preprocessing steps (ii) to (v), the sentence reads as follows:

“investment property_plant_and_equipment -money- -date- -money- -date-”.

(ii) Named entities such as the money value “$128 million” and dates like “2009” are replaced with the tags “-money-” and “-date-”^{Footnote 11}; (iii) lemmatization reduces the word “Investments” (plural) to its base-form “investment” (singular); (iv) the phrase “property, plant and equipment” is found to significantly co-occur within our sample of LIQ + CAP sections, so that it is concatenated to a single token using underscores (“property_plant_and_equipment”);^{Footnote 12} and (v) stopwords outside of phrases like “and”, “in”, and “were” are removed.^{Footnote 13} As the result shows, the sentence has become shorter and is less likely to reflect linguistic cues that are specific to a certain point in time which helps to reduce model dimensionality and improves the generalizability of the document embeddings.

Next, we use the Doc2Vec module of the genism library in Python to estimate vector representations of each LIQ + CAP section. While the DBOW and the DM model only differ slightly, the simpler DBOW model ignores word order which could reduce its ability to capture semantic information. Therefore, for our baseline analysis, we employ the DM model and later apply the DBOW model to test the robustness of our findings. For the implementation of the Doc2Vec model, we first have to define the dimensionality $d$ of the document embeddings which the model learns during the training process. While larger vector representations may yield better results, they also require more costly computations. However, based on the findings of Pennington et al. (2014) who show that embedding dimensions greater than 300 offer little improvement in quality, we follow the vast majority of the literature that employs textual embedding models and set number of embedding dimensions equal to 300 (e.g. Lau and Baldwin 2016; Li et al. 2021; Matin et al. 2019). For the remaining parameters, we closely follow the recommendations of Lau and Baldwin (2016) who provide optimal Doc2Vec hyper-parameter settings for semantic textual similarity tasks.^{Footnote 14} After 1000 iterations of training over the whole sample of LIQ + CAP sections, we assign each section a 300-dimensional document vector. We then construct an equity focus vector by averaging the vectors of all LIQ + CAP sections in which managers explicitly express their equity intent. Finally, we construct ${EQUITY\_INTENT}_{it}$ as cosine similarities from each LIQ + CAP sections to the equity focus vector. While ${EQUITY\_INTENT}_{it}$ naturally ranges between 0 and 1, higher values indicate a stronger equity intent.

In order to better understand how our main variable of interest is measures, Fig. 1 graphically displays a random sample of 500 LIQ + CAP embeddings and their distant to the equity intent vector by employing t-Distributed Stochastic Neighbour Embeddings (t-SNE) techniques to visualize the 300-dimensional vectors in a 3D scatter plot.^{Footnote 15} Each dot represents the document embedding of a randomly drawn LIQ + CAP section. The redder the dots, the higher ${EQUITY\_INTENT}_{it}$ (i.e. the closer they are to the equity intent vector). Consequently, a red dot that occupies the exact same space as the equity intent vector would have a value of ${EQUITY\_INTENT}_{it}=1$. Put simply, we would expect that redder firms have a higher propensity to experience a one-year-ahead stock price crash, as they exhibit a stronger intent to issue equity.

In order to validate our measure of equity intent, we examine the correlations between a firm’s equity intent and various firm characteristics such as firm size ($LOGM{V}_{it}$), the market-to-book ratio ($MT{B}_{it}$), leverage ($LE{V}_{it})$, operating performance ($RO{A}_{it})$ as well as risk measures including firm-specific return volatility $(SIGM{A}_{it}$) and cash flow volatility ($CFVO{L}_{it}$). In addition, we include proxies for firms’ inherent information asymmetry using an intangibility measure ($ADJROT{A}_{it})$ and R&D expense ($PROP\_COS{T}_{it}$). Finally, we also test correlations with firm age ($AG{E}_{it}$). The results are reported in the Table 2A. We find that smaller, younger, badly performing, and high-growth firms have a higher equity intent. Furthermore, firms that are less capable of using debt financing, firms with higher cash flow volatility, and firms with higher information asymmetry are more likely to search for equity. In summary, we conclude that our measure of equity intent correlates with common firm characteristics that are likely to constrain firms’ financing options to equity financing.

Table 2 Descriptive statistics

Full size table

3.3 Measuring crash risk

To measure firm-specific stock price crash risk, we calculate firm-specific weekly returns ($W$) using the following market-industry index model estimated for each firm and fiscal year:

$$r_{i\tau } = \alpha_{i} + \beta_{1i} r_{m\tau - 1} + \beta_{2i} r_{j\tau - 1} + \beta_{3i} r_{m\tau } + \beta_{4i} r_{j\tau } + \beta_{5i} r_{m\tau + 1} + \beta_{6i} r_{j\tau + 1} + \varepsilon_{i\tau }$$

where ${r}_{i}$, ${r}_{j}$, and ${r}_{m}$ are the returns in week $\tau$ for stock $i$, the value-weighted Fama–French index for industry $j$, and the CRSP value-weighted market index m, respectively. Since 10-K files are usually filed within a three month period after a firm’s official fiscal year-end, we define a fiscal year as the 12-month period ending three month after a firm’s official fiscal year-end to avoid look-ahead bias (Kim et al. 2019).^{Footnote 16} To account for non-synchronous trading in our estimation (Dimson 1979), we augment the index model with lead and lag terms for market and industry returns.^{Footnote 17} The firm-specific weekly return for firm $i$ and week $\tau$, ${W}_{i\tau }$, is the natural logarithm of one plus the residual of the equation above. For the selection of crash risk measures, we follow the existing literature and calculate the following four measures (Al Mamun et al. 2021; Chen et al. 2001; Hong et al. 2017; Jin and Myers 2006; Kim et al. 2011, 2019; Kim and Zhang 2016). First, the negative skewness of weekly stock returns:

$$NCSKEW_{it} = - \left[ {n\left( {n - 1} \right)^{\frac{3}{2}} \mathop \sum \limits_{\tau = 1}^{n} W_{i\tau }^{3} } \right]/\left[ {\left( {n - 1} \right)\left( {n - 2} \right)\left( {\mathop \sum \limits_{\tau = 1}^{n} W_{i\tau }^{2} } \right)^{\frac{3}{2}} } \right]$$

$NCSKEW_{it}$ is calculated by dividing the negative of the third moment of firm-specific weekly returns by the standard deviation of firm-specific weekly returns raised to the third power over all n weeks of the fiscal year. Through scaling the raw third moment by the standard deviation cubed we are able to compare firms with different stock price variances (Chen et al. 2001). By putting a minus sign in front of the nominator, high values of $NCSKEW_{it}$ correspond with a more left-skewed distribution and an increased stock price crash risk for firm i in year t. Hence, we adopt the common convention that more left-skewed return distributions are associated with frequent small gains but bear the risk of extreme negative outliers (i.e. stock price crashes).

As measures based on third moments are potentially overly affected by extreme observations, we apply the “down-to-up” volatility $DUVOL_{it}$ as our second crash risk variable. $DUVOL_{it}$ equals the natural logarithm of the ratio of the standard deviation of firm-specific weekly returns in down weeks to the standard deviation of firm-specific weekly returns in up weeks:

$$DUVOL_{it} = log\left[ {\frac{{\left( {n_{u} - 1} \right)\mathop \sum \nolimits_{DOWN} W_{i\tau }^{2} }}{{\left( {n_{d} - 1} \right)\mathop \sum \nolimits_{UP} W_{i\tau }^{2} }}} \right]$$

where $n_{u}$ and $n_{d}$ represent the number of up and down weeks defined as weeks in which the weekly return $W_{i\tau }$ exceeds or falls below the average return of a fiscal year, respectively. Based on the pattern of incremental gains and larger losses, “crash prone” firms should be characterised by higher standard deviation of firm-specific weekly returns in down weeks compared to up weeks. Thus, high values of $DUVOL_{it}$ should correspond with an increased stock price crash risk for firm i in year t.

Thirdly, we use $COUNT_{it}$ defined as the difference in frequencies between negative stock price crashes and positive upward stock price jumps in firm-specific returns. Stock price crashes (upward jumps) are defined as a firm-specific weekly return that falls (rises) 3.09 standard deviations below (above) the annual mean.^{Footnote 18} Similar to prior works, we calculate $COUNT_{it}$ as the difference between stock price crashes and upward jumps (Jin and Myers 2006). Based on the same argumentation as our previous measures, skewed-return distributions, or put differently, “crash prone” firms should be characterised by more stock price crashes than upward jumps. In conclusion, higher values of $COUNT_{it}$ indicate increased stock price crash risk for firm i in year t. Finally, we include the indicator variable $CRASH_{it}$ that equals one if a firm experiences a stock price crash within a fiscal year and zero otherwise. Therefore, $CRASH_{it}$ captures the realization of a firm-specific stock price crash.

To evaluate our hypothesis that managers’ equity intent increases future stock price crash risk through bad news hoarding behaviour, we regress we our crash risk measures $NCSKEW_{it + 1} , DUVOL_{it + 1} , COUNT_{it + 1}$, and $CRASH_{it + 1}$ on our text-based equity intent variable $EQUITY\_INTENT_{it}$ as well as a set of control variables:

$$CRASH RISK_{it + 1} = \beta_{0} + \beta_{1} EQUITY\_INTENT_{it} + \sum \beta_{k} CONTROLS_{it}^{k} + \varepsilon_{it}$$

The set of control variables includes the non-linear relationship between financial opacity, measured as the three years moving sum of discretionary accruals $(OPAQUE_{it} )$ and its squared term, since Hutton et al. (2009) document a concave relation between financial opacity and crash risk. Firm size, measured as the natural logarithm of a firm’s market capitalization ($LOGMV_{it}$), controls for various aspects of a firm’s operational and business environment. Prior works show that growth firms, captured by the market-to-book ratio ($MT{B}_{it}$), are more crash-prone (Hutton et al. 2009; Kim et al. 2019), whereas firms that are less likely to experience stock price crashes should be more capable of obtaining debt ($LE{V}_{it}$). Since Graham et al. (2005) suggests that poorly performing firms are more likely to withhold bad news, we control for operating performance ($RO{A}_{it}$) and the mean of firm-specific weekly returns during a fiscal year ($RE{T}_{it}$). Chen et al. (2001) find that investor heterogeneity, measured as the detrended level of turnover ($DTUR{N}_{it}$), is positively associated with crash risk. Finally, firm-specific skewness ($NCSKE{W}_{it}$) and volatility ($SIGM{A}_{it}$) is documented to correlate with crash risk (Kim et al. 2019; Wu and Lai 2020). All of our regressions also include year and industry fixed effects (based 2-digit SIC industry classification) to account for year- and industry-wide variation in crash risk patterns. The reported standard errors are robust to heteroscedasticity (MacKinnon and White 1985).

4 Results

4.1 Univariate statistics

Panel A of Table 2 contains descriptive statistics for our main variables ${NCSKEW}_{it+1}$, ${DUVOL}_{it+1}$, ${COUNT}_{it+1}$, $CRAS{H}_{it+1}$, and ${EQUITY\_INTENT}_{it}$, as well as our additional control variables. The average cosine similarity between an individual firms’ equity intent and the equity intent vector equals 0.281. The mean values (medians) of the variables ${NCSKEW}_{it+1}$ and ${DUVOL}_{it+1}$ of 0.128 (0.030) and 0.022 (− 0.035), respectively, indicate that the distribution of weekly returns of the firms in our sample exhibits a pronounced negative skewness. Consistently, we find that, on average, 23.4% of the observations experience a stock price crash as indicated by the mean value of $CRAS{H}_{it+1}$ which is generally consistent with the descriptive statistics reported by Kim et al. (2019). Interestingly for ${COUNT}_{it+1}$ we find a marginally negative mean of − 0.001 which indicates that we observe slightly more stock price jumps than crashes. But the direct comparison with the other two measures, demonstrates that if a crash happens it tends to be larger than a corresponding jump and thus detrimental for investor welfare.^{Footnote 19}

As a first step of our analysis, we use univariate comparisons between equity intent and our crash risk measures. Therefore, we sort our sample into tercile groups by the equity intent variable ${EQUITY\_INTENT}_{it}$ and present the mean values of the one-year-ahead crash risk measures for each group in Panel B of Table 2. The results demonstrate that firms with low equity intent are more likely to experience upward stock price jumps rather than crashes as indicated by the negative means for two of our dependent variables. More importantly, we find that the stock price crash risk increases monotonically from the low-equity intent group to the high-equity intent group when we measure crash risk by ${NCSKEW}_{it+1}$ and ${DUVOL}_{it+1}$. Furthermore, the differences between the high- and low-equity intent group are statistically significant at the 1%-level. For ${COUNT}_{it+1}$, we find that crash risk decreases from low- to medium-equity intent only to peak for the high-equity intent group. The differences in crash risk between the high- and low-equity intent groups are statistically significant at the 5%-level. Similarly, turning to the $CRAS{H}_{it+1}$ measure, we find that 22.6% of the observations in the low-equity intent group experience at least one stock price crash during a fiscal year, whereas for the high-equity intent group, 25.2% of the observations experience at least one stock price crash risk during a fiscal year. The difference in means between the low- and high-equity intent group is statistically significant on the 1%-level. However, to get a deeper understanding how equity intent affects the level of crash risk, we need to consider other factors influencing crash risk within a multivariate analysis.

4.2 Main results

Table 3 shows the results of our regressions to predict one-year-ahead crash risk. The regression models differ only with respect to the dependent variables, so that column 1 shows the results for ${NCSKEW}_{it+1}$, column 2 for ${DUVOL}_{it+1}$ and column 3 for the results for ${COUNT}_{it+1}.$ We find that the coefficient of ${EQUITY\_INTENT}_{it}$ is positive and statistically significant on the 1%-level for all four crash risk measures, ${NCSKEW}_{it+1}, {DUVOL}_{it+1}$, ${COUNT}_{it+1}$, and $CRAS{H}_{it+1}$. These results are in line with our hypotheses that a stronger equity intent helps to predict future stock price crash risk as it may represent a motive for bad news hoarding. Consistent with the extant literature on the relationship between earnings management and crash risk (Hutton et al. 2009) we find the coefficients for ${OPAQUE}_{it}$ (${{OPAQUE}_{it}}^{2})$ to be positive (negative) and statistically significant at 1%-level or 5%-level in all model specifications. Even more importantly these results demonstrate that a firm’s equity intent predicts crash risk in $t+1$ after controlling for the level of earnings management and insofar is a potent predictor for bad news hoarding that transcends the effects of earnings manipulation. In line with expectations, the R-squared of the regression models for ${COUNT}_{it+1}$ and $CRAS{H}_{it+1}$ indicate a weaker model fit due to the relatively small variations in these variables over the whole sample period since both only contain information on tail events of the return distribution. Moreover, this might also contribute to less statistical significance for some of the control variables because of relatively higher standard errors. However, the overall model fit of our models is consistent with results reported in prior literature on stock price crash risk which is reflective of the complex nature of predictions models for rare events such as stock price crashes. We also estimate the economic significance of ${EQUITY\_INTENT}_{it}$ on the realization on a one-year-ahead crash event as captured by ${CRASH}_{it+1}$. Therefore, we use a margins model to estimate partial derivatives of the regression equation with respect to each variable in the model for each unit in the data. We find that a one standard deviation increase in ${EQUITY\_INTENT}_{it}$ is associated with an 0.80% increased probability of a one-year-ahead crash.^{Footnote 20} This effect is comparable to other determinants of crash risk identified by prior research. For instance, Kim et al. (2019) show that a one standard deviation in their modified readability measure is associated with a 0.52% increased crash probability, whereas Hutton et al. (2009) show that a one standard deviation increase in $OPAQU{E}_{it}$ is associated with a 1.73% increase in the probability of a future crash event. In summary, we consider the effect of ${EQUITY\_INTENT}_{it}$ to be economically significant. Therefore, our findings may help investors secure their investment performance against stock price crashes.

Table 3 Impact of equity intent on stock price crash risk

Full size table

Up to this point, our results present a positive association between a firm’s equity intent and its crash risk. To further examine the role of equity intent as a motive for bad news hoarding, we conduct a mediation analysis. In this analysis we assess the extent to which the effect of equity intent could be explained by another mediator variable that is likely to capture bad news hoarding by managers. In this regard, we assume that managers who try to block the flow of negative information to the public, intentionally or unintentionally use less negative language to explain their assessment of their firm. Therefore, we construct the mediation variable ${NEGW}_{it}$ that represents the fraction of negative words used by firm managers within the MD&A section. We estimate the fraction of non-negated negative words within the MD&A section by using the Fin-Neg wordlist proposed by Loughran and McDonald (2011).^{Footnote 21} Following the recommendations of Baron and Kenny (1986), the subsequent mediation analysis proceeds in three steps. Firstly, as documented under “Step 1” in Table 4, we regress the mediator $NEG{W}_{it}$ on our main independent variable $EQUITY\_INTEN{T}_{it}$. The results show that more equity intent of a firm is associate with significantly (at the 1%-level) less negative words within its MD&A section, indicating that managers who are seeking equity allow less negative news flow. Secondly, we regress the crash risk measures on $EQUITY\_INTEN{T}_{it}$ as shown in “Step 2” of Table 4. Hence, the second step simply repeats our baseline analysis and shows that higher equity intent corresponds to higher crash risk. Thirdly. We regress the crash risk measures on both our main independent variable $EQUITY\_INTEN{T}_{it}$ and on the mediator $NEG{W}_{it}$. We find that while the coefficient of $EQUITY\_INTEN{T}_{it}$ remains positive and statistically significant, negative news flow helps mitigating future crash risk as indicated by the negative and statistically significant coefficient of $NEG{W}_{it}$.^{Footnote 22} Finally, we estimate the average causal mediation effect (ACME). The ACME stands for the indirect effect of equity intent on crash risk that goes through negative language of managers assessed negative wording in the MD&A section of a firms 10-K report. For the calculation of the ACME we use 1000 simulations and non-parametric bootstrapped confidence intervals. The results are presented in columns (2)–(4). The ACME is statistically significant at the 1%-level. This finding indicates that, a significant part of the effect of equity intent on crash risk can be attributed to bad news hoarding through less negative language used by firm managers in their MD&A section. In line with the predictions of Jin and Myers (2006) as well as the vast majority of empirical research on crash risk, these findings indicate that managers’ bad news hoarding behavior serves an important economic mechanism through which managers’ equity intent affect future crash risk.

Table 4 Bad news hoarding—mediating effect of negative wording in the MD&A section

Full size table

Our paper illustrates that a firms equity intent acts a motive for bad news hoarding. Given that managers actively engage in earnings manipulation (Hutton et al. 2009) as well as textual obfuscation to hide negative news (Kim et al. 2019) one could naturally ask whether these instruments of bad news hoarding are more likely to increase crash risk when managers are incentivised to hoard bad news due to their equity intent. Consistent with the literature, we use the FOG Index to assess the textual obfuscation based on the readability of the 10-K report. We calculate the FOG Index as (words per sentence + percentage of complex words) × 0.4 with a higher FOG Index indicating that the report is more difficult to read, where complex words are defined as words exceeding three syllables. The FOG Index indicates the number of years of formal education a reader of average intelligence needs to understand a given text on the first reading. We follow the recommendations of Kim et al. (2019) and adjust the FOG Index for more than 2000 multisyllabic words that occur frequently in financial reports but are unlikely to be considered complex from an investor’s perspective. After modifying the FOG Index, we derive $MODFO{G}_{it}$ as proposed by Kim et al. (2019). If a firms equity intent reinforces the effects of earnings manipulation and textual obfuscation on crash risk, we would expect that ${OPAQUE}_{it}$ and ${MODFOG}_{it}$ have a stronger predictive power when managers have equity intent. Hence, we examine the effects of earnings manipulation and textual obfuscation for two different groups based on their equity intent. Similar to the univariate comparison in Table 2, we conduct a split sample analysis by comparing firms within the lower tercile (low equity intent) and the upper tercile (high equity intent) group of ${EQUITY\_INTENT}_{it}$. In Panel A of Table 5 the results for the low-equity intent group indicate that textual obfuscation is solely significant at the 5%-level for our second crash risk measure ${DUVOL}_{it+1}$. Moreover, ${OPAQUE}_{it}$ significantly predicts crash risk in $t+1$ for ${NCSKEW}_{it+1}$(on the 5%-level) and ${DUVOL}_{it+1}$ (on the 1%-level) and $CRAS{H}_{it+1}$ (on the 5%-level only for the squared term of $OPAQU{E}_{it}$). In line with our expectations, the results for the high-equity intent group presented in Panel B of Table 5 demonstrate that the effects of both instruments of bad news hoarding (earnings manipulation and textual obfuscation) increase in effect size and statistical significance. Thus, textual obfuscation increases crash risk in $t+1$ at least at the 5%-level for all crash risk measures. Similarly, earnings manipulation also increases crash risk in $t+1$ at least on the 5%-level for all measures of crash risk.^{Footnote 23} Overall, these findings indicate that managers’ equity intent increases the adverse consequences of common bad news hoarding tool which further strengthens our view that managers equity intent encourages managers to hide adverse information.

Table 5 Impact of equity intent on the association between earnings manipulation, textual obfuscation and stock price crash risk

Full size table

5 Additional analyses

In order to test the sensitivity of our results, we conduct several robustness tests. Firstly, because investors that try to hedge against future crash risk might be interested in the persistence with which equity intent signals bad news hoarding, we expand the prediction window of all four crash risk measures. While our baseline analysis employs a 12-month ahead prediction window, managers may accumulate bad news over several years. Thus, we test the relationship between managers’ equity intent and the 24-, 36-, and 48-month ahead stock price crash risk. The results are displayed in Table 6. We find that the coefficients of ${EQUITY\_INTENT}_{it}$ remain statistically significant at least on the 5%-level for all four crash variables when employing a 24-month window, and even remain significant on at least 5% for $NCSKE{W}_{it+3}$ and $DUVO{L}_{it+3}$ when employing a 36-month window. Even for a 48-month prediction window, the coefficients of ${EQUITY\_INTENT}_{it}$ remain statistically significant on the 10%-level when predicting $DUVO{L}_{it+4}$. Consistent with the notion that bad news hoarding becomes more difficult to maintain over the course of time the coefficients and their respective t-values of ${EQUITY\_INTENT}_{it}$ steadily decreases with an increasing prediction window, similar the models’ R-squared. Given the decreasing importance of $EQUITY\_INTEN{T}_{it}$ over time, our results suggest that investors should screen 10-K filings on a yearly basis to obtain better predictions of future crash risk.

Table 6 Persistence of the impact of equity intent on stock price crash risk

Full size table

Secondly, we examine the association between managers’ equity intent and upward stock price jumps. Our results generally suggest that managers’ equity intent is associated with increasing bad news hoarding behaviour. While a gradual increase of the stock price as a result of managers “hyping up” the firm’s stock price prior to an SEO seems plausible, sudden upward price jumps seem comparably unlikely, as they would indicate positive market reaction following the revelation of unexpected (good) news. In this case, one would expect that ${EQUITY\_INTENT}_{it}$ is associated with future stock price crashes as a consequence of the sudden release of the accumulated negative information, but should either have no, or a negative correlation with upward stock price jumps. Therefore, we decompose the crash risk measure $COUN{T}_{it+1}$ into stock price crashes and upward jumps and estimate both proxies separately. Unreported results show that ${EQUITY\_INTENT}_{it}$ is only positively associated with stock price crashes and shows a negative correlation with upward jumps, that is statistically significant only at the 10%-level. Similar to the conception of a placebo test, the theoretical consistency of these results suggest that our inferences are unlikely to be driven by unobservable forces.

Thirdly, we construct an alternative measure for our main variable of interest ${EQUITY\_INTENT}_{it}$. Given the usage of neural networks, we acknowledge that the variable is subject to a large parameter space and uncountable interactions between neurons. While our baseline analysis employs the DM model, we next estimate word embeddings using the DBOW model proposed by Le and Mikolov (2014). The DBOW model is generally considered to be less complex compared to the DM model as it only uses the document matrix to predict a randomly sampled set of consecutive words from a document. For the implementation of the DBOW model, we follow the recommendations of Lau and Baldwin (2016) and estimate 300-dimensional document embeddings in 400 iterations over the whole sample of LIQ + CAP sections.^{Footnote 24} Similar to our baseline analysis, we use these document embeddings to estimate an equity intent vector and calculate cosine similarities from each LIQ + CAP embeddings to the equity intent vector to derive $EQUITY\_INTENT{\_DBOW}_{it}$. Table 7 shows the results of estimating our crash risk measures using $EQUITY\_INTENT{\_DBOW}_{it}$. The results remain virtually unchanged, indicating that our inferences are not merely driven by model and parameter choices.

Table 7 Alternative measurement of equity intent

Full size table

Fourthly, we include additional control variables to address concerns of omitted variable bias.^{Footnote 25} Specifically, we address concerns that managers’ equity intent is merely a reflection of a firm’s risk profile by controlling for additional variables capturing business risks including a firm’s cash flow volatility $CFVO{L}_{it}$, sales volatility $SALESVO{L}_{it}$, earnings volatility $EARNVO{L}_{it}$ as well as the Herfindahl–Hirschman Index based on 3-digit SIC codes $HH{I}_{it}$ (Giroud and Mueller 2010; Kim et al. 2019). Since our main variable of interest is based on textual data, we also control for textual proxies that prior works found to be significantly associated with crash risk. These include the modified FOG Index $MODFO{G}_{it}$ proposed by Kim et al. (2019), managers’ usage of ambiguous language measured by the fraction of weak modal words within the MD&A $WMO{D}_{it}$ and the natural logarithm of a 10-K’s file size in megabytes $FILESIZ{E}_{it}$ as in Ertugrul et al. (2017).^{Footnote 26} Finally, based on the findings of Wu and Lai (2020), we include two additional measures for firm’s inherent information asymmetry, namely, intangible intensity $ADJROT{A}_{it}$ as suggested by Clausen and Hirth (2016) as well as a firm’s proprietary costs $PROP\_COS{T}_{it}$ measured as a firm’s R&D expense scaled by total assets and firm age ($AG{E}_{it}$). The results displayed in Table 8 show that the coefficient of ${EQUITY\_INTENT}_{it}$ remains statistically significant for all four crash measures, alleviating concerns of omitted variable bias.^{Footnote 27}

Table 8 Additional controls

Full size table

Finally, we test whether earnings manipulation, a common vehicle managers use to hoard bad news (Hutton et al. 2009), serves as another mediator for the effect of equity intent on stock price crash risks. Specifically, one could argue that managers do not only block negative news flow but also engage in more aggressive earnings management to obfuscate financial disclosures. Therefore, in Table 9, we use $OPAQU{E}_{it}$ as a mediator variable for $EQUITY\_INTEN{T}_{it}$. “Step 1” of the mediation analysis shows that higher equity intent is associated with higher levels of earnings management, suggesting that managers who search for equity are more likely to engage in earnings management. Similarly, the results of “Step 2” and “Step 3” of the mediation analysis are largely consistent with our baseline analysis, indicating that earnings management serves as a significant mediator for effects of managers’ equity intent on future crash risk. In summary, the above analyses suggest robustness of our finding that managers equity intent incentivises managers to hoard bad news and hence, increases firm-specific crash risk.

Table 9 Bad News Hoarding – Mediating effect of earnings manipulation

Full size table

6 Conclusion

Previous research has established the notion that managers are incentivised to strategically withhold negative information from outside investors in order to maximize their own capture of the firm’s cash flows (Jin and Myers 2006; Kothari et al. 2009). Since the accumulation of bad news is limited when further obfuscations become too costly or difficult to maintain, all negative information is released at once, resulting in the firm’s stock price to crash. Given the subsequent destruction of shareholder welfare, the motives for managers to conduct this behaviour has received increasing attention from financial research literature. While incentive-based management compensation, CEO overconfidence (Kim et al. 2016), and overall career concerns (Kothari et al. 2009) have already been identified as motives to obfuscate corporate disclosure, we turn our focus on corporate financing decisions, thus drawing the connection to arguably one of the most important areas of corporate decision making. Capital structure decisions, especially in the case of equity financing. are known for providing agency conflicts between managers and outside investors, which should incentivise managers to obfuscate corporate disclosure.

Further extending the analysis of drivers of bad news hoarding behaviour, we analyse whether the intention to issue equity incentivises managers to strategically withhold negative information to maximize corresponding capital inflow and strengthen their position of power inside the firm. We examine the role of managers’ intentions within the capital structure decisions in shaping firm-specific stock price crash risks, by analysing how managers express the financing needs of their firms in the LIQ + CAP sections of their MD&A. Using neural network embeddings of the LIQ + CAP sections, we identify different degrees of equity intent. Differences in the management’s inclination to raise more equity may lead to different levels of bad news hoarding which in turn may impact firm-specific stock price crash risks. We expect that managers expressing equity intent in the firms’ disclosures are more prone to bad news hoarding behaviour which helps to predict future stock price crash risk of firms. In fact, we find that a firm’s current equity intent has strong predictive power for firm-specific stock price crash risk in $t+1$. As a central motive for bad news hoarding, equity intent exerts comparable effects on crash risk as other popular obfuscation vehicles. By conducting a mediation analysis, we find that stronger equity intent affects future stock price crash risk through the channel of bad news hoarding. Consistent with this notion, we find that managers’ equity intent incentivises managers to obfuscate financial disclosure by manipulating earnings and writing overly complex reports. Moreover, our results remain robust for wider prediction windows up to three years, alternative measures of equity intent as well as for a diverse set of additional factors that potentially influence stock price crash risk.

This paper contributes to the stock price crash risk literature by describing the mechanisms between management intentions in capital structure decisions and future stock price crash risks. As financing decisions present one of the most essential business activities, our results apply relevance for both investors and a broad scope of corporate decision makers. In addition, our approach fits in with current SEC efforts to use data analytics to uncover EPS manipulation as part of their current policy agenda requiring stronger corporate disclosures to better protect investors.^{Footnote 28} In contrast to the rather basic textual information complementing prediction models applied in previous research, our approach holds additional prediction power for stock price crashes as it is sensitive towards various textual cues in management statements within firm disclosures indicative for bad news hoarding. More general, it enables to gauge management intentions from written statements and therefore provides an interesting venue for future research analysing the effects of opportunistic management behaviour on financial market outcomes. For example, using our method to identify managers’ intentions with regards to capital structure decisions, future research may provide a deeper analysis of how manager time the release of good news prior to raising capital (Lang and Lundholm 2010). While this paper suggests that using information retrieval techniques that go beyond simple summarization techniques such as word counts and readability measures, future works may further explore textual cues of bad news hoarding by applying more sophisticated approaches such as named entity recognition techniques, topic models, or word embeddings.

Notes

Note that accounting fraud constitutes an extreme and illegal form of bad news hoarding. However, it is documented that managers have significant discretion over both accounting numbers and textual disclosures (Dechow et al. 1995; Hung and Qiao 2017; Lo et al. 2017; Roychowdhury 2006). Hence, bad news hoarding is likely to come in shades. While some managers may engage in extreme forms of bad news hoarding that result in accounting scandals, other managers will rather aim to obfuscate financial reporting by remaining comfortably within the scope of discretion permitted by regulators.
Sunder (2010) differentiates between two perspectives on risk mitigation. The first risk perspective describes the uncertainty of outcomes, which is symmetric in losses and gains and can be reduced through diversification. The second risk perspective focuses on the magnitude and probability of losses and can only by reduced via screening mechanisms. While the overall uncertainty of outcome of a specific firm (stock volatility) can be reduced through diversification, the most appropriate response towards extreme tail-risks (i.e., stock price crashes) are screening mechanisms.
Note that penny stocks are often traded in thin markets that have a high price volatility and low liquidity. Since we seek to examine the relationship between managers’ equity intent and their tendencies to hoard bad news, stock price crashes arising from difficult market conditions could bias our empirical identification. Therefore, we follow the extant literature and remove low-priced firms from our sample (e.g. Kim et al. 2011; Kim et al. 2016; Kim and Zhang 2016; Kim et al. 2019.).
Note that, while some studies in the field of crash risk also exclude firms with negative book values (e.g. Al Mamun et al. 2021; Wu and Lai 2020), these firms have become much more common in recent years, are often financially healthy, and report strong earnings (Luo et al., 2021). Since Luo et al. (2021) show that the extreme leverage ratios of such firms are mostly driven by high investment demand, they also appear susceptible to the usage of equity financing and thus, are clearly of interest for our research question. However, unreported results suggest that our inferences remain unchanged when excluding negative book value firms.
For the textual analysis, we use all available 10-K filings before dropping observations with insufficient data to measure crash risk. This significantly increases our sample size for the textual analysis and consequently, improves the generalizability of our textual proxies.
Note that we test whether either one of the phrases appears in a sentence of the LIQ + CAP section. Therefore, these phrases can slightly vary. For instance, we also identify phrases such as “we seek to undertake a new equity offering” even though the phrase is “undertake equity offering”. In total we identify 6,499 LIQ + CAP sections in which managers explicitly state their equity intent which makes up 12.6% of our total sample. This is generally consistent with Hoberg and Maksimovic (2014) who identify 12.8% of firms within their sample as equity-focused.
Note, that the authors only report the results of a combined measure that uses both the DM and DBOW Doc2Vec architectures. Furthermore, as a baseline, the authors apply a BOW approach that uses term frequency-inverse document frequencies (TF-IDF) weights as the raw count approach used in Hoberg and Maksimovic (2014) performs even worse. Even when using a more sophisticated method such as a weighted Bag-Of-Bigrams with TF-IDF weights, the authors still report a relative improvement of 32% in terms of error rate.
Note that we use spaCy for preprocessing step (ii) and (iii).
We use the generic stopword list proposed by Loughran and McDonald (https://sraf.nd.edu/textual-analysis/resources/#StopWords).
Specifically, the sentence is taken from the Agilent Technologies 10-K filing in 2009 (0001047469–09-010861).
Note that spaCy also detects named entities that consist of multiple words such as “Apple Inc” or “the following year”.
Note that “Property, plant and equipment” refers to a common balance sheet item. Hence, its meaning is not a simple composition of the meanings of its individual words. Since these phrases have a different meaning than their individual words, they should rather be treated as unique tokens (Mikolov et al. 2013b).
Note that, since stopwords can help understanding commonly used phrases, we remove stopwords after forming phrases. Therefore, the phrase “property_plant_and_equipment” still includes the stopword “and”.
Following Lau and Baldwin (2016), we use a vector size of $d=$ 300, a window size of 5, use a down-sampling threshold of ${1e}^{-6}$, draw 5 “noise words” through negative sampling. Given the size of our dataset we also ignore words accruing less than 5 times.
Note that the embedding dimensions are not interpretable by humans. Each dimension attempts to capture semantic information of a given LIQ + CAP section which the Doc2Vec model learned during the training process.
In line with previous research, we rely on weekly returns as it increases variance R²s compared to the utilization of monthly data and thus, should result in more accurate crash risk measures. However, our results remain robust when following Callen and Fang (2013, 2015a, b) in measuring crash risk who use daily instead of weekly returns.
Note that the lead and lag terms barely correlate across time (i.e. < 10%). However, the market (${r}_{m\tau }$) and industry (${r}_{j\tau }$) returns show high correlation coefficients of approximately 79%. Even though these correlations are unlikely to bias the residuals of the market-industry index model, we also estimate firm-specific weekly returns using an extended market model as in Kim et al. (2021) that excludes industry returns and instead controls for additional lead and lag terms of market returns. Our results remain robust.
The threshold of 3.09 standard deviations reflects the critical value for 0.1% of the distribution of weekly returns.
We recognise that ${COUNT}_{it+1}$ exhibits only small variations within our sample since it only represents extreme values of the return distribution. In theory, this may lead to lower reliability and fit of OLS regression models. To account for this, we recode ${COUNT}_{it+1}$ to positive values by adding a constant equal to the minimum sample value of ${COUNT}_{it+1}$ to each observation and re-estimated all our models using Poisson regressions. We find that the results are qualitatively the same. For this reason, and to maintain the comparability with the existing literature, we report the results of the OLS regressions below. All other results are available on request.
We also calculate standardized coefficients for all OLS estimates of ${EQUITY\_INTENT}_{it}$. We find that a one standard deviation increase in equity intent exerts a fairly consistent positive influence on future crash risk for all OLS models (${NCSKEW}_{it+1}: 0.036; \,\,{DUVOL}_{it+1}: 0.043;$ ${COUNT}_{it+1}$: 0.025).
We rely on the level of word counts since Loughran and McDonald (2011) note that differencing methods rely on the assumption that investors can remember the frequency of negative words used in 10-Ks that were filed one year ago. Moreover, their results suggest that much of the variation in differences is likely to be driven by random variation in the frequency of commonly used words.
Note that consistent with the prediction of Baron and Kenny (1986), we find that in “Step 3” of Table 1, the coefficients of $EQUITY\_INTEN{T}_{it}$ are slightly smaller as compared to the regression in “Step 2”. Baron and Kenny (1986) argue that it is critical to examine not only the significance of the coefficients but also their absolute magnitude. Since the independent variable and the mediator should be correlated one would expect reduced power in the test of the coefficients in “Step 3” as shown by our results.
In addition, we find larger and more significant effects of most control variables for firms with high equity intent. Thus, in line with expectations, growth firms are even more crash prone if their managers have a strong incentive to hide adverse information, e.g., hive a high equity intent. For brevity, we do not report the results for the control variables.
Following Lau and Baldwin (2016), we use a vector size of $d=$ 300, a window size of 15, use a down-sampling threshold of ${1e}^{-5}$, draw 5 “noise words” through negative sampling. Given the size of our dataset we also ignore words accruing less than 5 times.
Descriptive statistics for the additional variables are displayed in Table 1A.
While Ertugrul et al. (2017) and Kim et al. (2019) estimate the fraction of ambiguous language as well as the FOG Index based on the full 10-K file, we determinately measure the variables based on the MD&A section as this section is more likely to capture characteristics of firm’s management. However, constructing measures based on the full 10-K does not alter our results.
Note that our results also remain robust when using ${\mathrm{EQUITY}\_\mathrm{INTENT}\_\mathrm{DBOW}}_{\mathrm{it}}$ instead.
See https://www.wsj.com/articles/sec-digs-deeper-into-companies-eps-manipulation-11633870803.
Specifically, we compile the following regex as a raw string:
(\n[^A-Za-z\n\"\ “] +)(FINANCIAL\s + \w + ,\s +|ANALYSIS\s + OF\s +)?LIQUIDITY\s + (AND|\&).
Sometimes table headers are written in caps, so we do not match (\bAT\b|YEAR|FISCAL|DECEMBER). The final end regex is compiled by using the exact same formatting text of the starting regex followed by the following raw string:
(?!(\bAT\b|YEAR|FISCAL|DECEMBER))\b[A-Z] + \b|\n[^A-Za-z\n\"]*?I[Tt][Ee][Mm]\s + \dA.

References

Al Mamun MD, Balachandran B, Duong HN, Gul FA (2021) Are corporate general counsels in top management effective monitors? Evidence from stock price crash risk. Eur Acc Rev 30:405–437. https://doi.org/10.1080/09638180.2020.1763819
Article Google Scholar
Alti A (2003) How persistent is the impact of market timing on capital structure? SSRN J. https://doi.org/10.2139/ssrn.458640
Article Google Scholar
Baker M, Wurgler J (2002) Market timing and capital structure. J Financ 57:1–32. https://doi.org/10.1111/1540-6261.00414
Article Google Scholar
Ball RA (2009) Market and political/regulatory perspectives on the recent accounting scandals. J Account Res 47:277–323. https://doi.org/10.1111/j.1475-679X.2009.00325.x
Article Google Scholar
Bao D, Fung SYK, Su LN (2018) Can shareholders be at rest after adopting clawback provisions? Evidence from stock price crash risk. Contemp Account Res 35:1578–1615. https://doi.org/10.1111/1911-3846.12326
Article Google Scholar
Baron RM, Kenny DA (1986) The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol 51:1173–1182. https://doi.org/10.1037/0022-3514.51.6.1173
Article Google Scholar
Bergmann NK, Roychowdhury S (2008) Investor sentiment and corporate disclosure. J Account Res 46:1057–1083. https://doi.org/10.1111/j.1475-679X.2008.00305.x
Article Google Scholar
Callen JL, Fang X (2013) Institutional investor stability and crash risk: monitoring versus short-termism? J Bank Finance 37:3047–3063. https://doi.org/10.1016/j.jbankfin.2013.02.018
Article Google Scholar
Callen JL, Fang X (2015a) Religion and stock price crash risk. J Financ Quant Anal 50:169–195. https://doi.org/10.1017/S0022109015000046
Article Google Scholar
Callen JL, Fang X (2015b) Short interest and stock price crash risk. J Bank Finance 60:181–194. https://doi.org/10.1016/j.jbankfin.2015.08.009
Article Google Scholar
Chen J, Hong H, Stein J (2001) Forecasting crashes: trading volume, past returns and conditional skewness in stock prices. J Financ Econ 61:345–381. https://doi.org/10.3386/w7687
Article Google Scholar
Chi J, Gupta M (2009) Overvaluation and earnings management. J Bank Finance 33:1652–1663. https://doi.org/10.1016/j.jbankfin.2009.03.014
Article Google Scholar
Clausen S, Hirth S (2016) Measuring the value of intangibles. J Corp Finan 40:110–127. https://doi.org/10.1016/j.jcorpfin.2016.07.012
Article Google Scholar
Cohen DA, Zarowin P (2010) Accrual-based and real earnings management activities around seasoned equity offerings. J Account Econ 50:2–19. https://doi.org/10.1016/j.jacceco.2010.01.002
Article Google Scholar
DeAngelo H, DeAngelo L, Stulz RM (2010) Seasoned equity offerings, market timing, and the corporate lifecycle. J Financ Econ 95:275–295. https://doi.org/10.1016/j.jfineco.2009.11.002
Article Google Scholar
Dechow P, Sloan R, Sweeney A (1995) Detecting earnings management. Account Rev 70:193–225
Google Scholar
Dimson E (1979) Risk measurement when shares are subject to infrequent trading. J Financ Econ 7:197–226
Article Google Scholar
DuCharme LL, Malatesta PH, Sefcik SE (2004) Earnings management, stock issues, and shareholder lawsuits. J Financ Econ 71:27–49. https://doi.org/10.1016/S0304-405X(03)00182-X
Article Google Scholar
Eaglesham J (2013) Accounting fraud targeted. Wall Street J 2013:C1
Google Scholar
El-Haj M, Rayson P, Walker M, Young S, Simaki V (2019) In search of meaning: lessons, resources and next steps for computational analysis of financial discourse. J Bus Financ Acc 46:265–306. https://doi.org/10.1111/jbfa.12378
Article Google Scholar
Ertugrul M, Lei J, Qiu J, Wan C (2017) Annual report readability, tone ambiguity, and the cost of borrowing. J Financ Quant Anal 52:811–836. https://doi.org/10.1017/S0022109017000187
Article Google Scholar
Franco-Santos M, Lucianetti L, Bourne M (2012) Contemporary performance measurement systems: a review of their consequences and a framework for research. Manag Account Res 23:79–119. https://doi.org/10.1016/j.mar.2012.04.001
Article Google Scholar
Giroud X, Mueller HM (2010) Does corporate governance matter in competitive industries? J Financ Econ 95:312–331. https://doi.org/10.1016/j.jfineco.2009.10.008
Article Google Scholar
Graham JR, Harvey CR (2001) The theory and practice of corporate finance—evidence from the field. J Financ Econ 60:187–243. https://doi.org/10.1016/S0304-405X(01)00044-7
Article Google Scholar
Graham JR, Harvey CR, Rajgopal S (2005) The economic implications of corporate financial reporting. J Account Econ 40:3–73. https://doi.org/10.1016/j.jacceco.2005.01.002
Article Google Scholar
Harris ZS (1954) Distributional structure. Word 10:146–162. https://doi.org/10.1080/00437956.1954.11659520
Article Google Scholar
Hoberg G, Maksimovic V (2014) Redefining financial constraints: a text-based analysis. Rev Financ Stud 28:1312–1352. https://doi.org/10.1093/rfs/hhu089
Article Google Scholar
Hong HA, Kim J-B, Welker M (2017) Divergence of cash flow and voting rights, opacity, and stock price crash risk: international evidence. J Account Res 55:1167–1212. https://doi.org/10.1111/1475-679X.12185
Article Google Scholar
Hung S, Qiao Z (2017) Shadows in the sun: crash risk behind earnings transparency. J Bank Finance 83:1–18. https://doi.org/10.1016/j.jbankfin.2017.06.007
Article Google Scholar
Hutton AP, Marcus AJ, Tehranian H (2009) Opaque financial reports, R2, and crash risk. J Financ Econ 94:67–86. https://doi.org/10.1016/j.jfineco.2008.10.003
Article Google Scholar
Jensen MC (2005) Agency costs of overvalued equity. Financ Manage 34:5–19. https://doi.org/10.1111/j.1755-053X.2005.tb00090.x
Article Google Scholar
Jensen MC, Meckling WH (1976) Theory of the firm: managerial behavior, agency costs, and ownership structure. J Financ Econ 3:305–360. https://doi.org/10.1002/9780470752135.ch17
Article Google Scholar
Jin L, Myers S (2006) R2 around the world: New theory and new tests. J Financ Econ 79:257–292. https://doi.org/10.1016/j.jfineco.2004.11.003
Article Google Scholar
Kim Y, Park MS (2005) Pricing of seasoned equity offers and earnings management. J Financ Quant Anal 40:435–463. https://doi.org/10.1017/S0022109000002374
Article Google Scholar
Kim J-B, Zhang L (2016) Accounting conservatism and stock price crash risk: firm-level evidence. Contemp Account Res 33:412–441. https://doi.org/10.1111/1911-3846.12112
Article Google Scholar
Kim J-B, Li Y, Zhang L (2011) Corporate tax avoidance and stock price crash risk: Firm-level analysis. J Financ Econ 100:639–662. https://doi.org/10.1016/j.jfineco.2010.07.007
Article Google Scholar
Kim J-B, Wang Z, Zhang L (2016) CEO overconfidence and stock price crash risk. Contemp Account Res 33:1720–1749. https://doi.org/10.1111/1911-3846.12217
Article Google Scholar
Kim C, Wang K, Zhang L (2019) Readability of 10-K reports and stock price crash risk. Contemp Account Res 36:1184–1216. https://doi.org/10.1111/1911-3846.12452
Article Google Scholar
Kim J-B, Si Y, Xia C, Zhang L (2021) Corporate derivatives usage, information environment, and stock price crash risk. Eur Account Rev. https://doi.org/10.1080/09638180.2021.1918564
Article Google Scholar
Kothari SP, Shu S, Wysocki PD (2009) Do managers withhold bad news? J Account Res 47:241–276. https://doi.org/10.1111/j.1475-679X.2008.00318.x
Article Google Scholar
Krasker WS (1986) Stock price movements in response to stock issues under asymmetric information. J Financ 41:93–105. https://doi.org/10.1111/j.1540-6261.1986.tb04493.x
Article Google Scholar
Lang MH, Lundholm RJ (2010) Voluntary disclosure and equity offerings: reducing information asymmetry or hyping the stock? Contemp Account Res 17:623–662. https://doi.org/10.1506/9N45-F0JX-AXVW-LBWJ
Article Google Scholar
Lau JH, Baldwin T (2016) An empirical evaluation of doc2vec with practical insights into document embedding generation. In: 1st Workshop on Representation Learning for NLP, pp 78–86
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning 32, pp 1188–1196
Li K, Mai F, Shen R, Yan X (2021) Measuring corporate culture using machine learning. Rev Financ Stud 34:3265–3315. https://doi.org/10.1093/rfs/hhaa079
Article Google Scholar
Lo K, Ramos F, Rogo R (2017) Earnings management and annual report readability. J Account Econ 63:1–25. https://doi.org/10.1016/j.jacceco.2016.09.002
Article Google Scholar
Loughran T, McDonald B (2011) When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. J Financ 66:35–65. https://doi.org/10.1111/j.1540-6261.2010.01625.x
Article Google Scholar
Luo H, Liu I, Tripathy N (2021) A study on firms with negative book value of equity. Int Rev Financ 21:145–182. https://doi.org/10.1111/irfi.12260
Article Google Scholar
MacKinnon JG, White H (1985) Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties. J Econometr 29:305–325. https://doi.org/10.1016/0304-4076(85)90158-7
Article Google Scholar
Matin R, Hansen C, Hansen C, Mølgaard P (2019) Predicting distresses using deep learning of text segments in annual reports. Expert Syst Appl 132:199–208. https://doi.org/10.1016/j.eswa.2019.04.071
Article Google Scholar
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013b) Distributed representations of words and phrases and their compositionality. In: Jordan MI, LeCun Y, Solla SA (eds) Advances in neural information processing systems. MIT, Cambridge, pp 3111–3119
Google Scholar
Mikolov T, Chen K, Corrado G, Dean J (2013a) Efficient estimation of word representations in vector space. In: 1st International conference on learning representations, ICLR 2013a, Scottsdale, Arizona, USA, May 2–4, 2013a, workshop track proceedings
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Alessandro Moschitti QCRI, Bo Pang G, Walter Daelemans UoA (eds) Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Stroudsburg, pp 1532–1543
Reichmann D, Reichmann M (2022) Predicting firm-specific stock price crashes. Working Paper
Roychowdhury S (2006) Earnings management through real activities manipulation. J Account Econ 42:335–370. https://doi.org/10.1016/j.jacceco.2006.01.002
Article Google Scholar
Schmidt PS, von Arx U, Schrimpf A, Wagner AF, Ziegler A (2019) Common risk factors in international stock markets. Fin Markets Portfolio Mgmt 33:213–241. https://doi.org/10.1007/s11408-019-00334-3
Article Google Scholar
Sunder S (2010) Riding the accounting train: from crisis to crisis in eighty years. In: Presentation at the conference on financial reporting, auditing and governance
Walker MD, Yost K (2008) Seasoned equity offerings: what firms say, do, and how the market reacts. J Corp Financ 14:376–386. https://doi.org/10.1016/j.jcorpfin.2008.04.001
Article Google Scholar
Wu K, Lai S (2020) Intangible intensity and stock price crash risk. J Corp Finan 64:101682. https://doi.org/10.1016/j.jcorpfin.2020.101682
Article Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Banking and Finance, Ruhr University Bochum, Bochum, Germany
Doron Reichmann, Rouven Möller & Tobias Hertel

Authors

Doron Reichmann
View author publications
You can also search for this author in PubMed Google Scholar
Rouven Möller
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Hertel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Doron Reichmann.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix 1: Parsing the LIQ + CAP section

Since our estimation of document embedding heavily relies on a clean identification of the LIQ + CAP section we provide a detailed description of our parsing procedure below. We start by extracting the MD&A sections using the parser proposed by Reichmann and Reichmann (2022) which outputs raw text MD&A sections that are stripped of HTML code. Next, the parser replaces HTML tags that are commonly used to indicate certain formatting are replaced with predefined tags such as “###” or “***”. Using these raw text files, we next develop an algorithm to parse the LIQ + CAP section that searches the MD&A section using a maximum of eight iterations. Each iteration is built on the notion that an MD&A follows a consistent layout in terms of formatting. Once we match any of the following specifications, the algorithm outputs the LIQ + CAP section. Otherwise, it continues to the next iteration.

1.
We search for formatted section headers written in capital letters: Specifically, we search for any potential formatting tag that directly starts at the beginning of a new line that is followed by a upper case section header that fits the description of the Liquidity and Capital Resources section.^{Footnote 29} If we find a match, we extract the formatting tags and search for a new line starting with the exact same formatting tags followed by either caps-written words or a case-insensitive variation of Item 7a since the LIQ + CAP section is often find at the end of the MD&A.^{Footnote 30} We consider this first-best match as a clear indicator of the starting and ending point of the LIQ + CAP section.
2.
We search for formatted section headers with important words taking an initial capital letter: In titles, all important words often take an initial capital letter. Therefore, in the second iteration, we basically re-run iteration two with the only difference that we require important word to have a capital letter followed by lower case letters (e.g. Liquidity and Capital Resources). Consistently, we also expect the LIQ + CAP section to end with a similar formatted section header.
3.
We search for headers written in capital letters: We find that not all headers start with a predefined tag denoting HTML-formatting. This is especially true for firm-years from earlier years, where HTML was less common. In this case, we start by looking for headers denoting the start of the LIQ + CAP section written in capital letters. Then, we compile a list of common subsections that usually follow the LIQ + CAP section. Specifically, we compile the following regexes using OR operators:
a.
\nCONTRACTUAL\s + (AND\s + OTHER\s +)?OBLIGATION
b.
\n(CRITICAL|RECENT|SIGNIFICANT)\s + ACCOUNTING
c.
\nFORWARD(\s +|-)LOOKING\s + STATEMENT
d.
\nOFF(\s +|-)BALANCE\s + SHEET
e.
\nOPERATING(.*?)?\s + ACTIVITIES
f.
\n(PLAN|RESULTS)\s + OF\s + OPERATIONS
g.
\nPAYMENTS\s + DUE
h.
\nGOVERNMENT\s + REGULATION
i.
\nREPORT\s + OF\s + INDEPENDENT
j.
\nI[Tt][Ee][Mm]\s + \dA
k.
\nI[Tt][Ee][Mm]\s + \d
4.
We search for headers with important words taking an initial capital letter: Finally, if iteration 1) to 3) fail, we follow a similar concept as in iteration 2) by rerunning iteration 3) but instead of looking for fully capitalized headers we require important words taking an initial capital letter.

Finally, we find that the algorithm sometimes identifies very small texts as LIQ + CAP sections that are table of contents within the MD&A. Therefore, we require a LIQ + CAP section to exceed 50 characters. Consequently, once the algorithm identifies a starting point in a table of content, it will fail to pass the 50 characters threshold for the whole section as it usually identifies the ending point immediately in the next line of the table. In this case we re-run iteration 1) to 4) after cutting off the MD&A up to the falsely identified starting point, thus forcing the algorithm to find a new starting point outside of the table of contents (Tables

Table 10 Descriptive statistics—additional variables

Full size table

10,

Table 11 Equity intent and firm characteristics

Full size table

11).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Reichmann, D., Möller, R. & Hertel, T. Nothing but good intentions: the search for equity and stock price crash risk. J Bus Econ 92, 1455–1489 (2022). https://doi.org/10.1007/s11573-022-01085-w

Download citation

Accepted: 03 March 2022
Published: 30 March 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s11573-022-01085-w

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Nothing but good intentions: the search for equity and stock price crash risk

Abstract

Similar content being viewed by others

Bankruptcy prediction and the discriminatory power of annual reports: empirical evidence from financially distressed German companies

Social Integrity and Stock Price Crash Risk

Textual tone in corporate financial disclosures: a survey of the literature

1 Introduction

2 Literature review and hypothesis

Hypothesis

3 Sample selection, data, and research design

3.1 Sample selection

3.2 Measuring managers’ equity intent

3.3 Measuring crash risk

4 Results

4.1 Univariate statistics

4.2 Main results

5 Additional analyses

6 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix 1: Parsing the LIQ + CAP section

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Nothing but good intentions: the search for equity and stock price crash risk

Abstract

Similar content being viewed by others

Bankruptcy prediction and the discriminatory power of annual reports: empirical evidence from financially distressed German companies

Social Integrity and Stock Price Crash Risk

Textual tone in corporate financial disclosures: a survey of the literature

1 Introduction

2 Literature review and hypothesis

Hypothesis

3 Sample selection, data, and research design

3.1 Sample selection

3.2 Measuring managers’ equity intent

3.3 Measuring crash risk

4 Results

4.1 Univariate statistics

4.2 Main results

5 Additional analyses

6 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix 1: Parsing the LIQ + CAP section

Appendix 1: Parsing the LIQ + CAP section

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation