1 Introduction

Financial analysts act as the key information intermediaries between firms and investors in capital markets (e.g. Lang and Lundholm 1996). To this end, they evaluate the future prospects of firms and provide earnings forecasts for investors (Barth and Hutton 2004). As with board directors (Brown et al. 2019) and institutional investors (Kang et al. 2018), analysts may opt to cover firms in more than one industry for various reasons. For instance, analysts might intend to cover industries which are related to their educational backgrounds (Kini et al. 2009; Cao et al. 2022); on which they have prior work experience (Bradley et al. 2017a) and social networks (Cohen et al. 2010; Chen et al. 2021); or for which their forecast accuracy is expected to be higher (Balashov 2017). In the case when competition among analysts in a specific industry is so fierce that it becomes difficult for them to stand out from the crowd, they might diversify their firm coverage across multiple industries to reduce competitive risk and adversities posed to their career prospects and compensation (Merkley et al. 2017). Despite the analysts’ multi-industry engagements (e.g. Clement 1999; Jacob et al. 1999; Kini et al. 2009), the literature has paid little attention to the phenomenon that analysts put different emphasis on, and allocate varying time, effort, and resources to, the industries they covered, and thus whether this differential by analysts for their covered industries affects their forecast quality remains unknown. We aim to fill this gap in the literature. To this end, we investigate the impact of analysts’ industrial concentration on the quality of their earnings forecasts. We measure the forecast quality by the accuracy, informativeness, and timeliness of analysts’ earnings forecasts.

The industrial concentration of analysts pertains to a structural characteristic of their firm coverage. We define analysts’ industrial concentration as the extent to which an analyst concentrates her/his research firms on a smaller proportion of industries s/he covers. Analysts who provide forecasts for a large number of firms within a small proportion of industries have a relatively higher level of industrial concentration, and vice versa. Accordingly, we use the Herfindahl–Hirschman Index (HHI) to construct two empirical measures of analysts’ industrial concentration. First, for each analyst, we compute the square of the ratio of the number of forecasts of earnings per share (EPS) for each covered industry to the total number of her/his EPS forecasts in a fiscal year, and then take the sum of all the squares as our first measure of analysts’ industrial concentration (HHI). Second, analysts may follow a few firms, with quite a few forecasts made for each covered firm, in each covered industry. In such a case, compared with the number of EPS forecasts for each sector, the number of covered firms for the sector might be better in reflecting analysts’ effort and resources allocated to the analysis of industry information. To account for this case, we compute the square of the ratio of the number of an analyst’s covered firms for each covered industry to the total number of her/his covered firms in a fiscal year, and then take the sum of all the squares as our second measure of analysts’ industrial concentration (HHI1).

The degree of an analyst’s industrial concentration is conceptually determined by the proportion of the number of covered firms in each industry to the total number of covered firms. Yet, the industrial concentration has nothing to do with the number of industries covered by the analyst if s/he covers more than one industry. An analyst can be industrially concentrated even if s/he covers a great deal of industries. That said, analysts with higher industrial concentration usually cover a considerable number of firms in each concentrated industry, and are likely to expend more time, effort, and resources in analysing these firms. As a result, their industry analyses tend to be in more depth and breadth for the concentrated industries.

Because of the information commonality of peer firms in the same industry (Frankel et al. 2006; Liu 2011), industrial-concentrated analysts may acquire and accumulate industry knowledge and experience, and thereby become more sophisticated and specialized in analysing industry information. By following a considerable number of firms in few industries, industrial-concentrated analysts can also benefit from economies of scale in the decreased (increased) costs (efficiency) of their information acquisition and/or processing. Indeed, the extant literature (e.g. Ramnath 2002; Piotroski and Roulstone 2004; Boni and Womack 2006; Brown and Mohammad 2010; Hilary and Shen 2013; Dichev et al. 2013; Chen et al. 2021) documents that industry-specific information is important and conducive to earnings forecasting. For instance, Brown et al. (2015) show that analysts, whose equity research attaches importance to industry information, are more likely to detect stock mispricing, thereby providing more accurate forecasts and rendering stock recommendation opinions more informative to investors. Bradley et al. (2017a) find that analysts, who have industry experience before becoming analysts, are more specialized in processing industry information, which increases their forecast accuracy. To the extent that higher industrial concentration enables analysts to acquire and process industry information better, the quality of their earnings forecasts should be higher. On this basis, the association between analysts’ industrial concentration and forecast quality is expected to be positive.

On the other hand, as analysts with higher industrial concentration follow more firms in the concentrated industries, they might deem the firm-specific information of industry peers to be of high similarity, and thus under-emphasize analysis of firm-specific information for their forecasting. They might be overconfident in, and over-reliant on, industry information, and take less effort to analyse firm-specific information than they should. As a result, their forecast quality decreases. Thus, whether analysts’ industrial concentration is positively or negatively associated with their forecast quality is an open empirical issue to explore in this study.

We also examine how the relation between analysts’ industrial concentration and their earnings forecast quality varies with product market competition, firm-specific risk, and the information opacity of firms. In a highly competitive product market, firms tend to reduce information disclosures due to the concern on high proprietary costs of disclosures (Ellis et al. 2012; Mattei and Platikanova 2017). The lack of corporate disclosures would make industry information more important for analysts’ earnings forecasts. Furthermore, a firm’s prospect in a highly competitive industry depends critically on its competitive advantage over its industrial peer firms rather than on its own performance alone (Gu 2016; Martens and Sextroh 2021). As such, industry information becomes more value-relevant to analysts, who need to make a comparative analysis of the industrial peers to assess the firm’s performance and prospect and make forecasts. As industrial-concentrated analysts have a superior capacity to acquire and process industry-specific information, their industrial concentration should be conducive to improving the quality of their earnings forecasts for firms confronted with higher industrial product market competition.

High firm-specific risk and high opacity of corporate information make it difficult for analysts to evaluate the future prospects of firms (Lang and Lundholm 1996; Hope 2003; Dhaliwal et al. 2012). Consequently, analysts might need to refer to the related common information from industrial peer firms to complement the uncertain or opaque firm-specific information so as to maintain their forecast quality. Industrial-concentrated analysts are likely better at exploiting and analysing the information commonality from industry peers. Therefore, the analysts’ industrial concentration should help improve their forecast quality when the uncertainty or opacity of firm-specific information is high. In sum, we hypothesize that the association between analysts’ industrial concentration and forecast quality is positively moderated by industrial product market competition, firm-specific risk, and firms’ information opacity.

Our regression analysis of the three proxies for analysts’ forecast quality—the accuracy, informativeness, and timeliness of forecasts, is based on 21,396, 17,804, and 24,873 analyst-year observations, respectively, covering 5561, 4860, and 6107 US listed companies for the period 1995–2017. We find that a higher level of analysts’ industrial concentration is associated with higher quality of their earnings forecasts. Furthermore, we find evidence to suggest that analysts tend to compromise their forecast timeliness to provide more accurate and informative forecasts for firms that face intense industrial product market competition, high firm-specific risk, or high information opacity.

To mitigate potential endogeneity concerns, we adopt a two-stage least-squares regression technique. Besides, following Larcker and Rusticus (2010), we analyse the impact threshold for a confounding variable (ITCV) to ensure that our regression estimations are not biased by potential correlated-omitted variable(s). We also follow Oster (2019) to check the sensitivity of our main results to unobservable confounder(s). The results from using these approaches elicit the same inferences in support of our expectation that analysts’ industrial concentration promotes the quality of analyst earnings forecasts.Footnote 1

Prior studies provide some evidence on the impact of industry experience on the performance of stock market participants, such as institutional investors (Kang et al. 2018), board directors (Brown et al. 2019), and auditors (Low 2004; Carson 2009; Bills et al. 2015; Petrov and Stocken 2022). Our paper differs from, and adds to, this line of literature in that we look at industrial concentration and analyse its association with the performance of financial analysts, thereby offering implications for analysts in particular and plausibly for stock market participants in general.

Our study shows that analysts’ industrial concentration promotes the quality of earnings forecasts, which is appreciated and favoured by the market. In specific, to the extent that industrial-concentrated analysts are able to provide more accurate and more timely forecasts based on their industry knowledge and experience, their forecasts would likely be valued more by investors. Consistent with this proposition, we find that analysts’ industrial concentration is associated with greater forecast accuracy, higher forecast timeliness, and stronger stock market reactions to forecasts, suggesting that analysts with higher industrial concentration are perceived by investors as being better able to decipher the industry-level implications for future earnings in an accurate and timely manner. Thereby, our study lends support to the extant view that analysts are more sophisticated in processing information than investors (e.g. Bradshaw 2012; Kadan et al. 2012; Brown et al. 2015), and that analysts’ industry knowledge and expertise are highly valued by investors (Bradley et al. 2017a; Brown et al. 2015).

Last, but not least, our findings hold important implications for practitioners. Specifically, given that analysts with higher industrial concentration tend to make more accurate, more informative, and more timely earnings forecasts, analysts might seek to concentrate on a smaller proportion of covered industries to improve their forecast accuracy, forecast informativeness, and forecast timeliness to investors. When following firms subject to intense product market competition, high risk, and high information opacity, analysts might need to take more effort and resources to analyse industry information as well as peer firms’ information for making high-quality earnings forecasts. Investors who refer to analyst research for their investment decision-making may recognize that analysts who focus their firm coverage on a smaller proportion of covered industries tend to provide more accurate earnings forecasts.

The remainder of our paper proceeds as follows. Section 2 reviews the related literature and develops hypotheses. Section 3 describes our data, sample, and variable measurements. Section 4 expounds the research design. Section 5 discusses our results, and Sect. 6 concludes.

2 Literature review and hypothesis development

2.1 The impact of analysts’ industrial concentration on the quality of their earnings forecasts

Analysts’ industrial concentration helps them acquire and process industry information better for forecasting. Based on the limited-attention argument (e.g. Hirshleifer and Teoh 2003; Dechow and You 2012; Hameed et al. 2015; Harford et al. 2018), analysts who focus on a smaller proportion of covered industries might devote more time, effort, and resources to the analysis of industry information of the concentrated industries. This enables them to foster industry knowledge and experience, thereby becoming more specialized and sophisticated in analysing the value implications of industry information for firms. Peer firms in the same industry share information commonality in corporate fundamentals that drive accounting returns and stock returns (De Franco et al. 2014), and the analysis specific to an industry can be applicable to all companies in that industry (Liu 2011). Hence, industrial-concentrated analysts can benefit from economies of scale in information acquisition and processing (Kini et al. 2009; Kadan et al. 2012; Huang et al. 2022). Moreover, the information spillovers among industrial peer firms make it possible to interpret a firm’s performance by the information from its industrial peers (e.g. Piotroski and Roulstone 2004; Frankel et al. 2006; Hilary and Shen 2013). Therefore, an industrial-concentrated analyst, who usually covers a large number of firms in the same industry, can benefit not only from reduced costs of information acquisition and processing but also from improved efficiency in the information analysis.

Extensive research (e.g. Ramnath 2002; Piotroski and Roulstone 2004; Boni and Womack 2006; Brown and Mohammad 2010; Hilary and Shen 2013; Dichev et al. 2013; Brown et al. 2015; Bradley et al. 2017a) provides evidence that industry-specific information is crucial for evaluating firm performance and making earnings forecasts. For instance, Brown et al. (2015) contend that a good understanding of an industry’s key development trends as well as its supply chains, technologies, management, and distribution models is conducive to forecasting future earnings. Chen et al. (2021) find that industry information becomes more important for earnings forecasting after the implementation of Regulation Fair Disclosure (Reg FD), because corporate managers shift from private disclosures of firm-specific information to those of industry information, which is a practice belonging to a legal grey area post Reg FD. Dichev et al. (2013) document that a firm’s financial reporting choices are largely driven by economic conditions specific to the firm’s industry; thus, relevant industry information and knowledge are essential for making valuation judgements from corporate reporting and disclosures. Also, analysts’ industry expertise is considered as a research trait that is valued substantially by institutional investors (Brown et al. 2015). Taken together, provided that analysts with higher industrial concentration are better able to acquire and process industry information, and given the importance of industry information in evaluating firms’ future prospects, it is reasonable to conjecture that analysts’ industrial concentration helps promote the quality of their earnings forecasts.

On the other hand, industrial-concentrated analysts, who usually cover a large number of firms in the same industry, might deem the information analysis of these firms to be overlapping to a substantial degree due to the high similarity of information from the industrial peer firms under the analysts’ research portfolios. As a result, they might unconsciously or imperceptibly under-emphasize firm-specific information and associated analysis, and be overconfident in, and over-reliant on, industry information for their earnings forecasts. Therefore, the quality of the analysts’ forecasts might be impaired as a result of their high industrial concentration. Based on the above discussion, we propose the following two competing hypotheses for empirical tests:

Hypothesis 1a.

Analysts’ industrial concentration increases the quality of their earnings forecasts.

Hypothesis 1b.

Analysts’ industrial concentration decreases the quality of their earnings forecasts.

2.2 Cross-sectional analysis of industrial product market competition

When firms face high industrial product market competition, industry information becomes more important for the evaluation of corporate performance and prospect. On the one hand, there is often a high degree of product homogeneity in competitive industries. As such, disclosures by a firm are likely to reveal information about the demand-side factors affecting the entire industry (Ali et al. 2007). On the other hand, firms in fiercely competitive product markets are often subject to high proprietary costs of disclosures (Verrecchia 1983). They are more likely to deliberately withhold information about their major customers, product innovation, and/or business expansion plans (Ellis et al. 2012; Mattei and Platikanova 2017), leading to higher information opacity and increased difficulty in forecasting earnings. In this case, analysts would rely more on industry information due to the lack of firm-specific information. Moreover, a firm’s prospect in a highly competitive industry is largely contingent on its competitive position in the whole industry rather than on its own performance alone (Gu 2016; Martens and Sextroh 2021). As such, industry information becomes more value-relevant to analysts as they need to analyse a firm’s competitive advantage over its industrial peer firms to evaluate the firm’s future prospect. To the extent that industrial-concentrated analysts have the superior capability to acquire and process industry information, analysts’ industrial concentration should help improve the quality of their earnings forecasts for firms confronted with fiercer industrial product market competition. Therefore, we put forth our second hypothesis as follows:

Hypothesis 2.

The positive (negative) association between analysts’ industrial concentration and the quality of their earnings forecasts, as predicted in Hypothesis 1a (Hypothesis 1b), is more (less) pronounced for firms that face fiercer industrial product market competition.

2.3 Cross-sectional analysis of firm-specific risk and the information opacity of firms

High firm-specific risk and high firm information opacity make firm-specific information itself of greater uncertainty (Barth et al. 2001; Dichev and Tang 2009; Dichev et al. 2013; Dhaliwal et al. 2012). As a result, ceteris paribus, it is more difficult for analysts to provide quality earnings forecasts. Analysts may refer to the information of industrial peer firms to compensate for the uncertainty or opacity of firm-specific information so as to maintain their forecast quality. To the extent that industrial-concentrated analysts have the superior ability to acquire and process industrial information and to exploit the information commonality among industry peers, analysts’ industrial concentration should be instrumental in improving the quality of their earnings forecasts for the firms that have high risk or high information opacity. Accordingly, we put forward our third hypothesis as follows:

Hypothesis 3.

The positive (negative) association between analysts’ industrial concentration and the quality of their earnings forecasts, as predicted in Hypothesis 1a (Hypothesis 1b), is more (less) pronounced for firms that have higher risk or higher information opacity.

3 Data and variable measurements

3.1 Data sources and sample

Prior to 1995, the release dates of analyst forecasts were often not precisely reported in the Institutional Brokers Estimate System’s (I/B/E/S) detail files (Frankel et al. 2006; Clement et al. 2011), so our sample period starts from 1995. In 2018, I/B/E/S started to anonymize the names of brokers and their analysts in its detail history files, which led to 13.8% of broker identifiers (IDs) and 30.7% of analyst IDs being reshuffled, according to the announcement of Wharton Research Data Services.Footnote 2 To maintain the stability and continuation of broker IDs and analyst IDs for our sample period (Law 2023), we end it in 2017.

While data on analyst earnings forecasts are gathered from I/B/E/S, other data are collected from the Centre for Research in Security Prices (CRSP) and Compustat. We require that firms have the necessary data available on these databases to construct the variables of interest for our regression analysis. The independent variables and dependent variables used for our regression estimates are measured in the same year.Footnote 3 To alleviate the problem of outliers, we winsorize all continuous variables at the 1st and 99th percentiles, respectively. Our multivariate tests of the impact of analysts’ industrial concentration on forecast accuracy, forecast informativeness, and forecast timeliness are based on 21,396, 17,804, and 24,873 analyst-year observations, respectively, corresponding with 5,561, 4,860, and 6,107 US listed firms and with 3,225, 3,326, and 3,333 analysts. Panel A (Panel B) of Table 1 reports the distribution of the mean values of analysts’ industrial concentration, forecast accuracy, forecast informativeness, and forecast timeliness across years (industries). Firms in the industry of health, energy, and finance have the highest level of analysts’ industrial concentration.

Table 1 Analysts’ industrial concentration (HHI & HHI1), forecast accuracy (accuracy), forecast informativeness (car), and forecast timeliness (timeliness) across years and industries

3.2 Measures of analysts’ industrial concentration

We use the Herfindahl–Hirschman Index (HHI), developed by Herfindahl (1950) and Hirschman (1945), to construct two measures of analysts’ industrial concentration. Firstly, for each analyst, we compute the square of the ratio of the number of forecasts of earnings per share (EPS) for each covered industry to the total number of her/his EPS forecasts in a fiscal year, and then take the sum of all the squares as our first measure of analysts’ industrial concentration. In line with Rhoades (1993) and Sonney (2009), we use the first two digits of standard industrial classification (SIC) codes for industry classification, and measure the analysts’ industrial concentration by the following HHI model:

$${HHI}_{i,t}=\sum_{j=1}^{J}{\left(\frac{{Sector}_{i,j,t}}{{Sum}_{i,t}}\right)}^{2}$$
(1)

where \({Sector}_{i,j,t}\) is the number of earnings forecasts issued by analyst \(i\) for all the firms in industry \(j\) in year \(t\); \({Sum}_{i,t}\) is the total number of earnings forecasts issued by analyst \(i\) in year \(t\). The value of \({HHI}_{i,t}\) is greater than 0, and less than or equal to 1. The higher the HHI value, the greater the industrial concentration of an analyst. \({HHI}_{i,t}\) equals 1 in cases when an analyst focuses her/his coverage on firms in a single industry. Secondly, we compute the square of the ratio of the number of an analyst’s covered firms for each covered industry to the total number of her/his covered firms in a fiscal year, and then take the sum of all the squares as another measure of analysts’ industrial concentration (HHI1). For construction of the second measure, \({Sector}_{i,j,t}\) in Model (1) is re-defined as the number of firms covered by analyst \(i\) in industry \(j\) and in year \(t\), and \({Sum}_{i,t}\) is re-defined as the total number of firms covered by analyst \(i\) in year \(t\). An analyst is regarded as having a reasonable degree of industrial concentration if the HHI or HHI1 is more than 50%. The proportion of analysts, for whom the value of HHI (HHI1) is over 0.5, is 51.29% (54.12%). Since the proportion of industrial-concentrated analysts is not significantly higher than the other, it is unclear whether analysts in general appreciate the benefits of industrial concentration to firm coverage.

4 Research design

4.1 Test of Hypothesis 1

Hypothesis 1 concerns the relationship between analysts’ industrial concentration and the quality of their earnings forecasts. We measure the quality by the analysts’ forecast accuracy, forecast informativeness, and forecast timeliness. To test the association between analysts’ industrial concentration and forecast accuracy, we employ the following ordinary least-squares (OLS) regression model:

$$\begin{aligned}accuracy&={\alpha }_{0}+{\alpha }_{1}HHI\left(or HHI1\right)+{\alpha }_{2}bsize+{\alpha }_{3}top10+{\alpha }_{4}gexp+{\alpha }_{5}freq+{\alpha }_{6}portsize+{\alpha }_{7}horizon+{\alpha }_{8}size+{\alpha }_{9}price\\&\quad+{\alpha }_{10}stdearnings+{\alpha }_{11}competition\\&\quad+{\alpha }_{12}insti +{\alpha }_{13}liquidity +{\alpha }_{14}salesgrowth\\&\quad+{\alpha }_{15}intanratio+{\alpha }_{16}firmage +{\alpha }_{17}capx\\&\quad+{\alpha }_{18}abtradvol+YearDummies+IndustryDummies+\varepsilon \end{aligned}$$
(2)

where accuracy is calculated as − 1 times the absolute value of the difference between actual EPS and an analyst’s last forecast of annual EPS for a firm for a fiscal year, divided by the firm’s actual EPS at the end of the fiscal year. If analysts’ industrial concentration increases (decreases) their earnings forecast accuracy, the coefficients for HHI and HHI1 should be positive (negative) and statistically significant at conventional levels.

Following prior literature on the determinants of analyst forecast accuracy, we control for two main categories of variables to alleviate potential correlated-omitted-variable(s) bias. The first category comprises analyst characteristics. Analysts employed by large brokers usually have better access to datasets as well as private information of firm management, which contributes to their forecast accuracy (Stickel 1995; Clement 1999). We therefore use broker size (bsize) and top− 10 brokerage (top10) to control for the research resources available to analysts. Furthermore, Jacob et al. (1999) find that analysts improve their forecast accuracy through learning-by-doing. Clement (1999) shows that analysts’ forecast experience enhances their abilities and skills to make accurate forecasts. Hence, we include analysts’ general forecast experience (gexp) and firm-specific forecast experience (freq) in the regression. Kini et al. (2009) find that the sector diversity in the analysts’ portfolios of covered firms adds research complexity and the costs of information processing, making it more difficult to forecast earnings accurately. Therefore, we control for the size of analysts’ portfolios (portsize). We also control for analyst forecast horizon (horizon), because analysts’ forecast accuracy decreases with the number of days between their forecast issuance date and the earnings announcement date (Jacob et al. 1999). The second category of the determinant variables for analyst forecast accuracy comprises firm characteristics, which include firm size (size), stock price (price), earnings volatility (stdearnings), product market competition (competition), institutional stock ownership (insti), stock liquidity (liquidity), sales growth (salesgrowth), intangible assets (intanratio), firm age (firmage), capital expenditures (capx), and abnormal trading volume (abtradvol) (e.g. Lim 2001; Gu and Wang 2005; Dhaliwal et al. 2012; Bradley et al. 2017b; He et al. 2019a).

Next, we test whether analysts’ industrial concentration improves their earnings forecast informativeness, using the following OLS regression model:

$$\begin{aligned}car&={\alpha }_{0}+{\alpha }_{1}HHI\left(or HHI1\right)+{\alpha }_{2}gexp+{\alpha }_{3}freq+{\alpha }_{4}portsize+{\alpha }_{5}bsize\\&\quad+{\alpha }_{6}top10+{\alpha }_{7}size+{\alpha }_{8}price+{\alpha }_{9}stdearnings+{\alpha }_{10}competition+{\alpha }_{11}insti+{\alpha }_{12}liquidity\\&\quad+{\alpha }_{13}salesgrowth +{\alpha }_{14}intanratio +{\alpha }_{15}beta+{\alpha }_{16}btm+{\alpha }_{17}tradingvol +{\alpha }_{18}changeeps\\&\quad+{\alpha }_{19}xrd+{\alpha }_{20}regulated+{\alpha }_{21}changeroa+{\alpha }_{22}abaccruals+YearDummies+IndustryDummies+\varepsilon \end{aligned}$$
(3)

Following Frankel et al. (2006), we define analyst earnings forecast informativeness (car) as the magnitude of the stock price reaction to analyst earnings forecasts. In particular, \(car\) equals the three-day [− 1, 1] cumulative unsigned abnormal stock returns surrounding an analyst’s last forecast of EPS for a firm for a fiscal year. The abnormal stock returns are estimated using a market model with an estimation period of [− 181, − 2] relative to the forecast date.Footnote 4 If analysts’ industrial concentration makes their earnings forecasts more (less) informative to investors, the coefficients on HHI and HHI1 should be positive (negative) and statistically significant at conventional levels.

In line with prior studies on analyst forecast informativeness (e.g. Frankel et al. 2006; Arand et al. 2015; He and Marginson 2020; He et al. 2020), we control for analysts’ general forecast experience (gexp), analysts’ firm-specific forecast experience (freq), analysts’ portfolio size (portsize), broker size (bsize), top− 10 brokerage (top10), firm size (size), stock price (price), earnings volatility (stdearnings), product market competition (competition), institutional stock ownership (insti), stock liquidity (liquidity), sales growth (salesgrowth), intangible assets (intanratio), firm beta (beta), the book-to-market ratio (btm), stock trading volume (tradingvol), change in EPS (changeeps), research and development expenditures (xrd), industrial regulatory status (regulated), change in pre-tax return on assets (changeroa), and abnormal accruals (abaccruals).

To test the impact of analysts’ industrial concentration on forecast timeliness, we employ the following OLS regression model:

$$\begin{aligned}timeliness&={\alpha }_{0}+{\alpha }_{1}HHI\left(or HHI1\right)+{\alpha }_{2}gexp+{\alpha }_{3}fexq+{\alpha }_{4}freq\\&\quad+{\alpha }_{5}portsize+{\alpha }_{6}bsize+{\alpha }_{7}top10+{\alpha }_{8}size\\&\quad+{\alpha }_{9}price+{\alpha }_{10}stdearnings+{\alpha }_{11}competition\\&\quad+{\alpha }_{12}insti+{\alpha }_{13}liquidity+{\alpha }_{14}beta+{\alpha }_{15}btm\\&\quad+{\alpha }_{16}horizon +{\alpha }_{17}firmage+{\alpha }_{18}capx+YearDummies+IndustryDummies+\varepsilon\end{aligned} $$
(4)

Where timeliness is measured, in line with previous research (e.g. Zhang 2008), as − 1 times the natural logarithm of the number of days between the date on which a firm announces its earnings for the current fiscal year and the date on which an analyst issues her/his first earnings forecast since the earnings announcement. Provided that analysts’ industrial concentration increases (decreases) the timeliness of their forecasts, HHI and HHI1 should take on a positive (negative) coefficient that is statistically significant at a conventional level.

We control for analysts’ general forecast experience (gexp), analysts’ firm-specific forecast experience (fexp and freq), analysts’ portfolio size (portsize), broker size (bsize), top− 10 brokerage (top10), firm size (size), stock price (price), earnings volatility (stdearnings), industrial product market competition (competition), institutional stock ownership (insti), stock liquidity (liquidity), firm beta (beta), the book-to-market ratio (btm), analyst forecast horizon (horizon), firm age (firmage), and capital expenditures (capx). All these variables are documented by prior literature (e.g. Louis et al. 2013; Jackson 2005; Guttman 2010) to be related to the timeliness of analyst forecasts.

For the foregoing Models (2), (3), and (4), we include year- and industry-fixed effects; the standard errors of the coefficients are clustered by firms to control for potential correlations of residuals within firms (Petersen 2009).Footnote 5 Detailed definitions of all the control variables are provided in “Appendix 1”.

To mitigate potential endogeneity concerns, we use a two-stage least-squares regression approach. A brokerage house having more analysts following a specific industry is likely to attach more importance to that industry. As a result, the likelihood of an analyst being industrial-concentrated increases. However, the number of analysts of a brokerage house following a specific industry is unlikely to exert a direct effect on analysts’ individual forecast performance.Footnote 6 On this basis, we use bexp as a plausible instrument for the two-stage least-squares regression. It is calculated as the natural logarithm of the ratio of the number of analysts in a brokerage house, who follow a given industry, to the total number of analysts employed by the same brokerage house in a fiscal year. Except for the instrument, the control variables used in the first-stage regression are the same as those included in the second-stage regression (Wooldridge 2000).

We also perform a test of the impact threshold for a confounding variable (ITCV) as per Larcker and Rusticus (2010) to further check whether or not our baseline regression results are biased by potential correlated-omitted variable(s). Frank (2000) defines ITCV as the threshold of the impact of omitted variable(s), beyond which the regression results would be overturned if the omitted variable(s) were controlled in our regression. The larger the value of ITCV, the less possible that our regression results are driven by correlated-omitted variable(s).

We further test the sensitivity of our baseline regression results to omitted variable(s) bias by using a method developed by Oster (2019). Under Oster’s approach, a coefficient of proportionality δ is proposed to capture the importance of unobservable variable(s) relative to that of observed variables in the regression, and to determine the threshold beyond which the bias induced by unobservable variable(s) would nullify the regression results. A value of δ larger than 1 indicates that the omitted variables need to be more influential than the included variables to explain away the regression results, which is almost impossible provided that the major determinants of the dependent variable are controlled in the regression. In such a case, the regression results are unlikely to be biased by potential omitted variable(s).

4.2 Tests of Hypothesis 2 and Hypothesis 3

Hypothesis 2 concerns whether the association between analysts’ industrial concentration and forecast quality is stronger for firms faced with fiercer industrial product market competition. In line with Karuna (2007), we construct a composite measure of industrial product market competition, which is captured by industry-level capital expenditures, industry-level sales, and industry-level operating costs. A high industry-level of capital expenditures or operating costs indicates that firms need to incur high costs in order to maintain a stable supply of competitive products and/or services to the market, suggesting a high level of product market competition. High industry-level sales indicate a high market demand and thus relatively low industrial competition. Hypothesis 3 predicts that firms’ risk or information opacity positively moderates the association between analysts’ industrial concentration and forecast quality. As with previous studies (e.g. Fink et al. 2010; Cao et al. 2008), we measure firm-specific risk by idiosyncratic return volatility. We follow Hutton et al. (2009) to measure a firm’s information opacity as the three-year moving sum of the absolute value of abnormal accruals for the current and previous two fiscal years.

To test Hypotheses 2 and 3, we construct the moderating variables, dcompetition, dfirmrisk, and dopacity, which equal 1 if the moderator variables (i.e. industrial product market competition, firm-specific risk, and firms’ information opacity) are larger than their sample medians, and 0 otherwise. We interact the moderating variables with the variables as to the analysts’ industrial concentration, and include the interaction terms in Models (2–4) for the regression estimation. If the hypotheses hold, the coefficients on the interaction terms should be positive and statistically significant at conventional levels.

5 Empirical results

5.1 Univariate results

Table 2 reports the descriptive statistics of all variables used for our empirical tests. The mean values of HHI (HHI1) are 0.5081, 0.5460, and 0.5384 (0.4888, 0.5271, and 0.5201) for the samples of analyst forecast accuracy, forecast informativeness, and forecast timeliness, respectively, indicating that analysts have the propensity to concentrate their research firms to a smaller proportion of industries they cover. The mean of car, which measures the average abnormal stock returns to the earnings forecasts issued by industrial-concentrated analysts, amounts to 0.1, suggesting that their forecasts are valued significantly by investors. “Appendix 2” presents the Spearman correlations between the variables involved in our baseline regression analyses. accuracy, car, and timeliness are positively correlated with HHI (or HHI1). The correlations amount to 8.5%, 2.8%, and 5.0% (8.4%, 2.9%, and 5.4%), respectively, and are statistically significant at the 1% level. These results provide preliminary support for the hypothesis that analysts’ industrial concentration is conducive to improving forecast quality.

Table 2 Summary statistics

5.2 Multivariate regression results

Tables 3, 4, and 5 report the OLS regression results for the association between analysts’ industrial concentration and the accuracy, informativeness, and timeliness of their earnings forecasts, respectively. The coefficients on HHI and HHI1 are both positive and statistically significant at the 1% level.Footnote 7 A one-standard-deviation increase in HHI (HHI1) leads to an increase in accuracy, car, and timeliness by 0.0366, 0.0024, and 0.0508 (0.0337, 0.0025, and 0.0531), which account for 141.86%, 2.40%, and 1.79% (130.62%, 2.5%, and 1.87%) of the sample means of accuracy, car, and timeliness, respectively. These findings support our prediction that analysts’ industrial concentration promotes the quality of their earnings forecasts in terms of forecast accuracy, forecast informativeness, and forecast timeliness.

Table 3 Test of Hypothesis 1: Analysts’ industrial concentration and forecast accuracy
Table 4 Test of Hypothesis 1: analysts’ industrial concentration and forecast informativeness
Table 5 Test of Hypothesis 1: analysts’ industrial concentration and forecast timeliness

Table 6 presents the two-stage least-squares (2SLS) regression results for Hypothesis 1. In the first-step regression estimations, bexp, which indicates the proportion of analysts at a brokerage who follow a given industry, has a positive and statistically significant relationship with HHI (HHI1). This result is in line with the notion that when a brokerage house has more analysts following a specific industry, the likelihood of an analyst being industrial-concentrated increases. The partial F-statistics for the instrument are all much larger than the cut-off point of 10 and statistically significant at the 1% level, suggesting that our models are not subject to weak-instrument problems (Stock et al. 2002; Larcker and Rusticus 2010). For the second-stage regression results, the coefficients on accuracy, car, and timeliness are positive and statistically significant at the conventional level, suggesting that our results reported in Tables 3, 4, and 5 are robust to correcting for endogeneity. These results for the 2SLS regressions are also economically significant. Specifically, in Column (1a) (Column (2a)) of Panels A, B, and C, a one-standard-deviation increase in the instrument bexp leads to an increase in HHI (HHI1) by 0.0433, 0.0462, and 0.0506 (0.0442, 0.0485, and 0.0514), respectively, which account for 8.52%, 8.46%, and 9.40% (9.04%, 9.20%, and 9.88%) of the sample mean of HHI (HHI1). In Column (1b) (Column (2b)) of Panels A, B, and C, a one-standard-deviation increase in HHI (HHI1) causes an increase in accuracy, car, and timeliness by 0.2800, 0.0168, and 0.2575 (0.2935, 0.0160, and 0.2545), respectively, which are equivalent to 1085.27%, 16.8%, and 9.05% (1137.60%, 16%, and 8.95%) of the sample mean of accuracy, car, and timeliness. Despite the use of the 2SLS regression to tackle potential endogeneity concerns, we do not claim that we have established the causality between analysts’ industrial concentration and their forecast quality. As with prior research (e.g. Huang et al. 2022; Sonney 2009; Jiang et al. 2016) on the determinants of analyst forecast performance, our study focuses on pointing to the association between the two variables of interest, and provides some inferences based on the association.

Table 6 Two-stage least-squares regression analyses of Hypothesis 1

Table 7 reports the results of the tests of the impact threshold for a confounding variable (ITCV). Panels A, B, and C present the results of ITCV tests for Hypothesis 1, for which the dependent variables in the regressions are accuracy, car, and timeliness, respectively. None of the control variables included in our baseline models has an impact with its absolute value larger than the absolute ITCV values of 0.0084, 0.016, and 0.0083 (0.0067, 0.0176, and 0.0091) for HHI (HHI1), indicating that our baseline results reported in Column (1) (Column (2)) of Tables 3, 4, and 5 are not biased by potential correlated-omitted variable(s).

Table 7 Tests of the impact threshold for a confounding variable (ITCV)

Table 8 presents the results of Oster’s (2019) estimate of the proportional degree of selection, δ. The values of δ for the regressions of accuracy, car, and timeliness on HHI (HHI1) are 6.196, 9.872, and 1.527 (5.668, 15.812, and 1.477), respectively, which are all larger than the recommended threshold of 1. Put differently, to attribute our baseline results alternatively to the omission of unobservable variable(s) in the regressions, the unobservable variable(s) would have to be around 6.2, 9.9, and 1.53 (5.7, 15.8, and 1.5) times as powerful as the observed variables in explaining the dependent variable. This is unlikely to take place given our control of a comprehensive list of variables as per prior literature. Therefore, we may rest assured that our baseline results are free from potential omitted variable(s) bias.

Table 8 Oster’s (2019) estimates of the proportional degree of selection

Table 9 reports the results for the cross-sectional analysis of product market competition. The coefficients on HHI*dcompetition (or HHI1*dcompetition) are positive and statistically significant for both the regressions of accuracy and car, but are significantly negative for the regression of timeliness. These results imply that, in covering firms faced with fiercer industrial product market competition, analysts tend to compromise their forecast timeliness to achieve more accurate and more informative forecasts for investors. Table 10 displays the results for the cross-sectional analysis of firm-specific risk. The coefficients on HHI*dfirmrisk and HHI1*dfirmrisk are positive and statistically significant in the regressions of accuracy and car, but are significantly negative in the regression of timeliness. These results suggest that in forecasting firms that confront higher risk, analysts may prioritize provisions of more accurate and informative forecasts over that of more timely forecasts. Table 11 reports the results for the cross-sectional analysis of firms’ information opacity. HHI*dopacity and HHI1*dopacity have positive (negative) and statistically significant coefficients in the regressions of accuracy and car (the regression of timeliness). Thus, analysts’ industrial concentration has a larger positive impact on forecast accuracy and forecast informativeness for firms that have a high level of information opacity. Nonetheless, it takes longer for industrial-concentrated analysts to make forecasts for these firms, meaning that they compromise their forecast timeliness to ensure the higher level of forecast accuracy and informativeness for investors.

Table 9 Test of Hypothesis 2 as to the cross-sectional analysis of industrial product market competition
Table 10 Test of Hypothesis 3 as to the cross-sectional analysis of firm-specific risk
Table 11 Test of Hypothesis 3 as to the cross-sectional analysis of firms’ information opacity

6 Conclusion

This study examines the association between analysts’ industrial concentration and the quality of their earnings forecasts. Our results reveal that analysts’ industrial concentration increases their forecast accuracy, forecast informativeness, and forecast timeliness, which lends support to the view that allocation of effort and resources to the analysis of industry-specific information is conducive to making quality forecasts. We further find that the positive association between analysts’ industrial concentration and the accuracy and informativeness (the timeliness) of their earnings forecasts is more (less) prominent in cases when firms are subject to fiercer industrial product market competition, higher firm-specific risk, and/or higher information opacity. This suggests that the benefits of industrial concentration are more salient for firms in those cases, but that the analysts need to spend more time making more accurate and more informative forecasts.

Our findings illuminate the benefits and importance of industrial concentration to financial analysts and have important practical implications. In specific, analysts may opt to concentrate their equity research on a smaller proportion of industries they cover in order to make quality forecasts. For firms faced with fiercer industrial product market competition and/or higher firm information opacity, analysts should allocate more effort and resources to the analysis of industry information by industrially concentrating their firm coverage. Also, it is important for analysts to make a great effort to foster industry knowledge and expertise, which are conducive to improving their forecast quality and thus advancing their career developments. For investors who seek analyst reports for their investment decision-making, it would be helpful to refer to the forecasts issued by industrial-concentrated analysts. Their higher-quality forecasts are beneficial not only to the investors but also for the strategy formation process and monitoring of firms (e.g. zu Knyphausen-Aufseß et al. 2011; He et al. 2019b).

Lastly, prior research (e.g. Huang et al. 2022; Sonney 2009; Jiang et al. 2016; Hirshleifer et al. 2019) which focuses on analysing the association between analyst forecast properties and their determinants does not account for potential endogeneity concerns. In contrast, we go to great lengths to address the concerns in our analysis, but we concede that they are not eliminated completely, which remains a limitation of this study.