1 Introduction

Star scientists have long been recognised as catalysts for knowledge creation within academia. There is also a growing policy interest in smaller economies and regions in increasing their connections to academic star scientists as a means to catalyse dynamic local research clusters. Although explicit policy programmes have been implemented, there is limited understanding of the mechanisms through which star scientists have an effect on local scientific productivity. One candidate mechanism is the direct effect of help from star scientists to incumbent local scientists, even where an explicit co-authorship relationship does not exist. Existing studies have generally not focused on the impacts of star scientists in smaller economies and have tended to overlook modes of knowledge transfer beyond co-authorship. Moreover, the importance of network building, both within and across institutions, emerges as a strategy for facilitating access to star help and unlocking its potential for productivity enhancement. This research contributes insights into knowledge and technology transfer by providing new empirical evidence on the knowledge transfer mechanisms facilitated by star scientists, and offering actionable recommendations for improving academic and research management practices in small open economies. This paper addresses these gaps with new empirical evidence on knowledge transfer that occurs through a star-help mechanism in small economies, where star help is indicated by acknowledgements to various forms of star help in the acknowledgement sections of published journal articles. To broaden the evidence base and recognise that developing connections to international star scientists has been a particular policy focus for smaller open economies (SOEs), our data are drawn from academic departments in Ireland, Denmark and New Zealand. As we use the term in this paper, 'star scientist' refers to those individuals who are highly productive in terms of their publications and citations and are located chiefly at a university (Schiller & Diez, 2010; Zucker & Darby, 2006).

Scientific collaboration through formal (e.g., co-authorships) or informal means (e.g., discussions and helpful comments) connects a star with other scientists. These interactions could play an essential role in generating knowledge spillovers. This paper focuses on one particular channel of star impact and aims to investigate the effects of the star's help on their colleagues' productivity. Helpful interaction—either with co-located or non-co-located scientists—is a relevant and usually neglected channel for learning and knowledge exchange in socially interactive environments. Interactions with stars, even if they are informal, influence behaviour, research topics, collaborations, and the productivity of scientists (Bramoullé et al., 2020). More frequent and long-lasting interactions impact the extent of knowledge overlap (Nooteboom, 2000), facilitate learning processes, and enhance knowledge creation through the recombination of existing ideas between stars and scientists (Nelson & Winter, 1982; Weitzman, 1998). The empirical challenge is that evidence of help does not leave a paper trail; however, explicit acknowledgements to help from a star provide a (noisy) indicator of star interaction (Bozeman & Corley, 2004; Oettl, 2012). We broadly interpret such acknowledgements as evidence of helpful interaction. From acknowledgements in the publications, we use natural language processing (NLP) techniques to identify the star scientist names that are being acknowledged. We then draw on recent advancements in econometric techniques to estimate the productivity impacts of star scientists' help on the scientific productivity of the acknowledging authors. We also explore the effects of different kinds of star help by categorising acknowledgements into five types: Conceptual, Technical, Material, Funds & Support, and Other Types.

The basic idea behind the empirical approach we take in the paper is that help (in various forms) from star scientists can raise the productivity of the assisted scientists and that an indicator of the presence and form of such help can be gleaned from the acknowledgement texts of publications. Our core measure of productivity is (forward) citation-weighted publications in a given year. However, a key empirical challenge is that an observed correlation between acknowledgements and productivity might not reflect a causal productivity effect. An alternative possibility is that an acknowledgement leads to a signalling effect, whereby a publication with a star acknowledgement is more likely to be cited due to a perceived endorsement from a star. To distinguish between the productivity and signalling hypotheses, we repeat the analysis using a measure of raw publications as our dependent variable. Assuming blind refereeing, where referees do not have sight of the acknowledgement texts, raw publications should not be contaminated by a signalling channel. We can, therefore, attain some confidence that we are observing a true productivity effect if the results are robust to switching to a citation-insensitive dependent variable. We find similar qualitative dynamic patterns across the two dependent variables.

Our data consists of all Scopus-recorded academic publications by authors affiliated with a university in the three SOEs across 27 subject fields for the period 1990–2017. Scopus provides acknowledgement data under the 'Funding Text' heading. Using this pool of acknowledgement data, we identify star names and their type of helpful interaction for each publication. We classify the type of acknowledgements based on the keywords often used to acknowledge the star's contribution to the publication. This help by the star can take many forms: an author might acknowledge the star for helpful discussions throughout research; there may be an acknowledgement to comments received during a seminar or a conference; there may be acknowledgements to various forms of technical assistance, especially in the fields of biochemistry, pharmaceuticals, and medicine; or there could be theoretical or mathematical help received during the analysis stages of research. It is important to note that these types of help are not formally captured as a recognised co-authorship. The co-authorship channel has been a major focus of the literature exploring the effects of star connections on productivity (see, e.g., Oettl, 2012). In this paper, we explore the effects of star help where a formal co-authorship relationship does not exist. Our paper thus extends the literature by exploring the productivity effects of star help even where the star does not become a co-author. We investigate both the immediate effects of observations of acknowledgements of star help and any enduring effects over time using a dynamic event-study approach.

Using this event-study methodology, where the event is the acknowledgement by the author to a star in the publication, we test a number of hypotheses relating to the effect of star help on the acknowledging scientist’s productivity. We employ a coarsened exact matching (CEM) of star-acknowledging and non-star-acknowledging scientists to create a similar treatment and control group. The findings support the hypothesis of a positive productivity effect and find an interesting dynamic pattern of productivity effects following an acknowledgement to a star. First, the results show a positive star-help effect on the authors who acknowledge a star, with the impact on productivity evident in the year of publication but largely disappearing in the subsequent years. Second, the results indicate that an author who continues to acknowledge a star in the years after the initial interaction has a higher and more persistent productivity effect. The results are robust to using raw publications as the dependent variable. In considering the types of acknowledgements, we find a significant positive but transitory effect on author productivity in the year of acknowledgement across all types, except materials where the effect is shown to be more persistent. We speculate that materials-related acknowledgements may reflect more enduring relationships that are not repeatedly acknowledged in publications over time. Finally, we find that the estimated productivity impact of star help is greater for assisted authors that are in the lower quartiles of the relevant field-specific productivity distribution in the year before the (initial) acknowledgement occurs, suggesting that less productive authors get greater benefit from a star’s help.

This paper contributes to the literature in several ways. First, the paper investigates the impacts of star help on peer productivity. Prior research mainly focuses on the star scientist's proximity with limited consideration of the precise mechanisms involved (Agrawal et al., 2017; Azoulay & Zivin, 2005). Second, in contrast to the existing literature, the paper focuses on the effects of star help in small open economies. This fills a significant gap in the literature, given the importance that policies to improve connections to star scientists (including but not limited to star recruitment policies) play in the policy mix of smaller national and regional economies. Third, the paper identifies non-co-authorship-based star connections using acknowledgements in the publication, where we hypothesise that such non-co-authorship-based interactions have an impact on a scientist's productivity. Moreover, the paper differentiates these informal interactions into Conceptual, Technical, Material, Funds & Support, and Other Types. Finally, the paper employs the growing literature on heterogeneous treatment effects accounting for staggered treatment by using Sun and Abraham's (2021) estimator. This allows us to consistently estimate the effects of interest in the potential presence of heterogeneity across both cohorts and periods.

The remainder of the paper is structured as follows. Section 2 reviews the related literature and describes the framework and hypotheses. Section 3 describes the data and the use of NLP in extracting acknowledgement data from the acknowledgement texts. Section 4 outlines the event-study-based empirical framework used to identify the causal effects of star help. Section 5 reports the results, including various robustness tests for the analysis. Finally, Sect. 6 concludes with a review of the main findings and their implications for science policy.

2 Related literature and hypotheses

2.1 Related literature

A key determinant of the growth of an economy is the accumulation and transmission of knowledge (Romer, 1986, 1990). Romer identifies two processes, human capital spillovers and R&D investments, where knowledge accumulation and spillovers happen. Human capital spillovers benefit others in the economy through increased productivity and innovation. These innovations are often a result of interaction between individuals with a considerable stock of previously accumulated knowledge. As potential key figures in this knowledge spillover process, star scientists may play a central role in the workflow structure due to their unique expertise and social status (Paruchuri, 2010). These spillover agents are often highly skilled pioneers in their fields who induce knowledge flows across organisational and regional boundaries and affect ongoing research projects inside academia and firms (Bergman & Schubert, 2005; Maier et al., 2007).

Waldinger (2012) and Azoulay et al. (2010) discuss the adverse impacts of losing ties with a star on collaborators. They find a decline in productivity and quality-adjusted publications for those affected by a star's externalities. Zucker and Darby (2006) find a positive impact of star scientists in start-up biotech firms. In these studies, researchers focus more on the impact on the productivity distribution rather than the mechanism enabling the knowledge flow. Further studies have provided evidence that the influence of a star might depend upon the collaborative strength and breadth of the star's expertise (Kehoe & Tzabbar, 2015) and have documented significant long-term effects on the performance of junior researchers (Li et al., 2019).

Although the presence of stars could catalyse the development of peers' scientific output, helpful interactions are essential for some knowledge transfer mechanisms. First, human capital externalities generate knowledge flows that must be recognised as valuable, then assimilated and recombined with previous knowledge to generate new scientific knowledge. In other words, to take advantage of stars' helpfulness, scientists have to increase their absorptive capacity and their capacity to apply knowledge to their research projects (Cohen & Levinthal, 1990). Although a certain level of previous related knowledge is required for learning to take place (Cohen & Levinthal, 1990; Torre & Rallet, 2005), recurrent helpful interactions and the resulting knowledge exchange adjust the optimal level of knowledge overlap between scientists over time (Nooteboom, 2000), which is fundamental to facilitate the transfer of knowledge (Boschma, 2005).

Second, these interactions also take advantage of spatial proximity to transmit formal and tacit knowledge forms (Bathelt et al., 2004). Scientists working in the same department go through face-to-face (F2F) communications, allowing efficient transmission of complex and tacit knowledge (Gertler, 2003; Storper & Venables, 2004). However, organised proximity, i.e., belonging to the same community of practice or sharing the same system of representations, could alleviate the requirements of being co-located for knowledge transfer and learning to occur. Information and communication technologies and temporary co-locations such as meetings and academic congresses could solve communication problems (Torre & Rallet, 2005). In the case of star help, both channels (F2F and organised proximity) are helpful for knowledge transfer that affects the productivity of incumbent scientists.

Finally, information sharing is the key to scientific progress in any spillover mechanism. Merton (1973) defines the norms of unconditional knowledge-sharing as one of the critical processes in academic life. At the same time, there is tension among academic individuals to share their valuable information due to incentives associated with the research (Blumenthal et al., 1996; Murray, 2010; Walsh & Cohen, 2008). Haeussler et al. (2014) consider two possibilities in which such information sharing happens: specific sharing, where researchers share their private work on a specific request, which in turn will get recognised for future accomplishments, and general sharing, where such information is shared publically, such as sharing unpublished data, materials, etc.

This study discusses how star help affects scientists' productivity. Through this particular mechanism, the absorptive capacity of scientists is enhanced, the optimal level of knowledge overlap is adjusted through repeated social interactions, and knowledge sharing is facilitated. However, while networks can positively affect productivity, their specific nature can also play a crucial role in determining their impact. For example, a study by Horta et al. (2010) found that academic inbreeding, or hiring PhD graduates as faculty members at the same university where they received their degree, can harm scholarly output. Of course, although understudied, informal interaction is just one example of a network-intermediated productivity effect. More broadly, networks help recombine specialised ideas and knowledge transformation (Groysberg et al., 2008; Rothaermal and Hess, 2007). Another widely studied network effect is co-authorship. Grigoriou and Rothaermel (2014), Agrawal et al. (2017), and Yadav et al. (2023) find increased productivity gains and show very high positive spillovers of co-authorship with a star. Also, the network position of the co-author is crucial for the productivity that facilitates access to nonredundant knowledge (Mohnen, 2022). However, the impact of stars on productivity through non-co-authorship-based network channels remains comparatively understudied.

One important exception to the neglect of the helpful interaction channel is Oettl (2012). He defines a taxonomy of stars based on these social interactions: maven (highly helpful and average productivity), all-star (highly helpful and highly productive), lone wolf (highly productive and less helpful), and non-star. Following Oettl's approach, we use acknowledgement texts in publications to provide a paper trail on helpfulness activities. Other studies have also utilised the information available in acknowledgement texts. Mackintosh (1972) classifies acknowledgements based on the facilities, access to data, and help of individuals. McCain (1991) classifies acknowledgements from experimental papers in genetics into five typologies: access to research-related information, unpublished results or data, peer interactive communication, technical assistance, and manuscript preparation. Cronin (1991) (later modified in Cronin et al. (1993)) introduces six typologies for acknowledgements: access, peer interactive communication, moral support, technical support, clerical support, and financial support. These studies focus on the typology of acknowledgements, while our focus is on how different types of acknowledgements are associated with the observed productivity effect on the acknowledging author. Moreover, we build on this literature by using observed acknowledgements as not just thank-you notes but indicators of important productivity-affecting activities that are distinct from co-authorships and citations (Cronin, 1991; Paul-Hus & Desrochers, 2019).

Nevertheless, the information in the acknowledgements of academic publications should be applied with caution and with sensitivity to context. Using acknowledgements as a sign of helpful interaction with a star scientist could be either a sign of intellectual debt or a signalling effect. For example, along with co-authorship, acknowledgements could also reveal other forms of collaboration (Laudel, 2002). The content of such acknowledgements is intended to repay debt towards formal and informal collaboration. In addition, acknowledgements could reflect personal relationships and individual preferences. Hellqvist (2010) conducted a qualitative analysis of acknowledgements in sociology journals and suggested that personal style, editorial guidelines, cultural norms, and ethical principles influence the pattern of acknowledgements.

The literature also considers the various motivations for acknowledging prior work in a paper. Berg and Faria (2008), for example, argue that the scientist names chosen in acknowledgements are based on the effect they may have on readers. We, therefore, recognise that acknowledgement of star help is a noisy indicator of actual star help but assume there is a sufficient signal in the acknowledgement data to make them useful in the estimation of star-help effects on research productivity. Furthermore, the type of helpful interaction also matters to those who receive such help from star scientists. As argued by Oettl (2012), the type of helpfulness being acknowledged is also relevant to the productivity effects of these acknowledgements, and we also distinguish between different types of star help in our empirical analysis.

2.2 Framework and hypotheses

We conceptualise a star scientist as a scientist who is usually productive and connected within their network. From the point of view of a non-star scientist, we assume, following the literature cited above, that forming a connection with a star scientist changes their network position and potentially their productivity. As an example, a scientist will obtain access to knowledge through their network, with their eigenvalue centrality within their network being one commonly used measure of how easily they can access knowledge (Newman, 2018). Forming a connection with a star will change their centrality and thus their access to knowledge. As another example, forming a collaborative connection with a star can directly increase productivity and may have further indirect effects through access to the star's broader collaborative network.

This paper focuses on connections to a star that deliver potentially productivity-improving star help, separate from any direct effect through co-authorship. We think of such star help as augmenting the scientist's inputs into their production of scientific outputs (as measured by citation-weighted publications). Figure 1 captures schematically the relationship between inputs and outputs, where star help is viewed as augmenting the scientist's inputs. We also allow for the possibility that a positive effect of observed star acknowledgements could reflect a signalling effect, where the positive effect on subsequent output—particularly citations to a publication—reflects a signalling (or reputational spillover) effect from the star.

Fig. 1
figure 1

Star help and observed scientific productivity

Our empirical approach is to look for evidence of the presence and type of star help in the acknowledgement text of publications. By carefully matching each scientist that is “treated" with star help to an observationally similar scientist who is not observed to receive such help, we use an event-study framework to estimate the dynamic effects of treatment.

Our first hypothesis is that an observation of acknowledgement for star help is associated with a contemporaneous increase in star productivity, with productivity measured by citation-weighted publications.

Hypothesis 1a

The observation of an acknowledgement for star help is associated with an increase in citation-weighted publications in the year of acknowledgement.

In addition, an important advantage of our event-study framework is that we can observe the effects of an acknowledgement for star help in a given year on the evolution of productivity over time. This leads to our second hypothesis:

Hypothesis 2a

Repeated observations of acknowledgements for star help are associated with a sustained increase over time in productivity as measured by citation-weighted publications.

A significant challenge is disentangling the input-enhancing role implied by observed star acknowledgements from signalling effects. On the assumption that acknowledgement texts are not observable to referees, raw publication data (i.e., output not weighted by subsequent citations) should not be contaminated by signalling effects. This leads to the following hypotheses based on a simple raw publications measure of output:

Hypothesis 1b

The observation of an acknowledgement for star help is associated with an increase in raw publications in the year of acknowledgement; and.

Hypothesis 2b

Repeated observations of acknowledgements for star help are associated with a sustained increase over time in productivity as measured by raw publications.

Finally, acknowledgements to a star scientist can be classified by type based on the keywords in each publication's acknowledgement text. These keywords assist with identifying the nature of the help between the author and star, leading to our third hypothesis that the effect of star help on an author's productivity will vary across the types of helpful interaction with the author.

Hypothesis 3

The observation of an acknowledgement for star help is associated with an increase in citation-weighted publications in the year of acknowledgement, with the magnitude of the increase potentially varying across the different types of helpful interaction.

3 Data and methodology

Our dataset consists of all the publications and their citations across the 27 subject fields identified by Scopus, where each publication contains at least one author affiliation recorded in Ireland, Denmark, or New Zealand. As discussed previously, Ireland and Denmark are chosen because they are small open economies with nationally funded star recruitment programmes, while New Zealand is chosen because it is also a small open economy with no formal star recruitment programmes. We collect variables at the publication level for the year of publication, authors, affiliations, subject field, citation count to 2019, references, abstracts, keywords, and acknowledgements. We identify authors and publications using unique Scopus identifiers. The dataset contains approximately 1.43 million publications divided over 219,582 unique authors. The unit of observation is at the author level rather than the publication level, and the dependent variable tracks the author's productivity before and after the star help revealed from the acknowledgement texts.

To help capture an author's performance over time, we restrict our analysis to the authors present in one of the three countries for at least four years. For each author who satisfies this condition, we access their catalogue of publications dating back to 1990, including those outside their affiliations in the three focal countries. The final dataset contains an unbalanced panel of 889,479 unique publications divided over 59,122 authors across the years. Using the affiliation data of the author, we determine the department (defined based on the Scopus definition of subject fieldsFootnote 1). If an author publishes in more than one subject field, we assign them to the department in which they have the most publications.

3.1 Identification of a star scientist

Given that this study focuses on the impact of star scientists on their peers through channels of knowledge spillover identified by the acknowledgement text in an author's publication, the first step is to identify the star scientists in our dataset. The dataset contains the citation distribution of each author from 1990 for each subject field in any given year. A star is defined as one who is at or above the 95th percentile of scientists in the cumulative distribution of citations for the relevant subject field received for all publications up to that year. In measuring the citations to any given publication, we measure all citations up to the end of our observation period in 2019, so our publication quality measure depends on the subsequent citations to that publication.

We only identify star scientists from 1996 onward to allow enough time for the accumulation of citations, given that our publication data began in 1990. Similar to the department allocation for each author, we determine the department for each star as the department in which they had the most publications. The star scientist identified is relative to scientists in Denmark, Ireland, and New Zealand. Hence, these stars are highly productive scientists in the context of the overall distribution of scientist citations for these three countries. Through this method, we identify 981 stars from the dataset, 205 in Ireland, 548 in Denmark, and 228 in New Zealand.

3.2 Identification of star names in the acknowledgements

We identify star names and their author interaction type from the pool of available acknowledgement text. Since the study focuses on the impacts of star scientists on their peers through informal collaborations, we think of the influence of these interactions as occurring through intense discussions, critical reviews, data sharing, collaborations, and supervision. Broad forms of funding texts are available in Scopus, which is translated into acknowledgement texts. A limitation of the Scopus database is that the acknowledgement text is not available for every publication. Table 14 in the "Appendix" outlines the yearly distribution of publications and available acknowledgement texts in Scopus.

Table 1 Acknowledgement types that define the types of a star help

Using a Natural Language Processing (NLP) tool, we extract the namesFootnote 2 of scientists from our acknowledgement text data. We use the open-source library, Spacy, an advanced NLP tool in Python designed to process a large volume of text data. More specifically, we utilise Spacy's Named Entity Recognition (NER) feature to extract named entities from each acknowledgement text. Named entities are phrases that contain the names of persons, organisations, and locations. Spacy's NER feature has reported an accuracy of 89.8 (OntoNotesFootnote 3) and 91.6 (CoNLL-2003 corporaFootnote 4) percent and compares well to other packages available for NLP. The tool helps to token the words in an acknowledgement text, and we group them using a unique publication identifier. Using the NER feature, we tag the author entities in each publication's acknowledgement text, and then we match these names to those of the star scientists identified in Sect. 3.1. Overall we identify 331 stars acknowledged over 971 publications by 1815 authors.

3.3 Identification of acknowledgement types

While prior studies primarily focus on the typology of the overall acknowledgement text (Cronin, 1991; Desrochers et al., 2018; Paul-Hus et al., 2016, 2017), we focus on the types of interaction between a star and an author. Using a similar approach as Oettl (2012) we identify the type of star help based on their helpfulness. Moreover, these acknowledgement types are classified in our study based on the available keywords that determine the star help in our data. Here we define five types of acknowledgements to a star who is not a co-author: Conceptual, Technical, Materials, Funds & Support, and Other Types. Each acknowledgement type indicates the interaction between an author and a star during the research project.

Conceptual acknowledgements recognise star scientists for their intellectual contribution to the research project. For example, "We thank P. Di Vecchia, G. Grignani, C. Kristjansen, and N. Obers for useful discussions" (Harmark and Orselli, 2006) and "I would like to thank Prof. P. Sigmund for stimulating and enlightening discussions of the topic and useful comments on the manuscript" (Glazov, 1998). Using keywords, we identify this type of interaction between a star and the author. Here, keywords such as "discussions," "comments," "feedback," "critique," "advice," "suggestions," are used. Keywords that categorise help in the technical form are "technical assistance," "statistical assistance," "excellent assistance," and "expert assistance." For example, "We thank P. Rasmussen, B. Jensen, and D. Bardenfleth for expert technical assistance" (Sorensen et al., 2010). In this example, P. Rasmussen is the star, and the author acknowledges his help through their technical assistance.

Material acknowledgement shows the debt of gratitude for materials and data shared by the star for the research purpose. Keywords used to identify interactions based on this specification are "data sample," "antibodies," and "cells." For example, "Professor Klaus Bendtzen is thanked for providing antibodies for the cytokine measurements" (Theander et al., 1997). Another acknowledgement type captures the author's "funding and supports" through the star interaction. Keywords such as "grants," "funds," "support," and "financial." are used to identify the interaction with the stars. For example, "This research was sponsored through the contestable research fund of the Waikato Management School of the University of Waikato and Professor Chris Ryan, Waikato Management School, for his continued support" (Lockyer, 2005). Finally, acknowledgements using the keywords such as; "committee," "contributions," "permission," "facilities," "director," "supervisor," "founder," "help," "dedication," are classified as "Other Types." For example, "The study was initiated by Torben Jørgensen, DMSc (PI); Knut Borch-Johnsen, DMSc (Co-PI); Troels Thomsen, Ph.D., and Hans Ibsen, DMSc" (Baumann et al., 2015). A similar approach is adopted by Paul-Hus and Desrochers (2019) to classify acknowledgement types as 'vague.'

Table 1 presents the variation across the different types of acknowledgements by 1815 authors to 331 stars over the period 1990 to 2017. Star help classified as the conceptual type constitutes 37% of the overall acknowledgements, where the subject areas: Agricultural and Biological Sciences; Biochemistry; Medicine; Physics; and, Astronomy account for 62% of conceptual acknowledgements to a star scientist. Star help based on technical acknowledgements occur primarily in the subject areas: Agricultural and Biological Sciences, Biochemistry, and Medicine, where the research in these departments can include considerable laboratory work as well as the use of complex instruments. Material acknowledgements account for 14% of the general acknowledgements and are mainly observed in the fields of Medicine and Biochemistry. Only 7% of the acknowledgements to star scientists in our sample show an acknowledgement to a star for the funding the author received.

3.4 Output measure

We use two output measures to test our hypotheses discussed in Sect. 2.2: (i) Field Normalised Total Citations (FNTC)—the sum of an individual's publication citations divided by the average citations to a publication for that subject field in that year for all countries combined; and (ii) total count of normalised raw publications of the individual author in that year. We calculate the dependent variable as follows:

$$\mathrm{Field\,Normalized\,Total\,Citations}:Y_{i,t}^{FNTC}=\sum_{{p}_{i,t}=1}^{{P}_{i,t}}\frac{{c}_{{p}_{i,t}}}{{\overline{c} }_{s,t}};$$
$${{\text{Publications}}: Y}_{i,t}^{P}=\frac{{P}_{i,t}}{{\overline{P} }_{s,t}};$$

where \({P}_{i,t}\) is the total number of publications by individual \(i\) published in year \(t,\) \({c}_{{p}_{i,t}}\) are the subsequent total citations (or "forward" citations recorded in 2019) to a publication \({p}_{i,t}\) that occur for individual \(i\) in year \(t\), \({\overline{c} }_{s,t}\) is the average citations to a publication in the relevant subject field, \(s\), for publications that occur in year \(t\), and \({\overline{P} }_{s,t}\) is the average number of publications in subject field \(s\).

These output measures provide the dependent variables for our regressions. Using both dependent variables, we use the author's publication data from 1996–2017 (our estimation window) to calculate the overall impact of the star's helpful interaction. In our analysis, we split the individuals into two subgroups—Treated: Authors who acknowledge the star for the helpful interaction for their publications and who did not co-author with the star before the acknowledgement; and Never-Treated: Authors who received neither helpful interaction nor a star co-authorship interaction. Section 3.5 below discusses the matching procedure to identify matched pairs of treated and control authors before the treatment event happens.

3.5 Coarsened exact matching procedure (CEM)

Matching treated units to control units in observational data helps to mitigate the confounding influence of pretreatment control variables with the primary goal of improving the balance between the treated and control groups. We, therefore, employ matching to help control for the endogeneity issue that might arise in acknowledging a star. For example, acknowledging a star can be considered random in the cases such as tips and help received from a seminar series or international conferences. At the same time, prior connections with the author also involve the star being a part of the research as an informal contributor. To address this, we create a panel data which has a control group comprising authors who never received any star exposure in terms of star's helpful interaction or co-authorship and a treated group—a similar set of authors in terms of characteristics who received star help that is identified from the acknowledgement texts of the publications.

In our study, we ensure that the treated authors had no star interaction in terms of star co-authors before or during the event of the star’s helpful interaction. To minimize the effect of confounding in our observational causal inference, we employ Coarsened Exact Matching (CEM), which is a monotonic imbalance-reducing matching method (Blackwell et al., 2009; Iacus et al., 2011; Iacus et al., 2012). CEM is a design strategy that involves matching on a set of covariates that have been “coarsened,” meaning that they have fewer possible values for matching, which increases the number of matches (King et al., 2019, 2011). This technique has been shown to improve the balance of covariates between exposure groups and to guarantee balance for each covariate, only limited by the coarseness of the grouping (Fini et al., 2018). In contrast, other matching techniques, such as Propensity Score Matching (PSM), do not guarantee each variable such a balance guarantee. They may require repeated iterations to achieve balance (Fini et al., 2023). Moreover, CEM ensures balance for higher-order terms, such as interactions of covariates, while such a guarantee does not exist in a propensity score approach.

We find an author in the control group at \({\text{t}}-1\) year who matches similar characteristics (based on the observables) with an author from the treated group at \({\text{t}}-1\) who receives star help at year \(t\). The CEM procedure allows us to define the covariates to match a categorical variable rather than a continuous variable. We identify a match based on five covariates: subject field, country, total career age, cumulative publication experience, and cumulative citations received on prior publications. Each covariate uses a categorical variable with course binsFootnote 5 (Iacus et al., 2012). Table 2 shows the difference in the mean value and p-value of each matching variable for both control and treated groups one year before the star interaction occurs. The unbalanced matched panel data set contains 1258 authors, with 629 authors in the treated group who acknowledge a star for the first time. Furthermore, 75 of these authors acknowledge the stars in the years after the first acknowledgement. Similarly, for the five cohorts based on acknowledgement types: Conceptual, Technical, Material, Funds & Support, and Other Types, we use the CEM matching procedure to identify matching pairs of control and treated author. Table 16 in the "Appendix" reports the matching statistics of matched pairs for each cohort.

Table 2 Summary statistics: Control and Treated group (k-to-k matched)
Table 3 Dynamic star help effects for field normalised total citations

4 Econometric methodology

The empirical goal of our econometric analysis is to estimate the effects of helpful interactions with star scientists on the productivity of authors over time, where we capture the interactions from the acknowledgement texts of the author's research publications. As we discussed, a star might impact the productivity of their peers through various channels, and our focus is to identify whether an author's informal non-co-authorship interactions with a star could affect their productivity measured in terms of (forward) citations to their publications (hypothesis 1) & raw publications output (hypothesis 2).

To measure these dynamic effects, we utilise an 'event-study' specification where the event is an acknowledgement to a star scientist that can occur in the past or future. Our event-study design is a staggered adoption design where units (authors) are treated at different times, and some units have never been treated. We estimate the dynamic treatment effects of the helpful interaction with a star on their peers from three years before the published acknowledgement to three years afterwards in the following event study specification:

$$ln{Y}_{it}= \alpha + {\beta }_{\le -4}{staracknw}_{i,-4}+ \sum_{j= -3}^{-2}{\beta }_{j}{staracknw}_{ij}+ \sum_{j= 0}^{3}{\beta }_{j}{staracknw}_{ij}+ {\beta }_{\ge 4}{staracknw}_{i,4}+{\delta }_{t}+ {\mu }_{i }+ {\epsilon }_{it},$$
(1)

where the dependent variable \(ln{Y}_{it}\) is a measure of the citation-weighted/normalised raw publication output of author \(i\) at year \(t\), \(sta{racknw}_{ij}\) is a binary variable equal to 1, if a star is acknowledged by the author \(i\) as of year \(t\),\(j\) years ago, \({\delta }_{t}\) is a year fixed effect, \({\mu }_{i}\) is an author fixed effect, and \({\epsilon }_{it}\) is a zero mean error term. The coefficients of interest, \({\beta }_{j}\), show the proportionate effect of the helpful interaction with the star on productivity from three years before the acknowledgement to three years afterwards. We normalise the effect of the star interaction to zero for the year before the star acknowledgement, and we assume that the cumulative effect is constant at \({\beta }_{\le -4}\) and \({\beta }_{\ge 4}\) by binning at four leads and four lags. The binning variables may not be comparable to the leads and lags of the acknowledgement binary variables in estimating the dynamic effects since they could be correlated with other excluded level variables; however, they act as essential controls in our specification (Schmidheiny and Siegloch 2019). Furthermore, standard errors are clustered at the author level and are robust to arbitrary forms of serial correlation and heteroscedasticity. McHale et al. (2022) adopt a similar econometric approach that estimates the effects of star arrival on the departments' productivity in SOEs.

An important assumption of our event study model is the generalised form of the parallel trends assumption, whereby without a star acknowledgement, the quality-adjusted output in the treatment group would have changed in the same way as it did in the non-treatment group (the authors that did not acknowledge a star scientist). The estimated lead coefficients in our specification allow us to examine indirect evidence to support the parallel trends assumption, with any observed pre-acknowledgement effects considered evidence of a failure of this assumption. On the other hand, an observed pre-trend could indicate anticipation effects of star acknowledgement on productivity; for example, if an author had prior knowledge of possible helpful interaction with a star in the future, they might change their productivity behaviour. However, in our case, this is considered to be unlikely since the author acknowledges a star for his helpfulness which could not be anticipated before the interaction, and therefore, we assume that anticipation effects are zero.

Another critical assumption in this event-study setting is that the star acknowledgement effect on author productivity is homogeneous across the timing of the acknowledgements. However, recent literature has shown that the coefficients of given leads and lags can be contaminated by the effects from other periods in the presence of heterogeneous effects across different treatment timings (Callaway and Sant’Anna, 2021; Sun & Abraham, 2021; Goodman-Bacon, 2021). In our econometric analysis, star acknowledgement has a staggered treatment timing, and heterogeneity in the effects could arise if different cohorts experience different treatment paths. Therefore, we also adopt the approach of Sun and Abraham (2021) to estimate the heterogeneous treatment effects. This approach derives the dynamic effects of star acknowledgement in a three-step estimation that is robust to treatment effect heterogeneity and calculates a weighted average of 'cohort average treatment effects on the treated' (CATT). First, we define the year in which an author \(i\) acknowledges a star scientist as \({e}_{i}\). Second, we estimate the weighted average of cohort effects for a given time relative to the acknowledgement event. To allow the estimated star acknowledgement effects to vary by cohort based on the year that the acknowledgement event occurs, we estimate the following equation:

$$ln{Y}_{i,t}=\sum_{e}\left[{\delta }_{e,-4}(1\left\{{E}_{i}=e\right\}{staracknw}_{i,-4})+\sum_{j=-3}^{-2}{\delta }_{e,j}(1\left\{{E}_{i}=e\right\}{staracknw}_{ij})+\sum_{j=0}^{3}{\delta }_{e,j}(1\left\{{E}_{i}=e\right\}{staracknw}_{ij})+{\delta }_{e,4}(1\left\{{E}_{i}=e\right\}{staracknw}_{i,4})\right]+{\delta }_{t}+ {\mu }_{i}$$
(2)

where 1 \(\left\{{E}_{i}=e\right\}\) is an indicator variable that takes the value 1, if the individual \(i\) receives star help in the year \(e\) and 0 otherwise. \({\delta }_{e,j}\) is the star help effect on productivity \(j\) year after author acknowledges a star in year \(e\). The 1996 treated cohort is dropped from the analysis since it is always treated across the observation window. A further set of weights are estimated \(Pr\left\{{E}_{i}=e|{E}_{i} \epsilon [-j,T-j]\right\}\) that are equal to sample shares of each cohort for the relevant periods \(j\). Finally, to obtain the IW estimator, we take a weighted average of the \({\widehat{\delta }}_{e,j}\) (or \({CATT}_{e,j}\)) and estimate Eq. 2 with relevant weights calculated.

$${\beta }_{j}^{*}=\sum_{e}\left[{\widehat{\delta }}_{e,j}Pr\left\{{E}_{i}=e|{E}_{i}\epsilon [-j, T-j]\right\}\right].$$
(3)

5 Results

In this section, we present the results for our different hypotheses outlined in Sect. 2.2. We use FNTC as the dependent variable for testing hypotheses 1a and 2a and the number of raw publications for testing hypotheses 1b and 2b. Furthermore, we only use FNTC as the dependent variable to test hypothesis 3, where we explore the effect of star help across the types of helpful interaction identified.

5.1 Star help effect on individual output (with FNTC as dependent variable)

We examine hypotheses 1a and 2a in this section. We present the estimated results from the event study specification for both the homogenous and heterogeneous star acknowledgement effects in Table 3. The dependent variable is FNTC to the output published in year \(t\). Column 1 of Table 3 shows the results for the homogeneous model, and the star acknowledgement effects are also shown on the graph in the left panel of Fig. 2. We find evidence that a helpful interaction with a star, as captured by a star acknowledgement, significantly affects an author's productivity supporting hypothesis 1a. A star acknowledgement is associated with an economically and statistically significant contemporaneous increase in the quality-adjusted output of the author of 24.70 log points, which translates to a 28.02% increase in output. In the years after the star acknowledgement, the estimated coefficients remain positive but decrease substantially to between 0.04 and 0.05 log points and are only statistically significant at the 10% level.

Fig. 2
figure 2

Event study model with homogenous (left) vs. heterogeneous (right) star help effects at an individual level. The figure plots the dynamic effect of the star's helpful interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017 (the 1996 cohort dropped is omitted). The dependent variable is field normalised total citations

Regarding the heterogeneous model using the approach of Sun and Abraham (2021), the event study results are reported in Column 2 of Table 3 and displayed in the right panel of Fig. 2. Under the heterogeneous model, we find a similar contemporaneous effect as in the homogenous model, with a 28% increase in the quality-adjusted output of the author in the year of acknowledgement. Moreover, the results lend further support to hypothesis 1a when the differential timing of treatment is taken into account in our estimation. Furthermore, the estimated coefficients for the years following a star acknowledgement are again similar to the homogenous model (0.05 to 0.06 log points), and these coefficients are statistically significant at the 5% and 10% levels, respectively. In considering the staggered timing of treatment on authors that have a helpful interaction with a star, the homogenous results are robust to the heterogeneous specification in Eq. 3. Overall, the results suggest that the quality-adjusted output of an author increases contemporaneously when a star is acknowledged, but the effect tends to fall in subsequent years. We also dropped the 1996 (always treated) cohort from the homogenous estimation for comparison purposes, which has minimal effects on the final results. Overall, we find supporting evidence for hypothesis (1a) that there is a contemporaneous increase in the citation-weighted publications from star help.

One advantage of an event study setting is that it allows visual evidence to support or contradict the parallel trends assumption. Our results in Fig. 2 depicts that both the homogeneous and heterogeneous models show very little evidence of a pre-trend; therefore, visual inspections support the parallel trends assumption. However, Roth (2022) raises some important concerns about relying on insignificant pre-trends in the event study setting to assess the credibility of parallel trends. In particular, the test may have low power to detect meaningful violations of parallel trends. Roth (2022) suggests a diagnostic approach to determine whether such concerns could be warranted, and we implement these diagnostics in our analysis.

Roth's (2022) technique involves constructing a hypothesised linear violation of parallel trends and then comparing the likelihood of the observed coefficients under the hypothesised trend relative to under-parallel trends. We construct a hypothesised trend with a power of 50%, which would detect a significant pre-trend 50 percent of the time. With a power of 50%, the slope estimate is found to be 0.02 for both the homogenous and heterogeneous effects models. The likelihood ratio reporting the ratio of the likelihood of the observed coefficients under the hypothesised trend relative to under-parallel trends is 0.30 and 0.38, respectively, which supports that the estimated coefficients are more likely observed under a parallel trend. The results of this diagnostic test are also shown in Table 3.

Next, we examine hypothesis 2a, which proposes that the effect of star help has a sustained increase in the quality-adjusted output of the author. Table 4, Column 1, reports the results based on the sample of the author's productivity that continues to acknowledge a star after the first point acknowledgement. After their initial interaction, the authors who maintain this relationship with a star show a substantial increase in their quality-adjusted publications. Their output is found to increase by 41.62% in the year of acknowledgement and is statistically significant at 1%. These results show the importance of maintaining the relationship with a star. The estimated coefficients decrease in the years after the first acknowledgement (20.12, 23.74, 23.94 log points) but are significant and persistent.

Table 4 Dynamic star help effects (multiple and one-time) for field normalised total citations

Additionally, the event study plot in Fig. 3 left panel shows no evidence of a pre-trend as well as a strong help effect on productivity up to 3 years after the first acknowledgement in the case where multiple acknowledgements take place. This is in contrast to an increase in productivity that is broadly limited to the year of acknowledgement in the right hand panel, which limits the sample to cases of a single acknowledgement. Overall, the evidence suggests that once-off help is associated with a time-limited increase in productivity (supporting hypothesis 1a), and that sustained help is associated with a sustained increase in productivity (supporting hypothesis 2a).

Fig. 3
figure 3

Event study model with multiple (left) vs. one-time (right) star help effects at an individual level. The figure plots the dynamic effect of subsequent and first-star helpful interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

5.2 Star help effect on individual output (with raw publications as dependent variable)

In our estimation above, we examine the effect of star help on the quality-adjusted productivity of authors, where we use the field-weighted citations as our dependent variable to study this effect. However, there remains a concern that the positive effect of observed star acknowledgement on citation-weighted output found above could reflect a signalling effect, whereby acknowledgement to a star scientist is used as a signal or indicator to convey information about the potential quality of the publication. For the reasons outlined in Sect. 2.2, we augment our analysis using FNTC with an analysis using field-normalised total raw publications to help disentangle the input-enhancing role implied by observed star acknowledgements from a signalling effect. Based on our assumption that the blinded manuscript submitted to a journal for peer review contains no acknowledgement texts and, thus, the decisions by the reviewers cannot be impacted by the acknowledgement to a prominent star scientist, we now present the results from testing hypothesis 1b and 2b using raw publications count.

Column 1 in Table 5 reports the estimated coefficients for the baseline regression using authors' field normalised raw publications as the dependent variable. We find that in the year of the acknowledgement event, the raw publication output of authors increased by 22.38% compared to the authors who never acknowledged any help from a star. We also find a decrease in the magnitude of the star-help effect on the raw publications output of the author. These results support hypothesis 1b, where we propose that there will be an increase in the raw publications output in the year of star help. However, similar to our findings for FNTC, we do not find any evidence of a significant effect on raw publications output in the years after the event.

Table 5 Dynamic star help effects (overall, multiple, one-time) for raw publications

For hypothesis 2b, the results based on the treated sample of authors that acknowledge a star on one occasion and those that acknowledge a star on multiple occasions are presented in Table 5, Columns 2 and 3, respectively. Regarding the authors that acknowledge the star for help in the years after the initial event, the results suggest a sustained effect on raw publications output broadly similar to our analysis with field-weighted citation as the dependent variable. An acknowledgement of star help is associated with a 28.53% increase in raw publications in the event year, and the coefficients are found to be statistically significant in the years after. This provides evidence to support our hypothesis that there is a sustained increase in the raw publications output for authors who maintain an informal collaboration with a star.

In contrast, in considering the effect of star help on the raw publications output for the authors who acknowledge a star for help in a single event, the results are aligned with our prior analysis using field-weighted citations as the dependent variable in that we do not find any evidence for a statistically significant effect in the years after the event. Figure 4 illustrates the event study plots for the two cases. Overall, by separating the input-enhancing role implied by observed star acknowledgements from a signalling effect using the raw publications measure, the results indicate that the positive effect of the observed star acknowledgement on output reflects more than just a signalling effect in both the cases of once-off acknowledgement and multiple acknowledgements.

Fig. 4
figure 4

Event study model with homogenous star help effects at an individual level: Overall Star Help (Row 1); Multiple Star Help (Row 2, Left); One-Time Star Help (Row 2, Right). The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is raw publications

5.3 Effects of star help through different channels

In this Section, we consider hypothesis 3, which states that the effect of star help is different in magnitude across the different types of star help identified based on the keywords contained in the acknowledgement text for the publication. We identify five types of acknowledgement to understand the effects of the helpful interaction between the author and the star: Conceptual, Technical, Material, Funds and Support, and Other Types. Our dataset is split into five cohorts, each containing treated and control authors identified from the CEM procedure. This analysis estimates the effect of a star help that happens through five different channels.

For each channel, we modify Eq. (1)

$${Y}_{it}= \alpha + {\beta }_{j \le -4}{staracknwtype}_{i,-4}+ \sum_{j= -3}^{-2}{\beta }_{j}{staracknwtype}_{ij}+ \sum_{j= 0}^{3}{\beta }_{j}{staracknwtype}_{ij}+ {\beta }_{j \ge 4}{staracknwtype}_{i,4}+{\delta }_{t}+ {\mu }_{i }+ {\epsilon }_{it}$$
(4)

where \({staracknwtype}_{ij}\) is a binary variable that indicates if a star is acknowledged for the specific type of help in year \(j\) by an author \(i\).

The results from the estimates are presented in Table 6. They show that a conceptual acknowledgement to a star is associated with an increase in output of 37.16%, and this is the most significant increase in output compared to all other types examined in this analysis. The help from the star through sharing their knowledge positively impacts the author's productivity. In contrast, a technical acknowledgement type is associated with the smallest effects on output, with an increase of 18.77%. This acknowledgement type is observed primarily in subject fields such as medicine and biochemistry, where the research methods involve practical examination and laboratory experiments, and stars can provide their direct expertise. Material acknowledgement to a star is associated with an increase in output of 20.08% in the year of acknowledgement and unlike the other types of help, there was a 21.05% increase in output in the year afterwards. Also, the event study plot for material acknowledgement in Fig. 5 shows a persistent productivity rise in the years after star acknowledgement, which contrasts with the absence of a sustained effect for the other types of acknowledgement. Material acknowledgements to a star are observed mainly in the fields of medicine and biochemistry, where stars can conveniently share unpublished data and loan specimens. This could suggest that the star and the author develop a more persistent connection through the materials-sharing channel than through the other acknowledgement channels.

Table 6 Dynamic star help effects for field normalised total citations for five types of helpful interactions
Fig. 5
figure 5

Event study model with channels of help identified from the acknowledgement texts. Conceptual (row 1, left), Technical (row 1, right), Material (row 2, left), Funds & Support (row 2, right), and Other types (row 3). The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

A project funded by the star's support also impacts positively on an author's productivity, with a funding acknowledgement to a star found to be associated with a 35.80% increase in output in the year of acknowledgement. These findings suggest that a star who supports the research through financial means also has an effect on the author's productivity. The number of these acknowledgements are fewer when compared to other types of interaction channels in our data. Finally, all the other types of acknowledgement that could not be accurately classified under conceptual, technical, materials, or funds and support are categorised as ‘Other Types’ for the purpose of this study. In the year of these type of acknowledgement events, the author's quality-adjusted output increases by 28.79%, similar to our overall star acknowledgement effect in Sect. 5.1. These results suggest that the 'Other' category proxies for the other more concrete forms of help the stars provide. Alternatively, the relative robustness of the results across acknowledgement types might indicate that the precise form of help is less important than the close interaction with the star. Overall, these results provide evidence to support hypothesis 3 that the effect of star help on the author's quality-adjusted productivityFootnote 6 is present, albeit variable across the types of help. Furthermore, we note that the estimated coefficients are statistically significant in the acknowledgement event year for each type, which can be taken as further support for hypothesis 1a.

5.4 Robustness

5.4.1 Robustness to a balanced panel

Our baseline estimation uses an unbalanced panel of 1258 authors to estimate the effects on the dependent variable. In a balanced panel, we observe units (in this case, authors) every time period, reducing the noise introduced by unit heterogeneity. To analyse whether this variation in the dataset affects our results, we use a panel of 520 authors present in the data throughout our estimation period (1996–2017). The matching statistics of these 520 authors are reported in Table 7a. We present the estimated results of the event study specification based on the balanced panel in Table 8, Column 1. Similar to the star acknowledgement associated with the unbalanced panel data, we again observe a significant contemporaneous effect (27.12%). Though the estimated coefficients in the further years fall (9.46, 7.43, 10.5 log points), they are statistically significant up to three years after the year of acknowledgement. The event study plot in Fig. 6 again shows no evidence of a pre-trend. In addition, Roth's pretest diagnostic analysis reports a likelihood ratio of 0.40, suggesting that the estimated coefficients are likely to follow a parallel trend before the treatment.

Table 7 Summary statistics: Control and Treated group (k-to-k matched) (a) a balanced panel, (b) alternate matching criteria using the availability of publication match at t = 0, (c) alternate matching criteria using acknowledgement text availability in publication at t = 0
Table 8 Robustness test of dynamic star help effects for field normalised total citations
Fig. 6
figure 6

Robustness test on Event study model with homogenous star help effects at an individual level for a balanced panel. The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

5.4.2 Robustness to alternative matching criteria

As a further robustness check, we examine the sensitivity of the results to alternative matching criteria. One concern is that authors are unlikely to publish their research work annually throughout their careers. It takes time to publish a paper following the review process and editing. As an acknowledgement can only occur in a year a paper is published, acknowledgements may be simply partly picking up the fact that a paper was published in a particular year, thus biasing our estimate of the productivity effect of an acknowledge relative to the control authors. To this end, we identify the authors in the control group that have also published at least one paper in the event year (\(t=0\)) when the authors in the treated group have a publication that acknowledges a star. Table 7b reports the matching statistics of the new set of control and treated authors. Here we find 1208 matching authors that satisfy the matching criteria. In Table 8, Column 2, we present the estimated results from the event study specification. In considering the additional matching criteria, we still find a similar contemporaneous effect from star acknowledgement on output. An increase of 27.76% in the quality-adjusted output is estimated once the author gets exposure from the star, which is similar to our baseline results.

As discussed previously in Sect. 3.2, an additional concern is that the availability of acknowledgement text from the Scopus database is limited. The credibility of the star help effects can be questioned due to the comparison between a treated author who gets a match from the control group and has no acknowledgement text available in that particular year. To check the possible implications for our results, we compare the authors in the control group that have a publication with an available acknowledgement text in the event year (\(t=0\)) when the treated author acknowledges a star. Table 7c reports the matching statistics of 990 authors at year \(t-1\) with these additional criteria. The event study estimation reported in Table 8, Column 3 again shows similar results to our analysis in Sect. 5.1. We find a contemporaneous increase in the quality-adjusted output of the author by 28.27% associated with star help. Also, the plots for the estimation under both alternative matching criteria are presented in Fig. 7, and they indicate no evidence of a pre-trend. Furthermore, the likelihood ratios of observing any pre-trend are 0.36 and 0.34 from the Roth test analysis, which supports the appropriateness of the parallel trend assumption.

Fig. 7
figure 7

Robustness test on Event study model with homogenous star help effects at the individual level using alternate matching criteria; the matched control author should have a publication in year \(t=0\) (left), and the control author should have a publication with acknowledgement text in year \(t=0\) (right), where year \(t=0\) is the year when the treatment starts. The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

5.4.3 Robustness to country-specific cohorts

The publication data from Scopus comprises those authors that have published in Ireland, Denmark, and New Zealand, and the stars we identified are those individuals who are highly productive relative to their peers in these three countries. Therefore we should be able to see a similar effect of star help on the author's productivity in the year of star acknowledgement. The results reported in Table 9 and presented graphically in Fig. 8 show some differences across countries—with the largest acknowledgement effect observed for Denmark and the smallest for New Zealand—but overall, we find evidence of a significant productivity effect in the year of acknowledgement.

Table 9 Robustness test of dynamic star help effects for field normalised total citations for the three countries separately
Fig. 8
figure 8

Event study model with homogenous star help effects at an individual level by country (Ireland, Denmark, New Zealand). The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

5.4.4 Robustness to co-authorship relations

While our baseline estimation excludes authors that have co-authored with a star prior to the star acknowledgement event from the treatment group and the control group comprises of matched authors that have never co-authored or acknowledged a star scientist at any time in the sample, there could still be the potential for endogeneity from other types of social connection between the author and the star prior to the acknowledgement event and separate to direct co-authorship. For example, an author might have had an indirect social relationship with the star by co-authoring with a non-star author who may have previously co-authored with the star and thus it would be important to control for social distance in the estimation more generally.

To explore this possibility, we examine the network of co-authors for the treated authors before the acknowledgement of star help (before \(t=0\)) to identify the individuals not directly connected to the star through co-authorship but sharing a mutual co-authorship connection with someone else in the second degree of separation. We identify a total of 56 out of the 629 treated authors reported in Table 2 that share a mutual co-authorship connection with the star. Then in a robustness check, we exclude these 56 treated authors and their matched control pairs from the analysis and re-estimate the baseline regression. Column 1 in Table 10 presents the estimated coefficients after their exclusion. Consistent with the baseline results in Table 3, we find that acknowledging star help is associated with a 28.40% (0.25 log points) increase in the quality-adjusted output of the author in the year of acknowledgement. In addition, Fig. 9 shows the estimated event study plot after the exclusion of the 56 treated authors and their matched pairs, and it also indicates that the baseline results are robust to their exclusion.

Table 10 Robustness test of dynamic Star Help Effects by excluding the authors with previous indirect relations with a star (column1) and excluding the co-authored publications with a star after the initial acknowledgement event (column 2 & 3)
Fig. 9
figure 9

Event study model with homogenous star help effects at an individual level: Excluding the authors with indirect star relationship before the acknowledgement event. The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

Another concern for our baseline results is the potential emergence of a co-authorship relationship between the author and the star post the acknowledgement event. This could have important implications for our results where the estimated increase in author output associated with the acknowledgement of star help could instead be partially attributed to the later emergence of a direct co-authorship relationship between author and star. Authors who acknowledge a star for their help could also produce more co-authored publications with the same star or indeed any star in the years after the event (\(t=0\)). In a further test of the robustness of our results, we examine the effect of once-off and multiple-star help on author’s output, excluding all co-authorship publications that occur between a star and author after the acknowledgement event from our estimation.Footnote 7

Column 2 in Table 10 reports the results for one-time star help after excluding these publications. Again, the results are similar to the baseline and also have a similar dynamic pattern (see Table 4, Column 2 and Fig. 3 (right) for comparison). Overall, they show a star help effect of 26. 5% for the quality-adjusted productivity of the author in the year \(t=0\) as well as no evidence of a sustained productivity effect in the following years. Furthermore, Column 3 in Table 10 reports the estimated coefficients for multiple star help after excluding the post acknowledgement event star co-authorship publications from the analysis. Here, the star help effect is estimated to be a 39.1% increase in the quality-adjusted output of the author. Additionally, in the years following the initial acknowledgement event, the results show statistically significant coefficients broadly comparable with the baseline findings but with a smaller positive impact at each lag. Figure 10 also shows the event study plots of the star help effect for both once-off and multiple star help after removing these co-authored publications, and these are comparable to the baseline plots in Fig. 3. Therefore, our initial results are largely robust to the exclusion of co-authored publications between an author and star in the years after the acknowledgement event.

Fig. 10
figure 10

Event study model with homogenous star help effects at an individual level: Excluding the future star co-authored publications for authors who acknowledge star help for: One–Time (Left); Multiple Times in the years after the first point of contact (Right). The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

5.4.5 Robustness to single and multiple acknowledging authors

To test whether the effects of a star acknowledgement are different where the star is acknowledged multiple times, we define a dummy variable \(M\) that takes a value of 0 if the star is acknowledged just once and 1 where if the star is acknowledged more than once. This leads to a revised estimating equation:

$$ln{Y}_{it}= \alpha + {\beta }_{\le -4}{staracknw}_{i,-4}+ \sum_{j= -3}^{-2}{\beta }_{j}{staracknw}_{ij}+ \sum_{j= 0}^{3}{\beta }_{j}{staracknw}_{ij}+ {\beta }_{\ge 4}{staracknw}_{i,4}+{\gamma }_{\le -4}({staracknw}_{i,-4}\times M)+ \sum_{j= -3}^{-2}{\gamma }_{j}\left({staracknw}_{ij}\times M\right)+ \sum_{j= 0}^{3}{\gamma }_{j}({staracknw}_{ij}\times M)+ {\gamma }_{\ge 4}({staracknw}_{i,4}\times M))+{\delta }_{t}+ {\mu }_{i }+ {\epsilon }_{it}.$$
(5)

At any given lead or lag, a simple test of difference in effect for the cases of single or multiple acknowledgements is then a test of the statistical significance of the relevant \(\gamma\) coefficient. A positive and statistically significant coefficient in period 0 (the period in which the acknowledgement occurs) would indicate that even in the period in which the acknowledgement occurs, a scientist that makes multiple acknowledgements receives a greater productivity boost than a scientist who only makes a single acknowledgement. This could reflect unobserved heterogeneity between single and multiple acknowledging scientists (that is not picked up by our controls) or indicate that the quality of initial help received is greater for multiple acknowledging scientists. However, a finding of no statistically different initial productivity effects of acknowledgements between the two groups helps allay concerns of unobserved heterogeneity between the single and multiple citing scientists.

We report the event study estimates of the model in Table 11. In Column 1, we show the estimated beta coefficients from Eq. (5), while in Column 2, we show the estimated gamma coefficients on the interaction between the indicator variable for star help and the multiple acknowledgement dummy. Most importantly, in the year of star help (\(j=0\)), we don’t find any evidence of a significantly different initial productivity effect for the authors who acknowledge a star multiple times. We take this as evidence that the observation of multiple acknowledgements does not imply that the scientist is more productive and/or can make better use of star help, suggesting that the sustained productivity effect associated with multiple acknowledgements reflects a causal effect of the help rather than being due to any selection effect.

Table 11 Robustness test of dynamic Star Help Effects for Field Normalised Total Citations comparing the initial productivity effects of authors who acknowledge star single and multiple times

5.4.6 Robustness to the productivity of acknowledging authors

In a final robustness test, we examine for potential heterogeneous treatment effects across the distribution of the outcome variable. The effect on productivity from acknowledging star help could differ depending on the productivity of the acknowledging authors. To investigate this, we look at the cumulative FNTC of the treated authors one year before treatment, the acknowledgment to a star. This productivity indicator provides a measure of where an author is in the initial productivity distribution before treatment. Table 12 reports summary statistics for each of the four quartiles of treated authors from the distribution of cumulative FNTC at year \(t-1\). We divide our overall sample of treated authors into four sub-samples based on these quartiles and then, using Eq. 1, we estimate the results for each sub-sample of treated authors and their matched pairs from the control group.

Table 12 Summary statistics: by quartile of the cumulative FNTC distribution for the treated group at year t-1

The event study estimates for each quartile are reported in Table 13 and their related event-study plots are presented in Fig. 11. The results show that the contemporaneous effect of star help on author productivity for authors in the first and second quartiles (column 1 and 2) is larger compared to the effect for authors in the third and fourth quartiles (columns 3 and 4) of the cumulative FNTC distribution. Star help is associated with a statistically significant increase of 38.13% for authors in quartile 1 and a statistically insignificant increase of 13.54% for authors in quartile 4. The event study plots also indicate that the effect is sustained in the years after the acknowledgement event for quartile 1 authors on average. In contrast, it is not sustained for the authors in later quartiles. In comparing the low- and high-productivity author clusters, we thus find that star help has a greater effect on authors that have low productivity before the star help is received relative to the authors that have high initial productivity. We interpret this result as indicating that lower productivity authors gain most from star help. We hope to further investigate this finding in future work, as it may have important implications for policies that can be used to support lower-productivity colleagues.

Table 13 Robustness test of dynamic Star Help Effects for Field Normalised Total Citations comparing the low and high productivity clusters of treated authors based on the quartiles of the cumulative FNTC at t-1
Fig. 11
figure 11

Event study model with homogenous star help effects at an individual level comparing low–high productivity clusters by equal four quartiles of cumulative FNTC received for the treated authors one year before the treatment. The figure plots the dynamic effect of star interactions with an author at 95% confidence intervals. The event and observation window is from 1996–2017. The dependent variable is field normalised total citations

6 Summary and policy implications

Although there is a growing body of literature that estimates the effects of connections to stars on the productivity of peers, a star's role in helping their peers through non-co-author-based relationships has been largely neglected. While focusing on the performance gains of co-authoring with a helpful star, Oettl (2012) discusses the possibility of extending the research to colleagues and students who are not co-authoring with the star. Following this suggestion, this paper investigates the effects of star help on the productivity of scientists receiving the help.

To implement this empirically, we identify interactions between a star and an author identified from the acknowledgement texts in a publication. We treat these acknowledgements as an indicator of helpful interaction and examine their impacts on the author's quality-adjusted scientific output.

We analyse the output of authors who publish in three countries: Ireland, Denmark, and New Zealand. Using Natural Language Processing (NLP) techniques, we extract the names of individuals who get acknowledged for their contributions to the research. We match these individuals to previously identified stars' names relative to the peers who published in the three countries. We estimate the effect of star help by comparing scientists who acknowledge the star with a carefully matched control group of scientists who do not.

The overall effect of star help is estimated in an event study setting, where the event is the acknowledgement by an author of an identified star. We observe an increase in the quality-adjusted output of authors in the event year. Although we find evidence of a sharp fall in the productivity effect in subsequent years, the initial effect of star interaction is significant. In addition, analysis of authors who continue to acknowledge a star in the years after the initial acknowledgement shows a higher and sustained helpfulness effect on their output.

In addition, we examine the effects of different types of star help. We classify these interactions from a star based on the keywords in the acknowledgement text. We identify five interaction channels through which the knowledge transfer occurs. A disaggregated analysis of these interactions suggests that all types of help positively affect the author's output. However, authors who acknowledge a star for the conceptual help show the highest increase in the output in the year of acknowledgement. Unlike other forms of star help, authors who acknowledge a star for materials received show consistent productivity effects even without evidence of sustained acknowledgements.

A second contribution of our analysis is to examine star-help effects within an explicitly dynamic framework. Our staggered event-study findings show significant productivity effects in the year that star help is acknowledged. However, the observed effects drop sharply in the years following the acknowledgement unless that help is sustained. Moreover, these findings are robust to recent techniques that explicitly account for heterogeneity across the years in which the star help is observed to occur.

Previous work has established that the occurrence and nature of star help can be an important mediating factor in determining the effects of co-authorship relationships with a star (e.g., Oettl, 2012). Our study focuses on the productivity effects of star help in the absence of a co-author relationship. We find a robust relationship across forms of star help between the acknowledgement of help and the productivity of non-co-authoring researchers. Moreover, the effects on productivity are more significant where there is evidence of the help being sustained over time and where the researcher receiving the help is positioned in the lower quartiles of the relevant field-specific productivity distribution.

These results have important implications for the recruitment and organisational strategies of academic departments. To the extent that stars are co-located with non-star researchers, they should have more opportunities to provide help to the benefit of incumbents at the receiving department, with the beneficial effects being very impactful for less productive researchers. This points to one source of potential value from star recruitment policies in addition to the more widely studied co-authorship and recruitment-quality channels, at least to the extent that co-located stars are better positioned to provide help to their non-star colleagues (Agrawal et al., 2017). Furthermore, as the results are not specific to co-located researchers, organisational policies that help embed researchers in networks that increase the probability of interaction with stars should have productivity benefits. Where the star and non-star are co-located, formal mentoring programmes and more informal means such as regular departmental seminars, workshops and social gatherings should help initiate and develop the relationships that support the provision of star help. Star help can also be provided even where the star and non-star are located in different institutions. In this regard, it is likely beneficial for the non-stars to embed in broader networks that allow for interactions with stars, thus supporting the development of the relationships that facilitate productivity-enhancing star help. This suggests the importance of organisational support for network building, such as providing funding for conference travel and encouraging membership of cross-department societies or international funding consortia. Our results suggest that such policies could be especially beneficial for scientists in the lower parts of their field-specific productivity distributions, especially where this help can be sustained over time.