1 Introduction

When a science-related technology emerges,Footnote 1 firms that aim to develop this technology invest in the relevant research. Such investments are long-term and high-risk; therefore, they are typically made by firms with strong market position (Rosenberg 1990). A considerable proportion of these firms’ research outcomes are accumulated in the form of firm-specific tacit knowledge (Nelson and Winter 1982), and some outcomes are patented for future appropriation (Teece 1986). At the same time, some outcomes are selectively published in the form of research papers (Hicks 1995; Simeth and Raffo 2013; Grassano et al. 2019).

However, publishing by firms is a seemingly counterintuitive behavior in that firms reveal their research outcomes to the public without direct monetary rewards. As firms do not have enough incentives to produce public knowledge with their own money (Arrow 1972), the primary contributors to published knowledge have been universities that are largely supported by the government (Mowery and Rosenberg 1999). Nonetheless, firms in some emerging sectors publish their research outcomes when they expect that they can ultimately benefit by doing so. Although the overall intensity of firm science and publications has decreased since the 1980s (Arora et al. 2018),Footnote 2 firms in some sectors still publish actively (Grassano et al. 2019).

When firms engage deeply in research publications in emerging fields, a question arises concerning the influence of the private sector’s engagement in published knowledge development. If firms possess assets that are critical for developing and commercializing an emerging technology, their research papers will likely attract the attention of researchers intending to contribute to the knowledge frontier (Dasgupta and David 1987) or economic development (Etzkowitz 2002). If the attraction is significantly strong, the development of published knowledge will be naturally focused around the firms’ interests, facilitating progress in the corresponding field and shaping the direction of knowledge evolution. The recent advancement of artificial intelligence (AI) is a relevant case in which firms that have published a substantial number of papers possess key complementary assets required to advance and apply emerging technologies (Anthes 2017; Agrawal et al. 2018; Mansell and Steinmueller 2020). Although it is challenging to intentionally alter the rate and direction of the evolution of an emerging technology, papers published by firms in fields, such as recent AI, can considerably influence the evolution of published knowledge.

Such an impact can be of considerable significance in influencing policies related to science and technology with respect to government support, particularly when firm papers are of high quality. One of the reasons that we can expect high quality from firm papers in the field of AI is the widespread research collaboration between private and public organizations. Firm researchers in the field of AI have increasingly collaborated with researchers in universities (and public research institutions) because of the expected potential synergy of the collaboration (Gibney 2016). Firms can decrease their long-term investment risk by effectively gaining access to the scientific knowledge base of university researchers, while university researchers can acquire the opportunity to conduct better research by leveraging the firms’ resources. In this vein, firms’ engagement in research publications may have high-quality impacts on the published knowledge development of the field in the long run. If this is the case, we can derive important policy implications regarding the direction of public support for relevant innovations.

However, to the best of our knowledge, relatively little scholarly attention has been paid to empirically examining how and to what extent firms’ publications in emerging science-related technologies can influence the evolution of knowledge that is built on research papers (Perkmann and Walsh 2009). Moreover, empirical studies barely shed light on the features of papers produced by firm and non-firm (i.e., university or public research institutions) collaborations. Focusing on the recent development of AI, this study bridges these gaps in the literature with the following research questions.

  • RQ1) To what extent do firms’ publications (relative to other publications) influence the formation of published knowledge trajectory relevant to emerging science-related technology?

  • RQ2) Are there any qualitative differences between publications by firm and non-firm collaborations and other publications on emerging science-related technology in terms of their a) novelty and b) conventionality?

Referring to Uzzi et al. (2013), papers satisfying the conditions of both high novelty and conventionality are likely to be highly important ones in the long run. Therefore, the second research question reflects the long-term quality of published papers, focusing on the qualitative characteristics. To answer the research questions, this study controls for the previous track record of author(s) and affiliation(s) associated with each publication, as well as paper-specific characteristics. We employ statistical models, along with data-mining approaches such as community detection methods, main path analysis, and dynamic topic modeling on the data obtained from AI-related conference and journal papers. The findings of this study are expected to enhance our understanding of firms’ publishing on emerging science-related technologies and thereby provide a more detailed picture of the role of firm and non-firm actors in the evolution of emerging science-related technology.

The rest of this paper is structured as follows. In Section 2, we provide a background of the literature and rationalize why the recent development of AI fits our context. Section 3 describes the research methodologies and data collection, while Section 4 discusses the findings. Lastly, Section 5 presents the study’s contributions and conclusions.

2 Conceptual background

2.1 Firms’ investment in an emerging science-related technology

In the post-war period of the twentieth century, research on high-risk science-related technologies was largely conducted by the corporate laboratories of technological giants such as AT&T, Du Pont, GE, and Xerox (Pisano 2010). These capital-intensive organizations invested heavily in internal research and concentrated their efforts on developing technologies that were expected to create huge long-term value despite the great uncertainties involved. The expected benefits from these risky investments include first-mover advantages such as learning experiences, buyer-switching costs, and barriers to the entry of other firms (Rosenberg 1990). Even when a firm does not benefit much from the first-mover advantage, the absorptive capacity obtained from such an investment helps the firm effectively monitor and understand research conducted by external actors (Cohen and Levinthal 1990).

Differing from this conventional pattern, there have been some cases in which small firms invest in research concerning high-risk technologies. When biotechnology emerged in the 1980s, the major actors that conducted innovation research were young entrepreneurial firms, particularly academic spin-offs (Rothaermel and Thursby 2007). This was possible because compared with research outcomes from other fields, those from biotechnology, in general, are readily appropriable. There were a number of venture capitalists, wealthy individuals, and large firms that financed the research of small firms. Similar patterns have been observed in emerging sectors such as nanotechnology and energy (Henderson 1993; Christensen and Rosenbloom 1995; Arora et al. 2018).

With the resurgence of AI starting around 2006, large firms have again been the major business actors engaged in research (Gibney 2016), as they detect AI’s huge market potential—as an enabling technology (Teece 2018) or even as a general-purpose technology (Klinger et al. 2018)—applicable to various domains. Moreover, the complementary assets necessary to proceed with AI research—big data and computational power—are concentrated in firms, not in universities and public research institutions. The business models that derive value from AI are also likely to be firm-specific because an algorithm performs better when it has a narrow focus (Norton 2016). Such circumstances have incentivized firms to perform research on the emerging technology of AI (Waters 2015).

2.2 Evolutionary perspective on firms’ publishing

Since the knowledge itself is not inherently private or public, the decision to make the research findings public depends on the strategic choices of firms. Firms strategically decide which parts of their research findings should be kept private or made public, and which methods of knowledge protection or disclosure should be employed for future appropriation. Firms can apply patents to protect their knowledge legally or choose to keep their knowledge as a trade secret if it is difficult to obtain a patent for the knowledge or the risk of releasing the knowledge through a patent document is higher than the expected return. By contrast, firms can choose to publish their research outcomes purposefully through channels such as journals or conference proceedings.

Among firm-level actions that are used to manage the boundary between private and public knowledge, research publication is one of several selective revealing strategies through which firms intentionally make knowledge spillovers (Alexy et al. 2013). This firm behavior of revealing knowledge without direct monetary rewards is counterintuitive when we consider the free riding of external actors on the revealed knowledge (Arrow 1972). Due to this counterintuitive nature, many scholars have discussed the motivations underlying firms’ publishing, such as to attract talented researchers by providing them an opportunity to get published (Simeth and Raffo 2013), to maintain links with academia (Rosenberg 1990), or to build the necessary technical reputation to exchange valuable knowledge with academia (Hicks 1995).

One of the main reasons why firms expect their publishing to serve key purposes, such as maintaining technical reputation and academic relationships, is because the published knowledge not only delivers written knowledge to the public but also indicates the unpublished aspects of firm resources (Hicks 1995). These resources can include complementary assets required to commercialize emerging technologies, such as infrastructure, data, and tacit knowledge and skills. Researchers in academic engineering may be interested in solving problems of industrial significance (Vincenti 1990). Institutional changes after the Bayh–Dole Act of 1980 and the subsequent founding of technology transfer offices in universities further encourage university researchers to contribute directly to economic and social development, which is known as the third mission of universities (Etzkowitz 2002). Therefore, papers published by firms that possess resources critical to an emerging technology’s commercialization are likely to draw researchers’ attention to these firms.

Thus, it is expected that when a firm that possesses assets complementary to an emerging technology actively engages in publishing, the influence of its papers on the evolution of published knowledge are considerable. Similar observations have been made in that the average impact of papers published by leading firms is outstanding in fields such as biotechnology, physical science, electrical engineering, and recently, AI (Hicks 1995; Narin et al. 1997; Hartmann and Henkel 2019; McKelvey and Rake 2020; Baruffaldi and Poege 2020). This is an interesting aspect considering that the overall evolution of published knowledge is directed by the collected interests of individual researchers mostly affiliated with universities that are largely financed by the government. Given the conventional understanding that research publications are mostly driven by governmental support (Mowery and Rosenberg 1999), the active engagement of firms in research and subsequent publishing leads to an important policy question: where to direct governmental support?

According to the literature on technological systems for innovation, the components constituting a technological system interact with each other, revealing a variety of problems and sources of knowledge needed to innovate (Carlsson and Stankiewicz 1991). Hughes (1987) emphasized that when technologies become increasingly embedded in a social structure, they stimulate reverse salients that fall behind the overall evolution of the system. Dealing with these sluggish components in the production and diffusion of a new technology requires the involvement of different types of institutions, including private and public actors (Antonelli 2001). The engagement of various types of institutions helps maintain diversity in technological systems, which is necessary to cope with potential obstacles in innovation.

From this viewpoint, firms’ engagement in research and publishing in an emerging field implies that the government might need to direct its support towards areas complementary and nonredundant to the firms’ investment. When leading firms invest heavily in the early stages of a field’s knowledge development, the field can lose its diversity as it can be locked into the direction pursued by the leading firms. Although optimists expect that AI will soon be widely applied, it has been proven that it is not straightforward to move from generic AI models to broad applications owing to factors falling behind, from technical to non-technical ones, in each application domain (Mansell and Steinmueller 2020). Careful maneuvers in the direction of governmental support can increase the diversity of components in the overall technological system, ultimately broadening the scope for selection among competing variants and coping with potential barriers.

To extend the above discussions, this study investigates the influence of firm papers on the overall direction of published knowledge evolution, which can be characterized by the significance of the impact on the formation of a knowledge trajectory or the direction of advancement within a technological paradigm (Dosi 1982). Our approach goes beyond previous observations that leading firms’ publishing can make an outstanding impact in that we evaluate whether firm papers can even lead the direction of knowledge evolution in an emerging field of AI.

2.3 Collaborative knowledge production

While large corporate labs usually conducted in-house research during the post-war period, since the 1980s, there has been an increasing trend of sharing the risks of producing new knowledge among heterogeneous organizations including firms, universities, and public research institutions. This trend is in line with the strong attention given to the open innovation literature (Chesbrough 2003) and the third mission of universities (i.e., contribution to economic and social development; Etzkowitz 2003) over the past few decades. The transition to inter-organizational and transdisciplinary research efforts was also noted by Gibbons et al. (1994) as basic properties of Mode 2 knowledge production, which is a representative way of producing new knowledge in modern society.

Within this trend, the number of university-industry (U–I) collaborations has risen significantly. Firms have enjoyed various benefits from this type of relationship. In particular, theoretical depth necessary to advance science-related technologies often must be obtained from experts in universities. However, just as firms manage the boundary between private and public knowledge, so do university researchers. University researchers can keep some of their capabilities secret to maintain their competitive advantage in the academic community (Hilgartner and Brandt-Rauf 1998). Therefore, even when a firm recognizes the valuable capability of university researchers, it is not necessarily able to exploit that capability (Hicks 1995). U–I collaborations have been extensively employed to resolve this issue, as collaborations allow firm researchers to gain formal access to university researchers’ valuable scientific knowledge. In addition, if a trust-based relationship between a university and a firm evolves over time, other valuable information beyond the boundary of formal collaborative activities can flow between the firm and university researchers. By maintaining connections with universities, firm researchers can also participate in a broader network of academia, which sometimes unexpectedly brings useful information necessary to develop technology that the firm needs.

Meanwhile, since collaboration connotes reciprocity (Katz and Martin 1997), university researchers can also benefit from the U-I collaboration. When high-quality resources necessary to improve research productivity are concentrated in firms, the U-I collaboration can be a channel that allows university researchers access to these firm resources. The resources can be equipment, data, and technological know-how, to name a few. In addition, through the relationships, university researchers can better understand key industrial problems. Such an understanding can lead university researchers to pay more attention to real-world problems when they conduct research. Considering that university researchers have been encouraged to contribute to economic development over the past 40 years (Etzkowitz 2002), they can significantly benefit from awareness of industrial issues.

Moreover, such awareness can even lead to advancement of scientific knowledge, as application-focused research may yield an unintended scientific discovery. The distinction between applied and basic research is often meaningless in the realms of science-related technologies such as computer science (CS) and medicine (Rosenberg 1990). Stokes (1997) argued that the major contributors to the advancement of science-related technology are people who address real-world problems but do not lose sight of the contribution to scientific understanding, such as Louis Pasteur who tried to solve fermentation problems in the wine industry but additionally created the science of bacteriology. Other “star scientists,” as defined by Zucker et al. (2002) have also performed this type of research. Hence, university researchers who are exposed to important industrial issues through the U–I collaborations are likely to be capable of performing salient research at the interface between science and technology.

Based on the reciprocal relationship between firms and university researchers, it can be said that U–I collaborations satisfy various necessary conditions that could lead to important research outcomes in the realms of emerging science-related technology. These conditions include the scientific knowledge base of university researchers, sufficient complementary resources provided by firms, and awareness of crucial industrial issues. Therefore, we expect that papers published by firm and non-firm collaborations are characterized by significantly higher novelty and conventionality, two qualitative characteristics related to the possibility of a paper gaining importance in the long run (Uzzi et al. 2013). High novelty means the extent to which the published knowledge includes novel ideas that can be obtained by allocating resources for enough exploration, while high conventionality characterizes to what extent a paper is well grounded in the body of academic discussions.

2.4 Firms’ influence on the evolution of published knowledge: the case of AI

Since we delve into the phenomenon in a specific sector (i.e., AI), it is difficult to extend our results to all other fields. To mitigate this concern, we suggest boundary conditions to extend the results of this study. This section summarizes the re-emergence of AI and describes some of its general features as a science-related technology and a type of Mode 2 knowledge. Based on the features, we derive a set of hypotheses.

Although knowledge production related to AI has only relatively recently begun to grow rapidly, the beginning of the field dates back to the mid-1950s. From the mid-1950s to the mid-1970s, optimism about humans achieving the ultimate goal of creating intelligent machines was high. However, in the face of unresolved failure in the underlying theoretical model defining AI, funding for AI temporarily stopped (Boden 2016). From the early 1980s, the field started to regain attention because of firms’ adoption of AI-based software called “expert systems,” which provide domain-specific solutions to firm managers by relying on the codified knowledge of experts. However, the technical improvement in expert systems relative to their extremely high price was unsatisfactory, and most investors and governments again decided that AI funding should pause from around the late 1980s until the early 2000s (Nilsson 2009). Although the turning point in the public’s perception of the field occurred in 1997, when IBM’s Deep Blue beat the world chess champion, the field started to regain broad attention with the realization of Moore’s law (Anthes 2017). Since the mid-2000s, there has been a resurgence in AI as the amount of data, computational power, and statistical machine learning tools have improved significantly (Anthes 2017). In particular, the methodological breakthrough by Geoffrey Hinton and his colleagues in 2006 suggested a creative way to deepen the structure of neural network models (i.e., deep-learning) (Hinton et al. 2006). The resurgence of AI also represents a kind of “new science-based technology,” a term that indicates that some technologies emerge through the synergy between older scientific knowledge and rapidly advancing information technology (IT) (Koumpis and Pavitt 1999). Since IT capacity is a key factor supporting the such R&D process, incumbent firms in many fields with superior IT capacities advance the frontier knowledge of an emerging technology despite the high risk.

Besides the re-emerging feature of AI, knowledge production in this field shows characteristics that are in accordance with those of science-related technology or Mode 2 knowledge. First, the field of AI may lack an agreed definition, but broadly, it indicates activities devoted to making intelligent machines (Simon 1996; Nilsson 2009). Such a description shows that the final goal of AI is oriented towards solving practical problems, which is expected to result in useful applications. AI has even been regarded as a type of enabling technology for various industrial purposes including manufacturing, medical equipment, geography, automatic vehicles, and internet services (Teece 2018). Such a nature implies that the problem definitions of AI are specified from real-world issues, which is consistent with the basic property of Mode 2 knowledge that “knowledge [is] produced in the context of application” (Gibbons et al. 1994, p. 3).

Although several disciplines are related, AI addresses some phenomena surrounding computers and is a part of computer science (Simon 1995). However, computer science has long been considered difficult to classify into any particular category (natural science, applied science, and technological R&D; Rosenberg and Nelson 1994). The field not only develops technology to achieve a goal (Bolander 2019) but also produces theoretical knowledge in relevant areas (Boden 2016). Therefore, much of the science and technology content has become indistinguishable in this field. Even within the computer science community, some researchers argue that the field is science, whereas others argue that it is closer to technology (Denning 2005). This corresponds with one of the properties of Mode 2 knowledge production: “Conventional terms—such as applied science, technological research, or research and development—are inadequate” (Gibbons et al. 1994, p. 2).

The approach to conducting AI research is divided into two main groups: one based on theory and logic and the other described as an empirical art that involves learning through apprenticeship and cases rather than rigorous theory (Nilsson 2009). Although recent progress in AI is more associated with the latter categorization (Boden 2016; Parnas 2017; Hutson 2018), which requires big data and computing power, frontier knowledge in the field is still produced by efforts involving both aspects. Empirical AI researchers describe their research method as heuristic search, which is based on rules of thumb generated through experimentation (Parnas 2017). They repeat experiments until they obtain a more intelligent machine by changing its design. In many cases, researchers do not clearly understand why one machine performs better than another. Such a property is consistent with that of engineering and is referred to as a blind search process of trial and error that recombines various technological components until one obtains successful results (Vincenti 1990; Fleming and Sorenson 2004). Along with its rather messy nature, AI researchers need help from science and mathematics to design better machines with reduced uncertainties when searching for new designs. In addition, many researchers still heavily rely on theory rather than trial and error, again connoting the science-related nature of AI technology. Owing to its characteristics, the field requires professional experts not only from computer science but also from cognitive science, psychology, statistics, and mathematics, among others (Boden 2016). Indeed, it is difficult to adequately handle the problems in this field by relying on the knowledge base of a single organization. Heterogeneity in terms of both scientific disciplines and tacit experience must be coordinated to adequately address problems. Such a property of knowledge production corresponds with one of the important dimensions of Mode 2 knowledge production, namely, “transdisciplinarity” (Gibbons et al. 1994, p. 4).

Because of the transdisciplinary nature of the field, R&D collaborations between firms and universities, and even between competing firms, have been actively pursued. From the perspective of firms, such collaborative efforts help them share the high risk of performing frontier R&D as well as mitigating the difficulties in solving complex problems. In the field of AI, there are several forms of U-I collaboration. For example, university and firm researchers often work together on a firm’s project, which is one of the most common forms of U-I collaboration. In addition, prominent professors in this field have taken the role of advising on and even leading firms’ AI research, which is another form of U-I collaboration in this area. Furthermore, a number of academic AI talents have shifted from academia to industry since the early 2010s (Gibney 2016). After Andrew Ng, a pioneering researcher in the field, joined Google from Stanford University, many academic experts followed in the same direction. This recent phenomenon is consistent with the “heterogeneity and organisational diversity” of Mode 2 knowledge production (Gibbons et al. 1994, p. 6).

Lastly, another important dimension of knowledge production in AI is the need for computational power in training and testing the designed machine. Advances in AI knowledge since around 2006 owe more to the rapidly increased capacity of hardware than to any other factor (Anthes 2017). This is consistent with another feature of Mode 2 knowledge production, described as follows: “The feature contributing to innovation under Mode 2 conditions is the role that computers and especially computational modeling have come to play, … [and these] can be used to meet a wide variety of uses and of building more sophisticated techniques and instruments that will enhance the design principle and its range of application” (Gibbons et al. 1994, p. 19).

As Mode 2 knowledge is usually produced in the context of application, we can expect firms performing AI research to play a leading role in suggesting research agendas where they have relevant complementary assets for application. Firms can devise novel technological solutions that are likely to be linked to commercialization, although the suggested ideas can be premature at an early stage. In particular, AI firms’ collaboration with researchers in universities or public research institutions is expected to create knowledge that is valuable in the long run. This is because they can adopt more transdisciplinary approaches based on understanding from both scientific knowledge and problems defined in the context of application. Given the nature of knowledge that firms can contribute to this emerging area, we expect that the knowledge evolution of the field will naturally converge towards a direction preferred by firms. This implies that private investments can shape the knowledge trajectory within the emerging technological paradigm with the emergence of science-related technologies.

  • Hypothesis 1. When science-related technologies emerge, firms’ publications influence the formation of published knowledge trajectory more than other publications.

  • Hypothesis 2. When science-related technologies emerge, publications by firm and non-firm collaborations show higher a) novelty and b) conventionality than other publications.

3 Methodology

3.1 Data

To answer the research questions, we use information on the papers in AI domain. In computer science, top-tier conferences are ranked higher than most journals, excluding a few highly ranked journals (Vardi 2009; Freyne et al. 2010).Footnote 3 The peer review process in conferences is much faster than that in journals, accelerating the progress of knowledge diffusion in the field of AI. We select target venues, including conferences and journals, and collect information about publications in the target venues from the SCOPUS database, which is a bibliographic database that provides comprehensive information about both journal and conference publications. The top 10 journals and top 10 conferencesFootnote 4 are listed by referring to Guide2Research,Footnote 5 a portal providing the rank of authors, journals, and conferences related to sub-fields of computer science.

Given the general-purpose nature of AI, studies in various domains can apply AI algorithms (WIPO 2019). Although the application of AI algorithms is important, setting the outlet boundaries in all potential fields of study makes it difficult to maintain the knowledge quality of the studies used in our analysis. Therefore, we consider papers published in highly reputed outlets that focus heavily on novel AI algorithms. Among the various sub-fields of computer science, we select a category titled “Machine Learning, Data Mining, and Artificial Intelligence.” Table 1 presents a list of highly reputed journals and conferences. The list of selected outlets covers the major outlets mentioned in recent relevant studies, such as Hartmann and Henkel (2019). Based on the final list of journals and conferences, we collect information on the articles published during 2006–2016. As noted in the introduction, “firm paper” indicates a research article in which at least one of the authors is affiliated to a firm (Arora et al. 2018). In this study, firms publishing research include those firms that (i) published at least one article in the field during 2006–2016 and (ii) are listed as a sponsor of the listed conferences (i.e. firms satisfying both (i) and (ii)). We obtain 56,981 papers consisting of 6848 firm papers and 50,133 others (non-firm papers) that include the papers from universities and public research institutions. Among the 6848 firm papers, 5419 are published by firm and non-firm collaboration, while the rest are published by only firm researchers. 120 firms are related to the collected firm papers.Footnote 6 The list of firms covers key industry players that appeared in a recent report by the WIPO (2019).

Table 1 Major conferences and journals

3.2 Variables and methods

3.2.1 Dependent variables and models

The unit of analyses used to answer our research questions is an individual paper. The dependent variable of interest in H1 is each paper’s influence on the formation of published knowledge trajectories. For H2, the dependent variables should reflect the content of the knowledge in each paper with respect to novelty (H2a) and conventionality (H2b).

Influence on the formation of the published knowledge trajectory

To examine the influence of each paper on the formation of the published knowledge trajectory (H1), we adopt a measure that reflects the minimum distance of each paper to the knowledge trajectories. A shorter distance represents larger influence of each paper on the knowledge trajectory formation. Boden (2016) noted that state-of-the-art AI includes an extraordinarily wide range of methodologies and therefore cannot be unified by any single core technique. Hence, we first divide the full citation network constructed from the collected AI papers into several groups to detect more detailed trajectories by sub-area. Using a well-established community detection technique (Blondel et al. 2008)Footnote 7 as implemented in the Gephi program, we find 22 communities that occupy more than 1% of the full citation network. The proportions of the five largest communities among all the communities chosen are as follows: 12.56%, 7.77%, 6.4%, 6.32%, and 5.44%. Modularity is 0.737, which is larger than a widely accepted threshold (i.e., 0.3) for judging whether communities are successfully grouped (Clauset et al. 2004). The identified communities can be understood as major academic groups focusing on different sub-domains within AI.Footnote 8

To identify the knowledge trajectory from each community, we employ the Search Path Node Pair (SPNP) measure, a connectivity measure widely used to detect the main paths in large citation networks (Hummon and Dereian 1989). This represents the key paths of knowledge flow in a network by allocating weights to each link based on how many times the particular link bridges pairs of nodes in the network.Footnote 9 Several prior studies have used this algorithm to identify critical knowledge trajectories from papers or patent citation networks (Verspagen 2007; Fontana et al. 2009; Martinelli and Nomaler 2014). We detect the main path for each community chosen.Footnote 10 For example, Fig. 1 presents the distinguished trajectory within the largest community along with firm papers lying on the trajectory. We compute how far each paper is from the trajectory (i.e., each paper’s minimum distance to one of the papers lying on the distinguished main paths) by regarding a direct citation linkage between two papers as the unit distance. For instance, if paper k can reach one of the trajectory papers by passing through at least two other papers, the minimum distance of paper k to the trajectory is 3. Because the distance value is infinite in the case where a paper does not have any forward citations, we find and set the maximum distance dm in which the number of papers without any forward citations becomes closest to the number of papers with distance dm when we regard distances greater than or equal to dm as dm. The dm satisfying this condition is 9 in our data. We use OLS regression to examine whether the firm papers’ distance to the knowledge trajectory is significantly shorter than that of other papers, which would imply a significant influence of firms’ publications (relative to other publications) on the formation of published knowledge trajectory.

Fig. 1
figure 1

Papers on the trajectory of the largest community. Note: Trajectory papers including at least one firm-affiliated author are in green and the other papers are in orange

Novelty and conventionality of published knowledge

We employ the suggestion in Uzzi et al. (2013), when measuring both the novelty (H2a) and conventionality (H2b) of knowledge represented in a paper. The authors suggested the z-score, which evaluates how often a particular combination of two distinct knowledge bases has been observed, compared with the expected degree in a certain period. The combination of knowledge bases is defined as a pair of different journals in the references of each paper, presuming that a journal addresses similar knowledge. When the knowledge combination appears less often than expected by chance, the combination is interpreted as a more unusual and atypical one. Several z-scores can be assigned to each paper because a paper usually includes numerous references. Uzzi et al. (2013) proposed that two summary statistics, namely, the median of the z-scores and left 10th percentile of the z-scores allocated to each paper, characterize conventionality and novelty, respectively.

However, applying this measure to our research context is somewhat problematic under the basic assumption that a journal (or a conference) is a distinct knowledge base. As previously mentioned, the boundary of AI is extremely broad and several journals and conferences try to accommodate this variety. Considering that a single journal or conference often addresses a broad range of AI-related research topics,Footnote 11 it is difficult to define a venue as the homogeneous unit of knowledge. Therefore, we adjust the measure used in Uzzi et al. (2013) by considering a “research topic” instead of a “journal” as a unit of the knowledge base, following Kaplan and Vakili (2015), which used topic modeling to specify the concept of knowledge recombinations from patent data. This way of adopting z-scores is also in line with Kim et al. (2016), which applied z-scores to patent data by regarding a single patent as a combination of different technology classes instead of using reference information.

To do this, we employ a text-mining technique called dynamic topic modeling (DTM) (Blei and Lafferty 2006). Based on a collection of documents, this algorithm captures not only the per-document topic proportion but also the evolution of detailed keywords composing each topic. The latter part is a differentiated aspect of the DTM from other static topic models. This is useful in our research context because today’s AI evolves rapidly with generating new keywords although most of the broad research themes are maintained.Footnote 12 DTM is conducted on the 56,981 articles collected, with the number of topics mentioned by scholars set to 100 to allow a feasible interpretation of each topic (Blei and Lafferty 2007; Chang et al. 2009; Hall et al. 2008). We obtain the per-paper topic proportion vectors, which consist of some outstandingly high values showing the paper’s strong relevance to some topics. However, most of the other values are close to zero, denoting a lack of relevance between the paper and most other topics. The topics assigned with significantly large proportions can be regarded as the key knowledge components of each paper. For instance, if a vector allocated to a paper has a value of 0.4., 0.2, and 0.3 for three topics (α, β, and γ) and 0.001 for the other 97 topics, one can consider three possible topic combinations (i.e., αβ, αγ, and βγ) in this paper.

Based on the DTM results, we compute the z-score for topics α and β in year t as follows:

$${z}_{\alpha \beta t}=\frac{o_{\alpha \beta t}-{\mu}_{\alpha \beta}}{\sigma_{\alpha \beta}},$$

where oαβt is the observed number of papers belonging to α and β in year t, μαβ is the expected number of papers belonging to α and β in year t based on the papers published in the previous year (i.e., t-1), and σαβ is the corresponding standard deviation. A one-year time lag is employed to reflect the rapidly changing research trend of emerging AI. Following Kim et al. (2016) and Uzzi et al. (2013), median and minimum z-scores are used as the conventionality and novelty of each topic pair, respectively. For both conventionality and novelty, the OLS regression model is applied because the dependent variables are normally distributed. The regression is conducted using the papers published during 2007–2016 (each year is t), with each of those papers’ conventionality and novelty computed based on papers published in the previous year (i.e., 2006–2015; each year is t-1). In addition, we drop papers assigned a single topic because z-scores can be calculated only when at least one topic pair exists. Of the 56,981 papers collected, 50,050 papers remain for the regression analysis.

3.2.2 Independent variables and controls

Independent variables

The main independent variable is a dummy variable indicating whether a paper is a firm paper (1) or not (0) (Firm Paper1 (Firm Involved or Not)). In addition, we further consider a categorical variable with three levels, differentiating among publications by only non-firm researchers (including those in universities or public research institutions) (0), both firm and non-firm researchers (1), and only firm researchers (2) (Firm Paper2 (Firm Only, Non-firm Only, or Firm/Non-firm Collaboration)).

Controls

We include several variables to control for other factors that may influence each paper’s impact on the published knowledge evolution. Garfield (1979) underlined that the average number of references per paper reflects the potential impact of papers in a given field. Hence, we control for the number of references in each paper (Number of References). The year of publication is included to control for the increasing popularity of AI during its recent period of development, and the expected positive relationship between the age of paper and distance to the trajectory (Publication Year (t)). Within the various sub-areas of AI, some research topics garner more attention in a period than other topics (Boden 2016), which may influence the impact of each paper. Therefore, we control for the size of the academic groups with which each paper is associated. We use the community detection results to consider the number of papers in each community as the size of the academic group having interest in a specific sub-area of AI (Size of Community). A dummy variable on whether the paper is published in a conference or journal is used as a control because computer science tends to prefer conferences as a channel for publishing high-quality papers, though some disagreement exists (Vardi 2009) (Conference Journal Dummy).

The number of authors is controlled for to reflect controversial evidence on whether multi-author publications are likely to have a higher impact than single-author publications (Gazni and Didegah 2011; Didegah and Thelwall 2013; Number of Authors). Regarding author-related characteristics, the reputation of authors is also a crucial factor that determines the impact of publications, besides their quality (Bornmann et al. 2012). Merton (1968) similarly mentioned the Matthew effect in science, which is the tendency to give credit to well-known scholars. Moreover, firms that invest in AI try to collaborate with highly reputed researchers (Gibney 2016). To capture this influence, we control for the authors’ average performance that relates to each paper in the two following complementary ways. First is the average number of the authors’ recent publications in computer science (Authors’ Average Number of CS Publications (t-4 ~ t)), which reflects the quantity of the authors’ research output. Second is the average of the authors’ h-index, which denotes the number of publications with a citation number ≥h (Hirsch 2005; Authors’ Average h-index at t).Footnote 13 The h-index reflects not only the quantity of the author’s research outputs but also their academic impact.

In addition, we control for the number of affiliations per paper because heterogeneity in author affiliations positively correlates with the impact of publications (Franceschet and Costantini 2010; Number of Affiliations). Moreover, the institutional characteristics of affiliations can shape researchers’ publishing behavior, potentially influencing the impact of each paper. For instance, since firm-affiliated researchers are less pressured to publish papers than those who are university-affiliated, firm researchers can selectively publish only part of their research outcomes. In addition, firm researchers should often follow the firm-level approach on strategic publishing regarding where, when, and what to publish, leading to a potential bias in firm publications and its subsequent impact (e.g., Li et al. 2015). Therefore, we control for the average academic performance of affiliations related to each paper using the average number of their recent publications in the CS field (Affiliations’ Average Number of CS Publications (t-4 ~ t)) and the average of their h-index (Affiliations’ Average h-index at t).

To compute for the described control variables related to authors and author affiliations, a range of publication information broader than the collected sample of AI papers should be considered. We additionally employ the Microsoft Academic Graph (MAG; Sinha et al. 2015),Footnote 14 which is a widely used database that includes publication records, citation relationships, fields of study,Footnote 15 and author or affiliation-specific identification codes.Footnote 16 Based on this database, we compute the author and affiliation-related control variables introduced above.Footnote 17

Lastly, some sub-fields of AI have been particularly affected by the recent deep-learning revolution. Although there are more relevant areas, computer vision and natural language processing (NLP) are representative cases of these sub-fields (LeCun et al. 2015). These sub-fields have strongly benefited from firms’ resources (big data and computational facilities). Considering firms’ significant investment in recent AI, we can conjecture that the impact of papers in such sub-fields may have been higher than the papers in other fields. Therefore, we control for whether the paper is related to vision or NLP (Firm-resource Sensitive Field). This variable is obtained by using fields of study information in MAG data.Footnote 18

4 Findings

Table 2 presents basic statistics and correlations. Two sub-tables are reported because we lost some observations in the process of computing novelty and conventionality, as described above. The upper and lower sub-tables correspond to the first (H1) and second (H2a and H2b) hypotheses, respectively. Although some observations are lost when we test H2, the two sub-tables show that the correlation and basic statistics are consistent overall. The averages of the number of authors and author affiliations in a single paper are 3.24 and 1.52, respectively. On average, authors publish about 11 CS papers within five years from the publication year t (i.e., t-4 ~ t), and their average h-index at year t is 6.99. On average, affiliations associated with a paper produce about 1288 CS papers during the period t-4 ~ t, and their average h-index at year t is 183.96. The minimum value of the authors’ (and affiliations’) average number of CS publications can be zero even though they have published at least one paper in the selected list of AI journals and conferences. This is because there are few AI papers not involved in the CS field categories that we employ from the MAG database (see Appendix E), owing to the transdisciplinary nature of AI. The Firm-resource Sensitive Field dummy shows that about one-third of the collected papers are related to vision or NLP.

Table 2 Descriptive statistics and correlations

Table 3 compares the following three paper groups—firm only, firm and non-firm collaboration, and non-firm only—in terms of the author and affiliation-related variables. This table shows that the firm and non-firm collaboration group consists of more papers that were produced by high performing researchers (in terms of both publishing quantity and h-index) than the other two groups. This suggests that firms selectively collaborate with outstanding researchers in universities. The table also shows that the average number of publications (in all fields) by affiliations associated with a single paper is highest in the non-firm paper group, which is an intuitive statistic because this group mostly consists of universities. By contrast, the number of CS publications is shown to be higher in the firm-related paper groups than in the non-firm paper group. This implies that compared with universities, a firm investing in basic research of emerging science-related technology can, on average, publish even more papers in that field, which is in line with Hicks’s (1995) observation. In addition, the average h-index of affiliations is shown to be highest in the firm and non-firm collaboration paper group, which reconfirms that firms selectively collaborate with highly reputed affiliations that include outstanding researchers.

Table 3 The averages of the groups’ control variables related to authors and affiliations

Table 4 reports the results of the regression analyses for H1 (OLS models; DV is the distance to the knowledge trajectory). Model 1 shows the OLS results with only the control variables. This baseline model reveals that the coefficients of the control variables are significant in the directions that we expected. Model 2 adds Firm Paper1 (Firm Involved or Not) and shows that the coefficient of this variable is significantly negative. That is, firm papers tend to have shorter distances to the knowledge trajectory (about 0.49 shorter than other papers when a direct citation distance equals 1), indicating their higher influence on the formation of the published knowledge trajectory compared to other papers. Model 3 adds Firm Paper2 (Firm Only, Non-firm Only, or Firm/Non-firm Collaboration) and shows that firm-only papers (2) had the shortest distance to the knowledge trajectory (negative and significant; about 0.74 shorter than non-firm only (0) papers when a direct citation distance equals 1), which confirms the group’s significantly higher influence on the knowledge trajectory formation than other papers. The 95% confidence intervals on the firm and non-firm collaboration (1) and firm-only (2) publications run from −0.49 to −0.35 and − 0.86 to −0.62, respectively, confirming the significantly higher influence of the latter than he former on the formation of knowledge trajectory. Overall, Table 4 shows the outstanding impact of papers that include firm-affiliated authors—particularly those authored by firm researchers only—on the trajectory formation of published knowledge. For a robustness check (see Table 6 in Appendix D), we provide the results of the log-normal regression (see Models 4–6 in Table 6), which is widely used to model the distance. In addition, because a considerable proportion of papers is disconnected or extremely far from the trajectory, we also suggest the results of the models that exclude papers without citations in the five years since its publication (see Models 7–9 in Table 6) and those excluding papers that are not involved in any of the major communities (i.e., community size is 0) (see Models 10–12 in Table 6).

Table 4 Regression results (H1)

Table 5 reports the regression results of H2a (the OLS models for DV, z-min (novelty)) and H2b (the OLS models for DV, z-med (conventionality)). The first set of three columns in Table 5 reports the H2a-related results. Model 1 contains only control variables, and Models 2 and 3 add Firm Paper1 (Firm Involved or Not) and Firm Paper2 (Firm Only, Non-firm Only, or Firm/Non-firm Collaboration), respectively. Model 2 shows that firm papers have significantly higher novelty (i.e., lower value of the minimum z-score) than the non-firm papers. In particular, Model 3 shows that papers produced by firm and non-firm collaboration (1) have significantly higher novelty than non-firm papers (0). The second set of three columns in Table 5 reports H2b-related results. Model 1 includes only control variables, and Models 2 and 3 add Firm Paper1 (Firm Involved or Not) and Firm Paper2 (Firm Only, Non-firm Only, or Firm/Non-firm Collaboration), respectively. Models 2 shows that firm papers have significantly higher conventionality (i.e., higher value of the median z-score), than non-firm papers. Based on Model 3, we find that the 95% confidence intervals on firm and non-firm collaboration (1) and firm-only (2) publications run from −0.02 to 0.26 and − 0.03 to 0.48, respectively. This indicates that there is no significant difference between these two groups in terms of conventionality. Since the ranges of z-min and z-med are fairly large (−20.03 to 114.61 and − 16.65 to 118.91, respectively; see Table 2), we conduct robustness check after removing extreme z-values from the original data. Table 7 in Appendix D shows the robust results after winsorization.

Table 5 Regression results (H2a and H2b)

In summary, Table 5 interestingly shows that during AI’s period of emergence, firm and non-firm collaboration papers satisfied both conditions of higher conventionality and novelty than non-firm papers. According to observations in Uzzi et al. (2013), publications that satisfy the conditions of both outstanding novelty and conventionality are likely to have high impact in the long run. In this regard, our findings imply that during the recent development of AI, collaborations between firms and universities have produced new knowledge that is likely to be critical in the long run.

5 Discussion and conclusion

This study investigated the influence of firms’ research publication on the evolution of published knowledge over the period of the resurgence of a science-related technology, specifically, AI. The empirical results suggest that papers with firm-affiliated authors, particularly those with only firm-affiliated author(s), have had a significantly higher influence on the direction of published knowledge evolution than other types of papers evaluated in this study. In addition, we show that collaborations between firms and non-firms have played a pivotal role in producing more conventional and novel knowledge. This finding suggests that the collaborated papers are likely to include potentially salient knowledge in the long run.

The results of this study are largely attributable to the distinctive characteristics of AI; therefore, we extend them in consideration of the boundary conditions outlined in Section 2.4. In addition to the technological nature of the study’s subject, we emphasize on the industrial context in which the assets necessary to develop and commercialize AI are concentrated in firms. We expect that the findings of this study would be extended to sectors in which the circumstances of the technology itself and the distribution of complementary assets are similar to those of AI. Despite constraints in generalizing the findings of this study, we, nevertheless, link our results to findings in the literature to explain our contributions.

Our findings imply that under high technological uncertainty, researchers can be tempted to focus their efforts on the research areas in which firms have published. The finding extends an argument of Hicks (1995) that firm papers signal the presence of unpublished tacit knowledge and resources of the respective firms. We showed that when firms’ unpublished assets can be used to contribute to the innovation and knowledge frontier of an emerging science-related technology, their publishing can affect the type of knowledge that researchers produce (i.e., where to focus their research efforts).

In terms of the evolution of science-related technology, our findings suggest a case in which published knowledge is significantly driven by private investments instead of public ones. This is an interesting but less studied phenomenon, requiring further investigation to understand its implications. While the support of private money has led to the evolution of published knowledge, we find that both firms and non-firm entities have contributed to knowledge evolution through their own strengths.

Our results have implications for science and technology policies. First, while our main interpretations focus on non-firm researchers who collaborate with firm researchers, the basic statistics in Table 2 show that most papers are still produced solely by non-firm researchers. This implies that many researchers in universities and public research institutions have addressed topics less attended by firms. The diversity of knowledge maintained by non-firm researchers is necessary to create another breakthrough in the future (Nieto and Santamaría 2007; Uzzi et al. 2013). In this vein, to sustain diversity in research topics, which will trigger another breakthrough in the long run, greater public funding for AI research should be allocated to research topics in which firms are less interested in.

Second, innovation can be achieved only when the components relevant to technology commercialization are successfully coordinated. Although the private sector invests in and leads AI research in which they have complementary assets, there may be hurdles firms cannot easily handle in the long process of innovation. For example, although firms interested in autonomous vehicles invest significantly in and impact computer vision and robotics research, infrastructure and ethical issues remain critical challenges to successful commercialization and adoption of autonomous vehicles. This concern is relevant to the concept of “reverse salients” (Hughes 1987), which indicates factors deterring innovation within a system, including technological, institutional, market, infrastructure, and legal aspects (Bijker et al. 1989; Mulder and Knot 2001; Takeishi and Lee 2005; Murmann and Frenken 2006). To make the firms’ investment in knowledge production more meaningful, the government should help mitigate potential obstacles to innovation that private actors cannot handle easily.

The implications of the role of public funding are relevant to recent evidence of China’s emergence in the field of AI. In line with our results, Lundvall and Rikap (2022) suggested the concept of “Corporate Innovation System” in the field of AI, underlining the dominant role of tech giants in leading the direction of the field evolution. In particular, the authors emphasize the co-evolution between the national and corporate innovation systems in China, which offers a new window of opportunity for the country. China’s government has provided a stepping stone for the country to catch up in AI by funding AI research of top universities and public research institutions and investing in AI-related infrastructure, such as markets for data and digital services. The case of China shows the importance of the government’s role in supporting targeted areas that can facilitate innovation, given that firms lead knowledge development in the field.

From the perspective of firms, the results can be interpreted to mean that firms engaged in publishing have had opportunities to drive knowledge evolution in the direction they chose to publish—opportunities that imply such firms’ research investments have gained legitimacy. A desirable scenario for firms investing in AI research is that the emerging knowledge evolves in a direction that enables them to leverage their resources and capabilities efficiently. Therefore, the value of gaining legitimacy for a firm’s research investment in an uncertain technology is significant.

Some of the limitations of this study suggest future research directions. The first limitation of this study is the uncertain scope of generalizing the results. Although we highlight the scope covered by this study, future studies are needed to investigate whether and how firms publishing on areas other than AI can (differently or similarly) influence the evolution of published knowledge. Second, while this study sheds light on the implications of firms’ publications from the policy perspective, another area that must be studied is when firms can benefit from publishing. Although prior studies have shown some of the motivational aspects of publishing for firms (Rosenberg 1990; Hicks 1995; Polidoro and Toh 2011), the consequences of publishing remain an unanswered question. Accordingly, future studies should explore firms’ benefits from publishing from such aspects as, say, knowledge spillovers and incentivizing firm-affiliated researchers. Third, although we used categorical variables to distinguish firm papers, future studies can use more fine-grained measures reflecting the exact share of firm-affiliated authors in paper to get further understandings of firms’ engagement in research publication. Lastly, reflecting on the results of the present study, further consideration of firms that forge deep links with universities is needed. We show that firms not only provide significant funding to university researchers but also help build the main research trajectories for some emerging science-related technologies. This provides evidence of the bidirectional impact between industry and academia. Future studies are needed to investigate the details of such bidirectional linkages in more depth.