Introduction

The roles of public and private sectors in the discovery and development of new drugs are part of a long-running debate. In the past decades, the rise of novel collaborations in the form of industry-academy partnerships have blurred the traditional border between the two sectors (Etzkowitz 2003; Leydesdorff and Etzkowitz 1996; Bozeman et al. 2013; Bozeman 2000). Budgetary concerns and highly priced medicines are yielding increasing pressure on governments and raising new questions about the rates of return from public spending. Moreover, the argument that public sector is responsible for a large share of drug development, have fuelled proposals to recoup profits from government funded drugs and to avoid tax payers to pay twice: first with taxes for publicly funded research, and then through monopoly of prices or restricted access (Sampat and Lichtenberg 2011).

The extent of public sector involvement in basic scientific research is recognized under the major role played by governmental funding agencies which affect the cognitive development of science and ultimately the economic growth and competitiveness (Kearnes and Wienroth 2009). However, the relation between funding input and research outputs is difficult to map due to the number of funding sources and the variety of collaborative relationships.

To probe collaboration patterns, the use of publication data has been prominently focused on authorship, as an indicator of scientific productivity, and citations as a measure of impact (Abramo et al. 2009). More recently, however, the study of the acknowledgments section of published literatures has taken part to so-called “reward triangle” and it is gaining ground as a source of data (Costas and van Leeuwen 2012; Cronin and Weaver 1995). In particular, acknowledgments of financial support are of prominent importance as they indicate the source of funding or economic support which made possible the research and the publication. Moreover, acknowledgments have the potential to map the funding inputs without the need to gain direct access to data via funders or researchers individually (Rigby 2011; Grassano et al. 2017). Accordingly, the study of acknowledgment can also be of interest to highlight research policy purposes (Álvarez-Bornstein et al. 2017; Paul-Hus et al. 2016).

Given the aforementioned potentials for using publications data, in this study we seek to improve our understanding on the financial aspects underlying the basic research which led to the discovery of pharmaceuticals. In particular, we explore the scientific production of scientists involved in the creation of biotechnology companies in United Kingdom (UK) and which originated a drug development project. Our objective is to identify the nature of the support acknowledged by the publications authored by the founders of such enterprises. We seek to shed light on the financial contribution of the public and private organizations to the science which anticipated the company creation. Doing so, we contribute to highlight some the financial levers which characterize the earliest and riskiest stage of pharmaceutical innovation.

The remainder of this paper is organised as follows. First, we present the conceptual motivation behind this study which we use to formulate a set of research questions. Next, we describe the study design and the methodology we applied to conduct our analysis. We subsequently present the results of our study and we discuss the main policy implications.

Previous literature and objectives

Since the biotech revolution, much has changed in the paradigm by which biomedical research and development (R&D) are conducted and theorized. The linear model of innovation is one of the first conceptual frameworks that was developed to understand the relation between science, technology and their application to the economy (Godin 2006). This approach defines innovation as causal process that leads from basic to applied research and development ending with production and diffusion (Godin 2006). Consistently with the literature that followed (Bush 2001) seminal work, much basic biomedical research is conducted in academic/non-profit institutions and is funded by the public sector (Zycher et al. 2010). Conversely, the bulk of applied and development science that concerns the translation of the research findings into products or processes is funded mostly by the private sector. Public basic research is characterized by a high degree of uncertainty in both its scientific maturity and its potential market applicability (Toole 2007). Therefore, the private sector has inadequate incentives to invest in basic research (Galkina Cleary et al. 2018; Arrow 1978).

In the last decades, many criticisms concerning the linearity of the model have been raised and authors have claimed the inclusion of the dynamic interactions between academic and commercial sectors (Cockburn and Henderson 1996, 1998). Since the enactment of the Bayh–Dole Act in the USA in 1980, there has been a substantial rise in the commercialization of science and other forms of university technology transfer (Siegel and Wright 2015). Universities have broadened the traditional mission of educating students to include the patenting and commercialization of research discoveries (Bok 2009). As a result, multiple channels and processes of knowledge interaction have been established between universities and industry which have been featured in the growing literature on university–industry relations (Bekkers and Freitas 2008; Bercovitz and Feldman 2006; Bercovitz et al. 2001).

In this context, an increasing number of university scientists started for-profit companies to develop their scientific breakthroughs (Etzkowitz 1983; Shane and Khurana 2003; Stuart and Ding 2006). In particular, academic scientists have been of critical importance for the emergence of the biotechnology industry. Mansfield (1991) pointed out that the pharmaceutical industry had the highest percentage of new products based on academic research. Many start-ups have been either founded by scientists or heavily rely on work of public science (Zucker et al. 1998; McMillan et al. 2000; Audretsch and Stephan 1996). Stuart and Ding (2006) estimate that half of all biotechnology firms have been founded by university scientists, most of whom maintained academic appointments post-founding. Azoulay et al. (2007) found that 38% of the members of a random sample of 3800 U.S.-based, academic life scientists had, at some point in their careers, co-authored one or more papers with scientists working in the private-sector.

Past research has employed publications data to study company innovative performances. For example, using bibliographic information of researchers affiliated with 190 UK-based biotechnology firms, Jong and Slavova (2014) demonstrated that firms adopting open science strategies were more prominent in the R&D landscape with benefits related to the development of radical products innovations. Some empirical studies on scientific human capital have demonstrated the varying performance impact of a mix of scientists in terms of their orientation towards publishing and patenting (Baba et al. 2009; Stokes 2011; Subramanian et al. 2013; Zucker et al. 2002). For these reasons, published studies by founding scientists contain a rich source of information which can be used to probe probing collaboration patterns, interdisciplinary linkages and other research spill overs (Wang and Shapira 2015). This is confirmed by Sternitzke (2010) who reported that each drug is accompanied by, on average, about 19 journal publications and 23 additional patents.

In the past literature, some attempts to make use of publication data were made either through case studies or by studies which used grant proposals, CV or performance report as sources for funding information (Stuart and Ding 2006; Heinze et al. 2007). The acknowledgment field of publications has been used by a growing number of bibliometric studies. Until recent times, it was very difficult to carry out analyses on acknowledgments since this information was not available in bibliographic databases. However, since 2008, Web of Science (WoS) started including funding acknowledgment data and this opened up new possibilities for investigations. Some studies analysed the acknowledgment sections by disciplines or countries (Díaz-Faes and Bordons 2014; Wang and Shapira 2015; Costas and van Leeuwen 2012). These highlighted the acknowledgment appeared more frequent in Anglo-American journals and in the so-called ‘hard sciences’ such as chemistry and psychology (Cronin et al. 2004). Other contributions measured the role of funded articles as an indicator of research quality (Gillett 1991; Costas and van Leeuwen 2012; Zhao 2010). Also, partnerships between the public and private sectors in multiple disciplinary areas have been studied using funding acknowledgement section of Spanish articles (Morillo 2016) founding that 79% was financed by the public and 18% in combination with private. The accuracy and completeness of the extraction process adopted by the extant literature have been the object of some recent contribution which also questioned and tested the robustness of the bibliographical databases reporting funding information (Rigby 2011; Grassano et al. 2017).

Yet, despite the potentials and the magnitude of publications as a source of data, the existing body of literature have rarely employed these into the analysis of the funding of pharmaceutical innovation. Based on the previous research, this paper is a first attempt to analyse the nature and the scope of the financial contribution acknowledged by the literature which lead to biopharmaceutical discoveries. To do so, we chose to analyse the funding acknowledgements (FA) in scientific publications authored by the founders of biopharmaceutical companies before the incorporation. Our focus is, therefore, the basic and applied research that is undertaken primarily for the advancement of scientific knowledge, with or without a specific practical application in view (Godin 2006). The main objectives addressed in this article are the following:

  • To explore the nature of the financial support acknowledged in the publications by scientists involved in the foundation of biotechnology companies.

  • To highlight the distribution of funding organizations across countries and disciplines.

  • To analyse the extent to which funding information have higher impact in terms of citations.

Moreover, based on the results of the proposed objectives, we aim to critically assess the position of public and private sector towards the funding and of the earliest stage of pharmaceutical innovation and discuss potential research policy implications.

Methods

Sample

To construct our sample, we identified 91 UK-based biotechnology firms founded between 1996 and 2016, which originated at least one drug development projectFootnote 1 and received venture capital (VC) support according to the GlobalData Pharma Intelligence Centre’s database. We assumed that investments from VC represented a promising stage at which discoveries made in basic research may conduct and after which there might be further investments in development research.

Further, we collected the biographical information of the founding scientists (full name, role, previous affiliation). To do so, we accessed Beaurhurst database that contains key information on high-growth companies that have secured equity fundraising in UK. When founders’ or primary scientists’ personal details were not reported, we conducted a web search of companies’ websites. For each company, multiple identities were selected and ranked according to their role. Specifically we gave more relevance to the scientists or those individuals that, accordingly to the information provided, were involved in the identification of the company science and its foundation (see “Appendix” for ranking criteria). The collection was conducted by two independent researchers, results have been compared afterword; any disagreement and risk of bias were solved through a discussion between the authors.

Data collection

A literature search was conducted using two search queries on different bibliographic databases.

First (method 1), we aimed to capture the most relevant publications by the founding scientists before the company creation. Accordingly, using Thomson Reuter’s Web of Science (WoS) database, we collected the ten most cited articles published by the company founder before the year of the incorporation.

Method 1, search query author: “name of founding scientist”_ FILTER < (year of company foundation)

A second research query (method 2) was made associating the founders’ identity with the information regarding the products in the company pipeline. Details of the companies’ pipelines were downloaded from GlobalData database. Specifically, we looked at compounds at the most advanced stage in development in each company pipeline. Next, we collected information on: (a) the disease or condition for which the drug is or is seeking approval (therapy area); (b) the structural, biological, chemical and/or pharmaceutical features of the drug (molecule type); (c) the specific effect on the indication or symptoms that are intended to treat (mechanism of action).

Accordingly we downloaded the first publication appearing on Google Scholar for each of the included founding scientists. Scholar is recognized as one of the few academic search engines that combines several approaches in a single algorithm as a result of weighing the full text, the author, the publication in which the articles appears and how often the piece has been cited in other scholarly literature (Beel and Gipp 2009). This approach allowed the identification of the most relevant article per author according to the company science.

Method 2, search query “Mechanism of action “AND/OR” molecule type “AND/OR “therapy area” AND author: “name of scientific founder(s)”_ FILTER < (year of company foundation)

As a result of the two collecting strategies we aimed to include the most influential publications in the founders’ backgrounds (method 1) including the research made in the area that was then carried on within the enterprise as the base for the drug development (method 2).

Overall, we collected only articles written in English and published in peer-reviewed scientific journals. Meeting abstracts, commentaries and reviews were excluded. This two-steps procedure yielded 714 results (620 with method 1 and 94 using method 2). Duplicates originating from the two research methods (77 publications, 11%) were removed.

Data analysis

Data cleansing was required to extract and harmonize the information contained in our set of publications. A first problem concerned the standardisation and disambiguation of author names. This arises when several authors share the same name, but also one author expresses his/her name in different ways. As pointed out by D’Angelo et al. (2011), manual inspection to disambiguate authors can be very effective for small population of scientists and data integration from external sources can be very helpful to ease the task. In our work, we used the information collected about the founders’ background from companies’ websites to make an efficient use of the “article group” option provided by WoS.

Secondly, our sample was checked to identify funding information contained in the acknowledgement section of each article. Due to the reduced size of our sample, we opted for extracting, coding and interpreting funding data in addition to WoS manually. Accordingly to Grassano et al. (2017), when there was no clear way to detect the financial nature of the support, we adopted a conservative approach and the acknowledged organisation was not included in the list of funders. Also, as the intention of our work was to highlight the financial contribution supporting the research, we included only funding data that were explicitly reported to the founder or acknowledged to the study, with the exclusion of all funding provided explicitly to other authors. Once the relevant funding information for a given publication were identified, we extracted the full name of the organisation and added the code of the country where this was located and labelled according to the scope (public, private). Governmental bodies, charities and non-profit organisations were grouped as ‘public’, whereas private foundations held by commercial organisations were included under ‘private’ (Sussex et al. 2016; Viergever and Hendriks 2016). A thesaurus of the various names and acronyms of funding agencies was created.

Thirdly, affiliation analysis was conducted following the approach applied by Hottenrott and Lawson (2017). Specifically, each author on the selected publications was attributed one or more institution based on the [C1] address field in WoS. All addresses were coded by institution type and assigned to a unique code. This was conducted semi-manually, meaning that a search algorithm containing word elements such as “univ”, “hosp”, “ltd” was undertaken using Excel, and these were marketed as universities, hospitals or companies respectively. The retrieved institutions were finally coded as belonging to the higher education sector (HEI),Footnote 2 public or semi-governmental research institutes (PRO), private companies (PRIV).

The bibliometric analysis of the acknowledgement section as well as authors affiliations was performed in Excel. The Statistical Package for the Social Sciences (SPSS, version 24.0 for Windows) and STATA were used to perform statistical analysis and test relations in our dataset. Figure 1 illustrates the study design, inclusion criteria and data collection strategy.

Fig. 1
figure 1

Study design

Results

A total of 637 publications, made between 1970 and 2015 and authored by 141 founders of 91 biotechnology companies was included in our study. Funding data addressed to the founder/scientist or generically acknowledged to the study was disclosed in 403 (63%) articles (Fig. 2).

Fig. 2
figure 2

Descriptive results

The support from public institutions was reported in 329 (82%) publications whereas that from the industry or a combination of private and public organisations (“mix”) was found in 28 (7%) and 46 (11%) respectively. Authors were affiliated to the higher education sector (HEI) in 72% of our sample, 14% to public or semi-governmental research institutes (PRO) and 10% to private companies (PRIV); 3% showed multiple affiliations (Fig. 3).

Fig. 3
figure 3

Number of publications by type of funding a and authors affiliation b

Table 1 illustrates the number of publications by type of funder and authors’ affiliation. When the publications were privately financed, authors were likely to be affiliated with private organisations (52% vs 6% of publicly financed). A Fisher’s exact test of independence was performed and a significant relation between type of funding and author affiliation was found (p < .001). Overall, these results suggest that the distribution of funding data between affiliations is not casual and authors belonging to universities or to private corporations are more likely to attract public or private funding respectively (p < .001).

Table 1 Funding to publications, affiliations

The study also demonstrates as a myriad of different organizations are involved in the funding of biomedical research that leads to the creation of biotechnology enterprises in UK. As such, 261 different funders (206 public, 55 private) were individuated. Major funders, defined as those appearing in at least 10 publications, appeared in 70% of our sample showing that the majority of biomedical research was funded by a small number of organisations. Among public funders, the Wellcome Trust, Medical Research Council and the National Institute of Health (NIH) were the most acknowledged together with Cancer Research UK. The studies in our sample were supported by 77 public institutions based in UK that compared in 256 publications (63%). Also the contribution of 59 US funding organisations was acknowledged (24%) together with that of organisations from 22 different nationalities.

Funding acknowledgements also enabled the exploration of the extent to which funders are jointly disclosed in publications. In Fig. 4c we examine the co-occurrence of the ten major public funders. The analysis revealed that public funders most frequently jointly acknowledged were the Medical Research Council together with the Wellcome Trust or Cancer Research UK. Moreover, it is worth noting that not in a small proportion (22%), funding organisations were the same as the author affiliations and this value could be higher if we included all other authors’ affiliation in the check in addition to founders.

Fig. 4
figure 4

Top-10 public a and private funders b, co-occurrence matrix c and scope of funders d

The WoS category classification enabled to compare the content of the studies with the reported funding organisation and to analyse the topics that public and private funders mostly support. The areas with most publications were biochemistry, oncology and science and technology. Figure 5a, b profile public and private funders in terms of research domain relative to the number of publications containing funding data. All major funders support was primarily focused on biochemistry and molecular biology. The Wellcome Trust together with the Medical Research Council showed a focus on science and technology and immunology. Cell biology was mostly investigated by Cancer Research UK that also showed a pronounced attention towards scientific production in oncology.

Fig. 5
figure 5

Research areas supported by public a, private or mix b, top-4 major funders c

Further bibliometric indicators were used to gain a deeper understanding of how the interaction between public and private sectors take place within the production of scientific knowledge that leads primary scientists to the creation of biotechnology companies. Assuming the informal sense that the first and last author positions are generally perceived as the ‘key’ positions on a paper, the analysis demonstrated that when authors were publicly affiliated they were more often reported as first or last authors (80%) than private (72%). Overall, founders/scientists appeared first or last in byline position in 57% of our sample demonstrating a substantial contribution to the published work.

The impact of publications as measured by the number of citations per year was also assessed by applying non-parametric tests. Specifically, a Mann–Whitney U test showed that the number of citations per year was greater for publications with funding acknowledgements than for the articles that did not disclosed any funding to the author or the study (p < .001). On the other hand, a Kruskal–Wallis H test (p = .148) indicated that the distribution of citations per year was not statistically different across publications that received public funding compared to private or a mix of both sectors.

Discussion

The contribution of public and private entities to the early stages of pharmaceutical innovation was the object of this paper. We highlighted the public sector plays a key role in supporting the riskiest stage of pharmaceutical innovation that is when basic, blue-sky research is conducted with no practical or economical application in view. The work found that when financial acknowledgments are disclosed, the number of citations is higher and therefore the broader the impact of the publication. As a consequence, we may derive the basic research, if conducted with the support by public or private organisation has better chances to be the ground for future innovations.

The topic has been investigated through a bottom-up approach taking in analysis the publications made by leading scientists figuring as the founders of 91 biotechnology companies headquartered in UK. The discovery of innovative pharmaceuticals was explored assuming the production of scientific knowledge made before the incorporation to play a key role in the identification of the potentials for business. Discoveries made in basic research are significant for the process that leads to innovation. Specifically, the technology or knowledge transfer defined as the “movement of know-how, technical knowledge or technology from one organizational setting to another”(Bozeman 2000), plays an increasingly important role in stimulating economic development (Siegel et al. 2007). In this context, it has been recognized the research support from government or industry could play as catalyst for enabling the innovation process. However, despite the growing emphasis on formalizing the interaction that take place between the higher educational system and industry as a vehicle to enhance innovation, the financial aspects related to this phenomenon were partially unexplored. As such, we analysed the type of support provided to the studies upon which we assumed the authors, once identified the potential of the results achieved, have started a commercial business. Contrarily we did not aim to assess or map the support given to biomedical research overall. Doing so, the current study sheds light on some financial aspects related to the discovery of pharmaceuticals.

Specifically, here we demonstrate the public sector plays a fundamental role in the financing of biopharmaceutical discoveries and so the risks associated. Using bibliographic information as a proxy, we evidenced the major role of the public sector as reported in the acknowledgments sections and through the affiliation of most of the authors. Surprisingly, the study found a tight relation between the type of support provided to publications and authors’ affiliation. In particular, support from private sector was almost 4 times higher when the founders were already privately affiliated and lower (− 38%) when these, before starting their own company, were belonging to public or non-profit organisations. This was also evidenced for those studies financed by public institutions in which, however, the relation was weaker. Also, we demonstrated that the impact of publications that acknowledged financial support either by the public, private or a mix of both sector was higher in terms of annual citations. Contrarily, the type of funding did not seem to influence the number of yearly citations. Furthermore, we have examined the extent to which different organisations are acknowledged as funders of scientific publications. As a result of our investigation, we found a scientific landscape populated by a large number of different institutions. The distribution among publications was highly skewed meaning that few organisations financed the majority (70%) of the sample. Moreover, it is worth to notice that the support given by public institutions was more focused on broader areas of investigation such as biochemistry, cell biology whereas private organisations were contributing to more applied topics such as oncology, immunology and chemistry.

Furthermore, the application of bibliometric analysis to publication data have shown potentials to inform policy making on research funding. Here, the retrieved publication were used to profile the research portfolios of major public funders. As a result, the number of studies in oncology published with the support of Cancer Research UK (CRUK) were three times those of other institutions and constituted 20% of CRUK portfolio. Also, a specific focus on neurosciences and neurology was found in publications that disclosed the Wellcome Trust as supporting institution. In addition, as a result of the manual coding of the retrieved literature, the study measured the precision of WoS in reporting the information contained in the acknowledgement section. Consistently with the previous literature on this topic, we found that WoS did not report funding data for 20 publications (3%) for which our manual check evidenced financial support (false negatives).

In this study, we provide evidence whilst fully aware of the complexity of attributing a single conceptual framework to this topic and the limitations related to the use of bibliometric techniques in this field (Reichert and Milne 2002; Cockburn and Henderson 1996). A first critical aspect is associated with the manual coding of acknowledgements paratexts required to gain the appropriate understanding of the funding information to authors. Certainty the inclusion of the funding text [FU] in WoS since 2008 has enabled the analysis of acknowledgement section as the “third edge” of the reward triangle in bibliometric studies together with authorships and citations (Costas and van Leeuwen 2012).

However, the incompleteness of data provided by WoS and the lack of precision as highlighted by Grassano et al. (2017), have partially limited the use of this source. In addition, as said, our investigation was focused solely on authors that we identified as the founder of the biotechnology companies. Hence, all funding information attributed to other authors were due to be excluded and from this we opted for a manual coding of the acknowledgement sections. Furthermore there were cases in which acknowledged information were not reported as “financial support” or “funding” but simply “supported by” and these were interpreted as financial as well. Consequently to that, the number of financed studies included in the study may be overestimated. Another limitation is represented by the way the identities of authors and founders have been identified and matched. The inclusion, despite it was made through a double-blind process conducted by two independent researchers, involved judgments and was subject to the reliability of available data. Precisely, we relied on the information contained in the accessed databases and companies’ websites to determine whether the name of any of the present or past employer was reported as the founder or the leading scientist. Other names, of other people possibly involved in the creation of such enterprises may have been omitted and thus not included in the analysis. Also, when retrieving publications from the online databases, a number of homonymies were found and therefore limitations in associating names with titles were due to the required process of disambiguation.

In conclusion, although the study covers an extensive period of time, the recent boost in public–private efforts, such as the Innovative Medicine Initiative (IMI) at European level, may not be fully evidenced in our data. The IMI is a public–private partnership between the European Union and the European Federation of Pharmaceutical Industries which have produced more than four thousand publications involving collaboration between researchers across different sectors, institutions and countries (Laverty and Meulien 2019). Our study does however extend the extant literature on academic founders by focusing on scientific founders of biotechnology companies. By including different career backgrounds, our study demonstrates that company founders act as boundary-spanning ‘gatekeepers’ who facilitate access to socially embedded knowledge (Gittelman and Kogut 2003).

Future research may seek to clarify drivers of collaborative research by taking into account multiple processes of knowledge transfer (Fabiano et al. 2020). Research efforts are also encouraged to keep track of the funding given to applied research, for example by including publications made after the company foundation.

Conclusions

In this study, we traced the relationship between knowledge and innovation by taking a bottom up approach to identify the origin of an idea. Rather than observing research budgets top-down, we focused our analysis on scientists who published and started a company. By looking into this highly specific field of knowledge production, our results confirm the critical role played by non-profit research institutions in the financing of innovation.

Given the scale and the complexity of the challenges health systems are facing, the results of our study support the thesis that further polarization may occur if collaborative initiatives at research level are not encouraged. Collaboration at pre-competitive stage is a key aspect required to accelerate the flow of knowledge between university and industry. Therefore, we welcome policy initiatives and research efforts bringing multidisciplinary and multi-stakeholders together.

Furthermore, given the key role governments play a in both promoting innovation and in the generation of new market opportunities, risks and benefits need to be shared amongst actors at industrial level (Mazzucato 2011). From a policy perspective, the outcome of our research advocates a shift towards a mission-oriented approach in which governments set the direction of innovation towards public needs. To support this, the pricing structure of pharmaceuticals should reflect the dual role taxpayers play in funding early innovation and as final users.