1 Introduction and related literature

Over time, an increasingly amount of economic literature has been devoted to embed regional innovation systems (RISs) within the perspective of evolutionary economics.

Several contributions, see for instance Cooke et al. (1997, 1998), Lambooy and Boschma (2001), Iammarino (2005), Morgan (2007), Uyarra (2010) and Gunnarson and Wallin (2011), have therefore attempted to link regional science to evolutionary economics, sharing two main features.

On the one hand, indeed, these papers are concerned with the role of regional policies in shaping the innovative performance of regions.

According to Cooke et al. (1997, 1998), three main policy instruments can be employed by regional authorities to enhance regional innovation.

Specifically, regional policies can enhance innovation by increasing the amount of financial resources devoted to innovative projects, by financing infrastructures-intended as both telecommunications infrastructures and network links-and, finally, by favouring regional cooperation and learning.

Lambooy and Boschma (2001), instead, emphasise the relevance of policies aimed at creating a favourable environment for the creation, the diffusion and the adoption of new technologies, though they warn, in our opinion correctly, that these policies are destined to fail if they deviate from the local context.

Gunnarson and Wallin (2011), on the other hand, suggest that the regional governance can control the development path of innovation by facilitating cooperative and interactive behaviours and by coordinating public investments devoted to the creation of knowledge.

The second, and most important for the purposes of this paper, common element shared by these contributions, is instead represented by the explicit acknowledgment of the relevance of local institutions in explaining the dynamics of regional innovation systems.

In particular, the co-evolution of institutions and innovation is assumed to be a key factor in explaining the trajectories of regional innovation systems.

As notably remarked by Cooke et al. (1998, page 1580) “the systematic dimension of innovation at regional level relies upon a combination of a well-endowed organisational infrastructure and an associative superstructure composed of an embedded civil society capable of activating social capital. Institutionally speaking, embeddedness will reside in the collective social order which evolves according to an informal microconstitution composed of microregulatory conventions, habits, routines, and rules of the game. Systematic innovation is facilitated by the constructive interaction of the institutional order and the organisation infrastructure”.

Morgan (2007), on the other hand, stresses that, given the evolutionary nature of capitalism, the relevance of the quality of institutions lies in their ability to reduce the uncertainty and the instability faced by innovative entrepreneurs.

Haschka and Herwartz (2020), propose a Bayesian stochastic frontier approach to assess the determinants of innovation efficiency in four high-technology sectors in Europe (i.e. chemicals, general electronics, pharmaceutical products and rubber and plastic products). Although their evidence indicates that R&D investments represent the major driver of innovation activities, a key role found for the flows of knowledge from neighbour areas or firms. Specifically, the geographical closeness to universities and competitors are found to stimulate the creation and the development of local networks and, through that, to promote and improve innovativeness. In particular, they show that firm-level innovation increases with the level of market competition, as the latter incentivises firms to undertake innovative activities to avoid other firms to prevail in the market. Further, their evidence points out to a differentiated effect of knowledge spillovers, which crucially depend on the sector examined. Indeed, firms operating in highly competitive sectors, such as pharmaceutical and chemicals, are more incentivised to pursue firm-level efficiency compared to general electronics and rubber and plastic products, where the degree of competition is lower.

Though these contributions explicitly acknowledge that the regional institutional environment can play a key role in affecting regional innovation systems, little is said on how institutional quality affects their efficiency.

In this paper, using regional data from Italy over the 2004–2011 period, we investigate the effects of local institutional quality on the regional innovation system (hereafter RIS) efficiency. In particular, we believe that an analysis based on Italy represents a promising field of analysis for various different reasons. The first, and somewhat more obvious, is represented by the persistent and large regional divides, in terms of both economic outcomes and institutional quality, between the core areas of the Centre-North and the peripheral regions of the South. However, beyond the persistent regional dualism, whose roots can be found in the different effectiveness and efficiency of the institutional environment (Tabellini, 2010; D’Agostino and Scarlato, 2015). there are other reasons which make, in our opinion, the Italian context particularly appealing. Indeed, two peculiar aspects of the Italian economy are respectively represented by the prevalence of Small and Medium firms (SMEs) (Cucculelli and Peruzzi, 2020a; Cisi et al., 2020) and by a concentrated ownership structure, as a large share of firms are owned and managed at the family level (Carletti et al., 2020; Cucculelli and Peruzzi, 2020b).

Moreover, compared to other industrialised economies, Italy has historically exhibited a poor performance in terms of R&D intensity, whose levels did not converge to the ones registered by other OECD countries (Awaworyi Churchill et al. 2020). Further, as shown by Borin et al. (2014), the lower effectiveness and efficiency of the Italian institutional environment significantly constrained its growth, as it halted Foreign Direct Investments (FDI), which have been proven to be a relevant source of technological transfer and productivity. Given these considerations, we therefore believe that assessing the efficiency of the Italian RIS and evaluating how the quality of institutions affects the efficiency with which the RIS converts inputs into output, and hence, knowledge, is of significance importance for both scholars and policy-makers. It must be also clarified that the definition of institutional quality employed in this paper follows the approach of Kaufmann et al. (2010). Accordingly, governance is broadly defined as “the traditions and institutions by which authority in a country is exercised. This includes (a) the process by which governments are selected, monitored and replaced; (b) the capacity of the government to effectively formulate and implement sound policies; and (c) the respect of citizens and the state for the institutions that govern economic and social interactions among them.” (Kaufmann et al., 2010 page 4).

For the empirical purposes of the paper, we propose the application of a knowledge production function, estimated through a Stochastic Frontier Analysis (SFA). In particular, although the previous literature on the Italian economy has proven that the quality of institutions has some role in stimulating both Total Factor Productivity (TFP) (Lasagni et al., 2015; Albanese et al., 2020) and patenting activities (D’Ingiullo and Evangelista, 2020), we believe that the SFA methodology, compared to these approaches, offers various different advantages. Specifically, the application of the SFA approach allows us to evaluate the efficiency with which the RIS converts inputs into output and to appraise how far the RIS is relatively to its best practices, summarised by the stochastic frontier (Triebs and Kumbhakar, 2018), something which cannot be directly captured neither by the TFP, nor by the amount of registered patents. Another advantage of the SFA compared to these alternative approaches lies in the fact that, through its application, it is possible to evaluate whether an improvement in the quality of institutions favours a more efficient use of resources, hence moving the RIS closer to its best practices. Following this discussion, this contribution aims at testing the following hypothesis: H1) the evolution of the institutional environment enhances RIS efficiency.

Beyond providing an empirical analysis of the interactions between the quality of institutions and the RIS efficiency, this paper also employs a Fixed Effects estimator to investigate whether the benchmark results differ once a traditional knowledge production function is considered.

We believe, hence, that our contribution is relevant even once we move away from the evolutionary perspective, where the nexus between the quality of institutions and RIS efficiency has received little attention, both theoretically and empirically.

Indeed, the theoretical literature has been mostly concerned with the impact of institutional quality upon growth and innovation.

Specifically, Rivera-Batiz (2002) proposes an endogenous growth model aimed at assessing the relationship between democracy, governance and innovation. The main theoretical prediction of this model is that improved democracy promotes innovation, provided that such an improvement is accompanied by a corresponding strengthening of governance. The latter, in particular, discourages corruption among public officials and makes innovative projects relatively more profitable, hence stimulating economic growth.

Tebaldi and Elmslie (2008), on the other hand, developed a theoretical setting to investigate the relationships between institutional quality and long-run growth. To theoretically evaluate the impact of institutions upon economic growth, they propose a composite index of institutional quality which includes factors like control of corruption, pro-market regulations, the perceived effectiveness of the judiciary system and of the public administration and, finally, contract and property rights enforcement. According to their findings, countries with weak institutions exhibit low rates of technological change and output growth, a result which is essentially driven by the fact that countries with weak institutions generate an inefficient allocation of the available human capital in the economy.

Aghion et al. (2016), within a Schumpeterian growth model,Footnote 1 assess the relationship between taxation, corruption and innovation. They show that increased corruption decreases the optimal tax revenues which, in turn, diminishes the investment in public infrastructures, hence discouraging innovation. At the same time, they show that increased corruption reduces the entrance in the market of potential innovative entrepreneurs.

From an empirical perspective, though an abundant literature has investigated the impact of the quality of institutions upon the innovative performance of countries and firms, see for instance Giménez and Sanaú (2007), Asongu et al. (2018), Cahn et al. (2019) and Clò et al. (2020),Footnote 2 just a limited amount of papers has instead analysed how institutions interact with efficiency.

Examples of this limited literature are, to the best of our knowledge, the contributions of Méon and Weill (2005), Castiglione et al. (2018) and Aldieri et al. (2020), which show unanimous consensus in ascertaining that increased quality of institutions is associated with higher efficiency.

In particular, Méon and Weill (2005), identify three main channels through which bad governance is translated into lower efficiency.

These channels can be summarised as follows: a) bad governance acts like a tax on innovators and diverts resources from productive activities. This, in turn, has a detrimental effect on both the accumulation of productive resources and the efficiency with which they are employed. From this perspective, typical examples are corruption and political instability. In particular, while corruption discourages productive activities as it acts like a tax on innovators that reduces the expected gains from investments, increased political instability heightens the risks related to productive economic activities, hence preventing them; b) bad governance reduces the incentives to invest in innovation. In particular, a weak rule of law forces innovative entrepreneurs to invest in property right protection, while an inefficient regulatory distorts entrepreneurs’ incentives, as they might find more profitable to take advantage of weak property rights and engage litigation, rather than innovating. An ineffective government, on the other hand, is assumed to stimulate rent-seeking rather than productive activities; c) bad governance hampers the diffusion of knowledge as it promotes the accumulation of generic rather than specific capital and discourages foreign direct investments.

To anticipate our findings, the empirical evidence reported in this paper indicates that most of the institutional factors considered in the analysis play no role in affecting RIS efficiency, as we find a positive and highly significant impact for government effectiveness only, which drives the positive effect detected for one of the two composite indices of institutional quality employed in the paper. This result is robust to different assumptions about the underlying technology, to alternative lag structures between R&D and patenting activities and to the application of different R&D inputs. Moreover, this evidence is confirmed once instead of the RIS efficiency, we appraise the impact of institutional quality on the amount of registered patents, through the application of a canonical knowledge production function.Footnote 3

Two main policy prescriptions can be drawn from the empirical analysis performed in this paper. On the one hand, measures that improve and strengthen the endowment of regional socio-economic structures are highly desirable, as they enhance both RIS efficiency and patenting activities. Further, interventions for Southern regions should be designed to reduce the technological and efficiency gap with the core regions of the country.

The paper is organised as follows: Section 2 introduces the methodology used to estimate RIS efficiency. Section 3 describes the data, the production set and the empirical specifications proposed in the paper. The empirical evidence is described in Section 4, while Section 5 provides a sensitivity analysis. Section 6 concludes and discusses the main policy implications.

2 Measuring the Regional Innovation System (RIS) efficiency

Following Fritsch and Slavtchev (2011) and Barra and Zotti (2018), we propose to measure RIS efficiency through the concept of technical efficiency, as introduced by Farrell (1957), in which a given unit is technically efficient if it is able to produce the possible maximum output from a given amount of input. To empirically deal with the relationship between inputs and outputs of innovation, a KPF,Footnote 4 based on a Cobb–Douglas production function formulation (see Griliches, 1979 and Jaffe, 1989), is employed.

Measuring the performances of given economic unit crucially depends on the possibility that the unit under scrutiny (i.e. regions) is characterised by inefficiencies in production. It turns out that the econometric specification of production and cost functions must take into account the possibility that a given unit is likely to produce using its inputs in a sub-optimal way. One possible approach to incorporate inefficiency into the estimation of a production function is the application of a Stochastic Frontier Analysis (SFA), proposed by Aigner et al. (1977) and Meeusen and Van den Broeck (1977). Empirically, this method assumes that the error term is divided into two components with different distributions (see Kumbhakar and Lovell (2000) for analytical details on stochastic frontier analysis). The first component, the “inefficiency”, is asymmetrically distributed (typically as a semi-normal), while the second, i.e. the “error”, is distributed as a white noise and both these components are assumed to be uncorrelated to avoid distortions in the estimates. Though some contributions, see for instance Broekel (2015), have emphasised that non-parametric approaches offer a number of advantages and avoid critical assumptions (such as homogeneous elasticities), the literature has identified various drawbacks related to the application of this approach and such to justify the empirical methodology employed in this paper. As notably remarked by Méon and Weill (2005) and by Castiglione et al. (2018), two-stages approaches, such as Data Envelopment Analysis (DEA), are inconsistent. Specifically, these procedures are based on the computation of efficiency scores in the first stage and on the regression of these scores on some exogenous variables in the second stage. These methods are therefore inconsistent as they assume that the inefficiency component, in the first stage, is independently and identically distributed (i.i.d.), but this assumption is then violated in the second stage regression. Castiglione et al. (2018) further highlight that another advantage of the SFA, compared to the DEA, is that the it allows to deal with the presence of random shocks which are not controlled by the producer but that can affect the production output. Moreover, various contributions in the literature, see for instance Kneip et al. (1998), Park et al. (1998) and Gijbels et al. (1999), have emphasised that non-parametric approaches lead to bias estimates of the parameters of interest in presence of small samples, as it is the case of the current contribution. Finally, as shown by Koop et al. (1999), Simar (2003) and Simar and Zelenyuk (2011), the performance of non-parametric approaches is particularly poor in the presence of outliers, while the assumptions concerning the distribution of the error term in the parametric approach make it less sensitive to the presence of extreme valuesFootnote 5 (Pereira de Souza et al., 2010). Formally,Footnote 6 taking logarithms, the KPF is described as follows:

$${y}_{it}={\alpha }_{i}+f\left({{\varvec{x}}}_{it},{\varvec{\beta}}\right)\mathrm{exp}\{{v}_{it}-{u}_{it}\}$$
(1)

where \({\alpha }_{i}\) allows for regional-specific time-invariant unobserved heterogeneity, being disentangles from time varying inefficiency (Belotti et al., 2013; Greene 2005), \({y}_{it}\) is the output of region \(i\) at time t; \({{\varvec{x}}}_{it}\) is a vector of input quantities of region \(i\) at time \(t\) (see Section 3.1 for more details on the input–output framework used in the production set); \({\varvec{\beta}}\) is a vector of unknown parameters; \(f\left({{\varvec{x}}}_{it},{\varvec{\beta}}\right)\) is the production function or conventional regression model; \({v}_{it}\) is a vector of random variables related to the idiosyncratic or stochastic error term of region \(i\) at time \(t\), assumed to be independently and identically distributed (i.i.d.) as \(N(0,{\sigma }_{v}^{2})\) and independent of the \({u}_{it}\), while \({u}_{it}\) is a vector of non-negative random variables measuring the inefficiency term of region \(i\) at time \(t\) and assumed to be independently but not identically distributed. They are obtained from the truncation to zero of the distribution \(N({m}_{it},{\sigma }_{u}^{2})\), where \({m}_{it}=\mu +{z}_{it}\delta\), where \(\mu\) denotes the location parameter, \({z}_{it}\) is a vector of determinants of (technical) inefficiency of region \(i\) at time \(t\) (see Section 3.2. for more details on the variables used in the inefficiency component), and \(\delta\) is a vector of unknown coefficients. In sum, as notably argued by Liu and Myers (2009), the SFA model assumes that \({u}_{it}\) and \({v}_{it}\) are independent of each other and that \({u}_{it}\) is independent of the input vector, \({{\varvec{x}}}_{it},\) conditional on \({z}_{it}.\) Furthermore, the assumption of independence between the stochastic term (v) and the inefficiency term (u) prevents that unobserved factors could have an adverse effect on the territorial determinants and then on the components of institutional quality. If this assumption were relaxed, the effect of the unobserved factors could strongly affect the performance (i.e. efficiency scores) and therefore the ranking (in terms of performance) of the regions. Same goes for u, where we assume that inefficiency of region i does not depend on the inefficiency of region j, with i ≠ j. This leads us to assume that the inefficiency component should not be identical between regions, while as regards the unobserved factors (v), we assume that the factors of region i do not influence the factors of region j, but that they are distributed identically.

In addition, time dummies have been included in the model to capture exogenous factors that might affect the production set and to provide a measure of technological change; time trend and macro-area dummies have also been included in the inefficiency component to capture how time and geographical areas affect inefficiency.

We assume that all the regions are initially endowed with the same level of technology but that this technology varies over time. For this reason, we do not consider a single time trend but a set of time-dummies.

All the parameters in Eq. (1) and efficiency scores are estimated through a maximum likelihood estimator (MLE) using the STATA 14 software. Following Kalirajan and Shand (1999), we estimate the technical efficiency assuming that the output elasticity associated to any input (i.e. \({\varvec{\beta}}\)) is identical for all Italian regions (\(i\)=1,.., 20). In other words, the produced output may fall systematically below the maximum, not because of lower output elasticities of the factors of production, but because of a lower level of the function.

3 Data, production set and model specification

3.1 The production set

The empirical analysis proposed in this paper is developed by combining data collected from the Italian National Institute of Statistics (ISTAT)Footnote 7 with information concerning the quality of regional institutions drawn from the Institutional Quality Index (IQI) dataset developed by Nifo and Vecchione (2014), over a 8 years-period (from 2004 to 2011).Footnote 8 The estimated technology is specified using six inputs (both regarding the R&D expenditures and the number of R&D employees in different sectors) and one output (number of disclosed regional patent applications in the years 2004–2011). See Table 1 below for more details on the model specification.Footnote 9 Specifically, the first set of inputs consists of the amount of R&D expenditures in the public sector (RD_EXP_PUBL), in the higher education institutions (RD_EXP_HEI) and in the private sector (RD_EXP_PRIV).Footnote 10 See Bottazzi and Peri (2003), Fritsch and Franke (2004), Buesa et al. (2010) and Tavassoli and Carbonara (2014), for the use of similar innovation inputs. The second set of inputs, instead, is based on the amount of R&D employees in the public sector (RD_EMPL_PUBL), in the higher education institutions (RD_EMPL_HEI) and in the private sector (RD_EMPL_PRIV).Footnote 11 See Fritsch and Slavtchev (2011) and Buesa et al. (2010) for the use of this type of innovation inputs. As highlighted by Buesa et al. (2010, page 724), the choice of these inputs depends on the conclusion that “innovatory outputs depend in the first place on the effort made in allocating resources, regardless of whether the latter is measured via expenditures or staff employed in R&D”. For this reason, the two set of inputs are alternatively used in the knowledge production function to explore potential differences determined by the way in which R&D is measured (i.e. R&D expenditures or employees).

Table 1 Specification of inputs, outputs and exogenous factors in SFA models

With respect to the output, we propose to adopt, as a measure of innovation, a standard variable, represented by the number of disclosed regional patent applications registered at the European Patent Office (PAT) over the 2004–2011 period. In particular, patents are territorially assigned on the basis of the postal code of the inventor’ s residence. If the invention is the work of several authors, the patent is divided equally between all of them and therefore between their respective places of residence. For this reason, the reported data contain decimal values. Although the application of the number of patents as a measure of innovation outputFootnote 12 is not immune from limitations, they can be used as a good approximation for innovative ideas (see Bottazzi and Peri, 2003; Buesa et al. 2010; Fritsch and Slavtchev, 2011; Tavassoli and Carbonara 2014). Following Fritsch and Slavtchev (2011), we assume that a certain amount of time is required before R&D activities will result in a patent. Several months (usually from 12 to 18) are therefore needed for patents’ applications to be publishedFootnote 13 and a time lag between innovation inputs and the output of at least one or two years should be assumed.Footnote 14 The time lag between R&D inputs and patent applications also depends on the reliability of the data and, to deal with this issue, different solutions have been exploited in the literature. Acs et al. (2002) report that innovation records result from inventions made 4 years before. Fritsch and Slavtchev (2007) propose a time lag of three years between patent applications and innovation input, while Fischer and Varga (2003) relied on a two-year lag. Fritsch and Slavtchev (2011), on the other hand, reduced the time lag to a period of one year. Given both the data constraints and the main literature and also in order to take into account the time required to transform competences into concrete innovation, as well as innovation into patents, we follow Fischer and Varga (2003) and assume a time lag of two years between innovation inputs and outputs. Data for Italian regional governance quality are taken from Nifo and Vecchione (2014). Taking inspiration upon the World Governance Indicator (WGI) developed by Kauffman et al. (2010), they provide information for six different measures of regional institutional quality. Specifically, beyond considering five elementary dimensions of institutional quality, namely corruption, government effectiveness, regulatory quality, rule of law, voice and accountability, they further exploit these five dimensions of governance to construct an overall index of institutional quality.Footnote 15 Beyond the application of the full set of indicators developed by Nifo and Vecchione (2014), we propose an additional aggregate index of institutional quality, obtained through the application of the Principal Component Analysis (PCA) to the afore-mentioned five elementary dimensions. This procedure seems to be suitable for our purposes, as the institutional quality indicators employed in the paper exhibit strong linear correlations.Footnote 16 According to Ariu et al. (2016), if the indices of governance exhibit strong linear correlations, it is then hard to empirically quantify their impact. Accordingly, the application of the PCA has the advantage to shrink the dimension of the governance indicators and to deal with the existing variability in the data. It is worth noting that in the literature that has investigated the impact of institutional quality upon efficiency, while Méon and Weill (2005) and Aldieri et al. (2020) have taken advantage of the full set of governance quality indicators identified by Kauffman et al. (2010), hence relying on a set of governance indices which is close to the one employed in this paper, the empirical analysis of Castiglione et al. (2018) only exploits the composite institutional quality indicator of Nifo and Vecchione (2014) and not just explores the potential contribution of each individual measure of institutional quality upon efficiency. All inputs and the output variables are in log-levels so that the overall positive skewness of the variables is reduced.

Descriptive statistics disaggregated by the four main Italian regional macro-areas are reported in Table (2). What clearly emerges is the large dual structure of the Italian economy, which is visible not only considering the production function parameters, but also the variables related to the determinants of efficiency and the quality of institutions. Indeed, the descriptive statistics reveal that North-Western regions show a higher degree of economy activity, as signaled by the level of patents and R&D expenditures, while Southern regionsFootnote 17 are characterised by a lower degree of R&D intensity, implying that regions in the North are the most innovative in the country. A similar picture emerges when considering the descriptive statistics related to the efficiency components, with North-Western regions which outperform all the others. Once again, the Southern part of the country registers the worst performance, displaying statistics which are indicative of an economy that did not achieve high levels of socio-economic development and a catching-up process with the most developed macro-areas of the country.Footnote 18 The same holds once we focus on the quality of regional institutions, with the Southern regions that exhibit weaker governance relatively to Center and Norther regions.

Table 2 Descriptive statistics by macro areas

3.2 Determinants of RIS efficiency

In a stochastic frontier environment characterised by heterogeneity, omission of relevant exogenous factors may determine bias estimates of the production frontier parameters and of the level of technical efficiency, therefore leading to poor policy conclusions (e.g. Caudill and Ford, 1993; Caudill et al., 1995; Hadri 1999; Wang 2003). Differences in the economic environment, indeed, might have a relevant impact upon RIS efficiency. To empirically deal with the issues determined by the omission of relevant exogenous factors, various control variables could be used. These variables are considered exogenous in the sense that they influence the production process but are not themselves either inputs or outputs. They, in fact, influence the efficiency with which inputs are converted into outputs. Allowing efficiency to depend on regional environmental characteristics enables researchers to examine the determinants of efficiency, and to suggest policy interventions to enhance it. In other words, the basic assumption of the model is that institutional and environmental factors influence the degree of technical efficiency. For this reason, the innovation production function must include these variables, as they directly influence the efficiency component. We therefore apply a single stage approach (see, for example Battese and Coelli, 1995) where environmental factors are assumed to directly affect technical efficiency. This procedure, indeed, is preferred to a two-stage approach since the latter may determine inconsistent estimates of the main parameters of interest (Kumbhakar and Lovell, 2000).

Following the empirical procedure proposed by Barra and Zotti (2018), we explore whether the institutional factors, related to quality of government, affect RIS efficiency. We then introduce seven different indicators related to the quality of institutions. Specifically, beyond the five elementary dimensions of government quality identified by Nifo and Vecchione (2014) and their composite measure of institutional quality, we propose an additional composite indicator, computed by applying the Principal Component Analysis (PCA) to the five elementary dimensions discussed below. Accordingly, Nifo and Vecchione (2014), for the Italian context, identify the following five elementary dimensions of institutional quality, namely: a) Corruption (CORR): “summarizes data on a crimes committed against the Public Administration (PA), the number of local administrations overruled by the federal authorities and the Golden-Picci Index, measuring the corruption level on the basis of “the difference between the amounts of physically existing public infrastructure (…) and the amounts of money cumulatively allocated by government to create these public works” (Golden and Picci, 2005, p. 37)” (Nifo and Vecchione, 2014 page 1633); b) Government effectiveness (GOV): “measures the endowment of social and economic structures in Italian provinces and the administrative capacity of provincial and regional governments in relation to policies concerning health, waste management and the environment” (Nifo and Vecchione, 2014 page 1633); c) Rule of law (RULE_LAW): “summarizes data on crime against persons or property, on magistrate productivity, trial times, the degree of tax evasion and the shadow economy” (Nifo and Vecchione, 2014 page 1633); d) Regulatory quality (REG): “comprises information concerning the degree of openness of the economy, business environment and, hence, the ability of local administrators to promote and protect business activity” (Nifo and Vecchione, 2014 page 1633); finally, e) Voice and accountability (VO_ACC): “captures the participation in public elections, the phenomenon of associations, the number of social cooperatives and cultural liveliness measured in terms of books published and purchased in bookshops” (Nifo and Vecchione, 2014 page 1633). Nifo and Vecchione (2014) then combine these five elementary dimensions of governance to obtain an overall measure of institutional quality, namely the Institutional Quality Index (IQI). The IQI is the outcome of 3 different stages, respectively represented by normalisation, weight assignment and aggregation. Specifically, the normalisation process, based on the distance from the ideal point, serves to transform each elementary dimension in an indicator bounded in the [0,1] interval,Footnote 19 while the weighting scheme is based on the analytic hierarchy process (AHP) developed by Saaty (1980, 1992). The last stage instead consists in the aggregation of the elementary dimensions of institutional quality to obtain the IQI index. Nevertheless, beyond the six indices of institutional quality provided by Nifo and Vecchione (2014), we propose an alternative aggregation of the five elementary dimensions, based on the application of the Principal Component Analysis (PCA). As stated before, the PCA allows to shrink the dimensions of governance and to better deal with the existing variability in the data.

Finally, we include in the inefficiency componentFootnote 20 (see Section 2 for more analytical details) the following variables: knowledge spillovers (labelled as KS1 and KS2, respectively), population density (PD); export (EP); unemployment rate (UR); employment in services (SERV) and in industry (IND) sectors.

Relatively to the role of knowledge spillovers, which are expected to be positively correlated with the RIS efficiency, their relevance has been remarked by various contributions in the literature (see for instance Bode, 2004; Cabrer-Borrás and Serrano-Domingo, 2007; Aldieri et al., 2018; Agasisti et al., 2019). As notably emphasised by Bode (2004), the higher is the geographical proximity among regions, the higher is the intensity of the knowledge transferred from one region to another. The measurement of knowledge spillovers requires, however, some considerations. Indeed, a popular approach adopted in the literature to capture knowledge spillovers consists in the application of patent citations, see for instance Jaffe et al. (1993), Jaffe et al. (2000) and Thompson and Fox-Kean (2005). Other contributions, like Acs et al. (2002) and Grafström (2018), have instead proposed the application of the amount of patents, as the latter represent a good proxy of knowledge (Acs et al., 2002), though they warn that the application this measure must be taken with caution. Other possibilities, however, have been exploited in the literature and other variables have been employed to capture knowledge spillovers. These alternative approaches consider, among others, the amount of R&D expenditures (Fritsch, 2000; Griliches, 2007; Autant-Bernard and LeSage, 2011), and the amount of R&D employees (Bloch, 2013; Zemstov and Kotsemir, 2019). At the same time, some contributions have emphasised that knowledge spillovers mostly occur in the private sector (Greunz, 2003; Wenner et al., 2011; Autant-Bernard and LeSage, 2011). For this reason, we measure knowledge spillovers by alternatively considering the amount of R&D expenditures in the private sector (KS1) and the amount of R&D employees in the private sectorFootnote 21 (KS2), depending on whether the inputs of the model consist of R&D expenditures or R&D employees. It must be also specified that knowledge spillovers, in line with other contributions in the literature, see for instance Fritsch and Franke (2004) and Zemstov and Kotseminr (2019), have been treated as an explanatory variable of the RIS efficiency. In line with other studies in the literature devoted to assess the role of different types of spillovers, (see, among others, Pacheco and Tyrrell, 2002; Moreno et al., 2005; Kelejian et al., 2013; George and Sandler, 2018; Agasisti et al., 2019), we rely on the application of an inverse distance weighted matrix.Footnote 22 More specifically, the amount of R&D expenditures and R&D employees in the private sector have been respectively weighted by a spatial matrix, W, with the implication that the interaction between the two different measures of R&D and the spatial matrix allows us to capture the effects of knowledge spillovers of region k on its neighbour i.Footnote 23

With respect to the other determinants of RIS efficiency, population density (PD), measured as the number of inhabitants in the region by squared kilometres, aims at measuring the effects of urbanisation. High population density should boost RIS efficiency as it provides opportunity for intensive contacts and cooperation (for a similar view, see Feldman, 2000 and Fritsch, 2000). The unemployment rate (UR), measured as the number of people actively looking for a job as a percentage of the labour force, is intended to capture, in line with Srholec (2010), the impact of structural factors and socio-economic conditions on RIS efficiency and to appraise, in line with Crescenzi and Rodríguez-Pose (2013) and Hajek et al. (2014), how the lack of labour demand and the lack of adequate skills in the local markets affect the RIS efficiency. Additional channels, however, might operate. In particular, as emphasised by Rodríguez-Pose and Di Cataldo (2015), the unemployment rate reflects the structural rigidities of the local labour markets which, in turn, halt the generation of knowledge. Cunningham et al. (2019), on the other hand, emphasise that dynamic and flexible labour markets represent a fundamental ingredient for innovation and technology transfer to take place.Footnote 24 Although this literature points out to an inverse relationship between the unemployment rate and innovative activities, the expected sign of this variable is a priori ambiguous. Indeed, while on the one hand it can be argued that regions with high socio-economic standards should exhibit more efficient RISs, as they are better able to accumulate and retain human capital, hence favouring the generation of knowledge and to easily absorb the knowledge created elsewhere, on the other it might be argued that higher unemployment could exert a positive effect on innovation (Crescenzi and Rodríguez-Pose, 2013), as it might stimulate R&D expenditures (Ceh and Gatrell, 2006) or encourage entrepreneurship (Lee et al., 2004),Footnote 25 hence implying that unemployment and RIS efficiency could be directly correlated. Taking into account regional differences in the industry structure is crucial since patenting propensity differs across industries, see for instance Griliches (1990) and Broekel et al. (2018). In order to control for the impact of regional specialisation in certain industries, following Bottazzi and Peri (2013), we use two variables, namely the percentages of employment in services (SERV) and in industry (IND) sectors. Specifically, SERV and IND are measured, respectively, as the number of employees in services and industry sectors over the number of total employees in each region. A priori, we expect these variables to be directly correlated with RIS efficiency. In particular, the assumption behind the expected sign of these variables is that that the higher is the degree of regional specialisation in the industrial and in the service sector, the higher is the RIS efficiency. We further use a variable aimed at assessing the impact of export (EP), measured as the value of exports as a percentage of the Gross Domestic Product, on RIS efficiency. As higher export stimulates patent applications (Salomon and Shaver 2005), R&D investments (Baldwin and Gu, 2004; Yang, 2018) and the introduction of new products and technologies (Gorodnichenko et al., 2010), a positive sign is expected for this variable as well, as it seems plausible to assume that the higher is the propensity of the RIS towards export activities, the higher is its efficiency. Finally, a time trend to control for exogenous effects is included in our preferred specifications. See Tables 3 for the definition and the expected sign of these variables.

Table 3 Definition of the variables and expected sign

4 Empirical evidence

4.1 Investigating the endogeneity issues using an IV approach: A preliminary check

Before discussing the evidence of the SFA approach, the issue of endogeneity requires some considerations and needs to be addressed. Indeed, the evidence reported in this paper might be confounded by the presence of endogeneity, which might be driven by reverse causality among the main variables employed the paper. It turns out that that ruling out the possibility of endogeneity is thus necessary not only to appraise the feasibility of the SFA approach, but also to avoid that the estimates might be biased, hence leading to poor policy conclusions.

To appraise the issue of endogeneity, the analysis reported in this section closely mirrors the one performed Haschka et al. (2020). In their contribution aimed at assessing the relationship between health care infrastructure and health outcome, Haschka et al. (2020) propose, prior to the SFA, to appraise the existence of reverse causality by relying on an approach based on the Hausman test (Hausman, 1978). Specifically, to assess whether the SFA approach is suitable or not for their analytical purposes, they propose to estimate their production technology using the Ordinary Least Squares (OLS) and two alternative Instrumental Variable (IV) models. In particular, in the two IV models Haschka et al. (2020) treat their inputs as endogenous and propose to instrument them using the lagged first differences and through the application of heteroscedasticity-based instruments.Footnote 26 For the purposes of our paper, we therefore follow the latter approach and propose a set of specifications, with both one and two-year lag between inputs and output, where the set of inputs of our technology are assumed to be endogenous.

The evidence reported in Table 4 indicates that R&D expenditure and employees in higher education and in the private sector positively and significantly affect innovative activities. Among the main controls, while population density exhibits an ambiguous sign, in this environment the unemployment rate is inversely and significantly correlated with the variable of interest. On the other hand, a higher share of employees in the industrial sector seems to be significantly correlated with higher propensity towards patenting activities. Relatively instead to the main diagnostics of the model, as the Hausman test is largely above the conventional 10% threshold, the non-rejection of the null hypothesis suggests that, for the purposes of the paper, endogeneity does not represent a relevant issue and provides favourable evidence for the application of the SFA approach, hence implying that our results are unlikely to be biased by the presence of endogeneity. Moreover, according to the diagnostic statistics reported in Table 4, there is also favourable evidence for the validity of the instruments’ set, as implied by the p-values of the Hansen J-statistic, which are above the 10% level.

Table 4 Estimates for the knowledge production function using IV approach

4.2 RIS Efficiency: The performance of Italian macro-areas

The main contribution of this paper is to shed light on the nexus between institutional quality and RIS efficiency. To achieve this goal, we start providing estimates of the efficiency scores. Specifically, we provide a comparison of the estimated parameters either when institutional factors are included or not in the proposed specifications.

The efficiency scores distributed by macro-area, based on a Cobb–Douglas KPF specification, are presented in Table 5. Results are shown either when the amount of R&D expenditures is used as input (Model A) and when R&D employees represent the relevant input variable (Model B), while the output variable is always the number of disclosed regional patents.Footnote 27 In order to take into account that a certain amount of time is required before R&D activities will turn into a patent, a time lag between innovation inputs and the output of either one and two years is assumed. We report the estimated scores assuming linear homogeneity in inputs and applying robust standard errors in order to control for heteroskedasticity, whereas standard errors clustered at regional and year level. Cluster-adjusted standard errors correct for the possible correlation in innovative performances in the same region over time. The asymptotic approximation for clustered standard errors relies on a large number of clusters (see Donald and Lang, 2007). We have 160 clusters (8 years* 20 regions) which should be enough to deal with this issue. The null hypothesis that there is no heteroskedasticity in the error term has been tested and rejected at 1% significance level, using a Likelihood Ratio Test (LR), giving credit to the use of some exogenous variables, according to which the inefficiency is allowed to change. In other words, the validity of the heteroskedasticity assumption has been confirmed, leading to the significance of the inefficiency term.

Table 5 RIS efficiency by macro areas according to the stochastic frontier approach

The estimated RIS efficiency scores reported in Table 5 reveal, once again, a large dualistic structure within Italian regions. More precisely, the estimated parameters suggest that North-Western and North-Eastern regions outperform the other relevant macro-areas in terms of RIS efficiency. On the other hand, Southern regions exhibit significantly lower RIS efficiency, especially if compared to the performances registered by North-East regions. Moreover, the general picture that emerges from Table 5 is that RIS efficiency is usually higher once institutions are explicitly included in the econometric estimate, a finding which is robust to both the application of different R&D inputs and an alternative lag structure. All in all, the evidence reported in Table 5 indicates that regions with more effective institutional infrastructures exhibit significantly higher RIS efficiency scores.

4.3 RIS efficiency determinants

The discussion concerning the impact of institutional quality upon RIS efficiency begins by considering the benchmark results reported in Table 6, in which we propose a set of econometric estimates where we assume a two-year lag between R&D expenditures and patents and impose linear homogeneity in inputs. The production function parameters suggest that R&D expenditures in the public sector have a weak positive effect, while expenditures in higher education and in the private sector are found to be, as expected, positive and significant, in line with the findings of Barra and Zotti (2018).

Table 6 Estimates for the knowledge production function and for RIS efficiency components according to the stochastic frontier approach

Estimates of the efficiency component reveal that knowledge spillovers exert, in line with the expectations, a positive effect on RIS efficiency, as the parameter of interest is mostly found to be statistically significant. Both population density and export, as expected, increase RIS efficiency. The unemployment rate, in line with the expectations, is found to be detrimental for RIS efficiency, while the shares of employees in both the services and industry enter with an unexpected sign, as they inversely correlate with the variable of interest.

With respect to the impact of institutional quality, evidence reported in Table 6 indicates that government effectiveness has a positive and highly significant impact on RIS efficiency, while other dimensions of the quality of institutions are instead found to be insignificant. Relatively to the two composite indices of institutional quality employed in the paper, while the IQI of Nifo and Vecchione (2014) is positive and significant, our PCA measures plays no role in affecting the variable of interest. The positive effect found for the IQI seems therefore to suggest that the quality of institutions, as a whole, has a positive effect in stimulating RIS efficiency, though the result is driven by government effectiveness.

5 Sensitivity analysis

5.1 Does a different lag of R&D expenditures affect the estimates?

In Table 7 we provide, as initial robustness check, the results of our knowledge production function assuming, as suggested by the main empirical literature, a one-year lag between R&D expenditures and patent applications.

Table 7 Estimates for the knowledge production function and for RIS efficiency components according to the stochastic frontier approach

Differently from the benchmark specifications, the estimates reported in Table 7, beyond assuming an alternative lag structure between the relevant R&D input and the output, also allow-at least partially-for a different specification of the underlying technology. Specifically, though the benchmark assumption of linear homogeneity in inputs is retained (columns B1-B7), we also provide a set of empirical specifications in which this hypothesis is instead relaxed (columns B8-B14). In particular, relaxing the linear homogeneity assumption allows us to deal with the possibility of economics of scale in the underlying technology.

Under the hypothesis of linear homogeneity in input, there is evidence of a positive and highly significant effect of government effectiveness on RIS efficiency and, to a lesser extent, for regulatory quality, as the estimated coefficient is statically significant only at the 10% level. Once the assumption of linear homogeneity is removed, while the weak positive effect of regulatory quality vanishes, the positive and highly significant impact of government effectiveness is confirmed. Moreover, both the specifications seem to suggest that the overall quality of institutions has some positive effect on RIS efficiency, as implied by the estimated coefficients of the IQI, though this effect is again mediated by government effectiveness.

5.2 Does a different measure of innovation inputs affect the estimates?

In Table 8, we assess the impact of institutional quality on RIS efficiency assuming two lags between R&D employees and patents. Specifically, we provide estimates both under the assumption of linear homogeneity in inputs (columns C1-C7) and once we relax this constraint (columns C8-C14). Again, the robustness of our findings is confirmed, hence suggesting that our results are not affected by a different set of inputs and by the underlying technology. Indeed, regardless the assumptions concerning the technology, we detect a positive and highly significant effect of government effectiveness on RIS efficiency, while all other dimensions of institutional quality are again proven to be insignificant. In line with the estimates reported so far, the direct and significant correlation between the IQI and the left-hand-side variable is confirmed, and channeled by government effectiveness (Table 9).

Table 8 Estimates for the knowledge production function and for RIS efficiency components according to the stochastic frontier approach
Table 9 Estimates for the knowledge production function and for RIS efficiency components according to the stochastic frontier approach

We conclude our robustness analysis by proposing an alternative set of empirical specifications, in which we assume a one lag between R&D employees and patents. Specifically, we investigate the impact of institutional quality under the hypothesis of linear homogeneity (columns D1-D7) and once this hypothesis is removed (columns D8-D14). Among the institutional determinants of RIS efficiency, there is evidence of a positive and highly significant effect of government effectiveness on the variable of interest, while no significant role was detected for all other relevant dimensions of the quality of institutions employed in the paper. Again, the overall quality of institutions exhibits a positive and significant impact on the variable of interest, at least when the latter is measured through the composite indicator of Nifo and Vecchione (2014). In line with the estimates reported so far, the positive impact of the IQI on RIS efficiency is again driven by government effectiveness.

6 Summary and conclusions

Over time, an increasingly amount of economic literature, see for instance Cooke et al. (1997, 1998), Lambooy and Boschma (2001), Iammarino (2005), Morgan (2007), Uyarra (2010) and Gunnarson and Wallin (2011), has been devoted to embed regional innovation systems (RISs) within the perspective of evolutionary economics.

These contributions have therefore tried to reconcile regional science with evolutionary economics. Though these papers argue that the evolution of the institutional environment plays a crucial role in affecting the trajectories of regional innovation systems, little is said on how local institutions affect RIS efficiency.

In this paper, using Italian regional data over the 2004–2011 period, we investigate the effect of regional institutions on RIS efficiency, through the application of a knowledge production function estimated within the framework of a Stochastic Frontier Analysis.

The evidence reported in this paper confirms the dual structure of the Italian economy, with Southern regions being relatively less efficient and less innovative compared to Northern and Central regions. Further, the efficiency score parameters suggest RIS efficiency is larger once institutions are embedded in the estimation and that regions with more effective institutional environment exhibit higher RIS efficiency scores.

The empirical analysis conducted through the application of the Stochastic Frontier Analysis, however, points out to weak impact of the quality of institutions on RIS efficiency, as government effectiveness is the only dimension of governance that exhibits a positive and highly significant impact on the variable of interest. There is also evidence according to which the institutional environment, taken in its entirety, exerts a positive effect on RIS efficiency, as shown by the positive and significant coefficients of the composite measure of institutional quality proposed by Nifo and Vecchione (2014), though this result is essentially mediated by government effectiveness.

Our benchmark results are shown to be robust to different assumptions about the underlying technology, to alternative lag structures between R&D and patenting activities and to the application of different R&D inputs. Further, this evidence is confirmed once instead of the RIS efficiency, we appraise the impact of institutional quality on the amount of patents issued, through the application of a canonical knowledge production function, as government effectiveness is the only stand-alone dimension of institutional quality found to stimulate regional patents. Moreover, an evaluation of regional knowledge spillovers indicate that the latter are relatively more important in explaining RIS efficiency, while their role is significantly weakened once the standard KPF is examined. This finding seems to suggest that, at least in the Italian case, regional spillovers do not generate new innovation but, rather, imitation.

The evidence reported in this paper, however, is not immune from limitations and three drawbacks are worth mentioning, respectively represented by the size of the sample and by the application of the application of patents in absolute values. Relatively to the first drawback, it arises from the fact that the analysis conducted in this paper relies upon a limited amount of information, as only a bit more of one hundred observations are available, with the implication that the evidence reported here does not allow us to draw strong conclusions. The second, on the other hand, is instead driven by the lack of information concerning the value/quality of the patents, such as a citation-weighted patent index, which, in turn, implies that the empirical analysis proposed in this paper is based on the assumption that all patents are equal in their value. However, as a large economic literature has shown (see, among others, Trajtenberg, 1990; Hall et al., 2005; Van Roy et al., 2018), patents significantly differ in terms of their importance and value, hence implying that the application of patents’ counts may not adequately capture the degree of innovativeness. Despite these limitations, we believe that our analysis has some relevance in terms of policy implications.

Though, in accordance with Méon and Weill (2005, page 87), we acknowledge that “Improving governance can arguably be deemed too evasive an objective to be really achievable”, the empirical evidence reported in this paper emphasises that government effectiveness plays a key role in enhancing both RIS efficiency and patenting activities. This result therefore suggests that policy measures aimed at enhancing the regional socio-economic endowment represent, in the Italian case, the priority.

More specifically, policies that strengthen physical infrastructures and facilitate the diffusion of ICT technologies would be highly desirable as they would stimulate both RIS efficiency and patents. Consistently with the regional policies identified by Cooke et al. (1997, 1998), the evidence reported here suggests that regional policies can play a key role in affecting the regional development path, through the financing of infrastructures-intended as both telecommunications infrastructures and network links-and, finally, by favouring regional cooperation and learning. At the same time, we believe, in line with Lambooy and Boschma (2001), that these measures, to be effective, must be tailored to the characteristics and the peculiarities of the local contexts.

In our opinion, these policies are relevant even though efficiency is not directly observable by policy-makers. In particular, it seems plausible to assume that measures that improve the quality of regional infrastructures and that promote the diffusion of ICT technologies would create a favourable environment for the RIS to operate efficiently and encourage the creation of knowledge.

Finally, though the main goal of this paper is to investigate the relevance of institutional quality upon RIS efficiency, the significant economic, technological and efficiency gap between Italian regions requires some State-level policy interventions. In particular, as Southern regions lack of some elementary preconditions for the diffusion of knowledge (Capello, 2016), measures aimed at strengthening the education system in these regions and tailored programs for the promotion of entrepreneurship could spur innovation and efficiency in these peripheral areas. Moreover, as shown by Berman et al. (2020), some dynamic centres in the South, albeit less economically developed, took significant advantage of the presence of Multinational Enterprises, exhibiting a better reaching-in performance compared to the core areas of the country. It turns out that specific measures which incentivise and concentrate foreign investments in the South would be highly recommendable as they would reduce the technological and efficiency gap with the most developed regions in the country.

Moreover, we believe that the findings reported in this paper and the associated policy implications might embedded in the European context as well. Indeed, as various contributions in the literature have proven (see, among others, Wanzenböck and Piribauer, 2018, Mulligan et al., 2019; Muscio and Ciffolilli, 2020), regional R&D subsidies from the European Union foster innovative activities and facilitate technological integration, hence suggesting that the findings reported in this paper may be important for other European regions, most notably those of other Southern or Western European countries, such as Ireland, Greece, Spain and Portugal. Indeed, as pointed out in the contribution of Wagner and Zeileis (2019), the conditional convergence hypothesis in the growth of the real GDP per capita does not hold, in the European landscape, only for 13 non-coastal and non-capital regions in the aforementioned countries. This result, in our opinion, seems to suggest that the proposed interventions for the less-developed regions in the Mezzogiorno might be extended to the peripheral areas of these countries, whose observed growth patterns did not converge to the ones observed for the majority of the European regions. In particular, these policies might facilitate not only the technological development of these areas but would also allow them to be embedded in the European context and to reduce the observed gap with the most developed areas.

The last drawback is instead represented by the structure of our data, which does not allow us to assess whether the impact of institutional quality upon RIS efficiency varies across different industries. As emphasised by Griliches (1990), Arundel and Kabla (1998), and Aldieri et al. (2021), industries significantly differ in the share of innovations being patented. Moreover, Malerba (2002) notably argues that the effects of national institutions on sectoral innovation depend on the sector examined. Indeed, given the presence of sectoral-specific factors, changes in the institutional environment might unevenly affect firms, depending on the sector in which they operate. It turns out that future research should assess whether the impact of institutions upon RIS efficiency varies across different industries and sectors. This empirical approach would shed additional light on the institutions-efficiency nexus and would allow to assess how the quality of local institutions shapes the efficiency of regional innovation systems in different industries and technologies, hence leading to more accurate policy implications. A second complementary line of research could instead be represented by an assessment of the role of spillovers of institutional change upon RIS efficiency. Indeed, as various contributions in the literature have proven, see for instance Seldadyo et al. (2010), de Groot (2011), Kelejian et al. (2013), Krammer (2015) and Grechyna (2018), institutions are spatially correlated, hence implying that improvements in the quality of institutions in a given country/region exert a positive effect on the governance of neighbour countries/regions. It would be relevant, in our opinion, to assess whether these types of spillovers exist in the Italian case and whether an improvement in the quality of institutions in a region is translated into an increase in both the quality of government and in the RIS efficiency of its neighbours.