1 Introduction

During the last two decades, vast amounts of empirical studies have analysed the relationship between firm characteristics and export activity. The different approaches in the empirical literature reflect the evolution of theoretical models that have gradually improved the explanation of the complex interactions between firm heterogeneity and participation in international markets.

One of the issues that has attracted increasing attention is the relationship between firms’ innovative capacity and export activity. As suggested by Melitz and Redding (2014), comprehension of such relationships not only helps explain some empirical regularities observed in trade data (see, among others, Love & Roper, 2015) but also plays a major role in theoretical models linking growth to trade (e.g. Aghion et al. 2018). Based on the literature, the analysis of the export-innovation relationship must consider that productivity-enhancing investments, such as R&D expenditures, are endogenous decisions jointly made with trade participation. In these models, innovation investments can induce productivity improvements that allow firms to afford the costs associated with exporting activity and enable firms to achieve a greater ability to meet international market demand, thus making exporting more profitable. Moreover, participation into international markets can further stimulate innovative activity by inducing higher competitive pressure on firms and favouring technological transfer from destination markets.

This study aims to shed additional light on the effects of firms’ innovative effort on their export intensity (‘intensive margin’ measured as the ratio of firm’s export sales on total sales) by taking into account endogeneity issues and controlling for other firm characteristics that might favour the exporting activity. Particularly, we focus on firms’ export intensity as a continuous measure of exporting activity, allowing us to apply a quantile analysis approach.

We believe that such an issue deserves priority given the role played by exporters in the economy. Indeed, empirical literature suggests that firms active in international markets perform better. Moreover, they strengthen over time, become more resilient to negative shocks, and are more capable of keeping up with external demand fluctuations (Bugamelli et al., 2018, among others). For this reason, investigating the factors favouring firms’ exporting performance is crucial to better understanding the complexity of growth processes and countries’ resilience capacity.

The main novelty of this study is that it investigates how the relationship between innovation and exports varies along the conditional distribution of export intensity. Since innovation is more valuable for exporters as it generates additional profits from export sales, a natural question is to investigate whether innovation rents are different along the export intensity distribution. Indeed, empirical evidence on this issue is scant. A similar research question was analysed by Wagner (2006), who applies quantile regression techniques to a panel of German plants, and by Orts and Martì (2018) on a European firms sample. However, these studies do not address the issue of endogeneity of firm innovative effort and exclude non-exporting firms from the analysis, thereby avoiding censoring issues. In our study, we address these methodological issues and apply the conditional quantile instrumental variable (CQIV) estimator proposed by Chernozhukov et al. (2015) to a sample of Italian manufacturing firms contained in the Unicredit-Capitalia survey and observed in the 1992–2003 period. This approach has never been adopted in the literature. Regarding the choice of the instrument, we follow a widely used approach in applied industrial organisations and use the industry value of R&D expenditures in Spain as instruments for the R&D variable.

All estimated models confirm the positive impact of R&D expenditures on export intensity and, more importantly for the purposes of this study, suggest that such an effect displays significant heterogeneity along the export intensity conditional distribution. Particularly, our estimates show that the effects of R&D increase along the conditional distribution of export intensity, reach a maximum at the 70th quantile, and persist at higher quantiles. Such results imply that firms characterised by an export intensity of about 50% or more can take the highest advantages, in terms of further expansion of their sales in international markets, from investing in R&D activity.

The heterogeneous effect of R&D expenditures along the export intensity distribution might be related to an increase in firm competitiveness, spurred by innovations generated by R&D activity and the resulting possibility of entering new foreign markets. Furthermore, we might argue that firms with low export intensity do not devote large resources, particularly human resources, to export activity and/or are passive exporters, so they have lower returns from innovative effort.

This study contributes to the exporters’ heterogeneity trade literature by providing new empirical evidence for the Italian case. Unlike most of the literature focusing on heterogeneity in firms’ observed characteristics, our study highlights another dimension of heterogeneity, namely, the effect of some covariates. If some differentials are observed, we believe such information might be useful for designing policy interventions aimed at favouring exporting activity and productivity-enhancing policies at the micro level.

A possible drawback of this study is related to the available sample period, which is not so recent and does not reflect important events such as the great recession and recent world trade trends. However, the external validity of our analysis rests on the similarities of our sample (considering sectoral and size distributions and the presence of exporting and innovative firms) with the sample of Italian firms included in the more recent EFIGE dataset. Indeed, as discussed in Sect. 5, our results are consistent with those obtained by Orts and Martì (2018) on the EFIGE data.

The remainder of this paper is organised as follows. The next section synthesises the theoretical background and evolution of the empirical literature addressing the relationship between innovation and exports. Section 3 describes the data, and Sect. 4 illustrates our empirical strategy. We discuss the empirical results in Sect. 5 and we provide our concluding remarks in Sect. 6. An Appendix containing additional results concludes the paper.

2 Related literature

The relationship between innovation and exporting activity has traditionally been analysed within the frameworks of trade theory and growth theory. The recent trade literature (surveyed by Melitz & Redding, 2014) considers empirical regularities observed in trade data and proposes theoretical models based on firm heterogeneity in differentiated product markets, able to explain different mechanisms whereby trade liberalization affects the economy, including an endogenous increase of firm productivity. These models explain observed heterogeneity between exporters and non-exporters relative to performance indicators, different patterns of trade participation across firms and destination markets, and within-industry reallocations of resources following trade liberalization.

Within this literature, some models are particularly relevant for our study because they include the impact of the trading environment and trade participation on firms’ decisions relative to productivity-enhancing investments, such as R&D expenditures (e.g. Aw et al., 2011; Bustos, 2011; Lileeva & Trefler, 2010). In such a framework, trade liberalisation can stimulate innovation and adoption of upgraded technology so that firm-level productivity becomes the outcome of several endogenous decisions jointly taken with trade participation. In these models, trade liberalisation has different impacts on firms; most productive firms will decide to both innovate and export, lower productive firms only export, firms with lower productivity neither innovate nor export, and the least productive firms exit. More recently, Bonfiglioli et al. (2018) developed and empirically validated on US firm-level data, a theoretical model that includes fixed and variable export costs; firm heterogeneity derives from technological choices; particularly, firms can select the size of their investment in projects of unknown quality, thus affecting the variance of the probability distribution from which firms’ productivity is drawn. The model reveals that only the most productive firms export and that export opportunities raise the return to technological heterogeneity by reallocating profits in favour of the most productive firms, so that firms have higher incentives to invest in more ambitious innovation projects (with more dispersed outcomes).

The relationship between innovation and exports is also related to the endogenous growth literature linking trade and growth. A recent survey by Grossman and Helpman (2015) describes the main channels relating trade and growth, namely, knowledge spillovers, market dimension and competitive intensity, relative prices, and technology dissemination.

In such models, trade liberalisation generates an expansion of market size, thus increasing innovation incentives (market size effect). However, this effect can be counterbalanced by a competition effect as exporters are subject to a more intense competition that might reduce incentives to innovate by lowering expected innovation rents. Indeed, participation in international trade might stimulate innovation through different channels: enlargement of relevant markets that entail productivity improvements, requirement of catching the international technological frontier, exploitation of scale economies allowing firms to sustain fixed costs associated with R&D and innovative activities, and technological transfer from destination markets (learning-by-exporting effect).Footnote 1 Particularly, Aghion et al. (2018) suggests that positive demand shock for exporters can induce higher innovation especially for firms closer to the frontier and that it may reduce innovation for firms far from the frontier. Authors test the predictions of their model on a sample of French firm-level data observed over the period 1994–2012.

Impullitti and Licandro (2018) develop a theoretical model with heterogeneous firms operating in an oligopolistic market and innovation-driven productivity growth. They consider three potential sources of gain from trade. First, through higher product market competition, the pro-competitive effect of trade leads to lower markups in the domestic market. Second, the selection effect of trade forces the least productive firms out of the market and prompts firms to enter the export market. Both the pro-competitive and selection effects reallocate resources to most productive firms, thereby raising the average aggregate level of productivity and lowering equilibrium prices. Moreover, both these effects stimulate firms to innovate, thereby increasing productivity growth rate. The model identifies innovation as a key driver of dynamic gains from trade. Using firm-level and aggregate data on the US economy, they calibrate and numerically solve the model and demonstrate how the interaction between selection and innovation-driven growth can generate larger welfare gains from trade.

From an empirical point of view, firms’ innovative ability has been included among those factors that explain observed heterogeneity in firms’ participation in international markets, others being productivity, size, and age. We refer to Bernard et al. (2012) for a survey of the related empirical evidence.Footnote 2

The international evidence supports the ‘self-selection hypothesis’, that is, a positive causal effect of firms’ innovative capacity on exporting activity, both on the extensive (i.e. exporting probability) and intensive (i.e. export intensity) margins. These findings are robust to the adoption of different innovation activity measures, as suggested in the literature. R&D expenditures or R&D employees, the share of workers with technical and scientific backgrounds, or the presence of joint R&D projects with external partners (Braymen et al., 2011) are usually employed to measure innovation inputs. Innovation output indicators include the number of product and/or process innovations and the number of patents (Cassiman & Golovko, 2011; Fernández-Mesa & Alegre, 2015; Monreal-Pérez et al., 2012). A positive impact of innovation on exporting margins has also been found on Italian data (e.g. Altomonte et al., 2013; Benfratello & Razzolini, 2008; D’Angelo, 2012; Dosi et al., 2015; Frazzoni et al., 2011).Footnote 3 Recently, Brancati et al. (2018), using data on Italian manufacturing and services sectors observed over the period 2007–2013 (MET dataset), analysed the determinants of firms’ exporting activity. The authors found that firms’ innovative efforts, as measured by product innovation and R&D expenditures, past trade experience in foreign markets, and productivity, are the main determinants of firms’ decision to export. However, only past trade experience seems to have significant impact on export intensity. Slightly different results are provided by Mancusi et al. (2018), who empirically confirm the self-selection hypothesis for a sample of Italian firms observed over the period 2004–2009 (Unicredit-Capitalia and EFIGE datasets). Particularly, the authors demonstrate that product innovation is one of the main factors determining both extensive and intensive export margins.

From the analysis of the applied literature that investigates firms’ exporting probability determinants, some empirical regularities emerge: firms exhibiting higher productivity and a higher propensity to innovate are more likely to become exporters.

Overall findings on the heterogeneity of (potential) exporters with respect to non-exporters are robust to using different samples and econometric techniques. Most studies demonstrate that size has a nonlinear effect on export probability and that credit constraints do matter for internationalisation choices. Evidence on other control variables commonly included in empirical models, such as firm age, ownership structure, labour cost, workforce composition, and geographical location, is less conclusive and not sufficiently robust. Overall, empirical findings on Italian manufacturing firms are consistent with those obtained in international literature. Furthermore, the applied literature analysing firms’ export intensity determinants has univocally confirmed that factors affecting the extensive margin are key in determining the intensive margin.

A related branch of the empirical literature investigates the impact of the exporting activity on innovative capacity, i.e. the so-called learning-by-exporting hypothesis predicting that firms can improve their performance by participating in international markets. Particularly, some studies have identified a positive effect of exporting activity on introducing product innovations (e.g. Aghion et al., 2018; Damijan et al., 2010; Van Beveren & Vandenbussche, 2010). Other studies show that firms increase their R&D activities and upgrade their technologies when they compete in international markets (e.g. Baum et al., 2016; Criscuolo et al., 2010; Wagner, 2012; Zhou & Song, 2016).

Turning to empirical evidence for Italy, Bratti and Felice (2012) test whether exporting activity improves firm innovativeness on the VIII and IX waves of the Unicredit-Capitalia database. They find that firm internationalisation seems to boost the probability of introducing product innovations after controlling for reverse causality issues. More recently, Fassio (2018) focused on analysing the learning-by-exporting mechanism in a sample of European firms (EFIGE). Stressing the role played by export destinations, the author finds that a higher level of technological knowledge in export destinations causes exporting firms to introduce product innovations (the technological learning effect) while a foreign higher demand growth mainly induces exporters to develop process innovations (the foreign demand effect).Footnote 4

Overall, the results stemming from the surveyed literature suggest that exporting firms with higher productivity levels are more likely to introduce product innovations and produce high-quality products. Such results have been obtained after accounting for the effect of a wide range of firm- and industry-level control variables such as size, workforce composition, ownership structure, age, affiliation to a group, geographical location, sector, and destination country characteristics.

In summary, the empirical literature has highlighted how firms’ innovative capacity is closely related to firm performance in foreign markets. Indeed, innovation is more beneficial for exporters because it will generate additional profits from export sales. However, a dimension that remains almost unexplored is whether the relationship between innovative capacity and export activity is heterogeneous according to different levels of involvement in international activity.

To the best of our knowledge, only two studies have analysed this issue. Wagner (2006), using a sample of German exporting plants, estimates a model wherein export intensity depends on size, its square, the subsidiary branch plant status of the establishment, a dummy variable that identifies whether a firm operates in the crafts sector, the workforce composition, three dummies for different classes of R&D intensity, and a binary indicator for patent registration. The author finds that, for medium levels of R&D intensity (between 3.5% and 8.5%), the impact of R&D is significant and similar across different levels of export intensity distribution. In contrast, for low levels of R&D intensity (less than 3.5%), the impact is significantly positive only for the highest quantiles (75th and 90th) of export intensity. No significant effect of R&D on export intensity was found for high levels of R&D intensity (higher than 8.5%).

Similarly, Orts and Martì (2018) analyse the effect of different firm characteristics on export intensity and market scope (i.e. the number of markets to which the firm exports) on the EFIGE dataset with a quantile approach. The authors estimate a model wherein export performance is modelled as a function of firm productivity, size, capital intensity, R&D intensity, import intensity and a series of binary indicators for foreign affiliates and product and process innovations, respectively.Footnote 5 The authors find that the positive effect of R&D intensity increases along the export intensity distribution, reaches its maximum around the 70th quantile, and is not statistically significant at the highest quantiles.

3 Data

The empirical analysis is based on a dataset of Italian manufacturing firms obtained by merging the VI, VII, VIII, and IX waves of the ‘Indagine sulle imprese manifatturiere italiane’ (Survey on Italian Manufacturing Firms) run every 3 years by the Unicredit-Capitalia Observatory of Medium and Small Firms. The sampling scheme was the same for all four waves. All firms above the 500-employee threshold are included whereas firms with a number of employees between 10 and 500 are included according to a stratified sampling strategy. Strata are then based on size class, geographical area, and industry (according to the Pavitt taxonomy) to represent the population of Italian manufacturing firms significantly.

Pavitt (1984) suggested a classification of industries based on innovation-related characteristics (e.g. product and process innovation, sources of knowledge, appropriability regimes), firm size, and competitive factors. Pavitt identifies four groups of sectors: (i) the supplier-dominated sector, which is the most traditional manufacturing industries relying on sources of innovation external to the firm; (ii) the scale-intensive sector, mainly characterised by large firms producing basic materials and consumer durables for which sources of innovation may be both internal and external to the firm with a medium level of appropriability; (iii) the specialised suppliers sector, composed of small, more specialised firms producing technology to be sold to other firms for which the level of appropriability is high; and (iv) the science-based sector with high-tech firms relying on R&D from both in-house sources and university research and developing new products or processes with a high degree of appropriability.

The four waves cover the periods 1992–1994, 1995–1997, 1998–2000, and 2001–2003, respectively, and provide qualitative and quantitative information concerning several firm-level characteristics such as ownership structure, workforce composition, internationalisation, and innovation activities. Some variables have annual frequency while others refer to the last year of each wave, and still others cover the 3-year time span.

This dataset (or portions of it) has been extensively used by researchers in the internationalisation and innovation fields. For instance, the merged VIII and IX waves have been used, along with the study by Bratti and Felice (2012), by Antonelli and Crespi (2013) to examine the allocation of public subsidies to R&D activities and by Antonietti and Cainelli (2011) to estimate a structural model of R&D, innovation, productivity, and export.

Survey data were integrated with balance sheet information derived from the AIDA repository, a database elaborated by Bureau Van Dijk. Although this dataset admittedly currently covers a somewhat distant period, it still represents an extremely valuable source of information, especially for analysing the relationships between exports and innovation. In fact, it contains several measures of both different innovation input variables (e.g. formal R&D expenditures, number of employees involved in R&D) and innovation outputs (i.e. process and product innovation), and a wide array of firm-level control variables, which are extremely difficult to find in a single dataset.

The combined dataset obtained by merging the four waves of the survey is composed of 12,679 firms; unfortunately, only a very small fraction was observed in all four waves (1.2%), and a major change in the sample occurred between the VII and VIII waves. In turn, most of the firms appear in only one wave (68.85%), and even when firms appear twice, they might appear in two non-contiguous waves. Furthermore, in the VIII wave, the question about the share of exported turnover, our dependent variable, has unfortunately been omitted from the questionnaire.

Because of this peculiar structure of the original dataset, we prefer using the subsample composed of those firms observed either in both the VI and VII waves or in both the VIII and IX waves. The use of firms present in two contiguous waves comes at the cost of not being able to control for time-invariant firm-specific individual effects; however, it allows us to use 3-year lags of the independent variables likely to exert their effect on exports after a sufficiently long period. Therefore, we apply cross-sectional techniques to a sample of 1,029 firms with current variables observed in 1997 or 2003 and lagged variables observed in 1994 or 2000 (or over the–1992–1994 or 1998–2003 periods).Footnote 6

Table 1 presents the sectoral distribution of the original and final datasets across the manufacturing industries (2 digits Nace rev. 1 classification). Firms in the leather, electrical machinery, and medical and precision instruments industries (industry codes 19, 31, and 33) are underrepresented in the final dataset whereas the opposite occurs in the machinery industry (code 29). As for the extensive margin of export, that is, the share of exporting firms, the original and final samples differ only for the chemical and basic metal industries (codes 24 and 27), wherein exporting firms are overrepresented, and the electrical machinery industry (code 31), wherein they are underrepresented. These differences are mostly due to the unavailability of the instrument, constructed at the industry/size level (see later), for some industry/size class combinations and the increasing relationship between propensity to export and size. Indeed, the instrument is unavailable for the entire leather industry; for medium and large firms in the industry codes 22, 24, 27, 31, and 33; and for small firms in industry code 30. To avoid imputation of an instrument retrieved from a contiguous class size or industry, we excluded those observations.Footnote 7 Notwithstanding these differences, the two datasets do not seem to differ dramatically, so the final sample reasonably represents the sectoral composition of the Italian manufacturing sector well.Footnote 8

Table 1 Firms sectoral distribution

Table 2 shows the distribution of the sample concerning both the firms’ Pavitt classification and export status. Such classification groups firms based on their technological competence and has often been adopted in many empirical studies on innovation, trade, and competitiveness at both the macro and micro levels.Footnote 9

Table 2 Number of observations

Supplier-dominated industries include textiles, footwear, food and beverages, paper and printing, and wood while the scale-intensive group includes basic metals, motor vehicles, trailers, and semi-trailers. Specialised suppliers include machinery and equipment; office, accounting, and computing machinery; and medical, precision, and optical instruments. Finally, science-based industries consist of chemicals, pharmaceuticals, and electronics.

Table 2 shows that most firms operate in supply-dominated industries (about 42%); firms operating in science-based industries represent a small fraction of the sample (only about 4%) whereas less than 30% of firms operate in each of the other two sectors. Overall, 75% of firms are engaged in exporting activity. However, the Pavitt sector stratification reveals that such percentage is higher for firms operating in specialised suppliers and science-based industries (about 90% and 80% of firms export, respectively).

Table 3 shows some descriptive statistics for the whole sample for subsamples of exporting and non-exporting firms and for different ranges of the distribution of export intensity. Export intensity (expint), defined as the ratio of firms’ export sales to total sales, is on average about 40% for exporters and displays a high level of dispersion around the mean. For almost half of the exporters (about 41%), sales derived from exporting activity are below 30% of total sales. In contrast, the remaining firms are uniformly distributed across higher values of export intensity. Therefore, the export intensity distribution appears to be skewed to the right and, hence, not to be normally distributed. This shape of the export intensity distribution is confirmed by both graphical methods and statistical tests.Footnote 10

Table 3 Descriptive statistics for different ranges of export intensity

Firms’ innovative efforts are measured by the expenditures on R&D activity (R&D). Unconditional to export status, around 44% of firms in our sample were involved in formal R&D activities. However, striking differences emerge between exporting and non-exporting firms. More than half of the exporting firms (51%) performed R&D activities whereas only about 18% of non-exporting firms did. Furthermore, average expenditures in R&D activity is much higher for exporters than for non-exporters and significantly increases along export intensity distribution.Footnote 11

The average size, as measured by the number of employees (size), is approximately 130, as expected, and size distribution is very asymmetric with about 55% of firms classified as small (i.e. less than 50 employees) and a lower fraction (about 13%) defined as large (i.e. more than 250 employees).

On average, exporting firms are significantly larger than non-exporting firms (162 vs. 40 employees), and size is positively correlated with export intensity, so larger firms are those exhibiting higher shares of sales derived from exporting activity. Area dummies (North, Centre, South) show that most firms (75%) are located in Northern Italy (Emilia-Romagna, Friuli-Venezia Giulia, Liguria, Lombardia, Piemonte, Trentino-Alto Adige, Valle D’Aosta, and Veneto), and 18% are located in the Centre (Abruzzo, Marche, Toscana, Lazio, and Umbria). When considering exporting firms, firms in the North appear to be more export-oriented compared to firms in the rest of the country, both on the intensive and extensive margins. The sectoral and size distribution as well as the firm-level characteristics in our final sample are consistent with the evidence of the Italian section of the more recent EFIGE survey covering 2007–2009 (see Altomonte et al., 2012; Altomonte & Aquilante, 2012).Footnote 12 Particularly, the share of exporters is around 72%, and 54% of firms have workers directly involved in R&D activities. Furthermore, in the EFIGE dataset, the share of exporting firms increases with size. A minor difference emerges in the intensive margin of exports, which is slightly lower for the EFIGE dataset. The overall similarity of our dataset with that of EFIGE is reassuring in terms of the representativeness of our dataset regarding R&D and export activity.

Pavitt industry dummies suggest that firms operating in specialised suppliers and science-based industries show higher export propensity and intensity. Particularly, the share of firms belonging to specialised suppliers and science-based (supply dominated and scale intensive) industries increases (decreases) along the export intensity distribution.

4 Empirical strategy

This study aims to investigate whether innovative activities and other export intensity determinants differently affect export intensity at various points in its distribution by applying quantile regression techniques. As previously mentioned, such an approach has been adopted only by Wagner (2006) and Orts and Martì (2018). However, the standard conditional quantile regression techniques adopted in these studies do not account for either the censored nature of our dependent variable or the possible endogeneity of the regressors. Therefore, we adopted a more comprehensive estimation approach.

To assess the relationship between firms’ innovative effort and export intensity (firms’ export sales on total sales, in percentage), along with the effects of other firm characteristics on their export performance, we consider the following empirical model:

$$Expint_{i} = \beta_{0} + \beta_{1} Innov_{i} + \beta_{2} X_{i} + u_{i}$$
(1)

where subscript i denotes firms.

Firms’ innovative effort (Innovi) is measured by logarithmic transformation of firms’ R&D expenditures, which captures the existence of a system of incentives toward intentional innovative activities and can be considered a proxy for the allocation of resources to research and other information-generating activities in response to perceived profit opportunities (Grossman & Helpman, 1991).

Although the Unicredit-Capitalia survey reports information on the introduction of process and product innovation, we do not use these variables as export activity is often associated with organisational and technical changes that might be erroneously considered product or process innovation. In contrast, these variables represent product differentiation activity or simple process reorganisation and consist of introducing minor (technical or aesthetic) changes. Indeed, the Oslo Manual (2018), which provides guidelines for collecting and using innovation data, specifies that “An innovation is a new or improved product or process (or combination thereof) that differs significantly from the unit’s previous products or processes and that has been made available to potential users (product) or brought into use by the unit (process)”.

Xi is a vector of covariates including firm size and its square, a set of Pavitt dummies to include the omission of sector specific time invariant characteristics that might bias our parameter estimates, a year dummy accounting for common macroeconomic factors, and three area dummy variables indicating whether firms operate in the North, Centre, and South of Italy to control for time-invariant locational effects. Indeed, the main destination markets of Italian exports are located in Europe (e.g. Bugamelli et al., 2018); therefore, being located in an area closer to destination markets might favour exporting performance. Moreover, economic activities are not evenly distributed across different geographical areas. For example, industrial districts are more concentrated in the North and Centre of Italy; in turn, the exploitation of agglomeration economies might differ across areas and affect firms’ export performance. A related issue is the heterogeneity in infrastructure endowments, quality of institutions, and economic growth that characterises the Italian economy (e.g. Banca d’Italia 2019).

We do not include productivity among covariates since R&D may be considered as one of its major drivers.Footnote 13 Size is expected to exert a positive effect on export intensity. Given the existence of relevant sunk costs necessary for entering foreign markets, larger firms might be able to utilise economies of scale in production and might show a higher capacity to take risks and obtain credit at lower costs (Wagner, 1995). Such predictions stem from several theoretical models suggested by different trade theory models (Melitz & Redding, 2014).

As the relationship between size and export intensity might be nonlinear, we also include the square of the size variable. Wakelin (1998) suggests that an inverted U-shaped relationship may be associated with the existence of major firms with monopolistic power, which may show less motivation to export.

Both size and R&D might be endogenous given the possible existence of reverse causality and simultaneity issues, as discussed by recent theoretical models of trade (e.g. Aghion et al., 2018; Impullitti & Licandro, 2018; Melitz & Ottaviano, 2008; Melitz & Redding, 2014). In principle, identifying valid instruments for both variables would be appropriate. However, given that the CQIV estimator does not allow us to apply IV to more than one endogenous variable at a time, we choose to find a valid instrument for the R&D variable. Moreover, to weaken the possible inverse relationship with export intensity, we include 3-year lagged values of R&D (the last year of the previous waves). Indeed, it is reasonable to assume that R&D effort takes time to exert its effect on firms’ export performance.Footnote 14

With regard to the choice of the instrument, we choose to follow a widely used approach in applied Industrial Organization—one that uses the value the endogenous variable in a market different from the one under analysis. For a discussion on this approach, see Berry and Haile (2016). To this end, we retrieve the 1998–2000 values of industry R&D expenditures in Spain from the ‘third Community Innovation Survey’ (CIS III), which has been conducted over a sample of European (mostly EU) countries. Among countries participating in this survey, we select Spain as the country whose industrial structure is most similar to that of Italy. We match Italian and Spanish firms according to both industry and size. The rationale is that, on one hand, R&D activity is obviously strongly related to industry characteristics (technology, product differentiation, etc.) while, on the other hand, it is related to firm size. Thus, by adopting this approach, we are confident in increasing the correlation between the instrument and endogenous variable.

Particularly, the CIS III survey contains information on the industry wherein the firm operates (Nace rev.1 at the 2 digits level) and a discrete measure of size built as a three-class variable (small, medium, large), which is defined according to the number of employees. The instrument for an Italian firm operating in a given industry and size is thus the mean value of the log R&D expenditures for Spanish firms of that size and operating in the same industry observed in 1998–2000.

The adoption of the CIS III survey is determined by the need to have the most similar time span to the one we consider. Unfortunately, the unavailability of the first two releases of the survey prevented us from using the years preceding our sample period. However, the relevance of the chosen instrument can be assessed by observing very high statistical significance of the instrument in first-stage regression (see below) whereas instrument exogeneity relies on the fact that a shock to exporting activity in Italy is very unlikely to have consequences for Spanish R&D expenditures. Moreover, we conduct IV estimates with an additional instrument, namely, the 1998–2000 values of industry R&D expenditures in Germany from the CIS III; thus, the over-identification of the model allows us to run a formal test of exogeneity.

Regarding the possible endogeneity of the size variable, given the impossibility of applying instrumental variable techniques to more than one endogenous variable at a time, we include size and its square in lagged form (3-year lag) to weaken the possible inverse relationship between size and export intensity given that lagged size might be less related to current export intensity values.

Another issue we need to control for is the censored nature of our dependent variable. Indeed, export intensity has a non-negligible mass at 0 due to the large fraction (23%) of non-exporting firms. To account for both endogeneity of R&D and censoring bias issues, we apply the CQIV estimator recently proposed by Chernozhukov et al. (2015). Note that this estimator allows for including only one continuous endogenous explanatory variable in the model. This also motivates the choice of our innovation activity variable. This estimator uses a control function approach by estimating the first stage for the endogenous regressor and the second stage, where the first-stage estimated residual is included as an additional regressor. Moreover, in the second stage, the probability of censoring is estimated for all observations, and standard quantile estimators for uncensored data are iteratively applied to the subset of observations for which the probability of censoring is sufficiently low (for technical details, see Chernozhukov et al., 2015, 2019).

For comparison purposes, we estimate different specifications of Eq. (1) with other quantile and mean estimation techniques that, in contrast to the CQIV estimator, only partially consider the issues of censoring, endogeneity, and heterogeneous effects along the distribution of the dependent variables raised by the nature of the current data.

Particularly, we also use three quantile estimators (the Censored Quantile [CQR], the Instrumental Variable Quantile [QIV] and the simple Quantile [QR]). These estimators address the heterogeneity issue but differ in the way they address the other two issues. The CQR approach treats the censored nature of the data, but it does not consider endogeneity whereas the QIV approach only controls for endogeneity, and standard quantile techniques do not consider both endogeneity and censoring issues. Coefficients obtained by quantile regressions can be interpreted similarly with the coefficients in an OLS regression (i.e. as the partial derivative with respect to a particular regressor). The difference is that the effect is on the conditional quantile of the dependent variable and not on the conditional mean. Moreover, to verify whether the quantile approaches provide a more exhaustive picture of the relationship between firms’ export intensity and their innovation potential, we estimate our models using both instrumental variables TOBIT and standard TOBIT models that have been widely used for analysing export intensity determinants. The first method allows us to include endogeneity issues while both approaches allow us to tackle the censored nature of the dependent variable. However, they rely on specific distributional assumptions about variables and are more sensitive to outliers, and the marginal effects are heterogeneous according to values assumed by covariates and not by values of the dependent variable, such as in quantile regressions. Finally, we also use standard IV and OLS techniques measuring only a constant effect of the regressors at the mean of the dependent variable.Footnote 15

5 Empirical results

Tables 4 and 5 report the estimates of the basic specification of Eq. (1) obtained by applying the different econometric techniques mentioned above. Regarding the quantile approaches, since 23% of observed firms do not export, we present results for percentiles above the 30th; particularly, estimates are performed for each ten percentiles ranging from the 30th to the 90th percentiles.

Table 4 Regression results
Table 5 Regression results

The first-stage coefficients for the instrument (R&D_SP) are always positive and highly statistically significant, thereby confirming the relevance of the instrument. More precisely, the first-stage F-statistics on excluded instruments are very high (above 25, being the square of the t-statistics for the null that the coefficient is equal to 0). This finding suggests our estimates do not suffer from a weak instrument problem and confirms that the instrument is significantly correlated with the endogenous variable. Furthermore, to determine instrument exogeneity, we estimate our basic specification after including an additional instrument, namely, 1998–2000 values of industry R&D expenditures in Germany recovered from the CIS III (see Table 7 in the Appendix). The results clearly show the instruments’ validity; the Hansen J-statistics for over-identifying restrictions is equal to 0.028 with a p-value equal to 0.868.Footnote 16

In addition to the estimated coefficients of R&D and control variables, we also report in Tables 4 and 5 the coefficient of the control function residual (labelled as CF residual), which is significant for all quantiles, thereby confirming the endogeneity concerns about the R&D variable.

Overall, R&D expenditures positively and significantly affect export intensity. However, the magnitude of the effect displays a significant variability associated with different estimation techniques. Figure 1 shows the pattern of the R&D coefficient estimated using the CQIV techniques for different percentiles along with the average partial effect of the Tobit IV estimates.

Fig. 1
figure 1

Censored quantile instrumental variable (CQIV) and Tobit IV estimates. Figure shows estimated R&D coefficients (squares) and their confidence interval (vertical lines) obtained by applying censored quantile instrumental variable estimator (CQIV) at 30th, 40th, 50th, 60th, 70th, 80th and 90th percentiles of the export intensity distribution. The horizontal continuous line indicates the R&D average partial effects estimated by applying instrumental variables Tobit (Tobit IV) model and the horizontal dashed lines display the corresponding 95% confidence interval bounds

CQIV estimates suggest that an increase of 10% in R&D expenditures prompts an increase in export intensity, ranging between about 0.10 and 0.38 percentage points. Particularly, the impact of R&D increases along the export intensity distribution, reaches the highest value at the 70th percentile (where export intensity is about 50%), and decreases slightly at the highest percentiles (80th and 90th), but it is still higher than the lowest percentiles (30th and 40th). Therefore, CQIV regressions suggest that firms characterised by an export intensity of about 50% or higher benefit most from investing in R&D activity in terms of expansion in international markets. As expected, Tobit IV estimates lie in the ranges of the CQIV values and suggest an increase of 10% in R&D expenditures causes an increase in export intensity of about 0.14 percentage points. As shown in Fig. 1, most percentile coefficients lie outside the Tobit IV confidence interval, thereby supporting conditional quantile modelling with respect to a traditional conditional mean approach. A similar pattern of R&D coefficients is observed when applying the QIV estimator, although marginal effects are quantitatively lower, ranging between 0.2 and 3.1.

Consistent with previous international empirical literature, our findings suggest that investments in R&D activity have a positive effect on firms’ export intensity by generating a higher probability of introducing product/process innovations and/or by increasing firm absorptive capacity (see the Sect. 2). Furthermore, by applying quantile regression techniques, we provide a clearer picture of the heterogeneity of such effects, which seems to depend on firms’ export intensity distribution. Several not mutually exclusive considerations might explain our finding that R&D investments have a heterogeneous effect on export intensity distribution. The first, most intuitive explanation is that introducing new products and/or cost savings induced by innovative activities fosters firms’ competitiveness, allowing them to enter new foreign markets and/or export new products, thus enlarging export baskets and the number of destination markets.Footnote 17 Indeed, we do not have information on the evolution of export basket composition; however, our data suggest that higher export intensity is associated with a higher number of export destination markets. The Unicredit-Capitalia database allows us to identify eight different destination markets: UE (15), other European countries, Africa, North America, South America, Asia, China, Australia, and Oceania. Firms included in our sample, on average, export to 3.2 foreign markets. Particularly, the average number of destination markets increases from 1.45 for firms exporting less than 3% of their turnover (30th percentile) to 4.22 for those with export intensity higher than 76% (90th percentile). Hence, the number of markets served increases along with the distribution of export intensity. A higher number of destination markets might have a multiplicative effect on innovation rents and might help diversify business risks by providing cross-subsidization of revenues. A similar argument has been suggested, among others, by Halpern and Muraközy (2012), who analyse the innovation and firm performance relation on Hungarian data. The authors find that innovation has a positive effect on all margins of trade and innovative firms export to more countries, more products, and more in one product–destination combination, on average.

Another possible explanation might be related to the different ways export activities are organised. Instead of directly (or actively) exporting, firms may engage in exports indirectly (or passively) via intermediaries. If firms with different export methods are also characterised by different export intensities, the heterogeneous impact of R&D might be demand-induced (or reduced) by intermediaries. Finally, firms exporting a non-negligible fraction of their sales tend to devote specific resources, such as export departments and export managers, to export activities. In turn, the return on innovative activities might be higher in terms of an increase in export shares; larger human resources are devoted to foreign markets.

The relationship between firms’ innovative effort and exporting activity detected by applying the CQIV estimator is confirmed when Eq. (1) is estimated with the CQR and QR approaches. In both cases, the pattern of estimated coefficients for R&D appears to be similar to the CQIV results as they reach their maximum value around the 70th percentile, and marginal effects are found to be lower with respect to CQIV and QIV values since they range between 0.03 and 0.8. Likewise, the estimates obtained with the Tobit and OLS methods are smaller than those computed using the instrumental variables approach (Tobit IV and IV). Therefore, the results obtained when both endogeneity issues and the censored nature of the dependent variable are neglected provide the lower bound of the parameter values.

To further support our results, properly testing the statistical significance of the difference among parameter estimates obtained at different quantiles is important. Unfortunately, the CQIV estimator provided by Chernozhukov et al. (2015) does not provide a variance–covariance matrix that would allow us to conduct specific tests involving more than one quantile. To overcome this limitation, we can compare parameter estimates at different quantiles by inspecting confidence intervals in Fig. 1. If they do not overlap, we can infer that coefficients differ along export intensity distribution. Particularly, confidence intervals confirm the increasing pattern of the R&D coefficients until the 70th percentile and seem to suggest that coefficients estimated at the highest percentiles (from 70th onwards) do not significantly differ from each other.

Furthermore, for each pair of quantiles, we performed t-tests on the R&D coefficients from CQIV estimates. Particularly, we computed the following statistics:

$$t= \frac{{\upbeta }_{1i}-{\upbeta }_{1j}}{\sqrt{{\mathrm{se}}_{{\upbeta }_{1i}}^{2}+{\mathrm{se}}_{{\upbeta }_{1j}}^{2}}}$$
(2)

where β1i and β1j are the CQIV estimated coefficients of R&D at quantiles i and j (Table 5), and \({\mathrm{se}}_{{\upbeta }_{1i}}\) and \({\mathrm{se}}_{{\upbeta }_{1j}}\) are the standard errors. The null hypothesis is that the coefficients are equal between pairwise quantiles. In Table 6, we report t-statistics, and we highlight in bold those values allowing us to reject the null hypothesis (at the 5% significance level).

Table 6 t-test on pairwise quantiles

Although these tests are not rigorous, their results are consistent with the results of the graphical inspection of Fig. 1. Particularly, the effect of R&D estimated at the 70th percentile is significantly higher than those estimated at other percentiles, both lower and higher. However, in the latter case, the null hypothesis is rejected only at the 20% significance level.

We cannot directly compare our results with those of Wagner (2006) on German data because the way R&D is included as a regressor is different. However, they are strikingly similar to those provided by Orts and Martì (2018) on EFIGE data. Despite this, we believe our results on the heterogeneous impact of R&D on export intensity along its conditional distribution can provide novel evidence on this issue because they are obtained by properly addressing the endogeneity of the R&D variable and the censored nature of the dependent variable.

On the results concerning different control variables included in the basic specification of Eq. 1, the effect of firm size (evaluated at t-3) is not stable along the export intensity distribution and exhibits a significant variability associated with different estimation techniques. Coefficients from the CQIV regressions follow a bell-shaped pattern for lower quantiles while a U-shaped relationship between size and export intensity is observed in the right tail of the distribution.Footnote 18 Given that firm size in our sample increases along the export intensity distribution, these findings are broadly consistent with those obtained by Sterlacchini (2001), who finds an inverted U-shaped relationship for small firms, a U-shaped relationship for large firms, and no impact of size on export intensity for medium-sized firms.Footnote 19 Size coefficients show a similar pattern when the QIV estimator is applied, while Tobit IV and IV estimates do not show any impact of size on export intensity. A bell-shaped relationship between firm size and exporting activity that persists along most of the export intensity distribution is shown by the CQR and QR estimates and by Tobit and OLS regressions. As discussed above for the effect of R&D, the adoption of the most suitable estimation technique helps disentangle more nuanced evidence on the relationship between size and export intensity.

Examining the coefficients of geographical location dummies, heterogeneous results emerge according to the different quantiles analysed. Estimates reveal that being located in Northern Italy positively affects export intensity in most of the estimated models. This result is consistent with the evidence provided by previous literature for Italy showing that northern firms exhibit a higher export intensity relative to firms located in other areas (Brancati et al., 2018; Bugamelli et al., 2000; D’Angelo, 2012; Sterlacchini, 2001).

Furthermore, estimates suggest that firms whose export intensity is lower than 10% and located in the South of Italy seem to have a locational disadvantage relative to those located in the Centre, while such a differential disappears for firms exporting a higher percentage of their sales. This finding might suggest that southern firms may find it difficult to begin participating in export markets, for instance, because of geographical distance and, hence, higher transportation and informational costs. However, once they increase their participation, as measured by export intensity, they overcome these disadvantages. This pattern is not confirmed when we apply other estimation techniques that do not provide conclusive evidence.

As for sectoral dummies, coefficients turn to be at best weakly significant for highest quantiles of the export intensity distribution, once we control for formal R&D activity, size, and location. Hence, CQIV results suggest that the observed highest export intensity/propensity characterizing firms operating in specialised suppliers and science-based industries (as shown in Tables 2 and 3) is likely to be associated to their R&D activity rather than to sector specific characteristics. These findings are broadly confirmed by estimates obtained with Tobit IV, QIV, and IV, while the unconditional rankings of Tables 2 and 3 tend to emerge with estimation methods that do not consider the endogeneity of the innovation activity.

To check the robustness of our results, we follow different strategies. First, we replicate our estimates by using R&D intensity (i.e. R&D over total sales) instead of R&D expenditures, and all our findings are confirmed. Second, we augment our basic specification by including some firm characteristics that have been found to affect export intensity in previous empirical literature. We include firms’ ownership structure to account for the fact that firms belonging to foreign owners might be more skilled to compete in international markets as they might have a larger international network. Alternatively, a binary variable indicating whether the firm has established, through either brownfield or greenfield, a foreign subsidiary in the previous 3-year period is also included. Indeed, FDI and export activity might be correlated since FDI might improve the knowledge of foreign markets, and that of the markets obtained by exporting might favour FDI investments. Most of the results discussed so far are broadly confirmed when we control for these covariates.Footnote 20

As an additional robustness check, we estimate Eq. (1) after splitting the sample according to different criteria. Estimates conducted stratifying the sample according to Pavitt classification suggest that our main results on the relationship between R&D and export intensity are confirmed regardless of the category considered.Footnote 21 This notwithstanding, as expected, the effect of R&D on export intensity is stronger for companies operating in industry that base their innovative activity more on internal R&D expenditures (i.e. scale intensive and science-based industries). Moreover, a stronger heterogeneity on the relationship between R&D expenditures and export intensity emerges when we split the sample according to firms’ geographical location. Indeed, our main results seem to be mostly driven by firms operating in the Centre and South of Italy, where the impact of R&D increases along the export intensity distribution and reaches the highest value between the 70th and 80th percentiles. As for firms located in Northern Italy, instead, a quite stable impact of the R&D on export intensity emerges. This impact is also very strong, being larger (for all quantiles but the first) to the maximum effect estimated for the rest of the country. Finally, the overall findings are also confirmed when we estimate our model on subsamples based on firms’ size (small vs. medium large firms as defined in the Data Section).Footnote 22 Indeed, estimates conducted on separate sub-samples have to be taken with caution given the small size of the different subsamples.

6 Concluding remarks

In this paper, we analyse firms’ export intensity determinants with a particular focus on the role of innovative effort. To better understand the relationship between firms’ innovative effort, measured by R&D expenditures and export intensity, we study whether such a relationship varies along the export intensity distribution by applying quantile regression techniques on a cross-sectional sample of Italian manufacturing firms. We also deal with the censored nature of the export intensity variable and with the endogeneity of the innovation proxy by applying the CQIV estimator recently proposed by Chernozhukov et al. (2015).

The empirical results obtained by applying CQIV and other estimation techniques confirm that R&D expenditures positively affect export intensity. Particularly, this effect increases along the export intensity distribution, reaches the highest value at the 70th percentile (where export intensity is about 50%), and persists at the highest percentiles (80th and 90th). Therefore, our results suggest that firms characterised by an export intensity of about 50% or more can obtain the highest advantages from investing in R&D activity in terms of expansion in international markets. Point estimates suggest that an increase of 10% in R&D expenditures prompts an increase in export intensity, which ranges between about 0.10 and 0.38 percentage points according to different export intensity levels.

We have discussed different possible explanations for these results, grounded on competition effects, the way exporting activities are organised, the number of destination markets, and the amount of resources devoted to the exports. As for external validity, assessing whether our results hold for other countries and/or in recent years is difficult. Particularly, our sample period covers only the beginning of the first decade of this century, thereby neglecting the huge increase in world trade and the great recession. In turn, a natural extension of this study is to replicate our analyses with data covering the years following the great recession. However, the similarity of our results with those obtained by Orts and Martì (2018) using more recent data covering several countries (e.g. the EFIGE dataset) is partially reassuring in this respect. Indeed, our sample shows very similar characteristics relative to the Italian data included in the EFIGE dataset. Particularly, the sectoral and size distribution, percentage of firms investing in R&D, and percentage of exporting firms do not significantly differ. The overall similarity of our dataset with the more recent EFIGE database can provide some intuition on the external validity of our results.

Our findings show the importance of considering the heterogeneous effects of covariates along export intensity distribution, not only to better understand the relationship between export intensity and firms’ innovative effort but also to provide useful insights for the design of policy instruments aimed at favouring export activity. Particularly, R&D incentives should consider their impact on export performance (see Altomonte et al., 2013), and our results suggest that such policy measures are likely to have a heterogeneous effect, being stronger for firms characterised by an already high level of export intensity. Overall, the results point towards the importance of supporting the growth of firms’ export-intensive margin to better exploit the positive impact of innovation incentives. Results of the heterogeneity analysis, to be taken with caution given the small size of different subsamples, suggest that the overall effect is homogenous across firm size but seems to vary in magnitude according to Pavitt sectoral classification and geographical location. Indeed, the relationship between R&D and export performance seems to be stronger for firms that mostly base their innovative activity on internal R&D expenditures (i.e. scale intensive and science-based industries) and for firms operating in the North of Italy. Hence, policies which uniformly affect incentives to perform R&D for all firms will tend to exacerbate existing differences in exporting performance across sectors and regions thereby suggesting the need to adopt other kinds of interventions to boost export performance of firms in South-Central Italy and operating in supply dominated and in specialised suppliers industries. Further research using wider samples might be useful to clarify this issue.