1 Introduction

Innovation is the creation of new knowledge and the successful implementation of a new product, service, or process, which for most activities entails commercial success (Gordon and McCann 2005). In the modern world, innovation has been typically emphasized as a key path to prosperity. Then, how a region breaks with existing knowledge system and introduces new ideas, products, and services is a question of policy significance. So far, many researchers have examined the innovation process by appraising the relative motors of regional innovation output. Most of these studies have focused on economic factors such as R&D and human capital (e.g., Feldman and Florida 1994; Sun 2003; Stephens et al. 2013), the role of agglomeration economics (Lee and Rodríguez-Pose 2014), demographic and institutional context of local economies (Sleuwaegen and Boiardi 2014; Fitjar and Rodríguez-Pose 2015; Crescenzi and Rodríguez Pose 2017), or factors influencing spatial linkages (Song and Zhang 2017).

Yet, one potentially key factor behind the growth of Chinese regional innovation has been largely overlooked: the role of amenities in enabling the development of innovative activities. Being a key factor behind quality of life, amenities are defined as site- or region-specific goods and services that affect regional attractiveness for its workforce (Mulligan and Carruthers 2011). These amenities include climatic features, beauty and accessibility of the natural environment, landscape, public services, safety, or the vibrancy of a region’s cultural milieu (Roback 1982; Gyourko and Tracy 1991; Dalenberg and Partridge 1997; Deller et al. 2001; Rappaport 2007). Conversely, disamenities are factors that make places unattractive such as extreme weather, air pollutants, and crime. Since amenities have been found to be important factors in driving household and firm location in developed countries (e.g., Rappaport 2007; Partridge 2010), it seems reasonable to assume that amenities may play a significant role in shaping the geography of innovation, especially if they are attractive to skill workers.

While China is still considered a developing country, over the past 30 years, its economy has experienced the same type of changes that led to amenity-driven migration and growth in more developed countries. For example, Chinese per-capita GDP has usually grown more than 8 percent per year since 1993 (World Bank). Additionally, internal migration barriers have decreased due to reforms in the “HukouFootnote 1 system and rapid advances in transportation and information technology. Because Chinese households are now more educated and wealthier, they are seeking out locations that offer a higher non-market quality of life (Zheng et al. 2009; Zheng and Kahn 2013a).

Figure 1 maps the spatial distribution of innovation intensity in China in 2007 and 2014, respectively, in which innovation intensity is measured by per-capita patent applications. They show that patenting activities are spatially concentrated in coastal China, especially in the most economically developed areas in the Yangtze River delta and Pearl River delta. Meanwhile, clusters of innovation are expanding over time. For example, from 2007 to 2014, joining coastal cities, some central and eastern cities such as Chengdu, Jinan, and Hefei were also among the most innovative poles, pointing to a decentralized process. Correlation test results also show innovation and air pollution are inversely related. (The correlation coefficient between concentration of PM2.5 and patenting intensity is − 0.31 and significant at the 5% level during the period 2007–2010 when we have data.) As air pollution is a disamenity, it then suggests a positive correlation between innovation and desirable amenities. However, more evidence is needed to assess the positive and significant relationship.

Fig. 1
figure 1

The geographical distribution of innovation intensity (2007, 2014)

Given that high-skilled workers (who are most likely to be high-income) are more strongly attracted to amenities (because amenities are typically normal goods) than less skilled workers, there should also be a positive relation between the output of these high-skilled workers (including innovation) and a given location’s level of amenities. Thus, amenities likely influence where innovation takes place by affecting the geographical distribution of highly educated workers.

Therefore, this paper aims to explore the question of whether amenities play a role in determining the geography of innovation in China. In answering this question, we offer several contributions to the literature. First, this study concentrates on the role of amenities in affecting Chinese regional innovation, which has attracted surprisingly little scholarly attention given that amenities are thought to be increasingly important as China becomes wealthier. Second, the paper is the first to discuss the heterogeneous impact of amenities at the city level, which is an extension of studies exploring the heterogeneous effects of amenities at the individual level. Third, both natural amenities and consumer amenities are considered. Due to concerns regarding endogeneity from consumer amenities, previous studies have rarely included consumer amenities. Yet, by utilizing a matching instrument approach to address the endogeneity problems, this paper considers both the impact of natural amenities and consumer amenities. Finally, because we use city-level data, our conclusions are more applicable than the past literature that primarily used provincial-level data (e.g., Sun 2000, 2003; Li 2009; Crescenzi et al. 2012; Fan et al. 2012; Fan 2014).

In what follows, Sect. 2 overviews the literature on the determinants of regional innovation. Section 3 presents mechanism analysis on the impact of amenities on regional innovation by justifying the relationship between amenities and regional innovation. Section 4 introduces the empirical model, variables, and estimation strategies. The key results are in Sect. 5, and a discussion of the mechanisms is in Sect. 6. Section 7 provides further heterogeneity analysis to investigate whether the impact of amenities is place-based. Finally, Sect. 8 concludes and discusses policy implications.

2 Drivers of regional innovation: literature review

There are many factors thought to drive regional innovation. To begin with, according to Griliches (1979), the innovative-related inputs such as research and development (R&D) and human capital are required for innovative processes (e.g., Co 2002; Gordon and Mccann 2005; Rodríguez-Pose and Crescenzi 2006; Chen and Kenney 2007; Faggian and McCann 2009; Smit et al. 2015; Fitjar and Rodríguez-Pose 2015; Kafouros et al. 2015; Crescenzi and Jaax 2017). Yet, merely R&D efforts and human capital are by no means sufficient to explain regional-level innovative performance. An extensive literature has shed light on the role of other socioeconomic, demographic, and institutional factors.

There is a consensus that innovative processes are more productive in economically aggregated regions. The New Economic Geography emphasizes agglomeration economies as the main motor of economic growth and innovation production (Fujita and Thisse 2003; Gordon and Mccann 2005; Kuckiki and Tsuji 2010; Crescenzi and Jaax 2017). Concentration of people and economic activities is supposedly helpful in terms of accelerating the transmission of information. Also, face-to-face contacts constitute economically efficient means for the transmission of generally considered “sticky” but valuable knowledge (Morgan 2004). The effectiveness of agglomeration economies in creating a supportive context for generating new knowledge has also been documented by ample empirical evidence (e.g., Gordon and McCann 2005; Fitjar and Rodríguez-Pose 2015; Rodríguez-Pose and Crescenzi 2008; Kafouros et al. 2015; Crescenzi and Jaax 2017).

The innovative processes have also been linked to the industrial structure of local economics and institutions. Fitjar and Rodríguez-Pose (2015), for instance, suggest that the structural factors constitute the conditioner or processes of regional innovation. As highlighted by several authors, institutional conditions and demographical and social factors have made the relationship between innovative efforts and the generation of new knowledge far from a linear process (Agrawal et al. 2008; Rodríguez-Pose and Crescenzi 2008; Kafouros et al. 2015; Smit et al. 2015; Rong et al. 2017; Wu et al. 2017).

The openness of a region as reflected by FDI and trade is also assumed to affect local innovation performance. Crescenzi and Jaax (2017), for example, conclude that trade is the “global knowledge pipeline”. Spatial spillovers through geographical proximity are another route that promotes innovation (e.g., Agrawal et al. 2008; Rodríguez-Pose and Crescenzi 2008; Smit et al. 2015; Rong et al. 2017; Wu et al. 2017).

In addition to the above factors, amenities could be another key determinant of regional innovation. However, very few studies have attempted to identify and assess the impact of amenities on regional innovation. To date, a handful of recent papers have provided hints to this question. For example, utilizing 2013–2015 US census block data, Zandiatashbar and Hamidi (2018) find that amenities (related to transit service quality and walkability) along with diversity and tolerance are significant influential factors of local knowledge economy in the USA. Based on data for an older downtown area in a Chinese province, He et al.(2018a, b) find landscape amenities, leisure, and creativity networking opportunities are important factors in explaining the spatial concentration of photography studios in Nanjing, China—though we caution that generalization from case studies can be limited.

3 Amenities and innovation: mechanism analysis

Existing studies of developed countries offers several hints why amenities are important for innovation development, which we classify into two strands. First, amenities affect innovation by increasing the supply of skilled workers; second, amenities may further increase innovation because amenities benefit the health and attitudes of the workforce, which increases their productivity. Following the two strands of literature, we justify the relationship between amenities and innovation output in what follows.

3.1 Amenities attract talent with high levels of human capital

Amenities affect innovation activities by helping to retain and attract skilled workers. According to Florida (2002a), amenities play a crucial role in innovation and economic development in this manner. Florida defines talent as highly educated people, professional and technical workers, scientists, and engineers, all of whom help generate innovations. Holding other local conditions constant, amenities attract talent, which in turn attracts high-technology industries that produce more regional innovation. Yet, such a causal relationship is based on two preconditions: First, high-skilled workers are geographically mobile. Second, there is a limited supply of high-skilled workers, which implies that the labor market for these workers is affected by their location choices. In such a scenario, high-technology firms will be attracted to locations with a greater supply of high-skilled workers.

Many empirical studies support such an amenity growth strategy. For example, Glaeser et al. (2001), Adamson et al. (2004) find a positive association between amenities and city growth. Li et al. (2016) suggest that high-growth firms exist in counties with higher educational attainment and more natural amenities.

More specifically, according to some survey studies, environmental amenities, good schools, and public services are important site-specific factors in location choice for both households and firms. For example, Stafford et al. (1984) found that quality of life is ranked seventh out of ten factors for all plants in choice of region but third out of ten for high-tech facilities. Focusing only on R&D facilities, Lund (1986) found that quality of life is ranked first out of six factors. Thus, innovative firms and research institutes that are actively engaged in innovation are typically attracted to amenable places.

3.2 Amenities affect labor productivities

If high-skilled workers have low mobility across regions and their bargaining power in the labor market is limited, regions are unlikely to be affected by the amenity factors as described above. Even in this case, there is another possible channel through which amenities affect regional innovation, i.e., by affecting worker productivity.

Individual workers’ mood and creativity may be linked to their location’s amenities and culture. Places with open space, nice climate, surrounded by lakes, greenspace, and consumer services will likely have happier workforces and improved productivity—especially if creativity is required (Wu and Dong 2014; He et al. 2018a, b). According to Carlino and Saiz (2019), the package of environmental, esthetic, and recreational amenities is complementary to the enjoyment of leisure that in turn improves psychological welfare—thereby making people more productive. Conversely, the opposite applies when there are disamenities. For example, according to the psychology literature, exposure to ambient air pollution reduces cognitive ability, as well as increases anxiety and boredom (Lavy et al. 2014; Pun et al. 2017). Additionally, other studies reveal that air pollution harms human health, shortens life expectancy, reduces the return of human capital (e.g., Chay and Greenstone 2003; Neidell 2004; Currie and Neidell 2005; Rundell 2012; Chen et al. 2013; Sun et al. 2019). Therefore, all else equal, more amenable environments enhance worker productivity, especially for knowledge workers.

The economics literature further illustrates how labor productivity is affected by air quality. For example, Zivin and Neidell (2012) suggest that ozone concentration in the air affects those who are outdoors. By measuring how labor productivity changes, empirical studies from the USA, India, and China all suggest that the efficiency of piece-rate workers declines during days with higher concentrations of PM2.5 and SO2 air pollutants (Adhvaryu et al. 2014; Chang et al. 2016; He et al. 2018a, b). The adverse effect of air pollutants on athletic performance provides other examples. For example, German baseball athletes perform worse as CO, PM10, and PM2.5 air concentrations increase (Lichter et al. 2017; Archsmith et al. 2018). Using a sample of over 300,000 marathon runners in 55 events in 37 Chinese cities, Guo and Fu (2019) find that a one standard deviation increase in air pollution slows a runner’s time by 2.6 min. Therefore, we infer that air quality likely affects R&D activities and innovation. Likewise, access to good schools, healthcare, restaurants, as well as cultural and historical amenities such as museums could have similar effects on innovation (though this is also an empirical question).

This compelling evidence of the influence of various amenities on local productivity and their attractiveness to high-skilled workers calls for a comprehensive framework to assess the impact of amenities on Chinese innovation. Previous research emphasized the importance of R&D, human capital, agglomeration, industrial factors, and spatial spillovers in explaining regional innovation. This paper extends this literature by showing that increased provision of consumer amenities and a more amenable natural environment can also support innovation growth.

4 Empirical model and data

4.1 Model specification

Several theories have been put forward to explain the geography of innovation. Innovation is the return of investment in human capital and research and development (R&D) spending (Griliches 1979, 1986; Jaffe 1986). The New Economic Geography (NEG) approach highlights the role of agglomeration economies to explain differing innovation performance (e.g., Acs et al. 2002; Carlino et al. 2007). Furthermore, the literature on regional innovation systems emphasizes economic structural factors such as industry composition and the level of economic development in determining innovation (Rodríguez-Pose and Crescenzi 2008; Crescenzi et al. 2012; Crescenzi and Rodríguez Pose 2013). However, Rodríguez-Pose and Wilkie (2016) note that, as significant geographical heterogeneity can exist between any two countries, it is hard to find a general spatial model of innovation applying to any country. Some examples of papers that explore regional-level innovation include Crescenzi et al. (2007, 2012), Rodríguez-Pose and Crescenzi (2008), Crescenzi and Rodríguez Pose (2013), Lee and Rodríguez-Pose (2014), Rodríguez-Pose and Villarreal Peralta (2015), Betz et al. (2016). Following this literature, we specify the model as followsFootnote 2:

$$ {\text{inno}}_{i,t} = \alpha + \alpha_{1} \;{\text{amenity}}\_{\text{natural}}_{i,t - 1} + \beta_{1} \;{\text{amenity}}\_{\text{consumer}}_{i,t - 1} + \alpha_{2} X_{i,t - 1} + \mu_{i} + \nu_{t} + \varepsilon_{it} $$
(1)

where \( {\text{inno}}_{{i,t}} \) represents innovation outcome at time t in city i; \( {\text{amenity\_natural}}_{{i,t - 1}} \) and \( {\text{amenity\_consumer}}_{{i,t - 1}} \) are the natural amenity and consumer amenity variables at time t-1 in city i, respectively. \( X_{i,t - 1} \) is a set of control variables. \( \mu_{i} \) and \( \nu_{t} \) stand for city and year fixed effects, and \( \varepsilon_{it} \) represents the residual. The coefficients of interest are \( \alpha_{1} \) and \( \beta_{1} \) in model (1). To mitigate potential endogeneity problems caused by reverse causality, we lag all the time-varying explanatory variables by one period. Endogeneity concerns will be further discussed below.

4.2 Variables

4.2.1 Dependent variable

The dependent variable proxies for innovation intensity are per-capita patent applications,Footnote 3 which have been accepted as the most appropriate measure of innovation performance (e.g., Mansfield 1986; Griliches 1990; Hagedoorn and Cloodt 2003; Mate-Sanchez-Val and Harris 2014).

4.2.2 Amenity variables

The definition of amenities can vary across studies. Natural amenities were the main focus—e.g., Ullman (1954) defined amenities to include mountains, warm climate, lakes, and oceans. For example, U.S. Department of Agriculture ranks US counties for natural amenities defined as temperature in January and July, hours of sunshine, humidity, topography, and proximity to large bodies of water, in which the index is widely used in the literature (e.g., Betz and Partridge 2013; Betz et al. 2016).

Consumer amenities, also known as manmade amenities or urban amenities, have long interested researchers and policymakers (Rosen 1974; Roback 1982; Gyourko and Tracy 1991; Dalenberg and Partridge 1997; Glaeser and Shapiro 2002; Bayer et al. 2009). For example, Brueckner et al. (1999) refer to consumer amenities to historical sites (e.g., monuments, buildings, parks), and services such as restaurants, theaters, swimming pools, and tennis courts. Glaeser et al. (2001) define consumer amenities to include a rich variety of factors such as restaurants, theaters, an attractive mix of social partners, aesthetics, physical setting, good public services, low crime, and short commutes.

Perception toward what constitutes an amenity varies case by case (He et al. 2018a, b), making each “amenity” a contextualized term that requires a localized and situational understanding. Since the analysis is rooted in Chinese context, the selection of amenity variables is based on preferences of Chinese households during the period. By consulting the literature and taking the institutional environment(as well as the media) into account, this paper selects climate and air quality as natural amenities and includes public transportation, healthcare resources, educational resources, and tourism to indicate consumer amenities.Footnote 4 Specifically, we choose the following natural amenity measures: average temperature, precipitation, humidity, sunshine hours, air quality, and green areas. Because climate data are only available for the provincial capital, we assign those values for non-capital cities in the province.Footnote 5 We also follow Zheng et al. (2010) and construct a composite index of temperature comfort as defined by the following equation:

$$ {\text{temindex}}_{\text{i}} { = }\frac{1}{{\sqrt {\left( {{\text{mintem}}_{i} - {\text{average}}_{i} } \right)^{2} + \left( {{\text{maxtem}}_{i} - {\text{average}}_{i} } \right)^{2} } }} $$
(2)

where, for prefecture i, ‘\( {\text{average}}_{\text{i}} \)’ is the average annual temperature, while ‘\( {\text{mintem}}_{\text{i}}\)’ and ‘\({\text{maxtem}}_{\text{i}} \)’ refers to the average temperatures in January and July, respectively. In general, this index measures whether the city faces extreme temperatures over the year—a greater ‘\( {\text{temindex}}_{\text{i}}\)’ represents smaller temperature extremes.

Taking the institutional context of China into account, a list of factors is chosen to proxy for consumer amenities including public educational resources, healthcare services, transportation, infrastructure, and high-quality tourist attractions. For example, due to the restriction of “Hukou” system, public educational resources for primary and secondary schools and healthcare resources are primarily available for households with local “Hukou” status especially in large cities such as Beijing and Shanghai (Chan 2009; Bosker et al. 2012). Consequently, the availability of public educational resources becomes one of the most important factors that Chinese households consider in their residential location choice (Liu and Shen 2014; Su et al. 2019).

The key features of what we define as amenities are those that are most easily assessed by local residents. In China, a few features stand out as key amenities that affects skilled workers’ location choice including primary and secondary schools, healthcare services, etc. Since the 1980s, rural residents have moved to urban areas for better careers and higher quality of life, while the supply of public services in cities was strained to meet the increased demands of residents and new migrants. Thus, local governments use the Hukou registration system as a way to erect barriers between those who hold local Hukou and non-holders by allowing only the local Hukou holders to fully access public services.Footnote 6 Though Hukou status is no longer a constraint for finding a job in cities, it remains an important tool for rationing access to local public services such as schools, healthcare and social security benefits (Zheng and Kahn 2013b). The scarcity of public service resources and the existence of Hukou system exactly reflect that the amenities related to Hukou system are valuable for workers, especially amenities with regard to good schools and quality healthcare services. By contrast, environmental amenities such as green areas, air quality, and consumer amenities such as public safety, universities, sports, tourism-and-entertainment facilities and services are not exclusive to non-Hukou holders.

Specifically, we adopt per-capita public primary and middle schools to proxy for amenities related to educational resources (ppschool, pmschool). We also count the number of 4A and 5A Grade Tourism and Vacation Zones for each city to reflect the attractiveness of natural, cultural and historical sites. In addition, we will include per-capita school and healthcare resources—i.e., the number of primary and middle schools (mschool, pschool) and healthcare institutions (bed).

To reveal the overall amenities for each prefecture city, we construct two composite indexes for natural amenities and consumer amenities by means of principal component analysis (PCA). Using PCA allow us to expurgate the redundancies while maintaining the key features of the amenity indicators (e.g., Fagerberg 1988; Crescenzi et al. 2007, 2012; Crescenzi and Rodríguez Pose 2013; Rodríguez-Pose and Villarreal Peralta 2015; Rodríguez-Pose and Zhang 2019). The elements used to construct our natural amenity index are the natural amenity variables described above. Similarly, the consumer amenity index is created by combining the corresponding consumer amenity indicators. The final indexes we construct follow expectations. For example, comfortable temperatures and the climate have positive weights, while a negative weight is assigned to air pollution. We therefore adopt the constructed index as a proxy for the overall amenities in each city. The test results for the first principal component analysis are reported in Table 5 in “Appendix.”

4.2.3 Control variables

The control variables include measures for R&D expenditures, human capital, agglomeration economies, population density, manufacturing activity, and economic wealth. Including these factors help control for structural and economic development differences across geographical location. The variable definitions and data sources are given in Table 1. Table 6 in “Appendix” provides descriptive statistics. The data are drawn for the 281 prefecture citiesFootnote 7 and span from 2007 through 2014 for a total of 2248 observations.

Table 1 Definition and source of variables

4.3 Estimation strategy

We first estimate model (1) by ordinary least square (OLS) assuming that all of the amenity variables are exogenous, given that natural aspects are predetermined for each city.Footnote 8 Later, we use an instrumental variable (IV) estimation strategy to re-estimate model (1) to account for any potential endogeneity of the consumer amenity variables. For example, consumer amenities that cater to the affluent may promote innovation or innovation growth and can stimulate the supply of consumer amenities for high-income individuals (Kahn and Zheng 2016, p. 84). To mitigate these endogeneity concerns, we assume innovation is an outcome of returns of past investment in human capital and other innovation inputs. Thus, all the explanatory variables are lagged one period.Footnote 9 Because omitted variables may also bias the results, we account for city fixed effects and year fixed effects to control for time-invariant city effects and national common trends. (Graves and Mueser 1993; Brueckner et al. 1999)

In addition to the fixed-effects specification, we also apply instrumental variables (IVs) to address possible endogeneity caused by omitted variables. Specifically, we follow an approach similar to Autor et al. (2013) and more specifically Partridge et al. (2017) by using a matching process to obtain suitable instruments for consumer amenities. The basic idea of this approach is to construct two “counterfactuals”Footnote 10 for each city. Specifically, the first step is to find the two best city matches for each city based on their initial-period 2007 per-capita GDP (GDPpc), average years of schooling (avedu), agglomeration (popu), and density (popudensi) using the Mahalanobis distance approach for matching.Footnote 11 To reduce the bias from spillover shocks that would invalidate the counterfactual, we set the minimum Euclidean distance between the matches of at least 200 km. Coastal city matches are restricted to other coastal cities and matched cities for inland cities are only chosen from other inland cities. In the second step, we take the consumer amenity variable values from the matched cities to be the IVs (Matchiv1, Matchiv2) for the city in question.

With “matched-city” instrumental variables, we then employ two-stage least squares (2SLS) to estimate model (1). The matching IV approach is used to tackle endogeneity problems caused by missing variables. Partridge et al. (2017) adopted this approach to construct instrumental variables when no proper instrument is available. Theoretically, valid instruments should satisfy two basic conditions: (1) relevance, i.e., the instruments should be highly correlated with the endogenous explanatory variable and (2) exogeneity—i.e., the instruments should be uncorrelated to the error term. In our case, the amenity values of the matched cities can be ideal instruments for amenities in the underlying city because the matched cities should have similar amenities, which fulfill the first condition. Secondly, the matched cities are at least 200 km far away, which means the second condition would be met because amenities for the matched cities are less likely to be correlated with shocks for the underlying city.

This hybrid of matching and IV approach has also been employed by some previous studies. For instance, Autor et al. (2013) use the contemporaneous composition and growth of Chinese imports in eight other developed countries to instrument for growth of Chinese imports to the USA. Likewise, Partridge et al. (2017) and Malgouyres (2017) use an identical approach to ours in deriving international trade instruments for local regions. To assess the robustness of our results, a series of sensitivity analyses will be conducted including the use of different estimation methods and matching instruments.

5 Main results analysis

Table 2 reports the OLS results by regressing model (1). Correspondingly, column (1) presents the results with only amenity variables, while columns (2)–(7) report the results where different control variables are added. Finally, columns (8)–(10) display the results where the individual amenity variables are replaced with the index amenity variables..

Table 2 Amenities and innovation, OLS results

Table 2 shows that sunshine is significantly positively associated with innovation outcomes, while air pollutant is negatively associated with regional innovation. Also, for the natural amenity variables, only the sunshine and air quality variables are significant. There is no significant evidence that the other climate amenities such as precipitation and humidity affect innovation. There is also no evidence that green amenities at the prefecture level have a statistically significant influence on innovation, which somewhat contrasts with Zheng et al.’s (2009) results. This implies that places with more sunshine have advantages for generating innovation. As such, it is reasonable to increasingly expect Chinese migration patterns to become more similar to that of the latter 1900s in the USA, when migration was strongly affected by climate (Graves 1980; Rappaport 2007).

Regarding each individual consumer amenity, only those related to healthcare services, educational resources, and public transportation have a significant effect. However, if there is collinearity among the amenity indicators, we should not exclude the possibility that the statistically insignificant amenity variables shown in columns (1)–(7) could still be significant determinants of regional innovation.Footnote 12 Columns (8)–(10) report the results using principal components’ natural amenity index and consumer amenity index that use all of the amenity variables in their construction. These results show a strong statistically positive relationship between the indexes and regional innovation.

When the control variables are added, the significance level for the main amenity variables (e.g., sunshine, air quality, healthcare resources, primary education resources, and public transportation) remains unchanged, though a few lose some of their statistical significance from 1 to 10% (e.g., primary education resources per capita) and two amenity variables totally lose significance in the last two columns (e.g., healthcare resources per capita, middle school resources per capita). In contrast, when single amenity variables are introduced one at a time along with the two amenity indexes (natural amenity index and consumer amenity index), the coefficients for the amenity indexes are positive and significant at the 1% level. Overall, the evidence suggests a strong positive correlation between amenities and innovation.

Apart from the role of amenities, several economic factors such as R&D, human capital, population density, and economic wealth are also key drivers of Chinese innovation. Notably, while Crescenzi et al. (2012) found that R&D and agglomeration economies are the main factors explaining provincial innovation growth, we highlight two other important engines for innovation growth based on the city-level data: human capital and economic wealth (GDP per capita). In addition, market size, proxied by population, is negatively associated with innovation, suggesting that population density is a better fit than simply using total city population. In general, amenities along with other drivers of innovation such as R&D inputs, human capital, and agglomeration economies are all impetus of regional innovation, and likely all are necessary components for innovation. All these factors work together to shape the geography of innovation in China.

Table 3 reports the 2SLS results in which consumer amenities are treated as endogenous using the matching IV strategy. Generally, we use nine indicators for consumer amenities, but the Durbin–Wu–Hausman test shows that only four are potentially endogenous (primary and middle schools per capita, number of beds, and number of primary schools). Thus, we only report IV results for the regressions that appear to be endogenous. Specifically, columns (1) through (4) list the results where one of the four consumer amenity variables is individually added to the model.Footnote 13 At the very bottom of Table 3, we report two diagnostic tests for the IV regressions including the first-stage F-statistic for the strength of the instruments and the overidentification test. Both tests indicate that the instruments are strong and pass the over identification tests, supporting their validity.

Table 3 Matching IV results, consumer amenities

Local educational resources in primary and middle schools and healthcare resources remain salient factors in determining patenting innovation. This implies that patenting innovation is positively linked to a city’s educational resources for primary and secondary school students. We believe one of the main reasons for this pattern is the type of highly educated workers critical to innovation are also parents who value educational quality for their children. Thus, while good primary and middle school resources will eventually produce more human capital, it appears to immediately attract highly educated workers interested in their children. Moreover, a pleasant natural environment is again found to be positively linked with local innovation, since the natural amenity index has a positive and significant and positive sign in all regressions. Apart from that, the significances and signs for the control variables are consistent with the original findings, which again confirm that the basic engines of innovation are R&D, human capital, density, and a wealthy economy.

To assess the sensitivity of the IV results, we tried several different matching criteria to construct IV matches. Previously, four variables—GDP per-capita (GDPpc), average years of schooling (avedu), agglomeration (popu), and density (density)—from the initial year of analysis, 2007, were chosen to be covariates for identifying the matches for each prefecture city. We changed the matching criteria to using the same covariates but measured in an earlier period in 2003 to further reduce possible endogeneity (or reverse causality) concerns.Footnote 14 At the same time, all the other criteria for the matching process remain the same. The new results are reported in the first four columns in Table 8 in “Appendix.” Again, the primary and secondary school variables as well as healthcare services are all positively linked with innovation performance. Also, the results of corresponding first-stage F-statistics and the over-identification tests suggest the instruments are both strong and pass the exclusion criteria. Therefore, the impact of educational resources is robust to different matching instruments.

In further sensitivity analysis, we replace the matching covariates with GDP per-capita, average years of schooling (avedu), agglomeration (popu), and the population share between 25 and 49 years old (share2549).Footnote 15 Using the instruments generated by the new matching criteria, we repeat the same 2SLS (see the results in the last four columns in Table 8 in “Appendix”). These results once more confirm the significant role of consumer amenities related to the educational resources and healthcare resources in affecting innovation. Overall, the sensitivity tests show that consumer amenities play a robust role in affecting innovation regardless of the estimation strategy.

In summary, the empirical results ascertain that local natural amenities and consumer amenities play a significant role in shaping the geography of regional innovation. Specifically, natural amenities including temperature, sunshine, air quality, and resources for primary and middle school students have a statistically significant impact on local innovation.

6 Mechanism analysis

In Sect. 3, we assume that amenities are associated with local innovation systems through two key mechanisms (1) by attracting more skilled workers and (2) by improving the productivity of local workers. To test the first mechanism, we now regress the stock of human capital on amenities and the other controls and report the results in Table 4. Amenities are significantly positively associated with human capital, regardless whether amenities are measured by a single indicator or the comprehensive consumer amenity index or natural amenity index. This pattern supports the first proposed mechanism.

Table 4 Mechanism analysis; OLS results

Regarding the productivity mechanism, due to the heterogeneity of personal productivity and data limitations, we instead indirectly consider this mechanism to appraise whether the productivity mechanism works in explaining the positive impact of amenities on innovation. Simply put, when human capital and amenities are both included in the innovation model, the human capital variable controls for that effect and if the amenity variable coefficients are significant, then this amenity effect can be attributed to some other mechanism besides human capital. Given that the focus of the literature has been on the productivity mechanism, we believe the most likely mechanism is through productivity.Footnote 16 Unfortunately, we lack micro-level data to further assess the mechanisms, though our results are suggestive.

7 Amenities, city characteristics, and innovation

The initial hedonic literature assumed workers are homogeneous in tastes and skill (Roback 1982). However, valuations for amenities can differ across income and skill levels, as posited by Roback (1988). One reason is the hierarchy of needs theory (Maslow 1943, 1954), which posits that when people can satisfy their basic needs such as food and shelter, they will then strive to meet higher-level needs such as aesthetic needs and higher quality of life. For example, scale-related urban amenities, which is defined as clusters of retail stores and cultural amenities made possible by the urban concentration of cinemas, theaters, museums, etc. (Dalmazzo and de Blasio 2011), are found to be especially consumed by wealthier households and more educated workers (Glaeser et al. 2001; Dalmazzo and de Blasio 2011; Dorfman et al.,2011; Stephens and Partridge 2015). These amenity valuations may also differ across cities because of different income and skill levels, which may lead to some interesting interaction effects. Obviously, this question extends from exploring individual differences to city differences. To provide a framework for answering this question, we assess the following four hypotheses:

  1. (1)

    Income heterogeneity effects—greater impact for more developed economies

People tend to have an increasing willingness to pay for amenities as their income increases. If wealthy people value amenities more (Adamson et al. 2004; Chen and Rosenthal 2008), the aggregated influence of amenities should rise with city income or wealth. For example, using city-level data, Zheng (2016) finds that there is significant income heterogeneity in the estimated marginal willingness to pay for city amenities, consistent with the hypothesis that amenities are expected to be more important as economies develop.

  1. (2)

    Skill heterogeneity effects—larger impacts in cities with more human capital

The impact of amenities may also vary by human capital level. If amenities are more attractive for more educated individuals, cities with higher shares of educated workers should be more influenced by amenities. Indeed, Dorfman et al. (2011) found amenities have a significant effect on high-technology job growth, and the share of university graduates located in high-amenity US coastal areas has also significantly increased since 2000 (Stephens and Partridge 2015). Then, it is reasonable to presume that cities with more human capital are influenced more by amenities.

  1. (3)

    Size heterogeneity effects —greater impact for larger cities

A city’s population in spatial equilibrium may have been greatly shaped by amenities (Tiebout 1956; Rappaport 2007), which means that the city’s population could capture the majority of the variation in cost of living for which amenities have been capitalized. For example, Partridge et al. (2008) argue that higher-ordered consumer services increasingly exist in successively larger cities. Thus, cities with a larger population would also have been affected more by amenity levels.

  1. (4)

    Density heterogeneity effects—greater impact for denser economies

Density itself could be a kind of amenity. For example, Florida (2002b) claims that creative-class workers are attracted to concentrations of creative people. Besides, knowledge for innovation is transferred more easily in denser places (Betz et al. 2016). Consequently, we expect that the impact of amenities is greater in denser economies.

To examine the above hypotheses, we construct interaction terms between the amenity variables and one of the city characteristic variables. The resulting models are specified as follows:

$$ {\text{no}}_{i,t} = \gamma + \beta_{1} {\text{amenity}}_{i,t - 1} + \beta_{2} X_{i,t - 1} +\varvec{\kappa}_{1} {\text{GDPpc}}*{\text{amenity + }}\mu_{i} + \nu_{t} + \varepsilon_{it} . $$
(3)
$$ {\text{inno}}_{i,t} = \gamma + \beta_{1} {\text{amenity}}_{i,t - 1} + \beta_{2} X_{i,t - 1} +\varvec{\kappa}_{2} {\text{humancap}}*{\text{amenity + }}\mu_{i} + \nu_{t} + \varepsilon_{it} . $$
(4)
$$ {\text{inno}}_{i,t} = \gamma + \beta_{1} {\text{amenity}}_{i,t - 1} + \beta_{2} X_{i,t - 1} +\varvec{\kappa}_{3} {\text{popu}}*{\text{amenity + }}\mu_{i} + \nu_{t} + \varepsilon_{it} . $$
(5)
$$ {\text{inno}}_{i,t} = \gamma + \beta_{1} {\text{amenity}}_{i,t - 1} + \beta_{2} X_{i,t - 1} +\varvec{\kappa}_{4} {\text{density}}*{\text{amenity + }}\mu_{i} + \nu_{t} + \varepsilon_{it} . $$
(6)

where \( {\text{GDPpc}}*{\text{amenity}} \),\( {\text{humancap}}*{\text{amenity}} \), \( {\text{popu}}*{\text{amenity}} \), and\( {\text{density}}*{\text{amenity}} \) are the interaction variables described above. Correspondingly, by examining the interaction coefficients, i.e., \( \varvec{\kappa}_{1} \)., \( \varvec{\kappa}_{2} \)., \( \varvec{\kappa}_{3} \), \( \varvec{\kappa}_{4} , \) we appraise whether the marginal influence of amenities varies with specific city characteristics. To simplify the interpretation, we use the amenity index variables described above rather than the individual amenity variables (natural amenity index and consumer amenity index) in these models. All of the interaction coefficients in models (3), (4), and (6) are significant with the exceptions of the interaction term between consumer amenity index and human capital in model (4) and the interaction term in model (5). Hence, these results further support hypotheses (1), (2), and (4)—i.e., the positive impact of amenities on regional innovation is greater in larger, more developed cities and in cities with a critical mass of highly educated workers.

To interpret the interaction effects better, Fig. 2 shows both the baseline and the marginal effects. The positive interaction term between amenity index and GDP per capita indicates that the marginal impact of amenities on innovation increases with GDP per capita (see figures a.1 and a.2). The same applies for population density in that both natural amenities and consumer amenities have a larger marginal influence on innovation as density increases (see figures c.1 and c.2). Likewise, the marginal influence of amenities is positively linked to increases in the city human capital (see figure b).

Fig. 2
figure 2figure 2

Interpreting the interaction terms

Therefore, our findings support the hypotheses that the impact of amenities is place-based but that their influence is also heterogeneous depending on the city’s level of development.

8 Conclusion and policy implications

This paper focused on the role of amenities in affecting innovation, which has been largely under-explored in the literature. The empirical results suggest that particular amenities such as air quality, sunshine, public transportation, and educational resources can contribute to city-level differences in Chinese innovation. The findings are robust to different estimation methods and instrumental variable strategies. Therefore, even though China is a developing country, our findings suggest that quality-of-life factors can play a key role, just as in developed countries. Moreover, our results suggest that amenities are associated with local innovation systems through two key mechanisms—i.e., by attracting more skilled workers and by improving the productivity of local workers. As income grows and workers place more value on quality of life, we predict a further strengthening of the link between amenities and Chinese regional innovation over time. Thus, providing quality amenities appears to be a good policy to promote place-specific innovation (Boschma and Fritsch 2009; Florida 2014; Lehmann and Seitz 2017).

Generally, this paper finds that amenities along with other drivers of innovation such as R&D inputs, stock of human capital, and agglomeration economies are all the impetus of regional innovation, and their effects interact with one another. In particular, while agglomeration economies and amenities are not direct inputs in knowledge creation, they appear to stimulate the input–output efficiency in its creation. All these factors work together to shape the geography of patenting innovation in China. Meanwhile, among all considered amenity factors, only amenities related to clean air, sunshine, public transportation, public educational and healthcare services are found to statistically affect city-level differences in Chinese innovation from 2007 to 2014. The impact of amenities also varies with city characteristics such as the economic wealth, density, and human capital, in which these effects also vary by amenity type. Specifically, the positive impact of amenities on regional innovation is greater in larger, wealthier cities and in cities with a critical mass of highly educated workers.

As China aims to achieve a twenty-first-century innovation-oriented economy, understanding the drivers of local innovation is a key policy priority. This study reveals a significant impetus of regional innovation in China is amenities. Just as Glaeser et al. (2001) suggests the benefits of consumer cities in the USA, our findings suggest the benefits of amenities in terms of regional innovation, which are typically not considered in innovation policies.

Second, differentiated amenity-related strategies or city-based policies are recommended for innovation development. For instance, for relatively developed cities, the role of amenities should receive more emphasis, as the influence of amenities on innovation is larger in these areas. Also, reducing city disamenity factors such as air pollution appears to also promote local innovation. In contrast, the policies for innovation development for poorer, less developed cities should focus on other fundamental drivers of innovation—e.g., R&D and human capital investment. However, even lagging cities that are well endowed with natural amenities can better utilize their assets to attract innovative workers.

Finally, this paper has some limitations that merit further research. First, more proxy measures of innovation should be considered to assess whether the results are robust. For example, using patents to proxy for innovation does not consider innovations that are not patented, and it fails to reflect the extent to which innovations are successfully commercialized (Crescenzi et al. 2012). Second, due to data availability, this paper did not directly test the underlying mechanisms by examining how individual migration decisions are affected by amenities. Instead, the results confirm a positive association between skills and amenities that is consistent with that proposition, though we did incorporate instrumental variable approaches that increase our confidence. Yet, these limitations provide potential directions for future research once more micro-level data become available.