1 Introduction

Most growth economists probably agree on the notion that human capital is one of the decisive factors in explaining why some countries became rich and others remained poor in the past two centuries (Goldin and Katz 1998; Lucas 2002; Glaeser et al. 2004; Hanushek and Woessmann 2008; Jones and Romer 2010; Galor 2011; Acemoglu et al. 2014; Gennaioli et al. 2013). There is still dissent with regard to the channels through which human capital fosters economic growth, though. On the one hand, human capital is interpreted as an independent factor of production which increases productivity for a given level of technology (Lucas 1988; Mankiw et al. 1992). On the other hand, human capital is seen as an input in the innovation process and therefore as a complement to technology (Benhabib and Spiegel 1994; Romer 1990; Nelson and Phelps 1966). According to this view, higher levels of human capital lead to the generation or diffusion of new technologies or to a more efficient adoption of a given technology, thereby shifting the frontier of the production-possibility set outwards.

In this paper we empirically address the role of human capital by analyzing the economic development in Prussian counties in the late nineteenth century. This is the period in which Prussia as a national economy caught up successfully to the more advanced British economy and even took over the technological leadership in some high-tech sectors such as chemicals or electrical engineering (Pierenkemper and Tilly 2004; Broadberry and Burhop 2010). Since the academic debate about the relative importance of different types of human capital during industrialization is not yet settled, we consider the influence of three different measures, namely literacy rates, craftsmanship, and secondary school enrollment rates. Valuable patents are used as a measure for innovation. We first analyze which type of human capital was mostly related to innovation in the Second Industrial Revolution. We find that “useful knowledge” embodied in skilled craftsmen was still an important determinant for innovation activities of private inventors in more traditional industries such as steam boilers or mechanical metal processing. In contrast to the First Industrial Revolution, however, the average stock of basic knowledge measured by the literacy rate was positively related to firms’ innovation in high-tech industries such as chemicals or electrical engineering.

The surprising result that the stock of literate people was associated with high-tech innovations can be explained by three different lines of argument. First, high literacy rates might indicate an above-average general investment in children’s human capital that comprised also higher education, nutrition, and preventive health care (Baten et al. 2014; Kelly et al. 2015; Prados de la Escosura 2015; Mühlhoff 2016). Second, a high literacy rate might have facilitated transregional knowledge spillover and therefore a county’s outstanding innovativeness because reading was the only way to learn about ideas occurred outside the own locality. Third, high-tech innovations and formal basic education might have been complements in highly innovative industries where white-collar occupations increased exponentially during the Second Industrial Revolution (Goldin and Katz 1998).

After having established the relationship between different types of human capital and innovation, we explore the impact of the former on income. Because data on GDP per capita are not available for Prussian counties we use information on per capita income-tax revenues as a second-best measure.Footnote 1 In the late nineteenth century, a Prussian household’s income tax was based on its taxable income from capital, immovable property, trade and commerce, and dependent employment. We therefore argue that Prussian income tax revenues reflect the productivity of the economic activity of the county. Our empirical analysis shows that literacy and patenting activities are independently correlated with income. This evidence lends support to the notion that basic education had an impact on Prussian growth through both the direct production channel and the indirect innovation channel.

The fact that our empirical analysis is based on county-level data within a single country should alleviate the problem of disentangling the effect of human capital from that of institutions (Acemoglu et al. 2001; Glaeser et al. 2004; Acemoglu et al. 2014). Yet we know that, for instance, the process of peasants’ emancipation which started in Prussia in 1807 has been very heterogeneous, which means that land-tenure and labor market relations might still have been different across counties in the late nineteenth century (Cinnirella and Hornung 2016; Cinnirella and Schueler 2016). What is more, Acemoglu et al. (2011) have shown that Napoleonic institutional reforms significantly influenced the economic development in some German states that later became part of Prussia. More recent studies have focused on geography, natural resources, and access to markets to explain the location and the rise of manufacturing in the nineteenth century (Crafts and Wolf 2014; Fernihough and O’Rourke 2014; Gutberlet 2014). That is why we will account for differences in institutions, geography, and market access to ensure that the estimated effect of human capital on income is not mediated by these factors.

Finally, we study another channel through which human capital might have affected Prussian economic growth. An important characteristic of the transition to modern economic growth is the rapid decline in fertility rates. Unified Growth Theory postulates that parents’ increased emphasis on children’s education, triggered by higher levels of technological progress, determined lower fertility outcomes (Galor 2011, 2012). Recent studies have provided empirical evidence of such a mechanism (Becker et al. 2010, 2012; Murphy 2015). We extend this literature by showing that the negative impact of literacy on fertility is larger in more technologically advanced counties, where arguably the returns to education were higher.

The rest of the paper is organized as follows. Section 2 discusses the role of human capital in economic growth on the basis of the theoretical and empirical literature. Section 3 introduces the data and discusses our measures for innovation, human capital, and income. Section 4 analyzes the relationship between different types of human capital and innovation. Section 5 tests whether literacy had a persistent impact on Prussian counties’ income tax revenues net of the effects of master craftsmen and formal secondary education. Section 6 analyzes to what extent literacy and innovation affect income independently of each other. Section 7 establishes that the negative impact of literacy on fertility increased with technological progress. Section 8 concludes.

2 Related literature

In their influential paper, Mankiw et al. (1992) assume that a society’s aggregated stock of human capital can serve as a substitute for technology. That is why they augment Solow’s classical growth model (Solow 1956) with the factor human capital which is assumed to increase the workforce’s productivity. Analyzing cross-country variation of GDP per capita in 1985 for 1998 non-oil producing countries Mankiw et al. (1992) show that the inclusion of the investment in human capital, measured as the fraction of the population aged 15–19 that is enrolled in secondary schools, increases the performance of their regression model considerably. As a result, countries with similar technologies, rates of savings, and population growth might still not converge because of enduring differences in national education policies.

Benhabib and Spiegel (1994) raise serious doubts on whether human capital should enter the macroeconomic production function as a separate factor. Following the seminal approach of Nelson and Phelps (1966) they assume instead that human capital is primarily needed to increase the level of technology by adopting foreign superior technology or by creating own innovations. In other words, human capital is not a substitute for but a necessary component to innovation. In their empirical approach, Benhabib and Spiegel assume that a country’s level of technology is determined by both imitating and innovative activities. A country’s capability to innovate is estimated by its human capital stock measured by average years of schooling in the labor force. Its potential to imitate is approximated by the existing gap between the technological level of the productivity leader and its own inferior technological level. It is important to note that they find a significant impact only of levels of human capital on growth of total factor productivity. In contrast, the growth of human capital does not affect economic performance. Our analysis will concentrate on levels as data on literacy and the proxy for income are only available for a single cross-section.

Vandenbussche et al. (2006) stress the importance of the composition of human capital between basic knowledge and higher education, where the former is relevant for imitation and the latter is important for innovation.Footnote 2 As a result, backward countries which want to catch up by imitating (foreign) technology should invest primarily in primary schooling, whereas countries at the technological frontier should concentrate on increasing the quality and quantity of higher education.Footnote 3 In our analysis we will also distinguish between the basic level of human capital measured by the literacy rate and higher education measured by enrollment rates in secondary education.

In historical perspective, the role of formal education during the First Industrial Revolution is generally considered as minor, especially in the case of the British industrial revolution (Mitch 1993, 1999; Clark 2005) for which relative prices of production inputs (Allen 2003) or cultural factors (Clark 2009; Mokyr 2009) have been regarded as the main determinants. O’Rourke and Williamson (1995) argue that the contribution of education to Sweden’s catch-up to Britain and the United States was also modest. Similarly, Taylor (1999) finds only second order effects of school enrollment rates on the growth performance of seven countries in the period 1870–1914.

Recently, some scholars have started to stress the importance of a few highly skilled craftsmen and entrepreneurs as engines of the First Industrial Revolution, while downplaying the importance of average levels of literacy (Galor and Moav 2002; Mokyr 2005; Mokyr and Voth 2009). Explaining why the Industrial Revolution started in Britain, Meisenzahl and Mokyr (2011) argue that the main advantage was “the supply of highly skilled, mechanically able craftsmen who were able to adapt, implement, improve, and tweak new technologies and who provided the micro inventions necessary to make macro inventions highly productive and remunerative”. If this was true, the institution of apprenticeship, and not the school system, provided the necessary skills for technological progress (Van der Beek 2014; Van der Beek and Feldman 2016).

Squicciarini and Voigtländer (2015) support this “upper-tail knowledge” hypothesis. They show for pre-industrial France that the local presence of a rather tiny “knowledge elite”, proxied by the density of subscriptions to Diderot’s and d’Alembert’s Encyclopédie, had a significant positive effect on city growth between 1750 and 1850, whereas the effect of the average worker’s skills, measured by the literacy rate, was negligible. Their main argument is that the knowledge elite fostered economic growth by transforming “useful knowledge” into profitable innovations. In contrast, better educated workers only raised the productivity of any given technology, but they did not help to adopt new technology.

In a recent paper, Kelly et al. (2015) point out that, if one looks at county level data, literacy rates still had a positive impact on the English industrialization. Showing that high literacy rates were positively associated with firms’ patenting activities in innovative industries such as chemicals and electrical engineering, our analysis of nineteenth-century Prussian counties supports this revisionist view. This latter result is puzzling, however, because it is not in line with the theoretical assumption that successful innovation activities in science-based industries first and foremost require a sufficient amount of higher educated people (Vandenbussche et al. 2006) but are less affected by the quantity of basic education (here measured by literacy rates).

We offer three alternative explanations to resolve this apparent contradiction between theory and history. First, it is conceivable that high literacy rates capture also an above-average investment in children’s human capital which comprised not only primary schooling but also higher education, nutrition, and preventive health care such as smallpox vaccination (Baten et al. 2014; Kelly et al. 2015; Prados de la Escosura 2015; Mühlhoff 2016). If this was true, high literacy rates would signal that a county’s (younger) workforce had comparatively high cognitive skills. Second, it was after all literacy that provided access to a broader knowledge base needed to develop innovations. In a world without modern mass communication techniques the only way to learn more about the ideas, theories and experiences of people outside the own local network was to read books, personal letters, newspapers, patents or encyclopedias. That is why a high literacy rate might have facilitated transregional knowledge spillover and therefore a county’s above-average innovativeness. Baten and van Zanden (2008), for example, have shown that book production, which is also driven by the demand of more and more literate people, was related to growth in real wages in the period 1450–1849. Third, high-tech innovations and basic formal education might have been complements during the Second Industrial Revolution. It is often argued that industrialization replaced skilled craftsmen with machinery and unskilled laborers. Goldin and Katz (1998), however, point out that skilled workers were still needed to install and maintain machinery and to supervise and direct the unskilled workforce. It was especially in innovative industries where more and more white-collar workers were hired to man the newly founded divisions for sales, accounting, and R&D. At the highly innovative chemical firm BASF, for example, the number of chemists, clerks, and technicians increased tenfold between 1876 and 1900 (von Hippel 2002, p. 106). As a result, innovative firms might have settled in counties with high education standards or have lobbied for an improvement of primary education. Both strategies would explain the positive correlation between literacy and patenting.

To identify the exact historical linkages between literacy and patenting activities in Prussian counties is beyond the scope of this study. Based on the data at hand, however, we provide ample evidence that these linkages must have existed. What is more, we explicitly take into account that different types of human capital (literacy, craftsmanship, and secondary schooling) might have had different effects on both innovation and income.

In a long-run perspective, Unified Growth Theory assumes that in the Malthusian period technological progress was mainly affected by the scale of the population, via a larger supply of innovative ideas, a higher demand for innovation, and accelerated technological diffusion. In later stages of development, when the technological progress had become more complex, human capital took a crucial role in advancing the technological frontier (Galor 2011, pp. 147–148). The accumulation of human capital is also seen as a main driver of the demographic transition, which is a necessary precondition for modern economic growth. By combining the neoclassical Solow model with elements of the demographic transition, Dalgaard and Strulik (2013) explain about two thirds of the cross-country variation in GDP per worker in 2000. Their study is of particular interest for our research as they also distinguish between a direct impact of human capital (measured by enrollment rates and school years) on growth and an indirect effect via technological innovation. Controlling for innovation with the help of the proxy R&D expenditures, they still find a sizable direct impact of human capital on growth (Dalgaard and Strulik 2013, Table 5).

3 Data

3.1 Valuable patents

Pure patent counts allocate the same weight to every patent, no matter whether it has a low or a high economic value for the patentee or the society. Using the number of patents as an indicator for innovative activity can therefore lead to a large measurement error.Footnote 4 One way to distinguish patents with a high value from those with a low value is to use a patent’s lifespan as an indicator for its profitability (Schankerman and Pakes 1986; for other methods to identify valuable patents, see Streb 2016). The caveat is that this method works only for patent systems with periodical renewal fees. The German patent law introduced in 1877 met this precondition (Seckelmann 2006, pp. 170–179). It allowed both private persons and firms to apply for patent protection that could last up to 15 years. To keep a patent granted in force the patentee had to pay an annual renewal fee which grew stepwise from 50 Marks in the first and second year to 700 Marks in the fifteenth year. Facing the rising expenditures for holding a particular patent, a patentee was supposed to decide to renew his patent only when the cost of doing this was lower than the expected future returns of the patent. The assumption that this mechanism worked as intended is supported by the finding that, in the late nineteenth century, 70% of all German patents granted had already been given up after just 5 years. Only about 5% of all patents reached the maximum age of 15 years (Streb et al. 2006, p. 352). We therefore assume in the following that only those long-lived Prussian patents that survived at least 10 years can be interpreted as valuable innovations.Footnote 5

Fig. 1
figure 1

The stock of valuable patents, 1877–1890. Note Prussian county borders as in 1849

By merging individual data on valuable patents granted in Prussia from 1877 to 1890 with county-level information our project makes use of two different, recently assembled historical data bases.Footnote 6 The prime source for the patent data is the annual Verzeichnis der im Vorjahre erteilten Patente published by the German patent office since 1877, which lists all patents granted in the preceding year. From these periodical listings, in 2005, Jörg Baten and Jochen Streb selected all patents that were held for at least 10 years and were therefore interpreted as valuable. In this selection, we identified 1740 valuable Prussian patents granted between 1877 and 1890, which will be used as a measure for the level of successful innovation. The information attached to each of these patents comprises the year in which the patent was granted, the technological class of the invention, as well as name and location of the patent holder. The name of the patent holder allows us to distinguish, admittedly with some margins of error, whether a patent was held by an independent inventor (or a small private company) or by a firm (private partnership or incorporated company), because in the latter case, information about the legal form of the companyFootnote 7 (e.g. GmbH, KG or AG) was added to the name of the patent holder.Footnote 8 The information about the location of the patent holder is used to assign the origin of a particular patent to a geographical unit. In the original data base, the patents were allocated to administrative districts (Regierungsbezirk), the middle administrative level of the German states (Streb et al. 2006, p. 365). To get a more disaggregated picture of the geographical distribution of valuable patents we re-assigned them to the rural and urban counties (“Land- und Stadtkreise”), the lower administrative level of the Prussian state. Figure 1 shows the geographic distribution of valuable patents granted in Prussia between 1877 and 1890 in the borders of 1849.Footnote 9

Fig. 2
figure 2

Literacy rates in 1871. Note Quintiles. Prussian county borders as in 1849.

Figure 1 reveals that valuable patents were by no means equally distributed across Prussia. Innovative centers were the Ruhr area, the counties neighboring the rivers Rhine and Main, the Province of Saxony, Greater Berlin and parts of Silesia, whereas large parts of Poznán, Pomerania, West- and East Prussia display very low patenting activity. Firms’ innovative activities were even more geographically concentrated.

3.2 Human capital

As explained above, the first part of our empirical analysis aims at studying which type of human capital is mostly related to innovation.Footnote 10 To capture the different skill categories we use the literacy rate, the enrollment rate in secondary education, and the share of master craftsmen in our analysis. The literacy rate refers to the ability to read and write of people older than 10 in 1871 as reported in the population census.Footnote 11 Following Benhabib and Spiegel (2005) we expect that the levels of literacy have an impact on innovation and economic growth. In Fig. 2 we show the geographic distribution of the literacy rate. As discussed in Becker and Woessmann (2009), the literacy rate is high in the central counties of Prussia near Wittenberg, whereas it is lower in the peripheral counties, especially in the east.

Fig. 3
figure 3

Secondary school enrollment rates in 1864. Note Quintiles. Prussian county borders as in 1849.

Fig. 4
figure 4

Master craftsmen in 1849 (in percent). Note Quintiles. Prussian county borders as in 1849. Master craftsmen include the following trades: soap boiler, lorimer, rope maker, sailmaker, firehosemaker, tailor, hatter, well sinker, cabinetmaker, wheelwright, wainwright, cooper, turner, comb maker, brush maker, basket maker, slater, potter, glazier, blacksmith, locksmith, belt maker, coppersmith, bell founder, tin molder, plumber, wire drawer. We have also added self-employed producers of scientific instruments, musical instruments, clockmakers, goldsmith, and engravers

Since we have no information about the level of secondary education in Prussian counties, we follow the example of Mankiw et al. (1992) and use as a proxy secondary school enrollment rates constructed for the years 1864 and 1873.Footnote 12 Secondary education in Prussia was heterogeneous in terms of curriculum as it included a humanistic track, the Gymnasium, and a more business-related track, the Realschule. Given that the majority of students in Prussia enrolled in the Gymnasium, it is not clear a priori whether we should expect a positive or a negative relationship between enrollment in secondary education and innovative activity.Footnote 13 Figure 3 shows that secondary schooling, which was usually located in urban districts, was more evenly distributed across Prussia than our other measures of human capital.

In the spirit of Meisenzahl and Mokyr (2011) as well as Squicciarini and Voigtländer (2015) we also use information about the relative number of master craftsmen to capture the extent of “useful knowledge” that was locally available. The Prussian census of 1849 reports data on employment by trade and explicitly distinguishes between masters and apprentices. Using this information we can construct the share of master craftsmen in the population.Footnote 14 Figure 4 shows that master craftsmen are concentrated in the early industrial centers such as Silesia or the Ruhr region.

3.3 Income tax revenues

As already explained above, we will rely on information on income tax revenues when estimating the effect of human capital and innovation on income. The Prussian Statistical Office published data on income tax revenues and the number of income taxpayers averaged over the period 1899–1903 for all Prussian counties.Footnote 15 Since the tax reform of 1891, a Prussian household’s income tax had been based on its taxable income (gross income minus income-related expenses) resulting from capital, immovable property, trade and commerce, and dependent employment. All households with an annual taxable income below 900 Marks were exempted from income tax. As a result, in 1900, only about 30% of the Prussian population was required to pay income tax. Therefore, we estimate only income of the prosperous part of Prussian counties’ inhabitants who could arguably make especially good use of their human capital stock. The progression of income tax rates was very moderate: Tax rates ranged from 1.13% for taxable incomes between 900 and 3000 Marks up to 3.95% for taxable incomes beyond 100,000 Marks (Schremmer 1994, pp. 152–155). Consequently, the estimation bias resulting from disproportionately high tax revenues in counties with an above-average number of people with very high income is limited. Using log tax revenues will also decrease the influence of observations with extremely high values. It is also important to note that our tax variable refers only to state income tax revenues excluding local surcharges. This ensures that we are not measuring different propensities to tax across counties. Figure 5 shows the geographic distribution of income tax revenues per 10,000 people in German Marks. Figure 6 displays the cross-sectional distribution of log income tax revenues per capita across Prussian counties which resembles the typical log-normal distribution of income.

Fig. 5
figure 5

Income tax revenues per 10,000 people. Note Quintiles. Average income tax revenues for the period 1899–1903 in German Marks. Prussian county borders as in 1849

Fig. 6
figure 6

Source: see text

The distribution of log income-tax revenues per capita across Prussian counties in the period 1899–1903.

We also considered the problem that income tax revenues were only reported in nominal terms. We collected provincial data on prices of grain, meat, and butter for the period 1899–1903Footnote 16 and used them to construct a province-specific cost of living index based on the weights presented by Allen (2009, Table 2.3). The potential issue of tax evasion was tackled by correcting our information about tax revenues with estimates of tax evasion in Prussian districts (Regierungsbezirk) around 1900 by Meisel (1911, p. 313). Both modifications did not change our main results and are therefore only reported in the online Appendix (see Table A9).

3.4 Control variables

Due to the abundance of information provided by the 1871 (and 1882) Prussian census we can include a rich set of control variables in our regression analysis.Footnote 17 First of all, we will account for scale effects by controlling for log population size (Galor 2011). In addition, we include urbanization rate in the set of control variables.

Table 1 Descriptive statistics.

Religious precepts can influence investments in education. Becker and Woessmann (2009) have shown that counties that were dominated by Protestants invested more in primary education than Catholic counties. Botticini and Eckstein (2005, 2007) argue that religious norms induced Jewish communities to acquire higher levels of human capital. Indeed, data on the number of illiterate people within religious denominations reported in the 1871 census show that illiteracy rates were particularly low for Jews and Protestants: mean illiteracy rate among Jews was 5.6 and 6.8% among Protestants, whereas illiteracy rate among Catholic came to 11.3%.Footnote 18 In our regression analysis we therefore also control for the share of Protestant and Jews in the Prussian counties.Footnote 19

Cinnirella and Hornung (2016) show that large landowners impeded the accumulation of human capital in nineteenth-century Prussia because they were primarily interested in securing a stock of unschooled and cheap farm laborers, and thus opposed the education of the peasants’ children.Footnote 20 The occupational census in 1882 reports information on farm size. To account for the influence of large landowners we constructed the variable “share of large landholdings” as the area of landholdings larger than 100 hectares over the total area.

In the late nineteenth century many Prussians migrated to regions with high demand for labor, for example to the coal fields of the Ruhr area. To control for this inflow of people who might have also brought along new ideas and technologies, we make use of the variable “share of people born in the respective municipality”.

Descriptive statistics of the variables used in our analysis are shown in Table 1.Footnote 21 The average level of literacy rate (84%) is one of the highest in Europe for the period under consideration. However, as also shown in Fig. 2, there is a large variation in literacy as in 10% of the counties one out of three individuals was not able to read and write.

There is also a large variation in the number of valuable patents in our sample. The average number of valuable patents in a county for the period 1877–1890 is about 4, with 65% of the counties having zero valuable patents. The city-county of Berlin reached the outstanding number of 464 valuable patents in the period under consideration, which is due to the fact that very innovative firms such as Siemens and Agfa were located there (Degner 2009). Berlin-based inventors might also have profited from their proximity to the Imperial Patent Office which facilitated communication with the members of this highly-centralized administration. To take this effect into account, we also control for the distance to Berlin. However, since Berlin constitutes a clear outlier we exclude the county from the first part of the analysis where we study the determinants of innovative activity.Footnote 22

4 The relationship between different types of human capital and innovation

We begin our empirical analysis by examining the relationship between the different types of human capital and a county’s innovative activity. We distinguish between basic knowledge measured by the literacy rate, higher formal education in the form of secondary school enrollment rate, and useful practical knowledge measured by the share of master craftsmen.Footnote 23 These three variables for human capital are standardized with zero mean and unit standard deviation to facilitate comparison. Since the share of master craftsmen is computed from the occupational census in 1849 we aggregate the variables at the county borders of 1849 which reduces the number of observations to 335. Because of its very large number of patents (464), we drop the city-county of Berlin when analyzing patenting activities. Thus the final number of observations is 334.

The dependent variable is the absolute number of valuable patents granted in the period between 1877 and 1890, a count variable that takes on nonnegative integer values. Given the particular nature of the dependent variable we will use different estimation methods from the class of count-data models. This will not only be a test of the robustness of our results but will also lead to a better understanding of the role of the different dimensions of human capital on innovation.

In Table 2 we employ different models to study the relationship between the absolute number of patents and the different types of human capital. We begin with estimating a standard OLS model (columns 1–2). Since the OLS model is not appropriate when dealing with a large number of zeros in the dependent variable, we constrain the sample to those counties with a positive number of patents (\(n=115\)). The OLS estimates suggest that both literacy and master craftsmen are significantly and positively related to the number of valuable patents, conditional on having at least one valuable patent. The coefficients for secondary school enrollment rates are insignificant.

In columns 3 and 4 we estimate the zero-inflated negative binomial model which accounts for the excessive number of zeros and for over-dispersion, i.e. the fact that the (conditional) variance exceeds the (conditional) mean. In particular, this model assumes that the excess-zeros are generated by a separate binary process which is independent from the process that leads to the count data. Since most of the patents in our sample are related to manufacturing (even the patent class 45 “agriculture” mostly contained agricultural machinery), the geographical distribution of valuable patents follows rather mechanically the geographical distribution of the industrial centers. That is why we assume that the percentage of people working in manufacturing predicts the excess number of zeros. The hypothesis is that counties with a low level of manufacturing were less dependent on innovation and therefore had zero valuable patents. Hence, in the first step, a logit model is used to estimate whether or not a Prussian county was capable (or willing) to get at least one valuable patent. In the second step, a count model is employed to estimate the absolute number of valuable patents in those counties where the occurrence of a valuable patent is more likely. Note that in the second step of the estimation zero patents are also possible. Yet, by predicting the excess-zeros in this way we reduce the potential impact of human capital on zero innovation, i.e. we implicitly assume that human capital mostly predicts variation in patents in the positive domain.

Table 2 Human capital and patenting activity

Columns 3 and 4 of Table 2 reveal that the percentage of the labor force working in manufacturing is indeed a good predictor of the excess zeros: the higher the percentage of people in manufacturing, the lower is the probability of observing a county with zero valuable patents. The estimates show that literacy and master craftsmen are significantly related to the number of valuable patents.Footnote 24 In columns 5 and 6 we estimate a standard negative binomial (NB) model which does not explicitly model the excess zeros but accounts for over-dispersion. Again, the estimates suggest a positive relationship between both literacy and master craftsmen and a county’s innovation, though the coefficient for master craftsmen in column 6 comes only close to significance with a p value of 0.107. In columns 7 and 8 we show the results of a Poisson model which is a special case of the negative binomial distribution assuming that the dispersion parameter is zero. These estimates also indicate that both literacy and master craftsmen are positively related to patenting. In sharp contrast, enrollment in secondary education has no significant impact on patenting activities regardless of the model we use. Finally, in column 9 we run a “horse-race” between literacy and master craftsmen in the Poisson model. This estimate implies that patenting activity was influenced rather by literacy than by master craftsmen. As the following regression results will show, however, the impact of the different types of human capital on innovation depends on both the type of patentee and the technology we look at.

With respect to the control variables, we find that both a larger population and a higher urbanization rate are associated with more innovation. However, we generally do not find any association of religions such as Protestantism or Judaism with patenting. This suggests that, once accounting for the human capital channel, Protestantism and Judaism had not residual impact on innovation.Footnote 25

So far, the results suggest that both literacy and master craftsmen are positively related to patenting activities. The next subsection will show that we can sharpen our results by explicitly considering the new role of firms in the innovation process of the Second Industrial Revolution.

4.1 Human capital and innovation by type of patent holder

The rise of chemicals and electrical engineering in the late nineteenth century relied much more on the availability and use of scientific knowledge than the innovations of the First Industrial Revolution in the late eighteenth century (Moser 2012, 2013). That is why, during the nineteenth century, the proverbial ingenious inventor lost ground to a more systematic and science-based research carried out in the newly-founded R&D departments of larger firms. This was especially true in Germany where firms like BASF, Bayer or Siemens profited from the advanced national education and research system.Footnote 26

In Table 3 we estimate different count-data models for independent innovations (Panel A) and for firms’ innovations (Panel B). In addition to our usual control variables we take into account the share of people employed in banking, insurance, and law in 1882. As patent fees were relatively expensive in Germany, easy access to credit and patent lawyers might have especially facilitated private inventors’ patenting activity.

Table 3 Human capital and patenting activity by patent holder

Our estimates now lead to a clearer result. Panel A shows that the coefficient for master craftsmen is significantly positive regardless of the model we use (columns 5–8, Panel A) whereas literacy is only weakly related to independent innovation (columns 1–4, Panel A). The horse race in column 9 provides additional support to the notion that master craftsmen and not literacy was related to independent innovation in the German Empire.

Panel B reveals that the opposite is true with respect to firms’ innovation: Literacy is significantly and positively related to firms’ innovation (columns 1–4, Panel B), whereas master craftsmen do not show any significant relationship with the innovation outcome (columns 5–8, Panel B). The horse race regression in column 9 confirms this finding.

Interestingly enough, when controlling for urbanization, the impact of secondary schooling is, in many cases, significantly negative with respect to firms’ innovation. This result might partly be explained by the fact that larger German firms already hired R&D personnel and other white-collar employees on the supra-regional labor market (Baten et al. 2007). Another possible explanation refers to the different types of secondary schools with very different curricula. As already mentioned, our variable for secondary education includes enrollment in both Gymnasium and Realschule, where the former had a more humanistic curriculum, and the latter was more business-oriented.Footnote 27 The fact that the Gymnasium dominated numerically secondary education (on average 60% of the students were enrolled in the Gymnasium) could explain the negative relationship between secondary education and firms’ patenting activity. Counties with a relatively large number of secondary schools might have had comparatively many public offices and private firms that were specialized in administrative or distributive aspects of manufacturing rather than in production activities where the patentable technological innovation occurred.Footnote 28

Considered as a whole, the findings of Table 3 suggest that the practical knowledge of master craftsmen which played such a prominent role during the First Industrial Revolution was still useful for independent innovations in more “traditional” technological fields such as machine-building, but rather unimportant for developing the science-based innovations of the Prussian firms that were engaged in chemicals and electrical engineering.

We can test this sector-related hypothesis by differentiating innovation activities by technological classes (see Streb et al. 2006, p. 359). On the one hand, we look at “traditional” innovations (“machines”) whose valuable patents were assigned to the technological fields “metal working”, “machine parts”, and “steam boilers”. On the other hand, we consider patents granted in the fields of “electrical engineering” and “chemicals” including “synthetic dyes” as science-based innovations (chemistry/electricity).

Table 4 Human capital and innovative activity in traditional (Panel A) and frontier technology (Panel B)

In Panel A of Table 4 we focus on innovation in the “machine” industry. The estimates of both the negative binomial and the Poisson model in columns 1–2 and 4–5 (Panel A) depict a strong pattern: Literacy is not related to “traditional” innovation in the machine industry, whereas a larger share of master craftsmen is strongly related to that type of innovation.Footnote 29 These results also hold when using an alternative outcome variable. Columns 6 shows a significant correlation between master craftsmen in 1849 and the share of people employed in metal working computed from the occupational census in 1882. Consistently, literacy is not related to occupation in metal working (column 3). This result is also corroborated by the horse race regressions presented in columns 7 and 8.Footnote 30

The estimates presented in Panel B of Table 4 focus on innovation and occupation in the scientifically advanced sectors chemistry and electrical engineering.Footnote 31 The results are unambiguous: literacy is strongly related to innovation in chemistry and electrical engineering (columns 1–2, Panel B), whereas master craftsmen do not significantly predict that type of innovation (columns 4–5, Panel B). The results are similar when using as dependent variable the share of people employed in chemistry in 1882 (columns 3 and 6, Panel B).Footnote 32 The horse race regressions in columns 7 and 8 (Panel B) confirm the notion that literacy had a positive impact on innovation and employment in technologically advanced industries.Footnote 33

Note that we have not identified the direction of causality. In line with our explanation presented in Sect. 2, innovations might have flourished in counties with a well-educated population because entrepreneurs decided to locate innovative and high-tech firms in highly productive areas characterized by high levels of literacy. With regard to the decision where to establish a firm, the local availability of literacy might have been as important as the access to near-by natural resources such as water or coal. On the other hand, an above-average economic growth of counties with many firm innovations might have generated the tax revenues that were needed to finance better education at the first place.

In sum, the analysis of the relationship between different types of human capital and innovation lends support to the hypothesis that the Second Industrial Revolution was a transition period when it comes to the role of human capital. As in the First Industrial Revolution, “useful knowledge” embodied in skilled craftsmen was still an important determinant for innovation activities of private inventors in more traditional industries such as steam boilers or mechanical metal processing. Simultaneously, however, basic formal education, here measured by literacy, became more and more important for firms’ patenting activities in high-tech industries such as chemicals or electrical engineering.

5 Different types of human capital and income

After having identified the most important correlations between the different types of human capital and innovation, we now turn our attention to the impact of human capital on income. Before accounting explicitly for patenting activity, we test whether literacy rate, master craftsmen and formal secondary education had a persistent impact on tax revenues. We estimate the following model:

$$\begin{aligned} \hbox {Log}(Y_{1898-1903} )=\beta L_{1871} +\gamma Z_{1849} +\delta S_{1864} +\mathbf{X}^{\prime }\lambda +\varepsilon \end{aligned}$$
(1)

where Y is the mean tax revenues per capita for the period 1898–1903, L is the literacy rate in 1871; Z is the share of master craftsmen in 1849, and S is the enrollment rate in secondary schools in 1864. Due to collinearity between the literacy rate and the share of master craftsmen, however, we avoid horse race specifications and include literacy and master craftsmen only separately in the various specifications.Footnote 34 The vector X again includes log population size (1871), the urbanization rate (1871), the share of large landownership (1882), the shares of Protestants (1871), the share of Jews (1871), the share of people born in the municipality (1871), and a binary variable for the counties west of the river Elbe. Robust standard errors are clustered at the district (Regierungsbezirk) level to allow for correlation within larger administrative units. The OLS estimates are reported in Table 5.

Table 5 Human capital and log income tax revenues

Column 1 reports a strong bivariate positive correlation between log tax revenues and literacy rates. Column 2 shows a similar result for master craftsmen. The coefficients of the two types of human capital are also comparable in magnitude. In column 3 we include secondary school enrollment rates plus our set of controls and the coefficient for literacy remains highly significant and of similar size. Secondary education is positively related to tax revenues. As expected, population size and urbanization are also significantly positively related to tax revenues. Interestingly, once accounting for literacy, the coefficient of the share of Protestantism is negative. A higher percentage of people born in the respective municipality, that is a low immigration rate, is negatively associated with tax revenues. Finally, counties west of the river Elbe have higher levels of income tax revenues. The estimated coefficients can be interpreted as semi-elasticities: a change in the literacy rate by one standard deviation is associated with a 30 % increase in tax revenues. The same change in the secondary school enrollment rate leads to an increase of tax revenues by 12%. In column 4 we run the same specification but for the share of master craftsmen. As one can see, the coefficient is now less than one third in comparison to the unconditional regression (column 2) and significant only at the 10% level. Secondary education keeps being significantly positively associated with income tax revenues.

Bils and Klenow (2000) point to the fact that most empirical studies only document correlations between human capital and economic development but do not establish causal effects. We have already discussed the problem of reverse causality: Economically more advanced counties might have had both the need and the financial means to build up a more efficient education system. In this case, economic growth would have fostered human capital formation and not the other way around.Footnote 35 We have information on income of male elementary school teachers for 1886 and past income-tax revenues for 1877. The inclusion of such variables, although not addressing the issue of reverse causality, allows us to estimate the impact of the human capital variables on the subsequent growth of income.Footnote 36 In column 5 the inclusion of both variables reduces the size of the coefficient for literacy slightly.Footnote 37 As expected, both the income of school teachers and tax revenues in 1877 are positively related to our dependent variable. The inclusion of these variables in column 6 renders the coefficient for the share of master craftsmen smaller and insignificant. It is important to note that secondary education remains always significantly positively related to income tax revenues.

This set of results suggests that improvements in formal education which led to higher literacy and higher enrollment in secondary schooling indeed fostered a Prussian county’s economic growth during the Second Industrial Revolution.

Becker and Woessmann (2009) reanimated the debate about the role of religion for European economic growth. The authors claim that, at least in Prussia, Protestantism had an effect on economic outcomes only through comparatively higher levels of primary schooling (measured by literacy) that resulted from Protestants’ precept that everyone should be able to read the Bible.Footnote 38 To address the problem of reverse causality—Protestants might have preferred richer counties with better schooling—and the omitted variable bias, Becker and Woessmann (2009) employ an instrumental variable approach instrumenting Prussian counties’ literacy rates with the distance to Wittenberg, the supposed center of German Protestantism. We cannot apply an analogous identification strategy in our study because it is verging on the impossible to find instruments for three different types of human capital that are all orthogonal to income tax revenues. What we can do is to test whether the results of Becker and Woessmann (2009) still hold when introducing master craftsmen and enrollment in secondary schooling as additional controls.

Note that we do not make any claim of causality with respect to the share of master craftsmen or secondary education. In the online Appendix we show that the share of master craftsmen and enrollment in secondary schools are not correlated with distance to Wittenberg (Figure A4). This suggests that distance to Wittenberg identifies exogenous variation only in literacy but not in the other two types of human capital we analyze. The results of the extended instrumental variable approach are reported in Tables A7 and A8 in the online Appendix. The first-stage F-statistic indicates the strength of distance to Wittenberg as an instrument for literacy. The instrumental variable estimates still suggest a statistically and economically significant effect of literacy on tax revenues. We find that the effect of literacy on tax revenues is also robust to the introduction of geographic controls such as soil quality, slope, temperatures and precipitation.Footnote 39 For additional robustness checks see the online Appendix.

6 Literacy, innovation, and income

One of the main objectives of this paper is to find out to what extent literacy and innovation affect income independently of each other. The left panel of Fig. 7 shows the relationship between log income tax revenues per capita and literacy rate while the right panel presents the relationship between log income tax revenues and patents per 10,000 people. In both cases, a positive relationship is clearly visible.

Fig. 7
figure 7

Source: see text

The relationship of log income tax revenues p.c. with literacy rates and valuable patents

In the first part of our empirical analysis we have shown that patenting activity in Prussian counties depended on literacy, in particular in the case of large firms and high-tech innovation. Because of this relationship and since we do not have an instrument for patenting activity that would need to be orthogonal to both literacy and income tax revenues, we can analyze this issue only descriptively by estimating the following OLS model:

$$\begin{aligned} \hbox {Log}(Y_{1898-1903} )=\beta L+\gamma P+\mathbf{X}^{\prime }\lambda +\varepsilon \end{aligned}$$
(2)

where the dependent variable is the log of the mean income tax revenues per capita for the period 1898–1903, L is the literacy rate, and P is the number of patents per 10,000 people in the period 1877–1890. The vector X includes our standard set of control variables. Note that we will also include enrollment rates in primary and secondary schools. To facilitate comparison, all human capital variables and the variable for patenting activity are standardized with zero mean and unit standard deviation. Standard errors are clustered at the district (Regierungsbezirk) level. Since in this section we are not interested in assessing the impact of master craftsmen which are observed at county borders as in 1849, we can revert to the county borders in place in 1871. That is why the number of observations increases to 452.

In the regression we include also geographic controls and “historical” controls to account for persistent differences in economic patterns at the county level. The geographic controls comprise variables for soil quality, slope, mean temperature and mean precipitation. The “historical” controls include dummies for having a university or a school in 1517, dummies for having a former Imperial or a Hanseatic city in 1500, respectively, and the level of city population in 1500.Footnote 40

Table 6 Human capital, patenting activity, and log income tax revenues

OLS estimates of Eq. (2) are presented in Table 6. In column 1 we regress log tax revenues on literacy and enrollment rates in primary schools in order to compare directly the effects of the stock and the flow of basic education. In addition, we control for secondary school enrollment. As one can see, literacy is still positively and significantly related to tax revenues while primary school enrollment rate has no significant impact. Enrollment in secondary schools has a significant positive coefficient which might be driven by reverse causality: The high demand of economically well-developed counties for skilled workers might have motivated investment in secondary schooling.

By including quintiles of the distribution of literacy, in column 2, we test for non-linearity in the relationship between literacy and tax revenues. The estimates reveal an interesting pattern: On the one hand, counties in the second, third, or fourth quintile of the distribution of literacy generated between 41 and 48% higher tax revenues than the counties in the bottom 20%. This suggests that Prussian counties needed to exceed a minimum threshold of literacy to prosper economically. On the other hand, counties at the top 20% of the distribution of literacy had significantly higher tax revenues than the preceding quintiles.Footnote 41 The interpretation of this result is consistent with our notion of a complementarity between technology and skills presented in Sect. 2, according to which innovative technologies with a comparatively high economic performance require high levels of human capital. Data on the occupational structure corroborate this interpretation. For example, the share of people employed in high-tech chemistry at the top quintile of the literacy distribution (0.13%) is significantly larger than the respective share employed in the fourth quintile (0.08%). On the contrary, mean employment in the traditional textile sector already peaked in the third quintile of the literacy distribution with 2.5% and decreased afterwards to 0.9% in the top quintile.

In column 3 we focus our attention on the relationship between innovation and log income tax revenues. The coefficient for valuable patents is positively and significantly related to log tax revenues. The magnitude of the coefficient, when compared to literacy, suggests a comparatively smaller impact of valuable patents on income: a standard deviation increase in patents per 10,000 people is associated with an increase in tax revenues by about 6%. In column 4 we analyze the relationship between innovation and income tax revenues both at the extensive and intensive margin. To have at least one valuable patent seems to be more relevant for income tax revenues than the absolute number of valuable patents.

In column 5 we further explore the possibility of non-linearity in the relationship between valuable patents and income tax revenues by including dummy variables for having 1, between 2 and 4, or 5 or more valuable patents, respectively. The results provide again some evidence for non-linearity. Income tax revenues in counties with five or more patents are not higher than those in counties with only a few patents (2–4).Footnote 42 In columns 6 and 7 we exploit our distinction between firms’ and independent patents and estimate their separate relationship with income tax revenues. The coefficients of both types of patents are positive and significant and of similar size.

In columns 8 we run a horse race between different measures for human capital and innovation activities. The estimates suggest that literacy and valuable patents are independently of each other related to income tax revenues. Most importantly, the magnitude of the coefficients is almost identical in size to the coefficients in the regressions that consider both variables in isolation. This finding supports our hypothesis that the impact of human capital on growth involves multiple channels.

In columns 9 and 10 we provide further support to this interpretation by using alternative measures for the scope of innovation activities.Footnote 43 In most Prussian counties the inventors concentrated on a very small variety of technological classes. In fact, the average number of different technological classes in a county is 1.5 (whereas the average quantity of valuable patents comes to 3.8). Column 9 shows that the number of technological classes is positively correlated with tax revenues. The coefficient for literacy (and secondary education) remains virtually unaffected. The data on technological classes can also be used to construct an index of technological fractionalization. This index measures the probability of drawing two different technological classes when randomly selecting two patents from a county’s patent sample.Footnote 44 Column 10 reveals that this variable is also strongly positively related to tax revenues. Moving from a county with zero technological diversity to a county with a fully heterogeneous spectrum of technological classes is associated with an increase in tax revenues of 28%. This result holds when controlling for the number of patents per capita and for having at least one valuable patent.Footnote 45 Again, the coefficients for literacy and secondary education stay the same.

Table 7 Literacy, patenting activity, and fertility outcomes

7 Human capital and the transition to modern economic growth

An important characteristic of the transition to modern economic growth is the rapid decline in fertility rates at the end of the nineteenth century. Theories about the so-called quantity-quality tradeoff argue that parents’ increasing investment in their children’s education during the Second Industrial Revolution was accompanied by decreasing levels of fertility (Galor 2011, 2012). The logic behind this hypothesis is that technological progress demanded higher levels of education which, in the presence of a household’s binding budget constraint, were traded against the quantity of children.

We add to this literature by estimating the impact of literacy on fertility outcomes at different levels of technological progress. According to Unified Growth Theory, the negative effect of education on fertility should be more accentuated when the returns to education are higher (Galor 2011, p. 154). We can test this hypothesis by interacting the literacy rate with the number of patents that were held in a county. Table 3 had shown that literacy is more related to firms’ innovation than to independent innovation. Therefore, it is reasonable to assume that the “returns to literacy” are higher in counties with many firms’ innovations than in counties with many independent innovations. We test this assumption in Table  7.Footnote 46

The fertility outcomes used in this analysis are the crude birth rates (CBR) in 1890 and 1900 and the marital fertility rates (MFR) in the same years.Footnote 47 The estimates show that the interaction between literacy and patents is only significant in the case of firms’ innovation. Interestingly enough, firms’ innovation is also positively correlated with the fertility outcomes whereas independent innovation shows no significant relationship with the CBR or the MFR. In sum, the estimates of Table 7 support the notion that the negative impact of literacy on fertility increases with technological progress that raises the returns to education. In general, we provide substantial evidence in favor of the hypothesis that literacy played a crucial and multifaceted role in Prussian’s transition to modern economic growth.

8 Conclusion

We merged individual data on valuable patents granted in Prussia in the late nineteenth-century with county-level data on literacy, craftsmanship, secondary schooling, and income tax revenues to explore the complex relationship between different types of human capital, innovation, and income.

It is widely believed in economics that improving formal education of broad segments of the population is a necessary precondition for a country’s transition from stagnation and backwardness to sustainable economic growth. In sharp contrast to this view, some scholars have claimed that average levels of literacy played no important role during the First Industrial Revolution in Britain and other European countries. They stress, instead, the importance of a few highly skilled craftsmen and entrepreneurs who were needed to transform useful knowledge into economically successful innovations. With this article we bridge the gap between these two perceptions by showing that the Second Industrial Revolution might have been a transition period when it comes to the role of human capital. As in the preceding First Industrial Revolution, “useful knowledge” embodied in master craftsmen was strongly related to innovation of independent inventors. But at the end of the nineteenth century the time of the famous independent inventor was mostly gone. The newly-founded industrial R&D departments took over the lead in developing more and more science-based innovations. At the same time, the quality of basic education became positively associated with both workers’ productivity and firms’ R&D processes.

To explain the surprising result that literacy was positively correlated with innovation activities we presented three different explanations that are not mutually exclusive. First, high literacy rates might indicate an above-average general investment in children’s human capital that comprised also higher education, nutrition, and preventive health care. Second, a high literacy rate might have facilitated transregional knowledge spillover because reading was the only way to learn about ideas that have occurred outside the own locality. Third, high-tech innovations and formal basic education might have been complements in highly innovative industries.

Our empirical analysis also strongly suggests that literacy and patenting activities are independently correlated with income. This evidence lends support to the notion that basic education had an impact on Prussian growth through both the direct production channel and the indirect innovation channel. What is more, literacy paved the way to modern economic growth also by accelerating structural change and fertility decline. Regarding the latter, our results are consistent with the interpretation that technological advancements, by augmenting the returns to education, provided more incentives to invest in education and amplified the quantity-quality tradeoff.