Broad city-size distributions are a well investigated phenomenon and are often studied by means of rank-size representations. Their discovery is mostly attributed to Zipf and his monumental book “Human behavior and the principle of least effort: An introduction to human ecology” (Zipf 2012). While their origin and some aspects, such as log-normal vs. power-law, upper or lower cut-offs, specific value of the exponent, are contested (Berry and Okulicz-Kozaryn 2012), there is a general agreement about the broad nature of city-size distributions (Batty, 2013, e.g.).

Here we trace back the pioneering works that deal with power-law city size distributions and find that they were first discovered by Auerbach (1913). The rank-size representation was introduced by Lotka (1925) and the topic was popularized by Zipf (1941), Zipf (1942) and Zipf (2012). In the following, we want to review Auerbach’s original work, discuss some bibliometric analysis, and speculate about reasons why Auerbach’s work receives so much less attention than Zipf’s. We also discuss the contribution of other contemporaries, in particular Saibante, Christaller, and Gibrat. We end with biographic notes on Auerbach.Footnote 1

In his paper “Das Gesetz der Bevölkerungskonzentration” (The law of population concentration) (Auerbach 1913) Auerbach defines a quantity which he calls “absolute concentration” (A.K.) given by the product of rank r and city size s. The size of cities is given by their population and the rank is obtained by ranking cities according to their size, where the largest city has rank 1. Plotting AK vs. r for German cities in 1910, he finds that AK reaches, after some deviations at lower ranks, a plateau of approximately constant value. Today the rank-size relation

$$\begin{aligned} s\sim r^{-\xi } \end{aligned}$$
(1)

is studied and reflects a power-law city-size distributionFootnote 2 analogous to the Pareto distribution for income (Pareto 1896, 1897). Combining Auerbach’s AK and Eq. (1) leads to \(\text {AK}=rs\sim r^{1-\xi }\), which is constant for \(\xi \simeq 1\). Accordingly, Auerbach’s finding of an approximately constant AK across German cities is equivalent to what is known today as “Zipf’s law for cities”. As becomes clear from his final paragraph, Auerbach was aware that constant \(r\times s\) characterizes the size distribution and he already called it a law—according to which the count of cities is inversely proportional to their minimum size (Auerbach 2023).

To make comparisons across countries and time periods, Auerbach suggests to divide each country’s AK by its population and thereby to obtain what he calls specific concentration (Sp.K.) as \(\text {SpK}=\frac{AK}{S}\), where S is the total population of the country. He then uses SpK to investigate city size distribution across countries, see Fig. 1a, regions, and time periods.

To our best knowledge, Lotka (1925) was first to show the rank-size plot, see Fig. 1b, and to mention the power-law relation as it is analyzed today (Cottineau 2017, e.g.). Lotka’s empirical rank-size representation is a simple and elegant method given the means that were available at the time. It also draws the attention to estimates of the exponent instead of Auerbach’s AK and SpK. However, in his book “Elements of Physical Biology” (Lotka 1925) Lotka treats a range of ecological problems and the “Law of urban population” seems a bit displaced between “The intensity law in organic and economic systems” and “The biological background of population pressure”.

Fig. 1
figure 1

Original graphical representations. a Auerbach plots the specific concentration (city population times rank divided by country population) as a function of the rank (for various countries). A constant behavior corresponds to Eq. (1) with \(\xi \simeq 1\). b Lotka simply plots the city population vs. rank (for USA cities). The straight line in log-log representation corresponds to Eq. (1). Lotka measures \(\xi \simeq 0.93\)

Fig. 2
figure 2

Original table from Saibante (1928). Please note, Saibante swapped the axes compared to Lotka so that the exponent \(\alpha \) reported here is the inverse of \(\xi \) in Eq. (1). Source: (Saibante 1928, p.58)

A little known but rather early and extensive work is the paper by Saibante published in 1928 (Saibante 1928). Saibante built on Lotka’s formulation and investigated how well it fits city-size distributions across a range of countries, regions, and time periods. He focused on the exponent, see Fig. 2, and thereby laid the foundation of modern empirical investigations. Saibante drew the connection to the Pareto distribution and additionally calculated the Gini coefficient. We speculate that as an Italian he had better access to their work.

Zipf wrote three publications on city-size distributions. In 1941 his book “National unity and disunity” came out and almost exclusively treats cities and rank-size plots but only includes a small number of equation and references. In reaction to a book review by Thorndike (1941), Lotka clarifies that the discovery was first indicated by Auerbach (Lotka 1941). The second publication is Zipf’s 1942 paper (Zipf 1942) in which he admits that the discovery was made by Auerbach and he regrets not knowing the reference before publishing (Zipf 1941). In this treatise we focus on his third publication, his well-known 1949 book (Zipf 2012), because it seems more elaborated and most cited today.

Zipf not only presents empirical analysis but also some sort of explanation for the power-law rank-size relation – he frames it as a manifestation of an optimization (“least effort”). Specifically, he introduces a model of economic geography where the urban system is organized in such a way that costs of production and consumption are minimized in terms of labor and transport (Zipf 2012, Ch.9, Sect.I). The model leads to a set of predictions, e.g. regarding the diversity (of establishments) as a function of city size. He argues that if transport costs are high, then people migrate to production centers; if they are low, then goods are brought to the consumers. Zipf calls these two effects “Force of Diversification” and “Force of Unification”, respectively. In the extreme case, they lead to a large number of small settlements or a small number of large cities, respectively. The power-law rank-size relation is then introduced as a straight line in double-logarithmic representation, connecting these two extremes as supporting points, which leads to \(rs^{\alpha }=\text {const}\) (Zipf 2012, Eq.(9–12)), corresponding to Eq. (1) with \(\alpha =1/\xi \) (Duncan 1957, e.g.). Moreover, Zipf attributes the exponent \(\alpha \) to the ratio of the magnitude of both forces. Empirically, Zipf estimates \(\xi \approx 1\) (implying \(\xi \simeq \alpha \)) which he interprets as balance between diversification and unification (Zipf 2012, Ch.9, Sect. III).

Table 1 provides an overview of early works and the referencing among them. It represents a subjective set of works related to cities and city-size distributions. Auerbach (1913) is cited in the later publications but the economics works (Gibrat 1931; Singer 1936), and Lösch (1940) represent an exception. Lotka (1925) is only cited by the latest publications (Stewart 1947) and Zipf (2012) (the paper by Saibante (1928) got very little attention). Almost all works somehow refer to Pareto (1896, 1897) but to our best knowledge, (Saibante 1928) was the first to recognize the connection between Auerbach’s and Pareto’s works. Singer (1936) was the first to spell out the “complete analogy” in an English publication. There seems to be a parallel evolution of two groups, i.e. works cited by Zipf (2012) and those cited by Lösch (1940).

Table 1 Citation matrix

Gibrat (1931) is off in Table 1 as he describes the log-normal distribution. Therefore, he analyzed the (complementary) cumulative distribution function (i.e. counts of cities that are more populous than certain threshold values x) and from the respective probabilities he obtained corresponding values z from the tabulated probit function. He then found a logarithmic relation \(z=a\log x+b\), where a and b are parameters. A logarithmic argument in the normal distribution corresponds to a log-normal distribution. As z vs. \(\log x\) is a linear function, it has been misquoted by Singer (1936) (“Gibrat’s formula is more general”) and Zipf (2012) (“Gibrat reported the rectilinearity”) but Lösch (1940) points towards “differences between Gibrat’s and Pareto’s methods”.

To assess the impact that Auerbach’s work had and still has, we analyze the number of citations that (Auerbach 1913) and Zipf (2012) receive across the years. Therefore, we use the Publish or Perish software (Harzing 2007) to retrieve the bibliographic information (from google-scholar; QueryDate: 2021-07-15) of those works citing Auerbach (A13) or Zipf (Z49). Since the Zipf-book is also about word frequency and other topics not immediately related to cities, it also receives unrelated citations. Accordingly, we only consider citations where the following keywords appear in the title or in the beginning of the abstract: city, cities, town, towns, village, villages, settlement, settlements, metropolis, metropolises, urban, urbanization, urbanisation, population, populations. To be consistent we apply the same filtering to both works and the number of counts reduces from 868 to 334 for A13 and from 15,339 to 1368 for Z49 (227 publications cite both). In Fig. 3a we plot the number of citations per year for either and both. In Fig. 3b the share of Z49-citations that also cite A13 is plotted. We can make three observations: (i) Works citing Auerbach (1913) to a large extent also cite Zipf (2012)—but less so vice versa. (ii) Typically 20% of the works citing Zipf (2012) also cite Auerbach (1913). (iii) There are two waves, the first peaking around 1980 (possibly related to the Simon and Mandelbrot dispute in the 1960s (Mitzenmacher 2004)) and the second one around 2015. We can conclude that the discovery is mostly attributed to Zipf and only a relatively small share acknowledges Auerbach. Cottineau (2022) reviewed 66 sources and found that most works not citing Zipf were published before 1990.

This leads to the question why Auerbach’s paper (Auerbach 1913) is so much less recognized although it was published more than 30 years before Zipf’s book (Zipf 2012). First, instead of Auerbach focusing on the regularity itself, he uses it as a prerequisite for his measures of absolute and specific concentration which are only meaningful if the regularity holds true. Moreover, his measures of absolute and specific concentration were found to have drawbacks (Maurer 1913) and did not catch on. Second, Auerbach’s paper is written in GermanFootnote 3 and Zipf’s work is written in English which was/is advantageous when English took the role of (global) science language. Third, Zipf’s works are more extensive compared to Auerbach’s 2-page paper (plus Figures). In addition, Zipf also finds a power-law in the word occurrence and discusses city-size distributions as a consequence of “the principle of least effort”. A fourth reason could be that early economics work in the context did not cite Auerbach (1913) and this missing reference could have been reflected in follow-up work in the rather big economics field. Lösch even attributed the discovery to Singer (1936), see Table 1.

Fig. 3
figure 3

Citations to Auerbach (1913) (A13) and Zipf (2012) (Z49). a Citations per year individually and both. Please be aware of the approximately exponentially increasing global publishing activity (Fortunato et al. 2018). b Share of works citing Auerbach that also cite Zipf. In both panels, the dark solid line represents an 11 year moving average. Almost all works citing Auerbach also cite Zipf—but not so vice versa

Interestingly, Christaller (1933), the creator of Central Place Theory, gives reference to Auerbach (1913) but in an ambiguous way. On the one hand, he writes “»Auerbach’s law« ... is not much more than playing with numbers.” (note 19, p. 82). On the other hand, he also states that explaining Auerbach’s finding is one of his major goals. “We come now to the chief problems of our investigations. ... generally and less exactly speaking, besides a great number of the smaller and smallest towns and market places, there are only a small number of greater towns; and the greater a town is, the smaller is the number falling in its respective category. This fact has already led to the statement of a most incredible law. ... Is there a possibility of truly explaining this fact?” At the same time he wants to search for such a law “Are there rules governing the relations of the frequency of a given size-class of city to the frequencies of other size-classes?” (“8. The System of Central Places”, p.58ff). Not surprisingly, it was confirmed that a special case of Christaller’s theory is consistent with the rank-size relation (Berry and Garrison 1958). It seems, Christaller did not recognize Auerbach’s achievement but nevertheless appreciated the importance of his discovery.

While Christaller made a career under the Nazi-regime (Barnes and Minca 2013), Lösch, who published several years after Christaller, was a staunch opponent of National Socialism (Bröcker 2014) and died at young age of scarlet fever shortly after the end of World War II. Although Auerbach (born 1856) accomplished his habilitation at an early age of 22, only in 1889 did he become an extraordinarius professor for theoretical physics at University of Jena. As detailed by Schlote (2011), his assignment was obstructed because of antisemitism in the department (Schröder-Auerbach 1955, Happe and Fischer 2003, see also). Only in 1923, shortly before his retirement in 1927, was he promoted to an ordinarius professor (Schlote 2011). In 1933, after his second stroke and the Nazis rise to power, Auerbach and his wife Anna (née Silbergleit) committed suicide (Happe and Fischer 2003, e.g.).

Fig. 4
figure 4

Portrait of Felix Auerbach by Edvard Munch, 1906, oil on canvas, 85.4 cm \(\times \) 77.1 cm, https://www.vangoghmuseum.nl/en/collection/s0541S2017

Having published more than 20 books, Auerbach was a versatile physicist and scholar (further biographical details can be found elsewhere (Klinckowstroem 1953, Wahl 1992, Schröder-Auerbach 1955, Happe and Fischer 2003, Rybski 2013, e.g.). His work on the measurement of hardness (Auerbach 1891) led to an Auerbach’s law in physics (Langitan and Lawn 1969, Schlote 2011, e.g.). Auerbach’s lectures at University of Jena were attended by the philosopher Rudolf Carnap (Carnap 2021) and his diaries suggest interdisciplinary syllabi. As Auerbach published on core physics topics, including relativity theory, and on musical art and fine art from the point of view of the natural scientist, Müller (2004) argues that he served as mediator between Einstein and the Bauhaus (Bredekamp 2005, see also). He also served as a patron of the arts (Kisch-Arndt 1964; Wahl 1992; Happe and Fischer 2003; Müller 2004). A portrait of him by Edvard Munch (Kisch-Arndt 1964; Happe and Fischer 2003), see Fig. 4, is an example of his engagement in the contemporary art scene. Auerbach also commissioned the Bauhaus architect Walter Gropius (with Adolf Meyer) to build his house in Jena. It was finished in 1924 and turned out to be the first single-family house following the New Building principles (Happe and Fischer 2003).

To date, there is an extensive body of literature, including reviews (Berry and Okulicz-Kozaryn 2012; Arshad et al. 2018), meta studies (Nitsch 2005; Cottineau 2017), international comparisons (Soo 2005; Ribeiro et al. 2021), and possible explanations of the ALZ-law (Berry and Garrison 1958, Gabaix 1999, e.g.). Future work could inspect more closely possible conceptual predecessors, including Reynaud (1841) and Kohl (1850), as well as successors of the second wave (Allen 1954; Duncan 1957; Beckmann 1958; Berry and Garrison 1958; Moore 1959). One theoretical strand are models employing stochastic mechanisms (i.e. proportionate effect resp. preferential attachment) and evolving around Yule (1925; 1944), Gibrat (“Gibrat’s law”) (Gibrat 1931; Sutton 1997), and Simon (1955) (for a recent approach see Beare and Toda (2022). An overview can be found in Mitzenmacher (2004). It is also worth mentioning that power-law distributions are abundant in nature and society (Newman 2005, e.g.) and a different perspective could be gained when approaching the phenomenon from other disciplines, such as linguistics or ecology.