Introduction

Power-law distributions of the form f(x) = a xk have been observed in several physical, biological, financial and social systems, such as the size distribution in cities.1,2,3,4,5 Zipf’s law for word frequency is a well-studied paradigm, for which Zipf coined the term “principle of least effort”,6 exhibiting a power law that was later derived by maximizing Shannon’s information entropy9 under a constraint on the mean-value of the logarithm of a variable. Notably, Zipf's law relates the probability of occurrence of a given word to its rank7,8 rather than its occurrence count. More generally, power laws have been used by Mandelbrot to describe word frequencies (and many other social and physical phenomena9) through the related concept of efficiency in communication. The principle of least effort was first discovered in 1894 by a French philosopher Guillaume Ferrero and he discussed this principle in his article entitled "L'inertie mentale et la loi du moindre effort".10 In 1949, the principle was proposed by George Kingsley Zipf in his book " Human behavior and the principle of least effort: An introduction to human ecology ".11 Zipf theorized that the distribution of word use was due to the tendency to communicate efficiently with least effort. Hence, the principle of least effort is also known as Zipf's law. Based on the principle of least effort, it is human nature to want the greatest outcome at the least amount of work. The principle of least effort has also been observed in animal behavior. The animal behavior scientist L. S. Tsai stated that “Among several alternatives of behavior leading to equivalent satisfaction of some potent organic need, the animal, within the limits of its discriminative ability, tends finally to select that which involves the least expenditure of energy”.12 Zipf’s law has been invoked in mining13,14,15,16,17 and geosciences.18,19,20

According to Mandelbrot,9 for most efficient communication the frequency of words transmitted in languages is inversely proportional to rank, another common invocation of Zipf's law; in general, power laws prevail in phenomena following Zipf’s law. By analogy, we hypothesize that in any given year the total price paid for elemental materials optimizes efficient utilization through least effort. We consider the elemental abundance rank j to have the same form of word occurrence j and the price of the element as similar to the cost of communication transmission.

In this paper, we extend this paradigm to a surprising observation from materials economics, the approximate power-law dependence between element price and abundance. In our conjecture, materials abundance is analogous to word frequency, while efficiency of utilization compares to transmission of information in communication theory. Hence for the most efficient way to price elements, the abundance—price distribution might plausibly obey Zipf's law.

As in word frequency studies, a useful variable is frequency rank. Empirically, we observe a power-law dependence between crustal abundance—expressed in various ways by weight, volume, or moles—and the abundance rank of elements. In particular, we show that the abundances from weight, volume and number of moles exhibit a power-law dependence upon rank. Available reserve data express crustal abundance and accessibility by mining, and we find that the three related parameters, weight, volume and moles, play similar roles in organizing the market price of an element. Departing from the trend, some elements such as Te and He are sold at apparently anomalously low prices, perhaps due to regulatory forces. Our main objective is to call attention to the surprising relationship between price and abundance.

Zipf found that the probability of occurrence pj of a given word of rank j empirically follows the relationship

$$ p_{j} \sim j^{{1/(\alpha - 1)}} . $$
(1)

Here α is a constant. For the most common word the rank j is one, for the second most common word the rank is two, etc. Exploring “efficiency”, Mandelbrot9 hypothesized that languages roughly maximize the information communicated for a given cost of transmission. The important point in his explanation is that the cost of transmission of a given word is proportional to the logarithm of its rank, following Shannon,9 which leads immediately to power-law distributions. The geologic abundances of chemical elements (namely Al, As, Ba, Be, B, Cd, C in coal, C in industrial diamond, Cr, Co, Cu, Dy, Ga, Ge, Au, He, In, Fe, La, Pb, Li, Mn, Hg, Mo, Nd, Ni, Nb, Pd, P, Re, Sm, Sc, Se, Ag, Te, Th, Sn, Ti, W, U, Va, and Zn.) found in the earth's crust is used in this study. Borrowing from analyses of Zipf’s law, the abundance Aj may be conveniently mapped to abundance rank j.

Global aspects of abundance and production

We used crustal abundances, global mining production and price per kg (in US $) tabulated in 2010, a relatively typical year, for our analysis.21,22,23,24,25,26,27,28,29

The dataset from 2010 was chosen for our initial study to test the analysis as it was a recovery year after the 2008–2009 global financial crisis. While there may not be a “typical year”, drawing inferences relating to price with respect to abundances and availability from any year provides a priori values to predict future values. Also, our intent for studying 2010 as a control system is further to understand the effects of new technologies and other device markets as they evolve in time. Most of these elements (Primary elements: Al, As, Ba, Be, B, C in coal, C in industrial diamond, Cr, Co, Cu, Dy, Au, Fe, La, Pb, Li, Mn, Hg, Mo, Nd, Ni, Nb, Pd, P, Re, Sm, Se, Ag, Sn, Ti, W, U, Va, and Zn) are obtained as primary ore, while Cd, Ga, Ge, He, In, Sc, Te and Th are mined as byproducts. The utilization of these elements is vast and diverse, from electronics to nuclear power. Utilizations include (a) Al, Be, Cu, Ag and Au for electrical transmission, (b) As, Cd, Ga, Ge, In, Se and Te for semiconductor solar cells, (c) B, Dy, Nd, rare earths and Sm for making magnets, (d) Cr, Fe, Mn, Ni and Nb for steel, (e) Ln, Pb, Li and V for batteries, (f) Hg, Sc and Sn for light production, (g) Pd, Pt and rare earths for catalysts, (h) Co, Re and Fe for making turbines, (i) Sn and Zn for soldering, (j) Ba and C- industrial diamond for drilling, (k) C—coal for power production, (l) Th for superconductors, He for cryogenic applications, (m)Ti as light weight alloys, and (n) Uranium for nuclear energy production. In 2010, China led production of 25 elements and is among the top 3 producers for 31 elements; Russia led production of 3 elements, while the USA was first in production of only 2 elements (Other significant producers include Argentina, Australia Belarus, Belgium, Bolivia, Botswana, Brazil, Canada, Chile, Gabon, Germany, Guinea, Hungary, India, Iran, Japan, and Kazakhstan). The inhomogeneity of the crust is a worrisome source of noise in market competition to be better understood.

Of particular interest are the energy-critical elements (ECE) which are chemical and isotopic species required for emerging sustainable energy sources.30,31,32,33 Examples of ECEs include cerium, cobalt, dysprosium, europium, gallium, helium, indium, lanthanum, lithium, neodymium, praseodymium, samarium, tellurium, terbium and yttrium. Some elements such as Te and He are sold at very low prices30 perhaps due to non-market forces. In the case of He, regulation in the US prompted a policy study31 intended to avoid depletion of finite resources. It is self-evident that to connect price of elemental materials to their abundance and mining production may help in developing and implementing policies for sustainable utilization of materials, sustained human development, and responsible energy sources. This goal motivated our exploration of the price of elements using rank obtained from the abundances of the elements (from weight, volume and number of moles) from earth's crust and availability of the elements (from weight, volume and number of moles) from global mining production using parsimonious assumptions.

Results and discussion

Here we use “availability” to express produced or mined quantity, while “abundance” fits geologic abundance. We confined our attention to the elements found in the earth's crust and for which there is a well-defined market price.21 Considering annual mining production by raw weight, we assigned rank 1 to the most available element (carbon in coal), rank 2 to the next most available element (Fe) and so on. In this way, we ranked the elements by availability by weight from global mining production in Table 1, which contains (i) availability by weight (kg/ton), (ii) global mining production (metric tons), (iii) weight rank of weight availability from global mining production and (iv) market price ($/kg) during 2010. Figure 1 shows the global mining production (metric tons) and price per kg (in dollars) of the various elements as a function of weight rank (using Table 1). Not surprisingly, as the weight rank increases the price per kg increases, while the global production (in metric tons) decreases. The rank changes when comparisons are made with respect to volume and moles instead of weight, which we next explore.

Table 1 (i) Abundance of the elements in earth's crust (kg/ton), (ii) global mining production of the elements (metric tons), (iii) Rank from availability by weight from global mining production and (iv) Market price ($/kg) of elements during 2010.
Figure 1
figure 1

Global mine production and price per kg of elements as a function of rank from weight during 2010 showing higher price for elements with lower mining production.

Rank with respect to abundance

We use weight, volume, and moles in the earth's crust or in global mining production to express the amount (abundance/availability) of an element. We converted Table 1 using density and atomic weight of elements to these new variables and then calculated rank resulting in Table 2. The elements Al, Fe, In, Mn, Ni, and P have similar rank j expressed in terms of weight, volume and moles; several elements such as carbon in industrial diamond and He have a wider difference of rank. Carbon in coal has rank j = 1 from availability in terms of weight from global mining production making it the most mined element with respect to weight. However, carbon as industrial diamond has rank 38 from availability in terms of weight from global mining production leading to a wider difference in rank (from 8 to 38). Similarly, helium has rank 4 by volume from global mining production and has a larger rank (to 39) by other metrics. Figure 2 shows the 3D plots for (A) ranks from abundances from earth’s crust and (B) ranks from availability from global mining production obtained from Table 1. The black dots represent the three coordinates, while the red and blue dots are their projections upon the XY and XZ planes, respectively. In both Fig. 2a, b it is observed that He and Li (light elements) have greater differences in their ranks (due to low density) when compared to heavier elements such as Re and Sc. These vast differences in rank between heavy and light elements play a crucial role in the specific market price of the material. Also, for some elements their ranks with respect to other elements in the same graph (Fig. 2a, b) are very different. For example, the coordinates for carbon in industrial diamond in graph 2A is (9 8 7), while in graph B is (42 42 40) indicating that though carbon as diamond is abundant in earth’s crust, it is mined less (or is marketed in lesser quantities to maintain high price for it) giving rise to higher values for rank from availability by global mining production. Figure 3 shows the dependence of the logarithm of the abundances and availability upon logarithm of rank for the elements. The profiles have two regions (rank in the range 1–20 and 21–43) that are linear with different slopes. This indicates a broken power-law relationship between abundance/availability (A) and rank (j) of the form A = a jk where a is a constant and k is the exponent. The log–log dependence of abundance/availability (A) upon rank in Fig. 3 can be described piecewise, as summarized in Table 3. As pointed out by Gabaix5 in his study of city growth, small and medium-size cities have higher variance in population leading to an apparently different power-law exponent that large cities; similarly, abundance/availability of elements that are relatively rarer (j > 20) can exhibit a different power-law relationship from abundant elements.

Table 2 (a) Rank from abundance in terms of weight from earth's crust (kg/t), (b) Rank from availability in terms of weight from global mining production (kg), (c) Rank from abundance in terms of volume from earth's crust (m3/t), (d) Rank from availability in terms of volume from global mining production (m3), (e) Rank from abundance in terms of moles from earth's crust (mol/t) and (f) Rank from availability in terms of moles from global mining production (mol).
Figure 2
figure 2

3D plots for (a) ranks from abundances from earth’s crust and (b) ranks from availability from global mining production. The black dots represent the three coordinates, while the red and blue dots are their projections upon the XY and XZ planes, respectively. The points in the projections along the XY plane and XZ plane are connected by solid lines. The coordinates of some elements (rank from volume, rank from weight, rank from moles) are shown for clarity.

Figure 3
figure 3

(a) Abundance in terms of weight from earth's crust, (b) abundance in terms of volume from earth's crust, (c) abundance in terms of moles from earth's crust, (d) availability in terms of weight from global mining production, (e) availability in terms of volume from global mining production and (f) availability in terms of moles from global mining production.

Table 3 Log–log dependence of abundance (or availability) upon rank.

The observation of Zipf’s law is largely empirical. However, based on prior work1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 and our own analyses, we suggest that the origin of this relationship is related to a least-effort view. Moreover, the trend is notably accurate. It can be observed from Table 3 and Fig. 3 that the log abundance vs log rank fits the Zipf’s law trend. In addition, the standard errors observed are less (less than 5%). The R-square value in the fits were greater than 0.91.

We observed that as rank increases abundance decreases rapidly. Hence elements become rarer as rank increases. The weight of the earth's crust is ~ 2.6 × 1022 kg. Using the formula for abundance in terms of moles from earth's crust in Table 3, we get the relationship for the number of moles in earth's crust (n) for an element of rank j as

$$ n = n_{0} j^{{ - \beta }} $$
(2)

with β = 14.56 ± 0.94 and n0 = 2.98 × 1038. Choosing n = 1 gives the value of j in the range 304–669 indicating that the maximum number of elements that can exist in the earth's crust with at least one mole is ~ 600 (which is, of course, much larger than the number of elements in the periodic Table of Elements). Another way to extract the exponent k is using the Newman’s formula1 for extracting power-law exponents for functions of the type p(x) = C xk which is

$$ k = 1 + n\left[\sum\limits_{{i = 1}}^{n} {\ln \left( {x_{i} /x_{{\min }} } \right)} \right]^{{ - 1}} $$
(3)

This approach is much superior to least squares analyses, especially if applied to cumulative distributions functions. The values of the exponent computed using the above expression for all the abundances are also listed in Table 3. The value of the exponent is more for j ≤ 20 than for j > 20 in all the graphs (from Table 3).

Estimation of price of elements

Drawing parallels between the argument of Mandelbrot and others to the statistical structure of languages and our observation of the power-law relationship between materials abundance and rank, we turn to the proposition that money spent on materials should in some sense maximize utilization efficiency. If such an extremal principle holds, then one would expect Zipf’s law to hold at least piecewise to yield the most efficient way to price elements; indeed, a piecewise power law is observed in Fig. 3 in the abundance dependence upon the rank. Since the dependence of logarithm of the abundances or availability upon logarithm of rank has two regions (for j in the range 1–20 and 21–43) that are linear with different slopes (Fig. 3), within these two regions, the element with j = 12 was taken as reference to calculate the price for elements with j < 20 and element with j = 34 was taken as reference to calculate the price of elements with j > 20. If E1and E2 are two elements, we use the following formula

$$ {\text{PE}}_{1} = \left( {{\text{AE}}_{2} /{\text{AE}}_{1} } \right){\text{MPE}}_{2} $$
(4)

For estimating cost of E1 with respect to reference E2 from their abundances (or availabilities). Here PE1 is the price of 1 unit (kg or m3 or mole) of E1, AE1 and AE2 are their corresponding abundances (or availabilities) and MPE2 is the market price of 1 unit of reference E2 in the year 2010. Table 4 shows the values calculated per unit quantity of the elements using Eq. 3. From the price of unit quantity of weight, volume and moles in Table 4, the price of 1 kg of the element was estimated (using density and molecular weight of the elements) and the results are shown in Table 5. From the entries in Table 5, the value of variation of estimated price of elements from their market price (from weight, volume and moles) was calculated using the following formula

$$ {\text{Variation}}\,{\text{in}}\,{\text{price}}\,{\text{in}}\,{\text{percentile}} = \left( {{\text{estimated price}} - {\text{market price}}} \right) \times 100/{\text{market}}\,{\text{price}} $$
(5)
Table 4 (I) Market price per kg in 2010. (II) Price per kg from abundance in terms of weight from earth's crust (kg/t). For Rank 1–20 price was calculated with respect to rank 12 (zinc) and for rank > 20 price was calculated with respect to rank 32 (molybdenum). (III) Price per kg from availability in terms of weight from global mining production (kg). For Rank 1–20 price was calculated with respect to rank 12 (titanium) and for rank > 20 price was calculated with respect to rank 32 (palladium). (IV) Market price per cubic meter in 2010. (V) Price per cubic meter from abundance in terms of volume from earth's crust (m3/t). For Rank 1–20 price was calculated with respect to rank 12 (vanadium) and for rank > 20 price was calculated with respect to rank 32 (molybdenum). (VI) Price per cubic meter from availability in terms of volume from global mining production (m3). For Rank 1–20 price was calculated with respect to rank 12 (titanium) and for rank > 20 price was calculated with respect to rank 35 (indium). (VII) Market price per mole in 2010. (VIII) Price per mole from abundance in terms of moles from earth's crust (mol/t). For Rank 1–20 price was calculated with respect to rank 12 (nickel) and for rank > 20 price was calculated with respect to rank 32 (indium). (IX) Price per mole from availability in terms of moles from global mining production (mol). For Rank 1–20 price was calculated with respect to rank 12 (barium) and for rank > 20 price was calculated with respect to rank 34 (indium).
Table 5 (A) Market price per kg in 2010. (B) Price per kg from abundance in terms of weight from earth's crust (kg/t). For Rank 1–20 price was calculated with respect to rank 12 (zinc) and for rank > 20 price was calculated with respect to rank 32 (molybdenum). (C) Price per kg from availability in terms of weight from global mining production (kg). For Rank 1–20 price was calculated with respect to rank 12 (titanium) and for rank > 20 price was calculated with respect to rank 32 (palladium). (D) Price per kg from abundance in terms of volume from earth's crust (m3/t). For Rank 1–20 price was calculated with respect to rank 12 (vanadium) and for rank > 20 price was calculated with respect to rank 32 (molybdenum). (E) Price per kg from availability in terms of volume from global mining production (m3). For Rank 1–20 price was calculated with respect to rank 12 (titanium) and for rank > 20 price was calculated with respect to rank 35 (indium). (F) Price per kg from abundance in terms of moles from earth's crust (mol/t). For Rank 1–20 price was calculated with respect to rank 12 (nickel) and for rank > 20 price was calculated with respect to rank 32 (indium). (G) Price per kg from availability in terms of moles from global mining production (mol). For Rank 1–20 price was calculated with respect to rank 12 (barium) and for rank > 20 price was calculated with respect to rank 32 (indium).

Equation 4 for variation in percentile was calculated for entries in Table 5 and the item having the least value of variation of estimated price of elements from their market price (from weight, volume and moles) was listed. The item corresponding to those values along with their percentile in the list is shown in Table 6. The role of the volume from global mining production dominates in materials economics (~ 33% of the entries in the list having the least value in variation in percentile).Using abundance of the elements in earth's crust (Table 1) and prices per kg in 2010 from (Table 5), the prices vs crustal abundance is shown in Figs. 46. Some of the elements are indicated in Fig. 4 to guide the eye. Figure 5 appears have a better fit when compared to Figs. 4 and 6 (as supported by Table 6). From Figs. 4, 5, and 6, it is observed that for several elements such as Al, Fe, Ti, Ga, Sc, Ge, Be, He, Re, and Te the difference between the calculated prices (from abundances from earth's crust and availability from mining production) and the market price can be orders of magnitude in difference.

Table 6 Items having the lowest variation in price (per unit quantity) in percentile in Table 5 along with their percentage of entries.
Figure 4
figure 4

(i) Market price per kg in 2010. (ii) Price per kg from abundance in terms of weight from earth's crust. (iii) Price per kg from availability in terms of weight from global mining production.

Figure 5
figure 5

(i) Market price per kg in 2010, (ii) price per kg from abundance in terms of volume from earth's crust, (iii) price per kg from availability in terms of volume from global mining production.

Figure 6
figure 6

(i) Market price per kg in 2010, (ii) price per kg from abundance in terms of moles from earth's crust, (iii) price per kg from availability in terms of moles from global mining production.

Some elements such as lithium are being sold at prices lower than predicted by our Zipf’s Law analysis. Indeed, a case study of Li could help us better understand the long-term implications of price deviations. For example, the majority of lithium end uses do not require it in metallic form but is required as a component. Initially Lithium metal had little demand and its price was nearly constant at ~ $20 per kg. Increased demand for lithium in batteries and aerospace (aluminum lithium alloys) resulted in the increase in the price of lithium till 1998. It decreased from 1999 to 2006 due to increased production of lithium from lower cost south American brine-based sales. Between 2007 and 2008 the price of lithium increased due to further applications in batteries to ~ $88/kg. Between 2009 and 2010 the price of lithium decreased due to global financial crisis. In 2010 the price of Lithium metal was ~ $73/kg.27 However, these time-dependent studies concerning potential long-term implications of price deviations in this study is complex and is beyond the scope of the present manuscript and will be saved for future comment.

The price of the elements is affected by various factors such as extraction techniques, purification techniques, transportation, and geopolitical forces. The extraction techniques include underground mining, surface mining and placer mining.34 The underground mining is used to extract higher grade metallic ores found in veins deep under the earth’s surface using drilling and blasting. It is an expensive technique. Surface mining is used to obtain lower grade metal ores by drilling and blasting hard rocks. However some minerals can be mined without blasting. It is a profitable techniques. Several industrial minerals are obtained this way and they have low value. Placer mining (another profitable technique) is used to get minerals from sediments.34 The price of the element varies with respect to purity also. For example the average value (in 2010) for gallium of purity less than 99.99% was $307, while for purity greater than 99.9999% was $600.35 Transportation costs also play a role in the price of the elements. For example the price of Mn increased in the period 2004–2008 due to increase in fuel costs for transportation.36 Geopolitical forces are yet another factor that affects the price of the elements. Examples for effect of geopolitical forces include (i) the chief producer of cobalt since 1920s has been The Democratic Republic of the Congo. The political and civil unrest in that country has influenced the price of cobalt (to low values in 1990),37 (ii) price of copper reduced in 1990–1992 due to dissolution of the Soviet Union and political turmoil in Africa,38 and (iii) price of gold decreased in 2001 due to September 11 terrorist attack followed by price surge due to increase in investment in gold stemming from political and economic concerns.39 The global financial crisis in 2008–2009 also resulted in the decline of prices of several elements such as Sb, Cd, Cr, Co, Fe, Mo, platinum group metals, Ni, Nb, Re, Se, Sn, V, and Zn. For all these elements the price reached low values in 2009 and recovered in 2010.40 Incorporation of the effects of factors such as extraction techniques, purification techniques, transportation, and geopolitical forces which are specific to individual or small groups of elements along with our model based on estimation of prices using least-effort principles can result in a more accurate estimate of the price of materials.

Conclusion

In this research, the dependence of price upon abundance and availability is considered and compared with that of market price. This analysis could be considered as a control study with minimal variables (abundance and availability) that is common to all elements. Importantly our analysis is the first of its kind to explore in terms of weight, volume and number of moles (as ranks in Zipf’s distribution) in which we conclude that the volume of the mined element plays a more crucial role than weight and number of moles in materials economics. Elemental price and abundance are related by a power-law relationship, as are abundance and rank. This relationship holds regardless of the way to express abundances by weight, volume or number of moles. While there are many factors to consider in the problem of price distribution of elements, (i) density (especially differences in marketed phase) of the elements, (ii) extraction techniques, (iii) purification techniques, (iv) transportation, (v) geopolitical forces, and so on, the availability of the element in the market is ultimately limited by the abundance of the elements in the earth's crust, a fact that must be faced in the sustainability of materials availability. In that limit, the market price of the element may track with abundance in terms of weight from earth's crust, when hindrances from mining and other factors shall be negligible. We note that current market prices for elements such as Te, Re, He, Hg, Cd, Se, and As are always below the calculated price per kg from abundance in terms of weight from earth's crust; this observation implies that they are sold at lower prices and it will lead to faster depletion of the resource. By this analysis, Te and He should be sold at prices at least ~ 100–1000 times more than their present price for efficient utilization. The calculated price per kg from availability in terms of weight from global mining production for Ge, Be, Li, Sc, and C in industrial diamond is always higher than the market price and calculated price per kg from abundance in terms of weight from earth's crust. This implies that such elements are sold at a lesser cost. Such lessening of the price could be due to lesser market for these elements, thereby compromising on the price. Given its utility and ease of use, rank could be a valuable metric in materials economics. Factors such as energy requirements in production, environmental impacts on human health, and ecosystems, and social disruptions associated with resource production are specific to some elements and are therefore not studied in this article. These factors can certainly contribute to the price of the elements and therefore could act as limitations in our analysis. However, from our observations the analysis using Zipf’s law serves as a good approximation (efficient/sustainable prices are not solely determined by these physical measures of abundance/availability according to Zipf’s law) to the price of the elements. We hope that this analysis will help us better to understand the market price of materials. Perhaps our approach can benefit in materials economics related to mars mission and other space missions where materials extraction from out of earth objects for economic gain can become a part of the mission.