The Nobel Family

Nobel laureates cluster together. 696 of the 727 winners of the Nobel Prize in physics, chemistry, medicine, and economics belong to one single academic family tree. 668 trace their ancestry to Emmanuel Stupanus, 228 to Lord Rayleigh (physics, 1904). Craig Mello (medicine, 2006) counts 51 Nobelists among his ancestors. Chemistry laureates have the most Nobel ancestors and descendants, economics laureates the fewest. Chemistry is the central discipline. Its Nobelists have trained and are trained by Nobelists in other fields. Nobelists in physics (medicine) have trained (by) others. Economics stands apart. Openness to other disciplines is the same in recent and earlier times. The familial concentration of Nobelists is lower now than it used to be.


I. Introduction
The Nobel Prize is the highest accolade in academia.Who are the winners?What made them into what they are?This paper sheds partial light on that last question, mapping the academic ancestry of Nobelists.There are 727 Nobel laureates.There are 25 family trees with a single Nobelist, 4 trees with 2 Nobelists, and 1 tree with 696 Nobelists.This is a remarkable agglomeration of excellence.
The clustering of Nobel Prize winners has been documented before (Zuckerman, 1996;Chan and Torgler, 2015), but not in terms of academic genealogy.The only comparable effort is limited to the winners of the Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel (Tol, 2022b).The current paper extends that family tree to the Nobel Prizes in physics, chemistry, and medicine or physiology.(The prizes for literature and peace are of an entirely different nature.) A family tree shows more than just clustering.It allows for the identification of key figures in research training as revealed by the number of and closeness to Nobel descendants.It also distinguishes Nobelists who are insiders from those who are not.The paper also uses a newly defined measure of cross-closeness (Tol, 2023) to identify Nobelists who studied with other Nobelists.I also analyze differences between the four disciplines in terms of their respective concentration of Nobelists and their openness to other disciplines.
The paper proceeds as follows.Section II discusses the data and methods.Section III shows the results for Nobel descendants, ancestors, and peers, as well as differences between disciplines and changes over time.Section IV concludes.

A. Data
I constructed the academic ancestry of all Nobel laureates, focusing on PhD advisor-advisee relations in recent times and on wider mentor-mentee relations for earlier periods. 1 The main source of information is AcademicTree.The database was largely complete at the start of this project and updated where needed.
The AcademicTree is a Wiki.For recent times, its main source of information is ProQuest, a database of all PhD theses completed at a consortium of major research universities.A number of volunteers have added great historical depth to the data.2Other volunteers have added data about themselves or people close to them.The result is uneven coverage.Prominent researchers, however, are likely to be included.
I added Nobel laureates and their ancestors who were not already included using Mathematics Genealogy, RePEc Genealogy, Wikipedia and a range of other sources, including biographies, obituaries, and PhD theses.In a few cases, I emailed individuals. 3he definition of "advisor" is problematic.Formalities and practice vary strongly over time, between countries, between disciplines, and between institutions.It is not uncommon among prominent emeriti in Western Europe to have only a Master's degree and in the generations before that, we find people who were home-schooled or self-taught.In other places or recent times, a PhD counts for little; it is the Habilitation that matters, or the second PhD, or the post-doctoral fellowship.In some universities, professors jealously guard their students whereas in other places it takes a village to train a researcher.On top of that, the formal advisor may differ from the actual teacher.These caveats notwithstanding, this is the best data available.
Ancestors were added until the respective Nobelists were connected to the main family.If no connection was possible, four generations of ancestors were added, if known.The resulting tree has 33 generations, with Erasmus as Urahn.

B. Methods
Data were transferred to Matlab and stored as a directed acyclic graph or polytree for analysis and visualization.Representation as a polytree offers a number of standard measures of centrality.I use the harmonic mean distance, where distance is the number of edges between two nodes.The harmonic mean is defined for unconnected polytrees, as is the case here, and emphasizes proximate over distant relations.I define distance as the distance to a Nobel laureate, rather than to any node.Besides the standard outcloseness for academic ancestors and incloseness for descendants, I also define and use crosscloseness to measure the distance to Nobel siblings and cousins.I analyze these measures for all Nobel laureates and separately for Physics, Chemistry, Physiology or Medicine, and Economics.
More precisely, the distance from a node i in a graph to the rest of this graph can be measured by the Hölder mean (1) where D j,i is the distance from node i to any node j, that is, the number of edges between node i and node j.The set J typically includes all nodes j = i but may be restricted to nodes with a particular characteristic.Here, J contains only Nobelists.
For h = 1, the Hölder mean is the arithmetic mean.This can be computed using the Matlab function centrality, which is included in the standard release.Note that D i (1) = ∞ unless node i descends from all other nodes in set J. This makes it less suitable for any application to unconnected graphs, as is the case here.
For h = −1, the Hölder mean is the harmonic mean, which is bounded if some nodes in the network cannot be reached.In other words, the harmonic mean applies to connected as well as unconnected subgraphs: For unreachable nodes D j,i = ∞ so 1/D j,i = 0. Marchiori and Latora (2000) propose this as a measure of distance, Gil-Mendieta and Schmidt (1996) its inverse as a measure of closeness.
The Hölder mean distance can be used to emphasize proximity at the expense of distal relationships.Close relations are further emphasized as h becomes more negative.
Equation ( 1) is an outcloseness measure.Outcloseness on a polytree measures ancestry.Replacing D j,i by D i,j in Equation ( 1) yields an incloseness measure, measuring descent.
Outcloseness and incloseness measure the vertical distance, between parents and children.The horizontal distance, crosscloseness (Tol, 2023), is of interest too-siblings can be just as influential as parents.The horizontal distance of node i to j on a polytree is defined as That is, distance equals the number of shared ancestors of generation n divided by the maximum number of ancestors.In biology, H i,j (1) = 1 for siblings, H i,j (1) = 0.5 for half-siblings, and H i,j (1) = 0 for everyone else.H(i, j)(2) = 0.5 for first cousins, H(i, j)(3) = 0.25 for second cousins, and so on.
Having constructed the matrix H of horizontal distances, the inverse of the generalized mean of Equation ( 1) then defines crosscloseness.

III. Results
Figure 1 shows the main family tree of 696 Nobel laureates.Figure A1 in the Appendix shows all trees, Table A1 lists the Nobel prize winners who are not part of the main tree.Nobelists are colour-coded by discipline.Node size is proportional to the sum of out-, in-, and crosscloseness.Figure 1 shows a thick cluster of nodes, with some separation between physics, chemistry, and medicine, with economics as an outgrowth.
There are 360 professor-student pairs who both won the Nobel Prize, 255 in the same discipline.These numbers increase to 863, 431 in the same discipline, if we include grandprofessor-grandstudent pairs and more distant relationships.This highlights just how tightly knit the Nobel tree is.

A. Nobel descendants
Emmanuel Stupanus4 is the nearest common ancestor of 668 Nobelists, almost all of the 696 Nobelists in the main tree.Stupanus was a 17th-century professor at the University of Basel, best known for his opposition to empirical evidence in medicine.He trained a few students-Franz de le Boë, Johann Bauhin and Nikolaus Eglinger-but their students were more numerous and influential.See Figures A2, A3 and A4.
The Nobelist with the most Nobel descendants ( 228) is John Strutt, Lord Rayleigh (physics, 1904).His student, Joseph Thompson (physics, 1906)  Georg Lichtenberg is the central-most professor in the network.Lichtenberg was an 18th century physicist at the University of Göttingen, best known for his work on electricity.He also trained a large number of scientists, who in turn trained more.See Figure A5 for the first two generations.In both Lichtenberg and Stupanus, we find a common ancestor who is not renowned for his contributions to science, but who was influential in training young scientists, including in the art of training young researchers.
The central-most Nobel professor, and the 12th-most central professor, is John Strutt.Ernest Rutherford is the highest-ranked Nobelist (joint 75th) in chemistry, Otto Warburg in medicine (479th), Wassily Leontief in economics (595th).The central-most student is Victor Ambros who was Craig Mello's professor and therefore closer to Mello's academic ancestors.Mello is the most-central Nobel student and the 3rd-most central student, after Fritz Melchers, who was one of Georges Kohler's professors.Seven of the top ten Nobelists are in medicine, three in chemistry.Martin Perl (1995) is the highest-ranked physicist at 29, Esther Duflo the highest ranked economist at 82.
As noted above, 31 of the 727 Nobelists are not connected to main family.There are 66 Nobelists who have no Nobel ancestry and no Nobel peers.Another 130 Nobelists have fellow students who won the Nobel Prize but no professors who did.

C. Shared ancestry
The central-most fellow student of Nobelists is Emil Fischer (chemistry, 1902) who, with August Kekulé and Adolf von Baeyer as professors, studied with an amazing cast of later Nobelists.Figure 2 shows all grandstudents of Fischers' grandprofessors-that is, his academic siblings and cousins-who either won the Nobel prize or have descendants who did.This is a remarkable cluster of excellence.
Harold Urey (chemistry, 1934) is the 2nd-most central peer.He studied under Gilbert Lewis and Niels Bohr, together with many other prominent scholars.The top 12 central-most enNobeled fellow students are all chemists.Karl Landsteiner (1928) is the highest-ranked Nobelist in medicine at 13. Julian Schwinger (1965) tops the physics list at 17, Tjalling Koopmans (1975) the economics list at 68, although he has more academic cousins in physics than in economics.

D. Differences between disciplines
Figure 1 and the results above suggests that different disciplines play different roles.This is underlined in Table 1 (proximal descent) and Table 2 (distal descent).Table 1 shows that 96 Nobel laureates in chemistry have students who won the Nobel prize, 66 in chemistry, 12 in physics, and 18 in medicine.Medicine laureates trained chemistry laureates but no physics ones.Economics laureates neither trained nor were trained by laureates in other disciplines.Table 2 reveals a similar pattern, with chemistry firmly in the centre, training more of the laureates in other disciplines and receiving more training from them.Some physics laureates can trace their ancestry to medicine ones.Some economics laureates have ancestry in physics and chemistry, or in medicine.
Table 3 amplifies this result.The average Nobelist has 4.6 Nobel ancestors-therefore, the average Nobelist also has 4.6 Nobel descendants.These numbers vary between fields.Chemistry Nobelists have the most Nobel ancestors (5.9), economics Nobelists the fewest (1.0).This difference is statistically significant, as are the differences with in-between physics (4.7) and medicine (4.9).On average, physics (3.5) and chemistry (3.5) have the most Nobel ancestors from their own field, followed by medicine (1.9) and economics (0.8).The majority (59%) of Nobel ancestors of Nobel laureates in medicine are from other fields, about a third (34%) and a fifth (21%) for chemistry and physics, and only 6% for economics.These differences are statistically significant.
Table 3 also shows the average number of descendants.Chemistry (7.0) and physics (6.2) Nobelists have the most Nobel descendants, followed by medicine (2.6) and economics (0.8).The number of Nobel descendants by field equals the number of Nobel ancestors by field.Medicine laureates have the largest share (43%) of Nobel descendants in other fields, statistically significantly more than physics (29%) and medicine (22%).Economics laureates have no Nobel descendants in other fields.
Overall, clustering of Nobel laureates in family trees is strongest in chemistry and physics, and weakest in economics.Chemistry laureates train most laureates in other fields; medicine laureates are trained most by laureates in other fields.Economics is the most isolated of the four fields.Figure 4 plots the number of Nobel descendants divided by the number of Nobel laureates against the year of the award.There is a clear downward trend.That is, the number of Nobel laureates has grown faster than the number of Nobel descendants of Nobel laureates.The slight upward trend in Figure 3 notwithstanding, the Nobel tree has grown less concentrated over time.
Figures A6 and A7 plot the fraction of Nobel ancestors and descendants, respectively, of Nobel laureates who won in a different field against the year of the award.There has been no significant or substantial change over time.Overall, fields are as open (or closed) to outside influence now as they were in the past.
Figure A8 plots the fraction of Nobel laureates who do not have a Nobel prize winner among their ancestors.This fraction starts relatively high.The early Nobelists studied with venerable researchers who could not have won a prize that had yet to be instituted.From around 1950 onwards, however, the fraction is roughly stable, even though the number of past Nobelists keeps increasing.
Figure A9 plots the fraction of Nobel laureates neither whose professors nor whose fellow students won the Nobel prize.This fraction has increased over the last 40 years or so.As with Figure A9, this suggests that the Nobel prize has opened up to people of non-Nobel families.
IV. Discussion and conclusion I construct the academic family tree of all 727 winners of the Nobel Prize in physics, chemistry, and medicine and the Nobel Memorial Prize in economics.96% of all laureates belong to one family tree; 92% of laureates are related in the sense that their professor's professor's ... professor was Emmanuel Stupanus.31% of Nobel prize winners descend from Lord Rayleigh, who won the physics prize in 1904.7% of Nobel laureates are ancestors of Craig Mello, who won the medicine prize in 2006.Chemistry (economics) laureates have the highest (lowest) number of Nobelists among their ancestors and descendants.Chemistry Nobelists have trained and are trained by Nobelists in other fields.Physics Nobelists have trained others, and medicine laureates are trained by others.Economics sits largely apart.Openness to other disciplines has not changed over time, but the familial concentration of Nobelists has fallen.
The analysis in this paper is limited to formal teaching relationships.It does not include other forms of scientific collaboration, such as co-authorship (Kademani et al., 2005;Fields, 2015b,a;Bai et al., 2021), informal mentoring, collegiality, and competition.Such relationships are important too, but harder to map.I do not look at the almae matres of the Nobelists or where they did their most important work (Schlagberger, Bornmann and Bauer, 2016).I study neither the methods and flow of ideas (Chan and Torgler, 2015)-indeed, Emmanuel Stupanus would be aghast at the empirical research of most of his Nobel descendants-nor citations (Bjork, Offer and Söderberg, 2014;Sangwal, 2015;Zhang, Zuccala and Ye, 2019;Frey and Gullo, 2020;Kosmulski, 2020).
A key question is not answered in this paper.Is the concentration of Nobelists because the best professors select the best students (Athey et al., 2007) and teach them well Jones and Sloan (2021), or is it because Nobelists have a strong voice in later awards and disproportionally nominate their proteges (Zuckerman, 1996)?Examination of the minutes of the awarding committees suggests that the latter explanation is at least partially true (Economist Data Team, 2021, but see (Tol, 2022a)).Further study would be welcome.
B. Nobel ancestryCraigMello (medicine, 2006)  has the most Nobel ancestry: 51 of his academic ancestors won the Nobel Prize.Georges Kohler (medicine, 1984) comes second with 42, followed by Robert Horvitz (medicine, 2002) with 31 and Arthur Kornberg (chemistry, 2006) and David Julius with 30 (medicine, 2021).Four of the top five won in medicine, seven of the top 10; the rest is in chemistry.The physics Nobelist with the Noblest ancestry is Eric Cornell (2001) with 23, ranking 14th.Esther Duflo (2018) the highest ranked economist, a shared 134th, with 8 Nobel ancestors.

Figure 3
Figure 3 plots the number of Nobel ancestors divided by the number of Nobel laureates against the year of the award.There is a slight upward trend.That is, the number of Nobel ancestors of Nobel laureates has grown faster than the number of Nobel laureates.

Figure 1 .
Figure 1.The main Nobel network.The colour denotes the discipline: red = medicine, blue = physics, green = chemistry, light blue = economics, grey = not a Nobel laureate.The size denotes proximity, the sum of in-, out-and cross-closeness, to Nobel laureates.

Figure 2 .
Figure 2. Academic siblings and cousins of Emil Fischer.The colour denotes the discipline: red = medicine, blue = chemistry, grey = not a Nobel laureate, but an ancestor of Nobelists.

Figure 4 .
Figure 4.The number of Nobel descendants over the number of Nobel laureates over time.

Figure A5 .
Figure A5.Students and grand-students of Georg Christoph Lichtenberg, the central-most scholar.

Figure A6 .
Figure A6.The fraction of Nobel ancestors of Nobelists who one their Nobel Prize in a different field over time.

Figure A7 .
Figure A7.The fraction of Nobel descendants of Nobelists who one their Nobel Prize in a different field over time.

Figure A9 .
Figure A9.The fraction of Nobelists who have no Nobel ancestry or peers over time.
comes second, with 227 Nobelists.Seven other Nobelists have more than 100 Nobel descendants: Adolf von Baeyer(chemistry, 1905), Wilhelm Ostwald (chemistry,  1909), Ernest Rutherford (chemistry, 1908)), Emil Fischer (chemistry, 1902), Max  Born (physics, 1954), Niels Bohr (physics, 1922), and Walther Nernst (chemistry,  1920).Five of these hold Nobel Prizes in chemistry, four in physics.John Strutt is the Nobelist with the most descendants (126) who won the Nobel Prize in physics.Adolf von Baeyer tops the list in chemistry, with 107 Nobel descendants.Strutt and von Baeyer descend from de le Boë; see FiguresA2 and A3.The numbers are much lower in medicine: Otto Warburg (1931) has the largest Nobel descent at 35.The prize in economics is much younger.WassilyLeontief (1973)has the largest number of Nobel descendants (15).

Table 2 -
Nobel laureates as academic ancestors.

Table 3 -
Average (standard error) number of Nobel ancestors and descendants of Nobelists