“The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist

Ehberger, Markus

doi:10.1007/s00407-022-00293-8

“The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist

Open access
Published: 05 July 2022

Volume 76, pages 531–571, (2022)
Cite this article

Download PDF

You have full access to this open access article

Archive for History of Exact Sciences Aims and scope Submit manuscript

“The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist

Download PDF

Markus Ehberger ORCID: orcid.org/0000-0002-8080-0782¹

3321 Accesses
1 Citation
2 Altmetric
Explore all metrics

A Correction to this article was published on 31 October 2022

This article has been updated

Abstract

In 1927, Paul Dirac first explicitly introduced the idea that electrodynamical processes can be evaluated by decomposing them into virtual (modern terminology), energy non-conserving subprocesses. This mode of reasoning structured a lot of the perturbative evaluations of quantum electrodynamics during the 1930s. Although the physical picture connected to Feynman diagrams is no longer based on energy non-conserving transitions but on off-shell particles, emission and absorption subprocesses still remain their fundamental constituents. This article will access the introduction and the initial reception of this picture of subsequent transitions (PST) by conceiving of concepts, models, and their representations as tools for the practitioners. I will argue for a multi-factorial explanation of Dirac’s initial, verbally explicit introduction: the mathematical representation he had developed was highly suggestive and already partly conceptualized; Dirac was philosophical flexible enough to talk about transitions when no actual transitions, according to the general interpretation of quantum mechanics of the time, occurred; and, importantly, Dirac eventually used the verbal exposition in the same paper in which he introduced it. The direct impact of PST on the conception of quantum electrodynamical processes will be exemplified by its reflection in diagrammatical representations. The study of the diverging ontological commitments towards PST immediately after its introduction opens up the prehistory of a philosophical debate that stretches out into the present: the dispute about the representational and ontological status of the physical picture connected to the evaluation of the perturbative series of QED and QFT.

Quantum Electrodynamics

Quantum Field Theory Beyond the S-Matrix Formalism

Article 01 July 2020

Thermal Field Theory in real-time formalism: concepts and applications for particle decays

Article Open access 23 February 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

This article will focus on the verbal representation of the model that was, and partly still is, underlying the perturbative evaluation of quantum electrodynamical (QED) phenomena. I will engage with Paul Dirac’s initial introduction of a verbally explicit description of the light-matter interaction in terms of temporally ordered transition subprocesses, each accompanied by an emission or absorption of a photon, even though these subprocesses did not refer to actual but to (in modern terminology) virtual transitions. I will ask why it was Paul Dirac, and not someone else, who introduced this kind of representation. I will discuss its impact on other representational formats and the plethora of stances physicists took towards the verbal explication immediately after Dirac’s introduction. The basic idea behind my exploration is an understanding of theory as practice and of concepts, their representations and the models they figure in as tools for practitioners. As I will use it frequently, I will abbreviate the analytical term “verbal representation of the model” with “verbal model” and, if I refer to the late 1920s and 1930s specifically, I will follow one of the actors and call it “das Bild ‘aufeinander folgender’ Übergänge” (Kockel 1937, 162) or, the “picture of ‘successive’ transitions” (PST for short).

Even though the conception of processes in terms of PST partly differs from the physical picture connected to Feynman diagrams, I want to start out with a quick look at these diagrams as it can serve to introduce the overarching topic of this article: The use of physical reasoning in the evaluation of abstract mathematical procedures within theoretical physics, especially when a direct connection between physical language and real-world process is not warranted or called into doubt by the actors themselves. Feynman diagrams, or more to the point, their interpretation, can exemplify how a processual language and the concepts figuring in it can have a constructive function, how they can be considered tools for the quantum field theorist.

Surely, there is no doubt that Feynman diagrams are an extraordinarily useful tool in the everyday business of quantum field theorists. Even philosophers who deny Feynman diagrams any ontological or representational status acknowledge the tool character of the diagrammatic technique.^{Footnote 1} In the following, however, I do not want to focus on the representational format of diagrams but on the processual language, i.e. the physical reasoning that can be used to construct them.

When Richard Feynman introduced his readers to the first Feynman diagram published by himself (see Fig. 1) he did so with a physical interpretation: The electrons (note the arrow of time on the left) would travel some way, one of them would emit and the other absorb a “virtual quantum,” not obeying the relativistic energy–momentum relation, in principle unobservable and represented by the squiggly line in the middle. The virtual photon itself would travel from the point where it was emitted to the point where it will be absorbed and, thereby, it would mediate the interaction between the two electrons. Then, the electrons would go off to infinity without further interaction.^{Footnote 2}

According to the mainstream of the philosophical literature on Feynman diagrams and virtual particles, such a picturesque description has no realistic or physical content.^{Footnote 3} As the conclusion of the arguments goes, Feynman diagrams (at least a single diagram on its own) should not be conceived of as a representation of a physical process but of a mathematical structure, a term in an infinite perturbative series of contributions to the calculation of one single physical effect. No more, no less.

And actually, starting from their introduction in the late 1940s, Feynman diagrams were not necessarily interrelated with a physical interpretation. They can perform their function as calculational aids through their topological features and an accompanying set of rules for drawing and translating them into mathematical expressions. David Kaiser (2005, especially the first section of chapter 5) has termed the divide between physical interpretation and mere topological evaluation the Feynman (physical interpretation)–Dyson(topological constructs) split in reference to the historical protagonists who first exemplified these stances.

The Feynman–Dyson-split still resonates in the different stances towards the diagrams in perturbative evaluations. While Mandl and Shaw (1984, 56) note in their textbook that “the reader must be warned not to take this pictorial description of the mathematics as a literal description of a process in space and time,” Peskin and Schroeder (1995, 3) encourage their reader “[to] imagine a process that can be carried out by electrons and photons, draw a diagram, and then use the diagram to write down the mathematical form of the quantum-mechanical amplitude for that process to occur.” But these two points of view are not mutually exclusive. For example, Mandl and Shaw (1984, 54–55) use a physical and processual description to introduce the idea of Feynman diagrams right before their warning and thereby point towards the role of the physical reasoning this article is concerned with.

If we take Peskin’s and Schroeder’s above-mentioned encouragement at face value, the constructive process does not start with the diagram but with the physical reasoning underlying it; the diagrams are the representational format in which this reasoning is cast to translate it into mathematical expressions. Feynman himself opened and ended his own physical interpretation by noting that it “will permit us to write down the higher order terms” and that “the correct terms of higher order in $e^2$ or involving larger numbers of electrons (interacting with themselves or in pairs) can be written down by the same kind of reasoning” Feynman (1949, 773). In this sense, the interpretation in terms of processes, the storyline that is connected to a diagram or to the respective term in the perturbative series, is more than a mere interpretation. This story and the method of telling it have a tool character and are used for what a lot of every-day tools are used for: construction.

Certainly, Feynman’s version of QED constituted a digression from QED as it was practiced and conceived of in the 1930s.^{Footnote 4} His space-time diagrams show similarities to Minkowski diagrams or bubble chamber pictures rather than to diagrams that were used in the theoretical evaluation of QED before the late 1940s.^{Footnote 5} Feynman’s application of diagrammatical techniques further had its roots in his struggle with the Dirac equation and Feynman’s main goal was a physical understanding suitable for the elimination of the divergences plaguing QED.^{Footnote 6} Yet part of the physical picture connected to the diagrams, the storyline of subsequent acts of emission and absorption processes, is older than the diagrammatical technique. As is well known, even before the invention of Feynman diagrams there was a mode of physical reasoning applied to evaluate QED and it served the same purpose as the story connected to a Feynman diagram today. PST, as introduced by Paul Dirac in 1927, was used by physicists, albeit knowing of its possibly fictional character, to construct mathematical representations of the phenomena they were investigating. In the following, I will engage with this mode of reasoning. More to the point, I will engage with the verbal representation of it, the language physicists used in their practice.

The approach I will take has been heavily influenced by pragmatic accounts of conceptual development. In the literature on the history and philosophy of science, the understanding of concepts as tools for researchers has proven a valuable angle for understanding scientific practice and its development.^{Footnote 7} Similarly, nearly from the onset of the so-called practice turn in the mid-1980s, it has been argued that representations, whatever form they might take, should not be conceived of as mere depictions but as “means for doing things, tools for intervening” (Soler et al. 2014, 23).^{Footnote 8}

To clarify, I want to point out that a model or a concept and their representations are not equivalent in my understanding. When I refer to a representation, I always mean something concrete, specific and manifest that has been written down in some way by the historical actors. A model or a concept, on the other hand, come in different representations. Each representation enables specific inferences to be drawn, but also entails specific constraints on thinking possibilities. Such constraints might be internal, dictated by the representational format itself.^{Footnote 9} But the constraints can also be externally dictated by other representations that the actors considered to describe the same model.^{Footnote 10}

As in the example of Feynman diagrams, I will conceive of the whole interpretation of the mathematical structures, i.e. the verbal expressions in which they are cast, as a tool for the respective physicists. The concepts, or rather their representations, involved in the interpretation have a tool character themselves due to the role they play. As the notion of a virtual particle is an indispensable ingredient when trying to imagine and express the processes Feynman diagrams purport to portray, the notion of “virtual transitions” and “intermediate states,” unbound by energy conservation, were an integral part of the conception of quantum electrodynamical processes during the 1930s. In this sense, the following will both be a story of the application of the verbal model and the application of the concepts used to spell it out. I believe that one cannot be told without the other.

In the first section of this article, I will engage with the initial proposal of the verbal model of the perturbative evaluation of quantum electrodynamics by Paul Dirac in 1927. To fully apprehend this introduction, I will revisit the technical and conceptual environment Dirac was working in, his general outlook on (the concepts of) quantum theory and the use he made of PST. Since part of the answer to the question why it was specifically Dirac who introduced PST is the high suggestiveness of Dirac’s mathematical framework, I cannot avoid engaging with a few technicalities.

The second section will outline some of the initial reactions to Dirac’s introduction of PST. I will exemplify its impact on diagrammatic representations and the range of stances towards PST: from ascribing the occurring intermediate states a temporal dimension and using their occurrence for inferences on the existence of physical entities to clear denotation as fiction. I will close this article with a conclusion and an outlook on the use of PST during the 1930s.

1 Dirac’s verbal model for the scattering of light

In his second paper on QED, communicated in April 1927, Dirac introduced his reader to the scattering of light from an atom in the following way:

“[...] radiation that has apparently been scattered can appear by a double process in which a third state, n say, with different proper energy from m [final state] and k [initial state] plays a part. If initially all the b’s [the quantum amplitudes of the states] vanish except $b_k$, $b_n$ [the amplitude of the intermediate state] gets excited on account of transitions from state k by an amount proportional to $v_{nk}$ [the matrix element of the interaction between radiation field and matter], and although it must itself always remain small, a calculation shows that it will cause $b_m$ to grow continually with the time at a rate proportional to $v_{mn}v_{nk}$. The scattered radiation thus appears as the result of two processes $k\rightarrow n$ and $n\rightarrow m$, one of which must be an absorption and the other an emission, in neither of which is the total proper energy even approximately conserved.”^{Footnote 11}

Up to that point, such a verbally explicit and temporally ordered account of the scattering of light in terms of subsequent energy non-conserving or “virtual”^{Footnote 12} transitions to and from intermediate or “virtual” states was (nearly) absent from the papers dealing with this effect.^{Footnote 13} This is all the more striking as similar mathematical structures to the ones Dirac presented in his solution were already known at the time.

Shortly before the publication of Heisenberg’s Umdeutungs-paper, Anthony Kramers and Werner Heisenberg (Kramers and Heisenberg 1925) had developed an account of dispersion, the Kramers–Heisenberg [KH]formula, which became, as Lacki et al. (1999, 462) phrased it, “un passage obligé” for all following versions of quantum mechanics: Born et al. (1926), Schrödinger (1926), and Klein (1927) all rederived this formula in the respective theoretical and conceptual framework. Whether a classical perturbative evaluation was performed that was then translated into quantum theory through the correspondence principle (Kramers and Heisenberg) or whether the material part of the system was treated quantum mechanically from the start, the solution for the intensity of the secondary radiation always contained something of the general structure

$$\begin{aligned} \left\| \sum _n \frac{x_{kn}y_{nm}}{E_m-E_n}\right\| ^2, \end{aligned}$$

(1)

where $x_{kn}$ and $y_{nm}$ are the “characteristic amplitudes”^{Footnote 14} in Kramers and Heisenberg’s terminology, or, in the later conception, the dipole matrix elements between the two states k and n.

As the initial (m) and final (k) states of the processes were connected through the third level n and as the squares of the operators $x_{nk}$ were and are interpreted as proportional to the probability of a transition between the states n and k occurring, an interpretation in terms of transitions suggests itself. Kramers and Heisenberg even provided diagrams to visualize the mathematical structures above, but in a rather abstract action-variable space and without an explicit temporal dimension (see Fig. 2).^{Footnote 15} A verbal description of such formulas in terms of transitions from state m over n to k is lacking, not only in Kramers’ and Heisenberg’s paper but also in the later rederivations of the formula.

The most explicit description I came across prior to Dirac’s was given by Wolfgang Pauli in 1925. In his discussion of the KH formula, the atom made a “detour through a third state.”^{Footnote 16} Nevertheless, Pauli would no longer use this interpretation of the KH formula in his Handbuch-article Pauli (1926) published shortly afterwards.^{Footnote 17} The derived mathematical structures were certainly suggestive but for more than 2 years the practitioners did not describe their results in a temporally ordered fashion of subsequent transitions when dispersion was discussed.

And there are good reasons why such a description should be avoided. To some extent, it contradicts the general interpretation of quantum theory in neglecting the differentiation between quantum amplitudes and probabilities.^{Footnote 18} As you can directly see from Formula (1), the sum over different combinations of matrix elements is squared and refers to a probability or the intensity of the emitted radiation. If we were to identify the matrix elements in Formula (1) with quantum jumps, several processes not conserving energy in the intermediate steps would contribute to the observable phenomenon at once. It is the differentiation between the quantum amplitudes and their squares, the probabilities, that makes a simple interpretation impossible. And it is this differentiation, an important and non-trivial one, that will concern us in the following.

And this brings the question guiding this first section into sharper focus: Why did Dirac eventually, and contrary to prior (and some later) descriptions, start to talk in a specific way about the structures he encountered? More to the point: why did Dirac choose not only to name the structures, but to describe them in terms of temporally ordered subprocesses at least suggesting some kind of causal connection between them and thereby setting “the basic language and concepts characteristic of the modern conception [...]” (Lacki et al. 1999, 484). To engage with this question, we need to take a closer look at the conceptual and technical framework Dirac developed prior to his quantum electrodynamical account of dispersion.

1.1 Dirac’s technical and conceptual background

The output of the young Cambridge physicist in the years 1925 through 1928 was, to say the least, outstanding. Dirac made a long lasting contribution to quantum theory in these years: from an alternative formulation of matrix mechanics (his q-number algebra); through his technical foundation of quantum mechanics (his transformation theory) and the first applications of quantum electrodynamics; to his relativistic description of the electron, just to name the most influential of his achievements. By early 1926, he had made a name for himself in the community of quantum physicists and his work had gathered wider attention.^{Footnote 19}

The following contextualization will focus on the aspects of Dirac’s work which I deem important to understand his invocation of PST. One of the most important aspects of his technical framework is his time-dependent perturbation theory. But, as I will argue, the conceptual framework of his radiation theory, kind of the prototype of QED in the 1930s, and the specific way of its perturbative evaluation played an important role for his invocation.

1.1.1 Time-dependent perturbation theory (Dirac 1926)

In mid-1926 and after some persuasion by Werner Heisenberg,^{Footnote 20} Dirac took up Schrödinger’s wave function in his On the Theory of Quantum Mechanics and developed a version of time-dependent perturbation theory. In this mode of evaluating the formulas, the question was not how the energy levels of the system were altered due to the perturbation but how the occupation of unperturbed states changed over time.^{Footnote 21}

As was common for Dirac, he treated the problem first in most general terms.^{Footnote 22} Dirac evaluated an arbitrary system, described by the Hamiltonian $H_0$, and considered the respective problem to be solvable. Then he introduced a time-dependent external perturbation V starting to act at some moment $t=0$. Using the most general solution, a superposition of eigenstates $\psi =\sum _n c_n\psi _n$, Dirac showed that the wave function of the perturbed system could be expressed in the following way:

$$\begin{aligned} \psi (t)=\sum _n b_n(t)\psi _n, \end{aligned}$$

(2)

where the $\psi _n$ are the unperturbed wave functions and the $b_n$, the new expansion coefficients, are time dependent. Their initial values $b_n(0)$ are given by the coefficients of the unperturbed problem and their temporal development is governed by the following equation:

$$\begin{aligned} i\hbar {\dot{b}}_m=\sum _n V_{mn}b_n, \end{aligned}$$

(3)

where the $V_{mn}$ are the matrix elements of the perturbation between the two unperturbed states m and n.

The unperturbed states of the system referred to energy eigenstates of the atoms under study. Yet, as Dirac was not interested in the behaviour of a single atom but an assembly of similar ones, he did not normalize the sum of the squares of the coefficients $c_n$ and $b_n(t)$ to one but rather to the number of atoms in the respective state. Hence, in Dirac’s initial presentation $\Vert b_n(t)\Vert ^2$ corresponded to the number of atoms in the state n after the perturbation had been acting on this assembly for a time t.

Whether one follows Dirac’s conception of an assembly of atoms or interprets the square of the coefficients as the probability of finding one atom or system in a given state,^{Footnote 23} this perturbative scheme came with a particular understanding: In contrast to time-independent perturbation theory, Dirac did not ask about the influence of the perturbation on the energy levels of the system. Rather, according to the mathematical modelling, the focus lies on the temporal development of the coefficients of the unperturbed states. Which states couple depends on the structure of the perturbation, i.e. which matrix elements of the perturbation $V_{mn}$ are non-zero. The perturbation energy is further not directly part of the system’s energy, but causes alterations in its behaviour. In his papers on radiation theory, Dirac would therefore introduce the term “proper energy” to refer to the energy of the unperturbed part of the system (or, equivalently, to the energy of the total system minus the interaction energy).

Dirac applied his new way of performing perturbative calculations directly in the same paper for a derivation of the Einstein coefficients for induced emission and absorption (Dirac 1926, 675–677). The specifics of this semi-classical evaluation are not important for our purpose. But there is a methodological step which Dirac incorporated here that I will refer to later on. He already indicated how a second order perturbative calculation must be carried out: A first approximation is derived by plugging in the initial values $c_n$ into Eq. (3) and then integrating it with respect to t. For a second approximation, these time-dependent values are re-introduced into Eq. (3). Hence, Dirac essentially described an iterative procedure for the construction of higher order terms.

1.1.2 Radiation theory (Dirac 1927b, c)

Shortly after finishing the paper introducing his version of time-dependent perturbation theory, Dirac went to Copenhagen (September 1926 until early February 1927) and subsequently to Göttingen (until June 1927). Although Dirac was ever more strongly integrated into the circles of quantum physicists, he kept his habit of mostly working on his own. In Copenhagen, Dirac developed the so-called transformation theory,^{Footnote 24} which provided rules for quantizing any dynamical system and provided the formal basis for Heisenberg’s uncertainty paper. The transformation theory was also one of the pillars of Dirac’s radiation theory, which he developed in Copenhagen and which we shall address now.^{Footnote 25}

Essentially, when constructing quantum electrodynamics, Dirac applied quantization to the wave functions, or rather to the coefficients of his perturbation theory, again. Dirac started out with an assembly of bosons and showed that the temporal development of the coefficients could be cast in an Hamiltonian form with the canonical variables $b_r$ and $b_r^{\dagger }$. As before, $N_r$, the number of the systems, was given by $N_r=\Vert b_r\Vert ^2=b_rb_r^*$.^{Footnote 26}

Then, to use Dirac’s language, he did not treat these coefficients as c-numbers but as non-commuting q-numbers and imposed commutation relations on them, so that

$$\begin{aligned}{}[b_r,i\hbar b_s^{\dagger }]=i\hbar \delta _{rs}. \end{aligned}$$

(4)

These operators $b_r$ and $b_r^{\dagger }$ are today known, and were interpreted by Dirac, as creation and annihilation operators. Their action was expressed in the following terms (Dirac 1927c, 252):^{Footnote 27}

$$\begin{aligned} b_r^{\dagger }\psi (N_1,\ldots ,N_r,\ldots )&=\sqrt{N_r+1}\psi (N_1,\ldots ,N_r+1,\ldots ) \end{aligned}$$

(5)

$$\begin{aligned} b_r\psi (N_1,\ldots ,N_r,\ldots )&=\sqrt{N_r}\psi (N_1,...,N_r-1,\ldots ) \end{aligned}$$

(6)

Hence, they raised or lowered the occupation number in the respective state r by 1.

Dirac could generalize this procedure: The second quantized bosonic assembly was coupled to an external perturbation, an atom. The Hamiltonian thereby constructed consisted, besides the proper energies, only of one additional term proportional to the product of each one annihilation and one creation operator: $H\propto H_0 + \sum _{r,s} v_{r,s} b_r^{\dagger }b_s$. Hence, it “will contribute only to those matrix elements that refer to transitions in which $N_r$ decreases by unity and $N_s$ increases by unity” (Dirac 1927c, 252). Without any detour through intermediate states, an initial photon was absorbed and the final photon created. In the same paper, Dirac already referred to such processes as “direct scattering processes” (Dirac 1927c, 263).^{Footnote 28}

In the last section, Dirac left his initial conceptualization of a perturbed assembly of bosons and turned towards “the wave point of view” (Dirac 1927c, 262). By resolving the radiation into its Fourier components, the interaction between radiation and matter was, in classical theory, proportional to $\sum _r A_r{\dot{x}}$, where $A_r$ is the rth Fourier component of the vector potential and x the position variable at the location of the atom times the electric charge. After some manipulation, Dirac could describe the vector potential in terms of the number of light quanta $N_r$ and the conjugate phases $\theta _r$, which allowed him to impose the previously developed quantization rules.

The Hamiltonian Dirac derived in this “wave-theoretic” way only contained matrix elements which changed the occupation number by $\pm 1$. As he himself noted, “it would seem that there are no direct scattering processes [from this wave-theoretic point of view], but this may be due to an incompleteness in the present wave theory” (Dirac 1927c, 263). Since the emission and absorption coefficients could now be treated by this Hamiltonian and the two points of view led to the same Hamiltonians, the missing direct scattering term in the wave point of view set aside for the moment, Dirac still concluded that “there is thus a complete harmony between the wave and light-quantum descriptions of the interaction” (Dirac 1927c, 245).

Hence, in his first paper on quantum electrodynamics, Dirac had already established the notions of creation and annihilation operators acting on the occupation number of the wave functions. Furthermore, he had constructed an interaction Hamiltonian that consisted of a product of such operators. It was responsible for the “direct scattering of light.” Before turning to its counterpart, the scattering through intermediate states, I will revisit a technical aspect of the perturbative evaluation of Dirac’s radiation theory as it provides important insights into Dirac’s reasoning process.

1.1.3 “Dirac’s Mogelei” or how to arrive at sensible results in radiation theory

Today, Dirac’s method^{Footnote 29} is known as Fermi’s Golden Rule No. 2.^{Footnote 30} During the late 1920s and 1930s, it did not necessarily have a name. Once, it was discussed in private communications between Werner Heisenberg and Wolfgang Pauli as “Dirac’s Mogelei.”^{Footnote 31} To find some balance between the “Golden Rule” and the “Mogelei,” I will call it Dirac’s trick. This procedure allowed Dirac and the physicists in his follow-up to construct probabilities in radiation theory that were proportional to the square of matrix elements and linear in time, as was expected by the actors.

Dirac’s trick is actually one of many intricacies of the perturbative evaluation of radiation theory,^{Footnote 32} but studying it and its derivation will provide important insight into Dirac’s way of reasoning in connection with dispersion theory.^{Footnote 33} Let us follow Dirac and discuss Eq. (3) in most general terms. We can simply integrate the equation with respect to t and assume that we know that the system was initially in a specific state $b_n(0)$. This results in

$$\begin{aligned} b_m(t)=v_{mn}b_{n}(0)\frac{1-e^{i(W_m-W_n)t/\hbar }}{W_m-W_n}+\delta _{mn}. \end{aligned}$$

(7)

As Dirac noted, as long as the proper energies of the states m and n “differ appreciably,” the amplitude of the state m varies periodically with time and is small, i.e. “these stationary states are not excited to any appreciable extent” (Dirac 1927c, 258).

While it might seem strange at first that states with energies different from the initial state could be excited, the real problem occurred when Dirac tried to simply look at the transition amplitudes to a state whose energy is exactly equal to the energy of the initial state. Then the coefficient $b_m$ becomes proportional to t.^{Footnote 34} But since only the absolute square of this coefficient refers to the probability of finding the system in the respective state, this would render a quadratic time dependence. And this ran counter to any expectation. Dirac’s trick is a solution to exactly this problem.

As Dirac noted, the probability of finding the system in a state with exactly the same energy “is of no importance, being infinitesimal” (Dirac 1927c, 258). Rather, he proposed to multiply the absolute square of this coefficient with the density of states $(\Delta W_m)^{-1}$ around the energy value of the final state and to integrate over this energy range. Expressed in formulas this meant that

$$\begin{aligned} P_m(t)=\int \frac{dW_m}{\Delta W_m}\Vert b_m(t)\Vert ^2=2\int \frac{dW_m}{\Delta W_m}\Vert v_{mn}\Vert ^2\frac{1-cos[(W_m-W_n)t/\hbar ]}{(W_m-W_n)^2}. \end{aligned}$$

(8)

Dirac now substituted $x=\frac{(W_m-W_n)t}{\hbar }$ and pushed t to infinity.^{Footnote 35} The point is, that by pushing t to infinity (which is the formalization of considering “large” t), the energy peaks around the value of the initial state (see Fig. 3 for Dirac’s representations in his calculational notes)^{Footnote 36} and the integral can be evaluated to give $\pi $. This effectively renders

$$\begin{aligned} \lim _{\text {large}~t}P_m(t)\propto \frac{t\Vert v_{mn}\Vert ^2}{\Delta W_m}, \end{aligned}$$

(9)

where $W_m=W_n$, the proper energy of the initial state. Hence, a transition probability between states of the same energy, proportional to t and the square of the respective matrix element is obtained.

In summary, there are essentially three things to note for Dirac’s later conceptualization of the scattering of light when it comes to this procedure. First of all, Dirac first imposed energy conservation directly and found that then the amplitude of the states would grow proportional to t. As we shall see in a moment, although Dirac correctly identified this as unexpected and reconfigured his calculation, he would use exactly this kind of reasoning in his dispersion paper to find the relevant matrix elements. Second, one can simply square Eq. (7), as Dirac did as well, to arrive at something which should be interpreted as a probability of finding the system in a state with different proper energy than the initial state. Hence, even the probability of finding the system in a state with different proper energy was, albeit fluctuating around a small value, non-zero. Only for longer times, this would reduce to energy conserving transitions.^{Footnote 37}

Third, these states of different proper energy had to be incorporated into the procedure developed by Dirac to arrive at energy conservation in total and to arrive at a linear dependence on t. To some extent, Dirac had to use the energy non-conserving transitions already in first-order approximations to arrive at physically sensible results.

1.2 Dirac’s quantum theory of dispersion (Dirac 1927b)

After these preliminaries, it is time to delve into Dirac’s theory of dispersion. Dirac resolved some of the shortcomings of his prior wave theoretic treatment by starting with the relativistic classical Hamiltonian for the interaction of a charged particle with the electromagnetic field.^{Footnote 38} Expanding it and applying his previous quantization procedure, i.e. treating the number operators and their conjugated phases as non-commuting q-numbers, he derived a three part Hamiltonian:

$$\begin{aligned} H=H_0+V+D. \end{aligned}$$

(10)

$H_0$ described the proper energies of the material part of the system and the radiation field (in terms of $h\nu _r$). V and D were treated as perturbations. V was proportional to a sum over creation and annihilation operators which Dirac had already identified as connected, but not equivalent to emission and absorption processes. The last term, called D here, consisted of a product of such operators and, after some further reductions, boiled down to the direct or true scattering term Dirac had already partially discussed from his “bosonic point of view.”

To derive a dispersion formula, Dirac first applied the iterative procedure he had already used when introducing his perturbation theory: The general solution of the first order equation was plugged back into Eq. (3). To second order, this yielded (Dirac 1927b, 721):

$$\begin{aligned} i\hbar {\dot{b}}_m=d_{mk}e^{i(W_m-W_k)t/\hbar }+\sum _n\frac{v_{mn}v_{nk}}{W_n-W_k}(1-e^{i(W_n-W_k)t/\hbar })e^{i(W_m-W_n)t/\hbar }.\qquad \end{aligned}$$

(11)

Next, Dirac expanded the last term and reordered the whole equation with respect to the time-dependence.

After integration, the probability amplitude of the final state m was given by

$$\begin{aligned} b_m= & {} \left( d_{mk}-\sum _n\frac{v_{mn}v_{nk}}{W_n-W_k}\right) \frac{1-e^{i(W_m-W_k)t/\hbar }}{W_m-W_k}+ \nonumber \\&+\sum _n\frac{v_{mn}v_{nk}}{W_n-W_k}\frac{1-e^{i(W_m-W_n)t/\hbar }}{W_m-W_n}. \end{aligned}$$

(12)

Applying the exact same reasoning as he had done when developing and applying his trick in the previous paper, Dirac analysed the behaviour of this equation by simply imposing energy conservation for initial and final state at first. While the second term remained periodic, the time dependence of the first term became linear. Dirac now interpreted:

“The rate of increase [of the first term] consists of a part, proportional to $d_{mk}$, that is due to direct transitions from state k, together with a sum of parts, each of which is proportional to $v_{mn}v_{nk}$, and is due to transitions first from k to n and then from n to m, although the amplitude $b_n$ of the eigenfunction of the intermediate state always remains small.” (Dirac 1927b, 721)

Having thus identified the relevant terms for the description of dispersion,^{Footnote 39} he could simply apply his trick and derive the probability for transitions to the set of final states. Expressed in formulas this led to:^{Footnote 40}

$$\begin{aligned} \sum \Vert b_m\Vert ^2=\frac{4\pi ^2 t}{h\Delta W_m}\left\| d_{mk}-\sum _n\frac{v_{mn}v_{nk}}{W_n-W_k}\right\| ^2. \end{aligned}$$

(13)

So far, this was actually a rather general argument about perturbation theory: the only functional attribute of the perturbing terms was that d could couple initial and final state directly, while v had to do so through the intermediate state. To connect this argument with physical processes, i.e. the scattering of light, and eventually to get into contact with the previously known and accepted KH formula, Dirac had to consider two different sequences of events. I quote in full:

“We can now take the state n [intermediate state] to be either the state $J=J'',~N_s=N_s'-1,~N_t=0~(t\ne s)$ for any $J''$^{Footnote 41}, which would make the process $k\rightarrow n$ an absorption of an s-quantum and $n\rightarrow m$ an emission of an r-quantum, or the state $J=J'',~N_s=N_s',~N_r=1,~N_t=0~(t\ne r,s)$, which would make $k\rightarrow n$ the emission and $n\rightarrow m$ the absorption.” (Dirac 1927b, 722)

With this set of intermediate states and subprocesses, verbally expressed in terms of PST, Dirac was able to rederive the KH formula in his new conceptual framework. By plugging in the specific form of the interaction matrix elements and after some smaller manipulations, the intensity of the scattered radiation was proportional to

$$\begin{aligned} I_r\propto \left\| \sum _{J''}\left( \frac{x_r(J'''J'')x_s(J''J')}{\nu (J''J')-\nu _s}+\frac{x_s(J'''J'')x(J''J')}{\nu (J''J')+\nu _s}\right) \right\| ^2. \end{aligned}$$

(14)

Here, the $J''$ refer to the variables of the atom in the intermediate state, $\nu (J''J')$ to the frequency corresponding to the energy difference between initial and intermediate state and $\nu _s$ to the frequency of the incident radiation. The two part structure, the addition of the two fractions in the brackets, was a direct consequence of Dirac’s invocation of PST as quoted above. The first fraction resulted from the imagined process of first an absorption and then an emission, the second fraction resulted from the inverse order of events.

Obviously, the quote I used to open this first section (see p. 6) and which Dirac provided in the introduction of his dispersion paper was written after he had finished his calculations. This is what all of the reasoning he had applied boiled down to. As should have become clear, Dirac’s description shifted completely to the quantum amplitude level. He noted that the amplitudes would vary periodically with the time if the initial and final states did not have the same energy. The amplitudes were then “changed only by a small extent” (Dirac 1927b, 712). If initial and final state had the same energy, Dirac stated that the amplitude would increase linearly with the time and thus the transitions became “physically recognizable” (Dirac 1927b, 712). The amount of excitation was then, in Dirac’s verbal description, proportional to the matrix element and not its square. While Dirac still applied his trick to impose energy conservation and to get reasonable results in agreement with priorly established formulas, his verbal description was no longer based on the level of probabilities, but on the level of probability amplitudes.

What Dirac did here was non-trivial and, as we shall later see, physicists clearly recognized that speaking about transitions when no actual transitions are concerned was not in harmony with the general interpretation of quantum theory at the time. Dirac had made transitions, emission and absorption processes take place on the quantum amplitude level. To some extent, he redefined what these terms actually refer to and thereby explicitly introduced what will later be called “virtual transitions” and “virtual states.” And now it is time to provide an answer to the question guiding this first section: Why did Dirac invoke such a temporally ordered sequence of events when describing his theoretical procedure? And why was it specifically Dirac who did so?

1.3 So, why was it Dirac who introduced PST?

A huge part of the material underlying my answer to the above question has already been presented. As should have become clear through the foregoing discussion, the technical and conceptual environment Dirac had developed was highly suggestive. But as will be exemplified in Sect. 2, Dirac’s conception was not unanimously accepted and applied. And this certainly raises the question why it was specifically Dirac who introduced PST. To present an answer I will first look more closely at Dirac’s introduction of the dispersion paper and the archival material connected to PST. Second, I will contextualize Dirac’s verbalization in respect to his philosophical stance towards quantum theory and the concepts involved in it. Concluding this section, I will draw together all of these aspects and try to provide a multifactorial answer to the question posed.

1.3.1 The careful introduction of PST

At first we have to note that the direct identification of matrix elements with transitions only occurred in Dirac’s paper on dispersion. In his first paper on radiation theory, mainly dealing with emission and absorption, Dirac spoke of “those matrix elements that refer to transitions” (Dirac 1927c, 252, emphasis added) or “the matrix elements associated with that transition” (Dirac 1927c, 259 emphasis added). As he noted, “the probability of a transition [...] is proportional to the square of the modulus of that matrix element of the Hamiltonian which refers to this transition” (Dirac 1927c, 261). The connection between the matrix elements and transitions was a cornerstone of the interpretation of quantum theory from its beginning. But an identification suggesting a physical process standing behind these structures was not.

When introducing PST in his paper on dispersion, Dirac was also rather careful. Shortly before engaging with the description of the processes on a quantum amplitude level, he observed:

If $V_{mn}$ are the matrix elements of the perturbing energy V [...] then each $V_{mn}$ gives rise to transitions from state n to state m; more accurately, it causes the eigenfunction representing state m to grow if that representing state n is already excited [...].” (Dirac 1927b, 711; emphasis added)

Dirac actually acknowledged himself that he was speaking somewhat loosely when invoking the transition terminology.

Yet, the verbal description and the differentiation between direct scattering and scattering through intermediate states were cornerstones of his exposition. In early notes on the dispersion paper, Dirac wrote (see Fig. 4): “state how coeft. [coefficient] $v_{rs}$ in perturbation terms gives [emphasized and added] direct transitions $r\rightarrow s$ when there is no change of energy. Describe dispersion effect Scattering terms.”^{Footnote 42} When drafting the relevant passage, Dirac would wrestle with the exact formulation for framing the “dispersion effect” (see Fig. 5). He noted that “$a_n$ [the amplitude of the intermediate state] gets excited [scribbled out: at the expense of $a_m$; by transitions from $m'$] on account of the existence of $a_{m'}$ by an amount proportional to $v_{nm'}$ [...].”^{Footnote 43} Dirac certainly chose his words with care at this stage and the version he settled for in the draft (“on account of the existence”) was rather uncontroversial. Since the amplitude of the initial state was set equal to one from the beginning, this is a straightforward extrapolation. In the published paper, contrary to this draft, Dirac would nevertheless choose the transition terminology.

When it comes to the specifics of his route towards PST, the archival material only holds minor indications. Dirac’s notes are scattered over different folders, temporally unordered while obviously stretching over several years. He only took sparse annotations to the calculations and some are on the back of prior drafts. Worst of all, there are bigger gaps in the material which make a coherent reconstruction next to impossible.

Some of the calculations and drafts are nearly equivalent to the published version.^{Footnote 44} Others simply back up what was clearly the case: For example, Dirac explicitly used the wave function in occupation number representation in his drafts while such a representation, although obviously guiding his work, is nearly completely lacking in the dispersion paper.^{Footnote 45} Yet, there are a few pages that actually point towards conceptual issues and exemplify the strong grasp the ideas Dirac had developed already had on his thinking. One of them is reproduced in Fig. 6.

Although the notation is a little bit different,^{Footnote 46} up until the third line of the sheet the calculation is equivalent to the presentation in the publication. The break with his published paper occurs at the second equal sign in line three and is then carried through the whole calculation: Dirac did not split his formulas according to the time-dependence of the corresponding matrix elements, but rather in terms of direct scattering and processes through intermediate states.

There are two ways of temporally locating this calculation in Dirac’s derivation process. On the one hand, we might interpret this calculation as an attempt at treating resonance scattering. Dirac kept all terms of Eq. (12), including the ones he dropped in his calculation of dispersion. In the lower part of the draft, Dirac further applied his trick to the energy range of the intermediate states. Since this procedure was essentially a way of ensuring energy conservation, this points to an interpretation of the sheet as an attempt of treating resonant scattering.

On the other hand, had Dirac already derived the KH formula, so had he finished his dispersion theoretic treatment, which he nearly certainly did before developing an account of resonant scattering, he would have noted that he needed to group the terms differently, namely corresponding to their time dependence. As already noted, this is how he would present his calculations in the published paper.

Although I am not able to temporally locate this calculation exactly, it nevertheless provides an important insight: The ordering of the terms according to the temporal development was not guiding Dirac’s evaluation at this point, although he had already developed a firm grasp on the application of his trick. It was the distinction between direct scattering processes and scattering through intermediate states that led Dirac’s derivation of the probabilities of observable results in his radiation theory. When Dirac performed the above calculation, the conception based on a quantum amplitude level structured his evaluations.

The still existent notes by Dirac himself, taken in the period from early February to early April 1927, thus point towards the important role the verbal exposition of dispersion had for Dirac and to the influence of the conceptualization on early calculations of the effects. In particular, they highlight the role of the differentiation between direct scattering and scattering through intermediate states. Still, they do not provide any clues as to why Dirac chose to connect the calculations with PST. For this reason, as well as to finally have all the arguments needed for a coherent interpretation in place, we need to engage with Dirac’s philosophical stance towards theoretical physics and its concepts.

1.3.2 Dirac’s philosophical stance towards quantum theory

The best-known aspects of Dirac’s approach to theoretical physics are certainly his striving for mathematical beauty^{Footnote 47} and the method of “playing with equations.”^{Footnote 48} Both of these aspects were certainly part of Dirac’s development of radiation theory. Dirac remembered that, in coming up with the quantization procedure, he was simply playing with equations. The archival material suggests that he often simply tried things out to see where they lead when evaluating his radiation theory as well. One aspect of mathematical beauty, the formulation of the problem in most general but equivalently simple terms and then shifting focus to the actual application, is also mirrored in Dirac’s approach to perturbation theory. Yet, neither of these points help in answering the question guiding this section.

However, Dirac’s methodology included a standpoint that is connected to the above two guidelines and which might actually be used for an explanation: Dirac himself has called it “Eddington’s principle of identification.”^{Footnote 49} In most general terms, it states that the mathematics of the theory should be developed and evaluated before any physical interpretation and then, in a second step, the quantities referring to physical properties should be identified. Most explicitly, Dirac used this kind of reasoning in his argument for magnetic monopoles (Dirac 1931). But, as Olivier Darrigol (1993, 331–333) argued, it also guided Dirac’s application and interpretation of Schrödinger’s wave function in his 1926 paper. To understand how such a standpoint might be applied to explain Dirac’s invocation of PST, we need to briefly discuss the results of Dirac’s evaluation of resonant scattering.^{Footnote 50}

In the last section of the paper on dispersion, Dirac demonstrated that in the case of resonance between the incoming frequency and the frequencies of the material system, the most relevant part of the scattered light was due to actual emission and absorption processes, connected to the squares of the respective probability amplitudes. Actual counterparts to Dirac’s virtual transitions existed that described the scattering of light when the energy of the impinging radiation was fitting. As is necessary for the principle of identification to apply, in the case of dispersion there was further no theoretical hindrance in conceiving of such counterintuitive events as energy non-conserving transitions. From this point of view, an “anything that can happen, will happen” attitude seems to fit Dirac’s reasoning process.

And even though the identification with actual processes in the case of resonance might have enhanced the suggestiveness of Dirac’s framework, there is one obstacle for invoking this kind of argument as an explanation of PST. Since the principle of identification aims at associating mathematical structures with physical objects or processes, a close look at the structures Dirac identified with transitions in dispersion theory is important. And this is where a problem occurs: The mathematical elements describing dispersion on the one hand and the ones describing resonant scattering on the other were not equivalent. The starting point of Dirac’s mathematical and verbal description of dispersion was the first part of Eq. (12) while resonant scattering was represented by its second part. Actually, the parts describing dispersion were also part of the final results on resonant scattering in Dirac’s paper, but due to the dominance of the second part of Eq. (12) they could, according to Dirac’s argument, be neglected. The two mathematical elements existed side by side. Although a direct identification of matrix elements with transitions might have become more suggestive, an implementation of the principle of identification seems (at least to me) on shaky grounds.

Rather, I would argue that Dirac’s explicit verbalization is not due to a general philosophical stance but to Dirac’s flexibility when it came to such. Dirac is often portrayed, and described himself, as a philosophically uninclined physicist. In the most prominent discussions about the interpretation of quantum mechanics, as for example the EPR-paradox, Dirac remained silent. The focus on philosophical issues, as it was most distinctly represented by Niels Bohr, and the obsessive care to keep the conceptual framework as clean as possible were to some extent foreign to Dirac’s style of work.

Dirac’s research practice shows a rather instrumentalistic outlook on quantum theory and the concepts involved in it. In most general terms, he did not care much about the ontological implications the handling of the theory had.^{Footnote 51} As long as the theory was internally consistent and its predictions cohered with the empirical results, everything was fine for Dirac. As Max Born, contemporary of Dirac and his host in Göttingen in 1927, once pointedly described Dirac’s stance:

“They [physicists like Dirac] say: the existence of a mathematically consistent theory is all we want. It represents everything that can be said about the empirical world; we can predict with its help unobserved phenomena, and that is all we wish. What you mean by an objective world we don’t know and don’t care.”^{Footnote 52}

Dirac’s disinterest in issues he considered mostly philosophical came with a rather flexible attitude towards the principles guiding the interpretation of quantum mechanics from its start. This can best be exemplified by recourse to a principle that is directly connected to Dirac’s introduction of PST: the observability principle. It stated that only quantities in principle observable could be counted as genuine concepts of quantum theory. It was clearly expressed in Heisenberg’s Umdeutungs paper and was basic pillar of the interpretation of quantum theory in the following years. Dirac himself referred to the principle all throughout his scientific career.^{Footnote 53} An analysis of his actual research practice nevertheless shows that this was not much more than lip service.

As Helge Kragh observed, “he [Dirac] did not hesitate to propose quantities that seemed to have only the slightest connection to observables” (Kragh 1990, 264). Kragh’s prime example was the hypothetical negative energy world proposed by Dirac in 1942 to cure some of the problems QED was facing at the time.^{Footnote 54} Andrea Oldofredi and Michael Esfeld (Oldofredi and Esfeld 2019) also argued that Dirac’s work was not guided by the observability doctrine. One of their examples is the sea of negative energy electrons introduced by Dirac in 1930. As their argument goes, these entities were from a theoretical point of view unobservable by construction.

Some smaller annotations in Dirac’s writings actually indicate that he considered a careful introduction of unobservable concepts tenable in theoretical physics. In his first lecture notes on quantum theory, we find a passage by Dirac which was an obvious attack on Schrödinger’s conception of the wave function actually representing an electrical density. In this passage, Dirac explicitly stated that “one may introduce auxiliary quantities not directly observable for the purpose of calculation; but variables not observable should not be introduced merely because they are required for the description of the phenomena according to ordinary classical notions.”^{Footnote 55} In a strange twist of reasoning, he made a similar statement in his paper on the many-time formalism (Dirac 1932). Therein, he upheld the observability doctrine as one of the basic pillars of the interpretation of quantum theory, yet, as he commented, “strictly speaking, it is not the observable quantities themselves (the Einstein A’s and B’s) that form the building stones of Heisenberg’s algebraic scheme, but rather certain more elementary quantities, the matrix elements, having the observable quantities as the squares of their moduli” (Dirac 1932, 456; emphasis added).

All in all, Dirac was a physicist who was neither hindered in his methodology by some strict application of the observability principle, nor was he concerned with the ontological implications of his theoretical framework. The point is, that others were and I consider this one of the reasons why physicists would shy away from talking about transitions when no “actual transitions” occurred.

1.4 An interim conclusion

Now it is time to put everything together and present my whole argument as to why it was Dirac who introduced PST, i.e. a verbal representation of the scattering of light in terms of temporally ordered and energy non-conserving subprocesses taking place on a quantum amplitude level at least suggesting some kind of causal connection between these processes.

First of all, Dirac developed an extremely suggestive technical and conceptual framework. The amplitudes and probabilities of energy non-conserving transitions were, albeit small, non-zero and fluctuating. Energy was conserved for the whole process by application of Dirac’s trick, but only by pushing time to infinity. The role of the interaction, serving to introduce alteration in the system’s behaviour and not as altering the energy levels of the system, allowed for a possible explanation of energy non-conservation.^{Footnote 56} Shifting the focus on the quantum amplitudes was suggested by the creation and annihilation operators working on this level of theory and by the differentiation between direct scattering and scattering through intermediate states.

Secondly, Dirac did not only interpret the equations in terms of processes and leave it at that. He actually used PST. To reproduce the structure of the KH formula, Dirac had to invoke two different sequences of events: Either first an absorption occurred and then an emission or the other way around. Since energy conservation was not imposed on the virtual transitions, there was no theoretical hindrance in conceiving of the second, quite counter-intuitive sequence of events. The storyline Dirac came up with was used, and through it the relevant matrix elements and their specific combinations could be constructed. From the first time it was explicitly expressed, the picture of subsequent transitions, emission and absorption subprocesses as well as the concepts figuring in it served as tools for the construction of mathematical representations, which were, in a second step, evaluated further. The verbalization was used to bridge the gap between an abstract mathematical representation of perturbation theory and the physical process to be calculated.

Thirdly, this intricate interplay between mathematical techniques, previously established conceptualizations and, finally, the direct application from which, according to my argument, PST emerged, would not have led any physicist to provide such an exposition. Even after the introduction of PST, some physicists would show reluctant in accepting this kind of framing. But Dirac was philosophically flexible enough (and I consider this, from a pragmatic point of view and in the given circumstances, a good thing) not to worry too much about strict philosophical guidelines of interpretation. After all, the reference to a sequential occurrence of transitions on a quantum amplitude level, when no actual transitions occurred, was retrospectively a huge step, specifically in that it provided a language suitable for the theoretical treatment of quantum electrodynamical processes and was consequently used in the majority of the evaluations of QED in the 1930s.

2 The impact of PST: representational alterations and (meta-)physical implications

Dirac’s invocation of PST did not directly cause any open philosophical dispute. But, when discussing specific problems, physicists took stances, either implicitly or explicitly, towards the verbal model Dirac had proposed. All the reflections I am about to discuss were given in the derivation and interpretation of the light–matter interaction and the second order processes that it entails, both relativistically and non-relativistically.

Since all of the formulas and their derivation in radiation theory included intermediate states, I will not order the following section in respect to the physical effects. Rather, two different consequences of the establishment and use of PST will structure the following. The first subsection will engage with the reflection of Dirac’s verbal model in the medium of diagrams. The second subsection is devoted to the different stances towards the reality of the storyline of PST and centres around one of its most important direct consequences: It motivated Dirac to develop the idea of the Dirac sea.

2.1 Alterations in diagrammatical representations: the inclusion of an ordered sequence of events

Both examples of diagrammatical representations I am about to discuss have a common feature: the explicit inclusion of an ordered sequence of events in the representational format. Yet, both of them differ appreciably in the way this was achieved and to the effects they described. The background of the first example was the experimental observation of the so-called Raman effect. This particular instance of the scattering of light from an atom or molecule had been predicted by Adolf Smekal (1923) and was put on firmer theoretical ground by Kramers and Heisenberg in their dispersion paper. In most general terms, the Raman effect consists of the occurrence of components in the secondary radiation which exhibit a discrete frequency shift from the primary radiation. This frequency shift is proportional to the energy difference between two stationary states of the scattering atom or molecule ($\Delta \nu =\frac{1}{h}(E_n-E_m)$). It was observed by C.V. Raman^{Footnote 57} and independently by Landsberg and Mandelstam (1928) in early 1928.

Although the Raman effect was soon celebrated as clear evidence for quantum theoretic predictions, its further empirical evaluation led to some dispute about its conceptualization, mainly amongst experimentalists.^{Footnote 58} An integral part of this discussion was the mounting evidence that the frequency shift did not necessarily correspond to absorption frequencies in the infra-red. Rather, the alteration in frequency was proportional to the difference between two absorption frequencies of the scattering medium: The selection rules, hence, were not equivalent to the ones for regular absorption of energy by the scattering medium.

One of the most decisive experimental proofs of this effect was provided by Franco Rasetti (1929), who was working at CalTech in Pasadena at the time. At Rasetti’s home institute in Rome, Eduardo Amaldi and Emilio Segrè (Amaldi and Segrè 1929) noted the (partial) confusion in the empirical literature and explained the occurrence of the unexpected frequency shifts, i.e. the apparent independence of Raman lines from infrared absorption lines, including a lengthy exposition of PST. In a subsequent paper, Eduardo Amaldi (1929) investigated the problem further and cast his reasoning into a diagrammatical representation depicted in Fig. 7.

At the time, energy-level diagrams were still the most common way of representing atomic phenomena in quantum theory. Other diagrams, such as the ones by Kramers and Heisenberg (see Figure 2) or Ralph Kronig’s term scheme diagrams, as discussed by Martin Jähnert (2019, Section 7.2), were rather static in nature. But Amaldi chose to depict the process in a different fashion. The diagrams in Fig. 7 are read from left to right in the order initial, intermediate and final state. The a’s refer to the probability amplitudes of the respective states of the whole system, atom/molecule and radiation field. The indexes refer to the state of the atom (first index) and to the occupation of the components of the radiation field (all following indices). As Amaldi discussed both the Stokes and the anti-Stokes case of the Raman effect, two initial and final states occur in each diagram. The upper diagram represents first an absorption, note the index $n_{a}-1$ on the amplitude of the intermediate state, and a subsequent emission of a different quantum, represented by the index $1_b$ on the amplitude of the final state. The lower diagram refers to the process in which first an emission and then an absorption occurs.

While Amaldi’s representation digressed from the more traditional version of diagrammatical formats by representing an ordered sequence of events, also in energy-level diagrams an order was indicated in the follow-up of PST. In Göttingen, where Dirac had finished his dispersion paper, Maria Göppert-Mayer, at the time a doctoral student of Max Born, explained the physical effects predicted by the KH formula through the invocation of energy-level diagrams.^{Footnote 59} Kramers and Heisenberg already noted that their formula described, besides dispersion and the later to be called Raman effect, a third mechanism which is today called double emission. There is a certain probability that an atom will irradiate two different radiation components, as long as the sum of the frequencies corresponds to the energy difference between initial and final state of the process. In the course of writing an introduction to radiation theory for the textbook by Pascual Jordan and Max Born (Born and Jordan 1930, Chapter 7), Göppert-Mayer found that the inverse process, the simultaneous absorption of two photons, was also described by the KH formula. Its detailed evaluation became part of her dissertational research.

In her published dissertation as well as in initial communication of the results, Göppert-Mayer used the structural analogy of Raman scattering, double emission and double absorption to explain the “synergy of two light quanta in one elementary act.”^{Footnote 60} The corresponding term scheme diagrams are reproduced in Fig. 8. They represent from left to right: the Stokes and anti-Stokes case of the Raman effect, double emission and double absorption. n and m are the initial/final states of the atom (depending on which process you are looking at). k is an arbitrary other state of the material system, the intermediate state. The solid lines represent photons: pointing upwards means absorption and pointing downwards emission. The dashed lines represent “the behaviour of the atom” (Göppert 1929, 932). The arrow at the end of the dashed lines indicates the sequence of events. The photon lines are not ordered. Any sequence of emission and absorption processes must be included in the theoretical description. Göppert-Mayer’s diagrams abstract from this permutation of the emission and absorption subprocesses and thereby allow her to depict the Raman-effect, double emission and double absorption each within a single diagram.

The impact of Amaldi’s and Göppert-Mayer’s diagrammatical representations on the further development of quantum theory was certainly limited. As a matter of fact, up until 1937 representations resembling Amaldi’s diagrams in structure were not used by physicists, at least I did not find any prior to a paper on nuclear physics by Gregor Wentzel (1937).^{Footnote 61} Furthermore, I could not find any direct connection between Amaldi’s representation and Wentzel’s. But I did not choose to discuss these diagrams for their impact on the further development of diagrammatical techniques. Rather, I chose these two examples to exemplify the impact of the verbal representation on other representational formats. Contrary to previously established diagrammatical representations, they both included an ordered sequence of events. Dirac’s invocation of PST and its use by other physicists led to an alteration in another representational format.

In both cases the diagrams served a didactic purpose: Amaldi chose the specific diagrammatical representation to make the theoretical description of Raman scattering more accessible to his readers; Göppert-Mayer used the diagrams to visualize the analogical structure of the Raman effect and the other double processes. Even though the diagrams were not used for calculations, they served as tools for the respective actors.

All the same, Göppert-Mayer acknowledged that what she represented in the diagrams was not what was actually happening to the atom, but rather a language suitable to describe, compare and evaluate the theoretical modelling. The processes she subsumed under the headline of the “synergy of two light quanta in one elementary act” behave “as if two processes, neither of which satisfies the energy law, occur in one act.”^{Footnote 62} The explanatory function of the physical language and the corresponding diagrammatical representation was in Göppert-Mayer’s own assessment not warranted by its direct description of real-world processes. And thereby we are directly led to the topic of the second subsection: the stances physicists took towards the reality/physicality of PST.

2.2 “A certain degree of reality”^{Footnote 63}: PST between formal and physical

Maria Göppert-Mayer was not the only one who commented on the reality of PST. When John van Vleck (1929) investigated specific cases of selection rules for the Raman effect, he noted that the KH formula “involves the amplitudes connected with transitions to what we shall term the ‘intermediate’ states [...]” (van Vleck 1929, 754). He noted that “it is to be clearly understood that the term ‘intermediate’ relates merely to the position of a state such as b in the products in [the formula] [...]” (van Vleck 1929, 754).^{Footnote 64} To Van Vleck, the intermediate character was a mathematical one. Similarly, C. V. Raman noted that “the introduction of the third level C [the intermediate state] is merely a mathematical device.” Since energy is normally not conserved in a transition to an intermediate state, it “is a purely virtual one which cannot actually occur” (Raman 1929, 790). According to Yakov Illich Frenkel (1929, 758), who proposed to conceive of scattering as a two part process of actual transitions, “the usual assumption [...] does not regard the state n [intermediate state] as ‘really’ occurring [...].”

The above quotes are certainly indicative of a general trend of conceiving of virtual transitions as not corresponding to physical phenomena. Yet, the most decisive impact of PST was connected to the relativistic description of the electron, the Dirac equation, and its interpretation.^{Footnote 65} Both in the inception and the interpretation of what came to be known as the Dirac sea, the theoretical description of scattering played an important role.^{Footnote 66} And its invocation led some actors to see more in PST than a verbal model of mathematical structures.

2.2.1 Virtual negative energy states: beyond the purely formal

In 1928, Dirac (1928) came up with a relativistic wave equation which cured some of the problems of prior relativistic descriptions (foremost the negative probabilities), proved experimentally suitable and naturally included the spin of the electron. For the following discussion, two aspects of the Dirac equation are important. On the one hand, a problem of prior relativistic wave equations remained.^{Footnote 67} The Dirac equation still entailed negative energy solutions. Theoretically, there was no hindrance for positive energy electrons to fall into negative energy states which posed a severe problem: Not only were negative energy states never observed, but electrons in such states would behave in all kinds of physically unexpected ways. On the other hand, Dirac’s starting point was a wave equation which included the operators linearly. Hence, the quadratic terms of the vector potential, which were responsible for the “direct” or “true” scattering, were no longer included in the theoretical description.

Werner Heisenberg was the first one to note that the direct scattering terms no longer occurred and that they were replaced by another mechanism. In a letter to Wolfgang Pauli in late July 1928, he communicated his finding that the combination of matrix elements corresponding to transitions to and from negative energy states would lead, in the respective approximation, to the Thomson scattering formula. Hence, the scattering of light from a free electron was now theoretically connected to such “crazy transitions.”^{Footnote 68} Even though Heisenberg presented his conclusions at a lecture in Copenhagen, and it was therefore known in parts of the community, it did not lead to any further investigations right away. For example, Oskar Klein and Yoshio Nishina still rejected negative energy states as “physically not meaningful”^{Footnote 69} in their semi-classical derivation of the relativistic scattering of light resulting in what is known as the Klein–Nishina formula.^{Footnote 70} Only when the scattering of light from an atom and a free electron, both described relativistically, was reevaluated within the framework of Dirac’s radiation theory, the occurrence of negative energy states as intermediate states spurred further conceptual development. This reevaluation was carried out and communicated to Dirac by the Swedish physicist and expert in the theoretical description of light scattering, Ivar Waller.^{Footnote 71}

Through the study of the correspondence between Waller and Dirac, Karl Grandin (2008, 202–208) showed that it was the necessity of including negative energy states as virtual or intermediate states in radiation theory that would lead Dirac to take these negative energy solutions more seriously. In late 1929, Dirac was obviously still unaware of Werner Heisenberg’s finding. When Ivar Waller first communicated the necessity of including negative energy intermediate states to Dirac, a conclusion Waller had independently arrived at, Dirac still believed that there had to be some kind of mistake in Waller’s calculation.^{Footnote 72} Only after redoing the calculation himself, Dirac came to the conclusion that negative energy states were necessary in the theoretical description.^{Footnote 73} Directly afterwards and apparently within a few days, Dirac came up with the idea of what is today known as the Dirac sea, i.e. filling all the negative energy states with an infinity of electrons. Only deviations from this uniform distribution should be considered to be observable. Through Pauli’s exclusion principle, positive energy electrons could no longer fall into the abyss of negative energy.^{Footnote 74}

Waller’s communication of the calculational results to Dirac motivated the latter to take another look at negative energy solutions, and thereby create the first quantum theoretical conception of anti-particles, namely as holes in this sea.^{Footnote 75} In one letter Waller also indicated the importance of the description in terms of subsequent transitions in connection with radiation theory. He explicitly compared the role of the intermediate states in Dirac’s description and in semi-classical calculations, which Ivar Waller called the “density method”:

“By using the density method these [intermediate states] enter more formally in the calculation, so the difficulty [i.e. the occurrence of negative energy states as intermediate states] is perhaps not so serious there.”^{Footnote 76}

From a draft of the letter, we know what Waller meant by “more formally”: “the corresponding eigenfunctions only play the mathematical role of certain terms in an expansion in eigenfunctions.”^{Footnote 77}

As Dirac did not initially believe in the correctness of Waller’s calculation, Waller, to mitigate the problem, would retract the above statement in a second letter and note that also in radiation theory the intermediate states “play a rather formal role, as long as resonance effects do not occur.”^{Footnote 78} Yet, Ivar Waller as one of the leading experts on the calculation of scattering in quantum theory initially considered the intermediate states of radiation theory as something that went beyond the purely formal or mathematical. But Waller did not go as far as calling them physical. Dirac and Igor Tamm took this step.

2.2.2 The physicality of intermediate states

Dirac communicated his idea of the Dirac sea both to Ivar Waller and, in a well-known correspondence, to Niels Bohr.^{Footnote 79} As Bohr was initially reluctant to accept the relevance of negative energy states, Dirac argued that a “scattering process is really a double transition” and that “the intermediate state [...] lasts only a very short time.”^{Footnote 80} Since intermediate states of negative energy were necessary for a consistent description of scattering, Dirac considered them to be physical: “If one says the states of negative energy have no physical meaning, then one cannot see how the scattering can occur.”^{Footnote 81} Hence, Dirac proposed, in addition to the temporal order of the events of emission and absorption processes, a finite temporal dimension of the intermediate state. Further, from the occurrence of the negative energy states as intermediate states he concluded the physicality of these states. Dirac pushed the virtual states into physical terrain.

Igor Tamm, since a shared stay at Leiden and Leipzig in 1928 well acquainted with Dirac, had independently discovered that negative energy states occurred as intermediate states in the relativistic description of light scattering.^{Footnote 82} Without knowing about Dirac’s statements to Bohr, Tamm came to a similar conclusion. In a letter to Dirac, he noted that the occurrence of negative energy states as intermediate states “proves the physical relevance of these states [of negative energy].”^{Footnote 83} In one of his subsequent papers, Tamm noted that “scattering of a light quantum through matter consists according to the Dirac theory, as is well known, of a sequence of two elementary processes, namely of the absorption of the impinging and the emission of the scattered light quantum.”^{Footnote 84} I emphasized the term “elementary process” in this quote as it, or the equivalent “elementary act,” was conventionally used, see for example Göppert-Mayer’s statements above, to refer to processes which we today call real processes, i.e. something which can directly be connected to a probability (and not to probability amplitudes).

Dirac’s and Tamm’s comments are not only interesting as they suggest a physical interpretation of PST, or at least show how PST spurred physical conclusions. Actually, the argumentative structure they used was further and is still applied. Both, Tamm and Dirac, inferred the physicality of negative energy states from their occurrence as intermediate or virtual states. In the case of Heisenberg, Waller, Dirac and Tamm the occurrence of negative energy states was a consequence of the calculation. During the late 1930s and early 1940s physicists already took the liberty to propose new kinds of states, not yet observed by experiment, and included them as virtual states to make sense of as yet unexplained empirical observations.^{Footnote 85} In contemporary physics the same argumentative structure is applied: hypothetical fields are introduced and their physical existence is probed by the effects virtual particles of these fields would have on the experimental results.^{Footnote 86} Although all of these instances differ in the motivation to introduce new kinds of states or particles,^{Footnote 87} they share a common feature: In each case the step from virtual to physical is taken.^{Footnote 88}

2.2.3 Non-invocation and refutation: physicists close to Copenhagen

As should have become clear through the above examples, some physicists took the intermediate states and the corresponding description by Dirac very seriously, at least more seriously than in semi-classical methods.^{Footnote 89} But there is another side to the story. Some physicists clearly used notions and methods along the lines of Dirac but refused to apply or only reluctantly applied a verbal description (at least in publication). Wolfgang Pauli and Werner Heisenberg’s work on quantum electrodynamics is one such an example (Heisenberg and Pauli 1929, 1930). As Cathryn Carson has noted, they “did not give much of a verbal description, however, largely allowing the calculation to stand on its own” (Carson 1996, 112).^{Footnote 90} Victor Weisskopf applied Dirac’s kind of description until spending half a year in Copenhagen with Niels Bohr and then going on to Zürich to become Pauli’s assistant. In his famous paper on the self-energy of the electron (Weisskopf 1934), there was no verbalization or visualization of any kind. But, when discussing matters with Heisenberg in correspondence, Weisskopf explicitly used the notion of intermediate states to explain the difference between the self-energy of the photon and the electron.^{Footnote 91}

In total, it is noticeable that, on the one hand, physicists who are normally portrayed as rather pragmatic (e.g. Dirac, Heitler, Fermi)^{Footnote 92} applied Dirac’s description explicitly in their papers, while, on the other hand, physicists who were closely associated with the Copenhagen institute and who defined themselves (or were defined by others) as philosophically inclined (e.g. Bohr, Pauli, Heisenberg, Weisskopf, Rosenfeld)^{Footnote 93} did not represent intermediate states and the corresponding processes verbally.^{Footnote 94} The most explicit refutation of the physical interpretation of PST I came across after Dirac had proposed his idea of the Dirac sea, was given by a close acquaintance of Niels Bohr, Léon Rosenfeld. In a lecture held in February 1931 at l’Institut Herni Poincaré, he commented on the description of double transitions as follows:

“First of all, the negative energy states play an important, although purely formal, role in the theory of dispersion. One knows that the dispersion formula [...] includes a summation over all the energy states the diffusing body is capable of. In a pictorial fashion, but which is not justified by the general precepts of the physical interpretation of quantum mechanics, one can say that the phenomenon of diffusion consists of a double transition of the diffusing body [...].”^{Footnote 95}

Stances were obviously split concerning the question whether or not one should use such an explicit description in terms of subsequent transitions, emission and absorption subprocesses, and whether this in some way or another cohered with physical reality.^{Footnote 96} Nevertheless, PST had an impact on how physicists represented scattering phenomena in other representational formats and, eventually, it made its way into the everyday work of the majority of the physicist community. The most popular introductory texts to QED during the 1930s, published by Fermi (1932) and Heitler (1936), explicitly used this kind of verbal model. Today this kind of description is still, just in a modern form, part of the practice of quantum field theorists. The basic building blocks of Feynman diagrams remain emission, absorption, pair annihilation or creation subprocesses. And thereby, PST remains a part of the tool kit of the quantum field theorist.

3 Reflection and outlook

In the foregoing discussion, I used the idea that concepts and the models they figure in come in different representations and these representations are closely related to the actual theoretical practice of the historical actors. Specifically, I focused on the emergence and the initial reception of what I called PST, i.e. the verbally explicit description of the light–matter interaction in terms of subsequent and temporally ordered virtual (modern terminology) transitions.

I argued that this representation was first provided by Paul Dirac because of three interrelated reasons. First of all, he came up with a mathematical modelling that had a highly suggestive character within the priorly established conceptual framework. Secondly, Dirac was philosophical flexible enough to explicitly express what he saw in the formulas. As indicated, in particular physicists who are generally portrayed as philosophically motivated did not express PST (at least at first). Thirdly, Dirac put PST to work in his dispersion paper. To rederive the KH formula, different orders of subprocesses had to be envisioned and superimposed. From its beginning, PST functioned as a tool for the quantum field theorist.

I showed how the verbally explicit description in terms of subprocesses had direct impact on other representational formats: New diagrammatical representations were developed, while traditional ones were modified to exhibit the ordered sequence of events explicitly. The idea of subprocesses did not only exhibit a “less formal” character in radiation theory, its physical interpretation was also closely related to the development of the Dirac sea. Nevertheless, in the immediate aftermath of Dirac’s introduction of PST, stances stretched from the physical reading by Dirac and Tamm over Waller’s “less formal” to Göppert-Mayer’s “as if”, Leon Rosenfeld’s “not warranted” or a complete lack of mention in Heisenberg and Pauli’s QED papers.

In my eyes, we can draw two immediate conclusions from the illustration of these stances. On the one hand, it shows that the Feynman–Dyson split and the philosophical debate about the ontological and representational status of Feynman diagrams, emerging in the 1950s and stretching into the present, has a rather lengthy prehistory. Starting with its introduction, physicists positioned themselves towards the use and the metaphysical implications of the physical picture that still underlies the diagrammatical technique (all the difference set aside for the moment). On the other hand, the ontological commitments of actors did, for the greater part, not interfere with the application of PST. Whether it was conceived of as a representation of mathematical structures or a representation of physical processes, the actors used the verbal model as a tool, not only for didactic purposes but also for calculational ones.

In this practice, physicists followed Dirac’s initial presentation: While the derivation of the abstract perturbative series was mostly a mathematical, albeit non-trivial business, PST was invoked when constructing the specific matrix elements describing the investigated effects. Each sequence of events that would lead from the respective initial to the respective final state through subsequent virtual transitions was thereby envisioned. Which states would couple through which subprocesses was dictated by the matrix elements of the interaction energy. The exposition of perturbation theory and its application in Heitler’s influential textbook (Heitler 1936) is a point in case, as are higher order calculations performed during the 1930s.^{Footnote 97} In all of these cases, the “method [...] of intermediate states”^{Footnote 98} was applied.

During the 1930s, the prime mode of representation of this kind of reasoning was a verbal one. The well-known example of Hans Euler’s term scheme diagrams for light-by-light scattering, as discussed by Adrian Wüthrich (2010, Section 2.1), were accompanied by a tabular display listing all possible sequences of events in verbal terms. Bernhard Kockel, who was deeply involved in Euler’s calculations, did not invoke any diagrams to discuss the third-order effects he was investigating for his dissertation. He presented his reasoning with PST through tables in which each possible sequence of events was portrayed in verbal terms (see Fig. 9). Also in papers dealing with meson physics, the application of PST was often presented verbally to the readers.

Through this observation and the taken focus, my contribution also aims at enriching the collection of representational formats discussed in the historiography of QED. Diagrammatical representations currently occupy a rather prominent place. Although this focus is understandable due to the prominence of Feynman diagrams in post-WWII particle physics, it does not necessarily do justice to the practices of the physicist of the 1930s. A study of PST indicates that the diagrams of the 1930s were often post hoc representations, if the language used was cast into a diagrammatical representation at all.

In conclusion, I want to emphasize that PST as a representational format has a certain degree of independence of other representations: It can neither be reduced to the mathematical representation of the theory nor is it equivalent, although closely related, to the method of Feynman diagrams. On the one hand, PST performed a rather specific function in the theoretical evaluation of the 1930s: It bridged the gap between the abstract perturbative series and the matrix elements describing the effect to be evaluated. Even if this step might have been taken through a purely mathematical representation, the fact remains that actors did not do so. The specific language which Dirac introduced facilitated this task considerably, and as such exhibited its tool character.

On the other hand, although Feynman diagrams perform exactly the same function in perturbative evaluations, they do not allow for the same conclusions. Historical actors constructed processes through the invocation of PST that we can directly discard through the usage of Feynman diagrams.^{Footnote 99} Feynman’s version of QED does not only differ in the physical picture that is underlying its representation (i.e. a space-time approach vs. a state-transition model) and the physical properties of its concepts,^{Footnote 100} but it differs, through the diagrammatical representation, in the conclusions that can be extracted from the physical interpretation of the perturbative series. As such, the approach taken in this paper highlights a well-known aspect of the shift that occurred in QED in the late 1940s, namely the refinement of the theoretical tool kit through the introduction of different representational formats.

Change history

31 October 2022
A Correction to this paper has been published: https://doi.org/10.1007/s00407-022-00300-y

Notes

The articles by James Robert Brown (2018) and Tobias Fox (2008) are cases in point.
See Feynman (1949, 772–773) for the physical interpretation briefly outlined above and its connection to the mathematical structures.
The most important arguments brought forward until 2007 were reviewed by Fox (2008). Further reflections on Feynman diagrams were presented in the recent special issue of Perspectives on Science (2018, Vol. 26, Issue 4) and by Oliver Passon (2019), introducing the argument of topological equivalence to the philosophical discussion. In short, the most important argument against a realistic interpretation, the superposition argument, is based on the fact that Feynman diagrams represent mathematical terms of a perturbative series that figure on a quantum amplitude level, i.e. do not directly refer to probabilities. To compare theoretical predictions and actual experimental results, many different Feynman diagrams seemingly representing different physical processes with different numbers of virtual particles have to be superimposed and the sum of the respective mathematical terms has to be squared. This results in interference terms between them and there is, by definition, no way of physically cashing out the single terms. Hence, we cannot separate the contributions of seemingly different processes to one and the same observable effect. This argument was first proposed by Robert Weingard (1982).
See Schweber (1994, chapter 8) for a detailed and in-depth reconstruction of Feynman’s path towards his space-time approach. The genesis of Feynman diagrams was addressed by Adrian Wüthrich (2010), who also stressed the conceptual differences between the QED of the 1930s and Feynman’s approach. Recently, Alexander Blum (2017) has argued for a paradigmatic shift in terms of exemplars in QED of the late 1940s: Feynman’s work is a central aspect of this development.
See Kaiser (2005, 185ff) for an argument connecting the representational tradition of Minkowski diagrams to Feynman diagrams.
See the reconstruction by Wüthrich (2010, especially chapter 4).
For explicit denotations of concepts as tools see, e.g. Feest (2010), Steinle (2012) or MacLeod (2012).
In recent years, the role of representations and their usage for the historical study of concepts has been stressed by several authors in HPS literature, among them Theodore Arabatzis, Arianna Borrelli, Emily Grosholz, and Marion Vorms. A discussion of this field would certainly exceed the given space.
For example, the two-dimensional format of Feynman diagrams only allows for a limited number of diagrams to be drawn without creating cross-sections in the lines. Thanks goes to Adrian Wüthrich for pointing this out.
E.g., the structure of the basic building blocks of Feynman diagrams in QED, consisting of a vertex with two electron and one photon line, can be considered as arising from the structure of the interaction Hamiltonian. The same holds for PST: Through the mathematical representation of the interaction Hamiltonian, it was clear that the occupation number of the successive states differed by $\pm 1$ for photons and for the material system. Each matrix element described, in verbal terms, a transition of the material system accompanied by an emission or an absorption.
Dirac (1927b, 712). The notation has been changed slightly in comparison to the original.
Although a consistent denotation of these transitions as virtual only occurred in the mid-1930s, I will, to avoid any further confusion, use this terminology throughout the following paper.
This statement refers to the papers that made it into the mainstream of quantum theory. The possibility of dissecting non-resonant scattering of light into two subsequent acts of actual emission and absorption was also discussed before the advent of quantum mechanics, for example, Herzfeld (1924) or Smekal (1925), and mainly within a light quantum point of view. It briefly resurfaced afterwards, as for example in Frenkel (1929). But these attempts are largely independent of what I am about to engage with.
“charakteristische Amplituden,” Kramers and Heisenberg (1925, 10).
As a matter of fact, the numbers on the lines on the two diagrams on the right seem to indicate an implied order. Nevertheless, these diagrams were used to evaluate the formulas by Kramers and Heisenberg. In the construction of the general structure, they used the diagram on the left. Here the numbers obviously were not chosen to imply any order of purported processes. The order of the numbers would read 1–4 and 2–3, and not, as in the other diagrams, 1–2 and 3–4.
“Umweg über einen [sic!] drittes Niveau,” Pauli (1925, 14).
He kept this interpretation only for resonant scattering, for which energy is conserved in the respective transitions (Pauli 1926, 94–95).
The following point was not made explicit in the early papers and is certainly retrospectively informed by the discussion around Feynman diagrams. Yet it is based on the interpretation that matrix elements or quantum amplitudes do not directly refer to probabilities but only their squares. And this connection was already part of Heisenberg’s and Kramers’ paper.
For Dirac’s scientific biography, Kragh (1990) is still the standard reference. Although popular in nature, Farmelo (2009) is also a well-researched and valuable source for biographical details.
See for example Kragh (1990, 32).
The same procedure was independently and further developed by Max Born (1927). John Clarke Slater (1927) came up with this technique as well.
The relevant passages can be found in Dirac (1926, 673–675).
See for example Born (1927) or Dirac (1927a) for such a conceptualization.
From a different perspective, the same theory was independently developed by Pascual Jordan at approximately the same time. For a historical reconstruction of Jordan’s work see Duncan and Janssen (2009).
In the following, I will not go into too much detail about the derivation of what came to be called second quantization but rather pick out the aspects that are, in my opinion, necessary for the comprehension of Dirac’s conceptualization of dispersion in this framework. For detailed studies of Dirac’s development of radiation theory, see Jost (1972, Sections 1–4), Darrigol (1986, 225–229), Lacki et al. (1999, Section 4.1–4.2), Bromberg (1977), Kragh (1990, Chapter 6), Schweber (1994, Section 1.3) or Pais (1986, Section 15c). For a historically sensitive modern reconstruction see Duncan (2012, Section 2.1). My own reflection will most closely follow the paper by Lacki et al. (1999) due to the shared focus on perturbation theory.
Dirac worked with the transformed variables $b_r=a_re^{-iW_rt/\hbar }$ and $b_r^{\dagger }=a_r^{\dagger }e^{iW_rt/\hbar }$. But this is of no importance for the conceptualization I am concerned with.
Dirac actually worked with number and the conjugated phase operators which have the same action on the wave functions. They are connected to the creation and annihilation operators by $b_r=\sqrt{N_r+1}e^{-i\theta _r/\hbar }=e^{-i\theta _r/\hbar }\sqrt{N_r}$ and $b_r^{\dagger }=\sqrt{N_r}e^{i\theta _r/\hbar }=e^{i\theta _r/\hbar }\sqrt{N_r+1}$.
To treat emission and absorption processes with the above Hamiltonian, Dirac envisioned an infinity of photons in an unobservable zero state. For a discussion of this procedure, the interested reader is referred to the papers cited in footnote 25.
It was independently discussed by Gregor Wentzel (1927).
This is most probably due to the prominence of this rule in Fermi’s lecture on nuclear physics from 1949/50 (Fermi 1950). Golden rule No. 1 is the application of second order perturbation theory when the first order matrix elements vanish.
This may roughly be translated as “Dirac’s cheat,” while “mogeln” has in German a childish touch. Judging from Heisenberg’s response to a lost letter from Pauli, it seems as if Pauli suspected some hidden averaging over the phases of the wave functions in Dirac’s methods. Heisenberg came to a different conclusion: “In the course of this letter, in contrast to what I expected, I came to the result, that there is no cheat in Dirac’s paper.” (Original: “Ich [bin] im Laufe dieses Briefes, im Gegensatz zu dem, was ich im Beginn glaubte, zu dem Resultat gekommen, daß bei Dirac gar keine Mogelei steht.” Heisenberg to Pauli, June 13. 1928; as cited in Pauli (1979), 460–462).
For example, Dirac applied time-dependent perturbation theory to an Hamiltonian actually independent of the time; or: Dirac conceptualized the interaction between radiation and matter as a perturbation switched on at some point $t=0$.
I will follow Dirac’s description of the trick as he applied it in the later part of his paper on emission and absorption and his paper on dispersion. For the general derivation see Dirac (1927c, 257–259) from which some of the quotes are taken.
One easily sees this by applying de l’Hôpital’s rule for the value $W_m-W_n\approx 0$ twice on Eq. (7).
This might seem strange as Dirac’s procedure is, from the start, only valid for short times. The restriction Dirac imposed for the method to give reliable results are: (a) the time has to be large in comparison to what Dirac called “the periods of the atom,” i.e. the reciprocal absorption and emission frequencies; (b) the time has to be small in comparison to the life-time of the states considered. According to Dirac “[t]here is no difficulty in T satisfying both these conditions simultaneously provided the perturbing energy is sufficiently small” (Dirac 1930a, 171).
Calculations. General Calculations, 1925–1968(?). FSU Special Collections & Archives, P.A.M. Dirac Papers, Folder 5, Box: 41. Digitized Version under: https://diginole.lib.fsu.edu/islandora/object/fsu:667304, p. 13 [10/25/2021], courtesy of the Florida State University Libraries, Special Collections and Archives. This material will be abbreviated by Dirac. General Calculations in the following. Page references refer to the pages in the PDF file.
It should be noted that this was exactly how Dirac’s procedure was at first connected to the energy–time uncertainty by others, for example, by Landau and Peierls (1931): namely, by referring to the probabilities of energy non-conserving transitions and not their amplitudes. Whether or not Dirac himself had something along the uncertainty relations of Heisenberg in mind at this point is hard to answer. He might have been aware of Heisenberg’s uncertainty relations when finishing his dispersion paper, but Dirac did not directly refer to them.
The Hamiltonian reads: $H=c(mc^2+(p+\frac{e}{c}A)^2)^{\frac{1}{2}}-e\Phi $. In the expansion of this Hamiltonian the vector potential occurs quadratically. The quadratic terms are responsible for what Dirac called “direct scattering,” so exactly for those terms that were previously missing in his wave theoretic point of view.
The second term of Eq. (12) was only important in the case of resonant scattering which Dirac turned to separately in the last section of the paper.
This equation is directly taken from Dirac (1927b, 722). The sum on the left hand side refers to the already performed summation/integration over the energy range of the final state.
Annotation by the author: the J are the variables describing the state of the atom.
Calculations. Early Work, 1926–1930(?). FSU Special Collections & Archives, P.A.M. Dirac Papers, Folder 1, Box: 39. Digitized Version under: https://diginole.lib.fsu.edu/islandora/object/fsu:667285, p. 13 [25.10.2021], courtesy of the Florida State University Libraries, Special Collections and Archives. This material will be abbreviated by Dirac. Early Work in the following.
Dirac. Early Work, p. 11 (see footnote 42).
See for example the calculation of resonance scattering on p. 14 of Dirac. Early Work (Footnote 42).
For example, p. 7 of Dirac. General Calculations (Footnote 36), but essentially to be found at diverse places.
The matrix elements of the perturbation are marked by an $\alpha _{rs}$. The initial state is called 0, the intermediate state s and the final state r. The energies of the state are denoted by E and, from the fifth line onwards, the $\alpha '$ denote the dynamical variables of the system besides the energy.
See for example Kragh (1990, Chapter 14) or Wright (2016).
See for example Pais (1987).
Helge Kragh (1990, 270–274) discussed it under the headlines of a “principle of plenitude”-argument. Olivier Darrigol (1993, 302-304) also identified the principle of identification as an important piece of Dirac’s methodological framework.
I would like to thank Alexander Blum, who pointed me towards the relevance of the resonant case for Dirac’s interpretation in one of the presentations of this paper.
This is certainly exaggerated but captures the general picture that is drawn of Dirac.
Born (1938, 12); as cited in Kragh (1990, 81).
See Kragh (1990, 262–267) for a discussion of the observability doctrine and instrumentalism in Dirac’s work.
In the paper, Dirac would explicitly state that “negative energies and probabilities should not be considered as nonsense. They are well-defined concepts mathematically. [...] [they] should be considered simply as things which do not appear in experimental results” (Dirac 1942, 8).
Taken from lecture notes of Dirac from 1927, as cited in Kragh (1990, 80).
This whole line of argumentation was inverted and connected to the energy–time uncertainty by others in the follow-up of Dirac: as long as small times were considered energy non-conserving transitions were not ruled out as such. Energy was then, strictly speaking, not not-conserved, but provided by the interaction energy. See for example Landau and Peierls (1931, 59).
See Singh (2002) for a concise contextualization and evaluation of Raman’s discovery. It was first communicated before the Indian Society of physics, as published in Raman (1928).
See Ehberger (2020, Section 2) and the literature cited therein.
For a contextualization of her dissertational research, see Masters (2013).
“Zusammenwirken zweier Lichtquanten in einem Elementarakt,” Göppert (1929, 932).
Besides the structural similarity of representing initial, intermediate and final state from left to right, there are certain differences in the representation. Most importantly, Wentzel would no longer represent the respective occupation of the states by indexes of the quantum amplitudes. Rather, the particles occupying the states (abbreviated by letters) were the only element of the representation.
“als ob zwei Vorgänge, von denen jeder nicht dem Energiesatz genügt, in einem Akt geschehen,” Göppert-Mayer (1931, 284; emphasis added). Although Göttingen physicists most probably knew about Hans Vaihinger’s philosophy of “As if” (Vaihinger 1911), Göppert-Mayer did not directly reference to it at this or any other point.
Note that Van Vleck did not even bother to discuss an interpretation of the formulas in a processual way but rather warned the reader not to confound the term intermediate as referring to the energy of the state lying in between the initial and final state.
According to an overview article by Gregory Breit (1932, 530) “the picture of the absorption of $h\nu $ via the intermediate state J” has a certain degree of reality even though in the temporary condition J” the conservation of energy is violated.”
For the derivation of Dirac’s relativistic description, see Kragh (1981), Moyer (1981), or Pais (1987).
For brevity’s sake I will only discuss the emergence of the idea of the Dirac sea. Nevertheless, it should be noted that part of the conceptual criticism on Dirac’s initial interpretation of a hole in the sea as a proton was based on the specific modelling of the scattering phenomena in radiation theory. Just to name one example, the conceptual issues Robert Oppenheimer (1930) discussed in his well-known critique were based on the specific modelling of scattering, while the much too high annihilation rate of protons and electrons was framed as a “numerical discrepancy” (Oppenheimer 1930, 563).
The occurrence of negative energy states was noted by a view authors prior to Dirac but was never considered a serious problem and was mostly ignored, see Kragh (1981, 63–64). Dirac’s paper and, particularly, the theoretical description of scattering in Dirac’s radiation theory changed this status considerably (see below).
“verrückten Übergänge,” Heisenberg to Pauli, July 31. 1928; as cited in Pauli (1979, 468). It is worth noting that Heisenberg calls the transition to negative energy states “crazy” and not the transitions to intermediate states.
“physikalisch nicht sinvollen,” Klein and Nishina (1929, 856).
As is clear from the correspondence of Nishina and Ivar Waller, Nishina was in Copenhagen when Heisenberg gave his presentation. The difference between semi-classical calculations and Dirac’s radiation theory was still prominent at that time as Nishina “did not expect that you [Ivar Waller through a calculation in radiation theory] would obtain the same formula as ours [Klein’s and Nishina’s].” Nishina to Waller, February 22. 1930. Ivar Waller Archives, Center for History of Science, Royal Swedish Academy of Science [IWA].
See Grandin (2008) for a historical study of Ivar Waller’s work up to 1930.
Waller to Dirac, November 2. 1929, PAMDP, Box 23, Folder 3. Dirac to Waller, November 18th 1929, IWA.
Dirac to Waller, November 27. 1929, IWA. Most probably Dirac redid the calculation immediately after sending the letter dated on the 18. November to Waller, since by November 24. Niels Bohr already heard from Gamow that Dirac “had made progress with the mastering of the hitherto unsolved difficulties in your [Dirac’s] theory of the electron.” Bohr to Dirac, November 24., Archive for the history of Quantum Physics, Bohr Scientific Correspondence, MF 9 [BSC].
As at the time intermediate states were still thought of as falling under the reign of the Pauli exclusion principle, Dirac actually had to rethink the exact process which had led him to ponder about negative energy states: The negative energy states were filled and, therefore, first a negative energy electron had to be lifted to a positive energy level and then the initial electron would fill the hole created in the first step. As always, the order of emission and absorption had to be permuted. This replacement of events was communicated by Dirac in correspondence to Bohr and Waller and it was published in his first paper on the Dirac Sea. Subsequently, Igor Tamm (1930a) showed that such a work-around to the Pauli exclusion principle can be found also for more complex scattering processes and therefore proved that the exclusion principle can be neglected for this kind of processes.
At first, Dirac conceived of holes in the distribution of negative energy electrons as protons. Only after severe criticism, which was also partly based on the conception of scattering in radiation theory, would he retract this idea and propose to conceive of a hole as an anti-electron. For brevity’s sake, I will not go into the discussion of this episode.
Waller to Dirac, November 2. 1929, PAMDP, Box 23, Folder 3. Emphasis added.
Draft of the letter from Waller to Dirac November 2. 1929, IWA.
Waller to Dirac, November 26. 1929, PAMDP, Box 23, Folder 3.
See, for example, Kragh (1990, 90–95) for a discussion of the correspondence that started on November 24. 1929 and lasted until late December of the same year.
Dirac to Bohr, December 9. 1929, BSC, MF 9. A nearly equivalent statement can be found in the publication introducing the Dirac sea (Dirac 1930b, 365).
Dirac to Bohr, December 9. 1929, BSC, MF 9. While I did not consider the introduction of the verbal exposition as due to Dirac’s recourse to the “principle of identification” as laid out in Section 1.3, at this point I would explain the relevance of negative energy states by Dirac’s recourse to the said principle Kragh (1990, 273), using a slightly different kind of reasoning, has already indicated that such is the case). From the theoretical necessity of invoking negative energy states in the explanation of an in principle observable effect, the necessity of including these states in the physical framework was concluded.
For the relation between Dirac and Tamm, see the introduction to their correspondence by Kojevnikov (1993).
Tamm to Dirac, February 5. 1930, as cited in Kojevnikov (1993, 30).
“Die Streuung eines Lichtquants durch die Materie besteht nach der Diracschen Theorie bekanntlich aus einer Folge von zwei Elementarprozessen, nämlich aus der Absorption des einfallenden und der Emission des gestreuten Lichtquants.” Tamm (1930a, p. 705, emphasis added).
For example, higher charge and spin states of proton and neutron were proposed by Heitler (1940, first communication) and Heitler and Ma (1940, detailed calculation) to dampen the scattering coefficient of mesons and the divergence of the magnetic moments of nucleons. Both effects were at the time conceived of as due to a sequence of virtual meson emission and absorption processes. Homi Bhabha independently suggested higher charge states of proton and neutron and noted in 1940 that “if these postulated states merely remained as hypothetical intermediate states, they would be uninteresting. However, there are a number of processes by which a proton of charge 2e or $-e$ could be produced in the free state” (Bhabha 1940, 354).
See, for example, https://www.physik.uzh.ch/groups/serra/RareDecays.html [10/25/2021].
The occurrence of new states was forced on Waller, Dirac, Heisenberg and Tamm by the derivation. It was voluntarily introduced by Bhabha and Heitler, but to explain existing discrepancies between experiment and theory (or to solve problems in theoretical consistency). Finally, it is introduced in contemporary physics not to explain existing empirical riddles, but to suggest possible experiments and possible deviations from the known results.
This is not to say that anyone who uses the above kind of argument considers PST or its modern equivalent realistically.
For the semi-classical descriptions of the scattering from a free electron à la Klein and Nishina intermediate states do not occur, as was also noticed by Igor Tamm (1930b, 546). But for dispersion theoretic accounts, as for example in the Kramers–Heisenberg formula, also for semi-classical accounts intermediate states occur. This is the basis of Waller’s comments as quoted in the prior subsection.
For Heisenberg and Pauli, there was the further obstacle that the notion of a longitudinal photon, which we can identify today in the intermediate states of their formulas, was not yet fully formed.
Weisskopf to Heisenberg, October 24. 1934; as cited in Pauli (1985, 350–351). The difference in the self-energies, according to Weisskopf, is due to the difference of the possible occupations of the intermediate states in the two cases. Heisenberg first considered the two energies as equivalent while Weisskopf argued that they are not.
Heitler’s focus on concrete problems and the utilization of theory has both been noted by Cassidy (1981) and Roqué (1997, especially Section 4). For Fermi’s approach towards quantum theory I simply want to quote Telegdi (1997, 42): “Fermi’s way of teaching and thinking about quantum mechanics deserves special mention. His attitude was entirely pragmatic.”
Bohr’s, Pauli’s, and Heisenberg’s taste for philosophy and their strong intellectual connection are well-known and part of nearly any biographical treatise about these scientists. Weisskopf’s road over Copenhagen to become Pauli’s assistant in Zürich has shortly been outlined above. Léon Rosenfeld’s strong connection to Niels Bohr and the Copenhagen institute during the 1930s is discussed in Chapter 2 of Rosenfeld’s appropriately titled biography “Léon Rosenfeld. Physics, Philosophy, and Politics in the Twentieth Century” by Anja Skaar Jacobsen (2012).
At least they did not do so initially. From early 1934 onwards, Werner Heisenberg started to speak about such structures more regularly in correspondence and publication. This coincides with the start of the work on light-by-light scattering by his students, Hans Euler and Bernhard Kockel.
“Tout d’abord, les états d’énergie négative jouent un rôle important, bien que purement formel, dans la théorie de la dispersion. On sait que la formule de dispersion [...] comporte une sommation portant sur tous les états d’énergie dont est susceptible le corps diffusant. One peut dire d’une manière imagée, mais qui n’est pas justifiée par les préceptes généraus de l’interprétation physicque de la méchanique quantique, que le phénomène de la diffusion consiste en une double transition du corps diffusant [...].” Rosenfeld (1932, 56; emphasis added).
Possibly, but this needs further reflection, the different ontological readings of PST might be connected or could be interpreted in the context of an inference to best explanation argument in the sense that the physical reading by Dirac and Tamm was due to the necessity of including intermediate states in the theoretical explanation of scattering. I would like to thank Adrian Wüthrich for suggesting this possibility.
Just to name a view of these works: light-by-light scattering (Euler 1936); the Delbrück effect, as discussed by Akhiezer and Pomeranchuck (1937); third-order effects, as discussed by Kockel (1937); or corrections to the self-energy, discussed by Weisskopf (1939, Section IV) and Mercier (1939). Furry’s theorem (Furry 1937) was first proven through the invocation of PST.
“méthode [...] des états intermédiares,” Mercier (1939, 68).
The process of the scattering of light from a pair of electrons to second order (Tamm 1930a) or the creation of two pairs from three photons (Kockel 1937) are such examples. The fact that Tamm and Kockel found that such processes do not occur only after describing the process in a mathematical language can be conceived of as further proof of the (partial) independence of the mathematical and the verbal representation.
E.g. the exclusion principle can be completely neglected for the intermediate states in Feynman’s version, while for PST as practiced during the 1930s this was the case only for specific processes (see Kockel (1937, 168)). Furthermore, Feynman’s description is based on off-shell particles, not obeying the relativistic energy–momentum relation, while PST is built on energy non-conserving transitions.

References

Akhiezer, A., and I. Pomeranchuck. 1937. Kohaerente Streuung von Gamma-Strahlen durch Kerne. Physikalische Zeitschrift der Sowjetunion 11: 478–498.
Google Scholar
Amaldi, E. 1929. Sulla teoria quantistica dell’effetto Raman. Rendiconti Accademia Nazionale dei Lincei. Classe di Scienze Fisiche, Matematiche e Naturali 9: 876–881.
MATH Google Scholar
Amaldi, E., and E. Segré. 1929. Sulla teoria dell’effetto Raman. Rendiconti Accademia Nazionale dei Lincei. Classe di Scienze Fisiche, Matematiche e Naturali 9: 407–409.
MATH Google Scholar
Bhabha, H. 1940. On elementary heavy particles with any integral charge. Proceedings of the Indian Academy of Science 11: 347–368. https://doi.org/10.1007/BF03051350.
Article MATH Google Scholar
Blum, A. 2017. The state is not abolished, it withers away: How quantum field theory became a theory of scattering. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 60: 46–80. https://doi.org/10.1016/j.shpsb.2017.01.004.
Article MathSciNet MATH Google Scholar
Born, M. 1927. Das Adiabatenprinzip in der Quantenmechanik. Zeitschrift für Physik 40 (3–4): 167–192. https://doi.org/10.1007/BF01400360.
Article MATH Google Scholar
Born, M. 1938. Some philosophical aspects of modern physics. Proceedings of the Royal Society of Edinburgh 57: 1–18. https://doi.org/10.1017/S0370164600013614.
Article MATH Google Scholar
Born, M., W. Heisenberg, and P. Jordan. 1926. Zur Quantenmechanik. II. Zeitschrift für Physik 35 (8–9): 557–615. https://doi.org/10.1007/BF01379806.
Article MATH Google Scholar
Born, M., and P. Jordan. 1930. Elementare Quantenmechanik. Berlin: Springer.
Book MATH Google Scholar
Breit, G. 1932. Quantum Theory of Dispersion (I). Reviews of Modern Physics 4 (3): 504–576. https://doi.org/10.1103/RevModPhys.4.504.
Article MATH Google Scholar
Bromberg, J. 1977. Dirac’s quantum electrodynamics and the wave-particle duality. In: History of Twentieth Century Physics, Number 57 in Proceedings of the International School of Physics, ed. C. Weiner, “Enrico Fermi”, 147–157. New York, London: Academic Press.
Brown, J.R. 2018. How Do Feynman Diagrams Work? Perspectives on Science 26 (4): 423–442. https://doi.org/10.1162/posc_a_00281.
Article Google Scholar
Carson, C. 1996. The peculiar notion of exchange forces-II: From nuclear forces to QED, 1929–1950. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 27 (2): 99–131. https://doi.org/10.1016/1355-2198(95)00003-8.
Article MathSciNet MATH Google Scholar
Cassidy, D.C. 1981. Comic ray showers, high energy physics, and quantum field theories: Programmatic interactions in the 1930s. Historical Studies in the Physical Sciences 12 (1): 1–39. https://doi.org/10.2307/27757488.
Article Google Scholar
Darrigol, O. 1986. The origin of quantized matter waves. HSPS 16 (2): 197–253. https://doi.org/10.2307/27757565.
Article Google Scholar
Darrigol, O. 1993. From c-Numbers to q-Numbers. The Classical Analogy in the History of Quantum Theory. Berkeley: University of California Press.
Google Scholar
Dirac, P.A.M. 1926. On the theory of quantum mechanics. Proceedings of the Royal Society of London . Series A, Containing Papers of a Mathematical and Physical Character 112 (762): 661–677. https://doi.org/10.1098/rspa.1926.0133.
Article MATH Google Scholar
Dirac, P.A.M. 1927a. The physical interpretation of the quantum dynamics. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 113 (765): 621–641. https://doi.org/10.1098/rspa.1927.0012.
Dirac, P.A.M. 1927b. The Quantum Theory of Dispersion. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 114 (769): 710–728. https://doi.org/10.1098/rspa.1927.0071.
Dirac, P.A.M. 1927c. The Quantum Theory of the Emission and Absorption of Radiation. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 114 (767): 243–265. https://doi.org/10.1098/rspa.1927.0039.
Dirac, P.A.M. 1928. The Quantum Theory of the Electron. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 117 (778): 610–624. https://doi.org/10.1098/rspa.1928.0023.
Article MATH Google Scholar
Dirac, P.A.M. 1930a. The Principles of Quantum Mechanics. The International Series of Monographs on Physics. Oxford: Clarendon Press.
Dirac, P.A.M. 1930. A Theory of Electrons and Protons. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 126 (801): 360–365. https://doi.org/10.1098/rspa.1930.0013.
Article MATH Google Scholar
Dirac, P.A.M. 1931. Quantised Singularities in the Electromagnetic Field. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 133 (821): 60–72. https://doi.org/10.1098/rspa.1931.0130.
Article MATH Google Scholar
Dirac, P.A.M. 1932. Relativistic Quantum Mechanics. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 136 (829): 453–464. https://doi.org/10.1098/rspa.1932.0094.
Article MATH Google Scholar
Dirac, P.A.M. 1942. Bakerian Lecture—The physical interpretation of quantum mechanics. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences 180 (980): 1–40. https://doi.org/10.1098/rspa.1942.0023.
Article MathSciNet MATH Google Scholar
Duncan, A. 2012. The Conceptual Framework of Quantum Field Theory. Oxford: Oxford University Press.
Book MATH Google Scholar
Duncan, A., and M. Janssen. 2009. From Canonical Transformations to Transformation Theory, 1926–1927: The Road to Jordan’s Neue Begründung. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 40 (4): 352–362. https://doi.org/10.1016/j.shpsb.2009.07.001.
Article MathSciNet MATH Google Scholar
Ehberger, M. 2020. I’m Not There. Or: Was the Virtual Particle Ever Born? In Biographies in the History of Physics. Actors, Objects, Institutions., eds. Forstner, C. and M. Walker, 261–280. Cham: Springer Nature Switzerland. https://doi.org/10.1007/978-3-030-48509-2_15.
Euler, H. 1936. Über die Streuung von Licht an Licht nach der Diracschen Theorie. Annalen der Physik 418 (5): 398–448. https://doi.org/10.1002/andp.19364180503.
Article MATH Google Scholar
Farmelo, G. 2009. The Strangest Man. The Hidden Life of Paul Dirac, Quantum Genius. London: Faber and Faber.
Feest, U. 2010. Concepts as Tools in the Experimental Generation of Knowledge in Cognitive Neuropsychology. Spontaneous Generations: A Journal for the History and Philosophy of Science 4 (1): 173–190. https://doi.org/10.4245/sponge.v4i1.11938.
Article Google Scholar
Fermi, E. 1932. Quantum Theory of Radiation. Reviews of Modern Physics 4 (1): 87–132. https://doi.org/10.1103/RevModPhys.4.87.
Article MATH Google Scholar
Fermi, E. 1950. Nuclear Physics: A Course Given by Enrico Fermi at the University of Chicago. Chicago: University of Chicago Press.
MATH Google Scholar
Feynman, R.P. 1949. Space-Time Approach to Quantum Electrodynamics. Physical Review 76 (6): 769–789. https://doi.org/10.1103/PhysRev.76.769.
Article MathSciNet MATH Google Scholar
Fox, T. 2008. Haunted by the Spectre of Virtual Particles: A Philosophical Reconsideration. Journal for General Philosophy of Science. https://doi.org/10.1007/s10838-008-9066-9.
Article Google Scholar
Frenkel, J. 1929. The Quantum Theory of the Absorption of Light. Nature 124 (3133): 758–759. https://doi.org/10.1038/124758b0.
Article MATH Google Scholar
Furry, W.H. 1937. A Symmetry Theorem in the Positron Theory. Physical Review 51 (2): 125–129. https://doi.org/10.1103/PhysRev.51.125.
Article MATH Google Scholar
Grandin, K. 2008. Intermediate Theoretical Physics. In Aurora Torealis: Studies in the History of Science and Ideas in Honor of Tore Frängsmyr, ed. M. Beretta, K. Grandin, and S. Lindqvist, 193–214. Sagamore Beach: Science History Pbulications.
Google Scholar
Göppert, M. 1929. Über die Wahrscheinlichkeit des Zusammenwirkens zweier Lichtquanten in einem Elementarakt. Naturwissenschaften 17 (48): 932–932. https://doi.org/10.1007/BF01506585.
Article MATH Google Scholar
Göppert-Mayer, M. 1931. Über Elementarakte mit zwei Quantensprüngen. Annalen der Physik 401 (3): 273–294. https://doi.org/10.1002/andp.19314010303.
Article MATH Google Scholar
Halpern, O. 1933. Scattering Processes Produced by Electrons in Negative Energy States. Physical Review 44 (10): 855–856. https://doi.org/10.1103/PhysRev.44.855.2.
Article Google Scholar
Heisenberg, W., and W. Pauli. 1929. Zur Quantendynamik der Wellenfelder. Zeitschrift für Physik 56 (1–2): 1–61. https://doi.org/10.1007/BF01340129.
Article MATH Google Scholar
Heisenberg, W., and W. Pauli. 1930. Zur Quantentheorie der Wellenfelder. II. Zeitschrift für Physik 59 (3–4): 168–190. https://doi.org/10.1007/BF01341423.
Article MATH Google Scholar
Heitler, W. 1936. The Quantum Theory of Radiation, 1st ed. Oxford: Clarendon Press.
MATH Google Scholar
Heitler, W. 1940. Scattering of Mesons and the Magnetic Moments of Proton and Neutron. Nature 145: 29–30. https://doi.org/10.1038/145029b0.
Article Google Scholar
Heitler, W., and S.T. Ma. 1940. Inner Excited States of the Proton and Neutron. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 176 (966): 368–397. https://doi.org/10.1098/rspa.1940.0095.
Article MATH Google Scholar
Herzfeld, K.F. 1924. Versuch einer quantenhaften Deutung der Dispersion. Zeitschrift för Physik 23 (1): 341–360. https://doi.org/10.1007/BF01327600.
Article Google Scholar
Jacobsen, A.S. 2012. Léon Rosenfeld. Physics, Philosophy, and Politics in the Twentieth Century. Singapore: World Scientific Publishing.
Book Google Scholar
Jost, R. 1972. Foundation of quantum field theory. In Aspects of Quantum Theory, ed. A. Salam and E.P. Wigner, 61–78. Cambridge: Cambridge University Press.
Google Scholar
Jähnert, M. 2019. Practicing the Correspondence Principle in the Old Quantum Theory. A Transformation through Implementation. Cham: Springer Nature Switzerland.
Book MATH Google Scholar
Kaiser, D. 2005. Drawing Theories Apart: The Dispersion of Feynman Diagrams in Postwar Physics. Chicago: The University of Chicago Press.
Book MATH Google Scholar
Klein, O. 1927. Elektrodynamik und Wellenmechanik vom Standpunkt des Korrespondenzprinzips. Zeitschrift für Physik A Hadrons and nuclei 41 (6): 407–442. https://doi.org/10.1007/BF01400205.
Article MATH Google Scholar
Klein, O., and Y. Nishina. 1929. Über die Streuung von Strahlung durch freie Elektronen nach der neuen relativistischen Quantendynamik von Dirac. Zeitschrift für Physik 52 (11): 853–868. https://doi.org/10.1007/BF01366453.
Article MATH Google Scholar
Kockel, B. 1937. Über einige Mehrfachprozesse zwischen Elektronen. Positronen und Lichtquanten. Zeitschrift für Physik 107 (3–4): 153–176. https://doi.org/10.1007/BF01330357.
Article MATH Google Scholar
Kojevnikov, A.B. 1993. Paul Dirac and Igor Tamm Correspondence Part I: 1928–1933. München: Max-Planck-Institut für Physik.
Kragh, H. 1981. The genesis of Dirac’s relativistic theory of electrons. Archive for History of Exact Sciences 24 (1): 31–67. https://doi.org/10.1007/BF00327714.
Article MathSciNet MATH Google Scholar
Kragh, H. 1990. Dirac. A Scientific Biography. Cambridge: Cambridge University Press.
Google Scholar
Kramers, H.A., and W. Heisenberg. 1925. Über die Streuung von Strahlung durch Atome. Zeitschrift für Physik 31 (1): 681–708. https://doi.org/10.1007/BF02980624.
Article MATH Google Scholar
Lacki, J., H. Ruegg, and V. Telegdi. 1999. The Road to Stueckelberg’s Covariant Perturbation Theory as Illustrated by Successive Treatments of Compton Scattering. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 30 (4): 457–518.
Article MathSciNet MATH Google Scholar
Landau, L., and R. Peierls. 1931. Erweiterung des Unbestimmtheitsprinzips für die relativistische Quantentheorie. Zeitschrift für Physik 69 (1): 56–69. https://doi.org/10.1007/BF01391513.
Article MATH Google Scholar
Landsberg, G., and L. Mandelstam. 1928. Eine neue Erscheinung bei der Lichtzerstreuung. Die Naturwissenschaften 16 (28): 557–558. https://doi.org/10.1007/BF01506807.
Article Google Scholar
MacLeod, M. 2012. Rethinking Scientific Concepts for Research Contexts: The Case of the Classical Gene. In Scientific Concepts and Investigative Practice, eds. Steinle, F., and U. Feest, Number 3 in Berlin Studides in Knowledge Research, 47–74. Berlin: De Gruyter. https://doi.org/10.1515/9783110253610.47.
Mandl, F., and G. Shaw. 1984. Quantum field theory. Chichester: Wiley.
MATH Google Scholar
Masters, B.R. 2013. The Origins of Maria Göppert’s Dissertation on Two-Photon Quantum Transitions at Göttingen’s Institutes of Physics 1920–1933. In Traditions and Transformations in the History of Quantum Physics. HQ-3: Third International Conference on the History of Quantum Physics, Berlin, June 28- July 2, 2010., eds. Renn, J., S. Katzir, and C. Lehner, 209–230. Berlin: Edition Open Access. https://doi.org/10.34663/9783945561225-10.
Mercier, A. 1939. Sur l’énergie propre de l’électron. Helvetica Physica Acta 12 (1): 55–75. https://doi.org/10.5169/seals-110933.
Article MATH Google Scholar
Moyer, D.F. 1981. Origins of Dirac’s electron, 1925–1928. American Journal of Physics 49 (10): 944–950. https://doi.org/10.1119/1.12595.
Article Google Scholar
Oldofredi, A., and M. Esfeld. 2019. Observability, Unobservability and the Copenhagen Interpretation in Dirac’s Methodology of Physics. Quanta 8: 68–87. https://doi.org/10.12743/quanta.v8i1.93.
Article MathSciNet MATH Google Scholar
Oppenheimer, J.R. 1930. On the Theory of Electrons and Protons. Physical Review 35 (5): 562–563. https://doi.org/10.1103/PhysRev.35.562.
Article Google Scholar
Pais, A. 1986. Inward Bound. Of Matter and Forces in the Physical World. New York: Oxford University Press.
Book Google Scholar
Pais, A. 1987. Playing with equations, the Dirac way. In In Reminiscences about a Great Physicist: Paul Adrien Maurice Dirac, ed. B.N. Kursunoglu and E.P. Wigner, 93–116. Cambridge: Cambridge University Press.
Google Scholar
Passon, O. 2019. On the interpretation of Feynman diagrams, or, did the LHC experiments observe H to Gamma Gamma? European Journal for Philosophy of Science 9 (2): 20. https://doi.org/10.1007/s13194-018-0245-1.
Article MathSciNet MATH Google Scholar
Pauli, W. 1925. Ueber die Intensitäten der im elektrischen Feld erscheinenden Kombinationslinien. Kgl. Danske Videnskabernes Selskab. Mathematisk-fysiske Meddelelser. 7 (3): 1–20.
Google Scholar
Pauli, W. 1926. Quantentheorie, In Quanten, eds. Geiger, H., and K. Scheel, Vol. 13, Handbuch der Physik, 1–278. Berlin: Springer. https://doi.org/10.1007/978-3-642-99593-4_1.
Pauli, W. 1979. Wissenschaftlicher Briefwechsel mit Bohr, Einstein, Heisenberg u.a. Band I: 1920-1929, eds. Hermann, A., K. v. Meyenn, and V.F. Weisskopf. Number 2 in Sources in the History of Mathematics and Physical Sciences. New York: Springer.
Pauli, W. 1985. Wissenschaftlicher Briefwechsel mit Bohr, Einstein, Heisenberg u.a. Band II: 1930–1939, eds. v. Meyenn, K., with the Cooperation of Hermann, A. and V. F. Weisskopf. Number 6 in Sources in the History of Mathematics and Physical Sciences. Berlin: Springer.
Peskin, M.E., and D.V. Schroeder. 1995. An introduction to Quantum Field Theory. Boulder: Westview Press.
Google Scholar
Raman, C.V. 1928. A new radiation. Proceedings of the Indian Academy of Science 2: 387–398.
Google Scholar
Raman, C.V. 1929. Part II.-The Raman effect. Investigation of molecular structure by light scattering. Transactions of the Faraday Society 25: 781–792. https://doi.org/10.1039/TF9292500781.
Article Google Scholar
Rasetti, F. 1929. On the Raman Effect in Diatomic Gases. Proceedings of the National Academy of Sciences of the United States of America 15 (3): 234–237. https://doi.org/10.1038/123205a0.
Article MATH Google Scholar
Roqué, X. 1997. The manufacture of the positron. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 28 (1): 79–129. https://doi.org/10.1016/S1355-2198(96)00021-4.
Article MathSciNet MATH Google Scholar
Rosenfeld, L. 1932. La théorie quantique des champs. Annales de l’institut Henri Poincaré 2 (1): 25–91.
MathSciNet MATH Google Scholar
Schrödinger, E. 1926. Quantisierung als Eigenwertproblem. Vierte Mitteilung. Annalen der Physik 386 (18): 109–139. https://doi.org/10.1002/andp.19263861802.
Article MATH Google Scholar
Schweber, S.S. 1994. QED and the men who made it: Dyson, Feynman, Schwinger, and Tomonaga. Princeton: Princeton University Press.
Book MATH Google Scholar
Singh, R. 2002. C. V. Raman and the Discovery of the Raman Effect. Physics in Perspective 4: 399–420. https://doi.org/10.1007/s000160200002.
Article MATH Google Scholar
Slater, J.C. 1927. Radiation and Absorption on Schrödinger’s Theory. Proceedings of the National Academy of Sciences of the United States of America 13 (1): 7–12. https://doi.org/10.1073/pnas.13.1.7.
Article MATH Google Scholar
Smekal, A. 1923. Zur Quantentheorie der Dispersion. Die Naturwissenschaften 11 (43): 873–875. https://doi.org/10.1007/BF01576902.
Article Google Scholar
Smekal, A. 1925. Über metastationäre Atom- und Molekölzustände. Zeitschrift für Physik 34 (1): 81–93. https://doi.org/10.1007/BF01328457.
Article MathSciNet MATH Google Scholar
Soler, L., S. Zwart, V. Israel-Jost, and M. Lynch. 2014. Introduction. In Science After the Practice Turn in the Philosophy, History, and the Social Studies of Science, eds. Soler, L., S. Zwart, M. Lynch, and V. Israel-Jost, Number 14 in Routledge Studies in the Philosophy of Science, 1–43. New York: Routledge. https://doi.org/10.4324/9781315857985.
Steinle, F. 2012. Goals and Fates of Concepts: The Case of Magnetic Poles. In Scientific Concepts and Investigative Practice, eds. Steinle, F., and U. Feest, Number 3 in Berlin Studides in Knowledge Research, 105–126. Berlin: De Gruyter. https://doi.org/10.1515/9783110253610.105.
Tamm, I. 1930a. Eine Bemerkung zur Diracschen Theorie der Lichtzerstreuung und Dispersion. Zeitschrift für Physik 65 (9): 705–708. https://doi.org/10.1007/BF01391168.
Tamm, I. 1930b. Über die Wechselwirkung der freien Elektronen mit der Strahlung nach der Diracsehen Theorie des Elektrons und nach der Quantenelektrodynamik. Zeitschrift für Physik 62 (7–8): 545–568. https://doi.org/10.1007/BF01339679.
Telegdi, V.L. 1997. Enrico Fermi in America. Physics Today 55 (6): 38–43. https://doi.org/10.1063/1.1496374.
Article Google Scholar
Vaihinger, H. 1911. Die Philosophie des Als Ob. System der theoretische, praktischen und religiösen Fiktionen der Menschheit auf Grund eines idealisitschen Positivismus; mit einem Anhang über Kant und Nietzsche. Berlin: Reuther und Reichard.
van Vleck, J.H. 1929. On the Vibrational Selection Principles in the Raman Effect. Proceedings of the National Academy of Sciences 15 (9): 754–764. https://doi.org/10.1073/pnas.15.9.754.
Article MATH Google Scholar
Weingard, R. 1982. Do Virtual Particles Exist? PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 1982: 235–242.
Google Scholar
Weisskopf, V. 1934. Über die Selbstenergie des Elektrons. Zeitschrift für Physik 89 (1–2): 27–39. https://doi.org/10.1007/BF01333228.
Article MATH Google Scholar
Weisskopf, V. 1939. On the Self-Energy and the Electromagnetic Field of the Electron. Physical Review 56 (1): 72–85. https://doi.org/10.1103/PhysRev.56.72.
Article MATH Google Scholar
Wentzel, G. 1927. Über strahlungslose Quantensprünge. Zeitschrift für Physik 43 (8): 524–530. https://doi.org/10.1007/BF01397631.
Article MATH Google Scholar
Wentzel, G. 1937. Zur Theorie der Beta-Umwandlung und der Kernkräfte. I. Zeitschrift für Physik 104 (1–2): 34–47. https://doi.org/10.1007/BF01337892.
Article MATH Google Scholar
Wright, A.S. 2016. A beautiful sea: P. A. M. Dirac’s epistemology and ontology of the vacuum. Annals of Science 73 (3): 225–256. https://doi.org/10.1080/00033790.2016.1157731.
Article MATH Google Scholar
Wüthrich, A. 2010. The genesis of Feynman Diagrams. Number 28 in Archimedes. Dordrecht: Springer.

Download references

Acknowledgements

This work was supported by the research unit “The Epistemology of the LHC,” funded by the Deutsche Forschungsgemeinschaft (DFG) under the grant number FOR 2063. I especially want to thank Friedrich Steinle and Adrian Wüthrich for constant and constructive input on the different versions of this article. I want to further thank the participants of the research colloquium of the Final Theory Program MPI-Research group (led by Alexander Blum) and the participants of the workshop Virtual Entities in Science for the fruitful discussion of and comments on presentations of this paper. Finally, thanks goes to Rory Grennan and the team of the Special Collections and Archives at the Florida State University Library as well as Karl Grandin and the team of the Center for History of Science at the Royal Swedish Academy of Science for the uncomplicated provision of digital copies of the archival material on which part of this article is based.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of History and Philosophy of Science, Technology, and Literature, Technische Universität Berlin, Straße des 17. Juni 135, Berlin, 10623, Germany
Markus Ehberger

Authors

Markus Ehberger
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Markus Ehberger.

Ethics declarations

Conflict of interest

There is no conflict of interest.

Additional information

Communicated by Olivier Darrigol.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The quote in the title is taken from Otto Halpern’s initial proposal of spontaneous light quantum splitting and light-by-light scattering (Halpern 1933, 856)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ehberger, M. “The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist. Arch. Hist. Exact Sci. 76, 531–571 (2022). https://doi.org/10.1007/s00407-022-00293-8

Download citation

Received: 21 February 2022
Published: 05 July 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s00407-022-00293-8

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

“The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist

Abstract

Similar content being viewed by others

Quantum Electrodynamics

Quantum Field Theory Beyond the S-Matrix Formalism

Thermal Field Theory in real-time formalism: concepts and applications for particle decays

1 Dirac’s verbal model for the scattering of light

1.1 Dirac’s technical and conceptual background

1.1.1 Time-dependent perturbation theory (Dirac 1926)

1.1.2 Radiation theory (Dirac 1927b, c)

1.1.3 “Dirac’s Mogelei” or how to arrive at sensible results in radiation theory

1.2 Dirac’s quantum theory of dispersion (Dirac 1927b)

1.3 So, why was it Dirac who introduced PST?

1.3.1 The careful introduction of PST

1.3.2 Dirac’s philosophical stance towards quantum theory

1.4 An interim conclusion

2 The impact of PST: representational alterations and (meta-)physical implications

2.1 Alterations in diagrammatical representations: the inclusion of an ordered sequence of events

2.2 “A certain degree of reality”^{Footnote 63}: PST between formal and physical

2.2.1 Virtual negative energy states: beyond the purely formal

2.2.2 The physicality of intermediate states

2.2.3 Non-invocation and refutation: physicists close to Copenhagen

3 Reflection and outlook

Change history

31 October 2022

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Navigation

“The language of Dirac’s theory of radiation”: the inception and initial reception of a tool for the quantum field theorist

Abstract

Similar content being viewed by others

Quantum Electrodynamics

Quantum Field Theory Beyond the S-Matrix Formalism

Thermal Field Theory in real-time formalism: concepts and applications for particle decays

1 Dirac’s verbal model for the scattering of light

1.1 Dirac’s technical and conceptual background

1.1.1 Time-dependent perturbation theory (Dirac 1926)

1.1.2 Radiation theory (Dirac 1927b, c)

1.1.3 “Dirac’s Mogelei” or how to arrive at sensible results in radiation theory

1.2 Dirac’s quantum theory of dispersion (Dirac 1927b)

1.3 So, why was it Dirac who introduced PST?

1.3.1 The careful introduction of PST

1.3.2 Dirac’s philosophical stance towards quantum theory

1.4 An interim conclusion

2 The impact of PST: representational alterations and (meta-)physical implications

2.1 Alterations in diagrammatical representations: the inclusion of an ordered sequence of events

2.2 “A certain degree of reality”Footnote 63: PST between formal and physical

2.2.1 Virtual negative energy states: beyond the purely formal

2.2.2 The physicality of intermediate states

2.2.3 Non-invocation and refutation: physicists close to Copenhagen

3 Reflection and outlook

Change history

31 October 2022

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

2.2 “A certain degree of reality”^{Footnote 63}: PST between formal and physical