We said at the beginning of Sect. 1 that on our account of duality, two dual theories share a common core, which is itself a theory, the bare theory—and that they are isomorphic models of it. So to explain that account, we first need to explain our use of the terms ‘theory’ and ‘model’: which, after all, have various (conflicting!) uses. We do this in Sect. 2.1. This leads to: distinguishing within a model, what we will call the model root (in most cases: a model triple) from the specific structure (Sect. 2.2); and giving examples of this distinction (Sect. 2.3). Finally, we introduce notation for the values of physical quantities on states (Sect. 2.4). Then we will be ready to define duality as an isomorphism of model roots: details in Sect. 3.
‘Theory’ vs. ‘model’: interpretation
For the words ‘theory’ and ‘model’, the first point to make—the main idiosyncrasy in our usage—arises from the fact that since a duality compares two theories, it involves two “levels”: the common core “above” (more general and-or more abstract) and the theories “below” (more specific and-or more concrete). On the other hand, one naturally thinks of a theory as general and-or abstract, “standing above”, and “being common to”, its more specific and-or concrete models. So in order to discuss duality, the question arises: should we allocate the word ‘theory’ to the common core, or to each of the two dual theories?
We opt for the former. And since the two dual theories (so-called!) are indeed specific and-or concrete realizations (versions, formulations) of this common core that we have opted to call a ‘theory’, we also co-opt the word ‘model’ for the duals themselves. So the overall effect of our jargon is to lift the use of the words ‘theory’ and ‘model’, “one level up”.
As a simple example to illustrate our usage, consider position-momentum duality in elementary Schrödinger-picture quantum mechanics. Although the position and momentum representations might have been discovered, in a counterfactual history, as different “theories” (since the Schrödinger equation takes very different forms in the position and momentum bases, as do the operators and wave-functions), the discovery of the Fourier transformation reveals that these are in fact two representations of a common core theory: namely, quantum mechanics formulated in the basis-free language of Hilbert space. Thus what in the counterfactual history is thought to be two distinct “theories”, is seen to be just two models, i.e. representations, of a common structure, which we call elementary quantum mechanics.
‘Model’
So beware! This means that our use of ‘model’ rejects three common connotations of the word.Footnote 3 Namely, the connotations that a model is:
-
(i)
a solution of the theory, or a history of the system concerned (often a trajectory through the state-space), as against ‘all solutions’—which are the purview of the theory; and-or
-
(ii)
an approximation(s) to what the theory says (maybe about a specific regime or system the theory is concerned with); and-or
-
(iii)
a part of the empirical world—a hunk of reality!—that thus gives an interpretation (or part of an interpretation) of the theory.
We will indeed later be concerned with the ideas (i) to (iii): (in fact, with (iii) in this Subsection). But they are not part of our definition of ‘model’.
Instead, a model is for us a specific realization (or version or formulation) of a theory. That is: it ‘models’ (verb!) another theory “above”, which in general also has other such models. Almost always, it is a representation of the theory above, in the sense of mathematical representation theory—‘representation’ being another word with confusingly diverse uses. Recall the example, in this Section’s preamble, of position versus momentum representations.
And as we have announced: the main case for us of “the theory above” will be the common core, the bare theory, of two dual theories (our ‘models’). (It is usually best to think of the bare theory as uninterpreted, or abstract: though it may be interpreted: cf. Sect. 2.1.3). So we say the two given theories are models of the bare theory; and their being isomorphic models of the bare theory (isomorphic as regards the structure of the bare theory) is what makes them duals.
‘Theory’
As to our use of ‘theory’, the main thing to say is that it is mainstream: i.e. typical of the literature, especially the literature on the semantic conception of theories as applied to physics. Agreed: recent philosophy of science has emphasised many aspects of scientific endeavour that hardly invoke the notion of scientific theory, central though this notion was for discussions by both the positivists and their successors. For example, aspects such as experiment (calibration of instruments etc.), causation (mechanistic explanation etc.), and the social dimensions of knowledge (testimony etc.) have recently been discussed with a strong emphasis on models (in a more usual sense than ours!), rather than theories.
We agree that these aspects of scientific endeavour are important for our philosophical understanding of science (De Haro and De Regt 2018a: Sect. 1.1). But even if these aspects do not need the notion of theory, still the notion may well be useful for other aspects. Indeed, we believe it is indispensable for discussion of symmetries and dualities in physics.Footnote 4
More specifically: we can often think of a physical theory as a triple: a state-space \(\mathcal S\) (e.g. a Hilbert space in a quantum theory), a set of physical quantities \(\mathcal Q\) (almost always an algebra), and a dynamics \(\mathcal D\) that describes how the values of quantities (on states) change over time (and-or over space—so as to accommodate Euclidean theories). As we said in Sect. 1, symmetries and dualities will concern the preservation or matching of these values.Footnote 5 So we will sometimes write a theory (and of course: a model in our sense) as a triple, e.g. \(\langle \mathcal{{S}}, \mathcal{{Q}}, \mathcal{{D}} \rangle \). In any specific case, each of \(\mathcal S\), \(\mathcal Q\) and \(\mathcal D\) will of course have further structure. For example:
-
(i)
For a quantum theory, \(\mathcal S\) will usually be a Hilbert space, or a set of density matrices; for a classical theory, it will usually be a manifold, or a set of probability distributions;
-
(ii)
For almost any theory, \(\mathcal Q\) will be an algebra over the real or complex numbers, allowing quantities to be added and multiplied; and
-
(iii)
For almost any theory, the dynamics \(\mathcal D\) can be understood either in Schrödinger picture, with states changing over time and quantities fixed over time, or in Heisenberg picture (vice versa).
Two clarifications about our treatment of a theory as a triple: the first formal, the second interpretative. First: note that we said: ‘we can often think of a physical theory as a triple’, and ‘almost any’ in (ii) and (iii). For we agree that not every theory is presented, or best thought of, in this way. Theories in statistical and quantum physics are often formulated in terms of partition functions and-or path integrals with sources, and related concepts like sets of correlation functions, rather than in terms of states and quantities. And in field theories, the dynamics is often presented as field equations holding at each spacetime point—and so not naturally thought of in terms of the Schrödinger or Heisenberg pictures with their “background time”. But in this paper, we can think of theories (and models, in our sense) as such triples: all our morals will carry over to these other ways of formulating theories.Footnote 6
Second: We admit that of course, a physical theory is almost never presented to us as a tidy triple of state-space, quantities and dynamics. Almost always, the theory appears to us messier than that: more vaguely defined and-or more complicated. The triple needs to be extracted from that “mess”. Indeed, there are two points here.
-
(i)
The complicated appearance is of course in part due to those aspects such as experiment mentioned above. But this complexity, and the need to allow for such aspects (and to assess them philosophically), does not militate against extracting a triple as a concept useful for e.g. understanding symmetries.
-
(ii)
We make no claim that there is always, or even typically, a unique best definition of the triple. So presenting a theory as a triple usually involves: (a) choices about exactly what to take as the state-space etc.; and even (b) judgment about interpretative and perhaps controversial matters. We will see some examples of this variety, already in Sect. 2.2; and we will see that these choices may affect verdicts of interest, e.g. whether there is an isomorphism, or a duality.
Interpretation
This brings us to the interpretation of physical theories. A large subject! But here, we only need to report another part of our overall position, which is again mainstream. Namely, we endorse the endeavour of giving theories (and so: models in our sense) a classical referential semantics. That is: we envisage assigning references in the empirical world to appropriate elements of theories—be they states and quantities (i.e. mathematical objects) for a theory presented as a triple, or words and sentences (i.e. linguistic objects) for a theory presented in a language. This even-handedness between mathematical and linguistic objects—and thus between theories as N-tuples and as sets of sentences—is deliberate: for we endorse recent arguments against the traditional firm distinction between the semantic and syntactic conceptions of theory (Lutz 2017; Glymour 2013; van Fraassen 2014).Footnote 7
This even-handedness is one reason we call our endorsement of referential semantics ‘mainstream’. There are also two other reasons: the first is familiar in philosophy of science, and the second is familiar in philosophy of language.
First: recall that accepting such referential semantics is independent, or at least largely independent, of the debate over scientific realism. For that debate is largely about what is the right attitude to our theories, not about their semantic content. Thus the arch empiricist, van Fraassen, explicitly accepts a literal construal of the theoretical claims of—i.e. a referential semantics for—scientific theories (1980: p. 14): as do other influential positions that reject realism, such as Fine’s ‘natural ontological attitude’ (1984: pp. 96–99, 1986: p. 130).
Second: we should recall the moral of Lewis’ seminal paper, ‘Languages and Language’ (1975). Lewis begins by rehearsing a thesis and an antithesis: the task of the paper, and his moral, is to reconcile them in a synthesis—which indeed he accomplishes. Thus the thesis begins by saying that a language is an assignment ‘of meanings to certain sequences of types of sound or of marks...’ (p. 3). As this quote hints: the thesis is advocacy of a referential semantics; (indeed an intensional semantics—cf. below). The antithesis begins by saying that ‘language is a social phenomenon wherein people utter strings of vocal sounds ... and wherein people respond by thought or action to the sounds they observe to have been produced’ (ibid). As this quote hints: the antithesis is advocacy of an account of language emphasising people’s propositional attitudes (intentions, beliefs, desires etc.): both as what is communicated by language, and as what underpins that communication. Thus Lewis’ synthesis is his account of what it is for a language L, à la the thesis, to be the language used by a human population, à la the antithesis. The main idea is that this is a matter of the population having conventions (in Lewis’ sense) of truthfulness in L and trust in L. Thus he knits the thesis and antithesis together in a detailed way (and replies to various objections). He ends by saying: ‘According to the proposal I have presented, the philosophy of language is a single subject. The thesis and antithesis have been the property of different schools; but in fact they are complementary essential ingredients in any adequate account either of languages or of language’ (p. 35).
To which we say: ‘Hear, hear!’. That is: we claim a similar reconciliation between our advocating a referential semantics for scientific theories, and various lines of philosophical work that downplay, or even do not mention, reference or theories. Thus some work on some of the aspects of scientific endeavour, mentioned above, seems sceptical, not just about ‘theory’ as a useful notion, but also about reference. For example: much philosophical writing about experiment (calibration of instruments etc.) emphasizes non-linguistic skills, practices and norms; and much philosophical writing about the social dimensions of knowledge emphasizes the wider and practical world, e.g. the functioning of scientific communities and institutions in e.g. the maintenance of norms, such as accreditation, etc. Such emphases are entirely appropriate, say we. Study of experiment should of course emphasize non-linguistic skills; and so on. But such emphases in no way militate against developing a referential semantics for scientific language, and so for scientific theories. In short, we think Lewis’ synthesis gives a valuable, because irenic, perspective on this work’s relation to referential semantics.Footnote 8
So much by way of defending ourselves as having mainstream views about interpretation. The upshot is that we envisage a referential semantics using interpretation maps, i. These are maps on states or quantities (\(s \in \mathcal{S}\) etc.), or on linguistic items, assigning as values (outputs of the map) parts of the empirical world (hunks of reality!). But they are, in general, partial maps; i.e. for some arguments, the map yields no value (output). For some details and examples, see De Haro (2018b).Footnote 9
In the next Section we will contrast internal and external interpretations, both of which are interpretations in the above sense. Roughly speaking, an internal interpretation only interprets that “part” of the model that is “the homomorphic copy” of the theory, while an external interpretation can interpret all of the model.
Model roots, specific structure, and their interpretations
As hinted at the end of the previous Section, we need some more jargon and notation about the relation between a bare theory and its models that will allow us to distinguish the parts or aspects of models that are “shadows” of corresponding parts or aspects of the bare theory, from those that are not.
The jargon is clearest for the common case, which we will focus on throughout this paper: when the realization of the bare theory proceeds by (mathematical) representation, and the bare theory is a triple comprising a state-space \(\mathcal S\), a set of quantities \(\mathcal Q\) and a dynamics \(\mathcal D\): (cf. Sect. 2.1.2 and footnote 6). Although the bare theory may be interpreted (like its models, the dual theories, usually are), adding the adjective ‘bare’ signals helpfully that the theory is “above” and can be / is often uninterpreted.Footnote 10
In this common case: by the very definition of ‘representation’, the model gives a homomorphic copy of the bare theory (irrespective of there being a duality). That is: there are appropriate structure-preserving maps from the states, quantities and dynamics of the bare theory to the model’s homomorphic copy. To be precise: there is a pair of structure-preserving maps—from states in the bare theory to states in the model, and from quantities in the bare theory to quantities in the model. And there is a meshing condition on the model’s dynamics that makes it implement that of the bare theory. The details are as follows: though we can mostly take them in our stride, and just say ‘homomorphic copy’.Footnote 11
We can write the bare theory as a triple \(T = \langle \mathcal{S}, \mathcal{Q}, \mathcal{D} \rangle \); and similarly its model \(M = \langle \mathcal{S}_M, \mathcal{Q}_M, \mathcal{D}_M \rangle \), where the subscripts signal that the state-space etc. are different from that of the bare theory. If we think of the dynamics in Schrödinger style as a map on the state-space (more details in Sect. 2.4), and write the homomorphism from \(\mathcal{S}\) to \(\mathcal{S}_M\) as h, then the meshing condition on the model’s dynamics will be the commuting diagram in Fig. 1.
Once we have the distinction of levels, i.e. a bare theory represented by a model, there is an important distinction to be made within the model. (This is important irrespective of there being a duality.) Namely between:
-
(i)
The parts and aspects of the model which together express its realizing the bare theory;
-
(ii)
The parts and aspects which do not express the realization.
We will call (i) the model root. And in the common case where the bare theory, and so also the model, is a triple of states, quantities and dynamics, we will call (i) the model triple. And we will call (ii) the model’s specific structure. We can think of it as the ‘ingredients’ or ‘building blocks’ by which the representation of the bare theory, i.e. (i), is built.
There is a correlative distinction between two kinds of interpretation. Recall from Sect. 2.1.3 that an interpretation is given by interpretation maps, i.e. functions (in general, partial functions) mapping items in the theory to items in the world. Thus we say that an internal interpretation is one that only interprets the model root: it maps all of and only the model root to items in the world, regardless of the specific structure. On the other hand, an external interpretation also maps (some or all of) the specific structure to items in the world.
Of course, ingredients are present in the cooked dish, and building blocks are present in the built house. Similarly here: often, an item of specific structure is in the model root, though (by definition) it is not part of the representation of the bare theory. And in such a case an internal interpretation does not interpret the item of specific structure. (Sect. 2.3 will give examples.)
Thus the distinction between specific structure (‘building blocks’), and model root (‘what gets built’) is physically significant, in that it constrains interpretation. It is formal in that it can be stated without giving an interpretation: but it has consequences for interpretation. For more discussion, cf. De Haro (2016a: Sect. 1.1.2, 2018a: Sect. 2.2.3), De Haro and Butterfield (2018: Sect. 3.2.2), and De Haro (2018b: Sect. 2.1.2).
But we stress that even within a given model, the distinction is not ‘God-given’. It is relative to how exactly we define the bare theory, and thereby the homomorphism from it to the model. And for a physical theory, as usually presented to us informally and even vaguely, there need be no best or most natural way to make this exact definition. For recall comment (ii) at the end of Sect. 2.1.2: how exactly to present a theory as a triple of states, quantities and dynamics is a matter of choice and even judgment.
We will see this flexibility in play later, in Sect. 2.3, where we will have a choice to define the model root either as a single representation of a structure (with a further choice to include or not to include a choice of basis in the model root) or as an equivalence class of representations.
We will also see it in connection with dualities, in Sect. 2.3.2. For as we announced: we say a duality is an isomorphism of models of a bare theory; (details in Sect. 3.1). But this means: an isomorphism with respect to the structure of the bare theory—which is the structure that the models represent. Therefore the isomorphism that is the duality is an isomorphism of model roots. And in the common case of triples of states, quantities and dynamics: it is an isomorphism of model triples.Footnote 12 So the flexibility about the definition of a bare theory, and so about what a model must represent, leads to flexibility about exactly what the duality mapping is, i.e. what is the isomorphism between model roots.
It is worth having a notation distinguishing between the model root (which is usually a model triple of states etc.) and the specific structure. So given a model M, we now write m for its model root. This is usually a model triple, which we now write as \(\langle \mathcal{S}_M,\mathcal{Q}_M,\mathcal{D}_M \rangle \). Again, the subscripts signal that the state-space etc. are different from those of the bare theory. We also write \({\bar{M}}\) for the specific structure. So we write a model M of a bare theory T as
$$\begin{aligned} M =:\langle m ; {\bar{M}}\rangle = \langle \mathcal{S}_M,\mathcal{Q}_M,\mathcal{D}_M ; {\bar{M}}\rangle ~, \end{aligned}$$
(1)
where the semi-colon in the defined angle-bracket signals the contrast between root and specific structure, and the last equation just expresses the usual case of the root being a triple.
But beware: one should not think of M as just the ordered pair of independently given m and \({\bar{M}}\). For m is built by using the specific structure \({\bar{M}}\), and so it is not given independently of \({\bar{M}}\). Rather, m encodes M’s representing the bare theory T. So one might well write \(T_M\) instead of m, since having a subscript M on the right hand side of the equation \(M = \langle T_M; {\bar{M}} \rangle \) signals that M is not an ordered pair of two independently given items. In other words: the notation \(T_M\) emphasises that the model triple: (i) is a representation of T, (ii) is built from M’s specific structure viz. \({\bar{M}}\), and (iii) is itself a theory (hence the letter ‘T’).
Here is an illustration of this notation in use. As announced: we say that a duality is an isomorphism of model roots, with respect to the structure of the bare theory. So if \(M_1, M_2\) are models of a bare theory T, their being duals means: \(m_1 \cong m_2\). And in the usual case of model triples, i.e. \(m_i = \langle \mathcal{S}_{M{_i}},\mathcal{Q}_{M{_i}},\mathcal{D}_{M{_i}} \rangle \,,~i = 1,2\): this isomorphism of roots will be a matter of two appropriately meshing isomorphisms, one between the state-spaces \(\mathcal{S}_{M{_i}}\) and one between the quantity algebras \(\mathcal{Q}_{M{_i}}\). Details in Sect. 3.1.
Examples: matrix representations and Galilean transformations
In this Section, we will illustrate our notions of theory and model, and the contrast of model root vs. specific structure. Our first example (Sect. 2.3.1) is from matrix representations; our second example (Sect. 2.3.2) is about Galilean transformations in Newtonian mechanics. The latter example will also illustrate our notion of interpretation from Sect. 2.1.3, especially our internal versus external contrast from Sect. 2.2.
Matrix representations
Perhaps the simplest illustration of these notions, model root and specific structure, comes from defining the bare theory to be just an abstract group G; and as usual, taking realizations to be representations.Footnote 13 So let us consider matrix representations of G. More specifically: we consider for a finite group G, a set \(\{ M_i \}\) of \(n \times n\) complex matrices with non-zero determinant (i runs from 1 to the order of G). That is: \( M_i \in \text{ GL }(n,\mathbb {C})\). In such a representation, the choices of the size n of the matrices and of the ground-field (\(\mathbb {R}\) or \(\mathbb {C}\)) are of course parts of specific structure: for these are building blocks by which we build the homomorphic copy of G.
But even in this simple illustration, we can see further options about how exactly to define model root and specific structure. These options relate to the fact that the unitary group \(\text{ U }(n)\) acts on \(\text{ GL }(n,\mathbb {C})\), by \(M \mapsto U M U^{-1} \equiv U M U^{\dagger }\). Each \(U \in \text{ U }(n)\) sends a representation \(\{ M_i \}\) to another representation, called ‘equivalent’; and representation theory then of course concentrates on equivalence classes of representations, characterizing them in terms of their invariants, especially characters. The spirit of the enterprise is that a unitary change of basis has no mathematical significance. This means there are two main options, (A) and (B) below, about how to define the model root, and thereby, also the specific structure—and there will be further choices.Footnote 14
(A) Model root as representation We can say that a single representation \(\{ M_i \}\) is the model root. Indeed: multiplication of the matrices \(M_i\) realizes a homomorphic copy of G’s multiplication. And it is no objection that items of specific structure—the size of the matrices n, and the choice of the ground-field—are ‘in’ the matrices \(M_i\). For as we said: that an item appears in a model root does not prevent it appearing in the specific structure. After all, the model root is built from the specific structure.
On this option, two equivalent representations of G, i.e. a set of matrices \(\{ M_i \}\) and another set \(\{ U\,M_i\,U^{-1} \}\), for some fixed \(U \in \text{ U }(n)\), will be isomorphic, as homomorphic copies of G’s multiplication. So on our account of duality, they are duals. (As we will discuss in Sect. 3: this reflects our definition of duality being logically weak i.e. having many instances.)
But there is also a further choice. For the fact that the matrices realize group multiplication is independent of their acting as linear operators on the vector space \(\mathbb {C}^n\). That is: although we always think of a matrix as representing (that word again!) a linear operator, it only does so relative to a choice of basis—and the representation we began with, viz. \(h: G \rightarrow \{ M_i \}\), makes no such choice. But if we wish, we can adjoin such a choice to our model. That is: we can stipulate that the matrix representation \(\{M_i\}\) also has, as part of its specific structure, some specific basis \(\mathbf{e}_k\) (\(k=1,\ldots ,n\)) of \(\mathbb {C}^n\).
Once such a choice is made, the basis vectors \(\mathbf{e}_k\) (and thereby all vectors \(\mathbf{v}=\sum _{k=1}^nv_k\,\mathbf{e}_k\)) are of course acted upon—sent to another basis—by the similarity transformations U that act on the matrices \(M_i\). That is: \(\mathbf{e}_k\mapsto \sum _{l=1}^n U_{kl}\,\mathbf{e}_l\) when \(M_i\mapsto U\,M_i\,U^{-1}\). But the idea of the stipulation is that the first-chosen basis \(\mathbf{e}_k\) labels the representation: it fixes the interpretation of the \(\{M_i\}\) as linear operators. It is just that this labelling basis maps across to the labelling bases of equivalent representations. This illustrates the idea in footnote 12 that—to return to our jargon—specific structure can map across a duality. In this example, the duality maps a labelling basis to another basis: \(\{ \mathbf{e}_k \}\mapsto \{ \sum _{l=1}^n U_{kl}\,\mathbf{e}_l \} \) when \(M_i\mapsto U\,M_i\,U^{-1}\).
(B) Model root as equivalence class We can say that an equivalence class of representations—the entire orbit of a given \(\{ M_i \}\) under the action of \(\text{ U }(n)\)—is the model root. For indeed: equivalent representations realize the very same homomorphic copy of G’s multiplication. Again, the size of the matrices n, and the choice of the ground-field, are ‘in’ the model root (since they are preserved by equivalence). And again, it is no objection that an item in the model root is also in the specific structure: since the model root is built from the specific structure.
On option (B), there are still duals, on our account of duality—despite the option’s having “quotiented” to a more abstract notion of model than that of option (A). For we can vary n; and-or we can vary the ground-field. That is: two different choices of n can provide two model roots—the equivalence class of a representation \(\{M_i\}: M_i \in \text{ GL }(n_1,\mathbb {C})\), and the equivalence class of a representation \(\{M_j\}: M_j \in \text{ GL }(n_2,\mathbb {C})\)—that instantiate the same homomorphic copy of G. And similarly, we can vary the ground-field, and yet instantiate the same homomorphic copy of G. And similarly, we can consider non-linear/non-matrix realizations/representations of G.
And as in option (A), there is the further choice—again because the fact that matrices realize group multiplication is independent of their acting as linear operators on the vector space \(\mathbb {C}^n\). That is: if we wish, we can adjoin to a model root—an entire equivalence class of representations—the choice of some specific basis \(\mathbf{e}_k\) (\(k=1,\ldots ,n\)) of \(\mathbb {C}^n\), which we can then take to be specific structure. Of course, the only natural way to do this is to attach the basis to some arbitrary element of the class, i.e. one set of matrices \(\{ M_i \}\), and then transport the basis around to the other elements of the equivalence class by the action of \(\text{ U }(n)\).
Galilean transformations
Newtonian mechanics provides a simple illustration of the notions of model root, specific structure, and indeed duality. And since it is an example from physics rather than pure mathematics, we also get an illustration of Sect. 2.2’s distinction between internal and external interpretations.
The idea is as follows. The bare theory T is Newtonian mechanics, of say N gravitating point-particles, set in a Galilean (neo-Newtonian) spacetime: i.e. in a spacetime manifold that is ‘globally like \(\mathbb {R}^4\)’, with Euclidean geometry in its instantaneous time-slices, and a flat 4-dimensional connection, but no preferred absolute rest. This bare theory is modelled (in our sense: i.e. realized, represented) by formulations of Newtonian mechanics of N gravitating particles, set in a Newtonian spacetime, i.e. in a spacetime that is ‘globally like \(\mathbb {R}^4\)’ but that does have an absolute rest.
Famously (notoriously!), the difference in such formulations’ specifications of absolute rest is not experimentally detectable, since specifications that are each boosted with respect to the other specify the same flat 4-dimensional connection, and a boost maps a solution of the equations of motion to another solution. Or as it is usually put, in more physical terms: no experiment in Newtonian mechanics can distinguish one specification of rest from the others, because the theory is invariant under boosts (‘has boosts as a symmetry’). Hence, of course, the debate between Newton and Leibniz, as articulated in the Leibniz-Clarke correspondence, and with its long legacy down to the present day (e.g. Earman 1989).
So in this example, it is natural to say that the specific structure of each model includes its specification of absolute rest. Using this specification, the model defines a flat 4-dimensional connection—viz. the same connection as is defined by the other models—and thereby builds a homomorphic copy of T.
We can make this example simpler and precise, and yet still a worthwhile illustration, by taking the bare theory T to be just the abstract Galilean group Gal(3). This is a 10-dimensional Lie group, whose generators are three spatial rotations, four (space and time) translations, and three boosts. That is: its generators are usually thus described, by way of justifying their commutation relations. But of course the abstract group can be defined by the commutation relations, free of a physical interpretation. Thinking of Gal(3) like this will yield a clear analogy with Sect. 2.3.1’s matrix representations of an abstract group G, and with that Section’s option (A), i.e. Model root as representation.
Gal(3) is usually presented in its fundamental representation. Namely, as a concrete group of transformations on (bijections of) \(\mathbb {R}^4\), written in terms of the standard coordinates \((\mathbf{x},t) \in \mathbb {R}^4\), with \(g \in \) Gal(3) represented as the function
$$\begin{aligned} g(\mathbf{x},t):=(R\cdot \mathbf{x}+\mathbf{v}_0\,t+\mathbf{r}_0,t+t_0)~, \end{aligned}$$
(2)
where R is a \(3\times 3\) spatial rotation matrix, \(\mathbf{v}_0\) is the velocity boost, \(\mathbf{r}_0\) is the spatial translation vector, and \(t_0\) is the time translation. This fundamental representation can also be expressed in a coordinate-free way as an action on the affine space of \(\mathbb {R}^4\), i.e. on Euclidean 4-space. But we will not need the details of affine spaces (cf. e.g. Auslander and MacKenzie 1963, Chapter 1).
But just as in Sect. 2.3.1’s option (A), where a model root was a matrix representation of a group G, we could adjoin a choice of a basis \(\mathbf{e}_k\) (\(k=1,\ldots ,n\)) of \(\mathbb {C}^n\) as specific structure: so also here, we can adjoin a choice of an inertial coordinate system as specific structure, and we can take this to give the model’s specification of absolute rest.Footnote 15 The standard coordinate system defined by the components of \(\mathbb {R}^4\) itself is then just one choice among many, determining one specification of absolute rest among many. Natural though we find it for writing down the fundamental action of Gal(3), as we did in Eq. (2), the action can of course be written down in any inertial coordinate system. And any such system can be taken to give a model’s specification of absolute rest.
And just as in (A) of Sect. 2.3.1, each matrix \(M_i\) mapped the adjoined basis \(\mathbf{e}_k\) to another basis: so also here, each Galilean boost maps an adjoined choice of absolute rest, represented mathematically by an inertial coordinate system, into another such choice, i.e. another inertial coordinate system.Footnote 16
But there is also, so far, a disanalogy with (A) of Sect. 2.3.1. For so far, we have just one model root, encapsulated in Eq. (2), and various choices of specific structure; whereas (A) of Sect. 2.3.1 had many different model roots—many different matrix representations \(\{ M_i \}\) of the abstract group G. Correlatively, our fundamental representation of Gal(3) is faithful, i.e. has trivial kernel; while Sect. 2.3.1’s \(\{ M_i \}\) were in general not faithful.
But there are other representations of Gal(3). Indeed, there is a matrix representation that is isomorphic, as a model root, to what we have. So it is faithful—and the isomorphism of model roots is, on our account, a duality. Namely, we use \(5\times 5\) matrices \(\text{ G }\in \text{ GL }(5,\mathbb {R})\) (\(\text{ G }\) for ‘Galileo’ not ‘group’!), that act on vectors \((x_A):=(\mathbf{x},t,1)\in \mathbb {R}^4\times \{1\}\subset \mathbb {R}^5\), as follows (cf. Bargmann 1954: pp. 38–41 and Holm 2011: p 10):
$$\begin{aligned} \tilde{x}_A= & {} \sum _{B=1}^5\text{ G }_{AB}\,x_B\nonumber \\ (\text{ G }_{AB})_{{A,B}=1}^5:= & {} \left( \begin{array}{ccc}R&{}\mathbf{v_0}&{}\mathbf{r}_0\\ \mathbf{0}_{1\times 3}&{}1&{}t_0\\ \mathbf{0}_{1\times 3}&{}0&{}1\end{array}\right) . \end{aligned}$$
(3)
With this representation, a specification of absolute rest is given by a specific choice of vector \(x\in \mathbb {R}^4\times \{1\}\). Namely, \((x_A):=(\mathbf{x},t,1)\in \mathbb {R}^4\times \{1\}\) specifies rest to be the timelike congruence of inertial worldlines parallel to the inertial worldline passing through both the spatiotemporal origin \((\mathbf{0}, 0)\) and the point \((\mathbf{x},t)\). So here, it is a single vector that gets adjoined as specific structure; (not a whole basis, as in Sect. 2.3.1, and not a whole inertial coordinate system, as above). And each Galilean boost maps an adjoined choice of absolute rest, given by a vector \(x\in \mathbb {R}^4\times \{1\}\), into another such choice, i.e. another vector.
So much by way of how Galilean transformations’ illustration of the notions of model root and specific structure is analogous with Sect. 2.3.1’s matrices. We end with two comments, [1] and [2], about the physical interpretation of this example (comments which thereby have no analogues about those matrices). For simplicity and brevity, we will restrict both comments to the simple “vacuum” scenario which we have concentrated on: i.e. \(\mathbb {R}^4\) as a description of either neo-Newtonian or Newtonian spacetime, without regard to the N gravitating point-particles we mentioned at the start of this Section. Thus recall that we concentrated on taking the bare theory T to be just the abstract Galilean group Gal(3), and considered its action on \(\mathbb {R}^4\). But this is only for simplicity: these two comments carry over to the non-vacuum scenario, where there are particles.Footnote 17
[1] Agreement with usual verdicts The first comment looks ahead to Sect. 4.4’s discussion of symmetries in a spacetime theory, i.e. a theory that postulates a spacetime with certain chrono-geometric structures like metrics and connection. (Such theories of course also postulate matter and radiation, particles and fields, in the spacetime; but as just announced, we are setting that aside.)
We will see in Sect. 4.4 that in a spacetime theory, a symmetry is usually defined as a map on the spacetime that (once its domain of definition is extended in the natural way to include chrono-geometric structures and matter fields): (i) fixes, i.e. does not alter, the chrono-geometric structures, and (ii) maps a matter-field solution of the equations of motion to another solution. (We will also see how this relates to the more basic and general notion of symmetry we will use from the start of Sect. 4.) Accordingly, boosts are a symmetry of neo-Newtonian spacetime: for a boost preserves the chrono-geometric structures, i.e. the spatial and temporal metrics and the flat affine connection, and maps solutions to solutions. But boosts are not a symmetry of Newtonian spacetime (i.e. a spacetime that is globably like \(\mathbb {R}^4\), but that has a specification of absolute rest). For a boost does not fix a specification of rest.Footnote 18 These points are, in effect, the modern mathematical expression of the famous (notorious!) point with which this Section began: that no mechanical experiment can discern which is the putatively correct standard of rest.
Our discussion above, and our notions of model root, duality etc., accords with this. In particular, just as in Sect. 2.3.1: an item of specific structure can be “in” the model root, which is, after all, built with specific structure; and so an isomorphism of model roots (on our account, a duality) can map specific structure from one model to another. And this is what Galilean boosts do. In our jargon: they define an isomorphism of model roots that maps one model’s specification of absolute rest into another’s. Think for example of how the vector \((x_A):=(\mathbf{x},t,1)\in \mathbb {R}^4\times \{1\}\)—which specifies rest as parallelism to the inertial worldline through both the origin \((\mathbf{0}, 0)\) and the point \((\mathbf{x},t)\)—is mapped by \(\text{ G }_{AB}\) of Eq. (3) to a vector specifying rest as parallelism to the inertial worldline through both the origin \((\mathbf{0}, 0)\) and the point \((R\cdot \mathbf{x}+\mathbf{v}_0\,t+\mathbf{r}_0, t+t_0)\). [For more detail about how the duality maps in this example are defined by boosts, cf. Butterfield (2018: Sect. 4.1).]
But mapping one model’s specification of absolute rest into another’s is not the same as fixing the given specification, i.e. not the same as a symmetry in spacetime theories’ usual sense. Thus this example illustrates how a bare theory can have a symmetry, viz. boosts, that (some or even all) its models lack. This will later be a main theme (Sects. 5.1 and 5.2).
This example also has a philosophical moral that is not about symmetry. Namely: duality does not imply physical equivalence. Two theories can be duals—in our jargon: models with isomorphic model roots—without their making the very same claims about the world. They can even contradict one another about the world, as do two rival specifications of what is absolute rest. This leads in to the next comment.
[2] Internal and external interpretations The example also illustrates Sect. 2.2’s distinction between internal and external interpretations. As usual, interpretative issues are underdetermined by formal theory: witness the moral just stated at the end of [1]. However, the example clearly allows us to formulate the disagreement between Newton (Clarke) and Leibniz—the question whether absolute rest is physically real—in terms of the internal versus external contrast.
For recall that an internal interpretation interprets only the model root, but not the specific structure. More precisely, we define this as meaning that specific structure which is “in” a model root as a building block, does not get interpreted. Therefore models that are isomorphic, i.e. have isomorphic roots, and so differ only in their specific structure, must receive the same interpretation.Footnote 19 Thus in our example: an internal interpretation of a model simply does not interpret the specification of rest (whether it is given by an inertial coordinate system considered as rest, as for Eq. (2), or by a vector \(x\in \mathbb {R}^4\times \{1\}\), as for Eq. (3)). In short: the specification of rest is not part of what is physical. Thus an internal interpretation articulates Leibniz’ relationist views.
On the other hand, an external interpretation (by definition) does interpret (some or all of) the specific structure. Thus an external interpretation can take any two dual models—any isomorphic model roots with different specifications of rest, x and \(\tilde{x}\) say—to have distinct interpretations. This kind of external interpretation thus articulates the Newton-Clarke view: in short, that giving all material bodies the same boost makes a physical difference.
Values of quantities on states
Before formally defining duality as isomorphism, we need notation for treating states, quantities and dynamics. Suppose we are given a set of states \(\mathcal{S}\), a set of quantities \(\mathcal{Q}\) and a dynamics \(\mathcal{D}\): \(\langle \mathcal{{S}}, \mathcal{{Q}}, \mathcal{{D}} \rangle \). (As stressed in Sect. 1: in any specific case, \(\mathcal{S}\), \(\mathcal{Q}\) and \(\mathcal{D}\) will each have a lot of structure beyond being sets—but we will not need these details in what follows. And as admitted in Sect. 2.1.2: not all theories are presented, or best thought of, as such triples—but what we say will carry over to other formulations using e.g. partition functions; cf. footnote 6.)
Then we write the value of quantity Q in state s as
$$\begin{aligned} \langle Q , s \rangle \end{aligned}$$
(4)
It is these values that are to be preserved, or suitably matched, by the duality, i.e. by the isomorphism of model triples: cf. Eq. (9) below. And in subsequent Sections’ discussion of symmetry, it is these values that are to be preserved by a symmetry map.
For classical physics, one naturally represents (that word again!) a quantity as a real-valued function on states: \(Q: s \mapsto Q(s)\). Given such a function representing the quantity, \(\langle Q , s \rangle := Q(s) \in \mathbb {R}\) is the system’s possessed or intrinsic value, in state s, of the quantity Q. Similarly for quantum physics: one naturally represents quantities as linear operators on a Hilbert space of states, so that \(\langle Q , s \rangle := \langle s |{{\hat{Q}}} | s \rangle \in \mathbb {R}\) is the system’s Born-rule expectation value of the quantity. (But in fact, for quantum dualities, the duality often preserves off-diagonal matrix elements \(\langle s_1 |{{\hat{Q}}} | s_2 \rangle \in \mathbb {C}\): cf. below.)
As to dynamics, here assumed deterministic: This can be usually presented in two ways, for which we adopt the quantum terminology, viz. the ‘Schrödinger’ and ‘Heisenberg’ pictures; (though the ideas occur equally in classical physics). We will adopt for simplicity the Schrödinger picture. So we say: \(d_\mathcal{S}\) is an action of the real line \(\mathbb {R}\) representing time on \(\mathcal{S}\).
There is an equivalent Heisenberg picture of dynamics with \(D_H\), an action of \(\mathbb {R}\) representing time on \(\mathcal{Q}\). The pictures are related by, in an obvious notation:
$$\begin{aligned}&d_\mathcal{S}: \mathbb {R} \times {\mathcal{S}} \ni (t,s) \mapsto d_\mathcal{S}(t,s) =: s(t) \in {\mathcal{S}} \; {\text{ iff }} \; D_H\nonumber \\&\quad : \mathbb {R} \times {\mathcal{Q}} \ni (t,Q) \mapsto D_H(t,Q) =: Q(t) \in {\mathcal{Q}} \end{aligned}$$
(5)
where for all \(s \in {\mathcal{S}}\) considered as the initial state, and all quantities \(Q \in {\mathcal{Q}}\), the values of physical quantities at the later time t agree in the two pictures:
$$\begin{aligned} \langle Q, s(t) \rangle = \langle Q(t), s \rangle \; . \end{aligned}$$
(6)