I first present, in Sect. 2.1, the schema for theories and duality: from De Haro (2016: §1.1–§1.2) and De Haro and Butterfield (2017: §2.2–§2.3). Section 2.2 contains the proposal for theoretical equivalence, based on this schema. Section 2.3 contrasts the notions of theoretical equivalence and physical equivalence.
The schema for theories and duality
In this section, I introduce the schema for duality. I first introduce, in Sect. 2.1.1, the notions of theories and models. In Sect. 2.1.2 I discuss interpretation. I then introduce, in Sect. 2.1.3, duality and the notion of physical equivalence. Finally, in Sect. 2.1.4, I say how the notion of isomorphism in the schema differs from what in Sect. 1 I called ‘naïve’ isomorphism accounts.
Theories and models
Recall that dualities reveal that what we thought were distinct theories are representations of a single theory: so that our talk of ‘theories’ and ‘models’ is pushed “one level up”. Thus, in this section, I discuss the notions of theory, model, and interpretation, as construed by the schema.
The core notion of the schema is that of a bare theory: a physically uninterpreted, but mathematically formulated, structure with a set of rules for forming sentences, i.e. an abstract calculus (we will refer to these rules as the ‘language’ of the theory: see below).Footnote 6 A bare theory could consist of a set of axioms or a set of equations. But, to be specific, the schema considers a bare theory as a triple, \(T:=\langle \mathcal{S},\mathcal{Q},\mathcal{D}\rangle \), of a structured state-space, \(\mathcal{S}\), a structured set of quantities, \(\mathcal{Q}\), and a dynamics, \(\mathcal{D}\), consistent with the relevant structure. ‘Structure’ here refers: first, to symmetries which may act on the states and-or the quantities, e.g. as automorphisms of the state-space, \(a:\mathcal{S}\rightarrow \mathcal{S}\). And second, ‘structure’ also refers to the set of rules for forming sentences, e.g. for assigning values to the quantities.Footnote 7
As an example, consider Maxwell’s electromagnetic theory in vacuum, on \(\mathbb {R}^4\). This theory can be written as a triple, of which the state-space, \(\mathcal{S}\), is the space of square-integrable, smooth 2-forms F (or “Faraday tensors”). A salient quantity in this theory (of course not the only one: for more details, see Sect. 3) is the “stress-energy tensor”,Footnote 8t[F]: namely, a rank-2 smooth, symmetric, conserved tensor, quadratic in F: cf. Eq. (7). (I use the lowercase t here because T already stands for ‘theory’. The tensor components of t are usually written as \(T_{\mu \nu }\)). We can think of quantities \(\mathcal{Q}\) as maps from the state-space to a field of real or complex numbers (so that \(\mathcal{Q}\) is the dual space—in the mathematical, not the schema’s sense!—of \(\mathcal{S}\)): in our example, the stress-energy tensor is a functional \(t:\mathcal{S}\rightarrow \mathbb {R}\), hence the notation t[F]. Finally, the dynamics, \(\mathcal{D}\), is summarised by a set of two equations: namely, the 2-form F is both closed and co-closed, i.e. \(\hbox {d}F=\hbox {d}*F=0\) (hence it is harmonic). The symmetries of this theory are Poincaré transformations, which map F and t to themselves (of course, their components transform covariantly).
This example prompts the following remark about theories thus defined: notice that the state-space of Maxwell’s theory could equally well be presented in terms of the electric and magnetic fields, rather than the Faraday tensor. Thus there is no claim here that there is always a unique best definition of a theory, and choices are required about what to take as the state-space, and even judgment about interpretative matters. We will see another example of this in Sect. 3.1.
Recall that, as I announced at the start of Sect. 2.1.1, a model is in this paper not a particular solution. Rather, a model M of a bare theory, T, is a realization, or mathematical instantiation: i.e. it is a mathematical entity having the same structure as the theory, and usually some specific structure of its own. More specifically, I will use a notion of model as a representation (in the mathematical sense, not the philosophical one!) of the theory, i.e. a homomorphism from the theory to some other known structure—think, for example, how an abstract group can be represented by a set of \(n\times n\) matrices, where the rank, n, is specific structure that goes beyond the definition of (the homomorphic copy of) the group.
But a model must not be a merely mathematical representation, for I need to make the following physical distinction within the homomorphism. A model of a physical theory naturally suggests the notions of:
A model root: the realization of the theory, usually its homomorphic copy (though, in some cases, an isomorphic copy).
The specific structure: that structure which goes into building the model root, which is not part of the theory’s defined structure, and which gives the model its specificity. Specific structure may well be physically significant, depending on the context of application of the theory: and part of it is normally used for calculations within the model—but calculations can of course be done in different ways, using different specific structure. For examples, see: Sect. 3.2.3, De Haro (2016: §2.1), and De Haro and Butterfield (2017: §5.2).
It is helpful to have a schematic notation for models that exhibits how a model augments the homomorph of the structure of a bare theory, T, with its own specific structure:
$$\begin{aligned} M=\langle m;\bar{M}\rangle . \end{aligned}$$
(1)
Here m is the model root, and \(\bar{M}\) is the specific structure which goes into building m. In cases where T is a triple, m must itself be a triple with properties that are homomorphic to those of T. When the model root, m, is itself a triple, then we call it the model triple.Footnote 9
Like bare theories, models (and model triples) are, at this stage, uninterpreted; and an interpretation can again be added as a set of partial maps (see Sect. 2.1.2).
We will sometimes need a notation for the model as a homomorphism, from the theory to the structure m that does the (mathematical!) representing. I will denote it as follows:
$$\begin{aligned} h_m:T\rightarrow m, \end{aligned}$$
(2)
where m labels the representation. As usual, there is a triple of such maps, one for each item of the theory. For examples of models as representations in Maxwell’s theory, see Sect. 3.2.
Interpretation
I will adopt a mainstream position about interpretation: namely, classical referential semantics. Thus we will assign references in the empirical world to the elements of a theory—viz. to the states and the quantities, and to the formulas built from them—and likewise for models.Footnote 10 One advantage of the framework of referential semantics is that both realists and constructive empiricists can agree about the interpretation of a theory or model, in other words about its basic ontology (‘the picture of the world drawn by the theory’, to use van Fraassen’s (1980: pp. 14, 43, 57) words), even though they have different degrees of belief in the entities that the ontology of the theory postulates—and indeed they will have a different conception of what such a map entails. We will see illustrations of this point below, and in Sect. 2.3.1.
Thus I will model referential semantics by interpretation map(s), as follows:
An interpretation is a set of partial maps, preserving appropriate structure, from the theory to the world.Footnote 11 The interpretation fixes the reference of the terms in the theory. More precisely, an intepretation maps the theory, T, to a domain of application, D, in the world, i.e. it maps \(i:T\rightarrow D\). If the theory is a triple of states, quantities, and dynamics, viz. \(T=\langle \mathcal{S},\mathcal{Q},\mathcal{D}\rangle \), then the interpretation maps are also a triple, one for each item in the triple: however, we will not often need to make this explicit. Using different interpretation maps, the same theory can describe different domains of the world, and even different possible worlds. For more details, also about the kinds of maps required, see De Haro (2016: §1.1.2).
Likewise for models, an interpretation is a partial map, \(i:M\rightarrow D\), preserving appropriate structure, from the model to the world.
Notice that the choice of ‘the part of the structure that is common to all of the theory’s models’ goes into the definition of a model, and singles out the core physics that is described by the theory, because it is represented in all its models: namely, it is in the distinction between the model root and the specific structure from the previous section: and this choice constrains the kinds of systems the model is able to describe. The way to determine the relevant structure is interpretative not formal. For example, only experiments can tell us that massless vector fields correctly describe photons. Once that question is experimentally settled, one can take a model root including a massless vector field to describe a photon. But once we have that model root with its massless vector field, we can strip it of its photon interpretation and use the same model root to describe whichever other particle exhibits degrees of freedom of the same kind.
Interpretations thus defined are very general. To specify them further, we endorsed, in De Haro and Butterfield (2018: Section 2.3), De Haro (2016: §1.1.2) and De Haro and De Regt (2018: p. 636), a more specific framework, viz. intensional semantics [cf. Lewis (1970), Carnap (1947: pp. 177–182; 1963: pp. 889–908)]. In this framework, the notion of ‘linguistic meaning’ (for us, in the context of scientific theories: an ‘interpretation’) is taken to be ambiguous between what Frege (1892) called ‘sense’ and what he called ‘reference’, here called ‘intension’ and ‘extension’ respectively. The intension is the linguistic meaning of a term, while an extension is the worldly reference of the term, relative to a single possible world (with all of its contingent details). Thus Lewis (1970: pp. 22–27) defines intensions as maps from n-tuples of sequences of items—he calls such a sequence an ‘index’—to extensions, e.g. the truth-values of sentences. And, as Lewis remarks, the framework also applies if the indices are construed as models consisting of states representing possible worlds (ibid, p. 23): see also Carnap (1947: pp. 177–182; 1963: pp. 889–908). Here, I will discuss a variation of this framework that involves: (a) mapping from scientific theories and models (usually presented as triples, so that there are three such maps) rather than from sequences of general linguistic items; (b) a simplification: namely, modelling both intensions and extensions by interpretation maps, rather than defining an extension as the (set of) objects/truth value(s) and an intension as a map to extensions.
Thus both intensions and extensions are structure-preserving partial maps from a bare theory or model to a domain of application relative to a possible world. The difference between the intension and the extension is in the kind of domain of application: explicitly, for states: an intension maps a state to a generic property (or physical arrangement) of a system mirroring the defining properties of the mapped state (and likewise for quantities and dynamics). Thus the image of the state and the domain of application abstract from contingencies such as the detailed arrangement of the system and how the system is measured (so that the interpretation applies to all possible worlds that are described by the theory or model). By contrast, in the case of an extension, the image and domain of application are a fully concrete physical system: usually including also a specific context of experiment or description, and all the contingent details that are involved in applying a scientific theory to a concrete system [for more details, see De Haro and Butterfield (2018: §2.3) and De Haro and De Regt (2018: §1.1)].
Let me illustrate this notion of interpretation in the simple case of Maxwell’s theory in vacuum, and on \(\mathbb {R}^4\), mentioned in Sect. 2.1.1: where the state-space \(\mathcal{S}\) is a suitable set of 2-forms F, the quantities \(\mathcal{Q}\) contain a distinguished quantity, the stress-energy tensor, with components \(T_{\mu \nu }[F]\), and the dynamics is the condition that the 2-forms F are both closed and co-closed (more details in Sect. 3). Under the standard electromagnetic interpretation, the Faraday tensor F is the coordinate-free presentation of a tensor whose components are interpreted as electric and magnetic fields, so that the intensional interpretation map is:
$$\begin{aligned} i(F)= & {} \hbox {`a coordinate-free specification of an electric and magnetic configuration}\nonumber \\&~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\hbox {in vacuum'}~. \end{aligned}$$
(3)
The key interpretative parts are here ‘coordinate-free specification’ and ‘electric and magnetic configurations’, which we (uncontroversially) assume already have established meanings, for example as correlating with certain experimental procedures that we use to measure the fields, or as corresponding to certain properties of fields that we are familiar with in our world (such as the polarisation of light waves, and how they interact with other entities).
This meaning has come about through centuries of experimentation, instrumentation, and theorising about electro-magnetic fields. Different historical epochs may of course have different ways of manipulating and measuring these fields, but referential semantics assumes such reference to be clear, the more because the theory is sufficiently well established, as Maxwell’s electromagnetic theory indeed is—and this reference is construed as saying that there are such things as electric and magnetic fields in the world (if one is a realist) or that there are indeed appearances of phenomena of electric and magnetic fields (if one is an empiricist). Likewise for the phrase ‘coordinate-free specification’: it summarises the properties of the fields under changes of the frame of reference, i.e. properties like ‘the electric field is augmented in the directions perpendicular to the motion by an amount given by the Lorentz factor’. Such properties again uncontroversially correlate with particular phenomena in the world.Footnote 12
Notice the following two properties of the interpretation map Eq. (3). First, recall that I defined interpretation maps \(i:M\rightarrow D\) as appropriately structure-preserving. Therefore the phrase ‘coordinate-free specification’ in Eq. (3) implements this appropriately structure-preserving character of the Poincaré symmetries. Namely, Poincaré symmetries, which leave the 2-tensor F invariant and transform its components according to the standard Poincaré transformations, have a “shadow” in the domain of the world, D: this shadow contains the ordinary effects, of Lorentz expansions and contractions of the fields, that I referred to in the previous paragraph. Thus the interpretation map Eq. (3) correctly preserves symmetries. Second, the interpretation Eq. (3) is an intension: for it is valid at any possible world that instantiates electric and magnetic fields whose corresponding Faraday tensor satisfies the definitions of our states (viz. a square-integrable, smooth 2-form). In other words, it does not depend on a specification of F—as being, for example, a collection of travelling waves with certain polarizations. Such “generic” interpretations hold at every possible world that is described by the theory. In order to get an extension, we should specify the particular 2-form F that we are mapping. This would entail giving the details of its functional form: which then corresponds, under the interpretation map, to further specifications in a particular world, like ‘a wave travelling in such and such direction, with polarizations at such and such angles’. Since intensions are less detailed and simpler to write down in words, in the rest of the paper our examples will be mainly intensions. But, as I mentioned above, this way of modelling intensional semantics accommodates for boths kinds of interpretations.
There is also an interpretation map for quantities, which for example maps the 00-component of the stress-energy tensor:
$$\begin{aligned} i(T_{00})= & {} \hbox {`the electromagnetic energy density of the system'}. \end{aligned}$$
(4)
Again, this map is an intension—it holds at all possible worlds at which Maxwell’s theory applies, and in which a frame of reference has been specified, relative to which the 00-components of the tensor are taken. To specify the extension of \(T_{00}\), we should provide additional details such as where the electromagnetic energy is produced from, how it interacts with its environment and is measured, etc. We will return to the example of Maxwell’s electromagnetic theory in Sect. 3.
So much by way of introducing referential semantics and intensional semantics. The idea of theoretical equivalence will be, roughly, that two models are isomorphic relative to their formal structure—as in their model roots, m—and their interpretation. Thus I will now define an interpretation that depends only on the model root, and not the specific structure:
Internal interpretation: an interpretation that maps all of and only the model root, regardless of the specific structure of the model. Since internal interpretations map the model roots (and they may map specific structure only in so far as it appears in the model root), it will often be clearer to restrict the internal interpretation map of models to the model root, and write: \(i:m\rightarrow D\). Internal interpretations obviously also apply to bare theories, T, which do not have any specific structure.
I shall contrast this with other interpretations that also map the specific structure (or that involve coupling to other theories): which I will dub external interpretations [for a fuller exposition, see De Haro (2016: §1.1.2)]. For the reason given above, in the rest of this paper the relevant interpretations will be only the internal interpretations.Footnote 13
I use ‘internal interpretation’ in the singular here because we will normally consider a single interpretation. However, a given model can have multiple internal interpretations. For example, take a model whose dynamics is the heat equation for a real, non-negative function over space and time that we denote by: \(\rho :\mathbb {R}^4\rightarrow \mathbb {R}_{\ge 0}\), and is given by: \((t,\mathbf{x})\mapsto \rho \,(t,\mathbf{x})\). Take the state-space of the model to be the space of configurations of the function \(\rho (t,\mathbf{x})\) (and let the quantities of the model also be constructed from \(\rho \), its powers, and its derivatives and integrals over space and time—so that we now have a triple of states, quantities, and dynamics). This model can be interpreted as describing the spread of heat in time in a given region of space, or as a diffusion equation for Brownian particles in a medium. With these definitions, there is here a single model (in fact, a theory), with no specific structure. Both interpretations are internal, and yet they differ.Footnote 14
I also assume that the model root is rich enough that an internal interpretation allows the specification of a set of possible worlds that instantiate the solutions of the equations of the model (or, at least, the interpretation specifies suitable domains of application within such worlds: see the discussion in Sect. 2.3.3, especially footnote 40). Nevertheless, for simplicity, I will talk about ‘the possible world specified by the model’ rather than about the set of such worlds.
Duality and physical equivalence
In this section, I give brief construals of the notions of duality and of physical equivalence according to our proposed schema for duality (and I shall elaborate on physical equivalence in Sect. 2.3 after I define, in Sect. 2.2, weak theoretical equivalence).
Indeed, having discussed theories and models, we can now easily state the schema’s conception of duality: A duality is an isomorphism between model roots of a single bare theory, where the model roots are taken to be representations (in the mathematical—not semantic!—sense) of the theory. We denote this isomorphism of model roots, \(m_1\cong m_2\), by the map \(d:m_1\rightarrow m_2\).
Theories may have many representations: representations that are isomorphic to the original theory and representations that are not isomorphic to the original theory. But if we have two (or more) representations that are isomorphic to each other (whether they are isomorphic to the original theory or not), we have a duality.
The framework offered so far is mostly formal. Thus it is not surprising that the schema contributes to furthering understanding of formal equivalence—what I will call weak theoretical equivalence. This will be the topic of Sect. 2.2.1. But first, by way of introduction, let me say a few words about physical equivalence, i.e. the specific proposal for theoretical equivalence suggested by recent developments in physics—a fuller exposition is in Sect. 2.3.Footnote 15 Physical equivalence can be defined as sameness of interpretations, i.e. two models are physically equivalent iff they are weakly theoretical equivalent and, in addition, they have the same interpretation (not in the sense of having the same interpretation maps, but in the sense that their interpretation maps have the same codomain, or target, and appropriately matching values in that codomain). Thus physical equivalence, for dual models, is a meshing condition between duality and intepretation: namely, in the notation of Fig. 1, \(\hbox {ran}\,(i_1)=\hbox {ran}\left( i_2\circ d\right) \). Duality and interpretation then commute, in the sense that they form a triangular commuting diagram.
Against naïve appeals to isomorphism
In this section, I elaborate on the notion of isomorphism employed by the schema, and explain how it differs from naïve notions of isomorphism.
Whether a map is an isomorphism of course depends on the structure that the map is supposed to preserve. In the present case, of isomorphism between model roots, it is the structure of the model roots—and so of the bare theory of which each root is a representation—that is preserved. And when the model roots are triples of state-space, quantities and dynamics, then each of these three items—with their corresponding structure—is to be preserved.
This contrasts with a vague, and-or unqualified, notion of isomorphism. The isomorphism criterion of models is usually associated with the semantic view: see, for example, Suppe (2001: p. 526) and Halvorson (2012: p. 187). For example, Halvorson (2012: p. 188) has summarily criticised the isomorphism account of theoretical equivalence on the grounds that it would render Heisenberg’s matrix mechanics and Schrödinger’s wave mechanics theoretically inequivalent, for: ‘a matrix algebra is obviously not isomorphic to a space of wave functions; hence, a simpleminded isomorphism criterion would entail that these theories are inequivalent.’
The first point to make is that Halvorson slips in his formulation of the claim he is targeting. For of course the matrix algebra represents the quantities, while the wave-functions form a space of states—and in that sense I agree that they should not be isomorphic. But, correcting this slip: I of course agree with Halvorson that the representation space of a matrix algebra is in general not isomorphic to a space of wave-functions, and so the isomorphism criterion does not work in this case. But this is only a naïve notion of isomorphism, which attempts to map the state-spaces directly onto each other, so that discrete matrix indices are directly mapped to the state-space of continuous functions (von Neumann (1955: p. 22)). Such an isomorphism requires the introduction of what von Neumann called Dirac’s ‘mathematical “fiction”’, namely the Dirac delta distribution: ‘this cannot be achieved without some violence to the formalism and to mathematics’ (ibid, p. 28).Footnote 16 Setting aside the question whether Dirac’s programme can perhaps be carried out using the theory of distributions: one mathematically correct procedure is to seek an isomorphism, not of configuration spaces, but of functions on these two spaces, i.e. between sequences which are functions on the discrete configuration space, \(l^2(\infty )\), and wave functions on the continuous configuration space. It is this weaker isomorphism that quantum mechanics requires: ‘These functions... are the entities which enter most essentially into the problems of quantum mechanics’ (ibid, p. 28). And those two spaces are isomorphic, as a theorem of Fischer and Riesz secures [cf. e.g. von Neumann (1955: p. 29)]. Indeed, all complex infinite-dimensional separable Hilbert spaces are isomorphic to \(l^2(\infty )\) [e.g. Prugovečki (1981: p. 41)].
It seems likely that the claims, now well-rehearsed, in the literature [see North (2009: p. 84–88), Curiel (2014: pp. 295–304), Barrett (2018: §1)], that Lagrangian and Hamiltonian mechanics are inequivalent according to the isomorphism criterion, can be dealt with in a similar way, i.e. that a more sophisticated notion of isomorphism does render them equivalent, at least in an important class of salient cases. The leading idea of any such isomorphism must of course be the Legendre transformation, which, as textbooks teach us, carries one back and forth between the Lagrangian and Hamiltonian frameworks. For example, Barrett (2018: §4) focusses on the case where the Legendre transform is a diffeomorphism (the Lagrangian then being called ‘hyperregular’). Analogously to the case of matrix vs. wave mechanics, the criterion would obviously not be a direct isomorphism of tangent and cotangent bundles, which are not invariably isomorphic.
Similarly, Lutz (2017: pp. 335–336) proposes to block a charge against the naïve isomorphism criterionFootnote 17 by considering a more sophisticated criterion of isomorphism: namely, he considers indexed structures and their associated isomorphisms.
Without going into further details here: the target of the criticisms of the isomorphism criterion for theoretical equivalence often seems to be a naïve, i.e. vague and-or unqualified, appeal to isomorphism, which either does not specify the structure to be preserved, or identifies the wrong structure. On a more sophisticated treatment of the structures involved—which often means: a more restrictive definition of the relevant structure to be preserved—cases of theoretical equivalence in physics do often seem to be cases of isomorphism.
Therefore, I find most of the reasons in the recent philosophical literature, for abandoning the isomorphism criterion, to be wanting. The isomorphism criterion can be upheld provided we are more careful about the structures that we claim are isomorphic. It is in this spirit that I will propose an isomorphism criterion in the next section.
The schema’s contribution to a conception of theoretical equivalence
This section and the next contain the paper’s main proposal for theoretical equivalence, based on the schema. In Sect. 2.2.1, I give some background on the notion of theoretical equivalence in recent philosophy of science discussions, in terms of two conditions. Section 2.2.2 then goes on to fill in the first condition according to the schema, and Sect. 2.2.3 fills in the second condition.
The general notion of theoretical equivalence
In this section, I discuss the general idea of theoretical equivalence in recent philosophy of science. Theoretical equivalence is, in fact, something of a term of art, without a fixed meaning. It usually combines two conditions:
(A) some sort of “formal or mathematical requirement”; and
(B) some sort of “interpretative requirement”.
Individual authors may of course stress one aspect over the other, or even altogether reject one of the two conditions.Footnote 18
Two influential proposals for theoretical equivalence are due to Quine and Glymour. On Quine’s (1975: p. 320) proposal, two formulations count as formulations of the same theory if, besides being empirically equivalent, the two formulations can be rendered identical by a reconstrual of the predicates in one of them (for example, by switching some of the predicates). And Glymour (1970: p. 279) requires, besides empirical equivalence, inter-translatability as a necessary condition for theoretical equivalence.Footnote 19 Thus Quine and Glymour agree that empirical equivalence is a necessary condition for theoretical equivalence, but construe the formal requirement (reconstrual of predicates vs. inter-translatability) differently. And both do seem to have in mind a notion of theoretical equivalence in which both theories in some sense “say the same thing about the world”, i.e. a notion that is also interpretative.Footnote 20
About ‘empirical equivalence’: there are two main construals of the requirement of this criterion, one of which is syntactic, and the other semantic. Quine (1970: p. 179, 1975: p. 319) saysFootnote 21 that two theories are empirically equivalent if they imply the same observational sentences, also called ‘observational conditionals’, for all possible observations—present, past, and future. Van Fraassen (1980: p. 64) says that two theories, T and \(T'\) are empirically equivalent if for every model (not the schema’s model!), M, of T there is a model, \(M'\), of \(T'\) such that all of M’s empirical substructures are isomorphic to empirical substructures of \(M'\), and vice-versa.Footnote 22 The criterion of theoretical equivalence that I will give in Sect. 2.3 entails empirical equivalence as a special case [for a detailed discussion, see De Haro (2019b)].
The recent discussion of theoretical equivalence by Halvorson (2012), Barrett and Halvorson (2016), Halvorson and Tsementzis (2015), Barrett (2018), and Weatherall (2015, 2016a, b) has emphasised the formal aspects of equivalence over the interpretative. Weatherall mentions empirical equivalence in all of his papers on the topic, but for example Barrett and Halvorson (2016) do not mention it at allFootnote 23—nor do those papers mention matters of meaning or interpretation (other than interpretation using formulas in a formal language). Thus one could easily get the impression that some of these authors “are not interested in interpretation”. However, the situation is more subtle.Footnote 24
I will endorse this recent (though usually unstated) consensus, that there is an interesting part of the project of theoretical equivalence that is a largely, but not solely, formal matter.Footnote 25 Thus I will first develop, in Sect. 2.2.2, a notion of ‘weak theoretical equivalence’ (I will discuss physical equivalence in Sect. 2.3). Indeed I believe (contra Coffey (2014)) that there is an interesting philosophical project of:
(EE) Explicating equivalence in formal terms.
This consensus position in the recent discussion of theoretical equivalence can be qualified as a quietist position. Namely, it is the position of authors engaged with parts of the project of theoretical equivalence, and for whom interpretative equivalence is a minimal requirement that their project needs, but on which they do not wish to focus. And so, such authors, when confronted with dual models, discuss formal equivalence but typically refrain from discussing ontological questions. Quietism is the position that my account of weak theoretical equivalence attempts to save, by including an interpretative requirement that is as minimal as possible—namely, mere matching of interpretations.Footnote 26 The account can be suitably strengthened in the second step, as I will do in Sect. 2.3. Such authors have nothing to say about physical equivalence—for whatever reason: perhaps because they are sceptical about the notion, or not interested in it, or think it is too difficult to articulate. This is, to a large extent, a tenable position:Footnote 27 and in a moment I will add some of my own reasons why (EE) is an interesting philosophical project.
In light of the literature discussed above, the challenge in the first step of defining weak theoretical equivalence lies, for the most part, in how one fills in its being ‘a largely, but not solely, formal project’. That is, the difficulty lies in striking an appropriate balance between the interpretative and the formal: in such a way that, once one takes the interpretative conditions to have been established for a pair of theories, as a kind of “boundary condition”, then the project can concentrate on the formal issues.
In the next section I will propose how the schema suggests this can be done. But it will clarify that proposal if I first give three reasons why I think ‘weak theoretical equivalence’, in contrast with ‘physical equivalence’, should be construed as being mostly formal or mathematical:
(i) The notion of weak theoretical equivalence will easily lead in to a notion of physical equivalence, which can then be further analysed.
(ii) The second reason for letting weak theoretical equivalence be formal and not require sameness of interpretation is that establishing sameness of interpretation is never a simple matter: for it requires a notion of identity of domains of application, as I will discuss in Sect. 2.3, which leads in to issues of metaphysics. And we do not want to be too worried about metaphysics in our project of theoretical equivalence.
I take this second reason to underlie the recent consensus mentioned above: that there is a significant conceptual and technical project of finding criteria for when two theories are equivalent, without the need for all the details about the interpretation to be fleshed out, nor to commit to a metaphysical account of how the terms of the theory refer. And such a project strikes me as sensible.
This then prompts me to break up the overall project of explicating equivalence between theories into two tasks. The first task, namely explicating weak theoretical equivalence, cares as little as possible about interpretation: and it corresponds to (EE) above. The second task, namely explicating physical equivalence, takes interpretation fully on board.
(iii) The third reason for keeping weak theoretical equivalence largely formal is that, as the schema for duality has shown, this is indeed also how physicists think about equivalence (cf. De Haro (2018: §4.1.2)). Physicists are happy to say that the high-temperature Ising model is equivalent to a low-temperature one, without worrying about the ontological status of ‘temperature’ under such a duality map. It suffices that two variables that are interpreted as ‘temperatures’ are mapped to each other, and that the values (low vs. high) of ‘temperature’ are also mapped to each other. Even less do physicists worry whether they are e.g. realists or empiricists about temperature under this duality. And this is of course a legitimate practice. Indeed I of course maintain that the metaphysical questions are important; but they can be addressed in the next step, as part of the project of physical equivalence. Thus a project like (EE), that only takes interpretation minimally into account, agrees with a widespread, and justified, scientific practice.
To sum up: regardless of one’s preferred use of ‘theoretical equivalence’: it is an interesting project to establish criteria of formal equivalence that minimize interpretative issues, and so avoid being committed to the fully-fledged project of showing that the theories have the same interpretation (for they may not have the same interpretation!): and this is what I shall call the project of weak theoretical equivalence.
But if one endorses this recent consensus—that weakly theoretically equivalent models are in some (weak!) sense interpretatively equivalent: but that establishing weak theoretical equivalence does not require one to have a complete understanding of the interpretation, so that the project is largely formal—then it is clear that achieving what the consensus aims for requires a delicate balance.
So, it is now crucial to fill in condition (B)—the interpretative requirement—in such a way that we can strike this balance. I will propose that the models must have matching internal interpretations.
Condition (A’): isomorphism of model roots
I first fill in condition (A) in Sect. 2.1.3: namely, the formal or mathematical requirement for weak theoretical equivalence, which I will illustrate in Sect. 3.3. My proposal here is simple: namely, to read the schema’s notion of duality as a formal condition of theoretical equivalence, applicable to formal theories in the physical sciences:Footnote 28
(A’) Isomorphism of model roots of a single bare theory.
Recall that this was precisely the definition of duality, in Sect. 2.1.3. And it is indeed natural to regard duality as prompting the weak theoretical equivalence of two models (in my sense of ‘model’).
Condition (B’): matching internal interpretations
In this section, I fill out the second condition for weak theoretical equivalence: namely, the interpretative requirement, as prompted by the schema’s analysis of duality.
We have seen that the spirit of weak theoretical equivalence is that it should capture when two interpreted models (in the sense of model roots, i.e. often called ‘theories’) are equivalent, both formally and interpretatively, without being committed to a strong notion of sameness of interpretation. This is the ‘delicate balance’ that I mentioned at the end Sect. 2.2.1.
My proposal, then, for filling in condition (B), namely filling in the “interpretative” part of the definition, is to require “matching interpretations”. So, let me first define this notion: this will lead us to the restriction to internal interpretations (as defined in Sect. 2.1.2).
(B’) Matching interpretations: two models, \(M_1\) and \(M_2\), have matching interpretations when the ranges of their interpretation maps are isomorphic, i.e. when:
$$\begin{aligned} \hbox {ran}\,(i_1)\cong \hbox {ran}\,(i_2)~. \end{aligned}$$
(5)
I use the word ‘matching’ here, in order to avoid the already over-used word ‘equivalence’. Recall, from Sect. 2.1.2, that an interpretation is a structure-preserving partial map that maps a model to a domain of application. For the two models \(M_1\) and \(M_2\), we thus have maps \(i_1:M_1\rightarrow D_1\) and \(i_2:M_2\rightarrow D_2\). Here, the domains of application \(D_1\) and \(D_2\) are structured sets, and the interpretation maps are structure-preserving. Thus, the condition Eq. (5) is an isomorphism with respect to that structure, which is induced from the model’s own structure.
For simplicity in the discussion in the rest of this section, I will take the interpretations to be surjective maps, so that \(\hbox {ran}\,(i_1)=D_1\) and \(\hbox {ran}\,(i_2)=D_2\), i.e. the interpretation maps map “to the whole domain of application”: every element in the domain of application is mapped to by at least some element of the model. (This restriction does not affect the content of the discussion but does simplify the notation.) Thus, we can now restate the condition Eq. (5) as follows: \(D_1\cong D_2\). Let us denote the corresponding isomorphism between domains of application by \(\tilde{d}\), so that:
$$\begin{aligned} \tilde{d}:~D_1\rightarrow D_2~. \end{aligned}$$
(6)
I shall call this map, induced from the duality and interpretation maps, the induced duality map.
Combining the duality map, d (defined in Sect. 2.1.3), with the two interpretation maps, we get the diagram in Fig. 2, where the interpretation maps, \(i_1\) and \(i_2\), are evaluated on the model roots, \(m_1\) and \(m_2\) (see the discussion two paragraphs below). Here, d and \(\tilde{d}\) are both isomorphisms, while the interpretation maps of course need not be. Figure 2 thus contrasts with the diagram in Fig. 1, where the duality commutes with the interpretation—in the sense that the three maps \(d,i_1,i_2\) form a commuting diagram. In Fig. 2, the three maps \(d,i_1,i_2\) do not form a commuting diagram. Rather, we need to co-vary the duality map, thus getting the induced duality map, \(\tilde{d}\), in Eq. (6), as we change the interpretation.
The notion of co-variation of the interpretations with the duality map is natural for dualities.Footnote 29 For example, take Kramers–Wannier duality, i.e. the Ising model with high, respectively low, temperature as its two models [cf. Butterfield (2018: §4.3)]. These two models are isomorphic, and furthermore we can also match their interpretations by everywhere replacing a lattice at high temperature with a lattice at low temperature. Thus the isomorphism, d, between the models induces an isomorphism, \(\tilde{d}\), between the domains of application, that takes ‘high temperature’ to ‘low temperature’ and vice versa, while respecting the syntax. The induced duality map between the domains of application, thus construed, is analogous to translating one language into another. Of course, such a map does not in general preserve meanings—rather, it maps them into each other in a non-trivial manner!
Weak theoretical equivalence, as discussed here, is in the context of duality, i.e. of an isomorphism account of equivalence. Thus in particular, the diagram in Fig. 2 implies that \(i_2\,\circ \, d=\tilde{d}\,\circ \,i_1\). Now, in order for \(i_2\) to be defined on, or “match”, the range of d (i.e. the model root, since the duality map only maps the model roots, see Sect. 2.1.3), \(i_2\) must be restricted to the model root, and likewise for \(i_1\) (as shown in Fig. 2). This suggests that, for the purpose of theoretical equivalence between dual theories, we should restrict our interpretations to the internal interpretations, as I anticipated in Sect. 2.1.2 (see especially footnote 13). For such a commuting diagram does not exist for external interpretations, as I now show.
In principle, weak theoretical equivalence could also be defined for external interpretations. We would then replace the model roots \(m_1\) and \(m_2\) in Fig. 2 by the full models, \(M_1\) and \(M_2\) (cf. Eq. (1)), and \(i_1\) and \(i_2\) would be their external interpretations. We would then still require the map, \(\tilde{d}\), between the domains of application to be an isomorphism, as in Eq. (5). But notice that the diagram is no longer commuting: for the cardinalities of \(M_1\) and \(M_2\) may well differ, and so \(M_1\) and \(M_2\) are in general not isomorphic.Footnote 30 Thus, although weak theoretical equivalence could be defined for external interpretations as the isomorphism condition Eq. (5), there is no commuting diagram criterion for it, so that this sort of weak theoretical equivalence is unrelated to the existence of duality. Thus in this paper we hold on to the connection between weak theoretical equivalence and duality, by restricting to internal interpretations (for the example of Maxwell’s theory, see Sect. 3.3).
To sum up: I believe that making (B) precise as (B’), i.e. matching of internal interpretations gives a minimal requirement of interpretative equivalence—as required for the ‘delicate balance’ of the previous section—that is useful both for dualities and for discussions of theoretical equivalence. In short: meanings are simply co-varied as we map one model to the other, without the requirement that the meanings stay “the same”—the latter requirement is only introduced in the next step. Since the recent literature on theoretical equivalence has, as I mentioned, not given a conception of (B), I have here endeavoured to give a minimal one, based on the schema. In the next section, I will refine this to a criterion of theoretical equivalence.
Notice the difference between duality and weak theoretical equivalence thus construed: namely, condition (B’) is required for weak theoretical equivalence, but not for duality. We follow the physicists in defining duality as isomorphism of model roots—period. Weak theoretical equivalence, on the other hand, is constrained by the philosophical tradition to require, in addition, an interpretative condition that I have here interpreted weakly, in the sense of Eq. (5) and Fig. 2. Thus weak theoretical equivalence is an equivalence—an isomorphism—of interpreted model roots, while duality is an isomorphism of model roots, irrespective of their interpretation.
Theoretical equivalence of duals, or physical equivalence
In this section, I spell out the notion of theoretical equivalence, or—in the context of physics, especially as suggested by dualities—physical equivalence: and discuss the relation between weak theoretical equivalence and physical equivalence. In Sect. 2.3.1, I give sufficient conditions for physical equivalence, in the context of an isomorphism account of duality. In Sect. 2.3.2, I discuss when a judgment of physical equivalence is justified, and draw lessons for theoretical equivalence in physics more generally. Then, in Sect. 2.3.3, I compare my account with other work.
Physical equivalence
Recall that I have explicated the weak theoretical equivalence of models as the matching of their interpretations. I will now define physical equivalence as weak theoretical equivalence plus, in addition, sameness of interpretation: more precisely, as sameness of the domains of application, \(D_1\) and \(D_2\) (cf. Sect. 2.1.3, especially Fig. 1). Thus the induced duality map, \(\tilde{d}\), between the domains of application, is the identity map: so that we have the commuting diagram, in Fig. 2, between the duality and the two interpretation maps.
But recall that, after making explicit the unstated consensus of the project of theoretical equivalence in Sect. 2.2.1, I noticed that this consensus aimed to stay formal and to minimize matters of metaphysics. And I justified this consensus as sensible, by using a notion of interpretation that assigned a basic ontology to the models, but remained quiet about other metaphysical matters. And that quietism is, in its turn, justified by intensional semantics’ appropriately modelling the interpretative practices of both realists and empiricists: as in the interpretation maps that I defined in Sect. 2.1.2. Namely, recall the discussion in that section: realists and constructive empiricists can agree about the interpretation of a theory or model, in other words about its basic ontology (‘the picture of the world drawn by the theory’, to use van Fraassen’s (1980: pp. 14, 43, 57) words), even though they have different degrees of belief in the entities that the ontology of the theory postulates. Also, realists may disagree amongst themselves about a metaphysical construal of those entities: think, for example, of Quine’s (1960: §12) referential indeterminacy, according to which the linguist, upon hearing the native utter the word ‘gavagai’ while pointing at a rabbit, might for simplicity translate it as ‘rabbit’—while still being at a loss whether the objects to which this term applies are rabbits, or stages, i.e. brief temporal parts of rabbits, or mereological fusions of spatial parts of rabbits.
In other words, whatever the formally coinciding domains of application are, one has to impose on both sides the same ontological construal of terms, e.g. ‘energy is a property and not a relation’, etc.
Thus, if numerical identity of the elements and relations in the domains of application—or, in a weaker sense, the lack of a difference between them—is what we want, we need an agreed philosophical conception of the ontology, i.e. of the domains of application: which means that we now need to make our metaphysical commitments explicit, e.g. about realism or empiricism and referential indeterminacy. This then amounts to a deeper explication of what we mean by an ‘interpretation’ than just saying that it is a map. Only then does the requirement, that the induced duality map, \(\tilde{d}\) (Eq. (6)), between the domains of application be the identity, secure this lack of a difference.
Thus I will say that the domains of application are (numerically) the same (in a weaker sense, that they lack a difference) if we have an agreed philosophical conception of the interpretation.Footnote 31
In conclusion: two models are physically equivalent iff (i) they are weakly theoretically equivalent, and (ii) they have the same internal interpretation: where ‘sameness’ requires that an agreed philosophical conception of the interpretation has been supplied. This is the notion of theoretical equivalence that is suggested by dualities.
We can now be more precise about how weak theoretical equivalence can lead to physical equivalence, by comparing the diagrams in Figs. 1 and 2. Evidently, we obtain the diagram for physical equivalence, Fig. 1, if the induced duality map, \(\tilde{d}\), in Fig. 2 happens to be the identity map (and provided the above-mentioned philosophical explication is also given). But this is not the generic situation, i.e. \(\tilde{d}\) is in general not the identity map, as we saw in the example of Kramers–Wannier duality: since it maps high temperature to low temperature, and vice-versa.
There is a second way to obtain a situation of theoretical, or physical, equivalence from weak theoretical equivalence, i.e. Fig. 2. The idea is to define a new pair of interpretations using the induced duality map, as follows: \(i_1':=i_1\) and \(i_2':=\tilde{d}\circ i_2\), where both models now have \(D_1\) as their domain of application, i.e. \(i_1'(m_1)=i_2'(m_2)=D_1\). This means that, in effect, we change \(m_2\)’s domain of application, from \(D_2\) to \(D_1\). (Although this definition treats the two models asymmetrically, there is an alternative definition that maps to \(D_2\) rather than \(D_1\)). While this may at first sight look ad hoc, it is in fact natural if \(i_2\) is an external interpretation, while \(i_2'\) is now an internal interpretationFootnote 32: and it is common practice among the physicists working on dualities.
For example, consider gauge-gravity duality, where \(m_1\) is a gauge theory model and \(m_2\) is a quantum gravity model (more precisely, a version of string theory), and take \(i_2\) be the string theory interpretation of the quantum gravity model, while \(i_1\) is a sophisticated, internal interpretation of the gauge theory model. What we then learn about the interpretation of the quantum gravity model is that an internal interpretation, \(i_2'\), of it exists: and this is natural, given the existence of a duality. This “change of interpretation”, from the external interpretation \(i_2\) to the new internal interpretation \(i_2'\), in fact models the practices of the physicists who work on dualities: and the recent literature on dualities has tried to articulate in what sense such interpretations give natural interpretations of quantum gravity and quantum field theories. This is a large topic than I cannot discuss here, but it has been central to the recent literature on dualities: see, for example, Dieks et al. (2015), De Haro (2016, 2019b), Huggett (2017), De Haro and Butterfield (2018) and Read and Møller-Nielsen (2018).Footnote 33
When is a judgment of physical equivalence justified?
The above analysis of physical equivalence leaves unanswered an important epistemic question, about whether we are justified in adopting an internal interpretation, hence judging two models to be physically equivalent. The question is, roughly speaking, whether the internal interpretation is “detailed enough” and “general enough” that it cannot be expected to change, for a given domain of application D. I call this condition unextendability, and it is a condition on the interpreted theory. De Haro (2016: Section 1.3.3) gives a technical definition of the two conditions for unextendability, the ‘detail’ and ‘generality’ conditions. Roughly speaking, the condition that the interpreted theory is detailed enough means that it describes the entire domain of application D, i.e. it does not leave out any details. And the generality condition means that both the bare theory and the domain of application D are general enough that the theory “cannot be extended” to a theory covering a larger set of phenomena. Thus the theory is as general as it can/should be, and D is an entire possible world, and cannot be extended beyond it.Footnote 34 For a discussion of unextendability in the context of the effective field theory programme, in De Haro (2016: Section 1.4).
Part of the unextendability requirement is a general point about the need for mathematically consistent models (which is not the same as: ‘mathematically rigorous models’), in order to address the question of when dual models may be taken to be physically equivalent. For example, most of the solutions of Newtonian gravitation run into inconsistencies. One can already see this in the most elementary examples: any two massive point particles placed next to each other will attract and run into each other, developing a singularity—an infinitely strong force, with the corresponding infinite potential energy—when the two objects coincide. And while this may be a consequence of idealizing massive bodies as point particles, all of the approaches that derive forces for extended bodies use, to my knowledge, the point particle case as their starting point [see for example Truesdell (1966: Eq. (1.4)), Arnold (1989: pp. 133–135), Goldstein et al. (2002: pp. 134–136, 187, 224)]. Thus the pathologies seem pervasive: and so, the theory—as currently formulated—is inconsistent: it does not make any predictions beyond a certain point.Footnote 35 This may be no problem for ordinary uses of the theory, but it is a major obstacle if one wishes to discuss the putative physical equivalence between Newtonian gravitation and Newton–Cartan theory in a justified manner, as discussed aboveFootnote 36; or between two versions of Newtonian mechanics with different standards of rest. For such discussions assume that the theory describes an entire physically possible world—which it does not!Footnote 37
The point is of course not new, and my critique is not a wholesale critique of possible world-talk in physics, as one might take e.g. Wilson (2006: pp. 201–203) to be (for other criticisms, see Frisch (2005: pp. 4–9) and Vickers (2013: pp. 243)).Footnote 38 Possible world-talk is helpful even when it is not meant literally—and so, it can be applied to physical equivalence if one is not worried about this judgment being justified. But if one wishes to justify the judgment of physical equivalence, for which all agree one needs “a theory of the whole world”, one would expect that the possible world-talk should be taken to be literal. That is: because a justified judgment of physical equivalence (see above) entails unextendability, an entire possible world is required to be associated with the theory.
As I expounded in De Haro (2018: Section 1.4), especially with the example of the effective field theory programme, it seems to me that claims of physical equivalence—but also any judgment of theoretical equivalence in physics—between incomplete or inconsistent theories are not justified. Namely, if one is dealing with an effective theory, one must always be prepared to see its interpretation change, precisely at those points at which the theory becomes inconsistent—and that may call for a major overhaul of the theory’s interpretation. In such cases, talk of the ‘possible worlds’ described by the theory cannot be literal, since there are no such worlds. At best, there are limited domains of application within larger worlds that are described by the theory. Thus general considerations of ‘toy cosmologies’ or “theories of everything”, without regard for the finiteness and consistency of the theory, are insufficient. Unextendability is a natural condition that is sufficient to fix this, cf. De Haro (2016). I consider this to be one major contribution of dualities in string theory and quantum field theory to analyses of theoretical equivalence in physics.
Comparing to other work on theoretical equivalence
In this section, I comment on other recent work relating to theoretical equivalence, especially Butterfield (2018) and Read and Møller-Nielsen (2018). A common theme in the recent literature on dualities—and this is another aspect where dualities bear on discussions of theoretical equivalence in physics—is that “physical equivalence is not automatic”. There is by now indeed a consensus among those working on dualities that duality does not invariably lead to physical equivalence (and I will substantiate the existence of this consensus in a moment).
My comments here aim to: (A) first, point out four points where these three authors went further than I did in De Haro (2015, 2016); and second, point out four specific interpretative points where the current paper adds to previous work: two are interpretative (B) and two are about sufficient conditions for physical equivalence (C).
A. Four contributions of Butterfield (2018) and Read and Møller-Nielsen (2018)
The novelty of Butterfield (2018) and Read and Møller-Nielsen (2018) can be summarised in the following four points:
(1) While all hands agree that “formally equivalent models can disagree”, i.e. they can make different claims about the world even if their claims can be mapped to each other one-to-one, Butterfield (2018) has emphasised this point clearly and vividly, in several examples.
(2) Furthermore, Butterfield (2018) distinguishes two ways in which two models can disagree: namely, by making contradictory assertions about the same subject-matter, or by describing different subject-matters.
(3) Read and Møller-Nielsen (2018) have distinguished two different approaches to dualities, which they dub the ‘interpretational’ and the ‘motivational’ approaches [following a similar distinction, for symmetries, in Møller-Nielsen (2017)]. By ‘approaches to dualities’ I here mean different ways in which, when confronted with a duality, physicists or philosophers may choose to approach the interpretative project. The main difference is as follows: on the interpretational approach, the decision whether to interpret duality-related models as being physically equivalent need not wait upon an explication of their shared ontology, while on the motivational approach this explication is mandatory. (Within each of the two approaches, Read and Møller-Nielsen further distinguish two sub-positions, but I will not go into these details.) Thus theirs is a methodological distinction reflecting two different strategies for interpreting dual models.Footnote 39 Notice that the interpretational vs. motivational contrast does not lead to new interpretative options regarding dual pairs: even though being an interpretationalist or a motivationalist does of course bear on the kinds of interpretations that one is likely to adopt.
(4) Furthermore, Read and Møller-Nielsen (2018) articulate the possibility that dual models might not have physical interpretations, and they give this possibility a place in their overall methodological account—namely, in what they call the ‘cautious motivational view’ (more on this below, especially in footnote 40: and, as should be clear already from Sect. 2.3.1, I agree with these authors about motivationalism being the preferred position).Footnote 40
B. Interpretative attitudes and quietism
Let me now make a friendly mention of two points where I think that, in light of the present work, the two discussed accounts of physical equivalence are not enough. The first leads into the second: the first is about differing conceptions of physical equivalence in the literature; the second is about quietism and realism.
The interpretational vs. motivational contrast of Read and Møller-Nielsen (2018) gives an interesting characterisation of different approaches to interpretation. The main contrast is that motivationalists say we are warranted to pronounce verdicts about physical equivalence only if an explication of the common ontology of the models has been given. But, in so far as it is a contrast based on the actions or attitudes taken by interpreters of dualities, rather than on the reasons leading to those actions or attitudes, or on the notion of physical equivalence itself, they do not seem to amount to fully fleshed-out intepretational stances. In particular, interpretationalism is heterogeneous and does not seem to represent a single interpretative position. Their common trait is that interpretationalists say we are warranted to draw verdicts of physical equivalence even in the absence of an explication of the common ontology.
There is a third approach in the recent literature that, I think, is much more widely represented than interpretationalism, which in Sect. 2.2.1 I introduced as quietism: namely, the stance of the authors that I quoted in Sect. 2.2.1 as the “consensus” regarding theoretical equivalence, who are engaged with the formal project of theoretical equivalence, and for whom interpretative equivalence is a minimal requirement that their project needs, but on which they do not wish to focus.
Interpretationalism seems to be less represented in the literature than one might at first think. (Dewar (2015) seems to be one.) For example, Read and Møller-Nielsen (2018: Section 5.1) quote Rickles as an example, but I do not see that the evidence for his interpretationalism is more compelling than that for motivationalism or, indeed, quietism. One reason for this is, I think, that Rickles (2017: p. 64, my emphasis) is not committed to a realist notion of physical equivalence: ‘the equivalence is at the level of theories first and foremost... We must therefore remain purely in the realm of theoretical descriptions to avoid such ‘external’ [physical] confounders.’ Thus it seems to me that Rickles is best seen as a quietist.
Read and Møller-Nielsen (2018) assume realistic interpretations of scientific theories. Also Butterfield (2018) endorses realism in the sense of referential semantics. But I think that referential semantics does not settle all ontological matters because not every ontology, in the sense of “realist” referential semantics, is realist in the metaphysical sense (see the discussion in Sect. 2.1.2). Therefore, I have kept metaphysics as far from the main substance of my proposal for theoretical equivalence as possible—specific metaphysical discussions only coming in when we discuss physical equivalence. For the theory’s interpretation admits both realist and non-realist explications and different resolutions of referential indeterminacy (cf. Sect. 2.3.1).
C. Sufficient conditions for physical equivalence and consistency
It is worth mentioning two further points where I think that, in light of the present work, the accounts of physical equivalence in the literature are not enough. The first is about sufficient conditions for equivalence; the second is about the mathematical consistency required for physical equivalence.
First, Butterfield (2018) does not list additional requirements on the kinds of bare theories or of interpretations that, given a case of duality, can lead to physical equivalence. For example, Butterfield (2018) is sympathetic to my conditions of unextendability and internal interpretation, and he even uses the question of whether the theories in question are ‘toy cosmologies’ or “theories of everything” (TOEs) as a criterion to determine whether they are about the same subject-matter (2018: §4.1–§4.5, §6.2). The reason seems to be ‘that for two dual TOEs, there is literally ‘no room in the cosmos’ for them to have separate but isomorphic subject-matters’ (§1.2). However, he does not address the question of sufficient conditions for taking duality to give physical equivalence.
Second, so far as I can see, the mathematical inconsistencies of some of the theories that are taken as examples of duality and theoretical equivalence (inconsistencies which I discussed in Sect. 2.3.1) have been largely overlooked in the justification of judgments of theoretical equivalence: for example, in Newtonian gravitation. At least, it has not voiced the problem as clearly as it could have. For example, Read and Møller-Nielsen (2018: Section 3.2) take Newtonian gravitation as their main detailed example.Footnote 41 And, even though this example aims to illustrate symmetry, and only by analogy duality, the authors ‘seek to provide a fully worked out example of what it means to fully explicate symmetry-related models’ underlying ontology’ (ibid.). But, as I discussed in Sect. 2.3.1, I doubt that one can fully explicate the ontology of Newton’s theory of gravitation without addressing its singularities and-or inconsistencies. Also Butterfield (2018: Section 4) discusses various examples of dualities: Newtonian mechanics with different standards of rest, position-momentum duality in quantum mechanics, Kramers–Wannier duality, Heisenberg vs. Schrödinger pictures in quantum mechanics, and Legendre vs. Hamiltonian classical mechanics. Agreed: mathematical consistency is less central to Butterfield’s analysis, since he does not aim at a full explication of the ontology of these theories. But the fact that also Butterfield does not distinguish between theories that are well-defined (like his example of position-momentum duality in quantum mechanics) and those that have singularities (like his example of Newtonian gravitation) confirms my point.
In Sect. 2.3.1, I argued that one must have consistent models in order to regard them as justified cases of theoretical equivalence. This also helps one overcome the rhetoric of “examples of dualities should not depend on advanced theoretical physics”. While the simplicity of the examples is desirable, it cannot be had through inconsistent theories, if what one wishes to discuss is the question of when one is justified in judging cases of theoretical equivalence—which forces us to consider consistent theories.Footnote 42 Indeed, dualities between inconsistent models such as Newtonian gravitation are not justified cases of theoretical equivalence, and so they can at best be useful as pedagogical ‘toy models’—as Butterfield indeed seems to take them. In a slogan: a contemporary discussion of theoretical equivalence must go beyond Leibniz, Clarke, and Galileo’s ship.
It seems that considerations of physical equivalence—and more specifically, the criterion of unextendability—give us an interesting distinction in the types of theories that are subject to duality: those in which there is an interesting question about the justification of the judgment of physical equivalence under all suitable interpretations, and those for which this question is answered in the negative.