Abstract
In a companion article it was shown in a certain precise sense that, for any thermodynamical theory that respects the Kelvin–Planck second law, the Hahn–Banach theorem immediately ensures the existence of a pair of continuous functions of the local material state—a specific entropy (entropy per mass) and a thermodynamic temperature—that together satisfy the Clausius–Duhem inequality for every process. There was no requirement that the local states considered be states of equilibrium. This article addresses questions about properties of the entropy and thermodynamic temperature functions so obtained: To what extent do such temperature functions provide a faithful reflection of “hotness”? In precisely which Kelvin–Planck theories is such a temperature function essentially unique, and, among those theories, for which is the entropy function also essentially unique? What is a thermometer for a Kelvin–Planck theory, and, for the theory, what properties does the existence of a thermometer confer? In all of these questions, the Hahn–Banach Theorem again plays a crucial role.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In a companion article [8] we showed in a certain precise sense that, for any thermodynamical theory in which all processes comply with the Kelvin–Planck Second Law of Thermodynamics, there must exist a pair of continuous functions of the local material state—a specific entropy (entropy per mass) and a thermodynamic temperature—that together satisfy the Clausius–Duhem inequality for every process. That this is so is an immediate consequence of the Hahn–Banach Theorem. For existence of these functions there is no reliance at all on the presence of special processes, such as very slow reversible ones (for example, Carnot cycles).
In this article the situation will be different. Once again, the Hahn–Banach Theorem will be the central tool,Footnote 1 but this time in examining, for any given Kelvin–Planck theory, properties of the Clausius–Duhem entropy-temperature pairs that the theory admits. In almost every theorem contained here we show that, in sharp contrast to the existence theorem, the presence of special processes within the theory is not only sufficient to ensure that the temperature and entropy functions have specific properties but also necessary.
The difference resides in the fact that there is an inverse relationship between the supply of processes the theory contains and the supply of entropy-temperature pairs that satisfy the Clausius–Duhem inequality for all such processes. The larger the supply of processes, the smaller the set of Clausius–Duhem entropy-temperature pairs, and vice-versa. Thus, if the set of entropy-temperature pairs for a given Kelvin–Planck theory is to have a particular property (for example, essential uniqueness), then the set of processes extant in the theory must be sufficiently large as to ensure that the theory’s set of entropy-temperature pairs is suitably narrow. Stipulating the required breadth of the process supply will often amount to specifying that it must contain an abundance of processes of a very particular kind (for example, Carnot cycles).
The brilliant 19\({\text {th}}\) century pioneers invoked an abundance of idealized reversible Carnot cycles (traversing only equilibrium states) to deduce, almost simultaneously, both the existence and the uniqueness of entropy and thermodynamic temperature functions (defined on those states). It is understandable, then, that the classical arguments might lead to a conflation of, on one hand, existence and uniqueness and, on the other hand, necessary and sufficient conditions for each of these. Although an abundance of Carnot cycles visiting equilibrium states might be sufficient, as argued by the pioneers, for the existence of entropy and thermodynamic temperature functions, the presence of those cycles is not necessary, as we showed in [8] (under substantially weaker conditions than in [7]), nor is there a necessity to restrict the domains of those functions to equilibrium states. Here, among other things, we will develop far more fully ideas in [6] and [7] to the effect that particular properties of the temperature and entropy functions (for example, uniqueness) actually require, remarkably, the presence of something like those specific processes the pioneers imagined.
1.1 How this Article is Structured
After providing a synopsis of [8] in Sect. 2, we examine in Sect. 3 the relationship between temperature and hotness. In particular, for a Kelvin–Planck theory we posit definitions, stated solely in terms of the processes extant within the theory, of what it means for one local material state to be of the same hotness as another state and for one to be hotter than another. We then show that these relations are precisely reflected in the set of Clausius–Duhem temperature scales the theory admits. Not only does each temperature scale reflect the hotness relation faithfully, but also if one state has a higher temperature than another on each Clausius–Duhem temperature scale, then the set of processes in the theory must be sufficiently structured as to establish that the first state is indeed hotter than the second. In this implication the Hahn–Banach Theorem plays a crucial role.
Sections 4 and 5 are largely devoted to uniqueness questions. Uniqueness of a Clausius–Duhem temperature scale for a Kelvin–Planck theory is taken up in Sect. 4, where we show, among other things, that for such a scale to be essentially unique, not only is it sufficient that the set of processes extant in the theory be abundantly rich in Carnot cycles but also necessary. For a Kelvin–Planck theory having an essentially unique Clausius–Duhem temperature scale, we ask in Sect. 5 about circumstances under which its companion Clausius–Duhem specific-entropy function is also essentially unique. Among other things, we show that for essential uniqueness of entropy on the entire state space domain, it is not only sufficient that any two states be connected by a reversible process but also necessary. Here again, the Hahn–Banach Theorem is crucial.
In Sect. 6 we take cognizance of the fact that two very different bodies—one perhaps a metal rod and the other a liquid solution exhibiting chemical reactions and diffusion, embraced within two very different Kelvin–Planck theories—can exchange heat with each other. With this in mind, we study in Sect. 6 properties of a “conjoined” Kelvin–Planck theory that subsumes smaller distinct ones. Our special focus is on conjunctions of separate Kelvin–Planck theories of two different materials, wherein the first material can serve as a (suitably defined) thermometer for the second. In that case, we study how the larger conjoined theory can impart to the second Kelvin–Planck theory additional “hotter than” relations and uniqueness properties that were not intrinsic to it.
Section 7 contains concluding remarks. With an eye toward clarifying and softening distinctions that are sometimes drawn between “equilibrium” and “non-equilibrium” thermodynamics, we review, among other things, what the theorems in this article tell us about the (sometimes conflated) necessary and sufficient conditions for the very separate questions of existence and uniqueness of Clausius–Duhem entropy-temperature pairs.
2 Synopsis of Part I
A thermodynamical theory in [8] is an abstraction of just those features of a material (or collection of materials) that bear upon statements of the Second Law and, ultimately, upon statements of the Clausius–Duhem inequality. A particular theory is indicated by a pair of sets, \((\Sigma ,\mathscr {P})\), with \(\Sigma \) denoting the theory’s state space and \(\mathscr {P}\) denoting the set of processes experienced by those material bodies the theory is deemed to describe.
Elements of \(\Sigma \) are understood to represent the possible local states that might be exhibited in bodies as they experience physical processes captured in \(\mathscr {P}\). The state space will often take the form of a subset of \(\mathbb {R}^n\). In a theory of particular gas, for example, the states might consist of pairs \([\,p,v] \in \mathbb {R}^2\), where p is the local pressure and v is the local specific volume (the reciprocal of the density). In a theory of a reacting mixture of n chemical species, the states might consist of vectors \([\,c_1, c_2,\dots , c_n, \theta ] \in \mathbb {R}^{n+1}\), where \(c_i\) is the local molar concentration of the i th species and \(\theta \) is the local temperature on some empirical temperature scale. For reasons given in [8] we assume that \(\Sigma \) is a compact Hausdorff space.
A process in \(\mathscr {P}\) is specified by a pair of objects, the change of condition for the process, usually denoted \(\Delta \mathcalligra {m}\), and the heating measure for the process, usually denoted \(\mathcalligra {q}\). For the purposes of interpretation, imagine a body experiencing a physical process that initiates at time \(t_i\) and terminates at a final time \(t_f\). We begin by indicating what we mean by the body’s condition at a given instant and then the body’s change of condition over the course of the process.
At each instant t during the course of the process, the condition of the body, \(\mathcalligra {m}_t\), is a positive Borel measure on \(\Sigma \) with the following meaning: for each Borel set \(\Lambda \subset \Sigma \), \(\mathcalligra {m}_t(\Lambda )\) is the mass of that part of the body consisting of material in local states residing in \(\Lambda \). Note that \(\mathcalligra {m}_t(\Sigma )\) is the mass of the entire body at time t. For the process, the change of condition is given by the signed Borel measure \(\Delta \mathcalligra {m}:= \mathcalligra {m}_{t_f} - \mathcalligra {m}_{t_i}\). From mass conservation it follows that \(\Delta \mathcalligra {m}(\Sigma ) = 0\).
During the course of the process the body experiencing it will exchange heat with the body’s exterior. The heating measure \(\mathcalligra {q}\) for the process is again a signed Borel measure on \(\Sigma \) with the following interpretation: for each Borel set \(\Lambda \subset \Sigma \), \(\mathcalligra {q}(\Lambda )\) is the net amount of heat absorbed from the body’s exterior, over the course of the entire process, by material which, at the time of absorption, is in states within \(\Lambda \). Note that \(\mathcalligra {q}(\Sigma )\) is the net amount of heat absorbed by the body between the inception of the process and its end.
A member of the process set \(\mathscr {P}\) for the theory \((\Sigma ,\mathscr {P})\) is then identified with an element of the form \(\mathcalligra {p}\):= \((\Delta \mathcalligra {m},\mathcalligra {q})\). We denote by \(\mathscr {M}(\Sigma )\) the vector space of regular signed measures on \(\Sigma \), taken with its weak-star topology and by \(\mathscr {M}^{\circ }(\Sigma )\) the linear subspace of \(\mathscr {M}(\Sigma )\) consisting of all its members which, like \(\Delta \mathcalligra {m}\), take the value 0 on \(\Sigma \); the topology of \(\mathscr {M}^{\circ }(\Sigma )\) is the one it inherits as a subset of \(\mathscr {M}(\Sigma )\).
We hereafter regard \(\mathscr {P}\) to be a subset of the vector space \(\mathscr {V}(\Sigma ):= \mathscr {M}^{\circ }(\Sigma )\oplus \mathscr {M}(\Sigma )\), taken with the product topology. By \(\text {Cone}\,(\mathscr {P})\) we mean the set of all non-negative multiples of members of \(\mathscr {P}\). In the appendix of [8] we gave reasons to presume that, in theories that describe natural physical processes, the closure of \(\text {Cone}\,(\mathscr {P})\) should be convex. The discussion so far is summarized in the following definition:
Definition 2.1
A thermodynamical theory consists of a (compact) Hausdorff set \(\Sigma \), called the state space of the theory, and a set \(\mathscr {P}\subset \mathscr {V}(\Sigma )\) such that
is convex. Elements of \(\mathscr {P}\) are the processes of the theory,
The Kelvin–Planck version of the Second Law asserts, in effect, that it is impossible in a cyclic process for the body experiencing the process to merely absorb heat from its exterior; it must also emit heat to the exterior, in a manner that is qualitatively different from the heat absorption.
We say that a thermodynamical theory \((\Sigma ,\mathscr {P})\) is a Kelvin–Planck theory if, in a certain precise sense, it complies with the Kelvin–Planck requirement. We regard a process \(\mathcalligra {p}= (\Delta \mathcalligra {m},\mathcalligra {q})\) in \(\mathscr {P}\) to be cyclic if \(\Delta \mathcalligra {m}= 0\) — that is, if the condition of the body experiencing the process is the same at the process’s beginning and end. By \(\mathscr {M}_{+}(\Sigma )\) we mean the the subset of \(\mathscr {M}(\Sigma )\) consisting of measures that are non-negative on every Borel set. By \((0,\mathscr {M}_{+}(\Sigma ))\) we mean the set of all members of \(\mathscr {V}(\Sigma )\) of the form \((0,\nu )\), with \(\nu \in \mathscr {M}_{+}(\Sigma )\). We take the Kelvin–Planck stricture to require that \(\mathscr {P}\) meet \((0,\mathscr {M}_{+}(\Sigma ))\) at most in (0, 0); that is, if \((0,\mathcalligra {q}) \in \mathscr {P}\) is a cyclic process such that the heating measure \(\mathcalligra {q}\) is positive on some Borel set in \(\Sigma \), then \(\mathcalligra {q}\) should be negative on some other Borel set. In fact, for reasons explained in [8], we also require a little more:
Definition 2.2
A Kelvin–Planck theory is a thermodynamical theory \((\Sigma ,\mathscr {P})\) such that
Equation (2.2) amounts to a requirement that that no nonzero element of the forbidden cone (0, \(\mathscr {M}_{+}(\Sigma )\)) is approximated by vectors of \(\mathscr {V}(\Sigma )\) that point along directions associated with members of \(\mathscr {P}\). The Hahn–Banach Theorem then leads almost immediately to the existence, for a Kelvin–Planck theory, of continuous specific-entropy and thermodynamic temperature functions of state that together comply with the Clausius–Duhem inequality for all processes the theory contains. The version of the Hahn–Banach Theorem that we employFootnote 2 is given below.
Theorem 2.3
(Hahn–Banach) Let V be a Hausdorff locally convex topological vector space, and let A and B be non-empty disjoint closed convex subsets of V, with B compact. There is a continuous linear function \(f: V \rightarrow \, \mathbb {R}\) and a number \(\gamma \in \mathbb {R}\) such that
and
In particular, if A is a cone, then
and
Unlike classical arguments for the existence of continuous entropy and thermodynamic functions of state, the following theorem—the most important in this two-part series—requires nothing in the way of special processes or the idea of equilibrium states. The proof [7, 8] follows from the Hahn–Banach Theorem directly. In the theorem statement, \(\mathbb {R}_+\) denotes the strictly positive real numbers.
Theorem 2.4
(Existence of Entropy and Thermodynamic Temperature) For a thermodynamical theory \((\Sigma ,\mathscr {P})\) the following are equivalent:
-
(i)
\((\Sigma ,\mathscr {P})\) is a Kelvin–Planck theory.
-
(ii)
There exist functions \(\eta \in \text {C}(\Sigma ,\mathbb {R})\) and \(T \in \text {C}(\Sigma ,\mathbb {R}_+)\) such that
$$\begin{aligned} \int _{\Sigma }\eta \, d(\Delta \mathcalligra {m})\ \ge \ \int _{\Sigma }\frac{\textrm{d}\mathcalligra {q}}{T}, \quad \forall \ (\Delta \mathcalligra {m},\mathcalligra {q}) \in \mathscr {P}. \end{aligned}$$(2.3)
Definition 2.5
(Entropy, Thermodynamic Temperature) Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory. An element \((\eta ,T)\) of \(\text {C}(\Sigma ,\mathbb {R}) \times \text {C}(\Sigma ,\mathbb {R}_+)\) that satisfies (2.3) is a Clausius–Duhem pair for the theory. A function \(T \in \text {C}(\Sigma ,\mathbb {R}_+)\) is a Clausius–Duhem temperature scale for the theory if there exists \(\eta \in \text {C}(\Sigma ,\mathbb {R})\) such that \((\eta ,T)\) is a Clausius–Duhem pair. In that case, \(\eta (\cdot )\) is a specific-entropy function for the theory (corresponding to the Clausius–Duhem temperature scale \(T(\cdot ))\). The set of all Clausius–Duhem temperature scales for the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) is denoted \(\mathscr {T}_{CD}(\Sigma ,\mathscr {P})\) or merely \(\mathscr {T}_{CD}\) when the theory under consideration is apparent.
Remark 2.6
It will be useful to record, for future use, an observation made in the proof [8] of Theorem 2.4: if \((\eta ,T)\) is a Clausius–Duhem pair for the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\)–that is, if it satisfies the inequality in (2.3) for all members of \(\mathscr {P}\) — then it also satisfies that inequality for all members of \(\hat{\mathscr {P}}\).
Remark 2.7
(Reversible members of \(\hat{\mathscr {P}}\)) Note that if \((\Delta \mathcalligra {m},\mathcalligra {q})\) and \(-(\Delta \mathcalligra {m},\mathcalligra {q})\) are both members of \(\mathscr {P}\) (or, more generally, of \(\hat{\mathscr {P}}\)) then for each Clausius–Duhem pair \((\eta ,T)\) we must actually have the equality
Remark 2.8
(Essential uniqueness) It is not difficult to see that if, for a thermodynamical theory, \((\eta (\cdot ),T(\cdot ))\) is a Clausius–Duhem pair then, for any choice of \(\alpha \in \mathbb {R}_+\) and \(\beta \in \mathbb {R}\),
is also a Clausius–Duhem pair. In particular, if \(T(\cdot )\) is a Clausius–Duhem temperature scale for a thermodynamical theory, then so is any positive multiple of \(T(\cdot )\).
However, there might be still other Clausius–Duhem temperature scales that are not of this kind. For this reason, we say that, for a thermodynamical theory, a Clausius–Duhem temperature scale \(T(\cdot )\) is essentially unique if every other Clausius–Duhem temperature scale for the theory is a positive multiple of \(T(\cdot )\). Similarly, if \(\eta (\cdot )\) is a Clausius–Duhem specific-entropy function corresponding to a particular Clausius–Duhem temperature scale \(T(\cdot )\), we say that \(\eta (\cdot )\) is essentially unique if any other Clausius–Duhem entropy scale corresponding to \(T(\cdot )\) differs from \(\eta (\cdot )\) by at most a constant.
Among other things, we will take up uniqueness questions in the remainder of this article.
3 Hotness and Its Reflection in Thermodynamic Temperature Scales
Theorem 2.4 asserts the equivalence of a version of the Kelvin–Planck Second Law and the existence of functions-of-state pairs, consisting of a specific entropy and a thermodynamic temperature, which together satisfy the Clausius–Duhem inequality for all processes the theory contains. Note, however, that as yet there has been no notion of “hotness” posited for a particular state (as distinct from its temperature), nor has there been posited a meaning for the idea that one state is “hotter than” another.
Yet, hotness and hotter than are notions generally regarded to be inextricable from thermodynamics itself. Indeed, Clausius’s formulation of the Second Law, unlike the Kelvin–Planck formulation, takes “hotter than” as a primitive idea: Heat can never pass from a colder to a warmer body without some other change, connected therewith, occurring at the same time [3, 10].
Temperature is of course supposed to provide a faithful reflection of hotness (the more fundamental notion), and so it should be with the Clausius–Duhem temperature scales derived from the Kelvin–Planck Second Law. To determine whether this is indeed the case, it will be necessary to first posit for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) means by which two states in \(\Sigma \) can be judged to be of the same hotness or by which one can be judged hotter than the other. We take the view that, to the extent that a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) can be deemed self-contained, such judgments should derive from examination of the processes described by \(\mathscr {P}\) (or of the elements in its extension \(\hat{\mathscr {P}}\) ). This view will broaden somewhat in Sect. 6 when we consider conjoined thermodynamical theories and the idea of a thermometer.
Remark 3.1
(Clausius–Duhem vs. Clausius temperature scales) In the next few sections we will examine the relationship between hotness and its reflection in Clausius–Duhem temperature scales. Similar questions were addressed in [6], where our concern was with what we called Clausius temperature scales. A Clausius temperature scale is one that satisfies the Clausius inequality, which is the Clausius–Duhem inequality (2.3) restricted to cyclic processes. In that case, the left side of (2.3) reduces to zero (\(\Delta \mathcalligra {m}= 0\)). Neither the entropy nor the change of condition plays a role.
For this reason, the set of Clausius temperature scales can, for a Kelvin–Planck theory, be different from its set of Clausius–Duhem temperature scales (Definition 2.5). The delicate relationship between the two sets is discussed in Appendix D. Although theorems in the coming subsections resemble some in [6] about Clausius scales, it should be kept in mind that here they are about sets of thermodynamic temperature scales different from those in [6].
3.1 Hotness as Revealed by Processes
In deciding for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) the relative hotnesses of two states in \(\Sigma \) we will rely heavily on the cyclic processes contained in \(\mathscr {P}\) or, more generally, on elements of the form \((0,\mathcalligra {q})\) in \(\hat{\mathscr {P}}\). This is because, after the cycle, the condition of the body suffering the process is left unchanged and, as a result, the relationship between heat and work is especially simple. This is explained in the following remark.
Remark 3.2
Although the First Law of Thermodynamics plays no formal role here, one aspect of it will help guide our consideration of hotness. If, in a thermodynamical theory \((\Sigma ,\mathscr {P})\), \((\Delta \mathcalligra {m},\mathcalligra {q})\) is a process, then \(\mathcalligra {q}(\Sigma )\) is the net amount of heat absorbed by the body experiencing the process over the entire course of the process. If the process is cyclic (that is, if \(\Delta \mathcalligra {m}=0\)), then the First Law indicates that the work done by that body during the process is identical to the net heat received, \(\mathcalligra {q}(\Sigma )\). Thus, if \(\mathcalligra {q}(\Sigma )\) is positive then the body does work. If \(\mathcalligra {q}(\Sigma ) = 0\) then the body does no work, nor is any work done on it.
The picture usually invoked for a cyclic process is that of a device (for example, an engine or a refrigerator) in which there are nontrivial temporal variations in the condition of the body suffering the process, with its final condition restored to what it was at the beginning. This will be the cyclic-process picture that we will often have in mind.
However, there is another one, more closely connected with common physical experience, that will help motivate the mathematical expression of heat transfer from hot to cold. This other picture is that of a body in a steady condition, such as the one envisioned in time-invariant solutions of the familiar heat conduction equation, with heat flux at the body’s boundary. Because there is no change of condition over time, \(\Delta \mathcalligra {m}= 0\) so that over any fixed time interval the process is, in our sense, cyclic.
Example 3.3
(A Simple Steady Heat Transfer Process) Imagine a very slim cylindrical tube filled with a sample of the material under consideration, a material having state space \(\Sigma \). The tube is insulated along its extent but uninsulated at its ends. The two ends are immersed in different environments that ensure the permanent and laterally uniform presence of state \(\sigma '\) at one end and state \(\sigma \) at the other. In the picture envisioned, the sample exhibits no change over time (i.e, the sample’s condition \(\mathcalligra {m}\in \mathscr {M}_{+}(\Sigma )\) is time-invariant), with steady heat inflow at the \(\sigma '\) end and steady heat outflow at the \(\sigma \) end, the rates of heat flow at both ends being identical. Because the sample has a steady condition and there is no net heat receipt, the sample experiences no work.
In the context of this picture we can associate, with a fixed time interval, a cyclic process \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\), with \(\Delta \mathcalligra {m}= 0\) and \(\mathcalligra {q}= \alpha \delta _{\sigma '} - \alpha \delta _{\sigma }\), where \(\delta _{\sigma '}\) and \(\delta _{\sigma }\) are Dirac measures in \(\mathscr {M}(\Sigma )\) concentrated at \(\sigma '\) and \(\sigma \). The positive number \(\alpha \) is the amount of heat absorbed at the \(\sigma '\) end and emitted at the \(\sigma \) end during the stipulated time interval. Note that \(\mathcalligra {q}(\Sigma ) =0\), which is consistent with the absence of work.
Remark 3.4
The process considered in Example 3.3 is an idealized one, for it requires, among other things, means to maintain material at the tube ends in permanent and transversely uniform states \(\sigma '\) and \(\sigma \), and without any temporal change in the condition of the material along the tube’s extent. Although, for a thermodynamical theory \((\Sigma ,\mathscr {P})\), such a process might not actually be represented among members of \(\mathscr {P}\) (the true processes), the idealization might well be a member of \(\hat{\mathscr {P}}:= \text {cl}\,(\textrm{Cone} \,(\mathscr {P}))\). That is, if \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\) is not among the true processes, it might be approximated arbitrarily closely by true processes (or multiples of them). In such a case, the presence in \(\textrm{Cone} \,(\mathscr {P})\) of those approximations would give the same sense of passage of heat from \(\sigma '\) to \(\sigma \) as would the idealized example.
To the extent that the direction of heat transfer in a workless cyclic process should guide our conception of relative hotness in a theory \((\Sigma ,\mathscr {P})\), the presence in \(\mathscr {P}\) (or \(\hat{\mathscr {P}}\)) of the process in Example 3.3 would compel us to say that state \(\sigma \) is not hotter than state \(\sigma '\). However, we refrain from asserting that \(\sigma '\) is hotter than \(\sigma \): In view of Remark 3.4, it might happen that \(\hat{\mathscr {P}}\) also contains a (reverse) element of the form \((0,\bar{\mathcalligra {q}})\), with \(\bar{\mathcalligra {q}} = \alpha \delta _{\sigma } - \alpha \delta _{\sigma '}\). (See Remark 3.6 below.) In fact, such a possibility provides a basis for asserting that two different states are of the same hotness.
Definition 3.5
For a thermodynamical theory \((\Sigma ,\mathscr {P})\), two states \(\sigma \in \Sigma \) and \(\sigma ' \in \Sigma \) are of the same hotness (denoted \(\sigma \sim \sigma '\)) if both \((0, \delta _{\sigma } - \delta _{\sigma '})\) and \((0, \delta _{\sigma '} - \delta _{\sigma })\) are members of \(\hat{\mathscr {P}}\). The equivalence relation \(\sim \) induces a partition of \(\Sigma \) into equivalence classes called the hotness levels of the thermodynamical theory \((\Sigma ,\mathscr {P})\). We denote by \(\mathscr {H}\) the set of hotness levels induced in \(\Sigma \) by \(\mathscr {P}\), and we give \(\mathscr {H}\) the quotient topology it inherits from \(\Sigma \).
Remark 3.6
There is no requirement in Definition 3.5 that \((0, \delta _{\sigma }- \delta _{\sigma '})\) and \((0, \delta _{\sigma '}- \delta _{\sigma })\) be members of \(\mathscr {P}\), corresponding to the true processes, only that they be well-approximated by positive multiples of \(\mathscr {P}\)’s members. Such approximations might arise if certain members of \(\mathscr {P}\) correspond to mathematical encodings of physical processes closely resembling the one described in Example 3.3, having small departures from \(\sigma '\) and \(\sigma \) at the two cylinder ends, some giving rise to heat flow in one direction and others inducing heat flow in the opposite direction.
Remark 3.7
(A Dynamical Variant of Example 3.3) The heat transfer process described in Example 3.3 was a simple toy one in which the body suffering the process was in a steady condition (although not in what is usually regarded as a thermodynamic equilibrium). The example was intended to provide motivation for the idea, given in Definition 3.5, that two states are of the same hotness. Lest it be thought that an element of the form \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\), in particular with \(\Delta \mathcalligra {m}= 0\), can arise in \(\hat{\mathscr {P}}\) only from consideration of physical processes having a temporally unchanging condition, we provide in Appendix A a different and more physically robust example in which such steadiness is never present.
The following theorem indicates that, for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\), not only is it true that two states of the same hotness have the same value on each Clausius–Duhem temperature scale but also that if they are not distinguished by any such scale, then \(\hat{\mathscr {P}}\) must contain the elements stipulated in Definition 3.5. For this, the Hahn–Banach Theorem will play a central role.
Theorem 3.8
Let \(\sigma \in \Sigma \) and \(\sigma ' \in \Sigma \) be two states of the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\). The following are equivalent:
-
(i)
\(\sigma \) and \(\sigma '\) are of the same hotness.
-
(ii)
\(T(\sigma ) = T(\sigma ')\) for every Clausius–Duhem temperature scale \(T \in \mathscr {T}_{CD}\).
Two lemmas will facilitate the proof that (ii) implies (i). \(\mathscr {M}_{+}^1(\Sigma )\) denotes the set of \(\mathcalligra {m}\in \mathscr {M}_{+}(\Sigma )\) such that \(\mathcalligra {m}(\Sigma ) = 1\).
Lemma 3.9
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, let \((\mathcalligra {v},\mathcalligra {w})\) be an element of \(\mathscr {V}(\Sigma )\), and let \(\mathscr {K}(\mathcalligra {v}, \mathcalligra {w})\) be the convex hull of \((\mathcalligra {v},\mathcalligra {w}) \cup (0,\mathscr {M}_{+}^1(\Sigma ))\); that is, let
If \(\mathscr {K}(\mathcalligra {v}, \mathcalligra {w})\) is disjoint from \(\hat{\mathscr {P}}\) then there is for the theory a Clausius–Duhem pair \((\eta ,T)\) such that
Proof
Because the sets \(\{(\mathcalligra {v},\mathcalligra {w})\}\) and \((0\mathscr {M}_{+}^1(\Sigma ))\) are both convex and compact, the convex hull of their union, \(\mathscr {K}(\mathcalligra {v}, \mathcalligra {w})\), is also convex and compact ([2], §19.5). Moreover, by hypothesis \(\mathscr {K}(\mathcalligra {v}, \mathcalligra {w})\) is disjoint from \(\hat{\mathscr {P}}\). To get the desired result, we need only repeat the Hahn–Banach separation argument in the proof [8] of Theorem 2.4, \((i) \Rightarrow (ii)\), with \(\mathscr {K}(\mathcalligra {v}, \mathcalligra {w})\) replacing \((0,\mathscr {M}_{+}^1(\Sigma ))\). \(\square \)
Lemma 3.10
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory for which \((\eta ^{\,\circ },T^{\,\circ })\) is a Clausius–Duhem pair, and let \((\mathcalligra {v},\mathcalligra {w}) \in \mathscr {V}(\Sigma )\) be such that
If \((\mathcalligra {v},\mathcalligra {w})\) is not a member of \(\hat{\mathscr {P}}\), then there is another Clausius–Duhem pair \((\eta ,T)\) such that
Proof
We first show that \(\mathscr {K}(\mathcalligra {v},\mathcalligra {w})\) intersects \(\hat{\mathscr {P}}\) at most in \((\mathcalligra {v},\mathcalligra {w})\). Note that each element \((\mathcalligra {v}^*,\mathcalligra {w}^*)\) in \(\mathscr {K}(\mathcalligra {v},\mathcalligra {w})\) is of the form
with \(\lambda \in [0,1]\) and \(\mathcalligra {u}\in \mathscr {M}_{+}^1(\Sigma )\). For \(\lambda < 1\), it follows from (3.3), (3.5), the positivity of \(\mathcalligra {u}\), and the positivity of \(T^{\circ }\) that
Because \((\eta ^{\,\circ },T^{\,\circ })\) is a Clausius–Duhem pair for \((\Sigma ,\mathscr {P})\), (3.6) indicates that no member of \(\mathscr {K}(\mathcalligra {v},\mathcalligra {w})\) can be a member of \(\hat{\mathscr {P}}\), except perhaps for \((\mathcalligra {v},\mathcalligra {w})\) itself (that is, corresponding to \(\lambda =1\)). Thus if, as in the hypothesis, \((\mathcalligra {v},\mathcalligra {w})\) is not a member of \(\hat{\mathscr {P}}\), then \(\mathscr {K}(\mathcalligra {v},\mathcalligra {w})\) and \(\hat{\mathscr {P}}\) are disjoint. In this case, Lemma 3.9 ensures the existence of a Clausius–Duhem pair \((\eta ,T)\) such that the (strict) inequality (3.4) is satisfied. \(\square \)
Proof of Theorem 3.8
First we will show that (ii) implies (i). Suppose, then, that \(\sigma \) and \(\sigma '\) are not distinguished by any Clausius–Duhem temperature scale. We want to show that both \((0, \delta _{\sigma } - \delta _{\sigma '})\) and \((0, \delta _{\sigma '} - \delta _{\sigma })\) are members of \(\hat{\mathscr {P}}\). Let \((\eta ^{\,\circ },T^{\,\circ })\) be a Clausius–Duhem pair for \((\Sigma ,\mathscr {P})\), and let \((\mathcalligra {v},\mathcalligra {w}) = (0, \delta _{\sigma } - \delta _{\sigma '})\), Note that
Now suppose that \((\mathcalligra {v},\mathcalligra {w}) = (0, \delta _{\sigma } - \delta _{\sigma '})\) is not a member of \(\hat{\mathscr {P}}\). From Lemma 3.10 it follows that there is another Clausius–Duhem pair \((\eta ,T)\) such that
This, however, contradicts the supposition that no Clausius–Duhem temperature scale distinguishes \(\sigma \) from \(\sigma '\). The proof that \((0, \delta _{\sigma '} - \delta _{\sigma })\) is a member of \(\hat{\mathscr {P}}\) is similar.
To prove that (i) implies (ii) we suppose that both \((0, \delta _{\sigma '} - \delta _{\sigma })\) and \((0, \delta _{\sigma } - \delta _{\sigma '})\) are members of \(\hat{\mathscr {P}}\). From Remark 2.7 it follows that, for every Clausius–Duhem pair, the equality (2.4) obtains, with \((\Delta \mathcalligra {m},\mathcalligra {q})\) taken to be \((0, \delta _{\sigma } - \delta _{\sigma '})\). From this it follows that \(T(\sigma ) = T(\sigma ')\) for all \(T \in \mathscr {T}_{CD}\). \(\square \)
Because, for a particular Clausius–Duhem temperature scale, all states in a hotness level have the same temperature, it makes sense to speak of the “temperature of a hotness level” relative to that scale. In this sense, Theorem 3.8 enables us to think about temperature as a function of hotness rather than as a function of state.
Definition 3.11
Let \(T:\Sigma \rightarrow \mathbb {R}_+\) be a Clausius–Duhem temperature scale for a Kelvin–Planck theory with hotness levels \(\mathscr {H}\). By \(T_{*}: \mathscr {H}\rightarrow \mathbb {R}_+\) we mean the Clausius–Duhem temperature scale on \(\mathscr {H}\) induced by T the following way: for \(h \in \mathscr {H}\) let \(\sigma \in \Sigma \) be any member of h, so then;
The set of all Clausius–Duhem temperature scales induced on \(\mathscr {H}\) by members of \(\mathscr {T}_{CD}\) will be denoted by \(\mathscr {T}_{CD*}\).
Remark 3.12
Before closing this sub-section, we provide for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) a few facts, which, among others, were proved (in a slightly different setting) as Lemma 6.3 in [6]:
-
(i)
Every \(T_{*} \in \mathscr {T}_{CD*}\) is continuous.
-
(ii)
\(\mathscr {H}\) is compact and Hausdorff.
-
(iii)
Every hotness level, viewed as a subset of \(\Sigma \), is compact.
3.2 A “Hotter Than” Relation and Its Reflection in Clausius–Duhem Temperature Scales
In this section we will introduce for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) a way of making precise the idea that one hotness level is “hotter than” a different one.
We begin by defining a passive heat transfer from one hotness level to another. In physical terms, this is a cyclic process, perhaps resembling the one in Example 3.3, in which there is no work and in which heat is absorbed only by material in states of identical hotness and is emitted, in equal amount, only by material in states having a different common hotness. By the support of a measure \(\nu \in \mathscr {M}_{+}(\Sigma )\), denoted \(\textrm{supp} \,\nu \), we mean the complement in \(\Sigma \) of the largest open set of \(\nu \)-measure zero.
Definition 3.13
In a thermodynamical theory \((\Sigma ,\mathscr {P})\), a passive heat transfer from hotness level h to hotness level \(\textbf{h}^{\prime }\) is an element \((\Delta \mathcalligra {m},\mathcalligra {q})\) of \(\hat{\mathscr {P}}\) such that \(\Delta \mathcalligra {m}= 0\) and
where \(\mu \) and \(\mu '\) are members of \(\mathscr {M}_{+}(\Sigma )\) such that
To motivate our definitionFootnote 3 of hotter than, we let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory with distinct hotness levels \(h'\) and h. Consider an inventor who believes that, through ingenious design, the set of processes indicated in \(\mathscr {P}\) can be expanded to a larger set that includes a hitherto unknown passive heat transfer from h to \(h'\). We say that \(h'\) is hotter than h if no such expansion is possible without violating the Kelvin–Planck Second Law.
Definition 3.14
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory with hotness levels \(h'\) and h. Then \(\textbf{h}^{\prime }\) is hotter than h (denoted \(h' \succ h\)) if \(h'\ne h\) and if a thermodynamical theory \((\Sigma ,\mathscr {P}^{\dagger })\) violates the Kelvin–Planck Second Law—that is, \((\Sigma ,\mathscr {P}^{\,\dagger })\) is not a Kelvin–Planck theory—whenever \(\widehat{\mathscr {P}^{\,\dagger }}\) contains \(\mathscr {P}\) and also a passive heat transfer from h to \(h'\).
The preceding definition makes no mention of temperature. The following theorem asserts, for a Kelvin–Planck theory with hotness levels \(h'\) and h, that \(h'\) is hotter than h in the sense of Definition 3.14precisely when \(h'\) has a higher temperature than h on every Clausius–Duhem temperature scale.
Theorem 3.15
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, and let \(\mathscr {T}_{CD*}\) be its set of Clausius–Duhem temperature scales (on \(\mathscr {H}\)). If \(h'\) and h are distinct hotness levels, the following are equivalent:
-
(i)
\(h'\) is hotter than h.
-
(ii)
\(T_{*}(h') > T_{*}(h),\quad \forall T_{*} \in \mathscr {T}_{CD*}\).
Proof
To show the equivalence of (i) and (ii) we will prove the equivalence of their negations:
-
\((i)^\prime \) \(h'\) is not hotter than h.
-
\((ii)^\prime \) For \((\Sigma ,\mathscr {P})\) there is a Clausius–Duhem temperature scale \(\bar{T}_{*}(\cdot )\) on \(\mathscr {H}\) such that \(\bar{T}_{*}(h') \le \bar{T}_{*}(h)\).
To prove that \((i)^\prime \) implies \((ii)^\prime \) we first note that \((i)^\prime \) requires the existence of a Kelvin–Planck theory \((\Sigma ,\mathscr {P}^{\,\dagger })\) in which \(\widehat{\mathscr {P}^{\,\dagger }}\) contains \(\mathscr {P}\) and also a passive heat transfer from h to \(h'\), say \(\mathcalligra {p}^{\dagger } = (0, \mu - \mu ^{\prime })\), where \(\mu \) and \(\mu '\) are measures in \(\mathscr {M}_{+}(\Sigma )\) that satisfy (3.11). From Theorem 2.4 and Remark 2.6 there is a Clausius–Duhem pair \((\bar{\eta },\bar{T})\) for \((\Sigma ,\mathscr {P}^{\,\dagger })\) such that
In particular, the inequality in (3.12) obtains for all members of \(\hat{\mathscr {P}}\), so \((\bar{\eta },\bar{T})\) is also a Clausius–Duhem pair for the original Kelvin–Planck theory \((\Sigma ,\mathscr {P})\). Now let \(\bar{T}_{*}(\cdot )\) be the \((\Sigma ,\mathscr {P})\)-Clausius–Duhem temperature scale on \(\mathscr {H}\) induced by \(\bar{T}(\cdot )\). Because \(\mathcalligra {p}^{\dagger }\) is a member of \(\widehat{\mathscr {P}^{\dagger }}\), (3.12) requires that
This in turn requires that \(\bar{T}_{*}(h') \le \bar{T}_{*}(h)\).
To show that \((ii)^\prime \) implies \((i)^\prime \), we will begin by supposing that \((\bar{\eta }(\cdot ),\bar{T}(\cdot ))\) is, for \((\Sigma ,\mathscr {P})\), a Clausius–Duhem pair on \(\Sigma \) that gives rise to the temperature scale \(\bar{T}_*(\cdot )\) on \(\mathscr {H}\) posited in \(\mathrm {(ii)}^\prime \). Moreover, we will let \(\mathcalligra {p}^{\dagger } = (0, \mu - \mu ^{\prime })\), where \(\mu \) and \(\mu '\) are members of \(\mathscr {M}_{+}(\Sigma )\) satisfying (3.11). Let
Then \((\Sigma ,\mathscr {P}^{\dagger })\) is a Kelvin–Planck theory, with \(\widehat{\mathscr {P}^{\dagger }}\) containing both \(\mathscr {P}\) and \(\mathcalligra {p}^{\dagger }\). From this, \((i)^\prime \) follows. \(\square \)
The following is a corollary of Theorem 3.15:
Corollary 3.16
For a Kelvin–Planck theory with hotness levels \(\mathscr {H}\), the hotter than relation \(\succ \) gives a strict partial order on \(\mathscr {H}\).
Proof
Antisymmetry and transitivity are consequences of (ii) in Theorem 3.15. \(\square \)
Of course, it might happen that, for a particular Kelvin–Planck theory, \(\mathscr {H}\) is totally ordered by \(\succ \).
Remark 3.17
(Totally ordered hotness levels) It is not generally true that the elements of a set endowed with a total order can be numbered (with real numbers) in such a way as to reflect that order faithfully. A contrived counter-example with a thermodynamic flavor is given in [6]. However, an argument along the lines of the proof of Theorem 8.3 in [6] gives the following information: If \(\mathscr {H}\) is the set of hotness levels for a Kelvin–Planck theory and if \(\mathscr {H}\) is totally ordered by \(\succ \), then \(\mathscr {H}\) is homeomorphic and order-similar to a subset of the real line. In fact, every \(T_* \in \mathscr {T}_{CD*}\) reflects the order precisely and provides a homeomorphism between \(\mathscr {H}\) and \(T_*(\mathscr {H})\). If \(\Sigma \) is connected, then \(\mathscr {H}\) is homeomorphic and order-similar to a closed and bounded interval of the real line.
For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) the ordering of the hotness levels in \(\Sigma \) by “\(\succ \)” can be adapted in an obvious way to make sense of the idea that one state in \(\Sigma \) is hotter than another.
Definition 3.18
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, and let \(\sigma '\) and \(\sigma \) be members of \(\Sigma \). Then state \(\sigma '\) is hotter than state \(\sigma \), denoted \(\sigma ' \succ \sigma \), if the hotness level containing \(\sigma '\) is hotter than the hotness level containing \(\sigma \).
The following is an easy corollary of Theorem 3.15.
Corollary 3.19
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, and let \(\mathscr {T}_{CD}\) be its set of Clausius–Duhem temperature scales (on \(\Sigma \)). If \(\sigma '\) and \(\sigma \) are states in \(\Sigma \), the following are equivalent:
-
(i)
\(\sigma '\) is hotter than \(\sigma \).
-
(ii)
\(T(\sigma ') > T(\sigma ),\quad \forall T \in \mathscr {T}_{CD}\).
3.3 Remarks on Variants of the “Hotter Than” Relation
To the extent that it is precisely reflected in all Clausius–Duhem temperature scales for a Kelvin–Planck theory (and because it resonates with the Clausius statement of the Second Law), the definition of hotter than (\(\succ \)) in the preceding subsection is an especially satisfying one.
There is, however, a weaker but more tangible notion of hotter than that can sometimes give information when \(\succ \) does not, in particular when two hotness levels are not \(\succ \)-comparable. Although the context is different, a very similar analog of this weaker notion is discussed extensively in [6],Footnote 4 so only a few remarks are provided here.
Definition 3.20
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory with hotness levels \(h'\) and h. Then \(\textbf{h}^{\prime }\) is weakly hotter than h (denoted \(h'\, _w{\succ }\; h\)) if \(h'\ne h\) and \(\hat{\mathscr {P}}\) contains a member \((0,\mathcalligra {q})\) with \(\mathcalligra {q}\) of the form
where \(\mu '\), \(\mu \), and \(\nu \) are members of \(\mathscr {M}_{+}(\Sigma )\) such that
(Here \(\nu \) can be the zero measure.)
Viewed as a process, \((0,\mathcalligra {q})\) is a cyclic one in which, over the course of the process, all heat emitted from the body suffering the process emanates from material of hotness h and in which there is at least as much heat absorbed by material of hotness \(h'\). The work done by the body suffering the process, \(\mathcalligra {q}(\Sigma ) = \nu (\Sigma )\), is not negative. (If \(\nu =0\) and \((0,\mathcalligra {q})\) is a member of \(\hat{\mathscr {P}}\) then the process is a passive heat transfer from \(h'\) to h, and no work is done.)
Proof of the next theorem is essentially the one given in [6].
Theorem 3.21
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, and let \(\mathscr {T}_{CD*}\) be its set of Clausius–Duhem temperature scales (on \(\mathscr {H}\)). If \(h'\) and h are distinct hotness levels, the following are equivalent:
-
(i)
\(h'\) is weakly hotter than h.
-
(ii)
\(T_{*}(h') \ge T_{*}(h),\quad \forall T_{*} \in \mathscr {T}_{CD*}\).
Remark 3.22
When \(h'\) and h constitute a fixed pair of hotness levels with \(h'\, _w{\succ }\; h\), there must be at least one Clausius–Duhem temperature scale \(\bar{T}_*(\cdot )\) on \(\mathscr {H}\) such that \(\bar{T}_*(h') > \bar{T}_*(h)\), for otherwise the two hotness levels would be identical (Theorem 3.8). In fact, the set of all members of \(\mathscr {T}_{CD*}\) that distinguish between \(h'\) and h is dense in \(\mathscr {T}_{CD*}\) (in the sup-norm topology): For if \(T_*(\cdot ) \in \mathscr {T}_{CD*}\) is such that \(T_*(h') = T_*(h)\) then, by choosing \(\varepsilon > 0\) sufficiently small, a distinguishing Clausius–Duhem temperature scale \(T_*(\cdot ) + \varepsilon \bar{T}_*(\cdot )\) can be made to lie any given neighborhood of \(T_*(\cdot )\).
This, however, leaves open the question of whether there is a single Clausius–Duhem temperature scale \(T^{\circ }_*(\cdot )\) such that \(T^{\circ }_*(h') > T^{\circ }_*(h)\) for every pair of hotness levels with \(h'\, _w{\succ }\; h\). Such a temperature scale will indeed exist so long as \(\Sigma \) has a countable base of open sets, in particular if it is a metric space [6].
Remark 3.23
(An important consequence of Theorems 3.15 and 3.21 taken together) Consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) with Clausius–Duhem temperature scales \(\mathscr {T}_{CD*}\). Moreover, suppose that \(h'\) and h are hotness levels such that \(T_*(h') > T_*(h)\) for all \(T_*(\cdot )\) in \(\mathscr {T}_{CD*}\). Then Theorem 3.15prohibits the existence of a passive heat transfer from h to \(h'\), while Theorem 3.21requires the existence in \(\hat{\mathscr {P}}\) either a passive heat transfer from \(h'\) to h or, more generally, a member \((0, \mathcalligra {q})\) of the form given by equations (3.15) and (3.16) (roughly, a cyclic-process transfer of heat from \(h'\) to h in which the body experiencing the process has no net work done on it).
Remark 3.24
If, in Definition 3.20, \(\mathcalligra {q}\) can be chosen such that \(\nu \ne 0\), then we say that \(h'\) is strongly hotter than h (denoted \(h'\, _s{\succ }\; h\)).Footnote 5 In this case, \(h'\, _s{\succ }\; h\) implies that \(T_{*}(h') > T_{*}(h)\) for all \(T_{*} \in \mathscr {T}_{CD*}\). The converse, however, is not generally true: Even when \(h'\) has a higher temperature than h on every Clausius–Duhem temperature scale, \(h'\) and h might not be \(_s{\succ }\)-comparable. Nevertheless, they will be \(\succ \)-comparable by virtue of Theorem 3.15. (See Appendix D, Example D.1, in which there are only two states.) In any case, we have the implications \(h'\, _s{\succ }\; h \; \Rightarrow \; h'\, {\succ }\; h \; \Rightarrow \; h'\, _w{\succ }\; h\).
4 Thermodynamic Temperature Scale Uniqueness in Kelvin–Planck Theories
In this section our interest is in the precise connection between the supply of processes a Kelvin–Planck theory contains and the essential uniqueness of a Clausius–Duhem temperature scale for the theory, first on the entire state space and then on sub-domains of it. Recall from Remark 2.8 that a Kelvin–Planck theory is said to have an essentially unique Clausius–Duhem temperature scale if every such scale for the theory is a positive multiple of some fixed one.
Classical arguments appearing in standard textbooks indicate that, if a thermodynamical theory is suitably well endowed with Carnot cycles, then essential uniqueness of a thermodynamic temperature scale is ensured. In the context of Kelvin–Planck theories, we will prove not only this but also the converse: Essential uniqueness of a Clausius–Duhem temperature scale for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) requires that \(\hat{\mathscr {P}}\) be suitably rich in what we shall call Carnot elements. Here again the proof relies on the Hahn–Banach Theorem in the form of Lemma 3.10.
The following definition, already implicit in much that has already been said, will enable us to include in our discussion special “idealized processes” that, although not among the actual processes represented in \(\mathscr {P}\), might nevertheless be members of \(\hat{\mathscr {P}}\), in which case they are well-approximated by the actual processes (or by positive multiples of them).
Definition 4.1
A reversible element of a thermodynamical theory \((\Sigma ,\mathscr {P})\) is a member of \(\hat{\mathscr {P}}\) such that its negative is also a member of \(\hat{\mathscr {P}}\). An irreversible element is a member of \(\hat{\mathscr {P}}\) that is not reversible.Footnote 6 A cyclic element of \((\Sigma ,\mathscr {P})\) is a member of \(\hat{\mathscr {P}}\) of the form \((0,\mathcalligra {q})\).
Remark 4.2
In classical thermodynamics a reversible process is generally regarded to be one for which there is an associated “path” that can be reversed in every detail along the path. In Definition 4.1 there is no such insistence on detailed path reversal; there is only the requirement that both \((\Delta \mathcalligra {m},\mathcalligra {q})\) and its negative be members of \(\hat{\mathscr {P}}\).
Definition 4.3
Let \((\Sigma ,\mathscr {P})\) be a thermodynamical theory. A Carnot element of the theory is a reversible cyclic element \((0, \mathcalligra {q}) \in \hat{\mathscr {P}}\), with \(\mathcalligra {q}\) having a representation of the following kind: There are hotness levels \(h'\) and h such that
where \(\mu '\) and \(\mu \) are non-zero measures in \(\mathscr {M}_{+}(\Sigma )\) satisfying
In this case, the Carnot element operates between hotness levels \({\mathbf {h'}}\) and \(\textbf{h}\). In the special case that \(\mathcalligra {q}= c'\delta _{\sigma '} - c\delta _{\sigma }\) where \(c'\) and c are positive constants and \(\sigma '\) and \(\sigma \) are members of \(\Sigma \) we say that the Carnot element operates between states \(\sigma '\) and \(\sigma \).
Remark 4.4
Regarded in terms of the usual textbook picture, a Carnot element operating between states \(\sigma '\) and \(\sigma \) can be thought of as encoding the limit of extremely narrow Carnot cycles having two minuscule isothermal segments centered on \(\sigma '\) and \(\sigma \).
4.1 Essential Uniqueness of a Thermodynamic Temperature Scale: The Inexorable Role of Carnot Elements
In standard textbook arguments, the (tacitly assumed) presence of a large supply of Carnot cycles ensures not only the existence of a thermodynamic temperature scale but also its essential uniqueness. With respect to uniqueness, Theorem 4.5Footnote 7 also asserts the converse: For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) to have an essentially unique Clausius–Duhem temperature scale on \(\Sigma \), it is necessary that the theory be so rich in Carnot elements that there is at least one operating between each pair of hotness levels. For proof of this converse, the Hahn–Banach theorem plays a critical role, again in the guise of Lemma 3.10.
Theorem 4.5
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory with hotness levels \(\mathscr {H}\), and let \(T(\cdot )\) be a Clausius–Duhem temperature scale on \(\Sigma \). The following are equivalent:
-
(i)
Every Clausius–Duhem temperature scale on \(\Sigma \) is a positive multiple of \(T(\cdot )\).
-
(ii)
If \(\mathcalligra {q}\) is a member of \(\mathscr {M}(\Sigma )\) that satisfies
$$\begin{aligned} \int _{\Sigma }\frac{\textrm{d}\mathcalligra {q}}{T} = 0 \end{aligned}$$(4.3)then \((0,\mathcalligra {q})\) is a member of \(\hat{\mathscr {P}}\).
-
(iii)
For each pair of hotness levels \(h' \in \mathscr {H}\) and \(h \in \mathscr {H}\) there is a Carnot element of \((\Sigma ,\mathscr {P})\) operating between \(h'\) and h.
-
(iv)
For each pair of states \(\sigma ' \in \Sigma \) and \(\sigma \in \Sigma \) there is a Carnot element of \((\Sigma ,\mathscr {P})\) operating between them, having the form \((0, \mathcalligra {q})\) with
$$\begin{aligned} \mathcalligra {q}= c' \,\delta _{\sigma '} - c\,\delta _{\sigma } \quad \textrm{and} \quad \frac{c'}{c} = \frac{T(\sigma ')}{T(\sigma )}. \end{aligned}$$(4.4)
Remark 4.6
(Existence vs. Uniqueness, 1) In the companion article [8], it was shown that for any Kelvin–Planck theory there invariably exists a pair of continuous functions on the state space, a specific entropy and a thermodynamic temperature, that complies with the Clausius–Duhem inequality for all processes the theory contains. The existence of such a pair followed directly from the Hahn–Banach Theorem and did not require the presence in the theory of special processes such as Carnot cycles or, more generally, reversible processes.
However, for the essential uniqueness of the thermodynamic temperature function, the equivalence of (i) and (iv) in Theorem 4.5 indicates that every state should be “visited” by a Carnot cycle, in particular by a reversible process. Some readers might infer from this that, for the essential uniqueness of a thermodynamic temperature scale for given Kelvin–Planck theory, every member of that theory’s state space should, in some sense, be an “equilibrium” state. This is discussed further in Sect. 7.
Proof of Theorem 4.5
We will first show that \((i) - (iii)\) are equivalent, and then we will show that (iv) is equivalent to these.
Proof that (i) implies (ii) is a straightforward application of Lemma 3.10. To prove that (ii) implies (iii), let \(T_*(\cdot )\) be the temperature scale on \(\mathscr {H}\) induced by \(T(\cdot \)), and let \(h'\) and h be hotness levels. Furthermore, let \(\mathcalligra {q}\) be a measureFootnote 8 in \(\mathscr {M}(\Sigma )\) of the form
where \(\mu '\) and \(\mu \) are nonzero measures in \(\mathscr {M}_+(\Sigma )\) having support in \(h'\) and h respectively and satisfying the equation
Note that \(\mathcalligra {q}\), so chosen, satisfies (4.3), as does its negative, so (ii) ensures that (0, q) and \((0,-q)\) are both member of \(\hat{\mathscr {P}}\). In fact, from its form, (0, q) is a Carnot element operating between \(h'\) and h.
To prove that (iii) implies (i) we suppose that \(\bar{T}(\cdot )\) is another Clausius–Duhem temperature scale on \(\Sigma \), different from \(T(\cdot )\), and that \(\sigma _0\) is some fixed state in \(\Sigma \). Our aim will be to show that
For this purpose, let \(\sigma \) be an arbitrary state and let h and \(h_0\) be the hotness levels containing \(\sigma \) and \(\sigma _0\). From (iii) there is a Carnot element operating between h and \(h_0\). This is to say that \(\hat{\mathscr {P}}\) contains a reversible element \((0, \mu - \mu _0)\) where \(\mu \) and \(\mu _0\) are non-zero measures in \(\mathscr {M}_{+}(\Sigma )\) satisfying
If \(\bar{T}_*(\cdot )\) and \(T_*(\cdot )\) are the temperature scales on \(\mathscr {H}\) corresponding to \(\bar{T}(\cdot )\) and \(T(\cdot )\), we can invoke the Clausius–Duhem inequality to write, for the reversible element \((0, \mu - \mu _0) \in \hat{\mathscr {P}}\),
From this it follows that
Since \(\sigma \in \Sigma \) was arbitrary, we actually have
which is what (i) asserts.
Having shown that \((i) - (iii)\) are equivalent, we will now show that these are equivalent to (iv). It is easy to see that (iv) implies (iii). Next we show that (ii) implies (iv). Because \(\mathcalligra {q}\) given by (4.4) satisfies (4.3), (ii) ensures that both \((0,\mathcalligra {q})\) and its negative are members of \(\hat{\mathscr {P}}\). In this case, (iv) is satisfied. \(\square \)
We conclude this subsection with statements of two corollaries of Theorem 4.5, proofs of which (omitted here) are very much like the proofs of Corollaries 9.1 - 9.3 in [6], although the context there is different (Remark 3.1). Moreover, the proof of Corollary 4.7 resembles the proof given in the next section of the substantially broader Corollary 5.3.
Corollary 4.7
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory for which all Clausius–Duhem temperature scales on \(\Sigma \) are positive multiples of some fixed one, \(T(\cdot )\). The reversible cyclic elements of \((\Sigma ,\mathscr {P})\) are precisely those elements \((0,\mathcalligra {q}) \in \mathscr {V}(\Sigma )\) that satisfy
In particular, any \((0,\mathcalligra {q}) \in \mathscr {V}(\Sigma )\) that satisfies (4.12) is a member of \(\hat{\mathscr {P}}\). Of those \((0,\mathcalligra {q}) \in \mathscr {V}(\Sigma )\) that satisfy
either all are contained in \(\hat{\mathscr {P}}\) or none are.
A component of standard textbook arguments underlying the foundations of classical thermodynamics, in particular the existence of an entropy as a function of state, relies on an assertion to the effect that any cyclic reversible process can be approximated by combinations of Carnot cycles. (See, for example, page 35 in [4].) The following ensures that, for any Kelvin–Planck theory in which there is an essentially unique Clausius–Duhem temperature scale, the supply of Carnot elements is sufficiently large as to make that assertion true.
Corollary 4.8
(Approximating reversible cyclic elements by combinations of Carnot elements) Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory for which all Clausius–Duhem temperature scales on \(\Sigma \) are positive multiples of some fixed one. The set of all linear combinations of Carnot elements of \((\Sigma ,\mathscr {P})\) is dense in the set of all reversible cyclic elements of \((\Sigma ,\mathscr {P})\).
4.2 Essential Uniqueness of a Thermodynamic Temperature on a State Space Sub-domain
Although a thermodynamic temperature scale for a Kelvin–Planck theory need not be essentially unique on the entire state space, there might be nontrivial sub-domains on which essential uniqueness is to be found. The connection to Carnot elements (and perhaps to notions of equilibrium states) is recorded in the following proposition:
Proposition 4.9
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory having \(\mathscr {T}_{CD}\) as its set of Clausius–Duhem temperature scales . Furthermore, let \(\Sigma ^{\,0}\) be a subset of \(\Sigma \), and let \(\mathscr {T}^{\,0}_{CD}\) be the set of restrictions of members of \(\mathscr {T}_{CD}\) to \(\Sigma ^{\,0}\) . The following are equivalent:
-
(i)
All member of \(\mathscr {T}^{\,0}_{CD}\) are positive multiples of some fixed one.
-
(ii)
For each pair of distinct states in \(\Sigma ^{\,0}\) there is a Carnot element operating between them.
Proof
That (ii) implies (i) is a direct consequence of the Clausius–Duhem inequality. Proof that (i) implies (ii) amounts to an application of Lemma 3.10. \(\square \)
Remark 4.10
For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) we will say that states \(\sigma \) and \(\sigma '\) in \(\Sigma \) are Carnot-related, denoted \(\sigma \approx _{\,\mathscr {C}}\sigma '\), if \(\sigma = \sigma '\) or if \(\hat{\mathscr {P}}\) contains a Carnot element in operating between them. It is not difficult to see that \(\approx _{\,\mathscr {C}}\) is an equivalence relation. Proposition 4.9 tells us that we have essential temperature-scale uniqueness on each nontrivial \(\approx _{\,\mathscr {C}}\) equivalence class.
5 Uniqueness of Entropy-Temperature Functions of State
In this section we ask: For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\), what must be true of \(\mathscr {P}\) beyond a rich supply of Carnot elements to ensure not only that there is an essentially unique Clausius–Duhem temperature scale but also that there be an essentially unique Clausius–Duhem specific-entropy function?
5.1 Entropy-Temperature-Pair Uniqueness on the Entire State Space
The following theorem describes conditions under which, for a Kelvin–Planck theory, there is an essentially unique Clausius–Duhem pair on the entire state space:
Theorem 5.1
(Clausius–Duhem Pair Uniqueness) Let \((\eta ^{\,0},T^{\,0})\) be a Clausius–Duhem pair for a thermodynamical theory \((\Sigma ,\mathscr {P})\). The following are equivalent:
-
(i)
If \((\eta ,T)\) is any other Clausius–Duhem pair for \((\Sigma ,\mathscr {P})\), there are constants \(\alpha \) and \(\beta \), with \(\alpha > 0\), such that
$$\begin{aligned} T(\cdot ) = \alpha T^{\,0}(\cdot )\quad and \quad \eta (\cdot ) = \frac{1}{\alpha }\eta ^{\,0}(\cdot ) + \beta . \end{aligned}$$(5.1) -
(ii)
\(\hat{\mathscr {P}}\) contains the hyperplane
$$\begin{aligned} \left\{ \ (\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma ): \int _{\Sigma }\eta ^{\,0}\, \textrm{d}\,(\Delta \mathcalligra {m}) = \int _{\Sigma }\frac{\textrm{d}\mathcalligra {q}}{ T^{\,0}}\ \right\} . \end{aligned}$$(5.2) -
(iii)
For each choice of \(\sigma '\) and \(\sigma \) in \(\Sigma \), \(\hat{\mathscr {P}}\) contains a reversible element with change of condition \(\delta _{\sigma '} - \delta _{\sigma }\) and also a Carnot element operating between the hotness levels of \(\sigma '\) and \(\sigma \).
Remark 5.2
(Existence vs. Uniqueness, 2) The comments made in Remark 4.6 pertain here too. Although the existence of a Clausius–Duhem entropy-temperature pair for a Kelvin–Planck theory is ensured immediately by the Hahn–Banach Theorem without the requirement of special reversible processes [8], the equivalence of (i) and (iii) in Theorem 5.1 indicates that, for a Kelvin–Planck theory to have an essentially unique Clausius–Duhem pair, not only must every state be visited by Carnot cycles, each must also be visited by other reversible processes as well. As in Remark 4.6, some readers might infer that all states in such a Kelvin–Planck theory must be “equilibrium” states. Again, this is discussed on Sect. 7.
Proof of Theorem 5.1
To prove that (i) implies (ii) we suppose that (i) holds and that there exists \( (\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\) that satisfies the equation in (5.2) but does not belong to \(\hat{\mathscr {P}}\). Then Lemma 3.10 ensures the existence of another Clausius–Duhem pair \((\eta , T)\) such that
Because \((\Delta \mathcalligra {m},\mathcalligra {q})\) satisfies the equation in (5.2), it is evident that \(\eta (\cdot )\) and \(T(\cdot )\) could not be of the form given in (i). Thus, we have a contradiction.
To prove that (ii) implies (iii), for an arbitrary choice of \(\sigma '\) and \(\sigma \) in \(\Sigma \) we first let \(\mathcalligra {q}\) be any member of \(\mathscr {M}(\Sigma )\) that satisfies the equation
Then \((\delta _{\sigma '} - \delta _{\sigma },\mathcalligra {q}) \in \mathscr {V}(\Sigma )\) and its negative both satisfy the equation in (5.2), so from (ii) both are members of \(\hat{\mathscr {P}}\). Finally, note that \((0,\mathcalligra {q}^*) \in \mathscr {V}(\Sigma )\), where \(\mathcalligra {q}^*\) is any member of \(\mathscr {M}(\Sigma )\) of the form
satisfies the equation in (5.2), as does its negative. From (ii), then, both are members of \(\hat{\mathscr {P}}\), so \((0,\mathcalligra {q}^*)\) is the desired Carnot element.
We turn next to a proof that (iii) implies (i). When (iii) holds it is evident that for any pair of hotness levels there is a Carnot element operating between them. From Theorem 4.5, if \(T(\cdot )\) is a Clausius–Duhem temperature scale on \(\Sigma \) we already have the existence of a positive \(\alpha \) such that \(T(\cdot ) = \alpha T^{\,0}(\cdot )\).
If \(\eta \) is a specific-entropy function corresponding to T, it remains to be shown that, when (iii) holds, \(\eta \) is of the form given in (i). Let \(\sigma ^* \in \Sigma \) be some fixed state, and let \(\sigma \) be some other arbitrary state. Then from (iii) there is in \(\hat{\mathscr {P}}\) a reversible element of the form \((\delta _{\sigma } - \delta _{\sigma ^*},\mathcalligra {q})\), which must be consistent with the Clausius–Duhem inequality written in terms of both Clausius–Duhem pairs (\((\eta ,T)\) and \((\eta ^{\,0},T^{\,0})\). Thus, we have, for all choices of \(\sigma \),
and
Because \(\sigma \) was arbitrary, it follows from these equations that
which is in the form required by (i). \(\square \)
Corollary 5.3
Consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) having, in the sense of (i), an essentially unique Clausius–Duhem pair, \((\eta ^{\,0},T^{\,0})\). The set of reversible elements in \(\hat{\mathscr {P}}\) coincides with the set of all \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\) that satisfy
If even one member of the set
is an element of \(\hat{\mathscr {P}}\) then all are. In particular, if \(\mathscr {P}\) contains even one irreversible process, then \(\mathscr {P}\) is so rich in processes that \(\hat{\mathscr {P}}\) contains all members of \(\mathscr {V}(\Sigma )\) that are consistent with the (not necessarily strict) Clausius–Duhem inequality.
Remark 5.4
In the context of Corollary 5.3, if \(\hat{\mathscr {P}}\) contains even one irreversible element, \(\hat{\mathscr {P}}\) actually coincides with the closed half-space in \(\mathscr {V}(\Sigma )\) given by
Proof of Corollary 5.3
If \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\) satisfies (5.9) then so does its negative, in which case Theorem 5.1 requires that both be members of \(\hat{\mathscr {P}}\). Hence, \((\Delta \mathcalligra {m},\mathcalligra {q})\) is a reversible element of \(\hat{\mathscr {P}}\). On the other hand, if \((\Delta \mathcalligra {m},\mathcalligra {q})\) is a reversible element of \(\hat{\mathscr {P}}\) then both it and its negative are members of \(\hat{\mathscr {P}}\). Because each is a member of \(\hat{\mathscr {P}}\), both must satisfy the Clausius–Duhem inequality, so the equality (5.9) must obtain.
To prove the remainder of the corollary, we let \((\Delta \mathcalligra {m},\mathcalligra {q})\) and \((\Delta \mathcalligra {m}^*,\mathcalligra {q}^*)\) be members of \(\mathscr {V}(\Sigma )\) that satisfy
and we suppose that \((\Delta \mathcalligra {m}^*,\mathcalligra {q}^*)\) is a member of \(\hat{\mathscr {P}}\). Our aim is to show that \((\Delta \mathcalligra {m},\mathcalligra {q})\) is also a member of \(\hat{\mathscr {P}}\).
Let \(\gamma \) denote the positive number defined by
Note that
Because \(\hat{\mathscr {P}}\) is a cone, the first term on the right is a member of \(\hat{\mathscr {P}}\). To see that the second term is also a member of \(\hat{\mathscr {P}}\), note that the second term can be rewritten as
and that
From the equivalence of (i) and (ii) in Theorem 5.1, the element (5.15) — and therefore the second term on the right of (5.14) — must be a member of \(\hat{\mathscr {P}}\). Because both terms on the right of (5.14) are members of the convex cone \(\hat{\mathscr {P}}\), their sum \((\Delta \mathcalligra {m},\mathcalligra {q})\) is a member of \(\hat{\mathscr {P}}\).
To prove the last sentence of the corollary, let \((\Delta \mathcalligra {m}^{\dagger },\mathcalligra {q}^{\dagger }) \in \mathscr {P}\) be an irreversible process. By the Clausius–Duhem inequality, we must have
Because \((\Delta \mathcalligra {m}^{\dagger },\mathcalligra {q}^{\dagger })\) is not reversible, strict inequality must hold in (5.17), so \(\hat{\mathscr {P}}\) contains the entire open half-space (5.10). Theorem 5.1 ensures that \(\hat{\mathscr {P}}\) also contains the hyperplane (5.2). \(\square \)
Remark 5.5
(Consequences of a change of condition that cannot be reversed) Suppose that \((\Sigma ,\mathscr {P})\) is a Kelvin–Planck theory having an essentially unique Clausius–Duhem temperature scale, \(T^0\). If there is even one change of condition (as distinct from a process) that is not reversible—that is, if there exists \(\Delta \mathcalligra {m}^0 \in \mathscr {M}^{\circ }(\Sigma )\) such that for no choice of \(\mathcalligra {q}\) is \((\Delta \mathcalligra {m}^0,\mathcalligra {q})\) a reversible element of \(\hat{\mathscr {P}}\)—then \(T^0\) cannot have an essentially unique Clausius–Duhem entropy partner. For if \(\eta ^{\,0}\) were such a partner then, for any \(\mathcalligra {q}\in \mathscr {M}(\Sigma )\) that satisfies
Corollary 5.3 would require that \((\Delta \mathcalligra {m}^0,\mathcalligra {q})\) be a reversible element of \(\hat{\mathscr {P}}\).
Of course the presence of an abundance of irreversible processes in a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) does not preclude for it an essentially unique Clausius–Duhem entropy-temperature pair, as in Corollary 5.3 when \(\hat{\mathscr {P}}\) is a half-space of \(\mathscr {V}(\Sigma )\).
Remark 5.6
Consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) for which the set of Clausius–Duhem entropy-temperature pairs is not essentially unique. If \((\Delta \mathcalligra {m}^0,\mathcalligra {q}^0)\) is an irreversible element of \(\hat{\mathscr {P}}\) then there must exist at least one Clausius–Duhem pair for which the Clausius–Duhem inequality applied to \((\Delta \mathcalligra {m}^0,\mathcalligra {q}^0)\) is strict, for otherwise \((\Delta \mathcalligra {m}^0,\mathcalligra {q}^0)\) would, by Corollary 5.3, be reversible.
This prompts the following question: Is there a single Clausius–Duhem pair with respect to which the Clausius–Duhem inequality is strict when applied to every irreversible element of \(\hat{\mathscr {P}}\) ? When \(\Sigma \) is a metric space the answer is yes. This follows from an argument similar to the one given in the proof of Theorem 7.2 in [6].
5.2 Essential Uniqueness of Entropy on a State-Space Sub-domain
Even when, for a Kelvin Planck \((\Sigma ,\mathscr {P})\), there is an essentially unique Clausius–Duhem temperature scale on a state-space sub-domain \(\Sigma _0 \subset \Sigma \), we cannot expect in general that there will invariably be an essentially unique specific-entropy function on \(\Sigma _0\). The following theorem describes precisely the circumstances under which such entropy-uniqueness on that sub-domain will obtain. Although the implication (ii) \(\Rightarrow \) (i) is straightforward, the less obvious reverse implication follows from Hahn–Banach Theorem in the guise of Lemma 3.10. As might be expected, the situation is very much like that in Theorem 5.1, but with some subtle differences.
Theorem 5.7
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory, and let \(\Sigma _0\) be a subset of \(\Sigma \) consisting of at least two states. Suppose that \(T^{\dagger }_0: \Sigma _0 \rightarrow \mathbb {R}_+\) is the restriction to \(\Sigma _0\) of a Clausius–Duhem temperature scale \(T^{\dagger }: \Sigma \rightarrow \mathbb {R}_+\) and that every other restriction of a Clausius–Duhem temperature scale to \(\Sigma _0\) is a positive multiple of \(T^{\dagger }_0\). The following are equivalent:
-
(i)
If \((\eta ,T^{\dagger })\) and \((\bar{\eta },T^{\dagger }))\) are Clausius–Duhem pairs for \((\Sigma ,\mathscr {P})\) then, restricted to \(\Sigma _0\), \(\eta \) and \(\bar{\eta }\) differ by at most a constant.
-
(ii)
For each pair of distinct states \(\sigma ' \in \Sigma _0\) and \(\sigma \in \Sigma _0\), there exists in \(\hat{\mathscr {P}}\) a reversible element \((\delta _{\sigma '} - \delta _{\sigma }, \mathcalligra {q})\), with the support of \(\mathcalligra {q}\) contained in \(\Sigma _0\).Footnote 9
Proof
To prove that (i) implies (ii) suppose, on the contrary, that (i) holds but that \(\sigma '\) and \(\sigma \) are states in \(\Sigma _0\) such that, for no choice of \(\mathcalligra {q}\in \mathscr {M}(\Sigma )\) with \(\textrm{supp} \,\mathcalligra {q}\subset \Sigma _0\), are both \((\delta _{\sigma '} - \delta _{\sigma },\mathcalligra {q})\) and its negative members of \(\hat{\mathscr {P}}\). In particular, \((\delta _{\sigma '} - \delta _{\sigma },\mathcalligra {q}^*)\) and its negative cannot both be members of \(\hat{\mathscr {P}}\), where \(\mathcalligra {q}^*\) is chosen to be a member of \(\mathscr {M}(\Sigma )\) that has support in \(\Sigma _0\) and that satisfies the equation
Here \((\eta ,T^{\dagger })\) is the first Clausius–Duhem pair in (i).
If \((\delta _{\sigma '} - \delta _{\sigma },\mathcalligra {q}^*)\) is not a member of \(\hat{\mathscr {P}}\), then Lemma 3.10 ensures that there is another Clausius–Duhem pair \((\eta ^{\,\#}, T^{\,\#})\) such that
Recall from Remark 2.8 that, for any \(\alpha > 0\), \((\frac{1}{\alpha }\,\eta ^{\,\#},\alpha T^{\,\#})\) is again a Clausius–Duhem pair. In particular, from the hypothesis of the theorem, there is an \(\alpha ^{\,*} > 0\) such that \(\alpha ^{\,*}T^{\,\#}(\cdot )\) and \(T^{\,\dagger }(\cdot )\) are identical on \(\Sigma _0\). Thus, with
\((\bar{\eta },T^{\dagger })\) is a Clausius–Duhem pair for \((\Sigma ,\mathscr {P})\). From (5.20) and the fact that \(\mathcalligra {q}^*\) has support in \(\Sigma _0\) it follows that
Comparison with (5.19) tells us that the specific entropy functions \(\bar{\eta }\) and \(\eta \), both corresponding to the temperature scale \(T^{\,\dagger }\), cannot differ on \(\Sigma _0\) by at most a constant, in contradiction to (i). If \(-\,(\delta _{\sigma '} - \delta _{\sigma },\mathcalligra {q}^*)\) is not a member of \(\hat{\mathscr {P}}\), proof of contradiction to (i) is similar.
To prove that (ii) implies (i) suppose that \(\bar{\eta }\) and \(\eta \) are specific-entropy functions on \(\Sigma \) corresponding to the same Clausius–Duhem temperature scale \(T^{\dagger }\). Let \(\sigma _0\) be a fixed state in \(\Sigma _0\), and let \(\sigma \in \Sigma _0\) be another state. From (ii) it follows that \(\hat{\mathscr {P}}\) contains a reversible element \((\delta _{\sigma } - \delta _{\sigma _0},\mathcalligra {q})\). Because the element is reversible, the Clausius–Duhem inequality requires that
Thus, for any choice of \(\sigma \in \Sigma _0\) we have
which is to say that \(\bar{\eta }\) and \(\eta \), restricted to \(\Sigma _0\), differ by at most a constant. \(\square \)
Remark 5.8
In the proof that (ii) implies (i) there was no need to require that the heating measure have support in \(\Sigma _0\); any heating measure with arbitrary support would do. However, with (i) satisfied, the deeper implication \((i) \Rightarrow (ii)\) indicates that there must also exist in \(\hat{\mathscr {P}}\) a reversible element \((\delta _{\sigma } - \delta _{\sigma _0},\mathcalligra {q})\), with the support of \(\mathcalligra {q}\) contained in \(\Sigma _0\).
Remark 5.9
For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) we will say that states \(\sigma \) and \(\sigma '\) in \(\Sigma \) are reversibly-connected, denoted \(\sigma \approx _{\,\mathscr {R}} \sigma '\), if \(\sigma = \sigma '\) or if there is a \(\mathcalligra {q}\in \mathscr {M}(\Sigma )\) such that \(\hat{\mathscr {P}}\) contains both \((\delta _{\sigma } - \delta _{\sigma '}, \mathcalligra {q})\) and its negative. Like the Carnot relation \(\approx _{\,\mathscr {C}}\), the relation \(\approx _{\,\mathscr {R}}\) is an equivalence relation in \(\Sigma \). On any intersection of a \(\approx _{\,\mathscr {C}}\)-equivalence-class and an \(\approx _{\,\mathscr {R}}\)-equivalence-class there is essential uniqueness of Clausius–Duhem entropy-temperature pairs. Because in such an intersection all states are visited by both reversible connections and (reversible) Carnot elements, some readers might infer that these could only be equilibrium states. See, however, Sect. 7.
5.3 A Relationship Between the Supply of Entropy-Temperature Pairs and the Supply of Processes
For a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) in which there is an essentially unique Clausius–Duhem pair, Corollary 5.3 tells us that knowledge of a Clausius–Duhem pair determines \(\hat{\mathscr {P}}\) completely, so long as there is at least one irreversible process. When for the theory there is not an essentially unique Clausius–Duhem pair, we can still ask about the relationship between the full set of Clausius–Duhem pairs and \(\hat{\mathscr {P}}\). In particular, we can ask about circumstances under which \(\hat{\mathscr {P}}\) coincides with the set of all members of \(\mathscr {V}(\Sigma )\) that comply with the Clausius–Duhem inequality for every choice of Clausius–Duhem pair—that is, circumstances under which \(\hat{\mathscr {P}}\) is identical to the set
where \(CD (\Sigma ,\mathscr {P})\) is the set of all Clausius–Duhem entropy-temperature pairs for \((\Sigma ,\mathscr {P})\).
From the positivity of Clausius–Duhem temperature scales it follows easily that the set
is contained in \(\mathscr {Q}\). Thus, for \(\hat{\mathscr {P}}\) to coincide with \(\mathscr {Q}\) it is necessary that \(\hat{\mathscr {P}}\) contain \((0,-\mathscr {M}_+(\Sigma ))\). Less obvious is the fact that for \(\hat{\mathscr {P}}\) to coincide with \(\mathscr {Q}\) it is both necessary and sufficient that \(\hat{\mathscr {P}}\) contain \((0,-\mathscr {M}_+(\Sigma ))\). (This last assertion is largely a consequence of Lemma 3.9.)
Thus, if \(\hat{\mathscr {P}}\) contains the simplest elements of \(\mathscr {V}(\Sigma )\) that comply with the Clausius–Duhem inequality for every entropy-temperature pair—those elements of the form \((0,- \nu ),\ \nu \in \mathscr {M}_+(\Sigma )\)—then \(\hat{\mathscr {P}}\) must contain all elements of \(\mathscr {V}(\Sigma )\) that comply with the Clausius–Duhem inequality for every entropy-temperature pair.
Viewed as a process, \((\Delta \mathcalligra {m},\mathcalligra {q}):= (0,- \nu ),\ \nu \in \mathscr {M}_+(\Sigma )\) represents one that is cyclic and in which for every Borel set of states there is only heat emission. The First Law then indicates that the work done on the body suffering the process, \(\nu (\Sigma )\), is converted entirely into heat emitted to the body’s exterior. It is not unreasonable to suppose that physical processes of this kind, or approximations to them, are naturally abundant.
Remark 5.10
When \(\hat{\mathscr {P}}\) does not contain \((0,-\mathscr {M}_+(\Sigma ))\), it is a consequence of Lemma 3.9 that for any member (v, w) of \(\mathscr {Q}\) that is not a member of \(\hat{\mathscr {P}}\) there will nevertheless exist \(\nu \in \mathscr {M}_+(\Sigma )\) such that \((v,w + \nu )\) is a member of \(\hat{\mathscr {P}}\).
6 Conjoined Thermodynamical Theories and Thermometers
For the sake of simplicity and motivation, a thermodynamical theory \((\Sigma ,\mathscr {P})\) has been mostly viewed as a description of processes that bodies composed of a particular material might experience. From this viewpoint, derived functions of state for a Kelvin–Planck theory, such as a specific-entropy function \(\eta : \Sigma \rightarrow \mathbb {R}\), were deemed to be attributes of the particular material under consideration. In this interpretation of \((\Sigma ,\mathscr {P})\), hotness levels in \(\Sigma \) and their comparability relative to the “hotter than” relation \(\succ \) in \((\Sigma ,\mathscr {P})\) were regarded to be intrinsic to the theory, ascertained only by appeals to the set of processes the material itself can or cannot experience. Indeed, we admitted the possibility that, for a particular Kelvin–Planck theory, \(\mathscr {P}\) might not be sufficiently adequate as to render every pair of hotness levels intrinsically \(\succ \)-comparable or to make all Clausius–Duhem temperature scales essentially identical.
In this section we will expand that interpretation of a thermodynamical theory to accommodate the idea that bodies composed of a particular material inhabit a world containing bodies made of still other materials, and that these various bodies can exchange heat. Indeed, two bodies in contact, composed of different materials, can be viewed as a single compound body that exchanges heat with its exterior. In such a case, heat might be absorbed from the exterior (of the compound body) by the first body, passed to the second body, and emitted to that exterior by the second body.Footnote 10
With such processes in mind, we will introduce the idea of the conjunction of two thermodynamical theories—that is, a broader thermodynamical theory that embraces the two theories and that, in addition, allows for processes of the type just described. We will be especially interested in situations in which one of theories characterizes a material that has special thermometric properties relative to the other.
We shall see that the presence of the thermometric theory in the conjunction can, in a certain sense, impart to the other theory a total “hotter than” relation or an essentially unique Clausius–Duhem temperature scale where neither existed before.
6.1 Conjoined Thermodynamical Theories
Definition 6.1
Let \((\Sigma _1,\mathscr {P}_1)\) and \((\Sigma _2,\mathscr {P}_2)\) be thermodynamical theories having disjoint state spaces. We say that a thermodynamical theory \((\Sigma _3,\mathscr {P}_3)\) is a conjunction of \((\Sigma _1,\mathscr {P}_1)\) and \((\Sigma _2,\mathscr {P}_2)\) if \(\Sigma _3 = \Sigma _1\cup \Sigma _2\), if both \(\mathscr {P}_1\) and \(\mathscr {P}_2\) are essentially contained in \(\mathscr {P}_3\), and if, for each \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {P}_3\), \(\Delta \mathcalligra {m}(\Sigma _1) = 0\) and \(\Delta \mathcalligra {m}(\Sigma _2) = 0\).Footnote 11
Remark 6.2
This requires an explanation of what it means to say, for example, that \(\mathscr {P}_1\) is essentially contained in \(\mathscr {P}_3\). Note that if \((\Delta \mathcalligra {m}_1,\mathcalligra {q}_1)\) is an element of \(\mathscr {P}_1\), then \(\Delta \mathcalligra {m}_1\) and \(\mathcalligra {q}_1\) are both signed regular Borel measures on \(\Sigma _1\). On the other hand, if \((\Delta \mathcalligra {m}_3,\mathcalligra {q}_3)\) is an element of \(\mathscr {P}_3\), then \(\Delta \mathcalligra {m}_3\) and \(\mathcalligra {q}_3\) are both signed Borel measures on \(\Sigma _3 = \Sigma _1\cup \Sigma _2\). In formal terms, then, a process in \(\mathscr {P}_1\) cannot be a member of \(\mathscr {P}_3\). Nevertheless, we say that \((\Delta \mathcalligra {m}_1,\mathcalligra {q}_1)\) is essentially contained in \(\mathscr {P}_3\) if there is in \(\mathscr {P}_3\) a process \((\Delta \mathcalligra {m}_3,\mathcalligra {q}_3)\) such that \(\Delta \mathcalligra {m}_3\) and \(\mathcalligra {q}_3\) take the value zero on every Borel set in \(\Sigma _2\) and agree in value with \(\Delta \mathcalligra {m}_1\) and \(\mathcalligra {q}_1\) on every Borel set of \(\Sigma _1\).
6.2 Thermometers for a Kelvin–Planck Theory
Throughout remainder of Sect. 6, \((\Sigma ,\mathscr {P})\) is a generic Kelvin–Planck theory with hotness levels \(\mathscr {H}\). In particular, we do not presume that the hotness levels of \(\mathscr {H}\) are totally ordered by \(\succ \), the “hotter than” relation in \((\Sigma ,\mathscr {P})\). However, \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) will hereafter designate a different Kelvin–Planck theory with hotness levels \(\mathscr {H}_{\Theta }\), this time totally ordered, according to Definition 3.14, by the hotter than relation \(\succ _{\Theta }\) in \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\). It will be understood that \(\Sigma \) and \(\Sigma _{\Theta }\) are disjoint.
Definition 6.3
The Kelvin–Planck theory \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is a thermometer for \((\Sigma ,\mathscr {P})\) if there is a Kelvin–Planck conjunction of \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) and \((\Sigma ,\mathscr {P})\), say \((\Sigma _C,\mathscr {P}_C)\), having the following property: For each \(\sigma \in \Sigma \) there is a \(\sigma _{\theta } \in \Sigma _{\Theta }\) such that both \((0,\delta _{\sigma } - \delta _{\sigma _{\Theta }})\) and its negative are members of \(\hat{\mathscr {P}_C} := \textrm{cl}\ [\textrm{Cone} \,(\mathscr {P}_C)]\). In this case \((\Sigma _C,\mathscr {P}_C\)) is a thermometric conjunction of \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) and \((\Sigma ,\mathscr {P})\).
Remark 6.4
The defining property amounts to a requirement that for each \(\sigma \in \Sigma \) there is a \(\sigma _{\theta } \in \Sigma _{\Theta }\) such that, in the conjunction, \(\sigma \) and \(\sigma _{\theta }\) are of the same hotness. See Appendix A, in particular Remark A.1, for a description of how the required passive heat transfers might come about in a natural way.
Remark 6.5
Here it will be helpful to regard \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) as a mathematical encoding of the thermodynamic properties of a thermometric material—that is, a material which, for the purposes of measuring hotness, can finely probe, by means of heat transfer processes, a different target material, characterized by \((\Sigma ,\mathscr {P})\); the thermometric material assigns to each state of the target material a hotness level in \(\mathscr {H}_{\Theta }\). In turn, each such hotness level is, relative to some chosen Clausius–Duhem temperature scale for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), associated with a numerical value of temperature.
Remark 6.6
(Conditions sufficient to ensure that \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is a thermometer for \((\Sigma ,\mathscr {P})\)) Suppose that \((\Sigma _C,\mathscr {P}_C)\) is a Kelvin–Planck conjunction of \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) and \((\Sigma ,\mathscr {P})\). In Proposition 6.7 below we assert that \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is a thermometer for \((\Sigma ,\mathscr {P})\) if the conjunction satisfies some very weak and natural requirements: that \(\Sigma _{\Theta }\) is connected and that in the conjunction there is a kind of universal comparability of the states in \(\Sigma \) and those in \(\Sigma _{\Theta }\) with respect to the weakly-hotter-than relation \(_w{\succ _C}\) in the conjunction.Footnote 12
Proposition 6.7
Let \((\Sigma _C,\mathscr {P}_C)\) be a Kelvin–Planck conjunction of the Kelvin–Planck theories \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) and \((\Sigma ,\mathscr {P})\). Then \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is a thermometer for \((\Sigma ,\mathscr {P})\) if the following three conditions are satisfied:
-
(i)
\(\Sigma _{\Theta }\) is connected.
-
(ii)
Any two states, one in \(\Sigma \) and the other in \(\Sigma _{\Theta }\), that are not of the same hotness in \((\Sigma _C,\mathscr {P}_C)\) are \(_w{\succ _C}\)-comparable.
-
(iii)
For each \(\sigma \in \Sigma \) there is a state \(\sigma _{\theta }\in \Sigma _{\Theta }\) such that \(\sigma _{\theta }\) \(_w{\succ _C}\) \(\sigma \) and also a state \(\sigma '_{\theta }\in \Sigma _{\Theta }\) such that \(\sigma \) \(_w{\succ _C}\) \(\sigma '_{\theta }\) .
Remark 6.8
(Pervasiveness of \(_w{\succ _C}\)-comparability) It is important to note that in order for two states of different hotness to be \(_w{\succ _C}\)-comparable it is enough that there be a passive heat transfer from one to the other. Appendix A suggests that such a transfer will take place whenever material samples in the two different states are brought into contact, however briefly.
Proof of Proposition 6.7
It must be shown that for each \(\sigma \in \Sigma \) there is a \(\sigma _{\theta } \in \Sigma _{\Theta }\) such that \(\pm (0,\delta _{\sigma } - \delta _{\sigma _{\theta }})\) are members of \(\widehat{\mathscr {P}_C}\). With \(\mathscr {T}_C\, \) denoting the set of all Clausius–Duhem temperature scales for \((\Sigma _C,\mathscr {P}_C)\), this is equivalent by Theorem 3.8 to showing that for each \(\sigma \in \Sigma \) there is a \(\sigma _{\theta } \in \Sigma _{\Theta }\) such that \(T(\sigma ) = T(\sigma _{\theta })\) for all \(T \in \mathscr {T}_C\).
Suppose on the contrary that there is a \(\sigma ^* \in \Sigma \) such that for each \(\sigma _{\theta } \in \Sigma _{\Theta }\) there is a \(\bar{T} \in \mathscr {T}_C\) such that \(\bar{T}(\sigma ^*) \ne \bar{T}(\sigma _{\theta })\). Let
and
By supposition these sets are disjoint. From Theorem 3.21 and (iii) both sets are non-empty. From the same theorem and (ii), the union of the two sets is \(\Sigma _{\Theta }\). Because \(U_{\ge }\) and \(U_{\le }\) are both closed and each is the (open) complement of the other, \(\Sigma _{\Theta }\) is the union of two disjoint open sets, in violation of (i). \(\square \)
In preparation for the next section we posit the following definition:
Definition 6.9
A thermometer \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) for a given Kelvin–Planck theory is an ideal thermometer for it if all Clausius–Duhem temperature scales for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) are positive multiples of some fixed one.
Remark 6.10
(Approximate realization of ideal thermometers) If \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is an ideal thermometer for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) then, in addition to its thermometric properties, \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) must satisfy all of the equivalent conditions stipulated in Theorem 4.5. In particular, \(\hat{\mathscr {P}}_{\Theta }\) must contain a rich supply of Carnot elements. This might be the case, for example, when \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) describes the thermodynamics of a gas such as nitrogen or helium, with \(\Sigma \) consisting of pairs of the form (p, v), with p denoting the local pressure and v denoting the local specific volume. In this case, Carnot elements in \(\hat{\mathscr {P}}_{\Theta }\) might derive from Carnot cycles specified by paths in \(\Sigma _{\Theta }\), as depicted in standard text books.Footnote 13
Of course, the gas described by \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) must also satisfy the requirements of a thermometer for \((\Sigma ,\mathscr {P})\). The latter might, for example, describe bodies consisting of liquid mixtures in which chemical reactions occur among a collection of several specified molecular species. In that case, most elements of \(\Sigma \) would correspond to local mixture states in which chemical reaction equilibrium does not prevail. Nevertheless, Proposition 6.7, Appendix A, and Remark 6.8 indicate how, for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), the thermometric requirements of Definition 6.3 might be satisfied by means of brief contacts between the gas and samples of the reacting liquid mixture.
6.3 Properties Imparted to a Kelvin–Planck Theory by the Existence of a Thermometer
The following theorem describes a sense in which the existence of thermometer for a Kelvin–Planck system \((\Sigma ,\mathscr {P})\) can impart to it properties that were not there intrinsically.
Theorem 6.11
Let \((\Sigma ,\mathscr {P})\) be a Kelvin–Planck theory in which the hotness levels in \(\Sigma \) are not necessarily totally ordered by \(\succ \), the hotter than relation in \((\Sigma ,\mathscr {P})\). Moreover, suppose that \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is a thermometer for \((\Sigma ,\mathscr {P})\). If \((\Sigma _C,\mathscr {P}_C)\) is a thermometric conjunction of \((\Sigma ,\mathscr {P})\) and \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), then
-
(i)
the hotness levels in \(\Sigma _C\) are totally ordered by \(\succ _C\), the hotter than relation in \((\Sigma _C,\mathscr {P}_C)\). As a result, any two states of \(\Sigma \) not in the same C-hotness level are, in the sense of Definition 3.18, \(\succ _C\)-comparable.
-
(ii)
If the thermometer is ideal, then all Clausius–Duhem temperature scales for the conjunction are positive multiples of some fixed one. In particular, the restrictions to \(\Sigma \) of all Clausius–Duhem temperature scales for the conjunction differ by at most a positive multiple.
The theorem tells us that, for any two states \(\sigma , \sigma ' \in \Sigma \) that are not of the same hotness in \((\Sigma _C,\mathscr {P}_C)\), we either have \(\sigma ' \succ _C \sigma \) or \(\sigma \succ _C \sigma '\), this despite the fact that the same two states might not be intrinsically \(\succ \)-comparable in \((\Sigma ,\mathscr {P})\). The enhanced comparability results from the presence of the thermometer in the larger conjoined theory \((\Sigma _C,\mathscr {P}_C)\), a presence that provides for more processes with which hotness comparisons can be made.
Similarly, even when the Clausius–Duhem temperature scales for \((\Sigma ,\mathscr {P})\) are not all positive multiples of some fixed one (reflecting the absence of a sufficiently rich supply of Carnot elements in \(\hat{\mathscr {P}}\)), it will nevertheless be the case that, restricted to \(\Sigma \), all Clausius–Duhem scales for the larger conjunction will be a positive multiple of some fixed one, so long as the thermometer is ideal—that is, so long as the thermometer itself has an essentially unique Clausius–Duhem temperature scale. The essential uniqueness of Clausius–Duhem temperature scales for the conjunction derives from the richer supply of Carnot elements in \(\hat{\mathscr {P}}_C\). In Appendix B we describe a hypothetical physical scenario in which \(\hat{\mathscr {P}}_C\) contains a Carnot element operating between two states of \(\Sigma \) while \(\hat{\mathscr {P}}\) contains no such Carnot element.
Proof of Theorem 6.11
We begin with some preliminary remarks: Because \((\Sigma _C,\mathscr {P}_C)\) is a Kelvin–Planck theory, there exists for it an entropy-temperature pair, both functions having domain \(\Sigma _C\), that satisfies the Clausius–Duhem inequality for all processes in \(\mathscr {P}_C\). Let \((\eta _{\,C},T_{C})\) be any such pair. Because \(\mathscr {P}_{\Theta }\) is essentially contained in \(\mathscr {P}_C\), it is apparent that the restrictions of \(\eta _{\,C}\) and \(T_C\) to \(\Sigma _{\Theta }\) constitute a Clausius–Duhem pair for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\). In particular, the restriction of \(T_C\) to \(\Sigma _{\Theta }\) is a Clausius–Duhem temperature scale for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\). Therefore, whenever \(h'_{\Theta }\) and \(h_{\Theta }\) are hotness levels in \(\Sigma _{\Theta }\) such that \(h'_{\Theta } \succ _{\theta } h_{\Theta }\) we must have \(T_C(\sigma '_{\theta }) > T_C(\sigma _{\theta })\) for all \(\sigma '_{\theta } \in h'_{\Theta }\) and \(\sigma _{\theta } \in h_{\Theta }\). Moreover, if \(\sigma '_{\theta }\) and \(\sigma _{\theta }\) are of the same hotness in \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), we must have \(T_C(\sigma '_{\theta }) = T_C(\sigma _{\theta })\).
Proof of (i). We need to show that, if \(h'_C\) and \(h_C\) are distinct hotness levels for \((\Sigma _C,\mathscr {P}_C)\), then \(h'_C\) and \(h_C\) are \(\succ _C\)-comparable in the sense of Definition 3.14. From properties of the thermometer, every state in \(\Sigma \) is of the same \(\succ _C\)-hotness as some state in \(\Sigma _{\Theta }\). From this it follows that every hotness level for \((\Sigma _C,\mathscr {P}_C)\) contains a representative from \(\Sigma _{\Theta }\). Suppose, then, that \(\sigma '_{\theta }\) and \(\sigma _{\theta }\) are such representatives taken from \(h'_C\) and \(h_C\), respectively. Again from properties of \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), it must be the case that, relative to \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), the hotness levels \(h'_{\Theta } \subset \Sigma _{\Theta }\) and \(h_{\Theta }\subset \Sigma _{\Theta }\), containing \(\sigma '_{\theta }\) and \(\sigma _{\theta }\), are either \(\succ _{\Theta }\)-comparable or else they coincide.
If \(\sigma '_{\theta }\) and \(\sigma _{\theta }\) are of the same \(\succ _{\Theta }\)-hotness, then from the preliminary remarks above we have \(T_C(\sigma '_{\theta }) = T_C(\sigma _{\theta })\) for every \(T_C(\cdot )\) in the set of Clausius–Duhem temperature scale for \((\Sigma _C,\mathscr {P}_C)\). From Theorem 3.8 it follows that \(\sigma '_{\theta }\) and \(\sigma _{\theta }\) are of the same hotness in \((\Sigma _C,\mathscr {P}_C)\). This, however, contradicts the supposition that \(h'_C\) and \(h_C\) are distinct.
Suppose, then, that \(h'_{\Theta }\) and \(h_{\Theta }\) are \(\succ _{\Theta }\)-comparable, with \(h'_{\Theta }\succ _{\Theta } h_{\Theta }\). From Theorem 3.15 and the preliminary remarks above, we have \(T_C(\sigma '_{\theta }) > T_C(\sigma _{\theta })\) for each choice \(T_C(\cdot )\) of Clausius–Duhem temperature scale for \((\Sigma _C,\mathscr {P}_C)\). From Definition 3.18 and Corollary 3.19 if follows that \(h'_C\) is \(\succ _C\)-comparable to \(h_C\), with \(h'_C\succ _C\) \(h_C\).
Proof of (ii). Suppose that all Clausius–Duhem temperature scales for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) are positive multiples of some fixed one. We want to show that the same is true of all Clausius–Duhem temperature scales for \((\Sigma _C,\mathscr {P}_C)\). Let \(\bar{T}_C: \Sigma _C \rightarrow \mathbb {R}_+\) and \(T_C: \Sigma _C \rightarrow \mathbb {R}_+\) be Clausius–Duhem temperature scales for \((\Sigma _C,\mathscr {P}_C)\). Moreover, let \(\sigma ^*_{\theta }\) be a fixed state in \(\Sigma _{\Theta }\). It will be enough to show that
From properties of the thermometer \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), each \(\sigma \in \Sigma _C\) is of the same \((\Sigma _C,\mathscr {P}_C)\)-hotness as a state in \(\Sigma _{\Theta }\), denoted here as \(\sigma _{\theta }\). Because Clausius–Duhem temperature scales for \((\Sigma _C,\mathscr {P}_C)\) assign the same value to all states in \(\Sigma _C\) of the same \((\Sigma _C,\mathscr {P}_C)\) - hotness, (6.3) is equivalent to
From the preliminary remarks at the very beginning of the proof, the restriction to \(\Sigma _{\Theta }\) of any Clausius–Duhem temperature scale for \((\Sigma _C,\mathscr {P}_C)\) is a Clausius–Duhem temperature scale for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\). That (6.4) holds follows from the fact that all Clausius–Duhem temperature scales for \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) are identical up to a positive multiple. \(\square \)
6.4 Ensured Consistency of All Thermometers for a Kelvin–Planck Theory
For the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\), we will suppose throughout this subsection that \((\Sigma _{\Theta 1}, \mathscr {P}_{\Theta 1})\) and \((\Sigma _{\Theta 2}, \mathscr {P}_{\Theta 2})\) are two different thermometers (with \(\Sigma _{\Theta 1} \cap \Sigma _{\Theta 2} = \emptyset \)) and that \((\Sigma _{C1},\mathscr {P}_{C1})\) and \((\Sigma _{C2},\mathscr {P}_{C2})\) are, respectively, thermometric conjunctions of the two thermometers with \((\Sigma ,\mathscr {P})\).
We want to show that, if the co-existence of the two (Kelvin–Planck) thermometric conjunctions does not, by virtue of that coexistence, conflict with the Kelvin–Planck Second Law, then the two thermometric conjunctions, each derived from a different thermometer, will impart to \(\Sigma \) precisely the same hotter-than relations. Moreover, if both thermometers are ideal, then both conjunctions will impart the same (essentially unique) Clausius–Duhem temperature scale on \(\Sigma \).
Definition 6.12
The thermometric conjunctions \((\Sigma _{C1},\mathscr {P}_{C1})\) and \((\Sigma _{C2},\mathscr {P}_{C2})\) are Kelvin–Planck compatible if there is at least one Kelvin–Planck theory \((\Sigma _{C3},\mathscr {P}_{C3})\) in which \(\Sigma _{C3} = \Sigma \cup \Sigma _{\Theta 1}\cup \Sigma _{\Theta 2}\) and \(\mathscr {P}_{C3}\) essentially contains \(\mathscr {P}_{C1}\) and \(\mathscr {P}_{C2}\) (in the sense of Remark 6.2).
Theorem 6.13
(Consistency of Thermometers) Suppose that thermometric conjunctions \((\Sigma _{C1},\mathscr {P}_{C1})\) and \((\Sigma _{C2},\mathscr {P}_{C2})\) for \((\Sigma ,\mathscr {P})\), corresponding to two different thermometers \((\Sigma _{\Theta 1}, \mathscr {P}_{\Theta 1})\) and \((\Sigma _{\Theta 2}, \mathscr {P}_{\Theta 2})\), are Kelvin–Planck compatible.
-
(i)
On \(\Sigma \), the hotter-than relations derived from \((\Sigma _{C1},\mathscr {P}_{C1})\) and \((\Sigma _{C2},\mathscr {P}_{C2})\) are identical. That is, if \(\sigma '\) and \(\sigma \) are states in \(\Sigma \), then
$$\begin{aligned} \sigma ' \succ _{C1} \sigma \,\,\,\Leftrightarrow \,\,\,\sigma ' \succ _{C2} \sigma . \end{aligned}$$(6.5) -
(ii)
Suppose that for \(j=1,2\) all Clausius–Duhem temperature scales for \((\Sigma _{Cj},\mathscr {P}_{Cj})\) are positive multiples of some fixed one, \(T^*_{Cj}:\Sigma _{Cj} \rightarrow \mathbb {R}_+\). Then, restricted to \(\Sigma \), all Clausius–Duhem temperature scales for the two thermometric conjunctions are essentially identical. In particular, if \(\bar{T}^*_{Cj}:\Sigma \rightarrow \mathbb {R}_+\) is the restriction of \(T^*_{Cj}\) to \(\Sigma \), then there is a positive number \(\alpha \) such that \(\bar{T}^*_{C2}(\cdot ) = \alpha \bar{T}^*_{C1}(\cdot )\).
Proof
Throughout the proof, \((\Sigma _{C3},\mathscr {P}_{C3})\) is a fixed Kelvin–Planck theory satisfying the requirements of Definition 6.12.
To prove (i) we let \(\sigma '\) and \(\sigma \) be states in \(\Sigma \) such that \(\sigma ' \succ _{C1} \sigma \). Corollary 3.19 then ensures that \(T_{C1}(\sigma ') > T_{C1}(\sigma )\) for every \(T_{C1}\) that is a Clausius–Duhem temperature scale for \((\Sigma _{C1},\mathscr {P}_{C1})\). Contrary to what is to be proved, suppose that either \(\sigma ' \prec _{\,C2} \sigma \) or \(\sigma ' \sim _{C2} \sigma \). In these two cases, we have, respectively, \(T_{C2}(\sigma ') < T_{C2}(\sigma )\) and \(T_{C2}(\sigma ') = T_{C2}(\sigma )\) for every \(T_{C2}\) that is a Clausius–Duhem temperature scale for \((\Sigma _{C2},\mathscr {P}_{C2})\).
The Kelvin–Planck theory \((\Sigma _{C3},\mathscr {P}_{C3})\) has at least one Clausius–Duhem temperature scale, say \(T_{C3}\). Because \(\mathscr {P}_{C1}\) is essentially contained in \(\mathscr {P}_{C3}\), it follows that the restriction of \(T_{C3}\) to \(\Sigma _{C1}= \Sigma \cup \Sigma _{\Theta 1}\) is a Clausius–Duhem temperature scale for \((\Sigma _{C1},\mathscr {P}_{C1})\), in which case \(T_{C3}(\sigma ') > T_{C3}(\sigma )\). Because \(\mathscr {P}_{C2}\) is essentially contained in \(\mathscr {P}_{C3}\), it also follows that the restriction of \(T_{C3}\) to \(\Sigma _{C2}= \Sigma \cup \Sigma _{\Theta 2}\) is a Clausius–Duhem temperature scale for \((\Sigma _{C2},\mathscr {P}_{C2})\), in which case \(T_{C3}(\sigma ') < T_{C3}(\sigma )\) or \(T_{C3}(\sigma ')= T_{C3}(\sigma )\). Thus, we have a contradiction. Proof that \(\sigma ' \succ _{C2} \sigma \) implies \(\sigma ' \succ _{C1} \sigma \) is similar.
To prove (ii) we again note, as in the proof of (i), that for \(j=1,2\) the restriction of \(T_{C3}\) to \(\Sigma _{Cj}= \Sigma \cup \Sigma _{\Theta _j}\) is a Clausius–Duhem temperature scale for \((\Sigma _{Cj},\mathscr {P}_{Cj})\). Given the hypothesis of (ii), then, \(T^*_{Cj}:\Sigma _{Cj} \rightarrow \mathbb {R}_+\) must, for \(j=1,2\), be a positive multiple of the restriction of \(T_{C3}\) to \(\Sigma _{Cj}\). For this reason, \(\bar{T}^*_{C2}(\cdot )\) must be a positive multiple of \(\bar{T}^*_{C1}(\cdot )\). \(\square \)
7 Concluding Remarks: Equilibrium vs. Non-equilibrium Thermodynamics
In an attempt to clarify and soften distinctions that are usually drawn between “equilibrium” and “nonequilibrium” thermodynamics, we review here what the theorems in this article and its precursor tell us about (the sometimes conflated) necessary and sufficient conditions for the very separate (also sometimes conflated) questions of existence and uniqueness of Clausius–Duhem entropy-temperature pairs.Footnote 14
The most important theorem of this two-part series is Theorem 2.4. It asserts that, for any thermodynamical theory consistent with the Kelvin–Planck Second Law, there exists a pair of continuous functions of state—a specific entropy function and a thermodynamic temperature scale—that, taken together, satisfy the Clausius–Duhem inequality for all processes the theory contains. This follows immediately from the Hahn–Banach Theorem. There is no requirement, either tacit or explicit, that the theory contain special processes, in particular reversible ones such as Carnot cycles or reversible processes that transform one state into another. Although brilliant classical textbook arguments do indeed show that a (presumed) abundance of reversible processes is sufficient to arrive at the existence of a Clausius–Duhem pair, Theorem 2.4 tells us that reversible processes are not necessary for that purpose.
However, existence of these functions for a given Kelvin–Planck theory and their uniqueness are very different matters. The larger the supply of processes, the smaller will be the set of Clausius–Duhem entropy-temperature pairs that comply with the Clausius–Duhem inequality for every process the theory contains. Thus, if the set of entropy-temperature pairs for a given Kelvin–Planck theory is to be unique, either with respect to the temperature scale alone or with respect to both functions, then the set of processes extant in the theory must be sufficiently large as to ensure that the theory’s set of entropy-temperature pairs is suitably narrow.
Theorem 4.5 indicates that, if a Kelvin–Planck theory is to have an essentially unique temperature scale on its entire state space domain, it is necessary that the theory contain an abundance of (reversible) Carnot elements; in fact, there must be a Carnot element operating between each pair of distinct states. If, in addition, the theory is to have a specific entropy function that is essentially unique on the entire state space, Theorem 5.1requires that each pair of states also be connected by a reversible process. In each case, for uniqueness on the entire state-space domain it is necessary that every state be “visited” by a reversible process.
In the classical textbook picture, a reversible process has associated with it a path through a state space that can be traversed in both directions and in every detail. Such processes are usually regarded as ones that proceed so slowly that at each instant the body suffering the process can be regarded to be in a condition of equilibrium (or arbitrarily close to one). From this very classical perspective, an essentially unique Clausius–Duhem pair (or merely an essentially unique temperature scale) on the entire state space of a Kelvin–Planck theory would seem to require that all states in the theory be “equilibrium” states.
In this article, however, there is no notion of equilibrium.Footnote 15 A reversible element of a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) is a mathematical object specified by Definition 4.1. It carries no requirement of a path through \(\Sigma \) that is traversable in both directions, slowly or otherwise; in particular, there is no requirement of a two-way path through \(\mathscr {M}_{+}(\Sigma )\) traversed by a body’s condition measure. Although the abundance of reversible elements required by the uniqueness Theorems 4.5 or 5.1 might indeed derive in one application or another from consideration of the idealized slow near-equilibrium processes depicted in textbooks, that same abundance might derive from other sources and in other ways.
This is discussed in two appendices, tentatively described in the following remarks.
Remark 7.1
(Temperature scale uniqueness imparted to a Kelvin–Planck theory by the existence of an ideal thermometer) Appendix B is meant as a companion to the more general §s 6.2 and 6.3. In consideration of a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) that describes a hypothetical chemically reacting solution, we argue in Appendix B that, even when two (nonequilbrium) states \(\sigma \) and \(\sigma '\) in \(\Sigma \) are unconnected by a (reversible) Carnot element \((0,c'\delta _{\sigma '} - c\delta _{\sigma })\) in \(\hat{\mathscr {P}}\), the existenceFootnote 16 of an ideal thermometer for \((\Sigma ,\mathscr {P})\) invariably gives rise to such a Carnot element in the conjunction of \((\Sigma ,\mathscr {P})\) with the thermometer. In that case, Theorem 6.11 ensures essential Clausius–Duhem temperature-scale uniqueness for the conjunction, in particular on all of \(\Sigma \).
Note that it is in the conjunction that temperature-scale uniqueness on \(\Sigma \) takes on its meaning and it is there that the presence of the Carnot element \((0,c'\delta _{\sigma '} - c\delta _{\sigma })\) is to be found. It is in this broadened sense, involving the presumed availability of an ideal thermometer, that Clausius–Duhem temperature-scale uniqueness becomes more universal in character than Clausius–Duhem entropy-function uniqueness, discussed in Sect. 5.
Indeed, if members of a collection of distinct Kelvin–Planck theories, corresponding perhaps to a great variety of different materials, each had the same ideal thermometer, then for each of the pairwise thermometric conjunctions there would be an essentially unique temperature scale, universally imposed across the collection by a single thermometer, regardless of whether state spaces of the individual Kelvin–Planck theories were restricted solely to states of equilibrium.
Remark 7.2
(A reversible element realized in the limit by hypothetical physical processes that are very fast) Theorems 5.1 and 5.7 indicate that, for a given Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) with an essentially unique Clausius–Duhem temperature scale, essential uniqueness of a corresponding Clausius–Duhem specific-entropy function requires that every pair of states in \(\Sigma \) be connected by a reversible element of \(\hat{\mathscr {P}}\). However, this does not, by itself, require that all members of \(\Sigma \) are, in some sense, equilibrium states.
Appendix C is intended to indicate that reversible elements in \(\hat{\mathscr {P}}\) are not inextricably linked to slow transitions along paths in \(\mathscr {M}_{+}(\Sigma )\) consisting entirely of equilibrium conditions. There we suggest how a reversible element of the form \((\delta _{\sigma '}-\delta _{\sigma }, \mathcalligra {q}) \in \hat{\mathscr {P}}\) might arise in consideration of an idealized chemical reactor, where neither \(\sigma \), \(\sigma '\), nor the support of \(\mathcalligra {q}\), need be restricted to states of chemical equilibrium. Indeed, we indicate how such a reversible element might be the limit of a sequence of processes in \(\mathscr {P}\) corresponding to hypothetical physical realizations that occur at increasingly rapid rates.
What follows will serve as a summary.
(i) If for a Kelvin–Planck theory existence of Clausius–Duhem entropy and temperature functions of state are at issue, Theorem 3.8 would seem to provide little support for those who might argue, perhaps based on standard textbook derivations, that the domains of those functions should be limited to equilibrium states.
(ii) If, however, essential uniqueness of those same functions assumes critical importance in particular applications, then Theorems 4.5 and 5.1 might, in some contexts, lend support to the claim that the domain of those functions should indeed be restricted to states of equilibrium. However, Appendices B and C should be kept in mind: reversible processes in the sense of those theorems might have a variety of physical origins, some involving nonequilibrium states.
In any case we suspect that, in most applications, uniqueness of Clausius–Duhem entropy-temperature pairs will be considerably less consequential than their existence.
Notes
This weaker notion of hotter than corresponds to “hotter than in the first sense” (\(_1{\succ }\)) in [6].
Relation \(_s{\succ }\) is the analog of \(_3{\succ }\) in [6].
In particular, an irreversible process is a member of \(\mathscr {P}\) that is not a reversible element of the theory.
An example is given by (4.4), where \(\sigma '\) and \(\sigma \) are states in \(h'\) and h.
The support of the signed measure \(\mathcalligra {q}\) is the union of the supports of its (Hahn-Jordan) positive and negative parts.
Recall that, in a thermodynamical theory, a heating measure for a process suffered by a body (including such a compound one) takes account only of heat exchange of the body with its exterior, not heat flows internal to the (compound) body.
It is understood that \(\Sigma _3\) has the disjoint union topology inherited from \(\Sigma _1\) and \(\Sigma _2\).
In the spirit of Definition 3.20, we say that state \(\sigma '_C\) is weakly hotter than state \(\sigma _C\) if the two states are of different hotnesses and \(\widehat{\mathscr {P}_C}\) contains an element of the form \((0,\delta _{\sigma '_C} - \delta _{\sigma _C} + \nu )\), with \(\nu \in \mathscr {M}_+(\Sigma _C)\). In particular, \(\nu \) can be the zero measure, in which case there is a passive heat transfer from \(\sigma '_C\) to \(\sigma _C\).
For an ideal gas with processes as indicated in textbooks, the empirical ideal gas temperature scale, given by \(T(p,v) := \frac{M}{R}{pv}\), has the properties of a Clausius–Duhem temperature scale. Here M is the molecular weight of the gas and R is the ideal gas constant. Under wide-ranging conditions, helium approximates an ideal gas very well.
Although the word equilibrium is used often in thermodynamics textbooks, it is usually invoked intuitively and left without a precise definition, at least in a dynamical system sense.
Recall Remark 6.10.
When these assumptions are dropped the outcome is essentially the same, but the analysis becomes more cumbersome.
In the sequence, the amount of gas experiencing each cycle needn’t be the same.
In Remark A.1, \(\sigma '\) would be identified with \(\sigma \) here, while \(\sigma \) there would be identified with \(\sigma _{\theta }\) here.
There the Kelvin–Planck Second Law took the form \(\text {cl}\,(\textrm{Cone} \,(\mathscr {C}))\; \cap \; \mathscr {M}_{+}(\Sigma )= \{0\}\).
Note that in this case \(\mathscr {C}:= \text {cl}\,(\textrm{Cone} \,(\mathscr {C}))\).
Were \(\mathscr {C}\) in [6] identified with \(\mathscr {C}^*\) as defined here, all mathematics would remain the same; only the interpretation would be different.
In the theorem statement it is understood that \(C(\Sigma ,\mathbb {R})\) is given the sup norm topology.
References
Brézis, H.: Functional Analysis, Sobolev Spaces, and Partial Differential Equations. Springer 2011
Choquet, G.: Lectures on Analysis. W. A. Benjamin 1969
Clausius, R.: Über eine veränderte form des zweiten hauptsatzes der mechanischen wärmetheorie. Annalen der Physik 169(12), 481–506, 1854
Denbigh, K.: The Principles of Chemical Equilibrium: With Applications in Chemistry and Chemical Engineering. Cambridge University Press 1981
Feinberg, M.: Foundations of Chemical Reaction Network Theory. Springer 2019
Feinberg, M., Lavine, R.: Thermodynamics based on the Hahn–Banach theorem: the Clausius inequality. Arch. Ration. Mech. Anal. 82(3), 203–293, 1983
Feinberg, M., Lavine, R.: Foundations of the Clausius-Duhem inequality. New Perspectives in Thermodynamics (Ed. J. Serrin), pp. 49-64. Springer 1986. Also available as Appendix 2A in Truesdell, C., Rational Thermodynamics, Springer 1984.
Feinberg, M., Lavine, R.: Entropy and thermodynamic temperature in nonequilibrium classical thermodynamics as immediate consequences of the Hahn-Banach Theorem: I. Existence 2023. arXiv:2308.08636 [math-ph]
Kelley, J.L., Namioka, I.: Linear Topological Spaces. Springer 1963
Pippard, A.B.: Elements of Classical Thermodynamics for Advanced Students of Physics. Cambridge University Press 1964
Robertson, A., Robertson, W.: Topological Vector Spaces, 2nd edn. Cambridge University Press 1973
Author information
Authors and Affiliations
Corresponding authors
Additional information
Communicated by C. Dafermos.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendices
An Example of Passive Unsteady Heat Transfer
For the purpose of motivation, we provided in Example 3.3 a hypothetical physical situation that, in a thermodynamical theory \((\Sigma ,\mathscr {P})\), gave rise to a cyclic process in \(\hat{\mathscr {P}}\) of the form \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\), with \(\alpha > 0\), where \(\sigma '\) and \(\sigma \) are states in \(\Sigma \). That the change of condition was the zero measure on \(\Sigma \) (that is, \(\Delta \mathcalligra {m}= 0\)) resulted from the fact that, in the example, the body suffering the process was in a temporally steady condition (as distinct from traditional thermodynamic equilibrium), so there was no change in the condition of the body between the process’s inception and its termination.
Again for the purpose of motivation, it is our intent in this appendix to show, by means of a different hypothetical physical situation, that the same element in \(\hat{\mathscr {P}}\) can derive from consideration of dynamic processes in which a steady condition is never present. The example is a simple toy model (for example, one-dimensional, no motion), but it can be generalized to contain more complex and more natural features, suggesting that such processes will appear in \(\hat{\mathscr {P}}\) whenever bodies come into momentary thermal contact.
Consider, then, two samples of material, both samples described by the thermodynamical theory \((\Sigma ,\mathscr {P})\), filling two slender tubes, each of length L and small cross sectional area A, insulated along their extent, but not at their ends. The two samples are aligned along the x-axis, from \(x = -L\) to \(x = L\). The samples abut at \(x=0\), separated by a perfectly heat-conducting barrier of negligible thickness. A continuous function \(r:[-L,L]\times [-t^*,t^*] \rightarrow \mathbb {R}\) describes the heat flux through tube cross-sections; that is, r(x, t) is the rate of heat flow per unit cross-sectional area, in the positive x-direction, through the cross-section at position x and at time t. We will assume that r(0, 0) is positive.
We will also assume that there are two continuous functions, \(\hat{\sigma }':[-L,0]\times [-t^*,t^*] \rightarrow \Sigma \) and \(\hat{\sigma }:[0,L]\times [-t^*,t^*] \rightarrow \Sigma \) that give the point-wise state of the material on the two sides of the barrier at each instant. We denote by \(\sigma '\) and \(\sigma \) material states, assumed to be different, contiguous to the two sides of the barrier at \(t=0\). That is,
Remark A.1
This picture is especially apt in our consideration of thermometric conjunctions in Sect. 6, in which case \((\Sigma ,\mathscr {P})\), the Kelvin–Planck theory considered here, would be replaced by the thermometric conjunction \((\Sigma _C,\mathscr {P}_C)\). In such a context, \(\sigma \) might represent a state of the thermometric material, while \(\sigma '\) might represent a state of the material sample being probed.
We will argue that, given the physical situation described, consideration of physical processes suffered by a sequence of sub-bodies along the tubes will give rise to a corresponding sequence of elements in \(\hat{\mathscr {P}}\) that, for any \(\alpha > 0\) having units of energy, converges in \(\mathscr {V}(\Sigma )\) to \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\). (It is assumed that physical processes suffered by all such sub-bodies are accounted for separately in \(\mathscr {P}\).) For simplicity,Footnote 17 we suppose that there is no motion and that the local material density on each side of the barrier is independent of spatial position and time, with \(\rho '\) the density in the region \(x \in [-L,0)\) and \(\rho \) the density for \(x \in [0,L]\).
For any \(\xi \) and \(\tau \), with \(L> \xi > 0\) and \(t^*> \tau > 0\), we can calculate the process descriptor \(\mathcalligra {p}(\xi ,\tau ) = (\Delta \mathcalligra {m}(\xi ,\tau ),\mathcalligra {q}(\xi ,\tau )) \in \mathscr {P}\) that derives from consideration of the physical process suffered by the sub-body contained in the spatial interval \(-\xi \le x \le \xi \) over the course of the time interval \([-\tau , \tau ]\).
To specify a measure \(\mu \in \mathscr {M}(\Sigma )\) it is enough to specify how \(\mu \) integrates all continuous functions on \(\Sigma \); that is, it is enough to specify the bounded linear functional \(\Gamma _{\mu }: C(\Sigma ,\mathbb {R}) \rightarrow \mathbb {R}\) given by
The heating measure, \(\mathcalligra {q}(\xi ,\tau )\), for the process under consideration is given by
The change of condition measure, \(\Delta \mathcalligra {m}(\xi ,\tau )\), is specified by the stipulation that, for all \(g \in C(\Sigma ,\mathbb {R})\),
Thus, if \(\mathscr {P}\subset \mathscr {V}(\Sigma )\) contains descriptors of all physical processes our toy model admits, then, from consideration of the physical process corresponding to \(\xi >0\) and \(\tau >0\), we can conclude that \(\mathscr {P}\) contains the process descriptor
with \(\Delta \mathcalligra {m}(\xi ,\tau )\) and \(\mathcalligra {q}(\xi ,\tau )\) given by (A.4) and (A.3). Therefore, if \(\alpha \) is a positive constant (carrying units of energy)
is a member of \(\textrm{Cone} \,(\mathscr {P})\). Our aim is to show that by judiciously taking a sequence of values of \(\xi \) and \(\tau \), shrinking to zero, (A.6) will converge in \(\mathscr {V}(\Sigma )\) to \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\), which is to say that \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\) is a member of \(\hat{\mathscr {P}}: = \textrm{cl}\,[\textrm{Cone} \,(\mathscr {P})]\).
To show convergence in \(\mathscr {V}(\Sigma )\) of (A.6) to \((0,\alpha (\delta _{\sigma '} - \delta _{\sigma }))\) as \(\xi _n\) and \(\tau _n\) approach 0 in at least certain selected ways, we will argue that, for every choice of f and g in \(C(\Sigma ,\mathbb {R})\),
provided that we take \(\tau _n = \frac{1}{n}\) and \(\xi _n = (\frac{1}{n})^2\). We first note from (A.4) that
where
It is evident that, so long as we take \(\tau _n = \frac{1}{n}\) and \(\xi _n = (\frac{1}{n})^2\), the quantity shown in (A.10) will approach zero as \(n \rightarrow \infty \).
It remains to be argued that, with this same choice for \(\tau _n\) and \(\xi _n\),
From (A.3) it follows that
Therefore, to show that (A.12) holds, with \(\tau _n = \frac{1}{n}\) and \(\xi _n = (\frac{1}{n})^2\), it is enough to show that
and
However, these follow from continuity of the functions r, \(f\circ \hat{\sigma }'\) and \(f\circ \hat{\sigma }\).
New Carnot Elements Arising in a Thermometric Conjunction
As a companion to § 6.3, we provide here a discussion, supplemented by a toy physical picture, to suggest how, for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) endowed with an ideal thermometer \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\), their thermometric conjunction \((\Sigma _C,\mathscr {P}_C)\) can contain a Carnot element operating between two specified (perhaps non-equilibrium) states of \(\Sigma \) even when \((\Sigma ,\mathscr {P})\) itself contains no such Carnot element.
For this purpose we suppose that \((\Sigma ,\mathscr {P})\) is a Kelvin–Planck theory of liquid solutions composed of certain molecular species among which chemical reactions occur. We suppose also that \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) is an ideal thermometer for \((\Sigma ,\mathscr {P})\), encoding the behavior of a thermometric material, which we will presume to be a perfect gas.
As a preamble to the discussion, consider a single physical process involving heat transfer between two bodies—one composed of the reacting liquid solution described by \((\Sigma ,\mathscr {P})\) and the other composed of the thermometric gas described by \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\). In the theory \((\Sigma ,\mathscr {P})\), the process will have associated with it a heating measure \(\mathcalligra {q}\), defined on the Borel subsets of \(\Sigma \). That same physical process, viewed from the perspective of the thermometric conjunction \((\Sigma _C,\mathscr {P}_C)\) will also have a heating measure \(\mathcalligra {q}_C\) defined on the Borel subsets of \(\Sigma _C = \Sigma \cup \Sigma _{\Theta }\). It should be clearly understood that the restriction of \(\mathcalligra {q}_C\) to the Borel sets of \(\Sigma \) can be very different from \(\mathcalligra {q}\). This is because the heating measure in \((\Sigma ,\mathscr {P})\) captures details of heat exchange between the reacting solution and its exterior, an exterior that includes the gas thermometer. For that same physical process, the corresponding heating measure in \((\Sigma _C,\mathscr {P}_C)\) captures the details of heat transfer between a composite body (the solution sample taken with the thermometer) and the exterior of that composite body. That is, in \((\Sigma _C,\mathscr {P}_C)\) the heating measure takes no account of heat transfer between the reacting liquid solution and the thermometric gas.
Now let \(\sigma \in \Sigma \) and \(\sigma ' \in \Sigma \) be states of the reacting solution, not necessarily states of chemical equilibrium. By properties of the thermometer, there are gas states \(\sigma _{\theta }\) and \(\sigma '_{\theta }\) in \(\Sigma _{\Theta }\) (and therefore in \(\Sigma _C\)) such that, in the conjoined theory \((\Sigma _C,\mathscr {P}_C)\), \(\sigma \) and \(\sigma _{\theta }\) are of the same hotness, as are \(\sigma '\) and \(\sigma '_{\theta }\). Therefore, \(\hat{\mathscr {P}}_C\) contains the passive heat transfers required by Definition 6.3 between these liquid solution states and their corresponding gas states.
Because the ideal thermometer \((\Sigma _{\Theta },\mathscr {P}_{\Theta })\) has a unique Clausius–Duhem temperature scale, Theorem 4.5 requires that \(\hat{\mathscr {P}}_{\Theta }\) contain a (reversible) Carnot element, say \((0,c\,\delta _{\sigma _{\theta }}-c'\,\delta _{\sigma '_{\theta }}) \in \mathscr {V}(\Sigma _{\Theta })\), operating between \(\sigma _{\theta }\) and \(\sigma '_{\theta }\). In physical terms, this Carnot element can be regarded as the limit of representations in \(\hat{\mathscr {P}}_{\Theta }\) of a sequence of classical ideal gas Carnot cycles (as usually depicted in pressure-volume space) traversing two (decreasingly small) isothermal segments, one centered at \(\sigma _{\theta }\) and the other at \(\sigma '_{\theta }\).Footnote 18
Because \(\mathscr {P}_{\Theta }\) is, in the sense of Remark 6.2, essentially contained in \(\mathscr {P}_C\), \((0,c\,\delta _{\sigma _{\theta }}-\,c'\,\delta _{\sigma '_{\theta }})\), viewed as a member of \(\mathscr {V}(\Sigma _C)\), is a (reversible) Carnot element of \(\hat{\mathscr {P}}_C\). As we indicated above, \(\hat{\mathscr {P}}_C\) also contains (reversible) passive-heat-transfer elements of the form \((0,c'\,\delta _{\sigma '_{\theta }}-\,c'\,\delta _{\sigma '})\) and \((0,c\,\delta _{\sigma }-\,c\,\delta _{\sigma _{\theta }})\). Because \(\hat{\mathscr {P}}_C\) is a convex cone the sum, \((0,c\,\delta _{\sigma }-\,c'\,\delta _{\sigma '})\), of these three members of \(\hat{\mathscr {P}}_C\) having support entirely in \(\Sigma \), is also a member of \(\hat{\mathscr {P}}_C\).
This is to say that in the thermometric conjunction \((\Sigma _C,\mathscr {P}_C)\) there is invariably a Carnot element operating between two (arbitrary) states of the reacting liquid solution, \(\sigma \in \Sigma \) and \(\sigma ' \in \Sigma \), whether or not these be states of chemical equilibrium and even when the theory \((\Sigma ,\mathscr {P})\) of the reacting solution alone contains no such Carnot element.
This is a consequence of the mathematics, deriving from the suppositions with which we began. To understand in more physical terms how such a Carnot element in \((\Sigma _C,\mathscr {P}_C)\) can emerge, even when absent in \((\Sigma ,\mathscr {P})\), it will be useful to consider a toy physical picture meant to reflect the mathematics. The cartoon, like all cartoons, is imperfect, but it is only meant to be suggestive. At the end of this appendix we will make two remarks about how, in a much more extended exposition, certain of those imperfections might be mitigated. These remarks will draw on Appendix A and the appendix of this article’s companion [8].
In the cartoon, we imagine the Carnot element \((0,c\,\delta _{\sigma }-\,c'\,\delta _{\sigma '})\) in \(\hat{\mathscr {P}}_C\) to derive from a (limit) process of the following kind: a solution sample in state \(\sigma \), contiguous to the thermometric gas, rapidly absorbs a very small amount of heat, say c calories, from an external bath while simultaneously passing that same small amount of heat to the thermometric gas in state \(\sigma _{\theta }\), all without appreciable changes to the solution sample. That heat is used to drive a small isothermal segment of a Carnot cycle in the gas, that segment containing state \(\sigma _{\theta }\). A small amount of heat, in the amount of \(c'\) calories, is removed from the gas during the cycle’s second small isothermal segment, that segment containing gas state \(\sigma '_{\theta }\). The removed heat is rapidly passed to a different sample of the reacting solution, this one in state \(\sigma '\), while an equal amount of heat is simultaneously passed from there to a second external bath.
Note that in this overall hypothetical physical process, viewed as one experienced by a physical conjunction of liquid solution and thermometric gas taken together, the only heat exchange between the conjunction and the conjunction’s exterior is in the form of heat passage from the first external bath to the first solution sample (while in state \(\sigma \)) and then from the second solution sample (while in state \(\sigma '\)) to the second external bath. This is reflected in the process’s codification as \((0,c\,\delta _{\sigma }-\,c'\,\delta _{\sigma '})\) in \(\hat{\mathscr {P}}_C\).
However, viewed from the perspective of the reacting solution alone, described by the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) (as distinct from \((\Sigma _C,\mathscr {P}_C)\), the overall physical process indicated does not manifest itself as a Carnot element. If it is kept in mind that the thermometer is part of the solution’s exterior, as are the baths, it becomes apparent that there is no net absorption of heat from the solution’s exterior by solution in either states \(\sigma \) or \(\sigma '\). This is to say that, in \((\Sigma ,\mathscr {P})\), the heating measure for the overall physical process indicated is the zero measure in \(\mathscr {M}(\Sigma )\).
Remark B.1
(Transient passive heat transfers between the baths and the solution samples) In the cartoon, there is a transfer of a small amount of heat from the reacting-solution sample, while the sample is in a perhaps nonequilibrium state \(\sigma \), to the thermometric material, while the thermometric material is in state \(\sigma _{\theta }\). Because the reacting sample might be in a rapidly changing composition state, there arises the question of how the passive heat transfer \((0,c(\,\delta _{\sigma }-\,\delta _{\sigma _{\theta }})) \in \hat{\mathscr {P}}_C\) could be realized. This was the general subject of Appendix A, with special reference to the thermometric setting in Remark A.1. In rough terms, that element in \(\hat{\mathscr {P}}_C\) is derived (in Appendix A) from consideration of very narrow material region straddling the sample-thermometer interface during a time interval of vanishingly small duration.
Within the toy picture offered in this appendix, the initial reacting-liquid sample considered might be identified, in the sense of Appendix A (in particular Remark A.1), with a very thin sliver of liquid in the region \([-\varepsilon , 0]\) abutting the liquid-gas boundary, while the heat bath transmitting heat to that sample might be identified with the remaining liquid, residing in the region \([-L, -\varepsilon )\) exterior to the sliver.Footnote 19
Remark B.2
(About the addition of processes) Prior to the introduction of the physical cartoon, the Carnot element \((0,c\,\delta _{\sigma }-\,c'\,\delta _{\sigma '})\) in \(\hat{\mathscr {P}}_C\) derived mathematically as the sum of three other elements in \(\hat{\mathscr {P}}_C\), namely the passive heat transfers \((0,c'\,\delta _{\sigma '_{\theta }}-\,c'\,\delta _{\sigma '})\), \((0,c\,\delta _{\sigma }-\,c\,\delta _{\sigma _{\theta }})\), and the Carnot element in the thermometric gas, \((0,c\,\delta _{\sigma _{\theta }}-c'\,\delta _{\sigma '_{\theta }})\), viewed as a member of \(\hat{\mathscr {P}}_C\).
That, for a natural thermodynamical theory, the closure of the cone of the process set should be closed under addition is a consequence of reasoning given in the appendix of [8]. For the most part—but not entirely—this results from the supposition that two processes occurring in nature, suffered by different bodies, can be run in remote locations simultaneously to give a new natural process, suffered by the union of the two bodies, provided that the durations of the two separate processes are identical. However, this was just one supposition in the appendix of [8]. In light of still other natural suppositions, analysis in the appendix of [8] indicates that, for the purpose of the additivity result, the simultaneity requirement is, in effect, inconsequential.
This is mentioned here because, in the invocation of the physical cartoon, we have been casual about timing related to the two passive heat transfers between liquid and gas and also about timing related to the Carnot cycle in the gas. To be more precise, we have been casual about the timing of the physical processes (corresponding to members of \(\mathscr {P}_C\)) that approximate those three limiting elements of \(\hat{\mathscr {P}}_C\).
Discussion of such considerations would have made invocation of the cartoon significantly more complex than its didactic purpose warrants, but readers might want to keep in mind the appendix of [8].
A Reversible Element in \(\hat{\mathscr {P}}\) Involving Nonequilibrium States
It is the purpose of this appendix to indicate how, in a theory \((\Sigma ,\mathscr {P})\) of reacting mixtures, there might arise in \(\hat{\mathscr {P}}\ := \textrm{cl}\,[\textrm{Cone} \,(\mathscr {P})]\) a reversible element of the form \((\delta _{\sigma '} - \delta _{\sigma }, \mathcalligra {q})\), where neither \(\sigma ' \in \Sigma \) nor \(\sigma \in \Sigma \) is a state of chemical equilibrium.
We suppose that \((\Sigma ,\mathscr {P})\) describes gaseous mixtures of n molecular species \(A_1, A_2, \dots , A_n\) that participate in a perhaps complex network of chemical reactions. The local states will be regarded to be elements of the form \((c, \theta ) \in \mathbb {R}^{n+1}\), where \(c := [c_1, c_2,\dots , c_n] \in \mathbb {R}^n\) is the vector of local molar concentrations of the n species (moles per unit volume) and \(\theta \) is the local temperature (perhaps on an empirical temperature scale).
To describe \(\Sigma \), the full set of states for the theory, we first denote by \(M := [M_1,M_2,\dots ,M_n]\) the vector of molecular weights (mass per mole) of the species. For a fixed chosen positive value of \(\rho ^*\) (having units of mass per volume), the compact set
is the set of all local molar concentration vectors consistent with a local mass density less than or equal to \(\rho ^*\). Hereafter we take \(\Sigma = \Omega \times I\), where I is a closed (temperature) interval of positive real numbers, perhaps very large. We suppose that, in the theory, \(\rho ^*\) and I are chosen to preclude from \(\Sigma \) density and temperature extremes that are inappropriate to the model gaseous material under consideration.
For the mixture we presume that there are two smooth functions of state, \(\tilde{u}: \Sigma \rightarrow \mathbb {R}\) and \(\tilde{f}:\Sigma \rightarrow \mathbb {R}^n\) with the following interpretations: When \((c,\theta )\) is a local state in the mixture, \(\tilde{u}(c, \theta )\) is the local internal energy per unit volume, and \(\tilde{f}(c, \theta ) = [\tilde{f}_1(c,\theta ),\dots ,\tilde{f}_n(c,\theta )]\) is the vector of the local net molar production rates per unit volume of the n species due to the occurrence of all chemical reactions.
Consider a mixture sample that fills a rigid closed vessel of constant volume, V, and suppose that the mixture remains spatially homogeneous at all times. That is, at each instant the local state is the same everywhere. We presume that the local state is governed by the system of ordinary differential equations (C.2):
The overdot indicates differentiation with respect to time, and Q(t) is the rate per unit volume at time t of heat addition to the mixture within the reactor vessel. The first n equations are molar balances of the species. The last equation reflects the First Law of Thermodynamics applied to the reactor under consideration: The rate of change of internal energy of mixture filling the rigid reactor vessel is equal to the rate at which heat is supplied to it.
Remark C.1
Because the mass of the mixture filling the closed vessel is conserved and because the volume of the vessel is fixed, the density of the mixture remains constant in time, even as reactions cause the concentrations of the various species to change. If \(\rho \) is the fixed density of mixture in the vessel, presumed less than \(\rho ^*\), then the evolving vector of molar concentrations, governed by (C.2), will forever remain in the set
which is clearly contained in \(\Omega \).
As a consequence, \(\tilde{f}\) must be such that \(M \cdot \tilde{f}(c,\theta )\) \(= 0\) for all c and \(\theta \). Under very weak assumptions about the kinetics of the various reactions, the function \(\tilde{f}\) also has the property that \(\tilde{f}_i(c,\theta ) \ge 0\) whenever \(c_i = 0\), which is to say that the production rate of an absent species is not negative [5]. Hereafter, we assume that \(\tilde{f}\) has the property that any solution of (C.2) that begins with c initially in \(\Gamma \) will be such that c(t) remains in \(\Gamma \), and therefore in \(\Omega \), for all later times along the solution.
By a solution of (C.2) we will mean a set of \(n+2\) functions of time, \(c_1(\cdot )\), \(c_2(\cdot )\),\(\dots \), \(c_n(\cdot )\), \(\theta (\cdot )\), \(Q(\cdot )\), that satisfy (C.2) in some time interval (particular to that solution). So that we can focus specifically on the toy reactor described, we will confine our attention to solutions such that, at the initial time, \([c(\cdot ),\theta (\cdot )]\) takes values in \(\Gamma \times I\).
Each such solution gives rise to a process \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\) in the following way: Let \([t_i,t_f]\) be the time interval of the solution, and let \(\alpha := V\rho \) be the mass of the mixture in the vessel. Then the change of condition induced by the solution is
The heating measure \(\mathcalligra {q}\) induced by the solution is defined by its action on continuous functions: for each continuous \(\varphi \,: \Sigma \rightarrow \mathbb {R}\),
Hereafter, we suppose that \(\mathscr {P}\) contains as a subset all processes corresponding to those solutions of (C.2) that are consistent with the fixed mass \(\alpha \) of the mixture under consideration and initial conditions within \(\Gamma \times I\).
Let \(c^{\,0}\) be a mixture composition in \(\Gamma \) and let \(\theta ^{\,0}\) and \(\theta ^{\,^*}\) be temperatures such that \([c^{\,0},\,\theta ^{\,0}]\) and \([c^{\,0},\theta ^{\,^*}]\) are both in the interior of \(\Sigma \). Moreover, for small \(\varepsilon > 0\), let \(\theta _{\varepsilon }(\cdot )\) be the temperature history defined by
Consider the first n equations of (C.2), with the temperature given by (C.6) on the time interval \([0,\varepsilon ]\). From Remark C.1 and the smoothness of \(\tilde{f}\), the resulting n equations admit a solution \(c_{\,\varepsilon }(\cdot )\) on \([0,\varepsilon ]\) satisfying the initial condition \(c_{\,\varepsilon }(0) = c^{\,0}\). The full system (C.2) of \(n+1\) differential equations then admits the solution \(c_{\,\varepsilon }(\cdot ),\ \theta _{\,\varepsilon }(\cdot ),\ Q_{\,\varepsilon }(\cdot )\), with \(Q_{\,\varepsilon }(\cdot )\) calculated from \(c_{\,\varepsilon }(\cdot )\), \(\theta _{\,\varepsilon }(\cdot )\), and the last equation of (C.2).
This solution gives rise to the process \(\mathcalligra {p}_{\varepsilon } = (\Delta \mathcalligra {m}_{\varepsilon }, \mathcalligra {q}_{\,\varepsilon })\), where
and \(\mathcalligra {q}_{\varepsilon }\) is given by the requirement that, for every continuous \(\varphi : \Sigma \rightarrow \mathbb {R}\),
Because \(\Sigma \) is compact, there is a number A such that, for all \((c,\theta ) \in \Sigma \), \(\Vert \tilde{f}(c,\theta )\,\Vert \le A\). From the first n equations in (C.2) it follows that \(\Vert \, c_{\varepsilon }(t) - c^0\,\Vert \le A\,\varepsilon \) for all \(t \in [0, \varepsilon ]\). As a result \(\Delta \mathcalligra {m}_{\varepsilon }\) converges to
as \(\varepsilon \) approaches 0. Moreover, compactness of \(\Sigma \) ensures that there is number B such that, on \(\Sigma \), \(\vert \nabla _c\, \tilde{u} \cdot \tilde{f}\,\vert \le B\). From the last equation in (C.2) and (C.6) it follows that, for all \(s \in [0,1]\),
Note that as \(\varepsilon \) approaches 0 the second term on the left of (C.10) approaches
From this it follows that, as \(\varepsilon \) approaches 0, the heating measure \(\mathcalligra {q}_{\varepsilon }\) given by (C.8) converges to \(\mathcalligra {q}_{0}\) defined by the requirement that, for each continuous \(\varphi : \Sigma \rightarrow \mathbb {R}\),
As \(\varepsilon \) approaches 0, then, the family of processes \(\mathcalligra {p}_{\varepsilon } = (\Delta \mathcalligra {m}_{\varepsilon }, \mathcalligra {q}_{\,\varepsilon })\) in \(\mathscr {P}\) converges to \(\mathcalligra {p}_{\,0} = (\Delta \mathcalligra {m}_{0}, \mathcalligra {q}_{0})\) in \(\hat{\mathscr {P}}\), with \(\Delta \mathcalligra {m}_{0}\) and \(\mathcalligra {q}_{0}\) given by (C.9) and (C.12). To see that \(-\mathcalligra {p}_{\,0}\) is also a member of \(\hat{\mathscr {P}}\) it suffices to reverse the roles of \(\theta ^0\) and \(\theta ^*\).
Thus, we have in \(\hat{\mathscr {P}}\) a reversible element of the form
Note that \(c^{\,0}\), \(\theta ^{\,*}\), and \(\theta ^{\,0}\) were chosen arbitrarily. Neither \([c^{\,0}, \theta ^{\,*}]\) nor \([c^{\,0}, \theta ^{\,0}]\) need be a state of chemical equilibrium — that is, a stationary solution of (C.2) with \(Q = 0\).
Remark C.2
If \((\Sigma ,\mathscr {P})\) in our example is a Kelvin–Planck theory, and if \(\bar{\eta }\,(\cdot )\) and \(\eta \,(\cdot )\) are Clausius–Duhem specific-entropy functions corresponding to the same Clausius–Duhem temperature scale, then the Clausius–Duhem inequality and the presence of \((\Delta \mathcalligra {m}_{0}, \mathcalligra {q}_{0})\) in \(\hat{\mathscr {P}}\) require that
Because \(c^{\,0}\), \(\theta ^{\,0}\) and \(\theta ^*\) (and \(\rho \)) were chosen arbitrarily, (C.14) indicates that for each \(c \in \Omega \), the functions \(\bar{\eta }\,(c, \cdot )\) and \(\eta \,(c, \cdot )\) differ by at most a constant. In fact, if \(T: \Sigma \rightarrow \mathbb {R}_+\) is the Clausius–Duhem temperature scale to which the specific-entropy function \(\eta (\cdot )\) corresponds, then for each \(c \in \Omega \) the Clausius–Duhem inequality, (C.9), and (C.12) require that \(\eta (c,\cdot )\) be a function of the form
where \(\theta ^{\,0}\) is some fixed value in I.
Remark C.3
Reversible processes in the canonical picture are fictitious ones that proceed so slowly, and with such small changes, that they could never be completed. Nevertheless, they are regarded as processes that, in principle, can be approximated by real ones sufficiently well that a complete theory should embrace them in the limit.
In the context of the reacting-mixture theory considered in this appendix, the limiting reversible process (corresponding to \(\varepsilon = 0\)) is approximated by processes of a very different kind: As \(\varepsilon \) approaches zero, they complete increasingly quickly, with increasingly rapid changes in temperature, and with increasingly higher rates of heat transfer (all sustained over vanishingly small time intervals).
Remark C.4
In the hypothetical \(\varepsilon \)-parameterized processes described, temperature is presumed to be spatially uniform despite very rapid rates of heat transfer. In the case of conductive heat transfer to the mixture at the reactor wall, large values of heat flux are associated with large values of spatial temperature gradients in the mixture at the mixture boundary. However, even in the case of conductive heat transfer from the exterior at the mixture boundary, a large value of \(Q_{\varepsilon }(t)\) (rate of heat receipt per unit reactor volume) does not necessitate a large heat flux (rate of heat receipt per unit area) at the reactor walls. For each \(\varepsilon > 0\) we can imagine the reactor vessel to be a tall narrow circular cylinder of fixed radius \(R_{\varepsilon }\), in which case the heat flux at the cylinder wall would be \(Q_{\epsilon }(t)R_{\varepsilon }/2\). By choosing \(R_{\varepsilon }\) sufficiently small, the instantaneous heat fluxes at the wall (and presumably the temperature gradients there) can be kept as small as we wish.
Clausius Versus Clausius–Duhem Temperature Scales
In a 1983 article [6] we examined the existence and properties of Clausius temperature scales (as distinct from Clausius–Duhem temperature scales) for thermodynamic theories that respect the Kelvin–Planck Second Law. In that context, for a theory with state space \(\Sigma \), a Clausius temperature scale \(T: \Sigma \rightarrow \mathbb {R}_+\) was a continuous function that satisfies, for all cyclic processes, the Clausius inequality. The Clausius inequality is just the form that the Clausius–Duhem inequality takes for cyclic processes. Thus, \(T(\cdot )\) is a Clausius temperature scale if it is continuous and satisfies the Clausius inequality condition
where \(\mathscr {C}\subset \mathscr {M}(\Sigma )\) is the set of heating measures associated with the theory’s cyclic processes. In [6], \(\mathscr {C}\) was called the set of cyclic heating measures. Because the focus of [6] was entirely on cyclic processes, there was nothing in [6] corresponding to \(\mathscr {P}\), the full set of processes central to this article (apart from an anticipatory description of \(\mathscr {P}\) in the concluding remarks of [6]). Instead, \(\mathscr {C}\) was taken in [6] as a primitive notion, part of the description of a cyclic heating system, \((\Sigma ,\mathscr {C})\).Footnote 20
In light of the interpretation given to \(\mathscr {C}\) in [6], we hereafter define \(\mathscr {C}\) in this appendix as follows: for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\),
By a Clausius temperature scale for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\), we mean a continuous function \(T: \Sigma \rightarrow \mathbb {R}_+\) that satisfies the condition (D.1), with \(\mathscr {C}\) as in (D.2). For \((\Sigma ,\mathscr {P})\) we denote by \(\mathscr {T}_{Clausius}\) the set of all its Clausius temperature scales.
Our interest is in the relationship between the set of Clausius temperature scales for a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) and its set \(\mathscr {T}_{CD}\) of Clausius–Duhem temperature scales, described in Definition 2.5. Recall that a continuous function \(T: \Sigma \rightarrow \mathbb {R}_+\) is a Clausius–Duhem temperature scale for \((\Sigma ,\mathscr {P})\) if there is a continuous (specific entropy) function \(\eta : \Sigma \rightarrow \mathbb {R}\) such that
Because (D.3) is a more demanding requirement than is (D.1) we always have
The following example demonstrates that \(\mathscr {T}_{Clausius}\) can in fact be larger than \(\mathscr {T}_{CD}\) even when \(\mathscr {P}\) (as distinct from \(\hat{\mathscr {P}}\)) is a closed convex cone:
Example D.1
Here we consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) with state space \(\Sigma \) consisting of just two states, labeled 1 and 2; \(\Sigma \) is given the discrete topology. The process set \(\mathscr {P}\) is the closed convex cone consisting of all \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {V}(\Sigma )\) such that, with \(\alpha \) taking all real values,
We will consider first the nature of Clausius–Duhem entropy-temperature pairs for the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\). In this case, a Clausius–Duhem entropy function \(\eta : \{1,2\} \rightarrow \mathbb {R}\) amounts to a specification of two numbers \(\eta _{\,1}\) and \(\eta _{\,2}\). A Clausius–Duhem temperature scale \(T: \{1,2\} \rightarrow \mathbb {R}_+\) amounts to a specification of two positive numbers \(T_1\) and \(T_2\). For \((\eta ,T)\) to constitute a Clausius–Duhem pair, it must satisfy the Clausius–Duhem inequality for all \((\Delta \mathcalligra {m},\mathcalligra {q})\in \mathscr {P}\). If we let \(\beta _1 = 1/T_1\) and \(\beta _2 = 1/T_2\), this amounts to the requirement that
From this it is apparent that we must have \(\eta _{\,2}- \eta _{\,1} > 0\) and
It is not difficult to see that (D.7) can be satisfied only if
In fact, so long as (D.8) is satisfied, the Clausius–Duhem inequality will be satisfied for all members of \(\mathscr {P}\) with \(\eta \) chosen such as to have
Thus, for the Kelvin–Planck theory under consideration the set of Clausius–Duhem temperature scales (Definition 2.5) is given by
We turn next to consideration of the set of Clausius temperature scales for the same Kelvin–Planck theory \((\Sigma ,\mathscr {P})\). In this case, the set of cyclic heating measures (corresponding to \(\xi =0,\ \alpha = 0\)) is given byFootnote 21
Thus, a temperature function satisfies the Clausius requirement (D.1) precisely when
This is to say that the set of Clausius temperature scales is given by
Note that \(\mathscr {T}_{CD}\) is contained in \(\mathscr {T}_{Clausius}\) but is not identical to it.
In Example D.1, \(\mathscr {P}\) was a closed convex cone. In the following example, which is highly similar to the preceding one, \(\mathscr {P}\) is not closed, and the distinction between \(\mathscr {T}_{CD}\) and \(\mathscr {T}_{Clausius}\) becomes substantially more pronounced.
Example D.2
In this example we consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\) that is identical to the one in Example D.1 apart from one difference: Whereas in Example D.1 the parameter \(\alpha \) was permitted to take on all real values, here we restrict \(\alpha \) to the nonzero real values. Despite the difference, the set of Clausius–Duhem pairs here remains what it was in Example D.1. In particular, we again have
In this case, however, \(\alpha \ne 0\), so \(\mathscr {P}\) contains no cyclic processes at all, apart from the trivial one in which \((\Delta \mathcalligra {m},\mathcalligra {q})= (0,0)\). Thus, we have \(\mathscr {C}= \{0\}\) and, as a result,
Note that in the definition of \(\mathscr {C}\), given in this appendix by (D.2), the cyclic heating measures derive from the cyclic processes contained only within the true process set \(\mathscr {P}\), as distinct from the sometimes larger set \(\hat{\mathscr {P}}:= \text {cl}\,(\textrm{Cone} \,(\mathscr {C}))\). This definition of \(\mathscr {C}\) was motivated entirely by the physical interpretation given to \(\mathscr {C}\) in [6], where \(\mathscr {C}\) was merely described as the set of heating measures associated with cyclic processes. There was no mention or even a description of the fuller set of all processes (except in the concluding remarks).
In the main body of this article a cyclic element of a thermodynamic theory \((\Sigma ,\mathscr {P})\) is a member \((\Delta \mathcalligra {m},\mathcalligra {q})\) of \(\hat{\mathscr {P}}\) such that \(\Delta \mathcalligra {m}=0\). The set of cyclic elements of \((\Sigma ,\mathscr {P})\) contains not only all the cyclic processes in \(\mathscr {P}\) but also members of \(\mathscr {V}(\Sigma )\) that are approximated arbitrarily closely by “almost cyclic” processes (or positive multiples of them). Leaving the interpretation of \(\mathscr {C}\) in [6] aside, we could just as well have defined a Clausius temperature scale for a Kelvin–Planck theory to be a continuous function \(T: \Sigma \rightarrow \mathbb {R}_+\) such that
where \(\mathscr {C}^{*} \in \mathscr {M}(\Sigma )\) is the set of heating measures associated with the theory’s cyclic elements. More precisely,Footnote 22
For \((\Sigma ,\mathscr {P})\) we denote the set of Clausius temperature scales defined in this way by \(\mathscr {T}^*_{Clausius}\). Because the condition (D.16) is more demanding than (D.1) we call members of \(\mathscr {T}^*_{Clausius}\) the strong Clausius temperature scales for \((\Sigma ,\mathscr {P})\).
We will now reconsider Examples D.1 and D.2 in light of these ideas.
Because in Example D.1\(\hat{\mathscr {P}}\) is identical to \(\mathscr {P}\), we have \(\mathscr {C}^* = \mathscr {C}\), so there is no distinction between \(\mathscr {T}^*_{Clausius}\) and \(\mathscr {T}_{Clausius}\). Thus, for Example D.1 we have
Every Clausius temperature scale is also a strong Clausius temperature scale.
In the case of Example D.2 we noted that there are no cyclic processes in \(\mathscr {P}\) apart from the trivial one, so \(\mathscr {C}= \{0\}\), and
However, for the process set \(\mathscr {P}\) described in Example D.2, \(\hat{\mathscr {P}}:= \text {cl}\,(\textrm{Cone} \,(\mathscr {P}))\) is identical to the process set \(\mathscr {P}\) of Example D.1, given by (D.5) with \(\alpha \) taking all real values. Thus, for Example D.2, \(\mathscr {C}^*\) is identical to \(\mathscr {C}\) in Example D.1, and
Example D.2 indicates that for a Kelvin–Planck theory the set of strong Clausius temperature scales can be very different from the set of Clausius temperature scales.
In both examples the sets of Clausius scales and strong Clausius scales are different from the set of Clausius–Duhem temperature scales, which in both examples is given by
Note that in Example D.2 the set of Clausius–Duhem scales is very different from the set of Clausius temperature scales, but in both examples the set of strong Clausius temperature scales resembles very closely the set of Clausius–Duhem temperature scales.
In some ways this last observation is surprising, for the Clausius–Duhem temperature scale requirement (Definition 2.5) must take cognizance of the entire process set \(\mathscr {P}\), not just the cyclic ones, while the strong Clausius temperature scale requirement (D.16) takes cognizance only of the heating measures for Kelvin–Planck theory’s cyclic elements. In fact, though, Theorem D.3Footnote 23 below indicates that the phenomenon exhibited by the examples is general.
Theorem D.3
For any Kelvin–Planck theory the set of Clausius–Duhem temperature scales is dense in the set of strong Clausius temperature scales.
In the proof of Theorem D.3, a Hahn–Banach separation theorem (but a different version [11]) will again play a central role: Let V be a real locally convex topological vector space. If A and B are disjoint nonempty convex subsets of V and A is open, then there is a continuous linear function \(f:V \rightarrow \mathbb {R}\) and a constant \(\alpha \) such that \(f(x) < \alpha \) for all \(x \in A\) and \(f(x) \ge \alpha \) for all \(x \in B\).
Note that if B is closed under positive multiplication—that is, if \(\lambda b\) is a member of B for every \(b \in B\) and every \(\lambda > 0\)—then \(\alpha \) can be taken to be zero. In the proof below the set labeled CD, which will play the role of B, is closed under positive multiplication.
Proof of Theorem D.3
We consider a Kelvin–Planck theory \((\Sigma ,\mathscr {P})\). Let K be the linear subspace of \(C(\Sigma ,\mathbb {R})\) consisting of all constant functions. The equivalence relation \(\sim \) in \(C(\Sigma ,\mathbb {R})\) defined by \(f \sim g\) if and only if \(f-g \in K\) gives rise to the quotient vector space \(C_0(\Sigma ) := C(\Sigma ,\mathbb {R})/K\), with vectors consisting of the equivalence classes and vector space operations inherited from \( C(\Sigma ,\mathbb {R})\) in the usual way. We give \(C_0(\Sigma )\) the usual quotient topology, in which case \(C_0(\Sigma )\) and \(\mathscr {M}^{\circ }(\Sigma )\) are mutually dual spaces.
The equivalence class in \(C_0(\Sigma )\) containing \(f \in C(\Sigma ,\mathbb {R})\) is denoted [f]. Note that for every measure \(\mu \in \mathscr {M}^{\circ }(\Sigma )\) and every \(g\in [f]\) we have
so there is no ambiguity in the definition
Notwithstanding a slight abuse of language and identification of \(\beta \) with 1/T, we will say that \([\eta ]\in C_0(\Sigma )\) and \(\beta \in C(\Sigma ,\mathbb {R}_+)\) constitute a Clausius–Duhem pair \(([\eta ],\beta )\) for \((\Sigma ,\mathscr {P})\) if
In this way we can identify the set of Clausius–Duhem pairs for \((\Sigma ,\mathscr {P})\) with a subset CD of the locally convex topological vector space
What we have called \(\mathscr {V}(\Sigma )\) and \(\mathscr {V}^*(\Sigma )\) are mutually dual.
Let \(\beta _0\) be the reciprocal of a strong Clausius temperature scale for the Kelvin–Planck theory \((\Sigma ,\mathscr {P})\), and let N be an open convex neighborhood of \(\beta _0\) in \(C(\Sigma ,\mathbb {R}_+)\). We will show that N contains a \(\beta \) such that, for some \([\eta ] \in C_0(\Sigma )\), the pair \(([\eta ],\beta )\) is a member of CD. Suppose on the contrary that the open convex set \(C_0(\Sigma )\oplus N\) is disjoint from the convex set CD, which is invariant under positive multiplication. Then, from the Hahn–Banach separation theorem stated just above, there is a vector \((\Delta \mathcalligra {m}^*,\mathcalligra {q}^*) \in \mathscr {V}(\Sigma )\) such that
-
(i) \(\int _{\Sigma }\,[\eta ]\,\textrm{d}\Delta \mathcalligra {m}^* - \int _{\Sigma }\beta \;\textrm{d}\mathcalligra {q}^* \ge 0,\quad \forall \ ([\eta ],\beta ) \in CD\).
-
(ii) \(\int _{\Sigma }\,[f]\,\textrm{d}\Delta \mathcalligra {m}^* - \int _{\Sigma }g\;\textrm{d}\mathcalligra {q}^* <0,\quad \forall \ [f] \in C_0(\Sigma ),\ g \in N\).
Note that (ii) cannot be satisfied unless \(\Delta \mathcalligra {m}^* = 0\). Therefore, since \(\beta _0\) is a member of N, we have
From (i) and Remark 5.10 it follows that there exists \(\nu \in \mathscr {M}_{+}(\Sigma )\) such that \((\Delta \mathcalligra {m}^*,\mathcalligra {q}^* + \nu )= (0,\mathcalligra {q}^* + \nu )\) is a member of \(\hat{\mathscr {P}}\), whereupon \(\mathcalligra {q}^* + \nu \) is a member of \(\mathscr {C}^*\). Because \(\beta _0\) is the reciprocal of a strong Clausius temperature scale for \((\Sigma ,\mathscr {P})\), we must have
Since \(\beta _0\) takes positive values and \(\nu \) is a member of \(\mathscr {M}_{+}(\Sigma )\), we have
which contradicts (D.26). Therefore N contains the reciprocal of a Clausius–Duhem temperature scale. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Feinberg, M., Lavine, R.B. Entropy and Thermodynamic Temperature in Nonequilibrium Classical Thermodynamics as Immediate Consequences of the Hahn–Banach Theorem: II Properties. Arch Rational Mech Anal 248, 43 (2024). https://doi.org/10.1007/s00205-024-01987-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00205-024-01987-9