Abstract
The Onsager relations are discussed and it is suggested that they should be interpreted as there is a frame of reference where all the transport processes are independent. The concepts are illustrated with isobarothermal diffusion in simple metallic phases as well as complex ionic systems. A transformation from lattice-fixed frame of reference to number-fixed frame gives the Kirkendall effect as a cross effect.
Similar content being viewed by others
1 Introduction
In a legendary article from 1931 Onsager1 discussed cases where two or more irreversible transport processes occur simultaneously, e.g. heat and electric conduction. When occurring alone each process is described by a linear relation between the flux J and the gradient in a thermodynamic potential, e.g. Fourier’s law and the Ohm’s law. They can both be written in the form
The flux is the transported amount expressed per unit area and unit time in the direction of the driving force X which is the gradient of the appropriate thermodynamic potential. L is a phenomenological coefficient that represents a conductivity in the medium under consideration. When several processes occur simultaneously it is observed experimentally that one process may also depend on the driving force for another process, i.e. there are cross effects. For n simultaneous processes one would write
Figure 1 summarizes the processes of heat, electric and diffusive transport and their cross effects.
Inspired by Thomson (Kelvin) Onsager then showed that under some conditions
This is referred to as the Onsager reciprocity relations and was a major reason for awarding him with the Nobel prize in chemistry 1968. His mathematical proof is somewhat complex and is based on a number of assumptions and particularly the principle of microscopic reversibility. Onsager introduced the latter concept to represent that at equilibrium a process goes in the forward direction as often as in the backward direction. It is thus usually claimed that the relations hold close to equilibrium. However, the physical nature of this principle and how it relates to the underlying molecular laws is less clear. Over the years the reciprocity relations have been discussed and criticized frequently. The criticism by Truesdell2 is particularly concise and clear. It is also remarkable that Onsager did not consider transport processes at all in his proof but rather a generalized chemical reaction. The purpose of the present article is to demonstrate that the proof can be performed for transport processes and in a much simpler way with a straight forward physical meaning. We shall demonstrate that the relations hold even when the system is far from equilibrium as long as fluxes and forces are represented by the linear relation Eq 2.
In the following we shall use vector–matrix notation and write the flux as a columnar vector J with each element representing the flux of a certain quantity and X as a columnar vector with the conjugated driving forces as elements. L is thus a square matrix with elements \(L_{ij}\) and the same number of rows and columns as n, the number of transport processes considered. Equation 2 thus is written in the same form as Eq 1
and Eq 3 then is
where \(L^{\rm T}\) is the transpose of L, i.e. the matrix obtained by exchanging rows and columns. Equation 5 thus means that L is a symmetric matrix. The L matrix is referred to as the phenomenological matrix.
2 Entropy Production and Driving Forces
From a thermodynamic point of view the negative gradient of a potential \(\phi _i\) yields a force to give a flux \(J_i\) of its corresponding conjugated extensive quantity. Using \(\nabla \) to denote the gradient operator the driving force thus is \(-\nabla \phi _i\). The flux and the driving force are conjugated thermodynamic quantities. Examples of transported quantities and conjugated thermodynamic potentials are given in Table 1.
where T is the temperature, \(\mu _i\) the chemical potential of i, \(\psi \) the electric potential and P the pressure.
2.1 The Local Entropy Production
All irreversible processes produce entropy. Each process contributes to the total entropy production \(\sigma \) in the volume element under consideration by the amount \(-J_i\nabla \phi _i>0\). This relation is fundamental and stems from the combined first and second law of thermodynamics and an entropy balance, i.e. it does not depend on any model. As can be seen \(\sigma \) is expressed per volume and time. In vector–matrix notation we thus have the local entropy production.
where \(J^{\rm T}\) is the transpose of J and the columnar vector \(\nabla \Phi \) has the elements \(\nabla \phi _i\).
The local entropy production is a measure of the irreversibility and is independent of the choice of frame of reference for the fluxes. If fluxes and forces are expressed in an arbitrary frame of reference we would thus write
Consequently if \(\sigma \) is known in one frame of reference and the fluxes are transformed by a matrix A
it is necessary to transform also the driving forces to keep \(\sigma \) the same in order to make Eq 7 generally valid.
2.2 Transformation of Fluxes and Forces
Transform the fluxes with the matrix A according to Eq 8. As shown by several authors, e.g. Truesdell2 and Lorimer,3 the driving forces then must be transformed with a matrix B, i.e. \(\nabla \Phi '=B\nabla \Phi \), in such a way that the entropy production is invariant
thus \(A^{\rm T}B=I\), where I is the unity matrix and it follows that \(B=(A^{\rm T})^{-1}\) and
or
The L matrix must be transformed accordingly
Thus
It should be noted that if the L matrix is symmetric Eq 13 will guarantee that also \(L'\) must be symmetric.
2.3 The Onsager Reciprocity Relations
In accordance with the reciprocity relations1 the matrix L is always symmetric, i.e. \(L=L^{\rm T}\), provided that certain measures are taken. The most important ones are that the fluxes are independent and that fluxes and forces are chosen so that Eq 7 holds because the entropy production does not depend on frame of reference. As mentioned Onsager’s proof is not so easy to follow but from Eq 13 one may conclude that if there is a frame of reference where the processes are independent, i.e. the L matrix is diagonal and symmetric in a trivial sense, it is evident from Eq 13 that also \(L'\) will be symmetric although the off-diagonal elements are non-zero. It should be emphasized that it is well known that all real symmetric matrices can be diagonlized by the appropriate transformation.4
It is now suggested that the physical meaning of the reciprocity relations is that there is a frame of reference in which the true character of the processes as being independent is revealed by the L matrix being diagonal. The reason why we often do not see this is because we are looking at the processes in an inappropriate frame of reference and the only thing we see is that although the matrix has non-zero off-diagonal elements it is symmetric. An alternative proof of the reciprocity relations was presented by Verros and Testempasi in 2006.5 The proof is rather lengthy but seems to be based on the same ideas as discussed here.
3 Application to Isobarothermal Diffusion - Frame of Reference
We shall now demonstrate the preceding discussion with the case of isobarothermal diffusion. The reason for choosing this case is that it is practically important and there exist a large amount of experimental data. We consider a system with C components, i.e. its composition may be varied by adding the components in C different ways. Of course the fractions of the different components can only be varied in \(C-1\) different ways because they have to sum up to unity. In the following we will express the composition in terms of mole fractions \(x_i\) calculated from the molar content \(N_i\) of the different components i.
It this follows directly that \(\sum _{j=1}^Cx_j=1\).
3.1 Lattice-Fixed Frame
In a crystalline material it may seem natural to represent diffusion with the crystalline lattice as reference. One may then imagine “inert” markers that are fixed to the lattice as reference points. We call this frame of reference the lattice-fixed frame of reference. The C fluxes are independent in the lattice-fixed frame of reference. This means that there is no way to calculate one of the fluxes if one knows the \(C-1\) other fluxes. However, often it is more convenient to consider diffusion in other frames, e.g. during mixing and interdiffusion where the interest is the exchange of atoms. In a binary system one may be interested in how the two components mix with each other. A transfer of one component is then always balanced by an opposite transfer of the other component. This mean that the two fluxes are no longer independent when this frame of reference is used.
3.2 Frames of Reference with Dependent Fluxes
In practical assessment of diffusion data other frames of reference are used. These type of frames are defined by introducing a dependence between the fluxes, i.e.
where the amount of diffusing species is expressed in mol. Examples of such frames are given in Table 2, where \(m_i\) is the molar mass, \(\delta _{ik}\), the Kronecker delta that is 1 when \(i=k\) and 0 otherwise and s stands for a substitutional element. The number-fixed frame is identical to the volume-fixed frame when all elements have the same partial molar volume \(V_i\). The substitutional-fixed frame is identical to the volume fixed frame when all substitutional elements have the same volume \(V_S\), and the interstitial elements do not contribute to the molar volume. The molar volume is denoted \(V_\mathrm{m}\).
In all the frames of reference where a dependency is introduced between the C fluxes one flux is superfluous because it can be calculated from the other fluxes. In order to keep the number of independent “fluxes” the same we may replace one arbitrarily chosen flux with another quantity which is of interest. We shall return to this aspect in the next section.
3.3 Transformation Between Frames of Reference
Transformation from one frame of reference to any other frame of reference is achieved by multiplying the fluxes with a transformation matrix. Suppose that the fluxes J are known in one frame of reference and one would like to transform to another frame denoted by a “prime” as \(J'\). The fluxes are then transformed by multiplication with a transformation matrix A.
For example, if the fluxes J are known in the lattice-fixed frame of reference transformation with the matrix A gives the fluxes in any other frame defined by Eq 16.
where \(\displaystyle a_m=\sum \nolimits _{k=1}^Ca_kx_k\).
The transformation matrix thus has the elements \(a_{ik}=\delta _{ik}-x_ia_k/a_m\). As mentioned in the previous section one flux is now superfluous which means that information is lost in the transformation and the A matrix defined by Eq 17 is singular and cannot be inverted. Consequently it is not possible to perform the reverse transformation.
The sum of fluxes expressed in the lattice-fixed frame gives the net flow of atoms and is related to the migration velocity \(v_K\) of Kirkendall markers expressed in the number-fixed frame6.
By the same token \(\sum _{i=1}^CJ_iV_i\) represents the net-flow of volume relative the lattice. We will thus replace an arbitrarily chosen row in A with \(a_{ik}=b_i\) and the final A matrix is, if we chose to replace the C:th row
Transforming to the number-fixed frame and taking all \(b_i=-1\) the C:th flux thus is \(J_C'=-\sum _{i=1}^CJ_i=v_K/V_m\), i.e. the Kirkendall velocity expressed in the number-fixed frame. Consequently no information has been lost by using the A matrix defined in this way and the reverse transformation may be performed, e.g. it is possible to transform from number-fixed to lattice-fixed frame of reference. This of some practical importance because experimental evaluation of diffusion coefficients are often based on measured concentration profiles, e.g. using the so-called Boltzmann-Matano method and if one has also measured marker movement in the number-fixed frame one may calculate the diffusivities in the lattice-fixed frame.
3.4 Transforming from Lattice-Fixed Frame of Reference to Some Other Frames
In this section the transformation from lattice-fixed to other frames of reference is discussed. In Sect. 3.5 transformation between arbitrary frames of reference will be discussed.
3.4.1 Volume-Fixed Frame of Reference
The volume-fixed frame is defined by \(a_i=V_i\) for all species i, see Eq 15 and Table 2. Consequently \(a_m=V_m\) and the transformation matrix is
The last row gives \(v_K=-\sum _{i=1}^CJ_iV_i\), i.e. the Kirkendall velocity in the volume-fixed frame of reference.
3.4.2 Number-Fixed Frame of Reference
The number-fixed frame is defined by \(a_i=1\) for all species i, see Eq 16 and Table 2. Consequently \(a_m=1\) and the transformation matrix is
3.5 Transforming Between Arbitrary Frames of Reference
If we denote \(J^0\) as the flux in the lattice-fixed frame of reference we may transform to any other frame \(J^1\) by \(J^1=A^{0\rightarrow 1}J^0\), where \(J^1\) denotes the fluxes in the new frame of reference. Or we may transform to \(J^2\) by \(J^2=A^{0\rightarrow 2}J^0\). In fact we can transform from 1 to 2 by \(J^2=A^{1\rightarrow 2}J^1\) and we find that \(A^{1\rightarrow 2}=A^{0\rightarrow 2}(A^{0\rightarrow 1})^{-1}\) where \((A^{0\rightarrow 1})^{-1}\) denotes the inverse of \(A^{0\rightarrow 1}\).
4 The Concept of Mobility
As mentioned a negative gradient of a thermodynamic potential \(-\nabla \phi _i\) is a force to move something. The mobility \(M_i\) of a “particle” i is introduced from the assumption that its velocity \(v_i\) is proportional to the force \(-\nabla \phi _i\) acting on it, i.e
Suppose we have generalized “particles”, i.e. atoms, molecules, electrons, energy quanta etc. Except for a very short acceleration time the behaviour represented by Eq 22 is expected when a particle moves in a media with friction. The flux of i particles is obtained by multiplication with their concentration \(c_i\).
Introducing the \(C*C\) matrix L with the diagonal elements \(c_iM_i\) and all other elements zero we have in vector–matrix notation
4.1 Mobility in Isobarothermal Processes Without Electric Gradients
If all temperature, pressure and electric gradients can be neglected the local entropy production expressed in the lattice-fixed frame of reference is
where J is the columnar vector containing the C diffusive fluxes in the lattice-fixed frame of reference. \(\nabla \mu \) is the columnar vector containing the C chemical potential gradients \(\nabla \mu _i\). It should be emphasized that temperature may still vary in time but slow enough to level out any temperature differences in the considered volume. This is often a very good approximation on the micro-structural level. This may be understood from the fact that the thermal diffusivity in solid metals is in the order of \(10^{-5}\) m\(^2\)s\(^{-1}\) and from Einstein’s formula the time needed to achieve a homogeneous temperature over a distance of 1 mm is around \(10^{-1}\) s. One can then consider the local Gibbs energy dissipation, i.e. \(-T\sigma \), and use \(-\nabla \mu _i\) as the driving force, i.e. Eq 23 is usually written.
Introducing the \(C*C\) matrix L with the diagonal elements \(c_iM_i\) and all other elements zero we have in vector–matrix notation
5 Isobarothermal Diffusion in a Binary System
Consider a hypothetical binary system with components A and B. From Eq 26 and the fact that \(c_i=x_i/V_m\) one obtains in the lattice-fixed frame of reference
The L matrix in the lattice-fixed frame of reference thus is
In general \(J_A+J_B\ne 0\), i.e. the fluxes are independent in this frame of reference and the L matrix is diagonal. Transformation to the number-fixed frame of reference is performed with the matrix A given by Eq 21. Taking \(J_A'\) as dependent flux, i.e. \(J_A'=-J_B'\) one obtains
It should be observed that with this A matrix the second flux is not the dependent \(J_A'=-J_B'\) but it is rather taken as \(J_A+J_B\). From Eq 11 one then obtains the new set of forces
Equation 13 yields the new L matrix
As can be seen the \(L'\) matrix has off-diagonal elements but is symmetric and the reciprocity relations are clearly obeyed. In the new frame of reference \(X_2\) vanishes due to the Gibbs–Duhem relation. The fluxes in the new frame of reference thus are
The two fluxes \(J_1'\) and \(J_2'\) are independent and the \(L'\) matrix has off-diagonal elements. The Kirkendall effect, represented by the Kirkendall velocity \(v_K\) is now a cross effect. In a similar way other phenomena, e.g. electro migration, the motion of atoms in a strong electric field, may be regarded as cross effects.
6 Isobarothermal Diffusion in Ionic Systems
Ionic systems present some additional complications because diffusion occurs by migration of charged constituent species rather than electro-neutral components which have been considered so far. For simplicity, we now only consider elemental ions i, each with a charge \(Z_i\), and the electron \(e^{-1}\) with charge \(Z_{e^{-1}}=-1\). The electron is considered as a species and as a component that only contains the electron species.
It is common to assume that the diffusional flux \(J'_i\) of the ionic species i is described by an individual mobility and the gradient of the chemical potential of the ion i i.e. \(\nabla \Lambda _i\). Written as columnar vectors we use \(J'\) and \(\nabla \Lambda \). The L matrix thus is a square matrix and in the lattice-fixed frame of reference it is diagonal and is denoted \(L'\), i.e.
Equation 37 is sometimes referred to as Kohlrausch’s law. The basic parameters are the diagonal elements \(L'_i\) of the \(L'\) matrix which also includes the electron transport, \(L'_{e^{-1}}\). They are all proportional to the respective ionic mobilities. As the ions are charged there is an extra driving force due to the electric field \(\psi \). We arrange the \(Z_i\) values in a columnar vector Z and Eq 37 is modified into
where F is the Faraday constant. In order to transform to component diffusion the matrix B with elements \(B_{ki}\) is introduced. It tells how many moles of component k the ionic species i contains. The ionic species fluxes are thus transformed to component fluxes by
As an example consider diffusion in the hypothetical metal-deficit oxide \(M_{1-x}O_1\) with diffusing species \(M^{+2}\), \(M^{+3}\), \(O^{-2}\) , \(e^{-1}\) and the components M, O and \(e^{-1}\). Clearly \(x\ge 0\) with 0 representing the stoichiometric oxide. The B matrix is
The chemical potential gradients of the ionic species are related to the gradients of the component chemical potentials \(\nabla \mu \) by
where \(B^{\rm T}\) is the transpose of the B matrix.
Using Eq 41 we find that Eq 38 may be written
Wagner7 suggested in 1933 that in the absence of external electric field \(\psi \) would arrange itself in such a way that there would be no charge transfer due to diffusion, i.e. in vector–matrix notation
Thus the fluxes are not independent and one of them may be eliminated. We will thus eliminate the electron flux. Inserting Eq 42 in Eq 43 yields
Inserting Eq 44 in Eq 42 we obtain
where \(L''\) is
Although \(L'\) is diagonal \(L''\) is generally a filled matrix. It should also be observed that \(L''\) is not a square matrix but has the same number of rows as the number of species including the electrons and the same number of columns as the number of components including the electrons.
The next step is to transform from species flux to component flux by Eq 39. The result is
where L is
L is a square matrix with the number of rows and columns equal to the number of components including the electron. It is found that it is symmetric, i.e. Onsager relations are obeyed. It is further found that the row and the column that involve the electron vanish, i.e.
We may thus remove the electron as a component, i.e. \(\nabla \mu \) does not contain \(\nabla \mu _{e^{-1}}\). In practice we can thus use the convenient Eq 48 and simply skip the row and the column corresponding to the electron because they only contain zeros and we obtain our final L. However, it must be emphasized that the electronic mobility represented by \(L'_{e^{-1}}\) will enter in the remaining elements in the matrix because it is included in the scalar quantity \(Z^{\rm T}L'Z\) which is actually the sum \(\sum _iZ_i^2L_i'\) including the electron.
In this section we have thus shown that adopting Kohlrausch’s law we can bring the flux equation for ionic systems to the same form as Eq 27 with a symmetric L matrix. As we have previously shown that all legitimate transformations to other frames of reference will keep the matrix symmetric the Onsager relations are valid and stem from the diagonal matrix in Kohlrausch’s law.
7 Discussion
It is suggested that the Onsager reciprocity relations are a consequence of the fact that there is a frame of reference where all the considered transport processes are independent and the phenomenological matrix is then diagonal. Transformations of fluxes and forces to other frames under the constraint of an invariant entropy production then yields symmetric phenomenological matrices. Of course it remains to be proven that there actually exists a frame of reference where the processes are independent. Using his principle of microscopic reversibility Onsager1 showed that this would always be the case close to equilibrium in a system with chemical reactions. It is easy to show that isobarothermal transport processes based on random jumps will be independent in the lattice-fixed frame and give diagonal L matrices. Accounting for correlation effects will introduce off-diagonal contributions but Manning8 showed that in simple systems the L matrix will remain symmetric.
Data Availibility
Data sharing not applicable—no new data generated
References
L. Onsager, Reciprocal Relations in Irreversible Processes. Phys. Rev. 37, 405-426 (1931)
C. Truesdell, Rational Thermodynamics (McGRAW-HILL, New York, 1969)
J. Lorimer, Phenomenological Coefficients and Frames of Reference for Transport Processes in Liquids and Membranes. J. Chem. Soc. Faraday Trans. 74, 75 (1978)
K. Riley, Mathematical Methods for Physics and Engineering (Cambridge University Press, Cambridge, 2006)
G. Verros, A. Testempasi, An Alternative Proof of the Onsager Reciprocal Relations for Multi-component Diffusion. Lect. Ser. Comput. Comput. Sci. 6, 571-580 (2006)
A. Smigelskas, E. Kirkendall, Zinc Diffusion in Alpha Brass. Trans. AIME 171, 130 (1947)
C. Wagner, Beitrag zur Theorie des Anlaufvorgangs. Z. Phys. Chem. Abt. B 21, 21-41 (1933)
J. Manning, Cross Terms in Thermodynamic Diffusion Equations for Multicomponent Alloys. Metall. Trans. 1, 499-505 (1970)
Acknowledgements
This article is dedicated to the late professor John Morral who was always a great inspiration. In addition, the author would like to thank professor Zi-Kui Liu for fruitful discussions.
Funding
Open access funding provided by Royal Institute of Technology.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This invited article is part of a special tribute issue of the Journal of Phase Equilibria and Diffusion dedicated to the memory of former JPED Editor-in-Chief John Morral. The special issue was organized by Prof. Yongho Sohn, University of Central Florida; Prof. Ji-Cheng Zhao, University of Maryland; Dr. Carelyn Campbell, National Institute of Standards and Technology; and Dr. Ursula Kattner, National Institute of Standards and Technology.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ågren, J. The Onsager Reciprocity Relations Revisited. J. Phase Equilib. Diffus. 43, 640–647 (2022). https://doi.org/10.1007/s11669-022-00951-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11669-022-00951-y