Thermodynamic Limits of Spatial Resolution in Active Thermography

Thermal waves are caused by pure diffusion: their amplitude is decreased by more than a factor of 500 within a propagation distance of one wavelength. The diffusion equation, which describes the temperature as a function of space and time, is linear. For every linear equation the superposition principle is valid, which is known as Huygens principle for optical or mechanical wave fields. This limits the spatial resolution, like the Abbe diffraction limit in optics. The resolution is the minimal size of a structure which can be detected at a certain depth. If an embedded structure at a certain depth in a sample is suddenly heated, e.g., by eddy current or absorbed light, an image of the structure can be reconstructed from the measured temperature at the sample surface. To get the resolution the image reconstruction can be considered as the time reversal of the thermal wave. This inverse problem is ill-conditioned and therefore regularization methods have to be used taking additional assumptions like smoothness of the solutions into account. In the present work for the first time, methods of non-equilibrium statistical physics are used to solve this inverse problem without the need of such additional assumptions and without the necessity to choose a regularization parameter. For reconstructing such an embedded structure by thermal waves the resolution turns out to be proportional to the depth and inversely proportional to the natural logarithm of the signal-to-noise ratio. This result could be derived from the diffusion equation by using a delta-source at a certain depth and setting the entropy production caused by thermal diffusion equal to the information loss. No specific model about the stochastic process of the fluctuations and about the distribution densities around the mean values was necessary to get this result.


Introduction
In active thermography, subsurface-embedded structures are detected by heating the sample surface or the structures and measuring the time-dependent surface temperature. The temperature evolution as a function of space and time is determined by the heat diffusion equation, and its solution can be described as a composition of plain thermal waves with different frequencies and wavenumbers [1]. The heat diffusion equation is a macroscopic mean-value-equation in the sense that in a microscopic picture the temperature is proportional to the mean value of the kinetic energy of the molecules. Stochastic processes describe the statistical distributions of the fluctuations around those mean values [2].
The detection and location of the embedded structures from the measured temperature evolution at the surface are an inverse problem. In the microscopic picture, the movement of the molecules is invertible, as the kinetic equations are invariant to time reversal. The macroscopic mean-value-equation, for thermography the heat diffusion equation, is not invariant to time reversal any more. Recent results from nonequilibrium thermodynamics show that for macroscopic samples the mean entropy production is equal to the information loss by using the macroscopic mean-valueequation instead of the microscopic description (see Sect. 2 and 'Appendix'). This loss of information for the macroscopic description is the physical reason that the inverse problem gets ill-posed and that subsurface structures cannot be detected any more if they are lying too deep under the surface.
Thermal waves are caused by pure diffusion: their amplitude is decreased by a factor of e −2π ≈ 1/535 within a propagation distance of one wavelength [1]. The diffusion equation, which describes the temperature as a function of space and time ("diffusionwave fields" [3]), is linear. For every linear equation, the superposition principle is valid, which is known as Huygens principle for optical or mechanical wave fields. This allows a composition of the solution of plane waves having different wavenumbers and similar to the wave equation also for the diffusion equation, components with a higher wavenumber are attenuated more than those with a lower wavenumber. This limits the spatial resolution, like the Abbe diffraction limit in optics. The resolution is the minimal size of a structure which can be detected at a certain depth. If an embedded structure at a certain depth in a sample is suddenly heated, e.g., by eddy current or absorbed light, an image of the structure can be reconstructed from the measured temperature at the sample surface. One possibility for image reconstruction is a time reversal of the thermal waves. This inverse problem is ill-conditioned, and therefore regularization methods have to be used taking additional assumptions like smoothness of the solutions into account. In the present work for the first time, methods of nonequilibrium statistical physics are used to solve this inverse problem without the need of such additional assumptions and without the necessity to choose a regularization parameter.
Reconstructing the samples interior structure from the measured signals at the samples surface is a prominent example of an inverse problem. An inverse problem is a general framework that is used to convert observed measurements into information about a physical object or system. Inverse problems are typically ill-posed, as opposed to the well-posed problems more typical when modeling physical situations where the model parameters or material properties are known. Of the three conditions for a well-posed problem suggested by Hadamard [4] (existence, uniqueness, and stability of the solution or solutions) the condition of stability is most often violated [5]. Even if a problem is well-posed, it may still be ill-conditioned, meaning that initially small errors can grow exponentially. An ill-conditioned problem is indicated by a large condition number. It needs to be re-formulated for numerical treatment. Typically, this involves including additional assumptions, such as smoothness of solution [6]. This process is known as regularization, like truncated singular value decomposition (SVD) or Tikhonov regularization [7]. The choice of an adequate regularization parameter, which describes the trade-off between the original ill-conditioned problem and the additional assumptions (e.g., smoothness), is critical and has to be evaluated for every individual problem [6]. In this paper, we propose that by using the entropy production we get a physical background for choosing the regularization parameter for thermographic imaging and no additional assumption is necessary.
To simplify the following calculations, two assumptions have been made. First, heat diffusion is assumed to take place only in one dimension. Physically this happens in layered structures, when the lateral dimension of these structures is big compared to their thickness. Second, the heat source is a thin layer which is instantaneous heated in a thick sample. Mathematically, this can be approximated by a Dirac delta function in infinite space in one dimension. For real thermography, different boundary conditions on the sample surface have to be used. E.g., Mandelis has solved the heat diffusion equation in his book "Diffusion-Wave Fields" [3] in one, two, or three dimensions for different boundary conditions. The method demonstrated here to get a thermodynamic resolution limit for one-dimensional heat diffusion in infinite space can be generalized to such boundary conditions and more dimensions, as discussed in Sect. 4.
To get the spatial resolution usually a fine structure which is small compared to the resolution is imaged. Then the resolution is equal to the size of the imaged structure. For bigger structures, the convolution of the structure with the blurring from image resolution gives the size of the imaged structure. Using the Dirac delta function as the initial temperature distribution ensures that the size of the imaged structure is just the resolution. Of course the evaluated resolution is valid for any initial temperature profile as the heat diffusion equation is a linear one. For the excitation in active thermography often a pulse is used, e.g., a light or eddy current pulse. The pulse duration is chosen to be short compared to the time needed for thermal diffusion along the samples structures. Mathematically, this gives again a Dirac delta function, but in time. Signals for longer pulses can be calculated by a convolution integral using the temporal evolution of the generation pulse. Generalizations for other generation patterns, like a sinusoidal excitation for lock-in thermography, are discussed in Sect. 4. In the present work it should be shown that a general thermodynamic limit of the spatial resolution can be derived from a very recent result from stochastic thermodynamics, which is summarized in Sect. 2 and in the 'Appendix': if a macroscopic system is kicked out of the equilibrium by a short but not necessarily small perturbation, such as a short laser pulse, for the following dissipative process back to equilibrium the information loss about the kick magnitude is equal to the mean entropy production. In active thermography, the kick magnitude is no single number but the vector of the temperature in space just after the excitation pulse (=kick) or it can be the magnitude of all the Fourier components of the temperature wave, as used in Sect. 3.
The information loss can be quantified by the Kullback-Leibler divergence (e.g., [8], also called relative entropy). The Kullback-Leibler divergence D( f ||g) is used in information theory for testing the hypothesis that the two distributions with density f and g are different [8], and is defined as where ln is the natural logarithm. The Chernoff-Stein Lemma states that if n data from g are given, the probability of guessing incorrectly that the probability distribution for describing the data is f is bounded by the type II error ε = exp(−D( f ||g)) n , for n large [8]. In that sense D( f ||g) can describe some "distance" between the distribution densities f and g.
The inverse problem of estimating the kick magnitude from a measurement of an intermediate state, a certain time after the kick is ill-conditioned. Just after the kick its magnitude can be estimated very well. A long time after the kick, the state has nearly evolved back to equilibrium and all the information about the kick magnitude is lost. The information content at an intermediate state a time t after the kick with a distribution density p t in comparison to the equilibrium distribution p eq is D( p t || p eq ). For macroscopic systems, D is equal to the entropy production till time t Sect. 2. It has its maximum just after the kick, when no entropy has been produced yet. Then it decreases monotonously in time and gets zero in the limit of infinite time. But already at some earlier cut-off time t cut , all the information about the kick magnitude is lost: according to the Chernoff-Stein Lemma for a fixed error ε, the distribution p t at time t cannot be distinguished from the equilibrium distribution p eq if D( p t || p eq ) gets smaller than ln (1/ε) /n.
In active thermography, it is assumed that the information about the spatial pattern from the interior structure to the surface of the sample is transferred by heat diffusion. The "thermal wave" can be represented as a superposition of wave trains having a certain wavenumber or frequency in Fourier k-space or ω-space, respectively [9]. Instead of a cut-off time for the whole signal, the decrease of the Kullback-Leibler divergence in Fourier space gives a criterion for a cut-off wavenumber or a cut-off frequency as an upper limit, where all the information about the Fourier-component is lost because it cannot be distinguished from equilibrium according to Chernoff-Stein Lemma. In the past we have modeled heat diffusion by a Gauss-Markov process in Fourier space and found a principle limit for the spatial resolution [9]. Using the information loss and entropy production for a kicked process, it will be shown in Sect. 2 that the spatial resolution depends just on the macroscopic mean-value-equations and is independent of the actual stochastic process, as long as the macroscopic equations describe the mean heat flow and therefore also the mean entropy production as the mean heat flow divided by the temperature. General limits of spatial resolution are derived in Sect. 3 from the diffusion equation by using cut-off wavenumbers or frequencies from stochastic thermodynamics.

Information Loss and Entropy Production for Kicked Processes
In active pulse thermography, embedded structures are detected by heating the sample surface or the structures with a short excitation pulse, which "kicks" the sample out of its equilibrium. The kick is a sudden temperature rise due to, e.g., optical absorption or electromagnetic induction at the sample structures. For the reconstruction very often the temperature distribution just after the kick is the "kick magnitude" which should be reconstructed, like in photothermal depth profiling of the first kind [10]. But the present theory is valid also for other applications, e.g., determining the thickness of sample sheets of opaque materials, like metals. When lightening the surface with a short light pulse the heat is absorbed in a very thin surface layer. The absorbed heat diffuses into the material, is "reflected" at the back plane (or at voids in the sample), and measured as a temperature change of the surface. For the thermodynamic resolution limit for such a case, the depth a as given in (13) is the doubled thickness of the sheet as the thermal wave travels back and forth.
To derive the connection between information loss and entropy production, a generalization of the second law and of Landauers principle for states arbitrarily far from equilibrium given recently by Hasegawa et al. [11,12] and by Esposito and Van den Broeck in [13] is used. The main idea to deal with a non-equilibrium state p t is to perform a sudden transition from the known Hamiltonian H with equilibrium state p eq to a new one H * , such that the original non-equilibrium state becomes a canonical equilibrium with respect to H * . The average amount of irreversible work for this quench turns out to be the Kullback-Leibler divergence D( p t || p eq ) times k B T , where k B is the Boltzmann constant and T is the temperature of the system. The mean irreversible work for this quench is used either to change the entropy S of the kicked system itself or to heat the surrounding by H p t − H eq . For macroscopic samples it is shown in the 'Appendix' that the change in the system entropy S can be neglected compared to ( H p t − H eq )/T as fluctuations in macroscopic systems are small compared to the changes in the mean value. Therefore, the information k B D( p t || p eq ) about the non-equilibrium state any time t after the kick can be approximated by the mean entropy production described by the diffusion equation.
In information , k B D( p t || p eq ) can be identified as the amount of information that needs to be processed to switch from the known equilibrium distribution p eq (no kick) to the distribution p t a time t after the kick [8]. If the logarithm to the base 2 is taken instead of the natural logarithm in (1), D( p t || p eq ) measures the average number of bits needed to describe the kick magnitude, if a coding scheme is used based on the given distribution p eq rather than the "true" distribution p t [8]. The information theoretical interpretation of D( p t || p eq ) according to the Chernoff-Stein Lemma states that p t cannot be distinguished from p eq at the cut-off time t cut if D( p t cut || p eq ) becomes smaller than ln (1/ε) /n (see Sect. 1). In the present work the error level ε is chosen in a way that at the cut-off time the signal amplitude becomes less than the thermodynamic fluctuations (=noise level). In the next section this will be applied in the Fourier space to determine a cut-off wavenumber or a cut-off frequency.

Cut-Off Wavenumber and Frequency in Fourier Space
In this section, two different inverse problems for heat diffusion are presented. For the first inverse problem, the initial temperature profile T (x, 0) at a time t = 0 should be reconstructed from the measured temperature T (x, t m ) at one specific time t m > 0. For the second inverse problem, the temperature T (x m , t) is measured at a specific point x m , usually at the surface of the sample, for all times t > 0 and from that data, the temperature T (0, t) for the point x = 0 is reconstructed. The second inverse problem is more adequate for thermography, where the temperature can be measured only on the surface. Nevertheless, the first inverse problem is very instructive and therefore it is also presented here.
In active thermography, the macroscopic mean-value-equation for the temperature T (x, t) as a function of time t and space x after the kick, which is a short heating pulse, is the diffusion equation, with α is the thermal diffusivity. For simplicity x has only one dimension, but the same procedure can be used for a more dimensional space. The bilateral Fourier transform over space and its inverse iŝ where i = √ −1 and k = 2π/λ is the wavenumber, with the wavelength λ. The wavelength quantifies the spatial resolution of T (x, t) if the wavenumber in the integral of the Fourier transform is not taken till infinity but only to a limited wavenumber. This limitation of the wavenumber is caused by a lower cut-off-time for higher wavenumbers, where the Kullback-Leibler divergence D gets too small, so that the wave trains with a higher wavenumber cannot be distinguished from equilibrium.
For the first inverse problem we could show in [7] that for adiabatic boundary conditions, where the sample is thermally isolated, the eigenfunctions are cosinefunctions and in k-space (Fourier transform over space), the temperature evolution in time is a simple multiplication: The eigenvalues exp −k 2 αt decrease with higher wavenumbers k "exponentially," and for the inverse problem the multiplication with exp +k 2 αt as a huge number makes the reconstruction unstable for higher wavenumbers. Therefore, regularization was used, either truncated singular value decomposition (SVD) or Thikonov regularization. Results for both regularization methods were compared in [9]. The choice of an adequate regularization parameter, that is the cut-off value for the SVD or the trade-off parameter between the original ill-conditioned problem and the smoothness of the solution as an additional assumption for the Thikonov regularization, is critical. Li Voti et al. have described regularization by truncated SVD and genetic algorithms for photothermal depth profiling [10], where the influence of the number of used singular vectors on the reconstructed heat source profile is described. The L-curve method was used to find the optimum value of used singular vectors. Several groups, e.g., the group of Salazar [14], have investigated the reconstruction of thermal conducting depth-profiles from thermography data, e.g., by Thikonov regularization. This so-called photothermal inverse problem of the second kind is for small variations of the sample thermal properties as a function of depth mathematically the same problem as reconstructing the heat source profile [10]. In [7] the L-curve method was used to find the regularization parameter for Thikonov regularization, and in [9] a certain stochastic process, the Ornstein-Uhlenbeck process, was used to derive the cut-off wavenumber for the SVD. In both cases, the cut-off wavenumber k cut gave the same result as we derive in Eq. 7, but now without choosing a regularization parameter or using a specific stochastic process.
As initial condition at a time t = 0, the delta function T (x, 0) = δ (x) is taken, to be sure to get the imaging resolution and not a convolution with the initial structure (see Sect. 1), which results in a constant Fourier transformT (k, t = 0) = 1. All wavenumbers till infinity are present, which gives the best spatial resolution. λ going to zero means that even peaks which are only separated by a very small distance can be still reconstructed as two separate peaks. After a certain time t one gets from thermodynamics the mean entropy in k-space (e.g., [2,9]) proportional toT (k, t) 2 , which shows an exponential decay in time with exp −2k 2 αt , see Eq. 4; with the signal-to-noise ratio SNR. The wavenumber k cut is determined by using the Chernoff-Stein Lemma, If error ε is set to 1/ √ e and n = 1 for one measured temperature one gets that the cut-off wavenumber k cut is just the wavenumber for which at a time t the signal T (k cut , t) gets less than the noise level (Fig. 1). Then Noise level The reconstructed signal T r (x, t) is the Fourier transform of a rectangular function, which gives a sinc-function (see Fig. 1),

Reconstructed signal
The resolution δ r (t) is the "width" of the reconstructed signal and is taken as the distance between the zero points of the reconstructed signal T r (x, t) -to be on the save side -which is the wavelength corresponding to the wavenumber k cut , This is the same result as derived in [9] where we assumed that the thermal diffusion in k-space is an Ornstein-Uhlenbeck process, but in the derivation above the knowledge of the specific stochastic process is not necessary. The resolution given in Eq. 9 depends only on the diffusion equation (Eq. 2) as the equation for the mean value of the stochastic process.
Usually in thermography the temperature is not measured on the whole sample x at a certain time t, but at a certain x-which is usually the sample surface-the temperature is measured at several times t. Instead of a Fourier transform (Eq. 3) to k-space, a bilateral Fourier transform in ω-space is performed: Now for a certain depth a from the Fourier transform of the diffusion equation, which is the Helmholtz equation forT (x, ω), a cut-off frequency ω cut is determined by using the Chernoff-Stein Lemma analog to Eq. 6: ln SNR a 2 (11) This is consistent to a damping with the thermal diffusion length μ = √ 2α/ω cut , where the signal is damped by a factor of exp (−a/μ) to the noise level during propagation along the length a (e.g., [1] or [3]).
The reconstructed signal T r (a) at x = 0 is Like in Eq. 9 the resolution δ r (a) is taken as the distance between the zero points of the reconstructed signal T r (a), which is the wavelength corresponding to the wavenumber k(a), which is proportional to the depth a and independent from the thermal diffusivity α. The resolution is proportional to the thermal diffusion length at the cut-off frequency. This is not only the limit for the depth resolution, but also for the lateral resolution as can be estimated by using the formula for the Abbe diffraction limit δ Abbe = λ/ (2 sin (θ )). The minimal wavelength λ = 2π √ 2α/ω cut of the thermal wave as a certain point at the surface is now a function of the angle θ , as the path d at a certain angle θ is stretched to a = d/cos (θ ) (Fig. 2). This reduces the cut-off frequency ω cut by a factor of cos (θ ) 2 , which gives for the Abbe limit, Fig. 2 Thermal waves which do not go directly to the surface but at an angle θ have a longer path a and therefore their minimal wavelength which can be detected on the surface is increased by a factor 1/cos (θ )

Surface
The term sin (θ ) cos (θ ) is maximal for a value of θ = 45 • , where its value is 1/2: This expression for the lateral resolution is the same as given in (13) for the depth resolution. The detection circle at the surface has a radius of d. At a higher distance from the center (θ > 45 • ), the resolution is cut because of diffusion. Compared to optics, the influence of the minimal wavelength varying with θ on the Abbe resolution limit is not fully clear. This might result in a better resolution as estimated in (15); see also the discussion in Sect. 4.

Discussion, Conclusions and Outlook
Equations 13 and 15 are the main result of this work. For thermographic depth profiling, the axial and the lateral resolution are proportional to the depth and inversely proportional to the natural logarithm of the signal-to-noise ratio. The resolution does not depend on the thermal diffusivity α. This result could be derived from the diffusion equation by using a delta-source at a certain depth and setting the entropy production caused by thermal diffusion equal to the information loss. The delta-source in space as a point source is used to be sure to have a smaller structure than the blurring from image resolution. The derived resolution is valid for any initial temperature profile as the heat diffusion equation is a linear one. No specific model about the stochastic process of the fluctuations and about the distribution densities around the mean values was necessary to get this result. In earlier work [9] we derived Eq. 9 by assuming that the thermal diffusion in k-space is a special Gauss-Markov process (Ornstein-Uhlenbeck process). The same result in k-space was derived in [7] by choosing for the Thikonov regularization the regularization parameter equal to the inverse signal-tonoise ratio, which was justified by the "L-curve" method. In the present publication, e.g., the cut-off wavenumber k cut in Eq. 7 was derived by using a recently gained thermodynamic result: that the mean entropy production is equal to the information loss. In comparison to previous publications on photothermal depth profiling, e.g., by Li Voti [10], the groups of Salazar [14], or Majaron et al. [15], where the L-curve or the Morozov discrepancy principle was used to determine the regularization parameters, we could derive a principle limit from thermodynamics and no additional assumption for regularization or of a specific model for the stochastic process was necessary. This result is consistent with work about interference of thermal waves [16]. The wavenumber for a thermal wave with frequency ω is √ ω/2α, which is the inverse thermal diffusion length (e.g., [1]). Using the frequency ω cut from Eq. 11 the wavelength corresponding to this wavenumber is just the resolution in Eq. 13. Also experimental results indicate the linear relation between depth and resolution, e.g., in [15]. In the derivation of Eq. 15, using the formula for the optical Abbe diffraction limit, the minimal wavelength varies with the aperture-angle. This is different to optics, where the whole aperture has the same wavelength. Therefore, the resolution given in Eq. 15 might be an upper limit. This has to be verified by additional two-or three-dimensional models and simulations.
For future work instead of a short kick also other excitation patterns can be considered to evaluate the resolution limits. Busse used a phase angle measurement with a sinusoidal excitation in lock-in thermography to get a better resolution in depth [17]. Sreekumar and Mandelis proposed a chirped excitation pattern like in radar technology to get a better spatial resolution at a certain depth [18]. Using the mean entropy production it should be possible to give also thermodynamic resolution limits for those excitation patterns and compare them to resolution limits for single short pulse excitation. Different and more realistic boundary conditions for the surface should be implemented (e.g., third kind in [3]) to be comparable to experimental results. First results for such a third kind boundary condition are given in [7], where we have heated a metal foil embedded in epoxy resin by a short eddy current pulse. Instead of the k-space, where the eigenfunctions are, e.g., cosine-functions, new base functions f n were used (mathematically the same functions as in quantum physics for the finite potential well, e.g., Griffiths [19]). t ∞ ( ) Fig. 3 Illustration of the forward process: a system in equilibrium state p eq with mean value at x = 0 is kicked at a time t = 0 with magnitude x 0 to a state p kick far from equilibrium, followed by a dissipative process back to equilibrium. x is a set of reduced variables which captures the information on the work. The arrows connecting p kick at time t = 0, p t at t > 0, and p eq at t → ∞ indicate the tube of trajectories, which is "thin" for macroscopic systems as deviations from the mean values x (t) are small or multiplied by k B and using the definition of the Kullback-Leibler divergence Eq. 1, = mean entropy production (18) In this equation the distribution density p kick is a shifted equilibrium density corresponding to a Hamiltonian H (x − x 0 ). The partition function Z is the same for p kick and p eq , therefore F is zero. More applied work W in Eq. 18 means that the distribution p kick just after the kick is "more distant" from the equilibrium distribution p eq . The equilibrium is the state where all the information about the kick is lost and all the applied work has been dissipated.
A certain time t > 0 after the kick only a part of the applied work has been dissipated and not all the information about the kick magnitude has been lost. To describe a state at a time t, which is usually not an equilibrium state, it is not necessary to know p t for all microscopic variables, but x as a set of reduced variables which captures the information on the work is sufficient. In Fig. 3 the forward process is illustrated and the distributions are sketched. We propose that Eq. 18 can be written in a time-dependent form for all the intermediate non-equilibrium states after the kick with a distribution density p t in a good approximation.
Using the definition of the Kullback-Leibler divergence D and p eq (x) = exp (−β H (x)) /Z , one gets Information loss = k B D := k B (D( p kick || p eq ) − D( p t || p eq )) The entropy production is the total entropy change S minus the entropy flow, which is the negative of the dissipated heat H kick − H p t divided by the temperature. The total entropy change S ≡ S (t) − S kick is the difference in the Shannon entropy of p t and p kick : As p kick is only a "shifted" equilibrium distribution p eq , the two distributions have the same entropy: S kick = S eq . Equation 19 can be also deduced directly from Eq. 12 given by Esposito and Van den Broeck in [13] by taking a time-invariant equilibrium distribution. Then W irr = 0, which is the sum of the entropy production i S = S − Q/T and the information I = −k B D. Q is the heat coming from the heat bath, in our case the negative of the dissipated heat H kick − H p t .
Equation 19 describes that the information I about the kick magnitude decreases and the information loss increases during the evolution of time and is equal to the mean dissipated work divided by the temperature plus the entropy change S. After a long time t the distribution p t converges to the equilibrium distribution p eq and from Eq. 19 one gets Eq. 18 using S = S eq − S kick = 0. This is also true in the linear regime near equilibrium as the shape of the distribution p t and therefore S (t) does not change and is equal to S eq . But also far from equilibrium in a good approximation for all the intermediate states S 1/T H kick − H p t , as for a macroscopic system fluctuations are small compared to the mean value (see Fig. 3 showing a "thin" tube of trajectories). Then the distribution p t has nearly no overlap with p eq and one gets D( p t || p eq ) = p t ln p t dx − p t ln p eq dx ≈ − p t ln p eq dx = β H p t + lnZ .
(21) The entropy term p t ln p t in Eq. 21 can be neglected because for all regions in the phase space where p t is different from zero and which contribute to the integral, p eq is nearly zero and ln p t can be neglected compared to ln p eq . The same approximation in (21) is valid for p t = p kick and therefore the total entropy change S in Eq. 19 can be neglected: Subtracting Eq. 22 from Eq. 18 one gets for the information I (t) about the kick magnitude: I (t) can be identified as the amount of information that needs to be processed to switch from the known equilibrium distribution p eq (no kick) to the distribution p t a time t after the kick [8]. All the information about the kick can be approximated by the mean work, which has not been dissipated yet, divided by the temperature. This is the entropy production according to heat diffusion. After a long time all the energy has been dissipated and no information about the kick magnitude is available. The second approximation in Eq. 23 uses that for a macroscopic system the fluctuations are small and the mean of the Hamiltonian is approximately the Hamiltonian of the mean value x (t) (Fig. 3).