Entropy and Transfer Entropy: The Dow Jones and the Build Up to the 1997 Asian Crisis

Harré, Michael

doi:10.1007/978-3-319-20591-5_2

Michael Harré²³

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

8655 Accesses
2 Citations
3 Altmetric

Abstract

Entropy measures in their various incarnations play an important role in the study of stochastic time series providing important insights into both the correlative and the causative structure of the stochastic relationships between the individual components of a system. Recent applications of entropic techniques and their linear progenitors such as Pearson correlations and Granger causality have included both normal as well as critical periods in a system’s dynamical evolution. Here I measure the entropy, Pearson correlation and transfer entropy of the intra-day price changes of the Dow Jones Industrial Average (DJIA) in the period immediately leading up to and including the Asian financial crisis and subsequent mini-crash of the DJIA on the 27th October 1997. I use a novel variation of transfer entropy that dynamically adjusts to the arrival rate of individual prices and does not require the binning of data to show that quite different relationships emerge from those given by the conventional Pearson correlations between equities. These preliminary results illustrate how this modified form of the TE compares to results using Pearson correlation.

You have full access to this open access chapter, Download conference paper PDF

Detecting Causality in Non-stationary Time Series Using Partial Symbolic Transfer Entropy: Evidence in Financial Data

Article 03 February 2015

Modeling the flow of information between financial time-series by an entropy-based approach

Article 17 July 2019

Entropy and Efficiency of the ETF Market

Article 06 February 2019

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

One of the most pressing needs in modern financial theory is for more accurate information on the structure and drivers of market dynamics. Previous work on correlations [1] has lead to a better understanding of the topological structure of market correlations and mutual information [2] has been used to extend an earlier notion [3, 4] of a market crash as analogous to the phase-transitions studied in physics. These studies are restricted to static market properties in so far as there is no attempt to consider any form of causation. However, one of the goals of econophysics is to gain a better understanding of market dynamics and the drivers of these dynamics need to be extended to trying to measure causation. This is extremely difficult, strongly non-linear systems such as financial markets have feedback loops where the most recent change in price of equity a influences the price of b which in turn influences the price of a. This can make extracting causation relationships exceptionally difficult: the empirical distributions need to accurately reflect the temporal order in which price changes in the equities occur, and the time between these changes is itself a stochastic process. The goal of this paper is to introduce a (non-rigourous) heuristic that addresses these concerns using a modification to the conventional definition of the Transfer Entropy (TE) applied to the intraday tick data of the equities that make up the Dow Jones Industrial Average (DJIA) in the tumultuous build up of the Asian Financial Crisis (AFC) that culminated in the crash of the DJIA on the 27th October 1997. This article is arranged in the following way: Sect. 2.2 introduces the linear Pearson correlations I use as a comparison to the TE introduced in Sect. 2.3 in order to make comparisons and then discuss the results in Sect. 2.4.

2 Correlations

A statistical process generates a temporal sequence of data: $\mathbf{X}_{t} =\{\ldots,x_{t-1},x_{t}\}$, X _t is a random variable taking possible states S _X at time t, x _t ∈ S _X and $\mathbf{X}_{t}^{k} =\{ x_{t-k},\ldots,x_{t-1}\} \in \{ S_{X}\}^{k-1}$ is a random variable called the k-lagged history of X _t. The marginal probability is p(X _t), the conditional probability of X _t given its k-lagged history is $p(X_{t}\vert \mathbf{X}_{t}^{k})$ and further conditioned upon the second process Y _t ^k is $p(X_{t}\vert \mathbf{X}_{t}^{k},\mathbf{Y}_{t}^{k})$. The Pearson correlation coefficient r between such time series is:

$$\displaystyle\begin{array}{rcl} r_{t}^{k}& =& \frac{\mathrm{cov}(\mathbf{X}_{t}^{k},\mathbf{Y}_{ t}^{k})} {\sigma _{X}\sigma _{Y }} {}\end{array}$$

(2.1)

where cov(⋅ , ⋅ ) is the covariance, σ _X and σ _Y are standard deviations and r _t ^k is calculated over a finite historical window of length k where in order to calculate the dynamics of r _t ^k this window is allowed to slide over the data, updating r _t ^k as t progresses. A key issue with data that arrives at irregular or stochastic time intervals and r _t ^k is desired is what counts as a co-occurrence at time t of new data. The most common method is to bin the data into equally separated time intervals of length δ _t and if two observations x _t and y _t occur in the interval [t −δ _t, t] then x _t and y _t are said to co-occur at time t, this approach is used for the correlations calculated in this article. Throughout the change in the log price is the stochastic event of interest: if at time t the price is p _t and at time t ^′ it changes to $p_{t^{{\prime}}}$ then the stochastic observable is $x_{t^{{\prime}}} =\log (\,p_{t^{{\prime}}}) -\log (\,p_{t})$ [5], the increment t ^′− t may be fixed in which case it is labelled δ t or may dynamically vary, more on this below.

3 Transfer Entropy

Transfer Entropy was developed by Schreiber [6] as a rigorous way of measuring the directed transfer of information from one stochastic process to another after accounting for the history of the primary process (see below) for arbitrary distributions. This is a natural extension of Granger Causality, based on covariances rather than information measures, first introduced by Granger [7] in econometrics and in the case of Gaussian processes Granger causality and Transfer Entropy are equivalent [8]. Specifically, the entropic measures we are interested in are:

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X_{t})& =& -\mathbf{E}_{p(X_{t})}[\log p(X_{t})],{}\end{array}$$

(2.2)

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X_{t},Y _{t})& =& -\mathbf{E}_{p(X_{t},Y _{t})}[\log p(X_{t},Y _{t})],{}\end{array}$$

(2.3)

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X_{t}\vert \mathbf{X}_{t}^{k})& =& -\mathbf{E}_{ p(X_{t})}[\log p(X_{t}\vert \mathbf{X}_{t}^{k})],{}\end{array}$$

(2.4)

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X_{t}\vert \mathbf{X}_{t}^{k},\mathbf{Y}_{ t}^{k})& =& -\mathbf{E}_{ p(X_{t})}[\log p(X_{t}\vert \mathbf{X}_{t}^{k},\mathbf{Y}_{ t}^{k})],{}\end{array}$$

(2.5)

where $\mathbf{E}_{p(\cdot )}[\cdot ]$ is the expectation with respect to distribution p(⋅ ). The mutual information between two stochastic time series X _t and Y _t is:

$$\displaystyle\begin{array}{rcl} \mathbf{I}(\mathbf{X}_{t};\mathbf{Y}_{t})& \equiv & \mathbf{H}(\mathbf{X}_{t}) -\mathbf{H}(\mathbf{X}_{t}\vert \mathbf{Y}_{t})\; =\; \mathbf{H}(\mathbf{Y}_{t}) -\mathbf{H}(\mathbf{Y}_{t}\vert \mathbf{X}_{t}){}\end{array}$$

(2.6)

with a finite data window of length k this is the information theoretical analogue of r _t ^k and the k-lagged transfer entropy (TE) from the source Y to the target X is:

$$\displaystyle\begin{array}{rcl} \mathbf{T}_{Y \rightarrow X}^{k}& \equiv & \mathbf{H}(X_{ t}\vert \mathbf{X}_{t}^{k}) -\mathbf{H}(X_{ t}\vert \mathbf{X}_{t}^{k},\mathbf{Y}_{ t}^{k}).{}\end{array}$$

(2.7)

T _Y → X ^k measures the degree to which X _t is disambiguated by the k-lagged history of Y _t beyond that to which X _t is already disambiguated by its own k-lagged history. This work presents recent developments in TE [9], information theory and the ‘critical phenomena’ of markets [2], and adds new results for real systems to the recent success in using it as a predictive measure of the phase transition in the 2-D Ising model [10]. The implementation of TE used in this work was done in Matlab using [11].

3.1 Transfer Entropy Without Binning

The most common and direct method of calculating any of r _t ^k, I(X _t; Y _t) or T _Y → X ^k is to use discrete time series data. This is made possible either by the nature of the study itself where discrete time steps are inherent or through post-processing of the data by binning it into a discrete ordered sequence. However, a lot of interesting data, including intra-day financial markets data, is inherently unstructured and binning the data loses some of the temporal resolution and obfuscates the relationship between past and future events making causal relationships difficult to establish, so an alternative is proposed that addresses these issues.

I define a modified form of $\mathbf{T}_{Y \rightarrow X}^{k}$ by first redefining the stochastic time series in order to capture the continuous nature of the price arrival process. With t and $t^{{\prime}}\in \mathbb{R} > 0$ where 0 is taken as the start of trading on any given trading day and {t _i} and {t _j ^′} are the finite sequence of times at which the (log) price changes for two different equities during that day. Define the arrival indices of time series of length I and J as $\{i \leq I\} \in \mathbb{N}$ and $\{\,j \leq J\} \in \mathbb{N}$. Now there are two finite sequences of price changes on a single trading day d: {X ^d(t _i)} and $\{Y ^{d}(t_{j}^{{\prime}})\}$. The entropy of $\{X^{d}(t_{i})\}$ conditioned on its most recent past value is:

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X^{d}(t_{ i})\vert X^{d}(t_{ i-1}))& =& -\mathbf{E}_{p(X^{d})}\big[\log (p(X^{d}(t_{ i})\vert X^{d}(t_{ i-1}))\big],\;\;i > 1.{}\end{array}$$

(2.8)

An equivalent definition for the entropy conditioned on the most recent past of both {X ^d(t _i)} and $\{Y ^{d}(t_{j}^{{\prime}})\}$ is:

$$\displaystyle\begin{array}{rcl} \mathbf{H}(X^{d}(t_{ i})\vert X^{d}(t_{ i-1}),Y ^{d}(t_{ j-1}^{{\prime}}))& =& -\mathbf{E}_{ p(X^{d})}\big[\log (p(X^{d}(t_{ i})\vert X^{d}(t_{ i-1}),Y ^{d}(t_{ j-1}^{{\prime}}))\big]{}\end{array}$$

(2.9)

where i, j > 1 and t _j−1 ^′ is the minimum value such that, for a given t _i: $(t_{i} - t_{j-1}^{{\prime}}) > 0$. This modified definition of the TE (for the rest of this article this is simply referred to as the TE) is:

$$\displaystyle\begin{array}{rcl} \mathbf{\overline{T}}_{Y ^{d}\rightarrow X^{d}}& \equiv & \mathbf{H}(X^{d}(t_{ i})\vert X^{d}(t_{ i-1})) -\mathbf{H}(X^{d}(t_{ i})\vert X^{d}(t_{ i-1}),Y ^{d}(t_{ j-1}^{{\prime}})).{}\end{array}$$

(2.10)

The relationship between this and other measures is illustrated in Fig. 2.1. The first row shows the log price changes for two equities (Alcoa and Boeing) as a stochastic time series with an irregular arrival rate. The black arrows indicate the direction and magnitude of the log price changes. The second row shows the changes in prices binned into time intervals of width δ t so that changes that occur in the same time interval are considered co-occurring. In the third row is the lag-1 Pearson correlations or lag-1 mutual information, the causal direction of correlations is implicit in the time ordering of the bins, hence the arrows point forward in time. This does not account for the shared signal between x _t−1 and y _t−1. The fourth row shows the lag-1 Granger causality or transfer entropy, the signal driving y _t is x _t−1 after excluding the common driving factor of y’s past: y _t−1. Red arrows indicate the measured signal from the source (Alcoa) to the target (Boeing) and blue arrows indicate y’s signal that is being removed. Fifth row (fewer price changes shown for clarity): An alternative way to calculate the TE. Choose the target time series (in this case Boeing) and condition out the most recent previous price change in Boeing and then use only the most recent change in Alcoa as the source signal. Note that some Alcoa price signals are missed and some are used more than once and that price changes will rarely co-occur.

The definition of Eq. (2.10) has a number of appealing properties:

Using a fixed interval in which the price at the beginning is compared with the price at the end of the interval conflates signals that may occur before or after another signal but arrives during the same binning interval, thereby mixing future and past events in the measured relationships between bins.
Similarly, multiple price changes within δ t may net to zero change and so some price signals are missed.
As bin sizes get smaller they are less statistically reliable as fewer events occur within each bin, equally as bin sizes get larger there are fewer bins per day, thereby also reducing the statistical reliability.
Over the period of a single day, for each bin size the number of total bins is: δ t = 30 min: 13 bins/day, δ t = 1 min: 390 bins/day, whereas the raw data may have 50–5000+ price changes in a day.

The proposed heuristic for the TE introduced above addresses some of these shortcomings but not without introducing some other issues. First, it will always condition out the most recent price change information in the target equity (Boeing in Fig. 2.1) and so uses every bit of relevant information in the target time series. It also uses the most recent price change from the source time series, however it will sometimes miss some price changes or repeatedly count the same price changes (see bottom of Fig. 2.1). This is good if we are interested in the most recent price signals and in financial markets this is the case. It also reflects the dynamical nature of the time series, as the inter-arrival times may vary from day to day or between equities no new δ t needs to be defined, it will always use only the most recent information in both the source and the target time series. The most significant shortcoming is that this TE assumes there is no information being carried by the inter-arrival time interval and it is not clear that some of the theoretical foundations on which the original TE is based necessarily hold, from this point of view this method of calculating the TE is currently only a heuristic and the results presented here are for the moment qualitative in nature.

4 Empirical Results

The AFC began in Thailand in July 1997 with the devaluation of the Thai currency (the Bhat) and the crisis rapidly spread throughout South East Asia, ultimately resulting in the October 27 “mini-crash” of the DJIA, losing around 7 % on the day which was at the time the largest single day points drop on record for the DJIA, for a review of the crisis see [12] and the top plot of Fig. 2.2. Note that the entropy measurements shown illustrate that some care needs to be taken when comparing simple systems with data from real ‘complex systems’: the increase in the entropy of the DJIA on the 24th of June looks like what might be described as a ‘first order’ phase transition as studied in complex systems [13], but it is almost certainly caused by the rescaling of price increments on the New York Stock Exchange.^{Footnote 1}

This rescaling did have an interesting impact on the TE though, as can be seen in Fig. 2.3. Prior to the 24th of June there is considerable structure in the TE measure (warm colours denote high TE values, cooler colours denote lower TE values), however all signals drop off significantly immediately after this date although much of the structured signal eventually returns (not shown). The most notable signals are equities that act as targets of TE for multiple other equities, seen as yellow vertical strips indicating that many equities act as relatively strong sources of TE for a single equity: AT&T (equity 26), Wall Mart (equity 30) and McDonalds (equity 31) stand out in this respect. Notable single sources of TE are less obvious but Cocoa Cola and AT&T (equities 19 and 26) show some coherent signals indicated by multiple red points loosely forming a horizontal line. It is intriguing to note that the Pearson correlations showed no similar shift on the 24th of June (not shown) while conversely in Fig. 2.4 the mini-crash on the 27th October 1997 (day 64) there is a clear signal that the DJIA equities are significantly more correlated with no corresponding increase in the TE on that day (not shown) despite the general turmoil of the markets, as seen by significant fluctuations in the correlations on nearby days.

In Fig. 2.4 it is shown that the average TE across all equities is quite stable except for the drop occurring at the time of the change in minimum price increments on the 24th June. A simple shuffling test [14] estimates the TE for unrelated data to be approximately 0.02 nits on average (see the dashed lines, randomly sampled before and after the drop on day 61) but note that numerical estimations of TE are difficult so the TE sometimes drop below zero. This suggests that on average the TE across the DJIA is close to negligible but that some equities clearly have TE values significantly exceeding the 0.02 nits level, as shown by the blue line values. The largest peak in the Maximum TE plot occurs 6 days after the Dow crashes and is from the Disney equity to the McDonalds equity.

Finally in Fig. 2.5 is plotted two networks of relationships between the equities based on Pearson correlations and TE. The Pearson correlation network is ordered counterclockwise according to the total link weight of each equity and a link was included if its correlation was greater than 0.4. The TE network is ordered counterclockwise by total link weight, the colour represents the total weight of incoming links and the node size represents the total weight of outgoing links and a link was included if its TE was greater than 0.05 nits. Thresholds were chosen such that 10 % of all links in each network are included. The most notable differences between these networks is the changes in the relative importance of the individual equities. The overall DJIA index (DJI) is significantly correlated with other equities whereas this index is the least significant node in the TE network. Similarly, Walmart (WMT) is very well connected in the TE network but it is the least relevant node in the Pearson correlation network.

These are preliminary results using the comparatively small dataset of the 30 equities that make up the DJIA and will need to be confirmed on other indices and other crashes. There is one very significant point that comes out of this study: The driver of correlations between equities in financial markets is not necessarily the changes in the prices of other equities. This is true in the sense that changes in transfer entropy may leave correlations unchanged and changes in correlations are not necessarily driven by changes in transfer entropy. The former is a consequence of the top plot of Fig. 2.3 (the plots showing the lack of change in correlations is not shown due to space limitations), the latter is a consequence of the lower plot Fig. 2.3 for the Asian crisis crash (the plots showing the lack of change in transfer entropy are not shown). However, in the case of the Asian crash, the transfer entropy significantly peaked several days after the crisis but the significance of this is not clear from the data. This result is not peculiar to trading days in which known ‘significant’ events have occurred. Figure 2.5 shows an ordinary trading day in which the DJIA index plays a significant role in the correlation structure (left plot) but this relationship vanishes for the transfer entropy structure (right plot), compare for example the position of Walmart (WMT) in the two plots. In fact there appears to be very little relationship between strongly correlated equities and those that ‘transfer’ high values of entropy.

One of the goals of this work was to explore the analogy between phase transitions in statistical physics and market crashes in finance. Although recent work on precursors to phase transitions in physics has shown that it is a peak in a global measure of TE acts as precursor [10], it is interesting that peaks in Pearson correlations are not necessarily coincidental with peaks in TE for financial markets suggesting that it is not the transfer of entropy between equities within the DJIA that is driving the correlations but some signal external to the market. The results in [10] suggest that if the DJIA mini-crash was analogous to the second order phase-transition in the Ising model then peaks in the pairwise TE, mutual information and Pearson correlation [15] would be observed at the crash. However, in this and earlier studies only peaks in Pearson correlations and mutual information have so far been established during a market crash requiring verification and opening up a number of interesting questions for further work.

Notes

1.
For details see: http://www1.nyse.com/nysenotices/nyse/rule-changes/detail?memo_id=97-33.

References

Mantegna RN (1999) Eur Phys J B Cond Matter Complex Syst 11(1):193
Article MATH Google Scholar
Harré M, Bossomaier T (2009) EPL (Europhys Lett) 87(1):18009
Google Scholar
Vandewalle N, Boveroux P, Minguet A, Ausloos M (1998) Physica A: Stat Mech Appl 255(1):201
Article Google Scholar
Plerou V, Gopikrishnan P, Stanley HE (2003) Nature 421(6919):130
Article MATH ADS Google Scholar
Stanley HE, Mantegna RN (2000) An introduction to econophysics. Cambridge University Press, Cambridge
Google Scholar
Schreiber T (2000) Phys Rev Lett 85(2):461
Article ADS Google Scholar
Granger CW (1969) Econometrica: J Econ Soc 37:424–438
Article Google Scholar
Barnett L, Barrett AB, Seth AK (2009) Phys Rev Lett 103(23):238701
Article ADS Google Scholar
Barnett L, Bossomaier T (2012) Phys Rev Lett 109(13):138105
Article ADS Google Scholar
Barnett L, Lizier JT, Harré M, Seth AK, Bossomaier T (2013) Phys Rev Lett 111(17):177203
Article ADS Google Scholar
Lizier JT (2014) JIDT: an information-theoretic toolkit for studying the dynamics of complex systems. Front Robot AI 1:11. doi:10.3389/frobt.2014.00011
Article Google Scholar
Radelet S, Sachs J (1998) The onset of the east asian financial crisis. Tech. rep., National bureau of economic research
Google Scholar
Li W, Packard NH, Langton CG (1990) Physica D: Nonlinear Phenom 45(1):77
Article MathSciNet ADS Google Scholar
Kwon O, Yang JS (2008) EPL (Europhys Lett) 82(6):68003
Google Scholar
Matsuda H, Kudo K, Nakamura R, Yamakawa O, Murata T (1996) Int J Theor Phys 35(4):839
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Engineering and IT, Centre for Research in Complex Systems, Sydney University, Sydney, NSW, Australia
Michael Harré

Authors

Michael Harré
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Harré .

Editor information

Editors and Affiliations

Department of Computational Intelligence and Systems Science, Sony Computer Science Laboratories, Inc., Shinagawa, Tokyo, Japan
Hideki Takayasu
Department of Applied Physics, The University of Tokyo, Bunkyo, Tokyo, Japan
Nobuyasu Ito
Center for Service Research, National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki, Japan
Itsuki Noda
Dept Computational Intelligence, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan
Misako Takayasu

Rights and permissions

Open Access This book is distributed under the terms of the Creative Commons Attribution Non-commercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Harré, M. (2015). Entropy and Transfer Entropy: The Dow Jones and the Build Up to the 1997 Asian Crisis. In: Takayasu, H., Ito, N., Noda, I., Takayasu, M. (eds) Proceedings of the International Conference on Social Modeling and Simulation, plus Econophysics Colloquium 2014. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-20591-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-20591-5_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20590-8
Online ISBN: 978-3-319-20591-5
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics

Entropy and Transfer Entropy: The Dow Jones and the Build Up to the 1997 Asian Crisis

Abstract

Similar content being viewed by others

Detecting Causality in Non-stationary Time Series Using Partial Symbolic Transfer Entropy: Evidence in Financial Data