Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking

Hoffmann, Christian Hugo; Djordjevic, Charles

doi:10.1007/s41412-020-00101-0

Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking

Research Paper
Open access
Published: 15 September 2020

Volume 37, pages 155–182, (2020)
Cite this article

Download PDF

You have full access to this open access article

Homo Oeconomicus Aims and scope Submit manuscript

Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking

Download PDF

2907 Accesses
Explore all metrics

Abstract

Whenever risk managers are confronted with deep uncertainty and organized complexity, probabilistic inference methods are used. These seem able to allow for crisp inputs and precise results. However, as has been noted by several thinkers (e.g., von Hayek in Studies in philosophy, politics and economics, Routledge, London, 1967; Weaver in American Scientist, 36: 536–544, 1948), such methods cannot be used effectively in such situations. This might basically sound like old wine in a new bottle and, in fact, objections to, and limitations of conventional, i.e., probabilistic risk modeling are anything but unheard of in the literature of banking and finance. However, this paper introduces for the first time an argument, inspired by reflections on the old riddle of induction, from which those shortcomings of the limited suitability of probability can be derived. It demonstrates that any choice of a particular probability distribution for a given risk management purpose is necessarily arbitrary, i.e., it is not grounded in the data but in the choices of the statistician or risk manager, and cannot be justified by appealing to something more objective. Thereby, this paper unmasks the illusion that financial data and extreme losses are well-described by non-standard probability functions such as power laws that have been embraced at the expense

of bell curves in the aftermath of the global financial crisis of 2008. Moreover, although we do not propose a positive solution, we believe that articulating the real, and as yet unnoticed, source of the problem is a key step towards developing a principle and tractable response.

Complexity Analysis and Systemic Risk in Finance: Some Methodological Issues

Complexity Theory and Systemic Risk in the World’s Financial Markets

Model Risk and Uncertainty—Illustrated with Examples from Mathematical Finance

1 Introduction

The fact that probabilistic inferences methods break down in situations of complexity is well known and has been developed extensively in the literature (e.g., Weaver, 1948; von Hayek, 1967; Zadeh, 1962, 1969; Malik, 1996). However, these discussions are rather anecdotal (e.g., Hayek, 1967; Churchman, 1961) or often interpreted as if the central cause of this inability is a lack of enough data and/or computational power (Seising, 2012; and Weaver himself did pin his hope on the power of digital computers to solve problems of complexity). Indeed, the literature makes it seem as though a perfect computer exploiting big data could transcend those all-too-human limitations (Zin et al., 2019). In this paper, we argue that the problem is far deeper than a mere lack of power or a paucity of data. Rather, we argue that the problem stems from a widely discussed philosophical conundrum—i.e., the problem of induction. Specifically, we hope to both demonstrate that the failure of probabilistic inference methods in complex situations is a particularly interesting case of this philosophical problem as well as argue that this problem belies the hope that more raw computational power and richer or more precise data can save us from states of deep uncertainty, particularly present in situations of high complexity. This, we believe, is necessary stage-work so that the problem of complexity in banking and elsewhere can be thought about properly.

To begin, in philosophical discussions, the problem of induction has been widely debated since at least Hume. Roughly, the problem claims that there is no principled philosophical justification that allows one to infer from the fact that x has always y-ed (e.g., the sun has always risen) to the fact that x will y at a future point (e.g., the sun will rise tomorrow). Many potential solutions have been proposed including: a radical rethinking of the nature of causality so that it is grounded in the inherent features of the objects in question (e.g., Cartwright, 2013); a reliance on nomic regularities that link the initial event and the outcome together (e.g., Davidson, 1967); a pragmatic shrug-of-the-shoulders (Hume’s own reply); and so on. However, much of this discussion in the philosophical literature has focused on what should be paradigmatic examples of causality. By contrast, our focus is on the use of probabilistic methods that attempt to “tame chance” (Hacking, 2006). Specifically, our paper follows several theorists who have argued that probabilistic reasoning is neither useful nor successful in the realm of so-called organized complexity (Weaver, 1948). However, it has been shown that these thinkers’ discussions are often afflicted with vagueness (e.g., Gershenson, 2008; Byrne & Callaghan, 2014; Hoffmann, 2017). Ergo, the central aim of our paper is to supplement and enrich these discussions by (a) explicitly deducing the limits of probability theory from philosophical considerations and (b) verifying that this deduction holds for the measurement of complex risks in financial systems. Thereby, elaborating on financial systems makes sense for three main reasons:

(1)
A paradigm from a complexity lens: Modern financial systems are more interconnected and complex than many other (economic) systems (Bookstaber et al., 2015: 152; Acharya & Yorulmazer, 2008; Allen & Gale, 2000).
(2)
Special status: Whereas in organizations of other industries risk management usually focuses on the negative—threats and failures rather than opportunities and successes (Kaplan & Mikes, 2012: 50)—, risk management plays a decisively different, namely a broader and more central and sophisticated role in financial institutions: In the financial universe, risk and return are two sides of the same coin. Absorbing, repackaging and passing on risks to others (with different levels of intensity, to be sure) is the core of their business model (Bessis, 2010: 38). Accordingly, the effective management of risks is central to a financial institution’s performance (Saunders & Cornett, 2010: 182).
(3)
Alleged expertise: Financial institutions and intermediaries have been regarded as the world’s processors of risk par excellence, as leaders in the use of risk assessment and management techniques (Pergler & Freeman, 2008: 13; Mehta et al., 2012). Yet, numerous clear examples of financial risks that could not have been prevented due to the problem of induction, such as the financial and economic crisis that erupted in 2008 or the failure of LTCM (Lowenstein, 2002), show that these same institutions have also been the worst hit by the eruptions (or simply by reality). This seems to highlight shortcomings in their methodologies and presage that too much emphasis has been placed on the deceptively precise outputs of statistical measures and probabilistic models—“the hubris of spurious precision” (Rebonato, 2007: xi). Financial institutions have not managed to act as the specialists in risk measurement/management they are supposed to be (Saunders & Cornett, 2010: 18).

Should our argument prove correct, it will thus go hand in hand with severe implications for the business practice since probability theory will lose its claim to be capable of guiding the practical management of complex risk in banking and other financial institutions.

In Sect. 2, we delineate the philosophical problem of induction and briefly review its development in a systematic (not historical) manner. We do this because “[n]owhere is the problem of induction more relevant than in the world of trading [in the branch of financial economics, more generally; C.H.]—and nowhere has it been as ignored” (Taleb, 2007: 116; cf. also Spitznagel, 2013: 98, 178, 243; von Mises, 1949/1998: 31).^{Footnote 1} In Sect. 3, we introduce an axiomatic and formal version of probability theory. In Sect. 4, we discuss attempts to apply probability to complex systems as well as common criticisms. In Sect. 5, we discuss more sophisticated models that are thought to avoid the application problems voiced in Sect. 4. We argue that these, too, fall short. In Sect. 6, we develop our central argument at length. Specifically, we reason that the real source of the breakdown of probability in the face of complexity emerges as a conceptual correlate to the problem of induction. Indeed, we find that in situations of organized and dynamic complexity, especially, Hume et al.’s seemingly ‘academic’ worries take on a concrete reality. Moreover, we also demonstrate that this way of thinking about the failure of probability is far deeper and more articulate than worries about data or lack of computational power. The paper is closed in Sect. 7 by linking this conceptual critique with objections to the use of probability theory in risk management in banking. In addition, we include an appendix that, on the one hand, clarifies the sorts of systems one might be interested in as well as how these relate to complexity, organization, and predictable behavior. On the other, we detail the characteristics of financial systems and banking in terms of complexity.

2 Philosophical Roots of the Problem of Induction: Some Preliminaries

Arguably, David Hume’s greatest single contribution to philosophy of science has been the so-called problem of induction (1739-40/1888).^{Footnote 2} At a first pass, and at a minimum, induction concerns inferences whose conclusions are not validly entailed by the premises (Hájek & Hall, 2002: 149). However, notice that this minimalist definition would imply that any non-deductive sort of inference—from random guesswork to careful empirical studies—are all instances of the same reasoning pattern. Ergo, this net is far too broad. To home in on the form of the problematic induction that interests us, let us briefly consider some more specific versions of the problem.

To begin from an intuitive place, it is clear that one form of induction is what is referred to as enumerative induction or universal inference. This is basically an inference from particular inferences:

a₁, a₂, …, a_n are all Fs (a certain predicate) that are also G (another predicate),

to a general law or principle

All Fs are G.

Notice that this form of enumerative induction is problematic as it faces many counterexamples. For example, prior to 1697 AD, a common example of induction was as follows: I observe a swan around here (i.e., in Europe) and it is white; I observe another swan around here and it is white; etc. I then conclude “All swans have the property of whiteness.” However, clearly Australian (black) swans refute this inference. Given this problem with induction, a new problem of induction emerges (this has some overlap with Goodman (1955) and his discussion of the new riddle of induction). This new problem of induction is narrow in that it does not focus on induction as such. Rather, it begins by assuming that induction is already in good working order. Given this assumption, the new problem is finding some criteria that distinguish between a good inductive inference (e.g., copper conducts electricity) from a flawed one (e.g., all swans are white).^{Footnote 3}

Though this new problem of induction is certainly interesting, it is not our focus, for two reasons. First, the new problem simply assumes that induction is in order. Pace this, part of what we show in this paper is that in topics of organized complexity, it is most certainly not in good working order. Second, by presupposing induction, the new riddle of induction avoids further specification of what constitutes a specifically inductive inference. In turn, this, again, lends itself to the thought that inductive inferences are simply all and only non-deductive inference (as Rudolf Carnap suggested).

As suggested, we find this dichotomy insufficient and, therefore, let us attempt to characterize inductive inferences further. Fortunately, though inductive inferences are not easily characterized, we do have lucid indicators of what marks off an inference as being an instance of induction. Specifically, inductive inferences are: (a) “contingent, deductive inferences are necessary” (Vickers, 2014)^{Footnote 4}; induction is (b) ampliative, i.e., they “can amplify and generalize our experience, broaden and deepen our empirical knowledge” whereas deduction is explicative, i.e., it “orders and rearranges our knowledge without adding to its content” (ibid.). Though these features fall well short of a proper definition of induction, they will suffice in guiding our analysis.

Accordingly, the original, or old, problem of induction can now be explicated a bit. To wit, the problem of induction concerns “the support or justification of inductive methods” (Vickers, 2014)—i.e., methods that predict or infer, from past experiences, features of the future. Thus, and in Hume’s words, the problem of induction concerns what justifies our assumption that “instances of which we have had no experience resemble those of which we have had experience” (Hume, 1739–40/1888: 89), i.e., nature continues always uniformly the same. In turn, this demand for justification gives rise to a deep dilemma. On the one hand, the claim that nature has uniformity, the Uniformity Principle (UP), “cannot be proved deductively, for it is contingent, and only necessary truths can be proved deductively” (Vickers, 2014). On the other, uniformity cannot “be supported inductively—by arguing that it has always or usually been reliable in the past—for that would beg the question” (ibid.). In sum, for this paper, the old problem of induction focuses on the lack of justification for assuming a uniformity in the behavior in nature. As it were, we have no principled reason to assume the future will be like the past. Or more formally, Hume’s argument can be reconstructed as follows (following Henderson, 2018, where premises are labeled as P, and subconclusions and conclusions as C):

P1. There are only two kinds of arguments: demonstrative and probable (Hume’s fork).

P2. Inductive inferences presuppose the UP.

1st horn:

P3. A demonstrative argument establishes a conclusion whose negation is a contradiction.

P4. The negation of the UP is not a contradiction.

C1. There is no demonstrative argument for the UP (by P3 and P4).

2nd horn:

P5. Any probable argument for UP presupposes UP.

P6. An argument for a principle may not presuppose the same principle (Non-circularity).

C2. There is no probable argument for the UP (by P5 and P6).

C3. There is no argument for the UP (by P1, C1 and C2).

Consequences:

P7. If there is no argument for the UP, there is no chain of reasoning from the premises to the conclusion of any inference that presupposes the UP.

C4. There is no chain of reasoning from the premises to the conclusion of inductive inferences (by P2, C3 and P7).

P8. If there is no chain of reasoning from the premises to the conclusion of inductive inferences, the inferences are not justified.

C5. Inductive inferences are not justified (by C4 and P8).

Many different replies to different problems of induction are brought forward in the literature and it would be beyond the scope of the present study to restate and agitate them. Two concise comments shall suffice at this point. On the one hand, there is an extended debate revolving around probability and induction, a point we return to briefly in the next paragraph.^{Footnote 5} On the other hand, one of the most influential and controversial views on the problem of induction has been that of Karl Popper who argued that induction has no place in the logic of science (Popper, 1959).^{Footnote 6}

In summation, we focus on the justificatory aspect of the old problem of induction. In other words, we focus on what grounds or reasons can be adduced that can warrant the inference from “the x’s have always y-ed”, to “the x’s will y tomorrow.” Notice, to presage our discussion a bit, one very likely candidate that can do this justificatory work seems to be probability: e.g., (early) Peirce (1878/1992: 155–169) or later on Reichenbach (1949) push in exactly this direction. Indeed, it seems like the analytic status and precision of probability can offer a powerful justification for inductive inferences. As it were, I can calculate that, given enough time, the number of even roles of a fair, six-sided, dice, will converge to ½. And I am justified in this inference exactly because of the mathematical structure of probability theory itself.

3 Probability Theory in a Nutshell

The Latin root of probability is a combination of probare, which means to test, to prove, or to approve; and ilis, which means able to be, something like “worthy of approbation” (Bernstein, 1996: 48). Probability has always carried this double meaning, one looking into the future, the other interpreting the past, one concerned with our opinions and beliefs (interpretation as subjective probabilities), the other concerned with what we actually know or with frequencies in the real world (objective concept) (Hájek, 2019; Hacking, 2006: Chpt. 2; Jaynes, 2003: 39). But even though the interpretation of probability is an important foundational problem (ibid.) and even though the inadequacy of frequency interpretations of probability for many situations in risk management deserves appreciation (Rebonato, 2007),^{Footnote 7} it plays no major role in motivating and developing the central argument given in Sect. 6. Nor does probability theory in some strict sense represent the object of criticism. This is exactly because we focus on what could be termed the pure logical syntax of probability theory, without any interpretation per se. In other words, what we refer to as “probability theory” is the branch of mathematics and the standard formalism concerned with probability as axiomatized by Kolmogorov (1933) (cf. Schneider, 1988: Chapter 8). To be sure, Kolmogorov’s axiomatization, which we shortly present in what follows, has achieved the status of orthodoxy for discourses on risk (McNeil et al., 2005: 1f.) and it is typically what economists, risk experts and others, including these authors, have in mind when they think of “probability theory”.^{Footnote 8}

Following Kolmogorov (1933) and the outline by Hájek (2019), let Ω be a non-empty set (‘the universal set’). A field (or algebra) on Ω is a set F of subsets of Ω that has Ω as a member, and that is closed under complementation (with respect to Ω) and union. Let P be a function from F to the real numbers obeying:

1.
(Non-negativity) P (A) ≥ 0, for all A ∈ F.
2.
(Normalization) P (Ω) = 1.
3.
(Finite additivity) P (A ∪ B) = P (A) + P (B) for all A, B ∈ F such that A ∩ B = ∅.

Call P a probability function, the bearers of probabilities “events”, “sentences (of a formal language)”, or “outcomes”, and (Ω, F, P) a probability space. The assumption that P is defined on a field guarantees that these axioms are non-vacuously instantiated, as are the various theorems following from them (ibid.).

In the words of Hájek (2019), let us now strengthen our closure assumptions regarding F, requiring it to be closed under complementation and countable union; it is then called a sigma field (or $\sigma $-algebra) on Ω^{Footnote 9}:

3′. (Countable additivity) If A₁, A₂, A₃ … is a countably infinite sequence of (pairwise) disjoint sets, each of which is an element of F, then

$$P \left(\bigcup_{n=1}^{\infty }{A}_{n}\right) = \sum_{n=1}^{\infty }P \left({A}_{n}\right)$$

The non-negativity and normalization axioms establish, by and large, a convention for measurement, although it is non-trivial that probability functions take at least the two values 0 and 1, and that they have a maximal value (unlike various other measures, such as length, volume, and so on, which are unbounded). It is now easy to see that the theory applies to various familiar cases. For example, we may represent the results of throwing a single fair die once by the set Ω = {1, 2, 3, 4, 5, 6}, and F be the set of all subsets of Ω. Under the natural assignment of probabilities to members of F, we obtain such welcome results as P ({6}) = 1/6, P (even) = P ({2} ∪ {4} ∪ {6}) = 3/6, P (odd or less than 4) = P (odd) + P (less than 4) – P (odd ∩ less than 4) = 1/2 + 1/2 − 2/6 = 4/6, and so on (ibid.).

To the extent that Kolmogorov’s first two axioms are simply matters of convention (which is not undisputed),^{Footnote 10} they are not controversial. In any case, the third postulate (‘finite/countable additivity’) is seen as the most puzzling one for debate—and justifying it is rightly regarded as one of the main achievements of Bayesian scholarship (Bradley, 2017).^{Footnote 11} However, none of these axioms are the target of our central argument. Given that our argument does not turn on possible axiomatizations of probability theory, we hereby adopt the Kolmogorvian axioms as an eloquently abstract and logically precise formulation. However, circumspect readers should keep in mind that our scope is broader than this.

4 Probability, Complexity, and Common Problems

With the nuts and bolts of probability in view, we now turn to both attempts to deploy it in complex situations (e.g., Gilboa, 2009) and standard criticisms of these deployments. First, we lay out a fairly typical view of how one can employ probability in complex situations. Second, we examine several criticisms of this application. We should emphasize that most critics of the use of probability in complex systems (and we ourselves) do not argue that probability-based risk models are invalid (in terms of the deductive reasoning). Rather, we argue that modeling complex risks lies outside the scope of classical probability theory. In a nutshell, the problem is with that application, not with the theory itself.

To begin, let us bring the problem of complexity into view. Specifically, let us consider the disjoint between two categorically different situations. One of these is a dice game with Pascal. Given that the event-space for the dice is known, given that we can assign the probabilities correctly (e.g., that the distribution of probabilities is in part guided by the prior uniformity of the event space), etc., the use of probability makes perfect sense. I know, e.g., that Pascal’s bet on snake-eyes is not likely to win as the chances of rolling 2 die so that each is one is 1/36. By contrast, consider a rather different situation. Ted wants to figure out how likely it is that Jane will enjoy the meal he is cooking for her. In marked contrast to our dice with Monsieur Pascal, we simply have no idea how to begin to partition the possible event space, assigning values, and so on. Given this fairly obvious difference, trying to make an analogy between these two cases commits what Taleb has called “the ludic fallacy” (Taleb, 2007: 309) wherein we confuse games of chance with complex worldly systems (cf. also Gilboa, 2009). And, clearly, the financial world is more complex than either dice or dinner.

Given these intuitive cases, it must be noted though that the number of different possible applications of probability theory is, not surprisingly, much larger than the variety of playgrounds where dice, coins, urns, playing cards, etc. are found. Specifically, in addition to games of chance that constitute a key application of probability theory (Weaver 1963), there are other fields or types of problems to which probability theory reasonably applies and which have been classified as disorganized and complex: Disorganized complexity is represented by systems with very large numbers of variables and high degrees of randomness (Weaver, 1948: 537f.). Interest in systems of this sort began in the late nineteenth century with the investigation of systems representing the motions of gas molecules in a closed space. This is the realm where the methods of statistical mechanics are applicable, emanating from the work of Ludwig Boltzmann and Josiah W. Gibbs (see Table 1 in our Appendix for a perspicuous overview of different sorts of systems).

However, this use of probability according to the theory of complex and disorganized systems has been extended illegitimately to an application of probability to complex systems altogether, which raises a host of problems (Weaver, 1948). Given our paper’s focus, we examine one proposed, and allegedly predestined application of probability, namely that of risk management in banking which, at the same time, is a classic example of dynamic or organized complexity (see Table 2 in the Appendix). The purpose of probabilistic risk management is not necessarily to determine “the correct” probability distribution (where it might be unclear what that even means), but to make decision-guiding forecasts of the likely losses that would be incurred for a variety of risks the bank faces (Rebonato, 2007: 43, 107f.): “it is vital to remain mindful that it is the forecast that matters” (Brose et al., 2014: 340). The choice of risk metrics has been named the cornerstone of risk management in that connection (Stulz, 2008). In other words, a goal of risk management viz probability theory is to find “better” probability distributions wherein “better” means leading to more reliable and useful predictions for use in risk management.^{Footnote 12} The application of probability to complexity in the shape of risk management can be undertaken in several ways including risk measures such as Value at Risk, Expected Shortfall, etc. (cf. e.g., Jorion, 1997; McNeil et al., 2005). However, critically, for these applications to be applications of probability, clearly, they must be grounded in probability theory (Malz, 2011). In other words, a risk model must be a probability model. This entails two things:

a)
Risk models rely on formal languages that are often drawn from Kolmogorov’s (1933) axiomatization of probability. Indeed, this and its accompanying theory provide the lingua franca for describing risks, which often corresponds to the standard view of risk (McNeil et al., 2005: 1f.).
b)
Risk models are probabilistic in the sense that they represent financial quantities such as profits and losses with random variables whose values are described by probability distributions (Pergler & Freeman, 2008: 3; Rebonato, 2007: 148). That quantities are expressed by random variables entails that the quantities are governed by (stable) randomness, corresponding to functions on the probability space (see below).^{Footnote 13}

In turn, probability distributions are thus fundamental to risk models. A probability distribution in a purely formal sense is further a function that satisfies Kolmogorov’s axioms of probability (Frigg et al., 2014: 5). Thus, there seem, at least at first glance, to be ways to extend probability to cover complex situations. Given this, and the above, it is also clear that one extension of probability models are risk models, designed explicitly to cope with complex situations—to the extent that modern financial systems are truly complex, and not just disorganized and complex, which is unquestioned (Hoffmann, 2017; see also Table 2 in the Appendix). Prior to finding out if this is a legitimate application, let us examine how the probability distributions for real world risk modeling situations are constructed. Intuitively, to apply probability to Ted’s question of if Jane will like his meal, we need to construct a probability distribution of, e.g., Jane’s gustatory taste. Similarly, when we want to use a risk model in banking, we need to construct a probability distribution. Broadly speaking, three different ways to do this have been proposed (Mark & Krishna, 2014: 40f.; Rebonato, 2007: 148f., 180; and Embrechts, 2000: 449f.):

1)
Historical method: The distribution of the losses is simply constructed by ‘brute force’, i.e., by using the empirical distribution (frequencies) of past returns or losses. For example, and following Rebonato (2007: 148f.), imagine that we have 1000 observations of changes in the S&P index and, therefore, 1000 ranked hypothetical profits and losses. Let us take the tenth worst hypothetical outcome, and let us assume that it was − $20 millions (ibid.). We have 990 better (more favorable) outcomes and 9 worse outcomes (ibid.). By definition, we have just built from our sample an estimate of the true 99th percentile of the population (ibid.). Since the historical method appears to make use of no free parameters, “it is often referred to as a nonparametric approach” (ibid.: 148). Yet, an extremely important parameter is used; namely, the length of the statistical record or the length of the so-called ‘statistical window’ to be considered in the analysis (ibid.). This implicitly determines what constitutes the relevant past and does so in a binary manner—fully relevant and included in the data set or totally irrelevant and outside the boundaries of our data set (ibid.).
2)
Parametric techniques, including the ‘Variance–Covariance Approach’ (Mark & Krishna, 2014: 40) ‘Empirical Fitting Approach’ (Rebonato, 2007: 150f.) and the ‘Fundamental Fitting Approach’ (ibid.: 152f.): These techniques assume that the market or the distribution of profits and losses obeys a simple statistical formula or is a member of one of the many families of distribution functions one finds in statistics books. For example, we take our 1000 observations of changes in the S&P index again, but this time map it to the elegant Gaussian distribution so that there are then only two parameters needed to describe the returns, its mean and its variance or standard deviation.
3)
Monte Carlo simulation method: It randomly generates market factors such as interest rates and calculates the expected return on, e.g., the portfolio. This procedure is repeated many times (potentially hundreds of thousands of times). Ultimately, the resultant set of portfolio values form a distribution from which risk measures such as Value at Risk can be calculated. Notice that this method is particularly useful for sampling the tails, provided that we have strong ex ante reason to believe that the probability distribution is of a certain type and that we know its parameters.

Given this construction, it may seem as if we can apply risk models to complex systems like banking without ado. However, each proposal has some critical weaknesses. For example, Rebonato (2007: 157f.) points out that in common situations ex ante information on the type of the probability distribution in question is not available or is of dubious validity (especially in the tails): “What we do have is a finite number of actual data points, painstakingly collected in the real world. On the basis of these data points I can choose an acceptable distribution and its most likely parameters given the data […]. This approach may well do a good job at describing the body of the distribution (where the points I have collected are plentiful); but, when it comes to the rare events that the timorous risk manager is concerned about, the precision in determining the shape of the tails is ultimately dictated, again, by the quantity and quality of my real data [emphasis added].” (Rebonato, 2007: 157f.). But there is worse news. There is a fatal trade-off between the increase in accuracy that comes from learning more precisely what the relevant conditions are and the loss of estimation power that arises from the progressive loss of pertinent data: For example, if we wish to describe the magnitude of bank losses by means of a distribution, then we can take data from the last 100 years which includes long periods of boring and predictable banking—beautifully illustrated by the activities of the Bailey Brothers' Building and Loan from the classic It's a Wonderful Life. Or we can refer to the last 40 years when modern financial systems have come into existence, when in the wake of the Vietnam War and the oil price shocks of 1974 and 1979 inflation and interest rates in the money market rose significantly (Admati & Hellwig, 2013: 53) and deregulation of financial markets set in (e.g., the so-called ‘Big Bang’ in 1986).^{Footnote 14} The results would be very different. Moreover, what constitutes the relevant past is not ‘contained in the data’. Consequently, there is also an assumption, implicit or explicit, about what constitutes relevance.

In turn, these weaknesses can engender at least two reactions. One of them is to accept that probability theory cannot be applied to risk in banking. This would entail that one abandons the use of risk models. However, many theorists think that this reaction is both an over-reaction and that it throws the baby out with the bathwater. Another reaction is to try to locate the source of the weaknesses and remove them. It is one such proposal that we now turn to.

5 Fat-Tails and Big Problems

Recall in the last section we discussed the attempt to apply probability to banking via risk models and argued that this application faced substantial problems. One prima facie way to avoid these problems and continue to use risk models is to abandon the assumption that the probability distribution we construct in for such cases should be ‘normal’ or thin-tailed. In turn, one might contend that more sophisticated risk models can cope with the above worries. And indeed, many scholars (Hagel, 2013; Newman, 2013; Helbing, 2010; Mandelbrot & Taleb, 2010; Sornette, 2009) think that we should instead rely on, e.g., by ‘power laws’, ‘Pareto distributions’ or ‘Zipf’s law’ etc., or more generally, so-called ‘heavy-tail distributions’, allegedly better suited for describing many fundamental quantities such as stock and real asset prices.^{Footnote 15} First, we lay out how these more sophisticated models work. However, second, we argue that they still face a fundamental problem. In turn, this means the increase in sophistication has not addressed the underlying issue. This sets the stage for our presentation of the real problem in Sect. 6.

To begin, advocates of non-standard distributions note that a key problem with using standard distributions is that they systematically and dramatically underestimate extreme risk and black swan events (such as realized in the financial crisis which crescendoed in 2008. See, e.g., Taleb, 2007). They further believe that, once this problem is solved, the application of risk models to banking is no longer problematic. To see the difference, Fig. 1 juxtaposes the ‘old and denounced’ normal distribution^{Footnote 16} with a ‘new and powerful’ power law distribution.

Adding to this contrasting juxtaposition, the following Fig. 2 evidences that actual empirical data or an empirical return distribution conforms poorly to a prespecified theoretical normal distribution (cf. also Lux, 1998). (Obviously, the exact form of an empirical return distribution depends on the asset type and on the data frequency; Söderlind, 2014.^{Footnote 17} In other words, different data sets are differently well approximated by a certain prespecified theoretical distribution.)

In particular, this so-called Q–Q plot (Quantile–Quantile plot) indicates that the two distributions have fairly different left tails, which means that the frequency of factual extreme losses is not well-described by the tail of a bell curve. This might be seen as grist for the mill of power law proponents.

Given this increase in sophistication, one might think that the problem of deploying probability theory in complex systems has been solved. Indeed, in the words of the mathematician Walter Willinger and his colleagues: “[t]he presence of [power law] distributions in data obtained from complex natural or engineered systems should be considered the norm rather than the exception” (Mitchell, 2009: 269) and so a method designed explicitly for such situations should solve, or at least take a step towards solving, the underlying problem.

However, notice that this account simply assumes (in lieu of explicating) that ‘fat tail’ analysis leads to better predictions. And we have, as yet, been given no reason to accept this claim. Indeed, the triumphalist account of ‘fat tails’ voiced above rests on the unjustified assertion, (*): namely, that we now have a better (i.e., more reliable or trustworthy) or the right (?) extrapolation rule at hand; or, in the end, even an underlying theory of how bodies in the financial system behave and interact. Again, we simply have not been given a reason to accept this rather bold presupposition.

Indeed, the following statement by Sornette (2009: 5) who embraces the abandonment of ‘normal’ distributions in favor of wilder distributions feeds the doubt that the search for the one right rule or theory is tantamount to a never-ending task: “[I]n a significant number of complex systems, extreme events are even ‘wilder’ than predicted by the extrapolation of the power law distributions in their tail” (cf. also Spitznagel, 2013: 244). Moreover, the physicist and network scientist Cosma Shalizi had a less polite phrasing of his sentiments: “Our tendency to hallucinate power laws is a disgrace” (Mitchell, 2009: 254). Stumpf & Porter (2012) reinforce the stance and contend that most reported power laws lack statistical support and mechanistic backing: “[…] a statistically sound power law is no evidence of universality without a concrete underlying theory to support it” (ibid.: 666). Renn (2008: 16) already remarked fundamentally that the interactions between human activities and consequences are more complex, sophisticated or unique than the average probabilities used in technical risk analyses are able to capture. Alan Greenspan conceded “that nobody can predict the future when people are involved. Human behavior hasn’t changed, people are unpredictable.” (Cited in Sornette, 2003: 321). Churchman (1961: 164) differentiates that in the case of the gambler, such a theory [in the context of his discussion: a ‘well-substantiated’ theory of the manner in which draws of various types occur for a given reference class] may be available; in the case of the executive [metaphorically speaking of real-world businesses and risk management], there is no such theory.^{Footnote 18} Von Hayek (1967: 30) makes a very similar point.^{Footnote 19} And, finally, if we attempt to retreat from the ‘right’ distribution to simply a ‘better’ one, as defined in Sect. 4, we are faced with an odd epistemic situation. To wit, we may simply have no idea why one distribution is better for some specific case. In turn, this means we have no reliable way to determine if, when, how, and how far, the better distribution can be applied to other cases (or even the same case, if it changes enough).

All of this is to say that the assertion (*), that fat tail analysis and the real behavior of markets align, is, as it stands, unjustified since it has not yet been sufficiently accounted for the challenges posed by the complexity of modern financial systems (see Table 2). To sum up in a more abstract manner consider:

While Mandelbrot (1997) and other scholars have taken a step in the right direction by addressing some

non-linearities (e.g., the fatter tail syndrome) and
some dependencies (e.g., conspicuous and strong symptoms of non-independence of price changes or other time series),

important aspects of complexity have been neglected:

real dynamics or unstable time series—evolving or changing complex systems over time, which does not entitle us to pick a specific probability function because we do not have a good reason to prefer one to the other—and
other dependencies such as feedback in open systems—self-similarity and invariance in distribution with respect to changes of time and space scale do not indicate systems whose parts interact and that exchange material, information, and/or energy with their environment across a boundary.^{Footnote 20}

This finding can be packaged in a resounding, but in this form still under-determined and more suggestive research statement (which is why we refer to it as a conjecture, not as a fully-fledged proposition).

There is no good theory of how bodies in the financial system behave and interact and, therefore, a scientific basis for proclaiming the accuracy, suitability, and commonality of power law distributions is missing.

From the perspective of dynamic complexity, it is not power laws per se that are important, but the presence of underlying high variability and dynamics (Alderson & Doyle, 2010: 849). Moreover, the foundational and epistemological problem of identifying and utilizing the ‘right’ (or else providing an objective sense to ‘better’) probability distribution is not, and perhaps cannot be solved (Sect. 6). All in all, ‘power law’ proponents like other probabilists seem to be just poking around in the dark. Trying to predict future (extreme) losses with structurally wrong models is like trying to predict the visibility of a comet with an Aristotelian model. For Aristotle, a comet was understood as an atmospheric event and, because of this, an Aristotelian-model would utilize the wrong parameters, idealizations, and so on, as it simply misunderstands what a comet is. And no amount of fancy math, more data (against the background of a flawed theoretical assumption), and so on, could help here. Similarly, the use of risk models for complex and organized systems simply misunderstands the nature of these systems (see Table 1). Given this misunderstanding, no fancy math, no additional parameters, no amount of data, and so on, can help here either. In short, without prior (and correct) knowledge of the targeted system, the use of more fancy math, attempts to add parameters, and so on, are doomed to miss their mark. Comets are not atmospheric events and risk in the financial system is not akin to risk when betting on dice.

Before moving on, let us take stock. We have discussed the syntactic structure of axiomatic probability theory. We have brought into focus a range of possible applications. We argued that one such application, risk models and banking, suffers from some weaknesses. We have examined attempts to fix this by making the mathematical machinery more complicated. However, we argued that this more complicated machinery does not fix the problem. At this point, the reader might suspect that the problem is not that our math isn’t fancy enough or that our data isn’t rich enough. And, indeed, the next section tries to systematically articulate this suspicion.

6 The central Argument Against Using Probability Theory for Financial Risk Management

In this section, we show that the determination of a probability (or loss) distribution for risk management purposes in banking is not possible for either standard or fat-tail style analysis. This is due to a foundational epistemological problem of induction concerning deep doxastic uncertainty which, in turn, stems from organized complexity characterizing modern financial systems (see Table 2). Moreover, should this argument prove valid and sound, it will on top of that show that Hume’s worries about induction, at least in this instance, are far from mere ‘academic’ skepticism.

To begin, let us assume that the application of probability theory to a situation requires the partitioning of an event space. This assumption is simply built into the Kolmogorovian axiomatization of probability we sketched in Sect. 3. Given this, the partition can be made on either a priori or a posteriori grounds where “a priori” means that the event space is already given (e.g., dice) and “a posteriori” means that the event space is derived from empirical evidence (e.g., Bernoulli, 1713a, b/1988: 65). Let us examine each in turn.

If the event space can be partitioned a priori, then the use of probability is perfectly licit and the application of probability to the target raises no issues. However, as Taleb has stressed, the assumption that we have access to such a priori partitions for almost any real-world complex and organized system rests on a flawed analogy between these systems and, e.g., games of chance (e.g., Taleb, 2007: 309). Such an analogy is flawed for at least three reasons. First, unlike simple games, it is impossible to know all the parameters involved, especially given that modern financial systems are organizationally complex, i.e., consist of a plethora of parts and elements that are tightly coupled and in close interrelation (Hoffmann, 2017; Admati & Hellwig, 2013: 89, 76; Williams, 2010), Second, unlike simple games, every (risk) model developed invariably extrapolates from data of the past to the future (Heinemann, 2014: 194). In particular, there are basically three ways of building a probability distribution for some financial random variable (see Sect. 4), which depend mainly on hindsight, and which are thus based on ‘empirics’ (Stulz, 2008). In other words, methodology in risk management in banking relies on inductive inferences. Third, even granting that complex financial systems have a set of probability distributions, we have no epistemic access to it, unlike simple games (Taleb & Pilpel, 2004). This is partly because making sense of and understanding the behavior of complex systems is hardly possible because they are non-linear and have too many interdependent variables (e.g., Churchman, 1961). Thus, it seems that a priori partitioning is not viable.

If the event space is a posteriori, then the goal should be to derive the proper (or at least a better) partition of an event space from some empirical record. However, this derivation faces at least five different objections. The first and second objections were mentioned in our introduction. To wit, the data we have is woefully incomplete, spotty sketchy, and so on, and our computational power deeply inadequate. However, as we also noted, these two objections are contingent and one may retain the hope that, eventually, we (or perhaps some super AI) can overcome these flaws. In contrast, the next three objections undermine this hope.

The third objection begins with a well-established point from the philosophy of science. To wit, a given hypothesis is always underdetermined by (finite) data (e.g., Duhem, 1991: 185–218, & Quine, 1951).^{Footnote 21} This is partly because the testing of a hypothesis from the theory relies on a broader theory’s architecture, complete with auxiliary hypotheses, devices for idealization, pre-set parameters, and so on. Given this, any number of these background assumptions can be tweaked so that recalcitrant data can be accommodated. For example, one response to the perturbation of Mercury was not to alter Newton’s theory, but instead to posit a non-existent planet, Vulcan, whose orbit caused the perturbation, and then to argue that the telescopes simply weren’t powerful enough yet to detect Vulcan. Moreover, such a seemingly ‘ad hoc’ move led astronomers who noticed a perturbation in the orbit of Uranus to posit, and later discover, Neptune. It is also partly because one key purpose of a theory and hypotheses (and indeed risk management) is to go beyond the extant data and allow for predictions. For us, there are two key lessons to be learned from this. First, a theory only confronts evidence against the background of a host of complex assumptions, idealizations, and so on, and we can always tweak these and save the theory (or increase the theory’s power!) with respect to this evidence. Second, a good theory does not simply cope with data but gives us means to go beyond it.

Given this, by analogy, a probability distribution is under-determined by the finite data it is applied to, for similar reasons. For example, how I partition the event space is not something I can ‘read off’ from the data as I can partition it in any number of ways. Returning to Ted’s dinner, and his dining experiences with Jane, he might think Jane’s taste is important. However, he might equally well take atmosphere, conversation, comfort, Jane’s day, liberal amounts of wine, etc., as critical. And nothing in Jane’s past dining behavior determines which is correct. Moreover, it is simply not enough that we come up with a distribution that merely copes with the data. Rather, we need this partition to make reliable predictions. However, without some idea of why some given distribution makes reliable predictions for this case, we have no reason to expect it to work outside the case at issue. Indeed, if the case undergoes fairly drastic changes (as complex organized systems are wont to do), we have no reason to expect it works for a future version of it either.

Pursuant to the third objection, the fourth objection focuses on the data itself. Let us make the bold assumption that the data we have will always be finite as, if for no other reason, the universe ‘banged’ into being at a certain point. If this is so, then under determination is not a feature we can overcome by accruing more and more data. Indeed, the insistence that ‘more data’ can overcome the problem is a promissory note that cannot be cashed.

Finally, fifth, and most powerfully, one key assumption behind the attempt to derive a posteriori a probability distribution for complex organized systems like financial markets is that their past behavior gives us some handle on their future behavior. Indeed, the entire hope of such a derivation is that getting some probability distribution will enable prediction. However, first, this runs right into Hume’s objection against induction. To reiterate, we have been given no reason to grant this assumption—i.e., that the past will be like the future. Second, relatedly but worse, empirical studies of the behavior of complex organized systems give us solid empirical evidence to reject it (see our Appendix, Tables 1 and 2). Indeed, non-linear dynamic systems might be fully deterministic and yet in principle unpredictable as the amount of interdependent variables belie any hope that we can compress the behavior of the system into some simple and tractable form. As often said, the simplest description of such complex systems is often themselves. And this undermines the hope that we can develop some way to ‘outrun’ what they are doing. Here, indeed, we have good reason to expect past behavior to not be like future behavior. Moreover, if this is so, even trying to find a ‘better’ distribution is misguided as ‘better’ also presupposes that what the system did before will be like what it does in the future. And Hume laughs.

6.1 The Ramifications for Risk Models

Thus, we have given arguments that an a priori partition of event space is not viable for real-world, complex and organized systems like the global financial markets. And we have proffered five objections to the attempt to derive such a partition, three of which we take to be in principle epistemic objections. If these arguments (or only some subset of them) are valid and sound, then it is clear that the application of probability to risk management is deeply problematic.

Moreover, objection five, in particular, is an instance of Hume’s argument from induction adumbrated in Sect. 2. In both cases, what we have established is that inferring from the past behavior or a target to its future behavior is not justified. However, whereas Hume’s problem is ‘academic’ in that it applies to everything from the sun rising tomorrow to sophisticated inferences about copper and electricity, ours is more focused and specific. Precisely, it is partly because the banking system is an organized and complex system whereas in situations of disorganized complexity we find systems with very large numbers of variables and high degrees of (genuine) randomness, so that the laws of large numbers, the Central Limit Theorem and other so-called “properties of stochasticity” (1966: 604) hold. By contrast in organized and complex systems of risks in banking, any inference from past behavior is illicit and Hume’s worries are not ‘merely academic.’ If this is so, our argument has clear ramifications for the use of risk models in the banking system. Specifically:

Local: The suitability of a certain probability distribution for describing some past and future profit and loss data as well as for a given risk management purpose cannot be determined.
Global: Thus, any choice of a particular probability distribution for a given risk management purpose is necessarily arbitrary and cannot be justified. In other words, no striking reason or ground can be given that can both (a) assure us that the distribution we claim for the past behavior of a target is the right one and (b) ensure that a projection of the target’s past behavior into the future will be ‘enough alike’ to allow for predictions.

7 Conclusion

In order to draw inferences from data as described by econometric texts, it is necessary to make whimsical assumptions […]. The haphazard way we individually and collectively study the fragility of inferences leaves most of us unconvinced that any inference is believable.

(Edward E. Leamer, 1983)

Our findings and conclusions seem to be in line with the results reported by many mathematicians (mathematical finance)—in contrast to some applied scientists and practitioners (involved in the broader quantitative finance scene): “[…] The [Financial] Crisis [of 2007–09] saw numerous examples where ‘practice’ fully misunderstood the conditions under which some mathematical concepts or results could be applied. Or indeed where models were applied to totally insufficient or badly understood data; the typical ‘garbage in, garbage out’ syndrome.” (Das et al., 2013: 702; cf. also Riedel, 2013: 23f., 33, 52, 180; Jaynes, 2003: 65f., 74; Daníelsson et al., 2001: 3). In the light of our central argument, however, they unfortunately phrased their concerns in a too reluctant manner.

In fact, our global conclusion is different from, and stronger than, this charge of abusing mathematical tools. The real issue is not that we have ‘garbage’ data, that if we had more computational power, we could make predictions, etc. Rather, the problem is an inherent feature of complex systems themselves. By their very nature, we have no reason to assume that their behavior will evolve in such a way that their future will be like (i.e., predictable in some set abstract way) their past. Moreover, we have no way to establish and justify that our understanding of the system’s past behavior is correct.

It is important to see though that the correctness of our argument does not depend on problems of complex risk assessment being organized or dynamically complex and escaping statistical analysis—otherwise, the reasoning would be circular. Rather, we simply emphasized and specified the link between complexity and the inadequacy of probability-based modeling of risks. The importance of our central argument has different repercussions.

First and foremost, it exposes the uselessness of probability-based models of predominantly extreme risks, as the differences between probability distributions are less significant in the body (see Fig. 1). Put differently, we clearly reveal a wrong track of risk modeling in today's world, namely one which cannot succeed by normal statistical means, when we push logic to the extreme. The output of risk measures like Value at Risk, for example, is a value for a loss of money the magnitude of which depends on which loss or probability distribution is assumed. Probability distributions are not directly observable and as long as we have no systematic way of drawing a line between the ‘good’ and the ‘bad’ distributions, we should not rely on our calculations of minimum or expected losses etc. when making provisions for the future (Frigg et al., 2013: 487). We are aware that there exists an established body of literature on precisely this question of where to draw the line, and it is argued in that literature that scoring rules or "epistemic utility" measures (how they are sometimes called in philosophy) give us the answer. Yet, we showed that, conditional on the settings depicted by the central argument, and excluding, for example, situations of disorganized complexity, probabilistic forecasts are unreliable and do not provide a good guide for action as such (ibid.: 490). In other words, our central argument contends that the issue is not one of finding better distributions, but the very idea that complex organized systems work in the way risk management assumes they do.

And second, this entire discussion on probability distributions would have remained completely theoretical if it was not occurring against the background of risk management practices in the financial industry. Any real-life risk calculations which hinge on knowledge about probability distributions are suspicious. This covers many aspects of finance in banking, such as trading, asset or portfolio management, and does not exclude recent or future developments of financial markets and banking, such as the development of algorithmic trading, sometimes linked to the promise of making markets more stable (Li et al., 2019; Georges & Pereira, 2019). And as it can be very expensive to use the ‘wrong’ probability distribution (consider, for example, the collapse of the hedge fund LTCM; cf. Lowenstein, 2002) which we cannot separate from a suitable one according to the central argument, it is finally time to conclude:

There is no point in drawing on probability theory for organizationally complex risk management purposes.

Our central argument is valid and the rationale beneath the premises ought to ensure that they are correct (or at least, very plausible) and that, therefore, the whole argument is sound. In that case, the house of modern risk management where probability theory and distributions are constituting elements is not in danger of collapsing, but has never been well-established since it lacks a theoretical foundation.

Probabilistic reasoning is not worthless, not even in terms of its application to the real world (as opposed to gambling games of various sorts and text book cases); but, instead of using it because we are convinced that it gets things right, we (should) use it because it is so very convenient or maybe even the only tool we currently have. In other words, it is time, at the risk of boresome repetition, to use it with caution, deliberation, and restraint; the latter especially when facing organized and dynamic complexity (cf. also Peters & Gell-Mann, 2016). In sum, Hume’s pragmatic shrug may be the best we can do. However, we must always keep in mind that a shrug is not a justification. And even more, risk management professionals should be shaken out of their ‘dogmatic slumber’.

Notes

Indeed, “[a]t the heart of the Austrian methodology is healthy skepticism of data and, in particular, how economics (and, equivalently, investing) uses data to back-fit a story around spurious relationships found in the data [i.e., data mining]. Admittedly, we do take a peek at the data […], but we do not rely upon statistical and historical information to form our understanding.[…] I might go so far as to call myself an antiempiricist, because empiricism often creates illusions and obfuscates the true underlying mechanism at work.“ (Spitznagel, 2013: 177). The most experience can teach us is that there might be an ascertainable regularity in all the cases witnessed in the past (von Mises, 1957/2005: 4), but beyond, regularity is a desideratum which is far from being actualized when it comes to human action (ibid.: 6, 57, 203, etc.).
The problem of induction is to be distinguished from ‘mathematical induction’, a deductive argument form.
The objection that the swan case betokens an overly hastily generalization, supports the need to find some sort of difference between it and the copper case. What is it about copper that allows us to move from copper around here to the claim that, for any copper sample in the universe, ceteris paribus, it conducts electricity? And how is this different than the swans?
Of course, the contingent power of induction brings with it the risk of error; good inductions may lead from true premises to false conclusions (ibid.).
For example, on Reichenbach’s view (1949), the problem of induction is just the problem of ascertaining probability on the basis of evidence. Cf. Kyburg & Teng (2001) for an overview.
While agreeing with Hume that no rational justification of induction can be found, Popper insists that this result is innocuous, simply because induction forms no part of the practice of science (Barlas & Carpenter, 1990: 154). According to Popper, scientists propose ‘conjectures’, “and then subject these conjectures to severe observational tests in an effort to falsify them” (Hájek & Hall, 2002: 154). He claims that “we are never rationally warranted in considering such an hypothesis to be probable, given that it has passed such tests” (ibid.). And since, according to the simplest version of falsificationism, deductive relations are all we need attend to in order to check that a hypothesis has been refuted by some piece(s) of evidence, “the problem of induction poses no threat to the rationality of scientific practice” (ibid.).
The point is still worth making since, astonishingly enough, some writers “have come to realize that many everyday events, including those in economics, finance […], are best thought of as analogous to the flip of a coin or the throw of a die” (Cecchetti, 2008: 92). The “generator will toss a coin; heads and the [investment] manager will make USD 10,000 over the year, tails and he will lose USD 10,000” (Taleb, 2007: 152).
Alternatives to Kolmogorov’s probability calculus exist (e.g., Skyrms, 1980; Jeffrey, 1983; Spohn, 1986). For an overview and an introduction to probability theories, cf. Reichenbach, 1949; Fine, 1973; Schneider, 1988.
For probability theory without measure theory cf. Shafer & Vovk, 2001.
For example, physicists such as Dirac, Wigner, and Feynman have countenanced negative probabilities.
According to Bayesians, basically probability theorists who support the subjective view of probability, rational degrees of belief are probabilities. Probability axioms can be said to characterize rational belief. Contemporary Bayesianism, after Thomas Bayes and going back to F.P. Ramsey in 1930, is not only a doctrine, or family of positions, about probability. It applies generally in epistemology and the philosophy of science as well (Vickers, 2014). In terms of Kolmogorov’s third postulate, arguments for saying that degrees of belief, when represented by fractions, should be additive have not been undisputed (Hacking, 2006: 152f.).
This rules out trivial uses of probability like, e.g., defining the probability function over a one-element partition so that the chances of something happening are 1. One could further argue that "better" should always be defined relative to some loss function. Cf. Giesecke et al. (2008) who discuss the difficulties in defining well-suited loss-functions in risk management.
Random quantities provide a convenient way to talk about many probabilities. The value taken by a (discrete or continuous) random variable is subject to chance, and the associated likelihoods are described by a function called the probability (mass or density) function (Grimmett & Stirzaker, 2001: Chapter 2). Cf. also Kyburg & Teng, 2001: 54f.; Rapoport, 1986: Chapter 3.3.
Specifying the demarcation between the eras could go on: the globally pervasive emergence of a shadow banking system, the equally pervasive use of highly sophisticated financial instruments such as CDOs (collateralized debt obligations), currency debt swaps, or auction rate preferred securities, etc. are all peculiar to modern financial systems. For more details on their characterization, cf. Rossi, 2011; Neave, 2010; Minsky, 1986/2008.
The finding of fatter tails or of excess kurtosis in almost all financial data has generated much research in the attempt to fit stochastic processes with this feature to the data (Lux, 1998: 146). Candidates which have been proposed over the years include the stable Paretian distributions (Mandelbrot, 1963, Fama, 1963), compound normal distributions (Kon, 1984), and mixed diffusion jump processes (Friedman & Laibson, 1989) to name only some alternatives.
This is not supposed to deny the former’s elegant properties (just one of which is that one can reasonably or meaningfully speak of the average of the distribution) so that it, based to large parts on the Central Limit Theorem (Bulmer, 1979: 109), keeps on playing a prominent role in statistics and probability theories.
An extreme example are options returns that typically have very non-normal distributions (in particular, since the return is -100% on many expiration days). Ibid.
Furthermore, it is “useless to say the economist should look at history and let the facts ‘speak for themselves ‘, because we need an antecedent theory to know which facts we should even consider, and to even know how to classify a ‘fact ‘ in the first place “ (Spitznagel, 2013: 177).
Indeed, the unpredictability of human action (and the enormous influence individual choice wields in how economics works) is the core of the Austrian (or Viennese) School of economics (along with von Hayek, L. von Mises, Joseph Schumpeter, Murray Rothbard and many others). For example, von Mises (1949/1998: 31) writes: “No laboratory experiments can be performed with regard to human action. We are never in a position to observe the change in one element only, all other conditions of the event remaining unchanged. Historical experience as an experience of complex phenomena does not provide us with facts in the sense in which the natural sciences employ this term to signify isolated events tested in experiments. The information conveyed by historical experience cannot be used as building material for the construction of theories and the prediction of future events. Every historical experience is open to various interpretations, and is in fact interpreted in different ways.”.
Moreover, a system, at least a complex system, is more than the sum of its parts whereas, in the case of fractals (pushed forward by Mandelbrot), the parts are similar to the whole. “Viewed structurally, a system is a divisible whole; but viewed functionally it is an indivisible whole in the sense that some of its essential properties are lost when it is taken apart” (Ackoff, 1974: 14).
Note that Duhem’s and Quine’s versions of this problem differ. However, this need not concern us here.

References

Acharya, V. V., & Yorulmazer, T. (2008). Information contagion and bank herding. Journal of Money, Credit and Banking, 40, 215–231.
Google Scholar
Ackoff, R. L. (1974). Redesigning the future: Systems approach to societal problems. London: John Wiley & Sons.
Google Scholar
Allen, F., & Gale, D. (2000). Financial contagion. Journal of Political Economy, 108, 1–33.
Google Scholar
Alt, R., Beck, R., & Smits, M. T. (2018). FinTech and the transformation of the financial industry. Electronic Markets, 28, 1–9.
Google Scholar
Admati, A., & Hellwig, M. F. (2013). The Bankers' new clothes: What's wrong with banking and what to do about It. Princeton: Princeton University Press.
Google Scholar
Alderson, D. L., & Doyle, J. C. (2010). Contrasting views of complexity and their implications for network-centric infrastructures. IEEE Transactions on Systems, Man, And Cybernetics—Part A: Systems and Humans, 40, 839–852.
Google Scholar
Barlas, Y., & Carpenter, S. (1990). Philosophical roots of model validation: Two paradigms. System Dynamics Review, 6, 148–166.
Google Scholar
Bernoulli, J. (1713a). Ars conjectandi, opus posthumum. Accedit Tractatus de seriebus infinitis, et epistola gallicé scripta de ludo pilae reticularis. Basel: Thurneysen Brothers.
Google Scholar
Bernoulli, J. (1713/1988b). Ars conjectandi. In I. Schneider (Ed.), Die Entwicklung der Wahrscheinlichkeitstheorie von den Anfängen bis (pp. 62–68). Darmstadt: Wissenschaftliche Buchgesellschaft.
Google Scholar
Bernstein, P. L. (1996). Against the Gods: The remarkable story of risk. New York: Wiley.
Google Scholar
Bessis, J. (2010). Risk management in banking. New York: Wiley.
Google Scholar
Bookstaber, R., Glasserman, P., Iyengar, G., Luo, Y., Venkatasubramanian, V., & Zhang, Z. (2015). Process systems engineering as a modeling paradigm for analyzing systemic risk in financial networks. The Journal of Investing, 24, 147–162.
Google Scholar
Bradley, R. (2017). Decision theory with a human face. Cambridge: Cambridge University Press.
Google Scholar
Brose, M. S., Flood, M. D., & Rowe, D. M. (2014). Risk management data and information for improved insight. In M. S. Brose, M. D. Flood, D. Krishna, & B. Nichols (Eds.), Handbook of financial data and risk information I: Principles and context (pp. 328–380). Cambridge: Cambridge University Press.
Google Scholar
Bulmer, M. G. (1979). Principles of statistics. New York: Dover Publications.
Google Scholar
Byrne, D., & Callaghan, G. (2014). Complexity theory and the social sciences. London/New York: Routledge.
Google Scholar
Cartwright, N. (2013). Causal powers: why humeans can’t even be instrumentalists. In J. Jacobs (Ed.), Causal powers. Oxford: Oxford University Press.
Google Scholar
Cecchetti, S. G. (2008). Money, banking and financial markets. New York: McGraw-Hill Higher Education.
Google Scholar
Chishti, S., & Barberis, J. (2016). The FINTECH book: The financial technology handbook for investors, entrepreneurs and visionaries. NYC: Wiley.
Google Scholar
Churchman, C. W. (1961). Prediction and optimal decision. Philosophical issues of a science of values. Englewood Cliffs: Prentice-Hall.
Google Scholar
Daníelsson, J., Embrechts, P., Goodhart, C., Keating, C., Muennich, F., Renault, O., & Shin, H.S. (2001). An academic response to Basel II. LSE Financial Markets Group. Special Paper Series. No. 130. https://www.bis.org/bcbs/ca/fmg.pdf. Accessed 16 Nov 2018.
Das, B., Embrechts, P., & Fasen, V. (2013). Four theorems and a financial crisis. International Journal of Approximate Reasoning, 54, 701–716.
Google Scholar
Davidsom, D. (1967). Causal relations. Journal of Philosophy, 64(21), 691–703.
Google Scholar
Duhem, P. (1991). The aim and structure of physical theory. Princeton: Princeton University Press.
Google Scholar
Embrechts, P. (2000). Extreme value theory: Potential and limitations as an integrated risk management tool. Derivatives Use, Trading & Regulation, 6, 449–456.
Google Scholar
Fama, E. F. (1963). Mandelbrot and the Stable Paretian Hypothesis. Journal of Business, 35, 420–429.
Google Scholar
Fine, T. (1973). Theories of probability. New York: Academic Press.
Google Scholar
Friedman, B. M., & Laibson, D. I. (1989). Economic implications of extraordinary movements in stock prices. Brookings Papers on Economic Activity, 2, 137–172.
Google Scholar
Frigg, R., Bradley, S., Machete, R. L., & Smith, L. A. (2013). Probabilistic forecasting: Why model imperfection is a poison pill. In H. Anderson, D. Dieks, G. Wheeler, W. Gonzalez, & T. Übel (Eds.), New challenges to philosophy of science (pp. 479–491). Berlin/New York: Springer.
Google Scholar
Frigg, R., Bradley, S., Du, H., & Smith, L. A. (2014). Laplace's demon and the adventures of his apprentices. Philosophy of Science, 81, 31–59.
Google Scholar
Georges, C., & Pereira, J. (2019). Market stability with machine learning agents. Available at SSRN: https://ssrn.com/abstract=3374666. Accessed 16 Feb 2020.
Gershenson, C. (Ed.). (2008). Complexity. 5 Questions. Copenhagen: Automatic Press.
Google Scholar
Giesecke, K., Schmidt, T., & Weber, S. (2008). Measuring the risk of large losses. Journal of Investment Management, 6, 1–15.
Google Scholar
Gilboa, I. (2009). Theory of decision under uncertainty. Cambridge: Cambridge University Press.
Google Scholar
Goodman, N. (1955). Fact, fiction, & forecast. Cambridge: Harvard University Press.
Google Scholar
Grimmett, G., & Stirzaker, D. (2001). Probability and random processes. Oxford: Oxford University Press.
Google Scholar
Hacking, I. (2006). The emergence of probability: A philosophical study of early ideas about probability, induction and statistical inference. Cambridge: Cambridge University Press.
Google Scholar
Hagel, J. (2013). The power of power laws. In B. McKelvey (Ed.), Complexity. Critical concepts, vol. 5 (pp. 460–466). London/New York: Routledge.
Google Scholar
Hájek, A. (2019). Interpretations of probability. In E.N. Zalta (Ed.), Stanford encyclopedia of philosophy, https://plato.stanford.edu/entries/probability-interpret/. Accessed 11 Nov 2019.
Hájek, A., & Hall, N. (2002). Induction and probability. In P. Machamer & M. Silberstein (Eds.), The Blackwell guide to the philosophy of science (pp. 149–172). Malden: Blackwell Publishing.
Google Scholar
Heinemann, S. (2014). Ethik der Finanzmarktrisiken am Beispiel des Finanzderivatehandels. Paderborn: Mentis.
Google Scholar
Helbing, D. (2010). Systemic risks in society and economics. IRGC—The emergence of risks: Contributing factors, pp. 1–25.
Henderson, L. (2018). The Problem of Induction. In E.N. Zalta (Ed.), Stanford Encyclopedia of Philosophy. https://plato.stanford.edu/entries/induction-problem/. 17 Feb 2020.
Hoffmann, C. H. (2017). Structure-based explanatory modeling of risks. Towards understanding dynamic complexity in financial systems. Systems Research and Behavioral Science, 34, 728–745.
Google Scholar
Hume, D. (1739)-40/1888. Hume's treatise of human nature. In L.A. Selby Bigge (Ed.). Oxford: Clarendon Press.
Jaynes, E. T. (2003). Probability theory: The logic of science. Cambridge: Cambridge University Press.
Google Scholar
Jeffrey, R. C. (1983). The logic of decision. Chicago: University of Chicago Press.
Google Scholar
Jorion, P. (1997). Value at risk: The new benchmark for controlling market risk. Chicago: Irwin.
Google Scholar
Kaplan, R.S., & Mikes, A. (2012). Managing risks: A new framework. Harvard Business Review, June: 48–60.
Kolmogorov, A. N. (1933). Grundbegriffe der Wahrscheinlichkeitsrechnung. Berlin: Julius Springer.
Google Scholar
Kon, S. J. (1984). Models of stock returns: A comparison. Journal of Finance, 39, 147–165.
Google Scholar
Kyburg, H. E., & Teng, C. M. (2001). Uncertain inference. Cambridge: Cambridge University Press.
Google Scholar
Li, Y., Zheng, W., & Zheng, Z. (2019). Deep robust reinforcement learning for practical algorithmic trading. IEEE Access, 7, 108014–108022.
Google Scholar
Lowenstein, R. (2002). When genius failed: The rise and fall of long-term capital management. New York: Random House.
Google Scholar
Luce, R. D., & Raiffa, H. (1957). Games and decisions. New York: Wiley.
Google Scholar
Lux, T. (1998). The socio-economic dynamics of speculative markets: interacting agents, chaos, and the fat tails of return distributions. Journal of Economic Behavior & Organization, 33, 143–165.
Google Scholar
Malik, F. (1996). Strategie des Managements komplexer Systeme. Ein Beitrag zur Management-Kybernetik evolutionärer Systeme. Bern: Haupt.
Google Scholar
Malz, A. M. (2011). Financial risk management. New York: Wiley.
Google Scholar
Mandelbrot, B. (1963). The variation of certain speculative prices. Journal of Business, 35, 394–419.
Google Scholar
Mandelbrot, B. (1997). Fractals and scaling in finance: discontinuity, concentration, risk. New York: Springer.
Google Scholar
Mandelbrot, B., & Taleb, N. N. (2010). Mild vs wild randomness: focusing on those risks that matter. In F. X. Diebold, N. A. Doherty, & R. J. Herring (Eds.), The known, the unknown, and the unknowable in financial risk management. Measurement and theory advancing practice (pp. 47–58). Princeton/Oxford: Princeton University Press.
Google Scholar
Mark, R., & Krishna, D. (2014). Risk management. In M. Brose, M. Flood, D. Krishna, & B. Nichols (Eds.), Handbook of financial data and risk information. Principles and context (pp. 33–74). Cambridge: Cambridge University Press.
Google Scholar
Martin-Löf, P. (1966). The definition of a random sequence. Information and Control, 9, 602–619.
Google Scholar
Mayer, C. (2013). Firm commitment. Why the corporation is failing us and how to restore trust in it. Oxford: Oxford University Press.
Google Scholar
McNeil, A., Frey, R., & Embrechts, P. (2005). Quantitative risk management. Princeton: Princeton University Press.
Google Scholar
Mehta, A., Neukirchen, M., Pfetsch, S., & Poppensieker, T. (2012). Managing market risk: Today and tomorrow. McKinsy Working Papers on Risk, 32, 1–16.
Google Scholar
Minsky, H.P. (1986/2008). Stabilizing an unstable economy. Foreword by Henry Kaufman. New York: McGraw-Hill.
Mitchell, M. (2009). Complexity: A guided tour. Oxford: Oxford University Press.
Google Scholar
Neave, E. H. (2010). Modern financial systems: Theory and applications. Chichester: Wiley.
Google Scholar
Newman, M. E. J. (2013). Power laws, Pareto distributions and Zipf's Law. In B. McKelvey (Ed.), Complexity. Critical concepts, vol. 5 (pp. 15–67). London/New York: Routledge.
Google Scholar
Pergler, M., & Freeman, A. (2008). Probabilistic modeling as an exploratory decision-making tool. McKinsey Working Papers on Risk, 6, 1–18.
Google Scholar
Peters, O., & Gell-Mann, M. (2016). Evaluating gambles using dynamics. Chaos, 26, 023103. https://doi.org/10.1063/1.4940236.
Article Google Scholar
Peirce, C. (1878/1992). The probability of induction. In The Essential Peirce, Vol 1: Selected Philosophical Papers. Bloomington: Indiana University Press.
Perron, P. (2006). Dealing with structural breaks. Palgrave Handbook of Econometrics, 1, 278–352.
Google Scholar
Popper, K. R. (1959). The logic of scientific discovery. New York: Basic Books.
Google Scholar
Quine, W. (1951). Two dogmas of empiricism. The Philosophical Review, 60(1), 20–43.
Google Scholar
Rapoport, A. (1986). General system theory: Essential concepts & applications. Cambridge, MA: Abacus Press.
Google Scholar
Rebonato, R. (2007). Plight of the fortune tellers: why we need to manage financial risk differently. Princeton: Princeton University Press.
Google Scholar
Reichenbach, H. (1949). The theory of probability. Berkeley: University of California Press.
Google Scholar
Renn, O., & Keil, F. (2008). Systemische Risiken: Versuch einer Charakterisierung. GAIA, 17, 349–453.
Google Scholar
Riedel, F. (2013). Die Schuld der Ökonomen. Was Ökonomie und Mathematik zur Krise beitrugen. Berlin: Econ.
Google Scholar
Rossi, S. (2011). Can it happen again? International Journal of Political Economy, 40, 61–78.
Google Scholar
Saunders, A., & Cornett, M. (2010). Financial institutions management. A risk management approach. New York: McGraw-Hill Irwin.
Google Scholar
Schneider, I. (Ed.). (1988). Die Entwicklung der Wahrscheinlichkeitstheorie von den Anfängen bis 1933. Einführungen und Texte. Darmstadt: Wissenschaftliche Buchgesellschaft.
Google Scholar
Seising, R. (2012). Warren Weaver’s “Science and complexity” revisited. In R. Seising & V. S. González (Eds.), Soft computing in humanities and social sciences (pp. 55–87). Berlin/Heidelberg: Springer.
Google Scholar
Shafer, G., & Vovk, V. (2001). Probability and finance: It's only a game!. Chichester: Wiley.
Google Scholar
Shefrin, H. (2013). Assessing the contribution of Hyman Minsky’s perspective to our understanding of economic instability. https://ssrn.com/abstract=2311045. 16 Feb 2020.
Skyrms, B. (1980). Causal necessity. New Haven: Yale University Press.
Google Scholar
Söderlind, P. (2014). Lecture notes in finance (Master in Quantitative Economics & Finance): Risk measures (Chapter 6). Switzerland: University of St Gallen.
Google Scholar
Sornette, D. (2003). Why stock markets crash: Critical events in complex financial systems. Princeton/Oxford: Princeton University Press.
Google Scholar
Sornette, D. (2009). Dragon-Kings, Black Swans and the prediction of crises. International Journal of Terraspace Science and Engineering, 2, 1–18.
Google Scholar
Spitznagel, M. (2013). The Dao Capital. austrian investing in a distorted world. Hoboken: Wiley.
Google Scholar
Spohn, W. (1986). The representation of popper measures. Topoi, 5, 69–74.
Google Scholar
Stulz, R. (2008). Risk management failures: What are they and when do they happen? Journal of Applied Corporate Finance, 20, 39–48.
Google Scholar
Stumpf, M. P. H., & Porter, M. A. (2012). critical truths about power laws. Science, 335, 665–666.
Google Scholar
Taleb, N. N. (2007). Fooled by randomness. The hidden role of chance in life and in the markets. London: Penguin Books.
Google Scholar
Taleb, N. N., & Pilpel, A. (2004). On the unfortunate problem of the nonobservability of the probability distribution. Working paper (unpublished).
Taleb, N. N. (2013). Probability and Risk in the real world. A mathematical parallel version of the Incerto: I) Antifragile, II) The Black Swan, III) The Bed of Procrustes, & IV) Fooled by Randomness. Available at: http://www.datascienceassn.org/sites/default/files/Probability%20and%20Risk%20in%20the%20Real%20World.pdf. Accessed 13 Sept 2020.
Vickers, J. (2014). The Problem of induction. In E.N. Zalta (Ed.), Stanford encyclopedia of philosophy, https://plato.stanford.edu/entries/induction-problem/. 27 Mar 2018.
Von Hayek, F. A. (1967). Studies in philosophy, politics and economics. London: Routledge.
Google Scholar
Von Mises, L. (1949/1998). Human action: A treatise on economics. The Scholar’s Edition. Auburn: Ludwig von Mises Institute.
Von Mises, L. (1957/2005). Theory and history. An interpretation of social and economic evoluion. Indianapolis: Liberty Fund.
Weaver, W. (1948). Science and complexity. American Scientist, 36, 536–544.
Google Scholar
Weaver, W. (1963). Lady luck. The theory of probability. Harmondsworth: Penguin Books.
Google Scholar
Williams, M. (2010). Uncontrolled risk. The lessons of Lehman Brothers and how systemic risk can still bring down the world financial system. New York: McGraw Hill.
Google Scholar
Zadeh, L. A. (1962). From circuit theory to system theory. Proceedings of the IRE, 50, 856–865.
Google Scholar
Zadeh, L. A. (1969). Towards a theory of fuzzy systems. Electronic Research Laboratory, University of California, Berkeley. 94720 Report No. ERL-69-2.
Zin, T., Lin, T., & Chun-Wie, J. (Eds.) (2019). big data analysis and deep learning applications. Proceedings of the first international conference on big data analysis and deep learning. Berlin: Springer.

Download references

Funding

Open access funding provided by University of Liechtenstein.

Author information

Authors and Affiliations

Universitat Liechtenstein, Vaduz, Liechtenstein
Christian Hugo Hoffmann
Department of Philosophy, University of Zurich, Zürichbergstrasse 43, Zürich, CH-8044, Switzerland
Charles Djordjevic

Authors

Christian Hugo Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Charles Djordjevic
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian Hugo Hoffmann.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Table 1 A suggested taxonomy of risks/uncertainties and complexities based on Weaver (1948)

Full size table

Table 2 Characteristics of dynamic (Hoffmann, 2017) or organized (Weaver, 1948) complexity apply to modern financial systems

Full size table

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Hoffmann, C.H., Djordjevic, C. Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking. Homo Oecon 37, 155–182 (2020). https://doi.org/10.1007/s41412-020-00101-0

Download citation

Received: 30 October 2019
Accepted: 29 August 2020
Published: 15 September 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s41412-020-00101-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking

Abstract

Similar content being viewed by others

Complexity Analysis and Systemic Risk in Finance: Some Methodological Issues

Complexity Theory and Systemic Risk in the World’s Financial Markets

Model Risk and Uncertainty—Illustrated with Examples from Mathematical Finance

1 Introduction

2 Philosophical Roots of the Problem of Induction: Some Preliminaries

3 Probability Theory in a Nutshell

4 Probability, Complexity, and Common Problems

5 Fat-Tails and Big Problems

6 The central Argument Against Using Probability Theory for Financial Risk Management

6.1 The Ramifications for Risk Models

7 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Complexity, Power Laws and a Humean Argument in Risk Management: The Fundamental Inadequacy of Probability Theory as a Foundation for Modeling Complex Risk in Banking

Abstract

Similar content being viewed by others

Complexity Analysis and Systemic Risk in Finance: Some Methodological Issues

Complexity Theory and Systemic Risk in the World’s Financial Markets

Model Risk and Uncertainty—Illustrated with Examples from Mathematical Finance

1 Introduction

2 Philosophical Roots of the Problem of Induction: Some Preliminaries

3 Probability Theory in a Nutshell

4 Probability, Complexity, and Common Problems

5 Fat-Tails and Big Problems

6 The central Argument Against Using Probability Theory for Financial Risk Management

6.1 The Ramifications for Risk Models

7 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation