Eliciting decision weights by adapting de Finetti’s betting-odds method to prospect theory
- First Online:
DOI: 10.1007/s11166-007-9011-z
- Cite this article as:
- Diecidue, E., Wakker, P.P. & Zeelenberg, M. J Risk Uncertainty (2007) 34: 179. doi:10.1007/s11166-007-9011-z
- 5 Citations
- 512 Downloads
Abstract
This paper extends de Finetti’s betting-odds method for assessing subjective beliefs to ambiguous events. Thus, a tractable manner for measuring decision weights under ambiguity is obtained. De Finetti’s method is so transparent that decision makers can evaluate the relevant tradeoffs in complex situations. The resulting data can easily be analyzed, using nonparametric techniques. Our extension is implemented in an experiment on predicting next-day’s performance of the Dow Jones and Nikkei stock indexes, where we test the existence and nature of rank dependence, finding usual patterns. We also find violations of rank dependence.
Keywords
AmbiguityProspect theoryRank-dependent utilityInverse-SPessimismOptimismJEL Classification
D81C60The importance of decision making with unknown probabilities has been emphasized by Keynes (1921), Knight (1921), and many after. In most of our decisions we face uncertainties about relevant future events. The probabilities of those events are rarely known, and we usually have to act on our best beliefs and subjective assessments. Keynes and Knight stressed that it is important to distinguish between subjective uncertainties and objective probabilities.
Although Knight called subjective uncertainties unmeasurable, Borel (1924), Ramsey (1931), and de Finetti (1931) demonstrated soon after that subjective uncertainties can be measured after all, at least in principle. De Finetti proposed a betting-odds system for quantifying subjective uncertainties that has been widely used ever since (Winkler 1972). In its simplest form, it implies that the subjective probability of an event is p if any betting odds more favorable than p:1−p are accepted, and any betting odds less favorable are declined.
De Finetti’s system, and its applications up to now, have been based on the Bayesian model of expected utility. They are distorted by the many violations of expected utility that have been found empirically (Camerer and Weber 1992; Starmer 2000), and that have hampered wider applications. This paper adapts de Finetti’s system to rank-dependent utility (Schmeidler 1989) and prospect theory (Luce and Fishburn 1991; Tversky and Kahneman 1992), which can account for many violations of expected utility.
Another restriction of de Finetti’s betting-odds system is that it assumes linear utility. This assumption is commonly adopted in studies of belief elicitation (Nyarko and Schotter 2002). Under expected utility, also assumed by most modern studies on belief elicitation, linear utility implies risk neutrality. Risk neutrality is very unconvincing for large stakes, where there is pronounced risk aversion and where utility must be concave. It is more plausible, but still problematic, for moderate stakes as considered in our experiment. Empirical studies then still find considerable risk aversion. This may explain why de Finetti’s method has not been used more widely in the economics literature. Our study will maintain de Finetti’s assumption of linear utility, but will relax his assumption of expected utility. Violations of risk neutrality for moderate stakes, also found in our data, can then be explained by factors other than nonlinear utility. These alternative explanations are more convincing (Rabin 2000). Thus, we disentangle linear utility and risk neutrality, and resolve the major restriction of de Finetti’s betting-odds system. For further comments, see the discussion section.
The general estimation of nonadditive decision weights under uncertainty from data, when no probabilities need to be given so that weights are not transforms thereof, is very complex, even if utility is known. It involves many unknown parameters that quickly become intractable for large state spaces. De Finetti’s method, however, can still give tractable measurements for nonadditive decision weights. In his clever design, the resulting equalities are analytically tractable because unknowns conveniently drop from equations, allowing for nonparametric analyses. This will be demonstrated in Section 3.
We will use the new version of prospect theory (Tversky and Kahneman 1992). It corrects some theoretical problems of original prospect theory (Kahneman and Tversky 1979), using Quiggin’s (1981) rank-dependent probability weighting. More importantly, the new version of prospect theory, unlike the original version, can deal with uncertainty (unknown probabilities or ambiguity), using Schmeidler’s (1989) rank-dependent weighting of events. Schmeidler introduced rank dependence for the context of uncertainty, a context which is more important but also more difficult to analyze than risk (known probabilities). The approach of Schmeidler, Kahneman, and Tversky, developed 60 or 70 years after Keynes (1921) and Knight (1921), resulted in the first full-blown and empirically testable theory for uncertainty that reckons with ambiguity attitudes.^{1} Ambiguity attitudes concern situations of uncertainty where no objective probabilities are known but where it is, in deviation from the Bayesian approach, also problematic to assign subjective (additive) probabilities to events (Camerer and Weber 1992). This paper will report on an empirical test of rank dependence for subjective decision weights. It considers only positive outcomes, where Schmeidler’s rank-dependent utility coincides with prospect theory, the term used throughout this paper. Our findings, therefore, apply to both theories.
When restricted to two outcomes, the rank-dependent model had been known long before (Allais 1953, Eq. 19.1; Pfanzagl 1959, p. 288). The novelty of rank dependence only shows up for prospects with three or more outcomes (Gonzalez and Wu 2003). With such prospects, direct measurements can be obtained of decision weights in various middle “ranking positions,” and not just with the best or worst ranks, the only possible ranks for two-outcome prospects, as will be explained in Section 1.
General prospects with three or more outcomes, while prevailing in practice, are hard to implement in experiments. They are usually investigated in special designs that make their characteristics transparent. Studies for decision under risk using such prospects include Chew and Waller (1986), Gonzalez and Wu (2003), Lopes and Oden (1999), and several others. For decision under uncertainty, the domain of our study, there have only been a few studies with prospects yielding more than two outcomes (MacCrimmon and Larsson 1979, p. 364–365; Tversky and Kahneman 1992, Section 1.3; Wu and Gonzalez 1999).
We developed a design for three-outcome prospects that incorporates rank dependence but at the same time makes de Finetti’s betting odds transparent. Thus, subjects can still relate to the choices in a meaningful manner without major cognitive effort, and direct elicitations of nonlinear decision weights are obtained. Such elicitations are desirable for tractable empirical applications of prospect theory when probabilities are unknown. No direct quantitative elicitations of decision weights when in middle ranking positions have been provided in the literature before. Such elicitations give insights into the novelty of rank dependence relative to preceding theories for uncertainty. In this way, our study is to some extent a counterpart to Gonzalez and Wu (2003). These authors considered decision under risk, and used three-outcome prospects to test the novelty of new prospect theory against the 1979 version of this theory. They found mixed results. Our study concerns unknown probabilities, and gives new insights into the distortions of the widely used elicitations of subjective beliefs through de Finetti’s betting-odds method. Clemen and Lichtendahl (2005) emphasized the importance of quantitative measurements of biases in belief elicitation, so as to develop quantitative corrections.
We implement our method in an experiment on subjective probability estimations for the performance of stocks. Shiller et al. (1996, p. 163) argued for the importance of measuring subjective probability estimates for stock performances. We test the presence of rank dependence and, thereby, the desirability to extend the classical Bayesian methods for eliciting beliefs. We also investigate what the deviations from Bayesianism were, regarding some widely discussed properties of rank dependence. In particular, we investigate whether decision weights are convex (“pessimistic” or “uncertainty averse”), a condition mostly assumed in theoretical studies, and whether decision weights are likelihood-insensitive (“inverse-S” or “boundedly subadditive”), a condition suggested by most empirical studies. The latter condition entails a bias of beliefs and decision weights in the direction of fifty-fifty.
Many studies of rank dependence have considered situations where rank dependence is most prone to appear. Some even deliberately and explicitly manipulated stimuli so as to maximally enhance rank dependence (Weber and Kirsner 1997). Our strategy is opposite. We consider regular stimuli that are not particularly targeted towards enhancing rank-dependence effects. Our layout and presentation always maximize transparency and cognitive ease for the subjects. Thus, we move subjects in the direction of Bayesianism (which we consider rational), at the cost of enhancing rank dependence. Our estimations and significance levels for the existence and nature of rank dependence will, therefore, be conservative. We also include a number of situations that are especially prone to generate violations of prospect theory, so that we can critically test this theory. In these ways, we make it hard for prospect theory to perform well.
1 A reformulation of prospect theory through decision weights and rank dependence
This section presents rank-dependent utility and the new version of prospect theory in an elementary manner so as to highlight the central role of rank dependence. Tversky and Kahneman’s (1992) explanation is more complex. Given the importance of prospect theory, a simple explanation, accessible to a wide audience, is desirable.
The three uncertain events in our experiment (U, D, and R) are related to the performance of the Dow Jones industrial average and the Nikkei 225. U denotes the “Up” event that both stock indexes will go up tomorrow, D the “Down” event that both will go down, and R the “Rest” event that either one will go up and the other one will go down or at least one will remain constant. A prospect (u,d,r) yields $u if U obtains, $d if D obtains, and $r if R obtains. The outcomes u,d,r, are always positive in this paper. In applications, outcomes usually depend on stock-index changes in more complex manners. For the sake of exposition, and to be consistent with the experiment described later, we confine our attention to the three-outcome prospects as just described. Generalizations to more outcomes are straightforward. Outcomes x are sometimes equated with constant (riskless) prospects (x,x,x).
Subjective expected utility holds if there exists a utility function v and subjective probabilities π_{U}, π_{D}, and π_{R} that are nonnegative and sum to 1, such that a prospect (u,d,r) is evaluated by \( \pi _{{\text{U}}} {\text{v}}{\left( u \right)} + \pi _{{\text{D}}} {\text{v}}{\left( d \right)} + \pi _{{\text{R}}} {\text{v}}{\left( r \right)} \). Prospect theory generalizes subjective expected utility by allowing the πs to depend not only on the subjective beliefs about the occurrence of the event, but also on the “rank” of the events. Formally, the rank of an event is defined through the event that is ranked better in the sense of yielding better outcomes. The term rank dependence refers to this dependence. We use the term decision weight instead of subjective probability to reflect this dependence. To what extent decision-based quantities such as subjective probabilities and decision weights reflect beliefs or other factors has been a topic of many debates and speculations (Fox and Tversky 1998; Karni 1996; Nau 1995). At any rate, decision weights are relevant to decisions, and are the focus of this paper.
Decision weights for a prospect (u,d,r), depending on the ranks of U, D, R
| π_{U} | π_{D} | π_{R} |
---|---|---|---|
u ≥ d ≥ r | \( \pi ^{{\text{b}}}_{{\text{U}}} \) | \( \pi ^{{{\text{m,U}}}}_{{\text{D}}} \) | \( \pi ^{{\text{w}}}_{{\text{R}}} \) |
u ≥ r ≥ d | \( \pi ^{{\text{b}}}_{{\text{U}}} \) | \( \pi ^{{\text{w}}}_{{\text{D}}} \) | \( \pi ^{{{\text{m,U}}}}_{{\text{R}}} \) |
d ≥ u ≥ r | \( \pi ^{{{\text{m,D}}}}_{{\text{U}}} \) | \( \pi ^{{\text{b}}}_{{\text{D}}} \) | \( \pi ^{{\text{w}}}_{{\text{R}}} \) |
d ≥ r ≥ u | \( \pi ^{{\text{w}}}_{{\text{U}}} \) | \( \pi ^{{\text{b}}}_{{\text{D}}} \) | \( \pi ^{{{\text{m,D}}}}_{{\text{R}}} \) |
r ≥ u ≥ d | \( \pi ^{{{\text{m,R}}}}_{{\text{U}}} \) | \( \pi ^{{\text{w}}}_{{\text{D}}} \) | \( \pi ^{{\text{b}}}_{{\text{R}}} \) |
r ≥ d ≥ u | \( \pi ^{{\text{w}}}_{{\text{U}}} \) | \( \pi ^{{{\text{m,R}}}}_{{\text{D}}} \) | \( \pi ^{{\text{b}}}_{{\text{R}}} \) |
Schmeidler (1989) and Tversky and Kahneman (1992) stated their theories in terms of a weighting function. This function assigns to each event E the decision weight \( \pi ^{{\text{b}}}_{{\text{E}}} \) (when E has the best rank). Our presentation in terms of decision weights is equivalent. Decision weights for a single prospect should sum to 1.
Observation 1.1
All rows in Table 1 sum to 1. □
Because of this observation, the new version of prospect theory avoids the violations of stochastic dominance that hampered the developments of original prospect theory.
Decision weights for middle ranks only show up for prospects with three or more outcomes. Consequently, such prospects are needed for direct elicitations of such decision weights, and for direct tests of such decision weights. Earlier elicitations of middle decision weights were indirect, deriving them from nonadditive measures elicited from two-outcome prospects (Abdellaoui 2000; Bleichrodt and Pinto 2000; Fox and Tversky 1998; Gonzalez and Wu 1999; Tversky and Kahneman 1992). We will follow the assumption of linear utility underlying de Finetti’s betting-odds system and discussed elsewhere, i.e., we set v(x) = x.
2 Our hypotheses
Theoretical studies mostly assume pessimism (Dow and Werlang 1992), and terms such as uncertainty aversion and ambiguity aversion have been used. Empirical studies have suggested that insensitivity is prevailing (Abdellaoui et al. 2005; Einhorn and Hogarth 1985; Gonzalez and Wu 1999; Tversky and Fox 1995). On the basis of the above, a mix of pessimism and insensitivity can be expected, with strong inequalities \( \pi ^{{\text{w}}}_{{\text{U}}} > {\left\{ {\pi ^{{{\text{m,R}}}}_{{\text{U}}} ,\pi ^{{{\text{m,D}}}}_{{\text{U}}} } \right\}} \) and weaker inequalities \( \pi ^{{\text{b}}}_{{\text{U}}} \geqslant {\left\{ {\pi ^{{{\text{m,R}}}}_{{\text{U}}} ,\pi ^{{{\text{m,D}}}}_{{\text{U}}} } \right\}} \); the latter may be reversed if the effect of pessimism is stronger than that of insensitivity. Ellsberg (2001, pp. 203–206) predicted that such a reversal will not occur.
- Hypothesis 1
[Rank dependence of decision weights]. \( \pi ^{{\text{w}}}_{{\text{U}}} \ne {\left\{ {\pi ^{{{\text{m,R}}}}_{{\text{U}}} ,\pi ^{{{\text{m,D}}}}_{{\text{U}}} } \right\}} \ne \pi ^{{\text{b}}}_{{\text{U}}} . \)
Our second empirical hypothesis concerns the nature of rank dependence.
- Hypothesis 2
[Insensitivity and some pessimism]. \( \pi ^{{\text{w}}}_{{\text{U}}} \geqslant \pi ^{{\text{b}}}_{{\text{U}}} \geqslant {\left\{ {\pi ^{{{\text{m,R}}}}_{{\text{U}}} ,\pi ^{{{\text{m,D}}}}_{{\text{U}}} } \right\}}. \)
To critically test prospect theory, we elicited decision weights in different situations that, according to prospect theory, should give the same results, but where violations of prospect theory are most likely to generate differences. For this purpose, we considered degenerate prospects. These are (“riskless”) prospects for which it is certain beforehand what the outcome will be. People have a special preference for degenerate prospects (the certainty effect). Expected utility explains preference for certainty through concave utility. The Allais paradox, however, showed that this explanation is not sufficient (Allais 1953). There are factors underlying the certainty effect that are beyond expected utility, i.e., beyond utility curvature. Prospect theory uses probability weighting (and loss aversion) as a further factor, besides utility curvature, to explain the special preference for certainty. Prospect theory can, indeed, accommodate the Allais paradox.
The most pronounced violations of classical theories have been found, indeed, when degenerate prospects are present (Birnbaum and Thompson 1996; Humphrey 1995; McCord and de Neufville 1986; Starmer 2000). It is plausible that many psychological irregularities are effective in such situations, and that any theory, also prospect theory, will have difficulties there. We use the term degeneracy effects to designate factors underlying the certainty effect that are beyond prospect theory, i.e., beyond utility curvature and probability weighting (and loss aversion).
We elicited decision weights \( \pi ^{{\text{b}}}_{{\text{U}}} \) of event U when in the best rank both with a degenerate prospect present, denoting the resulting decision weight by \( \pi ^{{{\text{b,d}}}}_{{\text{U}}} \) and with no degenerate prospect present, denoting the resulting decision weight by \( \pi ^{{{\text{b,n}}}}_{{\text{U}}} \). Symbols such as \( \pi ^{{{\text{w,d}}}}_{{\text{U}}} ,\pi ^{{{\text{w,n}}}}_{{\text{U}}} \) and \( \pi ^{{{\text{b,d}}}}_{{\text{D}}} \) are similar, and all these weights were elicited in our experiment. Elicitations of decision weights for middle rank positions are not possible with degenerate prospects. The experimental details will be explained later. Prospect theory predicts equalities π^{d} = π^{n}, but degeneracy effects will generate differences. Such differences entail a violation of prospect theory, and show if the degeneracy effects, factors beyond prospect theory, reinforce or weaken the certainty effect relative to prospect theory’s predictions.
3 Making decision tradeoffs transparent through de Finetti’s betting-odds system
In modern theories, rank dependence is important. Our design has modified de Finetti’s design by reckoning with this rank dependence. Classical applications have elicited decision weights only for best ranking positions, but these need not reflect subjective beliefs more properly than decision weights in worst or middle ranking positions. We chose a layout, presented in the next section, so as to induce psychological processes that match the preceding algebraic derivation of the decision weight from Eq. 3.2.
4 Experimental stimuli and layout that make de Finetti’s betting-odds system transparent to subjects
The left column, indicated by a single large plus, designates the left side of Eq. 3.2, i.e. a gamble of B (=20) extra on event U. The right column, indicated by three small plusses, designates the right side of Eq. 3.2, yielding s = 3 more than the reference prospect with certainty. For all prospects, U is ranked worst with decision weight \( \pi ^{{\text{w}}}_{{\text{U}}} \), D is ranked middle with decision weight \( \pi ^{{{\text{m,R}}}}_{{\text{D}}} \), and R is ranked best with decision weight \( \pi ^{{\text{b}}}_{{\text{R}}} \).
In the instructions to the subjects, it was explained that the left prospects in the tables always result from the middle prospects through single big increases of the outcome for one event, and the right prospects always result from the middle ones through the same (small) increase of all three events. This layout and presentation of tables should make the tradeoffs transparent, of either getting B extra under event U or s extra for sure, as for de Finetti’s betting odds. At the same time, the initial focus on the reference prospect should make the rank-ordering of the events salient. The layout of the first table in Figure 1 thus makes the relevant tradeoffs in Eqs. 3.1 and 3.2 transparent to the subjects.
5 Experiment
Participants
N = 186 students, all from Tilburg University, took part. There were 62 psychology students divided into six groups. There also was a group of 124 students in general social sciences who participated in one big session. The average age of the participants was 20.1, and 32.8% were male.
Procedure
The experiment was carried out in classroom sessions. All items were administered using pencil-and-paper questionnaires. The subjects received brief verbal instructions, followed by detailed written instructions that took about 15 min to read (Appendix A). The stimuli and appendices are downloadable from the second author’s homepage, at http://people.few.eur.nl/wakker/pdfspubld/07.2dowjappdices.pdf.
A transparency with a graph depicting the performances of the stock indexes during the last 2 months, up to the day of the experiment, was projected during the task (Appendix B). Such periods are commonly used because for periods further in the past the nature of the stock may be different (Hull 2005, Section 13.4). A brief text in the written instructions discussed the likelihood of the indexes increasing or decreasing, referring explicitly to the last 2 months. As different groups participated on different days, the information about the indexes varied from group to group. Then the participants were asked to fill out the questionnaire at their own pace. This usually took about 30 min.
Stimuli; organization between pages
After the two learning-question pages and before the first experimental-question page, there was a page with three questions about the difficulty of the other questions and about whether the participants paid attention to their perceived likelihoods of the events (Appendix C). The questions about likelihood served to focus subjects’ attention on this aspect. Pilots had demonstrated that subjects were prone to using the heuristic of simply taking the sum of payments as their decision criterion, which ignores the different likelihoods of the events, leading to a loss of statistical power of our design.
The outcomes used in the experiment ranged from Dfl. 10 (approximately €4.50) to Dfl. 99. At the end of the experiment, there were self-assessment questions about age and gender.
Stimuli; organization within one page
All ten tables on one page had the same grey middle column, i.e. the same reference prospect. They also had the same left column, with the same single increase B (B = 20 in Figure 1). The payoffs of the right prospects were increased stepwise with step size x (x = 3 in Figure 1) up to 10x. We always had 10x ≥ B, so that the right prospect (10x + r_{1},10x + r_{2},10x + r_{3}) in the last choice on each page always dominated the left one \( {\left( {10x + r_{1} \geqslant B + r_{1} } \right)} \). In Figure 1, the right prospects dominate the left ones for all k ≥ 7 and, hence, for all three tables displayed on the right.
Stimuli and results
| + | +++ | ||||||
---|---|---|---|---|---|---|---|---|
U | D | R | U | D | R |
| ||
# | π | (B+)r_{1} | (B+)r_{2} | (B+)r_{3} | xk + r_{1} | xk + r_{2} | xk + r_{3} | mean (st.dev.) of π |
1 | \( \pi ^{{{\text{b,n}}}}_{{\text{U}}} \) | 20 + 44 | 29 | 13 | 3k + 44 | 3k + 29 | 3k + 13 | 0.465 (0.22) |
2 | \( \pi ^{{{\text{b,d}}}}_{{\text{U}}} \) | 30 + 24 | 24 | 24 | 3k + 24 | 3k + 24 | 3k + 24 | 0.414 (0.18) |
3 | \( \pi ^{{{\text{m,D}}}}_{{\text{U}}} \) | 20 + 31 | 65 | 14 | 3k + 31 | 3k + 65 | 3k + 14 | 0.473 (0.23) |
4 | \( \pi ^{{{\text{m,R}}}}_{{\text{U}}} \) | 20 + 23 | 14 | 59 | 3k + 23 | 3k + 14 | 3k + 59 | 0.485 (0.23) |
5 | \( \pi ^{{{\text{w,n}}}}_{{\text{U}}} \) | 20 + 13 | 46 | 65 | 3k + 13 | 3k + 46 | 3k + 65 | 0.505 (0.23) |
6 | \( \pi ^{{{\text{b,n}}}}_{{\text{U}}} \) | 30 + 16 | 46 | 46 | 3k + 16 | 3k + 46 | 3k + 46 | 0.430 (0.17) |
7 | \( \pi ^{{{\text{b,n}}}}_{{\text{D}}} \) | 18 | 40 + 56 | 35 | 4k + 18 | 4k + 56 | 4k + 35 | 0.334 (0.18) |
8 | \( \pi ^{{{\text{b,d}}}}_{{\text{D}}} \) | 37 | 30 + 37 | 37 | 4k + 37 | 4k + 37 | 4k + 37 | 0.352 (0.19) |
9 | \( \pi ^{{{\text{m,R}}}}_{{\text{D}}} \) | 11 | 40 + 18 | 59 | 4k + 11 | 4k + 18 | 4k + 59 | 0.314 (0.19) |
10 | \( \pi ^{{{\text{m,U}}}}_{{\text{D}}} \) | 59 | 40 + 15 | 10 | 4k + 59 | 4k + 15 | 4k + 10 | 0.310 (0.19) |
11 | \( \pi ^{{{\text{w,n}}}}_{{\text{D}}} \) | 59 | 40 + 10 | 56 | 4k + 59 | 4k + 10 | 4k + 56 | 0.334 (0.19) |
12 | \( \pi ^{{{\text{w,d}}}}_{{\text{D}}} \) | 58 | 30 + 28 | 58 | 4k + 58 | 4k + 28 | 4k + 58 | 0.355 (0.20) |
13 | \( \pi ^{{{\text{b,n}}}}_{{\text{R}}} \) | 42 | 26 | 20 + 63 | 2k + 42 | 2k + 26 | 2k + 63 | 0.525 (0.20) |
14 | \( \pi ^{{{\text{b,d}}}}_{{\text{R}}} \) | 17 | 17 | 20 + 17 | 2k + 17 | 2k + 17 | 2k + 17 | 0.506 (0.20) |
15 | \( \pi ^{{{\text{m,U}}}}_{{\text{R}}} \) | 74 | 12 | 20 + 37 | 2k + 74 | 2k + 12 | 2k + 37 | 0.493 (0.21) |
16 | \( \pi ^{{{\text{m,D}}}}_{{\text{R}}} \) | 16 | 61 | 20 + 27 | 2k + 16 | 2k + 61 | 2k + 27 | 0.513 (0.20) |
17 | \( \pi ^{{{\text{w,n}}}}_{{\text{R}}} \) | 77 | 51 | 20 + 13 | 2k + 77 | 2k + 51 | 2k + 13 | 0.498 (0.20) |
18 | \( \pi ^{{{\text{w,d}}}}_{{\text{R}}} \) | 49 | 49 | 20 + 29 | 2k + 49 | 2k + 49 | 2k + 29 | 0.488 (0.20) |
L1 | 30 + 50 | 10 | 30 | 3k + 50 | 3k + 10 | 3k + 30 | 0.367 (0.15) | |
L2 | 45 | 20 + 10 | 55 | 4k + 45 | 4k + 10 | 4k + 55 | 0.367 (0.24) | |
F1 | 30 + 35 | 11 | 24 | 2k + 35 | 2k + 11 | 2k + 24 | 0.347 (0.14) | |
F2 | 32 | 30 + 49 | 13 | 2k + 32 | 2k + 49 | 2k + 13 | 0.271 (0.14) |
Elicitations of decision weights with best or worst ranks when there are degenerate prospects, indicated by superscript d, occurred for the 10-tuples number 2, 6, 8, 12, 14, and 18 in Table 2. Then either \( B + r_{1} = r_{2} = r_{3} \) or r_{1} = r_{2} = r_{3} in Figure 3. For example, the second 10-tuple first considers a choice between (54, 24, 24) and (27, 27, 27) (for k = 1), and then a choice between (54, 24, 24) and (30, 30, 30) (for k = 2); etc. For each choice in this 10-tuple the second option is degenerate.
Elicitations of decision weights when there are no degenerate prospects, indicated by superscript n, occurred for the 10-tuples 1, 5, 7, 11, 13, and 17. For example, the first 10-tuple first considers a choice between (64, 29, 13) and (47, 32, 16), and then a choice between (64, 29, 13) and (50, 35, 19); etc. Although there are no degenerate prospects now, the rank of the first outcome (regarding the U event) is best, as it is for the second 10-tuple, and the same decision weight should result for event U from the first and the second 10-tuple according to prospect theory. Degeneracy effects will, however, generate differences between these decision weights.
Motivating the participants
So as to avoid income effects, individual-choice experiments usually pay at most one of the choices made by each participant for real, where this choice is randomly selected from all choices. A theoretical problem, suggested by Holt (1986), was demonstrated not to occur empirically by Starmer and Sugden (1991). The random-lottery incentive system has since become the almost exclusively used incentive system for individual-choice experiments (Holt and Laury 2002; Harrison et al. 2002). We used a variation of the system where only one of every ten participants, randomly selected, played for real. Two studies examined whether there was a difference between this form of the random-lottery incentive system and the original form, and did not find a difference (Armantier 2006, p. 406; Harrison et al. 2007). The incentive system was explained to the participants beforehand. The participants collected the money gained the next morning, when the relevant uncertainties about the stock indexes had been resolved. In addition, the 62 psychology students received course credits, and each student of the large group of 124 received a flat payment of €11.
Analysis
6 Results
The different subject groups exhibited the same patterns, and their data were pooled. Four subjects were dropped because a test question at the beginning of the experiment suggested that they did not understand the stimuli. Ten subjects were dropped because they had more than two incorrect choice-switches (from the right column to the left column), suggesting that they did not understand the stimuli. Dropping these subjects does not affect any of the main results hereafter.
We describe the results for the two events with significant rank dependence. For event U we find some pessimism, because \( \pi ^{{\text{b}}}_{{\text{U}}} < \pi ^{{\text{m}}}_{{\text{U}}} {\left( {t_{{165}} = - 4.18,\;p < 0.001} \right)} \) and \( \pi ^{{\text{b}}}_{{\text{U}}} < \pi ^{{\text{w}}}_{{\text{U}}} {\left( {t_{{168}} = - 3.24,\;p = 0.001} \right)} \). Whereas pessimism suggests that \( \pi ^{{\text{m}}}_{{\text{U}}} < \pi ^{{\text{w}}}_{{\text{U}}} \) there is no significant difference in our data (t_{168} = 0.93, ns). Event D exhibits likelihood insensitivity. That is, \( \pi ^{{\text{b}}}_{{\text{D}}} > \pi ^{{\text{m}}}_{{\text{D}}} {\left( {t_{{168}} = 3.35,\;p = 0.001} \right)} \) and \( \pi ^{{\text{w}}}_{{\text{D}}} > \pi ^{{\text{m}}}_{{\text{D}}} {\left( {t_{{169}} = 3.12,\;p = 0.002} \right)} \). There is no significant difference between \( \pi ^{{\text{b}}}_{{\text{D}}} \) and \( \pi ^{{\text{w}}}_{{\text{D}}} {\left( {t_{{168}} = 0.13,\;{\text{ns}}} \right)} \).
Significant inequalities and their support for properties of decision weights
| \( \pi ^{{\text{b}}}_{{\text{U}}} < \pi ^{{\text{m}}}_{{\text{U}}} \) | \( \pi ^{{\text{b}}}_{{\text{U}}} < \pi ^{{\text{w}}}_{{\text{U}}} \) | \( \pi ^{{\text{b}}}_{{\text{D}}} > \pi ^{{\text{m}}}_{{\text{D}}} \) | \( \pi ^{{\text{w}}}_{{\text{D}}} > \pi ^{{\text{m}}}_{{\text{D}}} \) |
---|---|---|---|---|
Insensitivity | − | 0 | + | + |
pessimism | + | + | − | + |
optimism | − | − | + | − |
Testing invariance of decision weights for same ranks of events
| \( \pi ^{{{\text{b,n}}}}_{{\text{U}}} > \pi ^{{{\text{b,d}}}}_{{\text{U}}} \) | \( \pi ^{{{\text{w,n}}}}_{{\text{U}}} > \pi ^{{{\text{w,d}}}}_{{\text{U}}} \) | \( \pi ^{{{\text{b,n}}}}_{{\text{D}}} < \pi ^{{{\text{b,d}}}}_{{\text{D}}} \) | \( \pi ^{{{\text{w,n}}}}_{{\text{D}}} > \pi ^{{{\text{w,d}}}}_{{\text{D}}} \) | \( \pi ^{{{\text{b,n}}}}_{{\text{R}}} > \pi ^{{{\text{b,d}}}}_{{\text{R}}} \) | \( \pi ^{{{\text{w,n}}}}_{{\text{R}}} > \pi ^{{{\text{w,d}}}}_{{\text{R}}} \) |
---|---|---|---|---|---|---|
t-statistic | t_{170} = 3.22 | t_{172} = 4.74 | t_{170} = −1.81 | t_{172} = −1.09 | t_{170} = 1.63 | t_{171} = 0.52 |
p-value | p = 0.002 | p = 0.000 | p = 0.07 | ns | ns | ns |
We also analyzed pessimism at the individual level, taking \( \pi ^{{{\text{w,n}}}}_{{\text{U}}} - \pi ^{{{\text{b,n}}}}_{{\text{U}}} \) as index of pessimism for event U with noncollapsed decision weights, and taking \( \pi ^{{{\text{w,n}}}}_{{\text{D}}} - \pi ^{{{\text{b,n}}}}_{{\text{D}}} \) and \( \pi ^{{{\text{w,n}}}}_{{\text{R}}} - \pi ^{{{\text{b,n}}}}_{{\text{R}}} \) similarly. All correlations are positive. They are significant for events U and D (r = 0.174, p = 0.025) and events U and R (r = 0.214, p = 0.006), and insignificant for events D and R (r = 0.104, p = 0.178). It suggests that individuals who are pessimistic for one event are pessimistic for another too, so that pessimism and optimism are individual traits to some extent. Similar correlational analyses for collapsed decision weights did not give significant results, suggesting that, in the presence of collapsing, effects other than rank dependence are dominant. A similar analysis for insensitivity is not well possible because middle-ranked events play a different role, without possibility of collapsing, in our design than events ranked worst or best.
Our final result concerns the prediction of rank dependence that the rows in Table 1 sum to the same, and that this sum is 1 (Observation 1.1). The average weights elicited did sum to approximately the same for each row, but this sum exceeded 1 and ranged from 1.24 to 1.34. Risk aversion (total number of right choices) correlated significantly with age (r = 0.22, p = 0.007), but not with gender.
7 Discussion
The analyses of variance detected strong deviations from expected utility, with decision weights affected by ranking. The paired t-tests found no clear overall patterns of rank dependence at the aggregate group level, with some support for pessimism and insensitivity. These findings together show that there are many violations of expected utility at the individual level, but there is also much heterogeneity between individuals. The size of the rank-dependent effects in our data will have been attenuated by some conservative aspects in our tests, discussed later.
Historical data suggest that the probability of the daily Dow Jones index going up is very close to 0.5, and so it is for the Nikkei index. These daily performances have a weak positive correlation (Hamao et al. 1990). Thus, R has a probability slightly below 0.5, and U and D have a probability slightly above 0.25. The decision weights of R obtained in our study are all close to 0.5. The decision weights of U and D exceed 0.25 considerably. These results reflect the general overweighting of decision weights in our data, in combination with the general regressive nature of judged probabilities. The latter implies that people overestimate small probabilities and underestimate moderate and high probabilities (Tversky and Fox 1995). The performances of the indexes in the months preceding the experiment were positive, with more movements up. Given that subjects received information about the two preceding months, it is natural that they weighted U more heavily than D.
Several decisions in the design of our study served to maintain tractability for the subjects. For example, we did not counterbalance for the following two order effects. First, in all pages, sure payments were increasing from left to right. Given a natural tendency against changes, a bias can be expected of switching from left to right too late, generating a systematic bias upwards in the measured decision weights. This bias is enhanced by the presence of pages where the final choices are governed by dominance, e.g. in Figure 1, so that there is more room for overestimation than for underestimation. Thus, biases upwards in our measurements will have been generated, explaining the violations of unit summation in Table 1. Second, the location of events on pages was always the same, with event U on top, D in the middle, and R at the bottom.
Both order effects just discussed do not affect the presence of rank dependence or its direction, to the best of our knowledge. The mentioned biases do affect the exact quantitative elicitations of decision weights. For such elicitations, the mentioned biases could have been avoided by counterbalanced elicitations and averaging. We did not carry out such corrective procedures so as to avoid a lengthy experiment and so as to reduce the cognitive burden for the subjects. For this paper, we optimized our design for the testing of general hypotheses about existence and direction of rank dependence.
Our design may have encouraged subjects to focus on the big and small changes B and s, and to ignore the reference prospects. This effect reduces rank dependence, and the power of our design to find it, so that our statistical conclusions become conservative.
Different groups of subjects participated in the experiment at different times and, accordingly, received different information about the stock indexes during the preceding 2 months. This difference leads to an additional variation between individuals, and again leads to a loss of power. We, however, do not consider it to be a bias. In real life it is only natural that different individuals have different information about events and, hence, different attitudes towards them. The difference mentioned does not distort our findings on rank dependence because these findings are all based on within-subject differences.
The inequality \( \pi ^{{{\text{b,d}}}}_{{\text{U}}} < \pi ^{{{\text{b,n}}}}_{{\text{U}}} \) that we found, with decision weights lower and more choices for right (= safe = risk averse) columns in matrices with degenerate prospects present, suggests that the degeneracy-effects enhanced the certainty effect beyond prospect theory. The inequality \( \pi ^{{{\text{w,d}}}}_{{\text{U}}} < \pi ^{{{\text{w,n}}}}_{{\text{U}}} \) that we found, however, suggests the opposite (Wu et al. 2005, p. 120, also report findings opposite to the certainty effect). There is, therefore, no clear direction in the violation of prospect theory that we found here. The other violation of prospect theory in our data, concerning the sums in Table 1 exceeding 1, was discussed before. Many other violations of prospect theory have been found, including Barron and Erev (2003), Birnbaum (2006), Goeree et al. (2002), González-Vallejo et al. (2003), Harbaugh et al. (2002), Lopes and Oden (1999), Neilson and Stowe (2001), and Starmer (1999). However, to date there is no more successful and tractable theory available for decision under risk or uncertainty. Many phenomena remain unpredictable in this domain, with only post-hoc heuristic explanations conceivable.
We extensively tested many alternative layouts and framings in pilot studies, where subjects were asked to give feedback. The layout of the stimuli chosen for the experiment was found to be most suited for making the decision-relevant tradeoffs transparent to the subjects. In the pilot studies, we found that grouping the 10-tuples by events, rather than completely randomizing the order of presentation, and some other changes in design, induced participants to resort to heuristics equivalent to expected value maximization instead of expressing subjective preferences. We believe that experimental choices, derived from a transparent design where the relevant tradeoffs are clear to the participants, will be more representative of choices made in significant real-life decisions than choices derived from nontransparent designs.
The most problematic heuristic to be avoided was the one of simply adding the outcomes, completely ignoring the uncertain events and taking them all as if equally likely. This is an extreme case of Viscusi’s (1989) model of biases toward uniform prior distributions that in our setup enhance expected value maximization. Hence, we used events that were very clearly not symmetric or equally likely. We did not want to use events relating to one continuous variable so as to avoid distorting different perceptions of convex versus nonconvex unions. We neither wanted to use events that would arouse emotions, such as events pertaining to soccer, the most popular sport in our country.
Had we assumed concave instead of linear utility, even if utility were assumed known such as in v(x) = x^{0.88}, then the calculations would have been considerably more intricate. Then decision weights do not cancel from equations any more so that an indifference only gives an equation with several decision weights as unknowns. Then solutions and approximative solutions to complex linear equalities would have been required, with complex data fittings. Convenient techniques for parametric fitting for uncertainty, when weighting functions neeed not be transforms of given probabilities, are yet to be developed. Hence we propose our method at present only when it is reasonable to assume linear utility. Many references have argued for linear utility for small stakes (Edwards 1955; Fox et al. 1996; Lopes and Oden 1999 p. 290; Luce 2000 p. 86; Rabin 2000; Ramsey 1931 p. 176; Savage 1954 p. 91). According to modern theories, risk aversion for moderate stakes (such as in Holt and Laury 2002), will be caused by factors other than utility curvature, such as the decision weights studied in this paper. With concave utility, the resulting decision weights would have been higher than in our calculations, so that utility curvature cannot account for the sums in Table 1 exceeding 1.
To further clarify the distentanglement of utility and risk attitude that is central in our extension of de Finetti’s method, we refer to Chateauneuf and Cohen (1994, Corollary 2 on p. 86 ) and Abdellaoui et al. (2007). The former demonstrated that it is theoretically possible to have risk aversion with strictly convex utility, and the latter found this phenomenon empirically in an experiment with loss outcomes.
Many empirical studies have found that the local curvature of utility is most nonlinear around zero (Tversky and Kahneman 1992), a phenomenon incorporated in the most commonly used parametric utility family, the CRRA (logpower) family, which usually has infinite derivative at zero. Our outcomes are all remote from zero, with minimal outcome €4, so as to have approximately linear utility. An additional reason for avoiding the zero outcome is that it induces several biases in the evaluation of prospects (Birnbaum et al. 1992). Preference foundations of de Finetti’s betting-odds system for prospect theory with linear utility are in Chateauneuf (1991) and Diecidue and Wakker (2002).
8 Conclusion
Using de Finetti’s betting-odds system, we developed a method for eliciting decision weights for prospect theory and rank-dependent utility. We found evidence for rank dependence of the decision weights. This finding constitutes a deviation from the classical Bayesian model for eliciting subjective beliefs in ambiguous events. Regarding the direction of the deviation, there was much individual variation, with as much evidence for likelihood-insensitivity as for pessimism. Both effects seem to play a role. Given the widespread use of belief elicitations, virtually always based on Bayesian principles, it is important that the deviations from Bayesianism be identified, so that more accurate estimations can be obtained of the beliefs of financial experts, players in games, and so on.
The multiple-priors model existed before (Wald 1950), and became popular after Gilboa and Schmeidler (1989) established its theoretical soundness. While this model has proved to be useful in many theoretical studies, we are not aware of empirical measurements thereof, or a tractable way to obtain those.