1 Introduction: simplifying multi-attribute choice

Baucells and Sarin (2013) argue for guiding decision processes by employing personalized decision rules that “require only a few meaningful inputs from the decision maker” and “have strong prescriptive properties” (p. 30). In other words, the rules should be simple and perform well. For example, such simple rules are deterministic elimination by aspects (Tversky 1972) and equal weighting of attributes (Dawes and Corrigan 1974), which perform surprisingly well in multi-attribute choice (Hogarth and Karelaia 2005a, b; Katsikopoulos 2013).

A condition that facilitates the performance of simple rules in multi-attribute choice is cumulative dominance (Kirkwood and Sarin 1985): Say that there are three attributes and the utility function is U(X) = 3a 1(X) + 2a 2(X) + a 3(X). There are two alternatives, A with attribute values a 1(A) = 1, a 2(A) = 0 and a 3(A) = 1 and B with attribute values a 1(B) = 0, a 2(B) = 1 and a 3(B) = 1. Then, A cumulatively dominates B because a 1(A) ≥ a 1(B), a 1(A) + a 2(A) ≥ a 1(B) + a 2(B) and a 1(A) + a 2(A) + a 3(A) ≥ a 1(B) + a 2(B) + a 3(B). The cumulative-dominant alternative A also maximizes utility since U(A) = 4 and U(B) = 3. Note that A is chosen by deterministic elimination by aspects (Hogarth and Karelaia 2005a, b) which, for two alternatives, orders attributes according to the magnitude of their weight and chooses the alternative with the higher value on the first attribute that has different values on the two alternatives (in the example, this is attribute a 1). In sum, one can make utility-maximizing choices by knowing only the order of attribute weights and not their exact magnitudes.

The result that cumulative-dominant alternatives maximize utility is not peculiar to this example but holds for all linear utility functions (Baucells et al. 2008). Furthermore, some simple rules such as deterministic elimination by aspects, always choose a cumulative-dominant alternative when one exists (Baucells et al. 2008) and cumulative-dominant alternatives exist frequently in the majority of choice problems with binary attributes (Şimşek 2013).

In sum, cumulative dominance seems to legitimate guiding decision processes by employing simple rules. But how robust is this result? What if the utility function is not linear or if the attributes are not binary? These are the questions we ask and answer here.

The remainder of this article is organized as follows: in the next section, we present the previous work that has demonstrated some benefits of cumulative dominance. Then we test, via mathematical analysis and computer simulation, the limits of cumulative dominance by relaxing previous assumptions about the utility function and the attributes. The last section concludes.

2 Cumulative dominance: known benefits

The goal is to choose one out of many alternatives A, B, C, …, so as to maximize a function that maps alternatives to a numerical measure of their quality. This function can be interpreted as the decision maker’s subjective utility function (Keeney and Raiffa 1976).

Each alternative A is described by n (n ≥ 2) attributes, a 1, …, a n . That is, A = (0.8, 0.2, 0.5) means that a 1(A) = 0.8, a 2(A) = 0.2 and a 3(A) = 0.5. When the attributes are continuous, they can take any value in the interval [0, 1]. When the attributes are binary, they equal either 0 or 1. There exist k (k ≥ 2) alternatives in the consideration set.

In Baucells et al. (2008), the true utility function, which is unknown to the decision maker and the decision analyst, is assumed to be a linear function of the attributes:

$$ U(X) = \sum\limits_{i = 1}^{n} {w_{i} a_{i} (X);\;w_{1} \ge w_{2} \ge \ldots > w_{n} \ge 0} . $$
(1)

The utility function described in (1) is a special case of Keeney and Raiffa’s additive utility function where each single-attribute utility function equals the identity function (1976, Section 6.4). Theoretically, sufficient conditions for the existence of an additive utility function are reviewed in (Keeney and Raiffa 1976, Theorem 6.3).

Baucells et al. (2008) assume that the utilities in (1) are used for ranking alternatives. We make the same assumption here.

Alternative A cumulative-dominates alternative B (Kirkwood and Sarin 1985) when the following holds:

$$ \sum\limits_{i = 1}^{j} {a_{i} (A) \ge } \sum\limits_{i = 1}^{j} {a_{i} (B); \quad j = 1,2, \ldots .,n.} $$
(2)

Note that (2) assumes an order of attributes. In what follows we fix this order and assume that it is the same with the order of the weights in (1).

We say that an alternative A is cumulative-dominant in the consideration set if it cumulative-dominates all other k − 1 alternatives.

A cumulative-dominant alternative has maximum utility when the utility function is linear. In the Introduction, we saw an example of this for binary attributes. It is also true for continuous attributes. For example, assume U(X) = 3a 1(X) + 2a 2(X) + a 3(X) and consider two alternatives A and B with a 1(A) = 0.8, a 2(A) = 0.2 and a 3(A) = 0.5, and a 1(B) = 0, a 2(B) = 1 and a 3(B) = 0.4. It holds that A cumulatively dominates B and U(A) = 3.3 > 2.4 = U(B). More generally, (Baucells et al. 2008, Theorem 3) analytically showed the following:

Result 1

For the linear utility function (1), A cumulative-dominates B [see (2)] if and only if U(A) ≥ U(B) for all weights w i .

Note that Result 1, and all results in the present article, assume that the decision maker is properly calibrated: The decision maker, possibly with the help of the decision analyst, knows how to code the attributes a i so that the weights w i are positive and is able to order the w i according to their magnitude. These assumptions are particularly plausible when the decision maker is familiar with the choice. In this sense, the scope of this article is decision making under risk, not uncertainty (Knight 1921).

Result 1 implies that if an alternative is cumulative-dominant in the consideration set, then it has maximum utility. The next question is how frequently do cumulative-dominant alternatives exist.

Baucells et al. (2008) investigated artificial choice problems where the values of the attributes were drawn from independent Bernoulli distributions or from simple generalizations of Bernoulli distributions that had attribute correlations. The number of attributes n and the number of alternatives k were varied from 2 to 10. When n increased, the probability of cumulative dominance decreased. The decrease, however, was not too rapid. For example, when attributes followed independent Bernoulli distributions and it was equally likely that an attribute equalled zero or one, the probability of cumulative dominance ranged from 97 to 68 % as n increased from 2 to 10 when there were k = 2 alternatives. For k = 10 alternatives, this probability ranged from 99 to 42 %. Combinations of n and k for which the probability of cumulative dominance fell below 50 % were rare and in most combinations this probability did not fall below 80 %. Furthermore, Şimşek (2013) investigated natural choice problems in 51 datasets with n ranging from 3 to 21 and k = 2. The probability of cumulative dominance ranged from 100 to 61 % with a mean of 87 %. We summarize these results informally as follows:

Result 2

For binary attributes, a cumulative-dominant alternative exists frequently for the majority of combinations of the number of attributes n and the number of alternatives k.

Based on Results 1 and 2, multi-attribute choice can be simplified if one has a method for choosing cumulative-dominant alternatives when such alternatives exist. Some heuristics, which are simple to apply and are often used spontaneously by people, indeed choose cumulative-dominant alternatives. For example, Hogarth and Karelaia (2005a, b) define a deterministic version of Tversky’s (1972) elimination by aspects. In this heuristic the decision maker first inspects the value of all alternatives on attribute a 1. If all alternatives have the same value on a 1, this attribute is eliminated. If one alternative has a unique maximum value on a 1, this alternative is chosen. Otherwise, all alternatives with a value smaller than the maximum on a 1 are eliminated and the process is repeated using attribute a 2. The process continues in this way until an alternative is chosen. For example, the heuristic chooses A = (0.8, 0.2, 0.5) over B = (0, 1, 0.4) based on attribute a 1. The performance of this and similar heuristics has been empirically found to compare favorably with statistical benchmarks such as linear and logistic regression, Bayesian networks, neural networks and classification and regression trees (Czerlinski et al. 1999; Martignon and Hoffrage 2002; Chater et al. 2003; Hogarth and Karelaia 2005a; Brighton 2006; Martignon et al. 2008; Katsikopoulos et al. 2010, 2011; Şimşek 2013).

3 Cumulative dominance: testing the limits

Utility function with attribute interactions

In what follows, we assume the following utility function:

$$ \begin{gathered} U(X) = \sum\limits_{i = 1}^{n} {w_{i} a_{i} (X)} + \sum\limits_{i = 1}^{n - 1} {\sum\limits_{j = i + 1}^{n} {w_{i,j} a_{i} (X)a_{j} (X)} } + \cdots + w_{{1,2,{ \ldots },n}} a_{1} (X)a_{2} (X) \cdots a_{n} (X); \hfill \\ w_{1} \ge w_{2} \ge \cdots \ge w_{n} \ge 0,w_{1,2} \ge w_{1,3} \ge \cdots \ge w_{n - 1,n} \ge 0,{ \ldots },w_{{1,2,{ \ldots },n - 1}} \ge w_{{1,2,{ \ldots },n - 2,n}} \ge \cdots \ge w_{{2,3,{ \ldots },n}} \ge 0. \hfill \\ \end{gathered} $$
(3)

That is, (3) generalizes (2) by additionally including all multiplicative interactions among attributes (two-way, three-way, etc.). Note that the interaction weights are assumed to be positive and increasing in all subscripts. For example, for n = 3 attributes, the general form of the utility function is U(X) = w 1 a 1(X) + w 2 a 2(X) + w 3 a 3(X) + w 1, 2 a 1(X)a 2(X) + w 1, 3 a 1(X)a 3(X) + w 2, 3 a 2(X)a 3(X) + w 1, 2, 3 a 1(X)a 2(X)a 3(X) where w 1 ≥ w 2 ≥ w 3 ≥ 0, w 1, 2 ≥ w 1, 3 ≥ w 2, 3 ≥ 0 and w 1, 2, 3 ≥ 0 and a specific instance is U(X) = 7a 1(X) + 6a 2(X) + 5a 3(X) + 4a 1(X)a 2(X) + 3a 1(X)a 3(X) + 2a 2(X)a 3(X) + a 1(X)a 2(X)a 3(X).

The utility function described in (3) is a special case of Keeney and Raiffa’s multi-linear utility function where each single-attribute utility function equals the identity function (1976, Section 6.4). Theoretically, sufficient conditions for the existence of a multi-linear utility function are reviewed in (Keeney and Raiffa 1976, Theorem 6.3). We provide empirical evidence that supports the particular assumptions made in (3).

First, in a meta-analysis of 113 datasets, Li et al. (2006) found that including multiplicative interactions in a linear model is crucial to fitting the data well since 11 % of the two-attribute, 7 % of the three-attribute and 3 % of the four-attribute interactions were statistically significant.

Second, there is evidence for the restrictions of (3) on the weights. Note that the assumption w i  ≥ 0 can be made without loss of generality. Given this, it can also be assumed that w i, j , w i, j, k , …, w 1, 2, …, n  ≥ 0 because of the empirical regularity of synergy which says that interaction weights tend to have the same sign with the corresponding single-attribute weights (in Li et al. 2006, this happened in 83 % of the statistically significant two-attribute interactions). Next, the assumption w i  ≥ w i + 1 can be made without loss of generality. Given this, it can also be assumed that w i, j , w i, j, k , etc., are decreasing in all subscripts because of an empirical regularity called inheritance (Wu and Hamada 2000; Li et al. 2006) which says that the magnitude of w i, j tends to depend only on the magnitudes of w i and w j and to increase when either of them increase (Chipman 1996).

Finally, we also tested how well (3) fits the data in 20 decision problems that have been used to evaluate the performance of various choice models (Czerlinski et al. 1999). In each of these problems, the task is to rank a number of alternatives according to their value on a criterion that can be interpreted as utility (e.g., in one problem the alternatives are counties in Minnesota and the criterion is their rent per acre and in another problem the alternatives are the contiguous states in the U.S. and the criterion is their average motor fuel consumption per person).

In each problem, fit was measured by the rank correlation (Kendall 1938) between the order of alternatives according to their utilities as estimated by (3) and the true order of alternatives according to their real criterion values. Specifically, the following procedure was used. First, it was determined which attributes to ignore by fitting (3) to all data in each problem and retaining only the attributes corresponding to the largest 25 % of the weights (when 50, 75 or 100 % were used the fit was essentially equal, so we chose 25 % to minimize the number of parameters). Second, the cross-validation method was used; Initially, a sample that contained 50 % of all alternatives, their attribute and criterion values, was used to estimate the weights that maximized the fit of (3) in this sample, and finally the fit of (3) with the estimated weights was measured in ranking the other 50 % of the alternatives. To average out random variation, this process was repeated 1,000 times. Across the 20 problems the average value of Kendall’s τ was 0.43. And there was a 61 % probability that the alternative with the truly highest criterion value was the alternative with the maximum utility as estimated by (3).

3.1 Binary attributes: cumulative-dominant alternatives have maximum utility

We now ask if Result 1 can be extended for (3). To gain some intuition, consider the following examples. First, assume U(X) = 7a 1(X) + 6a 2(X) + 5a 3(X) + 4a 1(X)a 2(X) + 3a 1(X)a 3(X) + 2a 2(X)a 3(X) + a 1(X)a 2(X)a 3(X). Take two alternatives A and B with a 1(A) = 1, a 2(A) = 1 and a 3(A) = 0, and a 1(B) = 1, a 2(B) = 0 and a 3(B) = 1. A cumulatively dominates B and U(A) = 17 > 15 = U(B), so in this case Result 1 holds. But for the alternatives A and B with a 1(A) = 1, a 2(A) = 0 and a 3(A) = 0, and a 1(B) = 0.8, a 2(B) = 0.1 and a 3(B) = 0.1, Result 1 does not hold: A cumulatively dominates B but U(A) = 7 < 7.28 = U(B).

So, Result 1 cannot be extended for a utility function with interactions (3) when the attributes are continuous. It can, however, be analytically shown that the result can be extended when the attributes are binary (for a proof, see the Appendix):

Result 3

For the utility function with attribute interactions (3) and binary attributes, A cumulative-dominates B [see (2)] if and only if U(A) ≥ U(B) for all weights w i , w i, j , …, w 1, 2, …, n .

3.2 Continuous attributes: cumulative-dominant alternatives essentially always have maximum utility

In the absence of a theoretical guarantee that cumulative-dominant alternatives always maximize utility when the attributes are continuous, we investigated empirically how frequently this is the case. In a computer simulation, we crossed two factors: (i) the probability distribution of the attributes and (ii) the parameterization of the utility function.

For (i), we used two symmetric and two skewed distributions. The symmetric distributions were the uniform distribution over the [0, 1] interval and the normal distribution with mean 0.5 and variance 1, truncated so that all values are in the [0, 1] interval. The skewed distributions were obtained by truncating the normal distribution with mean 0 and variance 1 and the normal distribution with mean 1 and variance 1. In all four cases, attribute values were drawn independently of each other.

For (ii), we used different sets of attribute and interaction weights. We used two types of weights that induce different compensations among the attributes (Hogarth and Karelaia 2005b; Katsikopoulos 2013). First, we used equal weights w i  = w i, j  = \( \cdots \) = w 1, 2, …, n  = 1 where compensation is maximum. Second, we used unequal weights where there is less compensation, with the largest weight being w 1 = 2n − 1 and weights decreasing by 1 so that w 2 = 2n − 2, …, w 2, 3, …, n  = 1. Because the number of weights grows exponentially with n, we used all weights for n from 2 to 5 and only the single-attribute weights and the two-attribute interaction weights for n from 6 to 10.

The results are provided in Table 1. Each cell is the probability that the cumulative-dominant alternative (when one exists) has higher utility in a pair of alternatives (k = 2), for the corresponding combination of the number of attributes and the structure of weights. We repeated the process 10,000 times to average out random variation. Note that the results were always within 1 % for all four probability distributions and so we report the results only for the uniform distribution.

Table 1 The probability of the cumulative-dominant alternative [when one exists, see (2)] having higher utility in a pair of alternatives (k = 2) as a function of the number of continuous attributes n (from 2 to 10) and the parameterization of the utility function (3) (equal or unequal weights; see text for details)

Because a cumulative-dominant alternative essentially always maximizes utility when there are two alternatives, it follows that a cumulative-dominant alternative essentially always maximizes utility for any number of alternatives as well.

3.3 Continuous attributes: a cumulative-dominant alternative exists frequently only in the minority of choice problems

We investigated empirically how frequently a cumulative-dominant alternative exists when attributes are continuous. In a computer simulation, we crossed three factors: (i) the probability distribution of the attributes (same as in the previous simulation), (ii) the number of attributes n (from 2 to 10) and (iii) the number of alternatives k (from 2 to 10). The results are provided in Table 2. Each cell is the probability that there exists a cumulative-dominant alternative for the corresponding combination of the number of attributes and the number of alternatives. We repeated the process 10,000 times to average out random variation. Note that the results were always within 2 % for all four probability distributions and so we report the results only for the uniform distribution.

Table 2 The probability that a cumulative-dominant alternative exists [see (2)] as a function of the number of continuous attributes n and the number of alternatives k

How do the results of Table 2 compare with the results of Baucells et al. (2008) and Şimşek (2013) for binary attributes? A similarity is that in both cases the probability of cumulative dominance decreases with n. But the numbers are clearly lower when the attributes are continuous: for example, recall that for k = 2 and binary attributes, the probability of cumulative dominance ranged from 99 to 68 % (Baucells et al. 2008) and 100–61 % (Şimşek 2013). But in the top row of Table 2 we see that this probability ranges from 75 to 35 % when the attributes are continuous. The mean is 49 % which is comparable to the 58 % that Şimşek (2013) found across 51 natural datasets with continuous attributes (her range was from 94 to 11 %). Furthermore, only for 7 of the 81 combinations of n and k does the probability of cumulative dominance rise above 50 % in Table 2. And, for 44 of the 81 combinations, this probability does not rise above 20 %. We summarize these results informally as follows:

Result 4

For continuous attributes, a cumulative-dominant alternative exists frequently for the minority of combinations of the number of attributes n and the number of alternatives k.

In sum, the analytical and empirical results of the present section suggest that when attributes are binary, cumulative dominance exists frequently and leads to the good performance of simple heuristic rules; whereas when attributes are continuous, cumulative dominance exists less frequently and the performance of simple heuristic rules may be compromised. We illustrate our results on real-world choice problems from Şimşek (2013). That work reports summary statistics across 51 problems, whereas here we report results on two particular problems communicated to us by the author.

First, consider an individual who wants to buy a car that maximizes mileage per gallon of gas. In 1983, the Committee on Statistical Graphics of the American Statistical Association prepared a dataset including 398 cars built between 1970 and 1982, scored on the following continuous attributes: number of cylinders, engine displacement, horsepower, vehicle weight, time to accelerate from 0 to 60 mph, model year and origin. A cumulative-dominant alternative existed in 57 % of all comparisons between two cars but when attributes were binarized using a median split, a cumulative-dominant alternative existed in 89 % of all paired comparisons (for details on how the order of attributes used for checking cumulative dominance was determined, see Şimşek 2013). The cross-validated accuracy of the simple heuristic rule that chooses the cumulative-dominant alternative when one exists and guesses otherwise was 0.77 with continuous attributes and 0.84 with binary attributes.

A second example illustrates that the frequency of cumulative dominance and the performance of simple heuristics is not necessarily compromised when attributes are continuous. Consider an investor or a policy maker who wants to determine which of two anesthesiology departments requires fewer monthly man-hours to be maintained. Myers (1986) scored 12 American naval hospitals on the following attributes: number of surgical cases, eligible population and number of operating rooms (for details on the dataset, see Şimşek 2013). When attributes were binary, the frequency of cumulative dominance was 100 % and the cross-validated accuracy of the rule of choosing the cumulative-dominant alternative when one exists and guessing otherwise was 0.77, whereas when attributes were continuous, the frequency of cumulative dominance was 94 % and the cross-validated accuracy of the rule was 0.97.

4 Conclusions

Baucells and Sarin’s (2013) guided decision processes are a decision-making approach which lies between the extremes of unaided intuition and demanding optimization. The idea is to employ personalized decision rules that are simple and also perform well as, for example deterministic elimination by aspects and equal weighting of attributes. Such heuristic rules are well studied in psychology (Tversky 1972; Dawes and Corrigan 1974; Payne et al. 1993; Gigerenzer et al. 1999) and first results have also been obtained in decision analysis (Martignon and Hoffrage 2002; Hogarth and Karelaia 2005b, 2006a, b; Katsikopoulos and Fasolo 2006; Katsikopoulos 2011, 2013).

The present work takes a general point of view by investigating cumulative dominance—a condition that facilitates the performance of simple rules in multi-attribute choice (Baucells et al. 2008; Carrasco and Baucells 2008; Şimşek 2013)—and delineating its boundary conditions. In sum, we found that when it exists, cumulative dominance legitimizes guiding decision processes by employing simple rules but its existence should not be taken for granted. Our results are consistent with a growing body of work which has shown that simple heuristics are a good choice for making decisions, provided that the right heuristic is chosen for the right problem (Payne et al. 1993; Gigerenzer, Todd and the ABC research group 1999; Gigerenzer et al. 2011).