Motivation

In search of alternative behavioral foundations of choice models, the concept of regret has recently attracted the attention of several scholars in transportation research (Chorus et al. 2008a, b; Chorus 2010; De Moraes Ramos et al. 2011; Hess et al. 2012; Chorus 2012a, 2012b; Kaplan and Prato 2012; Chorus and Bierlaire 2013; Chorus et al. 2013b; Hensher et al. 2013; Boeri and Masiero 2014; Chorus 2014a, b; Prato 2014; Rasouli and Timmermans 2015a). Regret relates to an affective state of mind that individuals may experience or anticipate once they realize that one or more of the non-chosen choice alternatives outperform the chosen alternative on one or more attributes of interest. Regret-based models assume that individuals minimize regret rather than maximize utility when choosing among choice alternatives. The principle of regret minimization may signal bounded rationality in the sense that emotions are assumed to play a role in the decision-making process and that individuals tend to avoid extremes in their decision processes (Rasouli and Timmermans 2015b).

Chorus et al. (2008a) introduced regret-based models into transportation research. They provided an econometric framework extending seminal work (Bell 1982; Loomes and Sugden 1982, 1983, 1987; Quiggin 1994), from binary to multinomial and from single to multi-attribute choices. Originally, they defined regret as a linear function of negative attribute differences between the considered alternative and the best non-chosen alternative. Later, Chorus (2010) suggested replacing this discontinuous function with a continuous logarithmic approximation, primarily for technical reasons. Very recently, Chorus (2014c) introduced the generalized regret model, but we do not include it in the present study because it deviates too much from the pure notion of regret.

This fundamental work has led to an upsurge of applications of regret-based models, and extensions of the model under both certain conditions (Chorus et al. 2008a, b; Chorus and de Jong 2011; Chorus 2012b; Kaplan and Prato 2012; Beck et al. 2013; Chorus et al. 2013; Chorus 2014a; Hensher et al. 2013; Boeri and Masiero 2014, Prato 2014), and uncertain conditions (Chorus et al. 2008a; De Moraes Ramos et al. 2011; Chorus 2014b). Regret-based models seem to be more important when respondents are less familiar with the choice situation (probably because they have less defined preferences) (Boeri et al. 2014). Applications cannot ony be found in transportation research area but also in environmental studies (Thiene et al. 2012) and health economics (de Bekker-Grob and Chorus 2013).

All these regret-based models have in common that regret is a function of attributes differences of the choice alternatives. That is, the physical measurements of attributes are used in the regret function to measure the amount of regret, assuming that regret is based on these differences. In reality, however, regret may be based on the perception of physical attribute differences. Psycho-physical research, although focusing on different kind of stimuli such as noise, has provided a substantial amount of evidence that the relationship between physical attribute values and perceived attribute values is non-linear. If these findings would equally apply to travel-related attributes, current regret-based models would wrongly represent the mental representation of physical attributes levels that are subsequently used to arrive at a choice. All current regret-based choice models implicitly or explicitly ignore the perception of attributes. Compared to utility-based models of transport-related decisions, ignoring perception may have more dramatic repercussions, because regret-based models are based on attribute differences. It is assumed that the same absolute attribute difference is perceived independently of the overall attribute range and the absolute value of attribute itself. Thus, incorporating an improved representation of the perception of attributes into regret-based models may enhance their predictive success.

In this paper, we therefore propose and empirically test such an elaboration of the basic model specifications. In order to investigate this issue, the current paper sets out to formulate elaborations of the basic random regret minimization models and incorporates a non-linear representation of the perception of attributes. Inspired by research on psycho-physical measurement, it is assumed that the perception of stimuli (attributes) is proportional to their absolute values. To allow for slight deviations from this representation, a more general non-linear representation is also formulated and tested. Two data sets (one on stated choice, the other on revealed choices) are used to estimate and assess the proposed models. These newly formulated regret models will be compared against each other and against the original random regret minimization models. In addition to comparing their predictive performance, K-fold validation tests will be conducted.

The remainder of the paper is organized as follows. We will first summarize current regret-based models as they have been formulated in transportation research. Next, we will outline the new regret models that incorporate the non-linear psycho-physical relationships. Then, we will discuss the estimation and validation results for two case studies, respectively pertaining to choice of shopping center and transportation mode choice. The paper is completed with a discussion of the results and avenues of future research.

Conceptual framework

The urban-transportation system can be described in terms of a set of \(i = 1, 2, \ldots , I \in C_{n}\) choice alternatives, characterized by a non-empty set of \(K_{i}\) attributes, defining choice set C of individual n. The set of attributes may be identical for the different choice alternatives (e.g., a set of generic attributes for shopping centres), but may also be (partly) specific for different alternatives (e.g., attributes of transport modes). Let \(X_{ik}\) describe the attribute value of alternative i on attribute k. Finally, let \(Z_{nj} , j = 1,2, \ldots ,J\) describe characteristic j of decision-maker \(n = 1, 2, \ldots , N\).

Random utility and random regret models directly link attribute values of the choice alternatives to the choices, observed in the real world \(\left\{ {X_{ik} } \right\} \to p_{i} , i \in C\) where \(p_{i}\) is the probability of choosing alternative i. In formulating an operational model, a researcher has to select the set of attributes \(\left\{ {K_{i} } \right\},\) assumed to influence the choice behaviour of interest, and specify choice set C, which may be individual specific or universal. If the modelling approach predicts observed probabilities as a function of the objective attribute values (revealed preference), there is no explicit attempt to further detail and breakdown the decision making process.

The choice literature, however, documents several attempts and more detailed ways to elaborate this approach. In this study, we focus our attention to the relationship between the objective values of the physical attributes and their perception by the decision maker. This psycho-physical relationship involves the mapping of the objective, physical space into a cognitive space, measuring individuals’ perception: \(\left\{ {X_{ik} } \right\} \to \left\{ {x_{nik} } \right\} \forall i \in C;\forall n; \forall k \in K_{i}\). This elaboration would acknowledge that individuals hold partial \((k_{i} \subset K_{i} )\) and imperfect knowledge \(x_{nik} \ne X_{ik}\) about the attributes of the choice alternatives in their choice set. Moreover, individuals may not be aware of all choice alternatives \((C_{n} \subset C)\). This elaboration also contends that individuals base their decisions on their cognitive representations of reality as opposed to reality itself. The notion that knowledge is imperfect may be captured by any non-identity function, mapping objective attribute values into cognitive values: \(x_{nik} = f_{nk} \left( {X_{ik} } \right) \forall i \in C_{n} ;\forall n; \forall k \in K_{ni}\) where \(f_{nk}\) is any function applied to attribute k by individual n.

The classic random regret model, suggested by Chorus et al. (2008a, b), assumes that function f is an identity function, indicating that objective attribute values as opposed to cognitive or perceived values are used. The original attribute-level regret function is assumed to be equal to

$$\varphi_{{ii^{'} k}} = \hbox{max} \left[ {0, \beta_{k}^{\text{BO}} \left( {X_{{i^{'} k}} - X_{ik} } \right)} \right]$$
(1)

where \(\varphi_{{ii^{'} k}}^{{}}\)defines the regret for chosen alternative i when compared to foregone alternative \(i^{'}\) on attribute \(k\), operator. Then, the regret of chosen alternative i when compared to foregone alternative \(i^{'}\) is assumed equal to the sum of attribute-level regrets.

$$R_{{ii^{'} }} = \mathop \sum \limits_{k} \varphi_{{ii^{'} k}}$$
(2)

Moreover, to generalize regret to the multi-alternatives choice situation, following Quiggin (1994) and assuming a universal choice set, the regret of chosen alternative i is defined against the best foregone alternative \(i^{'}\):

$$R_{i}^{\text{BO}} = \mathop {\hbox{max} }\limits_{i' \ne i \in C} \{ R_{{ii^{'} }} \}$$
(3)

(BO) stands for the “Best Only” regret based choice model. In this paper, we will refer to this model specification as the original regret model.

Because this discontinuous specification of regret leads to possible estimation problems, Chorus (2010) approximated the discontinuous attribute-level regret function by the following continuous logarithmic function:

$$\varphi_{{ii^{'} k}}^{\text{L}} = \ln \left( {1 + \exp \left[ { \beta_{k}^{\text{PL}} \left( {X_{{i^{'} k}} - X_{ik} } \right)} \right]} \right)$$
(4)

As in the original regret model, the regret associated with chosen alternative i when compared to foregone alternative \(i^{'}\) is defined as the sum of the attribute level regrets.

$$R_{{ii^{'} }}^{\text{L}} = \mathop \sum \limits_{k} \varphi_{{ii^{'} k}}^{\text{L}}$$
(5)

In addition, he assumed that regret was not defined against the best foregone alternative, but rather against all non-chosen alternatives. That is,

$$R_{i}^{\text{PL}} = \mathop \sum \limits_{{i^{'} \ne i \in C}} R_{{ii^{'} }}^{\text{L}}$$
(6)

PL stands for “Paired Logarithmic” regret based model. In this paper, this model specification will be called the logarithmic specification.To map regret into choice probabilities, both model specifications assumed that regret is a stochastic variable. Following classic discrete choice models, assuming that the random components of the regret functions are independently and identically Gumbel distributed, the assumption of regret minimization then leads to the following choice model:

$$P_{i} = \frac{{\exp \left( { - R_{i} } \right)}}{{\mathop \sum \nolimits_{{i^{'} \in C}} \exp \left( { - R_{{i^{'} }} } \right)}}$$
(7)

where \(P_{i}\) is choice probability choosing alternative i, \(R_{i}\) is the regret function.

New regret models

To include perception into these base regret models, we suggest a psycho-physical relationship that can be traced back to Weber (1834). He argued that individuals perceive change in a stimulus as a constant ratio of the original stimulus. This assertion, the so called Weber’s Law, indicates that as the initial stimulus gains more strength, much more strength is required for later stimuli to maintain the same perception difference.

$$\frac{{{{\Delta }}X_{k} }}{{X_{k} }} = {\text{C }}$$
(8)

where \(C\) is a constant (Gescheider 1997), \(X_{k}\) means an initial stimuli magnitude, and \({{\Delta }}X_{k}\) means the variance of the stimuli magnitude.Regret-based choice models have been developed based on the definition that attribute-regret is proportional to attribute-differences when an alternative performs better than others. By definition, the same attribute-difference generates the same regret regardless of the magnitude of the attribute.

However, considering individuals’ perception in real choice situations, the same attribute-difference may be perceived smaller when the size of the attribute is larger. For instance, if someone travelled during 120 min, he would feel a certain amount of regret if the travel time of the alternative, non-chosen route turned out to be 110 min. However, if that person travelled for 20 min, and the travel time of the alternative, non-chosen route would be 10 min, would that person experience the same amount of regret? The answer to this question, according to classic regret models would be yes, because regret only depends on attribute differences, which are the same in this example. However, it is likely that the same 10 min travel time difference is perceived quite differently in these two cases.

Therefore, to avoid this inherent limitation of the current regret formulations, we assume that individuals perceive attribute differences “Based On Weber’s law” (BOW). When it is applied to the original regret model, the attribute-level regret of alternative i when compared to alternative i’ for attribute \(k\) is then defined as

$$R_{i}^{\text{BOW}} = \mathop {\hbox{max} }\limits_{i' \ne i \in C} \left( {\mathop \sum \limits_{k} \hbox{max} \left[ {0, \beta_{k}^{\text{BOW}} \left( {\frac{{X_{{i^{'} k}} - X_{ik} }}{{X_{ik} }}} \right)} \right]} \right)$$
(9)

This new regret model can describe the variance in perception with respect to the size of attribute. It implies the contention that regret is not only a function of the attribute differences of the considered alternative and the best non-chosen one but also of attribute magnitude itself. The larger the attribute size, the smaller the regret with respect to the same attribute-differences. Parameters \(\beta_{k}^{\text{BOW}}\) are to be estimated from observed choices.

Likewise, the Weber’s law can be easily applied to the logarithmic specification. Equation (10) for “Paired Logarithmic Weber’s law” (PLW) gives the result:

$$R_{i}^{\text{PLW}} = \mathop \sum \limits_{i' \ne i \in C} \mathop \sum \limits_{k} \ln \left( {1 + { \exp }\left[ {\beta_{k}^{\text{PLW}} \left( {\frac{{X_{{i^{'} k}} - X_{ik} }}{{X_{ik} }}} \right)} \right]} \right)$$
(10)

This model also incorporates the non-linear mapping of attribute differences into cognitive differences. Later, Guilford (1932) pointed out that the Weber’s law does not hold for the perception of high intensity stimuli. Based on a series of experiments, he concluded that individuals in that case perceive change in a stimulus to be smaller, meaning that the stimulus in the denominator used as a reference should be decreased. He suggested a power coefficient in the denominator which should be less than one. The power coefficient ϑ is expected to lie between 0 and 1.

$$\frac{{{{\Delta }}X_{k} }}{{(X_{k} )^{{\vartheta_{k} }} }} = {\text{C }}$$
(11)

By adding the power coefficient, the suggested model is more flexible in capturing psycho-physical relationships. In this study, we will call this modification the generalized Weber’s law abbreviated as (BOGW) for (Based On Generalized Weber’s law. With this law, the original attribute-level regret of alternative i when compared to the best alternative i’ for attribute \(k\) is defined as

$$R_{i}^{\text{BOGW}} = \mathop {\hbox{max} }\limits_{i' \ne i \in C} \left( {\mathop \sum \limits_{k} \hbox{max} \left[ {0, \beta_{k}^{\text{BOGW}} \left( {\frac{{X_{{i^{'} k}} - X_{ik} }}{{\left( {X_{ik} } \right)^{{\vartheta_{k} }} }}} \right)} \right]} \right)$$
(12)

Similar to the first new regret model (Eq. 10), this second regret model can also describe the variance in perception with respect to the size of the attributes of an alternative. However, the degree of variance in the denominator is smaller as captured by the power function in the denominator. Parameters \(\beta_{k}^{\text{BOGW}}\) should be estimated from observed choices.

Likewise, the modification can be easily applied to the attribute-level regret of the logarithmic specification, as shown in Eq. 13.

$$R_{i}^{\text{PLGW}} = \mathop \sum \limits_{i' \ne i \in C} \mathop \sum \limits_{k} \ln \left( {1 + { \exp }\left[ {\beta_{k}^{\text{PLGW}} \left( {\frac{{X_{{i^{'} k}} - X_{ik} }}{{\left( {X_{ik} } \right)^{{\vartheta_{k} }} }}} \right)} \right]} \right)$$
(13)

Note that if the power coefficient \(\vartheta_{k}\) is estimated to be equal to zero, or not statistically significant from zero, the new regret models expressed in Eqs. 12 and 13 become the original formulations (Chorus et al. 2008a, b; Chorus 2010). On the other hand, if it is equal to one, it means that regret is generated based on Weber’s law (Weber 1834).

Both new regret models represent the psycho-physical relationship by dividing attribute differences by the size of attribute (moderated by some power coefficient). Therefore, individuals perceive different degrees of regret of each alternative, even for same attribute level difference, depending on the size of attribute of the chosen alternative.

To illustrate these models, assume two alternative routes (route 1 and route 2) that define the available choice alternatives between an origin and a destination, characterized by travel time only. We formulate five scenarios with a constant travel time difference between the two modes, assuming the route 1 is the faster route in all scenarios. The traveler values travel time (differences) as −0.5/min, and the power coefficient is assumed to be equal to 0.7. Intuitively, it is likely that most individuals will choose the route 1 due to the shorter travel time when the size of the attribute is small. However, when the size of the attribute increases, the shares of the route 1 and route 2 becomes more similar. Sheffi (1985) pointed out that this expected behavior stems from the variance in perception of travel time differences as a function of absolute travel time.

Table 1 shows how choice behavior is predicted to differ for the three types of regret models (the two newly suggested regret models, and the original regret model) by size of attribute. The probability of choosing the car is always higher, but differences in choice probability between the two alternatives are not the same for the models. Like the multinomial logit model, the original regret formulation does not consider the absolute size of travel time because regret is defined only by attribute-differences between alternatives. Therefore, most individuals are predicted to choose the car in all these scenarios with the same probability, regardless of the size of the attribute. In contrast, the new regret models suggested in this paper do consider the size of the travel time attribute. When applying the original Weber’s law to the specification of regret, the choice probabilities of the two alternatives become closer as the size of the attribute (travel time) become larger. In case of the generalized Weber’s law, the effect of perception on choice probabilities is moderated, because the value of the power coefficient decreases the value of the denominator.

Table 1 Change in choice probability by size of attribute in three regret-based models

Case studies

To examine whether indeed the inclusion of the psycho-physical relationship into the original model specifications improves the predictive ability of the regret choice models, two data sets were used. The first data set concerns a stated choice experiment about the choice of shopping center. The second data set is concerned with the choice of transportation mode in a revealed preference setting. For each data set, three analyses were conducted: (i) a comparison of the goodness-of-fit of the model specification based on Weber’s law versus the original two model specifications (the best only and the logarithmic pairwise comparisons models); (ii) a comparison of the goodness-of-fit of the model specification based on the generalized Weber’s law versus the original two model specifications (the best only and the logarithmic pairwise comparisons models); (iii) as a by-product a comparison of the goodness-of-fit of the original regret model versus the logarithmic specification. Technically, the latter comparison is a replication of an earlier study (Rasouli and Timmermans 2015a), using new data sets. In addition to the comparison of model performance, a K-fold cross-validation test was conducted for each case.

Likelihood values were calculated to compare the goodness-of-fit of the models. Because we need to compare non-nested models with different numbers of estimated parameters (specification based on the generalized Weber’s law), we applied Ben-Akiva and Swait’s test (1986) for non-nested choice models.Footnote 1 Matlab R 2015a was used to develop the code to estimate the various models.

The K-fold cross-validation implies that the data were split into K subsets. K-1 subsets were used to estimate the model, and the remaining subset was used to test the model. This process is repeated K times since each subset should be used as a test set once. The 2-norms statistic, which quantifies the distance between predicted and observed market shares derived from absolute error (Cherchi and Cirillo 2010) was used in this test. It can be expressed as:

$$2 - {\text{norms}} = \sqrt {D_{Alt.1}^{2} + \cdots D_{Alt.i}^{2} }$$
(14)

where \(D_{Alt.i}\) means the absolute error between observed and predicted shares for alternative i. Note that if the predicted and observed market shares are the same, D is equal to zero. It becomes bigger if the difference between predicted and observed market share increases. Thus, by definition, a lower value of the 2-norms statistic indicates a higher predictive power.

Case study 1: Stated choice experiment

Data

The first case study is based on a stated choice experiment constructed to analyze consumer choice of shopping center. The experiment described shopping centers in terms of three generic attributes - travel time, parking fees, and number of stores. Each attribute was categorized into four levels. Travel time was labeled 10, 30, 50 and 70 min, parking costs were varied in terms of 0.5, 1.0, 1.5, 2.0 euros, while levels for the number of stores were 30, 60, 90, and 120 stores respectively. Three imaginary shopping centers were considered as alternatives in choice sets. A \(4^{9}\) orthogonal fractional factorial choice set design consisting of 128 runs was constructed to orthogonally vary the attributes within and between the shopping centers. These choice set were blocked into 16 choice sets. A total of 242 participants indicated for each of these choice sets which of these three centers they would choose.Footnote 2

Estimation results

Original model

The original regret model (Eq. 3) was compared with the logarithmic specification (Eqs. 6) to figure out how well each base regret model represents observed consumer choice behavior. Recall that the regret functions are identical for larger attribute difference, but differ for smaller differences. Moreover, whereas the original regret model is based on a comparison of the best foregone only choice alternative, the logarithmic specification, considered in this study is based on all pairwise comparisons. Thus, the analysis is a partial replication of an earlier study, using a different data set (Rasouli and Timmermans 2015a). Although this comparison is not the focus on attention in this study, we included the results because they add empirical knowledge to question which specification seems behaviorally the best. The findings of the estimation results of each model are listed in Table 2.

Table 2 Summary of estimation results—original models: case study 1

All coefficients are statistically significant at the 95 percent significance level and their sign is in anticipated direction. The absolute values of the attribute parameters are larger in the original regret model than in logarithmic specification, as expected. Parking fees is the most influential attribute in both models. This result may be affected by the attribute range since parking fees has the smallest attribute range, compared to other attributes. The original formulation outperforms Chorus’ later specification, consistent with the result of previous research (Rasouli and Timmermans 2015a). The results of Ben-Akiva and Swait’s test (1986) for non-nested choice models indicate that the probability that the logarithmic specification outperforms the original regret is zero (\(p \le \varPhi ( - 9.224) \cong 0\)). \(\left( \varPhi \right)\) is the standard normal cumulative distribution.

Specification based on Weber’s law

In this section, we discuss the results of the comparison of the model specifications that include Weber’s law with their original, base specifications. Table 3 shows the estimation results and the difference in predictive performance of the models.

Table 3 Summary of estimation results—specification based on Weber’s law: case study 1

The t values indicate the parameters are statistically significant at the 95 % significance level. Their signs are consistent with theoretical anticipations. By applying Weber’s law, the absolute values of the parameters are increased in both regret models. This seems reasonable since the attribute difference is divided by the attribute value, and therefore larger parameters are needed to represent similar attribute level regret. The increment is smaller for parking fees than for other attributes, which may reflect the effect of attribute range. That is, since the original regret model does not consider the attribute range, and parking fees has the smallest range, the absolute value of the parameter for parking fees is relatively high. However, by applying Weber’s law, attribute values in the denominator become normalized. As a result, while parking fees is the most influential attribute in the original regret model, travel time becomes most important when applying Weber’s law.

Based on the results of Ben-Akiva and Swait’s test (1986) for non-nested choice models, we can conclude that the regret model, which includes the non-proportional perception effect, performs better than the original formulations (original regret and logarithmic specification). The probability that the original regret model is superior to its specification based on the Weber’s law is zero (\(p \le \varPhi ( - 5.052) \cong 0,\) with Φ being the standard normal cumulative distribution function). Also, the probability that the logarithmic specification is superior to its specification based on Weber’s law is zero (\(p \le \varPhi ( - 4.125) \cong 0\)).

Compared to the base formulations of the regret models, the specifications based on Weber’s law slightly improve the goodness-of-fit of the models. Thus, it seems that in this case study, the inclusion of the mapping of objectives values into cognitive attributes results in better predictive performance of the regret models.

Specification based on generalized Weber’s law

Focusing on the specification based on the generalized Weber’s law, similar results were obtained for both regret models (original regret and logarithmic specification). Table 4 shows the estimation results. The power coefficient for the number of stores in the modified Weber model specification is not statistically significant at the 95 percent significance level in both regret models. Therefore, they were dropped and replaced by one as in Weber’s Law.

Table 4 Summary of estimation results—specification based on Weber’s law: case study 1

Applying the generalized Weber’s law, the perception effect is somewhat diminished. It turns out that the increment in the absolute values of the estimated parameters is slightly lower than the specification based on Weber’s law. Parking fees is the most influential attribute in both regret models based on the generalized Weber’s law, while travel time is the most influential attribute in the specification based on Weber’s law.

The value of the power coefficient in the specification based on the generalized Weber’s law is about 0.4 in the case of the original regret model, which is slightly smaller than in the case of the logarithmic specification (the value of the power coefficient is around 0.5). These results are congruent with Guilford’s argument (1932) that the power coefficient should have a value between zero and one.

All specifications incorporating the generalized Weber’s law outperform their base model specifications according to Ben-Akiva and Swait (1986)’s test for non-nested choice models. The probability that the original regret model outperforms its elaboration based on the generalized Weber’s law is zero (\(p \le \varPhi ( - 10.005) \cong 0\)), and the probability that the logarithmic specification is superior to its elaboration based on the generalized Weber’s law is also zero (\(p \le \varPhi ( - 9.115) \cong 0\)). The improvement in goodness-of-fit of the models based on the generalized Weber’s law is higher than the improvement based on the classic Weber’s law.

Validation results

A K-fold cross-validation test was conducted to validate the estimation results. In total, 11 subsets (K = 11) were constructed, randomly assigning the respondents to the subsets. Of the 11 subsets, 10 subsets were used to estimate the models, while the one remaining subset was used to validate the estimation results. This process is repeated 11 times using all subsets as validation data.

Original models

First, the cross-validation test is applied to original regret model and the logarithmic specification. Results show that the estimated parameters only slightly differ between the 11 subsets. Likewise, the goodness-of-fit of the models differs only marginally. This implies that the data are very stable. The validation step involves using the estimated model to predict the observations in the 11th subset.

Figure 1 shows the 2-norms values of both regret models, which quantify the predictive success of the estimated model. Akin to the estimation results for the full data set, the results of the 11 fold cross-validation test indicate that the original regret model is superior to the logarithmic specification. A few fluctuations can be observed. Thus, we can rule out that a peculiar subset or peculiar respondent causes the improvement in predictive success of the original regret model.

Fig. 1
figure 1

2-norms values for K-fold cross-validation test: Comparison of original regret and logarithmic specification—case study 1

Specification based on Weber’s law

Second, each regret model (original regret and logarithmic specification) and their variants based on Weber’s law were subjected to the cross-validation test. The estimated parameters and their goodness-of-fit only show small variations between the 11 training data sets. This implies that this data set is suitable to conduct the cross-validation test.

The 2-norms values for the 11-fold cross-validation are shown in Fig. 2. It demonstrates that the variant based on Weber’s law performs better in predicting observed choice behavior. This means that when regret is defined to include the psycho-physical relationship, the model is superior to its original base specification. Compared to the base formulations, the value of 2-norms is generally lower for the models specifications based on weber’s law, because the model fit is better.

Fig. 2
figure 2

2-norms values for K-fold cross-validation test: Comparison of base formulations and their variants based on Weber’s law—Case study 1. a Original regret and its variant based on Weber’s law. b Logarithmic specification and its variant based on Weber’s law

Specification based on the generalized Weber’s law

Finally, the 11-fold cross-validation test was applied to the specification based on the generalized Weber’s law. The parameters are similar and the goodness-of-fit of the models is almost constant across training sets. Figure 3 presents the values of 2-norms. The variants based on the generalized Weber’s law show lower values than their original formulations, implying a higher predictive ability. The difference in predictive ability of the models is much clearer than in case of Weber’s law. The 2-norms values of the variants based on the generalized Weber’s law show lower values than the original formulations.

Fig. 3
figure 3

2-norms values for K-fold cross-validation test: comparison of base formulations and their variants based on the generalized Weber’s law—case study 1. a Original regret and its variant based on the generalized Weber’s law. b Logarithmic specification and its variant based on the generalized Weber’s law

Case study 2: revealed choice experiment

Data

For the second case study, we used the 2009 MON data (Mobiliteit Onderzoek Nederlands—the Dutch National Travel Survey). The survey was administered among representative residents in the Netherlands to obtain daily travel information. From the complete revealed preference (RP) data set, we extracted the data of the Noord Brabant Province, which concerns 354 respondents. The data includes two level of service variables (travel time and travel distance) for three mode choice alternatives (car, bike, walk). The travel time ranges from 1 to 134 min, while travel distance varies from 1 to 91 km. The variance of travel time is 56.48, which is larger than the variance of travel distance (50.89). The estimated models thus predict the probability of mode choice as a function of these level of service variables.

Estimation results

As in the first case study, the original regret model, the logarithmic regret model and their variants based on the Weber’s law and the generalized Weber’s law were estimated. The primary aim of the analysis was to examine and statistically test whether model specifications that including a mapping of the objective attribute values into cognitive values outperform their original specifications. Implicitly, this also allows examining whether the mode based on the generalized Weber specification outperform the model, based on the Weber’s law, and whether the logarithmic model outperforms the original regret model, introduced in the transportation research community by Chorus et al. (2008a, b).

Original models

Table 5 shows the detailed estimation results for the original regret and the logarithmic regret models. It shows that the overall results are similar to those obtained in the first case study. All estimated coefficients are statistically significant at the 95 % significance level and the signs are in anticipated direction. The logarithmic specification has smaller parameters than the original regret model; the same as in the first case study. Travel distance has more influence on choice than travel time in both regret models. Results also clearly indicate again that the original regret model is superior to the logarithmic specification in terms of predictive ability. Based on the non-nested choice model test (Ben-Akiva and Swait 1986), the probability that the logarithmic specification outperforms the original regret model is zero (\(p \le \varPhi ( - 6.178) \cong 0\)).

Table 5 Summary of estimation results: base formulations—case study 2

Specification based on Weber’s law

Next, we compare the performance of the two model specifications that include the Weber’s law against their original model specifications. The results are shown in Table 6. As in the first case study, the predictive performance of the models indicates that the models incorporating the psycho-physical relationship outperform the base models, but the degree of improvement is higher than in the first case study. This may be due to the fact that in the second case study, the attributes of all alternatives have to be profoundly perceived rather than explicitly be shown to the respondents as in the first case study. Therefore, the difference between cognitive representation of attribute values and their objective counterparts are likely to be larger, reflected by the higher improved performance of the model incorporating this phenomena.

Table 6 Summary of estimation results: specification based on Weber’s law—case study 2

All parameters are statistically significant at the 95 % significance level, and the signs of all parameters are in anticipated direction. The absolute value of the parameters is increased compared to the base models. Travel distance is more influential than travel time in all models. The results of the non-nested choice model test (Ben-Akiva and Swait 1986) reveal that the specifications based on Weber’s law better predict observed choice behavior than the base formulations. The probability that the original regret model is superior to its specification based on the Weber’s law is zero (\(p \le \varPhi ( - 4.785) \cong 0\)), and the probability that the logarithmic specification outperforms its specification based on Weber’s law is also zero (\(p \le \varPhi ( - 3.567) \cong 0\)).

Specification based on the generalized Weber’s law

When the regret models (original regret and logarithmic specification) are specified based on the generalized Weber’s law, their predictive performance is improved. Again, this finding is consistent with the results obtained in the first case study. Table 7 shows the results. Similar to the specification based on Weber’s law, the improvement is higher in the second case study. The probability that the original regret model outperforms the specification based on the generalized Weber’s law is zero (\(p \le \varPhi ( - 7.518) \cong 0)\). Similarly the probability that the logarithmic specification is superior to the variant based on the generalized Weber’s law is zero (\(p \le \varPhi ( - 6.222) \cong 0\)).

Table 7 Summary of estimation results: specification based on the generalized Weber’s law—case study 2

The power coefficient for travel time and distance is somewhat higher in the logarithmic specification. It is slightly over 0.7 for the original regret model, and close to 0.8 for the logarithmic specification. These values are closer to 1 than in the first case study meaning that the difference between objective attribute difference and cognitive representation is larger in the second case study incurred by the type of choice mechanism. (between revealed preference and stated choice experiment).

Validation results

The models estimated in this second case study were also validated using the K-fold cross-validation method. To that end, the data were randomly split into 6 different subsets. The models were estimated pooling 5 of these subsets, and the estimated model was used to predict the observations in the 6th subset. By repeating this procedure 6 times, each subset is used for validating the model once.

Original models

First, the original regret and the logarithmic specification were put to the cross-validation test. Estimation results across the 5 subsets are similar to the estimation results of the full data set. The values of the parameters are close and the original specification of the regret model consistently outperforms the later introduced logarithmic specification. The validation results are presented in Fig. 4. It demonstrates that the original regret model is superior to the logarithmic specification.

Fig. 4
figure 4

2-norms values for K-fold cross-validation test: Comparison of original regret and logarithmic specification—case study 2

Specification based on Weber’s law

Second, the cross validation test was applied to the base formulations of each regret model and their variants based on Weber’s law. The estimated models are similar across the training sets. Figure 5 shows the validation result in terms of the 2-norms. The overall results are similar to those obtained for the first case study, but the dominance relationship between the models is clearer. Except for small fluctuations, the specifications based on Weber’s law are superior to the corresponding base formulations of the two regret models.

Fig. 5
figure 5

2-norms values for K-fold cross-validation test: comparison of original formulations and their variant based on Weber’s law—case study 2. a Original regret and its variant based on Weber’s law. b Logarithmic specification and its variant based on Weber’s law

Specification based on the generalized Weber’s law

Lastly, each regret model and their perception-based variants based on the generalized Weber’s law were subjected to the cross-validation test. The full data set seems suitable to estimate the parameters and assess model fit. The validation results using the 2-norms statistic are shown in Fig. 6. Each variant is found to have a lower value, meaning that their predictive power is better than that of the base formulations. The results reinforce the findings of the first case study. Incorporating the psycho-physical relationship into the regret model improves their predictive ability. The difference between the values of 2-norms is larger than for the specification based on Weber’s law.

Fig. 6
figure 6

2-norms values for the K-fold cross-validation test: comparison of original formulations and their variants based on the generalized Weber’s law—case study 2. a Original regret and its variant based on the generalized Weber’s law. b Logarithmic specification and its variant based on the generalized Weber’s law

Conclusions and discussion

This paper has documented the formulation and empirical test of new regret-based choice models. The new models differs from their original, base formulations that mark the evolution of regret-based models in transportation research in that regret is not defined in terms of attributes differences between the considered choice alternative and the (best) non-chosen choice alternatives, but rather as the ratio of attribute differences to the absolute attribute values of the considered alternative. This transformation is assumed to represent the mapping of objective attribute values into cognitive attributes. Two specifications were tested: model variants incorporating the classic Weber’s law and model variants based on the generalized Weber’s law. These specifications are motivated by the results of psycho-physical research which has shown that the perception of change in a stimuli is relative to the absolute intensity of the stimuli. Although this research has traditionally focused on physical stimuli such as noise and sound, the perception of other stimuli may be driven by similar perceptual mechanisms.

The results of two case studies, one based on stated choice data, the other on revealed choice data, convincingly and consistently show that for both the original best only regret model and the paired comparison logarithmic specification of the regret model, incorporating the mapping function significantly outperform the original base specifications. These findings thus support the conclusion that it is crucial to account for the size of attributes of alternatives in addition to the attribute difference in regret-based models (and maybe in utility-based models as well) to better capture choice behavior. The evidence from the two case studies clearly shows that the inclusion of the perception component in the specification of the regret function outperforms the original random regret models. Further evidence is provided by the results of the K-fold cross-validation, which shows that the overall results also hold for the various subsets.

Moreover, results of the model comparisons demonstrate that the most general model, the one based on the generalized Weber’s law, performs best. This is not surprising in the sense that the classic Weber is a special case of the generalized Weber’s law. However, results indicate that the models based on the generalized Weber’s law perform significantly better and that the extra estimated parameter in the generalized function is not equal to one.

Finally, as a by-product of this study, these two case studies replicate the findings of another study (Rasouli and Timmermans 2015a), which concluded that the original regret model, based on the max operator outperforms the later introduced logarithmic specification. Of course, further empirical analyses are needed to increase our body of knowledge regarding the empirical performance of different regret models.

The main contribution of this paper is the attempt to include a perceptual mechanism in the formulation of regret models. It is argued that if perception involves a non-linear mapping of physical attribute values into cognitive values, commonly used regret-minimization models lead to less accurate results, which can be mitigated by considering the size of attributes of each alternative in the regret formulation. It goes without saying that this conclusion is based on these two case studies only. It would be interesting and valuable to replicate the comparison on other data to judge the generalizability of the current findings to different data sets and spatial and cultural settings. For the case of stated choice data, it would be interesting to see to what extent results depend on the size of attribute variation.

The results reported in this paper have been deliberately limited to the two pure random regret models. The evolution of regret-based models in transportation research however also includes attempt to formulate mixed utility/regret models Hess et al. (2012), Hess and Stathopoulos (2013) and Boeri et al. (2014). Further research should analyze whether the inclusion of the psycho-physical relationship will also improve the predictive success of other regret-based or hybrid models (e.g. Chorus 2014b).