Similarity and number of alternatives in the random-dot motion paradigm

van Maanen, Leendert; Grasman, Raoul P. P. P.; Forstmann, Birte U.; Keuken, Max C.; Brown, Scott D.; Wagenmakers, Eric-Jan

doi:10.3758/s13414-011-0267-7

Similarity and number of alternatives in the random-dot motion paradigm

Open access
Published: 28 January 2012

Volume 74, pages 739–753, (2012)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Similarity and number of alternatives in the random-dot motion paradigm

Download PDF

Leendert van Maanen^1,4,
Raoul P. P. P. Grasman¹,
Birte U. Forstmann¹,
Max C. Keuken^1,2,
Scott D. Brown³ &
…
Eric-Jan Wagenmakers¹

2895 Accesses
30 Citations
Explore all metrics

Abstract

The popular random-dot motion (RDM) task has recently been applied to multiple-choice perceptual decision-making. However, changes in the number of alternatives on an RDM display lead to changes in the similarity between the alternatives, complicating the study of multiple-choice effects. To disentangle the effects of similarity and number of alternatives, we analyzed behavior in the RDM task using an optimal-observer model. The model applies Bayesian principles to give an account of how changes in the stimulus influence the decision-making process. A possible neural implementation of the optimal-observer model is discussed, and we provide behavioral data that support the model. We verify the predictions from the optimal-observer model by fitting a descriptive model of choice behavior (the linear ballistic accumulator model) to the behavioral data. The results show that (a) there is a natural interaction in the RDM task between similarity and the number of alternatives; (b) the number of alternatives influences “response caution”, whereas the similarity between the alternatives influences “drift rate”; and (c) decisions in the RDM task are near optimal when participants are presented with multiple alternatives.

Some task demands induce collapsing bounds: Evidence from a behavioral analysis

Article 29 May 2018

The random dot tachistogram: a novel task that elucidates the functional architecture of decision

Article Open access 29 July 2016

Stimulus–response congruency effects depend on quality of perceptual evidence: A diffusion model account

Article 01 February 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In order to transform the continuous stream of perceptual information into goal-directed action, people need to make decisions. These decisions may involve any number of alternative options, often more than two. This may be the reason that psychological theorizing and experimenting about perceptual decision-making has recently shifted focus from binary choice (e.g., Britten, Shadlen, Newsome, & Movshon, 1992; Ratcliff, 1978) to multiple choice (e.g., Brown, Steyvers, & Wagenmakers, 2009; Ditterich, 2010; Hawkins, Brown, Steyvers, & Wagenmakers, in press; Ho, Brown, & Serences, 2009; Leite & Ratcliff, 2010). Multiple-choice decision-making is also an important research topic because it allows researchers to validate theories initially developed for binary choices (e.g., Ratcliff, 1978; Smith & Vickers, 1988). Assuming that multiple-choice decision-making is a natural generalization of binary choice, these models should also generalize to multiple-choice contexts in order to have validity as a description of the underlying decision process.

Random-dot motion

A task often used in the study of perceptual decision-making is the random-dot motion (RDM) task (Ball & Sekuler, 1982; Britten et al., 1992; Churchland, Kiani, & Shadlen, 2008; Forstmann et al., 2010; Forstmann et al., 2008; Ho et al., 2009; Mulder et al., 2010; Niwa & Ditterich, 2008; Pilly & Seitz, 2009; Roitman & Shadlen, 2002; Salzman & Newsome, 1994; Shadlen & Newsome, 2001; van Maanen, Brown, et al., 2011). In this task, participants are required to indicate the apparent direction of motion of a cloud of dots that is presented on a computer screen. Typically, a percentage of the dots move in a designated direction (the target direction), while the remaining dots move randomly. The percentage of coherently moving dots is often used as a measure of task difficulty (see, e.g., Britten et al., 1992). In many experiments, the potential target directions are indicated on an imaginary circle surrounding the dot cloud (Fig. 1). In particular, the RDM task is an experimental paradigm often used to study behavior associated with multiple-choice (n > 2) decision-making (Ball & Sekuler, 1982; Churchland et al., 2008; Ho et al., 2009; Niwa & Ditterich, 2008; Salzman & Newsome, 1994; Shadlen & Newsome, 2001). One reason for this may be that the paradigm extends rather naturally to more alternatives. However, one complicating factor in applying RDM to multiple-choice decision-making is that as the number of response alternatives changes, so does their similarity.

Similarity and number of alternatives interact

In spite of the ease with which the RDM paradigm can be extended to multiple-choice tasks, it is difficult to study the effects of increasing the number alternatives. This is because introducing an extra target inevitably decreases the relative angular distance of the alternatives, which increases the similarity of the alternatives in terms of location.

Overview

To disentangle the effects of similarity and number of alternatives, we will analyze optimal behavior in the RDM task. First, we introduce a model of an optimal observer in the RDM task. The model applies Bayesian principles to give an account of how changes in the stimulus (such as the similarity between alternatives) influence the decision-making process. We also show how this model can be neurally implemented. The model predictions will be experimentally verified in the following sections. Because the optimal-observer model describes behavioral changes under a rationality assumption, we additionally fit a linear ballistic accumulator (LBA; Brown & Heathcote, 2008) model to the data. The LBA model is a process model of decision-making that can be fit to the response time distributions of correct and error responses (cf. Ratcliff, 1978). This model allows us to study whether the changes that are predicted by the optimal-observer model are reflected by changes in latent variables in the decision process. These should appear as parameter changes of the LBA model.

Optimal behavior in RDM

The RDM stimulus consists of a set of dots, each moving in a particular direction. A proportion of the dots move in the same direction, and the remaining dots move in random directions. To make a decision, an optimal observer could simply count the number of dots that move in each direction and choose the direction with the most dots. However, the perceptual system introduces variance in the perceived motion—for instance, because motion-sensitive neurons in the brain not only respond to their preferred motion direction, but also to similar directions. Thus, in order to make a correct decision, the observer needs to decide whether a certain amount of evidence for a particular response alternative outweighs the evidence for other alternatives. One way to define optimal performance would be to achieve the minimum average response time for a prespecified error rate (cf. Bogacz, Brown, Moehlis, Holmes, & Cohen, 2006). A process that implements this strategy can be described as follows. The optimal observer computes for each response alternative the posterior probability that this alternative is the target, on the basis of the evidence observed so far:

$$ P\left( {{H_i}|D} \right) = \frac{{P\left( {D|{H_i}} \right)P\left( {{H_i}} \right)}}{{{\sum_j}P\left( {D|{H_j}} \right)P\left( {{H_j}} \right)}}, $$

(1)

with H _i being the hypothesis that motion direction i generated the RDM stimulus, and D being the set of observed dot movements. Here we assume that the prior probabilities for each alternative are equal—as they will be in subsequent experiments—and hence can be ignored. Equation 1 then simplifies to

$$ P\left( {{H_i}|D} \right) = \frac{{P\left( {D|{H_i}} \right)}}{{{\sum_j}P\left( {D|{H_j}} \right)}}. $$

(2)

On the basis of new incoming evidence, the model continuously recomputes the posterior probability of each response alternative until the probability of one of the alternatives crosses a preset response criterion.

The above scheme implements the multihypothesis probability ratio test (MSPRT; Baum & Veeravalli, 1994), which generalizes the sequential probability ratio test (SPRT; Wald, 1947) to more than two alternatives. Given a particular response criterion, the model stops sampling and initiates a decision when the posterior probability of any of the alternatives crosses the response criterion. Because the posterior probability reflects the probability that a particular choice is correct, the model reports the correct alternative on a proportion of the trials that is on or above the response criterion.

In addition to the MSPRT, other definitions of optimal behavior exist. For example, the observer could take the cost of making an error into account and weigh that against the cost of sampling more evidence (and responding later). An optimal strategy to do this would include maximizing the proportion of correct responses per unit time (the reward rate; Bogacz et al., 2006; Gold & Shadlen, 2002; Hawkins et al., in press; Simen et al., 2009). The choice of strategy most likely differs between experimental contexts as well as individuals (Hawkins et al., in press). In contexts in which the quality of the stimulus is uncertain, such as in typical RDM tasks in which the stimulus coherence is manipulated on a trial-to-trial basis (e.g., Palmer, Huk, & Shadlen, 2005), SPRT does not provide an accurate description of the data (cf. Bogacz et al., 2006; Hanks, Mazurek, Kiani, Hopp, & Shadlen, 2011). In the present setup, however, the quality of the stimulus is constant, and SPRT thus allows for a straightforward analysis of the decision-making process. In the Simulation section below, we will provide simulations using both implementations of optimal choice.

Evidence accumulation

The posterior probability for each alternative is computed on the basis of the evidence contained in the stimulus. In the RDM paradigm, this means that a participant perceives moving dots and the directions of movement are combined into evidence for each alternative.

Each movement is encoded in the brain by motion-sensitive neurons (Britten et al., 1992). Each motion-sensitive neuron is tuned to a particular motion direction, which means that each neuron has a preferred motion direction. The functional form of these tuning curves follows a von Mises function (Swindale, 1998). The von Mises function for an angle ϕ is given by

$$ f\left( {\phi |\mu, \kappa } \right) = \frac{{{e^{\kappa }}^{{{ \cos }\left( {\phi -\mu } \right)}}}}{{{2}\pi {I_0}(\kappa)}}, $$

(3)

where I ₀(x) is the modified Bessel function of order 0, μ indicates the preferred orientation of the neuron, and κ is the neuron’s response specificity, which may be used to express individual differences in motion perception ability.

In response to a stimulus with orientation ϕ, each neuron i produces a spike train that follows a Poisson process with a firing rate equal to f(ϕ | μ _i, κ). That is, within a time step of length ∆t starting at times t = 1, . . . , T, the number of spikes n _it is distributed according to a Poisson random variable:

$$ {n_{{it}}} \sim {\text{Pois}}[n|f\left( {\phi |{\mu_i},\,\kappa } \right)] = \frac{{f{{\left( {\phi |{\mu_i},\,\kappa } \right)}^n}}}{{n{!}}}{e^{{ - f\left( {\phi |{\mu_i},\,\kappa } \right)}}}. $$

(4)

The likelihood of a particular motion direction ϕ for the spikes in a single time step (i.e., from t to t + ∆t) is then

$$ {L_t}\left( \phi \right) = \mathop{\Pi }\limits_{{i = 1}}^N {\text{ Pois}}\left[ {{n_{{it}}}|f\left( {\phi |{\mu_i},\,\kappa } \right)} \right]. $$

(5)

Here we assume that the spike trains in each direction-sensitive neuron are independently distributed, which is approximately the case for cortical pyramidal cells (Beck et al., 2008; Jazayeri & Movshon, 2006).

Over time, the observer perceives a set of motion directions D, a subset of which is coherent and moving in the target direction, whereas the complement is uniformly distributed and moving in random directions. The observer’s challenge is to accumulate evidence for each of the possible directions of movement. The optimal observer computes Bayes’s rule (Eq. 2) for all choice alternatives on the basis of the likelihood of each alternative j ^{Footnote 1}:

$$ P\left( {D|{\mu_j},\,\kappa } \right) = \mathop{\Pi }\limits_{{t = 1}}^T \,\mathop{\Pi }\limits_{{i = 1}}^N \,{\text{Pois}}\left[ {{n_{{it}}}|f\left( {{\mu_j}|{\mu_i},\,\kappa } \right)} \right]. $$

(6)

The posterior probabilities in Eq. 2 can now be expressed as

$$ P\left( {{\mu_j}|D} \right) = \frac{{\Pi_{{t = 1}}^T\Pi_{{i = 1}}^N{\text{Pois}}\left[ {{n_{{it}}}|f\left( {{\mu_j}|{\mu_i},\,\kappa } \right)} \right]}}{{\sum {_{{k = 1}}^M\Pi_{{t = 1}}^T\Pi_{{i = 1}}^N\,{\text{Pois}}\left[ {{n_{{it}}}|f\left( {{\mu_k}|{\mu_i},\,\kappa } \right)} \right]} }}, $$

(7)

which simplifies to

$$ P\left( {{\mu_j}|D} \right) = \frac{{\exp \left[ {\sum {_{{i = 1}}^N\,\log \,f\left( {{\mu_j}|{\mu_i},\,\kappa } \right)} \sum {_{{t = 1}}^T\,{n_{{it}}}} } \right]}}{{\sum {_{{k = 1}}^M\exp \left[ {\sum {_{{i = 1}}^N\,\log \,f\left( {{\mu_k}|{\mu_i},\,\kappa } \right)} \sum {_{{t = 1}}^T} \,{n_{{it}}}} \right]} }}. $$

(8)

Equation 8 can be interpreted as follows: Each response alternative (μ _j) is represented by a counter that collects all spikes from the neurons encoding for that direction. Thus, the posterior probability for each alternative depends on the ratio of the spike counts of all alternatives.^{Footnote 2} Beck et al. (2008) showed that this model can be implemented by networks of neuronal populations, giving credence to the idea that the brain computes optimal choice.

The optimal-observer model is intended to analyze the RDM task in terms of optimal performance. That is, given the noisy nature of the stimulus and a particular response criterion, the model shows what the optimal behavior should be to minimize response times. This is not to say that the model is the definitive account of human behavior. There are a number of ways in which observers can deviate from the model’s predictions. For instance, individual observers may have different ways of taking the cost of sampling into account, and therefore could weigh the cost of time in different ways.

Simulations with the optimal-observer model: I. Similarity

As time progresses, more and more observations of moving dots become available. When, for a particular duration T and movement direction μ _j, the the posterior probability reaches a response criterion, the model provides response j with decision time T. This is illustrated in Fig. 2. The top left panel shows a condition with two alternatives (A and B). The mean direction of motion is μ _A = 0, and the alternative choice is located at μ _B = π (note that, due to the circular arrangement of alternatives, we can also say that μ _B = –π). For example, if we assume that the specificity of the von Mises function is κ = 1, the probability of drawing a sample that is closer to the incorrect alternative is only .11. This means that the majority of evidence samples (i.e., movement) indicates that A is the correct alternative. Therefore, this target configuration represents a relatively easy decision-making task, and the response criterion indicated by the horizontal gray line is reached quickly (Fig. 2, top right panel).

If the angular distance between alternatives changes, so does the probability that a response alternative is correct, given a sample from a distribution with a particular concentration. In the example in the top panels of Fig. 2, an evidence sample with perceived movement x = 1 contributes more to the posterior probability of A than to the posterior probability of B, because the distance between x and μ _A is smaller than the distance between x and μ _B. As the angular distance between A and B decreases, the probability increases that a particular evidence sample contributes more to B than to the target A. This is depicted in the middle left panel of Fig. 2. Here, the foil B with $ {\mu_B} = \frac{1}{2}\pi $ is a more likely alternative given evidence sample x, since μ _B – x < μ _A – x. Consequently, average decision times take longer, because the target locations are more similar than in the situation depicted in the top panels. Note that because the top and middle panels represent binary choices, every increase in the posterior probability of one alternative is mirrored as a decrease in the posterior probability of the other alternative.

We explored the model’s behavior as a function of angular distance by running a binary choice simulation. That is, the model had two response options, and we manipulated their angular distance. The angles were chosen to be representative of the similarities between alternatives in a typical multiple-choice RDM experiment: $ \frac{2}{3}\pi, \,\frac{2}{5}\pi, \,\frac{2}{7}\pi, \,{\text{and}}\,\frac{2}{9}\pi $. The κ parameter that controls the concentration of the von Mises distribution was kept constant at κ = 0.5. Each data point in Fig. 3 is based on 10,000 simulated decisions.

As Fig. 3 shows, the optimal-observer model predicts that the mean decision time (MDT) decreases as the angular distance between the alternatives increases. In particular, the models predict that log MDT will decrease approximately linearly with the logarithm of the angular distance (Fig. 3, middle panel). Because the model strives for a fixed proportion of correct responses, the accuracy rates are constant over different angular distances.

Simulations with the optimal-observer model: II. Number of alternatives

If the number of alternatives increases, the prior probability of each alternative decreases. As a consequence, the time needed to cross the distance between the start point and response criterion is greater, and therefore MDT is larger. The bottom panels of Fig. 2 illustrate this with a choice between four alternatives. MDT is prolonged because the prior probabilities are lower and the increase in the likelihood of each alternative is lower. This is because the circular target arrangement in the RDM task necessarily results in an interaction between similarity and number of alternatives.

In Simulation I, we focused on how decision-making in RDM depends on the similarity between alternatives by manipulating the angular distance between the choice alternatives. Now, we will simulate the additional effect of increasing the number of alternatives, as in a typical RDM experiment. We ran the model 10,000 times for three, five, seven, and nine response alternatives in a circular arrangement. This arrangement represents the convential way of extended the number of alternatives in the RDM paradigm. Because the alternatives are spaced over the full circle, the angular distances between two successive alternatives were $ \frac{2}{3}\pi, \,\frac{2}{5}\pi, \,\frac{2}{7}\pi, \,{\text{and}}\,\frac{2}{9}\pi $, respectively, as in the similarity simulation discussed before. On each trial, the mean of the sampling distribution was randomly chosen to be one of the alternative directions. The concentration of the sampling distribution was again κ = 0.5.

We simulated behavior for two response strategies that may be considered optimal. The first strategy minimizes response time for a given response criterion. This strategy implements the MSPRT (Dragalin, Tartakovsky, & Veeravalli, 1999). Whereas the MSPRT is a straightforward notion of optimal behavior, participants could also take other information into account while making a decision. The second strategy explores optimal choice by assuming that decision-makers maximize the reward rate over the course of an experiment (Bogacz et al., 2006; Gold & Shadlen, 2002; Hawkins et al., in press; Simen et al., 2009). The reward rate is defined as the average proportion of correct responses (P _c) per unit time (here defined as the sum of the mean response time MRT and the intertrial interval ITI):

$$ {\text{RR}} = \frac{{{P_c}}}{{{\text{MRT}} + {\text{ITI}}}}. $$

(9)

Maximizing reward rate entails that the proportion of correct responses be balanced against the time required to provide a correct response. Therefore, as P _c decreases—for example, due to the increased difficulty of the task—the maximal reward rate is obtained through faster responding, allowing for more responses, and therefore more correct responses per unit time. In this simulation, the intertrial interval was set at ITI = 50.

Independently of the choice of optimal strategy (fixed or reward-rate-based), the model predicts a linear increase of response time with number of alternatives (or even an exponential increase; see Fig. 4). This result is inconsistent with previous work on multiple-choice behavior that found that mean response times increased with the log of the number of alternatives (sometimes referred to as Hick’s law; Brown et al., 2009; Hick, 1952; Hyman, 1953; Teichner & Krebs, 1974). The model does not predict Hick’s law in this simulation, because the decision-making process is influenced by both changes in likelihood accumulation (because of changing similarity) and changes in prior probability (because of changing the number of alternatives).

With respect to predictions of choice accuracy, the optimal strategies diverge. The fixed-criterion strategy predicts that accuracy will be equal for all N, consistent with the idea that the model strives for a particular proportion of correct responses. The reward-rate-based strategy predicts that accuracy will decrease with increasing N, because this is equivalent to minimizing the total experiment time. This can be appreciated by observing the differences in the predicted MDTs between the models: The sum of MDT over all N is higher for the fixed-criterion than for the reward-rate-based strategy.

From the analysis of the optimal-observer model, two important properties of behavior in the RDM task emerge: First, the change over time in posterior probability is affected by the location of the choice alternatives, and therefore indirectly by the number of alternatives, because changing the number of alternatives affects the location of the alternatives. Second, the amount of evidence that is required for a decision is directly affected by the number of alternatives. This is because more alternatives decreases the initial value of the posterior probabilities that are monitored in the optimal-observer model. These properties were empirically tested in two experiments. Experiment 1 showed, in a conventional multiple-choice RDM task, that participants adjust the required amount of evidence in the face of more alternatives. In addition, Experiment 1 showed that the speed with which the posterior probabilities increase changes with more alternatives, which independently affects behavior. In Experiment 2, we tested the hypothesis that the similarity between alternatives affects the increase in posterior probability, whereas the number of alternatives affects the required amount of evidence.

The linear ballistic accumulator model

To corroborate the predictions for Experiments 1 and 2, we analyzed the data using the linear ballistic accumulator model (LBA; Brown & Heathcote, 2008). While the optimal-observer model can be considered a normative model of decision-making, as it describes optimal choice, the LBA model is a descriptive model and can be successfully fit to experimental data. In particular, the LBA model accounts for response time distributions for both correct and incorrect responses, as well as for the proportion of correct responses.

Similar to the optimal-observer model, the LBA model assumes that a decision is made via the accumulation of evidence for a particular alternative (Fig. 5). The LBA model assumes that each alternative is represented by its own accumulator. During a trial, evidence for each choice alternative j accumulates at a fixed rate (drift rate v _j) until a critical value (the decision criterion b) has been reached. To account for variability in the data, the LBA model assumes that the drift rates on a trial are drawn from a normal distribution. The mean of that distribution differs per accumulator, while the variance (s) is typically kept constant. In addition, the LBA model assumes that the start point is noisy as well and is drawn from a uniform distribution with bounds [0, A]. These two sources of variability prove sufficient to account for many benchmark phenomena in decision-making tasks (Brown & Heathcote, 2008). LBA also estimates the time that cannot be explained by any of the other components (nondecision time T _er).

Following the prediction from the optimal-observer model, we hypothesized that the drift rate would decrease with an increasing number of alternatives, reflecting the decreased likelihood of the alternatives in the optimal-observer model. In addition, we hypothesized that response caution would increase with the number of alternatives, to reflect the decrease in prior probability predicted by the optimal-observer model (cf. Churchland et al., 2008).

Experiment 1: Similarity and number of alternatives interact

Experiment 1 was based on the prototypical multi-alternative case in RDM. Thus, the alternatives are presented as targets on a circle surrounding a moving-dot kinematogram. The alternatives are maximally spaced, as shown in Fig. 1. Participants were instructed to respond by moving a joystick in the target direction.

Method

Participants

A group of 10 students (8 female, age range 18–49 years) from the University of Amsterdam student pool participated for course credit. All had normal or corrected-to-normal vision.

RDM stimulus

To create the moving-dot kinematogram of Experiment 1 (and subsequent experiments) we used the Variable Coherence Random-Dot Motion (VCRDM) library for Psychtoolbox (Brainard, 1997).^{Footnote 3} The appearance of motion in VCRDM is created by controlling the locations of a subset of dots for three frames in a row. That is, when the second frame is drawn, the location of a subset of dots will be recomputed to align with the target direction. The location of the remaining dots is randomly assigned. The size of the subset—often referred to as the coherence level—is under the experimenter’s control. We set the coherence at 35%. Pilot studies indicated that this coherence level made the task sufficiently demanding for our participants, especially when choosing between seven or nine alternatives. Each dot consisted of 3 × 3 pixels, and the initial locations of each dot sequence were uniformly distributed in an aperture of 5-visual-degree diameter.

Design and procedure

Participants were instructed to indicate the apparent direction of motion of the moving-dot kinematogram. To do this, they could move a joystick in the direction of one of several alternatives, which were indicated by yellow circles. The locations of the alternatives were randomized over the full circle. The distance of each alternative from the center of the aperture was 5 visual degrees. There were either three, five, seven, or nine alternatives present, distributed around the circle in such a way that the angular distance between them was maximized (Fig. 1). We used odd numbers of alternatives to ensure that two alternatives were never diametrically opposed. This is important, because the flow of motion is sometimes perceived to be in the opposite direction (Anstis & Mackay, 1980), which would increase the error rates for even configurations of alternatives. The trial order was pseudorandomized such that no more than two consecutive trials had the same number of alternatives. After 32 practice trials (8 for each condition), the experiment was presented in eight blocks of 144 trials (1,152 trials in total). After each block, the participant could take a short break. The participants were instructed to respond as quickly as possible without making any errors. Feedback to their responses was provided by color coding the alternatives. If participants selected the target it turned green; if participants selected a foil, it turned red and the target turned green. Note that the participants did not receive feedback during a trial (that is, there was no cursor present in the display). Response times were defined as the duration between stimulus onset and the moment at which the joystick passed the imaginary circle on which the alternatives were aligned. The nearest alternative was considered as the choice the participant had made. At the beginning of each trial, a red fixation dot was presented together with the alternatives. After 500 ms, the RDM stimulus was presented, which remained on screen until the participant made a response. If the response was faster than 200 ms, a feedback screen appeared that stated “Te snel!” (too fast!).

Results and discussion

We excluded trials on which participants responded too quickly (the trials on which “too fast” feedback was provided; 0.11% of the trials, five trials in total). We first analyzed whether joystick responses differed between correct and incorrect responses. This was done to exclude the possibility that incorrect trials would not reflect errors in the decision-making process, but rather would reflect noise in the motor program required to execute the response. To do this, we first computed the response vector, which we defined as the line through the center of the aperture and the joystick coordinates at the time of response. We assumed that errors related to motor noise would show a larger angular distance between the response vector and the location of the alternative that was selected than would correct responses and errors related to the decision process, because the intended movement was toward a different alternative. The right panel of Fig. 6 shows that this was not the case. Using the cosine of the angular distance, we collapsed responses to the left and right of the selected alternative into one measure, with a cosine of 1 representing a response that was exactly toward the chosen alternative. Although the response vector for the incorrect responses was slightly more off than the response vector for the correct responses, it was clear that at least the majority of incorrect responses reflected errors in the decision process, and not in the motor execution process.

Visual inspection of the response time data suggests that MRT increased linearly with the number of alternatives, which would not conform to Hick’s law (Fig. 6, left panel). To support this observation, we compared three regression models: a linear model and two log-linear models. The linear model (Model 1 in Table 1) predicted MRT as a function of number of alternatives (N) and response type (R, correct/incorrect). The log-linear models predicted MRT as a function of the logarithm of the number of alternatives and of response type. Whereas the second model included log N as a factor (Model 2 in Table 1), the third model included log (N + 1) (Model 3 in Table 1). This model was tested because sometimes Hick’s law is said to require an extra constant to account for uncertainty about the occurrence of the decision (in other words, the extra alternative that there is no stimulus; see, e.g., Hick, 1952). In addition, we included the type of response (correct or incorrect) and the interaction in the regression models. The models were fit to the data using least-squares fitting, and then the fits were compared on the basis of their maximum log likelihoods. The linear model had a higher log likelihood value than either of the log-linear models (Table 1). This supports the hypothesis that the data of Experiment 1 are better described by a linear model.

Table 1 Maximum log likelihood values for three regression models fitted to Experiment 1

Full size table

The results of Experiment 1 suggest that the effects of increasing the number of choice alternatives in the RDM task are at odds with the typical finding from multiple-choice experiments that the logarithm of the number of alternatives determines MRT. Previous research suggested that in some circumstances Hick’s law is not expected to apply (Kveraga, Boucher, & Hughes, 2002; Lawrence, St John, Abrams, & Snyder, 2008). However, we interpreted the results of Experiment 1 as an interaction between the effects of the number of alternatives and the similarity of the alternatives, which obscured the typical Hick’s law finding. Using the LBA model, we therefore aimed to disambiguate the effects of number and similarity.

LBA model of Experiment 1

Having established that the data were suited for modeling of the decision-making process, we now turned to the LBA model fit. To improve the estimate of the response time distribution of incorrect responses, we collapsed all trials in which an incorrect response was given. As a result, the LBA model consisted of only two accumulators: one for correct responses and one for incorrect responses. In addition, we Vincentized the data to obtain group estimates of the deciles used to fit the models (Vincent, 1912).^{Footnote 4} The models were fit to the data using SIMPLEX optimization routines (Nelder & Mead, 1965). For scaling purposes, we set the sum of the correct and incorrect drift rates to 1. To obtain the model that best described the data, we systematically varied which parameters were allowed to vary over the numbers of alternatives. In particular, each of the five model parameters was constrained over the numbers of alternatives, and these models were compared against a model in which all parameters were allowed to vary. Using the Bayesian information criterion (BIC), we assessed which of these models best balanced the fit to the data and the number of degrees of freedom.

Table 2 details how well each model accounted for the data by presenting BIC values and Schwarz weights (Wagenmakers & Farrell, 2004). Schwarz weights quantify the support for a particular model, given the data and the set of candidate models. Even when taking the additional parameters into account, the best model was the default model in which all parameters were allowed to vary. Importantly, the models in which drift rate (v), start point (A), or threshold (b) were constrained scored worse. The best LBA model fit is presented in Fig. 7. Inspecting the parameter values of this model shows that as the number of alternatives increases (and the similarity between the alternatives as well), the drift rate estimate for the correct responses decreases (Fig. 8, middle panel). In addition to drift rate changes, we found that response caution (i.e., criterion) in the LBA model conformed with the optimal-observer predictions. The best-fitting values for the decision criterion b show an increase with increasing N.

Table 2 BIC values and Schwarz weights (Wagenmakers & Farrell, 2004) for each of the LBA models fitted to the data of Experiment 1

Full size table

Analysis of the parameters of the LBA model showed that the effects of similarity and number of alternatives can be disentangled using a descriptive model of decision-making. Such a model-based approach would be particularly useful if similarity and the number of alternatives were independently manipulated. In that case, one would be interested in the effects of both manipulations separately. Experiment 2 demonstrated that, also in this situation, the LBA model accounts for the separate effects due to similarity and the number of alternatives. We introduced two similarity conditions: one in which the average similarity between the alternatives increased with more alternatives, as in Experiment 1, and one in which the average similarity between the alternatives decreased with more alternatives.

Experiment 2: A different relation between similarity and number

Participants were asked to perform the RDM task in two conditions. The spaced condition was similar to Experiment 1. Here, either three, five, seven, or nine alternatives were equally spaced on the half circle (Fig. 9, left panel). In the clustered condition, there were nine possible stimulus locations—the same nine used in the N = 9 version of the spaced condition. The N = 3, N = 5, and N = 7 conditions were formed by using the central three, five, or seven locations from the N = 9 condition. As a consequence, the average distance between alternatives increased with every extra alternative. We hypothesized that in the clustered condition, the average similarity between alternatives would decrease with N.

The manipulation of the average angular distance between alternatives led to two clear hypotheses: The effect of the number of alternatives on MRT would be sublinear in the clustered condition, in line with Hick’s law. This was because the average similarity between alternatives did not increase with more alternatives, as in the spaced condition and Experiment 1. Therefore, response time would not be higher due to the increased similarity, and the sublinear effect of the number of alternatives would not be negated by the angular distance manipulation. In addition, we predicted that responses in the spaced condition would replicate the findings of Experiment 1.

In terms of the changes of the LBA model parameters by the number of alternatives, we hypothesized that the decision criterion parameter (b) would increase with more alternatives, but that it would not differ between the spaced and clustered conditions. Following the optimal-observer logic, the decision criterion would only change with the number of alternatives, and not with the angular distance between the alternatives. On the other hand, we hypothesized that the drift rate parameter (v) would differ with both the number of alternatives and the angular distance condition. That is, increasing the number of alternatives as well as the different conditions would lead to changes in angle. Therefore, to account for the data, drift rate should be different for all conditions.