A habituation account of change detection in same/different judgments

Davelaar, Eddy J.; Tian, Xing; Weidemann, Christoph T.; Huber, David E.

doi:10.3758/s13415-011-0056-8

A habituation account of change detection in same/different judgments

Published: 09 September 2011

Volume 11, pages 608–626, (2011)
Cite this article

Download PDF

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

A habituation account of change detection in same/different judgments

Download PDF

Eddy J. Davelaar¹,
Xing Tian²,
Christoph T. Weidemann³ &
…
David E. Huber⁴

2329 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

We investigated the basis of change detection in a short-term priming task. In two experiments, participants were asked to indicate whether or not a target word was the same as a previously presented cue. Data from an experiment measuring magnetoencephalography failed to find different patterns for “same” and “different” responses, consistent with the claim that both arise from a common neural source, with response magnitude defining the difference between immediate novelty versus familiarity. In a behavioral experiment, we tested and confirmed the predictions of a habituation account of these judgments by comparing conditions in which the target, the cue, or neither was primed by its presentation in the previous trial. As predicted, cue-primed trials had faster response times, and target-primed trials had slower response times relative to the neither-primed baseline. These results were obtained irrespective of response repetition and stimulus–response contingencies. The behavioral and brain activity data support the view that detection of change drives performance in these tasks and that the underlying mechanism is neuronal habituation.

Enhanced inter-regional coupling of neural responses and repetition suppression provide separate contributions to long-term behavioral priming

Article Open access 20 April 2021

A context-change account of temporal distinctiveness

Article 25 March 2019

A neural-based account of sequential bias during perceptual judgment

Article 19 March 2021

A question of fundamental interest in the study of perception, memory, and other cognitive processes is whether the processes of the brain are generally tuned toward the detection of familiar items and events, or whether the brain detects that which is new in the environment. The study of episodic memory has focused on the process of familiarity, demonstrating that behavioral responses are sensitive to relative degrees of memory strength (e.g., Hintzman & Curran, 1995). Furthermore, the timing and cortical location of these familiarity responses have been evidenced with neural recordings (Curran, 2000; Henson, Cansino, Herron, Robb, & Rugg, 2003). For long-term memory, it is sensible to focus on the detection of old rather than new items. More specifically, there is evidence that familiarity captures long-term statistical regularities that are useful for a wide variety of tasks and judgments (e.g., Hertwig, Herzog, Schooler, & Reimer, 2008). Whereas it is nearly uniformly accepted that long-term memory is based on familiarity detection rather than novelty detection, this distinction is less well studied for shorter time scales, such as in perceptual tasks or in short-term working memory tasks. To investigate the roles of familiarity detection and novelty detection (i.e., change detection) in perceptual tasks, we examined situations that involved the detection of change with respect to stimuli that were presented close to each other in time and space.

Closely related paradigms have demonstrated that even with full attention and awareness, participants may be unable to detect change. For example, in studies of “change blindness,” participants are shown alternations of photos or abstract objects (e.g., Rensink, O’Regan, & Clark, 1997; Simons, 1996), and they experience difficulty identifying the change across alternations. In other work, changes across scenes are frequently missed, irrespective of whether these changes are abrupt or gradual (Rensink et al., 1997; Simons, Franconeri, & Reimer, 2000). Although these change detection paradigms suffice for investigating the limits of the cognitive system, they are not ideal for studying the mechanisms by which change detection occurs. This is because these detection paradigms involve visual displays with a high density of objects. Research has shown that an increase in perceptual load narrows the attentional focus (e.g., Williams, 1989), which draws attention away from the feature dimension along which a change might occur, thus producing inattentional blindness (Most, Scholl, Clifford, & Simons, 2005; Simons, 2000). Therefore, in tasks with a high density of objects, change blindness is attributable to load-induced narrowing of spatial attention, because attention is more likely to be allocated to an area where no change occurs. This sensitivity to the focus of attention renders these paradigms unsuitable for investigating the mechanisms underlying change detection.

To isolate the mechanisms of change detection, we used a paradigm with low attentional load. In this paradigm, a first stimulus, S1, is followed by a second stimulus, S2, to which a response is required indicating how it compares to S1. Such tasks featured in a debate on the architecture underlying same/different judgments (see Farell, 1985, for a review). This debate contrasted single- versus dual-process models (a dual-process model involves the separate processes of sameness detection and change detection), with the evidence favoring single-process models. Single-process models assume that a match value is obtained by contrasting S2 with S1, and that this value, which is along a match/nonmatch dimension, is compared to a criterion in order to determine the appropriate response (Ratcliff, 1985; Van Zandt, Colonius, & Proctor, 2000). However, these models do not specify whether the criterion is placed on the magnitude of a match signal (i.e., making a decision based on familiarity) or on the magnitude of a mismatch signal (i.e., making a decision based on novelty). In this article, we advocate a single-process model of change detection, and we test the claim that the nature of this process is to calculate the degree of mismatch (novelty).

Building on a theory of short-term priming (Huber & O’Reilly, 2003; see also Huber, 2008b), we favor the view that the perceptual system primarily employs mechanisms for detecting differences between stimuli rather than similarities. This theory explained a series of experiments (Huber, Shiffrin, Lyle & Quach, 2002a; Huber, Shiffrin, Lyle, & Ruys, 2001; Huber, Shiffrin, Quach, & Lyle, 2002) that used a two-alternative forced choice (2AFC) paradigm in which a briefly flashed (and masked) target stimulus needed to be identified. On each trial, prime stimuli presented prior to the target matched either the target or the incorrect choice alternative in the 2AFC (i.e., the foil). When comparing target-primed trials to foil-primed trials, the results were equal and opposite, suggesting that priming primarily induced a preference effect. Furthermore, with short prime durations (50 ms), there was a preference to choose the primed stimulus, but with longer prime durations (2,000 ms), there was a preference against choosing the primed stimulus (Huber, 2008b). In other words, if the target choice was primed for a long duration, participants tended to choose the foil instead of the target, reducing accuracy. Thus, following a sufficiently long exposure to a preceding stimulus, performance was better when the target represented a change.

As reviewed in Huber (2008b), this pattern of results is readily explained by proposing that the perceptual system habituates to any stimulus attended (e.g., a prime) and that the amount of habituation becomes greater with longer prime durations (Huber, 2008b; Huber & O’Reilly, 2003). According to this theory, when a prime is very salient, the primed choice word (target or foil) is still in a habituated state when the choice words appear, slowing down perceptual processing for that choice. Assuming that the decision is guided by a sense of perceptual fluency, the participant then chooses the alternative that is less habituated, or conversely, more novel. This theory was first instantiated as a formal model called “Responding Optimally with Unknown Sources of Evidence” (ROUSE; Huber et al., 2001). We focus on the neurocomputational version of ROUSE (nROUSE; Huber & O’Reilly, 2003), which is a neural model for novelty detection. This neural model produces habituation through “synaptic depression,” as we explain next.

In most neurocomputational models of cognitive processes, the activation of internal representations increases with increasing stimulus duration (but see Gotts & Plaut, 2002, and Huber & O’Reilly, 2003). However, these models are at odds with the common observation that neuronal firing rates change nonmonotonically with changes in the duration of relevant stimuli (Tsodyks & Markram, 1997). When excitatory pyramidal cells receive constant input, their firing rates typically increase sharply and reach a peak within 150 ms, after which they decrease toward a lower level that is above the baseline firing rate. This neural fatigue has been termed “synaptic depression,” because a variety of cellular mechanisms serve to produce a temporary reduction in the efficacy of the synapses, with a recovery period needed before synaptic resources are replenished. One such mechanism is the amount of neurotransmitter available within a spiking neuron. If a neuron has been inactive, it will have sufficient neurotransmitter available to effectively transmit its activation to other neurons. However, if that neuron has been active recently, its neurotransmitter supplies may be temporarily depleted, in which case the firing of the neuron is ineffective in terms of transmitting activation to other neurons. To illustrate the patterns of synaptic depression, consider Fig. 1, in which the output activation of a simulated neuron is shown as a function of stimulus duration (see Huber & O’Reilly, 2003, for details). As shown in the top panel, without synaptic depression, output activation increases monotonically with stimulus duration, and this pattern of response is identical when the cell is driven by a second input after a short delay. However, as shown in the bottom panel, which includes synaptic depression, the output activation is nonmonotonically related to stimulus duration. Furthermore, repeating the stimulus after a short delay leads to a decreased neuronal response as compared to the first presentation (e.g., compare the heights of the two peaks).

Using a model of habituation through synaptic depression, Huber and O’Reilly (2003) showed that for the short-term priming paradigm, the time needed to achieve peak activation for each choice (i.e., perceptual fluency) captured both positive priming following short prime durations and negative priming following long prime durations. Similar to the interactive-activation model of McClelland and Rumelhart (1981), this was done with a three-layer (visual, orthographic, and lexical–semantic) network, although the representations at each layer were localist, and synaptic depression was used for all simulated neurons. Thus, habituation was imposed at all levels of processing. For longer prime durations, the peak of the primed response was lower and later. Beyond modeling behavioral results, the same model was used to make detailed predictions regarding the time course of neural responses, which were confirmed in a combined electroencephalogram (EEG)/magnetoencephalogram (MEG) study (Huber, Tian, Curran, O’Reilly, & Woroch, 2008). Using parameters previously constrained by behavioral results, this model produced an accurate account of the ERP waveforms, which provided additional support to the claim that representations beyond the low-level visual stage are subject to habituation. Similar to this approach, in the present study we used MEG to address the earliest signal that reflects novelty detection.

If neural habituation is the mechanism behind behavioral negative priming with immediate repetitions, this may also explain the ubiquitous finding that neural responses experience a “repetition suppression,” which is used as an important marker in neuroimaging studies (Grill-Spector & Malach, 2001). Indeed, the mechanism of synaptic depression has been demonstrated to capture repetition-induced suppression in neural responses (Miller & Desimone, 1994). The neurophysiological mechanisms underlying repetition-induced suppression are under intense investigation (Fuhrmann, Cowan, Segev, Tsodyks, & Stricker, 2004), as is the precise mapping of habituation at the neural level, as compared to the repetition suppression seen in neuroimaging studies (Chouinard, Morrissey, Kohler, & Goodale, 2008; Guo, Lawson, & Jiang, 2007; Henson, Mouchlianitis, Matthews, & Kouider, 2008; Horner & Henson, 2008; James & Gauthier, 2006; Sawamura, Orban, & Vogels, 2006; for a review, see Grill-Spector, Henson, & Martin, 2006). The relation between neural repetition suppression and behavioral priming is still poorly understood, and dissociations have been observed between these phenomena (Dobbins, Schnyer, Verfaellie, & Schacter, 2004; Ganel et al., 2006; Sayres & Grill-Spector, 2006).

It is important to note that a close relation exists between neural habituation and information processing models, in which a prediction is made about what to expect on the basis of the recent past. For instance, Chance, Nelson, and Abbott (1998) implemented synaptic depression in a model of V1 cells, demonstrating increased sensitivity to transient changes. In other words, by resetting baseline activation by adapting to the current stimulus, perception is sensitive to change relative to the current stimulus. Using a hidden Markov model (HMM), Huber (2008a) formalized this relation between perceptual predictions and synaptic depression: A cascaded HMM was found to produce the same dynamics as synaptic depression. The HMM model produced less of a response to the repetition of an input because recent events provided a strong prediction for the perceptual evidence of the repeated item. Thus, repeated items were discounted such that any new perceptual evidence that might suggest a second occurrence of an item was instead attributed to this prediction based on the recent past. From this perspective, the reason for repetition suppression is to provide perceptual prediction, whereas synaptic depression is the mechanism by which repetition suppression is neurally implemented.

In applying this priming model to the present study, we do not claim that any particular level of processing is more important in terms of neural habituation. Because the present study contrasted repetitions (which are the same at all levels) with unrelated words (which are different at all levels), we cannot specify whether the observed effects were due to orthographic/phonological or lexical–semantic habituation (or some other higher-level representation). Furthermore, we do not claim that synaptic depression is the precise mechanism underlying these effects, considering that other neural mechanisms can produce similar habituation dynamics. Instead, our claim is that the dynamics of habituation, as they exist at some or all levels of processing, enable change detection between what just occurred and what appears subsequently.

By testing a habituation account both with behavioral priming and with MEG, we approached this issue with a single comprehensive model that is based on neural dynamics. There are two components to our account: (1) that same/different judgments for an immediate comparison between cue and target words are based on a novelty signal rather than a familiarity signal or different signals for same and different trials, and (2) that habituation is the mechanism that underlies this novelty signal. We tested the first component in Experiment 1 by comparing the topographic similarity and response magnitudes of neural responses for same versus different trials. This was done with words that never repeated across trials, so as to eliminate any priming/habituation effects from previous trials. We tested the second component in Experiment 2 by examining repetition priming between one trial and the next; if habituation underlies novelty detection and if habituation can linger from recent presentations of a word, this account predicts that last-trial priming of a cue word (i.e., even greater habituation to the cue) should facilitate different trials, whereas last-trial priming of a target word should produce the opposite effect (i.e., target habituation, as if the target was just seen as the cue, rather than during the last trial). Critically, this was done with a small number of words that were reused many times across trials, so as to equate any effects of long-term repetition priming. In this manner, we isolated the short-term habituation effect from the last trial. Furthermore, we contrasted this account with an account of repetition-induced suppression that is based on associative learning between a repeated stimulus and its behavioral response (e.g., Logan, 1990). In Experiment 2, we differentiated between these accounts using a paradigm in which the same repeated target word from one trial to the next might involve different behavioral responses on different trials (e.g., a response of “same” on trial n–1, but a response of “different” on trial n).

Experiment 1

We propose that change detection when a cue word is immediately followed by a target word is based on habituation and that the detection of sameness between cue and target is the absence of change detection. In other words, the brain does not employ a separate mechanism for detection of immediate repetitions for visually presented words. On this account, habituation to the cue word serves to produce a large response when the target is different from rather than repeats the cue (because the neural representation for a repeated target is already habituated). We do not claim that sameness is the absence of any signal (cf. Johnson, Spencer, Luck & Schöner, 2009). Instead, we suggest that a sameness signal is created by a competitive comparison between a single source of novelty and a referent (or bias). Such a view is compatible with the literature showing that some populations of cells respond more strongly for repeated stimuli, while others respond less strongly (Miller & Desimone, 1994).

In this same/different judgment paradigm, the obvious alternative account of performance is that the mechanism that triggers change detection may be the absence of familiarity rather than the presence of novelty. For this alternative to explain performance, it needs to be assumed that presentation of an item temporarily boosts the familiarity response for that item. This boost serves as the signal behind accurate performance, because a target that is the same as the cue is potentiated by its presentation as the cue, which ensures that the familiarity response to that target is above the familiarity detection criterion and thus no change is detected. These accounts are not mutually exclusive, and it is possible that both potentiation of familiarity and depression of novelty play a hand in change detection.

To differentiate between these accounts, Experiment 1 used MEG to determine whether the neural response pattern differs between same and different trials. MEG is a noninvasive measure of neural activity that has the same millisecond temporal resolution as EEG, but better spatial resolution. This increased spatial resolution makes MEG ideally suited to measure the pattern of neural activity in response to the target word. Additionally, we investigated whether the magnitude of the neural signal differs between same and different targets. A relative increase for same trials would suggest that repetitions produce potentiation, whereas a decrease would support the habituation account. A number of MEG studies have shown that habituation occurs in the time window of 100–350 ms after onset, with nonlexical and lexical habituation effects appearing in the early versus late parts of this window, respectively (see Monahan, Fiorentino, & Poeppel, 2008, for a review). Although these studies evidenced neural habituation with word stimuli, their focus was mainly on repetition priming effects instead of on determining mechanisms of change detection. To investigate this mechanism in Experiment 1, we used a multivariate sensor analysis to determine whether different brain networks are used for change detection versus detection of sameness. For each individual, we compared the patterns across all 157 sensors for same and different trials to ascertain whether patterns differed for the two types of trials, which would suggest the involvement of different processes. However, if the pattern remained the same but varied in response magnitude, this would suggest a single underlying process.

Because we were interested in the mechanism underlying change detection rather than the task-related decision process, we focused our analyses on the earliest MEG component that shows a difference for a target word that differs from a word presented immediately prior to the target. Previous work with repetition priming in a perceptual identification task revealed that the M170 is affected by immediate repetitions of visually presented words. Furthermore, this component is modulated by prime duration (Huber et al., 2008), as would be expected if this component were affected by habituation. Therefore, we focused on the M170 as the critical signal in the present experiment. The presentation sequence used in the present experiment follows closely from the MEG experiment of Huber et al. (2008). However, the task in that experiment was identification of a briefly flashed and then masked target word presented immediately after a prime word. In contrast, we now call the first word a “cue” because the task in the present experiment was a speeded same/different judgment of a target word that remained visible until a response was given. Similar to the prime durations used in the Huber et al. experiment, the present experiment used cue durations of 150 versus 2,000 ms to induce different levels of habituation.

Method

Participants

A group of 17 paid participants were recruited from the University of Maryland for Experiment 1. All participants were right-handed native English speakers with no history of neurological disorder.

Materials

We constructed a pool of 1,500 five-letter words. These words had a minimum written language frequency of 2 per million (Kučera & Francis, 1967). All words were displayed in uppercase Times New Roman font in yellow font color against a black background, and subtended less than 3° of visual angle. Stimulus materials were projected on a screen inside the MEG chamber with presentation times synchronized to the vertical refresh. The refresh rate was 75 Hz, providing display increments of 13.3 ms.

Design

Experiment 1 conformed to a 2 × 2 within-subjects design, crossing the variables cue type (same as vs. different from target) and cue duration (short vs. long: 150 vs. 2,000 ms).

Behavioral procedure

Experimental sessions lasted for about half an hour and included 48 blocks of eight experimental trials. Participants were instructed to refrain from blinking or moving during each block. Between each block of trials, participants received feedback about the total number of correct answers in the previous block. There were two repetitions of each of the four unique combinations of experimental conditions (short–same, short–different, long–same, long–different), which were randomly intermixed in each block. Word selection occurred randomly without replacement, such that each word was only seen on one trial. This was necessary to eliminate any priming effects from previous trials. Participants were fully informed as to the nature of the conditions and were warned that there were equal numbers of same and different trials. Two blocks of eight practice trials were presented prior to the 48 blocks of experimental trials.

At the beginning of each trial, a fixation line consisting of five short, dashed lines appeared for 1,000 ms in the center of the screen, followed by two instances of a cue word, one presented immediately above and one immediately below the fixation line. After 2,000 ms (long) or 150 ms (short), the cue words were followed by the target, presented in the center of the screen. Participants were asked to judge whether the target word was the same as or different than the cue word by pressing the corresponding button. They were encouraged to respond as quickly and accurately as possible. The target word remained on the screen until participants responded.

MEG procedure

Magnetic signals were recorded using a 160-channel (157 data channels and 3 reference channels), whole-head axial gradiometer system (KIT, Kanazawa, Japan). In order to check head position within the MEG scanner, five electromagnetic coils were attached to the head of a participant before the MEG recording. The locations of the coils were determined with respect to anatomical landmarks (nasion, left and right preauricular points) on the scalp using 3-D digitizer software (Source Signal Imaging, Inc.) and digitizing hardware (Polhemus, Inc.). The coils were localized with respect to the MEG sensors, both at the beginning and at the end of the experiment.

During the experiment, participants were supine inside a magnetically shielded room. Before the visual word experiment, they listened to 200 combined repetitions of 250-Hz and 1-kHz versions of a 50-ms sinusoidal tone (ISI randomized between 750 and 1,550 ms), with 100 repetitions for each frequency. Auditory-evoked responses to the onset of the pure tones were examined, and the auditory M100 was identified. The auditory M100 is a prominent and robust (across listeners and stimuli) deflection at 100 ms after onset and has been the most investigated auditory MEG response (for a review, see Roberts, Ferrari, Stufflebeam, & Poeppel, 2000). A dipole-like pattern (i.e., a source–sink pair) in the magnetic topographic map distributed over the temporal region of each hemisphere was identified for each participant. These responses were used to verify that the participant was positioned properly in the MEG scanner.

The MEG data were acquired with a sampling rate of 500 Hz, filtered online between 1 Hz and 200 Hz, with a notch at 60 Hz. Raw data were noise-reduced using CALM (Adachi, Shimogawara, Higuchi, Haruta, & Ochiai, 2001). A 1,000-ms time period that was time-locked to the long-duration cue word was extracted and averaged over the 192 long-cue-duration trials. These were used as standard responses of the M170 response to a singly presented word (see the Analysis of MEG Data section below). There was no baseline correction (i.e., subtraction of a prestimulus interval) of the data, because statistical comparisons were only between same and different trials rather than between short- versus long-duration cues. In other words, the prestimulus interval was functionally identical for the conditions that were compared, so baseline correction would only serve to reduce statistical power by introducing an additional source of variance. A second 1,000-ms time period that was time-locked to the target onset was extracted and averaged over the 96 trials for each of four conditions. Trials with amplitudes >3pT (~5%) were considered artifacts and were discarded. The averages were low-pass filtered with a cutoff frequency of 20 Hz.

Analysis of MEG data

Unlike EEG, analysis of MEG is typically done separately for each individual, because MEG provides a much more spatially localized response, and different individuals position their heads differently in relation to the sensors with MEG. Therefore, the same sensor may produce very different, or even opposite, responses (i.e., opposite sides of a local electromagnetic dipole) for different individuals. To address this issue, Tian and Huber (2008) developed multivariate measurement techniques that use the pattern of responses across all sensors by normalizing these patterns in relation to a “standard response,” which is the pattern observed in some other condition for that individual. We briefly describe these techniques here and point the interested reader to the discussion by Tian, Poeppel, and Huber (2011) that accompanies the release of the TopoToolbox suite of MATLAB routines that implement these techniques. These files, as well as a tutorial, can be downloaded from Dr. Tian’s website (https://files.nyu.edu/xt235/public/). In these techniques, rather than examining separate sink (magnetic influx)/source (magnetic outflux) patterns for the M170 of each individual, all 157 sensors are used to ascertain whether a difference in the M170 between conditions is due to a different pattern across the sensors or whether the M170 in different conditions corresponds to the same pattern, but with different response magnitudes of that pattern. These measurements are achieved through a vector dot product over sensors that is either normalized (similarity between patterns) or not normalized (response magnitude). This is the same mathematical distinction as the difference between a correlation (which is normalized for variance) versus a regression slope. However, in the present case, these numbers can be geometrically interpreted in terms of the 157-dimensional sensor space. Thus, these measures are interpreted as similarity or magnitude rather than correlation or regression. The similarity measure produces the cosine of the angle between patterns in sensor space (“angle test of response similarity”), whereas the response magnitude measure produces the projection of one pattern onto the standard response (“projection test of response magnitude”). The magnitude measure was obtained by determining the peak latency using the root-mean square and taking the average magnitude of the 40-ms time window around the peak latency.

It is important to note that the angle test seeks the same measurement goal as the vector normalization technique of McCarthy and Wood (1985), although the two techniques are mathematically different. The angle test first takes the dot product of the two sensor vectors and then normalizes against vector length, whereas the technique of McCarthy and Wood first normalizes against vector length and then uses ANOVA to compare the two normalized vectors. This distinction is important, and unlike McCarthy and Wood’s technique, which can produce an apparent topographic difference as an artifact of using vectors of sensor values that are not zero-centered (Urbach & Kutas, 2002), the angle test is similar to a correlation analysis and does not suffer from this problem.

Results

Behavioral results

Collection of behavioral responses from individuals in the MEG chamber was constrained by the recording setup. More specifically, although participants were told to respond quickly and accurately, they were also strongly urged not to move (particularly their head and shoulders) and not to blink, as this would cause artifacts. They responded by pressing a mouse button, which was out of sight down by their side (they were lying supine). As a result, responses times were slow and variable. However, accuracy was nearly perfect, indicating that they were indeed performing the requested task. Given these limitations, not much can be gained from their behavioral responses. Nevertheless, we report them for completeness. Accuracy and mean response times are shown in Table 1. Participants responded more accurately with long cue durations than with short cue durations [F(1, 16) = 6.31, MSE = 0.001, p < .05]. We found no significant effects for the response times. Given the relatively long response times ( > 600 ms), it was unlikely that the MEG response of interest, the M170, was contaminated by motor processes.

Table 1 Results of Experiment 1, showing means of behavioral (accuracy, correct response time) measures (with standard deviations)

Full size table

Angle test of response similarity

We first compared the target M170 responses in the four conditions (same/different targets following short/long cue durations) using the angle test to determine whether reliable differences existed between these conditions in the topographic pattern across the 157 sensors. Split-half reliability was determined by comparing trials from the first half of the experiment with trials from the second half (first/second). This comparison was done separately for first- versus second-half topographic patterns from the same condition as well as first- versus second-half patterns from different conditions. Thus, the within-condition comparisons provided a measure of the null hypothesis variance between the first versus second halves of the experiment, against which differences between conditions were compared for hypothesis testing.

These between/within condition comparisons were calculated for each subject. The two within-condition angles (same-first vs. same-second as well as different-first vs. different-second) were averaged for each participant to yield an overall within-condition value (for short duration, M = 0.527, SD = 0.254; for long duration, M = 0.476, SD = 0.199). The two between-condition angles (same-first vs. different-second as well as different-first vs. same-second) were also averaged for each participant, to yield an overall between-condition value (for short duration, M = 0.535, SD = 0.222; for long duration, M = 0.477, SD = 0.201). In a dependent-samples test across the 17 participants, there was no significant difference in the similarity (angle test) of the between-condition comparisons versus the within-condition comparisons for both cue durations [short duration, t(16) = 0.193, p = .849; long duration, t(16) = 0.017, p = .987]. Thus, the topographic patterns across the target responses were not found to be different. This suggests that the same distribution of cortical responses was involved in the same and different conditions. Therefore, any magnitude differences were taken to indicate change in the magnitude of the underlying cortical responses rather than differential recruitment of cortical areas in some conditions as compared to others.

Projection test of response magnitude

Because there did not appear to be differences in the M170 response patterns between same versus different targets, we next tested whether there was a difference in the M170 response magnitude. This was achieved through the geometric projection of the target word response onto a “standard response” pattern that is specific to each individual. The standard response should be a situation that involves the same kind of presentation and attentional state as the experimental condition. Therefore, for the present experiment, we used the M170 response to the cue in the long-duration conditions. Because half of all trials used a long-duration cue, this provided a fairly reliable M170 response pattern for each individual that was evoked by the onset of a single word in this same/different paradigm. The M170 response to the target for each of the four experimental conditions was then projected onto this standard response to yield four M170 response magnitude values for each individual.

Figure 2 highlights the need for a measure of response magnitude that disentangles the neural response to the target word from the neural response to the cue word. This figure shows global field power at each millisecond from the onset of the target word. Global field power is the square root of the average squared value across the 157 sensors (root-mean square). For comparison, this figure also shows the response to the cue word in the long conditions, which gives an indication of the response profile to a word presented in isolation (i.e., the response to a word in isolation without a prior cue word). As can be seen in the figure, the grand average waveforms for the two short-duration conditions overlap with the MEG response to the short-duration cue, whereas the response to the long cue is similar to the response to a target following a long cue (i.e., a relative lack of overlap for long conditions). The projection test estimates the magnitude of the M170 signal that is present within this overlapping signal, which should help isolate the component of the short conditions that is due to the presentation of the target. Furthermore, because the level of overlap is equal between same and different trials that test the same cue duration, same and different trial response magnitudes can be meaningfully compared.

Table 2 presents the M170 response magnitudes across trial types and cue durations. The magnitude of the MEG signal was higher following a long-duration cue [F(1, 16) = 6.11, MSE = 0.45, p < .05], but this was qualified by an interaction between trial type and cue duration [F(1, 16) = 3.12, MSE = 0.01, p < .1] due to an effect of trial type in the long-cue-duration condition [t(16) = 2.463, p < .05], but not in the short-cue-duration condition (t < 1). No significant effects were obtained for peak latency (all ps > .3). Response times did not correlate (within each condition) with MEG magnitude (ps > .5) or peak latency (ps > .6).

Table 2 Results of Experiment 1, showing means of MEG (magnitude, latency) measures (with standard deviations)

Full size table

Discussion

In Experiment 1, we tested the hypothesis that there is a single change detection process underlying same/different judgments to target words presented immediately after cue words. We found that in the temporal window around the M170 in response to the target word, the response patterns across the 157 sensors were similar, regardless of whether the target was the same as or different than the cue word, which suggests that the same identification processes were involved for both trial types. Critically, as predicted, there was a main effect such that the magnitude of response was larger for different trials (novelty detection) than for same trials (lack of novelty detection). Additionally, there was an interaction with cue duration such that the effect of trial type was larger following a long-duration cue, which demonstrates that the signal underlying change detection with words is sensitive to the timing of the comparison word. This difference in the magnitudes of the target words’ M170s as a function of cue duration replicates previous findings with a perceptual identification task (Huber et al., 2008), except that the present experiment used target words that were clearly visible. Furthermore, this result was predicted by a model of change detection based on habituation. Specifically, because habituation is greater following a long-duration cue, a target that differs from the cue was expected to produce a larger boost of additional activation (i.e., novelty). This result favors an account based on habituation and novelty detection rather than familiarity.

The habituation model supposes that a novelty response in the different condition is the signal that underlies accurate performance. Therefore, the model predicts that there should be a magnitude difference between the same and different trials for both the short- and long-cue-duration conditions, considering that even though accuracy was lower in the short cue duration, it was still very good. Thus, even though the novelty response was predicted to be weaker in the short-cue condition, it should still be present. However, the MEG projection test failed to find a reliable difference in the short-cue condition (although the trend was in the expected direction). Nevertheless, the failure to detect such a difference depends on statistical power. More specifically, even if a small but highly reliable novelty response underlies performance, MEG measurements may fail to detect this response with sufficient reliability.^{Footnote 1}

From our proposal, one might expect that the amplitude or latency of the M170 would predict response time. In other words, a stronger or a quicker novelty response should serve to promote faster accurate behavioral responses. For example, Noguchi, Inui, and Kakigi (2004) used a repetition priming paradigm with letters while recording MEG, and they found that the peak latency, but not the peak amplitude, correlated with RT. In our experiment, we found no such correlation (indeed, response times were relatively uniform across conditions). However, Noguchi et al. examined a completely different neural signal, which occurred at 252 ms, almost 100 ms later than the M170 we focused on. A recent study has shown that event-related components that occur with longer delays after stimulus onset are more strongly associated with task difficulty and task-relevant perceptual processes, whereas earlier components are more associated with low-level perceptual categorization processes (Philiastides, Ratcliff, & Sajda, 2006; Philiastides & Sajda, 2006; Ratcliff, Philiastides, & Sajda, 2009). Our results suggest that the M170 is an early neural signal that discriminates between same and different trials. Furthermore, the consistency between the present results and the results of Huber et al. (2008) with a different task demonstrates that the M170 to words is not closely involved in decision processes.

Outline of experiment 2

Experiment 1 revealed that the first neural response to a target word that distinguished same and different trials was (a) similar in its topographic distribution between the two trial types, thus implicating the same neural signal for both, and (b) larger in magnitude for different trials, suggesting that the signal driving performance is detection of novelty rather than familiarity. These results fit well with the neurocomputational model of short-term priming (Huber & O’Reilly, 2003), for which the dynamics are shown in Fig. 1. We now turn to a counterintuitive, but necessary, prediction from this model, and test this prediction by examining response times in a same/different judgment task.

We used the proposal that habituation underlies change detection to make testable predictions for performance across subsequent pairs of cue/target trials (see Fig. 3). The task was the same as in Experiment 1, but now with a structure to the trials so that we could examine the effect of repetitions from one trial to the next, and thereby the effect of habituation on change detection. Because we were interested in change detection, we focused on trials in which the cue and target differed, and looked at the effects of priming the cue versus priming the target, with priming provided by the preceding cue–target trial. As a result of this focus, Experiment 2A used a go/no-go task, with go responses when the target differed from the cue, so for this experiment the only data available were for different trials. Next, we considered the predictions of a habituation account in this paradigm for cue and target priming.

We used the following four-letter notation to indicate cue and target identity for a pair of trials in the critical conditions. Each letter symbolizes a word, and the order of the letters reflects the order of the presentation of the corresponding words (i.e., cue and target for the previous trial and cue and target for the present trial, in that order). Repeated letters indicate words that repeated within or across the two trials (the case of the letters reflects the case in which the words were presented: lower for cues vs. upper for targets). For trials that were preceded by a same or a different trial, the first two letters were aA or aB, respectively. To refer to the neither-primed conditions, the first two letters would differ from the last two letters (i.e., aAbC or aBcD). For cue-primed conditions, the second and third letters are the same (i.e., aAaB or aBbC). For target-primed conditions, the second and fourth letters are identical (i.e., aAbA and aBcB).

Figure 3 provides an illustration of the predictions of the habituation account, showing neural output activation levels as a function of time during the sequential presentation of a cue–target pair on trial n – 1 followed by trial n. On each trial, a cue is presented for 1 s, followed by a target for 1 s, and then a break of 2.5 s between trials. On presentation of any stimulus, the neural activation in the model follows a nonmonotonic function due to habituation. When neither the cue nor the target of trial n is primed (upper panel of Fig. 3), the neural responses are similar for both cue and target. However, if the cue appeared on the preceding trial (middle panel), the neural response for the cue word is habituated on trial n. Based on the results from Experiment 1, we assume that the novelty response is calculated by monitoring the degree of additional activation with the onset of the target word. Thus, performance is related to the difference between the degree of activation at the offset of the cue and the peak activation value in response to the target, as shown by the brackets on each panel. For any given trial, this difference is affected by priming from the previous trial, because habituation tends to linger for seconds or tens of seconds. In the case of cue priming (middle panel of Fig. 3), habituation increases the difference between cue and target as compared to the neither-primed condition (top panel). However, in the case of target priming (bottom panel), this difference is smaller, because the habituated target representation does not achieve as high a peak value. If this difference in activation is the signal supporting change detection, the increased signal in the cue-primed condition should lead to better performance (i.e., faster responses) relative to the neither-primed condition. Likewise, the decreased signal in the target-primed condition should reduce performance (i.e., slower responses). Thus, an account based on change detection and habituation predicts a pattern of costs and benefits, with last-trial priming depending on which word is primed.

Although Fig. 3 shows specific predictions, these predictions reflect the neural response rather than behavior. Therefore, our predictions regarding response time behavior are qualitative and only address the ordinal pattern in the data. The amount of novelty is assumed to feed into a decision process, which is needed to make quantitative predictions about accuracy and response time. Therefore, our account is focused on the process that takes place before the actual decision stage. In order to contrast this account with several other alternative explanations that have appealed to mechanisms within the decision stage, we refer to this model as a nondecision model (e.g., something prior to the decision stage, such as the speed or the quality of the perceptual response).

We can contrast these predictions to those formulated based on simple response repetition (i.e., a bias in favor of the response given on the last trial). Unlike the habituation model, response repetition predicts no difference between priming the cue versus priming the target, each as compared to the baseline neither-primed condition (see Table 3). Next, consider predictions from theories that explain priming effects through associative learning of stimulus–response contingencies (e.g., Hommel, 2004; Logan, 1988, 1990; Rothermund, Wentura, & De Houwer, 2005). These theories predict that if a word is repeated as a target on two consecutive trials and the same response is required (either “same” or “different”) on both trials, then performance should be better than in the neither-primed baseline and better than when different responses are required on the two trials (see Table 3). Note that this is exactly the opposite prediction from the habituation account (cf. Fig. 3). In addition, the habituation account predicts that “different” responses to primed targets will be slowed, regardless of the response given on the previous trial to that same target word. Whereas the habituation model is focused on processes that occur before the decision stage, the response repetition account and the stimulus–response account are examples of decision-stage models. As such, these types of models compete directly with the theoretical accounts of same/different judgments reviewed in the introduction. A third decision-stage model, an interference model, is one in which the target might be mistakenly compared with the preceding cue and the target of the previous trial. When these comparisons are congruent, the corresponding response time is faster than when the comparison is incongruent. As can be seen in Table 3, this alternative model makes the same prediction as the habituation model when the targets on successive trials repeat, but not when the cue was a target in the previous trial.^{Footnote 2} More specifically, the interference account does not predict an effect for the cue-primed condition, because in both the cue-primed and neither-primed conditions the target was not seen on the previous trial, and thus mistakenly comparing the target to the words of the previous trial will suggest the correct response of “different” for both conditions. This same logic explains why the stimulus–response theory also predicts no effect in the cue-primed condition—because the target was not seen on the previous trial for both the cue-primed and neither-primed conditions, it has no associated response.

Table 3 Predictions of three decision-based models and the habituation model for the change in response times between a primed trial (cue, target, or both) versus the corresponding neither-primed trial in Experiment 2

Full size table

We tested the predictions of the habituation account in an experiment in two parts that involved priming from previous trials by repeating the target word across trials. In Experiment 2A, we investigated change detection by asking participants to make a response whenever a change occurred. This was achieved through a go/no-go task in which a response was required when the target word was different than the preceding cue word. To directly contrast the habituation account with the stimulus–response learning account, we conducted Experiment 2B, which was identical to Experiment 2A in terms of the actual sequence of presentations, but which measured change detection with same/different rather than go/no-go responses. Note that both of these tasks concern change detection within a trial rather than between trials. Thus, stimuli presented in the previous trial provided no information about the required response on the current trial.

Beyond priming of the cue versus priming of the target, which contrasts the predictions of the habituation account with the three decision-stage accounts, we also tested the specific predictions of the habituation model regarding the duration and timing of events. Specifically, we investigated the role of cue duration. On trials in which the cue duration is brief, the activation in response to the cue will be high at the onset of the target, so a target that is different from the cue will not provide much additional activation (i.e., change detection will be difficult). However, if the cue duration is long, the response to the cue will be strongly habituated, and it will be easy to detect a target that is different. Therefore, we manipulated the duration of the cue word (200 vs. 1,000 ms). This was done in a blocked design to allow anticipation of the target onset, which would otherwise provide a source of noise in the response times. Provided that last-trial priming was equivalent in all conditions, the habituation model predicted that longer cue durations should make it easier to detect changes, regardless of whether the cue or target was primed. Thus, the difference between the cue-primed and target-primed conditions should diminish with increasing cue duration. However, because the experiment was run with cue duration in a blocked fashion (i.e., if the cue duration on trial n was long, the cue duration on trial n – 1 was also long), and because a previous same trial introduces greater priming than a previous different trial, this introduced some complexities into the predictions that are considered in greater detail in the Discussion section.

The durations differed somewhat from those used in Experiment 1. Experiment 1 needed to include a very long cue duration to eliminate overlap in the neural response between cue and target, so 2,000 ms was used for the long-cue condition. In contrast, in Experiment 2 we used only 1,000 ms for the long-cue condition to reduce the total time of the experiment. A more important difference between Experiment 1 versus Experiment 2 was how often words repeated across trials. In Experiment 1, words never appeared on more than one trial. However, in Experiment 2, the same words not only repeated across contiguous trials, but also on many other trials throughout the experiment. This was a crucial element of the design. As discussed in the introduction, habituation is a short-term effect that biases against attended stimuli, leading to slow responses. In addition to the deleterious effect of habituation, there are also positive long-term priming effects (e.g., Hertwig et al., 2008). Thus, each word on trial n – 1 would produce a beneficial long-term priming effect together with a short-lived habituation effect, both of which might affect performance on trial n. In order to separate these two opposing forces, we used a small pool of words and drew with replacement from this pool in order to equate the words in terms of the beneficial long-term priming effect. In other words, regardless of last-trial priming, all of the words on every trial (after practice) were subject to the benefits of long-term priming. This allowed us to isolate the short-term habituation effect by comparing cue- or target-primed trials against the baseline neither-primed trials.