When expectancies collide: Action dynamics reveal the interaction between stimulus plausibility and congruency

Coco, Moreno I.; Duran, Nicholas D.

doi:10.3758/s13423-016-1033-6

When expectancies collide: Action dynamics reveal the interaction between stimulus plausibility and congruency

Brief Report
Open access
Published: 19 May 2016

Volume 23, pages 1920–1931, (2016)
Cite this article

Download PDF

You have full access to this open access article

Psychonomic Bulletin & Review Aims and scope Submit manuscript

When expectancies collide: Action dynamics reveal the interaction between stimulus plausibility and congruency

Download PDF

Moreno I. Coco¹ &
Nicholas D. Duran²

1508 Accesses
9 Citations
Explore all metrics

Abstract

The cognitive architecture routinely relies on expectancy mechanisms to process the plausibility of stimuli and establish their sequential congruency. In two computer mouse-tracking experiments, we use a cross-modal verification task to uncover the interaction between plausibility and congruency by examining their temporal signatures of activation competition as expressed in a computer- mouse movement decision response. In this task, participants verified the content congruency of sentence and scene pairs that varied in plausibility. The order of presentation (sentence-scene, scene-sentence) was varied between participants to uncover any differential processing. Our results show that implausible but congruent stimuli triggered less accurate and slower responses than implausible and incongruent stimuli, and were associated with more complex angular mouse trajectories independent of the order of presentation. This study provides novel evidence of a disassociation between the temporal signatures of plausibility and congruency detection on decision responses.

Motor demands influence conflict processing in a mouse-tracking Simon task

Article Open access 20 November 2022

Response priming with motion primes: negative compatibility or congruency effects, even in free-choice trials

Article 24 February 2018

Action control costs in task selection: Agents avoid actions with incompatible movement and effect features

Article Open access 21 March 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

When experiencing events, our cognitive system routinely makes use of expectations to anticipate upcoming information and to guide action (e.g., Rao & Ballard, 1999; Friston, 2010; Wacongne et al., 2012). The violation of expectations, and their relation to stimulus plausibility, has been explored in areas as diverse as language comprehension to visual scene perception (e.g., Kutas & Hillyard, 1980; Van Berkum et al., 1999; Henderson et al., 1999; Ganis & Kutas, 2003; Hagoort et al., 2004; Mudrik et al., 2010; Võ & Wolfe, 2013; Coco et al., 2014). An open challenge remains to understand how the cognitive system utilizes expectancy mechanisms to synchronously hold information across multiple points in time and integrate it to produce action responses (Bar 2007). The growing attention towards this challenge can be traced to current proposals in the cognitive sciences that aim to bridge low-level perceptual processes, high-level expectancy mechanisms, and motor control within the same predictive processing framework (e.g., Clark, 2013; Pickering & Clark, 2014). Lupyan and Clark (2015), for example, suggest that perception is an inferential process, whereby prior beliefs are combined with incoming sensory data to optimize in-the-moment processing and to improve future predictions. In the current study, we draw from this framework to explore how different types of expectations interactively mediate comprehension, and how these comprehension processes can be captured over time through a fine-grained analysis of response behavior.

Previous research on expectancy mechanisms has largely employed electrophysiology (EEG) measures. A common finding in this research is that a mismatch between predictions and incoming linguistic or non-linguistic stimuli trigger negative shifts in event-related brain potentials (e.g., Ganis & Kutas, 2003; Kutas & Hillyard, 1980; DeLong et al, 2005; Mudrik et al., 2010). Such shifts are typically interpreted as evidence that a prediction had occurred and that prior knowledge has been updated. Moreover, the magnitude of these negative shifts is modulated by the context in which unexpected stimuli is placed (e.g., a sentential context, Marslen-Wilson & Tyler, 1980; Kutas, 1993), as well as by additional information preceding it, such as a narrative of visual scenes (Sitnikova et al. 2008; Cohn et al. 2012).

Although EEG studies have provided invaluable insights into contextual effects and processing costs at the moment unexpected stimuli is encountered, there may also be additional costs that persist after the initial encounter. This is particularly true when expectations must be maintained in memory to perform an explicit decision response, such as in verification tasks where participants are asked to assess the congruency of a pair of sequentially presented stimuli (i.e., whether a sentence and a picture convey the same message or not; e.g., Clark & Chase, 1972; Carpenter & Just, 1975). We hypothesize, per a predictive processing framework, that when incongruent stimuli are encountered, activation elicited from the initial stimulus will compete with bottom-up sensory activation from the second stimulus, taking time to resolve as the error signal is adjusted. Moreover, we expect multiple sources of expectancy to contribute to the prediction activation strength, i.e., not only the congruence between consecutive stimuli, but also their plausibility (i.e., whether the stimuli being conveyed depicts something plausible or implausible). The interaction between these sources has been only marginally investigated and there have been no studies, as far as we are aware, that have examined how the costs driven by congruency and plausibility activate, compete, and are resolved throughout an extended decision response.

The examination of this interaction requires an extension of typical verification tasks along with novel ways to track competition costs. Beginning with the task, participants in our study were presented with two consecutive mixed modality stimuli (a sentence and a scene) and asked to verify the congruency of the content. As an extension, the content of the stimuli were manipulated to be consistent or inconsistent with prior knowledge expectations (i.e., stimuli convey plausible or implausible content). For example, participants might be asked to read a sentence that is plausible or not (e.g., they boy is eating a hamburger vs. eating a brick), and then are shown a visual scene that does or does not match the earlier content (this order is also reversed for another set of participants in a “scene-first” version; see Fig. 1 for the example design). In this way, expectations generated when processing the first stimulus are allowed to interact with a subsequent second stimulus in terms of both plausibility and mutual congruence (and potentially modulated by modality).

The above experimental setup provides a fully-crossed design of congruency and plausibility expectations, which we employ to explore how systematic mismatches of expectations result in different competition costs. We implemented this design under an action dynamics approach that involved tracking participants’ computer-mouse cursor movements during verification - specifically as participants navigated from the bottom of their screens to the top of their screens where “yes” and “no” options were located in opposite corners. In previous work, the resulting movement trajectories have been shown to reveal moment-by-moment competition between the response options as decisions unfold, typically in trajectories that deviate toward a competitor response en route to a target selection (Spivey & Dale, 2004; Magnuson, 2005; Spivey et al., 2005; Farmer et al., 2007; Dale et al., 2007; Duran et al., 2010; Papesh & Goldinger 2012).

In the current paradigm, for example, we may observe response conflicts of this type when the first stimulus conveys plausible information, activating a “yes” response (i.e., initial movement toward this option on the screen), that must be corrected when a subsequent mismatching stimulus appears (to a “no” response). This result would corroborate previous mouse-tracking research that has found response conflicts to be associated with congruency mismatch in verification tasks (e.g., van Vugt & Cavanagh, 2012). Critically, however, our study makes it possible to examine whether conflicts due to violation of congruency expectations are modulated by the plausibility of the stimuli. In particular, we expect that when an initial stimulus conveys information violating prior knowledge, i.e., it is implausible, a ’no’ response is activated. This response must be corrected to a ’yes’ response if the subsequent stimulus is congruent with the initial stimulus, despite the subsequent stimulus also conveying the same implausible content. This is the case, for example, when the participant reads a sentence such as the boy is eating a brick, and then he/she is presented with a scene congruently depicting a boy eating a brick. In particular, this response conflict should emerge very early in the initial angle of the movement towards the incorrect response (to a ’no’ response), and be observed throughout the trajectory as a consistent deviation towards the incorrect choice, or as a higher number of directional changes. Such evidence would help refine the predictive processing account by showing that the matching of congruency expectations and bottom- up sensory information is alone not sufficient for facilitating cognitive processing, but much depends on the plausibility of the information being integrated.

Method

Previous literature has adopted the notion of congruency to indicate both stimulus implausibility (e.g., Mudrik et al., 2010) and mismatch between consecutive stimuli (e.g., West & Holcomb, 2002; Sitnikova et al., 2008). In our 2 ×2 experimental design, Congruency indicates whether stimuli matched in content (Congruent, Incongruent) between stimuli, and Plausibility indicates whether content was expected or unexpected within stimuli (Plausible, Implausible). Moreover, the cross-modal Order of presentation (Sentence First, Scene First) was manipulated as two separate experiments (between-participants design). In Fig. 1, we provide a schematic description of the experimental conditions for both studies, and provide a full set of crossed pairs of stimuli.

Participants

Sixty-four students at the University of Lisbon, all native speakers of Portuguese, participated in the study for course credits. The experiment was granted by the Ethics Committee of the Department of Psychology, in accordance with the University’s Ethics Code of Practice.

Materials

We used 125 photorealistic scenes, originally published in Mudrik et al. (2010), and added another 100 scenes based on open-access material from the Internet (e.g., Flickr). Each of this 225 unique scenes (size = 550 ×550 px) were presented in the two conditions of Plausibility and Implausibility, which means 450 scenes in total between these two conditions (e.g., the picture of the boy either eating a brick, or a hamburger). In order to generate the Congruency conditions, we constructed 2 types of sentence for each condition, for a total of 900 sentences, which is the total number of items^{Footnote 1}. As exemplified in Fig. 1, top row: (a) Congruent/Plausible is the scene of the boy eating an hamburger paired with the sentence the boy is eating a hamburger, Congruent/Implausible is the scene of the boy eating a brick paired with the sentence the boy is eating a brick, Incongruent/Plausible is the scene of a boy eating a hamburger and the sentence saying the boy is eating a fish, Incongruent/Implausible is the scene of the boy eating a brick, and the sentence saying the boy is eating an handle.

The sentences were written in Portuguese and checked for grammaticality by two independent native-speaking annotators. The target word (e.g., hamburger vs. brick) was always positioned at the end of the sentence. The annotators also ensured that the target object depicted in the scene was recognized as the target word used in the sentence.

We divided these 900 stimuli into 4 Latin-Squared lists, 225 items each, such that no items were repeated within the list. From each list, we selected 100 items to present to an individual participant, and made sure that across participants all 900 stimuli, distributed in the 4 lists, were presented an equal number of times.

In order to assess how plausibility, congruency, and other possible task co-variates (such as the grammaticality of the sentence) were perceived during the experiment across trials, we asked the participant at the end of each trial to rate on a scale from 1 to 6 (i.e., from very strongly disagree to very strongly agree) the: 1) plausibility of the scene, 2) visual saliency of the target object, 3) congruency between the scene and the sentence, 4) grammaticality of the sentence. For these ratings, the previously shown stimuli were displayed and there was no time limit to answer.

In the Supplementary Material, we present analyses of the ratings, in terms of both accuracy and response time, that confirm the validity of our experimental manipulations (’Question answer confidence scores’).

Apparatus and procedure

The experiment was designed using Adobe Flash 13.0 (sampling at 60 Hz) and run in the computer laboratory of the Department of Psychology at the University of Lisbon. The stimuli were presented on a 21” plasma screen at a resolution of 1024×768 pixels. Participants sat between 60 and 70 cm from the computer screen. Calibration of the mouse position was ensured by forcing participant to click on a black target circle (36 pixels across) located precisely at the bottom-center of the screen at the start of the trial and throughout its different phases. The optical computer mouse was located directly on the table, rather than on a mouse-pad, and participants had enough space around themselves to produce natural responses.

Participants first read a sentence, using a word-by-word self-presentation method, by clicking on the calibration button located at the bottom of the screen. After the last word was read, a visual scene was displayed for 1 second. This length of preview time is based on previous work using the same stimuli as Mudrik et al. (2010), and gives enough time to extract scene information and identify the critical target object. The scene then disappeared, and the response options (yes, no) were displayed at the top of the screen, counterbalanced (left/right) between participants. Once the participant clicked on a response, the four questions were presented one at a time across separate screens, after which a new trial started (Fig. 2 illustrates an example trial). The Order condition was simply created by presenting the scene for 1 second prior to the reading of the sentence.

At the beginning of the session, participants performed three practice trials to familiarize themselves with the task; and each completed 100 randomized trials, and spent between 45 and 60 minutes to complete the task.

Analysis

The dataset we analyzed contained a total of 5,851 unique trials. We removed 8 % (549 trials) from the full dataset if verification times were greater than 4 standard deviations from the mean or due to machine error.

We first analyzed the accuracy of performance during the verification task. This resulted in 5085 accurate trials (≈87 %). Based on this set, we analyzed the time to make a decision and the dynamics of the decision process itself (i.e., associated mouse trajectory). In particular, from the verification trajectory, we extracted the following measures: (a) initial degree (the degree of deviation from vertical after the mouse trajectory leaves a 50-pixel radius from the starting point, same as Buetti and Kerzel (2009)), where positive values indicate angles towards the incorrect target (b) latency (the time taken to move outside an initial region of 50 pixels around the starting point), (c) x-flips in motion (the number of directional changes on the x-axis), and (d) the area under the curve (AUC) (the trapezoidal area between the trajectory and an imaginary line drawn directly from the calibration button to the correct response bottom). Each measure captures complementary information about the decision process. Initial degree captures the earliest response where conflict can be observed. Latency gets at the initial hesitancy to commit to a decision, whereas x-flips gauges uncertainty and changes of mind as the decision unfolds. Finally, AUC is a summary measure for the overall strength of competition toward an incorrect response, where greater area suggests stronger competition. These measures have been detailed in previously published research (e.g., Buetti and Kerzel, 2009; Freeman & Ambady, 2010; Dale & Duran, 2011).

To perform our analysis, we employed linear mixed-effects models based on the R statistical package lme4 (Bates et al. 2011). We built full models with all main effects and possible interactions with a maximal-random structure, where each random variable of the design (e.g., Participants), is introduced as intercept and as uncorrelated slope on the predictors of interest (e.g., Plausibility), see Barr et al. (2013). We adopted this approach to also account for the large variance observed across participants in mouse-tracking experiments (Bruhn et al. 2014).

The dependent measures examined are: Accuracy^{Footnote 2} (probability), Response Time (in seconds), Initial Degree (in degrees), Latency (in seconds), x-flips (count), and AUC. The fixed variables of our design are Congruency (Congruent, Incongruent), Plausibility (Plausible, Implausible) and Order (Scene-First, Sentence-First). The random variables of our design are Participants (64), Scenes (450, as we have 225 scenes in two conditions of Plausibility) and Counterbalancing (2, left and right). We report tables with the coefficients of the predictors, their t-values, and indicate their p-value significance.

Results

We start by examining the performance accuracy and response time across all experimental conditions. We then examine how the response conflict develops along the decision trajectory by looking at the measures of initial degree, latency, x-flips, and area under the curve, which characterize its underlying dynamics. We report both the raw observed data (mean and SD), as well as the coefficients of the mixed-effects maximal model.

Accuracy and reaction time

When looking at the response accuracy, we find it to be quite high overall (≈87 %), indicating that participants are able to perform the verification task correctly (refer to Table 1). Accuracy is strongly mediated by Plausibility and Congruency, as well as by the Order used to present the sentence/scene pairs. In particular, participants are more accurate when plausible stimuli are presented, and when the sentence is presented prior to the scene (refer to Table 2 for the model coefficients and their significance). Congruency is not significant as a main effect, but only in interaction with Plausibility, where implausible and congruently matching stimuli are verified less accurately, independent from the Order of presentation.

Table 1 Observed data (mean and standard deviation) of all summary measures reported in the study, which are organized in the table by order of presentation (Sentence-First, Scene-First), Congruency (Congruent, Incongruent) and Plausibility (Plausible, Implausible)

Full size table

Table 2 Coefficients of mixed-effects models with maximal random structure (intercept and slopes on Participants, Scenes and Counterbalancing). Each dependent measure, organised across columns, is modelled as a function of the centred and contrast coded predictors: Congruency (Congruent = 0.5, Incongruent = -0.5), Plausibility (Plausible = 0.5, Implausible = -0.5), Order (Sentence-First = -0.5, Scene-First = 0.5). We report the β with the associated p-value, and the t-value from which it was derived

Full size table

On the response times to take a decision (correct trials only), we observe again a main effect of Plausibility, whereby implausible information is verified slower than plausible information. Moreover, we confirm the interaction between plausibility and congruency, whereby implausible but congruent stimuli take longer to be verified.

Initial degree, latency, x-flip, and area under the curve

Table 2 reports the coefficients for the mixed-model analyses of the initial degree, latency to start the movement, the flips along the x-axis during the trajectory, and the area sub-tending it.

From the very first moments of the trajectory (initial degree), the participants display a larger conflict in their response when the stimuli are implausible and congruently matched (two-way interaction with Plausibility and Congruency). This can be also seen from the observed data in Table 1 where we observe a more positive initial angle - with positive indexing an angular movement towards the incorrect response - when implausible stimuli are congruent. Conversely, the least deviated initial angle is observed when implausible stimuli are incongruent. Nothing else is observed as main effects, nor as interactions, with initial angle.

For latency, participants hesitate more before starting their verification when the stimuli are implausible, a pattern similar to that observed with overall response time. Interesting results on the latency are obtained as interactions with Order of presentation. We find that when the scene is presented first and the stimuli are implausible, the participants are faster to initiate the movement. We also find a significant interaction between Congruency and Order, such that when the scene is presented first, and the stimuli are incongruent, participants take longer to initiate the movement.

The measures of x-flip and area under the curve converge on the same significant result: implausible but congruent pairs generate more complex trajectories (interaction with Plausibility and Congruency). These results corroborate all other analyses reported above: greater conflicts on the response are generated when implausible stimuli have to be accepted as congruent.

In the Appendix, we also model the angular trajectories using growth-curve analysis, corroborating the results reported above at an even finer-grained resolution. Furthermore, in the Supplementary Material, we re-analyze all summary measures presented in this study, but instead of using the two-level categorical distinction for Plausibility (Plausible and Implausible) and Congruency (Congruent and Incongruent) as predictors, we use the ratings provided by the participants during question answering (refer to Fig. 2 for an example of experimental trial with these ratings). This additional analysis largely confirms the results presented in the main paper, and serves to demonstrate the ecological validity of our experimental manipulations and verification paradigm.

Discussion

There has been a recent and growing interest in understanding the cognitive system along central principles of predictive processing (e.g., Clark, 2013; Pickering & Clark, 2014; Lupyan & Clark, 2015). The current study adds to these attempts by employing a task that required participants to verify whether two sequentially presented stimuli, which also varied in plausibility of content, conveyed the same message. In doing so, two sources of expectancy (stimuli congruency and stimuli plausibility) were allowed to converge and compete during verification. Moreover, in a departure from the majority of previous research, we examined this competition in the continuous updating of the motor system using an action dynamics approach. This allowed a novel view into the temporally-extended activation and resolution of competition. Whereas reaction time primarily informs the time it takes to make a response, and EEG on the neural resources that are recruited at a specific time, we were able to examine competition across the earliest moments of processing and throughout a decision process.

Our analysis of the motor movements focused on key summary measures, including initial degree of movement, latency of movement, x-flips, and area under the curve (AUC). We interpreted greater latencies, increased complexity, and deviations toward an incorrect response as signaling greater processing costs and the application of error-signal updating. All measures consistently point to the same core result: implausible but congruently matching stimuli generated greater response competition than incongruent and implausible stimuli. A match between expectations elicited by an initial stimulus and incoming sensory information from the second stimulus would normally facilitate processing. However, in our scenario, such facilitation was actually disrupted because the incoming information violated other expectations based on prior knowledge.

As evidenced by the computer mouse movements, when the implausible stimulus is encountered, there is a very early hesitation and bias to respond “no” that is activated and persists during verification, competing with the congruency expectation to respond “yes.”

This result is in line with current proposals of predictive processing by underscoring how the cognitive system is actively engaging generative mechanisms of active expectation (i.e., predictive coding) and error correction (e.g., Rao & Ballard, 1999; Bar, 2007; Hinton, 2007; Kok et al., 2012; Wacongne et al., 2012), drawing on multiple sources of expectations from the local context (congruency expectations) and from prior knowledge (plausibility expectations). Moreover, this research helps refine current proposals of predictive processing by showing that congruency expectations can be immediately disrupted and temporarily overridden when incoming stimuli is in violation of prior knowledge.

The cross-modal paradigm also tested whether the directionality of the response costs are sensitive to the modality order of presentation (i.e., sentence-first vs. scene-first) (e.g., Clark & Chase, 1972; Federmeier & Kutas, 2001; Pecher et al., 2003). We found hints of an asymmetry on the latency to start the movement, especially in the angular trajectory (see Appendix), whereby stronger conflicts are observed when an implausible sentence is reinforced by a subsequent congruently matching scene (sentence-first scenario). Based on the literature on simulation models in sentence-picture verification tasks (e.g., Zwaan et al., 2002; Ferguson et al., 2013), we speculate that when a sentence is presented first, the resulting simulated imagery, opposed to merely seeing a picture, relies on deeper connections to prior knowledge to generate its content, and thus such condition shows a greater sensitivity to stimulus plausibility. Future research will be needed to better assess these asymmetries triggered by modality order by providing, for example, greater initial linguistic context to make the simulations even richer.

Another natural avenue of research for investigating plausibility and congruency on predictive processing is to use a fully crossed design, as in the current study, with EEG measures. Intriguing results about the impact of such sources on neural responses have been reported, for example, by Dikker and Pylkkanen (2011), who used a picture-name matching task and observed a very early signature of prediction violation, at around 100ms, when words did not accurately describe the pictures. This result has recently been corroborated by Coco et al. (2015), where congruency and plausibility were found to be associated with distinct temporal latencies, such that incongruent pairs generated revision costs as early as 100ms after the stimulus onset, whereas implausibility begins at 250ms (e.g., same latency of Mudrik et al. (2010) and more recently (Mudrik et al. 2014)). Of great interest would be to compare neural responses with the behavioral patterns reported here and draw joint conclusions about on-line stimuli processing and later integrative verification dynamics. Other key questions relate to the adaptation (or not) of participants to implausible verification, and the amount of exposure needed to modify prior knowledge so that implausible information is accepted without additional costs.

In conclusion, the current study provides new evidence for how response conflicts arise in the motor system as decision responses are acted out. We take this as demonstrating that the plausibility of stimuli mediates congruency expectations in a verification task. More generally, these results provide compelling evidence in support of a predictive processing account by showing that the matching of congruency expectations is not sufficient for facilitating cognitive processing, but depends greatly on the plausibility of information that is being integrated.

Notes

Note that the incongruent condition could also have been obtained by fixing the plausibility of the sentence (plausible or implausible) and manipulating the associated scenes (four versions). However, as scene material is harder to construct, we opted against this method.
Note: for the dependent measures of accuracy and x-flips, we utilized the glmer function with family set as binomial and poisson respectively, in order to assign the linking function appropriate to the type of data. For this reason, all estimates of accuracy should be transformed using the probit transformation, if probabilities have to be reconstructed.

References

Bar, M. (2007). The proactive brain: using analogies and associations to generate predictions. Trends in cognitive sciences, 11(7), 280–289.
Article PubMed Google Scholar
Barr, D., Levy, R., Scheepers, C., & Tily, H. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3), 255–278.
Article Google Scholar
Bates, D., Maechler, M., & Bolker, B. (2011). lme4: Linear mixed-effects models using s4 classes. http://cran.r-project.org/web/packages/lme4/index.html.
Bruhn, P., Huette, S., & Spivey, M. (2014). Degree of certainty modulates anticipatory processes in real time. Journal of Experimental Psychology: Human Perception and Performance, 40(2), 525.
PubMed Google Scholar
Buetti, S., & Kerzel, D. (2009). Conflicts during response selection affect response programming: reactions toward the source of stimulation. Journal of Experimental Psychology: Human Perception and Performance, 35(3), 816.
PubMed Google Scholar
Carpenter, P., & Just, M. (1975). Sentence comprehension: a psycholinguistic processing model of verification. Psychological Review, 82(1), 45.
Article Google Scholar
Clark, A. (2013). Whatever next? predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(03), 181–204.
Article PubMed Google Scholar
Clark, H., & Chase, W. (1972). On the process of comparing sentences against pictures. Cognitive Psychology, 3(3), 472–517.
Article Google Scholar
Coco, M.I., Malcolm, G.L., & Keller, F. (2014). The interplay of bottom-up and top-down mechanisms in visual guidance during object naming. The Quarterly Journal of Experimental Psychology, 67(6), 1096–1120.
Article PubMed Google Scholar
Coco, M.I., Araújo, S., & Petersson, K.M. (2016). (in prep.) Disentangling stimulus plausibility and contextual congruency: Electro-physiological evidence for differential cognitive dynamics.
Cohn, N., Paczynski, M., Jackendoff, R., Holcomb, P.J., & Kuperberg, G.R. (2012). (Pea) nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension. Cognitive Psychology, 65(1), 1–38.
Article PubMed PubMed Central Google Scholar
Dale, R., & Duran, N. (2011). The cognitive dynamics of negated sentence verification. Cognitive Science, 35 (5), 983–996.
Article PubMed Google Scholar
Dale, R., Kehoe, C., & Spivey, M. (2007). Graded motor responses in the time course of categorizing atypical exemplars. Memory & Cognition, 35(1), 15–28.
Article Google Scholar
DeLong, K.A., Urbach, T.P., & Kutas, M. (2005). Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nature Neuroscience, 8(8), 1117–1121.
Article PubMed Google Scholar
Dikker, S., & Pylkkanen, L. (2011). Before the n400: Effects of lexical–semantic violations in visual cortex. Brain and Language, 118(1), 23–28.
Article PubMed Google Scholar
Dshemuchadse, M., Scherbaum, S., & Goschke, T. (2013). How decisions emerge: Action dynamics in intertemporal decision making. Journal of Experimental Psychology: General, 142(1), 93.
Article Google Scholar
Duran, N., Dale, R., & McNamara, D. (2010). The action dynamics of overcoming the truth. Psychonomic Bulletin & Review, 17(4), 486–491.
Article Google Scholar
Farmer, T., Cargill, S., Hindy, N., Dale, R., & Spivey, M. (2007). Tracking the continuity of language comprehension : Computer mouse trajectories suggest parallel syntactic processing. Cognitive Science, 31, 889–909.
Article PubMed Google Scholar
Federmeier, K.D., & Kutas, M. (2001). Meaning and modality: influences of context, semantic memory organization, and perceptual predictability on picture processing. Journal of Experimental Psychology: Learning, Memory, and Cognition, 27(1), 202.
PubMed Google Scholar
Ferguson, H.J., Tresh, M., & Leblond, J. (2013). Examining mental simulations of uncertain events. Psychonomic Bulletin & Review, 20(2), 391–399.
Article Google Scholar
Freeman, J.B., & Ambady, N. (2010). Mousetracker: Software for studying real-time mental processing using a computer mouse-tracking method. Behavior Research Methods, 42(1), 226– 241.
Article PubMed Google Scholar
Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2), 127–138.
Article PubMed Google Scholar
Ganis, G., & Kutas, M. (2003). An electrophysiological study of scene effects on object identification. Cognitive Brain Research, 16(2), 123–144.
Article PubMed Google Scholar
Hagoort, P., Hald, L., Bastiaansen, M., & Petersson, K.M. (2004). Integration of word meaning and world knowledge in language comprehension. Science, 304(5669), 438–441.
Article PubMed Google Scholar
Henderson, J., Weeks, P., & Hollingworth, A. (1999). The effects of semantic consistency on eye movements during complex scene viewing. Journal of Experimental Psychology: Human Perception and Performance, 25, 210–228.
Google Scholar
Hinton, G.E. (2007). Learning multiple layers of representation. Trends in cognitive sciences, 11(10), 428–434.
Article PubMed Google Scholar
Kok, P., Jehee, J.F., & de Lange, F.P. (2012). Less is more: expectation sharpens representations in the primary visual cortex. Neuron, 75(2), 265–270.
Article PubMed Google Scholar
Kutas, M. (1993). In the company of other words: Electrophysiological evidence for single-word and sentence context effects. Language and Cognitive Processes, 8(4), 533–572.
Article Google Scholar
Kutas, M., & Hillyard, S. (1980). Reading senseless sentences: Brain potentials reflect semantic incongruity. Science, 207(4427), 203–205.
Article PubMed Google Scholar
Lupyan, G., & Clark, A. (2015). Words and the world predictive coding and the language-perception-cognition interface. Current Directions in Psychological Science, 24(4), 279–284.
Article Google Scholar
Magnuson, J. (2005). Moving hand reveals dynamics of thought. Proceedings of the National Academy of Sciences of the United States of America, 102(29), 9995–9996.
Article PubMed PubMed Central Google Scholar
Marslen-Wilson, W., & Tyler, L.K. (1980). The temporal structure of spoken language understanding. Cognition, 8(1), 1–71.
Article PubMed Google Scholar
Mirman, D. (2014). Growth Curve Analysis and Visualization Using R. Taylor & Francis.
Mudrik, L., Lamy, D., & Deouell, L. (2010). ERP Evidence for context congruity effects during simultaneous object - scene processing. Neuropsychologia, 48, 507–517. doi:10.1016/j.neuropsychologia.2009.10.011.
Article PubMed Google Scholar
Mudrik, L., Shalgi, S., Lamy, D., & Deouell, L.Y. (2014). Synchronous contextual irregularities affect early scene processing: Replication and extension. Neuropsychologia, 56, 447–458.
Article PubMed Google Scholar
Papesh, M.H., & Goldinger, S.D. (2012). Memory in motion: Movement dynamics reveal memory strength. Psychonomic Bulletin & Review, 19(5), 906–913.
Article Google Scholar
Pecher, D., Zeelenberg, R., & Barsalou, L.W. (2003). Verifying different-modality properties for concepts produces switching costs. Psychological Science, 14(2), 119–124.
Article PubMed Google Scholar
Pickering, M.J., & Clark, A. (2014). Getting ahead: forward models and their place in cognitive architecture. Trends in cognitive sciences, 18(9), 451–456.
Article PubMed Google Scholar
Rao, R.P., & Ballard, D.H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nature neuroscience, 2(1), 79–87.
Article PubMed Google Scholar
Scherbaum, S., Dshemuchadse, M., Fischer, R., & Goschke, T. (2010). How decisions evolve: The temporal dynamics of action selection. Cognition, 115(3), 407–416.
Article PubMed Google Scholar
Sitnikova, T., Holcomb, P.J., Kiyonaga, K.A., & Kuperberg, G.R. (2008). Two neurocognitive mechanisms of semantic integration during the comprehension of visual real-world events. Journal of Cognitive Neuroscience, 20 (11), 2037–2057.
Article PubMed PubMed Central Google Scholar
Spivey, M., & Dale, R. (2004). On the continuity of mind: Toward a dynamical account of cognition. Psychology of Learning and Motivation, 45, 87–142.
Article Google Scholar
Spivey, M., Grosjean, M., & Knoblich, G. (2005). Continuous attraction toward phonological competitors. Proceedings of the National Academy of Sciences of the United States of America 102(29):10, 398, 393–10.
Google Scholar
Van Berkum, J., Hagoort, P., & Brown, C. (1999). Semantic integration in sentences and discourse: Evidence from the n400. Journal of cognitive neuroscience, 11(6), 657–671.
Article PubMed Google Scholar
Võ, M.H., & Wolfe, J. (2013). Differential electrophysiological signatures of semantic and syntactic scene processing. Psychological science, 24(9), 1816–1823.
Article PubMed PubMed Central Google Scholar
Van Vugt, F., & Cavanagh, P. (2012). Response trajectories reveal conflict phase in image–word mismatch. Attention, Perception, & Psychophysics, 74(2), 263–268.
Article Google Scholar
Wacongne, C., Changeux, J.P., & Dehaene, S. (2012). A neuronal model of predictive coding accounting for the mismatch negativity. The Journal of neuroscience, 32(11), 3665– 3678.
Article PubMed Google Scholar
West, W.C., & Holcomb, P.J. (2002). Event-related potentials during discourse-level semantic integration of complex pictures. Cognitive Brain Research, 13(3), 363–375.
Article PubMed Google Scholar
Zwaan, R.A., Stanfield, R.A., & Yaxley, R.H. (2002). Language comprehenders mentally represent the shapes of objects. Psychological Science, 13(2), 168–171.
Article PubMed Google Scholar

Download references

Acknowledgments

We thank Liam Mudrik for sharing her stimuli with us, Rick Dale for comments on previous versions of this manuscript, and three anonymous reviewers that helped us improve the quality of this work. Fundação para a Ciência e Tecnologia under award number SFRH/BDP/88374/2012 and Leverhulme Trust under award number ECF-2014-205 to MIC are gratefully acknowledged.

Author information

Authors and Affiliations

School of Philosophy, Psychology and Language Sciences, University of Edinburgh, 3 Charles Street, Edinburgh, EH8 9AD, UK
Moreno I. Coco
School of Social and Behavioral Sciences, Arizona State University, Glendale, AZ, 85036, USA
Nicholas D. Duran

Authors

Moreno I. Coco
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas D. Duran
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Moreno I. Coco.

Appendix: Angular verification trajectories

In this Appendix, we report a finer-grained analysis of the angular trajectory, which largely corroborates the results presented in the main text; and provides greater detail about the role played by Order of presentation.

We visualize the x,y trajectory over time as angular values along the x-axis, which is where the most obvious changes in the trajectory happen (Scherbaum et al. 2010; Dshemuchadse et al. 2013). The angle is calculated as the tangent along the trajectory with values above 0 conventionally representing the direction towards the correct response option. Negative angular values would instead be regarded as movement toward the incorrect, competitor response option. As trials were self-terminated, the trajectories differed in their number of time points. We adopted the approach proposed by Spivey et al. (2005) where trajectories are linearly interpolated (up- or down-sampled) to be scaled all within 51 time-bins, which is around the mean (56.35±30.64) and median (51) number of time-points observed overall.

Angular trajectory is a dependent measure that unfolds over the normalized time bins. Thus, in our linear-mixed effects model, we add a predictor for Time, which we represent as an orthogonal polynomial of order three and captures the curvilinear dynamic of the trajectory (see Mirman (2014) for a detailed explanation of the framework). The linear term of the polynomial has exactly the same interpretation as a linear regression of the dependent measure over time (Time ¹). In the trajectory, this term points at the overall change in the direction. The quadratic term can be used to identify sudden changes in the linear trend, e.g., a decrease followed by an increase (Time ²). In the trajectory, this term points at the overall shape of the direction, i.e., whether it tends to go towards the correct or competitor response. The cubic term can be used to identify early and late effects in the trend (Time ³). In the trajectory, this term points at whether it initially deviates towards the competitor, i.e., negative values, or the correct response.

We visualize observed data as shaded bands indicating the standard error around the mean, and overlay the mean estimate of model fits as lines. We report tables with the coefficients of the predictors, significant at p<.05, their standard error, and derive p-values from the t-values for each of the factors in the model. The t-distribution converges to a z-distribution when there are enough observations (our data contains 264,180 points), and hence we can use a normal approximation to calculate p-values.

In Fig. 3, we plot the angular trajectory followed by the participants as they correctly verified the Congruency of the pair of stimuli (Sentence-First, top-row; Scene-First, bottom-row); and zoomed in on the earliest section of the trajectory to visualize the initial stages of the decision process. In order to simplify the understanding of our results, we focus on the significant terms, and especially on those corroborating with the summary measures reported above in the main text (refer to Table 3 for model coefficients).

Table 3 Mixed-effect maximal model analysis of the angular trajectory

Full size table

Almost all effects on the trajectory are captured by interactions between Congruency, Plausibility, and Order, with the polynomial terms of Time. We find that incongruent verifications are re-directed more decisively towards the correct response (Congruency:Time ¹), especially later in the trajectory (Congruency:Time ³), but have an overall negative bowing, i.e., more deviated pull towards the incorrect response than congruent trials (Congruency:Time ²). We also find implausible trials displaying an early deviation towards the incorrect response (Plausibility:Time ³).

When looking at three-ways interactions, we find that for implausible, congruently matched stimuli, these generate a more protracted deviation in the trajectory than incongruent trials (Congruency:Plausibility:Time ²). This result corroborates the summary measures reported in the main text. Moreover, this pattern is particularly prominent when the sentence is presented first (refer to the β for the three-way interaction Congruency:Plausibility:Order in Table 3) This effect is also rather evident in Fig. 3, top -row (Sentence-First), where there is consistent deviation towards the incorrect response, lasting until the end of the trajectory.

Further examination of Order reveals that when the scene is presented first, incongruent verifications deviate more strongly in the initial phases of the trajectory (Congruency:Order:Time ³); such deviations are quickly recovered as indicated by the positive three-ways interaction of Congruency:Order:Time ¹ and Congruency:Order:Time ². For Plausibility instead, when the scene is presented as a second stimulus, we find the trajectory to present an overall negative bowing toward the incorrect response (Plausibility:Order:Time ²).

Finally, when looking at the four-way interaction, we find that after an initial hesitation, participants corrected their arm movement trajectories towards the correct response more rapidly when implausible stimuli congruently matched, and the sentence served as the initial stimulus, i.e., a positively linear trajectory with an upward bowing trend.

In summary, on arm movement trajectories, we confirm electrophysiology research, whereby implausible stimuli are associated with greater competition costs. Also, incongruency causes an overall more deviated trajectory. However, in the current paradigm examining response movement dynamics, effects occur also after the stimuli had been initially encountered. In addition, our study also shows that implausible but congruently matching stimuli generate higher competition costs than incongruent and implausible stimuli. In this case, implausible messages have to be accepted as congruent, even though they violate prior knowledge, i.e., boys do not normally eat bricks. This result was particularly strong when the sentence was presented prior to the scene. The effects of Order of presentation were captured in the angular trajectory but not in the summary measures, because here we include the time-course component, and have a much larger number of data points to estimate the parameter coefficients.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Coco, M.I., Duran, N.D. When expectancies collide: Action dynamics reveal the interaction between stimulus plausibility and congruency. Psychon Bull Rev 23, 1920–1931 (2016). https://doi.org/10.3758/s13423-016-1033-6

Download citation

Published: 19 May 2016
Issue Date: December 2016
DOI: https://doi.org/10.3758/s13423-016-1033-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

When expectancies collide: Action dynamics reveal the interaction between stimulus plausibility and congruency

Abstract

Similar content being viewed by others

Motor demands influence conflict processing in a mouse-tracking Simon task

Response priming with motion primes: negative compatibility or congruency effects, even in free-choice trials

Action control costs in task selection: Agents avoid actions with incompatible movement and effect features

Introduction