Introduction

The processes underlying visual word identification are tolerant to noise and variations in shape of the words’ constituent letters (see Grainger, 2018; Grainger & Dufau, 2012, for reviews). One of the most striking demonstrations was provided by Hannagan, Ktori, Chanceaux, and Grainger (2012) with briefly presented text-based CAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart; e.g., ). They conducted a lexical decision experiment (“does the stimulus form a word?”) in which an uppercase printed target was preceded by a forwardly masked lowercase CAPTCHA prime for 50 ms. The prime was nominally identical or unrelated to the target. Results showed substantially faster responses in the identity than in the unrelated condition. Hannagan et al. (2012) concluded that the word recognition system is tolerant to “global continuous input transforms and small letter rotations” (p. 2).

Clearly, the presence of masked identity priming effects with CAPTCHAs or other degraded stimuli (e.g., handwritten words; see Perea, Marcet, Uixera, & Vergara-Martínez, 2018b, for review) is quite impressive. However, it is difficult to pinpoint the locus of this effect because the letters in degraded stimuli are distorted across several dimensions. To avoid this interpretive issue, here we focused on a distortion along a single dimension: letter rotation. Importantly, the most influential neural model of word recognition makes a strong prediction in the context of visual word processing: “letter detectors should be disrupted by rotation (> 40°)” (Local Combination Detectors [LCD] model: Dehaene, Cohen, Sigman, & Vinckier, 2005, p. 340). Similarly, Cohen, Dehaene, Vinckier, Jobert, and Montavont (2008, p. 364) stated: “the ventral visual word form system, alone, is limited in its invariance” and would be hindered by rotations above 45°. Consistent with this prediction, a number of single-presentation experiments have reported shorter response times to isolated letters and letter strings when presented horizontally than when rotated 45° or 90° (Risko, Medimorec, Chisholm, & Kingstone, 2014; see Kim & Straková, 2012, for electrophysiological evidence; see Blythe, Juhasz, Tbaily, Rayner, & Liversedge, 2019, for evidence during sentence reading; see also Jolicoeur & Landau, 1984; Jolicoeur, Snow, & Murray, 1987, for early research with isolated letters). These effects are greater for letter strings than for isolated letters – this has been interpreted in terms of the small time needed to encode a simple pattern (e.g., a letter; see Risko et al., 2014, for discussion).

Critically, this prediction from the LCD model has recently been put into question in several experiments that tapped the initial moments of processing using Forster and Davis’ (1984) masked priming procedure (see Grainger, 2008, for a review of the advantages of masked priming over single-presentation techniques). In a lexical decision experiment, Perea, Marcet, and Fernández-López (2018a) found that the magnitude of masked repetition priming (e.g., social-SOCIAL vs. energy-SOCIAL) and masked transposed-letter priming (e.g., soical-SOCIAL vs. soaral-SOCIAL) was sizeable and similar for words in marquee format (i.e., a vertical format with horizontal letters; 40-ms identity priming effect; 19-ms transposed-letter priming effect) and for words rotated 90° (37-ms identity priming effect; 20-ms transposed-letter priming effect). Clearly, if letter detectors had been disrupted by rotations above 40° in the first moments of word processing, one would have expected a sizeable reduction of masked repetition/transposed-letter priming effects for 90° rotated words. Furthermore, Yang and Lupker (2019) provided an even more extreme demonstration. They conducted two masked priming lexical decision experiments in which the words were presented horizontally or rotated 90° or 180°. While, unsurprisingly, overall latencies were longer for rotated stimuli (i.e., 180° > 90° > 0°), the size of the transposed-letter priming effect was similar in the three formats (33, 29, and 35 ms for 0°, 90°, and 180° words, respectively). Taken together, these two series of experiments suggest that, in the first moments of processing, letter detectors do not seem to be severely hindered by rotation. However, there are two potential limitations in these experiments. First, one might argue that top-down lexical processes might have contributed to cancelling out the effect of rotation angle (i.e., prime-target integration can be enhanced when the stimuli tap onto lexical representations, over-riding physical features (e.g., altar-ALTAR = ALTAR-ALTAR); see Vergara-Martínez, Gomez, Jiménez, & Perea, 2015, for electrophysiological evidence). Second, participants could have spontaneously used strategies when processing the rotated stimuli to offload cognitive work such as tilting their heads (see Risko et al., 2014, for discussion).

In the present set of experiments, we examined the effects of letter rotation in the first moments of letter processing while avoiding lexical or strategic processes. To do that, we combined a letter match task (i.e., a task that does not involve lexical processing) with a masked priming procedure (see Kinoshita & Kaplan, 2008; Kinoshita, Robidoux, Mills, & Norris, 2013). Importantly, both probe and target letters – always in different case – were presented in the canonical horizontal format (i.e., only the masked prime was rotated). In this manner, participants could not realize that the prime stimuli were rotated, thus minimizing any potential conscious strategies. The setup of a given trial was the following. A probe letter was presented above a pattern mask for 750 ms. The prime stimulus replaced the mask for 50 ms, which was in turn replaced by the target letter. To effectively mask the rotated primes, the size of both mask and target was greater than the size of the prime (see Fig. 1, for illustration of the technique). The key comparison was between identity primes versus unrelated primes (e.g., probe r; prime R; target R vs. probe r; prime T; target R).

Fig. 1
figure 1

Schematic representation of a “same” trial and a “different” trial with a masked priming letter match task with a 45° prime rotation

As indicated earlier, previous parametric manipulations of rotation angle with isolated letters have shown a processing cost. In a letter-identification task in which letters were presented briefly and masked, Jolicoeur and Landau (1984) found a higher percent of errors as a function of rotation angle ([0° vs. 30° vs. 60° vs. 90°] Experiment 1: 21.9 vs. 40.6 vs. 57.3 vs. 52.1%, respectively; Experiment 2: 25.0 vs. 25.0 vs. 43.8 vs. 50.0%, respectively). More recently, in a naming task in which the letters were presented at several orientations, Risko et al. (2014) found that response times were modulated by rotation angle ([0° vs. 45° vs. 90°] Experiment 1: 823 vs. 831 vs. 870 ms, respectively; Experiment 2: 963 vs. 1,045 vs. 1,068 ms, respectively). However, none of these experiments can disentangle whether the effect of rotation angle occurred in the initial moments of processing or whether the processing cost occurred later in processing. This is the reason why we opted for a masked priming paradigm with isolated letters: if rotation slows down the initial stages of letter processing, the size of masked identity priming should be reduced for rotated primes beyond 40–45°.

In each of the three experiments reported here, the primes were presented with one of two rotation angles, the difference always being 45°. In Experiment 1, we compared 0° versus 45° rotation angles. In Experiment 2, we compared 22.5° versus 67.5° rotation angles. In Experiment 3, we compared 45° versus 90° rotation angles. If letter detectors are hindered by rotations above 40° or 45°, as can be inferred from the LCD architecture (Dehaene et al., 2005), masked identity priming should be substantially reduced when the rotation angle of the prime is 45° or above. That is, one would expect an interaction between rotation and identity priming in Experiments 1 and 2, and a null effect of priming in Experiment 3. Alternatively, if letter detectors are resilient to rotation in the initial moments of processing, as claimed by Perea et al. (2018a, b) and Yang and Lupker (2019), masked identity priming should be approximately similar in magnitude in the two rotation angles in the three experiments.

Method

Participants

The sample was composed of 60 students from the University of Valencia with no hearing or reading disorders (20 in each experiment). All of them had normal or corrected-to-normal vision and signed a consent form before starting the experiment. For each experiment, the number of stimuli in each condition (identity condition, unrelated condition) for both “same” and “different” trials was 128 (i.e., 20 * 128 = 2,560 observations).

Materials

We employed eight consonants (B, C, F, G, R, S, T, and V). Probe and targets were always in different case (e.g., r-R, s-T) and primes were presented in the same case as the targets – note that Kinoshita and Kaplan (2008) found that identity priming occurred to a similar degree for visually similar pairs (e.g., t-T) and for visually dissimilar pairs (e.g., r-R). For “same” trials, the prime could be the same letter as the target (identity condition) or a different letter (unrelated condition) (e.g., [probe-prime-target] r-R-R vs. r-T-R). For “different” trials, the prime could be the same as the probe or not (e.g., f-F-R vs. f-T-R) – this way, the probe-prime relationship was not predictive of the responses (see Fig. 1, for a scheme of a trial). The rotation angles for the primes were: 0° versus 45° (Experiment 1), 22.5° versus 67.5° (Experiment 2), and 45° versus 90° (Experiment 3).Footnote 1

Procedure

The experiments were conducted individually or in groups of up to three participants in a quiet room. DMDX software on Windows computer (Forster & Forster, 2003) was used to present the stimuli and collect the responses. Participants were told that, on each trial, they would see a sequence of two letters, and they had to decide – as quickly and accurately as possible – whether these two letters were the same or not. On each trial, a probe letter was presented above a pattern mask (#) on the CRT screen for 750 ms. Then, the probe disappeared and the prime replaced the mask for 50 ms and, subsequently, the target letter replaced the prime. To effectively mask the primes, the mask was presented in 48-pt Arial font, the prime was presented in 12-pt Arial font, and probes and targets were presented in 22-pt Arial font. The target remained on the screen until the response – there was a 2-s deadline. There were 16 practice trials before then 512 experimental trials in each experiment. The session took approximately 18–22 min.

Results

As is usual with this procedure, we focused on “same” trials (i.e., trials in which the probe and the target were the same). In the response time (RT) analyses, we excluded the incorrect responses and the correct responses briefer than 250 ms. The mean RTs and error rates in each condition are displayed in Table 1. For the inferential analyses, we employed the lme4 (Bates, Maechler, Bolker, & Walker, 2015) and lmerTest packages (Kuznetsova, Brockhoff, & Christensen, 2016) in R (R Core Team, 2019). In each experiment, the two fixed factors (prime-target relationship [identity, unrelated] and rotation angle) were zero-centered. For the latency data, we employed generalized mixed-models analyses with the Gamma function because they do not require a non-linear transformation of RTs – note that non-linear transformations might (in some scenarios) alter the nature of the interaction between factors (see Yang & Lupker, 2019, for discussion). Nonetheless, inverse transformations (i.e., -1,000/RT) with linear mixed-effects models and analyses of variance on the participants’ and items’ means produced the same pattern of findings as that reported here. In the analyses of accuracy (i.e., categorical responses: 1 = correct; 0 = error), we employed generalized mixed-models analyses with the binomial function. For each test, we chose the most complex random effect structure that converged. The specific models are indicated in Appendix 1.Footnote 2

Table 1. Mean response times (in ms) and percentage of errors (in parentheses) in each of the conditions in the experiment

Experiment 1 (0° vs. 45°)

Latency data

We found faster responses in the identity condition than in the unrelated condition, b = 17.215, SE = 4.794, z = 3.591, p < .001. In addition, responses were faster when the prime was in horizontal format than in the 45° format, b = -9.358, SE = 3.866, z = -2.421, p = .015. The interaction between the two factors did not approach significance (z < 1, p > .45).

Accuracy data

The data showed a small numerical advantage of the identity priming condition, but neither the main effects nor their interaction was significant (priming: b = -0.381, SE = 0.204, z = -1.869, p = .062; rotation: b = 0.387, SE = 0.201, z = 1.93, p = .054; interaction: b = -0.480, SE = 0.257, z = -1.869, p = .062)

Experiment 2 (22.5° vs. 67.5°)

Latency data

The results showed an advantage of the identity over the unrelated condition, b = 14.675, SE = 3.678, z = 3.990, p < .001, whereas the other factors did not approach significance (both zs < 1, ps > .61).

Accuracy data

None of the effects approached significance (all zs < 1.4, all ps > .15).

Experiment 3 (45° vs. 90°)

Latency data

Response times were faster in the identity condition than in the unrelated condition, b = 15.785, SE = 4.707, z = 3.354, p < .001. The other factors did not approach significance (both zs < 1, both ps > .45).

Accuracy data

None of the effects approached significance (all zs < 1.6, all ps > .11).

Discussion

While there is some agreement that rotated letters are identified more slowly than their canonical upright counterparts (see Risko et al., 2014, for review), it is unclear whether this effect has its locus in the initial stages of processing. In the current set of experiments, we examined whether letter detectors are disrupted by rotation angle up to 90° during the first stages of processing. To that end, we manipulated the rotation angle of letter primes in three masked identity priming letter match experiments. We found a sizeable masked identity priming effect that was comparable in magnitude regardless of the rotation angle of the prime (Experiment 1: 0° vs. 45°; Experiment 2: 22.5° vs. 67.5°; Experiment 3: 45° vs. 90°). Critically, this pattern occurred in a scenario in which participants could not anticipate that some of the stimuli were rotated: all probe and target stimuli were presented in the standard upright orientation. While identity priming was not affected by letter rotation, we did find a small overall cost of letter rotation in Experiment 1 (0° vs. 45°). However, this effect must be interpreted with some caution, as it may reflect an effect of congruency between the rotation angle of the prime and the rotation angle of the probe/target – consistent with this interpretation, this difference was absent in Experiments 2 and 3.

The pattern of priming effects in these letter-match experiments is consistent with the robust masked identity and form priming effects in the lexical decision task reported by Perea et al. (2018a, b) and Yang and Lupker (2019) with 90° rotated words. The most parsimonious explanation of these findings is that, in the early stages of letter and word processing, letter detectors are resilient to rotation angles of 90° or below. Thus, the processing cost that occurs when identifying isolated rotated letters (Risko et al., 2014) or rotated words (see Yu, Gerold, Park, & Legge, 2010) would not be due to a disruption during the initial stages of processing but instead it would occur at a later processing stage (see Forster, 1998, for a similar argument concerning mixed-case stimuli).

Thus, the current experiments, together with the Perea et al. (2018a, b) and Yang and Lupker (2019) experiments, pose some problems for the LCD model of visual word recognition (Dehaene et al., 2005). This model establishes clear predictions regarding the tolerance of the visual system to different types of stimulus degradation during word reading. Specifically, in the context of fast word recognition, letter detectors should be disrupted by rotation angles larger than 45o. But why did Dehaene et al. (2005) make the claim that a rotation of around 40–45° would mark the limit of resilience in letter detectors? The key basis of their claim was based on an influential experiment conducted by Logothetis and Pauls (1995) with monkeys. These primates had been trained with novel wire-like and spheroidal 3D objects that were always presented with a specific viewpoint. In a subsequent phase, the monkeys were presented with these objects, but in various angles. The monkeys’ ability in identifying these objects decreased dramatically in rotation angles of around 40° or 45°. However, these findings, which were obtained after presenting various 3D objects with a single viewpoint to a non-human species, may not be generalizable to the letter detectors of the reading system in humans. Leaving aside that letters are 2D objects, compared to the artificial 3D objects used in the Logothetis and Pauls (1995) experiment, the tuning of letter detectors (i.e., the groups of neurons that respond to a given letter in humans) is fed by experience with multiple different printed versions of the same letter (e.g., A, A, A). In order to rapidly recognize each of a word’s constituent letters, the visual system encodes basic-level differentiating features while over-riding subordinate-level differences (e.g., font, size, etc.). As a result, letter detectors may be more resistant to variations in the visual form of the letters than the groups of neurons that respond to a trained 3D object in a single viewpoint in monkeys (see Hannagan et al., 2012, for discussion).

To sum up, the present masked priming experiments demonstrated that the visual form processing system is resilient to rotation angles of 90° or below in the initial moments of letter processing. This pattern supports the claims that, as a result of extensive experience, the human brain has developed a perceptual expertise for processing letters and words independently of the form (i.e., the abstract unit “e” would be activated with the visual input e, e, or ). As Koriat and Normal (1985) anticipated, the extensive practice in letter/word identification results in the “establishment of a broadly tuned memory representation that allows direct stimulus recognition over a relatively wide range of stimulus orientations” (p. 438). What the current set of experiments revealed is that the “internal rectifying operations” to reconstruct a canonical version of a rotated letter seem to occur very rapidly. Further experiments should examine in depth the time course of letter rotation (e.g., using electrophysiological measures) not only with isolated letters but also with words composed of rotated letters.

Open practice statement

All stimuli, data, and scripts are available at https://osf.io/qru9k/.