Effects of intraword and interword spacing on eye movements during reading: Exploring the optimal use of space in a line of text

Slattery, Timothy J.; Rayner, Keith

doi:10.3758/s13414-013-0463-8

Effects of intraword and interword spacing on eye movements during reading: Exploring the optimal use of space in a line of text

Published: 25 May 2013

Volume 75, pages 1275–1292, (2013)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Effects of intraword and interword spacing on eye movements during reading: Exploring the optimal use of space in a line of text

Download PDF

Timothy J. Slattery¹ &
Keith Rayner²

4727 Accesses
34 Citations
5 Altmetric
Explore all metrics

Abstract

Two eye movement experiments investigated intraword spacing (the space between letters within words) and interword spacing (the space between words) to explore the influence these variables have on eye movement control during reading. Both variables are important factors in determining the optimal use of space in a line of text, and fonts differ widely in how they employ these spaces. Prior research suggests that the proximity of flanking letters influences the identification of a central letter via lateral inhibition or crowding. If so, decrements in intraword spacing may produce inhibition in word processing. Still other research suggests that increases in intraword spacing can disrupt the integrity of word units. In English, interword spacing has a large influence on word segmentation and is important for saccade target selection. The results indicate an interplay between intra- and interword spacing that influences a font’s readability. Additionally, these studies highlight the importance of word segmentation processes and have implications for the nature of lexical processing (serial vs. parallel).

Reconsidering the Evidence That Systematic Phonics Is More Effective Than Alternative Methods of Reading Instruction

Article Open access 08 January 2020

The simple view of reading and its broad types of reading difficulties

Article Open access 12 August 2023

RETRACTED ARTICLE: Eye tracking: empirical foundations for a minimal reporting guideline

Article Open access 06 April 2022

While there have been a considerable number of experiments (see Rayner, 1998, 2009, for reviews) devoted to understanding how various lexical variables influence eye movements during reading, there have been far fewer studies examining the influence of typographical and font variables. It is quite clear that very difficult to encode fonts will lead to slower reading and, concomitantly, to longer eye fixations, shorter saccades, and more regressions (Rayner & Pollatsek, 1989; Rayner, Pollatsek, Ashby, & Clifton, 2012; Rayner, Reichle, Stroud, Williams, & Pollatsek, 2006). In general, however, the consensus view seems to be that as long as type font, type size, and length of lines are at all reasonable, reading will proceed quite normally, because lexical processing of the words in the text drives the eyes (Morrison & Inhoff, 1981; Rayner & Pollatsek, 1989). Because of this general view, until recently, the number of studies examining typographical variables has been quite sparse. However, recently a number of studies dealing with the effect of typographical variables on eye movements during reading have appeared. Indeed, Slattery and Rayner (2010) demonstrated that even subtle font differences lead to effects on eye movements and that these effects can interact with higher level cognitive variables like word frequency. In the present article, we examined how a different type of typographical variable, the spacing between letters, influences reading.

Calculating the number of characters (N _C) on a line of text is a trivial matter. The relevant variables for the calculation are the length of the line (L) and the width of the individual characters (W _C). Assuming a fixed width font for simplicity results in Eq. 1:

$$ {N_{\mathrm{C}}}=L/{W_{\mathrm{C}}}. $$

(1)

However, for our discussion, it is also necessary to differentiate between characters and letters. Here, we will refer to a letter as the colored area of a (nonspace) character that is distinct from the background. Therefore, a character contains the letter and the space surrounding this letter, which is indistinguishable from the background. Two letters (e.g., xy) within a word will be separated by intraword space: the sum of the space to the right of the leftmost letter and the space to the left of the rightmost letter. The importance of this intraword space (S ₁) can be seen through the application of kerning. Kerning is the process of adjusting the intraword space between certain letters so that the letters within a word all appear uniformly spaced. For instance, in the uppercase word VAST, the letters V and A are placed closer to each other than the other letters are. In fact, in x, y coordinate space, the x value of the rightmost point of the letter V is greater than the x value for the leftmost point of the letter A.

Of course, not every character contains a letter. The interword space (S ₂) is a character that is completely indistinguishable from the background. These interword spaces are far more distinct in English (and other alphabetic) text than the intraword spaces between letters within a word are and so play a crucial role in the number of letters that can fit on a line of text. For a fixed width font, this results in Eq. 2, where W _L is the width of the letter and N _W is the number of words on the line:

$$ {{N}_{{\rm{L}}}} = \left( {L - {{S}_{2}}*\left( {{{N}_{{\rm{W}}}} - 1} \right)} \right)/\left( {{{W}_{{\rm{L}}}} + {{S}_{1}}} \right). $$

(2)

While it is a trivial matter to calculate the number of characters or letters that can fit on a line of text, it is far less trivial to determine how to optimize the variables in Eq. 2 for the purpose of reading efficiency. The present work outlines the relevant factors involved in determining such optimal values for two of the variables involved in this equation: intraword spacing and interword spacing.

Intraword spacing effects

Intraword spacing can influence reading processes in a number of ways. First, it is well-known that crowding from flanker letters influences how quickly and accurately a central letter can be identified (Bouma, 1970; Chung, Levi, & Legge, 2001; Eriksen & Eriksen, 1974). If letter trigrams are pushed closer together, masking from the exterior flanker letters makes it harder to identify the central letter, whereas increasing the space between these letters reduces the amount of crowding, making it easier to identify the central letter. In fact, increased spacing between letters also results in increases in perceived letter size (Skottun & Freeman, 1983). There are three main characteristics of visual crowding. The first is that effects of crowding increase with increasing distance of the target from the fovea (Bouma, 1970). The second is that the effect of flankers is asymmetric, with the outer or more eccentric flanker exerting a greater crowding effect than the less eccentric inner flanker (Petrov, Popple, & McKee, 2007). Finally, the zone of crowding is not circular but, instead, exhibits a radial-tangential anisotropy, such that flankers positioned along the radial axis from the fovea to the target will produce more crowding than those placed tangentially to this axis (Toet & Levi, 1992). Recently, Nandy and Tjan (2012) showed that all of these characteristics of crowding can be explained as a consequence of saccades confounding the statistics of natural images. However, identifying a central letter is very different from identifying a word. For instance, with central letter identification tasks, performance improves to an asymptote as the flankers are moved further from the central letter. However, with word identification tasks like lexical decision and categorization, inhibition occurs both with reduced intraword spacing and with intraword spacing that is increased beyond some critical point (Chung, 2002; McLeish, 2007; Paterson & Jordan 2010; Pelli et al., 2007; Risko, Lanthier, & Besner, 2011; Vinckier, Qiao, Pallier, Dehaene, & Cohen, 2011). While it is clear that increasing intraword spacing beyond some critical value (two or three character spaces) will disrupt reading, it is far less clear what effect will occur for more subtle increases. There are reports of facilitation in lexical decision tasks with subtle increases to intraword space (Perea & Gomez, 2012). However, as Perea, Moret-Tatay, and Gomez (2010) noted, the results of studies that use subtle manipulations of increased intraword spacing are somewhat inconsistent, probably due to the fact that the amount of space added between letters varied across the studies, as did the fonts used in the studies. Fonts can differ quite a bit in their default intraword spacing. For instance, Times New Roman has less spacing than Courier New. Therefore, if there exists some optimal value for intraword spacing, one would expect that studies using different fonts might yield inconsistent results.

Changes in intraword spacing, unless compensated for with changes in interword spacing, will also lead to changes in the number of letters that can fall within high-acuity foveal vision during a single fixation. For single-word presentation tasks like lexical decision, naming, and categorization, this is only a minor issue. However, for normal reading, which involves a considerable amount of parafoveal preprocessing of text (Rayner, 1998, 2009; Schotter, Angele, & Rayner, 2012), such changes could add up to large effects as upcoming words are pushed further and further from fixation.

Some studies have explored the effect of adding or deleting spaces within text during normal reading by examining eye movements. In a clever experiment, McDonald (2006) varied the letter width and intraword space such that all the words in a sentence would subtend the same visual angle. He found clear differences between target words that differed in number of letters (so either a six-letter word or an eight-letter word occupied the same amount of space in the sentence). Specifically, the more letters in the word, the greater the number of fixations that were made on the word, and the longer the fixation times on the word were. Of course, this manipulation confounds the number of letters in a word with letter width and spacing, and McDonald noted that the most plausible explanation for the findings was that the longer words were subject to a greater degree of visual crowding. Going in the other direction, Paterson and Jordan (2010) found a detrimental effect of intraword spacing on eye movements. However, in their experiment, the smallest addition to intraword spacing added an extra space b e t w e e n e a c h l e t t e r (as in the prior three words), and this most likely disrupted the overall integrity of the words in the sentences. In fact, Paterson and Jordan also reported that the effect of word frequency was larger for all increased spacing conditions relative to the standard spacing control condition. From this result, they argued that the increased spacing interfered with normal word processing.

Interword spacing effects

Word identification is paramount during reading. As such, it is crucial that when we read a line of text, we are able to identify the beginnings and endings of individual lexical items, a process referred to as word segmentation. A number of studies have reported substantial reductions in reading rate for English text when interword spaces are removed (Morris, Rayner, & Pollatsek, 1990; Perea & Acha, 2009; Pollatsek & Rayner, 1982; Rayner, Fischer, & Pollatsek, 1998; Rayner, Yang, Schuett, & Slattery, 2013; Sheridan, Rayner, & Reingold, 2013; Spragins, Lefton, & Fisher, 1976). However, at least one study has reported a more modest reduction in reading rate for text without spaces (Epelboim, Booth, & Steinman, 1994). This reduction in reading rate is greater for lower frequency words than it is for higher frequency words and greater for contextually constraining text than for less constraining text, suggesting that the lack of interword spacing interferes with normal word identification processes. It is interesting to note that not all written languages use interword spaces. For instance, neither Thai nor Chinese text has interword spaces. However, despite this lack of interword spacing, word segmentation is just as important in these languages (see Li, Rayner, & Cave, 2009). For instance, text with added interword spaces has been found to increase reading rate for both Thai (Kohsom & Gobet, 1997; Winskel, Radach, & Luksaneeyanawin, 2009) and Chinese (Hsu & Huang, 2000a, 2000b), as compared with traditional text without such word spaces. Additionally, novel Chinese words are learned more efficiently when presented in sentences with interword spaces (Blythe et al., 2012). However, other studies have reported faster reading of text with added interword spaces only relative to a condition with spaces added at nonword boundaries (Bai, Yan, Liversedge, Zang, & Rayner, 2008), with no difference in reading rate between traditional nonspaced text and text with interword spaces. More recently, it has been shown that people learning Chinese as a second language benefit from added interword spaces (Shen et al., 2012). Thus, readers of Thai and Chinese appear to segment characters into words, similar to readers of alphabetic languages, but normally make use of cues other than interword spaces for these segmenting processes.

Interword spaces also have the effect of reducing lateral inhibition of the first and last letters of words. This may be largely responsible for the important role that first and last letters of words play during word recognition (Davis, 2010; Gomez et al., 2008; Jordan, 1990, 1995). Thus, we might expect that increasing intraword spacing would reduce this lateral interference, leading to faster reading rates, especially for fonts with small default interword spacing.

Interword spaces may play a role beyond just word segmentation and lateral inhibition of word-beginning and -ending letters. They may also influence the targeting and or accuracy of saccades within the oculomotor system. Interword space helps to break up the line of text into distinct light and dark patches. This low-frequency spatial information can be used even in parafoveal vision to help target saccades to areas that are more optimal for word identification. With normally spaced text, a reader’s first fixation on a word tends to be just left of word center (Rayner, 1979). This location is referred to as the preferred viewing location. However, with unspaced text, readers’ initial fixation on a word tends to be shifted more toward the beginnings of words (Rayner et al., 1998). However, there are, of course, errors in saccade planning and execution. Often, these errors are large enough to result in mislocated fixations—fixations that land on unintended words. Such mislocated fixations have been estimated to occur on as many as 15% to 20% of all reading saccades (Drieghe, Rayner, & Pollatsek, 2008; Engbert & Nuthmann 2008). These mislocated fixations would slow the reading process by placing the fovea in suboptimal locations. Increased interword spacing may serve to reduce the number of mislocated fixations, yielding more efficient reading. Recent work by Engbert and Krügel (2010) suggests that readers use Bayesian estimation of word centers when targeting saccades. From such a Bayesian framework, increasing interword spacing may aid in the accurate targeting of saccades toward word centers by reducing observational error in the estimation of target distance.

There is, however, at least one potential inhibitory effect that we expect from increasing interword spaces. Adding additional space between words, unless offset by decreases in intraword spacing, will push upcoming words further from the current fixation (i.e., further into the parafovea or periphery, where visual acuity drops sharply and crowding effects increase). This may reduce the ability to gain useful previews of upcoming words (Rayner, 1998, 2009; Schotter et al., 2012). Thus, finding an optimal amount of interword space will be a balancing act similar to finding an optimal amount of intraword space.

In the experiments reported here, we explored how the use of space on a line of text influences eye movements during reading. In Experiment 1, we systematically varied the amount of intraword spacing (by increasing and decreasing the space between letters). In Experiment 2, we pitted intraword and interword spacing against each other in a unique manipulation that allowed us to test the balance of these factors, as well as some controversial assumptions about the nature of lexical processing during reading.

Experiment 1

In Experiment 1, we investigated the role that intraword spacing played with regards to eye movements during reading. We explored the influence of letter spacing by adjusting the tracking between characters within a font. We employed four levels of spacing: reduced by half a pixel, normal, increased by half a pixel, and increased by a full pixel. Figure 1 shows a sentence across these four spacing conditions for each font. This manipulation is far more subtle than the one used by Paterson and Jordan (2010) and similar to the one used by Perea et al. (2010). Note that this manipulation applied to all characters, including the interword space. Thus, the relation between intraword and interword spacing was the same across the four levels of spacing.

Different fonts, even when rendered at the same point size, vary on a multitude of dimensions, including intraword and interword spacing. Therefore, in addition to the above-mentioned spacing manipulation, we also explored the influence of this spacing manipulation across two different fonts (Times New Roman and Cambria). Both of these fonts are proportional width, both have serifs, and both are highly familiar to readers. However, at 10 points, Cambria has more intraword spacing than does Times New Roman. Therefore, it is possible that the spacing manipulation we employed in Experiment 1 would affect these two fonts differently. An added benefit of using Times New Roman is that this is the font used by Perea et al. (2010) and Perea and Gomez (2012), who found facilitation with increased intraword spacing in single-word recognition.

Finally, previous studies that have manipulated frequency and spacing and that have reported inhibition from increased intraword spacing have also reported interactions between spacing and frequency, with increased spacing interfering with low-frequency words more than with high-frequency ones. However, the studies that have reported facilitation have not found interactions between frequency and spacing. Therefore, in order to explore how the bottom-up spacing manipulation was influenced by top-down processing, we embedded either a low- or a high-frequency word in each sentence. To the extent that the intraword spacing manipulation interferes with normal word processing, we would expect an interaction between spacing and word frequency.

Method

Subjects

Thirty-two undergraduate students at the University of Massachusetts at Amherst received course credit or were paid $7.00 for their participation. All subjects were naïve concerning the purpose of the experiment, were native speakers of English, and had either normal or corrected-to-normal vision.

Apparatus

An SR Research Eyelink 1000 eyetracker was used to record subjects’ eye movements, with a sampling rate of 1000 Hz. Subjects read sentences on a 19-in. Viewsonic VX 924 LCD monitor at its native resolution of 1,280 × 1,024 pixels. Viewing was binocular, but only the movements of the right eye were recorded. Viewing distance was approximately 50 cm.

Materials

Ninety-six experimental sentence frames were adapted from Sereno and Rayner (2000) and Slattery, Pollatsek, and Rayner (2007). Each frame contained one of a pair of frequency-manipulated target words, thereby creating 192 unique experimental sentences. The high-frequency members of these target word pairs averaged approximately 138 occurrences per million, and the low-frequency members averaged approximately 17 occurrences per million in the HAL database (Burgess, 1998; Burgess & Livesay, 1998) according to the English Lexicon Project Web site (Balota et al., 2007).^{Footnote 1} The average length of the target words was 5.8 characters (range: 3–11) and was matched between the high- and low-frequency words. An example of a sentence with its high- and low-frequency versions appears below (1, high frequency; 2, low frequency), with the target word appearing in italics^{Footnote 2}:

1.
They shouted at the driver who wildly cut them off.
2.
They shouted at the cabby who wildly cut them off.

The sentences were presented as black letters on a white background in either 10-point Cambria or Times New Roman font with Microsoft ClearType subpixel rendering (for more on ClearType, see Larson, 2007; Slattery & Rayner, 2010). The subpixel rendering allowed us to adjust the letter spacing of characters in small increments. It is perhaps easiest to explain the ClearType subpixel rendering with an analogy to grayscale rendering. Imagine that we rendered a letter I in grayscale and that the width of this letter was 1.5 pixels. To make the letter appear that it was more than 1 pixel but less than 2 pixels wide, we would adjust the level of gray of the second pixel (for which there are 256 levels). The darker gray this second pixel was, the wider the letter would appear. With ClearType subpixel rendering, we can adjust the level of each of the three colored subpixels of an LCD monitor (each with 256 levels of color), giving us more precision in the appearance of the rendered letters. Figure 1 above shows the four levels of character spacing we employed for Experiment 1: reduced by half a pixel, normal, increased by half a pixel, and increased by a full pixel (for reference, 1 pixel subtended 0.032° of visual angle). The distance between levels of this spacing variable was therefore constant, allowing us to examine trend analyses for our data (see the Results section below).

On average, target words subtended 1.42° of visual angle in the normal spacing condition for both the Cambria and Times New Roman fonts. However, due to various differences between these fonts related to proportional character widths and interword spacing, there were slight differences in the visual angle subtended by the entire sentences. The average sentence length in the normal condition was 10.95° of visual angle for the Cambria sentences and 11.14° for Times New Roman. This difference was approximately the size of a single character; however, it was statistically significant, p < .05.

Procedure

At the start of the experiment, subjects were familiarized with the experimental apparatus. Next, a calibration procedure was initiated that required subjects to look at a random sequence of fixation points presented horizontally across the middle of the computer screen. This procedure was repeated during a validation process, and the average error between calibration and validation was calculated. If this error was greater than 0.4° of visual angle, the entire procedure was repeated. At the start of each trial, a black square (0.8° of visual angle) appeared on the left side of the computer screen, which coincided with the left side of the first letter in the sentence. Once a stable fixation was detected within this area, the sentence replaced it on the screen. All sentences were presented vertically centered on the computer monitor. Subjects were instructed to read silently for comprehension and to press a button on a keypad when they finished reading the sentence. Comprehension questions appeared on the screen after a third of all the items. These yes/no questions required the subjects to respond via buttonpress. Latin square counterbalancing ensured that each subject saw an equal number of sentences in each experimental condition; no subject saw any sentence frame more than once, and over all subjects, each sentence was seen equally often in each experimental condition. Sentence order was randomized for each subject.

Results

We analyzed a number of dependent measures and will break up our results into two main sections. The first of these will consist of global measures of sentence reading: mean fixation duration, number of fixations, total sentence reading time, and comprehension question accuracy. For the calculation of the global-reading-dependent measures, we averaged over the independent variable of target word frequency. Each of these global reading measures was submitted to two 2 (font: Cambria vs. Times New Roman) × 4 (spacing: −1/2 pixel, normal, +1/2 pixel, +1 pixel) ANOVAs, one with subjects as a random effect variable and one with items as a random effect variable. We also report F tests for the trend analyses of the spacing variable. These analyses test whether the data over the spacing variable fit linear, quadratic, or cubic trends. This is important given the subtle nature of our manipulation. For instance, there may be no significant difference between consecutive levels of the spacing variable, but there may be a highly significant linear trend (slope significantly different than 0) in the spacing data when performance over levels is examined. Such trends are of paramount importance to the present research. Counterbalance list was added as a dummy variable (Pollatsek & Well, 1995).

The second section will consist of eye movement measures for target word processing: first-fixation duration (the duration of the first fixation on the target word), gaze duration (the sum of all first-pass fixations on the target word), skipping rate, and the length of the critical saccade that landed on (or beyond) the target word (see Table 1). Each of these target-word-dependent measures was submitted to two 2 (font: Cambria vs. Times New Roman) × 4 (spacing: −1/2 pixel, normal, +1/2 pixel, +1 pixel) × 2 (word frequency: high vs. low) ANOVAs, one with subjects as a random effect variable and one with items as a random effect variable. As with the global measures, we again examine the trend analyses for spacing. Counterbalance list was added as a dummy variable.

Table 1 Target word processing measures in Experiment 1

Full size table

Prior to analysis, fixation durations less than 80 ms were removed from the record (fewer than 1% of fixations). Trials with blinks on or near the target word or fixations longer than 1,000 ms on the target word were excluded from analysis, as were trials with more than two blinks during sentence reading. These trials accounted for 2.6% of the total trials and were evenly distributed across experimental conditions. Additionally, trials with fewer than 4 or more than 20 fixations were also excluded from analysis (0.8% of trials).

Global measures Accuracy for the comprehension questions was very high (mean of 92%) and was unaffected by experimental condition, ps > .20. Therefore, any effects seen in the fixation time measures cannot be explained by a speed–accuracy trade-off.

Arguably the most diagnostic measure of font readability in the present study is total sentence reading time (see Fig. 2), since it encompasses all the potential costs of the various manipulations. This measure indicated that sentences presented in Cambria (1,884 ms) were read faster than those presented in Times New Roman (1,938 ms), F ₁(1, 16) = 9.91, MSE = 27,153, p < .01; F ₂(1, 80) = 17.55, MSE = 47,897, p < .001. The effect of spacing was also significant (−1/2, 1,923 ms; 0, 1,862 ms; +1/2, 1,911 ms; +1, 1,909 ms), F ₁(3, 48) = 2.857, MSE = 16,911, p < .05; F ₂(3, 240) = 2.80, MSE = 57,598, p < .05, but more importantly, there was a significant quadratic trend of spacing, F ₁(1, 16) = 6.71, MSE = 12,388, p < .05; F ₂(1, 80) = 5.10, MSE = 47,656, p < .05. This trend indicated that the normal, unadjusted spacing was optimal for the fonts and spacing levels chosen in the study. The font × spacing interaction was not significant, Fs < 1.

The average fixation durations while the sentences were read were significantly influenced by both spacing and font (see Fig. 3). Mean fixation duration was shorter for sentences presented in Cambria (243 ms) than for those in Times New Roman (247 ms), F ₁(1, 16) = 11.20, MSE = 68, p < .005; F ₂(1, 80) = 10.43, MSE = 230, p < .005. Mean fixation duration was also influenced by spacing, F ₁(3, 48) = 22.59, MSE = 109, p < .001; F ₂(3, 240) = 26.07, MSE = 281, p < .001. Trend analyses indicated that the spacing effect was highly linear (−1/2, 253 ms; 0, 247 ms; +1/2, 241 ms; +1, 240 ms), F ₁(1, 16) = 41.98, MSE = 167, p < .001; F ₂(1, 80) = 76.48, MSE = 268, p < .001, since mean fixation duration decreased with increased spacing. There was no interaction between font and spacing, Fs < 1.

On average, readers fixated sentences presented in Cambria 7.68 times and fixated those presented in Times New Roman 7.89 times, F ₁(1, 16) = 10.08, MSE = 0.22, p < .01; F ₂(1, 80) = 7.44, MSE = 0.78, p < .01 (see Fig. 4). Spacing also significantly influenced the number of fixations that sentences received, F ₁(3, 48) = 9.17, MSE = 0.23, p < .001; F ₂(3, 240) = 6.95, MSE = 0.79, p < .001. For the number of fixations (−1/2, 7.64; 0, 7.58; +1/2, 7.93; +1, 7.99), there was a significant linear trend of spacing, F ₁(1, 16) = 19.91, MSE = 0.22, p < .001; F ₂(1, 80) = 14.82, MSE = 0.77, p < .001, as well as a cubic trend, F ₁(1, 16) = 8.20, MSE = 0.15, p < .05; F ₂(1, 80) = 3.65, MSE = 1.76, p = .060. Again, the interaction between font and spacing did not approach significance, Fs < 1.

Target word analyses

In order to examine how the experimental variables of font and spacing influenced word processing, we analyzed fixation measures on the high- and low-frequency target words that were embedded in the sentence frames. On average, these target words were fixated during first-pass reading 84.2% of the time. On the remaining 15.8% of the time, the eyes fixated beyond the target word without having directly fixated on the target itself. These cases are classified as skips of the target word whether or not the target word is later fixated as the result of regressive eye movements. Word frequency significantly influenced this skipping behavior, F ₁(1, 16) = 5.21, MSE = 1.9, p < .05; F ₂(1, 80) = 4.16, MSE = 6.3, p < .05, since high-frequency target words were skipped 17% of the time and low-frequency targets were skipped 14% of the time. There was also an effect of font that was fully significant only in the subjects analysis, F ₁(1, 16) = 9.13, MSE = 1.0, p < .01; F ₂(1, 80) = 3.65, MSE = 6.4, p = .06, since target word skipping rate was higher with Cambria (17%) than with Times New Roman (14%). However, there was no effect of spacing, Fs < 1, nor was there a significant linear, quadratic, or cubic trend of spacing on skipping rates, Fs < 1. There were also no significant interactions between any of these variables, Fs < 1.

To further examine the effect of spacing on eye movements, we calculated the mean landing position for the initial fixations on these targets as a percentage of target word length. This measure indicated that, on average, subjects fixated these target words slightly to the left of word center (0.45), replicating prior research (McConkie, Kerr, Reddix, & Zola, 1988; Rayner, 1979). However, there were no significant effects of any of the experimental variables on this measure (all ps > .10). The fact that the spacing manipulation did not influence word skipping behavior or initial fixation landing site illustrates that the saccadic system is capable of rapidly adjusting to serve the goals of reading. Unsurprisingly, the length (in visual angle) of the first saccade into or beyond the target word was highly influenced by spacing (−1/2, 2.05°; 0, 2.16°; +1/2, 2.25°; +1, 2.41°), F ₁(3, 48) = 29.67, MSE = 0.10, p < .001; F ₂(3, 240) = 26.81, MSE = 0.34, p < .001. Trend analyses show that this effect was highly linear in nature, F ₁(1, 16) = 53.34, MSE = 0.17, p < .001; F ₂(1, 80) = 56.69, MSE = 0.50, p < .001. The distribution of these critical saccade lengths is displayed in Fig. 5. There was also an effect of font on the length of these critical saccades, F ₁(1, 16) = 4.52, MSE = 0.15, p < .05; F ₂(1, 80) = 6.37, MSE = 0.35, p < .05, with these critical saccades being .07° larger, on average, with Cambria than with Times New Roman. Recall that there was a slight difference in the horizontal extent of the two fonts used in this study, with Cambria being slightly narrower than Times New Roman. Therefore, this effect is in the direction opposite to that predicted by the difference in the size of the fonts, suggesting that Cambria was easier to process than Times New Roman.

The duration of the initial fixation on the target words was influenced by spacing, F ₁(3, 42) = 3.57, MSE = 1,325, p < .05; F ₂(3, 123) = 2.95, MSE = 4,018, p < .05, since these initial fixations tended to decrease in duration with increased spacing^{Footnote 3} (−1/2, 263 ms; 0, 259 ms; +1/2, 249 ms; +1, 251 ms). These initial fixations were also influenced by target word frequency, F ₁(1, 14) = 8.65, MSE = 1,191, p < .05; F ₂(1, 41) = 7.25, MSE = 4,995, p < .05, with longer durations occurring on low-frequency (261 ms) than on high-frequency (251 ms) words. There was a font × word frequency interaction, but only in the items analysis, F ₁ < 1; F ₂(1, 41) = 3.92, MSE = 6.3, p < .05. This interaction appears to be due to a smaller frequency effect with the Cambria font. However, we don’t place much weight in this interaction, due to the nonsignificant subjects analysis (see also footnote 3). No other interactions approached significance, ps > .12.

Unlike first-fixation durations, gaze durations were not influenced by spacing, ps > .25. However, there was still a highly robust effect of word frequency, F ₁(1, 14) = 22.56, MSE = 1,688, p < .001; F ₂(1, 41) = 20.64, MSE = 6,933, p < .001, since gaze durations were longer on low-frequency (297 ms) than on high-frequency (278 ms) target words. Gaze durations did not significantly differ between the two fonts, F ₁(1, 14) = 1.40, MSE = 4,758, p > .25; F ₂(1, 41) = 2.99,MSE = 7,069, p > .09, nor were there any significant interactions between any of the three variables, ps > .16.

Discussion

There were a number of important findings from Experiment 1 with regard to the optimal use of space in a line of text. First, these results reconfirm that subtle low-level font characteristics do influence eye movement behavior during reading (Rayner et al., 2006; Rayner, Slattery, & Bélanger, 2010; Slattery & Rayner, 2010). We found that wider spacing results in shorter average fixation durations, consistent with the linear facilitative effects reported by Perea et al. (2010) and Perea and Gomez (2012) using the lexical decision task. While not statistically significant with regard tothe other spacing conditions, gaze durations on target words presented in Times New Roman were shortest in the +1/2 pixel condition, which also agrees with Perea et al. and Perea and Gomez. Also similar to Perea et al., we failed to find any interaction between word frequency and intraword spacing. Additionally, this effect of spacing did not interact with word frequency in any of our dependent measures, indicating that more subtle adjustments to intraword spacing do not disrupt the integrity of word units the way that larger adjustments do. However, this facilitative effect on fixation durations was offset by the trends in the number of fixations. Total reading time, which is a direct combination of average fixation duration and number of fixations, was shortest in the unmodified spacing condition, replicating RSVP reading results (Chung, 2002), suggesting that font designers are doing a relatively good job at selecting these default intraword spacing values. The increase in total sentence reading time associated with changes from default intraword spacing was asymmetrical, with the largest increase coming from the reduced intraword spacing condition, which caused an increase in both average fixation duration and number of fixations.

The present results also highlight the flexibility of the oculomotor system in rapidly adjusting to the spacing manipulation employed in Experiment 1 for the purpose of reading. Target word skipping, which is highly influenced by the number of letters in a word (Brysbaert, Drieghe, & Vitu, 2005; Rayner & McConkie, 1976), was uninfluenced by the spacing variable. This argues that word-skipping behavior is influenced more by word processing than by the horizontal extent of the skipped word. Spacing influenced initial fixation duration on target words, with shorter fixations for larger spacing, but did not influence gaze durations, since the refixation probability associated with spacing mitigated the effect that had been present in initial fixation durations. This further highlights the higher level cognitive impact upon oculomotor behavior during reading. That is, despite the undeniable and rapid low-level influences of font spacing on fixation durations, higher level cognitive influences help to ensure that the eyes remain on words long enough to accomplish the goal of successful reading.

Other effects of interest were that Cambria consistently outperformed Times New Roman in metrics of readability. It resulted in shorter fixation durations, fewer fixations, and shorter total reading times than Times New Roman, with no decrement in comprehension. Since Cambria is a newer font created for use on computer monitors, this finding should be welcomed by font designers and taken as an indicator of their relative success.

Experiment 2

In Experiment 1, the relative space between letters and words remained constant over the spacing conditions. One drawback of that manipulation is that words will be closer to each other in the smaller spacing conditions than in the larger spacing conditions. Thus, it is possible that parafoveal processing of the upcoming word was influenced by its proximity to the currently fixated word. In Experiment 2, we employed a modified spacing manipulation in which the space between word beginnings was held constant over the intraword spacing conditions (see also Rayner et al., 2010). This manipulation removed space between letters within a word (reduced intraword spacing) and placed that space after the word (increased interword spacing). Therefore, each word of a sentence began at the same location regardless of spacing condition (see Fig. 6). This manipulation has the added benefit of allowing us to directly test aspects of visual crowding on reading. Visual crowding occurs when objects are closer together than the critical spacing, which depends on eccentricity of the objects from fixation (Levi, 2008; Pelli & Tillman, 2008). The further the eccentricity of the objects, the greater the critical spacing will be. However, in Experiment 1, intraword spacing (the space between letters within a word) of parafoveal letters was confounded with the eccentricity of these letters (see Fig. 1). This confound with eccentricity should have acted to reduce the letter crowding effect within words in the reduced intraword spacing condition. In Experiment 2, we controlled for eccentricity over the letter-spacing conditions.

This novel manipulation has a few important implications for reading and font development. First, if letter perception, which is known to be influenced by visual crowding, is driving the eyes during reading, we should see a marked increase in fixation durations and reading times for the reduced-intraword/increased-interword spacing condition (from here on referred to as the adjusted spacing condition) in Experiment 2, as compared with the normal spacing condition. However, for the purpose of reading, we suspect that words are more important objects than letters. This may seem like an impossible stance, since words are built from a combination of letters. We are not advocating that letters are unimportant. As Pelli, Farell, and Moore (2003) convincingly demonstrated, word recognition cannot occur under conditions in which the word’s letters are not separately identifiable. However, as long as the letters are identifiable, we would argue that it is the properties of words and their recognition that influence eye movements during reading. For instance, the words slide and idles both contain the same letters but arranged in different orders, thereby making two different words. These two words differ in their frequency of usage (slide is roughly 120 times more frequent than idles), their phonological structure (slide has one syllable while idles has two), and morphological structure, as well as in the manner in which they can be used in the English language. We would argue, therefore, that while successful letter perception is a necessary step in reading, the bottleneck in reading performance is with word recognition. If, as we suspect, words are the important processing unit for reading, we might expect that in Experiment 2, the adjusted spacing condition should result in improved reading performance, as compared with the normal spacing condition. The reason for this counterintuitive prediction is that the adjusted spacing condition not only will have reduced intraword spacing, but also will have increased interword spacing. This increased interword spacing should help with word segmentation processes, result in less lateral inhibition of word-initial and -final letters, and improve oculomotor targeting.

Second, a major current controversy in reading is centered on whether lexical processing of words occurs in serial or is parallel in nature, with multiple words being accessed at the same time (Reichle, Liversedge, Pollatsek, & Rayner, 2009). It has now been shown repeatedly that reducing intraword spacing reduces a word’s readability and that this effect of crowding is a function of a words eccentricity from fixation. Thus, we can be confident that crowding will hamper the lexical processing of a word in the parafovea. If normal reading involves parallel lexical processing of the fixated word and words in the parafovea, reading should be greatly disrupted under the adjusted spacing conditions of Experiment 2, in which the parafoveal words are presented with reduced intraword spacing while controlling for word eccentricity. However, if normal reading involves the serial lexical identification of words with a limited role of lexical processing in the parafovea, we would expect little to no difficulty with this reduced intraword spacing condition. Note that the serial lexical processing prediction does not suggest that parafoveal processing is unimportant, only that there is a limited role for lexical processing of parafoveal words.