Do we remember learning materials better when they are presented in a format that is more difficult to read or when they are presented in a format that is more easy to read? Memory for study materials is clearly affected by difficulty of the content, learning strategy, and cognitive ability, but is it also affected by extraneous factors such as font size or type? Currently, there is no clear answer to these questions, because the evidence is mixed. One line of research that focused on the font size of to-be-remembered words consistently suggested that font size does not affect recall (e.g., Rhodes & Castel, 2008), but a recent meta-analysis that was based on these studies suggested that there is nevertheless a subtle memory advantage for the larger font words (Luna, Martin-Luengo, & Albuquerque, in press). In contrast, a second line of research that focused on other perceptual features of learning materials such as font type or clarity suggested that, in some cases, presenting materials in a perceptually degraded format can enhance rather than impair learning (e.g., Diemand-Yauman, Oppenheimer, & Vaughan, 2011). Can small font size similarly enhance memory? The current research investigated whether, under hitherto unexamined conditions, presenting words in a small font enhances memory.
Font size and memory for words
In a series of experiments, Rhodes and Castel (2008) presented words on a computer screen in 18-point or 48-point font. Participants were asked to memorize the words and to provide a judgment of learning (JOLs) for each studied word by estimating how confident they were that they would later be able to recall that word. They expected font size to have relatively little impact on free recall because, they argued, memory is predominately influenced by processing of the meaning of the stimuli (e.g., Craik & Lockhart, 1972). Their main interest was in whether learners' JOLs would reflect that. Rhodes and Castel found that larger words were predicted to be remembered better than smaller ones, but that this was a metacognitive illusion because, as expected, recall was not affected by font size.
Over the last decade, these findings have been replicated in several similar studies (Hu, Li, Zheng, Si, Liu, & Luo, 2015; Kornell, Rhodes, Castel, & Tauber, 2011; Luna, et al., in press McDonough & Gallo, 2012; Miele, Finn, & Molden, 2011; Mueller, Dunlosky, Tauber, & Rhodes, 2014; Susser, Mulligen, & Besken, 2013). These studies used the same basic paradigm as the original study, with only minor variations in materials and test format (e.g., word pairs and a cued-recall test in Rhodes & Castel, 2008, Experiment 3), study time (between 2 and 5 s), procedure (delayed JOLs, Luna et al., in press) and font sizes (125-point vs. 25-point, McDonough & Gallo, 2012; 70-point vs. 9-point Chinese characters, Hu et al., 2015; various font sizes depending on the participants' personal screen settings, Kornell et al., 2011). While no individual study showed a significant difference, a recent meta-analysis by Luna et al. (in press) revealed a small memory advantage for items presented in large font over small font. This advantage was much smaller than the one predicted by learners, reflecting a mismatch between the effect of font size on memory and metamemory.
Perceptual degradation and learning
There is, however, evidence that presentation of learning materials in a perceptually degraded format can sometimes improve learning. This evidence comes mainly from studies that used perceptual manipulation other than of font size.
One line of evidence comes from studies of perceptual interference in which a brief (i.e., 100-ms) presentation of a word is followed by a presentation of a pattern, which causes backward masking. Perceptual interference was found to enhance subsequent recognition and recall relative to longer presentations (i.e., 2.5 s) without interference (Besken & Mulligan, 2013; Hirshman & Mulligan, 1991; Hirshman, Trembath, & Mulligan, 1994; Mulligan, 1996).
More recent evidence comes from studies that used complex, educationally relevant learning materials. In a noteworthy study by Diemand-Yauman et al. (2011), university and high-school students studied textual materials that were presented in an easy-to-read font (standard black font) or in a difficult-to-read font (small, non-standard, gray font). Performance on subsequent tests suggested that studying the materials in a difficult-to-read font produced better learning outcomes than studying in an easy-to-read font. This finding was replicated in a number of subsequent studies (Eitel, Kühl, Scheiter, & Gerjets, 2014, Experiment 1; French et al. 2013; Lehmann, Goussios, & Seufert, 2016; Seufert, Wagner, & Westphal, 2017; Weissgerber & Reinhard, 2017; Weltman & Eakin, 2014).
These findings are consistent with the idea that difficulties can be desirable for learning (Bjork, 1994; Bjork & Bjork, 2011). Often, encoding manipulations that make learning slower and more difficult actually enhance long-term retention and transfer of learning. These manipulations include, for example, spacing, variation, and interleaving, and have been termed “desirable difficulties” (Bjork, 1994). The recent evidence suggests that presenting information in a perceptually degraded format can also be a desirable difficulty.
However, other studies failed to replicate the beneficial effect of difficult fonts, finding either no effect (e.g., Eitel et al., 2014, Experiments 2–4; Eitel, & Kühl, 2016; Pieger, Mengelkamp, & Bannert, 2016; Rummer, Schweppe, & Schwede, 2016; Strukelj, Scheiter, Nyström, & Holmqvist, 2016; Yue, Castel, & Bjork, 2013), or the opposite effect, impaired learning when studying in difficult fonts (e.g., Lonsdale, Dyson, & Reynolds, 2006; Miele & Molden, 2010; Yue et al., 2013). Nevertheless, the findings of Diemand-Yauman et al. (2011) and others suggested that presenting materials in a perceptually degraded format can, under some conditions, serve as a desirable difficulty.
More recent studies have therefore investigated the conditions under which perceptually degraded materials enhance learning (Dunlosky & Mueller, 2016; Oppenheim & Alter, 2014; Kühl, Eitel, Scheiter, & Gerjets, 2014; Weissgerber & Reinhard, 2017). For example, Lehmann et al. (2016) observed that perceptually degraded fonts improved learning only for learners with a high working memory capacity; Weissgerber and Reinhard (2017) observed that perceptually degraded fonts enhanced long-term, but not short-term memory; Halin, Marsh, Hellman, Hellstrom, and Sorqvist (2014) observed that perceptually degraded fonts improved performance only when there was distracting background noise; and Katzir, Hershko, and Halamish (2013) observed that smaller-than-standard fonts enhanced fifth graders’ reading comprehension, but impaired that of second graders.
Of most relevance to the current research are two other studies that found, using a procedure broadly similar to that of Rhodes and Castel (2008), that perceptually degraded presentation can be a desirable difficulty for the learning of single words. Sungkhasettee, Friedman, and Castel (2011) presented single words upright or upside down. Participants predicted equivalent memory for upright and inverted words, but free recall was higher for the inverted words, suggesting that inverted presentation is a desirable difficulty. Rosner, Davis, and Milliken (2015) examined the effect on memory of perceptual blurring of words, and observed better recognition memory for blurred words than for clear words, although an earlier study (Yue et al., 2013) failed to obtain a benefit from blurring. This study is discussed in more detail below.
Why does presenting learning materials in perceptually degraded format enhance memory? A common suggestion is that perceptually degraded formats function as a metacognitive cue to allocate more cognitive resources or to enhance cognitive engagement in the learning task. As a result, processing of degraded material is more effortful or deeper and this enhances memory (Alter, Oppenheimer, Epley, & Eyre, 2007; Diemand-Yauman et al., 2011; Hirshman & Mulligan, 1991; Mulligan, 1996). This explanation is rather general and suggests that different perceptual manipulations should have similar effects on learning outcomes. In contrast, Weissgerber and Reinhard (2017) proposed, based on the concept of transfer-appropriate-processing (McDaniel & Butler, 2011), that different perceptual manipulations might invoke different processes during encoding and that their effect on subsequent memory would depend on the match between the processes invoked and the demands of the memory test. Clearly, more research is needed to fully understand the mechanisms underlying the potential effects of perceptually degraded presentation formats on memory.
Font size and memory for words: potential moderators
The literature reviewed above provides ample evidence that presenting materials in perceptually degraded formats can enhance memory and learning outcomes and act as a desirable difficulty, but also provides clear evidence that this effect is far from robust. Considering this evidence, the lack of evidence for a beneficial effect of small font size is surprising. Why does presenting words in a small font not enhance memory in the same way as inverted (Sungkhasettee et al., 2011) or blurred (Rosner et al., 2015) presentations?
The current research examined the possibility that small font size can enhance memory, but that the conditions under which it does so had simply not yet been examined. It focused on four potential moderators of the effect of font size on memory: strength of the font size manipulation, whether JOLs are solicited or not, the test format, and study time.
The strength of the font size manipulation
The small fonts used in earlier studies may have been ineffective simply because they were not sufficiently small to trigger the processes that led to enhancement of memory. Rhodes and Castel (2008) and most follow-up studies used 48-point and 18-point fonts as the large and small font sizes, respectively. The current research was driven by the conjecture that whereas 48-point font can indeed be described as relatively large and easy to process, 18-point font is best described as a standard or medium size, rather than small, and is not difficult to process. Critically, 18-point font might not be small enough to induce the cognitive engagement and effortful processing that render other forms of perceptual degradation (e.g., word inversion or unusual font type) desirable difficulties. Consistent with this argument, Rosner et al. (2015) observed that the level of blurring moderated its effect on recognition memory. A benefit of blurred (over clear) fonts was obtained only for relatively higher levels of blurring. Similarly, using textual materials, Seufert et al. (2017, Experiment 2) demonstrated that increasing the level of font difficulty, up to the point where the text became illegible, enhanced recall and transfer performance.
Solicitation of JOLs
In the study by Rhodes and Castel (2008) and follow-up studies, participants were asked to predict the chance that they would remember the words they were studying (i.e., provide a JOL) during the learning phase, on an item-by-item basis. However, recent studies suggest that soliciting memory judgments during learning alters the encoding processes that would otherwise occur (e.g., Mitchum, Kelley, & Fox, 2016; Nguyen & McDaniel, 2016; Schmidt & Schmidt, 2017; Schnaubert & Bodemer, 2017; Soderstrom, Clark, Halamish, & Bjork, 2015; Witherby & Tauber, 2017; Zechmeister & Shaughnessy, 1980; for similar effects of judgments made during the test see Double & Birney, 2017; Naveh-Benjamin & Kilb, 2012). These studies usually suggested that JOLs might improve memory (e.g., Soderstrom et al., 2015; Witherby & Tauber, 2017), although several other studies reported no effect (Benjamin, Bjork, & Schwartz, 1998; Tauber & Rhodes, 2012) or a detrimental effect (Mitchum et al., 2016) of JOLs. Moreover, some studies suggested that JOLs eliminate differences between conditions that are observed when JOLs are not solicited (Begg Vinski, Frankovich, & Holgate, 1991; Besken & Mulligan, 2013; Matvey, Dunlosky, & Guttentag, 2001; Rosner et al., 2015; Soderstrom et al., 2015). For example, Besken and Mulligan (2013) demonstrated that the mnemonic benefit of perceptual interference was eliminated when item-by-item JOLs were solicited, and Rosner et al. (2015) similarly demonstrated that the benefit of high levels of blurring for recognition memory was eliminated when JOLs were solicited. It is possible that the processes involved in making item-by-item JOLs encourage additional processing, which overlaps with the processes that are responsible for mnemonic benefits of perceptual interference or blurring. As Rosner et al. (2015, p. 20) concluded, “desirable difficulty effects in remembering may be difficult to observe when item-by-item JOLs are made at the time of encoding.” This evidence is consistent with the possibility that the solicitation of JOLs in previous studies hindered a potential effect of font size on memory.
Test format
Rhodes and Castel (2008), and most follow-up studies, examined the effect of font size on cued or free recall performance. It is possible that recall tests are not sensitive to the beneficial effects of small font size. Recognition tests might be more sensitive to such effects, if they exist. Nairne (1988) suggested that perceptual manipulations affect the processing of surface-level aspects of the word that aids subsequent recognition, but not recall, which relies more on item elaboration. Indeed, perceptual degradation and interference manipulations were demonstrated to have benefits for recognition more than for recall (Hirshman & Mulligan, 1991; Mulligan, 1996; Nairne, 1988; Rosner et al., 2015). McDonough and Gallo (2012) did examine the effect of font size on recognition and found no effect, consistent with the results for recall. However, they used relatively large font sizes and solicited item-by-item JOLs, which might have hindered the effect of font size on recognition memory.
Study time
Earlier studies that examined the effect of perceptual degradation of words (small font or blurring) on memory varied in study time, which ranged from 0.5 to 5 s per word (between studies). The study in which a benefit of perceptual degradation was observed (Rosner et al., 2015) presented words for 1 s each. Perceptual interference effects were also observed with very brief presentation times (e.g., Mulligan, 1996). These findings suggest that the benefits of perceptual degradation may only be apparent when the study time is relatively short. It might be that relatively elaborative processing takes place spontaneously with longer but not with shorter study time, unless information is presented in a perceptually degraded format.
The current research
The current research was designed to investigate the effect of font size on memory for words and whether it depends on the strength of the font size manipulation, whether JOLs are solicited, the format of the test, and study time.
First, a series of 11 experiments were conducted to meet this goal. Table 1 presents an overview of these experiments. In all experiments, participants studied single words that were presented one-by-one on a computer, and were later tested on these words. The effect of the strength of the font size manipulation was examined by using three font sizes within-participants in all eleven experiments, the 48-point and 18-point sizes used in the earlier studies (e.g., Rhodes & Castel, 2008), and 5-point, a much smaller size. Pretesting suggested that the 5-point font was the smallest generally legible font size. Hereafter, the 5-point font is referred to as a very small font size. Solicitation of JOLs, test format (free recall vs. recognition), and study time (5 s vs. 0.5 s per word) were systematically manipulated across Experiments 1–8. To preview, in only two of these experiments was an effect of font size observed. Experiments 9 and 10 were attempts to replicate these effects. Experiment 11 directly examined the moderating role of solicitation of JOLs by manipulating this factor within the experiment, in addition to font size.
Table 1 Overview of the methods, descriptive statistics, and summary of main results of Experiments 1-11 Next, a set of small-scale meta-analyses that included data from all 11 experiments are reported. The purpose of the meta-analyses was two-fold: first, to examine whether there are any font size effects that emerge when experiments are combined even though they might not be observed, or might not be consistently observed, in single experiments (following Luna et al., in press); and secondly, to assess the potential moderating role of solicitation of JOLs, study time, and test format, that were usually manipulated between experiments (with the exception of the direct manipulation of JOLs solicitation in Experiment 11).