In the main experiment, participants performed a speeded brightness-classification task in which they were required to decide whether a visual stimulus was brighter or darker than the midgray background against which it appeared (cf. Marks, 1987, for confirmation that manipulating brightness in this way isolates surface brightness from illuminant brightness). They indicated their decision by pressing one of two keys that differed in size. It was assumed that a cross-sensory correspondence between size and brightness exists and that, like other correspondences, it can have an impact on performance as an incidental feature of a task situation. Accordingly, it was expected that participants would respond more quickly and with fewer errors when the size of the response key needing to be pressed was congruent with the brightness classification of the visual stimulus (i.e., bright–small, dark–big), rather than when it was incongruent with it (i.e., bright–big, dark–small).
Method
Participants
Seventy-six Lancaster University students (62 females and 14 males) between the ages of 18 and 34 (mean age = 19.75 years), none of whom were tested for synaesthesia, volunteered to participate for course credit or payment. All but four of the participants were right-handed by self-report.
Materials
The visual stimuli were six solid circles that were 4.5 cm in diameter, varying in brightness from white through to black. The circles were presented individually at the center of a 20-in. computer screen (Apple A1038, 1,680 × 1,050 cinema back-lit LCD display; response time [RT] = 14 ms, controlled by a Dual 2 GHz, PowerMac G5) and were brighter (340, 230, & 150 cd/m2) or darker (42, 17, and 2 cd/m2) than the midgray (90 cd/m2) background against which they appeared. The response keys comprised the smallest (2.5-cm diameter) and largest (7.5-cm diameter) of the wooden balls whose attributes were assessed in the preliminary observations. These were mounted on microswitches, whose physical resistances were adjusted until the two authors judged that equal force was needed to close each of them. This required a higher level of resistance to be set for the big key (i.e., 1,000 gm) than for the small key (250 gm). The small key was also raised 3.75 cm from the table to ensure that the two balls were perceived (haptically) to be equally high spatially (see the Preliminary Observations section). The response keys were covered with a thick black cloth throughout the experiment, ensuring that participants were never able to see them.
Design and procedure
Participants were instructed to grasp the top half of each key with the thumb and first two fingers of each hand, and to continue grasping them until told to take a break between blocks of trials. Immediately a circle appeared, participants classified it as bright or dark (relative to the midgray background) by pressing one of the two keys. Half of the participants (30 females and eight males) were asked to press the left-hand key when the circle was brighter than the background and the right-hand key when the circle was darker than the background. The other participants (32 females and six males) were assigned to the opposite brightness-hand mapping (i.e., right for bright, left for dark). The difference in the sizes of the two keys was incidental to the speeded-classification task and was never mentioned by the experimenter.
Participants completed four blocks of trials, each separated by a 1-min break. In a block of 48 trials, each of the six circles appeared eight times, in a randomly determined order that was generated afresh for each participant. Each circle remained visible until participants made their brightness decision and was followed by a blank interval of 3 s before the next circle was presented. Participants did not receive feedback about the speed or correctness of any of their responses.
At the end of each trial block, the experimenter surreptitiously switched the left–right positions of the two response keys so that participants performed the proceeding block using the opposite mapping of brightness on to key size. Thus, across the four experimental blocks, participants alternately pressed the small key for bright and the big key for dark (congruent condition), or the small key for dark and the big key for bright (incongruent condition). We counterbalanced across participants which of these two mappings (small vs. big for bright) was assigned to the first block of trials.
Results
Probability correct
Although the overall level of correct responding was consistently high, it was significantly higher for congruent trials, p(correct) = .99, than for incongruent trials, p(correct) = .98, Wilcoxon Signed Ranks p < .001.
Reaction time
RTs from incorrect trials were excluded from the analysis, and RTs greater than 2.5 SDs above a participant’s overall mean correct RT were replaced with the cut-off value. The resulting overall mean correct RT was 733 ms. Mean correct RTs for the various conditions are given in Table 2.
Table 2 Experiment: Mean correct RTs (SEMs in parentheses) and p (correct) according to the between-category and within-category brightness levels, and key size, with associated differences in RT attributable to key size
Two aspects of the variation in surface brightness were treated as separate factors in an ANOVA. The first, between-category brightness, refers to the brightness feature on which response selection was based and relates to whether a circle was brighter or darker than the midgray background. The second, within-category brightness, refers to the variation in surface brightness within each level of categorical brightness. Thus, for each level of categorical brightness, within-category brightness was low, medium, or high. Where a circle was darker than the background, these levels of within-category brightness were 2 (black), 17, and 42 cd/m2, respectively. Where a circle was brighter than the background, the low, medium, and high values for within-category brightness were 150, 230, and 340 (white) cd/m2, respectively.
Mean correct RTs (i.e., means of eight individual RTs) were submitted to a 2 × 2 × 3 × 2 mixed ANOVA, with key size (small vs. big), between-category brightness (darker vs. brighter), and within-category brightness (low, medium, or high) as within-participant factors. Hand-brightness assignment (left- vs. right-hand key for bright) was a between-participants factor.Footnote 9
Although there was no main effect of key size, F(1, 74) = 2.67, p = .11, η
p
2 = .04, (mean RTs for the small and big key being 724 ms and 743 ms, respectively), there was a significant main effect of between-category brightness, F(1, 74) = 17.32, p < .001, η
p
2 = .19, with participants classifying darker circles more quickly (mean RT = 714 ms) than brighter circles (mean RT = 752 ms). There was a significant between-category brightness × key size interaction, F(1, 74) = 21.56, p < .001, η
p
2 = .23, the nature of which confirmed the predicted congruity effect. The mean RTs for congruent and incongruent trials were 710 and 756 ms, respectively.
There was a main effect of within-category brightness, F(2, 148) = 27.72, p < .001, η
p
2 = .27, largely attributable to a significant quadratic component, F(1, 74) = 47.64, p < .001, η
p
2 = .39. Within-category brightness interacted significantly with between-category brightness, F(2, 148) = 140.36, p < .001, η
p
2 = .66, and the nature of this interaction confirmed that participants responded more quickly the higher the brightness contrast (in either direction) between the circle and the midgray background. Figure 1 shows that, despite the very pronounced effects of within-category brightness, and the marked changes in the effect of between-category brightness across different levels of within-category brightness (see Table 2), the difference in response speed between congruent and incongruent trials remained within 40–50 ms across successive levels of local brightness (i.e., 45.5, 49.5, and 44.5 ms, across low, medium, and high levels of within-category brightness, respectively). The size–brightness congruity effect appears here to be very robust.
There was not a significant interaction between within-category brightness and key size, F < 1, indicating that variations in this aspect of brightness (i.e., variations within a level of categorical brightness) did not combine with key size to generate a congruity effect. The lack of a significant three-way interaction between within-category brightness, between-category brightness, and key size, F < 1, indicates that the level of brightness contrast did not interact with key size.
There was not a significant main effect of hand-brightness assignment, F < 1, and no significant interaction between this and any other factor, or combination of factors (all ps > .05). No other interactions were significant.
Discussion
When a small and a large ball were used as response keys in a speeded brightness-classification task, a congruity effect was observed confirming the predicted correspondence between size and brightness. Participants responded more quickly and with fewer errors when the relative size of the key needing to be pressed was congruent with the relative brightness of the visual stimulus being classified. That is, they responded relatively more quickly and accurately when pressing the small (big) key to confirm that a circle was bright (dark), than when pressing the small (big) key to confirm that a circle was dark (bright). This happened even though key size was task irrelevant. Before accepting the results as evidence for the predicted size–brightness correspondence, however, two factors supporting alternative explanations need to be assessed.
Brightness contrast
The potential for brightness contrast to interact with key size to create a congruity effect must be considered because brightness contrast is known to enter into a generic correspondence with other features according to their magnitude (cf. Walsh, 2003). For example, people find it easier to encode numerically bigger (smaller) digits when they are presented at higher (lower) levels of visual contrast relative to the background on which they appear (R. Cohen Kadosh & Henik, 2006; Cohen Kadosh, Cohen Kadosh & Henik, 2008).Footnote 10 However, although the level of brightness contrast in the present experiment had a very significant effect on performance, with participants generally responding more quickly with higher levels of contrast, it did not interact with key size to produce a congruity effect. Thus, participants did not respond quicker when more contrast was paired with the big key (more size) than when it was paired with the small key (less size). Neither did they respond quicker when less contrast was paired with the small key, than when it was paired with the big key. Interestingly, in a situation in which there is no background to take into account, digit-color synaethetes associate brighter colors with numerically smaller digits (Cohen Kadosh, Henik, & Walsh, 2007), reinforcing the alignment on which the present congruity manipulation was based.
Valence as a basis for the size–brightness correspondence
Spence (2011) acknowledged that his framework might need to be extended to embrace a generic correspondence based on the valence of stimuli (i.e., evaluation or affective potential). Karwoski et al. (1942) also considered valence to be a possible basis for cross-sensory correspondences, and Proctor and Cho (2006) discussed the pervasiveness of valence-based congruity effects in speeded binary classification tasks. With size and brightness both being regarded as valenced dimensions, so that big (Meier, Robinson, & Caven, 2008; Silvera, Josephs, & Giesler, 2002) and bright (Meier, Robinson & Clore, 2004; Meier, Robinson, Crawford, & Ahlvers, 2007) are both deemed, by some, to be positively valenced, a valence-based explanation of the size–brightness congruity effect needs to be assessed.
According to one valence account, participants should find it easier to respond when the brightness of a circle and the size of the key being pressed have the same valence. However, if big and bright are both positive, this valence account predicts the reverse of the congruity effect observed in the present study. Thus, participants should have responded more quickly and accurately when the big (+) key was assigned to bright (+) and the small (−) key to dark (−). This was not the case.
According to a second valence account, it is the valence associated with hand dominance that plays the critical role in providing an alternative explanation for the congruity effect observed presently. According to Casasanto (2009), the dominant and nondominant hands are positively and negatively valenced, respectively. If the valence correspondence between brightness and hand contributed to the congruity effect, however, participants should have shown better overall performance when bright (+) was assigned to the dominant (+) right hand, than to the nondominant (−) left hand. In addition, the impact of size–brightness correspondence (i.e., the congruity effect) should have been more pronounced in the former case. Neither of these effects was observed.
These arguments are based on the assumption that big is always positively valenced. However, the valence associated with size is highly context sensitive. Although a big apple, relative to a small apple, is likely to be positive, a big tumor is undoubtedly more negative than a small tumor. It is clear, therefore, that context is important and that it is the valence of the keys used in the main experiment that is crucial. With this in mind, it will be noted that the preliminary observations reported above failed to reveal an association between key size and valence (see Table 1). In addition, and as a final step toward eliminating valence correspondence as an explanation of the size–brightness congruity effect, one of the authors (L.W.) completed a pilot study in which she repeated the speeded classification experiment, with 24 participants classifying each of six words (from the three antonym pairs good–bad, nice–nasty, amusing–boring) according to their valence. There was no congruity effect involving the valence of a test word and key size, F < 1.
Surface brightness or illuminant brightness?
Simple images on a computer screen, such as the solid circles presented in the brightness classification task, can have a rather ambiguous status. On the one hand, they are sources of illumination, with high levels of illuminance when white, and low levels of illuminance when black. As sources of illumination, the circles would be perceived as holes in a screen (in the present study, a midgray screen), with the source of light being located in the space behind the screen. On the other hand, they are depictions of objects varying in surface brightness, and this is how they have been conceptualized in the discussion thus far. However, is there anything to confirm this interpretation, apart from the fact that the experimenters and participants seem to have perceived them as objects, and the fact that much other research in visual perception would need to be revisited if, instead, they were perceived as holes?
Brightness as illuminance behaves as a magnitude dimension, thereby entering into correspondence with a whole range of other magnitude-based dimensions, including loudness and hand-grip force (see Stevens, 1975, and Marks, 1978, for reviews). And, with demonstrations that a brighter back-illuminated aperture is perceived to be bigger than a darker aperture (Robinson, 1954), these correspondences appear to include size. It is clear, therefore, that brightness as illuminance should, and does, align itself with big, rather than with small. But this is not what was observed in the present study. Here, brightness was aligned with small, which agrees with other findings in which surface brightness has been isolated from illuminant brightness. For example, when participants in an additional rating study (Walker & Walker, 2012) indicated what cross-sensory features were possessed by circles similar to those presented in the classification task reported presently, they reported brighter circles to be smaller, weaker, quieter, and lighter in weight than darker circles, which is the opposite of what would be expected for illuminant brightness. In addition, when Marks (1987) explored congruity effects in speeded classification that involved brightness, he presented contrasting levels of brightness in each of two ways. First, he presented small rectangles varying in brightness in the center of a computer screen that was otherwise dark (i.e., black), and that remained so throughout the experiment. Second, he presented similar rectangles on a midgray background and arranged for them to be brighter or darker than the background, which is the same type of manipulation that was employed in the present study. Marks (1987) regarded the first manipulation as varying illuminant brightness (with the stimuli appearing as “luminous spots,” p. 385), but regarded the second manipulation as varying surface brightness (with the stimuli appearing as “dark or light surfaces”, p. 385). His interpretation was confirmed by the different patterns of congruity effect he observed for the two types of brightness manipulation. Only illuminant brightness interacted with loudness to yield a magnitude-based congruity effect (i.e., people responded more quickly and accurately when brighter visual stimuli appeared with louder sounds) (see the Appendix, below, for further information regarding Marks, 1987 study). In summary, therefore, the weight of evidence indicates that the circles presented in the brightness classification task were perceived as objects, rather than as holes, and that the variation in brightness was encoded as variation in their surface properties.
General discussion
When participants (adult nonsynaesthetes) explored, by touch alone, three wooden balls varying in size, they judged smaller to be bright, and bigger to be dark. This correspondence between size and brightness was anticipated because both features share the same relationship with auditory pitch (i.e., higher pitched sounds are both smaller and brighter than lower pitched sounds), and because it was predicted that a kind of transitivity would hold among correspondences in terms of the directions in which dimensions align themselves (see Stevens, 1975, p. 105). Observing the predicted size–brightness correspondence is consistent with this prediction.
Participants’ judgements confirmed the functional bidirectionality of the correspondence between pitch and size (i.e., high pitch is small, and small is high pitched) and, by implication, the bidirectionality of all correspondences (cf. Martino & Marks, 2001). As was pointed out already, transitivity and bidirectionality together imply that the same set of correspondences should emerge regardless of the sensory channel used to probe it. For example, with auditory pitch and haptic size accessing the same amodal connotative features, the same set of correspondences should be observed regardless of which of these features is used to probe the correspondences. Results from the preliminary study confirmed this, showing that smaller haptic objects are like higher pitched sounds (see the introduction) in being more active, brighter, faster, lighter in weight, quieter, sharper, and weaker than their opposites. This result supports the claim that there exists a core set of correspondences that is accessed, whichever sensory channel is used to probe it.
When the smallest and largest balls were used as response keys in a speeded brightness-classification task, the same size–brightness correspondence gave rise to a congruity effect. This effect occurred even though key size was task irrelevant, and one possible interpretation of this is that the size–brightness correspondence, like other correspondences, can have an impact on behavior automatically.
What kind of correspondence is that between size and brightness?
If the size–brightness correspondence does not qualify as a correspondence having a structural, statistical, or linguistic (lexical) basis (see the introduction), and does not reflect a generic correspondence based on valence, what kind of correspondence is it? That is, what new type of correspondence needs to be added to Spence’s (2011) classification?
Following the lead provided by Karwoski, Osgood, and their colleagues (e.g., Karwoski et al., 1942), it is proposed that the correspondence between size and brightness reflects the manner in which the two dimensions interact at the level of connotative meaning (see Fig. 2), without linguistic mediation. Thus, haptic smallness connotes an amodal sense of smallness that then induces connotations of brightness. Similarly, visual surface brightness connotes an amodal sense of brightness that then induces connotations of smallness. It is the mutual induction of connotative meanings across dimensions (here, between connotations of size and brightness) that is the most direct basis for the size–brightness correspondence and, presumably, for some other correspondences.
Because connotative meanings are amodal in nature, they are well placed to explain the cross-sensory nature of correspondences: The same connotative meanings are induced by stimulation of any sensory channel, including the channel of most obvious relevance (e.g., connotations of brightness can be induced by visual stimulation as well as by sound). This also applies to haptic size which, as well as supporting the direct perception of size, induces connotations of size which, like other connotative meanings, are context sensitive: The same value for haptic size will induce connotations of either bigness or smallness, depending on the context in which it is encountered. The connotations of brightness that this, in turn, induces will vary accordingly.
The convergence of size and brightness at the level of connotative meaning appears to occur in the absence of common lexical terms marking contrasting values of size and brightness. If so, then this absence of lexical overlap distinguishes the size–brightness correspondence from the linguistic correspondences identified by Spence (2011). Perhaps, however, the latter type of correspondence also is essentially semantic in nature, with the presence of shared linguistic terms arising secondarily from this. The cross-cultural universality of the connotative meanings of elementary stimulus features (e.g., Osgood, 1960), and the correspondences they give rise to, would explain why similar linguistic terms are used to reflect the same correspondences across different languages. For example, cultures that do not use their equivalent of high and low to mark contrasting levels of auditory pitch instead use their terms for light/heavy, sharp/blunt, small/big, thin/thick, or weak/strong (cf. Eitan & Timmers, 2010). These are, of course, just the terms that the proposed set of core correspondences would predict, and their use suggests that the set of core correspondences provisionally and partially identified in the present study have cross-cultural relevance.
Semantic differential theory
The work of Karwoski et al. (1942) became situated in studies employing the semantic differential technique. A consistent finding from these studies is that connotative feature associations revolve around three independent factors identified as evaluation, potency, and activity. On this basis, therefore, at least three distinct sets of core correspondences are to be expected, encouraging a little reflection on the suggestion that only a single core set of correspondences exists.
The core set of correspondences identified in the present study do not have, as a common theme, either valence or potency. Valence was specifically ruled out as a factor in the correspondence between size and brightness. Similarly, common coding based on potency (i.e., a strength-based structural account) also was ruled out. The tripartite scheme at the heart of the semantic differential would require, therefore, that the set of correspondences revealed in the present study revolve around the concept of activity. This seems unlikely, however, not least because activity is another magnitude dimension, and yet some of the key feature dimensions within the core set of correspondences revealed here are not magnitude dimensions. Most notable among these is auditory pitch (e.g., Smith & Sera, 1992). Furthermore, Eitan and Timmers (2010) submitted their cross-sensory ratings of auditory pitch to principal component analysis, in the manner of the semantic differential technique, and although they confirmed evaluation, potency, and activity as three important underlying factors, they observed them not to be the strongest predictors of the variability among cross-sensory feature associations. This status went to a factor, which included brightness, which Eitan and Timmers found difficult to conceptualize. Just how many core sets of correspondences there are, and how they mesh with the tripartite scheme associated with studies of the semantic differential, remains for further research to clarify.
How might the congruity effect be explained?
It can be assumed that two conditions need to be in place for a particular correspondence to give rise to a congruity effect in speeded classification. First, the classification decision (and response selection) needs to refer to the same type of representation on which the correspondence is based. Second, the criterial and incidental stimulus features need to converge on this type of representation. If the size–brightness correspondence is based on connotative meanings, then the congruity effect observed in the brightness classification task should reflect interactions at relatively late (nonsensory) levels of information processing. Two features of the classification task confirm this. First, because participants grasped the two response keys continuously throughout each block of trials, one of the two keys, and its relative size, became salient only after the visual stimulus had been classified as being bright or dark. Prior to the classification of the stimulus, both keys, and both values for key size, were equally salient, and congruity was not yet an issue. Second, when a test circle was brighter or darker than the midgray background, its surface brightness could still take on any one of three values. This noticeable variation in surface brightness, which had no implications for stimulus classification (and, therefore, for response selection), did not interact with key size to yield a congruity effect. That is, within the conditions associated with a particular task-defined category of brightness (i.e., the brighter and darker conditions), participants did not respond more quickly when higher (lower) levels of surface brightness were paired with the smaller (bigger) response key. The absence of a congruity effect arising from these within-category variations in brightness is entirely consistent with the claim that the main congruity effect originated at levels of processing subsequent to the brightness classification of the stimulus.
Figure 3 illustrates the functional components determining the nature of the correspondence between size and brightness, and the basis of the congruity effect arising from this. Sensory-perceptual levels of representation are distinguished from connotative meanings, and different dimensions of connotative meaning are shown as being aligned in a way that is consistent with the core set of cross-sensory correspondences provisionally identified here. The six different levels for visual brightness are shown as mapping onto two contrasting levels of connotative brightness, as specified in the task instructions (i.e., in a context-sensitive manner). Similarly, the two levels for haptic size are shown as mapping onto contrasting levels of connotative size, even though this mapping was not mentioned in the instructions. Although five dimensions of connotative meaning are depicted, these are not intended to be exhaustive, and the two most relevant dimensions for present purposes are emphasised. In addition, although each and every connotative dimension is assumed to have reciprocal connections with every other, only connections from connotative size to connotative brightness are illustrated. Finally, but importantly, response selection is assumed to be based on connotative brightness.
Correct classification of the visual stimulus is assumed to develop over time. On a congruent trial, such as where the classification of a visual stimulus as bright requires the smaller key to be pressed, this key becomes progressively the more salient of the two. The increasing salience of this key’s relative haptic size then translates into evidence for connotative smallness, which, in turn, translates into connotative brightness. In this way, the correct classification of the visual stimulus as being bright is reinforced, and selection of the correct response is facilitated. Conversely, on an incongruent trial, such as where the connotative brightness of the visual stimulus needs to be confirmed by pressing the bigger of the two keys, the increasing salience of the relative haptic size of this key provides evidence for connotative bigness, which then translates into evidence for connotative darkness. However, this evidence for connotative darkness contradicts the connotations of brightness originating with the visual stimulus, thereby impeding correct stimulus classification and creating conflicting response tendencies.
Further research
The proposal that response selection in the brightness classification task is based, at least in part, on the connotations of brightness and darkness indicated by the visual stimuli is amenable to experimental validation. So also is the proposal that the mapping of sensory-perceptual features onto dimensions of connotative meaning is context sensitive. Although in the present study, we provided evidence in support of this proposal in relation to visual brightness, the same also should apply to haptic size, despite this being a task irrelevant feature. The categorization of a response key as big or small will, to some extent at least, reflect its size relative to the key with which it is paired, rather than its absolute size. We have evidence for this from an additional study in which the wooden ball of intermediate size was used as one of the response keys. It was observed to behave as a small key (in terms of how it contributed to a congruity effect) when paired with the biggest of the three wooden balls, but as a big key when paired with the smallest of the three balls.
Finally, with regard to the possibility that the size–brightness correspondence reflects a natural co-occurrence between the two features, the absence of evidence one way or the other means that contrasting claims can be made. In the present discussion, it is the lack of evidence that has been emphasized. However, a real need exists for a comprehensive programe of research to reveal the existence of natural co-occurrences, even those that are generally assumed to exist (e.g., between size and pitch), and to assess the consistency with which they support a particular correspondence. For example, although several natural co-occurrences might link size and brightness, they might do so in contradictory ways, some aligning small with bright, others aligning it with dark. In the context of a study in which a correspondence between surface brightness and weight was demonstrated, Walker, Francis, and Walker (2010) described a very tentative assessment of the existence of relevant co-occurrences. They reported having obtained samples of rocks (i.e., pebbles), and of different types of wood, and measuring both the surface brightness and density of each specimen. In the case of pebbles, they reported no association between surface brightness and weight, and a geologist colleague indicated that, in principle, an association would not be expected. With regard to different types of wood (taken from Edlin, 1969), Walker et al. reported a very modest, although significant, association between surface brightness and weight, with darker timber tending to be heavier than lighter colored timber. However, this association arose because of the exceptional heaviness and darkness of ebony. Furthermore, as Edlin pointed out, different types of wood do not vary in density when trees are freshly cut, only after they have been thoroughly air dried. Studies such as these, along with an examination of what, in principle, should be expected, have the potential to provide a sounder basis from which claims can be made regarding whether cross-sensory correspondences can, or cannot, have their origins in natural co-occurrences.