Database of synesthetic color associations for Japanese kanji

Hamada, Daisuke; Yamamoto, Hiroki; Saiki, Jun

doi:10.3758/s13428-015-0691-z

Database of synesthetic color associations for Japanese kanji

Published: 07 January 2016

Volume 49, pages 242–257, (2017)
Cite this article

Download PDF

Behavior Research Methods Aims and scope Submit manuscript

Database of synesthetic color associations for Japanese kanji

Download PDF

Daisuke Hamada¹,
Hiroki Yamamoto¹ &
Jun Saiki¹

3853 Accesses
3 Citations
4 Altmetric
Explore all metrics

Abstract

Synesthesia is a neurological phenomenon in which certain types of stimuli elicit involuntary perceptions in an unrelated pathway. A common type of synesthesia is grapheme–color synesthesia, in which the visual perception of letters and numbers stimulates the perception of a specific color. Previous studies have often collected relatively small numbers of grapheme–color associations per synesthete, but the accumulation of a large quantity of data has greater promise for uncovering the mechanisms underlying synesthetic association. In this study, we therefore collected large samples of data from a total of eight synesthetes. All told, we obtained over 1000 synesthetic colors associated with Japanese kanji characters from each of two synesthetes, over 100 synesthetic colors form each of three synesthetes, and about 80 synesthetic colors associated with Japanese hiragana, Latin letters, and Arabic numerals from each of three synesthetes. We then compiled the data into a database, called the KANJI-Synesthetic Colors Database (K-SCD), which has a total of 5122 colors for 483, 46, and 46 Japanese kanji, hiragana, and katakana characters, respectively, as well as for 26 Latin letters and ten Arabic numerals. In addition to introducing the K-SCD, this article demonstrates the database’s merits by using two examples, in which two new rules for synesthetic association, “shape similarity” and “synesthetic color clustering,” were found. The K-SCD is publicly accessible (www.cv.jinkan.kyoto-u.ac.jp/site/uploads/K-SCD.xlsm) and will be a valuable resource for those who wish to conduct statistical analyses using a rich dataset in order to uncover the rules governing synesthetic association and to understand its mechanisms.

Grapheme-color associations can transfer to novel graphemes when synesthetic colors function as grapheme “discriminating markers”

Article 06 April 2020

Kyuto Uno, Michiko Asano, … Kazuhiko Yokosawa

Do Synesthetic Colors Grab Attention in Visual Search?

Article 21 July 2015

Berit Brogaard, Kristian Marlow & Kevin Rice

Apparent physical brightness of graphemes is altered by their synaesthetic colour in grapheme-colour synaesthetes

Article Open access 18 November 2020

Kyuto Uno & Kazuhiko Yokosawa

Synesthesia is a neurological phenomenon in which ordinary stimuli elicit vivid individual perceptions in unrelated pathways, without a corresponding physical stimulus. For example, grapheme–color synesthetes experience subjective colors upon viewing black alphanumeric characters (Dixon, Smilek, & Merikle, 2004; Ramachandran & Hubbard, 2001). Hearing–color synesthetes similarly “see” colors upon hearing particular sounds (Baron-Cohen, Wyke, & Binnie, 1987). Lexical–gustatory synesthetes “taste” particular flavors and experience particular oral textures upon hearing/reading particular words (Ward & Simner, 2003). These synesthetic perceptions arise without a corresponding sensory input. Therefore, studies on these forms of synesthesia must isolate the processes of consciousness from sensory processing, and in doing so may provide a clue to understanding perceptual consciousness. Indeed, brain-imaging studies conducted among synesthetic participants have provided evidence identifying brain regions important for color perception (Hubbard, Arman, Ramachandran, & Boynton, 2005; Nunn et al., 2002; Sperling, Prvulovic, Linden, Singer, & Stirn, 2006). Synesthesia may also provide insight for understanding various aspects of cognition, including mental imaging (Barnett & Newell, 2008), memory (Luria, 1968; Tammet, 2006), art and creativity (Rich, Bradshaw, & Mattingley, 2005; Ramachandran & Hubbard, 2001), and numeracy (Rich et al., 2005).

Most of the knowledge of synesthesia has come from studies of grapheme–color synesthetes, who experience the most common form of synesthesia. In this form, the visual perception of letters and numbers (graphemes) induces simultaneous perception of a given color (e.g., the letter “F” may be green, and the number “2” may be red; Cytowic & Eagleman, 2009). Two important characteristics of this synesthetic experience should be noted. First, the grapheme–color association is surprisingly strong. Rich et al. (2005) showed that grapheme–color pairs were consistent in individuals, with almost no change since childhood. Asano and Yokosawa (2012) showed that kanji–color pairs also have consistency over time. Although a synesthete occasionally acquires different synesthetic color associations from his or her original ones, these new color associations are temporary and are lost over time (Mills et al., 2002). This consistency of associations has been used as a diagnostic criterion for synesthesia. Using this criterion, the prevalence of grapheme–color synesthesia has been estimated at anywhere from 1 in 2000 (Baron-Cohen, Burt, Smith-Laittan, Harrison, & Bolton, 1996) to 1 in 200 or more (Ramachandran & Hubbard, 2001). The second important characteristic is that the phenomenological aspect of color sensation in grapheme–color synesthesia is very heterogeneous across individuals. Dixon et al. (2004) showed that some grapheme–color synesthetes, termed “projectors,” perceived their associated colors visually in external space, characterizing them as existing “out there on the page.” Others, termed “associators,” perceived their colors in internal space, characterizing them as existing “in my mind’s eye” or “in my head.” Researchers often use this categorization to classify synesthetes. But currently there are controversies regarding the validity of the “associator” versus “projector” categorization (see Chiou & Rich, 2014).

What determines the associations between a grapheme and a color? There is some evidence of commonality between individuals. English-speaking synesthetes often associate synesthetic colors with the initial letter of common color words, such as “R” being red and “G” being green (Rich et al., 2005; Simner et al., 2005), and with phonological information, such as “I” [aɪ] being white [waɪt] (Rich et al., 2005). However, several studies have shown that associations are idiosyncratic between individuals (Laeng, Svartdal, & Oelmann, 2004; Ward, Li, Salih, & Sagiv, 2007). For example, when shown the letter “B,” one individual may report blue, another green, and others yellow. Even synesthetic monozygotic twins report different colors stimulated by the same letter (Rich et al., 2005). These aspects of synesthetic associations suggest that direct correspondences between a grapheme and a color (a first-order relation) are too elusive to guide the investigation of possible associative mechanisms.

Given this difficulty with establishing first-order relations, recent studies have begun to explore correspondence between the relationships among graphemes, on the one hand, and the relationships among colors, on the other (a second-order relation; Watson et al., 2012). Brang, Rouw, Ramachandran, and Coulson (2011) showed that similarity between synesthetic colors depended on similarity in the shapes of letters. Eagleman (2010) argued that letters early in their respective alphabets (e.g., A, B, C, D) tended to be associated with colors that are more distinct from each other, whereas letters that come later (e.g., V, W, X, Y) tended to be associated with colors that were quite similar to each other. This ordinal relationship has also been found for Japanese hiragana characters (Asano & Yokosawa, 2013).

Accumulating a drastically larger dataset of first- and second-order relationships between graphemes and colors would be a promising way, if not the only effective way, to understand the mechanisms underlying synesthetic associations. Such a dataset would allow us to more thoroughly analyze synesthetic mechanisms from various angles. There have been forays into this approach; for instance, Rich et al. (2005) examined 192 adult synesthetes and showed that certain letters and numerals are likely to induce specific colors (i.e., R → red, Y → yellow, D → brown). This study also revealed that synesthetic colors could be derived from parts of conventional sequences such as days of the week, letters, or numerals. Eagleman (2010) launched the Web-based Synesthesia Battery, and this project has verified almost 10,000 synesthetes across 15 types of synesthesia and eight languages. In fact, the ordinal relationship described above was found by using this battery. Although these databases have great utility, they have one fundamental limitation: The number of synesthetic colors per synesthete is relatively small. Previous studies have often been limited to color associations with only 26 Latin letters (A–Z) and ten Arabic numerals (0–9), and this relatively small sample per synesthete confines the analysis to the global structure of synesthesia common across synesthetes. If samples per synesthete could be greatly enlarged, we might uncover novel rules governing the fine structure of synesthetic associations.

Therefore, in this study we took a new approach, complementary to existing databases, in which thousands of synesthetic colors were collected from each of two synesthetes, hundreds of synesthetic colors from each of three synesthetes, and about 80 synesthetic colors from each of three synesthetes. This study had two major goals. The first goal was to create a database that includes synesthetic colors for 483 kanji, 46 hiragana, and 46 katakana. Hiragana and katakana are phonetic characters, whereas kanji are Japanese logographic characters, each of which has its own meaning. These characters originated from Chinese and are mainly used to write content words. Kanji have at least two important characteristics, as they relate to synesthetic color associations. The first is that kanji are visually composed of multiple orthographic components, called radicals. Most kanji have two or more radicals, and many kanji characters share common radicals. For example, the characters 決, 河, 活, and 流 share the common sanzui radical, 氵, which signifies water. Radicals may be linked to colors, and radical commonality may underlie the generation of synesthetic colors (Asano & Yokosawa, 2012). The second important characteristic is that most kanji have multiple readings, termed the on-yomi and kun-yomi. On-yomi are derived from the original Chinese pronunciation of the character, whereas kun-yomi come from the Japanese pronunciation (Tamaoka, 1991). For example, the kanji 青 (meaning “blue”) is pronounced /sho/ or /sei/ in its on-yomi reading, and /ao/ in its kun-yomi reading. By taking advantage of these specific characteristics of kanji, we succeeded in collecting a number of synesthetic colors associated with kanji characters from each of five synesthetes.

The second goal of this study was to propose new research directions using the developed database. The new database should allow more thorough analyses of the structure of the association between a grapheme and a color. We illustrate two promising directions here. The first is the analysis of the first- and second-order relationships between a given kanji character and synesthetic color, taking advantage of kanji’s multifaceted properties for the study of regularities. Asano and Yokosawa (2011, 2012) previously reported that synesthetic kanji–color associations depend on both the semantic and phonological information of the character. The larger samples of kanji–color synesthetic pairs in our database will extend these findings regarding the structure of synesthetic associations. The second direction concerns a fundamental yet overlooked issue in the literature: Can any color be a synesthetic color for a synesthete? More explicitly, are synesthetic colors distributed randomly in the color space? Notably, this question cannot be answered without collecting a large amount of data for a single synesthete, which our study provides. We will illustrate techniques of spatial statistics to explore the pattern of distribution of synesthetic colors in the color space.

Method

We collected a large number of synesthetic colors for Japanese kanji, hiragana, katakana, Latin letters, and Arabic numerals from each of eight synesthetes. Five of the synesthetes had colors associated with all kinds of characters (kanji–color synesthetes), and others had synesthetic colors associated only with hiragana, Latin letters, and Arabic numerals (hiragana–color synesthetes). The five kanji–color synesthetes participated in two experiments. The first employed a color selection task to determine the synesthetic colors that they experienced when stimulated with printed graphemes comprising kanji, hiragana, katakana, Latin letters, and Arabic numerals. The second experiment employed a color-matching task to determine their synesthetic color associations for Latin letters and Arabic numerals displayed on a CRT monitor. The three hiragana–color synesthetes participated only in the color-matching task.

Participants

Eight Japanese synesthetes (five females [S.H., A.H., M.H., A.M., and Y.H.] and three males [S.O., H.S., and Y.K.]; age range = 18–21 years) received payment (dependent on the experiment time) for their participation in the experiments. S.H. and S.O. were self-reported projector-type synesthetes, and the others were self-reported associator-type synesthetes. Their scores on the color match consistency test were below 1.0, indicating a consistent and authentic synesthetic experience (Eagleman, Kagan, Nelson, Sagaram, & Sarma, 2007).

Color selection experiment

Visual stimuli

The test stimuli comprised individual black graphemes printed centrally on a white card, called a “character card” (127 × 89 mm). The character cards, used in elementary education, included 482 kanji, 46 hiragana (あ–ん), 46 katakana (ア–ン), 26 Latin letters (A–Z), and ten Arabic numerals (0–9). Each kanji, hiragana, and katakana character was printed in 42-point HG textbook font (about 15 × 15 mm), and each Latin letter and Arabic numeral was printed in 72-point Arial font (about 25 × 33 mm). Character cards were placed on a desk covered by a black cloth and illuminated by a D65 daylight fluorescent lamp, which has a high color-rendering index of 98 (FL20S D-EDL-D65; Toshiba Inc., Tokyo, Japan). The viewing distance was approximately 40 cm.

Procedure

The participant was asked to view a character card and select the color(s) that best matched their perceived synesthetic color by using color chips from the Munsell Book of Color, Matte Finish Collection (Munsell Color, 1976). More specifically, in each trial they picked up a character card, placed it on the center of the desk, observed the character, and selected one of the five basic colors (5R, 5G, 5B, 5Y, 5P) that most resembled their synesthetic color. Next, they examined all four sheets for the selected basic color from the Munsell Book and selected the color chip that best matched the synesthetic color. Finally, they wrote the color chip’s Munsell code in a blank space on the character card. If they perceived multiple synesthetic colors for a character, the procedure was repeated, and the part of the character for which they perceived each synesthetic color was marked by a box. Trials were blocked for kanji, hiragana, and katakana (40 blocks of 12 or 13 cards for kanji; four blocks of 12 or 10 cards for hiragana or katakana, respectively), and the order of the trials was quasi-randomized.

Two of the participants, S.H. and A.H., performed a longer-term experiment (480 kanji characters) than did the others (240 kanji characters). Since the experiment was expected to be very time-consuming, we decided to allow S.H. and A.H. to perform the experiment in their home in their free time, after practice sessions that were performed in the laboratory. Care was taken to ensure that the experiment was performed in a constant visual environment. The other six synesthetes performed the experiment in our laboratory. All participants were instructed to view the cards under the high color-rendering lamp, to do the experiment regularly, and to keep an experimental diary. The diary confirmed that it took a long time to complete the experiment. Of the synesthetes who had kanji–color associations, S.H. spent 16 h 45 min, 1 h 4 min, and 54 min for the kanji, hiragana, and katakana sets, respectively, over a total of 17 days. A.H. spent 53 h 30 min and 2 h 4 min for the kanji and hiragana sets, respectively, over a total of 23 days. Because of the especially time-consuming nature of the experiment for this participant, the experimenter did not employ the katakana color selection task with A.H. S.O. spent 14 h 41 min, 54 min, and 40 min for the kanji, hiragana, and katakana sets, respectively, over a total of 7 days. M.H. spent 10 h 11 min, 49 min, and 57 min for the kanji, hiragana, and katakana sets, respectively, over a total of 7 days. H.S. spent 9 h 25 min, 1 h 4 min, and 55 min for the kanji, hiragana, and katakana sets, respectively, over a total of 5 days. The experimenters checked for mistakes in recording of the Munsell codes, and found a total of 13 typos of the Munsell codes (two typos for S.H.; six typos for A.H.; one typo for S.O.; two typos for M.H.; and two typos for H.S.) and excluded these selections.

Color matching using a CRT monitor

Visual stimuli

For the 26 Latin letters and ten Arabic numerals, the color coordinates (CIE L*a*b*) of each synesthetic color were determined by a color-matching task using a CRT monitor (1280 × 1024 pixels, 100 Hz, NANAO, FlexScan F980). The test character, displayed in Arial font (3° of visual angle), was located on the left side (3° from center) of the screen, and a reference color patch was located on the right side (3° from center). The test character was black, and the background color was gray (CIE x = .284, y = .341, Y = 30.0). The monitor was calibrated with a chromameter (CS-100A; Konica Minolta, Tokyo, Japan) using the Mcalibrator2 software (Ban & Yamamoto, 2013).

Procedure

In the laboratory setting, the participant was asked to adjust the color of the displayed reference patch so that it best matched their perceived synesthetic color for the test character. More specifically, after 3 min of dark adaptation followed by 2 min of light adaptation, participants started the color-matching trials. First, the character and the reference patch were displayed on the CRT monitor. The initial color of the reference patch was not shown by cordinates (L*a*b*) determined in the color selection experiment. Based on the cordinates determined in the color selection task, the initial color changed form –5, –5, –10 to +5, +5, +10 randomly for each trials. Next, the participant adjusted the color patch with a digital input device until he or she felt that it matched the perceived synesthetic color. The input device allowed the participant to alter the L*a*b* color coordinates of the color patch independently in steps of ±0.3, ±1, and ±1. The trial order was randomized.

Analysis

We used the CIE L*a*b* color space to analyze the distribution of synesthetic colors. The CIE L*a*b* coordinates specify colors using values along three axes: light and dark (L*), red and green (a*), and blue and yellow (b*). An L*a*b* color space is a uniform space in which a given spatial distance corresponds to an equivalent perceptual difference; that is, one unit corresponds approximately to one just-noticeable difference. This uniformity would be suitable for interpreting the distribution of synesthetic colors when analyzing them on the basis of a distance metric, as we describe below.

The distribution of synesthetic colors was investigated using spatial statistics analysis, performed with the spatstat (Baddeley, 2010) software, which is an R package for analyzing a spatial point pattern. Such point patterns fall into three general categories: random, clustered, and uniform (Fig. 1 top) (Bailey & Gatrell, 1995). In a random pattern, any point is equally likely to occur at any location, and points are not affected by other points. In a clustered pattern, many points are concentrated in some regions and are sparse in others. Finally, in a uniform pattern, every point is as far as possible from all nearby points. We considered the distribution of synesthetic colors in the a*b* chromaticity plane to be a 2-D point pattern, and then explored which distribution patterns the synesthetic colors exhibited.

More specifically, our exploratory analysis had three steps. First, the synesthetic color’s point pattern was visualized in the a*b* chromaticity plane. Second, the density of the point pattern was estimated by kernel estimation using the density() function. Kernel estimation calculates the density of points within a specified search radius (σ = 8.46) around each point. Third, the type of the synesthetic color’s point pattern was judged by the L function, computed from Ripley’s K function, which is defined as

$$ K(r)=E(r)/\lambda . $$

(1)

In the K function, which is a function of search radius r, E(r) is the expected number of points within radius r of an arbitrary point. λ is the density of points (synesthetic colors per unit region), estimated by λ = n/a, with n points (synesthetic colors) in a given region a. If a point pattern is random, the K function becomes

$$ K(r)=\pi {r}^2. $$

(2)

The L function can be computed from the K function as

$$ L(r)=\sqrt{K(r)/\pi }\ \hbox{--}\ r. $$

(3)

Using this L function, we could classify the synesthetic color’s point pattern as being one of the three types: random when L(r) = 0, clustered when L(r) > 0, or uniform when L($ r $) < 0, as is shown in Fig. 2a.

For further analysis, we referred to an extension of the L function, called the cross-L function, which can evaluate the dependency of two point patterns. This function is helpful for investigating the interrelation between the synesthetic colors stimulated by different groups of graphemes, such as by grapheme types (e.g., kanji vs. hiragana) or by the learning grade, and therefore complexity, of kanji (e.g., the simple一 and 二 vs. the more complex 雨 and 雲). The dependency of point pattern pairs was divided into three categories (Bailey & Gatrell, 1995): independent, attractive, or repulsive, shown in Fig. 1b. An independent pair would show no dependency between the two patterns, an attractive one would have two patterns that bunched together at specific places, and a repulsive one would have two mutually exclusive patterns. These types of synesthetic color point patterns were determined by the cross-L function, which is computed from Ripley’s cross-K function, defined as

$$ {K}_{AB}(r)=E(r)\ /\ {\lambda}_B. $$

(4)

Here, $ E(r) $ is the expected number of members of point pattern B within a given radius r of any arbitrary member of the point pattern A. $ {\lambda}_B $ is the density of the point pattern B, which can be estimated as $ {n}_B $/a, with a total of $ {n}_B $ members of the point pattern B in a given region a. The cross-L function can be computed from the cross-K function as

$$ {L}_{AB}(r) = \sqrt{K_{AB}(r)/\ \pi}\hbox{--}\ r. $$

(5)

Using this cross-L function, we could classify the synesthetic color point patterns as one of the three types: independent when $ {L}_{AB}(r) $ = 0, attractive when $ {L}_{AB}(r) $ > 0, or repulsive when $ {L}_{AB}(r) $ < 0, as is shown Fig. 2b.

Results

Synesthetic colors associated with kanji characters

We investigated the synesthetic colors associated with 482 kanji characters for S.H. and A.H., and 240 kanji characters for S.O., M.H., and H.S. We obtained a total of 4874 colors from five synesthetes, as specified using the Munsell color system. Additionally, about 80 synesthetic colors from each of three synesthetes were associated with hiragana characters. Notably, the number of synesthetic colors was larger than the number of kanji characters investigated. On average, a kanji character was associated with 2.26 colors for S.H., 4.61 colors for A.H., 1.92 colors for S.O., 1.55 colors for M.H., and 0.80 colors for H.S. Below, the thousands of synesthetic colors for kanji characters were analyzed.

The first question we asked was “How is a kanji character associated with multiple colors?” There seem to be at least four ways for S.H. The first was that synesthetic colors depended on the component radicals rather than on the character as a whole, and kanji characters are often composed of multiple radicals. For example, the character 決 was perceived in two parts, the aforementioned sanzui radical 氵, perceived as yellow, and the remaining 夬 component, perceived as red (Fig. 3a). This radical dependency was also seen in synesthetes S.O., M.H., and H.S. The second way was that a synesthetic color could depend on pronunciation, and kanji characters often have multiple readings. For example, the character 千 was perceived as yellow by S.H. when read as its on-yomi (/sen/), but it was perceived as red when read as its kun-yomi (/chi/) (Fig. 3b). The third way was that a kanji character as a whole sometimes had an overall color that differed from the local colors usually associated with its radicals. For example, the character 聞 is composed of two radicals (門 and 耳), which were perceived to have their own local colors (yellow-red and red-purple, respectively), but the character as a whole was perceived to be green (Fig. 3c). The fourth way was that a kanji was sometimes simply perceived to be two different colors. For example, the upper radical (ハ) of the 谷 character was sometimes perceived as purple-blue and sometimes as yellow (Fig. 3d).

For synesthete A.H., most synesthetic colors depended on finer component parts of the kanji characters, beyond the radicals. For example, the character 決 was divided into the radicals 氵 and 夬 by S.H., but A.H. further divided the夬 into the ユ and the remaining 人. The sanzui 氵 was perceived as red, the ユ component as a different red, and the 人 component as yellow (Fig. 4a). The character “識” was associated with the largest number of synesthetic colors (12 colors), and these synesthetic colors depended on detailed components of the kanji character (Fig. 4b).

We also investigated 46 hiragana characters, 46 katakana characters, 26 Latin letters, and ten Arabic numerals for S.H., S.O., M.H., and H.S., and all of these characters except the katakana for A.H. Some of these characters were associated with two or more colors for S.H., A.H., and M.H., and thus there were a slightly greater number of synesthetic colors than characters (S.H.: 144 colors for 128 graphemes; A.H.: 192 colors for 82 graphemes; S.O.: 122 colors for 128 graphemes; M.H.: 132 colors for 128 graphemes; H.S.: 83 colors for 128 graphemes). When these were added to the results from the kanji, a total of 1,229 (S.H.), 2,349 (A.H.), 551 (S.O.), 460 (M.H.), and 285 (H.S.) synesthetic colors were collected.

Figure 5 shows all of these colors in the L*a*b* color space for each of the five kanji–color synesthetes. The synesthetic colors seemed to be widely distributed across the three axes. To take a closer look, we projected these distributions in three 2-D planes: a* versus b* (Fig. 5b, f, j, n, and r), L* versus a* (Fig. 5c, g, k, o, and s), and L* versus b* (Fig. 5d, h, l, p, and t). Figure 6 shows all of synesthetic colors for each of the three hiragana–color synesthetes, in a similar fashion to Fig. 5. For both types of synesthetes, the point patterns in the chromatic plane seemed to show some regions where the points were concentrated and others where there were few points (a* vs. b*), suggesting that synesthetic colors were neither randomly nor uniformly distributed in the color space. This type of bias was also present for the luminance dimension. Focusing on the point pattern along the luminance axis for kanji–color synesthetes (L* vs. a* and L* vs. b*), we see a small number of dark colors, with L* less than about 30 (Fig. 5c, d, g, h, k, l, o, p, s, and t). For hiragana–color synesthete A.M., few dark colors can be seen with L* less than about 30 (Fig. 6c and d). In contrast, For Y.K. and Y.H., more dark colors can be seen with L* less than about 30 (Fig. 6g, h, k, and l). All together, it appears that the synesthetic colors are distributed more heavily in some regions in the color space. This tendency is investigated at greater length using spatial statistics in the Example of Applications section below.

We compiled these diverse synesthetic colors and the associations between kanji characters and colors into a database that could be helpful for the study of grapheme–color synesthesia. In the following sections, we describe the contents of the database and introduce new research directions that this database opens up.

Database contents

Figure 7 shows a section of the new database. The database denotes not just the synesthetic colors associated with each kanji character, but also gives additional information for each character, such as the radicals and pronunciations that affected the synesthetic colors, as we have described above (see also Table 1). This auxiliary information will potentially be useful for analyzing the first- and second-order relations of the synesthetic associations. The database was implemented as a Microsoft Excel file that comprised multiple sheets, including the kanji, hiragana, katakana, English graphemes (letters and numerals), and search sheets. The details of the contents of each sheet will be presented below.

Table 1 Synesthetic color information and auxiliary information for characters in the database

Full size table

Kanji sheet

The kanji sheet has each kanji character in a row and the various auxiliary information associated with each character in columns, hereafter called “classes.” The classes comprise three types: color class, elemental class, and supplemental class. The color class includes colors, defined using three types of chromaticity coordinates—the Munsell color system, CIE xyY, and CIE L*a*b* color coordinates—in conjunction with a visualization of the color by coloring the cell background.

The elemental class is the category of the various components in the kanji character. This had subclasses of “radical,” “constituent” “on-yomi,” “kun-yomi,” and “grade.” “Radical” describes the common subelements found in different kanji characters. The “constituent” subclass shows the component parts of the kanji character, allowing researchers to investigate the influences of all component shapes, not limited solely to radicals. The “on-yomi” and “kun-yomi” subclasses list the various readings of the kanji character, as we previously described. “Grade” shows the year in which the given kanji character is taught in Japanese elementary school (first through sixth grade), as stated by the Ministry of Education, Culture, Sports, Science and Technology of Japan’s appendix, “List of Kanji by School Year” in the Course of Study for Elementary School. Since the school year in which kanji characters are taught is standardized by the Ministry, this item will be potentially useful for research on the longitudinal tendencies of synesthetic colors.

The supplemental class is a category containing complementary information about the kanji characters or synesthetic colors. This is divided into five subclasses: “strokes,” “number of colors,” “color appearance,” “subjective report,” and “overall colors.” “Strokes” is the number of strokes in the kanji character. “Number of colors” shows the number of synesthetic colors associated with the character. “Color appearance” categorizes the various ways that the synesthete perceived the colors, divided into nine categories: (A) colors were divided into each part (“component”) of the kanji; (B) colors switched within the same part; (C) colors were divided into each part, but there was a different overall color when the character was seen as a whole; (D) there was an overall color for the character as a whole, but there was also a local color; (E) the color changed depending on the reading or meaning; (F) colors were divided into each part, but colors also switched in some parts; (G) indicates a synesthete can not show positions of synesthetic colors but can perceived colors somewhere; (H) there was a part with no color; and (I) there were colors in the mind’s eye. Finally, “subjective report” contains additional information on the synesthete’s subjective thoughts about the color appearance. “Overall colors” was the overall synesthetic color of a kanji character.

Hiragana, katakana, and English grapheme sheets

These sheets had each character in a row and select classes or subclasses associated with the character in the columns. The hiragana and katakana sheets had the color class and two supplemental subclasses (only “number of colors” and “subjective report”). The English graphemes sheet (with letters and numerals) only had the color class, determined as described in the Method section, containing the CIE xyY and L*a*b* coordinates measured in the CRT monitor color-matching task.

Search sheet

The search sheet can take arbitrary data from the kanji sheet. Figure 8 shows a screenshot of the search sheet, which has two functions for searching the database. The first is an OR search, in which a user can assign a maximum of two factors per subclass. The second is an AND search, for which a user can assign a maximum of two conditions. For these searches, the terms may be fully or partially matched. If a user would, for example, like to search for data on kanji that are first-grade level and were associated with red-purple synesthetic colors, as is shown in Fig. 8a, the user would input “1” for “grade,” with “full match” selected, as Condition 1, and input “RP” for color, with “partial match” selected, as Condition 2. Figure 8b shows the result of this example search, containing all returned entries fulfilling the condition.

Example of applications

Exploring the relationship between the multidimensional natures of kanji and synesthetic colors

Our database was useful for exploring regularity in the associations between kanji characters and synesthetic colors. Here, we illustrate this with an example in which the first- and second-order relationships of the association were examined, focusing on S.H., who had four types of grapheme–color associations.

At first we focused on the first-order relation; namely, which radicals were most frequently associated with what colors? To answer this, we determined the most frequent color among the five basic Munsell colors. In the database search sheet, we assigned each basic color (“R,” “Y,” “G,” “B,” and “P”) as the “color,” with “partial match” selected, as Condition 1. The most frequent color was yellow: 332 of 482 kanji characters had a yellowish color. We then explored the relationship between these yellowish colors and various radicals, to check for common radicals or constituents in the 332 detected characters. We found that yellowish colors were associated with the sanzui 氵, tsukanmuri ツ, and ninben イ radicals, in spite of the fact that sanzui means water. We then considered a possible second-order relationship: Do characters with shapes similar to these radicals have similar colors? To answer this, we examined three katakana characters with similar shapes: シ, ツ, and イ. Interestingly, these katakana characters also had a yellowish color. Taken together, for the most frequent color (yellowish), radicals not with similar meanings but similar shapes had similar colors.

Analysis of the global structure of synesthetic colors in the chromaticity space

Here we wanted to provide an answer for the question presented in the introduction: Are synesthetic colors distributed randomly in the color space? As we mentioned above, the distribution of synesthetic colors suggested the possibility that the point patterns were not random or uniform, but rather were clustered (see Fig. 5b, f, j, n, and r). To confirm this possibility statistically, we analyzed the point patterns of synesthetic colors using several techniques of spatial statics, which had three steps, noted in the Method section. At first, we projected the colors to the a*b* chromaticity plane, visualizing the point patterns of synesthetic colors (Fig. 9a, c, e, g, and i). Second, we estimated the density of each point pattern by kernel estimation (Fig. 9b, d, f, h, and j). To clarify the distributions of synesthetic colors for kanji–color synesthetes, the density of chromatic colors was also estimated after excluding the achromatic colors (S.H., 211 samples; A.H., 145 samples; S.O., 78 samples; M.H., 61 samples; H.S., 75 samples). As is clearly shown in Fig. 9b, d, f, h, and j, we found clusters of chromatic synesthetic colors for the kanji–color synesthetes. For the hiragana–color synesthetes, as is clearly shown in Fig. 10b, d, and f, there were also many clusters of synesthetic color. The high-density and low-density regions for each synesthete are summarized in Table 2. For two kanji–color synesthetes, S.H. and A.H., over 1,000 synesthetic colors were concentrated in multiple regions in the a*b* chromaticity plane. For the other three kanji–color synesthetes, S.O., M.H., and H.S., over 100 synesthetic colors were concentrated around an origin point.

Table 2 High-density and low-density regions of synesthetic colors in the a*b* chromaticity

Full size table

Finally, to confirm these synesthetic color clusters, we analyzed the types of point patterns using the L function (Fig. 11 for kanji–color synesthetes; Fig. 12 for hiragana–color synesthetes). As is shown in Figs. 11 and 12, the L functions returned positive values greater than the confidence intervals. This suggests that synesthetic colors are concentrated in multiple regions in the color space; that is, they form “synesthetic color clusters.”

Explanations of the script for spatial statistics of synesthetic colors

Since the database was made in the widely used Excel format, it can be easily imported to many kinds of statistical software. For example, the spatial statistics analysis above was done using R, as is shown in Fig. 13. The details of the analysis steps are explained below, to help researchers understand and modify the script for their own use.

Step 1: Prepare a spatstat package and input the row data into R. Then, the user can input a portion of the database as a text file—for example, the L*a*b* coordinates of synesthetic colors.
Step 2: Create a point pattern object using a class ppp object. The ppp(x, y, window = w, marks = mf) function will create the point pattern inside a rectangle. x (a*) and y (b*) are vectors of each of the coordinates of data points. The input window = w specifies a range of the polygon. Each coordinate value of the polygon was defined by the most achromatic colors (red: [a*, b*] = [82.28, 22.42]; green: [–139.00, 20.36]; yellow: [11.91, 134.92]; blue: [–14.69, –67.01]; purple: [116.06, –94.45]) from the Munsell renotation data (www.rit.edu/cos/colorscience/rc_munsell_renotation.php). Next, marks is defined, which is an additional attribute of each point in the pattern (Baddeley, 2010). For example, in addition to the color coordinates of synesthetic colors, a user could also record the kanji, hiragana, katakana, letter, and numeral. This marked point pattern is constructed by the mf in the script.
Step 3: Plot the point patterns and their density maps. First, the user can plot the point pattern using the plot(x, window = w) function, where x is a point pattern. Next, a user can plot a map of the kernel density estimation by a plot(density(x, σ)) call, where x is a point pattern and σ is the standard deviation of the isotropic Gaussian smoothing kernel used for the kernel method.
Step 4: Analyze the point patterns using the L function. The user can compute the K function for a point pattern x using the Kest function. Next, the K function can be used to plot the computed L function using plot(K, sqrt(iso/pi) ~ r).
Step 5: Analyze the two types of point patterns with the cross-L function. The user can compute the cross-type between all pairs of types simultaneously using the command alltypes(x, fun = “K”), where x is the marked point pattern and K is the K function. An example of the function for two types of point patterns (katakana vs. hiragana) for synesthete S.H. is shown in Fig. 14.
Fig. 14
Cross-L function of two types of point patterns for the chromatic synesthetic colors of synesthete S.H. The interval between the two blue lines indicates the 95 % confidence interval for complete spatial randomness
Full size image

Discussion

All previous studies on grapheme–color synesthesia have considered dozens of samples per synesthete (often 26 Latin letters and ten Arabic numerals). In this study, we created a new database, which we call the Japanese KANJI-Synesthetic Colors Database (K-SCD). The K-SCD has 1,229 and 2,349 synesthetic colors for each of two synesthetes, associated with 483 kanji, 46 hiragana, 46 katakana, 26 Latin letters, and ten Arabic numerals; 551, 460, and 285 synesthetic colors for each of three other synesthetes; and about 80 synesthetic colors for each of three synesthetes, associated with hiragana, Latin letters, and Arabic numerals. To our knowledge, no study has collected such a large number of synesthetic colors per synesthete. The large samples in the K-SCD should allow investigation of the intrapersonal structure of the grapheme–color synesthetic experience, and possibly lead to promising new ways of studying synesthesia.

We have illustrated two such ways here. Although these are preliminary studies, in the sense that only partial information from the database was utilized, they nevertheless provide important findings. The first example, taking advantage of the multidimensional properties of kanji, analyzed the regularity in the first- and second-order relationships between the radicals of kanji characters and their associated synesthetic colors. This analysis showed that radicals with shapes similar to katakana characters have similar colors to those katakana characters. This second-order relationship suggests that the colors associated with certain kanji may originate from those associated with katakana. This possibility is consistent with a previous finding (Asano & Yokosawa, 2012) showing that synesthetic colors for Japanese graphemes acquired at an early age (i.e., numerals, hiragana, and katakana) affect those acquired later in life (i.e., kanji) via their phonological and/or semantic content. The present results suggest the possibility that visual shape also contributes as a key factor to the process of synesthetic learning. Future studies may reveal other factors by analyzing auxiliary information such as the pronunciations, meanings, and grade levels of kanji.

In the second application of K-SCD, we investigated the distribution of synesthetic colors in a*b* chromaticity space, to understand the global structure of synesthesia. The spatial statistical analysis revealed that the synesthetic color distribution was neither random nor uniform. Instead, synesthetic colors formed multiple clusters that were separated from each other in a*b* chromaticity space. This result corroborates a previous similar finding (Yamamoto, 2009) based on hiragana, Latin letters, and Arabic numerals. Why did these synesthetic color clusters emerge? We conjecture that some kinds of first-order relationships determine the core colors of clusters in early synesthetic learning, and then second-order relationships, such as those based on similarity, determine the synesthetic colors of newly learned graphemes. In the second process, a new character may be tinged with a color similar to the core color of the character to which it is most similar, and the color distance from the core may be proportional to the distance in similarity between the two characters in a given grapheme space, leading to clustering. In this scenario, the synesthetic color clusters may reflect both between- and within-category differences in the grapheme similarity space.

The emergence of the synesthetic color clusters may be further analyzed using the cross-L function to investigate the interdependency of two point patterns. Point patterns can be divided into three categories: independent, attractive, or repulsive (see Fig. 1, bottom). Our hypothesis above predicts attractive dependencies between kanji and hiragana characters and between kanji characters across learning grades. This is supported by our preliminary analysis for the interrelation between hiragana characters and Latin letters (Yamamoto, 2009). A preliminary analysis for the interrelation between hiragana characters and katakana characters. (sample codes are shown in Fig. 13) also supports this hypothesis.

In conclusion, the K-SCD provides researchers with a variety of information regarding synesthetic color associations and enables novel research directions. The database, taking advantage of the multidimensionality of kanji characters and the abundance of possible samples, will be beneficial in developing our understanding of synesthesia. The database will be updated continually, as new data from new synesthetes and synesthetic color associations are obtained, or as room for improvement is found.

References

Asano, M., & Yokosawa, K. (2011). Synesthetic colors are elicited by sound quality in Japanese synesthetes. Consciousness and Cognition, 20, 1816–1823. doi:10.1016/j.concog.2011.05.012
Article PubMed Google Scholar
Asano, M., & Yokosawa, K. (2012). Synesthetic colors for Japanese late acquired graphemes. Consciousness and Cognition, 21, 983–993. doi:10.1016/j.concog.2012.02.005
Article PubMed Google Scholar
Asano, M., & Yokosawa, K. (2013). Grapheme learning and grapheme–color synesthesia: Toward a comprehensive model of grapheme–color association. Frontiers in Human Neuroscience, 7, 757. doi:10.3389/fnhum.2013.00757
Article PubMed PubMed Central Google Scholar
Baddeley, A. (2010). Analysing spatial point patterns in R [Workshop notes]. Retrieved from https://research.csiro.au/software/wp-content/uploads/sites/6/2015/02/Rspatialcourse_CMIS_PDF-Standard.pdf
Bailey, T. C., & Gatrell, A. C. (1995). Interactive spatial data analysis. Harlow, UK: Longman Higher Education.
Google Scholar
Ban, H., & Yamamoto, H. (2013). A non-device-specific approach to display characterization based on linear, nonlinear, and hybrid search algorithms. Journal of Vision, 13(6), 20. doi:10.1167/13.6.20. 1–26.
Article PubMed Google Scholar
Barnett, K. J., & Newell, F. N. (2008). Synaesthesia is associated with enhanced, self-rated visual imagery. Consciousness and Cognition, 17, 1032–1039. doi:10.1016/j.concog.2007.05.011
Article PubMed Google Scholar
Baron-Cohen, S., Burt, L., Smith-Laittan, F., Harrison, J., & Bolton, P. (1996). Synaesthesia: Prevalence and familiarity. Perception, 25, 1073–1079.
Article PubMed Google Scholar
Baron-Cohen, S., Wyke, M. A., & Binnie, C. (1987). Hearing words and seeing colours: An experimental investigation of a case of synaesthesia. Perception, 16, 761–767.
Article PubMed Google Scholar
Brang, D., Rouw, R., Ramachandran, V. S., & Coulson, S. (2011). Similarly shaped letters evoke similar colors in grapheme–color synesthesia. Neuropsychologia, 49, 1355–1358. doi:10.1016/j.neuropsychologia.2011.01.002
Article PubMed Google Scholar
Chiou, R., & Rich, A. N. (2014). The role of conceptual knowledge in understanding synesthesia: Evaluating contemporary findings from a “hub-and-spokes” perspective. Frontiers in Psychogy, 5, 105. doi:10.3389/fpsyg.2014.00105
Google Scholar
Cytowic, R. E., & Eagleman, D. M. (2009). Wednesday is indigo blue: Discovering the brain of synesthesia. Cambridge, MA: MIT Press.
Google Scholar
Dixon, M. J., Smilek, D., & Merikle, P. M. (2004). Not all synaesthetes are created equal: Projector versus associator synaesthetes. Cognitive, Affective, & Behavioral Neuroscience, 4, 335–343. doi:10.3758/CABN.4.3.335
Article Google Scholar
Eagleman, D. M. (2010). What has large-scale analysis taught us? Article presented at the 2010 Meeting of the UK Synaesthesia Association, Brighton, U.K.
Eagleman, D. M., Kagan, A. D., Nelson, S. S., Sagaram, D., & Sarma, A. K. (2007). A standardized test battery for the study of synesthesia. Journal of Neuroscience Methods, 159, 139–145.
Article PubMed Google Scholar
Hubbard, E. M., Arman, A. C., Ramachandran, V. S., & Boynton, G. M. (2005). Individual differences among grapheme–color synesthetes: Brain–behavior correlations. Neuron, 45, 975–985. doi:10.1016/j.neuron.2005.02.008
Article PubMed Google Scholar
Laeng, B., Svartdal, F., & Oelmann, H. (2004). Does color synesthesia pose a paradox for early-selection theories of attention? Psychological Science, 15, 277–281.
Article PubMed Google Scholar
Luria, A. (1968). The mind of a mnemonist. New York, NY: Basic Books.
Google Scholar
Mills, C. B., Viguers, M. L., Edelson, S. K., Thomas, A. T., Simon-Dack, S. L., & Innis, J. A. (2002). The color of two alphabets for a multilingual synesthete. Perception, 31, 1371–1394. doi:10.1068/p3429
Article PubMed Google Scholar
Munsell Color. (1976). Munsell book of color: Matte finish collection. Baltimore, MD: Author.
Google Scholar
Nunn, J. A., Gregory, L. J., Brammer, M., Williams, S. C., Parslow, D. M., Morgan, M. J., . . . Gray, J. A. (2002). Functional magnetic resonance imaging of synesthesia: Activation of V4/V8 by spoken words. Nature Neuroscience, 5, 371–375. doi:10.1038/nn818
Ramachandran, V. S., & Hubbard, E. M. (2001). Synesthesia: A window into perception, thought and language. Journal of Consciousness Studies, 8, 3–34.
Google Scholar
Rich, A. N., Bradshaw, J. L., & Mattingley, J. B. (2005). A systematic, large-scale study of synaesthesia: Implications for the role of early experience in lexical–colour associations. Cognition, 98, 53–84. doi:10.1016/j.cognition.2004.11.003
Article PubMed Google Scholar
Simner, J., Ward, J., Lanz, M., Jansari, A., Noonan, K., Glover, L., & Oakley, D. A. (2005). Non-random associations of graphemes to colours in synaesthetic and non-synaesthetic populations. Cognitive Neuropsychology, 22, 1069–1085. doi:10.1080/02643290500200122
Article PubMed Google Scholar
Sperling, J. M., Prvulovic, D., Linden, D. E., Singer, W., & Stirn, A. (2006). Neuronal correlates of colour–graphemic synaesthesia: A fMRI study. Cortex, 42, 295–303.
Article PubMed Google Scholar
Tamaoka, K. (1991). Psycholinguistic Nature of the Japanese orthography. Studies in Language and Literature, 11, 49–82.
Google Scholar
Tammet, D. (2006). Born on a blue day. London: Hodder & Stoughton.
Google Scholar
Ward, J., Li, R., Salih, S., & Sagiv, N. (2007). Varieties of grapheme–colour synaesthesia: A new theory of phenomenological and behavioural differences. Consciousness and Cognition, 16, 913–931. doi:10.1016/j.concog.2006.09.012
Article PubMed Google Scholar
Ward, J., & Simner, J. (2003). Lexical–gustatory synaesthesia: Linguistic and conceptual factors. Cognition, 89, 237–261. doi:10.1016/S0010-0277(03)00122-7
Article PubMed Google Scholar
Watson, M. R., Akins, K. A., & Enns, J. T. (2012). Second-order mappings in grapheme–color synesthesia. Psychonomic Bulletin & Review, 19, 211–217. doi:10.3758/s13423-011-0208-4
Article Google Scholar
Yamamoto, H. (2009). An exploratory analysis of the synesthetic color distribution in chromaticity space [in Japanese]. Paper presented at the 73rd Annual Convention of the Japanese Psychological Association, Japan, Kyoto.

Download references

Author note

We thank Maho Taniguchi for collecting the data, and Haruna Kawasaki for pilot analyses.

Author information

Authors and Affiliations

Graduate School of Human and Environmental Studies, Kyoto University, Yoshida, Nihonmatsu-cho, Sakyo-ku, Kyoto, 606-8501, Japan
Daisuke Hamada, Hiroki Yamamoto & Jun Saiki

Authors

Daisuke Hamada
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Jun Saiki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hiroki Yamamoto.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hamada, D., Yamamoto, H. & Saiki, J. Database of synesthetic color associations for Japanese kanji. Behav Res 49, 242–257 (2017). https://doi.org/10.3758/s13428-015-0691-z

Download citation

Published: 07 January 2016
Issue Date: February 2017
DOI: https://doi.org/10.3758/s13428-015-0691-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Database of synesthetic color associations for Japanese kanji

Abstract

Similar content being viewed by others

Grapheme-color associations can transfer to novel graphemes when synesthetic colors function as grapheme “discriminating markers”

Do Synesthetic Colors Grab Attention in Visual Search?

Apparent physical brightness of graphemes is altered by their synaesthetic colour in grapheme-colour synaesthetes