In Chinese, characters are monosyllabic morphemes that are often combined to form a two-character word. According to the Institute of Language Teaching and Research (1986), 73.6% of modern Chinese words are two-character words (see Li et al., 2015, for more discussion), suggesting that the majority of Chinese words consist of two characters (Myers, 2006; Packard, 2000). Given this important feature of Chinese, Tse et al.’s (2017) megastudy focused on two-character words when developing a Chinese database of normed lexical decision (i.e., deciding whether a two-character string forms a Chinese word, e.g., 朋友friend, or a nonword, e.g., 形忌). In the present study, we expanded this database by norming speeded naming (i.e., reading aloud a Chinese word) performance, which has been considered one of the gold standards for assessing skilled readers’ word recognition abilities (e.g., Yap & Rickard Liow, 2016) and for developing computational models of lexical processing (e.g., Coltheart et al., 2001). Moreover, we compiled more lexical variables (e.g., phonological consistency) and evaluated the role of orthographic, phonological, and semantic variables in naming versus Tse et al.’s lexical decision performance. In our speeded naming task, Cantonese, rather than Mandarin, was used, as it is the native dialect of our participant population (i.e., Hong Kong, as also in our previous megastudy, Tse et al., 2017). Before elaborating on the details of our study, we first briefly introduce the megastudy approach and then selectively review naming studies of Chinese characters/words to motivate the use of that approach to test the predictions presented at the end of the Introduction section.

Megastudy approach in psycholinguistics

Using the megastudy approach in psycholinguistic studies, researchers compile the values of lexical variables for a very large pool of words and collect the normative reaction time (RT) and accuracy rates for these words in a lexical processing task. In contrast to the conventional factorial-design approach in which stimuli are selected based on a small set of criteria, the megastudy approach allows the language to define the stimuli (see Balota et al., 2013, and Tse et al., 2017, for reviews of problems associated with factorial-design experiments and how they can be addressed by the megastudy approach). After this, researchers conduct item-level multiple regression analyses on the normed data to test for the influence of lexical variables on word processing. The databases developed with this megastudy approach, often termed “Lexicon Project,” have been reported in American English (Balota et al., 2007), British English (Keuleers et al., 2012), single-character Chinese (Chang et al., 2016; Liu et al., 2007; Sze et al., 2014), two-character Chinese (Tsang et al., 2018; Tse et al., 2017), French (Ferrand et al., 2010), Dutch (Keuleers et al., 2010), Italian (Barca et al., 2002), and Malay (Yap et al., 2010). The item-level analyses help identify the proportion of unique variance of performance that lexical variables, whether continuous or categorical, explain after statistically controlling for the effect of other variables, and reveal their relative contribution in lexical processing (e.g., Sze et al., 2015). These analyses can also address other research questions, such as replicating theoretically important lexico-semantic effects (Balota et al., 2013), evaluating the impact of novel lexical variables (Kang et al., 2011; Yarkoni et al., 2008), assessing computational models (Perry et al., 2007), and revealing the role of individual differences in word recognition (Lim et al., 2020; Yap et al., 2012).

Tse et al. (2017) used the megastudy approach to establish a lexical decision database of RT and accuracy rates for more than 25,000 Chinese two-character words. Each of 594 native Cantonese-speaking participants made lexical decisions for about 1400 words and 1400 nonwords, presented over three sessions. Each word was normed based on about 33 responses. Item-level regression analyses were conducted to determine the influence of theoretically important lexical characteristics on lexical decision performance. Making use of these data, Tse and Yap (2018) compared the role of orthographic, phonological, and semantic variables in predicting lexical decision performance and found some theoretically driven interaction effects (e.g., word frequency × character frequency) that are not readily explainable by any model of Chinese word recognition. Lim et al. (2020) further demonstrated that the effects of some lexical variables (e.g., word frequency) on lexical decision could be moderated by participants’ lexical processing fluency. All these demonstrate the usefulness of a megastudy database for investigating the effects of lexical variables on performance in a lexical processing task.

Why investigate speeded naming in Chinese?

In the speeded naming task, participants read aloud a visually presented word as quickly and as accurately as they can. In the present study, we extended Tse et al.’s (2017) database by collecting participants’ normed naming responses for their large pool of 25,000+ two-character Chinese words. We also conducted analyses on both the normed lexical decision data and the naming data in Tse et al.’s database to test whether their reported lexical effects might reflect general word recognition processes or merely idiosyncratic task demands.

Lexical effect on speeded naming performance

As few studies have examined the speeded naming of two-character Chinese words, in the following review we also refer to the work reported on the speeded naming of single Chinese characters. For orthographic variables, Yu and Cao (1992, see also Liu et al., 2007) found that high-frequency characters (e.g., 花 flower) were named faster than low-frequency characters (e.g., 堃 compliance). Gao et al. (2016) reported that high-frequency words (e.g., 多謝 thank you) were named faster than low-frequency words (e.g., 睿哲 divinely wise). Number of strokes is often used to indicate visual complexity (e.g., Xing et al., 2004), wherein characters with more strokes are considered visually more complex (e.g., 人 human vs. 鬱 depressed). Shen and Zhu (1994, see also Leong et al., 1987; Liu et al., 2007) demonstrated that characters with fewer strokes were named faster than those with more strokes.

For phonological variables, Hue et al. (1992, see also Liu et al., 2007) showed that phonologically regular characters (a character is pronounced the same as its phonetic radical, e.g., 叮ding1Footnote 1 and 丁ding1) were named faster than irregular characters (a character and its phonetic radical do not share the pronunciation, e.g., 模mou4 and 莫mok6). Orthography-to-phonology mapping consistency refers to whether the mapping is systematic between a character’s phonetic radical and its pronunciation. For example, 換wun6 is a highly consistent character, as all other characters sharing 奐 as their phonetic radical are also pronounced as wun6, e.g., 喚 and 渙. In contrast, 板baan2 is less consistent, because characters with 反 as their phonetic radicals are pronounced differently, from 皈gwai1 to 飯faan6. Yang et al. (2009) showed that characters consistent with orthography-to-phonology mapping (e.g., 渙thaw out) were named faster than inconsistent characters (e.g., 飯rice). Phonological consistency refers to whether a character has one (consistent, e.g., 骨gwat1) or more than one (inconsistent, e.g., 重zung6 and cung5) pronunciation (Tan & Perfetti, 1999). Leong and Cheng (2003) found that words with consistent characters (e.g., 骨 in 骨頭 bone) were named faster than those with inconsistent characters (e.g., 重 in 體重 weight), whether the character was the first or second in the word. Finally, characters that share pronunciations with more other characters (i.e., high homophone density, e.g., 支) were named faster than those that share pronunciations with fewer other characters (i.e., low homophone density, e.g., 鏡) (e.g., Ziegler et al., 2000, but see the null effect of homophone density in Chang et al., 2016).

For semantic variables, characters that combine more readily with another character to form a word (i.e., larger neighborhood size, e.g., 花園 garden, 花盆 flowerpot, 花店 florist, 花叢 flower bush, 花市 flower market, 花椒 wild pepper, 花蜜 honey, 開花 to blossom, 花錢 to spend money, 花粉 pollen, 鮮花 fresh flower, etc., for 花) were named faster than those not as likely (i.e., smaller neighborhood size, e.g., only 鳳凰 phoenix for 凰) (e.g., Chang et al., 2016; Liu et al., 2007, see Baayen et al., 2006 for similar findings in English). Peng et al. (2003) reported that characters with more meanings (e.g., 行) were named faster than those with fewer meanings (e.g., 廚). Semantic transparency refers to whether words are semantically related (transparent, e.g., 黑板black-board [blackboard]) or unrelated to their characters (opaque, e.g., 東西east–west [thing]). It can be quantified by participants’ ratings for the semantic relatedness between a word and its characters (e.g., Tse et al., 2017). Although words with more semantically transparent characters yielded faster lexical decision performance than those with less semantically transparent characters (e.g., Myers et al., 2004; Tse et al., 2017), to our knowledge this variable has not been examined in the two-character Chinese word naming literature. In other languages like English, transparent words were reported to yield better lexical decision and naming performance than opaque words (e.g., Kim et al., 2019).

Most of the previous findings in Chinese character/word speeded naming have been based on factorial-design experiments, wherein a relatively small set of words constrained to be matched on various lexical characteristics were used. It is important to test whether these effects of lexical variables occur when a much larger pool of words is included in the megastudy approach. Assuming that the findings of naming single characters can be generalized to naming two-character words, we expect to replicate most, if not all, of the above findings.

The role of character variables in word processing

There has been debate on whether Chinese words are accessed holistically or analytically (by combining the characters) in the literature. Packard (1999) has argued that the access is via the whole word, as the large number of homophones and extent of semantic ambiguity in Chinese might make character processing too costly in terms of time and effort. However, this holistic view might be incompatible with findings that character variables, such as character frequency, in addition to word variables (e.g., word frequency), account for variance in lexical decision performance (e.g., Tse et al., 2017, see also the evidence of priming studies, e.g., Tsang & Chen, 2014). Moreover, Tse et al. showed that the effects of lexical variables on lexical decision performance were quite similar whether the characteristic was associated with the first or second character, suggesting that word processing may be parallel, with both characters being processed simultaneously. That said, it is not clear whether this parallel processing also occurs in naming. Specifically, the speeded naming task requires participants to read aloud the first and then second characters, and the voice key is triggered to record naming RT once they begin to pronounce the first character (i.e., speech onset). Unlike in English, where participants have to process the entire letter string to correctly pronounce the word, participants may not need to completely process the whole Chinese word before naming the first character of the word.Footnote 2 If so, we expect that character variables, especially those of the first character, would be more predictive of naming performance than word variables, suggesting that the holistic versus analytic nature of Chinese word processing depends on task demands. To our knowledge, this issue has not been addressed in factorial-design experiments. The megastudy approach with multiple regression analyses could compare the predictive power of the lexical variables of the first and second characters so as to shed light on this issue.

The role of orthographic, phonological, and semantic variables in word processing

In contrast to the English language, an alphabetic writing system, Chinese language possesses a logographic system with far more opaque orthography; that is, there is no regular mapping between spelling and sound. While the phonetic radical of about 80% of Chinese characters provides pronunciation hints (Hoosain, 1992), this depends on orthography-to-phonology mapping consistency (i.e., whether a character’s pronunciation is shared with other Chinese characters possessing the same phonetic radical). Hence, it is possible that phonology plays a less influential role in Chinese, compared with English, word naming. On the other hand, Perfetti and Tan (1999) argued that Chinese word recognition results from a convergence of orthography, phonology, and semantics. Because orthography-to-phonology mapping (often in one-to-one relationship, e.g., 表 pronounced as biu2) is more reliable than orthography-to-semantics mapping (often in one-to-many relationship, e.g., 表 can mean watch, express, surface, or meter), phonology is the privileged constituent in word recognition. Inconsistent with this view, based on Tse et al.’s (2017) normed lexical decision data, Tse and Yap (2018, see also Sze et al., 2015) reported that orthographic and semantic variables, respectively, accounted for more variance than phonological variables. This suggests that phonological activation, compared with semantic activation, is less influential in a task which does not explicitly require the generation of word phonology and instead emphasizes the processing of stimulus familiarity (e.g., Balota & Chumbley, 1984). Evidence supporting the role of phonological information in Chinese word recognition has often been based on the speeded naming task (e.g., Perfetti & Tan, 1999), which places more emphasis on the generation of phonology. By identifying the proportion of variance in naming performance accounted for by orthographic, phonological, and semantic variables, we expect that phonological variables play a much larger role than semantic variables in naming performance.

In the few studies that have explored task-specific differences between Chinese lexical decision and naming (e.g., Gao et al., 2016; Li et al., 2017), word frequency was reported to produce a larger effect in the lexical decision task. Given the demands of speeded naming, when activating the whole word, participants might also sequentially activate the phonological information for the first and then the second character (e.g., Li et al.). This contrasts with the lexical decision task, in which participants can respond only by processing the whole word. The first-character-focused naming process might somehow play a role in reducing the word frequency effect in Chinese lexical processing. This is consistent with the view that, relative to naming, lexical decision emphasizes more frequency-based information in discriminating between word and nonword (e.g., Balota & Chumbley, 1984). In English, studies reported task-specific effects in lexical decision and naming (e.g., Andrews & Heathcote, 2001; Balota & Chumbley, 1985; Balota et al., 2004). For instance, there was an influence of semantic variables in lexical decision (e.g., Yap & Balota, 2009), but semantic effects in naming were smaller (as compared with lexical decision, e.g., Balota et al., 2004) or restricted to low-frequency irregular words (e.g., Cortese et al., 1997), so semantic variables might have a greater influence on lexical decision than on naming.

Most of the findings in Chinese have been based on factorial-design experiments and should thus be tested for generalizability using the megastudy approach (see, e.g., Balota et al., 2004, for similar analyses in English). By conducting the same analyses on the current naming data and Tse et al.’s lexical decision data, we compare the influences of these variables between the two tasks. We expect phonological variables to play a larger role in naming than in lexical decision, and semantic variables to play a larger role in lexical decision than in naming. For example, the first-character frequency effect would be larger in naming than in lexical decision, whereas the word frequency effect would be larger in lexical decision than in naming.

Present study

To recapitulate, we expanded Tse et al.’s (2017) database by norming speeded naming measures for more than 25,000 two-character Chinese words and compiling more lexical variables to test the role of orthographic, phonological, and semantic variables (see Table 1) in naming performance and to compare those with lexical decision performance in the same item-level multiple regression analyses. We have the following predictions.

  1. (1)

    We replicate the lexical effects (e.g., word frequency and phonological consistency) of previous speeded naming studies in Chinese, even though most of them were based on single-character naming factorial-design experiments.

  2. (2)

    Given that participants serially activate individual character phonology when they begin to pronounce the two-character words, character variables (e.g., first-character frequency) will be more predictive of naming performance than word variables (e.g., word frequency). The first-character lexical variables will also be more predictive of naming performance than the second-character lexical variables.

  3. (3)

    By comparing lexical decision (Tse et al., 2017) and naming based on the proportion of variance explained in multiple regression analyses, we expect that phonological variables play a larger role than semantic variables in naming, and semantic variables play a larger role than phonological variables in lexical decision (i.e., similar to Tse & Yap’s, 2018, findings, based on Tse et al.’s, 2017, normed lexical decision data).

Table 1 Variables listed in the Excel .xlsx and Unicode .csv files

Method

To ensure comparability, the participants, material, design, and procedure followed those in our lexical decision megastudy (Tse et al., 2017).

Participants

Two hundred and ninety-seven native Cantonese-speaking students from the Chinese University of Hong Kong (CUHK), with self-reported normal or corrected-to-normal vision, participated in our study. Data from 19 additional participants were replaced due to their failure to participate in all experimental sessions. Participants were asked to report their gender (191 female), age (M = 19.73, SD = 1.43), amount of time they read Chinese materials per week (M = 210.43 minutes, SD = 460.82), cumulative grade-point average (M = 3.23, SD = .34),Footnote 3 self-rated knowledge in seven-point scales for traditional characters (M = 5.63, SD = .88) and spoken Cantonese (M = 5.99, SD = .90), and overall grades and sub-grades in Chinese language university entrance exam (overall: M = 4.64, SD = 1.22; reading comprehension: M = 4.82, SD = 1.52; writing: M = 4.28, SD = 1.56; listening and integrated skills: M = 5.19, SD = 1.41; speaking: M = 3.88, SD = 1.39).Footnote 4 We did not impose any restriction on participants’ Chinese proficiency, as substantial variability in proficiency may allow an exploration of individual differences in language processing (e.g., Lim et al., 2020, see also Yap et al., 2012). Each participant was paid 300 HKD (~38 USD) for their voluntary participation.

Materials and design

We include the words that were normed for lexical decision performance in the Chinese Lexicon Project (see Tse et al., 2017, for the selection procedure). Stimuli were all traditional Chinese characters typically used in Hong Kong and should thus be familiar to participants. The lexical variables are listed in Table 1 (see Tse et al., 2017s, for how semantic transparency was normed). All values of these variables are listed in the .xlsx/.csv file available at: https://osf.io/vwnps. Using Tse et al.’s procedure (see also the English Lexicon Project of Balota et al., 2007), we divided 25,281 two-character Chinese words into nine lists of 2809 words each, with a restriction that the proportion of the words with a specific first character in each subset roughly reflected those in the whole word pool. We collected data from 297 participants (i.e., 9 lists × 33 participants) to obtain 33 naming responses for each word. This number of observations per word is comparable to the mean of other normed naming databases of other languages (e.g., 25 in Balota et al.; > 25 in Ferrand et al., 2010; 31 in Goh et al., 2020; 39 in Keuleers et al., 2010; 39 in Keuleers et al., 2012) and allows us to obtain stable estimates of each word’s mean RTs. Each participant read aloud 936–937 words in each of the three sessions and thus in total 2809 words. The presentation order of word sets across the three sessions were counterbalanced across participants.

Procedure

Informed consent was obtained at the beginning of the study. Each participant was tested in a quiet cubicle in three sessions separated by no more than a week. PC-compatible computers with E-Prime 2.0 (Schneider et al., 2001) were used to display the stimuli and collect RT and accuracy data. (The participants’ naming duration was also recorded, but we do not consider them in the current work.) Stimuli were in white and visually presented on a black background, one at a time, at the center of the screen. In each session, 936–937 words were divided into four blocks of 234–235 words. At the end of each block, participants received a self-paced rest break, so there were three rest breaks in each session. Participants took about 55 min to complete each session. The presentation order of the four blocks within each session were randomized for each participant. Within each block, the presentation order of the words was also randomized for each participant.

Each trial started with a 500-ms fixation point *** at the center of the screen, followed by a 120-ms blank-screen interstimulus interval. A two-character Chinese word in font size 36 and font type 標楷體 DFKai-SB was presented at the same location as the fixation point until the participant made a response. The physical size of the word was about 2.5 cm in height by 5 cm in width. Given that participants sat 55 cm away from the screen, the visual angle was about 1.56°. To ensure that the current data reflected the young adults’ naming responses to the script that was most familiar to them, we presented all stimuli in traditional characters, following Tse et al. (2017). The participant was instructed to read aloud the word in Cantonese (i.e., their native language) as quickly and as accurately as possible to trigger the voice key connected via the Chronos response box to the computer. The Chronset algorithm (Roux, Armstrong, & Carreiras, 2016) was used to ensure the precision of onset detection. The word remained on the screen for an additional 500 ms after voice onset and then disappeared. If no response was detected in 4000 ms, the word disappeared and then a verbal signal 太慢!Too Slow! appeared for 500 ms. At the end of each trial, a blank screen was displayed for 1000 ms, to serve as an intertrial interval. Following Balota et al.’s (2007) procedure, before the 1000-ms intertrial interval, participants were instructed to manually code, in a self-paced manner, their own responses as correct pronunciation, mispronunciation, or microphone error. Participants were informed about the importance and use of these coding options. At the start of a session, participants were presented 10 practice words to become familiar with the task. These words did not appear as experimental words in that session.

Results

A phonetic bias might occur for an articulatory reason (some phonemes take more time to initiate) or acoustic reason (some phonemes take more time for the voice key to detect) (e.g., Liu et al., 2007). Following previous studies (e.g., Balota et al., 2004), we controlled for these variables in our analyses by coding the following 13 characteristics of the initial phoneme of the first characters as 1 (presence) or 0 (absence): affricative, alveolar, bilabial, labiodentals, dental, fricative, glottal, liquid, stop, nasal, palatal, velar, and voiced.

Only responses for experimental words were analyzed. Based on 297 participants, the mean accuracy rate (i.e., percentage of correct responses after excluding trials with microphone error) was 96.40% (SD = 2.41%). Following previous studies (e.g., Sze et al., 2014; Tse et al., 2017), remaining responses faster than 200 ms or slower than 3000 ms were excluded (about 0.41% of the trials). The mean and SD were then computed for each participant’s word responses. Any correct word response 2.5 SD above or below their mean was regarded as an outlier and excluded (about 3.31% of the trials). The mean RT of the trimmed correct word trials was 659.53 ms (SD = 186.71 ms). As suggested by Faust et al. (1999) and following previous megastudies (e.g., Balota et al., 2007; Ferrand et al., 2010; Ferrand et al., 2018; Keuleers et al., 2012; Sze et al., 2014; Tse et al., 2017; Yap et al., 2010), these raw RTs were transformed into z scores for each participant, before averaging across participants for each word to yield individual word zRT values. This standardization controls for differences in overall RT and variability between participants, which then helps to reduce noise in the data. Previous works reported that zRT was more reliable than raw RT (e.g., Ferrand et al., 2010). The level of significance was set at .05.

To determine the reliability of our dependent measures (RT, zRT, and accuracy rates), we followed Tse et al. (2017) and computed the split-half correlation, with a correction for length (i.e., about 33 observations per word) with the Spearman-Brown formula (2 × r)/(1 + r). This reflects the proportion of variance in a variable that can be explained. The corrected correlations (rcorr) between the dependent measures computed on the first half (N = 16) of participants who saw the word and the dependent measures computed on the second half (N = 17) of participants who saw the word were .82, .88, and .86 for RT, zRT, and accuracy, respectively. The finding that the reliability of zRTs is higher than the reliability of the raw RTs is in line with Faust et al.’s (1999) view that taking away differences in overall RT and variability between participants may remove noise from the data and does not artificially reduce the variability of the words.

Following Tse et al. (2017), we excluded words that yielded lower than 70% accuracy rates (about 2.31%). We ran item-level multiple regression analyses analogous to those that Tse et al. did for the lexical decision data, with mean zRT, averaged across participants, as the dependent variable. For the accuracy data,Footnote 5 since the values are bounded (from 0 to 100) and violate the assumptions of linear regression models (i.e., without boundaries), we first transformed each word’s mean accuracy rates to logit (i.e., unbounded score) and then ran the same item-level multiple regression analysis as that for mean zRT, using the binomial family under the generalized linear modeling function in R 4.1.3. Note that McFadden’s pseudo R2 values were computed for these accuracy analyses, which could not be directly compared with the R2 values for zRT. The values of the lexical variables as predictors were all mean-centered in all regression analyses (Jaccard & Turrisi, 2003). Table 1 presents all variables listed in the Excel .xlsx and Unicode .csv files, which are available at: https://osf.io/vwnps (Tse_et_al.xlsx and Tse_et_al.csv). Following previous megastudies (e.g., Tse et al., 2017), we entered the lexical variables in the following order: characteristics of initial phoneme of the first character (step 1), orthographic variables (number of strokes, the frequency of the first and second characters, and word frequency) (step 2), phonological variables (number of phonemes, homophone density, phonological consistency, phonological regularity,Footnote 6 and orthography-to-phonology mapping consistency of the first and second characters) (step 3), and semantic variables (neighborhood size, number of meanings, and semantic transparency of the first and second characters) (step 4). After taking into account the missing values of some words, the following analyses were based on 19,888 words (about 78.67% of 25,281 words in total).

Analyses of naming performance

Tables 2 and 3 report the descriptive statistics of and inter-correlational matrices among these lexical variables and the two behavioral measures (zRT and accuracy). There was no multicollinearity problem for orthographic, phonological, and semantic variables, as indicated by their overall moderate- to low inter-correlations (see Table 3). The variance inflation factors were also generally low (< 1.94) in all these variables.Footnote 7 The betas (standardized regression coefficients) of these lexical variables when they were first entered in the regression models and the R2 change (i.e., change in proportion of variance accounted for in zRT) or pseudo R2 change in accuracy at each step are reported in Table 4. The total proportions of variance accounted for were 51.6% and 19.1% for zRT and accuracy, respectively.

Table 2 Descriptive statistics of the lexical variables and two behavioral measures (zRT and accuracy) involved in item-level regression analyses
Table 3 Inter-correlational matrices among these non-centered lexical variables and the two behavioral measures (zRT and accuracy) involved in item-level regression analyses (N = 19,888)
Table 4 Standardized regression coefficients (beta values) on naming zRT and accuracy rate (N = 19,888)

Variance accounted for in naming zRT and accuracy

First, characteristics of the initial phoneme of the first character accounted for 10.3% and 0.2% of the variance in naming zRT and accuracy, respectively, suggesting that phonological coding at the first character’s phoneme level is a significant predictor of naming zRT, but much less so naming accuracy. This reflects that the coding of onsets may primarily influence response speed due to voice key sensitivity and articulation, rather than response accuracy. Thus, they were statistically controlled in the following analyses. Second, phonological variables accounted for much less variance in naming zRT than orthographic variables (2.1% vs. 37.9%) and slightly more variance in naming zRT than semantic variables (2.1% vs. 1.3%). A similar pattern was observed in naming accuracy. The modest amount of variance accounted for by phonological variables in Chinese naming relative to English naming (e.g., Balota et al., 2004) could be due to the deeper orthography in the Chinese language relative to the English language.

Predictions 1 and 2: Effect of lexical variables on naming zRT and accuracy

We test whether our findings replicate the lexical effects of previous speeded naming studies in Chinese, most of which were based on single-character naming factorial-design experiments. Our findings were generally consistent with those reported in previous works, although there are some exceptions, thereby being partially congruent with our Prediction 1.

For orthographic variables, naming responses were faster for words comprising characters with fewer strokes (e.g., 人) than those with more strokes (e.g., 體), consistent with previous findings (e.g., Shen & Zhu, 1994). The naming responses were faster and more accurate for higher-frequency words (e.g., 多謝 thank you), relative to lower-frequency words (e.g., 睿哲 divinely wise), and for those with higher-frequency characters (e.g., 花 flower), relative to those with lower-frequency characters (e.g., 堃 compliance). These findings are also consistent with earlier studies (e.g., Gao et al., 2016; Yu & Cao, 1992). Consistent with our Prediction 2, character variables (e.g., first-character frequency) were more predictive of naming performance, as reflected by larger regression coefficients (see Table 4), than word variables (e.g., word frequency). In addition, the effects of number of strokes and character frequency were larger for the first character than the second character.

For phonological variables, naming responses were faster and more accurate when the first or second character was phonologically consistent (e.g., 骨 is pronounced as gwat1 in all words with it, such as骨頭gwat1 tau4 [bone]), relative to phonologically inconsistent (e.g., 重 is pronounced as cung5 in 體重tai2 cung5 [weight], zung6 in 重要zung6 jiu3 [important], or cung4 in 重申cung4 san1 [reiterate]). This effect was in line with previous findings (e.g., Leong & Cheng, 2003), although the effect was similarFootnote 8 for the first and second characters. The naming responses were faster when the first character contained more phonemes (e.g., 框 kwaang1), relative to fewer phonemes (e.g., 之 zi1), and when the first character was phonologically regular (i.e., pronounced the same as its phonetic radical) than when it shared the same syllable but not the same tone with its phonetic radical (i.e., change of tone) or was phonologically irregular (i.e., pronounced differently from its phonetic radical). The naming responses were slower when the first character was phonologically irregular (vs. change of tone). The phonological regularity effect was in line with Hue et al. (1992) and it occurred in the first, but not the second, character. The naming responses were slower when the first character was higher in homophone density (e.g., 支zi1, which is also the pronunciation of 袛, 諮, 孜, 蜘, 吱, 芝, 茲, 肢, 滋, 脂, 姿, 枝, 資, 知, 之, and 恣...) than lower in homophone density (e.g., 鏡 geng3, a pronunciation is not shared with other characters), contradicting the positive or null effect of homophone density reported in previous studies (e.g., Chang et al., 2016; Ziegler et al., 2000). The null effect of orthography-to-phonology mapping consistency was inconsistent with the facilitatory effect reported by Yang et al. (2009). Other than phonological consistency, all significant effects of phonological variables on naming were larger for the first character than the second character, partially consistent with our Prediction 2.

For semantic variables, naming responses were faster when the first character was associated with larger neighborhood size (e.g., 花 flower, relative to smaller neighborhood size, e.g., 凰 phoenix) or more meanings (e.g., 行, relative to fewer meaning, e.g., 廚). These are consistent with previous findings (e.g., Chang et al., 2016; Peng et al., 2003). In contrast, naming responses were slower when the first character was more transparent in meaning. All these effects occurred only for the first character or were larger for the first character than the second character, consistent with our Prediction 2.

Prediction 3: Comparison between lexical decision and naming

To test our Prediction 3, we compare the effect of lexical variables on naming versus lexical decision by combining Tse et al.’s (2017) normed lexical decision data with the current naming data. After excluding words that yielded lower than 70% accuracy in lexical decision or naming data, we ran analyses similar to those we did for naming data based on 18,736 words (about 74.11% of 25,281 words in total). The betas of these lexical variables when they were first entered in the regression models and the R2 change for zRT and pseudo R2 change for accuracy at each step are reported in Table 5. The total proportions of variance accounted for were 36.5% and 20.1% for lexical decision zRT and accuracy, respectively, and 51.8% and 19.1% for naming zRT and accuracy, respectively.

Table 5 Standardized regression coefficients (beta values) on lexical decision and naming zRT and accuracy rate (N = 18,736)

Variance accounted for in lexical decision and naming zRT and accuracy

Consistent with previous naming studies in other languages (e.g., Yap & Balota, 2009), the characteristics of the initial phoneme of the first character accounted for more variance in naming zRT than in lexical decision zRT (10.9% vs. 0.2%, respectively) but were quite similar in naming accuracy (0.3%) and lexical decision accuracy (0.1%).

To compare the variance explained by orthographic, phonological, and semantic variables in lexical decision and naming, orthographic variables accounted for slightly more variance in naming zRT (37.6%) than lexical decision zRT (34.9%), but the difference was the opposite for accuracy (16.9% vs. 18.7%, respectively). Consistent with our Prediction 3, phonological variables accounted for slightly more variance in naming zRT/accuracy than lexical decision zRT/accuracy (2.1%/1.6% vs. 0.3%/0.1%, respectively). Semantic variables accounted for a similar proportion of variance in naming and lexical decision zRT (about 1.1–1.2%), although they accounted for slightly more variance in lexical decision accuracy than naming accuracy (1.2% vs. 0.3%, respectively). The latter finding was in line with our Prediction 3 and consistent with the emphasis of lexical decision on semantic information for word/nonword discrimination. Nevertheless, it is important to note that the differences in the contribution of orthographic, phonological, and semantic variables between lexical decision and naming were smaller in Chinese word processing than English word processing (e.g., Yap & Balota, 2009), all in all suggesting that similar word recognition processes/mechanisms might be used to process two-characters Chinese words in the lexical decision and naming tasks. This was attributed to the deep orthography of Chinese language. The overall variance patterns of naming and lexical decision findings based on this subset (N = 18,736) were the same as those reported above and in Tse and Yap (2018), respectively, although the word pools were not perfectly overlapping across studies, and transformed accuracy rates (in logits), instead of raw accuracy rate as in Tse and Yap, were used in the current study.

Effect of lexical variables on lexical decision and naming zRT and accuracy

The overall pattern of naming results was similar to those reported above, except that now both semantically transparent first and second characters, not just the semantically transparent first character, predicted slower naming zRT. Still consistent with our Prediction 2, the effect of semantic transparency was larger for the first character than for the second character. For the lexical variables shared between the two studies, the pattern of lexical decision findings was similar to that reported in Tse and Yap (2018), except that the number of strokes, character frequency, and phonological consistency no longer predicted lexical decision accuracy. It is noteworthy that, unlike Tse and Yap’s results, there was no speed–accuracy trade-off for the effect of number of strokes on lexical decision performance in the current study.Footnote 9

The comparison of the effects of lexical variables on lexical decision and naming showed the following findings (see Table 5).

First, we found that the character with more strokes slowed both lexical decision and naming zRT, although the effect was larger on naming than on lexical decision. Similarly, the facilitatory effect of character frequency was also larger on naming than on lexical decision. This effect was also larger in the first character than in the second character for naming, but the effect was of similar magnitude between characters for lexical decision. In contrast, the facilitatory effect of word frequency was larger on lexical decision than on naming. Similar results regarding word frequency were obtained in the English Lexicon Project (Balota et al., 2004).

Second, the facilitatory effect of the first character with more phonemes was larger for naming zRT than for lexical decision zRT. While the homophone density of the first and second characters sped up lexical decision responses, the homophone density of the first character, but not second character, slowed naming responses. The facilitatory effect of phonological consistency on accuracy was larger in naming than in lexical decision, although the effect was similar in size for lexical decision and naming zRT. There was a facilitatory effect of first-character phonological regularity on naming, but not on lexical decision. For the character that shared the same syllable but different tone with its phonetic radical (i.e., “change of tone”), we obtained faster naming zRT for these “change-of-tone” first characters (e.g., 胖bun6, which shares the same syllable but not the same tone with its phonetic radical 半bun3) than phonologically irregular first characters (e.g., 耀jiu6, which does not share pronunciation with its phonetic radical 翟zaak6). The lexical decision zRT was also faster for “change-of-tone” second character than phonologically irregular second character.

Finally, the facilitatory effect of first and second character’s neighborhood size was larger in naming zRT than in lexical decision zRT.Footnote 10 The words with more first-character meaning sped up naming responses, whereas those with more second-character meaning slowed down lexical decision responses. Whereas the first and second character’s semantic transparency facilitated lexical decision zRT and accuracy, they slowed naming zRT (especially the first character’s one).

Discussion

The purpose of the current study was twofold. First, using a megastudy approach, we expanded Tse et al.’s (2017) database, which was developed for lexical decision performance, by norming speeded naming RT and accuracy rates for more than 25,000 traditional Chinese two-character words. We also compiled more lexical variables (e.g., phonological consistency and semantic neighborhood size). Second, we performed item-level multiple regression analyses to test the relative predictive power of orthographic variables (e.g., stroke count), phonological variables (e.g., phonological consistency), and semantic variables (e.g., semantic transparency) in naming and compared these with those in lexical decision, based on Tse et al.’s normed data, to determine whether specific lexical effects are task-specific or task-general. We had three major predictions. Prediction 1: We expected to replicate the lexical effects of previous Chinese speeded naming studies. Prediction 2: Given the serial nature of the word pronunciation, we expected that the character variables, especially the first-character variable, would be more predictive of naming performance than word variables. Prediction 3: In the comparison for the proportion variance explained by phonological and semantic variables between lexical decision and naming, we expected phonological variables to account for larger variance than semantic variables in naming, but expected the opposite pattern to occur in lexical decision. In the following, we discuss the findings in response to the above predictions as well as their general implications for Chinese lexical processing.

Prediction 1: Replication of benchmark findings in speeded naming

The current naming data generally replicate the standard lexical effects in Chinese naming literature. Naming performance, as indicated by faster zRT and/or higher accuracy, was facilitated as a function of word frequency (e.g., Gao et al., 2016), both characters’ character frequency (e.g., Yu & Cao, 1992), phonological consistency (e.g., Leong & Cheng, 2003), and neighborhood size (e.g., Chang et al., 2016), and the first character’s number of phonemes, number of meanings, and phonological regularity (e.g., Hue et al., 1992). In contrast, naming performance was inhibited as a function of number of strokes for both characters (e.g., Shen & Zhu, 1994) and as a function of homophone density and semantic transparency for the first character. The inhibitory effect of homophone density was inconsistent with that reported in previous studies (e.g., Chang et al., 2016; Ziegler et al., 2000). Also, unlike Yang et al. (2009), orthography-to-phonology mapping consistency did not predict naming performance.

It is noteworthy that most of the previous findings on Chinese speeded naming were based on single characters; that is, participants read aloud the single characters shown on the screen, rather than two-character words as in the present study. Psycholinguistic effects might not necessarily be generalizable from single-character studies to multiple-character studies. When the character appears in a word context, the lexical characteristics of word and/or another character might moderate the processing of the character being pronounced. For example, naming a character in a high-frequency word might be less affected by its lexical characteristics than when the same character appears in a low-frequency word, as word familiarity might facilitate the activation of the whole word and in turn its characters. On the other hand, given the sequential (i.e., first, then second character) processing implicated in word naming, the influence of the lexical characteristics of the second character might be preempted by that of the lexical characteristics of the first character and the whole word. These possibilities could be further investigated by comparing the lexical effects on lexical decision and naming in the isolated context versus word context. Moreover, in the regression analyses, the interaction terms could be added to test the moderating role of a specific word’s/character’s variables. Nevertheless, the results of the lexical effects in our normed naming data provide useful empirical benchmarks for the Chinese word processing literature.

Prediction 2: Predictive power of character versus word variables and the first-character versus second-character variables in speeded naming

By comparing the differences in the contribution of the lexical effect of the words and/or the first and second characters on speeded naming, we may determine whether the pronunciation of the Chinese words is activated holistically or through the combination of the characters, and whether the influence of the first character is stronger than that of the second character. For the effects of the character’s versus the word’s lexical variable, we compare the character and word variables at the same (orthographic) level, that is, the effect of character frequency versus the effect of word frequency. Consistent with our Prediction 2, the effect of word frequency was smaller than the effect of character frequency, especially the first character’s frequency, in both zRT and accuracy (see Table 4). For the effects of the first character’s versus second character’s lexical variable, we found that the significant lexical effects, with the exception of phonological consistency, of the first character on naming performance were stronger than those of the second character (see the regression coefficients in Table 4), again in line with our Prediction 2. These suggest that when participants read aloud a word, they first processed the first character, followed by the second character, rather than processing them simultaneously. Given that the voice key was triggered to record naming RT once participants began to pronounce the first character (i.e., speech onset), their naming responses may be more influenced by the lexical characteristics of the first character than those of the second character and the word. This result is consistent with the analytic view (e.g., Tse et al., 2017), but not the holistic view (e.g., Packard, 1999) of Chinese word processing. At the very least, these findings show that the holistic versus analytic nature of word processing is flexible and depends on task demands. This view is similar to word processing in English in that the meaning of a word is simultaneously and independently accessed via direct retrieval of a whole-word representation, which is more sensitive to word frequency, and a decomposition-then-composition process of constituent (character) representations, which is more sensitive to constituent variables (e.g., Baayen et al., 1997; Libben & Jarema, 2007). Other character-level and word-level lexical variables (e.g., word imageability) should be considered in future research to further test the holistic versus analytic nature of Chinese word processing.

Prediction 3: Predictive power of phonological versus semantic variables in speeded naming versus lexical decision

In the Chinese language, the orthography-to-phonology mapping is nearly deterministic (i.e., often in a one-to-one relationship, e.g., 表pronounced as biu2), whereas the orthography-to-semantics mapping is under-deterministic (i.e., often in a one-to-many relationship, e.g., 表can mean watch, express, surface, or meter). Perfetti and Tan (1998, 1999) argued that the one-to-one relation can be more quickly established in Chinese word recognition, so phonology should play a stronger role than semantics in Chinese word processing (see also Tan & Perfetti, 1999, for a model of visual recognition of two-character Chinese words). Tse and Yap (2018) analyzed Tse et al.’s (2017) normed lexical decision data and showed that orthographic and semantic variables, respectively, accounted for more variance than phonological variables, although some could point out that the lexical decision task does not require the generation of word phonology. Based on the R2 or pseudo R2 changes in each step (see Tables 4 and 5), orthographic variables (e.g., character and word frequency) were associated with the strongest predictive power on naming performance, followed by phonological variables; semantic variables accounted for the least variance in our analyses. Phonological variables did account for more variance than semantic variables in predicting naming performance, consistent with Tan and Perfetti’s view. However, the proportion of variance accounted for was still low for phonological variables (smaller than 2.5%) even when the naming task explicitly tapped character phonology, suggesting that the contribution of word phonology might not be as large as what Tan and Perfetti proposed.

Few studies have directly compared the effect of lexical variables on lexical decision versus naming for two-character Chinese words. Consistent with these previous findings (e.g., Gao et al., 2016; Li et al., 2017), we found that word frequency was a stronger predictor for lexical decision performance (−.545) than naming performance (−.151, see Table 5). This is compatible with the view that lexical decision emphasizes more frequency information, an index of whole-word familiarity, when participants discriminate words from nonwords (e.g., Balota & Chumbley, 1984). On the other hand, because participants read aloud the two-character word beginning from the first character, the first-character-focused naming process might reduce the influence of the word frequency effect in Chinese lexical processing. In fact, when comparing the effects of the first versus second character’s lexical variables on lexical decision and naming performance (see Table 5), we did find that the predictive power was stronger for the first character’s variable than the second character’s variable in naming zRT (e.g., −.405 vs. −.210 in character frequency, −.144 vs. .005 in the number of phonemes, and .034 vs. .017 in semantic transparency). In contrast, the difference in predictive power of the first versus second character’s variable was not as large in lexical decision (e.g., −.067 vs. −.065 in character frequency, −.021 vs. −.002 in the number of phonemes, and −.065 vs. −.063 in semantic transparency). The significant predictive power of character variables showed that words are processed not merely holistically (as reflected by word variables), but also analytically (as reflected by character variables). The different patterns of predictive power of the first versus second character’s orthographic, phonological, and semantic variables in the lexical decision and naming tasks suggest that the influence of first versus second character’s variables depends heavily on the task demand, regardless of the lexical characteristics of the characters/words.

The importance of task demands could be observed in the relative contribution of the phonological and semantic variables on naming and lexical decision performance. While orthographic variables accounted for much more variance than semantic and phonological variables in both lexical decision and naming performance, semantic variables accounted for more variance than phonological variables in lexical decision, and phonological variables accounted for more variance than semantic variables in naming. While these are consistent with our Prediction 3, semantic variables accounted for just slightly more variance in lexical decision accuracy (.012) than in naming accuracy (.003) but similar variance when the zRT measure was considered (.011 vs. .012) (see Table 5). This suggests that phonological variables clearly contributed more to naming performance than to lexical decision performance, but the contribution of semantic variables was quite similar in the performance of the two tasks. This was in contrast to those obtained in English, where the influence of semantic variables was more salient in lexical decision than in naming (e.g., Chang et al., 2019; Cortese & Khanna, 2007; Cortese et al., 1997; Yap & Balota, 2009). Semantic variables might have more influence on Chinese naming than on English naming because more than half of Chinese characters have no systematic orthography-to-phonology mappings (Chang et al., 2016), such that readers might rely on semantics when pronouncing Chinese characters (see also Chang & Lee, 2018).

Finally, it is noteworthy that there was a marked contrast in the role of semantic transparency in lexical decision versus naming performance. While semantic transparency facilitated lexical decision, the same variable, especially for the first character, had an inhibitory effect on naming. Following the general dual-route model of compound word processing (e.g., Libben & Jarema, 2007), the meaning of a word can be accessed via direct retrieval from semantic memory or via decomposition, with the meanings of its characters first activated and then combined to obtain the word meaning. The outputs from direct retrieval and decomposition routes are similar for transparent words (e.g., 花園garden). In contrast, for opaque words at least one of the characters is unrelated to the whole word, so there is a conflict between the two routes (e.g., combined meaning of the two unrelated characters, 花flower and 生grow versus meaning of the word, 花生peanut). In lexical decision, this conflict may trigger participants to do post-lexical checking to confirm the word lexicality (see, e.g., Balota & Chumbley, 1984), thereby slowing the word recognition process for opaque, relative to transparent, words and producing the semantic transparency effect (see, e.g., Kim et al., 2019). In naming, there was no need for post-lexical checking, as participants read aloud the word character-by-character, so the conflict in the meaning of whole word and its characters likely did not influence the naming responses. That being said, participants named opaque words faster than transparent words. This negative semantic transparency effect was unexpected and should be further examined in future research.

Comparison with Chang and Lee’s (2020) and Sun et al.’s (2018) findings

Before concluding the current study, two recent studies using the megastudy approach to examine the influence of lexical variables on lexical decision and naming of Chinese characters/words are worth discussing in the context of our current findings.

Using 3314 traditional Chinese single characters as their stimuli, Chang and Lee (2020) examined the predictive power of age of acquisition, as well as semantic variables, in naming and lexical decision of traditional Chinese characters. Contrary to the findings in English (e.g., Yap & Balota, 2009), they found that semantic variables had higher predictive power in naming than in lexical decision. These findings were also incongruent with our current findings that semantic variables had similar predictive power in naming zRT and lexical decision zRT. In fact, semantic variables even accounted for slightly more variance in lexical decision accuracy than in naming accuracy. However, it is noteworthy that phonological and semantic variables were defined differently between the two studies. Indeed, there were only two lexical variables in common in Chang and Lee and the current study.Footnote 11 Character frequency, defined as an orthographic variable in the current study but a semantic variable in Chang and Lee, was more predictive of naming than lexical decision in both studies. Moreover, the facilitatory effect of character frequency was also consistently obtained in the two studies. On the other hand, we found that number of strokes was more predictive of naming than lexical decision, contrary to the absence of such task differences reported by Chang and Lee.

Apart from little overlap in the lexical variables in the analyses, there were a number of stimulus and procedural differences that might complicate the direct comparison of the findings of the two studies. First, whereas Chang and Lee (2020) used single characters as stimuli, we used two-character words. Second, their participants read aloud the characters in Mandarin, instead of Cantonese as in the current study. Third, phonological variables were quantified based on different dialects (Mandarin and Cantonese) in the two studies. Fourth, as mentioned above, most of the lexical variables were not shared between the two studies. For instance, concreteness, imageability, and age of acquisition were included in Chang and Lee but not in the current study, whereas semantic transparency and neighborhood size were included in the current study but not in Chang and Lee. Hence, future research can be directed at determining whether some of these differences contributed to the above discrepancies in the findings in the two studies.

Unlike Chang and Lee (2020), which focuses on the comparison between lexical decision and naming of single traditional characters, Sun, Hendrix, Ma, and Baayen (2018) included single-character, two-character, three-character, and four-character words in their Chinese lexical database for simplified Chinese and performed analyses on existing lexical decision and naming data in other megastudies (e.g., naming data in Sun, 2016, as cited in Sun et al., 2018 and lexical decision data in Tse et al., 2017 and Tsang et al., 2018). It is noteworthy that the lexical variables in Sun et al. were not the same as those in the current study, as they were based on Mandarin dialect and simplified-script characters, in contrast to Cantonese dialect and traditional-script characters in the current study. Given the difference in script, Sun et al. restricted the evaluation of their measures to the subset of the two-character words in Tse et al. (2017), for which the written form is identical in simplified and traditional Chinese, such that the number of words was reduced from 25,281 to 8005. This limits the number of words in their analyses. Sun et al.’s naming data were provided by a single participant and based on 25,935 two-character words. In the following, we compare the current findings with Sun et al.’s results on single characters and two-character words. We also consider their single-character data, as we did with Chang and Lee and other studies that did not use the megastudy approach.

For single-character lexical decision data, in line with the current study, Sun et al. (2018) reported the facilitatory effect of character frequency and inhibitory effect of number of strokes. For two-character lexical decision data, consistent with the current study, Sun et al. also found the facilitatory effect of word frequency and character frequency, with the former effect being larger than the latter. The effects of first-character and second-character frequency were also similar in magnitude. Although Sun et al. did not obtain a significant effect of stroke count, this could be due to the restricted set of stimuli that was used in their study and/or that the effect size of this variable was rather small, as indeed reflected by their relatively small regression coefficients in our analyses (see Table 5).

For single-character naming data, in line with the current findings, Sun et al. (2018) found a facilitatory effect of character frequency and inhibitory effect of number of strokes. For two-character naming data, Sun et al. also found a facilitatory effect of character frequency and word frequency. However, the effects of character frequency (beta = −.025 for the first character and −.012 for the second character) were similar to or even smaller than the effect of word frequency (beta = −.021), in contrast to the current study in which the effect of first-character or second-character frequency was larger than the effect of word frequency (see Tables 4 and 5). However, the finding that the effect of first-character frequency was stronger than that of second-character frequency was in line with our current findings. Consistent with the current study, Sun et al. also found that the naming responses were slowed with the number of strokes, with the effect being larger for the first characters than the second characters.Footnote 12

Unlike other studies (including the current one), Sun et al. (2018) used information-theoretic measures to examine how the uncertainty of characters at the word level (e.g., entropy and conditional probability) influenced lexical decision and naming performance. They obtained a facilitatory effect of first-character entropy, suggesting that the greater uncertainty about the second character given the first character could trigger faster lexical decision and naming responses. Although this lexical decision finding was in contrast to the findings in the English language (e.g., Schmidtke et al., 2016; Hendrix et al., 2017), it highlights the importance of taking the combinatorial properties of characters into account when investigating lexical processing above the character level in future studies.

Conclusion and future directions

Motivated by the megastudy approach (e.g., Balota et al., 2013), the current extension of the Chinese Lexicon Project to speeded naming data represents an important addition to the different existing lexicon projects (e.g., English, Dutch, French, and Malay) being rapidly developed across the world and in line with the current research zeitgeist. Given that there are more native speakers of Chinese than any other language and that two-character words are the most common type of word encountered in Chinese reading, a two-character Chinese word speeded naming database can significantly contribute to research in psycholinguistics and other research domains using Chinese word stimuli. In the .xlsx/.csv files available at: https://osf.io/vwnps, we make the item-level data freely accessible to the research community, which allows researchers to search for naming RT, zRT, and accuracy rate for words and descriptive statistics of lexical variables (Table 1) for characters and words. This database serves as a critical resource of lexical characteristics and behavioral measures for future research. Researchers could perform item-level regression analyses to test the higher-order interactions among lexical variables (e.g., word frequency × character frequency interaction, to determine whether character frequency effect might be particularly salient for certain levels of word frequency), the evidence of which might be mixed in previous factorial-design experiments. This may address some theoretical issues in Chinese word processing, such as whether words and their characters are represented at the same level in the Chinese mental lexicon, whether words are accessed as a whole or via their characters, and the extent to which phonological and semantic variables interact to influence Chinese naming, which will have implications on the models of Chinese word recognition (see, e.g., Tse & Yap, 2018, for an example based on Tse et al.’s, 2017, normed lexical decision data).

Researchers could add other variables to our dataset to make it more comprehensive. For instance, previous studies showed that characters with higher phonological frequency (i.e., those with higher cumulative character frequency for their homophone mates) were named more slowly than those with lower phonological frequency (e.g., Ziegler et al., 2000, but see Chen et al., 2009). Chang and Lee (2020) showed that, after controlling for semantic variables, age of acquisition accounted for more variance in naming than in lexical decision for single characters. In addition, this variable interacted with phonological consistency in predicting character naming. Huang et al. (2006) showed that characters with higher summed frequency of all words in which a character occurred were named more slowly than those with lower summed frequency. It is important to test whether all these findings could be generalized in the current dataset (i.e., a large pool of two-character words with traditional script and Cantonese-speaking participants). The present large-scale data for various lexical variables and speeded naming performance could also facilitate the development of computational models of Chinese lexical processing (e.g., Shuai & Malins, 2017; Yang et al., 2009), with their computer simulation providing greater clarity and transparency than traditional descriptive models. Apart from addressing theoretical questions related to word recognition models, the current data, together with those developed by Tse et al. (2017) for lexical decision, could be regarded as normative data for traditional Chinese word recognition by native Cantonese-speaking university students and utilized to build Chinese proficiency tests. Using the findings from the French Lexicon Project (Ferrand et al., 2010), Brysbaert (2013) selected French words and nonwords varying in difficulty level and constructed a short French language proficiency test (LEXTALE-FR), which yielded good psychometric properties and can be used to assess French proficiency for both native and second language learners. Researchers could explore the possibility of developing a Chinese proficiency test with receptive (word recognition) and production (word pronunciation) components, based on lexical decision and speeded naming data, respectively. The large word pool in these databases would make it possible to construct multiple forms of tests that involved different sets of words and nonwords with similar difficulty levels, such that readers, be they first or second language learners, could be assessed without repeating the same set of words.

Open Practices Statement

The data are available at: https://osf.io/vwnps, and none of the experiment was preregistered.