Advertisement

Psychological Research

, Volume 81, Issue 3, pp 696–708 | Cite as

www.kanjidatabase.com: a new interactive online database for psychological and linguistic research on Japanese kanji and their compound words

  • Katsuo Tamaoka
  • Shogo Makioka
  • Sander Sanders
  • Rinus G. VerdonschotEmail author
Original Article

Abstract

Most experimental research making use of the Japanese language has involved the 1945 officially standardized kanji (Japanese logographic characters) in the Jōyō kanji list (originally announced by the Japanese government in 1981). However, this list was extensively modified in 2010: five kanji were removed and 196 kanji were added; the latest revision of the list now has a total of 2136 kanji. Using an up-to-date corpus consisting of 11 years’ worth of articles printed in the Mainichi Newspaper (2000–2010), we have constructed two novel databases that can be used in psychological research using the Japanese language: (1) a database containing a wide variety of properties on the latest 2136 Jōyō kanji, and (2) a novel database containing 27,950 two-kanji compound words (or jukugo). Based on these two databases, we have created an interactive website (www.kanjidatabase.com) to retrieve and store linguistic information to be used in psychological and linguistic experiments. The present paper reports the most important characteristics for the new databases, as well as their value for experimental psychological and linguistic research.

Keywords

Lexical Item Linguistic Research Compound Word Proper Noun Japanese Language 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgments

The present work was supported by the Grant-in-Aid for Challenging Exploratory Research, JSPS Grant number 25580112 (principal researcher: Katsuo Tamaoka), by the Grant-in-Aid for Grant-in-Aid for Scientific Research (C), JSPS Grant Number 15K02656 (principal researcher: Kazuko Komori), and a Grand-In-Aid for JSPS postdoctoral fellows (12F02315) and a JSPS Research Activity Start-Up Grant (15H06687) to Rinus G. Verdonschot.

References

  1. Amano, S., & Kondo, T. (1999). NTT deeta beesu siriizu: Nihongo no goi tokusei—Dai 1-ki [NTT database series: Lexical properties in Japanese, the first period]. Tokyo: Sanseido.Google Scholar
  2. Amano, S., & Kondo, T. (2000). NTT deeta beesu siriizu: Nihongo no goi tokusei—Dai 2-ki [NTT database series: Lexical properties in Japanese, the second period]. Tokyo: Sanseido.Google Scholar
  3. Atsuji, T. (1988). Kanji-no bunrui: Rikusho-o chuushin toshite [Kanji classification: focusing on six classifications]. In K. Sato (Ed.), Kanji kooza 1: Kanji towa [Kanji lecture series 1: what is kanji?] (pp. 49–69). Tokyo: Meiji Shoin.Google Scholar
  4. Balota, D. A., & Spieler, D. H. (1999). Word-frequency, repetition, and lexicality effects in word recognition tasks: beyond measures of central tendency. Journal of Experimental Psychology: General, 128, 32–55.CrossRefGoogle Scholar
  5. Barry, C., Hirsh, K. W., Johnston, R. A., & Williams, C. L. (2001). Age of acquisition, word frequency, and the locus of repetition priming of picture naming. Journal of Memory and Language, 44, 350–375.CrossRefGoogle Scholar
  6. Barry, C., Morrison, C. M., & Ellis, A. W. (1997). Naming the Snodgrass and Vanderwart pictures: effects of age of acquisition, frequency and name agreement. Quarterly Journal of Experimental Psychology, 50A, 560–585.CrossRefGoogle Scholar
  7. Brown, H., & Rubenstein, C. R. (1961). Test of response bias explanation of word-frequency effect. Science, 133, 280–281.CrossRefPubMedGoogle Scholar
  8. Chen, H. C., Cheung, H., & Lau, S. (1997). Examining and reexamining the structure of Chinese–English bilingual memory. Psychological Research, 60(4), 270–283.CrossRefPubMedGoogle Scholar
  9. Chikamatsu, N. (2005). L2 Japanese kanji memory and retrieval: An experimental on the tip-of-the-pen (TOP) phenomenon. In V. Cook & B. Bassetti (Eds.), Second language writing (pp. 71–96). New York: Multilingual Matters Ltd.Google Scholar
  10. Flores d’Arcais, G. B., & Saito, H. (1993). Lexical decomposition of complex Kanji characters in Japanese readers. Psychological Research, 55, 52–63.CrossRefGoogle Scholar
  11. Flores d’Arcais, G. B., Saito, H., & Kawakami, M. (1995). Phonological and semantic activation in reading kanji characters. Journal of Experimental Psychology Learning Memory and Cognition, 21, 34–42.CrossRefGoogle Scholar
  12. Frith, U. (1981). Experimental approaches to developmental dyslexia: an introduction. Psychological Research, 43(2), 97–109.CrossRefPubMedGoogle Scholar
  13. Gordon, B. (1983). Lexical access and lexical decision: mechanisms of frequency sensitivity. Journal of Verbal Learning and Verbal Behavior, 22, 24–44.CrossRefGoogle Scholar
  14. Haig, J. H. (1997). The new Nelson Japanese–English character dictionary: based on the classic edition by Andrew N. Nelson. Tokyo: Tuttle Publishing.Google Scholar
  15. Higuchi, H., Moriguchi, Y., Murakami, H., Katsunuma, R., Mishima, K., & Uno, A. (2016). Neural basis of hierarchical visual form processing of Japanese Kanji characters. Brain and Behavior,. doi: 10.1002/brb3.413.Google Scholar
  16. Hino, Y., & Lupker, S. J. (1998). The effects of word frequency for Japanese Kana and Kanji words in naming and lexical decision: can the dual-route model save the lexical-selection account? Journal of Experimental Psychology Human Perception and Performance, 24, 1431–1453.CrossRefGoogle Scholar
  17. Hino, Y., Miyamura, S., & Lupker, S. J. (2011). The nature of orthographic–phonological and orthographic–semantic relationships for Japanese kana and kanji words. Behavior Research Methods, 43, 1110–1151.CrossRefPubMedGoogle Scholar
  18. Horiguchi, J. (1989). Kanji no hitsujun [Stroke order of kanji]. In Y. Takebe (Ed.), Nihongoto nihongo kyooiku: Dai-8-kan. Nihongono moji hyooki (Joo) [Japanese and Japanese education: Vol. 8. Japanese writing system, No. 1] (pp. 97–124). Tokyo: Meiji Shoin.Google Scholar
  19. Jescheniak, J. D., & Levelt, W. J. M. (1994). Word frequency effects in speech production: retrieval of syntactic information and of phonological form. Journal of Experimental Psychology Language Memory and Cognition, 20, 824–843.CrossRefGoogle Scholar
  20. Jincho, N., Feng, G., & Mazuka, R. (2014). Development of text reading in Japanese: an eye movement study. Reading and Writing, 27(8), 1437–1465.CrossRefGoogle Scholar
  21. Kaiho, H., & Nomura, Y. (1983). Kanji joohoo shori no shinrigaku [Psychology of kanji information processing]. Tokyo: Kyoiku Shuppan.Google Scholar
  22. Kess, J. F., & Miyamoto, T. (1999). The Japanese mental lexicon: psycholinguistic studies of kana and kanji processing. Amsterdam: John Benjamins.Google Scholar
  23. Komori, K., Tamaoka, K., Saito, N., & Miyaoka, Y. (2014). Dai-2-gengo tosite Nihongo-o manabu chuugokugo wasya no nihongo no kanjigo no shuutoku ni kansuru koosatsu. Acquisition of Japanese kanji compound words by Chinese native speakers learning Japanese as a second language. Chuugoku-go washa no tameno nihongo kyooiku kenkyuu [Studies on Japanese language education for native Chinese speakers], 5, 1–16.Google Scholar
  24. Kudo, T., Yamamoto, K., & Matsumoto, Y. (2004). Applying conditional random fields to Japanese morphological analysis. In: Proceedings of the 2004 conference on empirical methods in natural language processing (EMNLP-2004) (pp. 230–237).Google Scholar
  25. Le Bigot, N., Passerault, J. M., & Olive, T. (2009). Memory for words location in writing. Psychological Research, 73(1), 89–97.CrossRefPubMedGoogle Scholar
  26. Leong, C. K., & Tamaoka, K. (1995). Use of phonological information in processing kanji and katakana by skilled and less skilled Japanese readers. Reading and Writing, 7, 377–393.CrossRefGoogle Scholar
  27. Leong, C. K., Cheng, P.-W., & Mulcahy, R. (1987). Automatic processing of morphemic orthography. Language and Speech, 30, 181–196.PubMedGoogle Scholar
  28. Luo, C., & Proctor, R. W. (2013). Asymmetry of congruency effects in spatial stroop tasks can be eliminated. Acta Psychologica, 143(1), 7–13.CrossRefPubMedGoogle Scholar
  29. Maekawa, K., Yamazaki, M., Ogiso, T., Maruyama, T., Ogura, H., Kashino, W., … Den, Y. (2014). Balanced corpus of contemporary written Japanese. Language Resources and Evaluation, 48, 345–371.CrossRefGoogle Scholar
  30. Miwa, K., Libben, G., & Baayen, R. H. (2012). Semantic radicals in Japanese two-character word recognition. Language and Cognitive Processes, 27(1), 142–158.CrossRefGoogle Scholar
  31. Miwa, K., Libben, G., Dijkstra, T., & Baayen, R. H. (2014). The time-course of lexical activation in Japanese morphographic word recognition: evidence for a character-driven processing model. Quarterly Journal of Experimental Psychology, 67, 79–113.CrossRefGoogle Scholar
  32. Morohashi, T. (2000). Dai Kanwa Jiten [The great Japanese kanji dictionary]. Tokyo: Taishukan.Google Scholar
  33. Morrison, C. M., & Ellis, A. W. (2000). Real age of acquisition effects in word naming and lexical decision. British Journal of Psychology, 91, 167–180.CrossRefPubMedGoogle Scholar
  34. Müller, H. M. (2010). Neurolinguistic findings on the language lexicon: the special role of proper names. Chinese Journal of Physiology, 53(6), 351–358.CrossRefPubMedGoogle Scholar
  35. Nelson, A. N. (1962). The original modern reader’s Japanese–English character dictionary (Classic ed.). Tokyo: Tuttle Publishing. (the former Charles E. Tuttle Company).Google Scholar
  36. Ono, F., & Kawahara, J. I. (2008). The effect of false memory on temporal perception. Psychological Research, 72(1), 61–64.CrossRefPubMedGoogle Scholar
  37. Proverbio, A. M., Mariani, S., Zani, A., & Adorni, R. (2009). How are ‘Barack Obama’ and ‘President Elect’ differentially stored in the brain? An ERP investigation on the processing of proper and common noun Pairs. PLoS One, 4(9), e7126.CrossRefPubMedPubMedCentralGoogle Scholar
  38. Saito, H., Masuda, K., & Kawakami, M. (1998). Form and sound similarity effects in kanji recognition. In C. K. Leong & K. Tamaoka (Eds.), Cognitive processing of the Chinese and Japanese languages (pp. 169–203). London: Kluwer Academic Publishers.CrossRefGoogle Scholar
  39. Saito, H., Masuda, K., & Kawakami, M. (1999). Subword activation in reading Japanese single kanji character words. Brain and Language, 68, 75–81.CrossRefPubMedGoogle Scholar
  40. Saito, H., Yamazaki, O., & Masuda, H. (2002). The effect of number of Kanji radical companions in character activation with a multi-radical-display task. Brain and Language, 81, 501–508.CrossRefPubMedGoogle Scholar
  41. Segui, J., Mehler, J., Frauenfelder, U., & Morton, J. (1982). The word frequency effect and lexical access. Neuropsychologia, 20, 615–627.CrossRefPubMedGoogle Scholar
  42. Shirakawa, S. (1994). Jitoo [Kanji etymology]. Tokyo: Heibonsha.Google Scholar
  43. Starreveld, P. A., La Heij, W., & Verdonschot, R. G. (2013). Time course analysis of the effects of distractor frequency and categorical relatedness in picture naming: an evaluation of the response exclusion account. Language and Cognitive Processes, 28, 633–654.CrossRefGoogle Scholar
  44. Taft, M. (1979). Recognition of affixed words and the word frequency effect. Memory and Cognition, 7, 263–272.CrossRefPubMedGoogle Scholar
  45. Taft, M., Huang, J., & Zhu, X. P. (1994). The influence of character frequency on word recognition responses in Chinese. In H.-W. Chang, J.-T. Huang, C.-W. Hue, & O. J. L. Tzeng (Eds.), Advances in the study of Chinese language processing (Vol. 1, pp. 59–73). Taipei: Department of Psychology, National Taiwan University.Google Scholar
  46. Taft, M., & Zhu, X. P. (1995). The representation of bound morphemes in the lexicon: a Chinese study. In L. B. Feldman (Ed.), Morphological aspects of language processing (pp. 293–316). Hillsdale: Lawrence Erlbaum Associates.Google Scholar
  47. Taft, M., & Zhu, X. P. (1997). Submorphemic processing in reading Chinese. Journal of Experimental Psychology Learning Memory and Cognition, 23, 761–775.CrossRefGoogle Scholar
  48. Tamaoka, K., & Altmann, G. (2004). Symmetry of Japanese kanji lexical productivity on the left- and right-hand sides. Glottometrics, 7, 68–88.Google Scholar
  49. Tamaoka, K., & Hatsuzuka, M. (1995). Kanzi niji jyukugo no shori niokeru kanji siyoohindo no eikyoo [The effects of kanji printed-frequency on processing Japanese two-morpheme compound words]. Dokusho Kagaku [The Science of Reading], 39, 121–137.Google Scholar
  50. Tamaoka, K., Kirsner, K., Yanase, Y., Miyaoka, Y., & Kawakami, M. (2002). A Web-accessible database of characteristics of the 1945 basic Japanese kanji. Behavior Research Methods Instruments and Computers, 34, 260–275.CrossRefGoogle Scholar
  51. Tamaoka, K., & Kiyama, S. (2013). The effects of visual complexity for Japanese kanji processing with high and low frequencies. Reading and Writing, 26(2), 205–223.CrossRefGoogle Scholar
  52. Tamaoka, K., & Makioka, S. (2004). New figures for a Web-accessible database of the 1945 basic Japanese kanji, fourth edition. Behavior Research Methods, Instruments and Computers, 36, 548–558.CrossRefGoogle Scholar
  53. Tamaoka, K., & Taft, M. (2010). The sensitivity of native Japanese speakers to On and Kun kanji readings. Reading and Writing, 23, 957–968.CrossRefGoogle Scholar
  54. Tamaoka, K., & Takahashi, N. (1999). Kanji niji jyukugo no shoji koodoo niokeru goi siyoo hindo oyobi shojiteki hukuzatsusei no eikyoo [The effects of word frequency and orthographic complexity on the writing process of Japanese two-morpheme compound words]. Sinrigaku Kenkyuu [The Japanese Journal of Psychology], 70, 45–50.CrossRefGoogle Scholar
  55. Tanaka, M. (2015). Japanese Kanji word processing for Chinese Learners of Japanese: a study of homophonic and semantic primed lexical decision tasks. Theory and Practice in Language Studies, 5(5), 900–905.CrossRefGoogle Scholar
  56. Todo, A. (2010). Kanji-gen Kaitei Dai-5-ban [Kanji Sources Revised Fifth Version]. Tokyo: Gakken.Google Scholar
  57. Toyoda, E. (2009). An analysis of L2 readers’ comments on kanji recognition. Electronic Journal of Foreign Language Teaching, 6, 5–20.Google Scholar
  58. Uno, A., Wydell, T. N., Haruhara, N., Kaneko, M., & Shinya, N. (2009). Relationship between reading/writing skills and cognitive abilities among Japanese Primary-School Children: normal readers versus poor Readers (dyslexics). Reading and Writing, 22, 755–789.CrossRefGoogle Scholar
  59. Valentine, T., Moore, V., & Brédart, S. (1995). Priming production of people’s names. The Quarterly Journal of Experimental Psychology Human Experimental Psychology, 48, 513–535.CrossRefGoogle Scholar
  60. Verdonschot, R. G., La Heij, W., Tamaoka, K., Kiyama, S., You, W.-P., & Schiller, N. O. (2013). The multiple pronunciations of Japanese kanji: a masked priming investigation. Quarterly Journal of Experimental Psychology, 66, 2023–2038.CrossRefGoogle Scholar
  61. Wang, L., Verdonschot, R. G., & Yang, Y. (2016). The processing difference between person names and common nouns in sentence contexts: an ERP study. Psychological Research, 80, 94–108.CrossRefPubMedGoogle Scholar
  62. Wu, J.-T., Chou, T.-L., & Liu, I.-M. (1994). The locus of the character/word frequency effect. In H.-W. Chang, J.-T. Huang, C.-W. Hue, & O. J. L. Tzeng (Eds.), Advances in the study of Chinese language processing (Vol. 1, pp. 31–58). Taipei: Department of Psychology, National Taiwan University.Google Scholar
  63. Yamada, J., Mitarai, Y., & Yoshida, T. (1991). Kanji words are easier to identify than katakana words. Psychological Research, 53(2), 136–141.CrossRefPubMedGoogle Scholar
  64. Yamato, Y., & Tamaoka, K. (2013). Chuugokujin nihongo gakushuusha niyoru gairaigo shori eno eigo rekisikon no eikyoo [Effects of English knowledge on the reading of Japanese texts via Japanese loanwords performed by native Chinese speakers learning Japanese]. Lexicon Forum, 6, 229–267.Google Scholar
  65. Yokosawa, K., & Umeda, M. (1988). Processes in human Kanji-word recognition. In: Proceedings of the 1988 IEEE international conference on systems, man, and cybernetics, August 8–12, 1988, Beijing and Shenyang, China, pp. 377–380.Google Scholar
  66. Yokoyama, S., Sasahara, H., Nozaki, H., & Long, E. (1998). Shinbun denshi media-no kanji: Asahi shinbun CD-ROM-ni yoru kanji hindo hyoo [Japanese kanji in the newspaper media: Kanji frequency index from the Asashi Newspaper on CD-ROM]. Tokyo: Sanseido.Google Scholar
  67. Yu, H., Gong, L., Qiu, Y., & Zhou, X. (2011). Seeing Chinese characters in action: an fMRI study of the perception of writing sequences. Brain and Language, 119(2), 60–67.CrossRefPubMedGoogle Scholar
  68. Zhou, X., & Marslen-Wilson, W. (1994). Words, morphemes and syllables in the Chinese mental lexicon. Language and Cognitive Processes, 9, 393–422.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2016

Authors and Affiliations

  • Katsuo Tamaoka
    • 1
  • Shogo Makioka
    • 2
  • Sander Sanders
    • 3
  • Rinus G. Verdonschot
    • 4
    Email author
  1. 1.Graduate School of Languages and CulturesNagoya UniversityNagoyaJapan
  2. 2.Osaka Prefecture UniversitySakaiJapan
  3. 3.Kumulus CentreMaastrichtThe Netherlands
  4. 4.Waseda Institute for Advanced Study (WIAS)Waseda UniversityTokyoJapan

Personalised recommendations