Journal of East Asian Linguistics

, Volume 21, Issue 1, pp 89–114 | Cite as

Word-length preferences in Chinese: a corpus study



Are there preferred word-length combinations in Chinese? If there are, are they motivated by semantics, syntax, prosody, or a combination of these? While the issue has been discussed for some time, opinions remain divided. This study offers a quantitative analysis of word-length patterns in Chinese [N N] and [V O] sequences, using the Lancaster Corpus of Mandarin Chinese. It is found that 1+2 is overwhelmingly disfavored in [N N] and 2+1 is overwhelmingly disfavored in [V O]. In addition, it is found that apparent exceptions, ranging between 1 and 2%, are limited to certain specific structures, and when these are factored out, both 1+2 [N N] and 2+1 [V O] are well below 1% in either token count or type count. The result bears on several theoretical debates, such as the validity of word-length preferences in Chinese, the motivation of the preferences, the extent and the nature of exceptions, and the interaction among syntax, semantics, and phonology.


Word-length preferences Elastic word length Corpus analysis Syntax–phonology interaction Prosody Metrical structure 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Anttila Arto, Matthew Adams, Michael Speriosu. (2010) The role of prosody in the English dative alternation. Language and Cognitive Processes 25(7–9): 946–981CrossRefGoogle Scholar
  2. Baayen, R. Harald, Richard Piepenbrock, and L. Gulikers. 1995. The CELEX lexical database: release 2 (CD-ROM). Philadelphia: Linguistic Data Consortium, University of Pennsylvania.Google Scholar
  3. Bloomfield Leonard. (1926) A set of postulates for the science of language. Language 2(3): 153–164CrossRefGoogle Scholar
  4. Bybee Joan. (2006) Frequency of use and the organization of language. Oxford University Press, New YorkGoogle Scholar
  5. Chen Matthew Y. (2000) Tone sandhi: Patterns across Chinese dialects. Cambridge, UKCrossRefGoogle Scholar
  6. Chomsky Noam, Morris Halle. (1968) The sound pattern of English. Harper and Row, New YorkGoogle Scholar
  7. Chomsky, Noam, Morris Halle, and Fred Lukoff. 1956. On accent and juncture in English. In For Roman Jakobson, ed. Morris Halle, Horace Lunt, Hugh MacLean, and Cornelis van Schooneveld, 65–80. The Hague: Mouton.Google Scholar
  8. Cinque Guglielmo. (1993) A null theory of phrase and compound stress. Linguistic Inquiry 24(2): 239–297Google Scholar
  9. Davis, Stuart. 2005. Capitalistic v. militaristic: The paradigm uniformity effect reconsidered. In Paradigms in phonological theory, ed. Laura J. Downing, T.A. Hall, and Renate Raffelsiefen, 107–121. Oxford: Oxford University Press.Google Scholar
  10. Duanmu, San. 1990. A formal study of syllable, tone, stress and domain in Chinese languages. PhD diss., MIT. Cambridge MA. [Distributed by MIT Working Papers in Linguistics.]Google Scholar
  11. Duanmu San. (2007) The phonology of Standard Chinese. Oxford University Press, OxfordGoogle Scholar
  12. Duanmu, San. 2012. Wordhood in Chinese. To appear in Encyclopedia of Chinese language and linguistics, ed. Wolfgang Behr, Gu Yueguo, Zev Handel, C.-T. James Huang, and Rint Sybesma. Leiden: Brill.Google Scholar
  13. Feng, Shengli. 1997. Hanyu de yunlu, cifa yu jufa [Interactions between morphology, syntax and prosody in Chinese]. Beijing: Peking University Press.Google Scholar
  14. Feng Shengli. (1998) Lun Hanyu de “ziran yinbu” [On “natural feet” in Chinese]. Zhongguo Yuwen 1((262): 40–47Google Scholar
  15. Feng Shengli. (2000) Hanyu yunlu jufaxue [Prosodic syntax in Chinese]. Shanghai, Shanghai Jiaoyu ChubansheGoogle Scholar
  16. Feng Shengli. (2003) Prosodically constrained postverbal PPs in Mandarin Chinese. Linguistics 41(6): 1085–1122CrossRefGoogle Scholar
  17. Frisch Stefan A., Nathan R. Large, David B. Pisoni. (2000) Perception of wordlikeness: Effects of segment probability and length on the processing of nonwords. Journal of Memory and Language 42((4): 481–496CrossRefGoogle Scholar
  18. Golston Chris. (1995) Syntax outranks phonology: Evidence from Ancient Greek. Phonology 12(3): 343–368CrossRefGoogle Scholar
  19. Guo Shaoyu. (1938) Zhongguo yuci zhi tanxing zuoyong [The function of elastic word length in Chinese]. Yen Ching Hsueh Pao 24: 1–34Google Scholar
  20. Gussenhoven Carlos. (1983) Focus, mode and the nucleus. Journal of Linguistics 19(2): 377–417CrossRefGoogle Scholar
  21. Halle Morris. (1962) Phonology in generative grammar. Word 18: 54–72Google Scholar
  22. Halle Morris, Jean-Roger Vergnaud. (1987) An essay on stress. MIT Press, Cambridge, MAGoogle Scholar
  23. Hayes Bruce. (1995) Metrical stress theory: Principles and case studies. University of Chicago Press, ChicagoGoogle Scholar
  24. Hockett Charles F. (1958) A course in modern linguistics. Macmillan, New YorkGoogle Scholar
  25. Ito, Junko. 1986. Syllable theory in prosodic phonology. PhD diss., University of Massachusetts, Amherst.Google Scholar
  26. Ke, Hang. 2007. Xiandai hanyu dan shuang yinjie dapei yanjiu [A study of monosyllabic and disyllabic usage in modern Chinese]. PhD diss., Institute of Linguistics, Chinese Academy of Social Sciences, Beijing.Google Scholar
  27. Kuang Laying. (2006) ‘V shuang + N dan’ de xingzhi ji biaoshi pianzheng guanxi de youshi [The property of ‘disyllabic V + monosyllabic N’ and its preference for modifier-noun relation]. Journal of South China Agricultural University (Social Science Edition) 3(5): 95–98Google Scholar
  28. Liu, Feng-hsi. 1992. Verb and syllable in Chinese. Paper presented at the 25th international conference on Sino-Tibetan Languages and Linguistics, Berkeley.Google Scholar
  29. Lu, Bingfu. 1990. The structure of Chinese nominal phrases. M.A. thesis, University of Connecticut, Storrs.Google Scholar
  30. Lu Bingfu, San Duanmu (2002) Rhythm and syntax in Chinese: A case study. Journal of the Chinese Language Teachers Association 37(2): 123–136Google Scholar
  31. Lü Shuxiang. (1962) Shuo ‘ziyou’ he ‘zhanzhao’ [On ‘free’ and ‘bound’]. Zhongguo Yuwen 1: 1–6Google Scholar
  32. Lü Shuxiang. (1963) Xiandai Hanyu dan shuang yinjie wenti chu tan [A preliminary study of the problem of monosyllabism and disyllabism in modern Chinese]. Zhongguo Yuwen 1: 11–23Google Scholar
  33. Lü Shuxiang. (1990) Lü Shu-Xiang wen ji 2 [Collected papers by Lü Shu-Xiang, volume 2]. Beijing, Shangwu YinshuguanGoogle Scholar
  34. McEnery, Tony, and Richard Xiao. 2004. The Lancaster Corpus of Mandarin Chinese: A corpus for monolingual and contrastive language study. In Proceedings of the fourth international conference on language resources and evaluation (LREC) 2004, ed. M.T. Lino, M.F. Xavier, F. Ferreire, R. Costa, and R. Silva, 1175–1178. Lisbon, May 24–30, 2004.Google Scholar
  35. Palmer, Martha, Fu-Dong Chiou, Nianwen Xue, Tsan-Kuang Lee, and Jeremy LaCivita. 2004. Chinese Treebank 4.0. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.Google Scholar
  36. Pan, Wenguo. 1997. Han Ying yu duibi gangyao [An outline of comparisons between Chinese and English]. Beijing: Beijing University of Languages and Cultures Press.Google Scholar
  37. Pirani, Laura. 2008. Bound roots in Mandarin Chinese and comparison with European “semi-words”. In Proceedings of the 20th North American conference on Chinese Linguistics (NACCL-20), 2008, ed. K.M. Marjorie Chan and Hana Kang, Vol. 1, 261–277. Columbus: The Ohio State University.Google Scholar
  38. Prince Alan. (1980) A metrical theory for Estonian quantity. Linguistic Inquiry 11: 511–562Google Scholar
  39. Richtsmeier Peter T. (2011) Word-types, not word-tokens, facilitate extraction of phonotactic sequences by adults. Laboratory Phonology 2(1): 157–183CrossRefGoogle Scholar
  40. Shannon, Claude E. 1948. The mathematical theory of communication. Bell System Technical Journal 27: 379–423 and 623–656, July and October. (Reprinted in Shannon and Weaver 1949 with minor revisions.)Google Scholar
  41. Shih, Chi-lin. 1986. The prosodic domain of tone sandhi in Chinese. PhD diss., University of California, San Diego.Google Scholar
  42. Sproat Richard, Chilin Shih. (1996) A corpus-based analysis of Mandarin nominal root compound. Journal of East Asian Linguistics 5(1): 49–71CrossRefGoogle Scholar
  43. Truckenbrodt, Hubert. 2005. Phrasal stress. In Encyclopedia of language & linguistics, ed. Keith Brown, 2nd ed., Vol. 9, 572–579. Oxford: Elsevier.Google Scholar
  44. Wang, Hongjun. 2001a. Yinjie danshuang, yinyu zhanlian (zhongyin) yu yufa jiegou leixing he chengfen cixu [The relations between the number of syllables, the tonal range of pitch (stress) and the grammatical structure in Chinese]. Dangdai Yuyanxue [Contemporary Linguistics] 3(4): 241–252.Google Scholar
  45. Wang, Hongjun. 2001b. Xinxi chuli yong xiandai hanyu fenci cibiao de neibu gouzhao he hanyu de jiegou tedian [The internal structure of A Modern Chinese Lexicon for Information Processing and structural properties of Chinese]. Yuyanwenzi Yingyong [Applied Linguistics] 4: 90–97.Google Scholar
  46. Wang Hongjun, Li Fu. (2005) Shilun xiandai hanyu de lei cizhui [On semi-affixes in modern Chinese]. Yuyan Kexue 4(5): 3–17Google Scholar
  47. Wu Weishan. (1986) Xiandai Hanyu san yinjie zuhe guilü chutan [Preliminary discussion on trisyllabic structures in modern Chinese]. Hanyu Xuexi 5: 3–4Google Scholar
  48. Xia, Fei. 2000a. The part-of-speech tagging guidelines for the Penn Chinese Treebank (3.0). Accessed 19 Apr 2011.
  49. Xia, Fei. 2000b. The segmentation guidelines for the Penn Chinese Treebank (3.0). Accessed 19 Apr 2011.
  50. Xiandai Hanyu Changyong Cibiao Ketizu [Common Lexicon in Modern Chinese Task Group]. 2008. Xiandai Hanyu changyong cibiao (cao an) [Common lexicon in Modern Chinese (Draft)]. Beijing: Shangwu Yinshuguan.Google Scholar
  51. Yang, Shujun. 2005. Xiandai hanyu sanyinjie ciyu yanjiu [A study of trisyllabic words in modern Chinese]. PhD diss., Each China Normal University, Shanghai.Google Scholar
  52. Zhou Ren. (2007) Xingxi liang yuanze yu hanyu jufa zuhe de yunlu moshi [The principle of information load and the prosodic model of syntactic combinations in Chinese]. Zhongguo Yuwen 3(318): 208–222Google Scholar
  53. Zubizarreta, Maria Luisa. 1998. Prosody, focus, and word order. [Linguistic Inquiry Monograph 33] Cambridge, MA: MIT Press.Google Scholar
  54. Zubizarreta, Maria Luisa, and Jean-Roger Vergnaud. 2006. Phrasal stress, focus, and syntax. In The Blackwell companion to syntax, ed. Martin Everaert and Henk Van Riemsdijk, Vol. III, 522–568. Malden, MA: Blackwell.Google Scholar

Copyright information

© Springer Science+Business Media B.V. 2011

Authors and Affiliations

  1. 1.Department of LinguisticsUniversity of MichiganAnn ArborUSA

Personalised recommendations