Abstract
Are there preferred word-length combinations in Chinese? If there are, are they motivated by semantics, syntax, prosody, or a combination of these? While the issue has been discussed for some time, opinions remain divided. This study offers a quantitative analysis of word-length patterns in Chinese [N N] and [V O] sequences, using the Lancaster Corpus of Mandarin Chinese. It is found that 1+2 is overwhelmingly disfavored in [N N] and 2+1 is overwhelmingly disfavored in [V O]. In addition, it is found that apparent exceptions, ranging between 1 and 2%, are limited to certain specific structures, and when these are factored out, both 1+2 [N N] and 2+1 [V O] are well below 1% in either token count or type count. The result bears on several theoretical debates, such as the validity of word-length preferences in Chinese, the motivation of the preferences, the extent and the nature of exceptions, and the interaction among syntax, semantics, and phonology.
Similar content being viewed by others
References
Anttila Arto, Matthew Adams, Michael Speriosu. (2010) The role of prosody in the English dative alternation. Language and Cognitive Processes 25(7–9): 946–981
Baayen, R. Harald, Richard Piepenbrock, and L. Gulikers. 1995. The CELEX lexical database: release 2 (CD-ROM). Philadelphia: Linguistic Data Consortium, University of Pennsylvania.
Bloomfield Leonard. (1926) A set of postulates for the science of language. Language 2(3): 153–164
Bybee Joan. (2006) Frequency of use and the organization of language. Oxford University Press, New York
Chen Matthew Y. (2000) Tone sandhi: Patterns across Chinese dialects. Cambridge, UK
Chomsky Noam, Morris Halle. (1968) The sound pattern of English. Harper and Row, New York
Chomsky, Noam, Morris Halle, and Fred Lukoff. 1956. On accent and juncture in English. In For Roman Jakobson, ed. Morris Halle, Horace Lunt, Hugh MacLean, and Cornelis van Schooneveld, 65–80. The Hague: Mouton.
Cinque Guglielmo. (1993) A null theory of phrase and compound stress. Linguistic Inquiry 24(2): 239–297
Davis, Stuart. 2005. Capitalistic v. militaristic: The paradigm uniformity effect reconsidered. In Paradigms in phonological theory, ed. Laura J. Downing, T.A. Hall, and Renate Raffelsiefen, 107–121. Oxford: Oxford University Press.
Duanmu, San. 1990. A formal study of syllable, tone, stress and domain in Chinese languages. PhD diss., MIT. Cambridge MA. [Distributed by MIT Working Papers in Linguistics.]
Duanmu San. (2007) The phonology of Standard Chinese. Oxford University Press, Oxford
Duanmu, San. 2012. Wordhood in Chinese. To appear in Encyclopedia of Chinese language and linguistics, ed. Wolfgang Behr, Gu Yueguo, Zev Handel, C.-T. James Huang, and Rint Sybesma. Leiden: Brill.
Feng, Shengli. 1997. Hanyu de yunlu, cifa yu jufa [Interactions between morphology, syntax and prosody in Chinese]. Beijing: Peking University Press.
Feng Shengli. (1998) Lun Hanyu de “ziran yinbu” [On “natural feet” in Chinese]. Zhongguo Yuwen 1((262): 40–47
Feng Shengli. (2000) Hanyu yunlu jufaxue [Prosodic syntax in Chinese]. Shanghai, Shanghai Jiaoyu Chubanshe
Feng Shengli. (2003) Prosodically constrained postverbal PPs in Mandarin Chinese. Linguistics 41(6): 1085–1122
Frisch Stefan A., Nathan R. Large, David B. Pisoni. (2000) Perception of wordlikeness: Effects of segment probability and length on the processing of nonwords. Journal of Memory and Language 42((4): 481–496
Golston Chris. (1995) Syntax outranks phonology: Evidence from Ancient Greek. Phonology 12(3): 343–368
Guo Shaoyu. (1938) Zhongguo yuci zhi tanxing zuoyong [The function of elastic word length in Chinese]. Yen Ching Hsueh Pao 24: 1–34
Gussenhoven Carlos. (1983) Focus, mode and the nucleus. Journal of Linguistics 19(2): 377–417
Halle Morris. (1962) Phonology in generative grammar. Word 18: 54–72
Halle Morris, Jean-Roger Vergnaud. (1987) An essay on stress. MIT Press, Cambridge, MA
Hayes Bruce. (1995) Metrical stress theory: Principles and case studies. University of Chicago Press, Chicago
Hockett Charles F. (1958) A course in modern linguistics. Macmillan, New York
Ito, Junko. 1986. Syllable theory in prosodic phonology. PhD diss., University of Massachusetts, Amherst.
Ke, Hang. 2007. Xiandai hanyu dan shuang yinjie dapei yanjiu [A study of monosyllabic and disyllabic usage in modern Chinese]. PhD diss., Institute of Linguistics, Chinese Academy of Social Sciences, Beijing.
Kuang Laying. (2006) ‘V shuang + N dan’ de xingzhi ji biaoshi pianzheng guanxi de youshi [The property of ‘disyllabic V + monosyllabic N’ and its preference for modifier-noun relation]. Journal of South China Agricultural University (Social Science Edition) 3(5): 95–98
Liu, Feng-hsi. 1992. Verb and syllable in Chinese. Paper presented at the 25th international conference on Sino-Tibetan Languages and Linguistics, Berkeley.
Lu, Bingfu. 1990. The structure of Chinese nominal phrases. M.A. thesis, University of Connecticut, Storrs.
Lu Bingfu, San Duanmu (2002) Rhythm and syntax in Chinese: A case study. Journal of the Chinese Language Teachers Association 37(2): 123–136
Lü Shuxiang. (1962) Shuo ‘ziyou’ he ‘zhanzhao’ [On ‘free’ and ‘bound’]. Zhongguo Yuwen 1: 1–6
Lü Shuxiang. (1963) Xiandai Hanyu dan shuang yinjie wenti chu tan [A preliminary study of the problem of monosyllabism and disyllabism in modern Chinese]. Zhongguo Yuwen 1: 11–23
Lü Shuxiang. (1990) Lü Shu-Xiang wen ji 2 [Collected papers by Lü Shu-Xiang, volume 2]. Beijing, Shangwu Yinshuguan
McEnery, Tony, and Richard Xiao. 2004. The Lancaster Corpus of Mandarin Chinese: A corpus for monolingual and contrastive language study. In Proceedings of the fourth international conference on language resources and evaluation (LREC) 2004, ed. M.T. Lino, M.F. Xavier, F. Ferreire, R. Costa, and R. Silva, 1175–1178. Lisbon, May 24–30, 2004.
Palmer, Martha, Fu-Dong Chiou, Nianwen Xue, Tsan-Kuang Lee, and Jeremy LaCivita. 2004. Chinese Treebank 4.0. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.
Pan, Wenguo. 1997. Han Ying yu duibi gangyao [An outline of comparisons between Chinese and English]. Beijing: Beijing University of Languages and Cultures Press.
Pirani, Laura. 2008. Bound roots in Mandarin Chinese and comparison with European “semi-words”. In Proceedings of the 20th North American conference on Chinese Linguistics (NACCL-20), 2008, ed. K.M. Marjorie Chan and Hana Kang, Vol. 1, 261–277. Columbus: The Ohio State University.
Prince Alan. (1980) A metrical theory for Estonian quantity. Linguistic Inquiry 11: 511–562
Richtsmeier Peter T. (2011) Word-types, not word-tokens, facilitate extraction of phonotactic sequences by adults. Laboratory Phonology 2(1): 157–183
Shannon, Claude E. 1948. The mathematical theory of communication. Bell System Technical Journal 27: 379–423 and 623–656, July and October. (Reprinted in Shannon and Weaver 1949 with minor revisions.)
Shih, Chi-lin. 1986. The prosodic domain of tone sandhi in Chinese. PhD diss., University of California, San Diego.
Sproat Richard, Chilin Shih. (1996) A corpus-based analysis of Mandarin nominal root compound. Journal of East Asian Linguistics 5(1): 49–71
Truckenbrodt, Hubert. 2005. Phrasal stress. In Encyclopedia of language & linguistics, ed. Keith Brown, 2nd ed., Vol. 9, 572–579. Oxford: Elsevier.
Wang, Hongjun. 2001a. Yinjie danshuang, yinyu zhanlian (zhongyin) yu yufa jiegou leixing he chengfen cixu [The relations between the number of syllables, the tonal range of pitch (stress) and the grammatical structure in Chinese]. Dangdai Yuyanxue [Contemporary Linguistics] 3(4): 241–252.
Wang, Hongjun. 2001b. Xinxi chuli yong xiandai hanyu fenci cibiao de neibu gouzhao he hanyu de jiegou tedian [The internal structure of A Modern Chinese Lexicon for Information Processing and structural properties of Chinese]. Yuyanwenzi Yingyong [Applied Linguistics] 4: 90–97.
Wang Hongjun, Li Fu. (2005) Shilun xiandai hanyu de lei cizhui [On semi-affixes in modern Chinese]. Yuyan Kexue 4(5): 3–17
Wu Weishan. (1986) Xiandai Hanyu san yinjie zuhe guilü chutan [Preliminary discussion on trisyllabic structures in modern Chinese]. Hanyu Xuexi 5: 3–4
Xia, Fei. 2000a. The part-of-speech tagging guidelines for the Penn Chinese Treebank (3.0). http://www.cis.upenn.edu/~chinese/ctb.html. Accessed 19 Apr 2011.
Xia, Fei. 2000b. The segmentation guidelines for the Penn Chinese Treebank (3.0). http://www.cis.upenn.edu/~chinese/ctb.html. Accessed 19 Apr 2011.
Xiandai Hanyu Changyong Cibiao Ketizu [Common Lexicon in Modern Chinese Task Group]. 2008. Xiandai Hanyu changyong cibiao (cao an) [Common lexicon in Modern Chinese (Draft)]. Beijing: Shangwu Yinshuguan.
Yang, Shujun. 2005. Xiandai hanyu sanyinjie ciyu yanjiu [A study of trisyllabic words in modern Chinese]. PhD diss., Each China Normal University, Shanghai.
Zhou Ren. (2007) Xingxi liang yuanze yu hanyu jufa zuhe de yunlu moshi [The principle of information load and the prosodic model of syntactic combinations in Chinese]. Zhongguo Yuwen 3(318): 208–222
Zubizarreta, Maria Luisa. 1998. Prosody, focus, and word order. [Linguistic Inquiry Monograph 33] Cambridge, MA: MIT Press.
Zubizarreta, Maria Luisa, and Jean-Roger Vergnaud. 2006. Phrasal stress, focus, and syntax. In The Blackwell companion to syntax, ed. Martin Everaert and Henk Van Riemsdijk, Vol. III, 522–568. Malden, MA: Blackwell.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Duanmu, S. Word-length preferences in Chinese: a corpus study. J East Asian Linguist 21, 89–114 (2012). https://doi.org/10.1007/s10831-011-9087-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10831-011-9087-y