Skip to main content

Visualizing Stylistic Differences in Chinese Synonyms

  • Chapter
  • First Online:
Computational and Corpus Approaches to Chinese Language Learning

Part of the book series: Chinese Language Learning Sciences ((CLLS))

Abstract

Synonyms and near-synonyms are a major source of difficulty in the acquisition of Chinese vocabulary, possibly due to the formal similarities between them. It is also difficult to describe their stylistic differences in a clear and objective manner. The observations found in reference works such as dictionaries can be vague, equivocal, and limited in explanatory power. The present paper demonstrates how the corpus-based, multi-feature, multi-dimensional framework for studying register variation (Biber in Variation across speech and writing. Cambridge University Press, New York, 1988) and Correspondence Analysis, a particular implementation of factor analysis, can be used to show stylistic differences in (near) synonyms by way of two-dimensional bi-plots (“stylistic maps” in a sense). After presenting a two-dimensional analysis for Chinese, five sets of synonyms will be used to demonstrate the approach, together with comparisons with previous observations. Not only can the present approach provide a clearer and more nuanced picture than what introspection allows, it also enables us to go beyond the spoken versus written dichotomy and gain a broader perspective on stylistic variation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For the purpose of this paper, the terms ‘style’ and ‘register’ will be used interchangeably.

  2. 2.

    The total number of tokens is one million.

  3. 3.

    The total number of tokens is over 10 billion.

  4. 4.

    Corpus of Contemporary American English at https://corpus.byu.edu/coca/.

References

  • Benzérci, J. P. (1973). Analyse des donnees. Paris: Dunod.

    Google Scholar 

  • Biber, D. (1988). Variation across speech and writing. New York: Cambridge University Press.

    Book  Google Scholar 

  • Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman grammar of spoken and written English. Harlow: Pearson Education.

    Google Scholar 

  • Bolinger, D. (1977). Meaning and form. London/New York: Longman.

    Google Scholar 

  • Carroll, J. B. (1960). Vectors of prose style. In T. A. Sebeok (Ed.), Style in language (pp. 283–292). Cambridge, MA: MIT Press.

    Google Scholar 

  • Chi, C. H. (池昌海) (1999). 对汉语同义词研究重要分歧的再认识. 《浙江大学学报: 人文社科版》1999 年第 01 期 第 77–84 页

    Google Scholar 

  • Feng, S. (冯胜利) (2010). On the mechanism of style and its grammatical properties (论语体的机制及其语法属性). Zhongguo Yuwen (《中国语文》), 5, 400–412.

    Google Scholar 

  • Green, G. (1982). Colloquial and literary uses of inversions. In D. Tannen (Ed.), Spoken and written language: Exploring orality and literacy. Norwood, NJ: ABLEX Publishing Corporation.

    Google Scholar 

  • Greenacre, M. (1984). Theory and applications of correspondence analysis. London: Academic Press.

    Google Scholar 

  • Gries, S. T. (2015). Quantitative designs and statistical techniques. In D. Biber & R. Reppen (Eds.), The Cambridge handbook of English corpus linguistics (pp. 50–71). Cambridge: Cambridge University Press.

    Chapter  Google Scholar 

  • Halliday, M. A. K., & Hasan, R. (1985). Language, context, and text: Aspects of language in a social-semiotic perspective. Geelong: Deakin University Press.

    Google Scholar 

  • Jang, S. (1998). Dimensions of spoken and written Taiwanese: A corpus-based register study. Unpublished doctoral dissertation. University of Hawaii, Manoa.

    Google Scholar 

  • Lü, S. (吕叔湘) (1980). Eight hundred words in contemporary Chinese (现代汉语八百词). Beijing: Commercial Press (商务印书馆).

    Google Scholar 

  • McEnery, A., & Xiao, Z. (2004). The Lancaster corpus of Mandarin Chinese: A corpus for monolingual and contrastive language study. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (pp. 1175–1178). Lisbon, Portugal.

    Google Scholar 

  • McGillivray, B., Johansson, C., & Apollon, D. (2008). Semantic structure from correspondence analysis. In Proceedings of the third text graphs workshop on graph-based algorithms in natural language processing (pp. 49–52). Manchester, UK.

    Google Scholar 

  • Tabata, T. (2002). Investigating stylistic variation in Dickens through correspondence analysis of word-class distribution. In T. Saito, J. Nakamura, & S. Yamazaki (Eds.), English corpus linguistics in Japan (pp. 165–182). Amsterdam/New York: Rodopi.

    Google Scholar 

  • Tabata, T. (2007). A statistical study of superlatives in Dickens and Smollett: A case study in corpus stylistics. Paper Presented at the Digital Humanities 2007 Conference. Urbana-Champaign, IL.

    Google Scholar 

  • Tao, H. (陶红印) (1999). On the grammatical significance of register distinctions (试论语体分类的语法学意义). Contemporary Linguistics (《当代语言学》), 3, 15–24.

    Google Scholar 

  • Teng, S.-H. (鄧守信) (1996) Chinese synonyms usage dictionary (汉英汉语常用近义词用法词典). Beijing: Beijing Language Institute Press (北京语言学院出版社).

    Google Scholar 

  • Wang, H. (王还) (2005). A dictionary of Chinese synonyms (汉语近义词典). Beijing: Beijing Language and Culture University Press (北京语言大学出版社).

    Google Scholar 

  • Wells, R. (1960). Nominal and verbal style. In T. A. Sebeok (Ed.), Style in language (pp. 213–220). Cambridge, MA: MIT Press.

    Google Scholar 

  • Yang, J. (杨寄洲), & Jia Y. (贾永芬) (2005). 1700 groups of frequently used Chinese synonyms (1700对近义词语用法对比). Beijing: Beijing Language and Culture University Press (北京语言大学出版社).

    Google Scholar 

  • Zhang, G. Q. (2010). Using Chinese synonyms. Cambridge: Cambridge University Press.

    Google Scholar 

  • Zhang, Z.-S. (2012). A corpus study of variation in written Chinese. Corpus Linguistics and Linguistic Theory, 8(1), 209–240.

    Google Scholar 

  • Zhang, Z.-S. (2013) The classical elements in written Chinese: A multidimensional quantitative study, Chinese Language and Discourse, 4(2),157–180.

    Article  Google Scholar 

  • Zhang, Z.-S. (2017). Dimensions of variation in written Chinese. London/New York: Routledge.

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zheng-Sheng Zhang .

Editor information

Editors and Affiliations

Appendices

Appendix A: 50 Features for the LCMC Study

  • 我, 我们, 你, 你们, 他, 她, 他们, 了, 着, 过, 被, 使, 名, 动, 形

  • 副, 地, 得, 的, 量, 名习, 介, 动习, 进行, 拟声, 将, 把, 之, 为, 所

  • 于, 以, 与, 无, 名素, 动素, 形素, 副素, 连, 语, 形名, 非谓, 动名

  • “?”, “!”, “:”, “‘‘”, “;”, “,”, “(”

Appendix B: 95 Features for the BCC Study

  • “!”, “?”, “:”, ““”, “(”, “《”, “;”, 吗, 呢,

  • 了1, 了2, 着, 过, 动名, 形名, 拟声, 的, 得, 地, 把, 将, 非谓,

  • 被动, 使, 让, 我, 你, 他, 她, 我们, 你们, 他们, 她们, 购买, 具有

  • 在于, 寻找, 获得, 巨大, 询问, 进入, 进行, 加以, 予以, 所, 以,

  • 于, 与, 之, 之a, 为, 无, 未, 其, 非, 此, 该, 均, 皆, 自, 至, 及,

  • 人, 人们, 人民, 没, 很, 是不是, 是否, 还, 仍, 仍然, 仍旧, 等候,

  • 依然, 大大, 纷纷, 有加, 似乎, 仿佛, 不时, 一番, 女人, 女士, 女性,

  • 女子, 妇女, 男人, 男子, 男士, 男性, 伟大, 欢喜, 晓得

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Zhang, ZS. (2019). Visualizing Stylistic Differences in Chinese Synonyms. In: Lu, X., Chen, B. (eds) Computational and Corpus Approaches to Chinese Language Learning. Chinese Language Learning Sciences. Springer, Singapore. https://doi.org/10.1007/978-981-13-3570-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-3570-9_8

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-3569-3

  • Online ISBN: 978-981-13-3570-9

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics