Skip to main content

Research on Chinese Animal Words Extraction Based on Children’s Literature Corpus

  • Conference paper
  • First Online:
Chinese Lexical Semantics (CLSW 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11831))

Included in the following conference series:

  • 1592 Accesses

Abstract

Categorized and graded vocabularies are an important aspect of children’s graded reading. Taking animal words from the Thesaurus of Modern Chinese as the seed words, this paper studies a method of extracting animal words from the children’s literature corpus and attempts to construct a word sequencing model. The method used is to match the results of automatic word segmentation with the seed words. There are 786 animal nouns extracted from the corpus, with an increasing rate of 39.36% compared to the 564 seed words, and there are 780 derivative animal words. The animal word sequencing model is based on word-work-popularity and word-writer-popularity, which resolves the problem of having an unbalanced number of characters and writer’s works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhu, Z.: An Introduction to Children’s Literature, p. 161. Higher Education Press, Beijing (2009). (in Chinese)

    Google Scholar 

  2. Su, X.: Thesaurus of Modern Chinese. Commercial Press, Beijing (2013). (in Chinese)

    Google Scholar 

  3. Hong, G., Su, X.: A lexical classification system based on meaning: thesaurus of modern Chinese. Lexicogr. Stud. (1) (2015). (in Chinese)

    Google Scholar 

  4. Chen, X.: On the language of children’s literature from the use of words. Sci. Educ. Collect. (Mid-Term J.) (11), 68–69 (2012). (in Chinese)

    Google Scholar 

  5. Institute of Language Teaching, Beijing Language and Culture University. Modern Chinese Frequency Dictionary. Beijing Language and Culture University Press (1986). (in Chinese)

    Google Scholar 

  6. Su, X.: Comparative study of GOTCFL and the glossaries in two textbooks. Appl. Linguist. (2) (2006). (in Chinese)

    Google Scholar 

  7. Wang, Z.: The statistical research on diachronic changes of the common wordlist for Chinese teaching. TCSOL Stud. (4) (2010). (in Chinese)

    Google Scholar 

  8. Yu, S., Zhu, X.: Quantitative lexicon study and knowledge base construction for commonly used words. J. Chin. Inf. Process. (03), 16–20 (2015). (in Chinese)

    Google Scholar 

  9. Wang, Y.: Cognitive Linguistics. Shanghai Foreign Language Education Press, Shanghai (2007). (in Chinese)

    Google Scholar 

  10. Song, F.: The method of extracting and classifying vocabulary in basic category in international teaching of Chinese language and its future application. Int. Dissem. Chin. Lang. (2) (2012). (in Chinese)

    Google Scholar 

  11. Song, F.: Research on large-scale corpus-based relative frequency location method for modern Chinese basic-level vocabulary. Appl. Linguist. (04), 77–84 (2014). (in Chinese)

    Google Scholar 

  12. Li, S., Ai, H.: The distribution of meaning-items about animal polysemous words in modern Chinese based on the typology. J. Ocean Univ. China (Soc. Sci. Ed.) (06), 99–104 (2017). (in Chinese)

    Google Scholar 

  13. Zhou, X.: A study of Chinese animal words. Jilin University (2012). (in Chinese)

    Google Scholar 

Download references

Acknowledgments

The research is supported by Science Foundation of Beijing Language and Culture University (supported by “the Fundamental Research Funds for the Central Universities”) (19YJ040005); Major Program of National Social Science Foundation of China (18ZDA295); MOOC Project of Beijing Language and Culture University (FZ201911); Top-ranking Discipline Team Support Program of Beijing Language and Culture University (JC201902); Beijing College Student innovation and entrepreneurship training program (No: 18XKGJ05, 201910032045, 201910032046).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huizhou Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhao, H., Wang, Z., Wang, S., Zhang, L. (2020). Research on Chinese Animal Words Extraction Based on Children’s Literature Corpus. In: Hong, JF., Zhang, Y., Liu, P. (eds) Chinese Lexical Semantics. CLSW 2019. Lecture Notes in Computer Science(), vol 11831. Springer, Cham. https://doi.org/10.1007/978-3-030-38189-9_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-38189-9_63

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-38188-2

  • Online ISBN: 978-3-030-38189-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics