Skip to main content

Machine Learning Models for Measuring Syntax Complexity of English Text

  • Conference paper
  • First Online:
Biologically Inspired Cognitive Architectures 2019 (BICA 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 948))

Included in the following conference series:

Abstract

In this paper we propose a methodology to assess the syntax complexity of a sentence representing it as sequence of parts-of-speech and comparing Recurrent Neural Networks and Support Vector Machine. We have carried out experiments in English language which are compared with previous results obtained for the Italian one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.natcorp.ox.ac.uk/docs/c5spec.html.

  2. 2.

    https://scikit-learn.org.

References

  1. Schicchi D, Pilato G (2018) WORDY: a semi-automatic methodology aimed at the creation of neologisms based on a semantic network and blending devices. In: Barolli L, Terzo O (eds) Complex, intelligent, and software intensive systems. Springer, Cham, pp 236–248

    Chapter  Google Scholar 

  2. Schicchi D, Pilato G (2018) A social humanoid robot as a playfellow for vocabulary enhancement. In: 2018 second IEEE international conference on robotic computing (IRC). IEEE Computer Society, Los Alamitos, pp 205–208

    Google Scholar 

  3. Di Gangi MA, Federico M (2018) Deep neural machine translation with weakly-recurrent units. In: 21st annual conference of the European association for machine translation, pp 119–128

    Google Scholar 

  4. Alfano M, Lenzitti B, Lo Bosco G, Perticone V (2015) An automatic system for helping health consumers to understand medical texts, pp 622–627

    Google Scholar 

  5. Kincaid J (1975) Derivation of new readability formulas: (automated readability index, fog count and Flesch reading ease formula) for navy enlisted personnel. Research branch report. Chief of naval technical training, Naval Air Station Memphis

    Google Scholar 

  6. Dell’Orletta F, Montemagni S, Venturi G (2011) Read-it: assessing readability of Italian texts with a view to text simplification. In: Proceedings of the second workshop on speech and language processing for assistive technologies. Association for Computational Linguistics, pp 73–83

    Google Scholar 

  7. Xu W, Napoles C, Pavlick E, Chen Q, Callison-Burch C (2016) Optimizing statistical machine translation for text simplification. Trans Assoc Comput Linguist 4:401–415. https://doi.org/10.1162/tacl_a_00107

    Article  Google Scholar 

  8. Lo Bosco G, Pilato G, Schicchi D (2018) A recurrent deep neural network model to measure sentence complexity for the Italian language. In: Proceedings of the sixth international workshop on artificial intelligence and cognition

    Google Scholar 

  9. Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297

    MATH  Google Scholar 

  10. Schmid H (2013) Probabilistic part-of-speech tagging using decision trees. In: New methods in language processing, p 154

    Google Scholar 

  11. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  12. Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge

    MATH  Google Scholar 

  13. Xu W, Callison-Burch C, Napoles C (2015) Problems in current text simplification research: new data can help. Trans Assoc Comput Linguist 3:283–297. https://doi.org/10.1162/tacl_a_00139

    Article  Google Scholar 

  14. Lo Bosco G, Pilato G, Schicchi D (2018) A sentence based system for measuring syntax complexity using a recurrent deep neural network. In: 2nd workshop on natural language for artificial intelligence, NL4AI 2018, vol 2244. CEUR-WS, pp 95–101

    Google Scholar 

  15. Bosco GL, Pilato G, Schicchi D (2018) A neural network model for the evaluation of text complexity in Italian language: a representation point of view. Procedia Comput Sci 145:464–470

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniele Schicchi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Schicchi, D., Lo Bosco, G., Pilato, G. (2020). Machine Learning Models for Measuring Syntax Complexity of English Text. In: Samsonovich, A. (eds) Biologically Inspired Cognitive Architectures 2019. BICA 2019. Advances in Intelligent Systems and Computing, vol 948. Springer, Cham. https://doi.org/10.1007/978-3-030-25719-4_59

Download citation

Publish with us

Policies and ethics