Machine Learning Models for Measuring Syntax Complexity of English Text

Schicchi, Daniele; Lo Bosco, Giosué; Pilato, Giovanni

doi:10.1007/978-3-030-25719-4_59

Daniele Schicchi¹⁵,
Giosué Lo Bosco¹⁵ &
Giovanni Pilato¹⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 948))

Included in the following conference series:

Biologically Inspired Cognitive Architectures Meeting

905 Accesses
7 Citations

Abstract

In this paper we propose a methodology to assess the syntax complexity of a sentence representing it as sequence of parts-of-speech and comparing Recurrent Neural Networks and Support Vector Machine. We have carried out experiments in English language which are compared with previous results obtained for the Italian one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Schicchi D, Pilato G (2018) WORDY: a semi-automatic methodology aimed at the creation of neologisms based on a semantic network and blending devices. In: Barolli L, Terzo O (eds) Complex, intelligent, and software intensive systems. Springer, Cham, pp 236–248
Chapter Google Scholar
Schicchi D, Pilato G (2018) A social humanoid robot as a playfellow for vocabulary enhancement. In: 2018 second IEEE international conference on robotic computing (IRC). IEEE Computer Society, Los Alamitos, pp 205–208
Google Scholar
Di Gangi MA, Federico M (2018) Deep neural machine translation with weakly-recurrent units. In: 21st annual conference of the European association for machine translation, pp 119–128
Google Scholar
Alfano M, Lenzitti B, Lo Bosco G, Perticone V (2015) An automatic system for helping health consumers to understand medical texts, pp 622–627
Google Scholar
Kincaid J (1975) Derivation of new readability formulas: (automated readability index, fog count and Flesch reading ease formula) for navy enlisted personnel. Research branch report. Chief of naval technical training, Naval Air Station Memphis
Google Scholar
Dell’Orletta F, Montemagni S, Venturi G (2011) Read-it: assessing readability of Italian texts with a view to text simplification. In: Proceedings of the second workshop on speech and language processing for assistive technologies. Association for Computational Linguistics, pp 73–83
Google Scholar
Xu W, Napoles C, Pavlick E, Chen Q, Callison-Burch C (2016) Optimizing statistical machine translation for text simplification. Trans Assoc Comput Linguist 4:401–415. https://doi.org/10.1162/tacl_a_00107
Article Google Scholar
Lo Bosco G, Pilato G, Schicchi D (2018) A recurrent deep neural network model to measure sentence complexity for the Italian language. In: Proceedings of the sixth international workshop on artificial intelligence and cognition
Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
MATH Google Scholar
Schmid H (2013) Probabilistic part-of-speech tagging using decision trees. In: New methods in language processing, p 154
Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge
MATH Google Scholar
Xu W, Callison-Burch C, Napoles C (2015) Problems in current text simplification research: new data can help. Trans Assoc Comput Linguist 3:283–297. https://doi.org/10.1162/tacl_a_00139
Article Google Scholar
Lo Bosco G, Pilato G, Schicchi D (2018) A sentence based system for measuring syntax complexity using a recurrent deep neural network. In: 2nd workshop on natural language for artificial intelligence, NL4AI 2018, vol 2244. CEUR-WS, pp 95–101
Google Scholar
Bosco GL, Pilato G, Schicchi D (2018) A neural network model for the evaluation of text complexity in Italian language: a representation point of view. Procedia Comput Sci 145:464–470
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Matematica e Informatica, Univerisitá degli Studi di Palermo, Palermo, Italy
Daniele Schicchi & Giosué Lo Bosco
ICAR-CNR - National Research Council of Italy, Palermo, Italy
Giovanni Pilato

Authors

Daniele Schicchi
View author publications
You can also search for this author in PubMed Google Scholar
Giosué Lo Bosco
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Pilato
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniele Schicchi .

Editor information

Editors and Affiliations

Moscow Engineering Physics Institute (MEPhI), Department of Cybernetics, National Research Nuclear University (NRNU), Moscow, Russia
Alexei V. Samsonovich

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schicchi, D., Lo Bosco, G., Pilato, G. (2020). Machine Learning Models for Measuring Syntax Complexity of English Text. In: Samsonovich, A. (eds) Biologically Inspired Cognitive Architectures 2019. BICA 2019. Advances in Intelligent Systems and Computing, vol 948. Springer, Cham. https://doi.org/10.1007/978-3-030-25719-4_59

Download citation

DOI: https://doi.org/10.1007/978-3-030-25719-4_59
Published: 17 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-25718-7
Online ISBN: 978-3-030-25719-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics