ReaderBench: A Multi-lingual Framework for Analyzing Text Complexity
Assessing textual complexity is a difficult, but important endeavor, especially for adapting learning materials to students’ and readers’ levels of understanding. With the continuous growth of information technologies spanning through various research fields, automated assessment tools have become reliable solutions to automatically assessing textual complexity. ReaderBench is a text processing framework relying on advanced Natural Language Processing techniques that encompass a wide range of text analysis modules available in a variety of languages, including English, French, Romanian, and Dutch. To our knowledge, ReaderBench is the only open-source multilingual textual analysis solution that provides unified access to more than 200 textual complexity indices including: surface, syntactic, morphological, semantic, and discourse specific factors, alongside cohesion metrics derived from specific lexicalized ontologies and semantic models.
KeywordsMulti-lingual text analysis Textual complexity Comprehension prediction Natural Language Processing Textual cohesion Writing style
This research was partially supported by the FP7 2008-212578 LTfLL project, by the 644187 EC H2020 RAGE project, by the ANR-10-blan-1907-01 DEVCOMP project, as well as by University Politehnica of Bucharest through the “Excellence Research Grants” Program UPB–GEX 12/26.09.2016.
- 1.Dascalu, M.: Analyzing Discourse and Text Complexity for Learning and Collaborating. SCI, vol. 534. Springer, Cham (2014)Google Scholar
- 2.Allen, L.K., Dascalu, M., McNamara, D.S., Crossley, S., Trausan-Matu, S.: Modeling individual differences among writers using ReaderBench. In: EduLearn 2016, Barcelona, Spain, pp. 5269–5279. IATED (2016)Google Scholar
- 3.Dascalu, M., Stavarache, L.L., Trausan-Matu, S., Dessus, P., Bianco, M.: Reflecting comprehension through French textual complexity factors. In: 26th International Conference on Tools with Artificial Intelligence, ICTAI 2014, Limassol, Cyprus, pp. 615–619. IEEE (2014)Google Scholar
- 4.Gifu, D., Dascalu, M., Trausan-Matu, S., Allen, L.K.: Time evolution of writing styles in Romanian language. In: ICTAI 2016, San Jose, CA, pp. 1048–1054. IEEE (2016)Google Scholar
- 5.Dascalu, M., Westera, W., Ruseti, S., Trausan-Matu, S., Kurvers, H.: ReaderBench learns Dutch: building a comprehensive automated essay scoring system for Dutch. In: André, E., Baker, R., Hu, X., Rodrigo, M., du Boulay, B. (eds.) AIED 2017. LNCS, vol. 10331, pp. 52–63. Springer, Cham (2017)CrossRefGoogle Scholar