Automatic Assessment of Open Ended Questions with a Bleu-Inspired Algorithm and Shallow NLP

Alfonseca, Enrique; Pérez, Diana

doi:10.1007/978-3-540-30228-5_3

Enrique Alfonseca⁵ &
Diana Pérez⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3230))

Included in the following conference series:

International Conference on Natural Language Processing (in Spain)

720 Accesses
13 Citations

Abstract

This paper compares the accuracy of several variations of the Bleu algorithm when applied to automatically evaluating student essays. The different configurations include closed-class word removal, stemming, two baseline word-sense disambiguation procedures, and translating the texts into a simple semantic representation. We also prove empirically that the accuracy is kept when the student answers are translated automatically. Although none of the representations clearly outperform the others, some conclusions are drawn from the results.

This work has been sponsored by CICYT, project number TIC2001-0685-C02-01.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Page, E.B.: The use of computer in analyzing student essays. International review of education 14 (1968)
Google Scholar
Whittington, D., Hunt, H.: Approaches to the computerized assessment of free text responses. In: Proceedings of the Int. CAA Conference (1999)
Google Scholar
Mitchell, T., Russell, T., Broomhead, P., Aldridge, N.: Towards robust computerised marking of free-text responses. In: Proceedings of the 6th International Computer Assisted Assessment (CAA) Conference, Loughborough, UK (2002)
Google Scholar
Laham, D.: Automated content assessment of text using Latent Semantic Analysis to simulate human cognition. Ph.D. thesis, University of Colorado, Boulder (2000)
Google Scholar
Burstein, J., Kukich, K., Wolff, S., Chi, L., Chodorow, M.: Enriching automated essay scoring using discourse marking. In: Proceedings of the Workshop on Discourse Relations and Discourse Marking, ACL, Montreal, Canada (1998)
Google Scholar
Burstein, J., Leacock, C., Swartz, R.: Automated evaluation of essay and short answers. In: Proceedings of the Int. CAA Conference (2001)
Google Scholar
Valenti, S., Neri, F., Cucchiarelli, A.: An overview of current research on automated essay grading. Journal of I.T. Education 2, 319–330 (2003)
Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.: Bleu: a method for automatic evaluation of machine translation (2001)
Google Scholar
Pérez, D., Alfonseca, E., Rodríguez, P.: Application of the BLEU method for evaluating free-text answers in an e-learning environment. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2004) (2004)
Google Scholar
Pérez, D., Alfonseca, E., Rodríguez, P.: Upper bounds and extension of the Bleu algorithm applied to assessing student essays. In: IAEA 2004 Conference (2004)
Google Scholar
Lin, C.Y., Hovy, E.H.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of 2003 Language Technology Conference (HLT-NAACL 2003) (2003)
Google Scholar
Lin, C.Y.: Rouge working note v. 1.3.1 (2004)
Google Scholar
Fellbaum, C.: Analysis of a handtagging task. In: Proceedings of ANLP 1997 Workshop on Tagging Text with Lexical Semantics: Why, What, and How? Washington D.C., USA (1997)
Google Scholar
Vossen, P.: EuroWordNet - A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Dordrecht (1998)
Book Google Scholar
Alfonseca, E.: Wraetlic user guide version 1.0 (2003)
Google Scholar
Foltz, P., Laham, D., Landauer, T.: The intelligent essay assessor: Applications to educational technology. Interactive Multimedia Electronic Journal of Computer-Enhanced Learning (1999)
Google Scholar
Rudner, L., Liang, T.: Automated essay scoring using bayes’ theorem. In: Proceedings of the annual meeting of the National Council on Measurement in Education (2002)
Google Scholar
Carro, R.M., Pulido, E., Rodríguez, P.: Dynamic generation of adaptive internet-based courses. Journal of Network and Computer Applications 22, 249–257 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Universidad Autónoma, de Madrid, 28049, Madrid, Spain
Enrique Alfonseca & Diana Pérez

Authors

Enrique Alfonseca
View author publications
You can also search for this author in PubMed Google Scholar
Diana Pérez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Software and Computing Systems, University of Alicante, Spain
José Luis Vicedo
Natural Language Processing and Information Systems Group, Department of Software and Computing Systems, University of Alicante, Spain
Patricio Martínez-Barco
Grupo de investigación del Procesamiento del Lenguaje y Sistemas de Información, Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, Alicante, Spain
Rafael Muńoz
Departamento de Lenguajes y Sistemas Informáticos, Carretera de San Vicente del Raspeig, Universidad de Alicante, 03690 San Vicente del Raspeig, Alicante, Spain
Maximiliano Saiz Noeda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alfonseca, E., Pérez, D. (2004). Automatic Assessment of Open Ended Questions with a Bleu-Inspired Algorithm and Shallow NLP. In: Vicedo, J.L., Martínez-Barco, P., Muńoz, R., Saiz Noeda, M. (eds) Advances in Natural Language Processing. EsTAL 2004. Lecture Notes in Computer Science(), vol 3230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30228-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-30228-5_3
Published: 20 October 2004
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23498-2
Online ISBN: 978-3-540-30228-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics