Automatic Spelling Error Classification in Malayalam

Dhanya, S.; Kaimal, M. R.; Nedungadi, Prema

doi:10.1007/978-981-97-0744-7_25

S. Dhanya^13,14,
M. R. Kaimal¹³ &
Prema Nedungadi^13,14

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 916))

Included in the following conference series:

International Conference on Information and Communication Technology for Competitive Strategies

14 Accesses

Abstract

Spelling errors, a commonplace phenomenon among students dominated by those with learning disabilities, call for effective systems that automatically and accurately identify and classify spelling mistakes. This research focused on developing an automatic spelling error classification tool to identify the Malayalam writing system’s phonological and orthographic spelling error categories. For the analysis of spelling errors in Malayalam, the study considered the input pair of real words and spelled words for extraction of six linguistic features: the difference in word length, edit distance, character overlapping percentage, the number of common phonemes, the number of common bigrams, and the number of common trigrams. After analysing spelling errors, machine learning techniques classify them. The research appraised the effectiveness of feature engineering apropos automatic spelling error classification in Malayalam. Results indicate that the fine-tuned Random Forest classifier achieved the highest classification accuracy of 71%, effectively distinguishing between the two error categories. The findings pinpoint the impact of linguistic features on accuracy and afford insights into the imperative of language technology tools for educational purposes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bright W (1999) A matter of typology: alphasyllabaries and abugidas. In: WRIT—1133ten Lang. literacy, vol 2, no 1, pp. 45–55, 1999. Available 1134. https://www.jbeplatform.com/content/journals/10.1075/wll.2.1.03bri
Haridas M, Vasudevan N, Nair GJ, Gutjahr G, Raman R, Nedungadi P (2018) Spelling errors by normal and poor readers in a bilingual Malayalam-English dyslexia screening test. In: 2018 IEEE 18th international conference on advanced learning technologies (ICALT), pp 340–344. https://doi.org/10.1109/ICALT.2018.00085
Gafoor K, Remia K (2013) Spelling difficulties in Malayalam among lower primary students, vol 5, 31–40
Google Scholar
Haridas M, Vasudevan N, Iyer, A, Menon R, Nedungadi P (2017) Analyzing the responses of primary school children in dyslexia screening tests. In: Proceedings of the IEEE 5th international conference on MOOCs, innovation and technology in education (MITE), pp 89–94. https://doi.org/10.1109/MITE.2017.00022
Haridas M, Vasudevan N, Gutjahr G, Raman R, Nedungadi P (2020) Comparing English and Malayalam spelling errors of children using a bilingual screening tool. In: Advances in intelligent systems and computing, vol 1027. Springer, pp 427–436. https://doi.org/10.1007/978-981-32-9343-4_34
Sooraj SK, Manjusha, Kumar M, Kp S (2018) Deep learning based spell checker for Malayalam language. J Intell Fuzzy Syst 34:1427–1434. https://doi.org/10.3233/JIFS-169438
Premjith B, Soman KP, Kumar MA (2018) A deep learning approach for Malayalam morphological analysis at character level. Procedia Comput Sci 132:47–54. https://doi.org/10.1016/j.procs.2018.05.058
Berkling J, Lavalley R (2018) Automatic orthographic error tagging and classification for German texts. Compu Speech Lang 52:56–78. ISSN 0885-2308. https://doi.org/10.1016/j.csl.2017.11.002
Protopapas A, Fakou A, Drakopoulou S, Skaloumbakas C, Mouzaki A (2012) What do spelling errors tell us? Classification and analysis of errors made by Greek schoolchildren with and without dyslexia. Reading Writing 26. https://doi.org/10.1007/s11145-012-9378-3
Vasudevan N, Haridas M, Nedungadi P, Raman R, Daniels PT, Share DL(2023) A multi-dimensional framework for characterising the role of writing system variation in literacy learning: a case study in Malayalam. Reading Writing https://doi.org/10.1007/s11145-022-10374-3
Renjit S, Idicula SM (2022) Feature based entailment recognition for Malayalam language texts. Int J Adv Comput Sci Appl 13(2). https://doi.org/10.14569/IJACSA.2022.0130283 (The Science and Information Organization)
García-Díaz JA, Vivancos-Vicente PJ, Almela Á, Valencia-García R (2022) UMUTextStats: a linguistic feature extraction tool for Spanish. In: Proceedings of the thirteenth language resources and evaluation conference, June 2022, Marseille, France. European Language Resources Association, pp 6035–6044. https://aclanthology.org/2022.lrec-1.649
Pennebaker J, Francis M, Booth R (1999) Linguistic inquiry and word count (LIWC)
Google Scholar
Spoon K, Crandall D, Siek K (2010) Towards detecting dyslexia in children’s handwriting using neural networks. In: Proceedings of the international conference on machine learning AI for social good workshop, pp 1–5
Google Scholar
Spoon K, Siek K, Crandall D, Fillmore M (2019) Can we (and should we) use AI to detect dyslexia in children’s handwriting?. In: Proceedings of the international conference on machine learning AI for social good workshop, Long Beach, CA, USA, 9–15 June 2019, pp 1–6
Google Scholar
Christopher B, Mariano F, Ted B (2017) Automatic annotation and evaluation of error types for grammatical error correction. 1:793–805. https://doi.org/10.18653/V1/P17-1074
Korre K, Chatzipanagiotou M, Pavlopoulos J (2021) ELERRANT: automatic grammatical error type classification for greek. 708–717. https://doi.org/10.26615/978-954-452-072-4_081
Suhaidi M, Abdul Kadir R, Tiun S (2021) A review of feature extraction methods on machine learning. J Inf Syst Technol Manage
Google Scholar
Manohar K, Jayan A, Rajan R (2022) Mlphon: a multifunctional grapheme-phoneme conversion tool using finite state transducers. IEEE Access 1–1. https://doi.org/10.1109/ACCESS.2022.3204403

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Amrita School of Computing, Coimbatore, India
S. Dhanya, M. R. Kaimal & Prema Nedungadi
Amrita CREATE, Amrita Vishwa Vidyapeetham, Amritapuri, India
S. Dhanya & Prema Nedungadi

Authors

S. Dhanya
View author publications
You can also search for this author in PubMed Google Scholar
M. R. Kaimal
View author publications
You can also search for this author in PubMed Google Scholar
Prema Nedungadi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Dhanya .

Editor information

Editors and Affiliations

Global Knowledge Research Foundation, Ahmedabad, Gujarat, India
Amit Joshi
Nottingham Trent University, Nottingham, UK
Mufti Mahmud
University of Peradeniya, Delthota, Sri Lanka
Roshan G. Ragel
Department of CSE, SNS College of Technology, Coimbatore, Tamil Nadu, India
S. Kartik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dhanya, S., Kaimal, M.R., Nedungadi, P. (2024). Automatic Spelling Error Classification in Malayalam. In: Joshi, A., Mahmud, M., Ragel, R.G., Kartik, S. (eds) ICT: Cyber Security and Applications. ICTCS 2022. Lecture Notes in Networks and Systems, vol 916. Springer, Singapore. https://doi.org/10.1007/978-981-97-0744-7_25

Download citation

DOI: https://doi.org/10.1007/978-981-97-0744-7_25
Published: 14 May 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0743-0
Online ISBN: 978-981-97-0744-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics