Skip to main content

Detecting Article Errors Based on the Mass Count Distinction

  • Conference paper
Natural Language Processing – IJCNLP 2005 (IJCNLP 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3651))

Included in the following conference series:

Abstract

This paper proposes a method for detecting errors concerning article usage and singular/plural usage based on the mass count distinction. Although the mass count distinction is particularly important in detecting these errors, it has been pointed out that it is hard to make heuristic rules for distinguishing mass and count nouns. To solve the problem, first, instances of mass and count nouns are automatically collected from a corpus exploiting surface information in the proposed method. Then, words surrounding the mass (count) instances are weighted based on their frequencies. Finally, the weighted words are used for distinguishing mass and count nouns. After distinguishing mass and count nouns, the above errors can be detected by some heuristic rules. Experiments show that the proposed method distinguishes mass and count nouns in the writing of Japanese learners of English with an accuracy of 93% and that 65% of article errors are detected with a precision of 70%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Kawai, A., Sugihara, K., Sugie, N.: ASPEC-I: An error detection system for English composition. IPSJ Journal 25, 1072–1079 (1984) (in Japanese)

    Google Scholar 

  2. McCoy, K., Pennington, C., Suri, L.: English error correction: A syntactic user model based on principled “mal-rule” scoring. In: Proc. 5th International Conference on User Modeling, pp. 69–66 (1996)

    Google Scholar 

  3. Schneider, D., McCoy, K.: Recognizing syntactic errors in the writing of second language learners. In: Proc. 17th International Conference on Computational Linguistics, pp. 1198–1204 (1998)

    Google Scholar 

  4. Pelletier, F., Schubert, L.: Two theories for computing the logical form of mass expressions. In: Proc. 10th International Conference on Computational Linguistics, pp. 108–111 (1984)

    Google Scholar 

  5. Izumi, E., Uchimoto, K., Saiga, T., Supnithi, T., Isahara, H.: Automatic error detection in the Japanese learners’ English spoken data. In: Proc. 41st Annual Meeting of the Association for Computational Linguistics, pp. 145–148 (2003)

    Google Scholar 

  6. Allan, K.: Nouns and countability. J. Linguistic Society of America 56, 541–567 (1980)

    Google Scholar 

  7. Baldwin, T., Bond, F.: A plethora of methods for learning English countability. In: Proc. 2003 Conference on Empirical Methods in Natural Language Processing, pp. 73–80 (2003)

    Google Scholar 

  8. Baldwin, T., Bond, F.: Learning the countability of English nouns from corpus data. In: Proc. 41st Annual Meeting of the Association for Computational Linguistics, pp. 463–470 (2003)

    Google Scholar 

  9. Bond, F., Vatikiotis-Bateson, C.: Using an ontology to determine English countability. In: Proc. 19th International Conference on Computational Linguistics, pp. 99–105 (2002)

    Google Scholar 

  10. O’Hara, T., Salay, N., Witbrock, M., Schneider, D., Aldag, B., Bertolo, S., Panton, K., Lehmann, F., Curtis, J., Smith, M., Baxter, D., Wagner, P.: Inducing criteria for mass noun lexical mappings using the Cyc KB, and its extension to WordNet. In: Proc. 5th International Workshop on Computational Semantics, pp. 425–441 (2003)

    Google Scholar 

  11. Lenat, D.B.: CYC: A large-scale investment in knowledge infrastructure. Communications of the ACM 38, 33–38 (1995)

    Article  Google Scholar 

  12. Huddleston, R., Pullum, G.K.: The Cambridge Grammar of the English Language. Cambridge University Press, Cambridge (2002)

    Google Scholar 

  13. Rivest, R.L.: Learning decision lists. Machine Learning 2, 229–246 (1987)

    MathSciNet  Google Scholar 

  14. Gillon, B.: The lexical semantics of English count and mass nouns. In: Proc. Special Interest Group on the Lexicon of the Association for Computational Linguistics, pp. 51–61 (1996)

    Google Scholar 

  15. Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proc. 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189–196 (1995)

    Google Scholar 

  16. Yarowsky, D.: Homograph Disambiguation in Speech Synthesis. Springer, Heidelberg (1996)

    Google Scholar 

  17. Burnard, L.: Users Reference Guide for the British National Corpus. version 1.0. Oxford University Computing Services, Oxford (1995)

    Google Scholar 

  18. Ostler, N., Atkins, B.: Predictable meaning shift: Some linguistic properties of lexical implication rules. In: Proc. of 1st SIGLEX Workshop on Lexical Semantics and Knowledge Representation, pp. 87–100 (1991)

    Google Scholar 

  19. Chodorow, M., Leacock, C.: An unsupervised method for detecting grammatical errors. In: Proc. 1st Meeting of the North America Chapter of the Association for Computational Linguistics, pp. 140–147 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nagata, R., Wakana, T., Masui, F., Kawai, A., Isu, N. (2005). Detecting Article Errors Based on the Mass Count Distinction. In: Dale, R., Wong, KF., Su, J., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2005. IJCNLP 2005. Lecture Notes in Computer Science(), vol 3651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562214_71

Download citation

  • DOI: https://doi.org/10.1007/11562214_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29172-5

  • Online ISBN: 978-3-540-31724-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics