A General Method for Transforming Standard Parsers into Error-Repair Parsers

  • Carlos Gómez-Rodríguez
  • Miguel A. Alonso
  • Manuel Vilares
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5449)

Abstract

A desirable property for any system dealing with unrestricted natural language text is robustness, the ability to analyze any input regardless of its grammaticality. In this paper we present a novel, general transformation technique to automatically obtain robust, error-repair parsers from standard non-robust parsers. The resulting error-repair parsing schema is guaranteed to be correct when our method is applied to a correct parsing schema verifying certain conditions that are weak enough to be fulfilled by a wide variety of parsers used in natural language processing.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Cerecke, C.: Repairing syntax errors in LR-based parsers. Australian Computer Science Communications 24(1), 17–22 (2002)Google Scholar
  2. 2.
    Corchuelo, R., Pérez, J.A., Ruiz, A., Toro, M.: Repairing Syntax Errors in LR Parsers. ACM Transactions on Programming Languages and Systems 24(6), 698–710 (2002)CrossRefGoogle Scholar
  3. 3.
    Earley, J.: An efficient context-free parsing algorithm. Communications of the ACM 13(2), 94–102 (1970)CrossRefMATHGoogle Scholar
  4. 4.
    Gómez-Rodríguez, C., Vilares, J., Alonso, M.A.: A compiler for parsing schemata. Software: Practice and Experience, doi:10.1002/spe.904 (forthcoming)Google Scholar
  5. 5.
    Grune, D., Jacobs, C.J.H.: Parsing Techniques. A Practical Guide, 2nd edn. Springer Science+Business Media, Heidelberg (2008)MATHGoogle Scholar
  6. 6.
    Kasper, W., Kiefer, B., Krieger, H.U., Rupp, C.J., Worm, K.L.: Charting the depths of robust speech parsing. In: Proc. of ACL 1999, Morristown, NJ, USA, pp. 405–412 (1999)Google Scholar
  7. 7.
    Kim, I.-S., Choe, K.-M.: Error Repair with Validation in LR-based Parsing. ACM Transactions on Programming Languages and Systems 23(4), 451–471 (2001)CrossRefGoogle Scholar
  8. 8.
    Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10(8), 707–710 (1966)MathSciNetMATHGoogle Scholar
  9. 9.
    Lyon, G.: Syntax-directed least-errors analysis for context-free languages: a practical approach. Commun. ACM 17(1), 3–14 (1974)CrossRefMATHGoogle Scholar
  10. 10.
    Moore, R.C.: Improved left-corner chart parsing for large context-free grammars. In: Proc. of the 6th IWPT, Trento, Italy, pp. 171–182 (2000)Google Scholar
  11. 11.
    Perez-Cortes, J.C., Amengual, J.C., Arlandis, J., Llobet, R.: Stochastic error-correcting parsing for OCR post-processing. In: ICPR 2000: Proceedings of the International Conference on Pattern Recognition, Washington, DC, USA, p. 4405. IEEE Computer Society, Los Alamitos (2000)Google Scholar
  12. 12.
    Shieber, S.M., Schabes, Y., Pereira, F.C.N.: Principles and implementation of deductive parsing. Journal of Logic Programming 24(1–2), 3–36 (1995)MathSciNetCrossRefMATHGoogle Scholar
  13. 13.
    Sikkel, K.: Parsing Schemata — A Framework for Specification and Analysis of Parsing Algorithms. Springer, Heidelberg (1997)MATHGoogle Scholar
  14. 14.
    Sikkel, K.: Parsing schemata and correctness of parsing algorithms. Theoretical Computer Science 199(1–2), 87–103 (1998)MathSciNetCrossRefMATHGoogle Scholar
  15. 15.
    van der Spek, P., Plat, N., Pronk, N.: Syntax Error Repair for a Java-based Parser Generator. ACM SIGPLAN Notices 40(4), 47–50 (2005)CrossRefGoogle Scholar
  16. 16.
    Vilares, M., Darriba, V.M., Ribadas, F.J.: Regional least-cost error repair. In: Yu, S., Păun, A. (eds.) CIAA 2000. LNCS, vol. 2088, pp. 293–301. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  17. 17.
    Vilares, M., Darriba, V.M., Vilares, J., Ribadas, F.J.: A formal frame for robust parsing. Theoretical Computer Science 328, 171–186 (2004)MathSciNetCrossRefMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Carlos Gómez-Rodríguez
    • 1
  • Miguel A. Alonso
    • 1
  • Manuel Vilares
    • 2
  1. 1.Departamento de ComputaciónUniversidade da CoruñaSpain
  2. 2.Escuela Superior de Ingeniería InformáticaUniversidade de VigoSpain

Personalised recommendations