Evaluation of Automated Theorem Proving on the Mizar Mathematical Library

  • Josef Urban
  • Krystof Hoder
  • Andrei Voronkov
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6327)

Abstract

This paper investigates the strength of first-order automatic theorem provers (ATPs) in proving theorems and lemmas from the Mizar proof assistant’s formal mathematical library. Several Mizar use-cases are described and evaluated, as well as various ATP systems and strategies. The new version of the leading Vampire ATP system is included in the evaluation, experiments with Mizar-specific strategy-selection are performed with E the prover, and the SInE axiom selection is evaluated on large Mizar problems with both E and Vampire. A rough mathematical division of the Mizar library is introduced, and the ATP performance is evaluated on it.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Dav81]
    Davis, M.: Obvious logical inferences. In: Hayes, P.J. (ed.) IJCAI, pp. 530–531. William Kaufmann, San Francisco (1981)Google Scholar
  2. [Din07]
    Ding, Y.: Several classes of BCI-algebras and their properties. Formalized Mathematics 15(1), 1–9 (2007)CrossRefGoogle Scholar
  3. [KRS90]
    Kotowicz, J., Raczkowski, K., Sadowski, P.: Average value theorems for real functions of one variable. Formalized Mathematics 1(4), 803–805 (1990)Google Scholar
  4. [MJWD06]
    Matuszek, C., Cabral, J., Witbrock, M., DeOliveira, J.: An Introduction to the Syntax and Content of Cyc. In: Baral, C. (ed.) Proceedings of the 2006 AAAI Spring Symposium on Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering, pp. 44–49 (2006)Google Scholar
  5. [MP09]
    Meng, J., Paulson, L.C.: Lightweight relevance filtering for machine-generated resolution problems. J. Applied Logic 7(1), 41–57 (2009)MATHCrossRefMathSciNetGoogle Scholar
  6. [PS07]
    Pease, A., Sutcliffe, G.: First Order Reasoning on a Large Ontology. In: Urban, J., Sutcliffe, G., Schulz, S. (eds.) Proceedings of the CADE-21 Workshop on Empirically Successful Automated Reasoning in Large Theories (2007)Google Scholar
  7. [Ric07]
    Riccardi, M.: The Sylow theorems. Formalized Mathematics 15(3), 159–165 (2007)CrossRefGoogle Scholar
  8. [Rud87]
    Rudnicki, P.: Obvious inferences. J. Autom. Reasoning 3(4), 383–393 (1987)MATHCrossRefMathSciNetGoogle Scholar
  9. [RV02]
    Riazanov, A., Voronkov, A.: The design and implementation of VAMPIRE. Journal of AI Communications 15(2-3), 91–110 (2002)MATHGoogle Scholar
  10. [Sch02]
    Schulz, S.: E – a brainiac theorem prover. Journal of AI Communications 15(2-3), 111–126 (2002)MATHGoogle Scholar
  11. [SP07]
    Sutcliffe, G., Puzis, Y.: SRASS - a semantic relevance axiom selection system. In: Pfenning, F. (ed.) CADE 2007. LNCS (LNAI), vol. 4603, pp. 295–310. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  12. [Sut06]
    Sutcliffe, G.: Semantic Derivation Verification. International Journal on Artificial Intelligence Tools 15(6), 1053–1070 (2006)CrossRefGoogle Scholar
  13. [Urb06]
    Urban, J.: MPTP 0.2: Design, implementation, and initial experiments. J. Autom. Reasoning 37(1-2), 21–43 (2006)MATHCrossRefGoogle Scholar
  14. [US08]
    Urban, J., Sutcliffe, G.: ATP-based cross-verification of Mizar proofs: Method, systems, and first experiments. Mathematics in Computer Science 2(2), 231–251 (2008)MATHCrossRefMathSciNetGoogle Scholar
  15. [US10]
    Urban, J., Sutcliffe, G.: Automated reasoning and presentation support for formalizing mathematics in Mizar. In: Autexier, S., Calmet, J., Delahaye, D., Ion, P.D.F., Rideau, L., Rioboo, R., Sexton, A.P. (eds.) AISC 2010. LNCS (LNAI), vol. 6167, pp. 132–146. Springer, Heidelberg (2010)Google Scholar
  16. [USPV08]
    Urban, J., Sutcliffe, G., Pudlak, P., Vyskocil, J.: MaLARea SG1: Machine Learner for Automated Reasoning with Semantic Guidance. In: Armando, A., Baumgartner, P., Dowek, G. (eds.) IJCAR 2008. LNCS (LNAI), vol. 5195, pp. 441–456. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  17. [VSU10]
    Vyskocil, J., Stanovsky, D., Urban, J.: Automated proof shortening by invention of new definitions. In: LPAR 2010. LNCS (LNAI). Springer, Heidelberg (to appear 2010)Google Scholar
  18. [WDF+09]
    Weidenbach, C., Dimova, D., Fietzke, A., Kumar, R., Suda, M., Wischnewski, P.: SPASS version 3.5. In: Schmidt, R.A. (ed.) Automated Deduction – CADE-22. LNCS, vol. 5663, pp. 140–145. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  19. [Wie00]
    Wiedijk, F.: CHECKER - notes on the basic inference step in Mizar (2000), http://www.cs.kun.nl/~freek/mizar/by.dvi
  20. [Wie07]
    Wiedijk, F.: Arrow’s impossibility theorem. Formalized Mathematics 15(4), 171–174 (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Josef Urban
    • 1
  • Krystof Hoder
    • 2
  • Andrei Voronkov
    • 2
  1. 1.Radboud UniversityNijmegen
  2. 2.University of Manchester 

Personalised recommendations