Skip to main content
Log in

Automated generation of machine verifiable and readable proofs: A case study of Tarski’s geometry

  • Published:
Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Abstract

The power of state-of-the-art automated and interactive theorem provers has reached the level at which a significant portion of non-trivial mathematical contents can be formalized almost fully automatically. In this paper we present our framework for the formalization of mathematical knowledge that can produce machine verifiable proofs (for different proof assistants) but also human-readable (nearly textbook-like) proofs. As a case study, we focus on one of the twentieth century classics – a book on Tarski’s geometry. We tried to automatically generate such proofs for the theorems from this book using resolution theorem provers and a coherent logic theorem prover. In the first experiment, we used only theorems from the book, in the second we used additional lemmas from the existing Coq formalization of the book, and in the third we used specific dependency lists from the Coq formalization for each theorem. The results show that 37 % of the theorems from the book can be automatically proven (with readable and machine verifiable proofs generated) without any guidance, and with additional lemmas this percentage rises to 42 %. These results give hope that the described framework and other forms of automation can significantly aid mathematicians in developing formal and informal mathematical knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Avigad, J., Dean, E., Mumma, J.: A formal system for Euclid’s elements. Rev. Symb. Log. 2(4), 700–768 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  2. Beeson, M.: Proof and computation in geometry. In: Automated Deduction in Geometry – ADG 2012, volume 7993 of Lecture Notes in Computer Science, pp. 1–30. Springer (2013)

  3. Beeson, M., Wos, L.: OTTER Proofs in Tarskian geometry. In: Automated Reasoning - 7th International Joint Conference, IJCAR 2014, volume 8562 of Lecture Notes in Computer Science, pp. 495–510. Springer (2014)

  4. Blanchette, J. C. : Redirecting proofs by Contradiction. In: Third International Workshop on Proof Exchange for Theorem Proving, PxTP 2013, Lake Placid, NY, USA, June 9-10, 2013, volume 14 of EPiC Series, pp. 11–26. EasyChair (2013)

  5. Bezem, M., Coquand, T.: Automating coherent logic. In: 12th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning — LPAR 2005, volume 3835 of Lecture Notes in Computer Science. Springer-Verlag (2005)

  6. Bezem, M, Hendriks, D. : On the mechanization of the proof of Hessenberg’s theorem in coherent logic. J. Autom. Reason. 40(1) (2008)

  7. Blanchette, J.C., Böhme, S., Paulson, L.C.: Extending sledgehammer with SMT solvers. J. Autom. Reason. 51 (1), 109–128 (2013)

    Article  Google Scholar 

  8. Blanchette, J.C., Bulwahn, L., Nipkow, T.: Automatic proof and disproof in Isabelle/HOL. In: Frontiers of Combining Systems, 8th International Symposium, Proceedings, volume 6989 of Lecture Notes in Computer Science, pp. 12–27. Springer (2011)

  9. Boutry, P., Narboux, J., Schreck, P., Braun, G.: A short note about case distinctions in Tarski’s geometry. 10th International Workshop on Automated Deduction in Geometry (ADG 2014), pp. 51-66. TR 2014/01, University of Coimbra (2014)

  10. Boutry, P., Narboux, J., Schreck, P., Braun, G.: Using small scale automation to improve both accessibility and readability of formal proofs in geometry. 10th International Workshop on Automated Deduction in Geometry (ADG 2014), pp. 31-50. TR 2014/01, University of Coimbra (2014)

  11. Braun, G., Narboux, J.: From Tarski to Hilbert. In: Automated Deduction in Geometry – ADG 2012 volume 7993 of Lecture Notes in Computer Science, pp. 89–109. Springer (2013)

  12. Fisher, J., Bezem, M.: Skolem machines and geometric logic. In: 4th International Colloquium on Theoretical Aspects of Computing — ICTAC 2007, volume 4711 of Lecture Notes in Computer Science. Springer-Verlag (2007)

  13. Ganesalingam, M., Gowers, W.T.: A fully automatic problem solver with human-style output. CoRR, abs/1309.4501 (2013)

  14. Gonthier, G., Asperti, A., Avigad, J., Bertot, Y., Cohen, C., Garillot, F., Roux, S.L., Mahboubi, A., O’Connor, R., Biha, S.O., Pasca, I., Rideau, L., Solovyev, A., Tassi, E., Théry, L.: A machine-checked proof of the Odd Order theorem. In: 4th Conference on Interactive Theorem Proving – ITP 2013 volume 7998 of Lecture Notes in Computer Science, pp. 163–179. Springer (2013)

  15. Haragauri N.G.: Contributions to the axiomatic foundations of geometry. PhD thesis. University of California, Berkley (1965)

    MATH  Google Scholar 

  16. Hales T.C.: Introduction to the Flyspeck project. In: Mathematics, Algorithms, Proofs, volume 05021 of Dagstuhl Seminar Proceedings. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany, p 2006

  17. Hilbert, D.: Grundlagen der Geometrie. Baedeker, Leipzig (1899)

    Google Scholar 

  18. Kaliszyk, C., Urban, J.: Learning-assisted automated reasoning with Flyspeck. CoRR, abs/1211.7012 (2012)

  19. Makarios, T.: A further simplification of Tarski’s axioms of geometry. CoRR, abs/1306.0066 (2013)

  20. Meikle, L., Fleuriot, J.: Formalizing Hilbert’s Grundlagen in Isabelle/Isar. In: Theorem Proving in Higher Order Logics, pp. 319–334 (2003)

  21. Meikle, L., Fleuriot, J.: Mechanical theorem proving in computation geometry. In: Automated Deduction in Geometry – ADG 04, volume 3763 of Lecture Notes in Computer Science, pp. 1–18. Springer-Verlag, November (2005)

  22. Narboux, J.: Mechanical theorem proving in Tarski’s geometry. In: Proceedings of Automatic Deduction in Geometry 06, volume 4869 of Lecture Notes in Artificial Intelligence, pp. 139–156. Springer-Verlag (2007)

  23. Polonsky, A.: Proofs, Types and Lambda Calculus. PhD thesis, University of Bergen (2011)

  24. Quaife, A.: Automated development of Tarski’s geometry. J. Autom. Reason. 5(1), 97–118 (1989)

    Article  MathSciNet  Google Scholar 

  25. Riazanov, A., Voronkov, A.: The design and implementation of Vampire. AI Commun. 15(2-3), 91–110 (2002)

    Google Scholar 

  26. Robinson, J.A.: A machine oriented logic based on the resolution principle. J. ACM 12, 23–41 (1965)

    Article  Google Scholar 

  27. Stephan Schulz: E - a brainiac theorem prover. AI Commun 15(2-3), 111–126 (2002)

    MATH  Google Scholar 

  28. Schwabhuser, W., Szmielew, W., Tarski, A.: Metamathematische Methoden in der Geometrie. Springer-Verlag, Berlin (1983)

    Book  MATH  Google Scholar 

  29. Stojanović, S., Narboux, J., Bezem, M., Janičić, P.: A vernacular for coherent logic. In: Conferences on Intelligent Computer Mathematics, volume 8543 of Lecture Notes in Computer Science, pp. 388–403. Springer (2014)

  30. Stojanović, S., Pavlović, V., Janičić, P.: A coherent logic based geometry theorem prover capable of producing formal and readable proofs. In: Automated Deduction in Geometry - ADG 2010, volume 6877 of Lecture Notes in Computer Science. Springer (2011)

  31. Sutcliffe, G.: The TPTP problem library and associated infrastructure: The FOF and CNF parts, v3.5.0. J. Autom. Reason. 43(4), 337–362 (2009)

    Article  Google Scholar 

  32. Tankink, C., Kaliszyk, C., Urban, J., Geuvers, H.: Communicating formal proofs: The case of Flyspeck. In: Interactive Theorem Proving - 4th International Conference, Proceedings, volume 7998 of Lecture Notes in Computer Science, pp. 451–456. Springer (2013)

  33. Tarski, A. : What is elementary geometry? . In: P. Suppes , L. Henkin, A. Tarski (eds.) The axiomatic Method, with special reference to Geometry and Physics, pp 16–29. Amsterdam , North-Holland (1959)

  34. Tarski, A., Givant, S.: Tarski’s system of geometry. Bull. Symb. Log. 5(2) (1999)

  35. Weidenbach, C., Dimova, D., Fietzke, A., Kumar, R., Suda, M., Wischnewski, P.: Spass version 3.5. In: Automated Deduction - CADE-22 Proceedings, volume 5663 of Lecture Notes in Computer Science, pp. 140–145. Springer (2009)

  36. Markus, W.: Isar - a generic interpretative approach to readable formal proof documents. In: Theorem Proving in Higher Order Logics (TPHOLs’99), volume 1690 of Lecture Notes in Computer Science, pp. 167–184. Springer (1999)

  37. Wiedijk, F. (ed.): The seventeen provers of the World, volume 3600 of Lecture Notes in Computer Science Springer (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sana Stojanović Ðurđević.

Additional information

The research presented in this paper was partly funded by the Serbian-French Technology Co-Operation grant EGIDE/“Pavle Savi´c” 680-00-132/2012-09/12. The first and the third author are partly supported by the grant ON174021 of the Ministry of Science of Serbia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ðurđević, S.S., Narboux, J. & Janičić, P. Automated generation of machine verifiable and readable proofs: A case study of Tarski’s geometry. Ann Math Artif Intell 74, 249–269 (2015). https://doi.org/10.1007/s10472-014-9443-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10472-014-9443-5

Keywords

Mathematics Subject Classification (2010)

Navigation