Abstract
The power of state-of-the-art automated and interactive theorem provers has reached the level at which a significant portion of non-trivial mathematical contents can be formalized almost fully automatically. In this paper we present our framework for the formalization of mathematical knowledge that can produce machine verifiable proofs (for different proof assistants) but also human-readable (nearly textbook-like) proofs. As a case study, we focus on one of the twentieth century classics – a book on Tarski’s geometry. We tried to automatically generate such proofs for the theorems from this book using resolution theorem provers and a coherent logic theorem prover. In the first experiment, we used only theorems from the book, in the second we used additional lemmas from the existing Coq formalization of the book, and in the third we used specific dependency lists from the Coq formalization for each theorem. The results show that 37 % of the theorems from the book can be automatically proven (with readable and machine verifiable proofs generated) without any guidance, and with additional lemmas this percentage rises to 42 %. These results give hope that the described framework and other forms of automation can significantly aid mathematicians in developing formal and informal mathematical knowledge.
Similar content being viewed by others
References
Avigad, J., Dean, E., Mumma, J.: A formal system for Euclid’s elements. Rev. Symb. Log. 2(4), 700–768 (2009)
Beeson, M.: Proof and computation in geometry. In: Automated Deduction in Geometry – ADG 2012, volume 7993 of Lecture Notes in Computer Science, pp. 1–30. Springer (2013)
Beeson, M., Wos, L.: OTTER Proofs in Tarskian geometry. In: Automated Reasoning - 7th International Joint Conference, IJCAR 2014, volume 8562 of Lecture Notes in Computer Science, pp. 495–510. Springer (2014)
Blanchette, J. C. : Redirecting proofs by Contradiction. In: Third International Workshop on Proof Exchange for Theorem Proving, PxTP 2013, Lake Placid, NY, USA, June 9-10, 2013, volume 14 of EPiC Series, pp. 11–26. EasyChair (2013)
Bezem, M., Coquand, T.: Automating coherent logic. In: 12th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning — LPAR 2005, volume 3835 of Lecture Notes in Computer Science. Springer-Verlag (2005)
Bezem, M, Hendriks, D. : On the mechanization of the proof of Hessenberg’s theorem in coherent logic. J. Autom. Reason. 40(1) (2008)
Blanchette, J.C., Böhme, S., Paulson, L.C.: Extending sledgehammer with SMT solvers. J. Autom. Reason. 51 (1), 109–128 (2013)
Blanchette, J.C., Bulwahn, L., Nipkow, T.: Automatic proof and disproof in Isabelle/HOL. In: Frontiers of Combining Systems, 8th International Symposium, Proceedings, volume 6989 of Lecture Notes in Computer Science, pp. 12–27. Springer (2011)
Boutry, P., Narboux, J., Schreck, P., Braun, G.: A short note about case distinctions in Tarski’s geometry. 10th International Workshop on Automated Deduction in Geometry (ADG 2014), pp. 51-66. TR 2014/01, University of Coimbra (2014)
Boutry, P., Narboux, J., Schreck, P., Braun, G.: Using small scale automation to improve both accessibility and readability of formal proofs in geometry. 10th International Workshop on Automated Deduction in Geometry (ADG 2014), pp. 31-50. TR 2014/01, University of Coimbra (2014)
Braun, G., Narboux, J.: From Tarski to Hilbert. In: Automated Deduction in Geometry – ADG 2012 volume 7993 of Lecture Notes in Computer Science, pp. 89–109. Springer (2013)
Fisher, J., Bezem, M.: Skolem machines and geometric logic. In: 4th International Colloquium on Theoretical Aspects of Computing — ICTAC 2007, volume 4711 of Lecture Notes in Computer Science. Springer-Verlag (2007)
Ganesalingam, M., Gowers, W.T.: A fully automatic problem solver with human-style output. CoRR, abs/1309.4501 (2013)
Gonthier, G., Asperti, A., Avigad, J., Bertot, Y., Cohen, C., Garillot, F., Roux, S.L., Mahboubi, A., O’Connor, R., Biha, S.O., Pasca, I., Rideau, L., Solovyev, A., Tassi, E., Théry, L.: A machine-checked proof of the Odd Order theorem. In: 4th Conference on Interactive Theorem Proving – ITP 2013 volume 7998 of Lecture Notes in Computer Science, pp. 163–179. Springer (2013)
Haragauri N.G.: Contributions to the axiomatic foundations of geometry. PhD thesis. University of California, Berkley (1965)
Hales T.C.: Introduction to the Flyspeck project. In: Mathematics, Algorithms, Proofs, volume 05021 of Dagstuhl Seminar Proceedings. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany, p 2006
Hilbert, D.: Grundlagen der Geometrie. Baedeker, Leipzig (1899)
Kaliszyk, C., Urban, J.: Learning-assisted automated reasoning with Flyspeck. CoRR, abs/1211.7012 (2012)
Makarios, T.: A further simplification of Tarski’s axioms of geometry. CoRR, abs/1306.0066 (2013)
Meikle, L., Fleuriot, J.: Formalizing Hilbert’s Grundlagen in Isabelle/Isar. In: Theorem Proving in Higher Order Logics, pp. 319–334 (2003)
Meikle, L., Fleuriot, J.: Mechanical theorem proving in computation geometry. In: Automated Deduction in Geometry – ADG 04, volume 3763 of Lecture Notes in Computer Science, pp. 1–18. Springer-Verlag, November (2005)
Narboux, J.: Mechanical theorem proving in Tarski’s geometry. In: Proceedings of Automatic Deduction in Geometry 06, volume 4869 of Lecture Notes in Artificial Intelligence, pp. 139–156. Springer-Verlag (2007)
Polonsky, A.: Proofs, Types and Lambda Calculus. PhD thesis, University of Bergen (2011)
Quaife, A.: Automated development of Tarski’s geometry. J. Autom. Reason. 5(1), 97–118 (1989)
Riazanov, A., Voronkov, A.: The design and implementation of Vampire. AI Commun. 15(2-3), 91–110 (2002)
Robinson, J.A.: A machine oriented logic based on the resolution principle. J. ACM 12, 23–41 (1965)
Stephan Schulz: E - a brainiac theorem prover. AI Commun 15(2-3), 111–126 (2002)
Schwabhuser, W., Szmielew, W., Tarski, A.: Metamathematische Methoden in der Geometrie. Springer-Verlag, Berlin (1983)
Stojanović, S., Narboux, J., Bezem, M., Janičić, P.: A vernacular for coherent logic. In: Conferences on Intelligent Computer Mathematics, volume 8543 of Lecture Notes in Computer Science, pp. 388–403. Springer (2014)
Stojanović, S., Pavlović, V., Janičić, P.: A coherent logic based geometry theorem prover capable of producing formal and readable proofs. In: Automated Deduction in Geometry - ADG 2010, volume 6877 of Lecture Notes in Computer Science. Springer (2011)
Sutcliffe, G.: The TPTP problem library and associated infrastructure: The FOF and CNF parts, v3.5.0. J. Autom. Reason. 43(4), 337–362 (2009)
Tankink, C., Kaliszyk, C., Urban, J., Geuvers, H.: Communicating formal proofs: The case of Flyspeck. In: Interactive Theorem Proving - 4th International Conference, Proceedings, volume 7998 of Lecture Notes in Computer Science, pp. 451–456. Springer (2013)
Tarski, A. : What is elementary geometry? . In: P. Suppes , L. Henkin, A. Tarski (eds.) The axiomatic Method, with special reference to Geometry and Physics, pp 16–29. Amsterdam , North-Holland (1959)
Tarski, A., Givant, S.: Tarski’s system of geometry. Bull. Symb. Log. 5(2) (1999)
Weidenbach, C., Dimova, D., Fietzke, A., Kumar, R., Suda, M., Wischnewski, P.: Spass version 3.5. In: Automated Deduction - CADE-22 Proceedings, volume 5663 of Lecture Notes in Computer Science, pp. 140–145. Springer (2009)
Markus, W.: Isar - a generic interpretative approach to readable formal proof documents. In: Theorem Proving in Higher Order Logics (TPHOLs’99), volume 1690 of Lecture Notes in Computer Science, pp. 167–184. Springer (1999)
Wiedijk, F. (ed.): The seventeen provers of the World, volume 3600 of Lecture Notes in Computer Science Springer (2006)
Author information
Authors and Affiliations
Corresponding author
Additional information
The research presented in this paper was partly funded by the Serbian-French Technology Co-Operation grant EGIDE/“Pavle Savi´c” 680-00-132/2012-09/12. The first and the third author are partly supported by the grant ON174021 of the Ministry of Science of Serbia.
Rights and permissions
About this article
Cite this article
Ðurđević, S.S., Narboux, J. & Janičić, P. Automated generation of machine verifiable and readable proofs: A case study of Tarski’s geometry. Ann Math Artif Intell 74, 249–269 (2015). https://doi.org/10.1007/s10472-014-9443-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10472-014-9443-5