Mechanical certification of FOLID cyclic proofs

Stratulat, Sorin

doi:10.1007/s10472-023-09832-7

Mechanical certification of FOL_ID cyclic proofs

Published: 16 February 2023

Volume 91, pages 651–673, (2023)
Cite this article

Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Sorin Stratulat ORCID: orcid.org/0000-0002-1670-9474¹

61 Accesses
Explore all metrics

Abstract

Cyclic induction is a powerful reasoning technique that consists in blocking the proof development of certain subgoals already encountered during the proof process. In the setting of first-order logic with inductive definitions and equality (FOL_ID), cyclic proofs can be built automatically by the Cyclist prover, but their implementations are error-prone and the human validation may be tedious. On the other hand, cyclic induction is not yet integrated into certifying proof environments that support first-order logic and inductive definitions, such as Isabelle and Coq. We propose a solution to check, using Coq, the cyclic proofs produced by E-Cyclist, an extension of Cyclist that implements a more efficient soundness validation method, by using the general Noetherian induction principle integrated into Coq. Our work is based on a methodology for certifying first-order formula-based Noetherian induction proofs, such as those based on implicit induction. The advantages of our approach are threefold: - I) The certification of cyclic FOL_ID proofs is mechanical. Coq can validate every single step from the E-Cyclist proofs, as well as the induction arguments; also, it helps to identify errors in a very precise way. - II) There is a great potential for automation. The methodology has already been used to automatically convert to Coq scripts implicit induction proofs. - III) Cyclic induction can be directly performed in Coq. Coq functions are provided to manage the induction part.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Translating Between Implicit and Explicit Versions of Proof

Verification of Certifying Computations through AutoCorres and Simpl

Efficient Certified RAT Verification

Data Availability

The datasets generated during and/or analyzed during the current study, in particular, the full Coq specifications and proof scripts, are archived and made available at https://members.loria.fr/SStratulat/files/ECyclist-coq-certification.zip

References

Brotherston, J.: Sequent calculus proof systems for inductive definitions. PhD thesis, University of Edinburgh (2006)
Brotherston, J., Simpson, A.: Sequent calculi for induction and infinite descent. J. Log. Comput. 21(6), 1177–1216 (2011). https://doi.org/10.1093/logcom/exq052
Article MathSciNet MATH Google Scholar
Gentzen, G.: Untersuchungen über das logische schließen. I. Mathematische Zeitschrift 39, 176–210 (1935). https://doi.org/10.1007/BF01201353
Article MATH Google Scholar
Brotherston, J., Gorogiannis, N., Petersen, R.L.: A generic cyclic theorem prover. In: APLAS-10 (10th Asian Symposium on Programming Languages and Systems). LNCS. https://doi.org/10.1007/978-3-642-35182-2_25, vol. 7705, pp 350–367. Springer (2012)
Michel, M.: Complementation is more difficult with automata on infinite words. Technical report CNET (1988)
Stratulat, S.: Cyclic proofs with ordering constraints. In: Schmidt, R.A., Nalon, C. (eds.) TABLEAUX 2017 (26th International Conference on Automated Reasoning with Analytic Tableaux and Related Methods). LNAI. https://doi.org/10.1007/978-3-319-66902-1_19, vol. 10501, pp 311–327. Springer (2017)
Stratulat, S.: Validating back-links of FOL_ID cyclic pre-proofs. In: Berardi, S., Van Bakel, S. (eds.) CL&C’18 (Seventh International Workshop on Classical Logic and Computation). EPTCS, pp. 39–53. https://doi.org/10.4204/EPTCS.281.4 (2018)
Stratulat, S.: E-Cyclist: Implementation of an efficient validation of FOL_ID cyclic induction reasoning. In: Kutsia, T. (ed.) 9th International Symposium on Symbolic Computation in Software Science. Electronic Proceedings in Theoretical Computer Science, vol. 342, pp. 129–135. https://doi.org/10.4204/EPTCS.342.11 (2021)
The Coq development team: The Coq Reference Manual. INRIA. INRIA. http://coq.inria.fr/doc (2020)
Stratulat, S.: Mechanically certifying formula-based Noetherian induction reasoning. J. Symb. Comput. 80 Part 1, 209–249 (2017). https://doi.org/10.1016/j.jsc.2016.07.014
Article MathSciNet MATH Google Scholar
Stratulat, S.: SPIKE, An automatic theorem prover – revisited. In: SYNASC 2020: Proceedings of the 22nd International Symposium on Symbolic and Numeric Algorithms for Scientific Computing. https://doi.org/10.1109/SYNASC51798.2020.00025, pp 93–96. IEEE Computer Society (2020)
Berardi, S., Tatsuta, M.: Classical system of Martin-Lof’s inductive definitions is not equivalent to cyclic proofs. Logical Methods in Computer Science 15(3). https://doi.org/10.23638/LMCS-15(3:10)2019 (2019)
Henaien, A., Stratulat, S.: Performing implicit induction reasoning with certifying proof environments. In: Bouhoula, A., Ida, T., Kamareddine, F. (eds.) Proceedings Fourth International Symposium on Symbolic Computation in Software Science, Gammarth, Tunisia, 15-17 December 2012. Electronic Proceedings in Theoretical Computer Science, vol. 122, pp. 97–108. https://doi.org/10.4204/EPTCS.122.9 (2013)
Lee, C.S., Jones, N.D., Ben-Amram, A.M.: The size-change principle for program termination. In: POPL ’01: Proceedings of the 28th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. https://doi.org/10.1145/360204.360210, vol. 36, pp 81–92. ACM Press (2001)
Fogarty, S., Vardi, M.Y.: Büchi complementation and size-change termination. In: Kowalewski, S., Philippou, A. (eds.) Tools and Algorithms for the Construction and Analysis of Systems, 15th International Conference, TACAS 2009, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009, York, UK, March 22-29, 2009. Proceedings. Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-642-00768-2_2, vol. 5505, pp 16–30. Springer (2009)
Jones, E., Ong, C. -L., Ramsay, S.J.: Cycleq: an efficient basis for cyclic equational reasoning. In: Jhala, R., Dillig, I. (eds.) PLDI ’22: 43Rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13 - 17, 2022, pp. 395–409. ACM. https://doi.org/10.1145/3519939.3523731 (2022)
Wirth, C. -P.: Descente infinie + Deduction. Logic Journal of the IGPL 12(1), 1–96 (2004). https://doi.org/10.1093/jigpal/12.1.1
Article MathSciNet MATH Google Scholar
Stratulat, S.: A Unified View of Induction Reasoning for First-Order Logic. In: Voronkov, A. (ed.) Turing-100 (The Alan Turing Centenary Conference). EPic Series, vol. 10, pp. 326–352. Easychair. https://doi.org/10.29007/nsx4 (2012)
Baader, F., Nipkow, T.: Term Rewriting and All That. Cambridge University Press, Cambridge (1998). https://doi.org/10.1017/CBO9781139172752
Book MATH Google Scholar
Contejean, E., Courtieu, P., Forest, J., Pons, O., Urbain, X.: Certification of automated termination proofs. Frontiers of Combining Systems, 148–162. https://doi.org/10.1007/978-3-540-74621-8_10 (2007)
Contejean, E., Paskevich, A., Urbain, X., Courtieu, P., Pons, O., Forest, J.: A3PAT, an approach for certified automated termination proofs. In: Gallagher, J.P., Voigtländer, J. (eds.) PEPM - Proceedings of the 2010 ACM SIGPLAN Workshop on Partial Evaluation and Program Manipulation, PEPM 2010, Madrid, Spain, pp. 63–72. ACM. https://doi.org/10.1145/1706356.1706370 (2010)
Nipkow, T., Paulson, L.C., Wenzel, M.: Isabelle/HOL — A Proof Assistant for Higher-Order Logic. Lecture Notes in Computer Science, vol. 2283. Springer, New York (2002). https://doi.org/10.1007/3-540-45949-9
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Université de Lorraine, CNRS, LORIA, Metz, F-57000, France
Sorin Stratulat

Authors

Sorin Stratulat
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sorin Stratulat.

Ethics declarations

Conflict of Interests

The author declares that there are no funding and/or conflicts of interests/competing interests that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix : A: The Coq proof of F_0 in the main lemma

intros. simpl. intros. rename H into Hind.

(* instantiate y from Q(x,y) *) inversionH1 as [|z]. - (* Q(x,0) *) rewrite \({\leftarrow }\) H. apply q1. - (* Q(x,s(z)) *) apply q2. split.

(* induction step for Q(x,z) *) -- apply (Hind F_0).

--- simpl. left. trivial.

--- rewrite \({\leftarrow }\) H2. unfold snd. unfold F_4. unfold F_0. rewrite_model. abstract solve_rpo_mul.

--- trivial.

--- trivial. (* instantiate x from P(x) *) --rewrite\({\leftarrow }\) H2 inHind. rewrite \({\leftarrow }\) H2 inH1. clearH2 y. inversionH0 as [|y].

---rewrite\({\leftarrow }\) H2. apply p1.

---applyp2. rewrite\({\leftarrow }\) H3in H0. (* instantiate P(s(y)) *)

inversion H0.

+ apply zero_different_from_succ in H4. contradiction.

+ rewrite H4.

++ assert (Q y y ∧ Py). (* induction step for Q y y /∖ P y *)

apply (Hind F_4).

+++ simpl. right. left. trivial.

+++ unfold snd. unfold F_4. unfold F_0. rewrite \({\leftarrow }\) H3. rewrite_model. abstract solve_rpo_mul.

+++ trivial.

+++ destruct H6. split; trivial. apply q2; trivial. split; trivial.

Appendix : B: The missing proofs from Section ??

1.1 B.1: Proof of Lemma 1

Proof

Let us assume two adjacent rb-paths \(r_{1} \rightarrow b_{1}\) and \(r_{2} \rightarrow b_{2}\) in a cycle such that r₂ is the companion of b₁, W(r₁) >_mulW(c(b₁)), and W(r₂) >_mulW(c(b₂)). We will try to prove that W(r₁) >_mulW(c(b₂)).

Since the roots/buds are repeated infinitely along the cycle, none of their weights should be empty. If, by contradiction, we assume that a weight is empty, there exists an rb-path \(r\rightarrow b\) in the cycle such that W(r) is empty. But there is no <_mul such that W(r) > W(c(b)).

We show that there should be a trace along the path [r₁,…,b₁,r₂,…,b₂]. Since W(c(b₂)) is not empty, let l₃ ∈ W(c(b₂)). By the definition of <_mul, there exists an IAA l₂ ∈ W(r₂) such that l₂ > l₃ or l₂ = l₃. Similar reasoning can be applied to l₂ as we have done for l₃ to conclude that there is l₁ ∈ W(r₁) such that l₁ > l₂ or l₁ = l₂. Hence, the trace is l₁,…,l₂,l₂,…,l₃.

Finally, we check whether W(r₁) >_mulW(c(b₂)). As previously, for each IAA l₃ ∈ W(c(b₂)) there is an IAA l₂ ∈ W(r₂) such that l₂ > l₃ or l₂ = l₃, and an IAA l₁ ∈ W(r₁) such that l₁ > l₂ or l₁ = l₂. We perform a case analysis on the comparison results between l₂ and l₃, as well as l₁ and l₂:

if l₂ > l₃ and l₁ > l₂, then l₁ > l₃;
if l₂ > l₃ and l₁ = l₂, then l₁ > l₃;
if l₂ = l₃ and l₁ > l₂, then l₁ > l₃;
if l₂ = l₃ and l₁ = l₂, then l₁ = l₃.

We show that W(r₁) >_mulW(c(b₂)) when there is at least one l₃ ∈ W(c(b₂)) for which there exists l₁ ∈ W(r₁) such that l₁ > l₃. After the pairwise deletion of equal IAAs from W(r₁) and W(c(b₂)), resulting W(r₁)^′ and W(c(b₂))^′, we have that l₃ ∈ W(c(b₂))^′, l₁ ∈ W(r₁)^′ and l₁ > l₃. By the definition of >_mul, we have that W(r₁) >_mulW(c(b₂)), because the same reasoning can be done for any other IAA from W(c(b₂))^′ as for l₃.

If for all l₃ ∈ W(c(b₂)), there exists l₁ ∈ W(r₁) such that l₃ = l₁, then there exists l₂ ∈ W(r₂) such that l₁ = l₂ and l₂ = l₃. Since W(r₂) >_mulW(c(b₂)), after the pairwise deletion of equal IAAs from W(r₂) and W(c(b₂)), resulting W(r₂)^′ and W(c(b₂))^′, we have that W(c(b₂))^′ is empty and W(r₂)^′ is not empty. Similarly, since W(r₁) >_mulW(c(b₁)), we have that W(c(b₁))^′ is empty and W(r₁)^′ is not empty. We conclude that after the pairwise deletion of equal IAAs from W(r₁) and W(c(b₂)), resulting W(r₁)^′ and W(c(b₂))^′, we have that W(c(b₂))^′ is empty and W(r₁)^′ is not empty. Hence W(r₁) >_mulW(c(b₂)), as required. □

1.2 B.2: Proof of Theorem 1

Proof

Let be a pre-proof whose normalized pre-proof is denoted by \(\mathcal {P}\) and for which every rb-path \(r\rightarrow b\) belonging to a cycle satisfies W(r) >_mulW(c(b)). Let also p₀ be an infinite path in \(\mathcal {P}\). Let p be the infinite path from \(\mathcal {P}\) built from p₀ by duplicating the nodes as shown during the normalization process from Section ??. By construction, the path p is built starting from some point only from the concatenations of rb-paths from cycles from \(\mathcal {P}\). Since the number of roots is finite in \(\mathcal {P}\), there is a root r in p that occurs infinitely often in p. Hence, there is an infinite sub-path p^′ of p of the form [r,…,r,…] which can be represented as the infinite concatenation of finite sub-paths [r,…,b], where r is the companion of b and r occurs only once in each sub-path. Each such sub-path is built from a finite number of concatenations of rb-paths of the form [r₁,…,b₁] for which W(r₁) >_mulW(c(b₁)). Since <_mul is transitive, by Lemma 1, we have that W(r) >_mulW(c(b)) for each sub-path [r,…,r,…,b]. Since W(r) is not empty, for the same reasons as presented in the proof of Lemma 1, there is a trace following \(p^{\prime }\).

By contradiction, we assume that the number of progress points in all the traces along \(p^{\prime }\) is finite. Since the cardinality of the weights for each root of the trace along \(p^{\prime }\) is finite, there is a sub-path [r,…,b] defined as above whose traces have no progress points and satisfies W(r) >_mulW(c(b)). After the pairwise deletion of equal IAAs from W(r) and W(c(b)), to get W(r)^′ and W(c(b))^′, we perform a case analysis by considering whether W(c(b))^′ is empty or not.

W(c(b))^′ is not empty. By the definition of >_mul, there is an IAA l ∈ W(c(b))^′ for which there is an IAA \(l^{\prime }\in W(r)'\) such that \(l^{\prime }>l\). Hence, the trace leading \(l^{\prime }\) to l has at least one progress point. Contradiction.
W(c(b))^′ is empty. By similar reasoning as given in the proof of Lemma 1, the cardinality of W(r) is greater than that of W(c(b)). But W(r) = W(c(b)) because r is the companion of b. Contradiction, again.

Since there is a trace along \(p^{\prime }\) that has an infinite number of progress points, p has also an infinitely progressing trace starting from some point. On the other hand, p₀ can be built from p by deleting the extra nodes added during the normalization process, so it has an infinitely progressing trace starting from some point.

We conclude that \(\mathcal {P}\) satisfies the global trace condition. □

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Stratulat, S. Mechanical certification of FOL_ID cyclic proofs. Ann Math Artif Intell 91, 651–673 (2023). https://doi.org/10.1007/s10472-023-09832-7

Download citation

Accepted: 13 January 2023
Published: 16 February 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s10472-023-09832-7

Keywords

Mathematics Subject Classification (2010)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mechanical certification of FOL_ID cyclic proofs

Abstract

Access this article

Similar content being viewed by others

Translating Between Implicit and Explicit Versions of Proof

Verification of Certifying Computations through AutoCorres and Simpl

Efficient Certified RAT Verification

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Appendices

Appendix : A: The Coq proof of F_0 in the main lemma

Appendix : B: The missing proofs from Section ??

1.1 B.1: Proof of Lemma 1

Proof

1.2 B.2: Proof of Theorem 1

Proof

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

Mechanical certification of FOLID cyclic proofs

Abstract

Access this article

Similar content being viewed by others

Translating Between Implicit and Explicit Versions of Proof

Verification of Certifying Computations through AutoCorres and Simpl

Efficient Certified RAT Verification

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Appendices

Appendix : A: The Coq proof of F_0 in the main lemma

Appendix : B: The missing proofs from Section ??

1.1 B.1: Proof of Lemma 1

Proof

1.2 B.2: Proof of Theorem 1

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation

Mechanical certification of FOL_ID cyclic proofs