Lexicographic Ranking Supermartingales with Lazy Lower Bounds

Takisaka, Toru; Zhang, Libo; Wang, Changjiang; Liu, Jiamou

doi:10.1007/978-3-031-65633-0_19

Toru Takisaka⁹,
Libo Zhang¹⁰,
Changjiang Wang⁹ &
…
Jiamou Liu¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14683))

Included in the following conference series:

International Conference on Computer Aided Verification

Abstract

Lexicographic Ranking SuperMartingale (LexRSM) is a probabilistic extension of Lexicographic Ranking Function (LexRF), which is a widely accepted technique for verifying program termination. In this paper, we are the first to propose sound probabilistic extensions of LexRF with a weaker non-negativity condition, called single-component (SC) non-negativity. It is known that such an extension, if it exists, will be nontrivial due to the intricacies of the probabilistic circumstances.

Toward the goal, we first devise the notion of fixability, which offers a systematic approach for analyzing the soundness of possibly negative LexRSM. This notion yields a desired extension of LexRF that is sound for general stochastic processes. We next propose another extension, called Lazy LexRSM, toward the application to automated verification; it is sound over probabilistic programs with linear arithmetics, while its subclass is amenable to automated synthesis via linear programming. We finally propose a LexRSM synthesis algorithm for this subclass, and perform experiments.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Background 1: Lexicographic RFs with Different Non-negativity Conditions. Ranking function (RF) is one of the most well-studied tools for verifying program termination. An RF is typically a real-valued function over program states that satisfies: (a) the ranking condition, which requires an RF to decrease its value by a constant through each transition; and (b) the non-negativity condition, which imposes a lower bound on the value of the RF so that its infinite descent through transitions is prohibited. The existence of such a function implies termination of the underlying program, and therefore, one can automate verification of program termination by RF synthesis algorithms.

Improving the applicability of RF synthesis algorithms, i.e., making them able to prove termination of a wider variety of programs, is one of the core interests in the study of RF. A lexicographic extension of RF (LexRF) [8, 10] is known as a simple but effective approach to the problem. Here, a LexRF is a function to real-valued vectors instead of the reals, and its ranking condition is imposed with respect to the lexicographic order. For example, the value of a LexRF may change from (1, 1, 1) to (1, 0, 2) through a state transition; here, the value “lexicographically decreases by 1” through the transition, that is, it decreases by 1 in some dimension while it is non-increasing on the left to that dimension. LexRF is particularly good at handling nested structures of programs, as vectors can measure the progress of different “phases” of programs separately. LexRF is also used in top-performing termination provers (e.g., [1]).

There are several known ways to impose non-negativity on LexRFs (see also Fig. 1): (a) Strong non-negativity, which requires non-negativity in every dimension of the LexRF; (b) leftward non-negativity, which requires non-negativity on the left of the ranking dimension of each transition, i.e., the dimension where the value of the LexRF should strictly decrease through the transition; and (c) single-component non-negativity, which requires non-negativity only in the ranking dimensions. It is known that any of these non-negativity conditions makes the resulting LexRF sound [8, 10], i.e., a program indeed terminates whenever it admits a LexRF with either of these non-negativity conditions. For better applicability, single-component non-negativity is the most preferred, as it is the weakest constraint among the three.

Background 2: Probabilistic Programs and Lexicographic RSMs. One can naturally think of a probabilistic counterpart of the above argument. One can consider probabilistic programs that admit randomization in conditional branching and variable updates. The notion of RF is then generalized to Ranking SuperMartingale (RSM), a function similar to RFs except that the ranking condition requires an RSM to decrease its value in expectation. The existence of an RSM typically implies almost-sure termination of the underlying program, i.e., termination of the program with probability 1.

Such a probabilistic extension has been actively studied, in fact: probabilistic programs are used in e.g., stochastic network protocols [33], randomized algorithms [18, 26], security [6, 7, 27], and planning [11]; and there is a rich body of studies in RSM as a tool for automated verification of probabilistic programs (see Sect. 8). Similar to the RF case, a lexicographic extension of RSM (LexRSM, [2, 15]) is an effective approach to improve its applicability. In addition to its advantages over nested structures, LexRSM can also witness almost-sure termination of certain probabilistic programs with infinite expected runtime [2, Fig. 2]; certifying such programs is known as a major challenge for RSMs.

Problem: Sound Probabilistic Extension of LexRF with Weaker Non-negativity. Strongly non-negative LexRF soundly extends to LexRSM in a canonical way [2], i.e., basically by changing the clause “decrease by a constant” in the ranking condition of LexRF to “decrease by a constant in expectation”. In contrast, the similar extension of leftward or single-component non-negative LexRF yields an unsound LexRSM notion [15, 20]. To date, a sound LexRSM with the weakest non-negativity in the literature is Generalized LexRSM (GLexRSM) [15], which demands leftward non-negativity and an additional one, so-called expected leftward non-negativity. Roughly speaking, the latter requires LexRSMs to be non-negative in each dimension (in expectation) upon “exiting” the left of the ranking dimension. For example, in Fig. 1, it requires $b_2$ to be non-negative, as the second dimension of $\boldsymbol{\eta }$ “exits” the left of the ranking dimension upon the transition $\ell _1 \rightarrow \ell _2$. GLexRSM does not generalize either leftward or single-component non-negative LexRF, in the sense that the former is strictly more restrictive than the latter two when it is considered over non-probabilistic programs.

These results do not mean that leftward or single-component non-negative LexRF can never be extended to LexRSM, however. More concretely, the following problem is valid (see the last paragraph of Sect. 3 for a formal argument):

^{Footnote 1}

We are motivated to study this problem for a couple of reasons. First, it is a paraphrase of the following fundamental question: when do negative values of (Lex)RSM cause trouble, say, to its soundness? This question is a typical example in the study of RSM where the question becomes challenging due to its probabilistic nature. The question also appears in other topics in RSM; for example, it is known that the classical variant rule of Floyd-Hoare logic does not extend to almost-sure termination of probabilistic programs in a canonical way [24], due to the complicated treatment of negativity in RSMs. To our knowledge, this question has only been considered in an ad-hoc manner through counterexamples (e.g., [15, 20, 24]), and we do not yet have a systematic approach to answering it.

Second, relaxing the non-negativity condition of LexRSM is highly desirable if we wish to fully unlock the benefit of the lexicographic extension in automated verification. A motivating example is given in Fig. 2. The probabilistic program in Fig. 2 terminates almost-surely, but it does not admit any linear GLexRSM (and hence, the GLexRSM synthesis algorithms in [15] cannot witness its almost-sure termination); for example, the function $\boldsymbol{\eta }$ ranks every transition of the program, but violates both leftward and expected leftward non-negativity at the transition $\ell _1 \rightarrow \ell _2$ (note $\boldsymbol{\eta }$ ranks this transition in the third dimension; to check the violation of expected leftward non-negativity, also note $\boldsymbol{\eta }$ ranks $\ell _2 \rightarrow \ell _4$ in the first dimension). Here, the source of the problem is that the program has two variables whose progress must be measured (i.e., increment y to 10 in $\ell _3$; and increment x to 5 in $\ell _4$), but one of their progress measures can be arbitrarily small during the program execution (y can be initialized with any value). Not only that this structure is rather fundamental, it is also expected that our desired LexRSM could handle it, if it exists. Indeed, modify the probabilistic program in Fig. 2 into a non-probabilistic one by changing “$ Unif [1,2]$” to “1”; then the program admits $\boldsymbol{\eta }$ as a single-component non-negative LexRF.

Contributions. In this paper, we are the first to introduce sound LexRSM notions that instantiate single-component non-negative LexRF. Our contributions are threefold, as we state below.

First, in response to the first motivation we stated above, we devise a novel notion of fixability as a theoretical tool to analyze if negative values of a LexRSM “cause trouble”. Roughly speaking, we identify the source of the trouble as “ill” exploitation of unbounded negativity of LexRSM; our $\varepsilon $-fixing operation prohibits such exploitation by basically setting all the negative values of a LexRSM into the same negative value $-\varepsilon $, and we say a LexRSM is $\varepsilon $-fixable if it retains the ranking condition through such a transformation. We give more details about its concept and key ideas in Sect. 2. The soundness of $\varepsilon $-fixable LexRSM immediately follows from that of strongly non-negative one [2] because any LexRSM becomes strongly non-negative through the $\varepsilon $-fixing operation (after globally adding $\varepsilon $). Fixable LexRSM instantiates single-component non-negative LexRF for general stochastic processes (Theorem 4.3), while also serving as a technical basis for proving the soundness of other LexRSMs. Meanwhile, fixable LexRSM cannot be directly applied to automated verification algorithms due to the inherent non-linearity of $\varepsilon $-fixing; this observation leads us to our second contribution.
Second, in response to the second motivation we stated above, we introduce Lazy LexRSM (LLexRSM) as another LexRSM notion that instantiates single-component non-negative LexRF. LLexRSM does not involve the $\varepsilon $-fixing operation in its definition; thanks to this property, we have a subclass of LLexRSM that is amenable to automated synthesis via linear programming (see Sect. 6). The LLexRSM condition consists of the single-component non-negative LexRSM condition and stability at negativity we propose (Definition 5.1), which roughly requires the following: Once the value of a LexRSM gets negative in some dimension, it must stay negative until that dimension exits the left of the ranking one. For example, $\boldsymbol{\eta }$ in Fig. 2 is an LLexRSM; indeed, $\ell _2 \rightarrow \ell _4$ and $\ell _1 \rightarrow \ell _5$ are the only transitions where $\boldsymbol{\eta }$ possibly changes its value from negative to non-negative in some dimension (namely, the second one), which is although the right to the ranking dimension (the first one). We prove linear LLexRSM is sound for probabilistic programs over linear arithmetics (see Theorem 5.4 for the exact assumption). The proof is highly nontrivial, which is realized by subtle use of a refined variant of fixability; we explain its core idea in Sect. 2. Furthermore, Theorem 5.4 shows that expected leftward non-negativity in GLexRSM [15] is actually redundant under the assumption in Theorem 5.4. This is surprising, as expected leftward non-negativity has been invented to restore the soundness of leftward non-negative LexRSM, which is generally unsound.
Third, we present a synthesis algorithm for the subclass of LLexRSM we mentioned above, and do experiments; there, our algorithms verified almost-sure termination of various programs that could not be handled by (a better proxy of) the GLexRSM-based one. The details can be found in Sect. 7.

2 Key Observations with Examples

Here we demonstrate by examples how intricate the treatment of negative values of LexRSM is, and how we handle it by our proposed notion of fixability.

Blocking “Ill” Exploitation of Unbounded Negativity. Figure 3 is a counterexample that shows leftward non-negative LexRSM is generally unsound (conceptually the same as [15, Ex. 1]). The probabilistic program in Fig. 3 does not terminate almost-surely because the chance of entering $\ell _4$ from $\ell _3$ quickly decreases as t increases. Meanwhile, $\boldsymbol{\eta }=(\eta _1,\eta _2,\eta _3)$ in Fig. 3 is a leftward non-negative LexRSM over a global invariant $[0 \le x \le 1]$; in particular, observe $\eta _2$ decreases by 1 in expectation from $\ell _3$, whose successor location is either $\ell _4$ or $\ell _1$.

This example reveals an inconsistency between the ways how the single-component non-negativity and ranking condition evaluate the value of a LexRSM, say $\boldsymbol{\eta } = (\eta _1,\ldots ,\eta _n)$. The single-component non-negativity claims $\boldsymbol{\eta }$ cannot rank a transition in a given dimension k whenever $\eta _k$ is negative; intuitively, this means that any negative value in the ranking domain $\mathbb {R}$ should be understood as the same state, namely the “bottom” of the domain. Meanwhile, the ranking condition evaluates different negative values differently; a smaller negative value of $\eta _k$ can contribute more to satisfy the ranking condition, as one can see from the behavior of $\eta _2$ in Fig. 3 at $\ell _3$. The function $\boldsymbol{\eta }$ in Fig. 3 satisfies the ranking condition over a possibly non-terminating program through “ill” exploitation of this inconsistency; as t becomes larger, the value of $\eta _2$ potentially drops more significantly through the transition from $\ell _3$, but with a smaller probability.

The first variant of our fixability notion, called $\varepsilon $-fixability, enables us to ensure that such exploitation is not happening. We simply set every negative value in a LexRSM $\boldsymbol{\eta }$ to a negative constant $-\varepsilon $, and say $\boldsymbol{\eta }$ is $\varepsilon $-fixable if it retains the ranking condition through the modification^{Footnote 2}. For example, the $\varepsilon $-fixing operation changes the value of $\eta _2$ in Fig. 3 at $\ell _4$ from $-2^{t}$ to $-\varepsilon $, and $\boldsymbol{\eta }$ does not satisfy the ranking condition after that. Therefore, $\boldsymbol{\eta }$ in Fig. 3 is not $\varepsilon $-fixable for any $\varepsilon >0$ (i.e., we successfully reject this $\boldsymbol{\eta }$ through the fixability check). Meanwhile, an $\varepsilon $-fixable LexRSM witnesses almost-sure termination of the underlying program; indeed, the fixed LexRSM is a strongly non-negative LexRSM (by globally adding $\varepsilon $ to the fixed $\boldsymbol{\eta }$), which is known to be sound [2].

The notion of $\varepsilon $-fixability is operationally so simple that one might even feel it is a boring idea; nevertheless, its contribution to revealing the nature of possibly negative LexRSM is already significant in our paper. Indeed, (a) $\varepsilon $-fixable LexRSM instantiates single-component non-negative LexRF with an appropriate $\varepsilon $ (Theorem 4.3); (b) $\varepsilon $-fixable LexRSM generalizes GLexRSM [15], and the proof offers an alternative proof of soundness of GLexRSM that is significantly simpler than the original one (Theorem 4.4); and (c) its refined variant takes the crucial role in proving soundness of our second LexRSM variant, lazy LexRSM.

Allowing “Harmless” Unbounded Negativity. While $\varepsilon $-fixable LexRSM already instantiates single-component non-negative LexRF, we go one step further to obtain a LexRSM notion that is amenable to automated synthesis, in particular via Linear Programming (LP). The major obstacle to this end is the case distinction introduced by $\varepsilon $-fixability, which makes the fixed LexRSM nonlinear. Lazy LexRSM (LLexRSM), our second proposed LexRSM, resolves this problem while it also instantiates single-component non-negative LexRF.

Linear LLexRSM is sound over probabilistic programs with linear arithmetics (Theorem 5.4). The key to the proof is, informally, the following observation: Restrict our attention to probabilistic programs and functions $\boldsymbol{\eta }$ that are allowed in the LP-based synthesis. Then “ill” exploitation in Fig. 3 never occurs, and therefore, a weaker condition than $\varepsilon $-fixability (namely, the LLexRSM one) suffices for witnessing program termination. In fact, Fig. 3 involves (a) non-linear arithmetics in the program, (b) parametrized if-branch in the program (i.e., the grammar “if prob(p) then P else Q fi” with p being a variable), and (c) non-linearity of $\boldsymbol{\eta }$. None of them are allowed in the LP-based synthesis (at least, in the standard LP-based synthesis via Farkas’ Lemma [2, 12, 15]). Our informal statement above is formalized as Theorem 5.3, which roughly says: Under such a restriction to probabilistic programs and $\boldsymbol{\eta }$, any LLexRSM is $(\varepsilon , \gamma )$-fixable. Here, $(\varepsilon , \gamma )$-fixability is a refined version of $\varepsilon $-fixability; while it also ensures that “ill” exploitation is not happening in $\boldsymbol{\eta }$, it is less restrictive than $\varepsilon $-fixability by allowing “harmless” unbounded negative values of $\boldsymbol{\eta }$.

Figure 4 gives an example of such a harmless behavior of $\boldsymbol{\eta }$ rejected by $\varepsilon $-fixability. It also shows why we cannot simply use $\varepsilon $-fixability to check an LLexRSM does not do “ill” exploitation. The function $\boldsymbol{\eta }= (\eta _1, \eta _2)$ in Fig. 4 is leftward non-negative over the global invariant $[0 \le x \le 1 \wedge t\ge 1]$, so it is an LLexRSM for the probabilistic program there; the program and $\boldsymbol{\eta }$ are also in the scope of LP-based synthesis; but $\boldsymbol{\eta }$ is not $\varepsilon $-fixable for any $\varepsilon >0$. Indeed, the $\varepsilon $-fixing operation changes the value of $\eta _2$ at $\ell _4$ from $-2t-4$ to $-\varepsilon $, and $\boldsymbol{\eta }$ does not satisfy the ranking condition at $\ell _2$ after the change. Here we notice that, however, the unbounded negative values of $\eta _2$ are “harmless”; that is, the “ill-gotten gains” by the unbounded negative values of $\eta _2$ at $\ell _4$ are only “wasted” to unnecessarily increase $\eta _2$ at $\ell _3$. In fact, $\boldsymbol{\eta }$ still satisfies the ranking condition if we change the value of $\eta _2$ at $\ell _1,\ell _2,\ell _3$ to 2, 1, and 0, respectively.

We resolve this issue by partially waiving the ranking condition of $\boldsymbol{\eta }$ after the $\varepsilon $-fixing operation. It is intuitively clear that the program in Fig. 4 almost-surely terminates, and the intuition here is that the program essentially repeats an unbiased coin tossing until the tail is observed (here, “observe the tail” corresponds to “observe $\textbf{prob}(0.5) = \textbf{false}$ at $\ell _2$”). This example tells us that, to witness the almost-sure termination of this program, we only need to guarantee the program (almost-surely) visits either the terminal location $\ell _5$ or the “coin-tossing location” $\ell _2$ from anywhere else. The $\varepsilon $-fixed $\boldsymbol{\eta }$ in Fig. 4 does witness such a property of the program, as it ranks every transition except those that are from a coin-tossing location, namely $\ell _2$.

We generalize this idea as follows: Fix $\gamma \in (0,1)$, and say a program state is a “coin-tossing state” for $\boldsymbol{\eta } = (\eta _1, \ldots , \eta _n)$ in the k-th dimension if $\eta _k$ drops from non-negative to negative (i.e., the ranking is “done” in the k-th dimension) with the probability $\gamma $ or higher. Then we say $\boldsymbol{\eta }$ is $(\varepsilon , \gamma )$-fixable (Definition 4.6) if the $\varepsilon $-fixed $\boldsymbol{\eta }$ is a strongly non-negative LexRSM (after adding $\varepsilon $) except that, at each coin-tossing state, we waive the ranking condition of $\boldsymbol{\eta }$ in the corresponding dimension. For example, $\boldsymbol{\eta }$ in Fig. 4 is $(\varepsilon , \gamma )$-fixable for any $\gamma \in (0, 0.5]$. As expected, $(\varepsilon , \gamma )$-fixable LexRSM is sound for any $\varepsilon >0$ and $\gamma \in (0,1)$ (Corollary 4.7).

3 Preliminaries

We recall the technical preliminaries. Omitted details are in [36, Appendix A].

Notations. We assume the readers are familiar with the basic notions of measure theory, see e.g. [5, 9]. The sets of non-negative integers and reals are denoted by $\mathbb {N}$ and $\mathbb {R}$, respectively. The collection of all Borel sets of a topological space $\mathcal {X}$ is denoted by $\mathcal {B}(\mathcal {X})$. The set of all probability distributions over the measurable space $(\varOmega ,\mathcal {B}(\varOmega ))$ is denoted by $\mathcal {D}(\varOmega )$. The value of a vector $\boldsymbol{x}$ at the i-th index is denoted by $\boldsymbol{x}[i]$ or $x_i$. A subset $D \subseteq \mathbb {R}$ of the reals is bounded if $D \subseteq [-x,x]$ for some $x>0$.

For a finite variable set V and the set $val^V$ of its valuations, we form predicates as first-order formulas with atomic predicates of the form $f \le g$, where $f,g:val^V \rightarrow R$ and R is linearly ordered. Often, we are only interested in the value of a predicate $\varphi $ over a certain subset $\mathcal {X} \subseteq val^V$, in which case, we call $\varphi $ a predicate over $\mathcal {X}$. We identify a predicate $\varphi $ over $\mathcal {X}$ with a function $\tilde{\varphi }:\mathcal {X} \rightarrow \{ 0,1 \}$ such that $\tilde{\varphi }(x)=1$ if and only if $\varphi (x)$ is true. The semantics of $\varphi $, i.e., the set $\{x\in \mathcal {X} \mid \varphi (x) \text{ is } \text{ true }\}$, is denoted by $\llbracket \varphi \rrbracket $. The characteristic function $\textbf{1}_A: \mathcal {X} \rightarrow \{0,1\}$ of a subset A of $\mathcal {X}$ is a function such that $\llbracket \textbf{1}_A =1 \rrbracket = A$. For a probability space $(\varOmega ,\mathcal {F},\mathbb {P})$, we say $\varphi $ over $\varOmega $ is ($\mathcal {F}$-)measurable when $\llbracket \varphi \rrbracket \in \mathcal {F}$. For such a $\varphi $, the satisfaction probability of $\varphi $ w.r.t. $\mathbb {P}$, i.e., the value $\mathbb {P}(\llbracket \varphi \rrbracket )$, is also denoted by $\mathbb {P}(\varphi )$; we say $\varphi $ holds $\mathbb {P}$-almost surely ($\mathbb {P}$-a.s.) if $\mathbb {P}(\varphi ) = 1$.

3.1 Syntax and Semantics of Probabilistic Programs

Syntax. We define the syntax of Probabilistic Programs (PPs) similarly to e.g., [2, 35]. More concretely, PPs have the standard control structure in imperative languages such as if-branches and while-loops, while the if-branching and variable assignments can also be done in either nondeterministic or probabilistic ways. Namely, ‘${{\textbf {if}} \, \star }$’ describes a nondeterministic branching; ‘${{\textbf {ndet}}(D)}$’ describes a nondeterministic assignment chosen from a bounded^{Footnote 3} domain $D \subseteq \mathcal {B}(\mathbb {R})$; ‘${{\textbf {if}}} \, {{\textbf {prob}}(p)}$’ with a constant $p \in [0,1]$ describes a probabilistic branching that executes the ‘${{\textbf {then}}}$’ branch with probability p, or the ‘${{\textbf {else}}}$’ branch with probability $1-p$; and ‘${{\textbf {sample}}(d)}$’ describes a probabilistic assignment sampled from a distribution $d\in \mathcal {D}(\mathbb {R})$. We consider PPs without conditioning, which are also called randomized programs [35]; PPs with conditioning are considered in e.g. [32]. The exact grammar is given in [36, Appendix A].

In this paper, we focus our attention on PPs with linear arithmetics; we say a PP is linear if each arithmetic expression in it is linear, i.e., of the form $b+\sum _{i=1}^n a_i \cdot v_i$ for constants $a_1,\ldots , a_n, b$ and program variables $v_1,\ldots , v_n$.

Semantics. We adopt probabilistic control flow graph (pCFG) as the semantics of PPs, which is standard in existing RSM works (e.g., [12, 15, 35]). Informally, it is a labeled directed graph whose vertices are program locations, and whose edges represent possible one-step executions in the program. Edges are labeled with the necessary information so that one can reconstruct the PP represented by the pCFG; for example, an edge e can be labeled with the assignment commands executed through e (e.g., ‘$x:=x+1$’), the probability $p\in [0,1]$ that e will be chosen (through ‘${{\textbf {if}}} \, {{\textbf {prob}}(p)}$’), the guard condition, and so on. Below we give its formal definition for completeness; see [36, Appendix A] for how to translate PPs into pCFGs.

Definition 3.1

(pCFG). A pCFG is a tuple $(L, V, \varDelta , Up , G)$, where

1.
L is a finite set of locations.
2.
$V = \{x_1, \ldots , x_{|V|}\}$ is a finite set of program variables.
3.
$\varDelta $ is a finite set of (generalized) transitions^{Footnote 4}, i.e., tuples $\tau = (\ell ,\delta )$ of a location $\ell \in L$ and a distribution $\delta \in \mathcal {D}(L)$ over successor locations.
4.
$ Up $ is a function that receives a transition $\tau \in \varDelta $ and returns a tuple (i, u) of a target variable index $i \in \{ 1,\ldots ,|V| \}$ and an update element u. Here, u is either (a) a Borel measurable function $u: \mathbb {R}^{|V|} \rightarrow \mathbb {R}$, (b) a distribution $d \in \mathcal {D}(\mathbb {R})$, or (c) a bounded measurable set $R \in \mathcal {B}(\mathbb {R})$. In each case, we say $\tau $ is deterministic, probabilistic, and non-deterministic, respectively; the collections of these transitions are denoted by $\varDelta _d$, $\varDelta _p$, and $\varDelta _n$, respectively.
5.
G is a guard function that assigns a $G(\tau ):\mathbb {R}^{|V|} \rightarrow \{ 0,1 \}$ to each $\tau \in \varDelta $.

Below we fix a pCFG $\mathcal {C}= (L, V, \varDelta , Up , G)$. A state of $\mathcal {C}$ is a tuple $s = (\ell ,\boldsymbol{x})$ of location $\ell \in L$ and variable assignment vector $\boldsymbol{x} \in \mathbb {R}^{|V|}$. We write $\mathcal {S}$ to denote the state set $L \times \mathbb {R}^{|V|}$. Slightly abusing the notation, for $\tau = (\ell , \delta )$, we identify the set $\llbracket G(\tau ) \rrbracket \subseteq \mathbb {R}^{|V|}$ and the set $\{\ell \}\times \llbracket G(\tau ) \rrbracket \subseteq \mathcal {S}$; in particular, we write $s\in \llbracket G(\tau ) \rrbracket $ when $\tau $ is enabled at s, i.e., $s=(\ell ,\boldsymbol{x})$, $\tau =(\ell , \delta )$ and $\boldsymbol{x}\in \llbracket G(\tau ) \rrbracket $.

A pCFG $\mathcal {C}$ with its state set $\mathcal {S}$ can be understood as a transition system over $\mathcal {S}$ with probabilistic transitions and nondeterminism (or, more specifically, a Markov decision process with its states $\mathcal {S}$). Standard notions such as successors of a state $s \in \mathcal {S}$, finite paths, and (infinite) runs of $\mathcal {C}$ are defined as the ones over such a transition system. The set of all successors of $s \in \llbracket G(\tau ) \rrbracket $ via $\tau $ is denoted by $\textrm{succ}_\tau (s)$. The set of runs of $\mathcal {C}$ is denoted by $\varPi _\mathcal {C}$.

Schedulers resolve nondeterminism in pCFGs. Observe there are two types of nondeterminism: (a) nondeterministic choice of $\tau \in \varDelta $ at a given state (corresponds to ‘${{\textbf {if}}} \, \star $’), and (b) nondeterministic variable update in a nondeterministic transition $\tau \in \varDelta _n$ (corresponds to ‘$ {x_i:={\textbf {ndet}}(D)}$’). We say a scheduler is $\varDelta $-deterministic if its choice is non-probabilistic in Case (a).

We assume pCFGs are deadlock-free; we also assume that there are designated locations $\ell _{\textrm{in}}$ and $\ell _{\textrm{out}}$ that represent program initiation and termination, respectively. An initial state is a state of the form $(\ell _{\textrm{in}}, \boldsymbol{x})$. We assume a transition from $\ell _{\textrm{out}}$ is unique, denoted by $\tau _{\textrm{out}}$; this transition does not update anything.

By fixing a scheduler $\sigma $ and an initial state $s_I$, the infinite-horizon behavior of $\mathcal {C}$ is determined as a distribution $\mathbb {P}_{s_I}^\sigma $ over $\varPi _\mathcal {C}$; that is, for a measurable $A\subseteq \varPi _\mathcal {C}$, the value $\mathbb {P}_{s_I}^\sigma (A)$ is the probability that a run of $\mathcal {C}$ from $s_I$ is in A under $\sigma $. We call the probability space $(\varPi _\mathcal {C}, \mathcal {B}(\varPi _\mathcal {C}), \mathbb {P}_{s_I}^\sigma )$ the dynamics of $\mathcal {C}$ under $\sigma $ and $s_I$. See [9] for the formal construction; a brief explanation is in [36, Appendix A].

We define the termination time of a pCFG $\mathcal {C}$ as the function $T_{\textrm{term}}^\mathcal {C}:\varPi _\mathcal {C}\rightarrow \mathbb {N}\cup \{+\infty \}$ such that $T_{\textrm{term}}^\mathcal {C}(s_0s_1\ldots ) = \inf \{ t \in \mathbb {N}\mid \exists \boldsymbol{x}. s_t = (\ell _{\textrm{out}}, \boldsymbol{x}) \}$. Now we formalize our objective, i.e., almost-sure termination of pCFG, as follows.

Definition 3.2

(AST of pCFG). A run $\omega \in \varPi _\mathcal {C}$ terminates if $T_{\textrm{term}}^\mathcal {C}(\omega ) <\infty $. A pCFG $\mathcal {C}$ is a.s. terminating (AST) under a scheduler $\sigma $ and an initial state $s_I$ if a run of $\mathcal {C}$ terminates $\mathbb {P}_{s_I}^\sigma $-a.s. We say $\mathcal {C}$ is AST if it is AST for any $\sigma $ and $s_I$.

3.2 Lexicographic Ranking Supermartingales

Here we recall mathematical preliminaries of the LexRSM theory. A (Lex)RSM typically comes in two different forms: one is a vector-valued function $\boldsymbol{\eta }: \mathcal {S}\rightarrow \mathbb {R}^n$ over states $\mathcal {S}$ of a pCFG $\mathcal {C}$, and another is a stochastic process over the runs $\varPi _\mathcal {C}$ of $\mathcal {C}$. We recall relevant notions in these formulations, which are frequently used in existing RSM works [12, 15]. We also recall the formal definition of LexRSMs with three different non-negativity conditions in Fig. 1.

LexRSM as a Quantitative Predicate. Fix a pCFG $\mathcal {C}$. An (n-dimensional) measurable map (MM) is a Borel measurable function $\boldsymbol{\eta }:\mathcal {S}\rightarrow \mathbb {R}^n$. For a given 1-dimensional MM $\eta $ and a transition $\tau $, The (maximal) pre-expectation of $\eta $ under $\tau $ is a function that formalizes “the value of $\eta $ after the transition $\tau $”. More concretely, it is a function $\overline{\mathbb {X}}_\tau \eta :\llbracket G(\tau ) \rrbracket \rightarrow \mathbb {R}$ that returns, for a given state s, the maximal expected value of $\eta $ at the successor state of s via $\tau $. Here, the maximality refers to the set of all possible nondeterministic choices at s.

A level map $\textsf{Lv}:\varDelta \rightarrow \{ 0,\ldots , n \}$ designates the ranking dimension of the associated LexRSM $\boldsymbol{\eta }: \mathcal {S}\rightarrow \mathbb {R}^n$. We require $\textsf{Lv}(\tau ) = 0$ if and only if $\tau = \tau _{\textrm{out}}$. We say an MM $\boldsymbol{\eta }$ ranks a transition $\tau $ in the dimension k (under $\textsf{Lv}$) when $k = \textsf{Lv}(\tau )$. An invariant is a measurable predicate $I: \mathcal {S}\rightarrow \{ 0,1 \}$ such that $\llbracket I \rrbracket $ is closed under transitions and $\ell _{\textrm{in}} \times \mathbb {R}^{|V|} \subseteq \llbracket I \rrbracket $. The set $\llbracket I \rrbracket $ over-approximates the reachable states in $\mathcal {C}$.

Suppose an n-dimensional MM $\boldsymbol{\eta }$ and an associated level map $\textsf{Lv}$ are given. We say $\boldsymbol{\eta }$ satisfies the ranking condition (under $\textsf{Lv}$ and I) if the following holds for each $\tau \ne \tau _{\textrm{out}}$, $ s \in \llbracket I \wedge G(\tau ) \rrbracket $, and $k \in \{ 1,\ldots , \textsf{Lv}(\tau ) \}$:

$$\begin{aligned} \overline{\mathbb {X}}_\tau \boldsymbol{\eta }[k](s)\le {\left\{ \begin{array}{ll} \boldsymbol{\eta }[k](s) &{} \text {if } k < \textsf{Lv}(\tau ),\\ \boldsymbol{\eta }[k](s) -1 &{} \text {if } k = \textsf{Lv}(\tau ). \end{array}\right. } \end{aligned}$$

We also define the three different non-negativity conditions in Fig. 1, i.e., STrong (ST), LeftWard (LW), and Single-Component (SC) non-negativity, as follows:

$$\begin{aligned} \text{(ST } \text{ non-neg.) } & \forall s \in \llbracket I \rrbracket . \forall k \in \{ 1,\ldots , n \} .{} & {} \boldsymbol{\eta }[k](s) \ge 0,\\ \text{(LW } \text{ non-neg.) } & \forall \tau \ne \tau _{\textrm{out}}. \forall s \in \llbracket I \wedge G(\tau ) \rrbracket . \forall k \in \{ 1,\ldots , \textsf{Lv}(\tau ) \}.{} & {} \boldsymbol{\eta }[k](s) \ge 0,\\ \text{(SC } \text{ non-neg.) } & \forall \tau \ne \tau _{\textrm{out}}. \forall s \in \llbracket I \wedge G(\tau ) \rrbracket . {} & {} \boldsymbol{\eta }[\textsf{Lv}(\tau ) ](s) \ge 0. \end{aligned}$$

All the materials above are wrapped up in the following definition.

Definition 3.3

((ST/LW/SC)-LexRSM map). Fix a pCFG $\mathcal {C}$ with an invariant I. Let $\boldsymbol{\eta }$ be an MM associated with a level map $\textsf{Lv}$. The MM $\boldsymbol{\eta }$ is called a STrongly non-negative LexRSM map (ST-LexRSM map) over $\mathcal {C}$ supported by I if it satisfies the ranking condition and the strong non-negativity under $\textsf{Lv}$ and I. If it satisfies the leftward or single-component non-negativity instead of the strong one, then we call it LW-LexRSM map or SC-LexRSM map, respectively.

LexRSM as a Stochastic Process. When it comes to automated synthesis, a (Lex)RSM is usually a function $\boldsymbol{\eta }$ over program states, as defined in Definition 3.3. Meanwhile, when we prove the properties of (Lex)RSMs themselves (e.g., soundness), it is often necessary to inspect the behavior of $\boldsymbol{\eta }$ upon the program execution under given scheduler $\sigma $ and initial state $s_I$. Such a behavior of $\boldsymbol{\eta }$ is formalized as a sequence $(\textbf{X}_t)_{t=0}^\infty $ of random variables over the dynamics of the underlying pCFG, which forms a stochastic process.

A (discrete-time) stochastic process in a probability space $(\varOmega , \mathcal {F}, \mathbb {P})$ is a sequence $(\textbf{X}_t)_{t=0}^\infty $ of $\mathcal {F}$-measurable random variables $\textbf{X}_t: \varOmega \rightarrow \mathbb {R}^n$ for $t \in \mathbb {N}$. In our context, it is typically associated with another random variable $T:\varOmega \rightarrow \mathbb {N}\cup \{+\infty \}$ that describes the termination time of $\omega \in \varOmega $. We say T is AST (w.r.t. $\mathbb {P}$) if $\mathbb {P}(T <\infty ) = 1$; observe that, if $(\varOmega , \mathcal {F}, \mathbb {P})$ is the dynamics of a pCFG $\mathcal {C}$ under $\sigma $ and $s_I$, then $\mathcal {C}$ is AST under $\sigma $ and $s_I$ if and only if $T_{\textrm{term}}^\mathcal {C}$ is AST w.r.t. $\mathbb {P}$. As standard technical requirements, we assume there is a filtration $(\mathcal {F}_t)_{t=0}^\infty $ in $(\varOmega , \mathcal {F}, \mathbb {P})$ such that $(\textbf{X}_t)_{t=0}^\infty $ is adapted to $(\mathcal {F}_t)_{t=0}^\infty $, T is a stopping time w.r.t. $(\mathcal {F}_t)_{t=0}^\infty $, and $(\textbf{X}_t)_{t=0}^\infty $ is stopped at T; see [36, Appendix A] for their definitions.

For a stopping time T w.r.t. $(\mathcal {F}_t)_{t=0}^\infty $, we define a level map $(\textsf{Lv}_t)_{t=0}^\infty $ as a sequence of $\mathcal {F}_t$-measurable functions $\textsf{Lv}_t: \varOmega \rightarrow \{ 0,\ldots n \}$ such that $\llbracket \textsf{Lv}_t=0 \rrbracket = \llbracket T\le t \rrbracket $ for each t. We call a pair of a stochastic process and a level map an instance for T; just like we construct an MM $\boldsymbol{\eta }$ and a level map $\textsf{Lv}$ as an AST certificate of a pCFG $\mathcal {C}$, we construct an instance for a stopping time T as its AST certificate. We say an instance $((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ for T ranks $\omega \in \varOmega $ in the dimension k at time t when $T(\omega ) > t$ and $k = \textsf{Lv}_t(\omega )$.

For $c>0$, we say an instance $((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ satisfies the c-ranking condition if, for each $t \in \mathbb {N}$, $\omega \in \llbracket \textsf{Lv}_t \ne 0 \rrbracket $, and $k \in \{ 1, \ldots , \textsf{Lv}_t(\omega ) \}$, we have:

$$\begin{aligned} \mathbb {E}[\textbf{X}_{t+1}[k] \mid \mathcal {F}_t] (\omega ) \le \textbf{X}_t[k](\omega ) -c \cdot \textbf{1}_{\llbracket k=\textsf{Lv}_t \rrbracket }(\omega ) \quad (\mathbb {P}\text{-a.s. }) \end{aligned}$$

(1)

Here, the function $\mathbb {E}[\textbf{X}_{t+1}[k] \mid \mathcal {F}_t]$ denotes the conditional expectation of $\textbf{X}_{t+1}[k]$ given $\mathcal {F}_t$, which takes the role of pre-expectation. We mostly let $c=1$ and simply call it the ranking condition; the only result sensitive to c is Theorem 4.3.

We also define the three different non-negativity conditions for an instance as follows. Here we adopt a slightly general (but essentially the same) variant of strong non-negativity instead, calling it uniform well-foundedness; we simply allow the uniform lower bound to be any constant $\bot \in \mathbb {R}$ instead of fixing it to be zero. This makes the later argument simpler.

$$\begin{aligned} \text{(UN } \text{ well-fnd.) } & \exists \bot \in \mathbb {R}. \forall t \in \mathbb {N}. \forall \omega \in \varOmega . \forall k \in \{ 1, \ldots , n \}.{} & {} \textbf{X}_t[k](\omega ) \ge \bot , \\ \text{(LW } \text{ non-neg.) } & \forall t \in \mathbb {N}. \forall \omega \in \llbracket \textsf{Lv}_t \ne 0 \rrbracket . \forall k \in \{ 1, \ldots , \textsf{Lv}_t(\omega ) \}.{} & {} \textbf{X}_t[k](\omega ) \ge 0, \\ \text{(SC } \text{ non-neg.) } & \forall t \in \mathbb {N}. \forall \omega \in \llbracket \textsf{Lv}_t \ne 0 \rrbracket . {} & {} \textbf{X}_t[\textsf{Lv}_t(\omega )](\omega ) \ge 0. \end{aligned}$$

Definition 3.4

((UN/LW/SC)-LexRSM). Suppose the following are given: a probability space $(\varOmega , \mathcal {F}, \mathbb {P})$; a filtration $(\mathcal {F}_t)_{t=0}^\infty $ on $\mathcal {F}$; and a stopping time T w.r.t. $(\mathcal {F}_t)_{t=0}^\infty $. An instance $\mathcal {I}= ((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ is called a UNiformly well-founded LexRSM (UN-LexRSM) for T with the bottom $\bot \in \mathbb {R}$ and a constant $c \in \mathbb {R}$ if (a) $(\textbf{X}_t)_{t=0}^\infty $ is adapted to $(\mathcal {F}_t)_{t=0}^\infty $; (b) for each $t \in \mathbb {N}$ and $1 \le k \le n$, the expectation of $\textbf{X}_t[k]$ exists; (c) $\mathcal {I}$ satisfies the c-ranking condition; and (d) $\mathcal {I}$ is uniformly well-founded with the bottom $\bot $. We define LW-LexRSM and SC-LexRSM by changing (d) with LW and SC non-negativity, respectively.

We mostly assume $c=1$ and omit to mention the constant. UN-LexRSM is known to be sound [2]; meanwhile, LW and SC-LexRSM are generally unsound [15, 20]. We still mention the latter two as parts of sound LexRSMs.

From RSM Maps to RSMs. Let $\boldsymbol{\eta }$ be an MM over a pCFG $\mathcal {C}$ with a level map $\textsf{Lv}$. Together with a $\varDelta $-deterministic scheduler $\sigma $ and initial state $s_I$, it induces an instance $((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ over the dynamics of $\mathcal {C}$, by letting $\textbf{X}_t(s_0s_1\ldots ) = \boldsymbol{\eta }(s_t)$; it describes the behavior of $\boldsymbol{\eta }$ and $\textsf{Lv}$ through executing $\mathcal {C}$ from $s_I$ under $\sigma $. Properties of $\boldsymbol{\eta }$ such as ranking condition or non-negativity are inherited to the induced instance (if the expectation of $\textbf{X}_t[k]$ exists for each t, k). For example, an instance induced by an ST-LexRSM map is an UN-LexRSM with $\bot = 0$.

Non-probabilistic Settings, and Instantiation of SC-LexRF. The key question in this paper is to find a LexRSM notion that instantiates SC non-negative LexRF (or SC-LexRF for short); that is, we would like to find a LexRSM notion whose conditions are satisfied by SC-LexRSM^{Footnote 5} in the non-probabilistic setting, which we formalize as follows. We say a pCFG is a (non-probabilistic) CFG if (a) $\delta $ is Dirac for each $(\ell , \delta ) \in \varDelta $, and (b) $\varDelta _p = \emptyset $; this roughly means that a CFG is a model of a PP without ‘${{\textbf {if}}} \, {{\textbf {prob}}(p)}$’ and ‘${{\textbf {sample}}(d)}$’. We say a probability space $(\varOmega , \mathcal {F}, \mathbb {P})$ is trivial if $\varOmega $ is a singleton, say $\{ \omega \}$.

4 Fixable LexRSMs

In Sect. 4–6 we give our novel technical notions and results. In this section, we will introduce the notion of fixability and related results. Here we focus on technical rigorousness and conciseness, see Sect. 2 for the underlying intuition. Proofs are given in appendices of [36]. We begin with the formal definition of $\varepsilon $-fixability.

Remark 4.1

As in Footnote 2, our formal definitions of fixability in this section slightly differ from an informal explanation in Sect. 2. One difference is that the $\varepsilon $-fixing in Definition 4.2 changes the value of a LexRSM at dimension k whenever it is negative or k is strictly on the right to the ranking dimension. This modification is necessary to prove Theorem 4.4. Another is that we define fixability as the notion for an instance $\mathcal {I}$, rather than for an MM $\boldsymbol{\eta }$. While the latter can be also done in an obvious way (as informally done in Sect. 2), we do not formally do that because it is not necessary for our technical development. One can “fix” the argument in Sect. 2 into the one over instances by translating “fixability of $\boldsymbol{\eta }$” to “fixability of an instance induced by $\boldsymbol{\eta }$”.

Definition 4.2

($\varepsilon $ -fixing of an instance). Let $\mathcal {I}=((\textbf{X}_t)_{t=0}^\infty ), (\textsf{Lv}_t)_{t=0}^\infty )$ be an instance for a stopping time T, and let $\varepsilon > 0$. The $\varepsilon $-fixing of $\mathcal {I}$ is another instance $\tilde{\mathcal {I}} = ((\tilde{\textbf{X}}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ for T, where

$$\begin{aligned} \tilde{\textbf{X}}_t[k](\omega ) = {\left\{ \begin{array}{ll} -\varepsilon &{} \text {if } \textbf{X}_t[k](\omega ) < 0 \text { or } k > \textsf{Lv}_t(\omega ),\\ \textbf{X}_t[k](\omega ) &{} \text {otherwise}. \end{array}\right. } \end{aligned}$$

We say an SC-LexRSM $\mathcal {I}$ is $\varepsilon $-fixable, or call it an $\varepsilon $-fixable LexRSM, if its $\varepsilon $-fixing $\tilde{\mathcal {I}}$ is an UN-LexRSM with the bottom $\bot = -\varepsilon $.

Observe that the $\varepsilon $-fixing of any instance is uniformly well-founded with the bottom $\bot = -\varepsilon $, so the $\varepsilon $-fixability only asks if the ranking condition is preserved through $\varepsilon $-fixing. Also, observe that the soundness of $\varepsilon $-fixable LexRSM immediately follows from that of UN-LexRSM [2].

While we do not directly use $\varepsilon $-fixability as a technical tool, the two theorems below show its conceptual value. The first one answers our key problem: $\varepsilon $-fixable LexRSM instantiates SC-LexRF with sufficiently large $\varepsilon $.

Theorem 4.3

(fixable LexRSM instantiates SC-LexRF). Suppose $\mathcal {I}= ((\boldsymbol{x}_t)_{t=0}^\infty , (\textsf{Lv}_t)_{t=0}^\infty )$ is an SC-LexRSM for a stopping time T over the trivial probability space with a constant c, and let $\varepsilon \ge c$. Then $\mathcal {I}$ is $\varepsilon $-fixable. $\square $

The second theorem offers a formal comparison between $\varepsilon $-fixable LexRSM and the state-of-the-art LexRSM variant in the literature, namely GLexRSM [15]. We show the former subsumes the latter. In our terminology, GLexRSM is LW-LexRSM that also satisfies the following expected leftward non-negativity:

$$\begin{aligned} \forall t \in \mathbb {N}. \forall \omega \in \llbracket \textsf{Lv}_t \ne 0 \rrbracket . \forall k \in \{ 1, \ldots , \textsf{Lv}_t(\omega ) \}. \ \mathbb {E}[ \textbf{1}_{\llbracket k > \textsf{Lv}_{t+1} \rrbracket }\cdot \textbf{X}_{t+1}[k] \mid \mathcal {F}_t](\omega ) \ge 0. \end{aligned}$$

We note that our result can be also seen as an alternative proof of the soundness of GLexRSM [15, Thm. 1]. Our proof is also significantly simpler than the original one, as the former utilizes the soundness of UN-LexRSM as a lemma, while the latter does the proof “from scratch”.

Theorem 4.4

(fixable LexRSM generalizes GLexRSM). Suppose $\mathcal {I}$ is a GLexRSM for a stopping time T. Then $\mathcal {I}$ is $\varepsilon $-fixable for any $\varepsilon >0$. $\square $

Now we move on to a refined variant, $(\varepsilon ,\gamma )$-fixability. Before its formal definition, we give a theorem that justifies the partial waiving of the ranking condition described in Sect. 2. Below, $\overset{\infty }{\exists }t.\varphi _t$ stands for $\forall k\in \mathbb {N}. \exists t\in \mathbb {N}.[t > k \wedge \varphi _t]$.

Theorem 4.5

(relaxation of the UN-LexRSM condition). Suppose the following are given: a probability space $(\varOmega , \mathcal {F}, \mathbb {P})$; a filtration $(\mathcal {F}_t)_{t=0}^\infty $ on $\mathcal {F}$; and a stopping time T w.r.t. $(\mathcal {F}_t)_{t=0}^\infty $. Let $\mathcal {I}=((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ be an instance for T, and let $\bot \in \mathbb {R}$. For each $k \in \{ 1,\ldots , n \}$, let $(\varphi _{t,k})_{t=0}^\infty $ be a sequence of predicates over $\varOmega $ such that

$$\begin{aligned} \overset{\infty }{\exists }t.\varphi _{t,k}(\omega ) \Rightarrow \overset{\infty }{\exists }t. [\textbf{X}_t[k](\omega ) = \bot \vee k > \textsf{Lv}_t(\omega )] \quad \text{( }\mathbb {P}\text{-a.s.) } \end{aligned}$$

(2)

Suppose $\mathcal {I}$ is an UN-LexRSM with the bottom $\bot $ except that, instead of the ranking condition, $\mathcal {I}$ satisfies the inequality (1) only for $t\in \mathbb {N}$, $k\in \{ 1,\ldots ,n \}$, and $\omega \in \llbracket k \le \textsf{Lv}_t \wedge \lnot (\textbf{X}_t[k]> \bot \wedge \varphi _{t,k}) \rrbracket $ (with $c=1$). Then T is AST w.r.t. $\mathbb {P}$. $\square $

The correspondence between the argument in Sect. 2 and Theorem 4.5 is as follows. The predicate $\varphi _{t,k}$ is an abstraction of the situation “we are at a coin-tossing state at time t in the k-th dimension”; and the condition (2) corresponds to the infinite coin-tossing argument (for a given k, if $\varphi _{t,k}$ is satisfied at infinitely many t, then the ranking in the k-th dimension is “done” infinitely often, with probability 1). Given these, Theorem 4.5 says that the ranking condition of UN-LexRSM can be waived over $\llbracket \textbf{X}_t[k]> \bot \wedge \varphi _{t,k} \rrbracket $. In particular, the theorem amounts to the soundness of UN-LexRSM when $\varphi _{t,k} \equiv false $ for each t and k.

Based on Theorem 4.5, we introduce $(\varepsilon ,\gamma )$-fixability as follows. There, $\mathbb {P}[\varphi \mid \mathcal {F}'] := \mathbb {E}[\textbf{1}_{\llbracket \varphi \rrbracket }\mid \mathcal {F}']$ is the conditional probability of satisfying $\varphi $ given $\mathcal {F}'$.

Definition 4.6

($(\varepsilon ,\gamma )$ -fixability). Let $\mathcal {I}=((\textbf{X}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ be an instance for T, and let $\gamma \in (0,1)$. We call $\mathcal {I}$ a $\gamma $-relaxed UN-LexRSM for T if $\mathcal {I}$ satisfies the properties in Theorem 4.5, where $\varphi _{t,k}$ is as follows:

$$\begin{aligned} \varphi _{t,k}(\omega ) \equiv \mathbb {P}[\textbf{X}_{t+1}[k]=\bot \mid \mathcal {F}_t](\omega ) \ge \gamma . \end{aligned}$$

(3)

We say $\mathcal {I}$ is $(\varepsilon ,\gamma )$-fixable if its $\varepsilon $-fixing $\tilde{\mathcal {I}}$ is a $\gamma $-relaxed UN-LexRSM.

The predicate $\varphi _{t,k}(\omega )$ in (3) is roughly read “the ranking by $(\textbf{X}_t)_{t=0}^\infty $ is done at time $t+1$ in dimension k with probability $\gamma $ or higher, given the information about $\omega $ at t”. This predicate satisfies Condition (2); hence we have the following corollary, which is the key to the soundness of lazy LexRSM in Sect. 5.

Corollary 4.7

(soundness of $(\varepsilon ,\gamma )$ -fixable instances). Suppose there exists an instance $\mathcal {I}$ over $(\varOmega , \mathcal {F}, \mathbb {P})$ for a stopping time T that is $(\varepsilon ,\gamma )$-fixable for any $\varepsilon >0$ and $\gamma \in (0,1)$. Then T is AST w.r.t. $\mathbb {P}$. $\square $

5 Lazy LexRSM and Its Soundness

Here we introduce another LexRSM variant, Lazy LexRSM (LLexRSM). We need this variant for our LexRSM synthesis algorithm; while $\varepsilon $-fixable LexRSM theoretically answers our key question, it is not amenable to LP-based synthesis algorithms because its case distinction makes the resulting constraint nonlinear.

We define LLexRSM map as follows; see Contributions in Sect. 1 for its intuitive meaning with an example. The definition for an instance is in [36, Appendix C].

Definition 5.1

(LLexRSM map). Fix a pCFG $\mathcal {C}$ with an invariant I. Let $\boldsymbol{\eta }$ be an MM associated with a level map $\textsf{Lv}$. The MM $\boldsymbol{\eta }$ is called a Lazy LexRSM map (LLexRSM map) over $\mathcal {C}$ supported by I if it is an SC-LexRSM map over $\mathcal {C}$ supported by I, and satisfies stability at negativity defined as follows:

$$\begin{aligned} & \forall \tau \ne \tau _{\textrm{out}}. \forall s \in \llbracket I \wedge G(\tau ) \rrbracket . \forall k \in \{ 1,\ldots , \textsf{Lv}(\tau )-1 \}. \\ &\quad \quad \quad \quad \boldsymbol{\eta }[k](s) < 0 \Rightarrow \forall s' \in \textrm{succ}_\tau (s). \biggl [ \boldsymbol{\eta }[k](s') < 0 \vee k >\max _{\tau ': s'\in \llbracket G(\tau ') \rrbracket } \textsf{Lv}(\tau ') \biggr ]. \end{aligned}$$

We first observe LLexRSM also answers our key question.

Theorem 5.2

(LLexRSM instantiates SC-LexRF). Suppose $\boldsymbol{\eta }$ is an SC-LexRSM over a non-probabilistic CFG $\mathcal {C}$ supported by an invariant I, with a level map $\textsf{Lv}$. Then $\boldsymbol{\eta }$ is stable at negativity under I and $\textsf{Lv}$, and hence, $\boldsymbol{\eta }$ is an LLexRSM map over $\mathcal {C}$ supported by I, with $\textsf{Lv}$. $\square $

Below we give the soundness result of LLexRSM map. We first give the necessary assumptions on pCFGs and MMs, namely linearity and well-behavedness. we say a pCFG is linear if the update element of each $\tau \in \varDelta _d$ is a linear function (this corresponds to the restriction on PPs to the linear ones); and an MM $\boldsymbol{\eta }$ is linear if $\lambda \boldsymbol{x}.\boldsymbol{\eta }(\ell ,\boldsymbol{x})$ is linear for each $\ell \in L$. We say a pCFG is well-behaved if its variable samplings are done via well-behaved distributions, which roughly means that their tail probabilities vanish to zero toward infinity quickly enough. Its formal definition is given in [36, Def. C.4], which is somewhat complex; an important fact from the application perspective is that the class of such distributions covers all distributions with bounded supports and some distributions with unbounded supports such as the normal distributions [36, Prop. C.6]. Possibly negative (Lex)RSM typically requires some restriction on variable samplings of pCFG (e.g., the integrability in [15]) so that the pre-expectation is well-defined.

The crucial part of the soundness proof is the following theorem, where $(\varepsilon ,\gamma )$-fixability takes the key role. Its full proof is given in [36, Appendix C].

Theorem 5.3

Let $\boldsymbol{\eta }:\mathcal {S}\rightarrow \mathbb {R}^n$ be a linear LLexRSM map for a linear, well-behaved pCFG $\mathcal {C}$. Then for any $\varDelta $-deterministic scheduler $\sigma $ and initial state $s_I$ of $\mathcal {C}$, the induced instance is $(\varepsilon ,\gamma )$-fixable for some $\varepsilon >0$ and $\gamma \in (0,1)$.

Proof (sketch). We can show that the $\varepsilon $-fixing $\tilde{\mathcal {I}} = ((\tilde{\textbf{X}}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ of an induced instance ${\mathcal {I}} = (({\textbf{X}}_t)_{t=0}^\infty ,(\textsf{Lv}_t)_{t=0}^\infty )$ almost-surely satisfies the inequality (1) of the ranking condition for each t, $\omega $, and k such that $\tilde{\textbf{X}}_t[k](\omega )=-\varepsilon $ and $1 \le k \le \textsf{Lv}_t(\omega )$ [36, Prop. C.2]. Thus it suffices to show, for each $\omega $, k, and t such that $\tilde{\textbf{X}}_t[k](\omega )\ge 0$ and $1 \le k \le \textsf{Lv}_t(\omega )$, either $\tilde{\mathcal {I}}$ satisfies the inequality (1) or (1) as a requirement on $\tilde{\mathcal {I}}$ is waived due to the $\gamma $-relaxation.

Now take any such $t,\omega $, and k, and suppose the run $\omega $ reads the program line prog at time t. Then we can show the desired property by a case distinction over prog as follows. Here, recall $\omega $ is a sequence $s_0s_1\ldots s_ts_{t+1}\ldots $ of program states; we defined ${\textbf{X}}_t$ by ${\textbf{X}}_t[k](\omega ) = \boldsymbol{\eta }[k](s_t)$; and $\mathbb {E}[{\textbf{X}}_{t+1}[k] \mid \mathcal {F}_t] (\omega )$ is the expectation of $\boldsymbol{\eta }[k](s')$, where $s'$ is the successor state of $s_0\ldots s_t$ under $\sigma $ (which is not necessarily $s_{t+1}$). Also observe the requirement (1) on $\tilde{\mathcal {I}}$ is waived for given $t,\omega $, and k when the value of $\boldsymbol{\eta }[k](s')$ is negative with the probability $\gamma $ or higher.

1.
Suppose prog is a non-probabilistic program line, e.g., ‘$x_i := f(\boldsymbol{x})$’ or ‘while $\varphi $ do’. Then the successor state $s'$ of $s_t$ is unique. If $\boldsymbol{\eta }[k](s')$ is non-negative, then we have $\mathbb {E}[\tilde{\textbf{X}}_{t+1}[k] \mid \mathcal {F}_t] (\omega ) = \mathbb {E}[{\textbf{X}}_{t+1}[k] \mid \mathcal {F}_t] (\omega )$, so the inequality (1) is inherited from $\mathcal {I}$ to $\tilde{\mathcal {I}}$; if negative, then the requirement (1) on $\tilde{\mathcal {I}}$ is waived. The same argument applies to ‘${{\textbf {if}} \star {\textbf {then}}}$’ (recall ${\mathcal {I}}$ is induced from a $\varDelta $-deterministic scheduler).
2.
Suppose ${ prog} \equiv `{} {\textbf {if}} {\textbf {prob}}(p) {\textbf {then}}\text {'}$. By letting $\gamma $ strictly smaller than p, we see either $\boldsymbol{\eta }[k](s')$ is never negative, or it is negative with a probability more than $\gamma $. Thus we have the desired property for a similar reason to Case 1 (we note this argument requires p to be a constant).
3.
Suppose ${ prog} \equiv ` x_i := \textbf{sample}(d)$’. We can show the desired property by taking a sufficiently small $\gamma $; roughly speaking, the requirement (1) on $\tilde{\mathcal {I}}$ is waived unless the chance of $\boldsymbol{\eta }[k](s')$ being negative is very small, in which case the room for “ill” exploitation is so small that the inequality (1) is inherited from $\mathcal {I}$ to $\tilde{\mathcal {I}}$. Almost the same argument applies to ‘$x_i := \textbf{ndet}(D)$’.

We note, by the finiteness of program locations L and transitions $\varDelta $, we can take $\gamma \in (0,1)$ that satisfies all requirements above simultaneously. $\square $

Now we have soundness of LLexRSM as the following theorem, which is almost an immediate consequence of Theorem 5.3 and Corollary 4.7.

Theorem 5.4

(soundness of linear LLexRSM map over linear, well-behaved pCFG). Let $\mathcal {C}$ be a linear, well-behaved pCFG, and suppose there is a linear LLexRSM map over $\mathcal {C}$ (supported by any invariant). Then $\mathcal {C}$ is AST. $\square $

6 Automated Synthesis Algorithm of LexRSM

In this section, we introduce a synthesis algorithm of LLexRSM for automated AST verification of linear PPs. It synthesizes a linear MM in a certain subclass of LLexRSMs. We first define the subclass, and then introduce our algorithm.

Our algorithm is a variant of linear template-based synthesis. There, we fix a linear MM $\boldsymbol{\eta }$ with unknown coefficients (i.e., the linear template), and consider an assertion “$\boldsymbol{\eta }$ is a certificate of AST”; for example, in the standard 1-dimensional RSM synthesis, the assertion is “$\eta $ is an RSM map”. We then reduce this assertion into a set of linear constraints via Farkas’ Lemma [34]. These constraints constitute an LP problem with an appropriate objective function. A certificate is synthesized, if feasible, by solving this LP problem. The reduction is standard, so we omit the details; see e.g. [35].

Subclass of LLexRSM for Automated Synthesis. While LLexRSM resolves the major issue that fixable LexRSM confronts toward its automated synthesis, we still need to tweak the notion a bit more, as the stability at negativity condition involves the value of an MM $\boldsymbol{\eta }$ in its antecedent part (i.e., it says “whenever $\boldsymbol{\eta }[k]$ is negative for some k...”); this makes the reduced constraints via Farkas’ Lemma nonlinear. Therefore, we augment the condition as follows.

Definition 6.1

(MCLC). Let $\boldsymbol{\eta }:\mathcal {S}\rightarrow \mathbb {R}^n$ be an MM supported by an invariant I, with a level map $\textsf{Lv}$. We say $\boldsymbol{\eta }$ satisfies the multiple-choice leftward condition (MCLC) if, for each $k\in \{ 1,\ldots ,n \}$, it satisfies either (4) or (5) below:

$$\begin{aligned} & \forall \tau \in \llbracket k <\textsf{Lv} \rrbracket . \forall s \in \llbracket I \wedge G(\tau ) \rrbracket . {} & {} \quad \boldsymbol{\eta }[k](s) \ge 0 , \end{aligned}$$

(4)

$$\begin{aligned} & \forall \tau \in \llbracket k <\textsf{Lv} \rrbracket . \forall s \in \llbracket I \wedge G(\tau ) \rrbracket . \forall s'\in \textrm{succ}_\tau (s). {} & {} \quad \boldsymbol{\eta }[k](s') \le \boldsymbol{\eta }[k](s). \end{aligned}$$

(5)

Condition (4) is nothing but the non-negativity condition in dimension k. Condition (5) augments the ranking condition in the strict leftward of the ranking dimension (a.k.a. the unaffecting condition) so that the value of $\boldsymbol{\eta }[k]$ is non-increasing in the worst-case. MCLC implies stability at negativity; hence, by Theorem 5.4, linear SC-LexRSM maps with MCLC certify AST of linear, well-behaved pCFGs. They also instantiate SC-LexRFs as follows.

Theorem 6.2

(SC-LexRSM maps with MCLC instantiate SC-LexRFs). Suppose $\boldsymbol{\eta }$ is an SC-LexRSM map over a non-probabilistic CFG $\mathcal {C}$ supported by I, with $\textsf{Lv}$. Then $\boldsymbol{\eta }$ satisfies MCLC under I and $\textsf{Lv}$. $\square $

The Algorithm. Our LexRSM synthesis algorithm mostly resembles the existing ones [2, 15], so we are brief here; a line-to-line explanation with a pseudocode is in [36, Appendix D]. The algorithm receives a pCFG $\mathcal {C}$ and an invariant I, and attempts to construct a SC-LexRSM with MCLC over $\mathcal {C}$ supported by I. The construction is iterative; at the k-th iteration, the algorithm attempts to construct a one-dimensional MM $\eta _k$ that ranks transitions of $\mathcal {C}$ that are not ranked by the current construction $\boldsymbol{\eta } = (\eta _1, \ldots , \eta _{k-1})$, while respecting MCLC. If the algorithm finds $\eta _k$ that ranks at least one new transition, then it appends $\eta _k$ to $\boldsymbol{\eta }$ and goes to the next iteration; otherwise, it reports a failure. Once $\boldsymbol{\eta }$ ranks all transitions, the algorithm reports a success, returning $\boldsymbol{\eta }$ as an AST certificate of $\mathcal {C}$.

Our algorithm attempts to construct $\eta _k$ in two ways, by adopting either (4) or (5) as the leftward condition at the dimension k. The attempt with the condition (4) is done in the same manner as existing algorithms [2, 15]; we require $\eta _k$ to rank the unranked transitions as many as possible. The attempt with the condition (5) is slightly nontrivial; the algorithm demands a user-defined parameter $\text{ Class }(U) \subseteq 2^U$ for each $U \subseteq \varDelta \setminus \{ \tau _{\textrm{out}} \}$. The parameter $\text{ Class }(U)$ specifies which set of transitions the algorithm should try to rank, given the set of current unranked transitions U; that is, for each $\mathcal {T}\in \text{ Class }(U)$, the algorithm attempts to find $\eta _k$ that exactly ranks transitions in $\mathcal {T}$.

There are two canonical choices of $\text{ Class }(U)$. One is $2^U \setminus \{ \emptyset \}$, the brute-force trial; the resulting algorithm does not terminate in polynomial time, but ranks the maximal number of transitions (by trying each $\mathcal {T}$ in the descending order w.r.t. $|\mathcal {T}|$). This property makes the algorithm complete. Another choice is the singletons of U, i.e., $\{ \{ \tau \} \mid \tau \in U \}$; while the resulting algorithm terminates in polynomial time, it lacks the maximality property. It is our future work to verify if there is a polynomial complete instance of our proposed algorithm. Still, any instance of it is complete over yet another class of LLexRSMs, namely linear LW-LexRSMs. For a formal statement and its proof, see [36, Thm. D.1].

7 Experiments

We performed experiments to evaluate the performance of our proposed algorithm. The implementation is publicly available^{Footnote 6}.

Our evaluation criteria are twofold: one is how the relaxed non-negativity condition of our LexRSM—SC non-negativity and MCLC—improves the applicability of the algorithm, compared to other existing non-negativity conditions. To this end, we consider two baseline algorithms.

(a)
The algorithm STR: This is the one proposed in [2], which synthesizes an ST-LexRSM. We use the implementation provided by the authors [3].
(b)
The algorithm LWN: This synthesizes an LW-LexRSM. LWN is realized as an instance of our algorithm with $\textrm{Class}(U) = \emptyset $. We use LWN as a proxy of the synthesis algorithm of GLexRSM [16, Alg. 2], whose implementation does not seem to exist. We note [16, Alg. 2] synthesizes an LW-LexRSM with some additional conditions; therefore, it is no less restrictive than LWN.

Another criterion is how the choice of $\textrm{Class}(U)$ affects the performance of our algorithm. To this end, we consider two instances of it: (a) Singleton Multiple Choice (SMC), given by $\textrm{Class}(U) = \{ \{ \tau \} \mid \tau \in U \}$; and (b) Exhaustive Multiple Choice (EMC), given by $\textrm{Class}(U) = 2^U \setminus \emptyset $. SMC runs in PTIME, but we do not know if it is complete; EMC does not run in PTIME, but is complete.

We use benchmarks from [2], which consist of non-probabilistic programs collected in [4] and their probabilistic modifications. The modification is done in two different ways: (a) while loops “${\textbf {while }} \varphi {\textbf { do }} P {\textbf { od}}$” are replaced with probabilistic ones “while $\varphi $ do (if prob(0.5) then P else skip fi) od”; (b) in addition to (a), variable assignments “$x := f(\boldsymbol{x}) + a$” are replaced with “$x := f(\boldsymbol{x}) + Unif [a-1, a+1]$”. We include non-probabilistic programs in our benchmark set because the “problematic program structure” that hinders automated LexRSM synthesis already exists in non-probabilistic programs (cf. our explanation to Fig. 2). We also tried two PPs from [15, Fig. 1], which we call counterexStr1 and counterexStr2.

We implemented our algorithm upon [2], which is available at [3]. Similar to [2], our implementation works as follows: (1) it receives a linear PP as an input, and translates it into a pCFG $\mathcal {C}$; (2) it generates an invariant for $\mathcal {C}$; (3) via our algorithm, it synthesizes an SC-LexRSM map with MCLC. Invariants are generated by ASPIC [19], and all LP problems are solved by CPLEX [25].

Table 1. The list of benchmarks in which a feasibility difference is observed between baselines and proposed algorithms. Ticks in “p.l.” and “p.a.” indicate the benchmark has a probabilistic loop and assignment, respectively. Numbers in the result indicate that the algorithm found a LexRSM with that dimension; the crosses indicate failures; “N/A” means we did not run the experiment.

Full size table

Results. In 135 benchmarks from 55 models, STR succeeds in 98 cases, LWN succeeds in 105 cases while SMC and EMC succeed in 119 cases (we did not run STR for counterexStr1 because it involves a sampling from an unbounded support distribution, which is not supported by STR). Table 1 summarizes the cases where we observe differences in the feasibility of algorithms. As theoretically anticipated, LWN always succeeds in finding a LexRSM whenever STR does; the same relation is observed between SMC vs. LWN and EMC vs. SMC. In most cases, STR, LWN, and SMC return an output within a second^{Footnote 7}, while EMC suffers from an exponential blowup when it attempts to rank transitions with Condition (5) in Definition 6.1. The full results are in [36, Appendix E].

On the first evaluation criterion, the advantage of the relaxed non-negativity is evident: SMC/EMC have unique successes vs. STR on 21 programs (21/135 = 15.6% higher success rate) from 16 different models; SMC/EMC also have unique successes vs. LWN in 14 programs (14/135 = 10.4% higher success rate) from 12 models. This result shows that the program structure we observed in Fig. 2 appears in various programs in the real world.

On the second criterion, EMC does not have any unique success compared to SMC. This result suggests that SMC can be the first choice as a concrete instance of our proposed algorithm. Indeed, we suspect that SMC is actually complete—verifying its (in)completeness is a future work. For some programs, EMC found a LexRSM with a smaller dimension than SMC.

Interestingly, LWN fails to find a LexRSM for counterexStr2, despite it being given in [15] as a PP for which a GLexRSM (and hence, an LW non-negative LexRSM) exists. This happens because the implementation in [3] translates the PP into a pCFG with a different shape than the one in [15] (for the latter, a GLexRSM indeed exists); the former possesses a similar structure as in Fig. 2 because different locations are assigned for the while loop and if branch. This demonstrates the advantage of our algorithm from another point of view, i.e., robustness against different translations of PPs.

8 Related Work

There is a rich body of studies in 1-dimensional RSM [12,13,14, 17, 20,21,22,23, 28,29,30], while lexicographic RSM is relatively new [2, 15]. Our paper generalizes the latest work [15] on LexRSM as follows: (a) Soundness of LexRSM as a stochastic process: soundness of $\varepsilon $-fixable LexRSMs (Definition 4.2) generalizes [15, Thm. 1] in the sense that every GLexRSM is $\varepsilon $-fixable for any $\varepsilon >0$ (Theorem 4.4); (b) Soundness of LexRSM as a function on program states: our result (Theorem 5.4) generalizes [15, Thm. 2] under the linearity and well-behavedness assumptions; (c) Soundness and completeness of LexRSM synthesis algorithms: our result generalizes the results for one of two algorithms in [15] that assumes boundedness assumption on assignment distribution [15, Thm. 3].

The work [24] also considers a relaxed non-negativity of RSMs. Their descent supermartingale, which acts on while loops, requires well-foundedness only at every entry into the loop body. A major difference from our LexRSM is that they only consider 1-dimensional RSMs; therefore, the problem of relaxing the LW non-negativity does not appear in their setting. Compared with their RSM, our LexRSM has an advantage in verifying PPs with a structure shown in Fig. 2, where the value of our LexRSM can be arbitrarily small upon the loop entrance (at some dimension; see $\eta _2$ at $\ell _1$ in Fig. 2).

The work [29] extends the applicability of standard RSM on a different aspect from LexRSM. The main feature of their RSM is that it can verify AST of the symmetric random walk. While our LexRSM cannot verify AST of this process, the RSM by [29] is a 1-dimensional one, which typically struggles on PPs with nested structures. Such a difference can be observed from the experiment result in [31] (compare [31, Table 2] and nested_loops, sequential_loops in [31, Table 1]).

9 Conclusion

We proposed the first variants of LexRSM that instantiate SC-LexRF. An algorithm was proposed to synthesize such a LexRSM, and experiments have shown that the relaxation of non-negativity contributes applicability of the resulting LexRSM. We have two open problems: one is if the class of well-behaved distributions matches with the one of integrable ones; and another is if the SMC variant of our algorithm (see Sect. 7) is complete.

Notes

1.
We use the term “instantiate” to emphasize that we compare LexRSM and LexRF.
2.
To give the key ideas in a simpler way, the description here slightly differs from the actual definition in Sect. 4; referred results in Sect. 2 are derived from the latter. See Remark 4.1.
3.
This is also assumed in [15] to avoid a complication in possibly negative LexRSMs.
4.
Defining these as edges might be more typical, as in our informal explanation. We adopt the style of [2, 15] for convenience; it can handle ‘${{\textbf {if}}} \, {{\textbf {prob}}(p)}$’ by a single $\tau $.
5.
One would perhaps expect to see “SC-LexRF” here; such a change does not make a difference under a canonical definition of SC-LexRF, so we define the notion of instantiation in this way to save space. See also [36, Appendix A].
6.
https://doi.org/10.5281/zenodo.10937558.
7.
There was a single example for which more time was spent, due to a larger size.

References

Ultimate automizer. https://www.ultimate-pa.org/?ui=tool &tool=automizer
Agrawal, S., Chatterjee, K., Novotný, P.: Lexicographic ranking supermartingales: an efficient approach to termination of probabilistic programs. Proc. ACM Program. Lang. 2(POPL), 34:1–34:32 (2018). https://doi.org/10.1145/3158122
Agrawal, S., Chatterjee, K., Novotný, P.: Lexicographic ranking supermartingales: an efficient approach to termination of probabilistic programs: implementation (2018). https://github.com/Sheshansh/prob_termination
Alias, C., Darte, A., Feautrier, P., Gonnord, L.: Multi-dimensional rankings, program termination, and complexity bounds of flowchart programs. In: Cousot, R., Martel, M. (eds.) SAS 2010. LNCS, vol. 6337, pp. 117–133. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15769-1_8
Chapter Google Scholar
Ash, R., Doléans-Dade, C.: Probability and Measure Theory. Harcourt/Academic Press, San Diego (2000)
Google Scholar
Barthe, G., Gaboardi, M., Grégoire, B., Hsu, J., Strub, P.Y.: Proving differential privacy via probabilistic couplings. In: Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science, pp. 749–758 (2016)
Google Scholar
Barthe, G., Gaboardi, M., Hsu, J., Pierce, B.: Programming language techniques for differential privacy. ACM SIGLOG News 3(1), 34–53 (2016)
Article Google Scholar
Ben-Amram, A.M., Genaim, S.: Complexity of Bradley-Manna-Sipma lexicographic ranking functions. In: Kroening, D., Păsăreanu, C.S. (eds.) CAV 2015, Part II. LNCS, vol. 9207, pp. 304–321. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21668-3_18
Chapter Google Scholar
Bertsekas, D.P., Shreve, S.E.: Stochastic Optimal Control: The Discrete-Time Case. Athena Scientific, Belmont (2007)
Google Scholar
Bradley, A.R., Manna, Z., Sipma, H.B.: Linear ranking with reachability. In: Etessami, K., Rajamani, S.K. (eds.) CAV 2005. LNCS, vol. 3576, pp. 491–504. Springer, Heidelberg (2005). https://doi.org/10.1007/11513988_48
Chapter Google Scholar
Canal, G., Cashmore, M., Krivić, S., Alenyà, G., Magazzeni, D., Torras, C.: Probabilistic planning for robotics with ROSPlan. In: Althoefer, K., Konstantinova, J., Zhang, K. (eds.) TAROS 2019, Part I. LNCS (LNAI), vol. 11649, pp. 236–250. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-23807-0_20
Chapter Google Scholar
Chakarov, A., Sankaranarayanan, S.: Probabilistic program analysis with martingales. In: Sharygina, N., Veith, H. (eds.) CAV 2013. LNCS, vol. 8044, pp. 511–526. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39799-8_34
Chapter Google Scholar
Chatterjee, K., Fu, H., Goharshady, A.K.: Termination analysis of probabilistic programs through Positivstellensatz’s. In: Chaudhuri, S., Farzan, A. (eds.) CAV 2016, Part I. LNCS, vol. 9779, pp. 3–22. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41528-4_1
Chapter Google Scholar
Chatterjee, K., Fu, H., Novotnỳ, P., Hasheminezhad, R.: Algorithmic analysis of qualitative and quantitative termination problems for affine probabilistic programs. In: Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pp. 327–342 (2016)
Google Scholar
Chatterjee, K., Goharshady, E.K., Novotný, P., Zárevúcky, J., Žikelić, Đ: On lexicographic proof rules for probabilistic termination. In: Huisman, M., Păsăreanu, C., Zhan, N. (eds.) FM 2021. LNCS, vol. 13047, pp. 619–639. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-90870-6_33
Chapter Google Scholar
Chatterjee, K., Goharshady, E.K., Novotný, P., Zárevúcky, J., Zikelic, D.: On lexicographic proof rules for probabilistic termination. CoRR abs/2108.02188 (2021). https://arxiv.org/abs/2108.02188
Chatterjee, K., Novotnỳ, P., Zikelic, D.: Stochastic invariants for probabilistic termination. In: Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages, pp. 145–160 (2017)
Google Scholar
Dubhashi, D.P., Panconesi, A.: Concentration of Measure for the Analysis of Randomized Algorithms. Cambridge University Press, New York (2009)
Google Scholar
Feautrier, P., Gonnord, L.: Accelerated invariant generation for C programs with aspic and C2Fsm. Electron. Notes Theoret. Comput. Sci. 267(2), 3–13 (2010)
Article Google Scholar
Ferrer Fioriti, L.M., Hermanns, H.: Probabilistic termination: soundness, completeness, and compositionality. In: Proceedings of the 42nd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pp. 489–501 (2015)
Google Scholar
Fu, H., Chatterjee, K.: Termination of nondeterministic probabilistic programs. In: Enea, C., Piskac, R. (eds.) VMCAI 2019. LNCS, vol. 11388, pp. 468–490. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11245-5_22
Chapter Google Scholar
Giesl, J., Giesl, P., Hark, M.: Computing expected runtimes for constant probability programs. In: Fontaine, P. (ed.) CADE 2019. LNCS (LNAI), vol. 11716, pp. 269–286. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-29436-6_16
Chapter Google Scholar
Huang, M., Fu, H., Chatterjee, K.: New approaches for almost-sure termination of probabilistic programs. In: Ryu, S. (ed.) APLAS 2018. LNCS, vol. 11275, pp. 181–201. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-02768-1_11
Chapter Google Scholar
Huang, M., Fu, H., Chatterjee, K., Goharshady, A.K.: Modular verification for almost-sure termination of probabilistic programs. Proc. ACM Program. Lang. 3(OOPSLA), 129:1–129:29 (2019). https://doi.org/10.1145/3360555
IBM: IBM ILOG CPLEX 12.7 user’s manual (IBM ILOG CPLEX division, incline village, NV) (2017)
Google Scholar
Karp, R.M.: An introduction to randomized algorithms. Discret. Appl. Math. 34(1–3), 165–201 (1991)
Article MathSciNet Google Scholar
Lobo-Vesga, E., Russo, A., Gaboardi, M.: A programming language for data privacy with accuracy estimations. ACM Trans. Program. Lang. Syst. (TOPLAS) 43(2), 1–42 (2021)
Article Google Scholar
McIver, A., Morgan, C.: A new rule for almost-certain termination of probabilistic-and demonic programs. arXiv preprint arXiv:1612.01091 (2016)
McIver, A., Morgan, C., Kaminski, B.L., Katoen, J.P.: A new proof rule for almost-sure termination. Proc. ACM Program. Lang. 2(POPL), 1–28 (2017)
Google Scholar
Moosbrugger, M., Bartocci, E., Katoen, J.-P., Kovács, L.: Automated termination analysis of polynomial probabilistic programs. In: ESOP 2021. LNCS, vol. 12648, pp. 491–518. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-72019-3_18
Chapter Google Scholar
Moosbrugger, M., Bartocci, E., Katoen, J.-P., Kovács, L.: The probabilistic termination tool amber. In: Huisman, M., Păsăreanu, C., Zhan, N. (eds.) FM 2021. LNCS, vol. 13047, pp. 667–675. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-90870-6_36
Chapter Google Scholar
Olmedo, F., Gretz, F., Jansen, N., Kaminski, B.L., Katoen, J., McIver, A.: Conditioning in probabilistic programming. ACM Trans. Program. Lang. Syst. 40(1), 4:1–4:50 (2018). https://doi.org/10.1145/3156018
Parker, D.: Verification of probabilistic real-time systems. In: Proceedings of the 2013 Real-time Systems Summer School (ETR 2013) (2013)
Google Scholar
Schrijver, A.: Theory of Linear and Integer Programming. Wiley, New York (1998)
Google Scholar
Takisaka, T., Oyabu, Y., Urabe, N., Hasuo, I.: Ranking and repulsing supermartingales for reachability in randomized programs. ACM Trans. Program. Lang. Syst. 43(2), 5:1–5:46 (2021). https://doi.org/10.1145/3450967
Takisaka, T., Zhang, L., Wang, C., Liu, J.: Lexicographic ranking supermartingales with lazy lower bounds. CoRR abs/2304.11363 (2024). https://doi.org/10.48550/arXiv.2304.11363

Download references

Acknowledgment

We thank anonymous reviewers for their constructive comments on the previous versions of the paper. The term “ill exploitation” is taken from one of the reviews that we found very helpful. We also thank Shin-ya Katsumata, Takeshi Tsukada, and Hiroshi Unno for their comments on the paper.

This work is partially supported by National Natural Science Foundation of China No. 62172077 and 62350710215.

Author information

Authors and Affiliations

University of Electronic Science and Technology of China, Chengdu, China
Toru Takisaka & Changjiang Wang
The University of Auckland, Auckland, New Zealand
Libo Zhang & Jiamou Liu

Authors

Toru Takisaka
View author publications
You can also search for this author in PubMed Google Scholar
Libo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Changjiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiamou Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Toru Takisaka .

Editor information

Editors and Affiliations

University of Waterloo, Waterloo, ON, Canada
Arie Gurfinkel
Georgia Institute of Technology, Atlanta, GA, USA
Vijay Ganesh

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Takisaka, T., Zhang, L., Wang, C., Liu, J. (2024). Lexicographic Ranking Supermartingales with Lazy Lower Bounds. In: Gurfinkel, A., Ganesh, V. (eds) Computer Aided Verification. CAV 2024. Lecture Notes in Computer Science, vol 14683. Springer, Cham. https://doi.org/10.1007/978-3-031-65633-0_19

Download citation

DOI: https://doi.org/10.1007/978-3-031-65633-0_19
Published: 26 July 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-65632-3
Online ISBN: 978-3-031-65633-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Lexicographic Ranking Supermartingales with Lazy Lower Bounds