1 Introduction

The (symmetric) traveling salesman problem asks: given an graph \(G = (V,E)\) with edge-lengths \(c_e \ge 0\), find the shortest tour that visits all vertices at least once. The Christofides-Serdyukov algorithm [4, 13] gives a \(\nicefrac {3}{2}\)-approximation to this APX-hard problem; this was recently improved to a \((\nicefrac {3}{2}-\varepsilon )\)-approximation by the breakthrough work of Karlin, Klein, and Oveis Gharan, where \(\varepsilon > 0\) [10]. A related question is: what is the integrality gap of the subtour-elimination polytope relaxation for the TSP? Wolsey had adapted the Christofides-Serdyukov analysis to show an upper bound of \(\nicefrac {3}{2}\) [16] (also [14]), and there exists a lower bound of \(\nicefrac {4}{3}\). Building on their above-mentioned work, Karlin, Klein, and Oveis Gharan gave an integrality gap of \(1.5 - \varepsilon '\) for another small constant \(\varepsilon ' > 0\) [11], thereby making the first progress towards the conjectured optimal value of \(\nicefrac {4}{3}\) in nearly half a century.

Both these recent results are based on a randomized version of the Christofides-Serdyukov algorithm proposed by Oveis Gharan, Saberi, and Singh [12]. This algorithm first samples a spanning tree (plus perhaps one edge) from the max-entropy distribution with marginals matching the LP solution, and adds an O-join on the odd-degree vertices O in it, thereby getting an Eulerian spanning subgraph. Since the first step has expected cost equal to that of the LP solution, these works then bound the cost of this O-join by strictly less than half the optimal value, or the LP value. The proof uses a cactus-like decomposition of the near min-cuts of the graph with respect to the values \(x_e\), like in [12].

Given the \(\nicefrac {3}{2}\) barrier has been broken, we can ask: what other techniques can be effective here? How can we make further progress? These questions are interesting even for cases where the LP has additional structure. The half-integral cases (i.e., points for which \(x_e \in \{0, \nicefrac {1}{2}, 1\}\) for all e) are particularly interesting due to the Schalekamp, Williamson, and van Zuylen conjecture, which says that the integrality gap is achieved on instances where the LP has optimal half-integral solutions [15]. The team of Karlin, Klein, and Oveis Gharan first used their max-entropy approach to get an integrality gap of 1.49993 for half-integral LP solutions [9], before they moved on to the general case in [10] and obtained an integrality gap of \(1.5-\varepsilon \); the latter improvement is considerably smaller than in the half-integral case. It is natural to ask: can we do better for half-integral instances?

In this paper, we answer this question affirmatively. We show how to get tours of expected cost at most 1.4983 times the linear program value using an algorithm based on matroid intersection. Moreover, some of these ideas can be used to strengthen the max-entropy sampling approach in the half-integral case. The matroid intersection approach and the strengthened max-entropy approach each yield improvements over the bound in [9]. Combining the techniques gives our final quantitative improvement:

Theorem 1.1

Let x be a half-integral solution to the subtour elimination polytope with cost c(x). There is a randomized algorithm that rounds x to an integral solution whose cost is at most \({(1.5-\varepsilon )\cdot c(x)}\), where \(\varepsilon = 0.001695\).

We view our work as showing a proof-of-concept of the efficacy of combinatorial techniques (matroid intersection, and flow-based charging arguments) in getting an improvement for the half-integral case. We hope that these techniques, ideally combined with max-entropy sampling techniques, can give further progress on this central problem. Our randomized algorithm does run in polynomial time, if we allow exponentially small error in the marginal x.

Our Techniques The algorithm is again in the Christofides-Serdyukov framework. It is easiest to explain for the case where the graph (a) has an even number of vertices, and (b) has no (non-trivial) proper min-cuts with respect to the LP solution values \(x_e\)—specifically, the only sets for which \(x(\partial S) = 2\) correspond to the singleton cuts. Here, our goal is that each edge is “even” with some probability: i.e., both of its endpoints have even degree with probability \(p > 0\). In this case we use an idea due to Haddadan and Newman [7]: we shift and get a \(\{\nicefrac {1}{3},1\}\)-valued solution y to the subtour elimination polytope \(K_{TSP}\). Specifically, we find a random perfect matching M in the support of x, and set \(y_e = 1\) for \(e\in M\), and \(\nicefrac {1}{3}\) otherwise, thereby ensuring \({\mathbb {E}}[y] = x\). To pick a random tree from this shifted distribution y, we do one of the following:

  1. 1.

    We pick a random “independent” set \(M'\) of matching edges (so that no edge in E is incident to two edges of \(M'\)). For each \(e' \in M'\), we place partition matroid constraints enforcing that exactly one edge is picked at each endpoint—which, along with \(e'\) itself, gives degree 2 and thereby makes the edge even as desired. Finding spanning trees subject to another matroid constraint can be implemented using matroid intersection. (We refer the interested reader to [3] for a similar application of the matroid intersection technique.)

  2. 2.

    Or, instead we sample a random spanning tree from the max-entropy distribution, with marginals being the shifted value y. (In contrast, [9] sample trees from x itself; our shifting allows us to get stronger notions of evenness than they do: e.g., we can show that every edge is “even-at-last” with constant probability, as opposed to having at least one even-at-last edge in each tight cut with some probability.)

(Our algorithm randomizes between the two samplers to achieve the best guarantees.) For the O-join step, it suffices to give fractional values \(z_e\) to edges so that for every odd cut in T, the z-mass leaving the cut is at least 1. In the special case we consider, each edge only participates in two min-cuts—those corresponding to its two endpoints. So set \(z_e = \nicefrac {x_e}{3}\) if e is even, and \(\nicefrac {x_e}{2}\) if not; the only cuts with \(z(\partial S) < 1\) are minimum cuts, and these cuts will not show up as O-join constraints, due to evenness. For this setting, if an edge is even with probability p, we get a \((\nicefrac {3}{2} - \nicefrac {p}{6})\)-approximation!

It remains to get rid of the two simplifying assumptions. To sample trees when |V| is odd (an open question from [7]), we add a new vertex to fix the parity, and perform local surgery on the solution to get a new TSP solution and reduce to the even case. The challenge here is to show that the losses incurred are small, and hence each edge is still even with constant probability.

Finally, what if there are proper tight sets S, i.e., where \(x(\partial S) = 2\)? We use the cactus decomposition of a graph (also used in [9, 12]) to sample spanning trees from pieces of G with no proper min-cuts, and union these trees together. These pieces are formed by contracting sets of vertices in G, and have a hierachical structure. Moreover, each such piece is either of the form above (a graph with no proper min-cuts) for which we have already seen samplers, or else it is a double-edged cycle (which is easily sampled from). Since each edge may now lie in many min-cuts, we no longer just want an edge to have both endpoints be even. Instead, we use an idea from [9] that uses the hierarchical structure on the pieces considered above. Every edge of the graph is “settled” at exactly one of these pieces, and we ask for both of its endpoints to have even degree in the piece at which it is settled. The \(z_e\) value of such an edge may be lowered from an initial value of \(\nicefrac {x_e}{2}\) in the O-join without affecting constraints corresponding to cuts in the piece at which it is settled.

Since cuts at other levels of the hierarchy may now be deficient because of the lower values of \(z_e\), we may need to increase the \(z_f\) values for other “lower” edges f to satisfy these deficient cuts. This last part requires a charging argument, showing that each edge e has \(z_e\) that is strictly smaller than \(\nicefrac {x_e}{2}\) in expectation. For our samplers, the naïve approach of distributing charge uniformly as in [9] does not work, so we instead formulate this charging as a flow problem.

2 Notation and preliminaries

Given a multigraph \(G = (V,E)\), and a set \(S \subseteq V\), let \(\partial S\) denote the cut consisting of the edges connecting S to \(V\setminus S\); S and \({\bar{S}} {:}{=}V \setminus S\) are called shores of the cut. (For a singleton set \(\{v\}\), we write \(\partial v\) instead of \(\partial \{v\}\).) A subset \(S \subseteq V\) is proper if \(1< |S| < |V|-1\); a cut \(\partial S\) is called proper if the set S is a proper subset. A set S is tight if \(|\partial S|\) equals the size of the minimum edge-cut in G. Two sets S and \(S'\) are crossing if \(S \cap S'\), \(S {\setminus } S'\), \(S' {\setminus } S\), and \(V {\setminus } (S \cup S')\) are all non-empty.

Define the subtour elimination polytope \(K_{TSP}(G) \subseteq {\mathbb {R}}^{|E|}\):

$$\begin{aligned} x(\partial v)&= 2 \qquad \qquad \forall v \in V \\ x(\partial S)&\ge 2 \qquad \qquad \forall \;\text {proper}\,\, S \\ x \ge 0. \end{aligned}$$
(LP-TSP)

Let x be half-integral and feasible for (LP-TSP). W.l.o.g. we can focus on solutions with \(x_e = \nicefrac 12\) for each \(e \in E\), doubling edges if necessary. The support graph G is then a 4-regular 4-edge-connected (henceforth 4EC) multigraph.

The spanning tree polytope \(K_{spT}(G) \subseteq {\mathbb {R}}^{|E(G)|}\) for a multigraph G is:

$$\begin{aligned} x(E(S))&\le |S| - 1 \qquad \qquad \forall S \subseteq V(G) \\ x(E(V(G)))&= |V(G)|-1 \\ x&\ge 0. \end{aligned}$$
(LP-spT)

where E(S) is the set of edges with both endpoints in S. This is the graphic matroid polytope, and the convex hull of the spanning trees.

Definition 2.1

(r-Trees) Given a multigraph G and “root” vertex \(r \in V(G)\), an r-tree T is a connected subgraph with n edges: the vertex r has degree exactly 2, and the subgraph restricted to the other vertices \(V(G) \setminus \{r\}\) is a spanning tree on them. This is a matroid, though we do not use this fact.

The (integral) perfect matching polytope \(K_{PM}\subseteq {\mathbb {R}}^{|E(G)|}\) is defined as follows:

$$\begin{aligned} x(\partial v)&= 1 \qquad \qquad \forall v \in V \\ x(\partial S)&\ge 1 \qquad \qquad \forall S \text {with} |S| \text {odd} \\ x \ge 0. \end{aligned}$$
(LP-PM)

The (integral) O-join dominator polytope \(K_{join}(G,O)\) is defined as follows. Let \(O \subseteq V\), |O| even.

$$\begin{aligned} z(\partial (S))&\ge 1 \qquad \qquad \forall S \subseteq V, |S \cap O| \text { odd} \\ z&\ge 0 \end{aligned}$$

Fact 2.2

For any solution \(x \in K_{TSP}(G)\), it holds that \(x|_{E(V(G)\setminus \{r\})} \in K_{spT}(G[V(G) \setminus \{r\}])\), \(\nicefrac {x}{2} \in K_{PM}(G)\) (when |V(G)| is even), and \(\nicefrac {x}{2} \in K_{join}(G,O)\) for \(O \subseteq V(G)\), |O| even.

Lemma 2.3

Consider a sub-partition \({\mathcal {P}}= \{P_1, P_2, \ldots , P_t\}\) of the edge set of G. Let x be a fractional solution to (LP-spT) that satisfies \(x(P_i) \le 1\) for all \(i \in [t]\). Then we can efficiently sample from a probability distribution \({\mathcal {D}}\) over spanning trees which contain at most one edge from each of the parts \(P_i\), such that \( \Pr _{T \leftarrow {\mathcal {D}}}[e \in T] = x_e\).

Proof

This follows immediately from the integrality of the matroid intersection polytope and Carathéodory’s Theorem. (See, for e.g., [5], for how to do this sampling in poly-time.) \(\square \)

2.1 The max-entropy distribution over spanning trees

Definition 2.4

(Strong Rayleigh Distributions, [2]) Let \(\mu \) be a probability distribution on spanning trees. The generating polynomial for \(\mu \) is \(p(z) = \sum _{T \in {\mathcal {T}}} {\mathbb {P}}(e \in T) \prod _{e \in T}z_e\). We say \(\mu \) is strongly Rayleigh (SR) if p is a real stable polynomial (i.e., \(p(z) \ne 0\) if \(Im(z_e) > 0\) for all \(e \in T\), i.e., if z lies in the upper half plane).

A distribution \(\mu \) over spanning trees is called \(\lambda \textit{-uniform}\) or \(\textit{weighted uniform}\) if there exist non-negative weights \(\lambda : E \rightarrow {\mathbb {R}}\) such that \({\mathbb {P}}(T) \propto \prod _{e \in T} \lambda (e)\). Borcea et al. [2] showed that \(\lambda \)-uniform spanning tree distributions are SR.

Theorem 2.5

([1], Theorem 5.1) There exists a \(\lambda \)-uniform distribution \(\mu \) over spanning trees such that given z in the relative interior of the spanning tree polytope of \(G = (V, E)\),

$$\begin{aligned} \sum _{T \ni e} {\mathbb {P}}_\mu (T) = z_e. \end{aligned}$$

The max-entropy distribution, which is the distribution \(\mu \) on spanning trees maximizing the entropy of \(\mu \) subject to preserving the marginals \(z_e\) (that is, \({\mathbb {P}}_{\mu }(e \in T)=z_e\)), is one such distribution.

Asadpour et al [1] showed that we can find weights \(\tilde{\lambda }: E \rightarrow {\mathbb {R}}^+\) which approximately respect the marginals given by a vector z in the spanning tree polytope in polynomial time.

Theorem 2.6

([1]) Given z in the relative interior of the spanning tree polytope of \(G = (V, E)\) and some \(\varepsilon > 0\), values \(\tilde{\lambda }_e\) for \(e \in E\) can be found such that, for all \(e \in E\), the \(\tilde{\lambda }\)-uniform distribution \(\mu \) satisfies

$$\begin{aligned} \sum _{T \ni e} {\mathbb {P}}_{\mu }(T) \le (1 + \varepsilon ) z_e. \end{aligned}$$

The running time is polynomial in and .

Fact 2.7

([2]) Let \(\mu \) be an SR distribution. Let \(F \subset E\) be a subset of edges. Then the projection of \(\mu \) onto F, i.e., \(\mu _{|F}(A) = \sum _{S: S \cap F = A} \mu (S)\), is also SR. Moreover, for any edge \(e \in E\), conditioning on \(e \in T\) or on \(e \not \in T\) preserves the SR property.

Theorem 2.8

(Negative Correlation, [12]) Let \(\mu \) be an SR distribution on spanning trees.

  1. 1.

    Let S be a set of edges and \(X_S = |S \cap T|\), where \(T \sim \mu \). Then, \(X_S \sim \sum _{i = 1}^{|S|} Y_i\), where the \(Y_i\) are independent Bernoulli random variables with success probabilities \(p_i\) and \(\sum _i p_i = {\mathbb {E}}[X_S]\).

  2. 2.

    For any set of edges S and \(e \not \in S\),

    1. (i)

      \({\mathbb {E}}_{\mu }[X_S] \le {\mathbb {E}}_{\mu }[X_S \mid X_e = 0] \le {\mathbb {E}}_{\mu }[X_S] + {\mathbb {P}}_{\mu }(e \in T)\), and

    2. (ii)

      \({\mathbb {E}}_{\mu }[X_S]-1+{\mathbb {P}}_{\mu }(e \in T) \le {\mathbb {E}}_{\mu }[X_S \mid X_e =1] \le {\mathbb {E}}_{\mu }[X_S]\).

Theorem 2.9

([8], Corollary 2.1) Let \(g: \{ 1, \hdots , m\} \rightarrow {\mathbb {R}}\) and let \(0 \le p \le m\). Let \(B_1, \hdots , B_m\) be Bernoulli random variables with probabilities \(p_1^*, \hdots , p_m^*\) that maximize (or minimize) \({{\mathbb {E}}[g(B_1+ \cdots + B_m)]}\) over all possible success probabilities \(p_i\) for \(B_i\) for which \(p_1 + \cdots + p_m = p\). Then \(\{p_1^*, \hdots , p_m^*\} \in \{0, x, 1\}\) for some \(x \in (0, 1)\).

3 Samplers

In this section, we describe the MaxEnt and MatInt samplers for graphs with an even number of vertices that contain no proper min-cuts. We give bounds on certain correlations between edges that will be used in Sect. 5 to prove that every edge is “even” with constant probability. The samplers for the case where the graph has an odd number of vertices are more technical and are deferred to Appendix B.

Suppose the graph \(H = (V,E)\) is 4-regular and 4-edge-connected (4EC), contains at least four vertices, and has no proper min-cuts.Footnote 1 This means all proper cuts have six or more edges. We are given a dedicated external vertex \(r \in V(H)\); the vertices \(I:= V {\setminus } \{r\}\) that are not external are called internal. (In future sections, this vertex r will be given by a cut hierarchy.) Call the edges in \(\partial r\) external edges; all other edges are internal. An internal vertex is called a boundary vertex if it is adjacent to r. An edge is said to be special if both of its endpoints are non-boundary vertices.

We show two ways to sample a spanning tree on H[I], the graph induced on the internal vertices, being faithful to the marginals \(x_e\), i.e., \({\mathbb {P}}_T(e \in T)= x_e\) for all \(e \in E(H) {\setminus } \partial r\). Moreover, we want that for each internal edge, both its endpoints have even degree in T with constant probability. This property will allow us to lower the cost of the O-join in Sect. 6. While both samplers will satisfy this property, each will do better in certain cases. The MatInt sampler targets special edges; it allows us to “hand-pick” edges of this form and enforce that both endpoints of such edges have degree 2 in the tree. We enforce this property via partition matroid constraints, so the conclusion that the endpoints have degree 2 will be immediate. The MaxEnt sampler, on the other hand, relies on maximizing the randomness of the spanning tree sampled (subject to being faithful to the marginals); negative correlation properties allow us to obtain the evenness property, and in particular, better probabilities than MatInt for non-special edges, and a worse probability for the special edges.

Our samplers will depend on the parity of |V|: when |V| is even, the MatInt sampler is the one given by [7, Theorem 13], which we describe in Sect. 3.1. They left the case of odd |V| as an open problem, and in Appendix B we extend their procedure to the odd case.

3.1 Samplers for Even |V(H)|

Since H is 4-regular and 4EC and |V(H)| is even, setting a value of \(\nicefrac {1}{4} = \nicefrac {x_e}{2}\) on each edge gives a solution to the perfect matching polytope \(K_{PM}(H)\) by Fact 2.2.

  1. 1.

    Sample a perfect matching M such that \({\mathbb {P}}(e \in M) = \nicefrac {1}{4} = x_e/2\) for all \(e \in E(H)\).

  2. 2.

    (Shift) Define a new fractional solution y (that depends on M): set \(y_e = 1\) for \(e \in M\), and \(y_e = \nicefrac {1}{3}\) otherwise. We have \(y \in K_{TSP}(H)\), and hence \(y|_I \in K_{spT}(H[I])\) by Fact 2.2: indeed, each vertex has \(y(\partial v) = 1 + 3\cdot \nicefrac {1}{3} = 2\) because M is a perfect matching. Moreover, every proper cut U in H has at least six edges, so \(y(\partial U) \ge |\partial U| \cdot \nicefrac {1}{3} \ge 2\). Furthermore,

    $$\begin{aligned} {}{\mathbb {E}}_M[ y_e ] = \nicefrac {1}{4}\cdot 1 + \nicefrac {3}{4}\cdot \nicefrac {1}{3} = \nicefrac {1}{2} = x_e. \end{aligned}$$
    (1)
  3. 3.

    Sample a spanning tree faithful to the marginals y, using one of two samplers:

    1. (a)

      MaxEnt Sampler: Sample from the max-entropy distribution on spanning trees with marginals y. (Since y may not be in the relative interior of the spanning tree polytope, contract the 1-valued edges to obtain a 6-regular, \(\nicefrac {1}{3}\)-uniform solution. This may have nontrivial min-cuts, so once again use a cactus hierarchy to decompose the graph into pieces (see Sect. 4.2); the induced solution on each piece is in the relative interior of the spanning tree polytope. For each piece, sample a \(\lambda \)-uniform spanning tree that preserves marginals, and then take the union of these trees.)

    2. (b)

      MatInt Sampler:

      1. i.

        Color the edges of M using 7 colors such that no edge of H is adjacent to two edges of M having the same color; e.g., by greedily 7-coloring the 6-regular graph H/M (Fig. 1) . Let \(M'\) be one of these color classes picked uniformly at random. Hence, \({\mathbb {P}}(e \in M') = \nicefrac {1}{28}\), and \({\mathbb {P}}(\partial v \cap M' \ne \varnothing ) = \nicefrac {1}{7}\).

      2. ii.

        For each edge \(e = uv \in M'\), let \(L_{uv}\) and \(R_{uv}\) be the sets of edges incident at u and v other than e. Note that \(|L_{uv}| = |R_{uv}| = 3\). Place partition matroid constraints \(y(L_{uv}) \le 1\) and \(y(R_{uv}) \le 1\) on each of these sets. Finally, restrict the partition constraints to the internal edges of H; this means some of these constraints are no longer tight for the solution y.

    3. (c)

      Given the sub-matching \(M' \subseteq M\), and the partition matroid \({\mathcal {M}}\) on the internal edges defined using \(M'\), use Lemma 2.3 to sample a tree on H[I] (i.e., on the internal vertices and edges of H) with marginals \(y_e\), subject to this partition matroid \({\mathcal {M}}\). (Recall this is valid as \(y|_I \in K_{spT}(H[I])\).)

Conditioned on the matching M, we have \({\mathbb {P}}(e \in T \mid M) = y_e\); now using (1), we have \({\mathbb {P}}(e \in T) = x_e\) for all \(e \in (E {\setminus } \partial r)\).

Remark 3.1

The issue when |V(H)| is odd is that x/2 is not a point in the perfect matching polytope. To fix this, we split the external vertex r into two vertices \(r_1, r_2\), but this action introduces a proper min-cut. So, we can no longer directly sample a spanning tree on the interval vertices I . To fix this, we make careful local changes to the fractional solution y to get a new solution \({\widehat{y}}|_I\) that has the correct marginals and is still in the spanning tree polytope. Finally, we show that, with some small loss in constants, we can reduce the analysis to that of when |V(H)| is even. See Appendix B.

Fig. 1
figure 1

The matching M consists of all highlighted edges (both brown and green), one possible choice of \(M'\) has edges \(\{a,c\}\) highlighted in brown, and the constraints are placed on the edges adjacent to those in \(M'\) (marked in gray) (color figure online)

3.2 Correlation properties of samplers

Let T be a tree sampled using either the MatInt or the MaxEnt sampler. The following claims will be used to prove the evenness property in Sect. 5. Each table gives lower bounds on the corresponding probabilities for each sampler. The proofs for |V(H)| odd are in Appendix B.

Lemma 3.2

If fg are internal edges incident to a vertex v, then

Probability statement

MatInt

MaxEnt

\({\mathbb {P}}( |T \cap \{f, g\}| = 2)\)

\(\nicefrac {1}{9}\)

\(\nicefrac {1}{9}\)

\({\mathbb {P}}( T \cap \{f, g\} = \{f\})\)

\(\nicefrac {1}{9}\)

\(\nicefrac {12}{72}\)

Lemma 3.3

If edges efgh incident to a vertex v are all internal, then

Probability statement

MatInt

MaxEnt

\({\mathbb {P}}( |T \cap \{e,f,g,h\} | = 2 )\)

\(\nicefrac {2}{21}\)

\(\nicefrac {8}{27}\)

\({\mathbb {P}}(|T \cap \{e, f\}| = |T \cap \{g, h\}| = 1)\)

\(\nicefrac {4}{63}\)

\(\nicefrac {16}{81}\)

Lemma 3.4

For an internal edge \(e = uv\):

(a) if both endpoints are non-boundary vertices, then

Probability statement

MatInt

MaxEnt

\({\mathbb {P}}( |\partial _T(u)| = |\partial _T(v)| = 2)\)

\(\nicefrac {1}{36}\)

\(\nicefrac {128}{6561}\)

(b) if both uv are boundary vertices, then

Probability statement

MatInt

MaxEnt

\({\mathbb {P}}(\)exactly one of uv has odd degree in T)

\(\nicefrac {1}{9}\)

\(\nicefrac {5}{18}\)

3.2.1 Correlation properties: |V(H)| even

We now prove the correlation properties for the even case: the numerical bounds for the even case are better than those claimed above (which will be dictated by the proofs of the odd case; see Appendix B).

Proof of Lemma 3.2, Even Case

To prove \({\mathbb {P}}(|T\cap \{f,g\}|=2) \ge \nicefrac {1}{9}\), we need only knowledge of the marginals and not the specific sampler. If one of fg lies in M (which happens w.p. \(\nicefrac {1}{2}\)), then its y-value equals 1 and it belongs to T w.p. 1, and the other edge is chosen w.p. \(\nicefrac 13\), making the unconditional probability \(\nicefrac 12 \cdot \nicefrac 13 = \nicefrac 16 \ge \nicefrac 19\). Similarly, conditioned on f lying in M and hence belonging to T, edge g is not chosen w.p. \(1 - y_e = \nicefrac 23\), so \({\mathbb {P}}(T\cap \{f,g\} = \{f\}) \ge \nicefrac 14 \cdot \nicefrac 23 = \nicefrac 16 \ge \nicefrac 19\).

The MaxEnt claim: It remains to show that \({\mathbb {P}}(T\cap \{f,g\}=\{f\})\ge \nicefrac {12}{72}\). Conditioned on \(f \in M\), we have \(g \not \in T\) w.p. \(\nicefrac {2}{3}\). Now condition on neither f nor g in M (happens w.p. \(\nicefrac {1}{2}\)). By Theorem 2.8,

$$\begin{aligned} \nicefrac {1}{3} \le {\mathbb {E}}[f \in T \mid g \not \in T] \end{aligned}$$

By Theorem 2.9, \({\mathbb {P}}(f \in T \mid g \not \in T) \ge \nicefrac {1}{3}\), so \({\mathbb {P}}(f \in T \wedge g \not \in T) \ge \nicefrac {1}{3} \cdot \nicefrac {2}{3} = \nicefrac {2}{9}\). Putting all of this together, we get

$$\begin{aligned} {\mathbb {P}}(f \in T \wedge g \not \in T) \ge \nicefrac {1}{4} \cdot \nicefrac {2}{3} + \nicefrac {1}{2} \cdot \nicefrac {2}{9} = \nicefrac {5}{18} \ge \nicefrac {12}{72}. \end{aligned}$$

\(\square \)

Proof of Lemma 3.3, Even Case

The MatInt claims: Each perfect matching M contains one of these four edges in \(\partial v\). Say that edge is e. If e also belongs to \(M'\) (w.p. \(\nicefrac 17\)), then T contains exactly one of \(\{f,g,h\}\). Since \(\nicefrac 17 \ge \nicefrac 2{21}\), this gives us the first bound in Lemma 3.3. Moreover, the probability of this edge in T belonging to the other pair (in this case, \(\{g,h\}\)) is \(\nicefrac 23\). Hence \({\mathbb {P}}( |T \cap \{e,f\}| = 1 \wedge |T \cap \{g,h\}| = 1) \ge \nicefrac 2{21} > \nicefrac 4{63}\), giving us the other bound in Lemma 3.3 for the MatInt sampler.

The MaxEnt claims: For the first bound in Lemma 3.3, we have that one of the four edges must be in M and have y-value 1. W.l.o.g., call that edge e. The other three, fgh, will be \(\nicefrac {1}{3}\)-valued edges. Since \({\mathbb {E}}[X_{f, g, h}] = 1\), we may apply Theorem 2.9 and obtain a lower bound of

$$\begin{aligned} {\mathbb {P}}(X_{f, g, h} = 1) \ge 3 \cdot \nicefrac {1}{3} \cdot (\nicefrac {2}{3})^2 = \nicefrac {4}{9} \ge \nicefrac {8}{27}. \end{aligned}$$

For the second bound in Lemma 3.3, call \(S_1 = \{e, f\}\) and \(S_2 = \{g, h\}\). W.l.o.g., we once again label \(e \in M\). Then, \({\mathbb {P}}(f \not \in T) = \nicefrac {2}{3}\). So for \(S_2\), we condition on \(f \not \in T\). Then by Theorem 2.8,

$$\begin{aligned} \nicefrac {2}{3} \le {\mathbb {E}}[X_{S_2} \mid f \not \in T] \le \nicefrac {2}{3} + \nicefrac {1}{3} = 1. \end{aligned}$$

Hence, we use Theorem 2.9 again and obtain \({\mathbb {P}}(X_{S_2} = 1 \mid f \not \in T) \ge 2 \cdot \nicefrac {1}{3} \cdot \nicefrac {2}{3} = \nicefrac {4}{9}\). In total, we obtain

$$\begin{aligned} {\mathbb {P}}(|T \cap \{e, f\}| = |T \cap \{g, h\}| = 1) \ge \nicefrac {2}{3} \cdot \nicefrac {4}{9} = \nicefrac {8}{27} \ge \nicefrac {16}{81}. \end{aligned}$$

\(\square \)

Proof of Lemma 3.4a, Even Case

The MatInt claims: The event happens when \(e \in M'\), which happens w.p. \(\nicefrac 1{28}\), which is at least \(\nicefrac 1{36}\).

The MaxEnt claims: Condition on \(e \in M\). Let \(S_1 = \partial (u) {\setminus } e\) and \(S_2 = \partial (v) {\setminus } e\). Denote \(S_1 = \{a, b, c\}\). Lower bound \({\mathbb {P}}(|S_1 \cap T|=1)\) using Theorem 2.9: \({\mathbb {E}}[|S_1 \cap T|]= 3 \cdot \nicefrac 13 = 1\), so \( {\mathbb {P}}(|S_1 \cap T| = 1) \ge 3 \cdot \nicefrac {1}{3} \cdot \left( \nicefrac {2}{3}\right) ^{2/3} = \nicefrac {4}{9}.\) Consider the distribution over the edges in \(S_2\) conditioned on \(a \in T\); this distribution is also SR. By Theorem 2.8, \( \nicefrac {1}{3} \le {\mathbb {E}}[X_{S_2} \mid X_a = 1] \le 1.\) Applying Theorem 2.8 twice more,

$$\begin{aligned} \nicefrac {1}{3} \le {\mathbb {E}}[X_{S_2} \mid X_a = 1, X_{b,c}=0] \le 1 + \nicefrac {1}{3} + \nicefrac {1}{3} = \nicefrac {5}{3}. \end{aligned}$$

By Theorem 2.9, \({\mathbb {P}}(X_{S_2} = 1 \mid X_a = 1, X_{b,c}=0) \ge 3 \cdot \nicefrac {1}{9} \cdot \left( \nicefrac {8}{9}\right) ^2 = \nicefrac {64}{243}\). Using symmetry, we obtain \({\mathbb {P}}(X_{S_2} = 1 \wedge X_{S_1} = 1 \wedge e \in M) \ge \nicefrac {64}{243} \cdot \nicefrac {4}{9} \cdot \nicefrac 14 \ge \nicefrac {128}{6561}.\) \(\square \)

Proof of Lemma 3.4b, Even Case

The MatInt claims: For part (b), suppose \(e \in M\) (w.p. \(\nicefrac 14\)), then each of uv have two other internal edges, each with y-value \(\nicefrac 13\). Let us say the good cases are when exactly one of these four is chosen; exactly one of uv has degree 2 and the other has degree 1 in these cases. We cannot choose zero of these four edges, because of the connectivity of T, so all bad cases choose at least two of these four. Given the y-values of \(\nicefrac 13\) on all four edges, the expected number of these edges chosen are \(\nicefrac 43\), so the the probability of a bad case at most \(\nicefrac 13\). This means that with probability at least \(\nicefrac 14 \cdot (1- \nicefrac 13) = \nicefrac 1{6}\), exactly one of uv has odd degree in T. Since \(\nicefrac 16 \ge \nicefrac 19\), we have proven (b).

The MaxEnt claims: Observe that the edge e will always contribute either: 0 to both the degree of u in T and the degree of v in T OR 1 to both of these degrees. Let ab be the two internal edges incident to u and cd be the two edges incident to v.

  1. 1.

    Case 1: \(e \in M\). This implies abcd are all \(\nicefrac {1}{3}\)-valued edges, so \({\mathbb {E}}[X_{a,b,c,d}] = \nicefrac {4}{3}\). Note that we must have \(X_{a,b,c,d} \ge 1\). Therefore, \({\mathbb {P}}(X_{a, b, c, d} = 1) \ge \nicefrac {2}{3}\).

  2. 2.

    Case 2: \(e \not \in M\), exactly two of abcd are in M. W.l.o.g., say \(a,c \in M\), so a and c are in T. \({\mathbb {E}}[X_{b, d}] = \nicefrac {2}{3}\), which implies (by Theorem 2.9) that \({\mathbb {P}}(X_{b, d} = 1) \ge \nicefrac {4}{9}\).

  3. 3.

    Case 3: \(e \not \in M\), exactly one of abcd is in M. W.l.o.g., say \(a \in M\), so we have \({\mathbb {E}}[X_{b, c, d}] = 1\). We condition on \(e \not \in T\), which happens w.p. \(\nicefrac {2}{3}\). Since we sample a spanning tree on the internal vertices, this means that at least one of cd is in T. W.l.o.g., say \(c \in T\). So in order to bound the probability that the internal parities of uv are different, equivalently we would like to bound \({\mathbb {P}}(X_{b,d}=1 \mid e \not \in T)\). Since \(\nicefrac {2}{3} \le {\mathbb {E}}[X_{b,d} \mid e \not \in T] \le \nicefrac {2}{3} + \nicefrac {1}{3}\), Theorem 2.9 implies that \({\mathbb {P}}(X_{b,d}=1 \mid e \not \in T) \ge \nicefrac {4}{9}\). So removing the conditioning on \(e \not \in T\) gives a lower bound of \(\nicefrac {2}{3} \cdot \nicefrac {4}{9} = \nicefrac {8}{27}\).

Taking the minimum of the bounds \(\nicefrac 23, \nicefrac 49\), and \(\nicefrac 8{27}\) in the three cases gives

$$\begin{aligned} {\mathbb {P}}(\text {exactly one of }u,v \text { has odd degree in }T) \ge \nicefrac {8}{27} \ge \nicefrac {5}{18}. \end{aligned}$$

\(\square \)

4 Sampling algorithm for general solutions; and cut Hierarchy

Now that we can sample a spanning tree from a graph with no proper min-cuts, we introduce the algorithm to sample an \(r_0\)-tree from a 4-regular, 4EC graph, perhaps with proper min-cuts.

4.1 The algorithm

Assume that the graph \(G = (V,E)\) has a set of three distinguished vertices \(\{r_0, u_0, v_0\}\), with each pair \(r_0, u_0\) and \(r_0, v_0\) having a pair of edges between them. This is without loss of generality (used in line 18). (We can introduce dummy nodes to ensure this property, which is for simplicity—it guarantees that the top set in the cut hierarchy is a cycle set.) Define a double cycle to be a cycle graph in which each edge is replaced by a pair of parallel edges, and call each such pair partner edges. For \(A \subseteq V\), we use G/A to denote the graph that results from contracting the vertex set A.

Algorithm 1
figure a

Sampling an \(r_0\)-tree for a half-integral solution

As in [9], we refer to the sets in line 4 as critical sets. The algorithm samples from the same pieces as in [9], with the key differences being randomizing between the MatInt and MaxEnt samplers; the MaxEnt sampler defined with respect to marginals y; and the critical optimization for \(K_5\)’s. The \(r_0\)-tree sampled is the union of the spanning trees, edges, and paths sampled during the algorithm.

Since G is a 4-regular, 4EC graph at every stage of the algorithm, if \(|S| = 2\) or 3, then S must be a cycle set, whereas if \(|S| \ge 4\), then S may be a degree or cycle set. The following two claims are proved in Appendix A.1, and show that Algorithm 1 is well-defined.

Claim 4.1

The graph remaining at the end of the algorithm (line 18) is a double cycle.

Claim 4.2

In every iteration in Algorithm 1, \(G'\) is either a double cycle or a graph with no proper min-cuts.

We will prove the following theorem in Sect. 6. This in turn gives Theorem 1.1.

Theorem 4.3

Let T be the \(r_0\)-tree chosen from Algorithm 1, and O be the set of odd degree vertices in T. The expected cost of the minimum cost O-join for T is at most \((\nicefrac {1}{2} - \varepsilon )\cdot c(x)\).

4.2 The cut Hierarchy

Recall our ultimate goal is to create a low cost, feasible solution in the O-join polytope, where O is the set of odd degree vertices in our sampled tree T. We start with the fractional solution \(z = x/2\) and then reduce the \(z_e\) value of some edges. In the process, we may violate some constraints corresponding to min-cuts. To fix these cuts, we need a complete description of the min-cuts of a graph. This is achieved by the implicit hierarchy of critical sets that Algorithm 1 induces.

The hierarchy is given by a rooted tree \({\mathcal {T}}= (V_{{\mathcal {T}}}, E_{{\mathcal {T}}})\).Footnote 2 The vertex set \(V_{{\mathcal {T}}}\) corresponds to all critical sets found by the algorithm, along with a root node and leaf nodes representing the vertices in \(V_G \setminus \{r_0\}\). If S is a critical set, we label the node in \(V_{{\mathcal {T}}}\) with S, where we view \(S \subseteq V_G\) and not \(V_{G'}\). The root node is labelled \(V_G \setminus \{r_0\}\) and each leaf nodes is labelled by the vertex in \(V_G \setminus \{r_0\}\) corresponding to it. Now we define the edge set \(E_{{\mathcal {T}}}\). A node S is a child of \(S'\) if \(S \subset S'\) and \(S'\) is the first superset of S contracted after S in the algorithm. In addition, the root node is a parent of all nodes corresponding to critical sets that are not strictly contained in any other critical set (i.e., the critical sets corresponding to the vertices in the graph G from line 17 when the while loop terminates). Finally, each leaf node is a child of the smallest critical set that contains it (or if no critical set contains it, is a child of the root node). Thus by construction, vertex sets labelling the children of a node are a partition of the vertex set labelling that node. A node in \(V_{{\mathcal {T}}}\) is a cycle or degree node if the corresponding critical set labelling it is a cycle or degree set, respectively. We say the root node is a cycle node (since the graph G in line 18 is a double cycle), and accordingly call \(V_G \setminus \{r_0\}\) a cycle set. (The leaf nodes are not labelled as degree or cycle nodes.)

Fig. 2
figure 2

A portion of the cut hierarchy \({\mathcal {T}}\) and the local multigraph \(G{\langle \!\langle S \rangle \!\rangle }\)

Definition 4.4

(Local multigraph) Let \(S \subseteq V_G\) be a set labelling a node in \({\mathcal {T}}\). Define the local multigraph \(G{\langle \!\langle S \rangle \!\rangle }\) to be the following graph: take G and contract the subsets of \(V_G\) labelling the children of S in \({\mathcal {T}}\) down to single vertices and contract \({\bar{S}}\) to a single vertex \(v_{{\bar{S}}}\). Remove any self-loops. The vertex \(v_{{\bar{S}}}\) is called the external vertex; all other vertices are called internal vertices. An internal vertex is called a boundary vertex if it is adjacent to the external vertex. The edges in \(G{\langle \!\langle S \rangle \!\rangle } \setminus v_{{\bar{S}}}\) are called internal edges. Observe that \(G{\langle \!\langle S \rangle \!\rangle }\) is precisely the graph \(G'\) in line 5 of Algorithm 1 when S is a critical set, and is a double cycle when \(S = V_G {\setminus } \{r_0\}\).

Properties of \({\mathcal {T}}\):

  1. 1.

    Let G be a 4-regular, 4EC graph with associated hierarchy \({\mathcal {T}}\). Let \(S \subseteq V_G\) be a set labelling a node in \({\mathcal {T}}\). If S is a degree node in \({\mathcal {T}}\), then \(G{\langle \!\langle S \rangle \!\rangle }\) has at least five vertices and no proper min-cuts, and hence every proper cut in \(G{\langle \!\langle S \rangle \!\rangle }\) has at least 6 edges. If S is a cycle node in \({\mathcal {T}}\), then \(G{\langle \!\langle S \rangle \!\rangle }\) is exactly a double cycle. These follow from Claim 4.2 and the equivalence between \(G'\) and \(G{\langle \!\langle S \rangle \!\rangle }\).

  2. 2.

    Algorithm 1 can be restated as follows: For each non-leaf and non-root node S in \({\mathcal {T}}\), sample a random path on \(G{\langle \!\langle S \rangle \!\rangle } \setminus v_{{\bar{S}}}\) if it is a double cycle or \(K_5\), and otherwise use the \(\textsc {MaxEnt} \) or \(\textsc {MatInt} \) samplers w.p. \(\lambda \) and \(1-\lambda \), respectively, on \(G{\langle \!\langle S \rangle \!\rangle } \setminus v_{{\bar{S}}}\). Sample a uniformly random cycle on the double cycle in line 18.

  3. 3.

    For a degree set S, the graph \(G{\langle \!\langle S \rangle \!\rangle }\) having no proper min-cuts implies that it has no parallel edges. In particular, no vertex has parallel edges to the external vertex in \(G{\langle \!\langle S \rangle \!\rangle }\). Hence we get the following:

Corollary 4.5

For a set S labeling a non-leaf node in \({\mathcal {T}}\) and any internal vertex \(v \in G{\langle \!\langle S \rangle \!\rangle }\): if S is a cycle set then \(|\partial v \cap \partial S| \in \{0,2\}\), and if S is a degree set then \(|\partial v \cap \partial S| \in \{0,1\}\).

Finally, we show how the hierarchy \({\mathcal {T}}\) allows us to characterize the min-cut structure of G. The cactus representation of min-cuts ([6]) is a compact representation of the min-cuts of a graph, and it can be constructed from the cut hierarchy; we defer the details to Appendix A.2. In turn we obtain the following complete characterization of the min-cuts of G in terms of local multigraphs.

Claim 4.6

Any min-cut in G is either (a) \(\partial S\) for some node S in \({\mathcal {T}}\), or (b) \(\partial X\) where X is obtained as follows: for some cycle set S in \({\mathcal {T}}\), X is the union of vertices corresponding to some contiguous segment of the cycle \(G{\langle \!\langle S \rangle \!\rangle }\).

5 Analysis part I: the even-at-last property

The proof of Theorem 4.3 proceeds in two parts:

  1. (1)

    In this section, we show that each edge e is “even-at-last” with constant probability. (This is an extension of the property that both of its endpoints have even degree.)

  2. (2)

    Then we construct the fractional O-join. As always, \(z = x/2\) is a feasible join, but we show how to save a constant fraction of the LP value for an edge when it is even-at-last. This savings causes other cuts to be deficient, so other edges raise their z values in response. However, a charging argument shows that the z-value for an edge does decrease by a constant factor, in expectation. This argument appears in Sect. 6.

To address part (1), let us define a notion of evenness for every edge in G. In the case where G has no proper min-cuts, we called an edge even if both of its endpoints were even in T. Now, the general definition of evenness will depend on where an edge belongs in the hierarchy \({\mathcal {T}}\).

Definition 5.1

We say an edge \(e \in E(G)\) is settled at S if S is the (unique) set such that e is an internal edge of \(G{\langle \!\langle S \rangle \!\rangle }\); call S the last set of e. If S is a degree or cycle set, we call e a degree edge or cycle edge, respectively.

Definition 5.2

(Even-at-Last) Let S be the last set of e, and \(T{\langle \!\langle S \rangle \!\rangle }\) be the restriction of T to \(G{\langle \!\langle S \rangle \!\rangle }\).

  1. 1.

    A degree edge e is called even-at-last (EAL) if both of its endpoints have even degree in \(T{\langle \!\langle S \rangle \!\rangle }\).

  2. 2.

    For a cycle edge \(e = uv\), the graph \(G{\langle \!\langle S \rangle \!\rangle } {\setminus } \{v_{{{\bar{S}}}}\}\) is a chain of vertices \(v_\ell , \ldots , u, v, \ldots , v_r\), with consecutive vertices connected by two parallel edges. Let \(C:= \{v_\ell , \ldots , u\}\), and \(C':= \{v, \ldots , v_r\}\) be the partition of this chain. The cuts \(\partial C\) and \(\partial C'\) are called the canonical cuts for e. Cycle edge e is called even-at-last (EAL) if both canonical cuts are crossed an even number of times by \(T{\langle \!\langle S \rangle \!\rangle }\); in other words, if there is exactly one edge in \(T{\langle \!\langle S \rangle \!\rangle }\) from each of the two pairs of external partner edges leaving \(v_{\ell }\) and \(v_r\).

Informally, a degree edge is EAL in the general case if it is even in the tree at the level at which it is settled. Also note that cycle edges settled at the same (cycle) set are either all EAL or none are EAL.

Definition 5.3

(Special and Half-Special Edges) Let e be settled at a degree set S. We say that e is special if both of its endpoints are non-boundary vertices in \(G\langle \!\langle S \rangle \!\rangle \) and half-special if exactly one of its endpoints is a boundary vertex in \(G\langle \!\langle S \rangle \!\rangle \).

We now prove a key property used in Sect. 6 to reduce the z-values of edges in the fractional O-join.

Theorem 5.4

(The Even-at-Last Property) The table below gives lower bounds on the probability that special, half-special, and all other types of degree edges are EAL in each of the two samplers.

 

Special

Half-special

Other degree edges

MatInt

\(\nicefrac {1}{36}\)

\(\nicefrac {1}{21}\)

\(\nicefrac {1}{18}\)

MaxEnt

\(\nicefrac {128}{6561}\)

\(\nicefrac {4}{27}\)

\(\nicefrac {1}{12}\)

Moreover, a cycle edge is EAL w.p. at least \(\lambda \cdot \nicefrac {1}{12} + (1-\lambda ) \cdot \nicefrac {1}{18}\).

Proof

Let e be settled at S. Let \(T_S\) be the spanning tree sampled on the internal vertices of \(G{\langle \!\langle S \rangle \!\rangle }\) (in the notation of Algorithm 1, the spanning tree sampled on G[S]).

First, assume that S is a degree set:

  1. 1.

    If none of the endpoints of e are boundary vertices in \(G{\langle \!\langle S \rangle \!\rangle }\) (i.e., e is special), then it is EAL exactly when both its endpoints have even degree in \(T_S\). By Lemma 3.4(a), this happens w.p. \(\nicefrac {1}{36}\) for the MatInt sampler and w.p. \(\nicefrac {128}{6561}\) for the MaxEnt sampler.

  2. 2.

    Now suppose that e is half-special, so that exactly one of the endpoints of \(e = uv\) (say u) is a boundary vertex in \(G{\langle \!\langle S \rangle \!\rangle }\), with edge f incident to u leaving S. By Lemma 3.3, the other endpoint v is even in \(T_S\) w.p.  \(\nicefrac 2{21}\) for the MatInt sampler and \(\nicefrac {8}{27}\) for the MaxEnt sampler. Moreover, the edge f is chosen at a higher level than S and is therefore independent of \(T_S\), and hence can make the degree of u even w.p. \(\nicefrac 12\). Thus e is EAL w.p. \(\nicefrac {1}{21}\) for the MatInt sampler and \(\nicefrac {4}{27}\) for the MaxEnt sampler.

  3. 3.

    Suppose both endpoints of e are boundary vertices of S, with edges fg leaving S. Let \(q_=\) be the probability that the degrees of vertices uv in the tree \(T_S\) chosen within S have the same parity, and \(q_{\ne } = 1 - q_=\). Now, when S is contracted and we choose a \(r_0\)-tree \(T'\) on the graph G/S consistent with the marginals, let \(p_=\) be the probability that either both or neither of fg are chosen in \(T'\), and \(p_{\ne } = 1 - p_=\). Hence

    $$\begin{aligned} \Pr [e \; EAL] = q_{oo}p_{11} + q_{oe}p_{10} + q_{eo}p_{01} + q_{ee}p_{00} = \nicefrac 12 (p_= q_= + p_{\ne } q_{\ne }), \end{aligned}$$
    (2)

    where \(q_{oo}, q_{oe}, q_{eo}\) and \(q_{ee}\) correspond to different parity combinations of u and v in \(T_S\) and \(p_{00}, p_{01}, p_{10}, p_{11}\) correspond to whether f and g are chosen in \(T'\). The second inequality follows from Claim C.1 applied to the random \(r_0\)-tree \(T'\).

    1. (a)

      If fg are settled at different levels, then they are independent. This gives \(p_= = p_{\ne } = \nicefrac 12\), and hence \(\Pr [ e \; EAL ] = \nicefrac 14\) regardless of the sampler.

    2. (b)

      If fg have the same last set which is a degree set, then by Lemma 3.2 each of the quantities \(p_{11}, p_{01}, p_{10} \ge \nicefrac 19\). By Claim C.1, \(p_{00}\ge \nicefrac 19\) as well. Hence, (2) gives \(\Pr [ e \; EAL] \ge \nicefrac 19\).

    3. (c)

      If fg have the same last set which is a cycle set, then first consider the case where fg are partners, in which case \(p_{\ne } = 1\). Now (2) implies that \(\Pr [e \; EAL] = \nicefrac {q_{\ne }}{2}\), which by Lemma 3.4(b) is \(\ge \nicefrac 12 \cdot \nicefrac 1{9} = \nicefrac {1}{18}\) in the MatInt sampler, and \(\nicefrac 12 \cdot \nicefrac 5{18} = \nicefrac 5{36}\) in the MaxEnt sampler. If fg are not partners, then they are chosen independently, in which case again \(p_= = p_{\ne } = \nicefrac 12\), and hence \(\Pr [ e \; EAL ] = \nicefrac 14\).

Next, let S be a cycle set. Let \(e = uv\) be an edge inside the cycle, and let \(\{a,b\},\{c,d\}\) be the four edges crossing \(\partial S\). Let \(v_S\) be the vertex obtained by contracting down S (whose incident edges are then \(\{a,b,c,d\}\)). Now in order for e to be EAL, one each of \(\{a,b\}\) and \(\{c,d\}\) must belong to T; call this event \({\mathcal {E}}\). We again consider cases based on where these edges are settled. Let node P be the parent of node S, and let \(v_S\) be the vertex in \(G{\langle \!\langle P \rangle \!\rangle }\) obtained from contracting S.

  1. 1.

    If all four edges are settled at P, and P is a degree set, then Lemma 3.3 says that for the MatInt sampler, we have \(\Pr [{\mathcal {E}}] \ge \nicefrac 4{63}\). In contrast, the MaxEnt sampler gives us \(\Pr [{\mathcal {E}}] \ge \nicefrac {16}{81}\).

  2. 2.

    If all four edges are settled at P, and P is a cycle set, then no matter how these four edges are distributed, \(\Pr [{\mathcal {E}}] \ge \frac{1}{2}\).

  3. 3.

    If three of them \(\{a,b,c\}\) are settled at P, and the fourth (say d) at a higher level, then since P has a vertex \(v_S\) with a single edge leaving it, P must be a degree set. Now exactly one of \(\{a,b\}\) is chosen in T w.p. at least \(\nicefrac 19\) in the MatInt sampler and \(\nicefrac {12}{72}\) in the MaxEnt sampler, by Lemma 3.2. And since d is independently picked at a different level, exactly one of \(\{c,d\}\) is chosen in T w.p. \(\nicefrac 12\), giving an overall probability of \(\nicefrac 1{18}\) in the MatInt sampler and \(\nicefrac {1}{12}\) in the MaxEnt sampler.

  4. 4.

    Finally, if only two edges are settled at P, and two others go to higher levels, then P is a cycle set by Corollary 4.5. In this case, exactly one of the two edges that are settled in P is chosen. Now we want a specific one of the edges going to a higher level to be chosen into T (and the other to not be chosen), which in the worst case happens w.p. at least \(\nicefrac 19\), due to Lemma 3.2.

\(\square \)

6 Analysis part II: the O-join and charging

To prove Theorem 4.3 and thereby finish the proof, we construct an O-join for the random tree T, and bound its expected cost via a charging argument. The structure here is similar to [9]; however, we use a flow-based argument to perform the charging instead of the naive one, and also use our stronger property that every edge is EAL with constant probability (versus the weaker property obtained in [9] that every tight cut contains an EAL edge with constant probability).

Let O denote the (random set of) odd-degree vertices in T. The dominant of the O-join polytope \(K_{join}(G,O)\) is given by

$$\begin{aligned} x(\partial S) \ge 1 \qquad \qquad \forall S \subseteq V, |S \cap O| \text { odd}. \end{aligned}$$

This polytope is integral, so it suffices to exhibit a fractional O-join solution \(z \in K_{join}(G,O)\), with low expected cost. (The expectation is taken over O.) Note that \(|S \cap O| \text { odd}\) if and only if \(|\partial S \cap T| \text { odd}\).

Now Theorem 4.3 follows from the claim below, which we will prove in this section.

Lemma 6.1

There is an \(\varepsilon > 0\) such that if the \(r_0\)-tree T is sampled using the procedure described in Algorithm 1, and O is the set of odd-degree vertices in T, then there is a fractional solution \(z \in K_{join}(G,O)\) where \({\mathbb {E}}[z_e] \le (\nicefrac 12 -\varepsilon ) x_e\) for all edges \(e \in E(G)\).

6.1 Construction of the fractional O-join

The construction of the fractional O-join z goes as follows: We start with the solution \(z=\nicefrac {x}2\). Notice that \(z(\partial S) \ge 1\) is a tight constraint in this initial solution when S is a min-cut. Now we describe how to reduce the \(z_e\) values.

Define

$$\begin{aligned}{} & {} p_{sp}^{MI} {:}{=}\nicefrac {1}{36}, p_{hs}^{MI} {:}{=}\nicefrac {1}{21}, p^{MI} {:}{=}\nicefrac {1}{18}, p_{sp}^{ME} {:}{=}\nicefrac {128}{6561}, p^{ME} {:}{=}\nicefrac {1}{12}\\{} & {} p_{sp} {:}{=}\lambda p_{sp}^{ME} + (1-\lambda )p_{sp}^{MI}, p_{hs} {:}{=}\lambda p^{ME} + (1-\lambda )p_{hs}^{MI}, p {:}{=}\lambda p^{ME} + (1-\lambda )p^{MI}. \end{aligned}$$

Let \(p_{sp}(e), p_{hs}(e),\) and \(p_d(e)\) denote the probabilities that e is EAL if e is a special degree edge, half-special degree edge, or other degree edge, respectively. Let \(p_c(e)\) denote the probability that a cycle edge e is EAL. These constants are just lower bounds on the probability an edge is EAL. Indeed, by Theorem 5.4,

$$\begin{aligned} p_{sp}(e) \ge p_{sp},\ p_{hs}(e) \ge p_{hs},\ p_d(e) \ge p,\ p_c(e) \ge p. \end{aligned}$$

(We do not distinguish the half-special case in the MaxEnt sampler, as the half-special bound of \(\nicefrac {4}{27}\) in Theorem 5.4 is greater than \(p^{ME}\).) Call an edge e a \(K_5\) degree edge if e is settled at a degree set S where \(G{\langle \!\langle S \rangle \!\rangle }\) is a \(K_5\). By the inequalities above, the random variables below are well-defined.

  1. 1.

    Define a Bernoulli random variable for each edge e:

    1. (a)

      If e is a special degree edge, set \(B_e \sim {{\,\textrm{Ber}\,}}(\nicefrac {p_{sp}}{p_{sp}(e)})\).

    2. (b)

      If e is a half-special degree edge, set \(B_e \sim {{\,\textrm{Ber}\,}}(\nicefrac {p_{hs}}{p_{hs}(e)})\).

    3. (c)

      If e is any other degree edge, set \(B_e \sim {{\,\textrm{Ber}\,}}(\nicefrac {p}{p_d(e)})\).

    4. (d)

      If e is a cycle edge, set \(B_e \sim {{\,\textrm{Ber}\,}}(\nicefrac {p}{p_c(e)})\). Further, if e and f are partners, perfectly correlate their coin flips, i.e., set \(B_e = B_f\).

  2. 2.

    If e is EAL and \(B_e = 1\), reduce \(z_e\) by

    1. (a)

      \(\tau \) if e is a non-\(K_5\) degree edge.

    2. (b)

      \(\gamma \) if e is a \(K_5\) degree edge.

    3. (c)

      \(\beta \) if e is a cycle edge.

  3. 3.

    We enforce that \(\tau \le \gamma \le \beta \le \nicefrac {1}{12}\), \(\beta \ge 2\tau \), and \(\beta \ge 2\gamma \). We will optimize \(\tau , \gamma ,\) and \(\beta \) via a linear program later, in Sect. 6.7.

This reduction scheme may make z infeasible for the O-join polytope. We now discuss how to maintain feasibility.

The purpose of the Bernoulli coin flips is to flatten the probability that an edge is reduced down to the lower bound on the probability that it is EAL from Theorem 5.4:

Observation 6.2

If e is a special, half-special, other degree, or cycle edge, then \(z_e\) is reduced with probability exactly \(p_{sp}\), \(p_{hs}\), p, or p, respectively.

6.2 Maintaining feasibility of the fractional O-join via charging

Suppose f is EAL and that we reduce edge \(z_f\) (per its coin flip \(B_f\)). Say f is settled at S. If S is a degree set, then the only min-cuts of \(G\langle \!\langle S \rangle \!\rangle \) are the degree cuts. So the only min-cuts that the edge f is part of in \(G\langle \!\langle S \rangle \!\rangle \) are the degree cuts of its endpoints, call them UV, in \(G\langle \!\langle S \rangle \!\rangle \) (U and V are vertices in \(G\langle \!\langle S \rangle \!\rangle \) representing sets U and V in G). But since \(|\partial U \cap T|\) and \(|\partial V \cap T|\) are both even by definition of EAL, we need not worry that reducing \(z_f\) causes \(z(\partial U) \ge 1\) and \(z(\partial V) \ge 1\) to be violated. Likewise, if S is a cycle set, then by definition of EAL all min-cuts \(S'\) in \(G\langle \!\langle S \rangle \!\rangle \) containing e have \(|\partial S' \cap T|\) even, so again we need not worry.

Since f is only an internal edge for its last set S, only certain O-join constraints (recall O-join constraints are of the form \(z(\partial S') \ge 1\), \(|\partial S' \cap T|\) odd) may be violated by reducing \(z_f\). In particular, only the cuts \(S'\) represented in lower levels of the hierarchy (that is, cuts \(S'\) in multigraphs \(G{\langle \!\langle X \rangle \!\rangle }\) for \(X \subset S\)) may be violated, since these are the only multigraphs in which f can be an external edge. Specifically, let f be an external edge for some \(G{\langle \!\langle X \rangle \!\rangle }\) and \(S'\) be a min-cut of \(G{\langle \!\langle X \rangle \!\rangle }\) (either a degree cut or a canonical cut). (For instance, in Fig. 2, let f be an edge between \(S_1\) and \(S_2\). We have that f is settled at S. Take \(X=S_2\). Now f is an external edge in \(G{\langle \!\langle X \rangle \!\rangle }\). Reducing \(z_f\) may now cause a min-cut \(S'\) in \(G{\langle \!\langle X \rangle \!\rangle }\) to become violated.) By Claim 4.6, cuts of the form \(S'\) (i.e., min-cuts in \(G{\langle \!\langle X \rangle \!\rangle }\) for \(X \subset S\)) are the only cuts that may be violated in the O-join polytope as a result of reducing \(z_f\). Call the internal edges (in \(G{\langle \!\langle X \rangle \!\rangle }\)) of \(\partial S'\) lower edges. When \(z_f\) is reduced and \(|\partial S' \cap T|\) is odd, we must distribute an increase (charge) over the lower edges totalling the amount by which \(z_f\) is reduced, so that \(z(\partial S') = 1\). We say these lower edges receive a charge from f. How the charge is distributed will depend on whether the lower edges are degree edges or cycle edges (see Sect. 6.3 and Sect. 6.4 below). (For instance, in the example from Fig. 2, letting \(f = (u,v)\) where \(v \in S_2\), if \(z_f\) is reduced by \(\beta \), we could increase each internal edge in \(G{\langle \!\langle X \rangle \!\rangle }\) incident to v by \(\beta /2\), i.e, charge these edges.)

We claim that this procedure maintains feasibility of the O-join solution z. Indeed, by Claim 4.6, no constraint corresponding to a min-cut in the O-join polytope is violated. Further, by capping the amount an edge can be reduced at \(\nicefrac {1}{4} - \nicefrac {1}{6} = \nicefrac {1}{12}\), we will ensure that none of the constraints are violated, since every other cut has at least 6 edges.

To prove Lemma 6.1, we will lower bound the expected decrease to \(z_e\), using different strategies for charging e when e is a cycle edge versus a degree edge. In particular, when e is a cycle edge, we distribute charge from an external edge evenly between e and its partner. When e is a degree edge, charge will be distributed according to a maximum-flow solution.

6.3 Charging of cycle edges

We now analyze the expected net decrease for each edge, starting with cycle edges. The expected decrease for a cycle edge starts off as exactly \(p \cdot \beta \) by Observation 6.2. However, we need to control the amount of charge the edge receives from edges settled at higher levels. Note that in the calculations, we assume the worst-case scenario that such edges settled at higher levels are reduced w.p. exactly p. The special/half-special edges are reduced with lower probability, which is only better for us because the expected amount of charge will be lower in these cases. Note also that throughout the calculations we may assume that an edge settled at a higher level is a cycle edge. This will follow from the assumption that \(\beta \ge 2\tau , 2\gamma \).

Consider a cycle edge e that is settled at set S. Let its partner on the cycle be \(e'\), and let the four external edges for S be \(\{a,b\}\) (leaving the vertex u) and \(\{c,d\}\) (leaving vertex v). When S is contracted, vertex \(v_S\) has these four edges \(\{a,b,c,d\}\) incident to it. Let P be S’s parent node in \({\mathcal {T}}\). Moreover, let C and \(C'\) be the canonical cuts, as in the figure below (Figs. 3 and 4).

Fig. 3
figure 3

Case (i)

Fig. 4
figure 4

Case (ii)

We start with a small lemma which will be invoked in our casework.

Lemma 6.3

If P has two edges leaving it and P is a cycle set, one must be from \(\{a,b\}\) and one from \(\{c,d\}\).

Proof

To see this, say both c and d (leaving node v) both did not leave P. Since P is a cycle set, c and d are partner edges. Consider the min cut-set \(S'\) which c and d are adjacent to in \(G{\langle \!\langle P \rangle \!\rangle }\). \(S' \cup v\) is a proper min cut which crosses S; however, this contradicts the fact that S is a critical set (namely S is not supposed to be crossed by another proper min cut). \(\square \)

We consider the following cases as in [9].

  1. (i)

    Two of the four edges incident to \(v_S\) also leave P—and therefore P is a cycle set by Corollary 4.5. By Lemma 6.3, of the two edges leaving P, one must be from \(\{a,b\}\) and one from \(\{c,d\}\). Without loss of generality, let \(\{a,d\}\) are the external edges for P, and \(\{b,c\}\) are internal (and hence partners in this cycle set). Now, consider the event that b is reduced; the important observation is that c is also reduced, because both are EAL at the same time and their coins are perfectly correlated. Not only that, the event that b is reduced implies that \(|\delta (S) \cap T|\) is even. This means that both cuts \(C, C'\) are deficient by the same amount and at the same time, due to this event of reducing bc, and raising e helps both of them. This means the net expected charge to e is at most

    $$\begin{aligned} p\frac{\beta }{2} \big ( \Pr [ C \text { odd}\mid b \text { reduced}] + \Pr [ C \text { odd}\mid a \text { reduced}] + \Pr [ C' \text { odd}\mid d \text { reduced}] \big ). \end{aligned}$$

    (Note that there is no term for c here because of the discussion above; moreover, the \(\nicefrac \beta 2\) term reflects that the charge of \(\beta \) is split between \(e,e'\).) Since a and b are settled at different levels, the parity of cut C is unbiased, even either conditioned on choices within P, or else conditioned on choices outside P. The same holds for the parity of \(C'\), so each of the probabilities above is at most \(\nicefrac 12\), making the net expected decrease

    $$\begin{aligned} p(\beta - \nicefrac \beta 2 \cdot 3 \cdot \nicefrac {1}2) = \boxed {p \nicefrac \beta 4}. \end{aligned}$$
    (3)
  2. (ii)

    Only one of the edges incident to \(v_S\), say edge a, is external for P. By Corollary 4.5, P is a degree set, and in the worst case, a \(K_5\) degree set since \(\gamma \ge \tau \). The charge is maximized when a is a cycle edge, in which case the expected charge to e is at most

    $$\begin{aligned} p \big ( \nicefrac \beta 2 \Pr [ C \text { odd}\mid a \text { reduced}] + \nicefrac \gamma 2 \Pr [ C \text { odd}\mid b \text { reduced}] + 2\cdot \nicefrac \gamma 2 \big ). \end{aligned}$$

    Since a and b are settled at different levels, the parity of the cut C is unbiased, either conditioning on the process inside P or else outside P, which makes the above expression \(p(\nicefrac \beta 4 + \nicefrac {5\gamma }{4})\), and the net expected decrease to be

    $$\begin{aligned} p(\beta - \nicefrac \beta 4 - \nicefrac {5\gamma }{4}) = \boxed {p(\nicefrac {3\beta }4 - \nicefrac {5\gamma }{4})}. \end{aligned}$$
    (4)
  3. (iii)

    All four of the edges are internal to P, which is a cycle set. The partners must now be one edge from each of the two pairs \(\{a,b\}\) and \(\{c,d\}\)—let’s say \(\{a,d\}\) and \(\{b,c\}\) are partners (due to the proof of Lemma 6.3). The cut C is odd in S if and only if \(C'\) is odd; indeed, C is odd means we choose both or neither of \(\{a,b\}\), and since \(\{c,d\}\) are the partners of \(\{a,b\}\), we choose both or neither of them. Hence, increasing e fixes both cuts at the same time, similar to the argument in (i), so we focus only on the increase due to cut C. Since a and b belong to different pairs and are chosen independently into the tree T, \(\Pr [C \text { odd}] = \nicefrac 12\). This means both a and b can give a expected charge of \(p(\nicefrac \beta 2)\cdot \nicefrac 12\) to e. The net expected decrease is \(p(\beta - 2\cdot \nicefrac \beta 2\cdot \nicefrac 12 ) = p\nicefrac \beta 2\), which is no worse than (3).

  4. (iv)

    All four of the edges are internal to P, and P is a degree set (Figs. 5 and 6. Note that \(G{\langle \!\langle P \rangle \!\rangle }\) cannot be a \(K_5\) since the vertex \(v_S\) has four internal edges incident to it. We simply bound the charge due to each edge by \(\tau /2\), and hence get net expected decrease of

    $$\begin{aligned} p(\beta - 4\cdot \nicefrac \tau 2) = \boxed {p(\beta - 2\tau )}. \end{aligned}$$
    (5)
Fig. 5
figure 5

Case (iii)

Fig. 6
figure 6

Case (iv)

6.4 Charging degree edges: max-flow formulation

So far we have shown that no cycle edge will receive too much charge. We did this by considering four different configurations for the external edges of S, and distributing charge evenly between a cycle edge and its partner. Before showing that no degree edge receives too much charge, we will define a charging scheme for degree edges that achieves better bounds than distributing charge uniformly. A similar flow-based charging argument was independently used in [10].

Let \(G\langle \!\langle S \rangle \!\rangle \) be the local multigraph for some degree set S, so \(G\langle \!\langle S \rangle \!\rangle \) is 4-regular and has no proper min-cuts. For an external edge e, let \(u_e\) denote the internal boundary vertex to which e is incident. We create a bipartite graph \(H=(B,F,E)\) from \(G\langle \!\langle S \rangle \!\rangle \) as follows. The vertices of B are labelled with the external edges of \(G\langle \!\langle S \rangle \!\rangle \). The vertices of F are labelled with the internal edges of \(G\langle \!\langle S \rangle \!\rangle \). So \(|B|=4\). Place an edge between \(e \in B\) and \(f \in F\) in the edge set E if e and f share \(u_e\) as an endpoint. We call H the bipartization of \(G\langle \!\langle S \rangle \!\rangle \).

The following lemma follows from the Max-Flow Min-Cut theorem:

Lemma 6.4

Given a bipartite graph \(G=(B,F,E)\) and \(c \ge 0\), there exists \(m: B \times F \rightarrow {\mathbb {R}}_{+}\) satisfying

$$\begin{aligned} \textstyle \sum _{f \in \partial u} m(u, f)&= 1 \text { for all } u \in B, \end{aligned}$$
(6)
$$\begin{aligned} \textstyle \sum _{u \in \partial f} m(u, f)&\le c \text { for all } f \in F \end{aligned}$$
(7)

if and only if

$$\begin{aligned} \textstyle |N(R)| \ge \frac{|R|}{c} \text { for all } R \subseteq B \end{aligned}$$
(8)

Lemma 6.4 now gives the following result, which follows from straightforward casework.

Lemma 6.5

Let \(H=(B,F,E)\) be the bipartization of \(G\langle \!\langle S \rangle \!\rangle \), as described above. If \(G\langle \!\langle S \rangle \!\rangle \) is not a \(K_5\), then the smallest c satisfying (8) is given by \(c=\nicefrac {1}{2}\). If \(G\langle \!\langle S \rangle \!\rangle \) is a \(K_5\), then the smallest c satisfying (8) is given by \(c=\nicefrac {2}{3}\).

The fact that we may achieve a value of \(c = \nicefrac 12\) instead of \(c = \nicefrac 23\) as long as \(G\langle \!\langle S \rangle \!\rangle \) is not a \(K_5\) motivates why we utilize a different sampler for \(K_5\). In fact, \(c = \nicefrac 23\) on \(K_5\) is achieved by distributing uniform charge over internal edges.

6.5 Charging of non-\(K_5\) degree edges

We now analyze the expected net decrease for degree edges. By Observation 6.2, the expected decrease for a non-special degree edge starts off as at least \(p_{hs} \cdot \tau \) or \(p_{hs} \cdot \gamma \), since \(p_{hs} \le p\). We will analyze the charge on non-\(K_5\) degree and \(K_5\) degree edges separately, as a different sampler is used when \(G{\langle \!\langle S \rangle \!\rangle } = K_5\). We begin with the former.

6.5.1 Charging of non-special edges

  1. 1.

    Case 1: All external edges are non-\(K_5\) degree edges. Let \(m: B \times F \rightarrow {\mathbb {R}}_+\) be any function satisfying the constraints in Lemma 6.4 for \(c=\nicefrac {1}{2}\) (by Lemma 6.5 such an m exists, and can be found efficiently using max-flow min-cut). The charging scheme is as follows: for each external edge e and boundary vertex \(u_e\), if the cut \(\delta (u)\) is odd and the edge e is reduced, charge each internal edge f incident to \(u_e\) by \(\tau \cdot m(e,f)\). The constraint (6) ensures that the charge neutralizes the reduction of e by \(\tau \). For any internal edge f, let \(\delta (f)\) denote the external edges with which f shares an endpoint, i.e., the neighbors of f in the bipartization \(H = (B,F,E)\). We have

    $$\begin{aligned} {\mathbb {E}}[\text {charge to } f]&= \sum _{e \in \delta (f)} \tau \cdot m(e, f) \cdot {\mathbb {P}}(u_e \text { odd}\wedge e \text { reduced}) \nonumber \\&\le \tau \cdot c \cdot {\mathbb {P}}(u_e \text { odd}\mid e \text { reduced}) \cdot {\mathbb {P}}(e \text { reduced}) \end{aligned}$$
    (9)

    Using the naive bound that \({\mathbb {P}}(u_e \text { odd}\mid e \text { reduced}) \le 1\), we obtain that the the expected decrease is at least \(\boxed {p_{hs}\tau - \frac{\tau }{2} \cdot p.}\)

  2. 2.

    Case 2: All external edges are \(K_5\) degree edges. An analogous argument to (9) (with \(\gamma \) instead of \(\tau \)) shows that the expected decrease is at least \(\boxed {p_{hs}\tau - \frac{\gamma }{2} \cdot p.}\)

  3. 3.

    Case 3: All external edges are cycle edges. We have that \({\mathbb {P}}(u_e \text { odd}\mid e \text { reduced}) = \frac{1}{2}\) in this case, since \(f \in T\) is independent of whether f is reduced since f has a partner edge. So an analogous argument to (9) (with \(\nicefrac {\beta }{2}\) instead of \(\tau \)) shows that the expected decrease is at least \(\boxed {p_{hs}\tau - \frac{\beta }{2} \cdot \frac{1}{2} \cdot p.}\)

  4. 4.

    Case 4: The external edges are a combination of non-\(K_5\) degree edges, \(K_5\) degree edges, and cycle edges. In this case, An analogous argument to (9) (with \(\max \{\frac{\tau }{2}, \frac{\gamma }{2}, \frac{\beta }{4}\}\) instead of \(\tau \)) shows that the expected decrease is at least \(p_{hs}\tau - p \cdot \max \{\frac{\tau }{2}, \frac{\gamma }{2}, \frac{\beta }{4}\}\).

6.5.2 Charging of special edges

These edges are never charged, as their endpoints are both non-boundary. Also, these cannot be \(K_5\) degree edges by definition. So the expected decrease is equal to the expected reduction, which is at least \( \boxed {p_{sp}\tau = \frac{\tau }{36}.}\)

6.6 Charging of \(K_5\) degree edges

\(K_5\) degree edges have an initial decrease of exactly \(p\gamma \), as they are neither special nor half-special. We will use the following lemma, whose proof is straightforward.

Lemma 6.6

Let \(G{\langle \!\langle S \rangle \!\rangle }=K_5\). Let f be an external edge and u be the boundary vertex that is one of its endpoints. Then

$$\begin{aligned} Pr(u \text { odd}\mid f \text { reduced}) = \frac{1}{2}. \end{aligned}$$

We will factor in this probability into the charging of \(K_5\) degree edges below.

  1. 1.

    All external edges are non-\(K_5\) degree edges. Since the edge being charged is a \(K_5\) edge, Lemma 6.5 gives us a worst-case expected charge of \(p\cdot \frac{2\tau }{3}\). However, factoring in \(Pr(u \text { odd}\mid f \text { reduced})=\frac{1}{2}\) for every external edge f, we have a worst-case expected charge of \(p \cdot \frac{2\tau }{3} \cdot \frac{1}{2}\), which by (9) gives us an expected net decrease of at least

    $$\begin{aligned} \boxed {p\gamma - p\frac{\tau }{3}.} \end{aligned}$$
  2. 2.

    All external edges are \(K_5\) degree edges. Using similar reasoning as above, we have a worst-case expected charge of \(p \cdot \frac{2\gamma }{3} \cdot \frac{1}{2}\), which gives us an expected net decrease of at least

    $$\begin{aligned} \boxed {p\gamma - p\frac{\gamma }{3}.} \end{aligned}$$
  3. 3.

    All external edges are cycle edges. Using the same reasoning as above, we get an expected net decrease of at least

    $$\begin{aligned} \boxed {p\gamma - p \frac{\beta }{3}.} \end{aligned}$$
  4. 4.

    The external edges are a combination of non-\(K_5\) degree edges, \(K_5\) degree edges, and cycle edges. An argument as in the previous section shows that the three cases above are the only possible worst cases.

6.7 Setting the parameters

For fixed \(\lambda \), finding the optimal values of \(\beta , \gamma , \tau \) can be done with a linear program. In particular, we maximize the minimum expected decrease \(\delta \), which is the minimum of the boxed expressions in §6.3 and Sect. 6.4. The additional constraints are that \(\tau \le \gamma \le \beta \), \(\beta \le \nicefrac {1}{12}\), \(\beta \ge 2\tau \), and \(\beta \ge 2\gamma \). We then optimize over \(\lambda \). This gives \(\lambda = 0.4715\), \(\beta = \nicefrac {1}{12}\), \(\gamma = 0.0401\), \(\tau = 0.0355\), and \(\delta = 0.0008475\). We have \(\varepsilon = 2\delta = 0.001695\), proving Theorem 4.3.