1 Introduction

Our work on quadrangulation edge flips places itself in the midst of a developing area of research whose origin can be partly traced back to a question of Aldous about triangulations of the n-gon [2]. The question concerns a discrete time edge flip Markov chain analogous to the one we will introduce, defined on the state space of triangulations of the regular n-gon (i.e. on the possible sets of diagonals which partition the n-gon into triangular regions). A single step of the Markov chain, given a triangulation, consists of picking a diagonal at random, deleting it and replacing it with the opposite diagonal in the quadrilateral created by its absence. One would wish to analyse the growth of the mixing time of this chain (which is sometimes referred to as the triangulation walk) as a function of the size n of the triangulation (or, equivalently, of the size of the state space, which is exponential in n). Aldous conjectures an upper bound of \(n^{3/2}\) (up to logarithmic factors in n) for the order of the relaxation time of this chain. In connection to this problem, he conjectures the same upper bound for a chain defined on n-cladograms, a type of binary tree structure with labelled leaves whose relevance also lies in its role as a formalisation of phylogenetic trees from systematic biology, which model evolutionary relationships between species [3].

An important feature of triangulations of the n-gon is the fact that they are counted by Catalan numbers: more precisely, there are \(C_{n-2}\) triangulations of the n-gon, where \(C_n=\frac{1}{n+1}{2n\atopwithdelims ()n}\). In fact, there is an extreme abundance of combinatorial structures which are counted by Catalan numbers, from Dyck paths to strings of matched parentheses to plane trees and beyond, with a thriving net of explicit bijections weaved between them, which often highlight surprising connections between the geometric features of different objects.

It seems therefore natural to attempt a systematic study of Markov chains defined on Catalan structures, but this task has proved very hard. For one thing, the natural notion of adjacency for different Catalan structures does not always translate well via sensible bijections, which gives rise to a rich panorama of different chains one might consider. But even concentrating on a single Markov chain has proved challenging so far, as attested by the relative scarceness of tight bounds for their mixing times, one notable exception being Wilson’s result [29] for adjacent transpositions on Dyck paths.

Twenty years after a serious effort was started problems of this kind, we still do not have tight bounds for the mixing of the triangulation walk proposed by Aldous. Molloy, Reed and Steiger showed an \(\Omega (n^\frac{3}{2})\) lower bound for its mixing time [24], while the best upper bound to date is McShine and Tetali’s \(O(n^5\log n)\) obtained in [21], where they analyse Markov chains on a number of other Catalan structures. As for n-cladograms, while the conjecture Aldous made in conjunction to the triangulation walk remains open, some chains have proved easier to analyse: Aldous himself showed an upper bound of \(O(n^3)\) for the relaxation time of a particular chain [3], improved to \(O(n^2)\), which is tight, by Schweinsberg [27]; also note Löhr, Mytnik and Winter’s work on the chain in the diffusion limit [20] as well as Forman, Pal, Rizzolo and Winkel’s in a similar vein [17]. Furthermore, recent results for the mixing of a very natural chain on Dyck paths were obtained by Cohen, Tetali and Yeliussizov [12] by rephrasing it as a basis exchange walk on a balanced matroid.

On the other hand, for many natural chains on Catalan structures, triangulations and related objects not even a polynomial upper bound for the mixing time is known. One such example is that of lattice triangulations, where polynomial bounds are only known for biased versions of the chain [9, 10, 28]; see also works on rectangular dissections, for which polynomial bounds were obtained very recently [7, 8].

One may also consider edge flip Markov chains on planar maps, and in particular on the set of p-angulations of the sphere of size n (with \(p\ge 3\)), that is the set of spherically embedded connected planar multigraphs with n faces of degree p (considered equivalent under orientation-preserving homeomorphisms of the sphere). An edge flip Markov chain on this state space can be defined as follows: at each step, an edge is selected uniformly at random, erased and replaced with one of the edges that can be drawn within the face of perimeter \(2p-2\) left behind in order to form two faces of degree p. The only result shown so far for this chain pertains to the case of triangulations of the sphere (\(p=3\)), where the mixing time is known to be of order at least \(n^{\frac{5}{4}}\) [6]. No polynomial upper bound on the mixing time was known, prior to this work, for any \(p\ge 3\).

In this paper we consider the case of rooted quadrangulations with n faces (i.e. the case \(p=4\), where maps are endowed with a distinguished oriented edge) and derive the first polynomial upper bound on the mixing time.

Note that quadrangulations in particular occupy a privileged position within the panorama of planar maps, mainly thanks to the famous bijections first developed by Cori, Vanquelin and Schaeffer [11, 13], which encode them with (different classes of) labelled plane trees, thus placing them within the framework of (generalised) Catalan structures.

The relation with trees has been exploited to obtain both scaling and local limit results which have led to the definition and subsequent investigation of objects such as the Brownian map [18, 23] and the UIPQ [14, 22], providing very rich insights into the geometric structure of uniform random large quadrangulations. In fact, it has been shown that a number of classes of uniform random planar maps converge to the Brownian map, whose universality makes quadrangulations of the sphere a very useful model for a random surface. Quadrangulations and, in general, planar maps are also very much studied in physics, in the context of quantum gravity, where the edge flip Markov chain is extensively applied in simulations.

Our contribution within this paper will consist in estimating the mixing time of the edge flip Markov chain \({\mathcal {F}}_n\) on the set of rooted quadrangulation of the sphere with n faces, as described above and much more thoroughly in Sect. 2. In particular, we shall prove the following.

Theorem 1

Let \(\nu _n\) be the spectral gap of the edge flip Markov chain \({\mathcal {F}}_n\) on the set \({\mathsf {Q}}_n\) of rooted quadrangulations with n faces. There are positive constants \(C_1,C_2\) independent of n such that

$$\begin{aligned} C_1n^{-\frac{11}{2}}\le \nu _n\le C_2n^{-\frac{5}{4}}. \end{aligned}$$

Consequently, the mixing time for \({\mathcal {F}}_n\) is \(O(n^{13/2})\).

The upper bound for the spectral gap is the same as Budzinski’s lower bound for the mixing time of flips on triangulations (which indeed it implies for our case \(p=4\)); the strategy by which we obtain it is quite general and would apply in a much broader context (cf. Remark 4.1).

As for the lower bound, we obtain it through a comparison (achieved with techniques developed by Diaconis and Saloff-Coste [16]) to a chain on labelled trees which arises via the aforementioned Schaeffer bijection. This chain, which has very natural interpretations on a number of Catalan structures, is a coloured generalisation of a chain on plane trees with n edges whose steps are as follows. Given a tree, pick an edge uniformly at random, and if it is a leaf, then choose one among the following three options with equal probabilities: leave the leaf intact, slide it one step to the left or slide it one step to the right (see Fig. 6). If the chosen edge is not a leaf, then do nothing. This chain is also natural in the context of Dyck paths: it is essentially equivalent to picking a vertex of the path uniformly at random and, if the vertex is a peak, translating the peak one position to the right or to the left, with equal probabilities (Fig. 7).

Though apparently not yet analysed within the scope of existing mathematical research about chains on Catalan structures, this ‘leaf translation’ chain is mentioned in the physics literature under the name of Fredkin spin chain, and a first lower bound of order \(n^{-\frac{11}{2}}\) for its spectral gap is given by Movassagh [25], based on work by himself and Shor [26]. We shall partially follow their argument, which is based on the method of building canonical paths to estimate the conductance, to produce an improved lower bound of order \(n^{-\frac{9}{2}}\) (see Theorem 4), which will be instrumental to obtain our result for flips on quadrangulations.

The paper is organised as follows. Sections 2 and 3 will provide the reader with all relevant definitions and recall some details of the Schaeffer bijection, since they will be relevant to our subsequent constructions. In Sect. 4 we give an upper bound of order \(n^{-\frac{5}{4}}\) for the spectral gap of \({\mathcal {F}}_n\) by considering the Dirichlet form evaluated at the function that gives the radius of a quadrangulation. Section 5 will acquaint the reader with the leaf translation Markov chain on plane trees (and a “leaf replanting” variant) and prove our lower bound for its spectral gap. Finally, a large portion of the paper—namely, Sect. 6—will be devoted to showing our lower bound for the spectral gap of \({\mathcal {F}}_n\) via a comparison with a chain on pointed rooted quadrangulations, which bridges the gap between \({\mathcal {F}}_n\) and the leaf translation Markov chain on labelled plane trees.

2 Edge flips on quadrangulations

Throughout this paper we shall be dealing with certain Markov chains whose configuration space is the set of quadrangulations with a fixed number of faces; in order to introduce them, let us first discuss some notation.

First and foremost, we shall be adopting some of the language of planar maps, with which we assume some familiarity: we will be referring to maps and their vertices, edges, faces, as well as corners and face contours; we shall not review any definitions but the basic one, that is:

Definition 2.1

A planar map is a connected, locally finite planar multigraph endowed with a cellular embedding in the sphere \(S^2\), considered up to orientation-preserving homeomorphisms of the sphere itself. We will call a rooted planar map of size n a planar map with n faces, endowed with one distinguished oriented edge.

One can now define a quadrangulation of size n, or of area n, as a rooted planar map of size n all of whose faces have four corners (see Fig. 1); we shall denote the set of all quadrangulations of size n by \({\mathsf {Q}}_n\). It follows from Euler’s polyhedral formula that a quadrangulation \(q\in {\mathsf {Q}}_n\) has 2n edges and \(n+2\) vertices; it is also worth noting that a quadrangulation is automatically bipartite, which implies that it has no loops (that is, it has no edges with only one endpoint). It may, however, have multiple edges between the same two endpoints, and edges which are adjacent to a single face; we call the latter double edges of the face they belong to, and the face which contains them a degenerate face. We shall often refer to the vertex that the root edge is issued from as the origin of the quadrangulation.

Given a quadrangulation \(q\in {\mathsf {Q}}_n\) and an edge e of q, we will denote by \(q^{e,+}\) (resp. \(q^{e,-}\)), the quadrangulation obtained from q by flipping edge e clockwise (resp. counterclockwise); more formally, we mean the quadrangulation given by the following procedure:

  • if e is adjacent to two distinct faces of q, erase e from q (thus obtaining a new face with exactly 6 corners) and replace it with the edge obtained by rotating e clockwise (resp. counterclockwise) by one corner (see Fig. 2).

  • if e is a double edge within a degenerate face, let v be the vertex of that face that is not an endpoint of e and let w be the endpoint of e having degree 1; erase e and replace it with an edge within the same face having endpoints vw. If e is the root edge of q, let the newly drawn edge be the root of \(q^{e,+}\) (resp. \(q^{e,-}\)), oriented in the same way as before (with respect to w).

Fig. 1
figure 1

On the left, a quadrangulation q in \({\mathsf {Q}}_8\); notice that we may choose to embed it in the plane (rather than the sphere) in a canonical way by having the external face be the one lying directly to the right of the root edge. The origin of q is marked in red. To the right, the two kinds of faces in a quadrangulation—non-degenerate and degenerate—with marked corners in clockwise order. The double edge in the degenerate face is the one adjacent to \(c_3\), which is drawn with a thicker line (color figure online)

Fig. 2
figure 2

Clockwise and counterclockwise flips for a simple and a double edge in a quadrangulation; if the edge is the root, its orientation is “preserved” as in the figure

Throughout this paper, we will consider the Markov chain \({\mathcal {F}}^n\) of quadrangulation edge flips on the state space \({\mathsf {Q}}_n\), whose transition probabilities are of the form

$$\begin{aligned} p(q,q')=\frac{1}{6n}\sum _{e\in E(q)}\left( 1_{q'=q^{e,+}}+1_{q'=q^{e,-}}+1_{q=q'}\right) ; \end{aligned}$$

in other words, conditionally on \({\mathcal {F}}^n_k=q\), \({\mathcal {F}}^n_{k+1}\) can be determined by choosing an edge e of q uniformly at random and setting either \({\mathcal {F}}^n_{k+1}=q\), or \({\mathcal {F}}^n_{k+1}=q^{e,+}\), or \({\mathcal {F}}^n_{k+1}=q^{e,-}\), with equal probabilities.

Notice that, given a pair \((q,q')\) of distinct quadrangulations in \({\mathsf {Q}}_n\), there are at most four distinct pairs (es) in \(E(q)\times \{+,-\}\) such that \(q'=q^{e,s}\). In fact, assuming e is not the root edge of q then e is uniquely determined by the pair \((q,q')\), and either s is determined as well or, in the case where e is a double edge, we have \(q'=q^{e,+}=q^{e,-}\). In addition, if e is not the root edge, it is possible that flipping the root edge might transform q into \(q'\); in other words, that \(q'=q^{\eta ,+}\) or \(q'=q^{\eta ,-}\), where \(\eta \ne e\) is the root edge of q. Consequently, we have

$$\begin{aligned} \frac{1}{3 |E(q)|}=\frac{1}{6n}\le p(q,q')\le \frac{2}{3n} \end{aligned}$$

for all \(q,q'\in {\mathsf {Q}}_n\) such that \(p(q,q')\ne 0\) and \(q\ne q'\).

Notice that, given \(q\in {\mathsf {Q}}_n\), \(e\in E(q)\), \(s\in \{+,-\}\), we can naturally identify vertices of q with vertices of \(q^{e,s}\), and edges of q with edges of \(q^{e,s}\) (where the edge e corresponds to the edge redrawn by the flip procedure in \(q^{e,s}\)); we will therefore often refer to vertices or edges using the same notation in q and \(q^{e,s}\), when we wish to implicitly exploit such a correspondence. This, of course, will need to be done with some care, since the correspondence is not necessarily unique when the quadrangulations q and \(q^{e,s}\) are given, but e and s are not known.

Remark 2.1

The Markov chain \({\mathcal {F}}^n\) is reversible and aperiodic: indeed, we have \(q'=q^{e,+}\) if and only if \(q=(q')^{e,-}\), so \(p(q,q')=p(q',q)\); furthermore, we have trivially that \(p(q,q)\ge \frac{1}{3}\).

Lemma 2.1

The Markov chain \({\mathcal {F}}^n\) is irreducible.

Proof

Let \(q_0\) be the quadrangulation with n degenerate faces and such that the origin has the maximum possible degree (that is 2n, all edges being incident to it)—see Fig. 3. We show that, given any quadrangulation \(q\in {\mathsf {Q}}_n\), one can obtain \(q_0\) from q with a sequence of edge flips.

Indeed, given any quadrangulation q, unless the degree of the origin \(\rho \) is 2n, one can increase it via an edge flip. Suppose not all edges have \(\rho \) as an endpoint and let v be a neighbour of \(\rho \) that has at least one neighbour different from \(\rho \); then, if you consider edges issued from v in clockwise order around v, there must be an edge e with second endpoint \(w\ne \rho \), followed by one with endpoints v and \(\rho \). Remark that \(q^{e,-}\) has an origin with degree increased by one with respect to q.

We may therefore suppose that q is a quadrangulation in \({\mathsf {Q}}_n\) whose origin \(\rho \) has degree 2n. Let v be the second endpoint of the root edge in q; we will show that, unless \(\deg v=n\), there is an edge flip of q increasing the degree of v and not decreasing the degree of \(\rho \). Indeed, notice that flipping any edge which is not a double edge inside a degenerate face does not change the degree of \(\rho \); this is because every quadrangulation is bipartite, and in particular the bipartition of q’s vertices has one class consisting of \(\rho \) only, and one consisting of \(V(q){\setminus }\{\rho \}\). The bipartition can be changed only by flips of degenerate edges, so any other flip will transform an edge having \(\rho \) as an endpoint to another edge having \(\rho \) as an endpoint. Consider now all edges adjacent to v; if v has strictly less than n adjacent edges, then it must be part of a face that is not degenerate (if it is only adjacent to degenerate faces, then the fact that all edges have \(\rho \) as an endpoint implies \(q=q_0\)). Consider any edge e of such a face not having v as an endpoint: then either \(q^{e,-}\) or \(q^{e,+}\) has the degree of v increased by one, and the degree of \(\rho \) unchanged.

Now, if \(q\in {\mathsf {Q}}_n\) has root edge \((\rho ,v)\) with \(\deg \rho =2n\) and \(\deg v=n\), then \(q=q_0\), as desired. Then reversibility (cf. Remark 2.1) implies that \({\mathcal {F}}^n\) is irreducible. \(\square \)

Fig. 3
figure 3

The quadrangulation \(q_0\) in \({\mathsf {Q}}_6\), with 6 degenerate faces arranged so that the degree of the origin is 12; to the right, the pointed version from Lemma 2.2

As a consequence of Lemma 2.1 and Remark 2.1, \({\mathcal {F}}^n\) admits the uniform measure on \({\mathsf {Q}}_n\) as its (unique) stationary distribution.

We will see later how, rather than the set \({\mathsf {Q}}_n\), it will be convenient to consider the set \({\mathsf {Q}}_n^\bullet \) of all pointed quadrangulations with n faces, that is the set \(\{(q,v) : q\in {\mathsf {Q}}_n, v\in V(q)\}\). The Markov chain \({\mathcal {F}}^n\) can be easily extended to a Markov chain \({\mathcal {F}}^{\bullet ,n}\) with state space \({\mathsf {Q}}_n^\bullet \), by redefining the (clockwise and counterclockwise) flips so that the distinguished vertex is preserved, thanks to the natural identification between \(V(q^{e,s})\) and V(q). Notice that, if \(F:{\mathsf {Q}}_n^\bullet \rightarrow {\mathsf {Q}}_n\) is the forgetful map that rids quadrangulations of the pointing, for a quadrangulation \(q_\bullet \) in \({\mathsf {Q}}_n^\bullet \) we have \(F(q_\bullet ^{e,s})=F(q_\bullet )^{e,s}\), where we are treating e both as an edge of \(q_\bullet \) and as an edge of \(F(q_\bullet )\), since F does induce a natural identification for both vertices and edges.

Reversibility is of course still true, but one has to go a little further to prove irreducibility of \({\mathcal {F}}^{\bullet ,n}\).

Lemma 2.2

Let \(q_0\in {\mathsf {Q}}_n^{\bullet }\) be the quadrangulation with n degenerate faces, rooted in an oriented edge \((\rho ,v)\) such that \(\rho \) has degree 2n and v has degree n, pointed in \(\rho \) (see Fig. 3). Then any quadrangulation \(q\in {\mathsf {Q}}_n^\bullet \) can be turned into \(q_0\) with a sequence of flips. In particular, \({\mathcal {F}}^{\bullet ,n}\) is irreducible.

Proof

Turning q into \(q_0\) can be done with a very similar procedure to Lemma 2.1. First, if \(\delta \) is the one distinguished vertex of q, one can apply flips until they obtain a quadrangulation \(q'\), similar to \(q_0\) but where \(\delta \) has degree 2n. If \(\delta \) turns out to be the origin, then we are done. Otherwise, if the root edge e of \(q'\) is not a double edge within a degenerate face, all we need to do is reverse its orientation by taking \((((q')^{e,+})^{e,+})^{e,+}\): this will make \(\delta \) the origin and preserve its degree \(\deg \delta =2n\). However, if e is a double edge of \(q'\), one only needs to flip clockwise the edge \(e'\) that comes before e in the clockwise contour of the degenerate face containing e. Then one can flip e clockwise three times, then \(e'\) counterclockwise, to have \(\delta \) as the origin and preserve its degree. Note that, at this point, the root edge is a double edge. We can then proceed as in Lemma 2.1 to increase the degree of the second endpoint \(v\ne \delta \) of the root edge until it is n. Notice that this only entails flipping edges that do not already have v as an endpoint, so the root edge will not be flipped and the final quadrangulation will be correctly rooted in an edge issued from \(\delta \). \(\square \)

As a consequence of the lemma above, the stationary distribution for \({\mathcal {F}}^{\bullet ,n}\) is the uniform measure on \({\mathsf {Q}}_n^\bullet \).

Our aim in this paper will be to prove upper and lower bounds for the spectral gap \(\nu _n\) of the Markov chain \({\mathcal {F}}^n\); we will rely on the Markov chain \({\mathcal {F}}^{\bullet ,n}\) for the known bijections available between the set \({\mathsf {Q}}_n^{\bullet }\) and certain sets of labelled trees, which we will briefly discuss in the next section. Dealing with \({\mathcal {F}}^{\bullet ,n}\) will still provide information about \({\mathcal {F}}^n\): any lower bound for \(\nu _n^\bullet \) will serve as a lower bound for \(\nu _n\), as per the following lemma.

Lemma 2.3

For the spectral gap \(\nu _n^\bullet \) of \({\mathcal {F}}^{\bullet ,n}\) and the spectral gap \(\nu _n\) of \({\mathcal {F}}_n\), we have \(\nu _n^\bullet \le \nu _n\).

Proof

The proof is quite immediate, since we can write

$$\begin{aligned} \nu _n={\mathcal {E}}_{{\mathcal {F}}^n}(f,f)=\frac{1}{2}\sum _{\begin{array}{c} q\in {\mathsf {Q}}_n\\ e\in E(q)\\ s\in \{+,-\} \end{array}}(f(q)-f(q^{e,s}))^2\frac{1}{|{\mathsf {Q}}_n|6n} \end{aligned}$$

for some function \(f:{\mathsf {Q}}_n\rightarrow {\mathbb {R}}\) such that \({\mathbb {E}}_\pi (f)=0\) and \({\mathbb {V}}_\pi (f)=1\) (where \(\pi \) is the uniform measure on \({\mathsf {Q}}_n\)).

Now, setting \(F:{\mathsf {Q}}^\bullet _n\rightarrow {\mathsf {Q}}_n\) to be the forgetful function which rids a quadrangulation of its distinguished vertex, consider the function \(f\circ F:{\mathsf {Q}}^\bullet _n\rightarrow {\mathbb {R}}\). We have \({\mathbb {E}}_{\pi ^\bullet }(f\circ F)=0\) and \({\mathbb {V}}_{\pi ^{\bullet }}(f\circ F)=1\), where \(\pi ^\bullet \) is the uniform measure on \({\mathsf {Q}}_n^\bullet \). For the spectral gap \(\nu _n^\bullet \) of \({\mathcal {F}}^{n,\bullet }\), we have

$$\begin{aligned} \nu ^\bullet _n\le {\mathcal {E}}_{{\mathcal {F}}^{n,\bullet }}(f\circ F,f\circ F)=\frac{1}{2}\sum _{\begin{array}{c} q\in {\mathsf {Q}}_n^\bullet \\ e\in E(q)\\ s\in \{+,-\} \end{array}}\left( f(F(q))-f(F(q^{e,s}))\right) ^2\cdot \frac{1}{(n+2)|{\mathsf {Q}}_n|}\cdot \frac{1}{6n} \\ =\frac{1}{2}\sum _{\begin{array}{c} q\in {\mathsf {Q}}_n^\bullet \\ e\in E(F(q))\\ s\in \{+,-\} \end{array}}\left( f(F(q))-f(F(q)^{e,s})\right) ^2\cdot \frac{1}{(n+2)|{\mathsf {Q}}_n|}\cdot \frac{1}{6n}={\mathcal {E}}_{{\mathcal {F}}^n}(f,f)=\nu _n, \end{aligned}$$

as claimed. \(\square \)

3 The Schaeffer bijection

Fig. 4
figure 4

A plane tree with 9 edges, whose 18 corners are labelled according to their order in the clockwise contour; the tree is rooted in the marked oriented edge, or equivalently has the corner labelled 1 as a distinguished corner. Leaves are marked in green and are defined as vertices other than the origin having degree 1, i.e. only one corner (color figure online)

In order to obtain lower bounds for the spectral gap of \({\mathcal {F}}^n\), we will find it convenient to compare it to the spectral gap of a certain Markov chain on the state space of (labelled) trees. A key ingredient to set up this comparison will be a well-known bijection often referred to as the Schaeffer correspondence [11, 13].

Although this bijection and its variants have been described in a number of papers, we shall still give a very brief presentation of the construction of labelled trees from rooted, pointed quadrangulations and vice-versa, since part of it will be heavily relied upon in the rest of the paper.

Definition 3.1

A plane tree is a rooted planar map with a single face.

We will often find it convenient to see the root of a plane tree as a distinguished corner rather than a distinguished oriented edge; in what follows, we shall refer to the clockwise contour of a tree (see Fig. 4) as the cyclic sequence \((c_i)_{i=1}^{2n}\) of its corners (where n is the number of its edges); we number the corner in such a way that \(c_1\) is the root corner, that is the corner of the origin lying immediately to the left of the root edge. Given a vertex v of a plane tree other than the origin, we shall write p(v) for its parent; notice that each edge of a tree may be univocally written in the form (vp(v)), where v is a vertex of the tree other than the origin. Vertices of degree 1, with the exception of the origin, will be called leaves.

We shall call \({\mathsf {T}}_n\) the set of all plane trees with n edges; trees with zero edges do not conform to the definition above, but we will still find it convenient to define \({\mathsf {T}}_0\) to be \(\{\bullet \}\) by convention, where “\(\bullet \)” is the graph with one vertex and no edges.

Definition 3.2

A labelled tree is a plane tree t endowed with a labelling \(l:V(t)\rightarrow {\mathbb {Z}}\) such that

  • if \(\rho \) is the origin of t, \(l(\rho )=0\);

  • for any vertex \(v\in V(t){\setminus }\{\rho \}\), \(|l(v)-l(p(v))|\in \{1,-1,0\}\).

We shall call \(\mathsf {LT}_n\) the set of all labelled trees with n edges, and set \(\mathsf {LT}_0=\{\bullet \}\).

Notice that, equivalently, a labelled tree could simply be presented as a plane tree whose edges are three-coloured (the colours being \(\{1,-1,0\}\)); if c(e) is the colour of the edge e, labels of vertices could be recovered by setting \(l(v)=\sum _{e\in P(v)}c(e)\), where P(v) is the one simple path leading from v to the origin (or the empty path if v is the origin itself). Throughout the paper, we will use both points of view; it will therefore be useful to introduce a more general notation for plane trees whose edges are r-coloured (r being some fixed positive integer); we will write \({\mathsf {T}}^{(r)}_n\) for the set

$$\begin{aligned} \left\{ (t,C): t\in {\mathsf {T}}_n, C\in \{1,\ldots ,r\}^{E(t)}\right\} . \end{aligned}$$

For convenience, we will often refer to a labelled or r-coloured tree with a single symbol such as t, and consider the labelling or colouring to be implicit; in the case of labelled trees, we will usually call the labelling l without further comment, and sometimes naturally extend it to corners, thus writing l(c) when we mean l(v), where v is the vertex of t that c is adjacent to.

The reason for our definition of \(\mathsf {LT}_n\) is the fact that the sets \({\mathsf {Q}}_n^\bullet \) and \(\mathsf {LT}_n\times \{-1,1\}\) have the same cardinality; moreover, pointed quadrangulations can be interpreted as pairs \((t,\varepsilon )\), where t is a labelled tree and \(\varepsilon \in \{-1,1\}\), in a rather natural way. As promised, we give here a description of how to construct an element of \({\mathsf {Q}}_n^\bullet \) from an element of \(\mathsf {LT}_n\) and a sign \(\varepsilon \in \{-1,1\}\) via the (unconstrained) Schaeffer correspondence; we include a brief description of the inverse construction for completeness and clarity, but this will not be explicitly used in the proofs to come.

Fig. 5
figure 5

The (unconstrained) Schaeffer bijection. On the left, the map from the labelled tree (in red) to the pointed quadrangulation (in black); the distinguished vertex is marked in red, and the numbers represent the labels on the tree. On the right, the map from the pointed quadrangulation (in black) to the labelled tree (in red); the distinguished vertex is marked in red, and numbers represent distances to the distinguished vertex in the quadrangulation. The two quadrangulations above are the same, although the unbounded face in the embedding on the right corresponds to the white inner face on the left (color figure online)

Construction of a mapping\(\phi \)from\(\mathsf {LT}_n\)to\({\mathsf {Q}}_n^\bullet \) (\(\phi :(\tau ,\varepsilon )\mapsto q\))

  • consider the clockwise (cyclic) contour \((c_i)_{i=1}^{2n}\) of \(\tau \), started at the distinguished corner, and let \(\ell \) be the minimal label appearing on vertices of \(\tau \);

  • for each corner \(c_i\) labelled at least \(\ell +1\), set \(k=\min \{j>0: l(c_{i+j})=l(c_i)-1\}\); join \(c_i\) to \(c_{i+k}\) with an edge (so that edges being drawn do not cross, see Fig. 5);

  • draw a new vertex \(\delta \) within the unbounded face of the tree and join each corner labelled \(\ell \) to \(\delta \) with a new edge (again, so as not to cross any previously drawn edges);

  • root the map thus obtained in the newly drawn edge issued from the distinguished corner of \(\tau \), oriented away from the origin of \(\tau \) if \(\varepsilon =-1\), towards it if \(\varepsilon =1\); make \(\delta \) the distinguished vertex;

  • erase all edges of \(\tau \) and forget all labels.

Construction of a mapping\(\phi ^{-1}\)from\({\mathsf {Q}}_n^\bullet \)to\(\mathsf {LT}_n\) (\(\phi ^{-1}:q\mapsto (\tau ,\varepsilon )\))

  • label all vertices in q with their graph distance to the distinguished vertex \(\delta \), thus defining a labelling \(l:V(q)\rightarrow {\mathbb {N}}\); for each face of q, read the labels of the vertices adjacent to its four corners cyclically according to a clockwise contour. Given two successive corners \(c_i\) and \(c_{i+1}\) in a clockwise contour of a face f, we say \(c_i\) is a down-step corner of f if the label of \(c_{i+1}\) is strictly smaller than that of \(c_i\) (notice that, since the map is bipartite, the label of \(c_{i+1}\) is either one more or one less than that of \(c_i\), hence each face has exactly 2 down-step corners);

  • draw a new edge within each face of q, joining its two down-step corners;

  • consider the root edge \((e_{-},e_+)\) of q, and let \(f_l\) and \(f_r\) be the faces lying left and right of \((e_{-},e_+)\) respectively (of course, the two may coincide); if \(l(e_-)<l(e_+)\), set \(\varepsilon =1\) and choose as new root the edge being drawn between a corner adjacent to \(e_+\) and the other down-step corner of \(f_l\), oriented away from \(e_+\); if \(l(e_-)>l(e_+)\), set \(\varepsilon =-1\) and root in the edge drawn between a corner of \(e_-\) and a down-step corner of \(f_r\), oriented away from \(e_-\);

  • subtract \(l(e_-)\) (if \(\varepsilon =-1\)) or \(l(e_+)\) (if \(\varepsilon =1\)) from all labels: this way the label of the new origin is 0;

  • erase all original edges and the distinguished vertex \(\delta \).

Theorem 2

(Schaeffer correspondence) The construction \(\phi :\mathsf {LT}_n\times \{-1,1\}\rightarrow {\mathsf {Q}}_n^\bullet \) described above is a bijection and \(\phi ^{-1}\) is its inverse; given \(t\in \mathsf {LT}_n\) and \(\varepsilon \in \{-1,1\}\), the mapping \(\phi \) naturally induces an identification between vertices of t and vertices of \(\phi (t,\varepsilon )\) such that, if l is the labelling of t, we have \(l(v)=d_{\mathrm {gr}}(v,\delta )-d_{\mathrm {gr}}(\delta ,\rho )\), where v is interpreted as a vertex of t in the left hand side of the equation and as a vertex of \(\phi (t,\varepsilon )\) in the right hand side, \(\rho \) is the origin of \(\phi (t,\varepsilon )\) and \(\delta \) its distinguished vertex.

4 An upper bound for the spectral gap of \({\mathcal {F}}^n\)

We will first show our upper bound for the spectral gap of \({\mathcal {F}}_n\), which will be achieved by evaluating the Dirichlet form for \({\mathcal {F}}_n\) in a function related to the radius of a quadrangulation. The same bound arises by considering many other natural functions relating to the metric structure of quadrangulations, constructed from graph distances, volumes of balls, lengths of separating cycles, etc.

Note that our proof will essentially rely on the fact that edge flips change distances by at most a constant and that the scaling limit of the radius of random quadrangulations is a known random variable (i.e. the radius of the Brownian map). The same upper bound would thus extend to analogous edge flip chains for other classes of random planar maps which converge to the Brownian Map when rescaled by \(n^{1/4}\); in particular, it implies the lower bound given by Budzinski in [6] for the mixing time of random triangulations.

Proposition 4.1

For the spectral gap \(\nu _n\) of the Markov chain \({\mathcal {F}}^n\) of flips on quadrangulations of size n we have

$$\begin{aligned} \nu _n\le Cn^{-\frac{5}{4}}, \end{aligned}$$

where C is some positive constant independent of n.

Proof

Let \({\text {r}}:{\mathsf {Q}}_n\rightarrow {\mathbb {N}}\) be the mapping sending a quadrangulation q to its radius, that is the maximum possible distance of a vertex of q to the origin. Consider the function \(f_n:{\mathsf {Q}}_n\rightarrow {\mathbb {R}}\) defined as

$$\begin{aligned} f_n(q)=\frac{{\text {r}}(q)}{n^{\frac{1}{4}}}. \end{aligned}$$

Notice that, given \(q\in {\mathsf {Q}}_n\), \(e\in E(q)\), \(v\in V(q)\) and \(s\in \{+,-\}\), we have

$$\begin{aligned} \left| d_{\mathrm {gr}}^q(v,\rho _q)-d_{\mathrm {gr}}^{q^{e,s}}(v,\rho _{q^{e,s}})\right| \le 3, \end{aligned}$$
(1)

where \(d_{\mathrm {gr}}^q(v,\rho )\) is the distance of v to the origin of q and \(d_{\mathrm {gr}}^{q^{e,s}}(v,\rho _{q^{e,s}})\) is the distance to the origin of \(q^{e,s}\) of the vertex that corresponds to v via the natural identification induced by flipping the edge e. Indeed, removing e can only increase the distance of v to \(\rho \) by at most 2, while reintroducing a rotated edge can only decrease it by at most 2; if e is not the root edge of q, then \(\rho \) is still the origin in \(q^{e,s}\); otherwise, \(\rho _{q^{e,s}}\) is a vertex adjacent to the previous origin \(\rho \). As a consequence, we have \(|{\text {r}}(q)-{\text {r}}(q^{e,s})|\le 3\).

Let us now evaluate the Dirichlet form \({\mathcal {E}}_{{\mathcal {F}}^n}(f_n,f_n)\); we have

$$\begin{aligned} {\mathcal {E}}_{{\mathcal {F}}^n}(f_n,f_n)= & {} \frac{1}{2}\sum _{\begin{array}{c} q\in {\mathsf {Q}}_n\\ e\in E(q)\\ s\in \{+,-\} \end{array}}(f_n(q)-f_n(q^{e,s}))^2\frac{1}{6n|{\mathsf {Q}}_n|}\\= & {} \sum _{\begin{array}{c} q\in {\mathsf {Q}}_n\\ e\in E(q),\ s\in \{+,-\}\\ f_n(q)<f_n(q^{e,s}) \end{array}}(f_n(q)-f_n(q^{e,s}))^2\frac{1}{6n|{\mathsf {Q}}_n|}, \end{aligned}$$

and therefore

$$\begin{aligned} {\mathcal {E}}_{{\mathcal {F}}^n}(f_n,f_n)\le \frac{3}{2n^{\frac{1}{2}+1}}\sum _{\begin{array}{c} q\in {\mathsf {Q}}_n\\ e\in E(q),\ s\in \{+,-\}\\ f_n(q)<f_n(q^{e,s}) \end{array}}\frac{1}{|{\mathsf {Q}}_n|}=\frac{3}{2n^{\frac{5}{4}}}{\mathbb {E}}_\pi (n^{-\frac{1}{4}}2X), \end{aligned}$$

where \(X:{\mathsf {Q}}_n\rightarrow {\mathbb {N}}\) maps q to the number of edges e in E(q) such that \({\text {r}}(q)<{\text {r}}(q^{e,s})\) for some \(s\in \{+,-\}\), and \(\pi \) is the uniform probability measure on \({\mathsf {Q}}_n\).

We intend to show that \({\mathbb {E}}_\pi (n^{-\frac{1}{4}}X)\) is bounded above by a constant independent of n.

Given \(q\in {\mathsf {Q}}_n\), consider the set \(S(q)=(B_{{\text {r}}(q)-2})^c\) of all vertices v of q such that \(d_{\mathrm {gr}}(v,\rho )\ge {\text {r}}(q)-1\), where \(\rho \) is the origin of q. Also, for each v in S(q), consider a simple path \(P_v\) in q with endpoints \(\rho \) and v and length \(d_{\mathrm {gr}}(\rho , v)\).

Flipping an edge e that is not the root of q and does not belong to \(\bigcup _{v\in S(q)}P_v\) cannot increase the radius of the quadrangulation; in fact, since all paths \(P_v\) and the origin are preserved, the distance to the root of vertices in S(q) cannot increase, and the distance to the root of any vertex outside of S(q) becomes at most \({\text {r}}(q)-2+2={\text {r}}(q)\). We thus have, for all \(q\in {\mathsf {Q}}_n\), \(X(q)\le |S(q)|{\text {r}}(q)+1\).

We shall now show that the law of |S(q)| under \(\pi \) is the same as the law of \(|B_2(q)|-1\) under \(\pi \), where \(|B_2(q)|\) is the number of vertices having distance at most 2 from the origin in q.

Fig. 6
figure 6

Left and right leaf translation

This can be shown by a rerooting argument on the labelled tree \(\phi ^{-1}(P(q))\), where P(q) is the quadrangulation q, pointed in its origin \(\rho \). The mapping \(\phi ^{-1}\circ P\) is a well known variant of the Schaeffer construction, and is a bijection between the set \({\mathsf {Q}}_n\) and the set \(\mathsf {LT}_n^+\times \{1\}\), where \(\mathsf {LT}_n^+\) is the set of all labelled trees with n edges such that no negative labels appear on them. The quantity |S(q)| represents the number of vertices labelled \({\text {r}}(q)-1\) or \({\text {r}}(q)-2\) (\({\text {r}}(q)-1\) being the maximum label appearing on vertices of \(\phi ^{-1}(P(q))\)).

Consider the following bijection of the set \(\mathsf {LT}^+_n\) to itself: given a tree \(t\in \mathsf {LT}^+_n\) and, among the corners of vertices bearing the maximum appearing label (say, \(M=\max _vl(v)\)), consider the leftmost one according to the clockwise contour; reroot t in that corner and relabel each vertex v of t with the label \(M-l(v)\). This correspondence is clearly a bijection, since its inverse consists in rerooting a tree from \(\mathsf {LT}^+_n\) in its rightmost corner labelled with its maximum label and performing the same relabelling as the one described.

Looking at vertices labelled M or \(M-1\), the above correspondence relabels them with 0 or 1; since \(\pi \) corresponds to the uniform measure on \(\mathsf {LT}_n^+\), the law of |S(q)| under \(\pi \) is the same as the law of the number of vertices at distance 1 or 2 from the origin in a uniform random quadrangulation from \({\mathsf {Q}}_n\), that is the law of \(|B_2(q)|-1\) under \(\pi \), which has exponential tails (see for example the proof of Proposition 9 in [4]).

Thanks to the Cauchy–Schwarz inequality, we can write

$$\begin{aligned} {\mathcal {E}}(f_n,f_n)\le & {} \frac{3}{n^{\frac{5}{4}}}\left( {\mathbb {E}}_\pi (|S(q)|^2){\mathbb {E}}_\pi (f_n(q)^2)\right) ^{1/2}+\frac{3}{n^{\frac{3}{2}}}\\\le & {} \frac{C'}{n^{\frac{5}{4}}}\left( {\mathbb {E}}_\pi (|B_2(q)|^2){\mathbb {E}}_\pi (f_n(q)^2)\right) ^{1/2}. \end{aligned}$$

But now \({\mathbb {E}}_\pi (|B_2(q)|^2)\) has a finite limit as \(n\rightarrow \infty \). Furthermore, the random variable \(f_n(q)\) (considered under \(\pi \)) converges weakly to the range of a Brownian snake driven by a Brownian excursion, whose variance is positive, and all of its moments converge (see [11, Corollary 3]); thus the right hand side of

$$\begin{aligned} n^{\frac{5}{4}}{\nu _n}\le C'\frac{\left( {\mathbb {E}}_\pi (|B_2(q)|^2){\mathbb {E}}_\pi (f_n(q)^2)\right) ^{1/2}}{{\mathbb {V}}_\pi [f_n(q)]} \end{aligned}$$

is bounded by a constant independent of n, which proves the proposition. \(\square \)

Remark 4.1

Note that the above proof essentially relies on the fact that each edge flip changes the radius of the quadrangulation by a constant, as in (1), and that \(n^{-\frac{1}{4}}X\) converges to a non-trivial random variable (for which we needed both the convergence properties of the radius in the scaling limit and some kind of control over the quantity S(q)). The proof above then yields a lower bound of \(n^{\frac{5}{4}}\) on the mixing time of the edge flip chains on any p-angulations provided the above two properties hold.

5 A Markov chain on labelled trees

Our main results about the Markov chain \({\mathcal {F}}^n\) will be achieved via a comparison to a very natural Markov chain on labelled plane trees, which we will introduce presently.

Given a plane tree \(t\in {\mathsf {T}}_n\) with contour \(c_1,\ldots ,c_{2n}\) and a leaf \(v\in V(t)\), suppose the corner of v is \(c_l\) (i.e. \(c_l\) is the one corner adjacent to the vertex v), with \(l<2n\); build a new tree \(t^{v,\rightarrow }\in {\mathsf {T}}_n\) as follows: draw a new leaf \(v'\) attached to the tree via the corner \(c_{l+2}\) (the corner \(c_1\) if \(l=2n-1\)); then erase v; if (p(v), v) is the root edge of t (i.e. if \(l=2\)), then root \(t^{v,\rightarrow }\) in \((p(p(v')),p(v'))\). Notice that, given a pair of trees \(t,t'\in {\mathsf {T}}_n\), there is at most one leaf v of t such that \(t'=t^{v,\rightarrow }\); this induces (when there is such a leaf) a natural identification between vertices of t and vertices of \(t'\) which sends v to the “shifted” leaf \(v'\) in \(t'\) and is a tree isomorphism between the trees \(\tau \) and \(\tau '\) obtained from t and \(t'\) by erasing v and \(v'\). This is why, given two trees of the form \(t,t^{v,x}\), we will automatically identify their vertices and denote them in the same way, including vertices v and \(v'\), thus taking “vertex v in \(t^{v,x}\)” to mean the newly drawn leaf \(v'\).

Given \(t\in {\mathsf {T}}_n\), we can define analogously a tree \(t^{v,\leftarrow }\in {\mathsf {T}}_n\) as the one tree such that \((t^{v,\leftarrow })^{v,\rightarrow }\) is t, if it exists. Additionally, we set \(t^{v,\leftarrow }=t\) if (vp(v)) is the root edge of t (that is, in the one case where t is not of the form \(t'^{v,\rightarrow }\)) and \(t^{v\rightarrow }=t\) if the corner of v is number 2n in the contour (so that now \(t^{v,\rightarrow }\) is defined for all leaves v of t).

When \(t^{v,\rightarrow }\ne t\), we say that the tree \(t^{v,\rightarrow }\) has been constructed from t by translating the leafvto the right (and \(t^{v,\leftarrow }\) differs from t by a leaf translation to the left, see Fig. 6); notice that, given two trees \(t,t'\) which differ by a leaf translation, there is a unique leaf v of t and a unique direction d, either \(\rightarrow \) or \(\leftarrow \), such that \(t'\) can be expressed as \(t^{v,d}\).

One could define a Markov chain X on the set of plane trees with n edges so that, given \(X_k=t\), \(X_{k+1}\) is determined by selecting an edge (vp(v)) of t uniformly at random and, if v is a leaf of t, setting \(X_{k+1}=t\) or \(X_{k+1}=t^{v,\rightarrow }\) or \(X_{k+1}=t^{v,\leftarrow }\) with equal probabilities, while \(X_{k+1}=t\) if v is not a leaf. We shall need a coloured variant of this chain, which can easily be defined on the set of plane trees with coloured edges \({\mathsf {T}}^{(r)}_n\), where \(C=\{1,\ldots ,r\}\) is the set of possible edge colours. The trees \(t^{v,\rightarrow }\) and \(t^{v,\leftarrow }\) are defined from \(t\in {\mathsf {T}}^{(r)}_n\) exactly as before, by additionally ensuring that all edge colours are preserved. We can also introduce appropriate “recolouring” moves: given \(x\in C\) and a leaf v in V(t), we set \(t^{v,x}\) to be the tree t, where the edge (vp(v)) is recoloured with colour x. One can now define \(X_{k+1}\), given \(X_k=t\), by selecting an edge (vp(v)) of t uniformly at random: if v is a leaf, we set \(X_{k+1}\) to be one of \(t^{v,\rightarrow },t^{v,\leftarrow },t^{v,1},t^{v,2},\ldots ,t^{v,r}\), each with probability \(\frac{1}{r+2}\); if v is not a leaf, \(X_{k+1}=t\). In other words, given \(t\ne t^{v,x}\), where v is a leaf of t and \(x\in \{1,\ldots ,r,\rightarrow ,\leftarrow \}\), the Markov chain X on \({\mathsf {T}}^{(r)}_n\) has transition probability

$$\begin{aligned} p_X(t,t^{v,x})=\frac{1}{n(r+2)}; \end{aligned}$$

if \(t,t'\) do not differ by a leaf translation or recolouring we have \(p_X(t,t')=0\), and \(p_X(t,t) \ge \frac{1}{r+2}\) since for each leaf v, if c is the color of v in t we have that \(t^{v,c}=t\).

Notice that the case \(r=3\) corresponds to a Markov chain on the state space \(\mathsf {LT}_n\) of labelled trees with n vertices. We shall call this the leaf translation Markov chain and will be estimating its spectral gap as well as comparing it to the spectral gap of \({\mathcal {F}}^{\bullet ,n}\).

Fig. 7
figure 7

Some of the natural correspondences between the set \({\mathsf {T}}_n\) of plane trees with n edges and other Catalan structures result in interesting alternative interpretations of the leaf translation Markov chain. For example, given a Dyck path \(D:\{0,\ldots ,2n\}\rightarrow {\mathbb {N}}\) of length 2n, say that \(i\in \{1,\ldots ,2n-1\}\) is an upward point for D if \(D(i)=D(i-1)+1\) and say that i is a peak if it is a local maximum for D. Then, the leaf translation chain corresponds to selecting an upward point i uniformly at random (there are exactly n of them), and, if i is a peak, then to either leaving it untouched or shifting it to the right (if possible) or shifting it to the left (if possible), each with probability 1/3. Shifting a peak i of D to the right, for example, can be done if \(i\le 2n-2\) and consists in constructing a Dyck path \(D'\) such that \(D'(i)=D'(i+2)=D(i+2)\), \(D'(i+1)=D(i+2)+1\) and \(D'(j)=D(j)\) for all \(j\in \{1,\ldots ,2n\}{\setminus }\{i,i+1,i+2\}\); the above picture shows an example with \(i=5\). An analogous interpretation for the leaf translations can be given on strings of balanced parentheses, where one selects an open parenthesis at random and, if it is immediately followed by a closed parenthesis, then the pair “()” may remain untouched or move one place to the right or to the left

The leaf translation Markov chain on \({\mathsf {T}}^{(r)}_n\) (including the simpler variant introduced at the beginning of this section for \(r=1\)) is a very natural chain, worthy in fact of study independently of our efforts with regards to \({\mathcal {F}}^n\). We have chosen to present it as a chain on \({\mathsf {T}}^{(r)}_n\), but its transitions appear very natural for a number of different interpretations of the state space via classical bijections between Catalan structures (see Fig. 7). Indeed, variants of this chain have been discussed in the physics literature under the name of Fredkin spin models, and have been investigated by Movassagh and Shor [25, 26], relying on work by Bravyi et al. [5].

In particular, Movassagh and Shor prove a lower bound of \(Cn^{-\frac{11}{2}}\) for the spectral gap of a chain on \({\mathsf {T}}^{(r)}_n\) closely related to X, from which a bound for the spectral gap of X can be gleaned; we shall partially follow their argument for estimating the spectral gap, but will improve their results and will therefore provide a complete proof of our lower bounds in the next section. In order to do this, we will now introduce a variant of the leaf translation Markov chain which is closer to the one originally considered by Movassagh and Shor in their proofs; even though bounding its spectral gap is not strictly speaking necessary for achieving our results for X and therefore \({\mathcal {F}}^n\), we believe our improved bound to be of independent interest, and the proof—which is somewhat simpler than the one for X—to provide a handy way to more naturally introduce some of the necessary notation and showcase the basic argument.

We shall introduce this chain on the set \({\mathsf {T}}^{(r)}_n\) and refer to [25] for a presentation as a chain on the set of (coloured) Dyck paths. Given \(t\in {\mathsf {T}}^{(r)}_n\), a leaf \(v\in V(t)\), an integer \(k\in \{1,\ldots ,2n-1\}\) and a colour \(c\in \{1,\ldots ,r\}\), we define \(t^{v,k,c}\in {\mathsf {T}}^{(r)}_n\) by the following procedure (Fig. 8):

  • erase (vp(v)) from t, thus obtaining \(t'\in {\mathsf {T}}^{(r)}_{n-1}\);

  • consider the clockwise contour \(c_1,\ldots ,c_{2n-2}\) of \(t'\). If \(1<k<2n-1\), add a leaf \(v'\) to \(t'\) via its corner \(c_k\); if \(k=1\) or \(k=2n-1\), add a leaf \(v'\) to \(t'\) via the root corner \(c_1\): if \(k=1\), let the new root corner be the one right before the added leaf, i.e. reroot \(t'\) so that the corner of \(v'\) becomes the second corner of the contour; if \(k=2n-1\), let the root corner be the one right after \(v'\), so that the corner of \(v'\) is the last one in the clockwise contour of the new tree;

  • colour the edge \((v',p(v'))\) with the colour c.

Fig. 8
figure 8

The leaf replanting move of a leaf v performed on a tree \(t\in {\mathsf {T}}^{(3)}_4\): above, the tree \(t' \in {\mathsf {T}}^{(3)}_3\) and its contour; below, the trees \(t^{v,k,1}\) for \(k=1,\ldots ,7\)

Notice that, if the corner of v is the k-th corner in the clockwise contour of t with \(1<k\le 2n-1\) and c is the colour of (vp(v)) in t, then \(t^{v,\rightarrow }=t^{v,k,c}\). Similarly, if \(k>2\), we have \(t^{v,{k-2},c}=t^{v,\leftarrow }\), and if \(1 < k \le 2n\) we have \(t^{v,k-1,c}=t\).

We define the leaf replanting Markov chain Y on the state space \({\mathsf {T}}^{(r)}_n\) by choosing, if \(Y_k=t\), a uniformly random edge (vp(v)) of t; if v is not a leaf, then we set \(Y_{k+1}=t\); if v is a leaf, we set \(Y_{k+1}\) to be \(t^{v,k,c}\), where k and c are chosen independently and uniformly at random in \(\{1,\ldots ,2n-1\}\) and \(\{1,\ldots ,r\}\) respectively.

While we were able to identify the leaf being moved between two trees that differ by a leaf translation, notice that this is not the case when we’re dealing with a leaf replanting. In general, we have

$$\begin{aligned} p_Y(t,t')=\sum _{v,k,c}\frac{1}{n(2n-1)r}1_{t'=t^{v,k,c}}. \end{aligned}$$

Remark 5.1

Notice that both X and Y are reversible, irreducible and aperiodic. Reversibility and aperiodicity are clear from the definition; irreducibility is also clear: given any tree in \({\mathsf {T}}^{(r)}_n\), one can turn it into the tree of height 1 whose edges are all coloured 1 with at most n transitions from Y (indeed, it suffices to apply the replanting \(\cdot \mapsto \cdot ^{v,1,1}\) on the rightmost leaf v of the tree several times). Since each leaf replanting can actually be obtained by concatenating at most \(2n-1\) transitions from X (\(2n-2\) translations and one recolouring), the same height one tree can be obtained with at most \(n(2n-1)\) transitions from X. The two Markov chains X and Y therefore both admit the uniform measure on \({\mathsf {T}}^{(r)}_n\) as their unique stationary distribution.

5.1 A lower bound for the spectral gaps of leaf replanting and leaf translation Markov chains

In order to prove the desired lower bounds, we first need to set up some machinery; the first part of this section will be devoted to constructing a family of probability measures on sequences of transitions for the Markov chain \(Y\). This will be done via a family of functions \(f_n:{\mathsf {T}}^{(r)}_n\times {\mathsf {T}}^{(r)}_{n-1}\rightarrow {\mathbb {R}}\) and a function \(F:({\mathsf {T}}^{(r)}_n)^2\rightarrow {\mathsf {T}}^{(r)}_{n-1}\) with some specific properties, which we now state. Throughout this section, we will make extensive use of the fact that plane trees are counted by Catalan numbers, hence

$$\begin{aligned} \left| {\mathsf {T}}^{(r)}_n\right| =\frac{r^n}{n+1}{2n \atopwithdelims ()n}\sim \frac{(4r)^n}{\sqrt{\pi }n^{\frac{3}{2}}} \end{aligned}$$
(2)

and in particular \(|{\mathsf {T}}^{(r)}_{n+1}|< 4 r\cdot |{\mathsf {T}}^{(r)}_n|\).

Proposition 5.1

For \(n\ge 1\), there exists a mapping \(f_n:{\mathsf {T}}^{(r)}_n\times {\mathsf {T}}^{(r)}_{n-1}\rightarrow {\mathbb {R}}\) such that

  1. (i)

    \(f_n(t,t')=0\) if \(t'\) cannot be obtained from t by deleting a leaf;

  2. (ii)

    \(\displaystyle \sum \nolimits _{t'\in {\mathsf {T}}^{(r)}_{n-1}}f_n(t,t')=1\) for all t in \({\mathsf {T}}^{(r)}_n;\)

  3. (iii)

    \(\displaystyle \sum \nolimits _{t\in {\mathsf {T}}^{(r)}_n}f_n(t,t')=\frac{|{\mathsf {T}}^{(r)}_n|}{|{\mathsf {T}}^{(r)}_{n-1}|}\) for all \(t'\) in \({\mathsf {T}}^{(r)}_{n-1}\).

Proof

We can recursively construct a mapping \(f_n\) with the required properties.

Indeed, \({\mathsf {T}}^{(r)}_1\times {\mathsf {T}}^{(r)}_0=\{P_1^{(1)},\ldots ,P_1^{(r)}\}\times \{\bullet \}\), where \(P_1^{(i)}\) is the tree with one edge which is coloured i and \(\bullet \) is the single vertex, and we can set \(f_1(P_1^{(i)},\bullet )=1\).

Notice that, for \(n>1\), we can define two functions \(L,R:{\mathsf {T}}^{(r)}_n\rightarrow \bigcup _{k=0}^{n-1}{\mathsf {T}}^{(r)}_k\) (see Fig. 9) by setting L(t) to be the tree of descendants of u in t, where \((\rho ,u)\) is the root edge of t (including u and with the natural rooting induced by that of t, unless u is a leaf, in which case \(L(t)=\bullet \)) and R(t) to be the tree obtained from t by erasing L(t) (and the edge \((\rho ,u)\)), rooted in the corner that contains the original root corner of t (unless \(\rho \) has degree 1, in which case \(R(t)=\bullet \)). We have \(0\le |L(t)|\le n-1\) and \(|L(t)|+|R(t)|=n-1\), and \(t \mapsto (L(t),R(t),c)\), where \(c\in \{1,\ldots ,r\}\) is the colour of the root edge in t, is a bijection between \({\mathsf {T}}^{(r)}_n\) and \(\bigcup _{k=0}^{n-1}({\mathsf {T}}^{(r)}_k\times {\mathsf {T}}^{(r)}_{n-1-k})\times \{1,\ldots ,r\}\).

We will now construct \(f_n:{\mathsf {T}}^{(r)}_n\times {\mathsf {T}}^{(r)}_{n-1}\), for \(n>1\), from the mappings \(f_1,\ldots ,f_{n-1}\), by using the projections LR and n constants \(C^{(n)}_0,\ldots ,C^{(n)}_{n-1}\) which will be explicitly worked out below. Consider \((t,t')\in {\mathsf {T}}^{(r)}_n\times {\mathsf {T}}^{(r)}_{n-1}\). If \(|L(t)|>0\) and \(t'\) can be obtained from t by deleting a leaf contained in L(t), set \(f_n(t,t')=C^{(n)}_{|L(t)|}f_{|L(t)|}(L(t),L(t'))\); if \(t'\) can be obtained from t by deleting a leaf contained in R(t), set \(f_n(t,t')=C^{(n)}_{|R(t)|}f_{|R(t)|}(R(t),R(t'))\). Notice that the two conditions are mutually exclusive; if neither is satisfied, set \(f_n(t,t')=0\).

Fig. 9
figure 9

The decomposition of a tree \(t\in {\mathsf {T}}_{13}\) into its left and right components \(L(t)\in {\mathsf {T}}_5\) and \(R(t)\in {\mathsf {T}}_7\)

We now have, for all t in \({\mathsf {T}}^{(r)}_n\) such that \(|L(t)|=k\) (where \(1\le k\le n-2\)),

$$\begin{aligned} \sum _{t'\in {\mathsf {T}}^{(r)}_{n-1}}f_n(t,t')= & {} C^{(n)}_{k}\sum _{t_l\in {\mathsf {T}}^{(r)}_{k-1}}f_{k}(L(t),t_l)+C^{(n)}_{n-k-1}\sum _{t_r\in {\mathsf {T}}^{(r)}_{n-k-2}}f_{n-k-1}(R(t),t_r)\\= & {} C^{(n)}_{k}+C^{(n)}_{n-k-1}, \end{aligned}$$

as well as \(\sum _{t'\in {\mathsf {T}}^{(r)}_{n-1}}f_n(t,t')=C^{(n)}_{n-1}\) if \(|L(t)|=0\) or \(|L(t)|=n-1\).

Furthermore, for all \(t'\in {\mathsf {T}}^{(r)}_{n-1}\) such that \(|L(t')|=k\) (where \(0\le k\le n-2\)),

$$\begin{aligned} \sum _{t\in {\mathsf {T}}^{(r)}_n}f_n(t,t')= & {} C^{(n)}_{k+1}\sum _{t_l\in {\mathsf {T}}^{(r)}_{k+1}}f_{k+1}(t_l,L(t'))+C^{(n)}_{n-k-1}\sum _{t_r\in {\mathsf {T}}^{(r)}_{n-k-1}}f_{n-k-1}(t_r,R(t'))\\= & {} C^{(n)}_{k+1}\frac{|{\mathsf {T}}^{(r)}_{k+1}|}{|{\mathsf {T}}^{(r)}_k|}+C^{(n)}_{n-k-1}\frac{|{\mathsf {T}}^{(r)}_{n-k-1}|}{|{\mathsf {T}}^{(r)}_{n-k-2}|}. \end{aligned}$$

Notice that \(f_n\) has property (i) by construction; to enforce properties (ii) and (iii), it is sufficient to choose

$$\begin{aligned} C^{(n)}_i=\frac{i(i+1)(3n-2i-1)}{(n-1)n(n+1)} \end{aligned}$$

for \(i=0,\ldots ,n-1\), since one then has \(C^{(n)}_0=0\) and \(C^{(n)}_i+C^{(n)}_{n-i-1}=1\) for \(0\le i\le n-1\), as well as

$$\begin{aligned}&C^{(n)}_i\frac{|{\mathsf {T}}^{(r)}_{i}|}{|{\mathsf {T}}^{(r)}_{i-1}|}+C^{(n)}_{n-i}\frac{|{\mathsf {T}}^{(r)}_{n-i}|}{|{\mathsf {T}}^{(r)}_{n-i-1}|}=C^{(n)}_i\frac{r|2(2i-1)|}{i+1}+C^{(n)}_{n-i}\frac{2r(2n-2i-1)}{n-i+1}\\&\quad =\frac{2r(2n-1)}{n+1}=\frac{|{\mathsf {T}}^{(r)}_{n}|}{|{\mathsf {T}}^{(r)}_{n-1}|} \end{aligned}$$

for all \(1\le i\le n-1\), by (2). \(\square \)

Lemma 5.2

There is a mapping \(F:({\mathsf {T}}^{(r)}_n)^2\rightarrow {\mathsf {T}}^{(r)}_{n-1}\) such that for all \(t\in {\mathsf {T}}^{(r)}_n\) and \(\tau \in {\mathsf {T}}^{(r)}_{n-1}\) we have \(|\{t'|F(t,t')=\tau \}|\le 8r\) and \(|\{t'|F(t',t)=\tau \}|\le 8r\).

Proof

Enumerate the elements of \({\mathsf {T}}^{(r)}_n\) as \(t_1,\ldots ,t_{|{\mathsf {T}}^{(r)}_n|}\) and the elements of \({\mathsf {T}}^{(r)}_{n-1}\) as \(\tau _1,\ldots ,\tau _{|{\mathsf {T}}^{(r)}_{n-1}|}\), in some order. The function that sends \((t_a,t_b)\) to \(\tau _S\), where \(S=(a+b)\mod |{\mathsf {T}}^{(r)}_{n-1}|\), satisfies the requirements. Indeed, we have \(|{\mathsf {T}}^{(r)}_n|=2r\frac{2n-1}{n+1}|{\mathsf {T}}^{(r)}_{n-1}|<4r|{\mathsf {T}}^{(r)}_{n-1}|\) and therefore \(a+b\le 2|{\mathsf {T}}^{(r)}_n|<8r|{\mathsf {T}}^{(r)}_{n-1}|\). Given \(\tau =\tau _S\) and \(t=t_a\), there are then at most 8r possibilities for \(a+b\) and therefore at most 8r possibilities for b. \(\square \)

We intend to prove lower bounds for the spectral gap of the leaf replanting Markov chain \(Y\) by assigning each pair of trees \(x,y\in {\mathsf {T}}^{(r)}_n\) a canonical path of leaf replanting moves turning x into y—or rather a probability measure on the set of possible paths from x to y. Such a probability measure will be constructed by using a set of functions \(f_i:{\mathsf {T}}^{(r)}_i\times {\mathsf {T}}^{(r)}_{i-1}\rightarrow {\mathbb {R}}\) with the requirements of Proposition 5.1 and a function \(F:({\mathsf {T}}^{(r)}_n)^2\rightarrow {\mathsf {T}}^{(r)}_{n-1}\) as in Lemma 5.2.

Let \(L,R:{\mathsf {T}}^{(r)}_n\rightarrow \bigcup _{i=0}^{n-1}{\mathsf {T}}^{(r)}_i\) be the mappings defined within the proof of Proposition 5.1 and depicted in Fig. 9.

First, given a tree \(x\in {\mathsf {T}}^{(r)}_n\) and a tree \(x'\in {\mathsf {T}}^{(r)}_n\) such that \(L(x')\in {\mathsf {T}}^{(r)}_{n-1}\), we will define a probability measure on paths from x to \(x'\). Then, given generic trees \(x,y\in {\mathsf {T}}^{(r)}_n\), we will construct random paths from x to y by concatenating paths from x to \(z\in {\mathsf {T}}^{(r)}_n\), where \(L(z)=F(x,y)\in {\mathsf {T}}^{(r)}_{n-1}\), and from z to y.

Given \(t\in {\mathsf {T}}^{(r)}_n\), consider the set \(\Gamma ^t\) of sequences \(t_0,\ldots ,t_n\) such that \(t_0=t\), \(t_i\in {\mathsf {T}}^{(r)}_{n-i}\) and \(t_{i+1}\) is obtained from \(t_i\) by erasing a leaf; define the probability measure \(Q^t\) on \(\Gamma ^t\) as

$$\begin{aligned} Q^t(t_0,\ldots ,t_n)=\prod _{i=0}^{n-1}f_{n-i}(t_i,t_{i+1}), \end{aligned}$$

that is the law of a sequence of random trees \(\theta _0,\ldots ,\theta _n\) such that \(\theta _0=t\) and that, given \(\theta _i=t_i\in {\mathsf {T}}^{(r)}_{n-i}\), the tree \(\theta _{i+1}\) is chosen in \({\mathsf {T}}^{(r)}_{n-i-1}\) according to the probability measure \(f_{n-i}(t_i,\cdot )\).

Given \(x,y\in {\mathsf {T}}^{(r)}_n\), consider now the set \(\Gamma _{x\rightarrow y}\) of all paths \((t_i,t_{i+1})_{i=0}^{2n-1}\) such that

  • for all i between 0 and 2n, the tree \(t_i\) belongs to \({\mathsf {T}}^{(r)}_n\);

  • \(t_0=x\), \(t_{2n}=y\) and \(L(t_n)=F(x,y)\) (which, since \(|F(x,y)|=n-1\), determines \(t_n\));

  • the tree \(t_1\) is obtained by replanting a leaf of \(t_0\) onto corner 1; for \(0<i<n\), the tree \(t_{i+1}\) is obtained by removing a leaf from \(R(t_i)\) and replanting it onto a corner of \(L(t_i)\);

  • similarly, the tree \(t_{2n-1}\) can be obtained by replanting a leaf of \(t_{2n}\) onto corner 1; for \(n<i<2n\), the tree \(t_{i-1}\) can be obtained by removing a leaf from \(R(t_i)\) and replanting it onto a corner of \(L(t_i)\).

In other words, for \(\gamma =(t_i,t_{i+1})_{i=0}^{2n-1}\in \Gamma _{x\rightarrow y}\), we have that the two sequences \(L_1(\gamma )=(L(t_n),\ldots ,L(t_1))\) and \(L_2(\gamma )=(L(t_n),\ldots ,L(t_{2n-1}))\) belong to \(\Gamma ^{F(x,y)}\), while \(R_1(\gamma )=(t_0, R(t_1),\ldots ,R(t_n))\in \Gamma ^{x}\) and \(R_2(\gamma )=(t_{2n},R(t_{2n-1}),\ldots ,R(t_n)) \in \Gamma ^{y}\) (Fig. 10). Vice-versa, any quadruple of sequences \(L_1,L_2\in \Gamma ^{F(x,y)}, R_1\in \Gamma ^x,R_2\in \Gamma ^y\) can be assembled into a path \(\gamma \in \Gamma _{x\rightarrow y}\).

We can thus construct a probability measure \(P_{x\rightarrow y}\) on \(\Gamma _{x\rightarrow y}\) by setting

$$\begin{aligned} P_{x\rightarrow y}(\gamma )=Q^x(R_1(\gamma ))Q^y(R_2(\gamma ))Q^{F(x,y)}(L_1(\gamma ))Q^{F(x,y)}(L_2(\gamma )). \end{aligned}$$
Fig. 10
figure 10

The form of a path in \(\Gamma _{t_0\rightarrow t_{10}}\), where \(t_0,t_{10}\in {\mathsf {T}}_5\); notice that \(|L(t_5)|=4\)

Before we prove a lower bound for the spectral gap of the leaf replanting Markov chain, it is useful to establish the following estimate:

Lemma 5.3

For all \(t\in {\mathsf {T}}^{(r)}_n\) and \(i\in \{0,\ldots ,2n\}\) we have

$$\begin{aligned} \sum _{x,y\in {\mathsf {T}}^{(r)}_n}P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: \gamma (i)=t\})\le 2(4r)^{n+1}, \end{aligned}$$

where, if \(\gamma =(t_i,t_{i+1})_{i=0}^{2n}\), we write \(\gamma (i)\) to indicate \(t_i\).

Proof

As above, given \(\gamma \in \Gamma _{x\rightarrow y}\), let us define sequences \(R_i(\gamma )=(R_i^0,\ldots ,R_i^n)\) and \(L_i(\gamma )=(L_i^1,\ldots ,L_i^n)\), for \(i=1,2\).

If \(0<i\le n\), we have \(\gamma (i)=t\) if and only if \(R_1^i=R(t)\) and \(L_1^{n+1-i}=L(t)\); therefore, we have

$$\begin{aligned}&\sum _{x,y\in {\mathsf {T}}^{(r)}_n}P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: \gamma (i)=t\})\\&\quad = \sum _{x\in {\mathsf {T}}^{(r)}_n}Q^x(\{(R_j)_{j=0}^n\in \Gamma ^x: R_{i}=R(t)\}) \\&\qquad \times \sum _{z\in {\mathsf {T}}^{(r)}_{n-1}}Q^z(\{(L_j)_{j=1}^n\in \Gamma ^z: L_{n+1-i}=L(t)\}) \sum _{\begin{array}{c} y\in {\mathsf {T}}^{(r)}_n\\ F(x,y)=z \end{array}}1 \end{aligned}$$

Since we have chosen F as in Lemma 5.2, the internal sum (having fixed x and z) is at most 8r; as for the other sums, we wish to show that, for any given \(t\in {\mathsf {T}}^{(r)}_k\) and \(i\in \{0,\ldots ,n\}\), we have

$$\begin{aligned} \sum _{x\in {\mathsf {T}}^{(r)}_n}Q^x(\{(R_j)_{j=0}^n\in \Gamma ^x: R_i=t\})\le (4r)^i; \end{aligned}$$
(3)

but, indeed,

$$\begin{aligned} \sum _{x\in {\mathsf {T}}^{(r)}_n}Q^x(\{(R_j)_{j=0}^n\in \Gamma ^x: R_i=t\})&=\sum _{\begin{array}{c} R_0\in {\mathsf {T}}^{(r)}_n,\ldots ,R_{i-1}\in {\mathsf {T}}^{(r)}_{n-i+1}\\ R_i=t \end{array}}\prod _{j=0}^{i-1}f_{n-j}(R_j,R_{j+1})\\&= \prod _{j=0}^{i-1}\frac{|{\mathsf {T}}^{(r)}_{n-j}|}{|{\mathsf {T}}^{(r)}_{n-j-1}|}\le (4r)^i, \end{aligned}$$

which we obtain by separating \(R_{i-1},R_{i-2},\ldots ,R_0\) from the sum, one after the other, and using the fact that \(f_{n-i+1},\ldots ,f_n\) satisfy requirement (iii) of Proposition 5.1.

Using (3) (where one needs to be weary of the fact that—in order to keep notation consistent with previous definitions—\(R_1\) is indexed from 0 and \(L_1\) is indexed from 1) we get

$$\begin{aligned} \sum _{x,y\in {\mathsf {T}}^{(r)}_n}P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: \gamma (i)=t\})\le 8r\cdot (4r)^i\cdot (4r)^{n-i}\le 2 (4r)^{n+1} \end{aligned}$$

for \(0<i\le n\); the same estimate is true for \(i=0\), since we have

$$\begin{aligned} \sum _{y\in {\mathsf {T}}^{(r)}_n}P_{t\rightarrow y}(\{\gamma \in \Gamma _{t\rightarrow y}\})= \sum _{z\in {\mathsf {T}}^{(r)}_{n-1}} \sum _{\begin{array}{c} y\in {\mathsf {T}}^{(r)}_n\\ F(x,y)=z \end{array}}1\le 8r\cdot |{\mathsf {T}}^{(r)}_{n-1}|\le 8r\cdot (4r)^n. \end{aligned}$$

The case of \(n<i\le 2n\) is perfectly symmetric. \(\square \)

All necessary notation is now in place to prove lower bounds for the spectral gap of both the leaf replanting and leaf translation Markov chains.

Theorem 3

If \(\gamma _{Y}\) is the spectral gap of the leaf replanting Markov chain \(Y\) on \({\mathsf {T}}^{(r)}_n\), we have \(\gamma _{Y}\ge C_r n^{-\frac{9}{2}}\) for an appropriate constant \(C_r\) independent of n .

Proof

By the canonical paths method (see for example [19, Section 13.4]), we have

$$\begin{aligned} \frac{1}{\gamma _{Y}}\le \max _{t,t'\in {\mathsf {T}}^{(r)}_n:p_Y(t,t')>0}\frac{1}{\pi (t)p_{Y}(t,t')}\sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} \gamma \in \Gamma _{x\rightarrow y}:\\ (t,t')\in \gamma \end{array}}|\gamma |P_{x\rightarrow y}(\gamma )\pi (x)\pi (y), \end{aligned}$$

where, if \(\gamma =(t_i,t_{i+1})_{i=0}^{N-1}\), we are writing \(|\gamma |\) to mean the length N of the sequence, and we say that \((t,t')\in \gamma \) if \(t=t_i\) and \(t'=t_{i+1}\) for some \(i\in \{0,\ldots ,N-1\}\).

By the description of the leaf replanting Markov chain, we know that (assuming \(t,t'\) differ by the replanting of a leaf) \(p_{Y}(t,t')\ge \frac{1}{2rn^2}\); furthermore, every path \(\gamma \in \Gamma _{x\rightarrow y}\) (for \(x,y\in {\mathsf {T}}^{(r)}_n\)) has length exactly 2n. Using the fact that \(\pi \) is the uniform measure on \({\mathsf {T}}^{(r)}_n\) and setting \((t,t')\) to be a pair of trees achieving the maximum above, one obtains

$$\begin{aligned} \frac{1}{\gamma _{Y}}\le \frac{2n\cdot 2rn^2}{|{\mathsf {T}}^{(r)}_n|}\sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{(t,t')\in \gamma \in \Gamma _{x\rightarrow y}}P_{x\rightarrow y}(\gamma ). \end{aligned}$$

All that is left to do is to estimate the sum of all probabilities \(P_{x\rightarrow y}(\gamma )\), where \(\gamma \) is a path in some \(\Gamma _{x\rightarrow y}\) involving the transition \((t,t')\). Notice that, if \((t,t')\) appears in \(\gamma \), then the fact that for \(0<i\le n\) we have \(|L(\gamma (i))|=i-1\) and for \(n\le i<2n\) we have \(|L(\gamma (i))|=2n-1-i\) implies that either \(t=\gamma (0)\), or \(t=\gamma (|L(t)|+1)\), or \(t=\gamma (2n-1-|L(t)|)\); we therefore have, if we set \(S=\{0,|L(t)|+1,2n-1-|L(t)|\}\),

$$\begin{aligned} \frac{1}{\gamma _{Y}}\le \frac{2n\cdot 2rn^2}{|{\mathsf {T}}^{(r)}_n|}\sum _{\begin{array}{c} i\in S \end{array}}\sum _{x,y\in {\mathsf {T}}^{(r)}_n} P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: \gamma (i)=t\}); \end{aligned}$$

by Lemma 5.3, this yields

$$\begin{aligned} \frac{1}{\gamma _{Y}}\le \frac{2n\cdot 2rn^2}{|{\mathsf {T}}^{(r)}_n|}3\cdot 2(4r)^{n+1}\le C_r\cdot n^{3+\frac{3}{2}}\le C_r\cdot n^{\frac{9}{2}}, \end{aligned}$$

as wanted. \(\square \)

The proof of a lower bound for the spectral gap of the leaf translation Markov chain is analogous, if a little more fiddly.

Theorem 4

If \(X\) is the leaf translation Markov chain on the state space of r-coloured plane trees with n edges \({\mathsf {T}}^{(r)}_n\) and \(\gamma _{X}\) is its spectral gap, we have

$$\begin{aligned} \gamma _X\ge C_r n^{-\frac{9}{2}} \end{aligned}$$

for some constant \(C_r\) independent of n.

Proof

Suppose \(t,t'\in {\mathsf {T}}^{(r)}_n\) differ by the replanting and recolouring of a leaf (i.e. are such that \(p_Y(t,t')>0\)); the leaf being replanted and recoloured may not be uniquely determined, but let v be the leftmost leaf and k be the minimum integer, given v, such that \(t'=t^{v,k,c}\). If k is greater than or equal to the number of the corner of v in t, we construct a “leaf translation path” \(\gamma (t,t')=(t_i,t_{i+1})_{i=0}^{N-1}\) from t to \(t'\) by setting \(t_0=t\), \(t_{i+1}=t_i^{v,\rightarrow }\) and choosing N to be as small as possible and such that \(t_{N-1}\) is \(t'\) up to the recolouring of the replanted leaf; finally, we set \(t_N=t_{N-1}^{v,c}\). Similarly, if the corner of v in t is indexed by a number strictly greater than k, we construct \(\gamma (t,t')\) as a (minimal) sequence of leftward translations of v, followed by a recolouring.

Given \(t_0,t_{2n}\in {\mathsf {T}}^{(r)}_n\) and \(\gamma =(t_i,t_{i+1})_{i=0}^{2n-1}\in \Gamma _{t_0\rightarrow t_{2n}}\), we can now define a “leaf translation” path \(\gamma ^X\) by concatenating \(\gamma _0,\ldots ,\gamma _{2n-1}\), where \(\gamma _i=\gamma (t_i,t_{i+1})\). We call \(\Gamma ^X_{t_0\rightarrow t_{2n}}\) the set \(\{\gamma ^X: \gamma \in \Gamma _{t_0\rightarrow t_{2n}}\}\). Notice that \(\gamma \mapsto \gamma ^X\) is a bijection between \(\Gamma _{t_0\rightarrow t_{2n}}\) and \(\Gamma ^X_{t_0\rightarrow t_{2n}}\), since the sequence \(\gamma \) can be reconstructed from \(\gamma ^{X}\) by setting \(\gamma (0)=\gamma ^X(0)\) and \(\gamma (i+1)=\gamma ^X(x)\), where

$$\begin{aligned} x=\min \{j>i : \gamma ^X(j) \text{ is } \text{ of } \text{ the } \text{ form } \gamma ^X(j-1)^{v,c} \text{ with } c\notin \{\leftarrow ,\rightarrow \}\}. \end{aligned}$$

We can therefore define a probability measure on \(\Gamma ^X_{t_0\rightarrow t_{2n}}\), which we still call \(P_{t_0\rightarrow t_{2n}}\), by simply setting \(P_{t_0\rightarrow t_{2n}}(\gamma ^X)=P_{t_0\rightarrow t_{2n}}(\gamma )\).

By the canonical paths method, we have

$$\begin{aligned} \frac{1}{\gamma _{X}}\le \max _{t,t'\in {\mathsf {T}}^{(r)}_n:p_X(t,t')>0}\frac{1}{\pi (t)p_{X}(t,t')}\sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} \gamma \in \Gamma _{x\rightarrow y}:\\ (t,t')\in \gamma ^X \end{array}}|\gamma ^X|P_{x\rightarrow y}(\gamma )\pi (x)\pi (y). \end{aligned}$$

Notice that for all \(\gamma \) we have \(|\gamma ^X|\le 4n^2\) (by Remark 5.1) and, for all \(t,t'\in {\mathsf {T}}^{(r)}_n\) such that \(p_X(t,t')>0\), we have \(p_X(t,t')\ge \frac{1}{n(r+2)}\). We therefore have, if we also replace occurrences of \(\pi (\cdot )\) with \(|{\mathsf {T}}^{(r)}_n|^{-1}\),

$$\begin{aligned} \frac{1}{\gamma _{X}}\le \frac{4n^3(r+2)}{|{\mathsf {T}}^{(r)}_n|}\max _{(t,t^{v,c})}\sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} \gamma \in \Gamma _{x\rightarrow y}:\\ (t,t^{v,c})\in \gamma ^X \end{array}}P_{x\rightarrow y}(\gamma ). \end{aligned}$$

Given a leaf translation or recolouring \((t,t^{v,c})\), we now wish to determine which paths \(\gamma =(t_i,t_{i+1})_{i=0}^{2n-1}\in \Gamma _{t_0\rightarrow t_{2n}}\) are such that \((t,t^{v,c})\) appears in \(\gamma ^X\).

If \(c\notin \{\rightarrow ,\leftarrow \}\), then \(t^{v,c}=t_i\) for some i which is almost univocally determined by \(|L(t^{v,c})|\): we necessarily have \(i=2n\) or \(i=|L(t^{v,c})|+1\) or \(i=2n-|L(t^{v,c})|-1\). We therefore find that

$$\begin{aligned}&\sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} \gamma \in \Gamma _{x\rightarrow y}:\\ (t,t^{v,c})\in \gamma ^X \end{array}}P_{x\rightarrow y}(\gamma )\le \sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} i\in \{2n,|L(t)|+1,\\ 2n-1-|L(t)|\} \end{array}}P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: \gamma (i)=t\})\\&\quad \le 6\cdot (4r)^{n+1} \end{aligned}$$

If \(c=\leftarrow \), consider the tree \(\tau \) obtained by removing v from \(t^{v,c}\). We must have either \(\tau =R(t_1)\), or \(R(\tau )=R(t_i)\) and \(L(\tau )=L(t_{i-1})\), with \(i=|L(\tau )|+1\). Similarly for \(c=\rightarrow \): either \(\tau =R(t_{2n-1})\), or \(R(\tau )=R(t_{i-1})\) and \(L(\tau )=L(t_i)\), with \(i=2n-1-|L(\tau )|\). The proof of Lemma 5.3 can be modified slightly to yield that, for any fixed \(i\in \{1,\ldots ,n\}\) and \(a,b\in \bigcup _{j=0}^{n}{\mathsf {T}}^{(r)}_j\), we have

$$\begin{aligned} \sum _{x,y\in {\mathsf {T}}^{(r)}_n}P_{x\rightarrow y}(\{\gamma \in \Gamma _{x\rightarrow y}: R(\gamma (i))=a, L(\gamma (i-1))=b\})\le 2(4r)^{n+1}. \end{aligned}$$

Applying both the original inequality from Lemma 5.3 and this variant yields that, when \(c\in \{\rightarrow ,\leftarrow \}\),

$$\begin{aligned} \sum _{x,y\in {\mathsf {T}}^{(r)}_n}\sum _{\begin{array}{c} \gamma \in \Gamma _{x\rightarrow y}:\\ (t,t^{v,c})\in \gamma ^X \end{array}}P_{x\rightarrow y}(\gamma )\le 6\cdot (4r)^{n+1}. \end{aligned}$$

Finally, this entails

$$\begin{aligned} \frac{1}{\gamma _{X}}\le \frac{24n^3(r+2)}{|{\mathsf {T}}^{(r)}_n|}(4r)^{n+1}\le C_r\cdot n^{\frac{9}{2}} \end{aligned}$$

for some constant \(C_r\) independent of n. \(\square \)

Remark 5.2

In order to obtain results for the flip chain \({\mathcal {F}}_n\), we are only interested in lower bounds for the spectral gap of the chain \(X\). It is not difficult, however, to obtain upper bounds proportional to \(n^{-2}\) for the spectral gaps \(\gamma _X\) and \(\gamma _Y\) of \(X\) and \(Y\) by an argument even simpler than that of Sect. 4.

Indeed, let \(H_n:{\mathsf {T}}^{(r)}_n\rightarrow {\mathbb {N}}\) be the function giving the height of a tree, and consider the Dirichlet forms \({\mathcal {E}}_X(n^{-\frac{1}{2}}H_n,n^{-\frac{1}{2}}H_n)\) and \({\mathcal {E}}_Y(n^{-\frac{1}{2}}H_n,n^{-\frac{1}{2}}H_n)\). Both can be bounded above by (a constant times) \(n^{-2}\), using the fact that \(H_n\) changes by at most 1 when a leaf replanting/translation/recolouring is performed, and that moreover, given \(t\in {\mathsf {T}}^{(r)}_n\), its height decreases with probability at most \(\frac{1}{n}\) when taking a step of either chain (since one has to remove the “top leaf”, which even needs to be unique). The bound on the spectral gap is then established thanks to the fact that the random variable \(n^{-\frac{1}{2}}H_n(t)\), where t is a uniform random element of \({\mathsf {T}}^{(r)}_n\), converges to a nontrivial random variable as \(n\rightarrow \infty \), and in fact its variance converges to a positive constant (cf. [1, Section 3.1]).

6 A lower bound for the spectral gap of \({\mathcal {F}}^n\)

6.1 Edge flips and the leaf translation Markov chain

We will now set up a comparison à la Diaconis–Saloff-Coste [15] between the Markov chain \({\mathcal {F}}^{n,\bullet }\) and a variant of the leaf translation Markov chain \(X\) on \(\mathsf {LT}_n\); as per Theorem 2, we have an explicit bijection \(\phi \) between the state space \({\mathsf {Q}}_n^\bullet \) of \({\mathcal {F}}^{n,\bullet }\) and the set \(\mathsf {LT}_n\times \{-1,1\}\).

A (reversible, irreducible and aperiodic) variant \({\widetilde{X}}\) of the leaf translation Markov chain can be defined on \(\mathsf {LT}_n\times \{-1,1\}={\mathsf {T}}^{(3)}_n\times \{-1,1\}\), where we consider the set of edge colours to be \(\{+,-,=\}\) rather than \(\{1,2,3\}\), as follows: conditionally on \({\widetilde{X}}_k=(t,\varepsilon )\), where \(t\in \mathsf {LT}_n\) and \(\varepsilon \in \{-1,1\}\), we set

  • \({\widetilde{X}}_{k+1}=(t,-\varepsilon )\) with probability \(\frac{1}{n+1}\)

  • with probability \(\frac{n}{n+1}\), we select a random edge (vp(v)) of t; if v is not a leaf, we set \({\widetilde{X}}_{k+1}=(t,\varepsilon )\); if v is a leaf, we set \({\widetilde{X}}_{k+1}\) to be one of \((t^{v,\rightarrow },\varepsilon )\), \((t^{v,\leftarrow },\varepsilon )\), \((t^{v,+},\varepsilon )\), \((t^{v,-},\varepsilon )\), \((t^{v,=},\varepsilon )\), each with probability \(\frac{1}{5}\) given the choice of v.

From Theorem 4 we can deduce the following analogous estimate for the spectral gap of this chain.

Corollary 6.1

If \({\widetilde{\gamma }}\) is the spectral gap of the Markov chain \({\widetilde{X}}\) on the state space \(\mathsf {LT}_n\times \{-1,1\}\) as defined above, we have \({\widetilde{\gamma }}\ge C n^{-\frac{9}{2}}\) for some constant C.

Proof

Let \(f:\mathsf {LT}_n\times \{-1,1\}\rightarrow {\mathbb {R}}\) be a function such that \({\mathbb {E}}_\pi (f)=0\), \({\mathbb {V}}_\pi (f)=1\) and \({\widetilde{\gamma }}={\mathcal {E}}_{\widetilde{X}}(f,f)\), where \(\pi \) is the uniform measure on \(\mathsf {LT}_n\times \{-1,1\}\) and \({\mathcal {E}}_{\widetilde{X}}\) is the Dirichlet form for the Markov chain \({\widetilde{X}}\). Then

$$\begin{aligned} {\widetilde{\gamma }}= & {} \frac{1}{2}\sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ v \text{ leaf } \text{ of } t\\ x\in \{\rightarrow ,\leftarrow ,+,-,=\}\\ \varepsilon \in \{-1,1\} \end{array}}(f(t,\varepsilon )-f(t^{v,x},\varepsilon ))^2\frac{1}{2|\mathsf {LT}_n|}\frac{1}{5(n+1)}\\&+\frac{1}{2} \sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ \varepsilon \in \{-1,1\} \end{array}}(f(t,\varepsilon )-f(t,-\varepsilon ))^2\frac{1}{2|\mathsf {LT}_n|}\frac{1}{n+1}. \end{aligned}$$

Consider now the maps \(f_1,f_{-1}:\mathsf {LT}_n\rightarrow {\mathbb {R}}\) defined so that \(f_\varepsilon (t)=f(t,\varepsilon )\) and the leaf translation Markov chain \(X\) on \(\mathsf {LT}_n\). We can immediately identify the first of the two sums above as

$$\begin{aligned} \frac{n}{2(n+1)}\left( {\mathcal {E}}_X(f_1,f_1)+{\mathcal {E}}_X(f_{-1},f_{-1})\right) . \end{aligned}$$

On the other hand, a lower bound for the second sum is given by \(\frac{2}{n+1}{\mathbb {V}}({\mathbb {E}}(f|\varepsilon ))\), where \(\varepsilon :\mathsf {LT}\times \{-1,1\}\rightarrow \{-1,1\}\) is the projection on the second component, since a simple application of the Cauchy–Schwarz inequality gives

$$\begin{aligned}&\sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ \varepsilon \in \{-1,1\} \end{array}}(f(t,\varepsilon )-f(t,-\varepsilon ))^2\frac{1}{2|\mathsf {LT}_n|}\frac{1}{n+1}\\&\quad = \frac{1}{(n+1)4|\mathsf {LT}_n|^2}\left( \sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ \varepsilon \in \{-1,1\} \end{array}}(f(t,\varepsilon )-f(t,-\varepsilon ))^2\right) \left( \sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ \varepsilon \in \{-1,1\} \end{array}}1^2\right) \\&\quad \ge \frac{1}{4(n+1)}\left( \frac{2}{|\mathsf {LT}_n|}\sum _{\begin{array}{c} t\in \mathsf {LT}_n \end{array}}f_1(t)-\frac{2}{|\mathsf {LT}_n|}\sum _{\begin{array}{c} t\in \mathsf {LT}_n \end{array}}f_{-1}(t)\right) ^2=\frac{4}{n+1}{\mathbb {V}}({\mathbb {E}}(f|\varepsilon )). \end{aligned}$$

We thus have

$$\begin{aligned}&{\widetilde{\gamma }}\ge \frac{n}{2(n+1)}\left( {\mathcal {E}}_X(f_1,f_1)+{\mathcal {E}}_X(f_{-1},f_{-1})\right) +\frac{2}{n+1}{\mathbb {V}}({\mathbb {E}}(f|\varepsilon ))\\&\quad \ge \frac{n\gamma }{2(n+1)}({\mathbb {V}}(f|\varepsilon =1)+{\mathbb {V}}(f|\varepsilon =-1))+\frac{2}{n+1}{\mathbb {V}}({\mathbb {E}}(f|\varepsilon ))\\&\quad = \frac{n\gamma }{n+1}{\mathbb {E}}({\mathbb {V}}(f|\varepsilon ))+\frac{2}{n+1}{\mathbb {V}}({\mathbb {E}}(f|\varepsilon )), \end{aligned}$$

where \(\gamma \) is the spectral gap of the leaf translation Markov chain \(X\) on \(\mathsf {LT}_n\). Finally, using Proposition 4, the variance decomposition formula and the fact that \({\mathbb {V}}_\pi (f)=1\), we obtain that, for some constants \(C'\) and C

$$\begin{aligned} {\tilde{\gamma }}\ge \frac{n}{(n+1)n^{\frac{9}{2}}}C'({\mathbb {E}}({\mathbb {V}}_\pi (f|\varepsilon ))+{\mathbb {V}}_\pi ({\mathbb {E}}_\pi (f|\varepsilon )))\ge \frac{C}{n^{\frac{9}{2}}}. \end{aligned}$$

\(\square \)

We are now ready to set up a comparison between the chains \({\widetilde{X}}\) and \({\mathcal {F}}^{n,\bullet }\). In order to do this, we will devote the next section to explicitly building sequences of quadrangulation edge flips that turn \(\phi (t,\varepsilon )\) into \(\phi (t',\varepsilon ')\), where \((t,\varepsilon )\) and \((t',\varepsilon ')\) are elements of \(\mathsf {LT}_n\times \{-1,1\}\) that are adjacent according to the graph of the Markov chain \({\widetilde{X}}\).

In particular, for each \(t,v,x,\varepsilon \) such that \(t\in \mathsf {LT}_n\), v is a leaf of t, \(x\in \{\rightarrow ,\leftarrow ,+,=,-\}\) and \(\varepsilon =\pm 1\), we shall build a sequence of quadrangulation edge flips \(P_\varepsilon (t,t^{v,x})=(q_i,e_i,s_i)_{i=1}^N\), such that

  • \(q_i\in {\mathsf {Q}}_n^\bullet \), \(e_i\in E(q_i)\), \(s_i\in \{+,-\}\);

  • \(q_1=\phi (t,\varepsilon )\);

  • \(q_{i+1}=q_i^{e_i,s_i}\), for \(i=1,2,\ldots ,N\);

  • \(q_{N+1}=\phi (t^{v,x},\varepsilon )\).

Notice that (as we remarked in Sect. 2) we can naturally identify vertices of \(q_i\) with vertices of \(q_{i+1}\) and edges of \(q_i\) with edges of \(q_{i+1}\) by building \(q_{i+1}\) via the procedure described for flipping \(e_i\). We will therefore often define edges \(e_i,\ldots ,e_N\) as edges of \(q_1\), since edges in \(E(q_1)\) have a natural interpretation in \(E(q_2),\ldots ,E(q_N)\).

Similarly, we will also build sequences \(P(t)=(q_i,e_i,s_i)_{i=1}^N\) such that \(q_1=\phi (t,1)\), \(q_{N+1}=\phi (t,-1)\) and \(q_{i+1}=q_i^{e_i,s_i}\).

Having constructed these in an appropriate way, a comparison of the Markov chains \({\mathcal {F}}^{n,\bullet }\) and \({\widetilde{X}}\) (cf. [15]) will yield Theorem 1, provided that we can bound the maximum length of a flip sequence with (a constant times) n and show that each triple (qes) (where \(q\in {\mathsf {Q}}^\bullet _n\), \(e\in E(q)\), \(s=\pm 1\)) only appears in at most a constant number of sequences P(t) and \(P_\varepsilon (t,t^{v,x})\), independent of n. Constructing our flip sequences and proving such bounds will be the aim of the next three subsections; Sect. 6.5 will conclude by deriving Theorem 1.

6.2 The sequence P(t)

As a matter of fact, we have already discussed the sequence P(t) in Lemma 2.2.

From the Schaeffer construction within Sect. 3 one can immediately see that \(\phi (t,-1)\) can be obtained from \(\phi (t,1)\) by simply giving the root edge the opposite orientation. If the root edge is not a double edge, this can be achieved via flips by just flipping it three times in the same direction, so we can set \(P(t)=(q_i,e_i,s_i)_{i=1}^3\), with \(q_1=\phi (t,1)\), \(s_1=s_2=s_3=+\) and \(e_i\) being the root edge of \(q_i\), for \(i=1,2,3\).

If the root edge of \(\phi (t,1)\) is a double edge within a degenerate face, one need only perform an extra flip on one of the boundary edges of the degenerate face before and after flipping the root edge three times. Setting \(q_1=\phi (t,1)\) and assuming \(e'\) is the edge before the root edge in the clockwise contour of its degenerate face, we set \(P(t)=(q_i,e_i,s_i)_{i=1}^5\), with \(e_1=e'\), \(s_1=s_2=s_3=s_4=+\), \(e_i\) is the root edge of \(q_i\) for \(i=2,3,4\), and \(e_5=e'\), \(s_5=-\) (see Fig. 11).

Lemma 6.2

For all \(t\in \mathsf {LT}_n\), we have \(|P(t)|\le 5\). Moreover, given a triple (qes) where \(q\in {\mathsf {Q}}_n^\bullet \), \(e\in E(q)\), \(s\in \{+,-\}\), we have

$$\begin{aligned} \left| \{t: (q,e,s)\hbox { appears in }P(t)\}\right| \le 9. \end{aligned}$$

Proof

The first part of the statement is clear by definition.

As for the second part, if t is a tree such that (qes) appears in P(t) and the root of \(\phi (t,1)\) is not a double edge, then \(\phi (t,-1)\) is obtained by flipping the root edge of q one, two or three times, so there are at most three possibilities for t.

If (qes) appears in P(t), where e is the root edge of q and the root edge of \(\phi (t,1)\) is a double edge, then one or more among q, \(q^{e,-}\) and \((q^{e,-})^{e,-}\) have one or both endpoints of the root edge of degree 2; setting \(e'\) to be such that \(e,e'\) share an endpoint of degree 2 in \(q'\in \{q,q^{e,-},(q^{e,-})^{e,-}\}\), \(\phi (t,1)\) must be of the form \(q'^{e',-}\), and therefore t must be one of at most 6 possible labelled trees.

If e is not the root edge of q and \(s=-\), then \(\phi (t,-1)=q^{e,s}\); if \(s=+\), \(\phi (t,1)=q\). \(\square \)

Fig. 11
figure 11

A sequence of flips \(P(t)=(q_i,e_i,s_i)_{i=1}^5\), where the root edge of \(q_1=\phi (t,1)\) is a double edge within a degenerate face. Notice that either the sequence \((q_2,e_2,s_2),(q_3,e_3,s_3),(q_4,e_4,s_4)\) is an example of \(P(t')\), for some \(t'\) such that the root of \(\phi (t',1)\) is not a double edge within a degenerate face

6.3 The colour change sequences \(P_\varepsilon (t,t^{v,-})\), \(P_\varepsilon (t,t^{v,=})\), \(P_\varepsilon (t,t^{v,+})\)

This section will be devoted to constructing the sequences \(P_\varepsilon (t, t^{v,c})\), where \(t\in \mathsf {LT}_n\), \(\varepsilon \in \{-1,1\}\), v is a leaf of t and \(c\in \{-,=,+\}\), i.e. sequences of quadrangulation edge flips whose aim is to achieve a “colour change”, or equivalently a “label change”, from the leaf translation Markov chain \({\widetilde{X}}\) on \(\mathsf {LT}_n\times \{-1,1\}\).

Given a leaf label change \((t, t^{v,c})\), where \(c\in \{=,+,-\}\), we need to construct a sequence \(P_\varepsilon (t, t^{v,c})=(q_i,e_i,s_i)_{i=1,\ldots ,N}\) such that \(q_1=\phi (t,\varepsilon )\), \(q_{i+1}=q_i^{e_i,s_i}\) and \(q_{N+1}=\phi (t^{v,c},\varepsilon )\). Our aim will then consist in estimating the maximum length N of such sequences in terms of n, as well as the number of quadruples \((t,v,c,\varepsilon )\), where \(t\in \mathsf {LT}_n\), v is a leaf in t, \(c\in \{=,+,-\}\) and \(\varepsilon \in \{-1,1\}\), such that a fixed triple (qes) appears in the sequence \(P_\varepsilon (t, t^{v,c})\).

We shall describe explicitly all sequences of the form \(P_\varepsilon (t, t^{v,+})\), where \(l(v)=l(p(v))\) in t, and \(P_\varepsilon (t, t^{v,-})\), where \(l(v)-l(p(v))=1\) in t, that “is colour changes” from \(=\) to \(+\) and from \(+\) to −. All other sequences will be built from these in the natural way, by concatenating them and/or running them backwards. If \(t^{v,x}=t\), we set \(P_\varepsilon (t,t^{v,x})\) to be empty.

Let us first consider the case of a colour change from \(=\) to \(+\), which is easily dealt with.

Lemma 6.3

Given \((t,\varepsilon )\in \mathsf {LT}_n\times \{-1,1\}\) and a leaf v of t such that \(l(v)=l(p(v))\), the quadrangulations \(q=\phi (t,\varepsilon )\) and \(q'=\phi (t^{v,+},\varepsilon )\) differ by an edge flip.

Proof

Consider all corners of t but the one corner around the leaf v; their target corners as determined by the Schaeffer bijection are unaffected by increasing the label of v, by definition. In particular, the two corners immediately before and after the corner of v in the contour, which are corners of p(v), share the same target corner before and after the label change, making the quadrangulation face which corresponds to the tree edge (vp(v)) a degenerate face. The only effect of the label increase is that of changing the target of the v corner to the appropriate corner of p(v), i.e. flipping the double edge e within the aforementioned degenerate face of \(\phi (t,\varepsilon )\). Also notice that the edge issued from the root corner of t (which cannot be the corner of the leaf v) is unaffected both by the label change and by the flip, so we do have \(\phi (t^{v,+},\varepsilon )=\phi (t,\varepsilon )^{e,+}\). \(\square \)

It is therefore natural, when \(l(v)=l(p(v))\), to set \(P_\varepsilon (t,t^{v,+})\) to \((q,e,+)\), where e is the double edge of \(\phi (t,\varepsilon )\) incident to the vertex v. Similarly, we will set \(P_\varepsilon (t^{v,+},(t^{v,+})^{v,=})=P_\varepsilon (t^{v,+},t)=(q^{e,+},e,-)\), thus covering all the cases of a colour change from \(+\) to \(=\) and vice-versa with flip sequences of length 1.

The construction of sequences of the type \(P_\varepsilon (t,t^{v,-})\) is less immediate. First, let us give a “static” description of how the label change affects the corresponding quadrangulation. Recall the mapping \(\phi \) sending \((t,\varepsilon )\in \mathsf {LT}_n \times \{-1,1\}\) to \(q\in {\mathsf {Q}}_n^\bullet \) as described in Sect. 3. The quadrangulation q is constructed via the map \(\phi \) by considering each corner c of t and drawing an edge from c to another corner which we refer to as the target corner of c, henceforth denoted by t(c). Recall also that \(\delta \) denotes the distinguished vertex of q.

Lemma 6.4

Consider a pair \((t,\varepsilon )\in \mathsf {LT}_n\times \{-1,1\}\) and a leaf v of t such that \(l(v)=l(p(v))+1\); let c be the corner of p(v) right after the corner of v in the clockwise contour of t. Suppose the target corner t(c) of c is adjacent to a vertex \(w\in V(t)\) and consider the quadrangulation edges e and \(e'\) drawn by the Schaeffer correspondence between c and t(c), and between t(c) and t(t(c)), respectively. Let \(e_1,\ldots ,e_k\) be the edges incident to w lying strictly between e and \(e'\), in clockwise order around w. If t(c) is instead the corner around \(\delta \), let \(e_1,\ldots ,e_k\) be all quadrangulation edges incident to \(\delta \).

The quadrangulation \(q'=\phi (t^{v,-},\varepsilon )\) can be obtained from \(q=\phi (t,\varepsilon )\) by “rerouting” the edges \(e_1,\ldots ,e_k\) to v—which is done by erasing their intersection with a suitably small neighbourhood of w (or \(\delta \)) and replacing it with paths to v drawn in the natural planar way—then replacing the quadrangulation edge issued from v with one joining v to t(t(c)) (or to \(\delta \), if t(c) is the corner around \(\delta \)): see Fig. 12. Notice that, if the root edge of \(q=\phi (t,\varepsilon )\) is one of \(e_1,\ldots ,e_k\), then it is “rerouted” to v and maintains the same orientation in \(q'\) (and, if it is not, then it is “preserved”).

Fig. 12
figure 12

The quadrangulations \(\phi (t,1)\) and \(\phi (t^{v,-},1)\) for a tree \(t\in \mathsf {LT}_5\) such that \(l(v)=l(p(v))+1\)

Proof

Let \(c_1,\ldots ,c_{2n}\) be the corners in the clockwise contour of t, and suppose \(c_l\) is the corner around the leaf v. Let us first suppose that \(l(c_{l+1})\) is not the minimal label in the tree (i.e. that its target is not the corner around \(\delta \)). Notice that decreasing the label of v (by 2) does not affect any edges drawn by the Schaeffer correspondence other than

  • the edge drawn from \(c_l\) to \(c_{l+1}\), which is replaced by an edge between \(c_l\) and \(t(t(c_{l+1}))\), that is the first corner labelled \(l(p(v))-2\) in the clockwise contour after \(c_l\);

  • all edges drawn from \(c_i\) to \(t(c_i)\), where \(t(c_i)=t(c_{l+1})\) and \(c_i\)does not lie between \(c_l\) and \(t(c_{l+1})\), since the target of \(c_i\) becomes \(c_l\) if the label of \(c_l\) is decreased by 2; indeed, those are replaced by edges joining \(c_i\) to \(c_l\).

The fact that all other targets remain the same should be clear: if only the label of \(c_l\) is changed, the target of a corner \(c\ne c_l\) may change only by becoming \(c_l\) or by no longer being \(c_l\). Since \(c_l\) is not the target of any other corner in t, the edges affected are those for which \(c_l\) lies between their origin corner and their target corner, labelled \(l(c_{l+1})-1\), in the clockwise contour, as described above.

All that is left to show is that those edges are \(e_1,\ldots ,e_k\); indeed they are edges whose target is a corner of w and whose origin corner comes after \(t(c_{l+1})\) and before \(c_l\) (or, equivalently, strictly before \(c_{l+1}\)), in the (cyclic) clockwise contour; that is, they lie strictly between \(e=(c_{l+1},t(c_{l+1}))\) and \(e'=(t(c_{l+1}),t(t(c_{l+1})))\).

If \(t(c_{l+1})\) is the corner around \(\delta \) in t, then what happens when the label of v is changed is even simpler, since v becomes the unique vertex with minimal label: all corners whose target in t is the corner around \(\delta \) change their target to \(c_l\) in \(t^{v,-}\), including \(c_{l-1}\) and \(c_{l+1}\), while the edge \((c_l,c_{l+1})\) is replaced by one joining v to \(\delta \), which will have degree 1 in \(\phi (t^{v,-},\varepsilon )\). \(\square \)

Before we give an actual description of a sequence of quadrangulation flips that achieves exactly the changes described by Lemma 6.4, we shall construct a sequence of flips that will be useful in what follows and whose only aim is to change the root edge of a quadrangulation by “exchanging” two edges.

Lemma 6.5

(Rerooting) Consider a quadrangulation \(q\in {\mathsf {Q}}_n^\bullet \); let \(e=(v,w)\) be its root edge (with either v or w being the origin) and let \(\eta \) be the edge after e in clockwise order around v, which we suppose distinct from e. Let \(q'\) be the same quadrangulation (with the same distinguished vertex), rerooted in \(\eta \) and with v as the origin if and only if v was the origin in q. Define the sequence of quadrangulation flips \(P(q,q')=(q_i,e_i,s_i)_{i=1}^5\) so that \(q_{i+1}=q_i^{e_i,s_i}\), \(q_1=q\), \(e_1=e_3=e_5=\eta \), \(e_2=e_4=e\), \(s_1=s_2=s_3=+\), \(s_4=s_5=-\) (see Fig. 13). Then \(q_6:=q_5^{e_5,s_5}=q'\).

Proof

Consider the union of the faces of q that are adjacent to e or to \(\eta \); unless either e or \(\eta \) is a double edge within a degenerate face, this is a (generalised) octagon, in the sense that its boundary has an inner contour with exactly 8 corners, which we can cyclically number as \(c_0,\ldots ,c_7\). We can suppose \(\eta \) joins \(c_0\) to \(c_5\) and e joins \(c_0\) to \(c_3\); it is then immediate to verify, as in Fig. 13, that the given sequence of flips ultimately results in \(\eta \) joining \(c_0\) to \(c_3\) and e joining \(c_0\) to \(c_5\).

An analogous check can be performed for the case where e or \(\eta \) is a double edge, where one deals with a hexagon rather than an octagon (lower part of Fig. 13). \(\square \)

Fig. 13
figure 13

The flip path \(P(q,q')=(q_i,e_i,s_i)_{i=1}^5\) reroots q, rooted in e, in the edge \(\eta \); above, the case where the union of faces adjacent to e and \(\eta \) has 8 corners. Below, the case where \(\eta \) is a double edge within a degenerate face; the case where e is a double edge is analogous

We can now present the construction of the flip path corresponding to a colour change from \(+\) to − on an edge (vp(v)), where v is a leaf of a labelled tree t : 

Lemma 6.6

(Construction of the path, \(+\) to −) With the same notation as in Lemma 6.4, consider \(t,v,\varepsilon \) and the edges \(e_1,\ldots , e_k\) of \(\phi (t,\varepsilon )\), all incident to a vertex w (possibly equal to \(\delta \)). If the root edge of \(\phi (t,\varepsilon )\) does not belong to \(\{e_1,\ldots ,e_k\}\), then set \(P_\varepsilon (t,t^{v,-})=(q_i,e_i,-)_{i=1}^k\), where \(q_1=\phi (t,\varepsilon )\) and \(q_{i+1}=q_i^{e_i,-}\). If some \(e_j\) is the root of \(\phi (t,\varepsilon )\), then set \(P_\varepsilon (t,t^{v,-})\) to be the path described above, with the sequence of flips described in Lemma 6.5 for the rotation of the root edge \(e_j\) around its endpoint that is not w, injected right before the flip \((q_j,e_j,-)\). For simplicity, we shall not renumber the quadrangulations \(q_1,\ldots ,q_k\) in this case, but \(q_j\) and \(q_{j+1}\) will be 6 flips rather than one flip apart.

Then \(q_k^{e_k,-}=\phi (t^{v,-},\varepsilon )\).

Proof

Consider the edges adjacent to v in the quadrangulation \(\phi (t^{v,-},\varepsilon )\) and number them as \(\eta _0,\ldots ,\eta _k\), in clockwise order around v, so that \(\eta _0\) is the (unique) edge of \(\phi (t^{v,-},\varepsilon )\) joining v to p(v).

We can show by induction that the quadrangulation \(q_i\) is the quadrangulation \(q_1\), where edges \(e_1,\ldots ,e_{i-1}\) have been replaced by edges \(\eta _1,\ldots ,\eta _{i-1}\) (and the natural edge identification after the flip sequence pairs \(e_1\) with \(\eta _1\), \(e_2\) with \(\eta _2\), and so on)—see Fig. 14. Indeed, if this is true for \(q_i\), then the edge \(e_i\) as seen in \(q_i\) is adjacent to two faces, one in whose clockwise contour it’s preceded by \(\eta _{i-1}\), one in whose clockwise contour it’s preceded by \(e_{i+1}\) (where we set \(e_{k+1}\) to be the edge between t(c) and t(t(c))). It follows that flipping \(e_i\) counterclockwise results in an edge joining the appropriate endpoint of \(e_{i+1}\) to v, which does correspond to creating \(\eta _i\). Since \(\eta _0\) can already be identified with an edge of \(q_1\), the final quadrangulation \(q_k^{e_k,-}\) is \(\phi (t^{v,-},\varepsilon )\), up to rerooting.

If some \(e_j\) is the root edge of \(q_1=\phi (t,\varepsilon )\), then—as described in Lemma 6.4\(\eta _{j-1}\) is the root edge of \(\phi (t^{v,-},\varepsilon )\), oriented towards v if and only if the root of \(q_1\) is oriented towards w. We have shown that in \(q_j\) edges \(e_j\) and \(\eta _{j-1}\) do belong to the same face, \(e_j\) coming right after \(\eta _{j-1}\) in its clockwise contour; performing the appropriate root rotation sequence of flips on \(q_j\) has exactly the effect of rerooting \(q_j\) in \(\eta _{j-1}\) (with the desired orientation). After that is done, one can proceed with the “normal” flip sequence to obtain \(q_k^{e_k,-}\), which is now rooted correctly. \(\square \)

This concludes the description of our canonical flip paths corresponding to label changes of leaves: we simply set \(P_\varepsilon (t,t^{v,-})\), when \(l(v)=l(p(v))\) in t, to be the concatenation of \(P_\varepsilon (t,t^{v,+})\) and \(P_\varepsilon (t^{v,+},(t^{v,+})^{v,-})\), while in general \(P_\varepsilon (t^{v,x},t)\) is the reverse path of \(P_\varepsilon (t,t^{v,x})\) (for \(x\in \{+,-,=\}\)).

In order to compare spectral gaps as we did in Sect. 6.1, we need to estimate the maximum length of a flip path of the form \(P_\varepsilon (t,t^{v,x})\) and the number of paths involving any fixed quadrangulation edge flip; we do this via the following two lemmas.

Lemma 6.7

Given \(t\in \mathsf {LT}_n\), a leaf v of t and \(x\in \{+,-,=\}\), the length of the flip path \(P_\varepsilon (t,t^{v,x})\), for \(\varepsilon \in \{-1,1\}\), is at most \(2n+6\).

Proof

The flip sequence \(P_\varepsilon (t,t^{v,-})\), if we ignore the possible root rotating subsequence, does not flip “the same edge” twice, so it has length at most 2n (in fact, it has length at most the maximum degree of a vertex in \(\phi (t,\varepsilon )\), since all flipped edges are adjacent to the same vertex). The root rotating subsequence has length 5 and the path \(P_\varepsilon (t,t^{v,+})\) has length 1, hence the above estimate. \(\square \)

Fig. 14
figure 14

The flip path \(P_\varepsilon (t,t^{v,-})\). Since \(q_1\) is rooted in \(e_4\), a root rotating sequence is inserted right before flipping \(e_4\), thus correctly rooting the final quadrangulation in \(\eta _3\), i.e. the flipped image of \(e_3\)

Lemma 6.8

Let (qes) be a triple with \(q\in {\mathsf {Q}}_n^\bullet \), \(e\in E(q)\), \(s\in \{+,-\}\); then there is a constant C such that there are at most C quadruples \((t,v,x,\varepsilon )\), where \(t\in \mathsf {LT}_n\), v is a leaf in t, \(x\in \{+,-,=\}\) and \(\varepsilon =\pm 1\), for which (qes) appears in the flip path \(P_\varepsilon (t,t^{v,x})\).

Proof

Let us first consider quadruples of the form \((t,v,-,\varepsilon )\).

Suppose tv are such that \(l(v)-l(p(v))=1\) and (qes) appears in \(P_\varepsilon (t,t^{v,-})=(q_i,e_i,s_i)_{i=1}^k\) (that is, \(q=q_l\), \(e=e_l\) and \(s=s_l\) for some \(l\in \{1,\ldots ,k\}\)); we first consider the case where no root rotation sequence appears in \(P_\varepsilon (t,t^{v,-})\). Let \(v_1\) and \(v_2\) be the endpoints of e in \(q^{e,s}\); then v corresponds to one of these two vertices in \(q_1=\phi (t,\varepsilon )\), since every edge \(e_i\) is adjacent to v in \(q_i^{e_i,-}\) and subsequent quadrangulations in the flip path. In particular, since the degree of v in \(q_1\) is 1 and it increases by 1 with each flip in \(P_\varepsilon (t, t^{v,-})\), if \(d_1\) and \(d_2\) are the respective degrees of \(v_1\) and \(v_2\) in q, then \(l=d_1\) or \(l=d_2\), according to whether v is \(v_1\) or \(v_2\). Suppose v is \(v_1\); let \(\eta _0,\ldots ,\eta _{d_1-1}\) be the edges incident to \(v_1\) in q, in clockwise order around \(v_1\), numbered so that e will end up in between \(\eta _{d_1-1}\) and \(\eta _0\) in \(q^{e,s}\). Then \(q_1=\phi (t,\varepsilon )=((q^{\eta _{d_1-1},+})^{\eta _2,+})^{\ldots })^{\eta _1,+}\), since \(e_i\) corresponds to the edge \(\eta _{d_{i-1}}\) in q. We therefore have only 2 possibilities for (tv).

Now suppose a root rotation sequence does appear in \(P_\varepsilon (t,t^{v,-})\); if it appears strictly after (qes), then the reasoning above is still valid. If it appears strictly before, then the root of q is some \(\eta _j\), and—reasoning as before—the quadrangulation \(q_1\) is recovered by inserting the reverse of a root rotation sequence right before applying the clockwise flip of \(\eta _j\) (or right at the end if \(j=0\)). We are left to deal with the case where (qes) actually belongs to a root rotation sequence. Notice that the number of possibilities for the quadrangulation \(q'\) obtained at the end of the root rotation sequence is bounded by a constant independent of n, since the sequence only acts within (at most) three adjacent faces, two of which are adjacent to e. Having established some \(q'\) to be the quadrangulation in question, the vertex v must be one of the endpoints of the root edge \(e'\) of \(q'\). As before, we can now reconstruct \(q_1\) by labelling \(\eta _0,\ldots ,\eta _{d_{i-1}}\) the edges incident to this endpoint and performing the appropriate reverse root rotation sequence, followed by clockwise flips on \(\eta _{d_{i-1}},\ldots ,\eta _1\).

Similarly, suppose tv are such that \(l(v)=l(p(v))\) and (qes) appears in \(P_\varepsilon (t,t^{v,-})\); if (qes) appears in \(P_\varepsilon (t^{v,+},t^{v,-})\), then \(\phi (t^{v,+},\varepsilon )\) can be reconstructed as above, hence \(t=(t^{v,+})^{v,=}\). Otherwise we have \((q,e,s)=P_\varepsilon (t,t^{v,+})\), hence \(q=\phi (t,\varepsilon )\).

Now, since \((q,e,-)\) appears in \(P_\varepsilon (t,t^{v,-})\) if and only if \((q^{e,-}, e, +)\) appears in \(P_\varepsilon (t^{v,-}, t)\), we have thus also covered the cases where (tvx) is such that \(l(v)=l(p(v))-1\), which correspond to another 4 possibilities.

Finally, the missing cases (\(x\in \{+,=\}\) and \(l(v)-l(p(v))\in \{1,0\}\)) are completely straightforward, since they correspond to a flip path of length 1, and therefore imply that \((t,\varepsilon )=\phi ^{-1}(q)\) and v is the one vertex whose label is changed by the flip.

This results, indeed, in a number of possibilities for the quadruple \((t,v,x,\varepsilon )\) that is bounded independently of n. \(\square \)

6.4 The leaf translation sequences \(P_\varepsilon (t,t^{v,\rightarrow })\), \(P_\varepsilon (t,t^{v,\leftarrow })\)

The other type of “move” we wish to emulate via quadrangulation flips is the translation of a leaf left or right in the contour of the tree. In doing this, we may suppose the leaf has the same label as its parent, and deal with all other cases by prefixing and appending flip paths of the type \(P_\varepsilon (t,t^{v,=})\) and \(P_\varepsilon (t^{v,=},t)\), which we have constructed in the previous section.

Fig. 15
figure 15

The quadrangulations \(\phi (t,\varepsilon )\) and \(\phi (t^{v,\rightarrow },\varepsilon )\), drawn in a case where \(l(c_{l+2})=l(v)-1\)

The description of \(\phi (t^{v,\rightarrow },\varepsilon )\) in terms of \(\phi (t,\varepsilon )\) is rather simple and depicted in Fig. 15:

Lemma 6.9

Consider \((t,\varepsilon )\in \mathsf {LT}_n\times \{-1,1\}\) and let v be a leaf of t such that \(l(v)=l(p(v))\). Let \(c_1,\ldots ,c_{2n}\) be the clockwise contour of t and suppose \(c_l\), with \(2\le l\le 2n-1\), is the corner of v (notice that, if we had \(l=2n\), we would have \(t^{v,\rightarrow }=t\)). Then \(\phi (t^{v,\rightarrow },\varepsilon )\) can be obtained from \(\phi (t,\varepsilon )\) by

  • identifying the two edges \((c_{l-1}, t(c_{l-1}))\) and \((c_{l+1},t(c_{l+1}))\) and erasing the double edge \((c_l,c_{l+1})\) (that is eliminating the one degenerate face of \(\phi (t,\varepsilon )\) which corresponds to the tree edge (p(v), v));

  • replacing the edge \((c_{l+2},t(c_{l+2}))\) by a degenerate face whose internal vertex is adjacent to the vertex of \(t(c_{l+2})\).

Notice that it is possible that either the edge \((c_{l-1},t(c_{l-1}))\) (if \(l=2\)) or the edge \((c_{l+2},t(c_{l+2})))\) (if \(l=2n-1\)) is the root edge of \(\phi (t,\varepsilon )\). In the former case, the root of \(\phi (t^{v,\rightarrow },\varepsilon )\) is the edge obtained from identifying \((c_{l-1}, t(c_{l-1}))\) and \((c_{l+1},t(c_{l+1}))\), oriented as before; in the latter, it is the second edge of the new degenerate face in clockwise order around the vertex of \(c_{l+2}\), oriented as \((c_{l+2},t(c_{l+2}))\) was.

Proof

Given \(t\in \mathsf {LT}_n\) and \(\varepsilon \in \{-1,1\}\), a leaf v such that \(l(v)=l(p(v))\) is the internal vertex of a degenerate face in \(\phi (t,\varepsilon )\); removing the leaf (and the tree edge joining it to its parent) results in a tree \(t'\in \mathsf {LT}_{n-1}\) and a quadrangulation \(\phi (t',\varepsilon )\) in which the face is eliminated by identifying the two edges of its boundary (into a root edge with the same orientation as before in the case where one of them was root edge in \(\phi (t,\varepsilon )\)).

Notice that erasing the leaf v from t and from \(t^{v,\rightarrow }\) yields the same tree \(t'\). The quadrangulation \(\phi (t^{v,\rightarrow },\varepsilon )\) can thus be obtained by first performing the operation described above to build \(\phi (t',\varepsilon )\) and then performing it “in reverse” by replacing the appropriate edge (which is the one drawn from the corner of \(t'\) that contains the edge joining v to p(v) in \(t^{v,\rightarrow }\)) with a degenerate face (see Fig. 15). \(\square \)

The quadrangulation flip path \(P_\varepsilon (t,t^{v,\rightarrow })\) will depend on the label of the vertex w of corner \(c_{l+2}\) in t; since the cases where \(l(w)=l(v)\) and \(l(w)=l(v)+1\) are simpler (we can construct a path of length 1 in the first case and 3 in the second!), we refer to Figs. 16 and 17 for its construction, which only involves flips within two adjacent faces of \(\phi (t,\varepsilon )\). Notice that, furthermore, the construction preserves the edges issued from \(c_{l-1}\) and \(c_{l+2}\), so that the root edge is automatically the correct one in the quadrangulations \(q_2\) from Fig. 16 and \(q_4\) from Fig. 17.

The case where \(l(w)=l(v)-1\) is more complex, and will be treated in the following lemma.

Fig. 16
figure 16

The flip path \(P_\varepsilon (t,t^{v,\rightarrow })\) has length 1 when the edge (vp(v)) is replanted onto a vertex which has the same label as v and p(v). Notice that the edge that needs to be flipped in order to change \(\phi (t,\varepsilon )\) into \(\phi (t^{v,\rightarrow },\varepsilon )\)cannot be the root edge of \(\phi (t,\varepsilon )\), since it is not issued from the first corner of t

Fig. 17
figure 17

The flip path \(P_\varepsilon (t,t^{v,\rightarrow })\) when \(l(w)=l(v)+1\), where w is the vertex that becomes the new parent of v in \(t^{v,\rightarrow }\)

Lemma 6.10

(Flip path construction, leaf translation onto smaller label) Consider a tree \(t\in \mathsf {LT}_n\) with contour \(c_1,\ldots ,c_{2n}\) and let v be a leaf of t such that \(l(v)=l(p(v))\), adjacent to a corner \(c_l\) with \(2\le l\le 2n-1\). Let w be the vertex of the corner \(c_{l+2}\) and further suppose that \(l(w)=l(v)-1\).

For \(\varepsilon \in \{-1,1\}\), let \(e_1,\ldots ,e_{k-1}\) be the edges of \(\phi (t,\varepsilon )\) that are adjacent to w and lie strictly between the edge \(e_k=(c_l,c_{l+2})\) and the edge \((c_{l+2},t(c_{l+2}))\), in clockwise order around w. If none of them is the root edge of \(\phi (t,\varepsilon )\), then we can set \(P_\varepsilon (t,t^{v,\rightarrow })=(q_i,e_i,-)_{i=1}^k\), and we have \(q_{k+1}:=q_k^{e_k,-}=\phi (t^{v,\rightarrow },\varepsilon )\).

If some \(e_j\) is the root edge of \(\phi (t,\varepsilon )\), then we can set \(P_\varepsilon (t,t^{v,\rightarrow })\) to be the same flip path as above, with the root rotation flip sequence from Lemma 6.5, performed around the endpoint of the root edge that is not w, inserted right before the flip \((q_j,e_j,-)\).

See Fig. 18 for the construction.

Fig. 18
figure 18

The flip path \(P_\varepsilon (t,t^{v,\rightarrow })\) in the case where the edge (vp(v)) is followed by a corner labelled \(p(v)-1\) in the clockwise contour of t

Proof

One can show inductively that, for \(i=2,\ldots ,k-1\), the quadrangulation \(q_i\) is obtained from \(q_1\) by collapsing the face that contains the edge \(e_k\) in \(q_1\) and replacing the edge \(e_i\) with a degenerate face whose internal vertex is adjacent to w; furthermore, the natural edge identification between \(q_1\) and \(q_i\) has \(e_k\) correspond to the internal edge of the degenerate face, while \(e_i\) corresponds to the ‘rightmost’ boundary edge of the newly created degenerate face in clockwise order around w. Then, \(q_k\) is obtained by flipping \(e_{k-1}\) counterclockwise. By Lemma 6.9, the only difference between \(q_k\) and \(\phi (t^{v,\rightarrow },\varepsilon )\) is the fact that the internal edge of the new degenerate face is incident to w in \(q_k\) and, potentially, the choice of the root edge (in the case where the root edge of \(q_1\) is among the flipped edges \(e_1,\ldots ,e_{k-1}\)). Flipping \(e_k\) in \(q_k\)—thus obtaining \(q_{k+1}\)—is enough to fix the first issue, and yields \(\phi (t^{v,\rightarrow },\varepsilon )\) up to rerooting. Suppose now that the root edge of \(q_1\) is some \(e_j\) (with \(1\le j\le k-1\)); the edge issued from the same corner in \(q_{k+1}\) is actually the flipped version of edge \(e_{j-1}\) (or the edge out of \(c_{l+1}\), which never gets flipped, in the case where \(l=2\): for ease of notation we will call it \(e_0\)). In the quadrangulation \(q_j\), \(e_j\) (not yet flipped and still the root edge) and \(e_{j-1}\) (already flipped, unless \(j=1\)) are consecutive in clockwise order around their endpoint that is not w. Performing the root rotation sequence before \((q_j,e_j,-)\) thus simply results in rerooting \(q_j\) in its edge \(e_{j-1}\) (with the correct orientation), which will not be flipped again and will end up being the correct root edge in \(q_{k+1}\) once the rest of the flips are performed. \(\square \)

As mentioned before, we construct \(P_\varepsilon (t,t^{v,\rightarrow })\) in general as the concatenation of \(P_\varepsilon (t,t^{v,=})\), \(P_\varepsilon (t^{v,=},(t^{v,=})^{v,\rightarrow })\) and \(P_\varepsilon ((t^{v,=})^{v,\rightarrow },((t^{v,=})^{v,\rightarrow })^{v,c})\), where c is chosen so as to “restore” the colour of (vp(v)) to the original one from t. We can further set \(P_\varepsilon (t^{v,\rightarrow },(t^{v,\rightarrow })^{v,\leftarrow })\) to be the reverse sequence of \(P_\varepsilon (t,t^{v,\rightarrow })\) (keeping in mind that \(P_\varepsilon (t,t)\) is already set to be empty). Notice that, using Lemma 6.7 and the fact that the construction from Lemma 6.10, excluding root rotation sequences, does not flip the same edge twice, we immediately have

$$\begin{aligned} \left| P_\varepsilon (t^{v,d},t)\right| \le 6n+17 \end{aligned}$$
(4)

for all \(t\in \mathsf {LT}_n\), v leaf of t, \(d\in \{\rightarrow ,\leftarrow \}\), \(\varepsilon \in \{-1,1\}\).

Additionally, we have the following lemma.

Lemma 6.11

Let (qes) be a triple with \(q\in {\mathsf {Q}}^\bullet _n\), \(e\in E(q)\), \(s\in \{+,-\}\); then there is a constant C such that there are at most C quadruples \((t,v,d,\varepsilon )\), where \(t\in \mathsf {LT}_n\), v is a leaf in t, \(d\in \{\rightarrow , \leftarrow \}\) and \(\varepsilon =\pm 1\), for which (qes) appears in the flip path \(P_\varepsilon (t,t^{v,d})\).

Proof

By lemma 6.8, the number of such quadruples is at most a constant times the number of those where \(l(v)=l(p(v))\), so we shall restrict ourselves to the latter case; since \(P_\varepsilon (t,t^{v,\rightarrow })\) is the reverse of \(P_\varepsilon (t^{v,\rightarrow },t)\), we may also suppose \(d=\rightarrow \).

Suppose \((t,v,\rightarrow ,\varepsilon )\) is a quadruple such that \(l(v)=l(p(v))\) and (qes) appears in \(P_\varepsilon (t,t^{v,\rightarrow })\), and let \(c_1,\ldots ,c_{2n}\) be the clockwise contour of t, with \(c_l\) being the corner of v. If \(l(c_{l+2})=l(v)\), then \(q=\phi (t,\varepsilon )\) and v is uniquely determined from the flip (qes).

If \(l(c_{l+2})=l(v)+1\), then there are a few possibilities (refer again to Fig. 17). If q has one more degree one vertex than \(q^{e,s}\), then \(q=\phi (t,\varepsilon )\) and v is that degree one vertex. If \(q^{e,s}\) has one more degree one vertex than q, let \(e'\) be the edge issued from that vertex; then v is the vertex in question and \(\phi (t^{v,\rightarrow },\varepsilon )=(q^{e,s})^{e',-}\). Otherwise, e is a double edge within a degenerate face, \(q^{e,s}=\phi (t^{v,\rightarrow },\varepsilon )\) and v is its degree one endpoint.

Finally, if \(l(c_{l+2})=l(v)-1\), then consider the edge e in \(q^{e,s}\). First, let’s suppose that (qes) does not belong to a root rotating sequence. If e is the interior edge of a degenerate face, then \(\phi (t^{v,\rightarrow },\varepsilon )=q^{e,s}\) and v is its endpoint of degree 1. Otherwise, consider the two endpoints \(w_1\), \(w_2\) of e in \(q^{e,s}\); one of them must play the role of the vertex w from Lemma 6.10. If it is \(w_1\), then to the right of the oriented edge \((w_1,w_2)\) in \(q^{e,s}\) lies a degenerate face with an internal vertex connected to \(w_1\). If \((w_1,u)\) is the first oriented edge in clockwise order around w starting with \((w_1,w_2)\) such that u is strictly nearer to \(\delta \) than \(w_1\), the construction of the flip path implies that \(\phi (t^{v,\rightarrow },\varepsilon )\) can be obtained from \(q^{e,s}\) by collapsing the face lying right of \((w_1,w_2)\) and replacing \((w_1,u)\) with a new degenerate face whose internal edge is issued from u and has v as the other endpoint. An analogous argument holds for \(w_2\), giving rise to only two possibilities for \((t,v,\rightarrow )\). If (qes) actually belongs to a root rotation sequence, then the the number of possibilities for the final quadrangulation \(q'\) obtained by completing the sequence is bounded by a constant; we can then use \(q'\) and its root edge in place of \(q^{e,s}\) and e to reconstruct the final quadrangulation \(\phi (t^{v,\rightarrow },\varepsilon )\) and its vertex v (from which one determines t as \((t^{v,\rightarrow })^{v,\leftarrow }\)) in (at most) two ways. \(\square \)

6.5 The final comparison between \({\mathcal {F}}^{n,\bullet }\) and \({\widetilde{X}}\)

Proof of Theorem 1

The upper bound for the spectral gap of \({\mathcal {F}}^n\) is Proposition 4.1; we set out to prove the lower bound.

Consider the chains \({\widetilde{X}}\) and \({\mathcal {F}}^{n,\bullet }\) from Sect. 6.1 and their respective spectral gaps \({\widetilde{\gamma }}\) and \(\nu _n^\bullet \), and let \(\nu _n\) be the spectral gap of \({\mathcal {F}}^n\). Also recall the Schaeffer correspondence \(\phi :\mathsf {LT}_n\times \{-1,1\}\rightarrow {\mathsf {Q}}_n^\bullet \) from Sect. 3 and the flip paths P(t) and \(P_\varepsilon (t,t^{v,x})\) constructed throughout Sects. 6.26.3 and 6.4 .

Let \(f:{\mathsf {Q}}_n^\bullet \rightarrow {\mathbb {R}}\) be a function such that \({\mathbb {V}}_{\pi ^\bullet }(f)=1\) (where \(\pi ^\bullet \) is the uniform measure on \({\mathsf {Q}}_n^\bullet \)) and \({\mathcal {E}}_{{\mathcal {F}}^{n,\bullet }}(f,f)=\nu _n^\bullet \). Then thanks to Corollary 6.1 we have

$$\begin{aligned}&Cn^{-\frac{9}{2}}\le {\widetilde{\gamma }}\le {\mathcal {E}}_{\widetilde{X}}(f\circ \phi ,f\circ \phi )\\&\quad =\frac{1}{2}\sum _{\begin{array}{c} t,t'\in \mathsf {LT}_n\\ \varepsilon ,\varepsilon '\in \{-1,1\} \end{array}}\left( f(\phi (t,\varepsilon ))-f(\phi (t',\varepsilon '))\right) ^2\frac{1}{2|\mathsf {LT}_n|}p_{\widetilde{X}}(t,t')\\&\quad =\frac{1}{2}\sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ v\hbox { leaf of }t\\ x\in \{\rightarrow ,\leftarrow ,+,-,=\}\\ \varepsilon \in \{1,-1\} \end{array}}\left( f(\phi (t,\varepsilon ))-f(\phi (t^{v,x},\varepsilon ))\right) ^2\frac{1}{2|\mathsf {LT}_n|}\frac{1}{5(n+1)}\\&\qquad + \sum _{\begin{array}{c} t\in \mathsf {LT}_n \end{array}}\left( f(\phi (t,1))-f(\phi (t,-1))\right) ^2\frac{1}{2|\mathsf {LT}_n|}\frac{1}{n+1}. \end{aligned}$$

Now we may rewrite each difference within the sums above in terms of the images of subsequent quadrangulations appearing in the paths \(P_\varepsilon (t,t^{v,x})\) and P(t), apply the Cauchy–Schwarz inequality and tweak the constants in order to recover factors of the form \(p_{{\mathcal {F}}^{n,\bullet }}(q,q')\). We obtain that the expression above is at most

$$\begin{aligned}&\frac{1}{2|\mathsf {LT}_n|}\frac{1}{n}\left( \sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ v\hbox { leaf of }t\\ x\in \{\rightarrow ,\leftarrow ,+,-,=\}\\ \varepsilon \in \{1,-1\} \end{array}} \left( \sum _{\begin{array}{c} i=0,\ldots ,|P_\varepsilon (t,t^{v,x})|\\ P_\varepsilon (t,t^{v,x})=(q_i,e_i,s_i)_{i=1}^N \end{array}} f(q_i)-f(q_i^{e_i,s_i})\right) ^2\right. \\&\qquad \left. + \sum _{\begin{array}{c} t\in \mathsf {LT}_n \end{array}} \left( \sum _{\begin{array}{c} i=0,\ldots ,|P(t)|\\ P(t)=(q_i,e_i,s_i)_{i=1}^N \end{array}}f(q_i)-f(q_i^{e_i,s_i})\right) ^2\right) \\&\quad \le 6\sum _{\begin{array}{c} t\in \mathsf {LT}_n\\ v \text{ leaf } \text{ of } t\\ x\in \{\rightarrow ,\leftarrow ,+,-,=\}\\ \varepsilon \in \{1,-1\} \end{array}} |P_\varepsilon (t,t^{v,x})| \sum _{\begin{array}{c} i=0,\ldots ,|P_\varepsilon (t,t^{v,x})|\\ P_\varepsilon (t,t^{v,x})=(q_i,e_i,s_i)_{i=1}^N \end{array}} (f(q_i)-f(q_i^{e_i,s_i}))^2\frac{1}{|{\mathsf {Q}}^\bullet _n|}\frac{1}{6n}\\&\qquad +6\sum _{\begin{array}{c} t\in \mathsf {LT}_n \end{array}}|P(t)| \sum _{\begin{array}{c} i=0,\ldots ,|P(t)|\\ P(t)=(q_i,e_i,s_i)_{i=1}^N \end{array}}(f(q_i)-f(q_i^{e_i,s_i}))^2\frac{1}{|{\mathsf {Q}}^\bullet _n|}\frac{1}{6n}. \end{aligned}$$

Given \(q\in {\mathsf {Q}}_n^\bullet \), \(e\in E(q)\), \(s\in \{+,-\}\), write C(qes) for

$$\begin{aligned} \left| \{t,v,x,\varepsilon : (q,e,s) \text{ appears } \text{ in } P_\varepsilon (t,t^{v,x})\}\right| +\left| \{t: (q,e,s) \text{ appears } \text{ in } P(t)\}\right| . \end{aligned}$$

Thanks to Lemmas 6.26.8 and 6.11, there is a constant M independent of n such that \(C(q,e,s)\le M\) for all qes. Furthermore, Lemmas 6.26.7 and (4) imply that \(\max \{|P(t)|,|P_\varepsilon (t,t^{v,x})|\}\le 7n\) for all \(n\ge 17\), \(t\in \mathsf {LT}_n\), v leaf of t, \(x\in \{\rightarrow ,\leftarrow ,+,=,-\}\), \(\varepsilon \in \{-1,1\}\). From this we obtain that, for some constant \(C'\),

$$\begin{aligned}&Cn^{-\frac{9}{2}}\le \frac{C'n}{2} \sum _{\begin{array}{c} q\in {\mathsf {Q}}^\bullet _n\\ e\in E(q)\\ s\in \{+,-\} \end{array}}C(q,e,s)(f(q)-f(q^{e,s}))^2\frac{1}{|{\mathsf {Q}}^\bullet _n|}\frac{1}{6n}\\&\quad \le C'n M\cdot {\mathcal {E}}_{{\mathcal {F}}^{n,\bullet }}(f,f)=C'n M\nu _n^\bullet \le C'M n \nu _n, \end{aligned}$$

where the last inequality follows from Lemma 2.3. Following the chain of inequalities, we have indeed shown that \(\nu _n\ge C_1n^{-\frac{11}{2}}\) for some constant \(C_1\) independent of n. \(\square \)