Complexity of Token Swapping and Its Variants

Bonnet, Édouard; Miltzow, Tillmann; Rzążewski, Paweł

doi:10.1007/s00453-017-0387-0

Complexity of Token Swapping and Its Variants

Open access
Published: 20 October 2017

Volume 80, pages 2656–2682, (2018)
Cite this article

Download PDF

You have full access to this open access article

Algorithmica Aims and scope Submit manuscript

Complexity of Token Swapping and Its Variants

Download PDF

4260 Accesses
13 Citations
9 Altmetric
Explore all metrics

Abstract

In the Token Swapping problem we are given a graph with a token placed on each vertex. Each token has exactly one destination vertex, and we try to move all the tokens to their destinations, using the minimum number of swaps, i.e., operations of exchanging the tokens on two adjacent vertices. As the main result of this paper, we show that Token Swapping is $W[1]$-hard parameterized by the length k of a shortest sequence of swaps. In fact, we prove that, for any computable function f, it cannot be solved in time $f(k)n^{o(k / \log k)}$ where n is the number of vertices of the input graph, unless the ETH fails. This lower bound almost matches the trivial $n^{O(k)}$-time algorithm. We also consider two generalizations of the Token Swapping, namely Colored Token Swapping (where the tokens have colors and tokens of the same color are indistinguishable), and Subset Token Swapping (where each token has a set of possible destinations). To complement the hardness result, we prove that even the most general variant, Subset Token Swapping, is FPT in nowhere-dense graph classes. Finally, we consider the complexities of all three problems in very restricted classes of graphs: graphs of bounded treewidth and diameter, stars, cliques, and paths, trying to identify the borderlines between polynomial and NP-hard cases.

Swapping Labeled Tokens on Graphs

Sequentially Swapping Colored Tokens on Graphs

The Time Complexity of the Token Swapping Problem and Its Parallel Variants

1 Introduction

In reconfiguration problems, we are interested to transform a combinatorial or geometric object from one state to another, by performing a sequence of simple operations. An important example is motion planning, where we want to move an object from one configuration to another. Elementary operations are usually translations and rotations. It turns out that motion planning can be reduced to the shortest path problem in some higher dimensional Euclidean space with obstacles [8].

Finding the shortest flip sequence between any two triangulations of a convex polygon is a major open problem in computational geometry. Interestingly it is equivalent to a myriad of other reconfiguration problems of so-called Catalan structures [4]. Examples include: binary trees, perfect matchings of points in convex position, Dyck words, monotonic lattice paths, and many more. Reconfiguring permutations under various constraints is heavily studied and usually called sorting.

An important class of reconfiguration problems is a big family of problems in graph theory that involves moving tokens, pebbles, cops or robbers along the edges of a given graph, in order to reach some final configuration [1, 5, 9, 11, 14, 16, 23, 31, 35]. In this paper, we study one of them.

The Token Swapping problem, introduced by Yamanaka et al. [36], fits nicely into this long history of reconfiguration problems and can be regarded as a sorting problem with special constraints.

The problem is defined as follows, see also Fig. 1. We are given an undirected connected graph with n vertices $v_1,\dots ,v_n$, a set of tokens $T = \{t_1,\ldots ,t_n\}$ and two permutations $\pi _{\text {start}}$ and $\pi _{\text {target}}$. These permutations are called start permutation and target permutation, respectively. Initially vertex $v_i$ holds token $t_{\pi _{\text {start}}(i)}$. In one step, we are allowed to swap tokens on a pair of adjacent vertices, that is, if v and w are adjacent, v holds the token s, and w holds the token t, then the swap between v and w results in the configuration where v holds t, w holds s, and all the other tokens stay in place. The Token Swapping problem asks if the target configuration can be reached in at most k swaps. Thus, a solution for Token Swapping is a sequence of edges, where the swaps take place. The solution is optimal if its length is shortest possible. To see the correspondence to sorting note that every placement of tokens can be regarded as a permutation and the target permutation can be regarded as the sorted state.

Yamanaka et al. [36] observed that every instance of Token Swapping has a solution, and its length is $O(n^2)$. Moreover, $\Omega (n^2)$ swaps are sometimes necessary.

It is interesting to note that although the problem in its full generality was introduced only recently [36], some special cases were studied before in the context of sorting permutations with additional restrictions (see Knuth [24, Section 5.2.2] for paths, Pak [30] for stars, Cayley [6] for cliques, and Heath and Vergara [19] for squares of a path).

Recently the problem was also solved for a special case of complete split graphs (see Gaku et. al. [38]). It is also worth mentioning that a very closely related concept of sorting permutations using cost-constrained transitions was considered by Farnoud, Chen, and Milenkovic [13], and Farnoud and Milenkovic[12].

The computational complexity of Token Swapping was investigated by Miltzow et al. [28]. They show that the problem is NP-complete and APX-complete. Moreover, they show that any algorithm solving Token Swapping in time $2^{o(n)}$ would refute the Exponential Time Hypothesis (ETH) [21]. The results of Miltzow et al. [28] carry over also to a generalization of Token Swapping, called Colored Token Swapping, first introduced by Yamanaka et al. [37]. In this problem, vertices and tokens are partitioned into color classes. For each color c, the number of tokens colored c equals the number of vertices colored c. The question is whether k swaps are enough to reach a configuration in which each vertex contains a token of its own color. Token Swapping corresponds to the special case where each color class comprises exactly one token and one vertex. NP-hardness of Colored Token Swapping was first shown by Yamanaka et al. [37], even in the case that only 3 colors exist.

We introduce Subset Token Swapping, which is an even further generalization of Token Swapping. Here a function $D: T \rightarrow 2^V$ specifies the set D(t) of possible destinations for the token t. We ask if k swaps are enough to reach a configuration, when each token t is placed on a vertex from D(t). Observe that Subset Token Swapping also generalizes Colored Token Swapping. It might happen that there is no satisfying swapping sequence at all to this new problem. Though, this can be checked in polynomial time by deciding if there is a perfect matching in the bipartite token-destination graph. Thus we shall always assume that we have a satisfiable instance.

In this paper we continue and extend the work of Miltzow et al. [28]. They presented a very simple algorithm which solves the instance of Token Swapping in $n^{O(k)}$ time and space, where k denotes the number of allowed swaps. In Sect. 3 we show that this algorithm can be easily generalized to Colored Token Swapping and Subset Token Swapping. Next, we present a slightly slower exact algorithm, whose advantage is only polynomial (in fact, only slightly super-linear) space complexity.

The algorithm by Miltzow et al. [28] shows that Token Swapping is in XP. A natural next step is to investigate whether the problem can be solved in FPT time (i.e., $f(k) \cdot n^{O(1)}$, for some function f). There is some evidence indicating that this could be possible. First, observe that if more than 2k tokens are misplaced, then one can immediately answer that we deal with a No-instance, as each swap involves exactly two tokens. Further, one can safely remove all vertices from the graph that are at distance more than k from all misplaced tokens. This preprocessing yields an equivalent instance, where every connected component has diameter $O(k^2)$. Thus for bounded maximum degree $\Delta $ each component has size f(k), for some function f. The connected components of f(k) size can be solved separately by exhaustively guessing (still in FPT time) the number of swaps to perform in each of them. Moreover, even the generalized Subset Token Swapping problem is FPT in $k + \Delta $ (see Proposition 3). For those reasons, one could have hoped for an FPT algorithm for general graphs. However, as the main result of this paper, we show in Sect. 4 that this is not possible.

Theorem 1

(Parameterized hardness) Token Swapping is $W[1]$-hard, parameterized by the number k of allowed swaps. Moreover, assuming the ETH, for any computable function f, Token Swapping cannot be solved in time $f(k)(n+m)^{o(k / \log k)}$ where n and m are respectively the number of vertices and edges of the input graph.

Observe that this lower bound shows that the simple $n^{O(k)}$-time algorithm is almost best possible. It is worth mentioning that the parameter for which we show hardness is in fact number of swaps $+$ number of initially misplaced tokens $+$ diameter of the graph, which matches the reasoning presented in the previous paragraph.

To show the lower bound, we introduce handy gadgets called linkers. They are simple and can be used to give a significantly simpler proof of the lower bounds given by Miltzow et al. [28]. One might also use them to establish a simpler and potentially stronger inapproximability result.

Since there is no FPT algorithm for Token Swapping (parameterized by the number k of swaps), unless FPT $=$ $W[1]$, a natural approach is to try to restrict the input graph classes, in hope to obtain some positive results. Indeed, in Sect. 5 we show that FPT algorithms exist, if we restrict our input to the so-called nowhere-dense graph classes.

Theorem 2

(FPT in nowhere dense graphs) Subset Token Swapping is FPT parameterized by k on nowhere-dense graph classes.

The notion of nowhere-dense graph classes has been introduced as a common generalization of several previously known notions of sparsity in graphs such as planar graphs, graphs with forbidden (topological) minors, graphs with (locally) bounded treewidth or graphs with bounded maximum degree.

Grohe et al. [17] proved that every property definable as a first-order formula $\varphi $ is solvable in $O(f(|\varphi |,\varepsilon )\, n^{1+\varepsilon })$ time on nowhere-dense classes of graphs, for every $\varepsilon >0$. We use this meta-theorem to show the existence of an FPT time algorithm for Subset Token Swapping, restricted to nowhere-dense graph classes. In particular, this implies the following results.

Corollary 1

Subset Token Swapping is FPT

(a)
parameterized by $k + {{\mathrm{tw}}}(G)$,
(b)
parameterized by k in planar graphs.

It is often observed that NP-hard graph problems become tractable on classes of graphs with bounded treewidth (or, at least, with bounded tree-depth; see Nešetřil and Ossona de Mendez [29, Chapter 10] for the definition and some background of tree-depth and related parameters). It is not uncommon to see FPT algorithms running in time $f({{\mathrm{tw}}})n^{O(1)}$ (or $f({{\mathrm{td}}})n^{O(1)}$) or XP algorithms running in time $n^{f({{\mathrm{tw}}})}$ (or $n^{f({{\mathrm{td}}})}$), for some computable functions f. Especially, in light of Corollary 1(a), we want to know if there exists an algorithm that runs in polynomial time for constant treewidth. In Sect. 6 we rule out the existence of such algorithms by showing that Token Swapping remains NP-hard when restricted to graphs with tree-depth 4 (treewidth and pathwidth 2; diameter 6; distance 1 to a forest).

Theorem 3

(Hard on almost trees) Token Swapping remains NP-hard even when both the treewidth and the diameter of the input graph are constant, and cannot be solved in time $2^{o(n)}$, unless the ETH fails.

Table 1 shows the current state of our knowledge about the parameterized complexity of Token Swapping (TS), Colored Token Swapping (CTS), and Subset Token Swapping (STS) problems, for different choices of parameters.

Table 1 The parameterized complexity of Token Swapping, Colored Token Swapping, and Subset Token Swapping

Full size table

While we think that our results give a fairly detailed view on the complexity landscape of Token Swapping, we also want to point out that our reductions are significantly simpler than those by Miltzow et al. [28].

Since the investigated problems seem to be immensely intractable, in Sect. 7 we investigate their complexities in very restricted classes of graphs, namely cliques, stars, and paths. We focus on finding the borderlines between easy (polynomially solvable) and hard (NP-hard) cases. The summary of these results is given in Table 2. Observe that on cliques Token Swapping is in P, while Colored Token Swapping (and thus also Subset Token Swapping) is NP-hard. On the other hand, on stars Colored Token Swapping (and thus also Token Swapping) is in P and Subset Token Swapping is NP-hard.

The paper is concluded with several open problems in Sect. 8.

2 Preliminaries

Yamanaka et al. [36] showed that in every instance of Token Swapping, the length of the optimal solution is $O(n^2)$ and this bound is asymptotically tight for paths. Here we show that long induced paths are the only structures forcing solutions of superlinear length.

Proposition 1

The length of the optimal solution for Token Swapping in an n-vertex $P_{r+1}$-free graph G is at most $r \cdot n$.

Table 2 The complexity of Token Swapping (TS), Colored Token Swapping (CTS), and Subset Token Swapping (STS) on very restricted classes of graphs

Full size table

Proof

We can assume that G is connected, since otherwise we can solve the problem on connected components separately. Let P be the longest path in G and let v be its end-vertex. Observe that $G - v$ is connected (otherwise P is not longest) and $P_{r+1}$-free. First, we move the token with destination v to this vertex, which requires at most ${{\mathrm{diam}}}(G) \leqslant r$ swaps. Then we can recursively continue with the graph $G-v$ (we never touch v again). Such a solution has length at most $r \cdot n$. $\square $

Note that this bound is asymptotically tight—to see this, consider a graph, whose every connected component is isomorphic to $P_r$ and has the reverse permutation of tokens (if we want to have our instance connected, we can add one additional vertex, adjacent to one of the end-vertices of each path, and put a well-placed token on it). Moreover, we observe that the bound from Proposition 1 holds also for Colored Token Swapping and Subset Token Swapping problems. Indeed, we can fix one destination for each of the tokens (by choosing a perfect matching in the token-destination graph) to obtain an instance of Token Swapping, whose solution is also the solution for the original problem.

For a token t, let ${{\mathrm{dist}}}(t)$ denote the distance from the position of t to its destination. For an instance I of Token Swapping, we define $L(I) := \sum _{t} {{\mathrm{dist}}}(t)$, i.e., the sum of distances to the destination over all the tokens. Clearly, after performing a single swap, ${{\mathrm{dist}}}(t)$ may change by at most 1. We shall also use the following classification of swaps: for $x,y \in \{-1,0,1\}$, $x \le y$, by a ($x/y$)-swap we mean a swap, in which one token changes its distance by x, and the other one by y. Intuitively, ($-1/-1$)-swaps are the most “efficient” ones, thus we will call them happy swaps. Since each swap involves two tokens, we get the following lower bound.

Proposition 2

[28]] The length of the optimal solution for an instance I of Token Swapping is at least L(I) / 2. Besides, it is exactly L(I) / 2 if and only if there is a solution using happy swaps only.

When designing algorithms, especially for computationally hard problems, it is natural to ask about lower bounds. However, the standard complexity assumption used for distinguishing easy and hard problems, i.e., P $\ne $ NP, is too weak to tell us something meaningful about possible complexities of algorithms. The stronger assumption that is typically used for this purpose is the so-called Exponential Time Hypothesis (usually referred to as the ETH), formulated by Impagliazzo and Paturi [21]. We refer the reader to the survey by Lokshtanov, Marx, and Saurabh for more information about ETH and conditional lower bounds [25]. The version we present below (and is most commonly used) is not the original statement of this hypothesis, but its weaker version (see also Impagliazzo et al. [22]).

Exponential Time Hypothesis

(Impagliazzo and Paturi [21]) There is no algorithm solving every instance of 3-Sat with N variables and M clauses in time $2^{o(N+M)}$.

3 Algorithms

First, we prove that Subset Token Swapping (and therefore also Colored Token Swapping as its restriction) is FPT in $k + \Delta $, where k is the number of allowed swaps, and $\Delta $ is the maximum degree of the input graph. This generalizes the observation of Miltzow et al. [28] for Token Swapping. Furthermore, we show that the simple algorithm for Token Swapping, presented by Miltzow et al. [28], carries over to the generalized problems, i.e., Colored Token Swapping and Subset Token Swapping. At last, we will present an algorithm that has polynomial space complexity.

Proposition 3

Subset Token Swapping is FPT in $k + \Delta $ and admits a kernel of size $2k + 2k^2 \cdot \Delta ^k$.

Proof

Let I be an instance of Subset Token Swapping on a graph G with maximum degree $\Delta $ and suppose I has a solution $\varvec{s}$ of length at most k.

Let $V_m$ be the set of such vertices v of G, that the token initially placed on v does not accept v as its destination. First, observe that every vertex from $V_m$ has to be involved in some swap in $\varvec{s}$. Thus we can assume that $|V_m|\le 2k$ (otherwise we immediately report a No-instance).

Let $E'$ be the set of edges that appear in $\varvec{s}$ and let $G'$ be the subgraph of G induced by $E'$. Consider a connected component C of $G'$. Suppose first that the vertex set of C does not contain any vertex from $V_m$. Observe that the sequence $\varvec{s}'$ obtained from $\varvec{s}$ by removing all edges from C is also a solution for I of length at most k. So, without loss of generality, every connected component C of $G'$ contains a vertex from $V_m$, and has at most k edges. Let $G''$ be the subgraph of G induced by the vertices at distance at most k from $V_m$ (we find it by running a breadth-first search, starting from $V_m$). We observe that every vertex incident to an edge in $E'$ is in $G''$. Thus the instance $I'$ of Subset Token Swapping, restricted to $G''$, is equivalent to I. Note that the maximum degree of $G''$ is at most $\Delta $, and the number of vertices in $G''$ is at most $2k + 2k^2\Delta ^{k}$. Thus $I'$ is a kernel for I. $\square $

Miltzow et al. [28] show that an optimal solution for Token Swapping can be found by performing a breath-first-search on the configuration graph, i.e. a graph, whose vertices are all possible configurations of tokens on vertices, and two configurations are adjacent when we can obtain one from another with a single swap. We observe that the same approach works for Colored Token Swapping and Subset Token Swapping, the only difference is that we terminate on any feasible target configuration.

Proposition 4

Let G be a graph with n vertices, and let k be the maximum number of allowed swaps. The Colored Token Swapping and the Subset Token Swapping problems on G can be solved in time:

$O(n! \cdot n^2) = 2^{O(n \log n)}$,
$n^{O(k)}=2^{O(k \log n)}$,

using exponential space. $\square $

The main drawback of such an approach is an exponential space complexity. Here we show the following complementary result, inspired by the ideas of Savitch [34].

Theorem 4

Let G be a graph with n vertices, and let k be the maximum number of allowed swaps. Subset Token Swapping on G can be solved in time $2^{O(n \log n \log k)}=2^{O(n \log ^2 n)}$ and space $O(n \log n \log k)=O(n \log ^2 n)$.

Proof

Consider the algorithm Reach (see Algorithm 1). It is easy to verify that it returns true if the configuration $\pi _s$ can be reached from the configuration $\pi _0$ with exactly k swaps, and false otherwise.

The depth of the recursion is $O(\log k)$. The configurations can be generated with polynomial delay, using only linear (in n) memory. Thus the time complexity of the algorithm is $n!^{\log k} \cdot n^{O(1)} = 2^{O(n \log n \log k)}$. The space needed to keep track of the recursive stack is $O(n \log n \log k)$. Recall that $k = O(n^2)$—otherwise we immediately report a Yes-instance.

To use the algorithm for Subset Token Swapping, we can enumerate all possible target configurations in $n! \cdot n^{O(1)} = 2^{O(n \log n)}$ time and polynomial space, and then solve the instance of Token Swapping for each of them. $\square $

4 Lower Bounds on Parameterized Token Swapping

Let us start by defining an auxiliary problem, called Multicolored Subgraph Isomorphism (also known as Partitioned Subgraph Isomorphism; see Fig. 2).

In Multicolored Subgraph Isomorphism, one is given a host graph H whose vertex set is partitioned into k color classes $V_1 \uplus V_2 \uplus \ldots \uplus V_k$ and a pattern graph P with k vertices: $V(P) = \{u_1,\ldots ,u_k\}$. The goal is to find an injection $\varphi : V(P) \rightarrow V(H)$ such that $u_iu_j\in E(P)$ implies that $\varphi (u_i)\varphi (u_j) \in E(H)$ and $\varphi (u_i) \in V_i$ for all i, j. Thus we can assume that each $V_i$ forms an independent set. Further, we assume without loss of generality that $E(V_i,V_j) := \{ ab \in E(H) :a \in V_i, b \in V_j\}$ is non-empty if and only if $u_iu_j \in E(P)$. In other words, we try to find k vertices $v_1 \in V_1$, $v_2 \in V_2$, $\ldots $, $v_k \in V_k$ such that, for any $i < j \in [k]$,^{Footnote 1} there is an edge between $v_i$ and $v_j$ if and only if $E(V_i,V_j)$ is non-empty.

The $W[1]$-hardness of Multicolored Subgraph Isomorphism problem follows from the $W[1]$-hardness of the Multicolored Clique. Marx [26] showed that assuming the ETH, Multicolored Subgraph Isomorphism cannot be solved in time $f(k)(|V(H)|+|E(H)|)^{o(k / \log k)}$, for any computable function f, even when the pattern graph P is 3-regular and bipartite (see also Marx and Pilipczuk [27]). In particular, k has to be an even integer since |E(P)| is exactly 3k / 2. We finally assume that for every $i \in [k]$ it holds that $|V_i|=t$, by padding potentially smaller classes with isolated vertices. This can only increase the size of the host graph by a factor of k, and does not create any new solution nor destroy any existing one.

Now we are ready to prove the following theorem.