Characterizations and Directed Path-Width of Sequence Digraphs

Gurski, Frank; Rehs, Carolin; Rethmann, Jochen

doi:10.1007/s00224-022-10104-w

Characterizations and Directed Path-Width of Sequence Digraphs

Open access
Published: 23 November 2022

Volume 67, pages 310–347, (2023)
Cite this article

Download PDF

You have full access to this open access article

Theory of Computing Systems Aims and scope Submit manuscript

Characterizations and Directed Path-Width of Sequence Digraphs

Download PDF

1542 Accesses
Explore all metrics

Abstract

Computing the directed path-width of a directed graph is an NP-hard problem. Even for digraphs of maximum semi-degree 3 the problem remains hard. We propose a decomposition of an input digraph G = (V,A) by a number k of sequences with entries from V, such that (u,v) ∈ A if and only if in one of the sequences there is an occurrence of u appearing before an occurrence of v. We present several graph theoretical properties of these digraphs. Among these we give forbidden subdigraphs of digraphs which can be defined by k = 1 sequence, which is a subclass of semicomplete digraphs. Given the decomposition of digraph G, we show an algorithm which computes the directed path-width of G in time $\mathcal {O}(k\cdot (1+N)^{k})$, where N denotes the maximum sequence length. This leads to an XP-algorithm w.r.t. k for the directed path-width problem. Our result improves the algorithms of Kitsunai et al. for digraphs of large directed path-width which can be decomposed by a small number of sequences and confirm their conjecture that semicompleteness is a useful restriction when considering digraphs.

Directed Path-Width of Sequence Digraphs

Quasi-Transitive Digraphs and Their Extensions

On the Pathwidth of Almost Semicomplete Digraphs

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Let Q = {q₁,…,q_k} be a set of k sequences $q_{i}=(b_{i,1},\ldots ,b_{i,n_{i}})$, 1 ≤ i ≤ k. All n₁ + … + n_k items b_i,j are pairwise distinct. Further there is a function t which assigns every item b_i,j with a type t(b_i,j). The sequence digraph g(Q) = (V,A) for a set Q = {q₁,…,q_k} has a vertex for every type and an arc (u,v) ∈ A if and only if there is some sequence q_i in Q where an item of type u is on the left of some item of type v.^{Footnote 1} The contributions of this paper concern graph theoretic properties of sequence digraphs and an algorithm to compute from a given set Q on k sequences the directed path-width of g(Q). In Section 2 we give preliminaries for graphs and digraphs. In Section 3 we show how to define digraphs from sets of sequences, and vice versa. Further we give methods in order to compute the sequence digraph and also its complement digraph. In Section 4 we consider graph theoretic properties of sequence digraphs. Therefore in Section 4.1 we introduce the class S_k,ℓ as the set of all sequence digraphs defined by sets Q on at most k sequences that together contain at most ℓ items of each type. We show inclusions of these classes and necessary conditions for digraphs to be in S_k,ℓ. In Section 4.2 we give finite sets of forbidden induced subgraphs for all classes S_1,1 and S_1,2. It turns out that S_1,1 is equal to the well known class of transitive tournaments. Since only the first and the last item of each type in every q_i ∈ Q are important for the arcs in the corresponding digraph all classes S_1,ℓ, ℓ ≥ 2 are equal and can be characterized by one set of four forbidden induced subdigraphs. These characterizations lead to polynomial time recognition algorithms for the corresponding graph classes. Furthermore we give characterizations in terms of special tournaments and conditions for the complement digraph. In Section 4.3 we consider the closure of the classes S_k,ℓ with respect to the operations taking the converse digraph and taking the complement digraph. In Section 4.4 we characterize sequences whose defined graphs can be obtained by the union of the graphs for the subsequences which consider only the first or last item of each type.

In Section 5 we consider the directed path-width problem on sequence digraphs. We show that for digraphs defined by k = 1 sequence the directed path-width can be computed in polynomial time. Further we show that for sets Q of sequences of bounded length, of bounded distribution of the items of every type onto the sequences, or bounded number of items of every type computing the directed path-width of g(Q) is NP-hard. We also introduce a method which computes from a given set Q on k sequences the directed path-width of g(Q), as well as a directed path-decomposition can be computed in time $\mathcal {O}(k\cdot (1+\max \limits _{1\leq i \leq k}n_{i})^{k})$. The main idea is to discover an optimal directed path-decomposition by scanning the k sequences left-to-right and keeping in a state the numbers of scanned items of every sequence and the number of active types.

From a parameterized point of view our method leads to an XP-algorithm w.r.t. parameter k. While the existence of FPT-algorithms for computing directed path-width is open up to now, there are XP-algorithms for the directed path-width problem for some digraph G = (V,A). The directed path-width can be computed in time ${\mathcal {O}}\left (|A|\cdot |V|^{2\text {d-pw}(G)}/(\text {d-pw}(G)-1)!\right )$ by [3] and in time ${\mathcal {O}}(\text {d-pw}(G)\cdot |A|\cdot |V|^{2\text {d-pw}(G)})$ by [4]. Further in [5] it is shown how to decide whether the directed path-width of an ℓ-semicomplete digraph is at most w in time $(\ell +2w+1)^{2w}\cdot |V|^{{\mathcal {O}}(1)}$. All these algorithms are exponential in the directed path-width of the input digraph while our algorithm is exponential within the number of sequences. Thus our result improves theses algorithms for digraphs of large directed path-width which can be decomposed by a small number of sequences (see Table 1 for examples). Furthermore the directed path-width can be computed in time $3^{\tau (\textit {und}(G))} \cdot |V|^{\mathcal {O}(1)}$, where τ(und(G)) denotes the vertex cover number of the underlying undirected graph of G, by [6]. Thus our result improves this algorithm for digraphs of large τ(und(G)) which can be decomposed by a small number of sequences (see Table 1 for examples).

Table 1 Values of parameters within XP-algorithms for directed path-width

Full size table

Since every single sequence q_i defines a semicomplete $\{\text {co-}(2\overrightarrow {P_{2}}),\overrightarrow {C_{3}},D_{4}\}$-free digraph (cf. Table 2 for the digraphs), we consider digraphs which can be obtained by the union of k special semicomplete digraphs which confirms the conjecture of [5] that semicompleteness is a useful restriction when considering digraphs.^{Footnote 2} In Section 6 we give conclusions and point out open questions.

Table 2 Special digraphs

Full size table

2 Preliminaries

We use the notations of Bang-Jensen and Gutin [7] for graphs and digraphs.

Undirected Graphs

We work with finite undirected graphsG = (V,E), where V is a finite set of vertices and $E \subseteq \{ \{u,v\} \mid u,v \in V,~u \not = v\}$ is a finite set of edges. For a vertex v ∈ V we denote by N_G(v) the set of all vertices which are adjacent to v in G, i.e. N_G(v) = {w ∈ V ∣{v,w}∈ E}. Set N_G(v) is called the set of all neighbors of v in G or neighborhood of v in G. The degree of a vertex v ∈ V, denoted by $\deg _{G}(v)$, is the number of neighbors of vertex v in G, i.e. $\deg _{G}(v)=|N_{G}(v)|$. The maximum vertex degree is ${\Delta }(G)=\max \limits _{v\in V} \deg _{G}(v)$. A graph $G^{\prime }=(V^{\prime },E^{\prime })$ is a subgraph of graph G = (V,E) if $V^{\prime }\subseteq V$ and $E^{\prime }\subseteq E$. If every edge of E with both end vertices in $V^{\prime }$ is in $E^{\prime }$, we say that $G^{\prime }$ is an induced subgraph of digraph G and we write $G^{\prime }=G[V^{\prime }]$. For some undirected graph G = (V,E) its complement graph is defined by co-G = (V,{{u,v}∣{u,v}∉E,u,v ∈ V,u≠v}).

Special Undirected Graphs

By P_n = ({v₁,…,v_n},{{v₁,v₂},…,{v_n− 1,v_n}}), n ≥ 2, we denote a path on n vertices and by C_n = ({v₁,…,v_n},{{v₁,v₂},…, {v_n− 1,v_n},{v_n,v₁}}), n ≥ 3, we denote a cycle on n vertices. Further by K_n = ({v₁,…,v_n},{{v_i,v_j}∣1 ≤ i < j ≤ n}), n ≥ 1, we denote a complete graph on n vertices and by K_n,m = ({v₁,…,v_n,w₁,…,w_m},{{v_i,w_j}∣1 ≤ i ≤ n,1 ≤ j ≤ m}) a complete bipartite graph on n + m vertices.

Directed Graphs

A directed graph or digraph is a pair D = (V,A), where V is a finite set of vertices and $A\subseteq \{(u,v) \mid u,v \in V,~u \not = v\}$ is a finite set of ordered pairs of distinct^{Footnote 3} vertices called arcs. For a vertex v ∈ V, the sets $N_{D}^{+}(v)=\{u\in V \mid (v,u)\in A\}$ and $N_{D}^{-}(v)=\{u\in V \mid (u,v)\in A\}$ are called the set of all out-neighbours and the set of all in-neighbours of v. The outdegree of v, outdegree_D(v) for short, is the number of out-neighbours of v and the indegree of v, indegree_D(v) for short, is the number of in-neighbours of v in D. The maximum out-degree is ${\Delta }^{+}(D)=\max \limits _{v\in V} \text {outdegree}_{D}(v)$ and the maximum in-degree is ${\Delta }^{-}(D)=\max \limits _{v\in V} \text {indegree}_{D}(v)$. The maximum vertex degree is ${\Delta }(G)=\max \limits _{v\in V} \text {outdegree}_{G}(v)+\text {indegree}_{G}(v)$ and the maximum semi-degree is ${\Delta }^{0}(G)=\max \limits \{{\Delta }^{-}(D),{\Delta }^{+}(D)\}$.

A vertex v ∈ V is out-dominating (in-dominated) if it is adjacent to every other vertex in V and is a source (a sink, respectively). A digraph $D^{\prime }=(V^{\prime },A^{\prime })$ is a subdigraph of digraph D = (V,A) if $V^{\prime }\subseteq V$ and $A^{\prime }\subseteq A$. If every arc of A with both end vertices in $V^{\prime }$ is in $A^{\prime }$, we say that $D^{\prime }$ is an induced subdigraph of digraph D and we write $D^{\prime }=D[V^{\prime }]$. For some digraph D = (V,A) its complement digraph is defined by co-D = (V,{(u,v)∣(u,v)∉A,u,v ∈ V,u≠v}) and its converse digraph is defined by con-D = (V,{(u,v)∣(v,u) ∈ A,u,v ∈ V,u≠v}).

Let G = (V,A) be a digraph.

G is edgeless if for all u,v ∈ V, u≠v, none of the two pairs (u,v) and (v,u) belongs to A.
G is a tournament if for all u,v ∈ V, u≠v, exactly one of the two pairs (u,v) and (v,u) belongs to A.
G is semicomplete if for all u,v ∈ V, u≠v, at least one of the two pairs (u,v) and (v,u) belongs to A.
G is (bidirectional) complete if for all u,v ∈ V, u≠v, both of the two pairs (u,v) and (v,u) belong to A.

Omitting the Directions

For some given digraph D = (V,A), we define its underlying undirected graph by ignoring the directions of the edges, i.e. und(D) = (V,{{u,v}∣(u,v) ∈ A,u,v ∈ V }).

Orientations

There are several ways to define a digraph D = (V,A) from an undirected graph G = (V,E), see [7]. If we replace every edge {u,v} of G by

one of the arcs (u,v) and (v,u), we denote D as an orientation of G. Every digraph D which can be obtained by an orientation of some undirected graph G is called an oriented graph.
one or both of the arcs (u,v) and (v,u), we denote D as a biorientation of G. Every digraph D which can be obtained by a biorientation of some undirected graph G is called a bioriented graph.
both arcs (u,v) and (v,u), we denote D as a complete biorientation of G. Since in this case D is well defined by G we also denote it by G⃡. Every digraph D which can be obtained by a complete biorientation of some undirected graph G is called a complete bioriented graph.

Special Directed Graphs

We denote by K_n⃡ = ({v₁,…,v_n},{(v_i,v_j)∣1 ≤ i≠j ≤ n}), n ≥ 1 a bidirectional complete digraph on n vertices. By $\overrightarrow {P_{n}}=(\{v_{1},\ldots ,v_{n}\},\{ (v_{1},v_{2}),\ldots , (v_{n-1},v_{n})\})$, n ≥ 2 we denote a directed path on n vertices and by $\overrightarrow {C_{n}}=(\{v_{1},\ldots ,v_{n}\},\{(v_{1},v_{2}),\ldots , (v_{n-1},v_{n}),(v_{n},v_{1})\})$, n ≥ 2 we denote a directed cycle on n vertices.

An oriented forest (tree) is the orientation of a forest (tree). A digraph is an out-tree (in-tree) if it is an oriented tree in which there is exactly one vertex of indegree (outdegree) zero.

A directed acyclic digraph (DAG for short) is a digraph without any $\overrightarrow {C_{n}}$, n ≥ 2 as subdigraph.

3 Sequence Digraphs and Sequence Systems

3.1 From Sequences to Digraphs

Let Q = {q₁,…,q_k} be a set of k sequences. Every sequence $q_{i}=(b_{i,1},\ldots ,b_{i,n_{i}})$ consists of a number n_i of items, such that all $n={\sum }_{i=1}^{k} n_{i}$ items are pairwise distinct. Further there is a function t which assigns every item b_i,j with a type t(b_i,j). The set of all types of the items in some sequence q_i is denoted by types(q_i) = {t(b)∣b ∈ q_i}. For a set of sequences Q = {q₁,…,q_k} we denote types(Q) = types(q₁) ∪⋯ ∪types(q_k). For some sequence $q_{\ell }=(b_{\ell ,1}, \ldots ,b_{\ell ,n_{\ell }})$ we say item b_ℓ,i is on the left of item b_ℓ,j in sequence q_ℓ if i < j. Item b_ℓ,i is on the position i in sequence q_ℓ, since there are i − 1 items on the left of b_ℓ,i in sequence q_ℓ.

In order to insert a new item b on a position j in sequence q_i we first move all items on positions $j^{\prime }\geq j$ to position $j^{\prime }+1$ starting at the rightmost position n_i and then we insert b at position j. In order to remove an existing item b at a position j in sequence q_i we move all items from positions $j^{\prime }\geq j+1$ to position $j^{\prime }-1$ starting at position j + 1.

For hardness results in Section 5.2 we consider the distribution of the items of a type t onto the sequences by

$$ d_{Q}(t) = |\{ q\in Q \mid t \in \textit{types}(q) \}| \qquad \text{ and } \qquad d_{Q} = \underset{t \in \textit{types}(Q)}{\max} d_{Q}(t). $$

For the number of items for type t within the sequences we define

$$ c_{Q}(t) = \sum\limits_{q\in Q} |\{ b \in q \mid t(b)=t\}| \qquad \text{ and } \qquad c_{Q} = \underset{t \in \textit{types}(Q)}{\max} c_{Q}(t). $$

Obviously it holds d_Q ≤ k and 1 ≤ d_Q ≤ c_Q ≤ n.

Definition 1 (Sequence Digraph)

The sequence digraph g(Q) = (V,A) for a set Q = {q₁,…,q_k} has a vertex for every type, i.e. V = types(Q) and an arc (u,v) ∈ A if and only if there is some sequence q_i in Q where an item of type u is on the left of an item of type v. More formally, there is an arc (u,v) ∈ A if and only if there is some sequence q_i in Q, such that there are two items b_i,j and $b_{i,j^{\prime }}$ such that

(1)
$1 \leq j < j^{\prime } \leq n_{i}$,
(2)
t(b_i,j) = u,
(3)
$t(b_{i,j^{\prime }})=v$, and
(4)
u≠v.

Sequence digraphs have successfully been applied in order to model the stacking process of bins from conveyor belts onto pallets with respect to customer orders, which is an important task in palletizing systems used in centralized distribution centers [8]. In this paper we show graph theoretic properties of sequence digraphs and their relation to special digraph classes.

In our examples we will use type identifications instead of item identifications to represent a sequence q_i ∈ Q. For (not necessarily distinct) r types t₁,…,t_r let [t₁,…,t_r] denote some sequence q_i = (b_i,1,…,b_i,r) of r pairwise distinct items, such that t(b_i,j) = t_j for j = 1,…,r. We use this notation for sets of sequences as well.

Example 1 (Sequence Digraph)

Figure 1 shows the sequence digraph g(Q) for Q = {q₁,q₂,q₃} with sequences q₁ = [a,a,d,e,d], q₂ = [c,b,b,d], and q₃ = [c,c,d,e,d].

In Example 1 all sequences of Q contain consecutive items of the same type. This is obviously not necessary to obtain digraph g(Q). Let C(q_i) be the subsequence of q_i which is obtained from q_i by removing all but one of consecutive items of the same type for each type and C(Q) = {C(q₁),…,C(q_k)}.

Observation 1

Let Q = {q₁,…,q_k} be a set of k sequences, then g(Q) = g(C(Q)).

Next we give methods in order to compute the sequence digraph and also its complement digraph. Therefore we define the position of the first item in some sequence q_i ∈ Q of some type t ∈types(Q) by first(q_i,t) and the position of the last item of type t in sequence q_i by last(q_i,t). For technical reasons, if there is no item for type t contained in sequence q_i, then we define first(q_i,t) = n_i + 1, and last(q_i,t) = 0.

Lemma 1

Let Q = {q₁} be some set of one sequence, g(Q) = (V,A) the defined sequence digraph, co-(g(Q)) = (V,A^c) its complement digraph, and u≠v two vertices of V.

1.
There is an arc (u,v) ∈ A, if and only if first(q₁,u) < last(q₁,v).
2.
There is an arc (u,v) ∈ A^c, if and only if last(q₁,v) < first(q₁,u).
3.
If (u,v) ∈ A^c, then (v,u) ∈ A.
4.
There is an arc (u,v) ∈ A and an arc (v,u) ∈ A^c, if and only if last(q₁,u) < first(q₁,v).

Proof 1

1.
By the definition of the arcs of sequence digraphs.
2.
It holds (u,v) ∈ A^c if and only if (u,v)∉A. Further by (1) and u≠v it holds (u,v)∉A if and only if first(q₁,u) > last(q₁,v).
3.
If (u,v) ∈ A^c, then our result (2) implies last(q₁,v) < first(q₁,u) which implies that (v,u) ∈ A.
4.
Follows by (2) and (3).

□

Lemma 2

Let Q = {q₁,…,q_k} be some set of k sequences, g(Q) = (V,A) the defined sequence digraph, co-(g(Q)) = (V,A^c) its complement digraph, and u≠v two vertices of V.

1.
There is an arc (u,v) ∈ A, if and only if there is some q_i ∈ Q such that first(q_i,u) < last(q_i,v).
2.
There is an arc (u,v) ∈ A^c, if and only if for every q_i ∈ Q it holds last(q_i,v) < first(q_i,u).

Proof 2

1.
By the definition of the arcs of sequence digraphs.
2.
It holds (u,v) ∈ A^c if and only if (u,v)∉A. Further by (1) and u≠v we have (u,v)∉A if and only if for every q_i ∈ Q it holds first(q_i,u) > last(q_i,v).

□

Properties (3) and (4) of Lemma 1 can not be shown for k > 1 sequences, since an arc (u,v) ∈ A^c can be obtained by two types u and v from two different sequences q_i and q_j such that u ∈types(q_i) −types(q_j) and v ∈types(q_j) −types(q_i). In such situations it is not necessary to have (v,u) ∈ A.

By Lemma 2 (1) only the first and the last item of each type in every q_i ∈ Q are important for the arcs in the corresponding digraph. Let M(q_i) be the subsequence of q_i which is obtained from q_i by removing all except the first and last item for each type and M(Q) = {M(q₁),…,M(q_k)}.

Observation 2

Let Q = {q₁,…,q_k} be a set of k sequences, then g(Q) = g(M(Q)).

Using Lemma 2 and Observation 2 the running time to obtain the sequence digraph can be bounded as follows.

Proposition 1

Given a set Q of sequences, the sequence digraph g(Q) = (V,A) can be computed in time $\mathcal {O}(n + k\cdot |\textit {types}(Q)|^{2})={\mathcal {O}}(n + k\cdot |V|^{2})$.

Proof 3

First, we initialize the values first(q_i,t) := |q_i| and last(q_i,t) := 0 for each sequence q_i and type t. Then, we compute the values first(q_i,t) and last(q_i,t) by scanning each sequence q_i from left to right and at position j where t(b_i,j) = t set the values last(q_i,t) := j and first(q_i,t) := j. The last assignment is made only if first(q_i,t) = |q_i| has its initial value. Finally, g(Q) can be computed in ${\mathcal {O}}(k \cdot |V|^{2})$ time by considering each pair $t,t^{\prime }$ of types for every sequence q_i and add the arc $(t,t^{\prime })$ to A if $\textit {first}(q_{i},t) < \textit {last}(q_{i},t^{\prime })$. □

3.2 From Digraphs to Sequences

Definition 2 (Sequence System)

Let G = (V,A) be some digraph and A = {a₁,…,a_ℓ} its arc set. The sequence system q(G) = {q₁,…,q_ℓ} for G is defined as follows.

(1)
There are 2ℓ items b_1,1,b_1,2,…,b_ℓ,1,b_ℓ,2.
(2)
Sequence q_i = (b_i,1,b_i,2) for 1 ≤ i ≤ ℓ.
(3)
The type of item b_i,1 is the first vertex of arc a_i and the type of item b_i,2 is the second vertex of arc a_i for 1 ≤ i ≤ ℓ. Thus types(q(G)) = V.

Example 2 (Sequence System)

For the digraph G of Fig. 2 the corresponding sequence system is given by q(G) = {q₁,q₂,q₃,q₄,q₅,q₆,q₇}, where q₁ = [a,b], q₂ = [b,c], q₃ = [c,d], q₄ = [d,e], q₅ = [e,a], q₆ = [e,f], q₇ = [f,a]. The sequence digraph of q(G) is G.

The definitions of the sequence system q(G) and the sequence digraph g(Q), defined in Section 3.1, imply the following results.

Observation 3

For every digraph G it holds G = g(q(G)).

The related relation Q = q(g(Q)) is not true in general (e.g. not for Q from Example 1). But if the reduction to different consecutive items C(Q) contains exactly two items of different types the following equivalence holds true.

Lemma 3

For every set Q of sequences it holds C(Q) = q(g(Q)) if and only if each sequence q_i ∈ C(Q) contains exactly two items of different types.

Since q(g(Q)) = q(g(C(Q))) holds for every set of sequences Q, in Lemma 3 we also can replace condition C(Q) = q(g(Q)) by condition C(Q) = q(g(C(Q))).

Lemma 4

For every digraph G = (V,A) with und(G) = (V,E) there is a set Q of at most |E| sequences such that G = g(Q).

Proof 4

For every digraph G = (V,A) by Observation 3 the sequence system Q = q(G) leads to a set of at most |A| sequences such that G = g(Q). If we have two arcs (u,v) and (v,u) between two vertices u and v these can be represented by one sequence [u,v,u]. □

There are digraphs which even can be defined by one sequence (see Example 3 (1) and Theorem 1) and there are digraphs for which |E| sequences are really necessary (see Lemmas 10 and 9).

For digraphs of bounded vertex degree the sequence system Q = q(G) leads to sets whose distribution and number of items of each type can be bounded as follows.

Lemma 5

For every digraph G = (V,A) where $\max \limits ({\Delta }^{-}(G),{\Delta }^{+}(G))\leq d$ there is a set Q with d_Q ≤ 2d and c_Q ≤ 2d such that G = g(Q).

In case of complete bioriented digraphs we can improve the latter bounds as follows.

Lemma 6

For every complete bioriented digraph G = (V,A) where $\max \limits ({\Delta }^{-}(G),{\Delta }^{+}(G))\leq d$ there is a set Q with d_Q ≤ d and c_Q ≤ 2d (for d ≥ 2 even c_Q ≤ 2d − 1) such that G = g(Q).

Proof 5

By Lemma 5 we obtain d_Q ≤ 2d and c_Q ≤ 2d. Since we consider complete bioriented digraphs for every two vertices u and v we now have two sequences [u,v] and [v,u], which can be combined to only one sequence [u,v,u]. This leads to d_Q ≤ d and c_Q ≤ 2d. For d ≥ 2 we can show the stated improvement as follows. If there is one type tintypes(Q) such that there are exactly 2d items of type t, then there are less than d types t^′intypes(Q) such that there are exactly 2d − 1 items of type $t^{\prime }$. Thus there is one sequence [t,t”,t] such that there are at most 2d − 2 items of type t”. If we substitute [t,t”,t] by [t”,t,t”] we reduced the number of items of type t to 2d − 1. If we repeat this substitution until there is no type tintypes(Q) such that there are exactly 2d items of type t, we achieve c_Q ≤ 2d − 1. □

4 Properties of Sequence Digraphs

4.1 Graph Classes and their Relations

In order to represent some digraph G = (V,A) as a sequence digraph we need exactly |V | types. By Lemma 4 every digraph is a sequence digraph using a suitable set Q of sequences. Thus we want to consider digraphs which can be defined by a given upper bound for the number of sequences. Furthermore the first and the last item of each type within a sequence is the most important one by Observation 2, thus we want to analyze the digraphs which can be defined by a given upper bound for the number of items of each type. We define S_k,ℓ to be the set of all sequence digraphs defined by sets Q on at most k sequences that together contain at most ℓ items of each type in types(Q). By the definition we know for every two integers k ≥ 1 and ℓ ≥ 1 the following inclusions between these graph classes.

$$ \begin{array}{@{}rcl@{}} S_{k,\ell} & \subseteq & S_{k+1,\ell} \end{array} $$

(1)

$$ \begin{array}{@{}rcl@{}} S_{k,\ell} & \subseteq& S_{k,\ell+1} \end{array} $$

(2)

By Lemma 4 and Observation 2 we obtain the following bounds.

Corollary 1

Let Q be a set on k sequences and g(Q) = (V,A) ∈ S_k,ℓ the defined graph with und(g(Q)) = (V,E). Then, $g(Q) in S_{k, ell^{\prime }} $ for all $ell^{\prime } geq 2k$ and $g(Q) in S_{k^{\prime }, ell}$ for all $k^{\prime } ge |E|$.

Lemma 7

Let ℓ ≥ 1 and G ∈ S_1,ℓ be defined by Q = {q₁}, then digraph g(Q) is semicomplete and thus graph und(g(Q)) is the complete graph on |types(Q)| vertices.

Proof 6

Let Q = {q₁} be some set of one sequence which defines the sequence digraph g(Q). For every vertex v in g(Q) there is exactly one corresponding type t_v ∈types(q₁). Since G ∈ S_1,ℓ for every two vertices v and w in g(Q) there are two items b_1,i and b_1,j of type t_v and t_w, respectively, which define the arc (v,w) or (w,v). □

Corollary 2

Let G = (V,A) be a digraph, such that graph und(G) has k connected components. Then for every $k^{\prime }<k$ and ℓ ≥ 1 it holds $G\not \in S_{k^{\prime },\ell }$.

Next we consider the relations of the defined classes for k = 1 sequence. Since S_1,1 contains only digraphs with exactly one arc between every pair of vertices (cf. Theorem 1 for a more precise characterization) and S_1,ℓ for ℓ ≥ 2 contains all bidirectional complete graphs we know that $S_{1,1}\subsetneq S_{1,\ell }$ for every ℓ ≥ 2. Further by (2) and by Observation 2 it follows that all classes S_1,ℓ for ℓ ≥ 2 are equal.

Lemma 8

For every two integers $\ell ,\ell ^{\prime } \geq 2$ it holds $S_{1,\ell }=S_{1, \ell ^{\prime }}$.

Corollary 3

For ℓ ≥ 2 the following inclusions hold.

$$ S_{1, 1}\subsetneq S_{1, 2}= {\ldots} = S_{1, \ell} $$

The equalities of Lemma 8 can not be generalized for k > 1 sequences and ℓ ≤ k since removing items from different sequences can change the sequence digraph. In order to give examples for digraphs which do not belong to some of the classes S_k,ℓ we next show some useful results.

For a set of digraphs ${\mathcal F}$ we denote by ${\mathcal F}$-free digraphs the set of all digraphs G such that no induced subdigraph of G is isomorphic to a member of ${\mathcal F}$. If ${\mathcal F}$ consists of only one digraph F, we write F-free instead of {F}-free. For undirected graphs we use this notation as well.

Lemma 9

Let G = (V,E) be a triangle free graph, i.e. a C₃-free graph, with |E|≥ 2, such that Δ(G) = ℓ and $G^{\prime }=(V,A)$ be an orientation of G. Then for k = |E| it holds $G^{\prime }\in S_{k,\ell }$ but for $k^{\prime }<k$ or $\ell ^{\prime }<\ell $ it holds $G^{\prime }\not \in S_{k^{\prime },\ell ^{\prime }}$.

Proof 7

Let G and $G^{\prime }$ be as in the statement of the lemma. Further let Q be a set of sequences such that $G^{\prime }=\textit {g}(Q)$. If some sequence q_i ∈ Q defines more than one arc of $G^{\prime }$, then q_i has to contain at least three items of pairwise distinct types. This would induce a C₃ in G, which is not possible by our assumption. Thus we have to represent every arc (u,v) of $G^{\prime }$ by one sequence [u,v]. Thus $G^{\prime }$ can not be defined by less than k = |E| sequences or less than ℓ = Δ(G) items for every type. □

Since for every k ≥ 2 and every ℓ = 2,…,k there is a tree T on k edges and Δ(T) = ℓ we know by Lemma 9 that for k ≥ 2 and ℓ = 2,…,k we have $S_{k,\ell -1} \subsetneq S_{k, \ell }$. Further by Observation 2 we know that for k ≥ 2 and ℓ ≥ 2k it holds S_k,ℓ = S_k,ℓ+ 1.

Corollary 4

For k ≥ 2 the following inclusions hold.

$$ S_{k, 1} \subsetneq S_{k, 2} \subsetneq {\ldots} \subsetneq S_{k,k} \subseteq S_{k,k+1}\subseteq {\ldots} \subseteq S_{k,2k} = S_{k,2k+1} = \ldots $$

Proposition 2

Let G ∈ S_k,ℓ, then for every induced subdigraph H of G it holds H ∈ S_k,ℓ.

Proof 8

A set of sequences $Q^{\prime }$ for an induced subdigraph H can be obtained from a set Q of the original digraph G by restricting $Q^{\prime }$ to the items which are destinated for types corresponding to vertices of H. □

Graph classes which are closed under taking induced subgraphs are called hereditary. Hereditary graph classes are exactly those classes which can be defined by a set of forbidden induced subgraphs. On the other hand, the classes S_k,ℓ are not closed under taking arbitrary subgraphs by the following example.

Example 3

1.
For every n ≥ 1 and ℓ ≥ 2 it holds K_n⃡ ∈ S_1,ℓ, which can be verified by set Q = {q₁}, where
$$ q_{1}=[v_{1},v_{2},\ldots,v_{n},v_{1},v_{2},\ldots,v_{n}]. $$
2.
Every orientation $T^{\prime }_{n,d}$ of a tree T_n,d on n vertices and Δ(T) = d ≤ n − 1 can be obtained as a subdigraph of K_n⃡. Since by Lemma 9 for n ≥ 3 it holds $T^{\prime }_{n,d}\in S_{n-1,d}$ but $T^{\prime }_{n,d}\not \in S_{n-2,d}$ and $T^{\prime }_{n,d}\not \in S_{n-1,d-1}$ the classes S_k,ℓ for each k ≥ 1 and ℓ ≥ 2 are not closed under taking arbitrary subgraphs.
3.
By Theorem 2 digraph TT_k which consists of the disjoint union of k transitive tournaments is in S_k,1 − S_k− 1,1.
4.
If we remove exactly one edge from TT_k we obtain a digraph $TT^{\prime }_{k}$ such that $TT^{\prime }_{k}\not \in S_{k,1}$. Thus the classes S_k,1 for every k ≥ 1 are not closed under taking arbitrary subgraphs.

For orientations of paths and cycles Lemma 9 leads to the following bounds.

Lemma 10

1.
Let $P^{\prime }_{n}$ be an orientation of a path P_n on n ≥ 2 vertices, e.g. $P^{\prime }_{n}=\overrightarrow {P_{n}}$ for n ≥ 2. Then for every ℓ ≥ 2 it holds $P^{\prime }_{n}\in S_{n-1, \ell }$. But for every ℓ ≥ 1 it holds $P^{\prime }_{n}\not \in S_{n-2, \ell }$.
2.
Let $C^{\prime }_{n}$ be an orientation of a cycle C_n on n ≥ 4 vertices, e.g. $C^{\prime }_{n}=\overrightarrow {C_{n}}$ for n ≥ 4. Then for every ℓ ≥ 2 it holds $C^{\prime }_{n}\in S_{n, \ell }$. But for every ℓ ≥ 1 it holds $C^{\prime }_{n}\not \in S_{n-1, \ell }$.

By Lemma 10 and Proposition 2 we obtain the following results.

Proposition 3

1.
Let ℓ ≥ 1, k ≥ 1, and G ∈ S_k,ℓ, then G has no induced path $\overrightarrow {P_{k^{\prime }}}$ for $k^{\prime }\geq k+2$.
2.
Let ℓ ≥ 1, k ≥ 3, and G ∈ S_k,ℓ, then G has no induced cycle $\overrightarrow {C_{k^{\prime }}}$ for $k^{\prime }\geq k+1$.

4.2 Characterizations of Sequence Digraphs for k = 1 or ℓ = 1

By Proposition 2 all the classes S_k,ℓ are hereditary and thus can be characterized by a set of forbidden induced subgraphs. In this section we even show a finite set of forbidden induced subgraphs for all classes S_k,ℓ where ℓ = 1 and for all classes where k = 1. These characterizations lead to polynomial time recognition algorithms for the corresponding graph classes. Furthermore we give characterizations in terms of special tournaments and conditions for the complement digraph.

4.2.1 Sequence Digraphs for k = 1 and ℓ = 1

Regarding the algorithmic use of special digraphs it is often helpful to know whether they are transitive. A digraph G = (V,A) is called transitive if for every pair (u,v) ∈ A and (v,w) ∈ A of arcs with u≠w the arc (u,w) also belongs to A.

Lemma 11

Every digraph in S_1,1 is transitive.

Proof 9

Let Q = {q₁} be some set of one sequence. If for every type there is only one item in q₁, then two arcs (u,v) and (v,w) can only be defined in g(Q) if q₁ contains some item b_1,i of type u, some item $b_{1,i^{\prime }}$ of type v, and some item $b_{1,i^{\prime \prime }}$ of type w such that $i<i^{\prime }$ and $i^{\prime }<i^{\prime \prime }$. By the definition of the sequence digraph g(Q) also has the arc (u,w). □

For some digraph G and some integer d let dG be the disjoint union of d copies of G.

Theorem 1

For every digraph G the following statements are equivalent.

1.
G ∈ S_1,1
2.
G is a transitive tournament.
3.
G is an acyclic tournament.
4.
G is a $\overrightarrow {C_{3}}$-free tournament.
5.
G is a tournament with exactly one Hamiltonian path.
6.
G is a tournament and every vertex in G has a different outdegree, i.e. {outdegree(v)∣v ∈ V } = {0,…,|V |− 1}.
7.
G is $\{2K_{1\&\#x20E1;},K_{2\&\#x20E1;},\overrightarrow {C_{3}}\}$-free.
8.
G can be constructed from the one-vertex graph K₁ by repeatedly adding an out-dominating vertex or by repeatedly adding an in-dominated vertex.

Proof 10

The equivalence of (2)–(6) is known from [9, Chapter 9].

(1) ⇒ (2) By Lemma 11 digraph in S_1,1 is transitive and by definition of S_1,1 digraph G is a tournament.

(3) ⇒ (1) Every acyclic digraph G has a source, i.e. a vertex v₁ of indegree 0, see [7]. Since G is a tournament there is an arc (v₁,v) for every vertex v of G, i.e. v₁ is an out-dominating vertex. By removing v₁ from G, we obtain a transitive tournament G¹ which leads to an out-dominating vertex v₂. By removing v₂ from G¹, we obtain a transitive tournament G² which leads to an out-dominating vertex v₃ and so on. The sequence [v₁,v₂,…,v_n] shows that G ∈ S_1,1.

(4) ⇔ (7) Obviously.

(1) ⇔ (8) holds by definition. □

By the method is given in part (3) ⇒ (1) of the proof of Theorem 1 we obtain the following result.

Proposition 4

Let G = (V,A) ∈ S_1,1, then a set Q of one sequence q, such that G = g(Q) can be found in time $\mathcal {O}(|V|+|A|)$.

4.2.2 Sequence Digraphs for ℓ = 1

The sequence digraph g(Q) = (V,A) for a set Q = {q₁,…,q_k} can be obtained by the union of g({q_i}) = (V_i,A_i), 1 ≤ i ≤ k by $V=\cup _{i=1}^{k} V_{i}$ and $A=\cup _{i=1}^{k} A_{i}$. Since for digraphs in S_k,1 the vertex sets V_i = types(q_i) are disjoint, we can follow the next lemma.

Lemma 12

For every integer k every digraph in S_k,1 is the disjoint union of k digraphs in S_1,1.

This lemma allows us to generalize Lemma 11 to k sequences.

Corollary 5

For every integer k every digraph in S_k,1 is transitive.

On the other hand none of the sets S_k,1 contains all transitive digraphs. Therefore let $T^{\prime }$ be an orientation of a tree T with k + 1 edges, such that for every vertex v in $T^{\prime }$ either the indegree or outdegree is 0. Such an orientation is possible for every tree T and leads to a transitive graph $T^{\prime }$. By Lemma 9 we know that $T^{\prime }\not \in S_{k,1}$.

Theorem 1 can be generalized to k ≥ 1 sequences as follows.

Theorem 2

For every digraph G and every integer k ≥ 1 the following statements are equivalent.

1.
G ∈ S_k,1.
2.
G is the disjoint union of k digraphs from S_1,1.
3.
G is the disjoint union of k transitive tournaments.
4.
G is the disjoint union of k acyclic tournaments.
5.
G is the disjoint union of k $\overrightarrow {C_{3}}$-free tournaments.
6.
G is $\{(k+1)K_{1\&\#x20E1;},K_{2\&\#x20E1;},\overrightarrow {C_{3}}\}$-free.

Proposition 5

Let G = (V,A) ∈ S_k,1, then a set Q on k sequences, such that G = g(Q) can be found in time $\mathcal {O}(|V|+|A|)$.

Proof 11

If G ∈ S_k,1 then G is the disjoint union of k digraphs from S_1,1. For every of them we can define a sequence q_i by Proposition 4. □

4.2.3 Sequence Digraphs for k = 1

The next examples show that for ℓ ≥ 2 items for each type even one sequence can define digraphs which are not transitive.

Example 4

1.
The digraph D₀ in Table 2 is not transitive, since it has among others the arcs (b,c) and (c,a) but not the arc (b,a). Further D₀ belongs to the set S_1,2, since it can be defined by set Q = {q₁} of one sequence q₁ = [c,a,b,c].
2.
The digraph D₆ in Table 2 is not transitive, since it has among others the arcs (c,b) and (b,a) but not the arc (c,a). Further D₆ belongs to the set S_1,2, since it can be defined by set Q = {q₁} of one sequence q₁ = [a,b,a,c,b,d,c,d].

In order to show a characterization for the class S_1,2 we next prove some lemmas.

Lemma 13

Let ℓ ≥ 1 and G ∈ S_1,ℓ, then the complement digraph co-G is transitive.

Proof 12

Let Q = {q₁} be some sequence which defines the sequence digraph g(Q) and (u,v) and (v,w) with u≠w be two arcs of co-(g(Q)). By Lemma 1 (2) we conclude that last(q₁,v) < first(q₁,u) and last(q₁,w) < first(q₁,v) thus we have last(q₁,w) < first(q₁,v) ≤last(q₁,v) < first(q₁,u) and last(q₁,w) < first(q₁,u) implies by Lemma 1 (2) that arc (u,w) is in digraph co-(g(Q)). □

Lemma 14

Let ℓ ≥ 1 and G ∈ S_1,ℓ, then the complement digraph co-G is $2\overrightarrow {P_{2}}$-free.

Proof 13

If we assume that co-G contains $2\overrightarrow {P_{2}} =(\{a,b,c,d\},\{(c,a),(d,b)\})$, see Table 2, as an induced subdigraph, then G has (among others) the arcs (c,b) and (d,a). By applying Lemma 1 (1) and (2) we obtain last(q₁,a) < first(q₁,c) < last(q₁,b) < first(q₁,d) < last(q₁,a), which leads to a contradiction. □

Lemma 15

Let G be a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph on n vertices, then G has a vertex v such that outdegree(v) = n − 1 and a vertex $v^{\prime }$ such that $\text {indegree}(v^{\prime })=n-1$. (D₄ is shown in Table 2.)

Proof 14

We show the result by induction on the number of vertices n in digraph G. For digraphs on n ≤ 3 vertices the statement is true.^{Footnote 4}

Let n ≥ 3 and G_n+ 1 = (V_n+ 1,A_n+ 1) be a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph on n + 1 vertices. Further let w ∈ V_n+ 1 and G_n = G_n+ 1[V_n+ 1 −{w}]. Then G_n = (V_n,A_n) is a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph on n vertices. By the induction hypothesis there is some v ∈ V_n such that outdegree(v) = n − 1 in G_n.

If (v,w) ∈ A_n+ 1 then outdegree(v) = n in G_n+ 1 and the proof is done. If (v,w)∉A_n+ 1 then the semicompleteness of G_n+ 1 implies that (w,v) ∈ A_n+ 1. Then for every (v,u) ∈ E_n it holds (w,u) ∈ E_n+ 1. Otherwise the semicompleteness of G_n+ 1 implies that (u,w) ∈ E_n+ 1 and thus G_n+ 1[{u,v,w}] leads to a $\overrightarrow {C_{3}}$ or to a D₄. Thus in G_n+ 1 it holds outdegree(w) = outdegree(v) + 1 = n − 1 + 1 = n.

The proof for the existence of a vertex $v^{\prime }$ such that $\text {indegree}(v^{\prime })=n-1$ is similar. □

Lemma 16

Let G be a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph, then its complement digraph co-G is transitive. (D₄ is shown in Table 2.)

Proof 15

Let (u,v),(v,w) ∈ A^c be two arcs of co-G = (V,A^c). Since G = (V,A) is semicomplete we know that (v,u),(w,v)∉A^c. If G is a $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph, then co-G is a $\{\overrightarrow {C_{3}},\overrightarrow {P_{3}}\}$-free digraph. Thus u and w are connected either only by (u,w) ∈ A^c or by (u,w) ∈ A^c and (w,u) ∈ A^c, which implies that co-G is transitive. □

A spanning subdigraph is a subdigraph obtained by the deletion of arcs.

Lemma 17

Every semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph has a spanning transitive tournament subdigraph. (D₄ is shown in Table 2.)

Proof 16

By Lemma 15 every semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph G on n vertices has a vertex v₁ such that outdegree(v₁) = n − 1. By removing v₁ from G, we obtain a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph G¹ on n − 1 vertices, which leads to a vertex v₂ such that outdegree(v₂) = n − 2. By removing v₂ from G¹, we obtain a semicomplete $\{\overrightarrow {C_{3}},D_{4}\}$-free digraph G² on n − 2 vertices, which leads to a vertex v₃ such that outdegree(v₃) = n − 3 and so on. The sequence q = [v₁,v₂,…,v_n] defines a digraph g({q}) ∈ S_1,1 which is a spanning transitive tournament subdigraph of G. □

These results allow us to show the following characterizations for the class S_1,2. Since we use several forbidden induced subdigraphs the semicompleteness is expressed by excluding 2K_1⃡.

Theorem 3

For every digraph G the following statements are equivalent.

1.
G ∈ S_1,2
2.
G ∈ S_1,ℓ for some ℓ ≥ 2
3.
co-G is transitive, co-G is $2\overrightarrow {P_{2}}$-free, and G has a spanning transitive tournament subdigraph.
4.
G is $\{\text {co-}(2\overrightarrow {P_{2}}),2K_{1\&\#x20E1;},\overrightarrow {C_{3}},D_{4}\}$-free. (D₄ is shown in Table 2.)

Proof 17

(1) ⇒ (2) By definition.

(2) ⇒ (1) By Lemma 8.

(3) ⇒ (4) If G has a spanning transitive tournament subdigraph then it is semicomplete and thus 2K_1⃡-free. If co-G is transitive it has no induced $\overrightarrow {C_{3}}$ and no $\overrightarrow {P_{3}}$. Thus G has no induced $\overrightarrow {C_{3}}$ and D₄. (4) ⇒ (3) Follows by Lemmas 16 and 17.

(1) ⇒ (3) Digraph co-G is transitive by Lemma 13 and $2\overrightarrow {P_{2}}$-free by Lemma 14. Further if G is defined by sequence q then the subsequence F(q) which is obtained from q by removing all except the first item for each type leads to a subdigraph $(V,A^{\prime })$ which is a transitive tournament.

(3) ⇒ (2) Let $G^{\prime }=(V,A^{\prime })$ be a subdigraph of G = (V,A) which is a transitive tournament. By Theorem 1 we know that $G^{\prime }\in S_{1,1}$ and thus there is some sequence

$$ q^{\prime}=[v_{1},\ldots,v_{n}] $$

such that $\textit {g}(\{q^{\prime }\})=G^{\prime }$. If $A^{\prime }=A$ we know that $G\in S_{1,1}\subseteq S_{1,\ell }$ for every ℓ ≥ 2. So we can assume that $A^{\prime }\subsetneq A$. Obviously for every arc $(v_{i},v_{j})\in A-A^{\prime }$ there are two positions j < i in

$$ q^{\prime}=[v_{1},\ldots,v_{j},\ldots, v_{i},\ldots, v_{n}]. $$

In order to define a subdigraph of G which contains all arcs of $G^{\prime }$ and arc (v_i,v_j) we can insert (cf. Section 3.1 for the definition of inserting an item) an additional item for type v_i on position k ≤ j, or an additional item for type v_j on position k > i, or first an additional item for type v_j and then an additional item for type v_i on a position k, j < k ≤ i, into $q^{\prime }$ without creating an arc which is not in A. This is possible if and only if there is some position k, j ≤ k ≤ i, in

$$ q^{\prime}=[v_{1},\ldots,v_{j},\ldots, v_{m^{\prime}}, \ldots, v_{k}, \ldots, v_{m^{\prime\prime}}, \ldots, v_{i}, \ldots, v_{n}] $$

such that for every $m^{\prime }$, j < m^′≤ k, it holds $(v_{m^{\prime }},v_{j})\in A$ and for every $m^{\prime \prime }$, $k\leq m^{\prime \prime }< i$, it holds $(v_{i},v_{m^{\prime \prime }})\in A$.

If it is possible to insert all arcs of $A-A^{\prime }$ by adding a set of additional items into sequence $q^{\prime }$ resulting in a sequence q such that G = g(q), then we have G ∈ S_1,ℓ for some ℓ ≥ 2. Next we show a condition using the new items of every single arc of $A-A^{\prime }$ independently from each other.

Claim 1

If for every arc $(v_{i},v_{j})\in A-A^{\prime }$ there is a position k, j < k ≤ i such that first inserting an additional item for type v_j and then an additional item for type v_i at position k into $q^{\prime }$ defines a subdigraph of G which contains all arcs of $G^{\prime }$ and arc (v_i,v_j), then G ∈ S_1,ℓ for some ℓ ≥ 2.

Proof 18

(of Claim 1) Every single arc $(v_{i},v_{j})\in A-A^{\prime }$ can be inserted by first inserting first an additional item for type v_j and then an additional item for type v_i at some position k, j < k ≤ i. Next we show that the new inserted items for two missing arcs $(v_{i},v_{j})\in A-A^{\prime }$ at position k and $(v_{i^{\prime }},v_{j^{\prime }})\in A-A^{\prime }$ at position $k^{\prime }$ do not create an arc which is not in A. This is done by a case distinction w.r.t. the 34 possible positions of $i,i^{\prime },j,j^{\prime },k,k^{\prime }$ shown in Table 3.

1.
If $j^{\prime }<k^{\prime }<i^{\prime }<j<k<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
2.
If $j^{\prime }<k^{\prime }<i^{\prime }=j<k<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, and $(v_{j^{\prime }},v_{j})\in A$.
3.
If $j^{\prime }<k^{\prime }<j<i^{\prime }<k<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
4.
If $j^{\prime }<k^{\prime }<j<k<i^{\prime }<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
5.
If $j^{\prime }<k^{\prime }<j<k<i^{\prime }=i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
6.
If $j^{\prime }<k^{\prime }<j<k<i<i^{\prime }$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
7.
If $j^{\prime }<j<k^{\prime }<i^{\prime }<k<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
8.
If $j^{\prime }<j<k^{\prime }<k<i^{\prime }<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$. But arc $(v_{i^{\prime }},v_{j})$ does not belong to A because of the positions of $i,i^{\prime },j,j^{\prime }$. Next we show that $(v_{i^{\prime }},v_{j})\in A$ using $(v_{i},v_{j^{\prime }})$ which also does not belong to A because of the positions of $i,i^{\prime },j,j^{\prime }$. If $(v_{i^{\prime }},v_{j})\not \in A$ and $(v_{i},v_{j^{\prime }})\not \in A$ then $j^{\prime }<j<k^{\prime }<k<i^{\prime }<i$ implies that co-G has an induced $2\overrightarrow {P_{2}}$ which is not possible by our assumption. Thus we know that $(v_{i^{\prime }},v_{j})\in A$ or $(v_{i},v_{j^{\prime }})\in A$. If $(v_{i},v_{j^{\prime }})\in A$ we can exchange the positions of $k^{\prime }$ and k (which implies that we are in case 11) or we also have $(v_{i^{\prime }},v_{j})\in A$.
9.
If $j^{\prime }<j<k^{\prime }<k<i^{\prime }=i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})=(v_{i},v_{j})\in A$.
10.
If $j^{\prime }<j<k^{\prime }<k<i<i^{\prime }$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, $(v_{j^{\prime }},v_{j})\in A$, and $(v_{i^{\prime }},v_{j})\in A$ (same situation as 8).
11.
If $j^{\prime }<j<k<k^{\prime }<i^{\prime }<i$, then $(v_{j},v_{i^{\prime }})\in A$, $(v_{i},v_{i^{\prime }})\in A$, $(v_{j},v_{j^{\prime }})\in A$. But arc $(v_{i},v_{j^{\prime }})$ does not belong to A because of the positions of $i,i^{\prime },j,j^{\prime }$. Next we show that $(v_{i},v_{j^{\prime }})\in A$ using $(v_{i^{\prime }},v_{j})$ which also does not belong to A because of the positions of $i,i^{\prime },j,j^{\prime }$. If $(v_{i},v_{j^{\prime }})\not \in A$ and $(v_{i^{\prime }},v_{j})\not \in A$ then $j^{\prime }<j<k<k^{\prime }<i^{\prime }<i$ implies that co-G has an induced $2\overrightarrow {P_{2}}$ which is not possible by our assumption. Thus we know that $(v_{i},v_{j^{\prime }})\in A$ or $(v_{i^{\prime }},v_{j})\in A$. If $(v_{i^{\prime }},v_{j})\in A$ we can exchange the positions of $k^{\prime }$ and k (which implies that we are in case 8) or we also have $(v_{i},v_{j^{\prime }})\in A$.
12.
If $j^{\prime }<j<k<k^{\prime }<i^{\prime }=i$, then $(v_{j},v_{i^{\prime }})\in A$, $(v_{j},v_{j^{\prime }})\in A$, and $(v_{i},v_{j^{\prime }})=(v_{i^{\prime }},v_{j^{\prime }})\in A$.
13.
If $j^{\prime }<j<k<k^{\prime }<i<i^{\prime }$, then $(v_{j},v_{i^{\prime }})\in A$, $(v_{i},v_{i^{\prime }})\in A$, $(v_{j},v_{j^{\prime }})\in A$, and $(v_{i},v_{j^{\prime }})\in A$ (same situation as 11).
14.
If $j^{\prime }<j<k<i<k^{\prime }<i^{\prime }$, then $(v_{j},v_{i^{\prime }})\in A$, $(v_{i},v_{i^{\prime }})\in A$, $(v_{j},v_{j^{\prime }})\in A$, and $(v_{i},v_{j^{\prime }})\in A$.
15.
If $j^{\prime }=j<k^{\prime }<i^{\prime }<k<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, and $(v_{i^{\prime }},v_{j})\in A$.
16.
If $j^{\prime }=j<k^{\prime }<k<i^{\prime }<i$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, and $(v_{i^{\prime }},v_{j})=(v_{i^{\prime }},v_{j^{\prime }})\in A$.
17.
If $j^{\prime }=j<k^{\prime }<k<i<i^{\prime }$, then $(v_{j^{\prime }},v_{i})\in A$, $(v_{i^{\prime }},v_{i})\in A$, and $(v_{i^{\prime }},v_{j})=(v_{i^{\prime }},v_{j^{\prime }})\in A$.

Table 3 Cases within the proof of Claim 1

Full size table

Every remaining case i ∈{18,…,34} is symmetric to shown case 35 − i. Thus for every two missing arcs $(v_{i},v_{j})\in A-A^{\prime }$ and $(v_{i^{\prime }},v_{j^{\prime }})\in A-A^{\prime }$ which are inserted by additionally items all arcs between these items belong to A.

Now we can show the claim by induction on the number $m=|A-A^{\prime }|$ of missing arcs. Let a₁,…,a_m be an arbitrary order of the missing arcs. For m = 1 we can insert the missing arc by the assumption of the claim. Let m ≥ 1, q_m be the sequence obtained from $q^{\prime }$ by inserting m missing arcs, and G_m = g(q_m) = (V,A_m). In order to insert arc a_m+ 1 = (v_i,v_j) we neglect the 2m new items in q_m and consider only the items of $q^{\prime }$ for determining a position $k_{a_{m+1}}$, $j<k_{a_{m+1}}\leq i$, such that a_m+ 1 can be inserted into $G^{\prime }$ by first inserting an additional item for type v_j and then an additional item for type v_i at position $k_{a_{m+1}}$. The existence of this position follows by the assumption of the claim. The insertion of the two new items at position $k_{a_{m+1}}$ for arc a_m+ 1 = (v_i,v_j) defines the following sets of arcs.

1.
A_1,m+ 1 = {(v_i,v_j)}
2.
Set A_2,m+ 1 of all arcs from vertices corresponding to items of $q^{\prime }$ on position $\leq k_{a_{m+1}}$ to v_i and to v_j and all arcs from v_i and from v_j to all vertices corresponding to items of $q^{\prime }$ on position $> k_{a_{m+1}}$.
3.
Set A_3,m+ 1 of all arcs from vertices corresponding to new items of q_m on positions $k_{a_{j}}$, j ≤ m, to v_i and to v_j and all arcs from v_i and from v_j to all vertices corresponding to new items of q_m on positions $k_{a_{j}}$, j ≥ m.

It remains to show that $A_{1,m+1}\cup A_{2,m+1} \cup A_{3,m+1} \subseteq A$. $A_{1,m+1} \subseteq A$ holds by definition and $A_{2,m+1} \subseteq A$ holds by the assumption of the claim. Set A_3,m+ 1 is equal to the union of all sets of arcs between the vertices corresponding to the two new inserted items for a_m+ 1 and the vertices corresponding to the two new inserted items for every a_j, j ≤ m following the order of the items. As our case distinction above implies that all arcs between the new items inserted at any a_j, j ≤ m and the new items inserted at a_m+ 1 are in A the union of all these arcs A_3,m+ 1 is also in A.

So we can insert a_m+ 1 by first inserting an additional item for type v_j and then an additional item for type v_i at position $k^{\prime }_{a_{m+1}}$, where $k^{\prime }_{a_{m+1}}$ is the position in q_m which corresponds to position $k_{a_{m+1}}$ in $q^{\prime }$.

Thus we have shown that g({q_m}) = G ∈ S_1,ℓ for some ℓ ≥ 2. □

Assume that G∉S_1,ℓ for every ℓ ≥ 2. By Claim 1 there is some arc $(v_{i},v_{j})\in A-A^{\prime }$ such that for every position k, j < k ≤ i inserting an additional item for type v_i and an additional item for type v_j at position k defines an arc which is not in A. That is, for every position k, j < k ≤ i, in $q^{\prime }$ there exists some $m^{\prime }$, j < m^′≤ k, such that it holds $(v_{m^{\prime }},v_{j})\not \in A$ or there exists some $m^{\prime \prime }$, $k\leq m^{\prime \prime }< i$, such that it holds $(v_{i},v_{m^{\prime \prime }})\not \in A$. By the transitivity of co-G it follows that there is one position k, j < k ≤ i, in $q^{\prime }$ such that there exists some $m^{\prime }$, j < m^′≤ k, such that it holds $(v_{m^{\prime }},v_{j})\not \in A$ and there exists some $m^{\prime \prime }$, $k\leq m^{\prime \prime }< i$, such that it holds $(v_{i},v_{m^{\prime \prime }})\not \in A$.

If co-G = (V,A^c) is the complement digraph of G we know that

$$ (v_{m^{\prime}},v_{j})\in A^{c} \text{ and } (v_{i},v_{m^{\prime\prime}})\in A^{c}. $$

(3)

Since $m^{\prime }\leq m^{\prime \prime }$ we know that $(v_{m^{\prime }},v_{m^{\prime \prime }})\in A$. Further we know that $(v_{m^{\prime \prime }},v_{m^{\prime }})\in A$, since otherwise $(v_{m^{\prime \prime }},v_{m^{\prime }})\in A^{c}$, property (3), and the transitivity of co-G would imply that (v_i,v_j) ∈ A^c which is not possible. Thus we know that

$$ (v_{m^{\prime}},v_{m^{\prime\prime}})\not \in A^{c} \text{ and } (v_{m^{\prime\prime}},v_{m^{\prime}})\not\in A^{c}. $$

(4)

Further the arcs $(v_{j},v_{m^{\prime }}),(v_{j},v_{m^{\prime \prime }}),(v_{m^{\prime }},v_{i}),(v_{m^{\prime \prime }},v_{i})$ belong to $A^{\prime }\subseteq A$ and thus

$$ (v_{j},v_{m^{\prime}})\not \in A^{c}, \text{ } (v_{j},v_{m^{\prime\prime}})\not\in A^{c}, \text{ } (v_{m^{\prime}},v_{i})\not\in A^{c}\text{ and } (v_{m^{\prime\prime}},v_{i})\not\in A^{c}. $$

(5)

If $(v_{i},v_{m^{\prime }})\in A^{c}$ or $(v_{m^{\prime \prime }},v_{j})\in A^{c}$ then (3) and the transitivity of co-G would imply that (v_i,v_j) ∈ A^c, thus we know

$$ (v_{i},v_{m^{\prime}})\not \in A^{c} \text{ and } (v_{m^{\prime\prime}},v_{j})\not\in A^{c}. $$

(6)

Properties (3)–(6) imply that the subdigraph $(\{v_{i},v_{j},v_{m^{\prime }},v_{m^{\prime \prime }}\},\{(v_{i},v_{m^{\prime \prime }}),$ $(v_{m^{\prime }},v_{j})\})$ of co-G leads to a $2\overrightarrow {P_{2}}$, which implies that G contains a $\text {co-}(2\overrightarrow {P_{2}})$. □

Corollary 6

Every digraph in S_k,ℓ can be obtained by the union of at most k many digraphs from S_1,2 and thus by the union of at most k many $\{\text {co-}(2\overrightarrow {P_{2}}),2K_{1\&\#x20E1;},\overrightarrow {C_{3}},D_{4}\}$-free digraphs.

Proposition 6

Let G ∈ S_1,2, then a set Q of one sequence q, such that G = g(Q) can be found in time ${\mathcal {O}}(|V|+|A|)$.

Proof 19

Let G = (V,A) ∈ S_1,2 and q = []. We perform the following steps until G = (∅,∅).

Choose v ∈ V such that (v,u) ∈ A for all u ∈ V −{v} and append v to q.
Remove all arcs (v,u) from A.
If indegree(v) = outdegree(v) = 0, remove v from V.
If there are vertices u such that indegree(u) = outdegree(u) = 0, remove u from V and append u to q.

Next we show that for every G = (V,A) ∈ S_1,2 this method leads to a sequence q, such that G = g({q}) and for every type in types(q) there are at most two items in q.

In order to perform the algorithm there has to be an ordering v₁,…,v_n of V such that for 1 ≤ i < n vertex v_i has maximum possible outdegree in subdigraph obtained by removing the outgoing arcs of v₁,…,v_i− 1 and thereby created isolated vertices from G. Since G ∈ S_1,2 there is a sequence $q^{\prime }$ such that $G=\textit {g}(\{q^{\prime }\})$. The order in which the types corresponding to the vertices of V appear in subsequence $F(q^{\prime })$, defined in the proof of Theorem 3, ensures the existence of such an ordering.

By performing the operations of the algorithm along such a vertex ordering we observe that in every iteration only the outdegree of v is changed and only vertices with defined incoming and outgoing arcs are removed. Thus by the definition of the sequence digraph it holds G = g({q}). Since every vertex which has only outgoing or only incoming arcs will be inserted once into q and every vertex which has outgoing and incoming arcs will be inserted at most twice into q this sequence fullfils the properties stated in the theorem. □

Example 5

We apply the method given in the proof of Proposition 6 on the digraph D₆ in Table 2 and obtain the sequence

$$ q=[b,a,c,b,d,c]. $$

This even leads to a shorter sequence than we used in Example 4 (2).

4.3 Converse Sequence Digraphs and Complements of Sequence Digraphs

For some sequence $q_{i}=(b_{i,1},\ldots ,b_{i,n_{i}})$ we define by $q^{-1}_{i}=(b_{i,n_{i}},\ldots ,b_{i,1})$ its converse sequence and for some set Q we define by $Q^{-1}=\{q^{-1}_{i} \mid q_{i}\in Q\}$ the set of converse sequences. Next we state a close relation between the sequence digraph of some set Q and of the converse digraph of the sequence digraph of Q^− 1.

Lemma 18

Let Q be a set of sequences. Then it holds G = g(Q) if and only if con-G = g(Q^− 1).

This relation implies that the classes S_k,ℓ are closed under taking converse digraphs.

Proposition 7

Let G ∈ S_k,ℓ, then con-G ∈ S_k,ℓ.

Next we want to address the relationship between the complement digraph of a sequence digraph g(Q) for some set Q and the sequence digraph g(Q^− 1) of the converse sequences Q^− 1. In general these digraphs are different. This can be verified by the two sequence digraphs considered in Example 4. Under certain conditions for some set of sequences Q the complement digraph of g(Q) can be obtained by the sequence digraph of the converse sequences Q^− 1, which is useful since Q^− 1 has the same number of sequences and the same number of items per types as Q.

Lemma 19

Digraph G is a tournament if and only if co-G = con-G.

Proof 20

Digraph G is a tournament, if and only if between every two vertices of G there is exactly one arc, if and only if the converse digraph con-G corresponds to the complement digraph co-G. □

Lemma 20

Let G ∈ S_k,ℓ defined by some set of sequences Q, such that graph g(Q) is a tournament, then co-(g(Q)) = g(Q^− 1).

Proof 21

Since g(Q) is a tournament by Lemma 19 it holds co-(g(Q)) = con-(g(Q)) and by Lemma 18 it holds con-(g(Q)) = g(Q^− 1). □

Since Q and Q^− 1 are defined on the same number of sequences and items of each type the following result holds.

Corollary 7

Let G ∈ S_k,ℓ be a tournament, then co-G ∈ S_k,ℓ.

Since the class S_1,1 is equivalent to the class of transitive tournaments (Theorem 1), the last two results hold for digraphs in S_1,1.

Lemma 21

Let G ∈ S_1,1 defined by some set Q = {q₁} of one sequence, then co-(g(Q)) = g(Q^− 1).

Proposition 8

Let G ∈ S_1,1, then co-G ∈ S_1,1.

The results can not be generalized to graphs from S_1,2 and also not sets Q on more than one sequence since these are not tournaments in general.

Beside transitivity shown in Lemma 13 we observe that the complements digraphs of sequence digraphs are special directed acyclic graphs.

Lemma 22

Let ℓ ≥ 1 and G ∈ S_1,ℓ, then the complement digraph co-G is acyclic.

Proof 22

Let Q = {q₁} be some sequence which defines the sequence digraph g(Q). If the complement digraph co-(g(Q)) contains a cycle ({v₁,…,v_n},{(v₁,v₂),…, (v_n− 1,v_n),(v_n,v₁)}), n ≥ 2, then by Lemma 13 vertex set {v₁,…,v_n} induces a bidirectional complete subdigraph in digraph co-(g(Q)). Thus the vertex set {v₁,…,v_n} induces an edgeless subdigraph in g(Q) which contradicts Lemma 7. □

It is well known that every DAG has at least one sink, i.e. a vertex v₁ of outdegree 0, and one source, i.e. a vertex v₂ of indegree 0 by [7]. This means by Lemma 22 that for every ℓ ≥ 1 every digraph G = (V,A) ∈ S_1,ℓ has a vertex v₁ of outdegree |V |− 1 and a vertex v₂ of indegree |V |− 1. For ℓ = 1 this is known by Theorem 1.

Lemma 23

Let ℓ ≥ 1 and G = (V,A) ∈ S_1,ℓ, u and v be two vertices from V, then the complement digraph co-G has at most one of the two arcs (u,v) and (v,u), i.e. co-G is an oriented graph.

Proof 23

Let G = (V,A) ∈ S_1,ℓ, u and v be two distinct vertices from V. Then G contains at least one of the two arcs (u,v) and (v,u) which implies that co-G has at most one of the two arcs (u,v) and (v,u). □

Proposition 9

Let ℓ ≥ 1 and G ∈ S_1,ℓ, then the complement digraph co-G has at most one non-trivial component which contains arcs.

Proof 24

If we assume that co-G has more than one non-trivial component which contains arcs, then by Lemma 23 digraph co-G contains $2\overrightarrow {P_{2}} =(\{a,b,c,d\},$ {(c,a),(d,b)}), see Table 2, as an induced subdigraph. Lemma 14 leads to a contradiction. □

Digraphs in S_2,1 consist of the disjoint union of two digraphs in S_1,1 (see Lemma 12), thus the corresponding complements have exactly one component. The complements of digraphs in S_2,2 can have more than one non-trivial component which contains arcs. This can be verified by digraph co-G ∈ S_2,2 in the following example, whose complement G has two non-trivial components which contains arcs.

Example 6

Let G be the disjoint union of two digraphs $G^{\prime }=(V^{\prime },A^{\prime })$ and $G^{\prime \prime }=(V^{\prime \prime },A^{\prime \prime })$ from S_1,1. Then G can be defined by Q = {q₁,q₂}, where $q_{1}=[v^{\prime }_{1},v^{\prime }_{2},\ldots ,v^{\prime }_{n}]$ and $q_{2}=[v^{\prime \prime }_{1},v^{\prime \prime }_{2},\ldots ,v^{\prime \prime }_{m}]$. W.l.o.g. we assume the vertices in $V^{\prime }$ and $V^{\prime \prime }$ to be enumerated in the order in which the graphs can be defined by inserting in-dominated vertices (cf. Theorem 1. (8)). Thus it holds G ∈ S_2,1.

The complement digraph co-G can be defined as follows. The digraph complement of a digraph in S_1,1 also belongs to S_1,1 and the additional arcs between $\text {co-} G^{\prime }$ and $\text {co-} G^{\prime \prime }$ can be defined by two sequences, one for the arcs from $\text {co-} G^{\prime }$ to $\text {co-} G^{\prime \prime }$ and one for the arcs from $\text {co-} G^{\prime \prime }$ to $\text {co-} G^{\prime }$. The arcs within $\text {co-} G^{\prime }$ and $\text {co-} G^{\prime \prime }$ can be defined in both sequences. Formally co-G can be defined by $Q^{\prime }=\{q^{\prime }_{1},q^{\prime }_{2}\}$, where

$$ q^{\prime}_{1}=\left[v^{\prime}_{n},v^{\prime}_{n-1},\ldots,v^{\prime}_{1}, v^{\prime\prime}_{m},v^{\prime\prime}_{m-1}\ldots,v^{\prime\prime}_{1}\right] $$

and

$$ q^{\prime}_{2}=\left[v^{\prime\prime}_{m},v^{\prime\prime}_{m-1}\ldots,v^{\prime\prime}_{1},v^{\prime}_{n},v^{\prime}_{n-1},\ldots,v^{\prime}_{1}\right]. $$

Thus it holds co-G ∈ S_2,2.

Next we want to study for some digraph G ∈ S_k,ℓ whether it holds co-G ∈ S_k,ℓ. By Proposition 8 the class S_1,1 is closed under complementations. The next example shows that the class S_1,2 is not closed under complementations and that the values of k and ℓ can grow arbitrary.

Example 7

We consider

$$ \overrightarrow{K_{n,m}}=(\{v_{1},\ldots,v_{n},w_{1},\ldots,w_{m}\}, \{(v_{i},w_{j}) \mid 1\leq i\leq n, 1\leq j \leq m\}) $$

as an orientation of a complete bipartite graph K_n,m. The complement digraph of $\overrightarrow {K_{n,m}}$ can be defined by one sequence

$$ q=[w_{1},w_{2},\ldots,w_{m},w_{1},w_{2},\ldots,w_{m},v_{1},v_{2},\ldots,v_{n},v_{1},v_{2},\ldots,v_{n}] $$

which implies that the complement digraph of $\overrightarrow {K_{n,m}}$ is in S_1,2. By Lemma 9 it holds $\overrightarrow {K_{n,m}}\in S_{n\cdot m,\max \limits (n,m)}$ but $\overrightarrow {K_{n,m}}\not \in S_{k,\ell }$ for k < n ⋅ m or $\ell <\max \limits (n,m)$.

Since $S_{1,2}\subseteq S_{k,\ell }$ for k ≥ 1 and ℓ ≥ 2 we know that the classes S_k,ℓ for every k ≥ 1 and ℓ ≥ 2 are not closed under complementations. The next examples show that S_2,1 is not closed under complementations.

Example 8

1.
Let G₁ be the disjoint union of two 1-vertex digraphs in S_1,1. Then G₁ ∈ S_2,1. Further co-G₁ = K_2⃡ ∈ S_1,2 − S_2,1.
2.
Let G_n for n ≥ 2 be the disjoint union of two n-vertex digraphs in S_1,1. Then G_n ∈ S_2,1 and co-G_n ∈ S_2,2 − (S_2,1 ∪ S_1,2), see Example 6.

Lemma 24

Let G ∈ S_k,1 for k ≥ 1, then co-G ∈ S_2,2.

Proof 25

By Lemma 12 every digraph G ∈ S_k,1 is the disjoint union of k digraphs G₁,…,G_k from S_1,1. Let q_i be the sequence for G_i. For every i it holds co-G_i ∈ S_1,1 (Proposition 8). A sequence for co-G_i can be obtained by the reverse sequence $q^{-1}_{i}$ of that for G_i (Lemma 21). Thus digraph co-G can be obtained by k digraphs from S_1,1 and all possible arcs between two of these k graphs. The arcs between these k digraphs can be created by two sequences each consisting of concatenations of $q^{-1}_{i}$ as follows.

$$ q^{\prime}=q^{-1}_{1} \circ q^{-1}_{2} \circ {\ldots} \circ q^{-1}_{k} $$

$$ q^{\prime\prime}=q^{-1}_{k} \circ q^{-1}_{k-1} \circ {\ldots} \circ q^{-1}_{1} $$

Since $\text {co-} G=\textit {g}(\{q^{\prime },q^{\prime \prime }\})$ it holds that co-G ∈ S_2,2. □

4.4 Simple Sequences

Let Q = {q₁,…,q_k} be a set of k sequences. By Observation 2 only the first and the last item of each type in every q_i ∈ Q are important for the arcs in the corresponding digraph. Next we want to analyze how we even can use subsequences defined by the first or last item of each type. Let F(q_i) be the subsequence of q_i which is obtained from q_i by removing all except the first item for each type and L(q_i) be the subsequence of q_i which is obtained from q_i by removing all except the last item for each type.^{Footnote 5}

Example 9 (F(q _i), L(q _i))

For the sequence q₁ = [b,a,c,b,a] we obtain the two subsequences F(q₁) = [b,a,c] and L(q₁) = [c,b,a].

The given example shows that sequence digraph g({q₁}) differs from the union of the two sequence digraphs g({F(q₁)}) and g({L(q₁)}) only by arc (a,b). In order to formalize this difference we define special sequences. Sequence q_i ∈ Q is simple, if there do not exist $t,t^{\prime }\in \textit {types}(q_{i})$ such that

$$ \textit{first}(q_{i},t)<\textit{first}(q_{i},t^{\prime})<\textit{last}(q_{i},t)<\textit{last}(q_{i},t^{\prime}). $$

(7)

Set Q is simple, if every sequence q_i ∈ Q is simple. We define $S^{\prime }_{k,\ell }$ to be the set of all sequence digraphs defined by simple sets Q on at most k sequences that contain at most ℓ items of each type in types(Q). By the definition we know for every two integers k ≥ 1 and ℓ ≥ 1 the following inclusions between these graph classes.

$$ \begin{array}{@{}rcl@{}} S^{\prime}_{k,\ell} & \subseteq & S_{k,\ell} \end{array} $$

(8)

$$ \begin{array}{@{}rcl@{}} S^{\prime}_{k,1} & = & S_{k,1} \end{array} $$

(9)

The sequence q₁ = [b,a,c,c,b,a] given in Example 9 is not simple and sequence digraph g({q₁}) differs from the union of the two sequence digraphs g({F(q₁)}) and g({L(q₁)}). The next theorem says that this is no coincidence.

Theorem 4

A sequence q₁ is simple if and only if g({q₁}) can be obtained by the union of g({F(q₁)}) and g({L(q₁)}).

Proof 26

(⇒) Let q₁ be a simple sequence and g({q₁}) = (V,A) its sequence digraph. Further let $\textit {g}(\{F(q_{1})\})=(V,A^{\prime })$ and $\textit {g}(\{L(q_{1})\})=(V,A^{\prime \prime })$ be the sequence digraphs of F(q₁) and L(q₁). We have to show that $A=A^{\prime }\cup A^{\prime \prime }$.

To show $A\subseteq A^{\prime }\cup A^{\prime \prime }$ we consider subsequence $q^{\prime }_{1}=M(q_{1})$ (defined in Section 3.1), which is also simple, defines the same digraph as q₁, and contains at most two items for every type. This implies that for every arc (u,v) ∈ A vertices u and v correspond to the first or the last item of its type.

If (u,v) ∈ A, $\textit {first}(q^{\prime }_{1},u)=b_{i}$, and $\textit {first}(q^{\prime }_{1},v)=b_{j}$ for some i < j, then $(u,v)\in A^{\prime }$.
If (u,v) ∈ A, $\textit {last}(q^{\prime }_{1},u)=b_{i}$, and $\textit {last}(q^{\prime }_{1},v)=b_{j}$ for some i < j, then $(u,v)\in A^{\prime \prime }$.
If (u,v) ∈ A, $\textit {last}(q^{\prime }_{1},u)=b_{i}$, and $\textit {first}(q^{\prime }_{1},v)=b_{j}$ for some i < j, then there is some $i^{\prime }\leq i$ such that $\textit {first}(q_{1},u)=b_{i^{\prime }}$, which implies that $(u,v)\in A^{\prime }$. Further there is also some $j^{\prime }\geq j$ such that $\textit {last}(q_{1},v)=b_{j^{\prime }}$, which implies that $(u,v)\in A^{\prime \prime }$.
If (u,v) ∈ A, $\textit {first}(q^{\prime }_{1},u)=b_{i}$, and $\textit {last}(q^{\prime }_{1},v)=b_{j}$ for some i < j we consider the following cases. If there is also some $i^{\prime }$, $i\leq i^{\prime } <j$, such that $\textit {last}(q^{\prime }_{1},u)=b_{i^{\prime }}$ then $(u,v)\in A^{\prime \prime }$. If there is also some $j^{\prime }$, $i < j^{\prime } \leq j$, such that $\textit {first}(q^{\prime }_{1},v)=b_{j^{\prime }}$ then $(u,v)\in A^{\prime }$. If none of the two cases is fulfilled, then $\textit {first}(q^{\prime }_{1},v)<\textit {first}(q^{\prime }_{1},u)<\textit {last}(q^{\prime }_{1},v)< \textit {last}(q^{\prime }_{1},u)$, which is not possible within a simple sequence.

Further by the definition g({F(q₁)}) and g({L(q₁)}) are subgraphs of G and thus it holds that $A^{\prime }\cup A^{\prime \prime }\subseteq A$.

(⇐) If q₁ is not simple, there exist $t,t^{\prime }\in \textit {types}(q_{1})$ such that (7) holds for i = 1. Thus arc $(t^{\prime },t)$ is in g({q₁}) but not in g({F(q₁)}) or g({L(q₁)}), which implies that g({q₁}) can not be obtained by the union of g({F(q₁)}) and g({L(q₁)}). □

By the definition for every sequence q_i the digraphs g({F(q_i)}) and g({L(q_i)}) both are in S_1,1 (see Theorem 1 for a precise characterization).

Corollary 8

Let $G\in S^{\prime }_{k,\ell }$, then G can be obtained by the union of 2k digraphs from S_1,1.

In order to generalize this result to arbitrary sequences it remains to add the missing arcs $(t^{\prime },t)$ for every $t,t^{\prime }\in \textit {types}(q_{1})$ such that (7) holds by using further sequences. This is not possible within one additionally sequence for each non-simple sequence.

5 Directed Path-width of Sequence Digraphs

5.1 Directed Path-Width

According to Barát [11], the notion of directed path-width was introduced by Reed, Seymour, and Thomas around 1995 and relates to directed tree-width introduced by Johnson, Robertson, Seymour, and Thomas in [12]. A directed path-decomposition of a digraph G = (V,A) is a sequence (X₁,…,X_r) of subsets of V, called bags, such that the following three conditions hold true.

(dpw-1)
X₁ ∪… ∪ X_r = V.
(dpw-2)
For each (u,v) ∈ A there is a pair i ≤ j such that u ∈ X_i and v ∈ X_j.
(dpw-3)
If u ∈ X_i and u ∈ X_j for some u ∈ V and two indices i,j with i ≤ j, then u ∈ X_ℓ for all indices ℓ with i ≤ ℓ ≤ j.

The width of a directed path-decomposition ${\mathcal {X}}=(X_{1}, \ldots , X_{r})$ is

$$ \underset{1 \leq i \leq r}{\max} |X_{i}|-1. $$

The directed path-width of G, d-pw(G) for short, is the smallest integer w such that there is a directed path-decomposition for G of width w.

There is a close relation between the directed path-width of some digraph and the (undirected) path-width (see [13]) of its underlying undirected graph.

Lemma 25 (Lemma 1 of 11)

Let G be some complete bioriented digraph, then d-pw(G) = pw(und(G)).

Determining whether the (undirected) path-width of some given (undirected) graph is at most some given value w is NP-complete [14] even for bipartite graphs, complements of bipartite graphs [15], chordal graphs [16], bipartite distance hereditary graphs [17], and planar graphs with maximum vertex degree 3 [18]. Lemma 25 implies that determining whether the directed path-width of some given digraph is at most some given value w is NP-complete even for digraphs whose underlying graphs lie in the mentioned classes. For example determining whether the directed path-width of some given digraph with maximum semi-degree ${\Delta }^{0}(G)=\max \limits \{{\Delta }^{-}(D),{\Delta }^{+}(D)\}\leq 3$ is at most some given value w is NP-complete, which will be useful in Proposition 11. While undirected path-width can be solved by an FPT-algorithm [19], the existence of such an algorithm for directed path-width is still open. The directed path-width of a digraph G = (V,A) can be computed in time $\mathcal {O}\left ({|A|\cdot |V|^{2\text {d-pw}(G)}}/{(\text {d-pw}(G)-1)!}\right )$ by [3] and in time $\mathcal {O}(\text {d-pw}(G)\cdot |A|\cdot |V|^{2\text {d-pw}(G)})$ by [4]. This leads to XP-algorithms for directed path-width w.r.t. the standard parameter and implies that for each constant w, it is decidable in polynomial time whether a given digraph has directed path-width at most w. Further in [5] it is shown how to decide whether the directed path-width of an ℓ-semicomplete digraph is at most w in time $(\ell +2w+1)^{2w}\cdot |V|^{{\mathcal {O}}(1)}$.

5.2 Hardness of Directed Path-width on Sequence Digraphs

Next we give some conditions on the sequences in Q such that for the corresponding digraph g(Q) computing its directed path-width is NP-hard.

Proposition 10

Given some set Q on k sequences such that n_i = 2 for 1 ≤ i ≤ k and some integer p, then the problem of deciding whether d-pw(g(Q)) ≤ p is NP-complete.

Proof 27

The stated problem is in NP since we can restrict to directed path-decompositions having a polynomial number of bags. To show the NP-hardness by a reduction from the directed path-width problem we transform instance (G,p) in linear time into instance (q(G),p) for the stated problem. The correctness follows by Observation 3. □

Proposition 11

Given some set Q such that d_Q = 3 or c_Q = 5 and some integer p, then the problem of deciding whether d-pw(g(Q)) ≤ p is NP-complete.

Proof 28

We show the NP-hardness by a reduction from the path-width problem for undirected graphs G where Δ(G) ≤ 3. Then the complete biorientation G⃡ fulfills $\max \limits ({\Delta }^{-}(G\&\#x20E1;),{\Delta }^{+}(G\&\#x20E1;))\leq 3$ and by Lemma 25 it holds d-pw(G⃡) = pw(G). The instance for the stated problem is (Q,p), where Q can be obtained from q(G⃡) by Lemma 6. The correctness follows by Observation 3. □

Similar results can be shown for related parameters such as directed cut-width [20].

5.3 Polynomial Cases of Directed Path-Width on Sequence Digraphs

Next we consider the directed path-width of sequence digraphs for k = 1 or ℓ = 1.

Proposition 12

Let G ∈ S_k,1, then d-pw(G) = 0.

Proof 29

By Theorem 2 every digraph in S_k,1 is the disjoint union of k digraphs in S_1,1. By Theorem 1 every digraph in S_1,1 is acyclic and thus has directed path-width 0. □

On the other hand, there are no constants k,ℓ such that every digraph of directed path-width 0 is in S_k,ℓ. This can be verified by an orientation $T^{\prime }$ of a tree T on k edges and Δ(T) = ℓ. Then $\text {d-pw}(T^{\prime })=0$ and by Lemma 9 it holds $T^{\prime }\in S_{k,\ell }$ but $T^{\prime }\not \in S_{k^{\prime },\ell ^{\prime }}$ for $k^{\prime }<k$ or $\ell ^{\prime }<\ell $.

For digraphs in S_1,2 the directed path-width can be arbitrary large, since this class includes all bidirectional complete digraphs. We can compute this value as follows. Let Q = {q}. For type t ∈types(q) let I_t = [first(q,t),last(q,t)] be the interval representing t, and let I_q = {I_t∣t ∈types(q)} be the set of all intervals for sequence q. Let I(q) = (V,E) be the interval graph where V = types(q) and E = {{u,v}∣u≠v, I_u ∩ I_v≠∅,I_u,I_v ∈ I_q}, see Fig. 3.

Proposition 13

Let G ∈ S_1,2 defined by some set Q = {q₁} of one sequence, then d-pw(G) = ω(I(q)) − 1 = pw(I(q)).

Proof 30

It holds d-pw(G) ≤ ω(I(q)) − 1 by an obvious directed path-decomposition along I(q). Further for every integer r the set I(r) = {I_t∣r ∈ I_t} defines a complete subgraph K_|I(r)| in I(q) and also a bidirectional complete subdigraph K_{|I(r)|⃡} in G. Thus it holds d-pw(G) ≥ ω(I(q)) − 1. The second equality holds since the (undirected) path-width of an interval graph is equal to the size of a maximum clique [21]. □

In contrast to Proposition 11 sets Q where d_Q = 1 can be handled in polynomial time.

Proposition 14

Given some set Q such that d_Q = 1 and some integer p, then the problem of deciding whether d-pw(g(Q)) ≤ p can be solved in time $\mathcal {O}(|\textit {types}(Q)|^{2}+n)$.

Proof 31

Let Q = {q₁,…,q_k}. If d_Q = 1 the vertex sets V_i = types(q_i) are disjoint. That is, g(Q) is the disjoint union of digraphs in S_1,2 for which the directed path-width can be computed in time ${\mathcal {O}}({\sum }_{i=1}^{k}|\textit {types}(\{q_{i}\})|^{2}+n_{i})\subseteq \mathcal {O}(|\textit {types}(Q)|^{2}+n)$ by Proposition 13. □

5.4 An XP-Algorithm for Directed Path-Width

We next give an XP-algorithm for directed path-width w.r.t. the parameter k, which implies that for every constant k for a given set Q on at most k sequences the value d-pw(g(Q)) can be computed in polynomial time. The main idea is to discover an optimal directed path-decomposition by scanning the k sequences left-to-right and keeping in a state the numbers of scanned items of every sequence and a certain number of active types.

Since d-pw(G) = 0 for G ∈ S_k,1 by Proposition 12, we assume that there are at least two items for each type in Q. If this is not the case, we insert another item directly after the only item of some type for each type and get $Q^{\prime }$. Obviously, this does not change the sequence digraph, so it holds $g(Q) = g(Q^{\prime })$. Such an insertion of additional items does not change the asymptotic runtime, since $|Q^{\prime }| \leq 2 \cdot |Q|$. If algorithm A has runtime $T_{A}(|I|) \in {\mathcal {O}}(|I|^{c})$ for some instance I and constant c, then after the modification of the instance A has runtime $T_{A}(|I^{\prime }|) \leq T_{A}(2 \cdot |I|) \in \mathcal {O}((2 \cdot |I|)^{c}) = {\mathcal {O}}(2^{c} \cdot |I|^{c}) = \mathcal {O}(|I|^{c})$ since 2^c is constant.

Example 10

Let Q = {q₁,q₂} be the set of sequences q₁ = [a,b,c,b,d,a] and q₂ = [e,f,e,c]. Then there is only one item for type d and one item for f contained in the sequences, so we add two further items, one for type d in q₁ and one for type f in q₂ and we get the modified set of sequences $Q^{\prime }=\{q^{\prime }_{1}, q^{\prime }_{2}\}$ and $q^{\prime }_{1} = [a,b,c,b,d,d,a]$ and $q^{\prime }_{2} = [e,f,f,e,c]$. The sequence digraph $\textit {g}(Q) = g(Q^{\prime })$ is given in Fig. 4.

Let Q = {q₁,…,q_k} be a set of k sequences. Every k-tuple (i₁,…,i_k) where 0 ≤ i_j ≤ n_j for 1 ≤ j ≤ k is a state of Q. State (0,0,…,0) is the initial state and (n₁,…,n_k) is the final state. The state digraph s(Q) for a set Q has a vertex for each possible state. There is an arc from vertex u labeled by (u₁,…,u_k) to vertex v labeled by (v₁,…,v_k) if and only if u_i = v_i − 1 for exactly one element of the vector and for all other elements of the vector u_j = v_j.

Let s = (i₁,…,i_k) be a state of Q. We define L(s) to be the set of all items on the positions 1,…,i_j for 1 ≤ j ≤ k and R(s) is the set of all items on the remaining positions i_j + 1,…,n_j for 1 ≤ j ≤ k. Furthermore, we define M(s) to be the set of items on the positions i_j + 1 for 1 ≤ j ≤ k and front(s) = {t(b)∣b ∈ M(s)} to be the set of types of the items in M(s). Obviously, for every state s = (i₁,…,i_k) it holds that L(s) ∪ R(s) leads to a disjoint partition of the items in Q and $M(s) \subseteq R(s)$.

Further each vertex v of the state digraph is labeled by the value f(v). This value is the number of types t such that there is at least one item of type t in L(v) and at least one item of type t in R(v). Formally we define $\textit {active}(v)=\{t\in \textit {types}(Q) \mid b\in L(v), t(b)=t, b^{\prime }\in R(v), t(b^{\prime })=t\}$ and f(v) = |active(v)|. Obviously, for the initial state u and the final state v it holds |active(u)| = |active(v)| = 0. Since the state digraph s(Q) is a directed acyclic graph we can compute all values |active(v)| using a topological ordering topol of the vertices. Every arc (u,v) in s(Q) represents one item b_i,j, thus

$$ \begin{array}{@{}rcl@{}} &|& \textit{active}((i_{1}, \ldots, i_{j-1}, i_{j}+1, i_{j+1}, \ldots, i_{k}))|\\ &=&|\textit{active}((i_{1}, \ldots, i_{j-1}, i_{j}, i_{j+1}, \ldots, i_{k}))| + c_{j} \end{array} $$

(10)

where

$$ c_{j} = \left\{ \begin{array}{lll} {\kern8pt}1, \text{if } \textit{first}(q_{j},t) = i_{j}+1 \text{ and } \textit{first}(q_{\ell},t) > i_{\ell} ~~\forall ~\ell \neq j \\ -1, \text{if } \textit{last}(q_{j},t) = i_{j}+1 \text{ and } \textit{last}(q_{\ell},t) \leq i_{\ell} ~~\forall ~\ell \neq j \\ {\kern8pt}0, {\text{otherwise}.} \end{array} \right. $$

Please remember our technical definition of first(q,t) and last(q,t) from Section 3.1 for the case that t∉types(q). Thus, the calculation of value |active(i₁,…,i_k)| for the vertex labeled (i₁,…,i_k) depends only on already calculated values, which is necessary in order to use dynamic programming.

Example 11 (State digraph)

Let Q = {q₁,q₂} be the set of sequences q₁ = [a,b,c,b,d,d,a] and q₂ = [e,f,f,e,c]. The state digraph s(Q) is given in Fig. 5. For every vertex v of s(Q) the vector h(v) in the vertex represents the state of v and every arc is labeled with the type of the item, that is considered in order to obtain the next state, and the number f(v) is given on the left of the vertex that represents state h(v).

Let $\mathcal {P}(Q)$ be the set of all paths from the initial state to the final state in s(Q). Every path $P\in \mathcal {P}(Q)$ has $r=1+{\sum }_{i=1}^{k}n_{i}$ vertices, i.e. P = (v₀,…,v_r). First we observe that every path in $\mathcal {P}(Q)$ leads to a directed path-decomposition for g(Q).

Lemma 26

Let Q be some set of k sequences and $(v_{0},\ldots ,v_{r})\in \mathcal {P}(Q)$. Then the sequence (active(v₁),…,active(v_r− 1)) is a directed path-decomposition for g(Q).

Proof 32

Let g(Q) = (V,A) be the sequence digraph, s(Q) = (S,B) be the state digraph, and P = (v₀,…,v_r) be an arbitrary path from the initial state to the final state in s(Q). We next will verify the three conditions of a directed path-decomposition. Since types(Q) = V we identify the vertices with their types.

For any type t ∈ types(Q), let (v_i− 1,v_i) be the first arc on the path P labeled by t. Then t ∈active(v_i) is valid, because later there is another arc (v_j− 1,v_j) with i < j, which is also marked with t, because there are always at least two items per type. It follows that V = active(v₁) ∪… ∪active(v_r− 1).
Let $(t,t^{\prime })\in A$. By the definition of g(Q) there is one sequence q_ℓ such that there is an item b_ℓ,m of type t and an item $b_{\ell ,m^{\prime }}$ of type $t^{\prime }$ such that $m<m^{\prime }$. Thus there is a vertex v_i in P such that b_ℓ,m ∈ L(v_i) − L(v_i− 1) and a vertex v_j in P such that $b_{\ell ,m^{\prime }} \in L(v_{j}) - L(v_{j-1})$, and i < j. If (v_i− 1,v_i) is the first arc on the path P labeled by t then t ∈active(v_i), else t ∈active(v_i− 1). If (v_j− 1,v_j) is the first arc on the path P labeled by $t^{\prime }$ then $t^{\prime } \in \textit {active}(v_{j})$, else $t^{\prime } \in \textit {active}(v_{j-1})$. Thus $t \in \textit {active}(v_{i^{\prime }})$ and $t^{\prime } \in \textit {active}(v_{j^{\prime }})$ for some $i^{\prime } \leq j^{\prime }$.
Let (v_i− 1,v_i) be the first arc on path P labeled by t and let (v_k,v_k+ 1) be the last arc on path P labeled by t. Then t ∈active(v_j) is valid for i ≤ j ≤ k, and t∉active(v_j) is valid for j < i or j > k. It follows that for each t ∈active(v_i) ∩active(v_k) it also holds t ∈active(v_j) for i ≤ j ≤ k.

Thus, sequence (active(v₁),…,active(v_r− 1)) is a directed path-decomposition for g(Q). □

Example 12 (Paths in s(Q))

We consider two paths in s(Q) of Example 11.

1.
The path
$$ \begin{array}{@{}rcl@{}} P_{1} & = & ((0,0),(0,1),(0,2),(1,2),(2,2),(3,2),(3,3), \\ & & (3,4),(3,5),(4,5),(5,5),(6,5),(7,5) ~) \end{array} $$

leads to the directed path-decomposition
$$ \begin{array}{@{}rcl@{}} & & (\{e\},\{e,f\},\{a,e,f\},\{a,b,e,f\},\{a,b,c,e,f\},\{a,b,c,e\}, \\ & & \{a,b,c\},\{a,b\},\{a\},\{a,d\},\{a\} ~) \end{array} $$

of width 4 for g(Q). The path is shown in red in Fig. 5.
2.
The path
$$ \begin{array}{@{}rcl@{}} P_{2} & =& ((0,0),(1,0),(1,1),(1,2),(1,3),(1,4),(2,4), \\ & & (3,4),(4,4), (5,4),(6,4),(7,4),(7,5) ~) \end{array} $$

leads to the directed path-decomposition
$$ (\{a\},\{a,e\},\{a,e,f\},\{a,e\},\{a\},\{a,b\}, \{a,b,c\},\{a,c\},\{a,c,d\},\{a,c\},\{c\}) $$
of width 2 for g(Q). The path is shown in blue in Fig. 5.

Example 12 shows that $P\in \mathcal {P}(Q)$ can lead to directed path-decompositions of non-optimal width. Lemma 26 leads to an upper bound on the directed path-width of g(Q) using the state digraph. The reverse direction is considered next.

Lemma 27

Let Q be some set of k sequences. If there is a directed path-decomposition ${\mathcal {X}}=(X_{1}, \ldots , X_{m})$ of width p − 1 for g(Q), then there is a path $(v_{0},\ldots ,v_{r})\in \mathcal {P}(Q)$ such that for every 1 ≤ i ≤ r it holds |active(v_i)|≤ p.

Proof 33

For a type t let $\alpha ({\mathcal {X}},t)$ be the smallest i such that t ∈ X_i and $\beta ({\mathcal {X}},t)$ be the largest i such that t ∈ X_i, see Example 13. Then t ∈ X_i if and only if $\alpha ({\mathcal {X}},t)\le i \le \beta ({\mathcal {X}},t)$. If (t₁,t₂) is an arc of g(Q), then $\alpha ({\mathcal {X}},t_{1}) \leq \beta ({\mathcal {X}},t_{2})$. This follows by (dpw-2) of the definition of a directed path-decomposition.

We define the path P = (u₀,…,u_r) inductive in the following way. Consider the last vertex u on the already constructed path. Let v₁,…,v_k be the neighbors of u in state digraph s(Q) and let t_j be the type of the item that is considered in order to obtain vertex v_j.

If there is a type t ∈front(u) such that t ∈active(u) then let v_j be the next vertex on path P, if t = t_j.
Otherwise, if there is no type t ∈front(u) such that t ∈active(u), then let t ∈front(u) be the type such that $\alpha (\mathcal {X},t)$ is minimal.

We next show that for every vertex u of path P there is a bag X_i in the directed path-decomposition ${\mathcal {X}}=(X_{1}, \ldots , X_{m})$, such that $\textit {active}(u) \subseteq X_{i}$. This implies that |active(u)|≤ p holds for each vertex u on path P.

First, let v be a vertex we get by case (b) from above, let u be the preceding vertex of v on path P, and let arc (u,v) be labeled with t in state digraph s(Q). Then $\alpha ({\mathcal {X}},t)$ is minimal, and let $t^{\prime } \in \textit {active}(u)$. We show that the intervals $[\alpha ({\mathcal {X}},t),\beta ({\mathcal {X}},t)]$ and $[\alpha ({\mathcal {X}},t^{\prime }),\beta ({\mathcal {X}},t^{\prime })]$ overlap, because $\alpha ({\mathcal {X}},t) \leq \beta ({\mathcal {X}},t^{\prime })$ and $\alpha ({\mathcal {X}},t^{\prime }) \leq \beta ({\mathcal {X}},t)$.

(1)
To show $\alpha ({\mathcal {X}},t) \leq \beta ({\mathcal {X}},t^{\prime })$ we observe the following:

Since $t^{\prime } \notin \textit {front}(u)$, there has to be a type $t^{\prime \prime } \in \textit {front}(u)$ such that $(t^{\prime \prime },t^{\prime })$ is an arc of g(Q). This implies that $\alpha ({\mathcal {X}},t^{\prime \prime }) \leq \beta ({\mathcal {X}},t^{\prime })$. Since t is a type of front(u) for which $\alpha ({\mathcal {X}},t)$ is minimal, we have $\alpha ({\mathcal {X}},t) \leq \alpha ({\mathcal {X}},t^{\prime \prime })$ and thus $\alpha ({\mathcal {X}},t) \leq \beta ({\mathcal {X}},t^{\prime })$.
(2)
To show $\alpha ({\mathcal {X}},t^{\prime }) \leq \beta ({\mathcal {X}},t)$ we observe the following:

Let $w^{\prime }$ be the first vertex such that $t^{\prime } \in \textit {active}(w^{\prime })$, and let w be the preceding vertex of $w^{\prime }$ on path P. We distinguish two cases:
1. (a)
  t ∈front(w). Since $t^{\prime }$ is a type of front(w) for which $\alpha ({\mathcal {X}},t^{\prime })$ is minimal, we have $\alpha ({\mathcal {X}},t^{\prime }) \leq \alpha ({\mathcal {X}},t)$ and thus $\alpha ({\mathcal {X}},t^{\prime }) \leq \beta ({\mathcal {X}},t)$.
2. (b)
  t∉front(w). Then there is a type $t^{\prime \prime } \in \textit {front}(w)$ such that $(t^{\prime \prime },t)$ is an arc of g(Q). This implies that $\alpha ({\mathcal {X}},t^{\prime \prime }) \leq \beta ({\mathcal {X}},t)$. Since $t^{\prime }$ is a type of front(w) for which $\alpha ({\mathcal {X}},t^{\prime })$ is minimal, we have $\alpha ({\mathcal {X}},t^{\prime }) \leq \alpha ({\mathcal {X}},t^{\prime \prime }) \leq \beta ({\mathcal {X}},t)$.

Finally, let v be an arbitrary vertex on the path P. By the discussion above, we can conclude that for every pair t_i,t_j ∈active(v) the intervals $[\alpha ({\mathcal {X}},t_{i}),\beta ({\mathcal {X}},t_{i})]$ and $[\alpha ({\mathcal {X}},t_{j}),\beta ({\mathcal {X}},t_{j})]$ overlap, because t_i first appears on path P or t_j first appears on path P. Since the cut of all intervals $[\alpha ({\mathcal {X}},t_{i}),\beta ({\mathcal {X}},t_{i})]$, t_i ∈active(v) mutually overlap, there is a bag X_j in the directed path-decomposition ${\mathcal {X}}$ such that $\textit {active}(v) \subseteq X_{j}$. □

Example 13

Let Q = {q₁,q₂} be the set of sequences q₁ = [a,b,c,b,d,d,a] and q₂ = [e,f,f,e,c] from Example 11 and let $\mathcal {X} = (\{a,e,f\}, \{a,b,c\},\{d\})$ be a directed path-decomposition. Then we get the following values for $\alpha ({\mathcal {X}},t)$ and $\beta ({\mathcal {X}},t)$.

The defined path in state digraph s(Q) is shown in Table 4.

Table 4 Construction of a path P = (v₀,…,v_r) for a given set Q of sequences and a directed path-decomposition of g(Q) of Example 13 to visualize the idea of the proof of Lemma 27

Full size table

By Lemmas 26 and Lemma 27 we obtain the following result.

Corollary 9

Given some set Q of k sequences, then

$$ \text{d-pw}(\textit{g}(Q)) = \underset{(v_{0},\ldots,v_{r}) \in \mathcal{P}(Q)}{\min} \underset{1 \leq i \leq r-1}{\max} |\textit{active}(v_{i})| - 1. $$

In order to apply Corollary 9 we consider some general digraph problem. Let G = (V,A,f) be a directed acyclic vertex-labeled graph. Function $f: V \to {\mathbb {Z}}$ assigns to every vertex v ∈ V a value f(v). Let s ∈ V and t ∈ V be two vertices. For some vertex v ∈ V and some path P = (v₁,…,v_ℓ) with v₁ = s, v_ℓ = v and (v_i,v_i+ 1) ∈ A we define $val_{P}(v) := \max \limits _{u \in P} (f(u))$. Let ${\mathcal {P}}_{s}(v)$ denote the set of all paths from vertex s to vertex v. We define $val(v) := \min \limits _{P \in {\mathcal {P}}_{s}(v)} (val_{P}(v))$. Then it holds:

$$ val(v) = \max\{ f(v), \underset{u \in N^{-}(v)}{\min}(val(u)) \}. $$

In Fig. 6 we give a dynamic programming solution to compute all the values of val(v), v ∈ V in time $\mathcal {O}(|V|+|A|)$. This is possible, since the graph is directed and acyclic.

Corollary 9 allows us to apply the algorithm given in Fig. 6 on s(Q) to compute the directed path-width of g(Q).

Theorem 5

Given some set Q, such that g(Q) ∈ S_k,ℓ for some ℓ ≥ 1, then the directed path-width of g(Q) and a directed path-decomposition can be computed in time $\mathcal {O}(k\cdot (1+\max \limits _{1\leq i \leq k}n_{i})^{k})$.

Proof 34

Let Q be some set, such that g(Q) ∈ S_k,ℓ. The state digraph s(Q) has at most $(1+\max \limits _{1\leq i \leq k}n_{i})^{k}$ vertices and can be found in time $\mathcal {O}(k \cdot (1+\max \limits _{1\leq i \leq k}n_{i})^{k})$ from Q. By Corollary 9 the directed path-width of g(Q) can be computed by considering all paths from the initial state to the final state in s(Q). This can be done by the algorithm given in Fig. 6 on s(Q) = (V,A) using f(v) = |active(v)|, v ∈ V, s as the initial state, and t as the final state. Since every vertex of the state digraph has at most k outgoing arcs we have $\mathcal {O}(|V|+|A|)\subseteq \mathcal {O}(k\cdot (1+\max \limits _{1\leq i \leq k}n_{i})^{k})$. Thus we can compute an optimal path in s(Q) in time $\mathcal {O}(k\cdot (1+\max \limits _{1\leq i \leq k}n_{i})^{k})$. □

These results lead to an XP-algorithm for directed path-width w.r.t. number of sequences k needed to define the input graph. This implies that for each constant k, it is decidable in polynomial time whether for a given set Q on at most k sequences the digraph g(Q) has directed path-width at most w. If we know that some graph can be defined by one sequence, we can find this in linear time (Proposition 6). This implies that for each constant k, it is decidable in polynomial time whether for a digraph G, which is given by the union of at most k many $\{\text {co-}(2\overrightarrow {P_{2}}),2K_{1\&\#x20E1;},\overrightarrow {C_{3}},D_{4}\}$-free digraphs, digraph G has directed path-width at most w.

6 Conclusions and Outlook

In this paper, we have considered digraphs which can be defined by a set of k sequences. We have shown an XP-algorithm for directed path-width w.r.t. number of sequences k needed to define the input graph. For special digraphs our solution improves known solution w.r.t. the standard parameter as shown in Table 1. This implies that for each constant k, it is decidable in polynomial time whether for a given set Q on at most k sequences the digraph g(Q) has directed path-width at most w. If we know that some digraph can be defined by one sequence, we can find this in linear time (Proposition 6). This implies that for each constant k, it is decidable in polynomial time whether for a digraph G, which is given by the union of at most k many semicomplete $\{\text {co-}(2\overrightarrow {P_{2}}),\overrightarrow {C_{3}},D_{4}\}$-free digraphs, digraph G has directed path-width at most w.

There are several interesting open questions. (a) In Section 5.4 we have shown XP-algorithms for the directed path-width problem w.r.t. parameter k (number of sequences). It remains open whether there are also FPT-algorithms for this parameter. The same question is open for the standard parameter. (b) Does the hardness of Proposition 11 also hold for c_Q ∈{2,3,4} and for d_Q = 2? (c) By Theorem 1 one can decide in polynomial time whether a given digraph belongs to the class S_1,1. By Lemma 12 this is also possible for every class S_k,1. Furthermore Theorem 3 allows to recognize the classes S_1,2 in polynomial time. It remains to consider this problem for the classes S_k,ℓ for k ≥ 2 and 1 < ℓ ≤ 2k. (d) In order to apply Corollary 9 for some given digraph G we need a set Q such that g(Q) = G. Can we find a set Q with a smallest number of sequences in polynomial time? (e) Can we find a simple set Q with a smallest number of sequences in polynomial time? (f) By its equivalence to directed vertex separation number [22] directed path-width is a vertex ordering problem (layout problem). This motivates to solve similar problems such as directed cut-width or directed feedback arc set (see [23] for a survey) on sequence digraphs. Is there even a general approach to solve vertex ordering problem on sequence digraphs (see [23] for a general approach on undirected graphs)?

Notes

A related but different concept for undirected graphs is the notation of word-representable graphs. A graph G = (V,E) is word-representable if there exists a word w over the alphabet V such that letters x and y alternate in w if and only if {x,y}∈ E., see [2] for a survey.
When considering the directed path-width of almost semicomplete digraphs in [5] the class of semicomplete digraphs was suggested to be “a promising stage for pursuing digraph analogues of the splendid outcomes, direct and indirect, from the Graph Minors project”.
Thus we do not consider directed graphs with loops.
A list of all digraphs on at most four vertices can be found in Appendix 2 of [10].
If for some type there is only one item in q_i, then this item remains in F(q_i) and in L(q_i).

References

Gurski, F., Rehs, C., Rethmann, J.: Directed pathwidth of sequence digraphs. In: Proceedings of the international conference on combinatorial optimization and applications (COCOA), vol. 11346 of LNCS, Springer, 2018, pp. 79–93 (2018)
Kitaev, S., Pyatkin, A.: Word-representable graphs: a survey. J. Appl. Industr. Math. 12(2), 278–296 (2018)
Article MATH Google Scholar
Kitsunai, K., Kobayashi, Y., Komuro, K., Tamaki, H., Tano, T.: Computing directed pathwidth in O(1.89ⁿ) time. Algorithmica 75(2016), 138–157 (2016)
Article MathSciNet MATH Google Scholar
Nagamochi, H.: Linear layouts in submodular systems. In: Proceedings of the international symposium on algorithms and computation, vol. 7676 of LNCS, Springer, 2012, pp. 475–484 (2012)
Kitsunai, K., Kobayashi, Y., Tamaki, H.: On the pathwidth of almost semicomplete digraphs. In: Proceedings of the annual european symposium on algorithms (ESA), vol. 9294 of LNCS, Springer, 2015, pp. 816–827 (2015)
Kobayashi, Y.: Computing the pathwidth of directed graphs with small vertex cover. Inf. Process. Lett. 115(2), 310–312 (2015)
Article MathSciNet MATH Google Scholar
Bang-Jensen, J., Gutin, G.: Digraphs. Theory, Algorithms and Applications. Springer, Berlin (2009)
MATH Google Scholar
Gurski, F., Rethmann, J., Wanke, E.: On the complexity of the FIFO stack-up problem. Math. Methods Oper. Res. 83(1), 33–52 (2016)
Article MathSciNet MATH Google Scholar
Gould, R.: Graph Theory. Dover Publications Inc, New York (2012)
MATH Google Scholar
Harary, F.: Graph Theory. Addison-Wesley Publishing Company, Massachusetts (1969)
Book MATH Google Scholar
Barát, J.: Directed pathwidth and monotonicity in digraph searching. Graphs Combinatorics 22(2006), 161–172 (2006)
Article MathSciNet MATH Google Scholar
Johnson, T., Robertson, N., Seymour, P., Thomas, R.: Directed tree-width. J. Combinatorial Theory Series B 82(2001), 138–155 (2001)
Article MathSciNet MATH Google Scholar
Robertson, N., Seymour, P., Graph Minors, I.: Excluding a forest. J. Combinatorial Theory Series B 35(1983), 39–61 (1983)
Article MathSciNet MATH Google Scholar
Kashiwabara, T., Fujisawa, T.: NP-completeness of the problem of finding a minimum-clique-number interval graph containing a given graph as a subgraph. In: Proceedings of the international symposium on circuits and systems, 1979, pp. 657–660 (1979)
Arnborg, S., Corneil, D., Proskurowski, A.: Complexity of finding embeddings in a k-tree. SIAM J. Algeb. Discrete Methods 8(2), 277–284 (1987)
Article MathSciNet MATH Google Scholar
Gusted, J.: On the pathwidth of chordal graphs. Discrete Appl. Math. 45(3), 233–248 (1993)
Article MathSciNet Google Scholar
Kloks, T., Bodlaender, H., Müller, H., Kratsch, D.: Computing treewidth and minimum fill-in: all you need are the minimal separators. In: Proceedings of the annual european symposium on algorithms (ESA), vol. 726 of LNCS, Springer, 1993, pp. 260–271 (1993)
Monien, B., Sudborough, I.: Min cut is NP-complete for edge weighted trees. Theor. Comput. Sci. 58(1988), 209–229 (1988)
Article MathSciNet MATH Google Scholar
Bodlaender, H.: A linear-time algorithm for finding tree-decompositions of small treewidth. SIAM J. Comput. 25(6), 1305–1317 (1996)
Article MathSciNet MATH Google Scholar
Chudnovsky, M., Fradkin, A., Seymour, P.: Tournament immersion and cutwidth. J. Combinatorial Theory Series B 102(1), 93–101 (2012)
Article MathSciNet MATH Google Scholar
Bodlaender, H.: A partial k-arboretum of graphs with bounded treewidth. Theor. Comput. Sci. 209(1998), 1–45 (1998)
Article MathSciNet MATH Google Scholar
Yang, B., Cao, Y.: Digraph searching, directed vertex separation and directed pathwidth. Discrete Appl. Math. 156(10), 1822–1837 (2008)
Article MathSciNet MATH Google Scholar
Bodlaender, H., Fomin, F., Koster, A., Kratsch, D., Thilikos, D.: A note on exact algorithms for vertex ordering problems on graphs. Theory Comput. Syst. 50(3), 420–432 (2012)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was funded in part by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – 388221852

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

University of Düsseldorf, Institute of Computer Science, Algorithmics for Hard Problems Group, 40225, Düsseldorf, Germany
Frank Gurski & Carolin Rehs
Faculty of Electrical Engineering and Computer Science, Niederrhein University of Applied Sciences, 47805, Krefeld, Germany
Jochen Rethmann

Authors

Frank Gurski
View author publications
You can also search for this author in PubMed Google Scholar
Carolin Rehs
View author publications
You can also search for this author in PubMed Google Scholar
Jochen Rethmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Frank Gurski.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A short version of this paper appeared in the Proceedings of the International Conference on Combinatorial Optimization and Applications (COCOA 2018) [1].

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gurski, F., Rehs, C. & Rethmann, J. Characterizations and Directed Path-Width of Sequence Digraphs. Theory Comput Syst 67, 310–347 (2023). https://doi.org/10.1007/s00224-022-10104-w

Download citation

Accepted: 03 October 2022
Published: 23 November 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00224-022-10104-w

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Characterizations and Directed Path-Width of Sequence Digraphs

Abstract

Similar content being viewed by others

Directed Path-Width of Sequence Digraphs

Quasi-Transitive Digraphs and Their Extensions

On the Pathwidth of Almost Semicomplete Digraphs

1 Introduction

2 Preliminaries

Undirected Graphs

Special Undirected Graphs

Directed Graphs

Omitting the Directions

Orientations

Special Directed Graphs

3 Sequence Digraphs and Sequence Systems

3.1 From Sequences to Digraphs

Definition 1 (Sequence Digraph)

Example 1 (Sequence Digraph)

Observation 1

Lemma 1

Proof 1

Lemma 2

Proof 2

Observation 2

Proposition 1

Proof 3

3.2 From Digraphs to Sequences

Definition 2 (Sequence System)

Example 2 (Sequence System)

Observation 3

Lemma 3

Lemma 4

Proof 4

Lemma 5

Lemma 6

Proof 5

4 Properties of Sequence Digraphs

4.1 Graph Classes and their Relations

Corollary 1

Lemma 7

Proof 6

Corollary 2

Lemma 8

Corollary 3

Lemma 9

Proof 7

Corollary 4

Proposition 2

Proof 8

Example 3

Lemma 10

Proposition 3

4.2 Characterizations of Sequence Digraphs for k = 1 or ℓ = 1

4.2.1 Sequence Digraphs for k = 1 and ℓ = 1

Lemma 11

Proof 9

Theorem 1

Proof 10

Proposition 4

4.2.2 Sequence Digraphs for ℓ = 1

Lemma 12

Corollary 5

Theorem 2

Proposition 5

Proof 11

4.2.3 Sequence Digraphs for k = 1

Example 4

Lemma 13

Proof 12

Lemma 14

Proof 13

Lemma 15

Proof 14

Lemma 16

Proof 15

Lemma 17

Proof 16

Theorem 3

Proof 17

Claim 1

Example 9 (F(q _i), L(q _i))