1 Introduction

The Minimum Hitting Set of Bundles Problem (Mhsb) was introduced by Angel et al. [1]. It is defined in the following way: Let \(\varOmega \) be a finite set of elements and let \(\mathcal F\) be a family of sets. Every \(F\in \mathcal F\) is a set of bundles, where a bundle \(U\) is a subset \(U\subseteq \varOmega \). A bundle \(U\) is covered by a set of elements \(S\subseteq \varOmega \) if \(U\subseteq S\). We say that a set \(F\) is hit if at least one bundle in \(F\) is covered. We want to find a collection of elements \(S\subseteq \varOmega \) of minimum size such that every set \(F\in \mathcal F\) is hit. We refer to S as a hitting set of bundles. In the following, let \(\mathcal {U}:=\bigcup _{F\in \mathcal F} F\) be the set of all bundles of an instance.

The minimum hitting set problem is the special case of Mhsb, where every bundle contains exactly one element [1]. It corresponds to the optimization version of one of Karp’s 21 NP-complete problems, the minimum set cover problem [14]. This immediately implies NP-hardness of Mhsb.

Mhsb has many applications. In particular, it provides an abstract framework for several scheduling problems. In this paper, we want to highlight two particular applications. The first one is scheduling jobs (non-preemptively) on a single machine with the objective of minimizing active time. It can be framed in the following way. Here, \(\varOmega \) is a set of time slots and \(\mathcal F\) represents a set of jobs. The bundles of a job refer to its feasible operation times. The goal is to schedule all jobs such that the number of occupied time slots is minimized. If all bundles of a job are time intervals of the same length and refer to possible execution times of the job, the problem is referred to as active (or busy) time minimization problem with unlimited capacities [5, 15].

The second application of Mhsb we want to point out is in the area of railway maintenance scheduling. Consider a railway corridor with bidirectional traffic and maintenance jobs that need to be carried out. Here, a train path is a movement over time along the railway track. Whenever such a train path interferes with a particular maintenance job, the train needs to be canceled. To minimize the impact of the mandatory maintenance on rail traffic, the goal is to schedule all maintenance jobs such that the number of canceled trains is as small as possible. Here, \(\varOmega \) represents the set of train paths and \(\mathcal F\) is the family of maintenance jobs. The bundles of a maintenance job refer to the corresponding sets of train paths that interfere with the feasible execution times of a job. Eskandarzadeh et al. [9] study a variant of this problem in which bundles are determined by job-specific release dates, deadlines, and processing times. For unidirectional train traffic, this agrees with active time minimization with unlimited capacities. In this paper, we also consider the railway maintenance scheduling problem with bidirectional traffic.

Mhsb allows more general bundle structures than those studied in the context of active time minimization or railway maintenance scheduling. Still, we often use the scheduling terminology to give an intuition and a better understanding of the particular cases of Mhsb that we consider in this paper.

1.1 Related Work

As mentioned before, if every bundle contains only one element, Mhsb corresponds to the minimum hitting set problem and its counterpart, the minimum set cover problem. Both problems have been studied extensively. There exists a polynomial-time \(\alpha _{\max }\)-approximation algorithm, where \(\alpha _{\max }\) refers to the maximum number of elements (in Mhsb this is the maximum number of bundles) a set may contain and a polynomial-time \((\ln (m)+1)\)-approximation algorithm, where m denotes the number of elements in the minimum set cover instance. For more details on the algorithms, see, e.g., [19]. Dinur and Steurer [8] proved that no polynomial-time \((1-o(1))\ln (m)\)-approximation algorithm exists for minimum set cover, unless \(\text {P}=\text {NP}\). Assuming the Unique Games Conjecture, the approximation factor \(\alpha _{\max }\) is best possible, due to a result by Bansal and Khot [2].

The minimum hitting set of bundles problem was introduced by Angel et al. [1]. They presented a polynomial-time \(F_{\max }\)-factor approximation algorithm, where \(F_{\max }\) refers to the maximum number of bundles a set may contain. This approximation guarantee is achieved by considering a relaxation of an integer linear programming (ILP) formulation and a simple rounding strategy. By using randomized rounding they were able to improve the approximation guarantee to \(F_{\max }(1-(1-\tfrac{1}{F_{\max }})^{M})\). Here, M denotes the maximum number of bundles an element is contained in. Note that if the same bundle is contained in different sets, it is accounted for several times. Angel et al. [1] highlighted two applications of Mhsb, the multiple-query optimization problem in database systems [18] and the min k-SAT problem [3].

Wan et al. [20] studied the minimum submodular cover problem with submodular weights. In this problem, we are given a submodular, increasing function f. A set \(S \subseteq \varOmega \) is a submodular cover, if \(f(S)=f(\varOmega )\). The objective is to find a submodular cover of minimum weight with respect to a submodular, increasing weight function w. Mhsb can be formulated as submodular cover problem with submodular weights in the following way. Choose the ground set to be the set of bundles \({\mathcal {U}}\). Define f to be the function that maps a collection of bundles on the number of sets that are hit by at least one bundle of the collection. The weight of a collection of bundles is simply the cardinality of their union. The main result in [20] implies a \(H(\gamma )\)-approximation for Mhsb, where H(i) denotes the i-th Harmonic number and \(\gamma \) is the maximum number of sets a bundle U in \({\mathcal {U}}\) hits. Iwata and Nagano [13] studied the set cover problem with submodular weights. They derived a polynomial-time \(\alpha _{\max }\)-approximation algorithm, where \(\alpha _{\max }\) is the maximum number of sets an element appears in.

Being a generalization of the minimum hitting set problem, Mhsb is W[2]-hard parameterized by the solution size \(\vert S \vert \). Damaschke [7] proved that Mhsb, parameterized by \(\vert \mathcal F\vert \) and the solution size \(\vert S \vert \), is W[1]-complete.

The active time minimization problem with capacity B was introduced by Chang et al. [4]. It is a scheduling problem with job-specific release dates, deadlines, and processing times and a bound B on the number of jobs that may be executed simultaneously. Chang et al. gave a polynomial-time algorithm for \(B=2\) and proved that the problem is NP-hard for \(B=3\). More closely related to our problem is the version with unlimited capacity, i.e. \(B=\infty \), studied, e.g., by Fong et al. [11]. They presented a polynomial-time algorithm for agreeable deadlines, i.e., instances where any job’s deadline is prior to every other job’s deadline that has a later release time. Also, Fang et al. [10] studied the problem in context of wireless sensoring and presented a polynomial-time 2-approximation algorithm for this special case. Online variants of the active time minimization problem also exist and have been studied, e.g., by Koehler and Khuller in [16].

Eskandarzadeh et al. [9] studied the maintenance scheduling in a railway corridor problem, which is an application of Mhsb. They presented a polynomial-time algorithm for active time minimization with unlimited capacities if all jobs have the same processing time. More specifically, they presented an ILP formulation, tailored to their restricted set of instances, for which they proved total unimodularity of the constraint matrix. In their computational experiments, they compared different ILP formulations for the bidirectional version.

Finally, Chekuri and Kumar [6] introduced a maximization variant of Mhsb, the maximum coverage problem with group budgets. In this setting, we are given costs on bundles and a budget per set. The goal is to choose bundles, respecting the budget constraints, such that the size of the union of chosen bundles is maximized. Chekuri and Kumar presented a polynomial-time constant-factor approximation algorithm.

1.2 Our Results

In this paper we study the minimum hitting set of bundles problem on instances with interval bundles and 2-dimensional interval bundles.

Minimum hitting set of interval bundles: The connection to the active time minimization problem leads to a number of applications that can be modeled by Mhsb. In Sect. 2, we take advantage of the structure that many of these applications have in common, in order to obtain polynomial-time algorithms. Assuming that jobs have to be executed without preemption, we obtain bundles of consecutive elements. We refer to these bundles as interval bundles and call this special case of Mhsb the Minimum Hitting Set of Interval Bundles Problem (Mhsib). Motivated by applications, we define the following properties of special cases of Mhsib. We say that \(\mathcal F\) is convex if for every job \(F\in \mathcal F\), the union of all possible operating times forms an interval. We say that \(\mathcal F\) is a-simple \(a \in {\mathbb {N}}\) if all bundles have size a. In the scheduling terminology this corresponds to equal processing times. In applications it also seems reasonable to assume that, for example, the number of starting times is bounded or the time horizon of feasible operating times for a job is bounded.

We present polynomial-time algorithms for several classes of interval bundle instances. These algorithms use a graph construction and solve the problem by computing a shortest path. However, the minimum hitting set of interval bundles problem is NP-hard in general. We explore the boundary of polynomial-time solvable instances and NP-hardness that arise from the aforementioned properties and parameters.

Minimum hitting set of 2-dimensional interval bundles: In Sect. 3, motivated by the application of Mhsb to railway maintenance scheduling, we study another special class of instances of Mhsb. In the maintenance scheduling in a railway corridor problem presented by Eskandarzadeh et al. [9], \(\varOmega \) is the disjoint union of sets of train paths in opposite directions on a single railway track. More generally, we can think of \(\varOmega \) as being the disjoint union of two totally ordered sets. Assuming that every job has to be executed without preemption implies that every bundle is a set of consecutive elements in each of the two totally ordered sets. We call this special structure 2-dimensional interval and denote the special case of Mhsb where all bundles are 2-dimensional interval by Minimum Hitting Set of 2-dimensional Interval Bundles Problem (2-dim Mhsib). We show that 2-dim Mhsib remains NP-hard on convex and 1-simple instances with \(F_{\max }=2\). Furthermore, we present a polynomial-time approximation algorithm for all 1-simple instances with \(F_{\max }=2\) using a result by Hochbaum [12] with an approximation guarantee slightly better than 2. For another restricted class of instances we give a polynomial-time algorithm by making use of a problem decomposition.

Remark 1

The introduction of a weight function \(w: \varOmega \rightarrow {\mathbb {R_+}}\) on the elements may be of interest. For the ease of presentation, we only consider the unweighted Mhsb. Techniques used throughout the paper also work for weighted instances, thus all of our results can be easily transferred to weighted Mhsb. These results can either be transferred directly with minor modifications of inequalities, or by simple modifications of the constructed graph’s edge weights.

Remark 2

Due to the structure of the problem, we assume, w.l.o.g., that

  1. (i)

    all bundles are non-trivial, that is, for all \(F\in \mathcal F\) we have that \(\emptyset \notin F\);

  2. (ii)

    bundles of a set are not subsets of each other, that is, for \(U\in F\), if \(U'\subset U\) we have that \(U'\notin F\); and

  3. (iii)

    for all distinct \(F,F'\in \mathcal F\) there exists a bundle \(U\in F\) such that for all \(U'\in F'\) we have \(U'\not \subseteq U\). If there was no such bundle \(U\in F\), any set of elements hitting \(F\) also hits \(F'\).

2 Minimum Hitting Set of Interval Bundles

In this section, we focus on bundle structures that arise in many applications. As mentioned in the introduction, \(\varOmega \) often corresponds to time slots if viewed as a scheduling problem. In this context, one may assume that jobs have to be executed without preemption. In the following, we make use of this particular structure and identify polynomial-time solvable special cases of Mhsb.

Throughout this section we look at instances in which we are given an ordering \(\prec \) of \(\varOmega \). To simplify notation, we may assume that \(\varOmega =[n]\) for some \(n \in {\mathbb {N}}\) with the natural ordering. In this context, we consider a special case of Mhsb, the Minimum Hitting Set of Interval Bundles Problem (Mhsib), where every bundle \(U\) corresponds to an interval. More specifically, if \(i,j\in U\), then \(k \in U\) for all \(i\le k\le j\). In addition to the interval property, we define two other properties that a family \(\mathcal F\) may have.

Definition 1

Let \(([n], \mathcal F)\) be an instance of Mhsib. We call the family \(\mathcal F\)

  1. (i)

    convex, if all \(F\in \mathcal F\) are convex, i.e. the following holds. Let \(i \in U', j \in U''\), for some \(U',U''\in F\). For all \(k \in [n]\) with \(i\le k\le j\) there exists a bundle \(U\) in \(F\) such that \(k \in U\). In other words, the union of all interval bundles \(U\) in \(F\) is again an interval.

  2. (ii)

    a-simple for some \(a \in {\mathbb {N}}\) if all bundles are a-simple, i.e. \(\vert U\vert =a\) for all \(U\in \mathcal {U}\), where \(\mathcal {U}:=\bigcup _{F\in \mathcal F} F\).

Fig. 1
figure 1

Example of a family \(\mathcal F\) to visualize the different sets and bundles properties: All bundles but \(U_{9}\) have the property that they are interval. The sets \(F_{1}, F_{2}\) and \(F_{3}\) are convex, and \(F_{1}\) and \(F_{2}\) are in addition to that a-simple with \(a=4\). The bundle structure of the set \(F_1\) is determined by the release date 6, deadline 11 and processing time 4. This overlapping bundle structure occurs for example in active time minimization

Figure 1 is an example of an instance of the general Mhsb problem and visualizes the properties we defined above. The next theorem shows that Mhsib remains NP-hard even on restricted instances in terms of Definition 1.

Theorem 1

Let \(([n], \mathcal F)\) be an instance of Mhsib and let \(F_{\max }\) be the maximum number of bundles a set may contain. The problem remains NP-hard, if \(\mathcal F\)

  1. (i)

    is 1-simple and \(F_{\max } =2\); or

  2. (ii)

    is convex and \(F_{\max } =3\); or

  3. (iii)

    is convex and a-simple for some a, where a is some function in n.

Proof

  1. (i)

    An easy reduction from vertex cover implies the statement. Given a graph \(G=(V,E)\), we let \(\varOmega :=V\) and \(\mathcal F:=\{\{\{u\},\{v\}\}:\{u,v\}\in E\}\). Observe that a minimum vertex cover is an optimal solution to Mhsib and vice versa.

  2. (ii)

    The statement follows again from a reduction of vertex cover to Mhsib. Let \(G=(V,E)\) be a graph with an arbitrary ordering \(\prec _V\) of the vertices. The main idea is to use the same construction as in i), i.e. introducing a set for every edge \(\{u,v\}\) of the graph. The challenge is to guarantee that our constructed family is convex, while respecting the structure assumed in Remark 2. We will do so by constructing sets that additionally contain a large dummy bundle. Because of its size, the dummy bundle is never entirely covered by an optimal solution. A large number of dummy elements serves to ensure a one-to-one correspondence between a minimum hitting set of bundles and a minimum vertex cover.

We begin by constructing the ground set of the corresponding instance of Mhsib. We introduce \(d\) (any \(d\ge \vert V\vert +2\) will work) elements for each vertex of the graph, that is, \(\varOmega _v:=\{\omega ^{v}_i: i\in [d]\}\) for all \(v\in V\). A vertex \(v\in V\) is represented by the element \(\omega ^v_1\) and we refer to the \(d-1\) elements \(\omega _2^v\dots \omega _d^v\) as dummy elements. The ground set is now given by \(\varOmega :=\bigcup _{v\in V} \varOmega _v\).

We use the ordering \(\prec _V\) of V to obtain an ordering \(\prec _{\varOmega }\) of \(\varOmega \) in a natural way by

$$\begin{aligned} \omega _i^u\prec _\varOmega \omega _j^v \Leftrightarrow {\left\{ \begin{array}{ll} u\prec _V v; \quad \text {or}\\ u=v \quad \wedge \quad i<j. \end{array}\right. } \end{aligned}$$

Next, we define bundles and sets of the instance. As in the construction of i) for every edge \(\{u,v\}\in E\) with \(u\prec _V v\) we are given two bundles \(\{\omega _1^{u}\}\) and \(\{\omega ^v_1\}\). In addition, we define a dummy bundle \(U_{\{u,v\}}\) containing all elements in \(\varOmega \) that are between \(\omega _1^u\) and \(\omega _1^v\). Formally, we define

$$\begin{aligned} U_{\{u,v\}}:=\varOmega _u \setminus \{\omega _1^u\} \cup \bigcup _{u\prec _V w \prec _V v}\varOmega _w. \end{aligned}$$

Note, that all constructed bundles are indeed interval with respect to the ordering of \(\varOmega \) induced by \(\prec _\varOmega \). The sets are then formally given by

$$\begin{aligned} \mathcal F= \left\{ \left\{ \{\omega ^u_1 \},\{\omega ^v_1 \}, U_{\{u,v\}} \right\} ~:~\{u,v\}\in E\right\} . \end{aligned}$$

The dummy bundle \(U_{\{u,v\}}\) contains a large number of elements, i.e. at least \(d\) and, hence, is never entirely hit in an optimal solution. Its only purpose is to ensure convexity of the respective family.

Any vertex cover C corresponds to a solution of the Mhsib instance of the same size. This simply follows from the fact that the vertices in C cover every edge and therefore all sets \(F\in \mathcal F\) contain at least one covered bundle. Additionally, a solution S of Mhsib of minimum size only contains elements that correspond to vertices of the graph, i.e. \(S\subseteq \{\omega _1^v:v\in V \}\). This follows from the fact that if S covers any bundle \(U_{\{u,v\}}\), this immediately implies \(\vert S\vert \ge d-1=\vert V\vert +1\). Also, any element in \(\varOmega \setminus \{\omega _1^v:v\in V \}\) can be omitted unless it was needed to cover some \(U_{\{u,v\}}\).

Finally, we have that S hits all sets \(F\in \mathcal F\) and covers the bundle \(\{\omega ^u_1 \}\) or \(\{\omega ^v_1 \}\) for all \(\{u,v\} \in E\), with either \(\omega ^u_1\) or \(\omega ^v_1\). Therefore, the set of vertices represented by elements in S form a vertex cover in the graph.

  1. (iii)

    Again, we prove the statement by a reduction of vertex cover to Mhsib. Let \(G=(V,E)\) and let \(\prec _V\) an arbitrary ordering of the vertices. This time we have to ensure that the constructed family is convex and a-simple. For every edge \(\{u,v\}\) we construct a corresponding set \(F_{\{u,v\}}\) in the family. The set contains multiple a-simple bundles, amongst them the bundles \(U_u\) and \(U_v\) which correspond to the vertices u and v. Instead of a single large dummy bundle as in ii), the sets contain a collection of dummy bundles, each of size a. To ensure that the optimal solution to Mhsib does indeed hit every set by covering a bundle corresponding to a vertex in V, we introduce additional dummy sets \(F_v\) for every \(v \in V\). The set \(F_v\) contains a single bundle, that overlaps with the bundle \(U_v\) in all but one element. Since there is only one bundle in \(F_v\), it has to be in every feasible solution of the instance of Mhsib. This ensures that hitting a set \(F_{\{u,v\}}\) by a bundle \(U_v\) or \(U_u\) only requires adding one additional element to the solution, whereas hitting \(F_{\{u,v\}}\) by any dummy bundle immediately increases the solution size significantly (by at least \(\tfrac{a-1}{2}\)).

Again, we start by constructing the ground set of the corresponding instance of Mhsib. Formally, let \(a:=4\vert V \vert +1\) and let \(v_0\) be a dummy vertex with \(v_0 \prec _V v\) for all \(v \in V\). For every vertex in V and the dummy vertex \(v_{0}\) we introduce 3a elements, that is, \(\varOmega _v:=\{\omega ^{v}_i: i\in [3a]\}\) for all \(v\in V\cup \{v_{0}\}\). Intuitively, the element \(\omega ^v_1\) corresponds to the vertex \(v \in V\) and all other elements in \(\varOmega _v\) are dummy elements. The ground set of elements is then given by \(\varOmega :=\bigcup _{v\in V\cup \{v_0\}} \varOmega _v\). As in ii) we extend the ordering \(\prec _V\) of the vertices in a natural way to an ordering \(\prec _{\varOmega }\) of the elements in \(\varOmega \).

We continue by constructing the bundles and the sets of the family \(\mathcal F\) of the corresponding instance of Mhsib. For a vertex \(v \in V\) let \(U_{v}\) be the bundle that contains the element \(\omega _{1}^{v}\) and the \(a-1\) previous elements with respect to \(\prec _{\varOmega }\). Formally,

$$\begin{aligned} U_{v}:=\{\omega ^{v'}_{2a+2},\omega ^{v'}_{2a+3},\dots ,\omega ^{v'}_{3a-1},\omega ^{v'}_{3a},\omega ^{v}_{1} \} \end{aligned}$$

where \(v'\) is the vertex preceding v in the natural ordering. Note that all bundles \(U_{v}\) are indeed a-simple.

Next, we define a collection B of dummy bundles, which is a partition of \(\varOmega \setminus \{\omega ^{v_{0}}_{1},\omega ^{v_{0}}_{2},\dots ,\omega ^{v_{0}}_{\frac{5a+1}{2}}\}\) in a-simple, disjoint bundles. B will be later used to ensure convexity of the family \(\mathcal F\). More explicitly B is given by

$$\begin{aligned} B:=\bigcup _{v \in V} \Big \{ \{\underbrace{\omega ^{v'}_{\frac{5a+3}{2}},\dots ,\omega ^{v'}_{3a},\omega ^{v}_{1},\dots ,\omega ^{v}_{\frac{a+1}{2}}}_{a}\},\{\underbrace{\omega ^{v}_{\frac{a+3}{2}},\dots ,\omega ^{v}_{\frac{3a+1}{2}}}_{a}\},\{\underbrace{\omega ^{v}_{\frac{3a+3}{2}},\dots ,\omega ^{v}_{\frac{5a+1}{2}}}_{a}\}\Big \}, \end{aligned}$$

where, again, \(v'\) is the vertex preceding v in the natural ordering. Note that the bundles in B are well-defined, since we chose a to be odd. We continue by constructing the sets of the family \(\mathcal F\). For every edge \(\{u,v\} \in E\) the family \(\mathcal F\) contains a set

$$\begin{aligned} F_{\{u,v\}}= \{ U_{u},U_{v}\}\cup B. \end{aligned}$$

In addition to that, for every vertex \(v\in V\) we add a dummy set containing a single bundle with the a elements preceding \(\omega ^v_1\) to \(\mathcal F\). More explicitly,

$$\begin{aligned} F_{v}:=\{\{\omega ^{v'}_{2a+1},\omega ^{v'}_{2a-2},\dots ,\omega ^{v'}_{3a-1},\omega ^{v'}_{3a}\}\} \end{aligned}$$

where, again, \(v'\) is the vertex preceding v in the natural ordering. This concludes the construction of the family. A visualization can be seen in Fig. 2.

Fig. 2
figure 2

Illustration of a constructed Mhsib instance in the proof of proof of Theorem 1 (iii)

Let C be a minimum vertex cover in G. We obtain the corresponding hitting set of bundles in the following way. As already mentioned, every bundle of a dummy set \(F_v\) has to be part of a feasible solution. Next, we ensure that every \(F_{\{u,v\}}\) is hit. Adding the element \(\omega _1^v\) for every \(v \in C\) to the hitting set of bundles, we cover the bundle \(U_{v}\) and, hence, hit the corresponding set \(F_{\{u,v\}}\). All in all, we obtain a hitting set of bundles of size \(a\cdot \vert V\vert +\vert C\vert \). Note, every set \(F_{\{u,v\}}\) is hit by one of the bundles \(U_{u}\) or \(U_{v}\) as either u or v must be contained in C. We claim that there is no solution to Mhsib of smaller cardinality. Observe, that an optimal solution to Mhsib has to hit every set \(F_v\), each containing only one bundle of size a. Since all these sets are pairwise disjoint, a solution must be of size at least \(a\cdot \vert V \vert \). Any bundle from the collection B contains at least \(a/2>\vert V \vert \) elements not contained in the bundles of the dummy sets \(F_v\) that are covered by the solution. Again, if such a bundle from B is covered the solution is of size at least \(a\cdot \vert V \vert +\vert V \vert +1\), which cannot be optimal by the previous argument, that there always exists a solution of size at most \(a\cdot \vert V\vert +\vert V\vert \). \(\square \)

Theorem 1 states that Mhsib remains NP-hard for convex families \(\mathcal F\) with \(F_{\max } \le 3\) for all \(F\in \mathcal F\). However, the following theorem shows that the computational complexity changes for convex instances if we reduce \(F_{\max }\) by one.

Theorem 2

Let \(([n], \mathcal F)\) be an instance of Mhsib. If \(\mathcal F\) is convex and \(F_{\max }=2\), then it is solvable in polynomial time.

Proof

We describe a reduction to a shortest path computation. The high level idea is to construct a layered graph, such that the vertices contained in an s-t-path describe an interval decomposition of the solution.

Before explaining the construction in more details, let us take a closer look at the bundle structure of our instance to give some intuition. Since every set contains at most two bundles, w.l.o.g., we may assume that for all \(F\in \mathcal F\), if \(U,U'\in F\) then \(U\cap U'=\emptyset \), as elements in the intersection have to be contained in any feasible solution. Given some \(F\in \mathcal F\), since \(\mathcal F\) is convex, there exist li and \(u \in [n]\) such that \(F= \{[l,i], [i+1,u] \}\). To make sure that the bundle is hit, we have to guarantee that either the set of elements \(\{l, \dots i\}\) or \(\{i+1,\dots ,u\}\) is contained in the minimum hitting set of bundles.

Let \(([n], \mathcal F)\) be an instance of Mhsib. We define a graph, with layers \(V_1,\dots V_n\) corresponding to the elements \(1,\dots n\). A set \(V_i\) contains a vertex for every interval in [1, n] containing i and, additionally, a vertex \(v^i_{\varnothing }\) representing the empty set. Formally, for every \(i \in [n]\) we define \(V_i:=\{ v_{[a,b]}^i \mid 1 \le a\le i \le b \le n\} \cup \{v^i_{\varnothing }\}.\) Note that \(a,b\le n\) and therefore the number of vertices introduced for each \(i\in [n]\) is polynomial in the size of the input. Let \(V_0:= \{s\}\) and \(V_{n+1}:=\{t\}\). Then the vertex set of the layered graph is given by \(V:= \bigcup _{i=0}^{n+1} V_i\).

The graph being layered, means that there only exist edges from layer \(V_i\) to the subsequent layer \(V_{i+1}\). In particular, every shortest s-t-path contains exactly one vertex \(v^i\) from every layer \(V_i\). Intuitively, if \(v^i=v_{\varnothing }^i\), then the element i is not part of the corresponding minimum hitting set of bundles. If \(v^i=v_{[a,b]}^i\), we add i to the solution. In particular, the solution contains all elements of the interval [ab]. To ensure this, we define the edges and edge weights of the layered graph in the following way. There are four feasible types of edges:

  1. (1)

    \(\{v^i_{\varnothing },v^{i+1}_{\varnothing }\}\) of weight 0,

  2. (2)

    \(\{v^i_{\varnothing },v_{[i+1,b]}^{i+1}\}\) of weight 1,

  3. (3)

    \(\{v_{[a,b]}^{i},v_{[a,b']}^{i+1}\}\) for some \(b'\ge b\) of weight 1, and

  4. (4)

    \(\{v_{[a,i]}^{i},v_{\varnothing }^{i+1}\}\) of weight 0.

Additionally, we add edges \(\{s,v^1\}\) for all \(v^1 \in V_1\) of weight 0 if \(v^1=v_\emptyset ^1\) and of weight 1 in all remaining cases. Also, we add edges \(\{v^n,t\}\) for all \(v^n \in V_n\), all of weight 0. We only have positive weights of 1 on arcs of type \(\{s,v_{[1,b]}^{1}\}\) and \(\{v^i_{\varnothing },v_{[i+1,b]}^{i+1}\}\) and \(\{v_{[a,b]}^{i},v_{[a,b']}^{i+1}\}\). These correspond to adding the element 1 and \(i+1\) to the hitting set of bundles, respectively.

In a next step, we tailor the graph to the set structure of our specific instance by deleting certain edges. This step ensures that every set is hit. Consider a set \(F\in \mathcal F\) with \(F= \{[l,i], [i+1,u] \}\). If not all elements \(\{l,l+1,\dots ,i\}\) are part of the solution, we have to make sure that all elements in \(\{i+1,\dots ,u\}\) are added to the solution. To do so, for every \(F\in \mathcal F\) with \(F= \{[l,i], [i+1,u] \}\), we delete all edges of the following types:

  1. (1)

    the edge \(\{v^i_{\varnothing },v^{i+1}_{\varnothing }\}\),

  2. (2)

    edges \(\{v_{\varnothing }^{i},v_{[i+1,b']}^{i+1}\}\) for which \(b'<u\),

  3. (3)

    edges \(\{v_{[a,b]}^{i},v_{[a,b']}^{i+1}\}\) for which \(a>l\) and \(b'<u\), and

  4. (4)

    edges \(\{v_{[a,i]}^{i},v_{\varnothing }^{i+1}\}\) for which \(a>l\).

As argued above, doing so for all sets in \(\mathcal F\), any s-t-path gives rise to a hitting set of bundles. More formally, let \(s=v_{I_0}^0,v_{I_1}^1,\ldots ,v_{I_n}^n,v_{I_{n+1}}^{n+1}=t\) an s-t-path (where \(I_0=I_{n+1}=\varnothing \)). Then \(S = \bigcup _{0\le i\le n+1} I_i\) forms a hitting set of bundles of cardinality of the length of the path. To see that, let \(F= \{[l,i], [i+1,u] \} \in \mathcal F\) be an arbitrary set of bundles. Since \(\{v_{I_i}^i,v_{I_{i+1}}^{i+1}\}\) is an edge in the graph, we have that \([l,i] \subseteq I_i\) or \([i+1,u] \subseteq I_{i+1}\). This implies that S hits \(F\) and, hence, forms a feasible hitting set of bundles. Observe, that S is a collection of intervals and every \(\{v_{I_i}^i,v_{I_{i+1}}^{i+1}\}\) of the path has weight 1 if and only if \({i+1}\in I_{i+1}\). Hence \(i+1 \in S\) for \(0\le i \le n-1\) by the above construction. Note, \(\{v_{I_n}^n,t\}\) has weight 0. Additionally, any hitting set of bundles S with its interval decomposition \(J_1=[l_1,u_1],\ldots ,J_k=[l_k,u_k]\), such that for all \(j,j'\in [k]\) with \(j\ne j'\) we have \(J_{j} \cap J_{j'} = \varnothing \), gives rise to an s-t-path. I.e., let

$$\begin{aligned} I_i= {\left\{ \begin{array}{ll} \varnothing \quad \text {if}~i \notin S=\bigcup _j J_j,\\ J_j \quad \text {if}~i \in J_j \subseteq S, \end{array}\right. } \end{aligned}$$

then \(s,v_{I_1}^1,\ldots ,v_{I_n}^n,t\) is an s-t-path of length of the cardinality of S, as for every \(F= \{[l,i], [i+1,u] \}\) we have some J with \([l,i]\subseteq J\) or \([i+1,u]\subseteq J\) and hence \((v_{I_i}^i,v_{I_{i+1}}^{i+1})\in E\). A depiction of the constructed graph for a small instance is displayed in Fig. 3.

We now compute a shortest s-t-path in the graph. Correctness of the algorithm follows from the one-to-one correspondence of s-t-paths and the respective interval decomposition of a hitting set of bundles and the fact that the weight of any path equals the sum of the cardinality of the sets (intervals). \(\square \)

Fig. 3
figure 3

Illustration of the constructed graph on an instance of 3 elements \(\varOmega =\{1,2,3\}\) and set family \(\mathcal F=\{F_1, F_2\}\) with \(F_1=\{\{1\},\{2,3\}\}\) and \(F_2=\{\{2\},\{3\}\}\). Blue edges have weight 1, black edges weight 0. A shortest s-t-path is highlighted in bold. It corresponds to the minimum hitting set of bundles containing the elements \(\{1\}\) and \(\{3\}\)

Theorem 1 also states that Mhsib remains NP-hard if \(\mathcal F\) is a-simple and convex. The following theorem considers two different types of convex instances with an additional property. In scheduling terminology, the first type of instances corresponds to the case, where the difference of any job’s release date and deadline is bounded by k. In ii) the processing time of every job is bounded by k and the number jobs intersection with time slot i is bounded by \(f_{\max }\).

Theorem 3

Let \(([n], \mathcal F)\) be an instance of Mhsib. The problem is solvable

  1. (i)

    in \(O(2^{2k} n)\) if \(\vert i - j \vert \le k\) for all \(i, j\in \bigcup _{U\in F}U\) and all \(F\in \mathcal F\); or

  2. (ii)

    in \(O(2^{2(k+f_{\max })} n)\) if \(\vert U \vert \le k\) for all \(U\in \mathcal {U}\), \(\mathcal F\) is convex and for all \(i\in [n]\),

    $$\begin{aligned} \vert \lbrace F\in \mathcal F~\vert ~\exists ~ U\in F: i \in U\rbrace \vert \le f_{\max }. \end{aligned}$$

Proof

  1. (i)

    The key idea is to construct a weighted graph, whose shortest s-t-path gives rise to a decomposition of a solution of the corresponding Mhsib instance. Here, we make use of the fact that the size of every set is bounded by k and, thus, all subsets of elements that intersect and hit a set \(F\) can be represented by a bounded number of sets. Note that we say that an element i intersects a set \(F\) if there exists a bundle \(U\in F\) with \(i\in U\).

We start by giving a formal construction of the graph \(G=(V,E)\). First, define the vertex set \(V:=V_{0}\cup \dots \cup V_{n+1}\) with \(V_i:=\{v^i_S : S\subseteq \{i-\min \{i,k\},\dots ,i\} \}\), \(V_0:=\{s\}\) and \(V_{n+1}:=\{t\}\). Note that \(\vert V_{i}\vert \le 2^k\). In the following, for all \(i\in [n]\) we let \(\mathcal F_i\) be the family of sets with largest element i, i.e. \(i=\max \{j \in \bigcup _{U\in F}U\}\) for all \(F\in \mathcal F_i\). The set of arcs E is then defined as follows. Our graph G only has arcs between subsequent layers, that is, there only exist arcs from \(V_{i}\) to \(V_{i+1}\). More specifically, arcs from a vertex \(v^i_S \in V_i\) to all other vertices in \(V_{i+1}\) exist if S hits all sets in \(\mathcal F_i\). If \(\mathcal F_i=\emptyset \), we include all arcs in \(V_i\times V_{i+1}\). Finally, all arcs from s to \(V_1\) are contained in the edge set.

The weight of each arc \((v^i_S,v^{i+1}_{S'})\) is given by the number of elements in \(S'\setminus S\). Arcs \((s,v^1_S)\) have weight \(\vert S\vert \), and the exiting arcs \((v^n_S, t)\) have weight 0. This concludes the construction of the graph G. For a schematic picture of such a constructed graph see Fig. 4.

Fig. 4
figure 4

Illustration of the constructed graph. Exemplary edge weights are highlighted in blue

Since the size of every set is bounded by k, i.e. \(\vert i-j\vert \le k\) for all \(i,j\in \bigcup _{U\in F}U\), every subset of elements that hits all sets in \(F_i\) is a subset of \(\{i-\min \{i,k\},\dots ,i\} \}\). Observe that any shortest s-t-path P passes through exactly one vertex of every \(V_i\). On that path, each vertex \(v^{i}_S\) represents a set of elements S. Let \(S^*:= \bigcup _{S: v^i_S \in P} S\). We claim that \(S^*\) is an optimal solution to the corresponding Mhsib instance. Every set \(F\) has a largest element i, and only vertices \(v^i_S\) where S hits all sets in \(\mathcal F_i\) have outgoing arcs. Thus, \(S^*\) hits every set in \(\mathcal F\) and, hence, \(S^*\) is feasible. Observe, that by the choice of the weight function the length of any shortest s-t-path P equals \(\vert S^*\vert \).

On the other hand every hitting set of bundles \(S^*\) gives rise to an s-t-path of same cost in the corresponding graph G. The path is obtained by traversing the respective vertices of the sets \(\{i-\min \{i,k\},\dots ,i\}\cap S^*\) for all \(i\in [n]\). Note that the size of the constructed graph is in \(O(2^{2k}n)\), following by the bounds on \(\vert V_i \vert \) and the fact that we only have edges between subsequent layers \(V_i\) and \(V_{i+1}\). A simple BFS finds a shortest path in linear time of the size of the graph.

  1. (ii)

    Again we make use of a graph \(G=(V,E)\) defined on \(V:=V_0\cup \dots \cup V_{n+1}\). Here,

    $$\begin{aligned} V_i:={\mathcal {P}}(\{i-\min \{i,k\},\dots ,i\})\times {\mathcal {P}}( \{F\in \mathcal F~\vert ~\exists U\in F: i\in U\}), \end{aligned}$$

where \({\mathcal {P}}\) denotes the power set. \(V_0:=\{s\}\) and \(V_{n+1}:=\{t\}\). In other words, every vertex in \(V_i\) for \(i\in [n]\) corresponds to a tuple \((S^{\varOmega },S^{\mathcal F})\) with \(S^{\varOmega }\subseteq \{i-\min \{i,k\},\dots ,i\}\) and \(S^{\mathcal F} \subseteq \{F\in \mathcal F~\vert ~\exists U\in F: i\in U\}\). The set \(S^{\mathcal F}\) shall be used to encode which of the sets containing element i already have been hit. Note that by assumption, the number of vertices in each set \(V_i\) is bounded by the number of tuples, which is at most \(2^{k}\cdot 2^{f_{\max }}\).

Given the vertex set as defined above, we only allow arcs between subsequent layers, that is, between vertices in \(V_{i}\) and vertices in \(V_{i+1}\). More specifically, there exists an arc from \((S^{\varOmega },S^{\mathcal F})\in V_i\) towards \((\bar{S}^{\varOmega },\bar{S}^{\mathcal F})\in V_{i+1}\) if the two following conditions hold:

  1. (a)

    for all \(F\in \bar{S}^{\mathcal F}\setminus S^{\mathcal F}\) there exists a bundle \(U\) in \(F\) such that \(U\subseteq \bar{S}^\varOmega \). That is, \(F\in \bar{S}^{\mathcal F}\setminus S^{\mathcal F}\) if \(\bar{S}^{\varOmega }\) covers one bundle in \(F\); and

  2. (b)

    if \(F\in \mathcal F_{i}\), then \(F\in S^{\mathcal F}\). This ensures that any s-t-path corresponds to a feasible solution of the respective instance of Mhsib.

All arcs in \(\{s\}\times \{(S^{\varOmega },S^{\mathcal F}) \in V_1: S^{\mathcal F}=\emptyset \}\) are contained in E. Let the weight function \(w: E \rightarrow {\mathbb {N}}\) be defined as

$$\begin{aligned} w((S^{\varOmega },S^{\mathcal F}),(\bar{S}^{\varOmega },\bar{S}^{\mathcal F}))= \vert \bar{S}^{\varOmega }\setminus S^{\varOmega }\vert . \end{aligned}$$

We claim, that an optimal solution to Mhsib can be obtained by computing a shortest s-t-path in the corresponding graph G. A solution to the Mhsib instance is, again, obtained by taking the union of all sets \(S^{\varOmega }\) of elements represented by the traversed vertices. Feasibility of the hitting set of bundles follows directly from the construction of the arcs in b). Moreover, the size of the solution equals the length of the path.

Similarly to i), by the choice of the weights, any solution to the corresponding instance of Mhsib gives rise to an s-t-path of same length. At this point we make use of the fact that the family is convex, which implies that the set of elements that is contained in a set forms an interval. Without this condition, there might exist layers i,j with \(i<j-1\), and a set \(F\) such that i and j intersect \(F\) but \(j-1\) does not. Due to condition b), this forces the set \(F\) to be hit by the elements from \(\{j,\ldots ,n\}\), even though in an optimal solution the set \(F\) might only be hit by elements from \(\{1,\ldots ,j-1\}\).

Note that the size of the constructed graph is in \(O(2^{2(k+f_{\max })}n)\), following by the bounds on \(\vert V_i \vert \) and the fact that we only have edges between subsequent layers \(V_i\) and \(V_{i+1}\). A simple BFS finds a shortest path in linear time of the size of the graph. \(\square \)

Fig. 5
figure 5

Example of an instance of railway maintenance. We are given a railway corridor between location A and location B with bidirectional traffic. The parallel lines represent train paths. The paths \(\{1,\dots n_1\}\) correspond to trains from A to B and the paths \(\{n_1+1, \dots n\}\) correspond to trains from B to A. Dashed boxes represent maintenance jobs. In particular, the height of a box corresponds to the section of the railway corridor that requires maintenance work and the length corresponds to the time window in which a job has to be carried out. The length of a solid box represents the processing time of a job. This can be formulated as an instance of 2-dim Mhsib in the following way: The ground set of elements is given by the set of train paths. For every dashed box we introduce a set F. The bundles in F are determined by the sets of train path that interfere with the solid box, given a certain position within the dashed box

3 Minimum Hitting Set of 2-Dimensional Interval Bundles

So far we focused on instances with a given total ordering of \(\varOmega \). In this section, we consider instances with \(\varOmega := N_{1}{\dot{\cup }} N_{2}\) where we are given a total ordering \(\prec _1\) of the elements in \(N_{1}\) and a total ordering \(\prec _2\) of the elements in \(N_{2}\). Throughout this section we may assume, w.l.o.g., that \(N_{1}:=\lbrace 1,\dots ,n_{1}\rbrace \) as well as \(N_{2}:=\lbrace n_{1}+1,\dots ,n\rbrace \) with cardinalities \(n_{i}:=\vert N_{i} \vert \) for \(i \in \lbrace 1,2\rbrace \).

As mentioned in the introduction, this setting is also motivated by an application to railway maintenance ( [9] and see Fig. 5). Interpret \(N_1\) and \(N_2\) as two sets of train paths in opposite direction on a railway track and let \(\varOmega := N_{1}{\dot{\cup }} N_{2}\). If we assume that maintenance jobs are executed without preemption, we obtain bundles with a very specific structure, so-called 2-dimensional interval bundles. More specifically, a set of bundles is 2-dimensional interval if for all \(U\in \mathcal {U}\), if \(i_1,i_2\in U\) with \(i_1,i_2 \in N_{1}\), then for all \(i' \in N_{1}\) with \(i_1 \le i' \le i_2\) we also have that \(i' \in U\). Analogously, if \(j_1,j_2\in U\) with \(j_1,j_2\in N_{2}\), then for all \(j' \in N_{2}\) with \(j_1 \le j' \le j_2\) we also have that \(j' \in U\). We refer to this problem as the Minimum Hitting Set of 2-dimensional Interval Bundles Problem (2-dim Mhsib) and define the properties of being convex and a-simple as follows.

Definition 2

Let \((N_{1}{\dot{\cup }}N_{2},\mathcal F)\) be an instance of 2-dim Mhsib. For \(i \in \{1,2\}\), let \(\mathcal F_{\vert N_i}\) be the family of sets restricted to \(N_i\), that is, \(\mathcal F_{\vert N_i}:=\lbrace \left( \bigcup _{U\in F} \{U\cap N_i\}\right) \setminus \{\emptyset \}: F\in \mathcal F\rbrace \). We call the family \(\mathcal F\)

  1. (i)

    2-dim-convex if \(\mathcal F_{\vert N_{1}}\) and \(\mathcal F_{\vert N_{2}}\) are convex.

  2. (ii)

    a-simple for some \(a \in {\mathbb {N}}\) if \(\vert U\vert =a\) for all \(U\in \mathcal {U}\).

Remark 3

In applications of 2-dim Mhsib, such as the maintenance scheduling problem, the property of \(\mathcal F\) being 2-dim-convex refers to a setting where the starting time of a job can be anywhere in a given time window. a-simple is a natural property in a regular train schedule setting where, independently of the starting time, a job always interferes with the same number of train paths.

2-dim Mhsib remains NP-hard even on 2-dim-convex families [9]. We are able to prove a slightly stronger result.

Theorem 4

The 2-dim Mhsib problem remains NP-hard if \(\mathcal F\) is 2-dim-convex, 1-simple, and \(F_{\max }=2\).

Proof

We use the following graph construction to reduce vertex cover to an instance with the required properties. Given a graph \(G=(V,E)\), subdivide each edge \(e=\{u,v\}\in E\) into three edges \(\{u,e_u\}\), \(\{e_u,e_v \},\{e_v,v\}\) and denote the resulting graph by \(G'=(V',E')\). Note that any vertex cover S in G yields a vertex cover in \(G'\) if we add exactly one vertex \(e_u,e_v\) for every edge \(e= \{u,v\}\in E\) to the cover. This new cover \(S'\) has size \(S+\vert E\vert \).

Also, w.l.o.g. a minimum vertex cover \(S'\) in \(G'\) contains exactly one vertex \(e_u,e_v\) for every edge \(e= \{u,v\}\in E\) (If not, it is either not a cover or we can add u and remove \(e_u\) from the cover without changing its cardinality). We claim that \(S'\cap V\) is a vertex cover in G. Assume the contrary, i.e. there is an edge \(e=\{u,v\}\) that is not covered. Then, since \(S'\) was a cover in \(G'\), the edges \(\{e_u,e_v \}\) must be covered by \(S'\) by either \(e_u\) or \(e_v\). Assume it is covered by \(e_u\), then it immediately follows by assumption that \(v\in S'\). A contradiction. Note that the cover \(S=S'\cap V\) has cardinality \(\vert S'\vert -\vert E\vert \).

Now, it is easy to see that \(\mathcal F:=\{\{\{u\},\{v\}\}:\{u,v\}\in E'\}\) is an instance which is 1-simple, \(F_{\max }=2\) and 2-dim convex. The latter follows from the fact that we do not have edges in V and edges \(\{e_u,e_v\}\) only. Thus, all sets are convex using an arbitrary ordering on V and an arbitrary ordering on the set of edge-vertices \(V'\setminus V\) as long as \(e_v,e_u\) follow one after another for all \(e\in E\). Figure 6 shows the construction and indicates the partition of elements into V and \(V'\setminus V\). \(\square \)

In the following, we outline a polynomial-time approximation algorithm for 2-dim Mhsib where the family \(\mathcal F\) is 1-simple and \(\vert F\vert \le 2\) for all \(F\in \mathcal F\). Since \(F_{\max }\le 2\) in this setting, there already exists a 2-factor approximation algorithm [1]. We improve the approximation guarantee slightly by using techniques from approximation algorithms for the vertex cover problem.

Theorem 5

Let \((N_{1}{\dot{\cup }}N_{2}, \mathcal F)\) be an instance of 2-dim Mhsib. If \(\mathcal F\) is 1-simple and \(F_{\max }= 2\), then there exists a polynomial-time \(\bigl ( 2-\frac{1}{k+1}\bigr )\)-factor approximation algorithm with

$$\begin{aligned} k:=\max \Bigl \lbrace \vert i-j\vert : \lbrace \lbrace i\rbrace , \lbrace j\rbrace \rbrace \in \mathcal F\text { with either } i,j\in N_{1} \text { or } i,j\in N_{2} \Bigr \rbrace . \end{aligned}$$

Proof

The proof of the theorem is based on a result by Hochbaum [12] for which we are going to convert our instance of 2-dim Mhsib into a vertex cover instance. The theorem is stated below.

Theorem 6

(Hochbaum [12]) Let G be a k-colorable graph. There exists a polynomial-time algorithm that finds a vertex cover with a size of at most \(\left( 2-\frac{2}{k}\right) \) times the size of an optimal vertex cover.

Fig. 6
figure 6

Construction of \(G'\) from G and corresponding vertex covers S and \(S'\) in G and \(G'\), respectively

Given a 1-simple instance \((N_{1}{\dot{\cup }}N_{2}, \mathcal F)\) of 2-dim Mhsib with \(F_{\max }=2\), we construct a graph \(G=(V,E)\) with \(V:=N_{1}{\dot{\cup }}N_{2}\) and \(E:=\lbrace \lbrace i,j\rbrace : \{\{i\},\{j\}\} \in \mathcal F\rbrace \). Then, an optimal solution of 2-dim Mhsib corresponds to a minimum vertex cover in the graph G and vice versa.

Consider the subgraphs of \(G[N_{1}]\) and \(G[N_{2}]\) and observe that by the construction of G and the definition of k it holds that \(\varDelta (G[N_{1}])\le k\) and \(\varDelta (G[N_{2}])\le k\). Therefore, by Brooks’ Theorem both induced subgraphs admit a \((k+1)\)-coloring. These colorings can be extended to a \((2k+2)\)-coloring of G.

Applying Theorem 6, a chromatic number of at most \(2k+2\) immediately guarantees a polynomial-time algorithm that computes a vertex cover whose size is at most \((2-\frac{1}{k+1})\) times the size of an optimal vertex cover. \(\square \)

Finally, we present a first polynomial-time algorithm for a special case of 2-dim Mhsib. It is based on a decomposition of the problem.

Theorem 7

Let \((N_{1}{\dot{\cup }} N_{2}, \mathcal F)\) be a 2-dim-convex instance of 2-dim Mhsib with \(F_{\max }\le 2\). The problem is solvable in polynomial time if for all \(F\in \mathcal F\) and \(U,U'\in F\), the symmetric difference only contains two elements, i.e. \(\vert U\triangle U'\vert =2\) and either

  1. (i)

    those elements belong to \(N_{1}\) or \(N_{2}\), i.e. \(U\triangle U' \subset N_{1}\) or \(U\triangle U'\subset N_{2}\); or

  2. (ii)

    the symmetric difference always contains one element from \(N_{1}\) and one element from \(N_{2}\), i.e. \(U\triangle U' \cap N_{1}\ne \emptyset \) and \(U\triangle U'\cap N_{2}\ne \emptyset \).

Proof

First, note that for any \(F=\{U,U'\}\) all elements in \(U\cap U'\) must be contained in any hitting set of bundles. Let \((N_{1}'{\dot{\cup }}N_{2}', \mathcal F')\) be the instance after removing all elements in the respective intersections. Note that there is a one-to-one correspondence between a minimum hitting set of bundles of \(\mathcal F'\) and a minimum hitting set of bundles of \(\mathcal F\) by adding back or removing the aforementioned elements of the intersections. Due to 2-dim-convexity and a 2-dimensional interval family \(\mathcal F\), we have that the reduced family \(\mathcal F'\) on the reduced set of elements \(N_{1}'{\dot{\cup }}N_{2}'\) is again 2-dim-convex and 2-dimensional interval. Furthermore, w.l.o.g., \(\vert F\vert = 2\) for all \(F\in \mathcal F'\). (If \(\vert F\vert =1\) all elements in \(U\in F\) are contained in any hitting set of bundles and can be removed to reduce the instance even further.) Additionally, by assumption the family \(\mathcal F'\) is 1-simple.

Observe, the graph on \(N_{1}'{\dot{\cup }}N_{2}'\) with edges corresponding to the sets in \(\mathcal F'\) is bipartite. This follows from the fact that on the one hand, if \(U\triangle U' \subset N_{1}\) or \(U\triangle U'\subset N_{2}\) for all \(\{U,U'\}\in \mathcal F\), due to convexity, the graph consists of a collection of paths since each element in \(N_{1}'{\dot{\cup }}N_{2}'\) can neighbor at most two others and cycles cannot occur. On the other hand, if \(U\triangle U' \cap N_{1}\ne \emptyset \) and \(U\triangle U'\cap N_{2}\ne \emptyset \) for all \(\{U,U'\}\in \mathcal F\), edges only occur between vertices representing the sets \(N_{1}'\) and \(N_{2}'\), but not within the sets \(N_{1}'\) and \(N_{2}'\), respectively. Now, any vertex cover in the constructed graph corresponds to a solution of 2-dim Mhsib and vice versa. Since the graph is bipartite, we can find a vertex cover in polynomial time, which follows from König’s theorem and any polynomial time maximum matching algorithm [17]. Finally, a solution to \(\mathcal F'\) can easily be lifted to a solution to \(\mathcal F\). \(\square \)

4 Conclusion

In this paper, we study the minimum hitting set of bundles problem on interval and 2-dimensional interval bundles. In the following, we provide an outlook on future research directions. In Theorem 1, we show that Mhsib remains NP-hard if the family \(\mathcal F\) is convex and a-simple, where a is some function in n. For convex, a-simple families with constant a the complexity remains open.

The following application of Mhsb gives rise to several interesting questions for future research. The universe is given by a collection of courses. Every set represents a student and the bundles of the set correspond to a feasible subset of courses, the student should take. In this case, the bundle constraints do not only express time overlaps, but can also be used to encode that some courses might only be taken in combination (e.g. lectures and corresponding exercise classes, lab experiments, etc.). In this setting, the introduction of an upper bound on the number of students per course seems natural. This corresponds to introducing an upper bound on the number of sets that are hit by a bundle, containing a specific element. If the bundles are interval, this problem is equivalent to active time minimization with capacity B.