1 Introduction

Adequate and differentiated service of customers becomes more and more important and product availability is an important aspect of service (cf. Isotupa [2006, p. 687]). There are many situations where it would be financially beneficial to provide different service levels to different customer classes (cf. Isotupa [2015, p. 411], Liu et al. [2013, pp. 1544f.] and Yadavalli et al. [2015, p. 637]): Companies often need to provide different service levels to different customers based on their contracts. This means that orders with long term contracts have higher priority than unscheduled orders since they may bear lower shortage cost than the booked orders. Another example is the case where different customers pay different prices for the same product. Then the company has an incentive to meet more of the demand of the customer who pays a higher price than a customer who pays a lower price. Theory and application of priority queues are well established parts of classical queueing theory (see e.g. Jaiswal (1968) and Wang et al. (2015)). Similarly, theory and application of inventory holding for streams of differentiated demand has found much interest in research (see e.g.  Isotupa (2015, 2011, 2007, 2006), Isotupa and Samanta (2013), Chen et al. (2012), Liu et al. (2013, 2014)). However, up to now, one of the key assumptions of queueing-inventory models in literature is that customers are indistinguishable.

The research communicated in this article is devoted to queueing-inventory systems where customers of two classes of different priorities are served. For serving a customer a piece of raw material from an associated finished goods inventory is needed. The inventory management is according to a standard base stock policy. Replenishment orders are fulfilled by an external supplier and the lead time is positive. Customers arriving when the inventory is depleted are rejected (lost sales). Additionally arrivals are regulated by a flexible admission control which respects priorities.

Our main interest are stability conditions for this system and the stationary behaviour of a stable system. We derive structural properties of the steady state distribution which provide insights into the equilibrium behaviour of the system. An explicit expression of the complete stationary distribution is still an open problem. Inspection of the literature shows that up to now there is no result available which provides rigorously proved conditions for stability in the case that both customer classes have unbounded queues. Our main result is therefore to derive a sufficient condition for ergodicity by the Foster-Lyapunov stability criterion. For a subclass of the admission policies we show that this condition is necessary as well. We expect that this is in general not the case. Furthermore, we consider the case of instant service, where we determine the stationary distribution explicitly.

The paper is organised as follows. In Sect. 2 we describe the related literature. In Sect. 3 we introduce our integrated model for production and inventory management and provide a Markov process Z for the description of the time evolution of the integrated system. In Sect. 4.1 ergodicity is investigated in detail. In Sect. 4.2 we assume that the queueing-inventory process Z is ergodic to analyse the properties of the stationary system. In Sect. 5 we consider the case of zero service time. In Sect. 6 we summarise our main findings and indicate further research directions.

2 Related literature and own contributions

Our research is connected to various parts of queueing theory and inventory control and their interplay in queueing-inventory models. Our fundamental production system is a classical \(M/M/1/\infty \) queueing system, see e.g. Wolff [1989, Chapter 5]. The specific features added to the \(M/M/1/\infty \) in this article are the following:

  • Service of a customer needs a piece of raw material available in an associated inventory (for a review of research on these models see e.g. Krishnamoorthy et al. (2021)).

  • Arriving customers have different priorities which result in a complicated admission control problem (for fundamentals on priority queues see Jaiswal (1968)).

Although in our models the customers are of different priority classes,

  • they require the same type of raw material stored in a single inventory and

  • they experience the same service time distributions.

The first property means that the items of raw material in the inventory are exchangeable with respect to the requesting customers’ services. This property is common in certain applications of e.g. maintenance and repair, see e.g. investigations by Ravid et al. (2013) and Daduna (1990) and references therein.

The second property is discussed and motivated by Wang et al. [2015, p. 733] who identify three main motivations for prioritization and argue that two of these lead to identical service times over classes:

  • Different customers have different willingness to pay for the same product.

  • Customers may require different products or services, where some of these products are more profitable than others.

  • Different service levels may substantially affect long-term profitability.

In our model we follow their conclusion: “Modelling the effects of prioritization due to the first and third motivations can be achieved with identical service time distributions for different segments.” [Wang et al. (2015), p. 733]

Literature on inventory theory related to our problems encompass the following problems and articles. Studies by Gruen et al. (2002) and Verhoef and Sloot (2006) analyse customers’ behaviour in practice and show that in many retail settings most of the original demand can be considered to be lost in case of a stockout. For an overview of the literature on systems with lost sales we refer to Bijvank and Vis (2011). They present a classification scheme for the replenishment policies most often applied in literature and practice, and they review the proposed replenishment policies, including the base stock policy.

Tempelmeier [2005, p. 84] argued that base stock control is economically reasonable if the order quantity is limited because of technical reasons. The base stock policy is “(...) more suitable for item with low demand, including the case of most spare parts” Rego and Mesquita [2011, p. 661].

Morse [1958, p. 9] investigated (pure) inventory systems that operate under a base stock policy. He gives a very simple example where the concept “re-order for each item sold” is useful: Items in inventory are bulky, and expensive (automobiles or TV setsFootnote 1). He uses queueing theory to model the inventory systems, analogously to e.g.  Reed and Zhang (2017).

Using queueing-theoretical methods to solve inventory models is well established, often under the heading of “production-inventory systems”. In this setting the production usually refers to the replenishment systems, early references are e.g.  Morse (1958), Karush (1957) (lost sales), Kaplan (1970) (backordering).

More recent examples are  Rubio and Wein (1996), Lee and Zipkin (1992, 1995), Zazanis (1994), and Li and Arreola-Risa (2021) who investigated classical single item and multi-item inventory systems. Especially they evaluate the performance of base stock control policies in complex situations.

For a review of literature on inventory control systems (= queueing-inventory systems with service time equal to 0, i.e. “instant service” Melikov et al. (2018a)) with multiple customer classes, we refer to Isotupa ([2011, Sect. 2, pp. 3ff.], [2015 Sect. 1, pp. 411ff.]) and Arslan et al. [2007, pp. 1486ff.]. It is differentiated between priority disciplines that regulate customer arrivals and priority disciplines that regulate customer services. We combine both features in our model.

Literature on integrated queueing-inventory models seemingly appeared only from 1992 on, see e.g. Sigman and Simchi-Levi (1992), Melikov and Molchanov (1992). For a recent extensive review we refer to Krishnamoorthy et al. (2021). Because the research described in the present article is on queueing-inventory systems with priority classes for customers we concentrate here on articles which deal with priority problems in queueing-inventory systems.

Importance of research on this topic is emphasized by e.g. Yadavalli et al. (2015) who remarked that patients with serious illnesses are given priority over patients opting for routine checks or else in multi-speciality hospitals. Similarly,  Liu et al. [2013, pp. 1544f.] stated that orders with long term contracts have higher priority than unscheduled orders since they may bear lower shortage cost than the booked orders.

To the best of our knowledge, queueing-inventory systems (under the heading “inventory in counter-stream serving systems”) with different classes of customers were first considered by Melikov and Fatalieva (1998) who formulated a Markov decision model to minimise a cost function which encompasses costs for waiting, inventory holding, loss of demand and for dispatching items. In this problem setting there is no direct prioritization of classes.

Zhao and Lian (2011) seemingly were the first to investigate a system with Poisson arrivals of different priority classes, exponentially distributed service and lead times under backordering. They found a priority service rule to minimise the long-run expected waiting cost by a dynamic programming method. They formulate the model as a level-dependent quasi-birth-and-death process such that the steady state probability distribution of their production-inventory systems can be computed by the Bright-Taylor algorithm.

Chen et al. (2012) and Liu et al. (2013, 2014) introduce a flexible admission control with a priority parameter \(0\le p\le 1\) “for controlling the application of priority” [Liu et al. (2014), p. 181]. The priority parameter p indicates the probability with which the arrivals of ordinary customers are treated like the arrivals of priority customers. If \(p=1\), there is no priority in regulating arrivals. If \(p=0\), there is a strict priority in regulating arrivals. In the last case, their model is the same as the model of Isotupa (2007). They derive the stationary distribution of the inventory levels and some performance measures. To obtain the optimal inventory control policy they construct a mixed integer optimization problem in Liu et al. (2013, 2014) and develop an efficient searching algorithm in Chen et al. (2012).

Similar randomized differentiated admission control is incorporated in many models and is discussed in-depth by Isotupa in a sequence of papers (Isotupa (2006, 2007, 2011, 2015); Isotupa and Samanta (2013)). Isotupa investigates inventory systems with (rQ)- and base stock policy for two classes of customers. Especially, the lost sales property for ordinary customers is of interest, e.g. for inventories of spare parts in the airline or shipping industries, (cf. Isotupa [2011, pp. 1f.]).

Jeganathan and coauthors investigate in a sequence of papers (e.g. Jeganathan et al. (2016), Jeganathan (2015), Yadavalli et al. (2015)) queueing-inventory systems with two classes of customers. They consider models with impatient customers, an optional second service and a mixed priority service (non-preemptive priority and preemptive priority).

Li and Zhao (2009) investigate a preemptive priority queueing system (without inventory) with two classes of customers and an exponential single server who serves the two classes of customers at potentially different rates.

Krishnamoorthy and Manjunath (2015) investigate a two-queue service system (without inventory) where customers arriving in a Poisson stream enter the high-priority queue. An ongoing service of the customers in the high-priority queue may be interrupted and the interrupted customer is sent to the low-priority queue. Service times are priority dependent exponential. In Krishnamoorthy and Manjunath (2018) customers arriving in a marked Poisson process enter either the high or the low priority queue. Furthermore, after being served at the high priority queue the customers may decide to ask for an additional service at the low priority queue. Service times are phase-type distributed.

Melikov et al. (2018a, 2018b) investigate variants of queueing-inventory systems with demand of two classes with high and low priority. They consider an admission control scheme which allows access for high priority customers in any case (backordering) and rejects low priority customers when the stock level is below a critical value. Using approximation methods they evaluate various performance measures of the systems.

Following the authors’ description the papers of Melikov et al. (2022a, 2022b) are (at a first glance) not on priority systems. But after a reinterpretation the models fit into our modelling framework. The authors consider a typical (exponential) queueing-inventory system with some additional features e.g. a special reorder policy with the possibility of emergency reorders, randomized admission in case of stockout, impatient customers. Using the terms of priority systems these customers are those having low priority. Additionally, there is a second incoming Poisson stream of (very) high priority customers which have interrupting priority over the low priority customers. The service time of the high priority customers are negligible (\(=0\)) and in case of stockout they immediately depart (lost sales). In terms of Melikov et al. (2022a, 2022b) the “high priority customers” are “destructive customers” which describe sudden deterioration of items in the inventory which resemble negative customers in pure queueing systems (cf. Gelenbe (1991), Gelenbe et al. (1991)).

The problem of stability for queueing-inventory systems with priority classes of demand has not found much interest in the literature. Most of the papers with priorities in queueing-inventory systems use state space truncation, i.e. consider finite waiting rooms for tackling the stability problem, either both customer queues are assumed to be finite (then stability is not an issue) or one of the queues is truncated (most often the low priority queue, but in some articles high priority customers never queue up). Then the system fits into the realm of QBD processes. Some representative examples are

for finite state space:Yadavalli et al. (2015), Liu et al. (2013), Chen et al. (2012) (service time \(=0\)), Jeganathan (2015), Jeganathan et al. (2013, 2016), Melikov et al. (2018a), Wang (2015),

for countable state space:Shajin et al. (2020) and Baek et al. (2017) (only a queue for low priority customers), Melikov et al. (2018a, 2018b) (steady state distribution obtained with approximation methods), (Zhao & Lian, 2011) (heuristic/intuitive criterion for ergodicity for two demand classes, extending the single demand class case of Schwarz et al. (2006)), Melikov et al. (2022b, 2022a) (no queue for the “very high priority” customers \(=\) destructive customers).

The conclusion is that (best to our knowledge) there exists up to now no rigorous result (criterion) for stability of queueing-inventory systems with differentiated priorities for demand classes in case of state spaces which are two-dimensional infinite, i.e. including \(\mathbb {N}_0^2\), counting for high and low priority customers (demands). This observation is not surprising because in a general context this problem can be termed as “ergodicity of random walks in the quarter plane” (\(\mathbb {N}_0^2\)), which is known to be a notoriously hard problem, see Fayolle et al. (1999). Even more, in the present problem setting this random walk is influenced by the additional (finite) dimension of the inventory. So our investigation is on ergodicity of random walks in the quarter plane in a random environment. A very recent investigation of such problems is from Dimitriou (2022).

Our main contributions are the following:

Starting from a Markovian description for a queueing-inventory system with two unbounded queues for customers of different priority classes and flexible admission control we derive (rigorously) sufficient conditions which guarantee stability of the system. For special cases we even show that the condition is sharp (necessary for stability). Although the complete stationary distribution seems to be an open problem we are able to derive several partial balance properties of the system which are of independent interest. Furthermore, we consider the special case of zero service time and determine the stationary distribution explicitly.

Notations and Conventions:

  • \(\mathbb {N}:=\left\{ 1,2,3,\ldots \right\} \), \(\mathbb {N}_{0}:=\{0\}\cup \mathbb {N}\).

  • The notation \(\subset \) between sets means “subset or equal” and \(\subsetneq \) means “proper subset”. For a set A we denote by \(\vert A\vert \) the number of elements in A.

  • \(1_{\left\{ expression\right\} }\) is the indicator function which is 1 if expression is true and 0 otherwise.

  • Empty sums are 0, and empty products are 1.

  • All random variables are defined on a common probability space \((\Omega ,\mathcal{F},P)\).

  • By Markov process we mean time-homogeneous continuous-time strong Markov process with discrete state space (\(=\) Markov jump process). Markov processes occurring are assumed to be regular and having paths which are right-continuous with left limits (cadlag). A Markov process is regular if it is non-explosive (i.e. the sequence of jump times of the process diverges almost surely), its transition intensity matrix is conservative (i.e. row sums are 0) and stable (i.e. all diagonal elements of the transition intensity matrix are finite).

3 Description of the model

The supply chain of interest is depicted in Fig. and consists of two arrival streams of priority and ordinary customers, a production system (a single server with two unlimited waiting rooms), an inventory and a supplier.

Fig. 1
figure 1

The production-inventory system with two customer classes

The production system manufactures units according to customers’ demand on a make-to-order basis. There are two types of customers—priority customers (type 1) and ordinary customers (type 2). \(\overline{C}=\left\{ 1,2\right\} \) is the set of customer classes. Priority customers arrive according to a Poisson process with rate \(\lambda _{1}>0\) and ordinary customers arrive according to a Poisson process with rate \(\lambda _{2}>0\).

Each customer needs exactly one item from the inventory for service. The service time for both types of customers is exponentially distributed with parameter \(\mu >0\). If the server is ready to serve a customer, who is at the head of the line, and the inventory is not depleted, the service begins immediately. Otherwise, the service starts at the instant of time when the next replenishment arrives at the inventory. A served customer departs from the system immediately and the associated item is removed from the inventory at this time instant.

An outside supplier replenishes raw material to the inventory according to a base-stock policy. Hence, each item taken from the inventory results in a direct order sent to the supplier. This means, if a served customer departs from the system, an order for one item of the consumed raw material is placed at the supplier at this instant of time. The base stock level \(b\ge 2\) is the maximal size of the inventory. The replenishment lead time is exponentially distributed with parameter \(\nu >0\). (Note that there can be more than one outstanding order.) Customers’ arrivals are regulated by a flexible admission control with priority parameter p, \(0\le p\le 1\): If the inventory is depleted all arriving customers are rejected (“lost sales”). If the on-hand inventory is greater than a prescribed threshold level s, \(0<s<b\), customers of both classes are admitted to enter the system. If the on-hand inventory reaches or falls below the threshold level s, priority customers still enter the system but ordinary customers are allowed to enter only with probability p and are rejected with probability \(1-p\).

There is a single server with two separate infinite waiting rooms—one waiting room for priority customers (priority queue) and one waiting room for ordinary customers (ordinary queue) both under an FCFS regime. If both customer queues are not empty, the server needs to decide which one of them should be served. The choice is made according to the preemptive resume discipline. An overview of various priority disciplines can be found in Jaiswal [1968, p. 53].

According to the preemptive resume discipline a newly arriving priority customer interrupts immediately an ongoing service of an ordinary customer. The preempted ordinary customer is put at the head of the ordinary customer queue and has to wait until the priority queue is exhausted before he re-enters service. The preempted customer resumes service from the point of interruption so that his service time upon re-entry has been reduced by the amount of time the customer has already spent in service (cf. Miller [1958, p. 1]). Since it is assumed that the service time is exponential, the ordinary customer requires on its re-entry stochastically the same amount of service as it required on its earlier entry. Thus, the preemptive resume discipline is equal to the preemptive repeat-identical discipline where the preempted customer requires the same amount of service on its re-entry as he required on his earlier entry (cf. Jaiswal [1968, p. 53]).

It is assumed that transmission times for orders are negligible and set to zero and that the transportation time between the production system and the inventory is negligible. All service times, inter-arrival times and replenishment lead times constitute an independent family of random variables.

A Markovian process description of the integrated queueing-inventory system is obtained as follows. Denote by \(X_{1}(t)\) the number of priority customers present in the system at time \(t\ge 0\), and by \(X_{2}(t)\) the number of ordinary customers in the system at time \(t\ge 0\). So \(X_{j}(t), j=1,2,\) counts the respective number of customers either waiting or in service. Since the customer in service will always be of the priority class when at least one priority customer is present, the value of the vector \((X_{1}(t),X_{2}(t))\) determines uniquely the type of the customer in service at time \(t\ge 0\), if any. By Y(t) we denote the on-hand inventory at time \(t\ge 0\).

We define the joint queueing-inventory process of this system by

$$\begin{aligned} Z=((X_{1}(t),X_{2}(t),Y(t)):t\ge 0). \end{aligned}$$

Then, due to the usual independence and memoryless assumptions Z is a homogeneous Markov process. The state space of Z is

$$\begin{aligned} E=\left\{ (n_{1},n_{2},k):(n_{1},n_{2})\in \mathbb {N}_{0}^{2},\,k\in \{0,\ldots ,b\}\right\} , \end{aligned}$$

where b is the maximal size of the inventory. Z has an infinitesimal generator \(\textbf{Q}=\left( q(z;\tilde{z}):z,\tilde{z}\in E\right) \) with the following transition rates for \((n_{1},n_{2},k)\in E\):

$$\begin{aligned} q((n_{1},n_{2},k);(n_{1}+1,n_{2},k))&=\lambda _{1}\cdot 1_{\left\{ k>0\right\} },\\ q((n_{1},n_{2},k);(n_{1},n_{2}+1,k))&=p\,\lambda _{2}\cdot 1_{\left\{ 0<k\le s\right\} }+\lambda _{2}\cdot 1_{\left\{ k>s\right\} },\\ q((n_{1},n_{2},k);(n_{1}-1,n_{2},k-1))&=\mu \cdot 1_{\left\{ n_{1}>0\right\} }\cdot 1_{\left\{ k>0\right\} },\\ q((n_{1},n_{2},k);(n_{1},n_{2}-1,k-1))&=\mu \cdot 1_{\left\{ n_{1}=0\right\} }\cdot 1_{\left\{ n_{2}>0\right\} }\cdot 1_{\left\{ k>0\right\} },\\ q((n_{1},n_{2},k);(n_{1},n_{2},k+1))&=\nu \cdot 1_{\left\{ k<b\right\} }. \end{aligned}$$

Furthermore, \(q(z;\tilde{z})=0\) for any other pair \(z\ne \tilde{z}\), and

$$\begin{aligned} q\left( z;z\right) =-\sum _{\begin{array}{c} \tilde{z}\in E,\\ z\ne \tilde{z} \end{array} }q\left( z;\tilde{z}\right) \qquad \forall z\in E. \end{aligned}$$

Definition 1

If the queueing-inventory process Z on state space E is ergodic, we denote its limiting and stationary distribution by

$$\begin{aligned} \pi:= & {} \left( \pi \left( n_{1},n_{2},k\right) :\left( n_{1},n_{2},k\right) \in E\right) \\{} & {} \text {with}\quad \pi \left( n_{1},n_{2},k\right) :=\lim _{t\rightarrow \infty }P\left( Z(t)=\left( n_{1},n_{2},k\right) \right) . \end{aligned}$$

Discussion of modelling assumptions

  1. (1)

    Our admission control is a modification of the admission schemes described in the related literature in Sect. 2. The strict rejection of ordinary customers below the control limit s is weakened by a randomized decision. If the inventory level is zero, demands due to both types of customers are lost, cf. Isotupa (2007, 2011, 2015).

  2. (2)

    Preemptive resume regime of priority customers over ordinary customers is a common scheme in queueing models, c.f. Jaiswal (1968). For queueing-inventory systems coupling each service with obtaining an item from the inventory implies that an item dedicated at first to an ordinary customer can be attached to an interrupting priority customer as well. Here the property that items from stock are exchangeable is necessary.

4 Stability and stationary behaviour

In this section we investigate stability of the queueing-inventory system. This especially means to find conditions on the system which guarantee that the system approaches in the long run a steady state (i.e. stabilises). For queueing systems with priority classes of customers this is a non-trivial problem because in general (e.g. in our setting of a queue without inventory) a solution of the global balance equations for the stationary distribution is not available and the standard ergodicity criterion for QBDs (Quasi-Birth-Death processes) is not applicable, see e.g. Latouche and Ramaswami (1999).

In Sect. 4.1 we find natural conditions for Z to be ergodic by constructing a suitable Lyapunov function in the spirit of the classical Foster-Lyapunov stability criterion. Foster (1953) introduced a technique for proving stability of Markov chains which was generalised in several directions. Our proof relies on Kelly and Yudovina [2014, Proposition D.3].

Proposition 1

Let \(X:=(X(t):t\ge 0)\) be an irreducible regular Markov process with countable state space E and transition rate matrix \(\textbf{Q}:=(q(x,y):x,y\in E)\). Suppose that \(\mathcal{L}:E\rightarrow [0,\infty )\) is a function such that for constants \(\varepsilon >0\) and \(b\in \mathbb {R}\), and some finite exception set \(F\subset E\) and all \(x\in E\) it holds

$$\begin{aligned} \sum _{y\in E\setminus \{x\}}q(x,y)\left[ \mathcal{L}(y)-\mathcal{L}(x)\right] \le {\left\{ \begin{array}{ll} -\varepsilon &{}x\notin F,\\ b-\varepsilon &{} x\in F. \end{array}\right. } \end{aligned}$$
(1)

Then X is ergodic.

Although we do not find a solution of the global balance equations for Z in Sect. 4.2, we present interesting results of the behaviour of Z in steady state. These results rely on the presence of partial balance properties inherent in the dynamics of the queueing-inventory system in equilibrium. Some of the resulting properties are surprising.

4.1 Ergodicity

Irreducibility of the state process Z can be seen directly from the transition rates at the end of Sect. 3. Our main result is the following theorem in the spirit of the Foster-Lyapunov stability criterion.

Theorem 2

The queueing-inventory process Z is ergodic if \(\lambda _{1}+\lambda _{2}<\mu \).

Proof

Positive recurrence will be shown by the Foster-Lyapunov stability criterion with \(\mathcal {L}:E\rightarrow \mathbb {R}_{0}^{+}\) as Lyapunov function with

$$\begin{aligned} \mathcal {L}(n_{1},n_{2},k):=n_{1}+n_{2}+\alpha (k), \quad (n_1,n_2,k)\in E, \end{aligned}$$
(2)

where \(\alpha :\{0,1,\ldots ,b\}\rightarrow [0,\infty )\) is strictly decreasing with

$$\begin{aligned} \alpha (k)=(b-k)\cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu } \end{aligned}$$
(3)

and the finite exception set is given by

$$\begin{aligned} F:=\left\{ (n_{1},n_{2},k):n_{1}+n_{2}=0\right\} . \end{aligned}$$

Furthermore, we define

$$\begin{aligned} \eta :=\mu -\lambda _{1}-\lambda _{2} \end{aligned}$$
(4)

and

$$\begin{aligned} \varepsilon :=\frac{\eta }{2}\cdot \min \left\{ 1, \frac{\nu }{\mu } \right\} . \end{aligned}$$

Due to the assumption, that \(\mu >\lambda _1+\lambda _2\) it holds \(\varepsilon >0\).

\(\blacktriangleright \) First, we will check \(\left( \mathbf {Q\cdot \mathcal {L}}\right) (n_{1},n_{2},k)<\infty \) for \((n_{1},n_{2},k)\in F\). Since \(0<\lambda _{1}<\infty \), \(0<\lambda _{2}<\infty \) and \(0<\nu <\infty \),

for \(k=0\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (0,0,0)&=\nu \cdot \left( \mathcal {L}(0,0,1)-\mathcal {L}(0,0,0)\right) \overset{(2)}{=}\nu \cdot (\alpha (1)-\alpha (0)) <\infty , \end{aligned}$$

for \(k=1,\ldots ,s\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (0,0,k)&=\lambda _{1}\cdot \left( \mathcal {L}(1,0,k)-\mathcal {L}(0,0,k)\right) +p\,\lambda _{2}\cdot \left( \mathcal {L}(0,1,k)-\mathcal {L}(0,0,k)\right) \\&\quad +\nu \cdot \left( {\mathcal {L}}(0,0,k+1)-\mathcal {L}(0,0,k)\right) \\&\overset{(2)}{=} \lambda _{1}\cdot [(1+\alpha (k))-\alpha (k)] +p\,\lambda _{2}\cdot [(1+\alpha (k))-\alpha (k)] \\&\quad + \nu \cdot (\alpha (k+1)-\alpha (k)) <\infty , \end{aligned}$$

for \(k=s+1,\ldots ,b-1\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (0,0,k)&=\lambda _{1}\cdot \left( \mathcal {L}(1,0,k)-\mathcal {L}(0,0,k)\right) +\lambda _{2}\cdot \left( \mathcal {L}(0,1,k)-\mathcal {L}(0,0,k)\right) \\&\quad +\nu \cdot \left( \mathcal {L}(0,0,k+1)-\mathcal {L}(0,0,k)\right) \\&\overset{(2)}{=} \lambda _{1}\cdot [(1+\alpha (k))-\alpha (k)] +\lambda _{2}\cdot [(1+\alpha (k))-\alpha (k)] \\&\quad + \nu \cdot (\alpha (k+1)-\alpha (k)) <\infty , \end{aligned}$$

for \(k=b\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (0,0,b)&=\lambda _{1}\cdot \left( \mathcal {L}(1,0,b)-\mathcal {L}(0,0,b)\right) +\lambda _{2}\cdot \left( \mathcal {L}(0,1,b)-\mathcal {L}(0,0,b)\right) \\&\overset{(2)}{=} \lambda _{1}\cdot [(1+\alpha (b))-\alpha (b)] +\lambda _{2}\cdot [(1+\alpha (b))-\alpha (b)] <\infty . \end{aligned}$$

\(\blacktriangleright \) Second, we will check \(\left( \mathbf {Q\cdot \mathcal {L}}\right) \left( n_{1},n_{2},k\right) \le -\varepsilon \) for \(z=(n_{1},n_{2},k)\notin F\) with

$$\begin{aligned} -\varepsilon&=-\frac{\eta }{2}\cdot \max \left\{ 1, \frac{\nu }{\mu } \right\} . \end{aligned}$$
(5)

For \(k=0\), \(n_{1}\in \mathbb {N}\) and \(n_{2}\in \mathbb {N}_{0}\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (n_{1},n_{2},0)&= \nu \cdot \left( \mathcal {L}(n_{1},n_{2},1)-\mathcal {L}(n_{1},n_{2},0)\right) \\&\overset{(2)}{=}\nu \cdot \left( (n_{1}+n_{2}+\alpha (1))-(n_{1}+n_{2}+\alpha (0))\right) =\nu \cdot (\alpha (1)-\alpha (0))\\&\overset{(3)}{=}\nu \cdot [(b-1)-(b-0)]\cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu } \overset{(4)}{=}-\frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon .\\ \end{aligned}$$

For \(k=0\), \(n_{1}=0\) and \(n_{2}\in \mathbb {N}\) it holds

$$\begin{aligned} \left( \textbf{Q}\cdot \mathcal {L}\right) (0,n_{2},0)&= \nu \cdot \left( \mathcal {L}(0,n_{2},1)-\mathcal {L}(0,n_{2},0)\right) \\&\overset{(2)}{=}\nu \cdot \left( (n_{2}+\alpha (1))-(n_{2}+\alpha (0))\right) =\nu \cdot (\alpha (1)-\alpha (0))\\&\overset{(3)}{=}\nu \cdot [(b-1)-(b-0)]\cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu } \overset{(4)}{=}-\frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=1,\ldots ,s\), \(n_{1}\in \mathbb {N}\) and \(n_{2}\in \mathbb {N}_{0}\) it holds

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (n_{1},n_{2},k)\\&= \lambda _{1}\cdot \left( \mathcal {L}(n_{1}+1,n_{2},k)-\mathcal {L}(n_{1},n_{2},k)\right) +p\,\lambda _{2}\cdot \left( \mathcal {L}(n_{1},n_{2}+1,k)-\mathcal {L}(n_{1},n_{2},k)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(n_{1}-1,n_{2},k-1)-\mathcal {L}(n_{1},n_{2},k)\right) +\nu \cdot \left( \mathcal {L}(n_{1},n_{2},k+1)-\mathcal {L}(n_{1},n_{2},k)\right) \\&\overset{(2)}{=}\lambda _{1}\cdot \left( n_{1}+1+n_{2}+\alpha (k)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +p\,\lambda _{2}\cdot \left( n_{1}+n_{2}+1+\alpha (k)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +\mu \cdot \left( n_{1}-1+n_{2}+\alpha (k-1)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +\nu \cdot \left( n_{1}+n_{2}+\alpha (k+1)-n_{1}-n_{2}-\alpha (k)\right) \\&\overset{\begin{array}{c} (3) \end{array}}{=}\lambda _{1}+p\,\lambda _{2}-\mu +\mu \cdot \left[ (b-(k-1))-(b-k)\right] \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\quad +\nu \cdot \left[ (b-(k+1))-(b-k)\right] \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&= \lambda _{1}+p\,\lambda _{2}-\mu - \frac{\mu -\lambda _{1}-\lambda _{2}}{\mu } - \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu } \cdot \nu \overset{\begin{array}{c} (4) \end{array}}{\le } -\eta + \frac{\eta }{2} - \frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=1,\ldots ,s\), \(n_{1}=0\) and \(n_{2}\in \mathbb {N}\) it holds

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (0,n_{2},k)\\&=\lambda _{1}\cdot \left( \mathcal {L}(1,n_{2},k)-\mathcal {L}(0,n_{2},k)\right) +p\,\lambda _{2}\cdot \left( \mathcal {L}(0,n_{2}+1,k)-\mathcal {L}(0,n_{2},k)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(0,n_{2}-1,k-1)-\mathcal {L}(0,n_{2},k)\right) +\nu \cdot \left( \mathcal {L}(0,n_{2},k+1)-\mathcal {L}(0,n_{2},k)\right) \\&\overset{(2)}{=}\lambda _{1}\cdot \left( 1+n_{2}+\alpha (k)-n_{2}-\alpha (k)\right) +p\,\lambda _{2}\cdot \left( n_{2}+1+\alpha (k)-n_{2}-\alpha (k)\right) \\&\quad +\mu \cdot \left( n_{2}-1+\alpha (k-1)-n_{2}-\alpha (k)\right) +\nu \cdot \left( n_{2}+\alpha (k+1)-n_{2}-\alpha (k)\right) \\&=\lambda _{1}+p\,\lambda _{2}-\mu +\mu \cdot \left( \alpha (k-1)-\alpha (k)\right) +\nu \cdot \left( \alpha (k+1)-\alpha (k)\right) \\&\overset{\begin{array}{c} (3) \end{array}}{=}\lambda _{1}+p\,\lambda _{2}-\mu +\mu \cdot \left[ (b-(k-1))-(b-k)\right] \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\quad +\nu \cdot \left[ (b-(k+1))-(b-k)\right] \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\overset{\begin{array}{c} (4) \end{array}}{\le } -\eta + \frac{\eta }{2} - \frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=s+1,\ldots ,b-1\), \(n_{1}\in \mathbb {N}\) and \(n_{2}\in \mathbb {N}_0\) it holds

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (n_{1},n_{2},k)\\&=\lambda _{1}\cdot \left( \mathcal {L}(n_{1}+1,n_{2},k)-\mathcal {L}(n_{1},n_{2},k)\right) +\lambda _{2}\cdot \left( \mathcal {L}(n_{1},n_{2}+1,k)-\mathcal {L}(n_{1},n_{2},k)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(n_{1}-1,n_{2},k-1)-\mathcal {L}(n_{1},n_{2},k)\right) +\nu \cdot \left( \mathcal {L}(n_{1},n_{2},k+1)-\mathcal {L}(n_{1},n_{2},k)\right) \\&\overset{(2)}{=}\lambda _{1}\cdot \left( n_{1}+1+n_{2}+\alpha (k)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +\lambda _{2}\cdot \left( n_{1}+n_{2}+1+\alpha (k)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +\mu \cdot \left( n_{1}-1+n_{2}+\alpha (k-1)-n_{1}-n_{2}-\alpha (k)\right) \\&\quad +\nu \cdot \left( n_{1}+n_{2}+\alpha (k+1)-n_{1}-n_{2}-\alpha (k)\right) \\&= \lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left( \alpha (k-1)-\alpha (k)\right) +\nu \cdot \left( \alpha (k+1)-\alpha (k)\right) \\&\overset{\begin{array}{c} (3) \end{array}}{=}\lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left[ (b-(k-1))-(b-k)\right] \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\quad +\nu \cdot \left[ (b-(k+1))-(b-k)\right] \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\overset{\begin{array}{c} (4) \end{array}}{=} -\eta + \frac{\eta }{2} - \frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=s+1,\ldots ,b-1\), \(n_{1}=0\) and \(n_{2}\in \mathbb {N}\)

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (0,n_{2},k)\\&=\lambda _{1}\cdot \left( \mathcal {L}(1,n_{2},k)-\mathcal {L}(0,n_{2},k)\right) +\lambda _{2}\cdot \left( \mathcal {L}(0,n_{2}+1,k)-\mathcal {L}(0,n_{2},k)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(0,n_{2}-1,k-1)-\mathcal {L}(0,n_{2},k)\right) +\nu \cdot \left( \mathcal {L}(0,n_{2},k+1)-\mathcal {L}(0,n_{2},k)\right) \\&\overset{(2)}{=} \lambda _{1}\cdot \left( 1+n_{2}+\alpha (k)-n_{2}-\alpha (k)\right) +\lambda _{2}\cdot \left( n_{2}+1+\alpha (k)-n_{2}-\alpha (k)\right) \\&\quad +\mu \cdot \left( n_{2}-1+\alpha (k-1)-n_{2}-\alpha (k)\right) +\nu \cdot \left( n_{2}+\alpha (k+1)-n_{2}-\alpha (k)\right) \\&=\lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left( \alpha (k-1)-\alpha (k)\right) +\nu \cdot \left( \alpha (k+1)-\alpha (k)\right) \\&\overset{\begin{array}{c} (3) \end{array} }{=}\lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left[ (b-(k-1))-(b-k)\right] \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\quad +\nu \cdot \left[ (b-(k+1))-(b-k)\right] \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\overset{\begin{array}{c} (4) \end{array}}{=} -\eta + \frac{\eta }{2} - \frac{\eta }{2}\cdot \frac{\nu }{\mu } \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=b\), \(n_{1}\in \mathbb {N}\) and \(n_{2}\in \mathbb {N}_{0}\) it holds

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (n_{1},n_{2},b)\\&= \lambda _{1}\cdot \left( \mathcal {L}(n_{1}+1,n_{2},b)-\mathcal {L}(n_{1},n_{2},b)\right) +\lambda _{2}\cdot \left( \mathcal {L}(n_{1},n_{2}+1,b)-\mathcal {L}(n_{1},n_{2},b)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(n_{1}-1,n_{2},b-1)-\mathcal {L}(n_{1},n_{2},b)\right) \\&\overset{(2)}{=}\lambda _{1}\cdot \left( n_{1}+1+n_{2}+\alpha (b)-n_{1}-n_{2}-\alpha (b)\right) \\&\quad +\lambda _{2}\cdot \left( n_{1}+n_{2}+1+\alpha (b)-n_{1}-n_{2}-\alpha (b)\right) \\&\quad +\mu \cdot \left( n_{1}-1+n_{2}+\alpha (b-1)-n_{1}-n_{2}-\alpha (b)\right) \\&\overset{\begin{array}{c} (3) \end{array}}{=} \lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left( (b-(b-1))-(b-b)\right) \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\overset{\begin{array}{c} (4) \end{array}}{=} -\eta + \frac{\eta }{2} \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

For \(k=b\), \(n_{1}=0\) and \(n_{2}\in \mathbb {N}\) it holds

$$\begin{aligned}&\left( \textbf{Q}\cdot \mathcal {L}\right) (0,n_{2},b)\\&=\lambda _{1}\cdot \left( \mathcal {L}(1,n_{2},b)-\mathcal {L}(0,n_{2},b)\right) +\lambda _{2}\cdot \left( \mathcal {L}(0,n_{2}+1,b)-\mathcal {L}(0,n_{2},b)\right) \\&\quad +\mu \cdot \left( \mathcal {L}(0,n_{2}-1,b-1)-\mathcal {L}(0,n_{2},b)\right) \\&\overset{(2)}{=}\lambda _{1}\cdot \left( 1+n_{2}+\alpha (b)-n_{2}-\alpha (b)\right) +\lambda _{2}\cdot \left( n_{2}+1+\alpha (b)-n_{2}-\alpha (b)\right) \\&\quad +\mu \cdot \left( n_{2}-1+\alpha (b-1)-n_{2}-\alpha (b)\right) \\&=\lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left( \alpha (b-1)-\alpha (b)\right) \\&\overset{\begin{array}{c} (3) \end{array}}{=} \lambda _{1}+\lambda _{2}-\mu +\mu \cdot \left( (b-(b-1))-(b-b)\right) \cdot \frac{\mu -\lambda _{1}-\lambda _{2}}{2\mu }\\&\overset{\begin{array}{c} (4) \end{array}}{=} -\eta + \frac{\eta }{2} \overset{(5)}{\le }-\varepsilon . \end{aligned}$$

\(\square \)

We do not expect that the result of the theorem is sharp for general \(p\in [0,1]\), i.e. the condition \(\lambda _{1}+\lambda _{2}<\mu \) is in general only sufficient for stability. We have only

Theorem 3

Consider the system from Theorem 2 for \(p=1\), i.e. low priority customers are admitted as long as the inventory is not depleted. Then Z is ergodic if and only if \(\lambda _{1}+\lambda _{2}<\mu \) holds.

The result of Theorem 3 at a first glance seems to be intuitive because no distinction is made between the arriving customers. The problem is that the influence of the interrupts of arrivals and service due to stockout is not accessible to intuition. The theorem states especially that the interrupts of arrivals and service balance over time. The proof of Theorem 3 is postponed to the next section because we need some preparations which we consider to be of independent interest.

4.2 Properties of the stationary system

In this section we assume that the queueing-inventory process Z is ergodic. So the limiting and stationary distribution \(\pi \) of Z (see Definition 1) exists but seems to be not accessible directly. Nevertheless we are able to present results on the stationary behaviour of Z. We will exploit the structural information inherent in the global balance equations of Z for \(\pi \).

The global balance equations \(\pi \cdot \mathbf {Q=0}\) of the ergodic queueing-inventory process Z are for \((n_{1},n_{2},k)\in E\) given by

$$\begin{aligned}&\pi (n_{1},n_{2},k)\cdot \left( (\lambda _{1}+p\,\lambda _{2})\cdot 1_{\left\{ 0<k\le s\right\} }+(\lambda _{1}+\lambda _{2})\cdot 1_{\left\{ k>s\right\} }\right. \nonumber \\&\quad \left. +\mu \cdot 1_{\left\{ n_{1}+n_{2}>0\right\} }\cdot 1_{\left\{ k>0\right\} }+\nu \cdot 1_{\left\{ k<b\right\} }\right) \nonumber \\&=\pi (n_{1}-1,n_{2},k)\cdot \lambda _{1}\cdot 1_{\left\{ n_{1}>0\right\} }\cdot 1_{\left\{ k>0\right\} }\nonumber \\&\quad +\pi (n_{1},n_{2}-1,k)\cdot p\,\lambda _{2}\cdot 1_{\left\{ n_{2}>0\right\} }\cdot 1_{\left\{ 0<k\le s\right\} }\nonumber \\&\quad +\pi (n_{1},n_{2}-1,k)\cdot \lambda _{2}\cdot 1_{\left\{ n_{2}>0\right\} }\cdot 1_{\left\{ k>s\right\} }\nonumber \\&\quad +\pi (n_{1}+1,n_{2},k+1)\cdot \mu \cdot 1_{\left\{ k<b\right\} }+\pi (n_{1},n_{2}+1,k+1)\cdot \mu \cdot 1_{\left\{ n_{1}=0\right\} }\cdot 1_{\left\{ k<b\right\} }\nonumber \\&\quad +\pi (n_{1},n_{2},k-1)\cdot \nu \cdot 1_{\left\{ k>0\right\} }. \end{aligned}$$
(6)

Let \((X_{1},X_{2},Y)\) be a random vector that is distributed according to the stationary distribution \(\pi \). So, Y is distributed as the stationary distribution of the inventory process in equilibrium and \(X_{1}\) resp. \(X_{2}\) are respectively distributed as the processes counting for the number of priority resp. ordinary customers in equilibrium.

The proofs of the main results in this section rely on partial balance relations inherent in the global balance equations of Z. These relations are extracted and formulated in the next lemma.

Lemma 1

For the queueing-inventory process Z it holds

$$\begin{aligned}&P(X_{1}=n_{1},Y>0)\cdot \lambda _{1} =P(X_{1}=n_{1}+1,Y>0)\cdot \mu ,\quad \; n_{1}\in \mathbb {N}_{0}, \end{aligned}$$
(7)
$$\begin{aligned}&P(X_{2}=n_{2},0<Y\le s)\cdot p\,\lambda _{2}+P(X_{2}=n_{2},Y>s)\cdot \lambda _{2}\nonumber \\&=P(X_{1}=0,X_{2}=n_{2}+1,Y>0)\cdot \mu ,\qquad \qquad \qquad \qquad \quad \quad n_{2}\in \mathbb {N}_{0},\end{aligned}$$
(8)
$$\begin{aligned}&P(X_{1}+X_{2}=n,0<Y\le s)\cdot (\lambda _{1}+p\,\lambda _{2})\nonumber \\&\quad +P(X_{1}+X_{2}=n,Y>s)\cdot (\lambda _{1}+\lambda _{2})\nonumber \\&=P(X_{1}+X_{2}=n+1,0<Y\le b)\cdot \mu ,\qquad \qquad \qquad \qquad \quad \; n\in \mathbb {N}_{0}. \end{aligned}$$
(9)

All the equations can be proven by applying the cut-criterion for positive recurrent processes (see Kelly [1979, Lemma 1.4, p. 8]). This is

Lemma 2

For ergodic Z with stationary distribution \(\pi \) it holds for any non-empty proper subset \(A\subset E\)

$$\begin{aligned} \sum _{z\in A} \sum _{\tilde{z}\in A^c}\pi (z) q(z,\tilde{z}) = \sum _{\tilde{z}\in A^c} \sum _{z\in A} \pi (\tilde{z}) q(\tilde{z},z) \end{aligned}$$

Proof of Lemma 1

For \(n_{1}\in \mathbb {N}_{0}\), equation (7) can be proven by a cut, which divides E into complementary sets according to the queue length of priority customers that is less than or equal to \(n_{1}\) or greater than \(n_{1}\). These are the sets

$$\begin{aligned} A:= \Big \{(m_{1},m_{2},k):m_{1}\in \{0,1,\ldots .n_{1}\},\,m_{2}\in \mathbb {N}_{0},\,k\in \{0,\ldots ,b\}\Big \}, \end{aligned}$$

and \(A^c\). Then for \(n_{1}\in \mathbb {N}_{0}\) it holds by direct evaluation

$$\begin{aligned}&\underbrace{\sum _{m_{1}=n_{1}}^{n_{1}}\sum _{m_{2}=0}^{\infty }\sum _{k=1}^{b}\pi (m_{1},m_{2},k)\cdot \lambda _{1}}_{=P(X_{1}=n_{1},Y>0)\cdot \lambda _{1}}=\underbrace{\sum _{\widetilde{m}_{1}=n_{1}+1}^{n_{1}+1}\sum _{\widetilde{m}_{2}=0}^{\infty }\sum _{\widetilde{k}=1}^{b}\pi (\widetilde{m}_{1},\widetilde{m}_{2},\widetilde{k})\cdot \mu }_{=P(X_{1}=n_{1}+1,Y>0)\cdot \mu }. \end{aligned}$$

Hence, for \(n_{1}\in \mathbb {N}_{0}\) it holds (7).

For \(n_{2}\in \mathbb {N}_{0}\), equation (8) can be proven by a cut, which divides E into complementary sets according to the queue length of ordinary customers that is less than or equal to \(n_{2}\) or greater than \(n_{2}\). These are the sets

$$\begin{aligned} A = \Big \{(m_{1},m_{2},k):m_{1}\in \mathbb {N}_{0},m_{2}\in \{0,1,\ldots .n_{2}\},\,k\in \{0,\ldots ,b\}\Big \}, \end{aligned}$$

and \(A^c\). Then for \(n_{2}\in \mathbb {N}_{0}\) it holds by direct evaluation

$$\begin{aligned}&\underbrace{\sum _{m_{1}=0}^{\infty }\sum _{m_{2}=n_{2}}^{n_{2}}\sum _{k=1}^{s}\pi (m_{1},m_{2},k)\cdot p\,\lambda _{2}}_{=P(X_{2}=n_{2},0<Y\le s)\cdot p\,\lambda _{2}}+\underbrace{\sum _{m_{1}=0}^{\infty }\sum _{m_{2}=n_{2}}^{n_{2}}\sum _{k=s+1}^{b}\pi (m_{1},m_{2},k)\cdot \lambda _{2}}_{=P(X_{2}=n_{2},Y>s)\cdot \lambda _{2}}\\&=\underbrace{\sum _{\widetilde{m}_{1}=0}^{0}\sum _{\widetilde{m}_{2}=n_{2}+1}^{n_{2}+1}\sum _{\widetilde{k}=1}^{b}\pi (\widetilde{m}_{1},\widetilde{m}_{2},\widetilde{k})\cdot \mu }_{=P(X_{1}=0,X_{2}=n_{2}+1,Y>0)\cdot \mu }. \end{aligned}$$

Thus, for \(n_{2}\in \mathbb {N}_{0}\) it holds (8).

For \(n\in \mathbb {N}_{0}\), equation (9) can be proven by a cut, which divides E into complementary sets according to the size of the total queue length that is less than or equal to n or greater than n. These are the sets

$$\begin{aligned} A= \Big \{(m_{1},m_{2},k):m_{1}\in \mathbb {N}_{0},\,m_{2}\in \mathbb {N}_{0},\,(m_{1}+m_{2})\in \{0,1,\ldots ,n\},\,k\in \{0,\ldots ,b\}\Big \}, \end{aligned}$$

and \(A^c\). Then for \(n\in \mathbb {N}_{0}\) holds by direct evaluation

$$\begin{aligned}&\underbrace{\sum _{m_{1}+m_{2}=n}^{n}\sum _{k=1}^{s}\pi (m_{1},m_{2},k)\cdot (\lambda _{1}+p\,\lambda _{2})}_{=P(X_{1}+X_{2}=n,0<Y\le s)\cdot (\lambda _{1}+p\,\lambda _{2})}+\underbrace{\sum _{m_{1}+m_{2}=n}^{n}\sum _{k=s+1}^{b}\pi (m_{1},m_{2},k)\cdot (\lambda _{1}+\lambda _{2})}_{=P(X_{1}+X_{2}=n,Y>s)\cdot (\lambda _{1}+\lambda _{2})}\\&=\underbrace{\sum _{\widetilde{m}_{1}+\widetilde{m}_{2}=n+1}^{n+1}\sum _{\widetilde{k}=1}^{b}\pi (\widetilde{m}_{1},\widetilde{m}_{2},\widetilde{k})\cdot \mu }_{=P(X_{1}+X_{2}=n+1,Y>0)\cdot \mu }. \end{aligned}$$

Thus, for \(n\in \mathbb {N}_{0}\) it holds (9). \(\square \)

Consider the queueing-inventory system with lead time zero for the replenishment orders. Then Y is constant b and the distribution of \((X_{1},X_{2})\) is the equilibrium of the priority system as described in Sect. 3 without inventory. A little reflection shows that the distribution of \(X_1\) is geometrical with parameter \(\lambda _{1}/\mu \), i.e. the priority customers are served as in a standard \(M/M/1/\infty \) with parameters \(\lambda _{1}\) and \(\mu \). (Indeed, this observation applies to a classical M/M/c queue with two priority classes under a preemptive priority discipline as well. The priority customers behave as customers in a classic M/M/c queue (cf. Wang et al. (2015)).)

Our first proposition embeds a similar observation into the setting of the model in Sect. 3. The problem with our queueing-inventory system is that the two customer classes have to share the same inventory. Therefore, the ordinary (= non-priority) customers impose restrictions on the priority customers’ service because they consume items from the inventory and generate more stock-outs experienced by the priority customers.

Proposition 4

For the queueing-inventory system from Sect. 3 the stationary number of priority customers in the system conditioned on positive inventory is geometrical with parameter \(\lambda _{1}/\mu \), i.e. for \(n_{1}\in \mathbb {N}_{0}\) it holds

$$\begin{aligned} P(X_{1}=n_{1}\mid Y>0)=P(X_{1}=0\mid Y>0)\cdot \left( \frac{\lambda _{1}}{\mu }\right) ^{n_{1}}, \end{aligned}$$
(10)

with normalisation constant

$$\begin{aligned} P(X_{1}=0\mid Y>0) = 1-\frac{\lambda _{1}}{\mu }. \end{aligned}$$

Proof

From (7) it follows by iteration for \(n_{1}\in \mathbb {N}_{0}\)

$$\begin{aligned} P(X_{1}=n_{1},Y>0)=P(X_{1}=0,Y>0)\cdot \left( \frac{\lambda _{1}}{\mu }\right) ^{n_{1}}. \end{aligned}$$
(11)

Conditioning and exploiting \(\lambda _{1}<\mu \) for summation ends the proof. \(\square \)

A further consequence of Lemma 1 are the following intuitive rate equations. These equalize the mean admitted arrivals per customer classes to class dependent throughputs (= effective departure rates). The proof is in any case by direct summation of the respective formulas in Lemma 1.

Proposition 5

For the queueing-inventory process it holds the following equilibrium of probability flows

$$\begin{aligned}{} & {} \underbrace{P(Y>0)\cdot \lambda _{1}}_{\begin{array}{c} \text {effective departure rate of priority customers}\\ \text {of priority customers} \end{array} }=\underbrace{P(X_{1}>0,Y>0)\cdot \mu }_{\begin{array}{c} \text {effective departure rate of ordinary customers} \\ \text {of priority customers} \end{array}} , \end{aligned}$$
(12)
$$\begin{aligned}{} & {} \underbrace{P(0<Y\le s)\cdot p\,\lambda _{2}+P(Y>s)\cdot \lambda _{2}}_{\begin{array}{c} \text {effective arrival rate}\\ \text {of ordinary customers} \end{array} }=\underbrace{P(X_{1}=0,X_{2}>0,Y>0)\cdot \mu }_{\begin{array}{c} \text {effective departure rate}\\ \text {of ordinary customers} \end{array}} , \end{aligned}$$
(13)
$$\begin{aligned}{} & {} \underbrace{P(Y>0)\cdot \lambda _{1}+P(0<Y\le s)\cdot p\,\lambda _{2}+P(Y>s)\cdot \lambda _{2}}_{\begin{array}{c} \text {effective arrival rate}\nonumber \\ \text {of customers} \end{array} },\nonumber \\{} & {} \quad =\underbrace{P(X_{1}+X_{2}>0,Y>0)\cdot \mu }_{\begin{array}{c} \text {effective departure rate}\\ \text {of customers} \end{array} } \end{aligned}$$
(14)

Similar to the flow equations in Lemma 1 it is possible to derive flow equations with respect to the inventory level. The derivation is more tedious and can be found in Otten [2018, Proposition 11.1.6]. The main result is that for \(k=0,1,\dots ,b-1\) it holds

$$\begin{aligned} P(Y=k)\cdot \nu = P(Y=k+1, X_1 + X_2 >0)\cdot \mu . \end{aligned}$$
(15)

Note that the statement of (15) exhibits an insensitivity property with respect to variation of the parameters of the system. More specifically it is independent of the threshold level s and the arrival intensities \(\lambda _{1}\) and \(\lambda _{2}\).

Remark 1

(1) The results in this section can be generalised in a direct way to the case of a system with C customer classes, where \(\overline{C}=\{1,\ldots ,C\}\) is the set of customer classes. Customers of type c arrive with rate \(\lambda _{c}>0\), a priority parameter \(p_{c}\) (\(0\le p_{c}\le 1\)) and a threshold level \(s_{c}\), \(c\in \overline{C}\). (2) In the models of Isotupa (2015), the replenishment rate depends on the number of pending orders. We can extend our model so that the replenishment lead time depends on the number of orders at the supplier. If there are \(k>0\) orders present at the supplier, the intensity of the replenishment lead time is \(\nu (k)>0\). For more details see Otten [2018, Chapter 11.1].

Proof of Theorem 3

Recall that the global balance equations for Z usually are stated with unknown positive \(x=(x(z):z\in E)\). In case of irreducibility (under Q) of E, Z is ergodic if and only if there exist a strictly positive solution x which is summable, i.e. \(\sum _{z\in E} x(z)<\infty \). The first observation is that the partial balance relations of Lemma 1 hold for any solution x, even if the solution is not summable. We shall exploit equation (9) which reads in case of \(p=1\) for a general solution

$$\begin{aligned}{} & {} \sum _{n_{1}+n_{2}=n}\left( \sum _{k=1}^b x(n_1,n_2,k)\right) \cdot (\lambda _{1}+\lambda _{2})\nonumber \\{} & {} \quad =\sum _{n_{1}+n_{2}=n+1}\left( \sum _{k=1}^b x(n_1,n_2,k)\right) \cdot \mu ,\qquad n\in \mathbb {N}_0 \,. \end{aligned}$$

It follows for \(n\in \mathbb {N}_0\)

$$\begin{aligned} \sum _{n_{1}+n_{2}=n}\left( \sum _{k=1}^b x(n_1,n_2,k)\right) =\left( \sum _{k=1}^b x(0,0,k)\right) \cdot \left( \frac{\lambda _{1}+\lambda _{2}}{\mu }\right) ^n. \end{aligned}$$

Assuming ergodicity of Z the summability condition yields

$$\begin{aligned}{} & {} \infty > \sum _{n_{1}=0}^\infty \sum _{n_{2}=0}^\infty \sum _{k=0}^b x(n_1,n_2,k)\\{} & {} \quad =\sum _{n_{1}=0}^\infty \sum _{n_{2}=0}^\infty \left( \sum _{k=1}^b x(n_1,n_2,k)\right) +\sum _{n_{1}=0}^\infty \sum _{n_{2}=0}^\infty x(n_1,n_2,0)\\{} & {} \quad =\sum _{n=0}^\infty \left\{ \sum _{n_1+n_{2}=n} \left( \sum _{k=1}^b x(n_1,n_2,k)\right) \right\} +\sum _{n_1=0}^\infty \sum _{n_{2}=0}^\infty x(n_1,n_2,0)\\{} & {} \quad =\sum _{n=0}^\infty \left\{ \left( \sum _{k=1}^b x(0,0,k)\right) \cdot \left( \frac{\lambda _{1}+\lambda _{2}}{\mu }\right) ^n \right\} +\underbrace{\sum _{n_1=0}^\infty \sum _{n_{2}=n}^\infty x(n_1,n_2,0)}_{\in (0,\infty )}. \end{aligned}$$

Because of \({\sum _{k=1}^b x(0,0,k)\in (0,\infty )}\) the last term is finite if and only if \(\lambda _{1}+\lambda _{2}<\mu \) holds. This finishes the proof. \(\square \)

5 The case of instant service

In this section we consider the case of instant service (zero production time), which turns the model into the formalism of classical inventory theory. The supply chain of interest is depicted in Fig.  and consists of priority and ordinary customers, an inventory and a supplier.

A short reflection shows that due to the lost sales assumption no customer queues will arise. Therefore, the on-hand inventory process \(Y=(Y(t):t\ge 0)\) carries all information for a Markovian description of the system, which is a slight generalisation of the pure inventory model in Isotupa (2015). Isotupa derives the stationary distribution but her model does not incorporate the priority parameter p. Chen et al. (2012), Liu et al. (2013, 2014) determine the stationary distribution for a pure inventory model with priority parameter but with a different replenishment policy.

Fig. 2
figure 2

The pure inventory system with two customer classes

The state space of Y is

$$\begin{aligned} K=\{0,\ldots ,b\}, \end{aligned}$$

where b is the maximal size of the inventory. Y is irreducible and therefore ergodic and we define the limiting and stationary distribution of Y by

$$\begin{aligned} \theta :=\left( \theta \left( k\right) :k\in K\right) ,\quad \theta \left( k\right) :=\lim _{t\rightarrow \infty }P\left( Y(t)=k\right) . \end{aligned}$$

\(\theta \) satisfies the following global balance equations in case of \(s<b\).

$$\begin{aligned}&\theta (0)\cdot \nu =\theta (1)\cdot (\lambda _{1}+p\,\lambda _{2}),\\&\theta (k)\cdot (\lambda _{1}+p\,\lambda _{2}+\nu )=\theta (k+1)\cdot (\lambda _{1}+p\,\lambda _{2})+\theta (k-1)\cdot \nu ,{} & {} k=1,\ldots ,s-1,\\&\theta (s)\cdot (\lambda _{1}+p\,\lambda _{2}+\nu )=\theta (s+1)\cdot (\lambda _{1}+\lambda _{2})+\theta (s-1)\cdot \nu ,\\&\theta (k)\cdot (\lambda _{1}+\lambda _{2}+\nu )=\theta (k+1)\cdot (\lambda _{1}+\lambda _{2})+\theta (k-1)\cdot \nu ,{} & {} k=s+1,\ldots ,b-1,\\&\theta (b)\cdot (\lambda _{1}+\lambda _{2})=\theta (b-1)\cdot \nu . \end{aligned}$$

It is direct to see that these are the balance equations for a finite birth-death process and we have immediately the following result.

Proposition 6

The inventory process \(Y=(Y(t):t\ge 0)\) has the following limiting and stationary distribution

$$\begin{aligned} \theta (0)&=\left[ \sum _{j=0}^{s}\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{j}+\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s}\cdot \sum _{j=1}^{b-s}\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) ^{j}\right] ^{-1}\\&=\left[ \frac{1-\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s+1}}{1-\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) }+\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s}\cdot \left( \frac{1-\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) ^{b-s+1}}{1-\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) }-1\right) \right] ^{-1}\\&=\left[ \frac{1-\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s+1}}{1-\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) }+\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s}\cdot \left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) \cdot \left( \frac{1-\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) ^{b-s}}{1-\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) }\right) \right] ^{-1},\\ \theta (k)&=\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{k}\cdot \theta (0),\qquad \qquad \qquad \qquad \ k=1,\ldots ,s,\\ \theta (k)&=\left( \frac{\nu }{\lambda _{1}+p\,\lambda _{2}}\right) ^{s}\left( \frac{\nu }{\lambda _{1}+\lambda _{2}}\right) ^{k-s}\cdot \theta (0),\qquad k=s+1,\ldots ,b. \end{aligned}$$

6 Conclusion

We investigated ergodicity for queueing-inventory systems with two priority classes in case of unbounded queues for both customer classes. The admission control to the queues is flexible and incorporates two parameters (sp) which mainly restrict entrance of low priority customers. The main result is a stability condition which is proved by construction of a suitable Lyapunov function. Although we proved that in case of parameter constellation (s, 1) the condition is sufficient and necessary we believe that in general the condition is not sharp. To clarify the situation in general is part of our ongoing research.

Another direction of our research in this field will be to investigate the system with priority classes where arriving customers which find the inventory depleted are backlogged. This means the lost sales behaviour of customers is replaced by backordering unsatisfied demand.