1 Introduction

We consider an integrated production–inventory system consisting of a finished goods inventory controlled by a base stock policy and an affiliated production network which serves as a replenishment system in a make-to-stock mode. When there is inventory, an arriving exogenous demand for a produced item from the system is satisfied immediately with exactly one item, and an order for producing a new item is instantly sent to the replenishment system. Arriving demand that finds the inventory depleted departs immediately without fulfillment and is lost. The control parameter for the inventory is the base stock level \(z\ge 1\). That means whenever an item from stock is dispatched to an arriving demand, and the inventory level is below this base stock level an order for the production of a new item is sent to the replenishment system.

We address the problem of determining a base stock level that minimizes a standard cost function. This cost function encompasses holding costs for items in the inventory, production costs for items in the replenishment system, and costs for unfulfilled demand (lost sales).

Due to the lost sales behavior of arriving demand, this integrated production–inventory network can be modeled as a closed network of queues with a fixed number of customers cycling in the system. Assuming the servers of the replenishment network to be exponential single servers and the stream of demands to be a Poisson process, Rubio and Wein (1996) show that the integrated system is a Gordon–Newell network (Gordon and Newell, 1967). Using queueing theory, the corresponding problem with backordering unsatisfied demand at integrated production–inventory systems under base stock control has already been successfully investigated for many years. However, almost only single replenishment servers have been subject of the investigations.

Until now, the case of production–inventory systems under base stock control with rejection of arriving demand due to depleted inventory has not been well understood, see Sections 4.2 and 4.3 in Rubio and Wein (1996), where only a very simple example could be solved analytically. We tackle this problem in the present article and consider a single item production–inventory system with lost sales. A sale is lost when there is no inventory available to serve an arriving demand (customer) immediately. We model our base stock system as a Gordon Newell network where a production order is released whenever demand is fulfilled. We analyze the cost function and focus on the queue length behavior of the system. Via numerical experiments, Rubio and Wein (1996) have found out that the cost function of a production–inventory system under base stock control with lost sales is in general not convex. For a production–inventory system where all replenishment stations and the finished goods inventory have the same traffic intensity (called “balanced system”), they prove that the cost function is a convex function of the base stock level. For a general replenishment network with an affiliated finished goods inventory, our main result states that if the holding costs dominate production costs the total cost function is convex in the base stock level under a few natural conditions.

Our paper is structured as follows: In Sect. 1.1, we provide an introduction into relevant literature on production–inventory systems. In Sect. 2, the detailed system is described, and the transformation into a closed network of exponential queues is given explicitly. In Sect. 3, the case of replenishment networks that consist of only a single station is analyzed. We demonstrate that the main technical problems can be solved by considering this fundamental setup. In a second step, we are able to reduce the general situation to this case. Details are presented in Sect. 4. In Sect. 5, we provide numerical evaluations for a two-station network, and Sect. 6 contains a short conclusion. Technical lemmata and lengthy proofs are collected in the “Appendix”.

Notation and conventions: \({\mathbb {N}}=\{1,2,\dots \}\), \({\mathbb {N}}_0 = {\mathbb {N}} \cup \{0\}\); increasing means nondecreasing and decreasing means nonincreasing.

1.1 Literature review

Analyzing production–inventory systems by using methods and models from queueing theory dates back to at least the 1950’s, see “Queues, Inventories and Maintenance” (Morse, 1958). A great variety of different systems have been investigated since then and the available literature is overwhelming. We sketch a selection of such work with emphasis on papers related to our analysis of an inventory under base stock control with an attached production network. Since in general the replenishment servers or the replenishment networks have limited capacity, a base stock policy is a reasonable control scheme (Federgruen and Zipkin, 1986a, b) although not necessarily optimal for systems without backordering unsatisfied demand (Hill, 1999).

Fundamentals of “Continuous time stochastic demand models” with different reorder policies for a single inventory are surveyed by Lee and Nahmias (1993)[Section 5]. In Section 5.4, base stock policies under the heading of “Order-for-order inventory policies” are described, which are special cases of (sS)-policies (with \(s=S-1\)). The case of multi-stage inventories under base stock policies are dealt with by Axsäter (1993)[Section 2]. As an introductory survey of the history of “production–inventory systems”, the “Notes” in Zipkin (2000) can be used, especially the “Notes” in Chapter 6 (p. 237) and Chapter 7 (p. 289).

Base stock policy and backordering of unsatisfied demand. Most of the queueing theory applications that analyze inventory systems tackle problems with backordering unsatisfied demand. Usually the early investigations consider either a single replenishment server or assume that reorders arrive after a random lead time. The latter can be modeled as an \(M/G/\infty \) single station replenishment facility. Kaplan (1970) derives a dynamic programming formulation of the random lead time model where the outstanding orders cannot cross (lead times are therefore strongly correlated). Karmarkar (1987) investigates the interplay of lot sizing, manufacturing lead times and in-process inventories using standard queueing models to evaluate performance of manufacturing systems. Similar investigations in the realm of Flexible Manufacturing Systems (FMS) are numerous, see e.g. the survey by Karmarkar (1993). Different aspects of optimization criteria are “cost targets” where various risk measures are considered as foundation for management decisions in production–inventory systems and as a measure of quality. Recent works in this direction are by Li et al. (2020) (criterion: probability of loss and expected loss), Li and Arreola-Risa (2021) (criterion: total cost conditional value-at-risk).

Bradley and Glynn (2002) investigate the interplay of the base stock level and the capacity of a single replenishment server using heavy-traffic analysis of queueing systems.

Analysis of simple (open tandem) networks as replenishment systems have been conducted by Lee and Zipkin (1992, 1995). For production–inventory systems with a general production network of Jacksonian type (Chen and Yao, 2001)[Chapter 2] and with backordering under base stock control, Rubio and Wein (1996) prove that setting the optimal base stock resembles decision making in the newsvendor problem. For some special cases, they have determined structural properties of the optimal policies. These results have been extended and generalized by de la Cruz and Daduna (2016, 2019).

A problem different from that investigated in these articles but closely related emerges when the production servers in the replenishment system have attached local inventories. An example with explicit stationary distribution for an open tandem supply network is provided by Song and Zipkin (2009)[Section 3] with a modified base stock policy. In general, the joint decision for optimal local inventory control is not analytically tractable. In their review, Atan et al. (2017) state “...an important result for continuous-review single end product models with multiple demand classes is the optimality of state-dependent base-stock production/replenishment policy ...”.

Production–inventory systems with unreliable replenishment server are considered by Hu and Xiang (1993) for the case of backordering.

Related models are production–inventory systems under base stock control where the time to dispatch an item to the arriving demand is positive (usually random), for surveys see Krishnamoorthy et al. (2011) and Krishnamoorthy et al. (2021).

Base stock policy and lost sales of unsatisfied demand. Compared with the case of backordering unsatisfied demand, the lost sales behavior of demand that is not immediately fulfilled has found only limited interest. The first analysis of the topic seems to be by Karush (1957) who investigates an inventory in continuous time controlled by a base stock policy with random lead times (independent, identically distributed) for reorders. This can be considered as an \(M/M/\infty \)-replenishment server. Karush proves convexity of stock-out probability in the base stock level.

Related but different models are investigated e.g. in case of periodic review of inventory level (Janakiraman and Roundy, 2004) and in discrete time (Zipkin, 2008). Janakiraman and Roundy (2004) prove for the case of stochastic lead times which do not cross (i.e. an \(M/M/\infty \)-replenishment server with strongly correlated service times) convexity of the expected discounted sum of holding and lost sales costs in the base stock level. Zipkin (2008) proves structural properties of several (“plausible”) policies and evaluates numerically some of the plausible policies. Johansen (2013) considers the counterexample of Hill (1999) and modifies the pure base stock policy in lost sales systems by introducing a lower bound on the time between placing replenishment orders.

The main reference of the system analyzed in this article is Rubio and Wein (1996) where an inventory under backordering and under lost sales for unsatisfied demand are considered. The replenishment system in both cases is a complex network of exponential service stations with random routing (processing networks in the terminology of Zipkin (2000)[Section 7.3.6]). In case of a tandem supply system under lost sales, Song and Zipkin (2009) show that for the case of a normal and an emergency supply source (node) the stationary distribution is accessible via generalized Gordon–Newell network models.

For general exponential networks with lost sales in the context of stock-out under base stock control, Rubio and Wein (1996) prove convexity of the cost function in the base stock level when the inventory and processing network is totally symmetric (“balanced” in their terminology). Thus far, more results similar to those for backordering by Rubio and Wein (1996), and de la Cruz and Daduna (2016, 2019) seem to be not available for the lost sales case.

Production–inventory systems with unreliable replenishment server are considered in Liu and Cao (1997) for the case of lost sales.

For production–inventory systems with positive service time, early results with product form steady state (similar to Rubio and Wein (1996) and the present article) are derived in Schwarz et al. (2006), Schwarz et al. (2007), and more recently in Krishnamoorthy and Viswanath (2013) and Otten et al. (2016). Toktay et al. (2000) illustrate a practical application of a closed queueing network to model a remanufacturing system with lost sales.

A comparison of different reorder policies, including base stock policy, for systems under lost sales are provided by Bijvank and Vis (2011, 2012).

2 The model

The production–inventory system in this article consists of a replenishment network of K service stations, indexed by \(\{1,2,\dots ,K\}\), and the finished goods inventory, indexed by \(K+1\). The base stock level is denoted by \(z\ge 1\). An arriving demand that finds finished goods available at the inventory takes one item, triggers an order for production of a new unit in the replenishment network and leaves the system. If the inventory is depleted, an arriving demand will depart immediately without obtaining an item (lost sales).

Fig. 1
figure 1

Structure of the production–inventory system

To model the entire production–inventory system, we use a Gordon Newell network with \(z\ge 1\) customers and \(K+1\) exponential single server stations \(\overline{J}=\{1,\dots ,K,K+1\}\). Stations \(j=1,\dots ,K\) represent the (replenishment-)production system, and the finished goods inventory is represented by station \(K+1\). Service intensities at nodes \(j=1,\dots ,K\) are allowed to be queue length dependent \(\mu _j(n)>0, n=1,\dots ,z\). The population size z equals the base stock level. Routing of customers is governed by an irreducible stochastic matrix \(R=(r(i,j):i,j\in \{1,\dots ,K+1\})\): With probability \(r(K+1,j)\), a new order for production of an unit is sent to station \(j\in \{1,\dots ,K\}\). An order served at station j is sent to station \(i\in \{1,\dots ,K\}\) with probability r(ji), and with probability \(r(j,K+1)\) it is sent as finished good to the inventory at station \(K+1\).

Along their paths through the network, the customers of the Gordon–Newell network therefore take different identities with respect to the production–inventory system. Customers at station \(K+1\) are items of the finished goods in stock. When leaving station \(K+1\) (triggered by an arrival of demand), a customer is transformed into an order which passes through the replenishment subsystem according to the routing matrix R and is processed at some subsequent stations out of \(1,\dots ,K\). When reentering station \(K+1\), the customer is converted to a new finished goods item (see Fig. 1).

The lost sales property of the external demand, which arrives in a Poisson-\(\lambda \) stream, implies that an arrival that finds no customer at \(K+1\) ( = stock-out) does not change the state of the network. An arrival that finds positive queue length at \(K+1\) (= finished goods stock \(>0\)) takes one item (= customer) from the queue, departs from the system, and triggers a replenishment order.

Hence by construction and the lost sales property, customers present at \(K+1\) leave that node with exponential-\(\lambda \) interdeparture times (= service times) as long as customers are present there. Therefore, station \(K+1\) operates with a state independent service rate \(\mu _{K+1}(n)=:\mu _{K+1}=\lambda , n\ge 1\).

Note that whenever the total population of the network resides in the finished goods station \(K+1\) there is no ongoing replenishment activity in the production sub-system \(\{1,\dots ,K\}\), and if there are fewer than z “customers” present at \(K+1\) there are ongoing production processes at some of the stations \(1,\dots ,K\).

Imposing the usual (conditional) independence assumptions on the stochastic behavior of the systems implies that the joint queue length vector for the system is a continuous time ergodic Markov process \(X=((X_j(t), j=1,...,K+1):t\ge 0)\) on state space

$$\begin{aligned} S(z,{\overline{J}}) := \{(n_1,\dots ,n_{K+1})\in {\mathbb {N}}_0^{K+1}: n_1+\dots +n_{K+1}=z \}. \end{aligned}$$
(1)

Here, \(X_{K+1}(t)\) is the size of the on-hand inventory, while \(X_{j}(t)\) is the number of replenishment orders waiting or in service at station j at time \(t\ge 0\), \(j\in \{1,\dots ,K\}\). The dynamics of the system is governed by the transition intensity matrix \(Q=(q(x,y):x,y\in S(z,{\overline{J}}))\) as follows. For a generic state \((n_1,\dots n_i,\dots n_j,\dots n_{K+1})\in S(z,{\overline{J}})\), we have:

$$\begin{aligned}{} & {} q((n_1,\dots n_i,\dots n_j,\dots n_{K+1}), (n_1,\dots n_i-1,\dots n_j+1,\dots n_{K+1}) \\{} & {} \quad = \mu _i(n_i)\cdot r(i,j),\quad n_i>0,\\{} & {} q((n_1,\dots n_j,\dots n_{K+1}), (n_1,\dots n_j+1,\dots n_{K+1}-1)\\{} & {} \quad = \lambda \cdot r(K+1,j),\quad n_{K+1}>0, \\{} & {} q((n_1,\dots n_j,\dots n_{K+1}), (n_1,\dots n_j-1,\dots n_{K+1}+1) \\{} & {} \quad = \mu _j(n_j) \cdot r(j,K+1),\quad n_{j}>0 . \end{aligned}$$

The stationary distribution \(\pi _z\) of X exists and is the (unique) probability solution of the global balance equation \(\pi \cdot Q=0\) with

$$\begin{aligned}{} & {} \pi _z(n_1,\dots ,n_{K+1}) \nonumber \\= & {} \prod _{j=1}^{K}\left( \prod _{k_j=1}^{n_j}\frac{\eta _j}{\mu _j(k_j)} \right) \left( \frac{\eta _{K+1}}{\mu _{K+1}}\right) ^{n_{K+1}} G(z,{\overline{J}})^{-1}, (n_1,\dots ,n_{K+1})\in S(z,{\overline{J}}), \end{aligned}$$
(2)

where \(G(z,{\overline{J}})\) is the normalization constant and \(\eta =(\eta _j:j=1,\dots ,K+1)\) is the probability solution of the traffic equation \(\eta = \eta \cdot R\).

We denote henceforth by \((X_j, j=1,...,K+1)\) a random vector distributed according to the stationary distribution (2) of X.

The stationary throughput of the network with z customers, i.e. the mean total number of departures per time unit over all nodes \(1,\dots ,K+1\) is \(TH(z)=G(z-1,{\overline{J}})/G(z,{\overline{J}})\), see Chen and Yao (2001)[(2.14) in Section 2.5]. Therefore, the mean total number of departures per time unit from node j (throughput at j) is \(TH_j(z)=\eta _j\cdot TH(z)\) .

The cost structure of our problem is as follows: Work-in-process (WIP) cost per item is c (per time unit), finished goods inventory cost per item is h (per time unit), and cost \(\ell \) is incurred for every lost sale. It follows from the definition of the throughput that the steady state mean number of lost sales per time unit under base stock level z is

$$\begin{aligned} \lambda - TH_{K+1}(z). \end{aligned}$$
(3)

The global steady state cost function is (with \(E_z(\cdot )\) the expectation under base stock level z)

$$\begin{aligned} C(z)=hE_z[X_{K+1}]+c\sum _{j=1}^KE_z[X_{j}]+ \ell \left( \lambda -TH_{K+1}(z)\right) . \end{aligned}$$
(4)

For the companion problem with backordering in Rubio and Wein (1996), it is proved that the optimal base stock \(z^*\) can be computed explicitly and the solution is of the form of the news-vendor problem’s solution. A similar procedure seems not possible for the lost sales system considered here. Rubio and Wein notice that the cost function is convex if \(h=c\). For the “balanced” case in which all \(K+1\) single server stations have state independent service rates \(\mu _j(n_j)=\mu _j, n_j=1,\dots ,z\) and the same utilization \(\rho _j=\eta _j/\mu _j= \rho \) for \(j=1,\dots ,K+1\), they deliver a closed form expression for the cost-optimal z (Rubio and Wein, 1996)[Formula (14)]. Moreover, they state that for \(h\ge c\) the total cost function is in general not convex, see Rubio and Wein (1996)[Section 4.2]. We therefore will disclose structural properties of the cost function (4) and prove that under some natural conditions it is indeed convex if \(h\ge c\).

3 Single station replenishment network, \(K=1\)

We start our analysis with the simplest case: an inventory with a replenishment network that consists of only one station. A similar setting has already been dealt with by Karush (1957), but with a somewhat different replenishment station. As shown in Rubio and Wein (1996)[4.2], this is a two-station Gordon–Newell network. We consider a slightly more general network which will enable us to approach the case of a general processing network in the next section.

In this section, we study the queue length behavior of a Gordon–Newell network with two nodes \(\overline{J}:=\left\{ 1,2\right\} \) where the service rate of node 1, which represents the processing network, has queue length dependent service rates.

There are in total \(z>0\) customers cycling in the network. Node \(j=1\) has state dependent service intensities \(\mu _1(n)\), \(n\le z,\) which are increasing and concave in n. Node \(j=2\) has exponential service times with rate \(\mu _2\). (Recall that in the production–inventory system represented by this Gordon–Newell network holds \(\mu _2:=\lambda \).)

The routing matrix \(R=(r(i,j):i,j\in \{1,2\})\) is determined by \(r(2,1)=1\) and \(r(1,2)=1-p\) for \(p\in [0,1)\). This indicates that an order is sent immediately to the replenishment server and that production of an item needs rework with probability \(r(1,1)=p\).

The cost function for this system is:

$$\begin{aligned} C(z)&=cE_z[X_{1}]+hE_z[X_{2}]+\ell \left( \lambda -\eta _2\cdot TH(z)\right) \nonumber \\&=hz+(c-h)E_z[X_{1}]+\ell \left( \lambda -\eta _2\cdot TH(z)\right) \end{aligned}$$
(5)

The following theorem presents the main result of Section 3:

Theorem 1

In the production–inventory system with a single replenishment station, the cost function (5) is convex in the base stock level if the following holds:

(i) The service rate function \(\mu _1(\cdot )\) is increasing and concave on {1,2,...}.

(ii) It holds

$$\begin{aligned} \frac{\eta _1}{\mu _1(1)}\le \frac{\eta _2}{\mu _2}. \end{aligned}$$
(6)

(iii) The costs rates fulfill \(h\ge c\).

Proof

For a Gordon–Newell network with local service rates that are increasing and concave in the local queue lengths, the throughput TH(z) is increasing and concave in the population size z (Shanthikumar and Yao, 1988). Hence, it suffices to show that \(E_z[X_{1}]\) is concave in z which is the result of Proposition 1 below. \(\square \)

Some comments on Theorem 1 and the natural conditions imposed on the production–inventory system are in order here. (a) The natural conditions (i) and (ii) constitute a setting which guarantees that the cost function is well-behaved. The interpretation of these conditions is as follows: The result in Shanthikumar and Yao (1988) indicates that (i) is a minimal condition. It means that higher load enforces the production controller to provide more production capacity until the capacity limit is reached. The meaning of condition (ii) is best understood in case of no rework in the production system (no feedback in queueing network terms), because then \(\eta _1=\eta _2=1/2\) holds. In this case, condition (ii) is simply \(\mu _2\le \mu _1(1)\) and states that the service capacity of the production system is adapted to the arrival stream such that on average there is enough capacity to satisfy the arriving demand of intensity \(\lambda =\mu _2\). The lost sales are then due to second order effects: the variability of inter arrival times and service times. In terms of closed network theory: for state-independent service rates at node 1, condition (6) indicates that node 2 is the bottleneck of the system. This interpretation also applies to the general case when using an intuitive but more complex definition of a bottleneck.

(b) Holding costs include all costs involved in storing unused inventory, such as costs for warehouse space and also insurance and opportunity costs. Condition (iii) states that holding costs dominate production costs. This property is discussed in Rubio and Wein (1996)[Section 4.2, p. 266]. Under this condition, they conjecture that for production networks with state-independent service rates \(\mu _j\) the cost function is quasi-convex, but not convex in general for \(h\ge c\). In case of \(h=c\), they propose that C(z) is convex.

The result of the next proposition is purely queueing-theoretical because it relies only on the conditions (i) and (ii). Moreover, it seems to be of independent interest because it provides second order properties for the mean queue lengths in a two-station network.

Proposition 1

Consider the two-node Gordon–Newell network with \(z\ge 1\) customers and \(\eta =(\eta _1,\eta _2)\) as the solution of the traffic equation. Assume that the service rate at node 2 is constant \(\mu _2\) and that the service rate \(\mu _1(n), n \in {\mathbb {N}},\) at node 1 is increasing in n. Let \((X_1,X_2)\) be distributed according to the stationary distribution (7).

If \(\eta _1/\mu _1(1) \le \eta _2/\mu _2\) holds, then the expected queue length \(E_z[X_1]\) at station 1 is a concave function of the population size z.

The proof is postponed to “Appendix 1”. Here, we add a remark indicating relations to a well-known fact from queueing theory which has been already used in the literature of production–inventory systems (Karush, 1957). To prove Proposition 1, it will be convenient to rewrite the stationary distribution (2) of the two-station production–inventory system as follows. The stationary distribution \(\pi _z\) of the joint queue lengths vector \((X_1,X_2)\) can be rewritten for \(n_1=n, n_2=z-n,~0 \le n \le z,\) as

$$\begin{aligned} \pi _z(n,z-n)=&\left( \prod ^{n}_{k=1}\frac{\eta _1}{\mu _1(k)}\right) \left( \frac{\eta _2}{\mu _2}\right) ^{z-n} \left( \sum ^z_{l=0} \left( \prod ^{l}_{k=1}\frac{\eta _1}{\mu _1(k)}\right) \left( \frac{\eta _2}{\mu _2}\right) ^{z-l} \right) ^{-1}\nonumber \\ =&\left( \prod ^{n}_{k=1}\frac{\mu _2\cdot (\eta _1/\eta _2)}{\mu _1(k)}\right) {\bar{G}}(z,\left\{ 1,2\right\} )^{-1}, \end{aligned}$$
(7)

with \({\bar{G}}(z,\left\{ 1,2\right\} )=\sum ^z_{m=0}\left( \prod ^{m}_{k=1}\frac{\mu _2\cdot (\eta _1/\eta _2)}{\mu _1(k)}\right) \). Formula (7) indicates that the stationary distribution of the two-station Gordon–Newell network equals the stationary distribution of an exponential single server queue with finite waiting room z, state-dependent service rates \(\mu _1(\cdot )\) and arrival intensity \(\mu _2\cdot (\eta _1/\eta _2)\) which satifies the natural condition \(\mu _2\cdot (\eta _1/\eta _2)<\mu _1(1)\). This has been already utilized by Karush (1957) when applying Palm’s formula for evaluating the performance of the production–inventory system.

Considering the case of state-independent service rates \(\mu _1(\cdot ) =\mu _1\) at node 1, (6) reads

$$\begin{aligned} \frac{\eta _1}{\mu _1} \le \frac{\eta _2}{\mu _2}, \end{aligned}$$
(8)

which means that station 2 is the bottleneck of the system.

As a result, Proposition 1 says that the expected queue length at the non-bottleneck station 1 is a concave function of the population size in the network, which is the base stock level in our motivating system. On the other hand, we see that at the bottleneck node 2 the expected queue length \(E_z[X_2] =z - E_z[X_1]\) is convex in z. Consequently, if the equality in (8) holds the expected queue length at both stations will be the same and linear in z. This is the case of a “balanced” production–inventory system where convexity of the cost function can be derived analytically by differentiating the cost function. Indeed, this generalizes to the multi-station network (Rubio and Wein, 1996) [Section 4.2].

Example 1

We consider the production–inventory system where an reorder has to undergo only a random lead time which is exponentially distributed with mean \(\nu ^{-1}\) and all lead times are assumed to be independent. This is the first model investigated by Karush (1957) and it can be easily seen that the replenishment network can be modeled as an infinite server queue, i.e. \(\mu _1(n)=n\cdot \nu , n=1,2,\dots ,z\). Karush proves that the stock-out probability \(\pi _z(z,0)\) is convex in z. This implies that the mean number of lost sales is convex in z. Note that \(\mu _1(\cdot )\) is increasing and concave. Thus, Theorem 1 applies to this system as well.

In a second approach, he proves that this property is maintained even under the assumption that the replenishment lead time is a mixture of generalized Erlang distributions, which he realizes as a network of a finite number of different branches of multistage tandem networks, each stage containing an infinite server. As Karush indicates, the class of mixtures of generalized Erlangian distributions is a very versatile class of distributions on \([0,\infty )\). Indeed, this class is dense in the class of all distributions on \([0,\infty )\) (Schassberger, 1973)[Section I. 6].

The important observation is that the stock-out probability does not reflect the complexity of the lead time distribution because it only depends on the mean lead time \(\nu ^{-1}\). This result dates back to Palm (1938) and is in fact a property of M/G/z/z queues, i.e. the Erlang loss system. Such properties are now labeled as “insensitivity properties” of queueing systems, see e.g. König et al. (1974) and Taylor (2011). For more details on inventory systems see Zipkin (2000)[Chapter 7].

4 Multi-station replenishment network, \(K\ge 1\)

In this section, we consider the general production–inventory problem as described in Sect. 2 where the system is modeled as a Gordon–Newell network with node set

\(\overline{J}:=\left\{ 1,...,K+1\right\} \) and cost function (4). This problem can be reduced to the one investigated in Sect. 3 by exploiting the fact \(E_z[X_{K+1}]=z-\sum _{j=1}^{K} E_z[X_j]\) such that equation (4) reduces to

$$\begin{aligned} C(z)= hz+(c-h) E_z[\sum _{j=1}^{K}X_j] +\ell \left( \lambda -\eta _{K+1}\cdot TH(z)\right) . \end{aligned}$$
(9)

(Recall that in the associated production–inventory system it holds \(\mu _{K+1}:=\lambda \).)

The main idea is to apply Norton’s Theorem for Gordon–Newell networks (Chandy et al., 1975) which says:

For the \(K+1\)-node Gordon–Newell network of Sect. 2 with z customers, we can construct a reduced two-node Gordon–Newell network consisting of

\(\bullet \) a node \(\widetilde{K+1}\) which has the same service rate as the node \(K+1\) in the original network (in our setting with a constant service rate \(\mu _{\widetilde{K+1}}:=\mu _{K+1}=\lambda \)), and

\(\bullet \) a single “composite queue” \({\widetilde{1}}\) substituting the nodes 1, ..., K from the original network with queue length dependent service rates \(\phi (n), n=0,1,2,\dots \) (to be determined below),

such that the steady state marginal distributions of the nodes \(K+1\) and \(\widetilde{K+1}\) in the two networks are identical although the rest of the network is very different.

This transforms the cost function (9) into

$$\begin{aligned} C(z)&= hz+(c-h) \underbrace{(z - E_z[X_{K+1}])}_{\text {original network}} +\ell \left( \lambda -\eta _{K+1}\cdot TH(z)\right) \nonumber \\&= hz+(c-h) \underbrace{(z - E_z[X_{\widetilde{K+1}}])}_{\text {using Norton's Theorem}} +\ell \left( \lambda -\eta _{K+1}\cdot TH(z)\right) \nonumber \\&= hz+(c-h) \underbrace{ E_z[X_{{\widetilde{1}}}]}_{\text { composite queue }{\widetilde{1}}} +\ell \left( \lambda -\eta _{K+1}\cdot TH(z)\right) , \end{aligned}$$
(10)

which is the analog of the cost function (5).

With that, we are prepared to state and prove our main theorem:

Theorem 2

Consider the production–inventory network with base stock level \(z\ge 1\) and with Poisson-\(\lambda \) demand process under lost sales from Sect. 2 and the associated Gordon–Newell network with exponential single server nodes \(\overline{J}=\{1,\dots ,K,K+1\}\) and \((\eta _1,\dots ,\eta _{K+1})\) as probability solution of the traffic equation. Node \(K+1\) has constant service rate \(\mu _{K+1}=\lambda \) while the other nodes may have state dependent service rates \(\mu _j(\cdot )\).

The cost function (10) is convex if the following holds:

\({\widetilde{{i}}}\) ) The service rate functions \(\mu _j(\cdot ), j=1,\dots ,K\) are increasing and concave.

\({\widetilde{{ii}}}\) ) It holds

$$\begin{aligned} \sum _{j=1}^{K}\frac{\eta _j}{\mu _j(1)}\le \frac{\eta _{K+1}}{\mu _{K+1}}. \end{aligned}$$
(11)

\({\widetilde{{iii}}}\) ) The costs fulfill \(h\ge c\).

Proof

The first observation is: from ( \({\widetilde{{i}}}\) ) it follows that the throughput TH(z) of the original network in (10) is concave (Shanthikumar and Yao, 1988). Consequently, for \(h=c\) the proof is finished. We therefore consider henceforth the case \(h>c\).

We will apply Norton’s Theorem (Chandy et al., 1975) to the associated Gordon–Newell network from Sect. 2 and obtain a two-node network with node set \({\widetilde{J}}=\{{\widetilde{1}},\widetilde{K+1}\}\) as described above. The service rate at \(\widetilde{K+1}\) is \(\mu _{\widetilde{K+1}}=\mu _{K+1}=\lambda \) and the routing matrix \({\widetilde{R}}\) is selected such that the solution of the traffic equation \({\widetilde{\eta }}\cdot {\widetilde{R}}={\widetilde{\eta }}\) has the probability solution \({\widetilde{\eta }}= ({\widetilde{\eta }}_{{\widetilde{1}}}, {\widetilde{\eta }}_{\widetilde{K+1}})=(1-\eta _{K+1},\eta _{K+1})\). (This is always possible.)

The service rates \(\phi (\cdot )\) at node \({\widetilde{1}}\) are queue length dependent with \(\phi (0)=0\) and

$$\begin{aligned} \phi (h)=\frac{\sum _{n_1+\dots +n_K=h-1}\prod _{j=1}^{K}\left( \prod _{k_j=1}^{n_j} \frac{\eta _j/(1-\eta _{K+1})}{\mu _j(k_j)}\right) }{\sum _{n_1+\dots +n_K=h}\prod _{j=1} ^{K}\left( \prod _{k_j=1}^{n_j}\frac{\eta _j/(1-\eta _{K+1})}{\mu _j(k_j)}\right) },\quad h>0\,. \end{aligned}$$
(12)

According to Norton’s Theorem, the stationary distribution \({\widetilde{\pi }}_z\) of the substitute network is with normalization constant \({\widetilde{G}}(z,\{{\widetilde{1}},\widetilde{K+1}\})\)

$$\begin{aligned} {\widetilde{\pi }}_z(n,z-n)= \left( \frac{\eta _{K+1}}{\mu _{K+1}} \right) ^n \left( \prod _{h=1}^{z-n}\frac{1-\eta _{K+1}}{\phi (h)}\right) \cdot {\widetilde{G}}(z,\{{\widetilde{1}},\widetilde{K+1}\})^{-1},~~n=0,1,\dots ,z. \end{aligned}$$
(13)

With this reduction, we are now in the framework of Theorem 1. To apply this theorem, we need to show that the following holds:

(a) The marginal steady state queue lengths at node \(K+1\) of the original system and node \(\widetilde{K+1}\) in the Norton’s Theorem substitute are the same.

(b) \( (1-\eta _{K+1})/\mu _{{\widetilde{1}}}(1)\le \eta _{K+1}/\mu _{\widetilde{K+1}}. \) (Analog to (ii) of Theorem 1.)

(c) The service rate function \(\phi (\cdot )\) is increasing and concave. (Analog to (i) of Theorem 1.)

If we verify (a),(b), (c), then from Proposition 1 it follows that \(E_z[X_{{\widetilde{1}}}] = E_z[\sum _{j=1}^{K}X_j]\) is concave, and we conclude that C(z) in (10) and thus in (9) is convex.

Verification of (a) The marginal steady state queue length distribution \({\widetilde{\pi }}_z^{\widetilde{K+1}}\) at node \(\widetilde{K+1}\) in the Norton’s Theorem substitute is \({\widetilde{\pi }}_z^{\widetilde{K+1}}(n) ={\widetilde{\pi }}_z(n,z-n)\) for \(n\ge 0\). By using the definition of \(\phi (\cdot )\) and simplifying the expression, we obtain from (13)

$$\begin{aligned}{} & {} {\widetilde{\pi }}_z^{\widetilde{K+1}} (n)\\{} & {} \quad =\left( \prod _{\ell =1}^{n}\frac{\eta _{K+1}}{\mu _{K+1}(\ell )} \right) \left( 1-\eta _{K+1}\right) ^{z-n}\\{} & {} \qquad {\sum _{n_1+\dots +n_K=z-n}\prod _{j=1}^{K}\left( \prod _{k_j=1} ^{n_j}\frac{\eta _j/(1-\eta _{K+1})}{\mu _j(k_j)}\right) } {\widetilde{G}}(z,\{{\widetilde{1}},\widetilde{K+1}\})^{-1}\\{} & {} =\underbrace{\left( \prod _{\ell =1}^{n}\frac{\eta _{K+1}}{\mu _{K+1}(\ell )} \right) {\sum _{n_1+\dots +n_K=z-n}\prod _{j=1}^{K}\left( \prod _{k_j=1}^{n_j}\frac{\eta _j}{\mu _j(k_j)}\right) }}_{(\star )} {\widetilde{G}}(z,\{{\widetilde{1}},\widetilde{K+1}\})^{-1}. \end{aligned}$$

Because \((\star )\) is the non-normalized marginal stationary distribution of node \(K+1\) in the original network, we conclude

$$\begin{aligned} {\widetilde{G}}(z,\{{\widetilde{1}},\widetilde{K+1}\})= G(z,\{1,\dots ,K,K+1\}), \end{aligned}$$

where \(G(z,\{1,\dots ,K,K+1\})\) is the normalization constant in (2). Thus, we have proved that the marginal distribution of the nodes \(K+1\) and \(\widetilde{K+1}\) are the same: \({\widetilde{\pi }}_z^{\widetilde{K+1}}=\pi _z^{K+1}\).

Verification of (b) By definition, we have

$$\begin{aligned} \mu _{{\widetilde{1}}}(1)=\phi (1)= \frac{1}{\sum _{j=1}^{K}\frac{\eta _j/(1-\eta _{K+1})}{\mu _j(1)}} =\frac{1-\eta _{K+1}}{\sum _{j=1}^{K}\frac{\eta _j}{\mu _j(1)}}. \end{aligned}$$

So,

$$\begin{aligned} \frac{1-\eta _{K+1}}{\phi (1)}=\sum _{j=1}^{K}\frac{\eta _j}{\mu _j(1)} {\mathop {\le }\limits ^{(\,{{\widetilde{ii}}}\,)}} \frac{\eta _{K+1}}{\mu _{K+1}} \end{aligned}$$

completes this part of the verification.

Verification of (c) According to Norton’s Theorem, the service rate function at the composite queue \({\widetilde{1}}\) is \(\phi (0)=0\) and for \(h>0\)

$$\begin{aligned} \phi (h)=\frac{\sum _{n_1+\dots +n_K=h-1}\prod _{j=1}^{K}\left( \prod _{k_j=1}^{n_j} \frac{\eta _j/(1-\eta _{K+1})}{\mu _j(k_j)}\right) }{\sum _{n_1+\dots +n_K=h}\prod _{j=1}^{K} \left( \prod _{k_j=1}^{n_j}\frac{\eta _j/(1-\eta _{K+1})}{\mu _j(k_j)}\right) }\,. \end{aligned}$$

A short reflection reveals that this is the throughput in a Gordon–Newell network with node set {1,...,K}, a routing (which can be selected irreducible) with stationary distribution \((\eta _j/\sum _{i=1}^{K}\eta _i: j=1,\dots ,K)\) of the routing process (solution of the relevant traffic equation), and queue length dependent service rates \(\mu _j(\cdot )\).

Because these service rate functions are by ( \({\widetilde{{ii}}}\) ) increasing and concave, \(\phi (\cdot )\) is increasing and concave (Shanthikumar and Yao, 1988). This verifies (c). \(\square \)

Example 2

In Example 1, we have mentioned that for the case of processing networks consisting of a single \(\cdot /M/\infty \) system which represents independent identically distributed lead times the result of Palm (1938) states that the stock-out probability is invariant under changing the shape of the lead time distribution as long as the mean \(\nu ^{-1}\) remains fixed. Having the K-station replenishment now at our disposal, we can reproduce the invariance result for the stock-out probability immediately for networks of infinite server queues and suitably defined Markovian random routing of the reorders before the lead time expires.

5 Numerical experiments

We have performed a series of numerical experiments, mainly to investigate the structure of cost functions of production–inventory systems which are not covered by our theoretical results. For our computations, we only consider queueing networks without feedback, i.e. \(\eta _1=\eta _2 =1/2\). We also set the lost sales costs \(\ell =30\) to clearly illustrate the convexity of the expected total costs function in the figures.

Fig. 2
figure 2

Expected total costs: \(\mu _1(n)=\ln (n)+\mu _2+0.1, \mu _2=5, \eta _1=\eta _2=0.5,c=1<h=2, \ell =30\)

Fig. 3
figure 3

Expected queue lengths (parameters as in Fig. 2)

First, we consider a closed Gordon Newell network with \({\bar{J}}={1,2}\) we have dealt with in Theorem 1. Taking parameters from Rubio and Wein, we set the WIP cost per item \(c=1\), and the finished goods inventory cost per item \(h=2\) (Rubio and Wein, 1996)[Section 3, p. 265]. The inventory (node 2) is the bottleneck of the system and generates higher costs than production at station \(j=1\) does (holding costs dominate production costs). As anticipated, the expected total costs are convex for state dependent increasing and concave service rates at station \(j=1\) with \(\frac{\eta _1}{\mu _1(1)} \le \frac{\eta _2}{\mu _2}\) and \(h>c\) (see Fig. 2 for a typical behavior of the cost function). Although the lost sales cost is rather high, the optimal base stock level is astonishingly low. This is explicable by the fact that node 2 (the inventory) is the bottleneck of the system: On average, there is more replenishment capacity available than demand arrives. In Fig. 3, we can observe the concavity of the expected queue length at station \(j=1\) and the convexity of the expected queue length at station \(j=2\), proved in Proposition 1.

Fig. 4
figure 4

Expected total costs: \(\mu _1(n)=\ln (n)+\mu _2+0.1, \mu _2=5, \eta _1=\eta _2=0.5,h=1, \ell =30\)

Fig. 5
figure 5

Expected total costs: \(\mu _1(n)=\ln (n)+\mu _2+0.1, \mu _2=5, \eta _1=\eta _2=0.5,h=1, \ell =30\)

Secondly, we investigate the same system in more detail when the production costs (at station 1) dominate the holding costs at the bottleneck (the inventory). Surprisingly, in these experiments the expected total costs are convex as long as the cost ratio \(\frac{c}{h}\) is moderately higher than 1 (see Fig. 4 with \(\frac{c}{h}\le 6\)). In these cases, the optimal base stock level z is decreasing in c. But for large cost ratios of \(\frac{c}{h}\), the expected total cost function becomes increasing and concave in z (see Fig. 5 with \(\frac{c}{h}\ge 20\)). In this situation, the optimal base stock level is \(z=1\) in all cases we have investigated.

Fig. 6
figure 6

Expected total costs: \(\mu _1=3<\mu _2=5, \eta _1=\eta _2=0.5, c=2>h=1, \ell =30\)

Fig. 7
figure 7

Expected queue lengths (parameters as in Fig. 6)

For completeness, we consider an example where station 1 with state-independent reproduction rate \(\mu _1<\mu _2\) is the bottleneck and the production costs dominate the holding costs (\(\frac{c}{h} > 1\)). As it can be seen in Fig. 6, the expected total costs are convex for this case. Furthermore, the expected queue length at station \(j=1\) is convex and the expected queue length at station \(j=2\) is concave (see Fig. 7). Note that this result is covered by Theorem 1 as well (by renumbering the stations).

6 Conclusion

For a production–inventory system with a complex processing network as replenishment system, we have proved convexity of a standard cost function. Although the system could be transformed into a standard closed Gordon–Newell network, which is in general easy to understand with respect to steady state distribution and expected queue lengths (first order properties), we have demonstrated that proving the second order properties of interest is quite challenging. Our numerical results clearly indicate that further research in the field of replenishment systems with state dependent service rates is needed. Our ongoing research encompasses the case of cost functions where production costs dominate holding costs and especially proving similar properties for production–inventory systems with positive dispatching times at the inventory. The case of multi-stage inventories under base-stock policy at every stage is another direction of our research.