1 Introduction

In this paper we consider a single-server queue with deterministic service time, which is assumed of unitary length for the sake of simplicity. The ith customer arrives to the system at time

$$\begin{aligned} t_i=i+\xi _i\,, \qquad i\in \mathbb {N}\,, \end{aligned}$$
(1)

where \(\{\xi _i\}\) are i.i.d. exponential random variables with parameter \(\beta \).

In the limit \(\beta \rightarrow 0\) the point process (1) weakly converges to a Poisson process of parameter 1, whereas for fixed \(\beta \) the arrivals are negatively autocorrelated, see Collings (1976), Guadagni et al. (2011) and references therein.

We study the system described above for fixed \(\beta \) and we assume that arriving customers might balk with independent probability \(1-\rho \). In other words, each customer can be deleted independently with probability \(1-\rho \) before joining the queue. Besides being a mathematical expedient that ensures the existence of a stationary state,Footnote 1 balking is a way to model empty intervals in a constant stream of customers. Again, the point process (1) with balking weakly converges to a Poisson process, but with parameter \(\rho \) (Guadagni et al. 2011). In what follows, we refer to the balked version of (1) as Exponentially Delayed Arrivals (EDA).

Service can be delivered by the unique server only at discrete times. The length of the queue at time t is \(n_t\); it represents the number of customers waiting to be served, including the customer that will be served precisely at time t, if any. Due to the balking, it is immediate to see that the traffic intensity of the system is \(\rho \). Using Kendall’s notation, we hereafter refer to the queue model described so far as EDA / D / 1.

The EDA / D / 1 model is motivated by the description of public and private transportation systems, including buses, trains, aircraft (Ball et al. 2001; Guadagni et al. 2011; Gwiggner and Nagaoka 2010; Iovanella et al. 2011) and vessels (Govier and Lewis 1963; Jagerman and Altiok 2003), appointment scheduling in outpatient services ( a; Cayirli and Veral 2003; Mercer 1960, 1973) crane handling in dock operations (Daganzo 1990; Edmond 1975), and in general any system where scheduled arrivals are intrinsically subject to random variations. Preliminary results show that the model described above fits very well with actual data of inbound air traffic over a large hub (Caccavale et al. 2014; Lancia and Lulli 2017).

The appearance of the stochastic point process (1) can be traced back to Winsten’s seminal paper (Winsten 1959). Winsten named such a queueing model late process and obtained results for the special case \(\xi _i\in [0, 2]\) and service time exponentially distributed. At the end of Winsten’s paper there is a discussion by Lindley, Wishart, Foster, and Takács, stating that Winsten’s work can be considered as the first treatment of a queueing model with correlated arrivals (Winsten 1959, pp. 22–28).

The same problem was investigated also by Kendall (1964, page 11). Kendall remarked the great importance of systems with arrivals like (1): “[...] perhaps too much attention has been paid to rather uninteresting variations on the fundamental Poisson stream. As soon as one considers variations dictated by the exigencies of the real world, rather than by the pursuit of mathematical elegance, severe difficulties are encountered; this is particularly well illustrated by the notoriously difficult problem of late arrivals.” Kendall also provided the following elegant interpretation: if the random variables \(\xi _i\) are non-negative then the process defined by (1) is the output of the stationary \(D/G/\infty \) queueing system (Kendall 1964). In particular, if the random variables \(\xi _i\) are exponentially distributed then EDA / D / 1 can be viewed as a 2-stage tandem queueing network

$$\begin{aligned} D/M/\infty \rightarrow \cdot /D/1 \,. \end{aligned}$$

However, this is not the approach followed in this work.

Some years later, under the hypothesis that \(\xi _i>0\), Nelsen and Williams (1970) exactly characterised the distribution of the inter-arrival time intervals and their correlations. They also gave an explicit expression of these quantities in the particular case of \(\xi _i\)’s exponentially distributed.

After the ’70s only approximations of the arrival process (1) (Bloomfield and Cox 1972; Sabria and Daganzo 1989) or numerical studies of its output (Almaz and Altiok 2012; Ball et al. 2001; Nikoleris and Hansen 2012) seem to have appeared in the literature. In particular, Guadagni et al. (2011) presented a self-contained study of an arrival process like (1), assuming for \(\xi _i\) a compact-support distribution. They also proposed an approximation scheme that keeps the correlation of the arrivals and is able to compute in a quite accurate way the quantitative features of the queue. To the best of our knowledge, a queueing system with arrivals described by (1) still remains an open problem and the best results obtained so far are due to Winsten (1959).

EDA / D / 1 is an example of a queueing system with correlated arrivals, a subject broadly studied in past years. There are many ways to impose a correlation to the arrival process. For instance, the parameters of the process may depend on their past realisation (Drezner 1999), or on some on/off sources (Wittevrongel and Bruneel 1999). Another relevant example of a queue model with correlated arrivals is the so-called Markov Modulated Queueing System. In Markov Modulated Queueing Systems the parameters are driven by an independent external Markovian process (Adan and Kulkarni 2003; Asmussen and Kella 2000; Combé and Boxma 1998; Lucantoni 1991; Pacheco et al. 2009). Our model shares with Markov Modulated Queueing Systems the property that one can define an external and independent Markovian process that drives the arrival rates. However, we see in Sect. 2 that the output of this external drive has also a deterministic effect on the queue length. More precisely, EDA can be interpreted as an independent drawing from an external pool of customers late at time t, see (3) below. Due to the memoryless property of the exponential delays, each customer late at time t will be still in the pool at time \(t+1\) independently with probability \(q\equiv e^{-\beta }\). This leads to binomial transitions in the number of late customers.

In Sect. 2 we show that EDA / D / 1 can be described as a bivariate Markov chain representing the queue length and the number of late customers. We show that the distribution of the late customers is described by a INAR(1) process (Al-Osh and Alzaid 1987). Then we prove that the bivariate chain is ergodic and write the balance equations of the stationary distribution, finding a functional equation for the bivariate generating function.

There exists an extensive literature about two-dimensional Markov models. Many methods for attacking the problem are available under two assumptions, namely, spatial homogeneity and finiteness of at least one marginal chain (Bini et al. 2005; Gail et al. 2000; Grassmann 2002; Latouche and Ramaswami 1999; Mitrani and Chakka 1995; Neuts 1989, 1995). Unfortunately, the Markov chain defined in Sect. 2 does not satisfy any of the mentioned requirements.

When both components of the Markov chain are infinite but space homogeneity is still ensured, the problem is typically attacked by reduction to a Riemann-Hilbert Boundary Value Problem. These may be solved, for example, by the uniformisation technique (Kingman 1961), conformal mappings (Cohen 1988; Cohen and Boxma 1983; Fayolle and Iasnogorodski 1979), the compensation method (Adan et al. 1993), or the Power Series Approximation (Blanc 1987a, b; Koole 1997; Hooghiemstra et al. 1988). The aforesaid binomial transitions are responsible for the lack of spatial homogeneity and are often encountered in Mathematical Biology (Artalejo et al. 2007; Brockwell et al. 1982; Economou 2004).

To the best of our knowledge, the functional equation (14) below has never previously appeared in the literature. Yet it is possible to mark some analogies with the functional equation studied by Economou and Kapodistria (2009), Economou et al. (2010),or Kapodistria (2011), the most important being that in both equations the right hand side exhibits the generating function computed in a convex combination in the parameter q. Adan et al. (2009), Altman and Yechiali (2006), Neuts (1994), and Yechiali (2007) give other examples of chains with binomial transitions.

In Sect. 3 we list some known results on INAR(1) process in order to describe the properties of the marginal distribution of the number of late customers and we write its exact analytical expression, which reveals the rich combinatorial structure of the problem. This intermediate result allows us to show that the stationary distribution of the EDA / D / 1 queue has a super-exponential decay. Finally, in Sect. 4 we show that such a super-exponential behaviour enables a simple, yet very effective, numerical approximation scheme of the system balance equations. For a wide range of the system parameters, including typical values for real traffic applications of the model, we give a very good a priori estimate of the total-variation distance between the true and the approximate solution. Moreover, we compare the queue of EDA / D / 1 with that of the standard M / D / 1 system, i.e. with Poissonian arrivals. We show that the two queueing distributions may be very different, especially for heavy traffic load.

2 Stationary distribution: generating function and balance equations

Let us consider the process \(n_t\), which describes the length of the queue at time t. This process is governed by the stochastic recursion

$$\begin{aligned} n_{t+1}= n_t+m_{(t,t+1]} -(1-\delta _{n_t,0}) \,, \end{aligned}$$
(2)

where \(m_{(t,t+1]}\) is the number of arrivals in the interval \((t,t+1]\) according to the arrival process (1), and \(\delta _{i,j}\) is Kronecker’s delta. The term \(1-\delta _{n_t,0}\) represents the action of the server in decreasing the queue length by one unit if this is non-empty at time t. Since the service time is deterministic, we focus on the so-called embedded process by observing the system at \(t\in \mathbb {N}\), i.e. at departure instants.

The quantity \(m_t \equiv m_{(t,t+1]}\) depends in general on the whole previous history of the system. Indeed, if for some large value of T, \(m_{s}=0\) for any \(s\in \{t-T, t-T+1,...,t-1\}\) then \(m_{t}\) is large with great probability. Conversely, if in the recent past the values of \(m_{s}\) have been large then \(m_{t}\) is expected to be small. This suggests that the arrival process is negatively autocorrelated, as proven by Guadagni et al. (2011). Hence, the recursion (2) does not depend only on the present value of \(n_t\), and the memory of the process is infinite since T can be arbitrarily large.

Let now \(l_t\) denote the number of customers that are late at time t, i.e.

$$\begin{aligned} l_t \equiv \big \vert \{ 0 \le i \le t-1 \text { such that } \xi _i > t-i \}\big \vert \,, \end{aligned}$$
(3)

and let \(p \equiv \int _0^1f_\xi (t)dt=\int _0^1\beta e^{-\beta t}dt=1-e^{-\beta }\) and \(q \equiv e^{-\beta }\). According to the memoryless property of the exponential delays \(\{\xi _i\}\), independently on its scheduled arrival time, each customer late at time \(t-1\) will continue to be late at time t with independent probabilty q, and it will arrive in queue with probabilty p. The process \(l_t\) is known in literature as INAR(1), also known as Galton–Watson branching process with Bernoulli offspring with parameter q and Bernoulli immigration of parameter \(q\rho \). This is equivalently defined as

$$\begin{aligned} l_t=\sum _{i=1}^{l_{t-1}}\zeta _{t,i}+\eta _t\,, \end{aligned}$$

where \(\zeta _{t,i}\) and \(\eta _t\) are independent Bernoulli variables (of parameters \(q\rho \) and q, respectively). The variables \(\zeta \) are interpreted in terms of random offsprings, while the variables \(\eta \) represent random immigrations.

In our case, at time \((t-1)^-\) we have \(l_{t-1}\) late customers and each of them can independently arrive in queue in the time interval \([t-1,t)\) with probability \(p=1-q\). We interpret the fact that the customer arrives in queue in terms of having no offspring, while if the customer continues to be late it has an offspring. The immigration variable represents the new late customer, which is balked with probability \(1-\rho \) and hence continues to be late at time t with probability \(\rho q\).

We can explicitly write the probability distribution of the quantity \(m_t\) conditioned to the value \(l_t\): if the customer scheduled in the interval \((t,t+1]\) has balked, then

$$\begin{aligned} P\left( m_t=j \,|\, l_t = l\right) = \left( {\begin{array}{c}l\\ j\end{array}}\right) p^j q^{l-j} \equiv b_{j,l} \,, \end{aligned}$$
(4)

otherwise

$$\begin{aligned} P\left( m_t=j \,|\, l_t = l\right) = \left( {\begin{array}{c}l+1\\ j\end{array}}\right) p^j q^{l+1-j} = b_{j,l+1}\,. \end{aligned}$$

All in all,

$$\begin{aligned} P\left( m_t=j \,|\, l_t = l\right) = b_{j,l}\,(1-\rho ) + b_{j,l+1}\, \rho \,. \end{aligned}$$
(5)

We describe the EDA / D / 1 queue by the embedded process \((n_t, l_t)\), \(t\in \mathbb {N}\), a discrete-time Markov chain with the following transition probabilities:

$$\begin{aligned}&\text {For } n>0, \nonumber \\&\quad \mathcal {P}\left( (n,l), (n+a-1,l-a+1)\right) = \rho \,b_{a,l+1}\,,&\;\;&0\le a\le l+1\,, \nonumber \\&\quad \mathcal {P}\left( (n,l), (n+a-1,l-a)\right) = (1-\rho )\,b_{a,l}\,,&\;\;&0\le a\le l\,; \end{aligned}$$
(6)
$$\begin{aligned}&\text {For } n=0, \nonumber \\&\quad \mathcal {P}\left( (0,l), (a,l-a+1)\right) = \rho \,b_{a,l+1}\,,&\;\;&0\le a\le l+1\,, \nonumber \\&\quad \mathcal {P}\left( (0,l), (a,l-a)\right) = (1-\rho )\,b_{a,l}\,,&\;\;&0\le a\le l\,. \end{aligned}$$
(7)

Figure 1 displays the transitions of the embedded chain \((n_t, l_t)\) in the quarter plane.

Fig. 1
figure 1

Transitions of the EDA / D / 1 queueing system in the quarter plane. They happen along the lines of Cartesian equation \(x+y = n_t+l_t\) and \(x+y = n_t+l_t - 1\) if \(n_t\ne 0\), and along the lines of Cartesian equation \(x+y = n_t+l_t\) and \(x+y = n_t+l_t + 1\) if \(n_t=0\). Green transitions happen with probability \(\rho \) (no balking), red transitions happen with probability \(1-\rho \) (balking). (Color figure online)

Remark 1

The idea of describing the system as a bivariate Markov chain has been already used by Guadagni et al. (2011). However, they find more convenient to work with the number of customers that have already arrived at time t rather than the number of customers late at time t. This is a direct consequence of the assumption that delays have compact support.

From the theory of INAR(1) processes, \(l_t\) is an ergodic Markov chain for \(q<1\). The existence of the stationary state of the embedded chain \((n_t, l_t)\) is guaranteed for \(\rho , q < 1\) by the following

Lemma 1

The bivariate chain \((n_t, l_t)\) is ergodic if and only if \(q<1\) and \(\rho <1\).

Proof

If \(q<1\) and \(\rho <1\), the bivariate chain \((n_t, l_t)\) is irreducible, see Fig. 1. Let us consider the process \(\alpha _t = n_t + l_t\), which represent the diagonal in the quarter plane where the point \((n_t, l_t)\) lies on, see Fig. 1. The bivariate chain has the property that \(|\alpha _{t+1}-\alpha _t|\le 1\). Equations (6)–(7) yield

$$\begin{aligned}&P(\alpha _{t+1} = \alpha _t + 1 \,|\, n_t\ne 0) = 0\,, \end{aligned}$$
(8)
$$\begin{aligned}&P(\alpha _{t+1} = \alpha _t - 1 \,|\, n_t\ne 0) = (1-\rho )\,, \end{aligned}$$
(9)
$$\begin{aligned}&P(\alpha _{t+1} = \alpha _t + 1 \,|\, n_t=0) = \rho \,, \end{aligned}$$
(10)
$$\begin{aligned}&P(\alpha _{t+1} = \alpha _t - 1 \,|\, n_t=0) = 0\,. \end{aligned}$$
(11)

In order to prove the positive recurrence of \((n_t, l_t)\), we use Foster’s criterion (Robert 2003, Cor. 8.7) setting \(f(n_t, l_t) \equiv M\alpha _t + l_t + 1\) with \(M=\nicefrac {2}{(1-\rho )}\) as the Lyapunov function. Thus, we need to show that there exist suitable positive constants \(K, \gamma \) such that

  1. 1.

    \(\mathbb {E}\left[ f(n_1,l_1) - f(n_0,l_0) \,|\,n_0=N,\ l_0=L\right] \le -\gamma \) for \(f(N,L) > K\);

  2. 2.

    \(\mathbb {E}\left[ f(n_1,l_1) \,|\, n_0=N,\ l_0=L\right] < \infty \) for \(f(N,L) \le K\);

  3. 3.

    the set \(\{n,l \ge 0 \,:\, f(n,l) \le K\}\) is finite.

Indeed

$$\begin{aligned}&\mathbb {E}\left[ f(n_1,l_1) - f(n_0,l_0) \,|\, n_0=N, l_0=L\right] \\&\qquad = M\mathbb {E}\left[ \alpha _1-\alpha _0 \,|\, n_0=N, l_0=L\right] + \mathbb {E}\left[ l_1 - l_0 \,|\, l_0=L\right] \,, \\&\qquad \le M\rho \delta _{N,0} - M(1-\rho )(1-\delta _{N,0}) + \rho q - L(1-q) \,. \end{aligned}$$

Then, using a little algebra, it can be shown that the first point of Foster’s criterion is satisfied by \(\gamma = 1\) and \(K > 1 + \frac{(M+1)(1+\rho q + M\rho )}{1-\rho }\) (for example, \(K=\frac{7}{(1-\rho )^2(1-q)}\)). Point 2 of Foster’s criterion holds because

$$\begin{aligned} \mathbb {E}\left[ f(n_1,l_1) \,|\, n_0=N, l_0=L\right] \le M(N+L+1) + L + 2, \end{aligned}$$

where we have used the property that \(\alpha _t\) has only nearest-neighbour transitions. Point 3 is fulfilled by simply considering the definition of \(f(n_t, l_t)\). For \(\rho =1\), the bivariate chain \((n_t, l_t)\) is not ergodic because it is no longer irreducibleFootnote 2. \(\square \)

For \(\rho , q < 1\), Lemma 1 guarantees both the existence and the uniqueness of the stationary distribution

$$\begin{aligned} P_{n,l} \equiv \lim _{t\rightarrow \infty } P(n_t=n, l_t=l) \,. \end{aligned}$$
(12)

Let us consider the following bivariate generating function:

$$\begin{aligned} P(z,y) \equiv \sum _{n,l \ge 0} \, z^n \, y^l \, P_{n,l}\,, \quad |z|,|y| \le 1 \,. \end{aligned}$$
(13)

We are now ready to prove the main result of this section.

Theorem 1

The bivariate generating function (13) satisfies

$$\begin{aligned} P(z,y)&= \frac{1+\rho \,(\upsilon -1)}{z}\left[ (z-1) \,P(0,\upsilon )+P(z,\upsilon )\right] \,, \end{aligned}$$
(14)

where \(\upsilon = \upsilon (z,y) \equiv z + q\,(y-z)\).

Proof

For each \(n,l \ge 0\), the balance equations of EDA / D / 1 are the following:

$$\begin{aligned} P_{n,l}= & {} (1-\rho ) \left( \sum _{j=0}^n P_{j+1, l+n-j} \, b_{n-j, l+n-j} + P_{0, l+n} \, b_{n, l+n}\right) \nonumber \\&+\, \rho \left( \sum _{j=0}^n\! P_{j+1, l+n-j-1} \, b_{n-j, l+n-j} + P_{0, l+n-1} \, b_{n, l+n}\right) , \end{aligned}$$
(15)

where \(b_{j,l}\) are given by (4) and we agree that \(P_{n,l}=0\) whenever \(n,l<0\). The special cases \(n=0\) and \(n=l=0\) respectively lead to

$$\begin{aligned}&P_{0,l} = (1-\rho )\left( P_{1,l}+P_{0,l}\right) \,b_{0,l} + \rho \left( P_{1,l-1}+P_{0,l-1}\right) \,b_{0,l} \,, \end{aligned}$$
(16)
$$\begin{aligned}&P_{0,0} = (1-\rho )(P_{1,0}+P_{0,0}) \,. \end{aligned}$$
(17)

To show that (15)–(17) hold, it suffices to write \(P_{n_{t+1},l_{t+1}}\) in terms of \(P_{n_{t},l_{t}}\) and then send t to infinity, in order to obtain the stationary state of the system.

Let us take (15) and multiply both sides by \(z^n\,y^l\). Then take (16) and multiply both sides by \(y^l\). Finally consider all such contributions, included the one in (17), and sum on nl. The summation of all terms multiplied by \((1-\rho )\) yields, after a bit of algebra,

$$\begin{aligned} (1-\rho ) \left[ P(0,zp+yq) + \frac{1}{z}\left( P(z,zp+yq) - P(0,zp+yq) \right) \right] . \end{aligned}$$

Analogously, the terms multiplied by \(\rho \) yield

$$\begin{aligned} \rho (zp+yq)\Big [ P(0,zp+yq) + \frac{1}{z}\left( P(z,zp+yq) - P(0,zp+yq) \right) \Big ]. \end{aligned}$$

Summing up the two contributions, we get (14).

\(\square \)

Remark 2

The functional equation (14) does not admit simple or immediate solutions. It is radically different from the functional equations typically studied in the literature (Cohen and Boxma 1983; Fayolle et al. 1999) and it is rather special in this respect. A simple solution can be found only in the particular case \(z=1\), see Sect. 3 below.

Remark 3

We end this section with a discussion of the special case \(q=0\). This means \(\beta \rightarrow \infty \) or, in other words, no delay \(\xi \). In this regime, the right-hand side of Eq. (14) does not depend on y anymore, and \(P(z,y) \equiv Q(z)\). The number of late customers is in fact \(l_t = 0\) as the ith customer can not have a delay \(\xi _i \ge 1\).

Then, Eq. (14) yields directly

$$\begin{aligned} Q(z) = \frac{1+\rho (z-1)}{z} [(z-1)Q(0) + Q(z)] \,, \end{aligned}$$
(18)

where \(Q(0)=1-\rho \) is the stationary probability of a void queue. Therefore, Eq. (18) is equivalent to

$$\begin{aligned} Q(z) = 1+\rho (z-1)\,, \end{aligned}$$

which is the classical result of a D / D / 1 queue with balking.

3 Asymptotic properties of \(P_{n,l}\)

In this section we first focus on \(P_{l}\), the marginal distribution of late customers. Starting from the known expression of the generating function of INAR(1) process in the form of an infinite product, we find the exact analytical expression of \(P_{l}\). Then, we derive the asymptotic behaviour of \(P_{l}\) and use it to infer asymptotics for \(P_{n,l}\).

The marginal distribution of late customers is

$$\begin{aligned} P_{l}\equiv \sum _{n\ge 0} P_{n,l} \end{aligned}$$

and its generating function is

$$\begin{aligned} \sum _{l\ge 0} P_{l}y^l = \sum _{n,l\ge 0} P_{n,l} \, y^l = P(1,y) \,. \end{aligned}$$

Setting \(z=1\) into Eq. (14) yields

$$\begin{aligned} P(1,y) = [1+\rho q(y-1)] P(1,1+q(y-1)) \,. \end{aligned}$$
(19)

This is exactly the recurrence relation of the INAR(1) generating function, that can be written in terms of an infinite product as

$$\begin{aligned} P(1,y) = \prod _{k \ge 0} \left[ 1+\rho q^{k+1}(y-1) \right] \,. \end{aligned}$$
(20)

Remark 4

The infinite product (20) has an interesting combinatorial interpretation:

$$\begin{aligned} P(1,y) = \prod _{k \ge 0} \left[ 1+\rho q^{k+1}(y-1) \right] = \frac{(\rho (1-y); q)_\infty }{1+\rho (y-1)}\,, \end{aligned}$$
(21)

where \((a; q)_\infty \equiv \prod _{k \ge 0} [1 - aq^k]\) is the infinite q-Pochhammer symbol, also known as infinite q-ascending factorial in a. For \(y= 1-\nicefrac {q}{\rho }\), \(P(1,y) = \phi (q) \,(1-q)^{-1}\), where \(\phi (q)\) is the well-known Euler function.

Remark 5

For \(q<1\), the right-hand side of (21) is analytic for each \(y\in \mathbb {C}\). Therefore, the power series \(P(1,y) = \sum _{l\ge 0} P_l \, y^l\), convergent for each \(|y|\le 1\), can be analytically continued in the whole complex plane. As such, we expect the marginal distribution \(P_l\) to decrease super-exponentially fast in l.

Following the insight given by Remark 5, we shift now the focus to the asymptotic behaviour of \(P_{l}\) and \(P_{n,l}\). Expanding the product and rearranging it in powers of \(\rho (y-1)\) yields

$$\begin{aligned} P(1,y)&= \prod _{k\ge 0}\left( 1 + \rho \, q^{k+1}(y-1)\right) \nonumber \\&= 1 + \sum _{k\ge 1} \rho ^k (y-1)^k \left[ \sum _{m\ge {k+1 \atopwithdelims ()2}} d(m;k) q^m \right] , \end{aligned}$$
(22)

where d(mk) is the number of partitions of m in k distinct parts.

Recall now the following two results from number theory.

Lemma 2

(Yaglom and Yaglom 1964) If \(m > \left( {\begin{array}{c}k+1\\ 2\end{array}}\right) \) then the number of partitions of m in k distinct parts equals the number of partitions of \(m-\left( {\begin{array}{c}k+1\\ 2\end{array}}\right) \) into at most k parts (not necessarily distinct).

Lemma 3

(Andrews 1998; Hardy and Wright 1979) Let \(p_{\le k}(m)\) be the number of partitions of m in parts that do not exceed k. Then \(p_{\le k}(m)\) equals the number of partitions of m into at most k parts and

$$\begin{aligned} P_{\le k}(q) \equiv \sum _{m\ge 0} p_{\le k}(m) \, q^m = \prod _{i=1}^k \frac{1}{1-q^i} \,. \end{aligned}$$

Using Lemmas 2 and 3 we can recast after some algebra (22) as

$$\begin{aligned} P(1,y)= \sum _{k\ge 0} \frac{\rho ^k q^{{k+1 \atopwithdelims ()2}} (y-1)^k}{\prod _{i=1}^k [1-q^i] }\,, \end{aligned}$$
(23)

and by differentiation

$$\begin{aligned} P_{l}= \sum _{k\ge l} \frac{(-1)^{k-l}\rho ^k q^{{k+1 \atopwithdelims ()2}} {k\atopwithdelims ()l}}{\prod _{i=1}^k [1-q^i] } \,. \end{aligned}$$
(24)

Remark 6

To the best of our knowledge, the explicit expression (24) has been proved in literature in relatively recent times, see Uchimura and Saitô (2011, 2015). Our proof, based solely on the combinatorial structure of the generating function and on the two Lemmas above, is slightly shorter.

Now we use (24) to obtain an upper bound on \(P_{l}\). Let \((q;q)_l = \prod _{i=1}^l \left[ 1-q^i\right] \) be the q-Pochammer symbol of the pair (qq). The following inequalities hold:

$$\begin{aligned} {m+l \atopwithdelims ()l}&= \prod _{k=1}^m \left( 1+\frac{l}{k} \right) \le (1+l)^m \,,\\ \prod _{i=l+1}^{l+m} \left[ 1-q^i\right]&\ge \prod _{i=1}^{m} \left[ 1-q^i\right] = (q;q)_m \,,\\ (q;q)_l&\ge (q;q)_\infty \,. \end{aligned}$$

Thus,

$$\begin{aligned} P_{l}&= \frac{\rho ^l\,q^{{l+1 \atopwithdelims ()2}}}{(q;q)_l} \sum _{m\ge 0} \frac{(-\rho )^m q^{(l+1)m} q^{{m \atopwithdelims ()2}} \, {m+l \atopwithdelims ()l}}{\prod _{i=l+1}^{l+m} [1-q^i]}\;, \nonumber \\&\le \frac{\rho ^l\,q^{{l+1 \atopwithdelims ()2}}}{(q;q)_\infty } \sum _{m\ge 0} \frac{ q^{{m \atopwithdelims ()2}} [\rho \, q^{l+1} \, (l+1)]^m}{ (q;q)_m }\;, \end{aligned}$$
(25)
$$\begin{aligned}&= \rho ^l\,q^{{l+1 \atopwithdelims ()2}} \frac{ \prod _{k\ge 0} \left[ 1 + q^{k+l+1} \rho (l+1) \right] }{(q;q)_\infty } \,, \end{aligned}$$
(26)

where from (25) to (26) we have used the properties of q-ascending factorials and q-binomial coefficients (Gasper and Rahman 2004). If l is sufficiently large then \(q^{l}\rho (l+1) \le 1\), and (26) yields

$$\begin{aligned} P_{l}\le \rho ^l\,q^{{l+1 \atopwithdelims ()2}} \frac{(-q;q)_\infty }{(q;q)_\infty } . \end{aligned}$$
(27)

Remark 7

Equations (24) and (27) show that, asymptotically in l, the leading order of \(P_{l}\) is \(\rho ^l q^{{l+1 \atopwithdelims ()2}}\). This fact can be directly implied from the arrival process (1). In fact, the most likely way to have l late customers (l large) is that each of the customers originally scheduled in the interval \([t-l, t)\) do not balk and are late at time t, an event of probability \(\rho q^l \, \rho q^{l-1} \cdots \rho q = \rho ^l q^{{l+1\atopwithdelims ()2}}\).

Since \(P_{n,l} \le P_{l}\), we have just obtained the following asymptotic result: uniformly in n, the equilibrium distribution \(P_{n,l}\) decays super-exponentially fast in l. More precisely,

$$\begin{aligned} P_{n,l} = O\left( \rho ^l q^{{l+1 \atopwithdelims ()2}} \right) \quad \text {for } l \rightarrow \infty \,. \end{aligned}$$
(28)

As a matter of fact, the super-exponential decay of \(P_{n,l}\) may be proved asymptotically in n too. Considering the auxiliary process \(\alpha _t = n_t + l_t\), which we have already encountered in the proof of Lemma 1, we can write

$$\begin{aligned} p_a \equiv P(\alpha _t = a) = \sum _{\begin{array}{c} n,l \ge 0\\ n+l=a \end{array}} P_{n,l} \,,\quad a \ge 0 \,. \end{aligned}$$

It is straightforward to prove that the generating function of \(p_a\) is

$$\begin{aligned} P(z,z) = \frac{1+\rho (z-1)}{1-\rho } P(0,z) \,. \end{aligned}$$
(29)

From (29), it follows immediately that

$$\begin{aligned} p_a = P_{0,a} + \frac{\rho }{1-\rho }P_{0,a-1}\,. \end{aligned}$$
(30)

For \(a=n+l\), formulas (28) and (30) then yield

$$\begin{aligned} P_{n,l} \le p_a = P_{0,a} + \frac{\rho }{1-\rho }P_{0,a-1} = O(\rho ^{a} q^{\left( {\begin{array}{c}a\\ 2\end{array}}\right) }) \,. \end{aligned}$$
(31)

Therefore, the following asymptotic result holds:

Theorem 2

The equilibrium distribution \(P_{n,l}\) decays super-exponentially fast as either \(n\rightarrow \infty \) or \(l\rightarrow \infty \). More precisely,

$$\begin{aligned} P_{n,l} = O\left( \rho ^{n+l} q^{{n+l \atopwithdelims ()2}} \right) \quad \text {for } n,l \rightarrow \infty \,. \end{aligned}$$
(32)

Remark 8

Equation (29) gives an interesting connection between the equilibrium distribution of the quantity \(\alpha _t = n_t+l_t\) and the stationary probability of having l late customers given that the queue is void. Figure 1 shows that the latter event drives the dynamic of \(\alpha _t\) through an independent Bernoulli random variable with parameter \(\rho \), which explains the factor \(1+\rho (z-1)\).

4 Approximate computation of \(P_{n,l}\)

As we have just shown that the joint stationary probability \(P_{n,l}\) decreases super-exponentially fast in the limit of either \(n, l \rightarrow \infty \), the natural question arising is whether a bare truncation of the infinite linear system (15)–(17) is sufficient to obtain a satisfactory numerical expression of \(P_{n,l}\). As we will see, in this case the answer is positive due to (32), and this may prove crucial in contexts where practical solutions are needed.

Another natural question, once the numerical expressions of the probability distribution of our process are obtained, is whether the average performance of this queueing system is comparable with a classical memoryless system, say an M / D / 1 with the same traffic intensity \(\rho \), or if it is definitely different. This question is natural also because we know, see above, that for q very close to 1 the arrival process converges to a Poisson process, and we may ask what are the actual values of q such that, with respect to the average congestion, the two processes, Poisson and EDA, become indistinguishable. As it will be clear, the difference between the average queue lengths remains large also for q very close to 1, due to the negative autocorrelation of the EDA arrivals. This can be interesting also in many applications, for instance in the air traffic case.

In order to compute numerically the probability distribution, we truncate the infinite system of balance equations (15)–(17) by fixing an integer \(\alpha _{\max }\) and imposing \(P_{n,l} = 0\) for \(n+l > \alpha _{\max }\). Then we map the quarter plane \(\{n,l \in \mathbb {N}\times \mathbb {N}\}\) onto the set of non-negative integers and relabel the unknowns \(P_{n,l}\) as \(\pi _i\), recasting (15)–(17) as \(\pi = \pi Q\). For the details of both mapping and relabeling, see “Appendix A”

Next, we map the set \(\{n,l \in \mathbb {N}\times \mathbb {N} \text { such that } n+l \le \alpha _{\max }\}\) onto the set of non-negative integers \(\{0,1,\ldots ,k_{\max }\}\), where \(k_{\max } \equiv {\alpha _{\max } + 1 \atopwithdelims ()2}\). We want to consider the truncated system

$$\begin{aligned} {\left\{ \begin{array}{ll} \tilde{\pi }_i = \sum _{j=0}^{k_{\max }} \tilde{\pi }_j Q_{j,i} \quad \text {for} \quad i = 1,2,\ldots ,k_{\max }-1 \,,\\ \sum _{j=0}^{k_{\max }} \tilde{\pi }_j = 1 \,. \end{array}\right. } \end{aligned}$$
(33)

The idea we present is not new and has been already discussed, for instance by Tijms (1994) for stationary distributions with geometric tail. “Appendix B” shows that there exists a sequence \(\{\varepsilon _j\}\) such that

$$\begin{aligned}&\sum _{i> k_{\max }} \pi _i \, Q_{i,j} \le \varepsilon _j \,, \qquad j=0,1,\ldots ,k_{\max }-1\,,\\&\sum _{i > k_{\max }} \pi _i \le \varepsilon _{k_{\max }} \,. \end{aligned}$$

An a priori estimate of the error introduced by the truncation can be obtained from perturbation theory (Golub and Loan 2012, § 2.6.2):

$$\begin{aligned} \sum _{j=0}^{k_{\max }} |\pi _j - \tilde{\pi }_j| \le \kappa (A) \sum _{i=0}^{k_{\max }} \varepsilon _i \,, \end{aligned}$$
(34)

where A is the \(k_{\max } \times k_{\max }\) matrix

$$\begin{aligned} A_{ij} \equiv {\left\{ \begin{array}{ll} \delta _{i,j} - Q_{i,j} \,, &{}\quad i=0,1,\ldots ,k_{\max }-1\,,\\ 1 &{} \quad i=k_{\max } \,, \end{array}\right. } \end{aligned}$$
(35)

\(\delta _{i,j}\) is the usual Kronecker’s delta, and

$$\begin{aligned} \kappa (A) \equiv \Vert A \Vert _1 \Vert A^{-1} \Vert _1 \end{aligned}$$

is the norm-1 condition number of the matrix A.

From “Appendix B”,

$$\begin{aligned} \sum _{j=0}^{k_{\mathrm{max}}} \varepsilon _j \le 2 \alpha _{\max } \frac{(-q;q)_\infty }{(q;q)_\infty } \rho ^{\alpha _{\mathrm{max}}+1} q^{{\alpha _{\mathrm{max}}+ 1 \atopwithdelims ()2}} \le 2 \alpha _{\max } \frac{(-q;q)_\infty }{(q;q)_\infty } q^{{\alpha _{\mathrm{max}}+ 1 \atopwithdelims ()2}}\,. \end{aligned}$$
(36)

Numerical evaluations show that \(\frac{(-q;q)_\infty }{(q;q)_\infty } q^{{\alpha _{\mathrm{max}}\atopwithdelims ()2}}\) remains smaller than \(10^{-16}\) for \(\alpha _{\max }=100\) and \(q\le .98\).

Estimating the condition number of a matrix is a notably difficult problem and a vast literature exists on this topic (Golub and Kahan 1965; Qi 1984; Johnson 1989). Since one of the aims of the present section is showing that a bare truncation of the balance equations (6)–(7) may be sufficient for an approximate computation of \(P_{n,l}\), we evaluate \(\kappa (A)\) numerically in the \(\rho q\)-plane when \(\alpha _{\max } = 100\), finding that the condition number is not larger than \(10^5\) for \(\rho , q\le 0.99\). Combining these two results we obtain that, uniformly in \(\rho \le 0.99\), the a priori norm-1 approximation error is less than \(10^{-12}\) for q up to 0.98.

Remark 9

For air traffic applications, \(q=0.98\) corresponds to typical delays of the order of one hour. The same value of q for other transport systems, e.g. trains or buses, correspond to even higher delays. Consider also that typical values of traffic intensity \(\rho \) in extremely congested systems, e.g. London Heathrow Airport, do not exceed 0.98, see Caccavale et al. (2014). Therefore, the approximation scheme presented in this section is very fit for real life applications.

We conclude this section by analysing the performance of EDA/D/1 under congestion, i.e. \(\rho , q\) close to 1. First, we offer a comparison of the marginal distribution of the queue according to EDA / D / 1 and the classical M / D / 1 system. Figure 2 shows the distribution of the queue length in 25 different scenarios, with \(\rho \) and q increasing from 0.9 to 0.98 and 0.95, respectively. EDA / D / 1 and M / D / 1 are compared varying q keeping \(\rho \) fixed. We see that, even for very large values of parameter q, the super-exponential decay of EDA / D / 1 yields both a much thinner tail and a much shorter average queue length. The plot also suggests that the distribution of the queue approaches the M / D / 1 cases for \(q\rightarrow 1\), which is expected in view of the weak convergece of 1 to a Poisson process. Yet this phenomenon is much more evident for low values of \(\rho \) – not reported in Fig. 2.

Fig. 2
figure 2

Comparison of the queue distribution of EDA/D/1 and M/D/1 models for \(\rho = 0.90, 0.92, 0.94, 0.96, 0.98\) and \(q = 0.90, 0.9125, 0.925, 0.9375, 0.95\). (Color figure online)

Figure 3 shows the average queue length. We note that the mean value of the queue never exceed 6, while for \(\rho =.98\) the average queue of M / D / 1 is equal to 25. This shows, as outlined above, that the negative autocorrelation of the arrivals 1 has an heavy impact on the congestion when the traffic intensity is high. As mentioned, such difference tends to vanish for low traffic intensity. A deeper comparison, extended to a wider range of values \(\rho , q\) is available at the address https://github.com/clancia/EDA/blob/master/Asymptotics.ipynb.

Fig. 3
figure 3

Mean queue length of EDA/D/1 for different pairs \((\rho , q)\)

5 Conclusions

In this paper we have addressed a single-server queueing system with deterministic service time and exponentially delayed arrivals. The point process describing these arrivals dates back to the ’50s of the past century and was studied by Kendall and others.

We have described the model as a bivariate Markov chain, proved that the latter is ergodic, wrote the balance equations of the stationary distribution, and found a functional equation for the bivariate generating function. Then we have focused on the marginal distribution of the number of late customers and found its exact expression. This intermediate step has enabled the fundamental result on the super-exponential decay of the joint stationary distribution. The characterisation of the asymptotic behaviour has finally led us to show that the solution to the balance equations can be approximately computed in a simple yet very accurate way. We have than compared this queueing system with a classical memoryless system, and we have found that the two systems are deeply different for high traffic intensity

In spite of the big efforts we have put forward to find the solution to the functional equation (14), the complete solution of the problem is still out of reach. An expansion in powers of q seems to be a promising approach to obtain (at least) an approximate expression of the bivariate generating function. This method allows to set up a recursive scheme to compute the coefficients of the power series (Lancia 2013). Unfortunately, the computations are quite involved and need some additional work to be refined. This is the subject of ongoing research.