Introduction

The integrated vendor–buyer problem is inspired by the expanding focus on supply chain management, which has been proved to be an adequate means by which both the vendor and the buyer can be benefited simultaneously. In this context, a significant amount of literature can be found in Banerjee (1986), Goyal (1976, 1988), Ha and Kim (1997), Hill (1997, 1999), Pan and Yang (2002), Ouyang et al. (2006), Giri and Bardhan (2015), etc. and the references therein. In the vendor–buyer literature, the market demand is generally assumed to be deterministic and the shortages are not allowed in the buyer’s inventory. For instance, Otake et al. (1999) investigated an inventory model with investment in setup operations under return on investment maximization. Wee et al. (2009) studied multi-objective joint replenishment inventory models for deteriorating items under fuzzy environment. Ghasemy et al. (2014a) developed an enhanced joint pricing and lot-sizing problem with logit demand function. Ghasemy et al. (2014b) also studied pricing and lot-sizing decisions in the retail industry under fuzzy chance constraint approach. Using game theoretic approach, recently Noori-daryan et al. (2019) analyzed pricing, promised delivery lead time, supplier-selection, and ordering decisions of a multi-national supply chain under price and delivery lead time dependent demand.

Ben-Daya and Hariga (2004) were the first to extend an integrated vendor–buyer model with stochastic customer demand. Since then a lot of researchers have studied the integrated model with stochastic demand under various assumptions (Hsiao 2008b; Glock 2009, 2012). However, in most of these works, the production process is presumed to be perfect. Relaxing this perfect production assumption, Huang (2004) developed an optimal integrated inventory policy for defective items in a just-in-time manufacturing environment. Ouyang et al. (2006) also studied an integrated vendor–buyer inventory model for defective items. Hsiao (2008b) proved that an inventory system which is controlled by the reorder and shipping points is preferable compared to the system where only the reorder point acts as a control parameter. Glock (2009) developed a model with variable lead-time by extending Ben-Daya and Hariga (2004)’s model with an increase in the batch shipments by a fixed factor. Glock (2012) further extended the previous (Qr) model by studying the alternative methods for reducing the lead-time. Taleizadeh and Noori-daryan (2016) developed pricing, inventory and production policies in a three-layer supply chain with rework process under a game theoretic approach. Taleizadeh et al. (2015) also studied pricing and ordering decisions in a three-layer supply chain with imperfect quality items under buyback of defective items. More studies on imperfect production process were contributed by Ben-Daya and Hariga (2000), Salameh and Jaber (2000), Ben-Daya and Hariga (2004), Lin (2012), Shu and Zhou (2013), to name a few. However, in these works, the quality of the production process was not taken to be a control parameter. But it is well established that making an investment in the production process, in terms of buying new equipment, regular machine maintenance and repair, worker training, etc. can help improving the production process quality (Porteus 1986). In an integrated scenario, since the production process is vendor-controlled, and the vendor has to pay warranty cost for defective items, it is beneficial for him/her to invest in order to reduce the number of defective items produced. For instance, Otake and Min (2001) and Li et al. (2008) studied inventory and investment in quality improvement under return on investment maximization.

In an integrated system with imperfect production process, it is very likely that the buyer performs some sort of inspection activity before selling the products to the customers. Neglecting this inspection/screening or assuming it to be non-negligible is not very practical. Keeping these issues in mind, Dey and Giri (2014) developed an integrated single-vendor single-buyer production inventory model with imperfect production process, finite screening time and vendor investment to improve process quality. This model was extended by Mukherjee et al. (2019) to include controllable backorder rate by means of backorder price discount. However, in both the above mentioned papers, the lead-time is assumed to be a constant. But, in reality, the lead-time is, more often than not, variable. A significant amount of works on variable lead time is available in the literature. Hosseini et al. (2013) adopted a multiple objective approach for joint ordering and pricing policy for an inventory system with stochastic lead-time. Ben-Daya and Hariga (2004) illustrated the benefits of a lot-size dependent lead time on the total expected costs. Glock (2012) showed how reducing the lead-time can affect the operational costs. With this view point, Mukherjee and Dey (2018) extended the model of Dey and Giri (2014) to include lot-size dependent lead-time. However, most of the papers mentioned above did not consider learning in production which is one of the most relevant issues of production-inventory systems in recent times.

Learning is an important human factor that has been proved to enhance the overall performance of a supply chain. In systems where workers are involved in repetitive type of production process, learning plays a very important role. Wright (1936) was among the early researchers who described the learning process with the help of a power curve. According to Wright’s learning curve, per unit production cost decreases by some fixed percentage when the production quantity doubles. Learning in production affects the optimal lot-sizes and dispatch time (Jaber and Boney 1999). Elmaghraby (1990) and Jaber and Boney (1996) addressed the forgetting phenomenon during non-production period. Glock and Jaber (2013) developed a mathematical model for imperfect production process where production and reproduction processes both are subject to learning and forgetting. Giri and Glock (2017) considered a closed-loop supply chain where production and inspection processes are subject to learning and forgetting. Recently, Dey and Giri (2018) presented a new approach to deal with learning in inspection for a single-vendor single-buyer integrated imperfect inventory model. However, the combined effects of vendor’s investment in terms of quality control, lot-size dependent lead-time and learning in production on the optimal decisions of an integrated vendor–buyer inventory model remains unexplored.

Keeping this point in mind, in this paper, an attempt is made to extend Dey and Giri (2014) and Mukherjee and Dey (2018)’s models further by taking into consideration lot-size dependent lead-time at the buyer’s end and learning in production at the vendor’s end. The safety stock factor is further assumed to be different for the first batch and the rest of the batches (Hsiao 2008a). The remainder of the paper is organized as follows. Notations and assumptions for developing the proposed model are given in the next section. Section 3 is devoted to model development from buyer’s and vendor’s perspectives as well as using integrated approach. The solution procedure of the model is outlined in Sect. 4. In Sect. 5, the model is illustrated through numerical examples. The paper is concluded with some remarks in Sect. 6.

Notations and assumptions

Notations

The following notations are used to develop the proposed model.

Q:

Shipment size (decision variable)

n:

Number of shipments (decision variable)

y:

Percentage of defective items produced (decision variable)

\(k_{1}\):

Safety stock factor for the first batch (decision variable)

r:

Reorder point

\(y_0\):

Original percentage of defective items produced

D:

Expected demand rate for non-defective items(units/year)

P:

Production rate, \(p = \frac{1}{P}\)

A:

Buyer’s ordering cost per order

F:

Transportation cost per delivery

K:

Vendor’s setup cost

L:

Lead-time

\({h}_{v}\):

Vendor’s holding cost per item per year

\({h}_{b1}\):

Buyer’s holding cost for defective items per item per year

\({h}_{b2}\):

Vendor’s holding cost for non-defective items per item per year

s:

Unit screening cost

x:

Screening rate

w:

Unit warranty cost for defective items

\(\pi\):

Buyer’s shortage cost per item per year

\(\eta\):

Fractional opportunity cost

\(\delta\):

Percentage decrease in defective items per dollar increase in investment

c:

Vendor’s production cost per unit per year

l:

Learning exponent in the production process

\(k_{2}\):

Safety stock factor for the \(j{\mathrm{th}}\) batch; \(j= 2,3,\ldots ,n\)

Assumptions

The proposed model is developed with the following assumptions:

  • The supply chain consists of a single vendor and a single buyer, and deals with a single product.

  • Demand is stochastic and normally distributed with mean D and standard deviation \(\sigma\).

  • The buyer follows the classical (Qr) continuous review inventory policy.

  • An order of nQ (non-defective) items is placed by the buyer to the vendor. These items are then produced and transferred to the buyer in n equal-sized shipments by the vendor, n being a positive integer.

  • Lead-time L is not a constant; it depends on the lot-size Q as given below:

    \(L=pQ+b\), where b denotes a fixed delay due to transportation, production time of other products scheduled during the lead-time on the same facility, etc. The mean and variance for lead time demand are \(D\sqrt{pQ+b}\) and \(\sigma ^2 \sqrt{pQ+b}\), respectively.

  • The re-order point r = expected demand during lead-time + safety stock (SS) i.e., \(r = D(pQ+b)+k\sigma \sqrt{pQ+b}\), where k is the safety stock factor.

  • Shortages at the buyer’s inventory are allowed and completely backlogged.

  • There is no overshooting of orders, i.e., there is no more than a single order outstanding in any cycle.

  • \(y_{0}~(0\le y_{0}\le 1)\) is the percentage of defective items produced in each batch of size Q.

  • The vendor’s production rate of non-defective items is greater than the mean demand rate i.e., \(P(1-y_{0})>D\).

  • Upon the arrival of each batch, the buyer inspects all the items. It is assumed that the screening process is non-destructive and error-free. The screening rate x is fixed and greater than the mean demand rate D.

  • The vendor incurs a warranty cost for each defective item produced.

  • The vendor invests money to improve the production process quality in terms of buying new equipment, improving machine maintenance and repair, worker training, etc. We consider the following logarithmic investment function I(y) (Porteus 1986):

    $$\begin{aligned} I(y)=\frac{1}{\delta }\ln \left( \frac{y_{0}}{y}\right) \end{aligned}$$

    where \(\delta\) is the percentage decrease in y per dollar (or any other suitable currency) increase in investment and \(y_{0}\) is the original percentage of defective items produced prior to investment.

Model development

Buyer’s perspective

We assume that the vendor accepts an order of size nQ for non-defective items from the buyer. The vendor produces these nQ items all in one go so as to minimize the production costs, and then n batches of size Q are delivered each at regular intervals of \(Q(1-y_{0})/D\) units of time. Thus each ordering cycle is of length \(Q(1-y_{0})/D\) and the complete production cycle is of length \(nQ(1-y_{0})/D\).

When the inventory of non-defective items reaches the re-order level r, the buyer requests the vendor for the next shipment. The vendor’s delivery reaches the buyer after a lead time L (see Fig. 1).

Fig. 1
figure 1

Inventory of the buyer

We assume the lead-time L as \(L(Q) = pQ+b\), where b denotes a fixed delay due to transportation, production time of other products scheduled during the lead-time on the same facility, etc. The fixed delay factor b can be decomposed into the waiting and set-up time \(T_{s}\) and the transportation time \(T_{b}\) (Hsiao 2008a). The production rate for non-defective items is assumed to be greater than the mean demand rate, and thus the vendor, having sufficient stock, can deliver the second batch at time \(t_{2} = t_{1} + \frac{Q}{D} - T_{b}\). Thus, the lead-time for the second batch is \(T_{b}\). Similarly, the lead-time for the jth batch is \(T_{b}\), for all \(j= 2,3,\ldots ,n\).

We assume that the demand during the lead time is normally distributed with mean DL(Q) and standard deviation \(\sigma \sqrt{L(Q)}\). In this case, the safety stock, S, is given by

$$\begin{aligned} S = k_{1}\sigma \sqrt{pQ+b} \end{aligned}$$
(1)

\(k_{1}\) being the safety factor for the first batch.

The expected shortage during this first batch is \(b(s,L(Q))=\sigma \sqrt{pQ+b}\psi (k_{1})\), where \(\psi (k_{1})=\int _{k}^{\infty }(z-k_{1})\phi (z)dz\), \(\phi (z)\) being the standard normal density function.

For the jth batch, \(j=2,3,\ldots ,n\), the demand during lead time is normally distributed with mean \(DT_{b}\) and standard deviation \(\sigma \sqrt{T_{b}}\) and the safety stock, S, is given by

$$\begin{aligned} S=k_{2}\sigma \sqrt{T_{b}} \end{aligned}$$
(2)

\(k_{2}\) being the safety factor for the \(j{{\mathrm{th}}}\) batch, \(j=2,3,\ldots ,n\).

The expected shortage during all other batches is given by \(b(sr,T_{b})=\sigma \sqrt{T_{b}}\psi (k_{2})\), where \(\psi (k_{2})=\int _{k}^{\infty }(z-k_{2})\phi (z)dz\)

From Eqs. (1) and (2), \(k_{2}\) can be expressed in terms of \(k_{1}\) as given below (Hsiao 2008a):

$$\begin{aligned} k_{2}=k_{1}\sqrt{\frac{pQ+b}{T_{b}}} \end{aligned}$$
(3)

It is assumed that the safety factors \(k_{1}\) and \(k_{2}\) are related to the re-order point r. Since \(k_{2}\) is expressed in terms of \(k_{1}\), we consider \(k_{1}\) as a decision variable instead of r.

The defective items which are discovered gradually in each batch are kept in hold separately and then returned to the vendor on the arrival of the next batch. The buyer, thus, has holding costs for defective items and non-defective items. The buyer’s average inventory level for non-defective items (including those defective items which have not yet been detected before the end of the screening time Q / x) is given by

$$\begin{aligned} \frac{nQ(1-y)}{D}\left[ k\sigma \sqrt{pQ+b}+\frac{Q(1-y)}{2}+\frac{DQy}{2x(1-y)}\right] \end{aligned}$$
(4)

The average inventory level for defective items is

$$\begin{aligned} nQ^2y\left[ \frac{1-y}{D}-\frac{1}{2x}\right] \end{aligned}$$
(5)

Therefore, the buyer’s annual expected total cost including the ordering cost, shipment cost, holding cost, shortage cost and screening cost is given by

$$\begin{aligned}&{{\mathrm{ETCB}}}(Q,k_{1},n) = \frac{D(A+nF)}{nQ(1-y)}+h_{b1}\left[ Qy-\frac{DQy}{2x(1-y)}\right] \\&\quad + h_{b2}\left[ k_{1}\sigma \sqrt{pQ+b}+\frac{Q(1-y)}{2}+\frac{DQy}{2x(1-y)}\right] \\&\quad + \frac{\pi D \sigma }{Q(1-y)}[\sqrt{pQ+b}\psi (k_{1})+(n-1)T_{b}\psi (k_{2})] \\&\quad + \frac{sD}{1-y} \end{aligned}$$
(6)

Now, if the buyer follows a deterministic demand, places his order only when the inventory level falls to zero and receives the order instantaneously then \(k_{1}=k_{2}=0,L=0,\pi =0\). Also, if the buyer screens the items received as a single batch and rejects them as a single batch at the end of the screening process then we have \(h_{b1}=0\). Under these assumptions, the above expression for the annual expected total cost for the buyer modifies to

$$\begin{aligned}&{{\mathrm{ETCB}}}(Q,n) = \frac{D(A+nF)}{nQ(1-y)} \\&\quad + h_{b2}\left[ \frac{Q(1-y)}{2}+\frac{DQy}{x(1-y)}\right] +\frac{sD}{1-y} \end{aligned}$$
(7)

which is the same expression as given in Huang (2004).

If the production process is assumed to be perfect, i.e., \(y=0\), then the annual expected total cost for the buyer given in (6) reduces to

$$\begin{aligned}&{{\mathrm{ETCB}}}(Q,n,k) = \frac{D(A+nF)}{nQ} \\&\quad +h_{b2}\left[ k\sigma \sqrt{pQ+b}+\frac{Q}{2}\right] +\frac{\pi D \sigma \sqrt{pQ+b}\psi (k)}{Q} \end{aligned}$$
(8)

which is the same expression as given in Ben-Daya and Hariga (2004).

Vendor’s perspective

During the production process, the vendor produces Q items in the first instance and delivers those to the buyer. After that, the vendor delivers a quantity Q to the buyer every \(T=\frac{Q(1-y)}{D}\) units of time. This process continues till the vendor’s production run is completed after \(\frac{nQ}{P}\) units of time (Fig. 2).

Fig. 2
figure 2

Inventory of the vendor

By assumption, the vendor’s production rate for non-defective items is greater than the mean demand rate. Therefore, the vendor’s inventory level gradually increases as long as the production continues and when the production stops, the inventory level starts decreasing according to the demand rate. Then the vendor’s average inventory holding cost (see Fig. 3 for holding area) can be calculated as follows (Huang 2004):

$$\begin{aligned} {{\mathrm{EHCV}}} = h_{v}\frac{Q}{2}\left[ (n-1)-(n-2)\frac{Dp}{1-y}\right] \end{aligned}$$
(9)
Fig. 3
figure 3

Vendor’s inventory holding area

The annual expected total cost incurred by the vendor is, thus, obtained as the sum of the setup cost, holding cost, warranty cost for the defective items and production cost (Huang 2004):

$$\begin{aligned}&{{\mathrm{ETCV}}}(Q,n) = \frac{BD}{nQ(1-y)} \\&\quad +h_{v}\frac{Q}{2}\left[ (n-1)-(n-2)\frac{Dp}{1-y}\right] +\frac{wDy}{1-y}+\frac{cD}{P(1-y)} \end{aligned}$$
(10)

If the production process is perfect, i.e., \(y=0\), then (10) reduces to

$$\begin{aligned} {{\mathrm{ETCV}}} = \frac{BD}{nQ}+h_{v}\frac{Q}{2}\left[ n(1-Dp)-1+2Dp\right] \end{aligned}$$
(11)

which is the same expression as given in Ben-Daya and Hariga (2004).

The total cost given in Eq. (10) does not include any investment on part of the vendor to improve the process quality. We now assume that the vendor makes an investment to reduce the number of defective items produced. Assuming a logarithmic investment function of the form \(I(y)=\frac{1}{\delta }\ln \left( \frac{y_{0}}{y}\right)\), the expected annual total cost of the vendor can be obtained as

$$\begin{aligned}&{{\mathrm{ETCV}}}(Q,y,n) = \frac{BD}{nQ(1-y)} \\&\quad +h_{v}\frac{Q}{2}\left[ (n-1)-(n-2)\frac{Dp}{1-y}\right] +\frac{wDy}{1-y} \\&\quad +\frac{\eta }{\delta }\ln \left( \frac{y_{0}}{y}\right) \end{aligned}$$
(12)

where \(\eta\) is the fractional opportunity cost. It may be noted here that this logarithmic function is convex in y.

Learning in production

Here we assume that \(Q_{p} = nQ\) units of items are produced in every cycle and the learning in the vendor’s production process follows the learning curve proposed by Wright (1936). In the following, we calculate the vendor’s average cost for the \(i{{\mathrm{th}}}\) cycle, \(i=1,2,3,\ldots\).

The production time for cycle i is given by

$$\begin{aligned} T_{pi}&= {} \int _{(i-1)Q_{p}}^{iQ_{p}}T_{1}x^{-l}{\mathrm{d}}x \\&= {} \frac{T_{1}Q_{p}^{1-l}[i^{1-l}-(i-1)^{1-l}]}{1-l} \end{aligned}$$
(13)

where l denotes learning exponent.

From (13), the production quantity in the \(i{{\mathrm{th}}}\) cycle can be written as

$$\begin{aligned} Q(t) = \left[ \frac{(1-l)t}{T_{1}}\left( \frac{1}{i^{1-l}-(i-1)^{(1-l)}}\right) \right] ^{(1/1-l)} \end{aligned}$$
(14)

The average inventory of the product during the production process in the \(i{{\mathrm{th}}}\) cycle is given by

$$\begin{aligned} I_{T_{pi}}&= {} \int _{0}^{T_{pi}}Q(t){\mathrm{d}}t \\&= {} \frac{T_{1}[i^{1-l}-(i-1)^{1-l}](nQ)^{2-l}}{2-l} \end{aligned}$$
(15)

After the production starts in the \(i{{\mathrm{th}}}\) cycle, the time for the first dispatch is given by

$$\begin{aligned} T_{1i}&= {} \int _{(i-1)nQ}^{Q+(i-1)nQ}T_{1}x^{-l}{\mathrm{d}}x \\&= {} \frac{T_{1}Q^{1-l}[(1+(i-1)n)^{1-l}-((i-1)n)^{1-l}]}{1-l} \end{aligned}$$
(16)

Now, we calculate the vendor’s average inventory in the \(i{{\mathrm{th}}}\) cycle as follows:

From Fig. 4, we have

$$\begin{aligned} {{\mathrm{Area}}}_{\mathrm{ABFE}}&= {} nQT_{1i} \\&= {} \frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-((i-1)n)^{1-l}]}{1-l} \end{aligned}$$
(17)
$$\begin{aligned} {{\mathrm{Area}}}_{\mathrm{BJHF}}&= {} \frac{n(n-1)Q^2(1-y)}{D} \\ {{\mathrm{Area}}}_{\mathrm{ACGE}}&= {} nQT_{pi} \\&= {} \frac{T_{1}(nQ)(nQ)^{1-l}[i^{1-l}-(i-1)^{1-l}]}{1-l} \\&= {} \frac{T_{1}(nQ)^{2-l}[i^{1-l}-(i-1)^{1-l}]}{1-l} \end{aligned}$$
(18)
Fig. 4
figure 4

Vendor’s total inventory in the ith cycle under learning in production

Thus, the vendor’s average inventory in the production period of the \(i{{\mathrm{th}}}\) cycle is determined from the three areas given above as

$$\begin{aligned} I_{T_{di}}&= {} {\mathrm{Area}}_{\mathrm{ABFE}} + {\mathrm{Area}}_{\mathrm{BJHF}} + {\mathrm{Area}}_{\mathrm{ACGE}} \\ I_{T_{di}}(Q,n)&= {} \frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-((i-1)n)^{1-l}]}{1-l} \\&\quad + \frac{n(n-1)Q^2(1-y)}{D} \\&\quad - \frac{T_{1}(nQ)^{2-l}[i^{1-l}-(i-1)^{2-l}]}{1-l} \end{aligned}$$
(19)

Thus, the vendor’s average inventory in the \(i{{\mathrm{th}}}\) cycle is

$$\begin{aligned} I_{vi}&= {} I_{T_{pi}}+I_{T_{di}} - \frac{n(n-1)Q^2(1-y)}{2D} \\&= {} \frac{T_{1}[i^{1-l}-(i-1)^{1-l}](nQ)^{2-l}}{2-l} \\&\quad + \frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-(i-1)n^{1-l}]}{1-l} \\&\quad + \frac{n(n-1)Q^2(1-y)}{D}-\frac{n(n-1)Q^2(1-y)}{2D} \\&\quad - \frac{T_{1}(nQ)^{2-l}[i^{1-l}-(i-1)^{1-l}]}{1-l} \\&= {} T_{1}(nQ)^{2-l}{i^{1-l}-(i-1)^{1-l}}\left[ \frac{1}{2-l}-\frac{1}{1-l}\right] \\&\quad + \frac{n(n-1)Q^2(1-y)}{D} \\&\quad +\frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-(i-1)n^{1-l}]}{1-l} \\&= {} \frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-(i-1)n^{1-l}]}{1-l} \\&\quad + \frac{n(n-1)Q^2(1-y)}{D} \\&\quad - \frac{T_{1}(nQ)^{2-l}{i^{1-l}-(i-1)^{1-l}}}{(2-l)(1-l)} \end{aligned}$$
(20)

The vendor’s total cost in the \(i{{\mathrm{th}}}\) cycle is

$$\begin{aligned} C_{vi}(Q,n)&= {} A_{v}+cT_{pi} \\&\quad + h_{v}\left[ \frac{nT_{1}Q^{2-l}[(1+(i-1)n)^{1-l}-(i-1)n^{1-l}]}{1-l} \right. \\&\quad \left. +\frac{n(n-1)Q^2(1-y)}{2D} \right. \\&\quad \left. - \frac{T_{1}(nQ)^{2-l}{i^{1-l}-(i-1)^{1-l}}}{(2-l)(1-l)}\right] \end{aligned}$$
(21)

On simplification, (21) reduces to

$$\begin{aligned} C_{vi}(Q,n)&= {} A_{v}+ \frac{h_{v}Q^{2-l}}{P(1-l)} \\&\quad \left[n[(1+(i-1)n)^{1-l}-(i-1)n^{1-l}]\right. \\\quad &\left. -\frac{n^{2-l}(i^(1-l)-(i-1)^{1-l})}{2-l}\right] \\&\quad + h_{v}\frac{n(n-1)Q^2(1-y)}{2D} \\&\quad + \frac{c(nQ)^{1-l}[i^{1-l}-(i-1)^{1-l}]}{P(1-l)} \end{aligned}$$
(22)

Integrated approach

The annual expected total cost of the integrated system is the sum of the vendor’s and the buyer’s annual expected total costs for the \(i{{\mathrm{th}}}\) production cycle, which is given by

$$\begin{aligned}&ETC_i(Q,y,k_{1},n) = \frac{D(A+B+nF)}{nQ(1-y)} \\&\quad +h_{b1}\left[ Qy-\frac{DQy}{2x(1-y)}\right] +h_{v}\frac{Q(n-1)}{2} \\&\quad + h_{b2}\left[ k_{1}\sigma \sqrt{pQ+b}+\frac{Q(1-y)}{2}+\frac{DQy}{2x(1-y)}\right] \\&\quad + \frac{\pi D \sigma }{Q(1-y)}\frac{[\sqrt{pQ+b}\psi (k_{1})+(n-1)\sqrt{T_{b}}\psi (k_{2})]}{den} \\&\quad +\frac{(s+wy)D}{1-y} \\&\quad + \frac{\eta }{\delta }\ln \left( \frac{y_{0}}{y}\right) + \frac{c(nQ)^{-l}D[i^{1-l}-(i-1)^{1-l}]}{P(1-y)(1-l)} \\&\quad + \frac{h_{v}DQ^{1-l}}{P(1-y)(1-l)}\left[ [1+(i-1)n]^{1-l}-[(i-1)n]^{1-l} \right. \\&\quad \left. - \frac{n^{1-l}(i^{1-l}-(i-1)^{1-l})}{2-l}\right] \end{aligned}$$
(23)

Now, assuming the safety factor to be the same for all batches, \(j=1,2,\ldots ,n\), i.e., \(k_{1}=k_{2}=k\), and letting the lead-time to be zero and no investment on the part of the vendor, we get,

$$\begin{aligned}&ETC_i = \frac{D}{nQ(1-y)}[A+B+nF] + \frac{(s+wy)D}{1-y} \\&\quad + \frac{h_{v}Q(n-1)}{2} \\&\quad + \frac{h_{v}DQ^{1-l}}{P(1-y)}\left[ (1+(i-1)n)^{1-l}-((i-1)n)^{1-l} \right. \\&\quad \left. -\frac{n^{1-l}(i^{1-l}-(i-1)^{1-l})}{{2-l}}\right] \\&\quad +\frac{c(nQ)^{-l}D[i^{1-l}-(i-1)^{1-l}]}{P(1-y)(1-l)} \\&\quad + h_{b2}\left[ \frac{Q(1-y)}{2}+\frac{DQy}{2x(1-y)}\right] \\&\quad + h_{b1}\left[ Qy-\frac{DQy}{2x(1-y)}\right] \end{aligned}$$
(24)

Further, assuming that the buyer does not hold the defective items, but rejects them in a batch at the end of the screening period, we get \(h_{b1}=0\). In this case, Eq. (24) reduces to

$$\begin{aligned}&ETC_i = \frac{D}{nQ(1-y)}[A+B+nF] + \frac{(s+wy)D}{1-y} \\&\quad + \frac{h_{v}Q(n-1)}{2} \\&\quad + \frac{h_{v}DQ^{1-l}}{P(1-y)}\left[ (1+(i-1)n)^{1-l}-((i-1)n)^{1-l} \right. \\&\quad \left. -\frac{n^{1-l}(i^{1-l}-(i-1)^{1-l})}{{2-l}}\right] \\&\quad + \frac{c(nQ)^{-l}D[i^{1-l}-(i-1)^{1-l}]}{P(1-y)(1-l)} \\&\quad + h_{b2}\left[ \frac{Q(1-y)}{2}+\frac{DQy}{2x(1-y)}\right] \end{aligned}$$
(25)

which is the same expression as given in Khan et al. (2014).

Again, neglecting the learning in production and the cost of production [putting \(l=0\) and \(c=0\) in Eq. (23)], we get

$$\begin{aligned}&ETC = \frac{D}{nQ(1-y)}[A+B+nF] + \frac{D(s+wy)}{(1-y)} \\&\quad + \frac{h_{v}Q(n-1)}{2} \\&\quad + \frac{h_{v}QD}{P(1-y)}\left[ 1-\frac{n}{2}\right] \\&\quad + h_{b1}\left[ Qy- \frac{DQy}{2x(1-y)}\right] + \frac{\eta }{\delta }ln\left( \frac{y_{0}}{y}\right) \\&\quad + h_{b2}\left[ \frac{DQy}{2x(1-y)}+\frac{Q(1-y)}{2}+k_{1}\sigma \sqrt{pQ+b}\right] \\&\quad + \frac{\pi D\sigma }{Q(1-y)}\left[ \sqrt{pQ+b}\psi (k_{1})+(n-1)\sqrt{T_{b}}\psi (k_{2})\right] \end{aligned}$$
(26)

Now, if the safety factor is assumed to be the same for all the batches, Eq. (26) reduces to

$$\begin{aligned}&ETC = \frac{D}{nQ(1-y)}[A+B+nF] + \frac{D(s+wy)}{(1-y)} \\&\quad + \frac{h_{v}Q}{2}\left[ (n-1)-(n-2)\frac{D}{P(1-y)}\right] \\&\quad + h_{b1}\left[ Qy- \frac{DQy}{2x(1-y)}\right] \\&\quad + \frac{\pi D\sigma }{Q(1-y)}\sqrt{pQ+b}\psi (k) + \frac{\eta }{\delta }ln\left( \frac{y_{0}}{y}\right) \\&\quad + h_{b2}\left[ \frac{DQy}{2x(1-y)}+\frac{Q(1-y)}{2}+k_{1}\sigma \sqrt{pQ+b}\right] \end{aligned}$$
(27)

Also, assuming the lead time to be constant instead of a variable quantity in the above equation, we get,

$$\begin{aligned} ETC&= {} \frac{D}{nQ(1-y)}[A+B+nF] + \frac{D(s+wy)}{(1-y)} \\&\quad + \frac{h_{v}Q}{2}\left[ (n-1)-(n-2)\frac{D}{P(1-y)}\right] \\&\quad + h_{b1}\left[ Qy- \frac{DQy}{2x(1-y)}\right] + \frac{\pi D\sigma }{Q(1-y)}\sqrt{L}\psi (k) + \frac{\eta }{\delta }ln\left( \frac{y_{0}}{y}\right) \\&\quad + h_{b2}\left[ \frac{DQy}{2x(1-y)}+\frac{Q(1-y)}{2}+k\sigma \sqrt{L}\right] \end{aligned}$$
(28)

The cost function in Eq. (28) is the same as given by Dey and Giri (2014).

In the objective function given in Eq. (23), the control parameters are \(Q,y,k_{1}\) and n of which \(Q, y, k_1\) are positive real numbers (\(Q>0, 0\le y\le y_0,k_1>0\)) and n is a positive integer. It is not possible to show that the objective function is convex in all four decision variables. However, it can be verified numerically that, for given values of ni (positive integer) and \(y~ (0\le y \le y_{0}\le 1)\), the cost function \(ETC_i\) is convex in Q and \(k_1\). One instance of 3D-graph of \(ETC_i\) is shown in Fig. 5. In the following section, we develop a solution procedure to derive the optimal values of \(Q,y,k_{1}\) and n such that the joint expected annual total cost \(ETC_i\) is minimized.

Solution procedure

To develop a solution algorithm for the proposed model, we assume that for fixed values of i, Q, \(k_{1}\) and y, \(ETC_i\) is convex with respect to real n. Further, we assume that, for fixed values of ni and \(y~(0\le y \le y_{0}\le 1)\), \(ETC_i\) is convex in \(k_{1}\) and Q and for fixed values of ni, Q, \(k_{1}\), the expected cost function \(ETC_i\) is convex in y.

If we consider the first production cycle (\(i=1\)), and there is no learning in production (\(l=0\)), then considering the safety factor to be equal for all batches (\(k_{1}=k_{2}=k\)), \(ETC_i\) can be shown to be convex in n(real), k and Q as given below:

$$\begin{aligned} \frac{\partial ^{2}ETC_i}{\partial n^{2}}&= {} \frac{2D(A+K)}{n^3 Q(1-y)} >0 \end{aligned}$$
(29)
$$\begin{aligned} \frac{\partial ^{2} ETC_i}{\partial k^{2}}&= {} \frac{D\sqrt{pQ+b} \sigma \pi \phi (k)}{Q(1-y)}>0 \\ \frac{\partial ^{2} ETC_i}{\partial Q^{2}}&= {} \frac{2DG(n)}{Q^3(1-y)} - \frac{h_{b2}k \sigma p^2}{4(pQ+b)^{\frac{3}{2}}} \\&\quad + \frac{\pi D \sigma \psi (k)}{(1-y)} \left[ \frac{2 \sqrt{pQ+b}}{Q^3} \right. \\&\left. - \frac{p}{ Q^2 \sqrt{pQ+b}} - \frac{p^2}{4 Q (pQ+b)^{\frac{3}{2}}}\right] >0 \end{aligned}$$
(30)
$$\begin{aligned}&= {} \frac{2DG(n)}{Q^3(1-y)} - \frac{h_{b2}k \sigma p^2}{4(pQ+b)^{\frac{3}{2}}} \\&\quad + \frac{8b^2+12pQb+3p^2Q^2}{4Q^3(pQ+b)^(3/2)} \end{aligned}$$
(31)

where \(G(n)=\frac{A+B+nF}{n}\).

It is to be noted here that the sum of the first and third terms in Eq. (31) is much greater than the second term. Hence, the second derivative in (31) is effectively positive.

Hence, from Eqs. (30) and (31), \(ETC_i\) is seen to be convex in k and Q for fixed values of n and \(y~(0\le y \le y_{0}\le 1)\). Although y is bounded, it is not possible to prove conclusively that \(ETC_i\) is convex in y. So, in order to arrive at an optimal solution, the procedure suggested by Dey and Giri (2014) is followed here:

For fixed value of n, the first derivative of \(ETC_i\) w.r.t. k is set to zero. That is,

$$\begin{aligned} \frac{\partial ETC_i}{\partial k}=h_{b2}+\frac{\pi D}{Q(1-y)}(F(k)-1)=0 \end{aligned}$$
(32)

where \(F(\cdot )\) is the cumulative distribution function.

This gives

$$\begin{aligned} \overline{F}(k)=\frac{h_{b2}Q(1-y)}{\pi D} \end{aligned}$$
(33)

where \(\overline{F}(\cdot )=1-F(\cdot )\).

Next, taking the first derivatives of ETC with respect to Q and y and setting those equal to zero, we get

$$\begin{aligned} \frac{\partial ETC_i}{\partial Q}&= {} -\frac{DG(n)}{Q^2(1-y)}+yh_{b1}\left\{ 1-\frac{D}{2x(1-y)}\right\} \\&\quad +h_{b2}\left\{ \frac{1-y}{2}+ \frac{Dy}{2x(1-y)}\right\} \\&\quad + \frac{h_{b2}k \sigma p}{2 \sqrt{pQ+b}} \\&\quad + \frac{h_{v}}{2}\left\{ -1+n\left( 1-\frac{Dp}{1-y}\right) +\frac{2Dp}{1-y}\right\} \\&\quad - \frac{\pi D \sigma \psi (k)}{(1-y)} \\&\left[ - \frac{\sqrt{pQ+b}}{Q^2} + \frac{p}{2Q \sqrt{pQ+b}}\right] = 0 \end{aligned}$$
(34)

and

$$\begin{aligned} \frac{\partial ETC_i}{\partial y}&= {} \frac{Dw}{1-y}+\frac{D(s+wy)}{(1-y)^2}-\frac{\eta }{y\delta }+\frac{DG(n)}{Q(1-y)^2} \\&\quad +Qh_{b1}\left\{ 1-\frac{D}{2x(1-y)}\right\} \\\quad - \frac{DQyh_{b1}}{2x(1-y)^2} \\&\quad +h_{b2}\left\{ -\frac{Q}{2}+\frac{DQ}{2x(1-y)}+\frac{DQy}{(1-y)^2}\right\} \\&\quad + \frac{Qh_{v}}{2}\left\{ \frac{2Dp}{(1-y)^2}-\frac{Dnp}{(1-y)^2}\right\} \\\quad -\frac{\pi D \sigma \sqrt{pQ+b}\psi (k)}{Q(1-y)^2} = 0, \end{aligned}$$
(35)

respectively. From Eqs. (29)–(35), we see that the control parameters \(Q, y, k_1,\) and n are dependent. So, we follow an iterative procedure and modify the algorithm proposed by Dey and Giri (2014) to obtain the optimal solution of the present problem.

Algorithm

Step 1::

Set \(i = 1\) and \(ETC^{*}=\infty\).

Step 2::

Set \(n=1\) and \(ETC_i^{*}=\infty\).

Step 3::

Set \(y = y_{0}\), \(k_{1} = 0, k_{2}=0\), compute \(\psi (k_{1})\) and \(\psi (k_{2})\) and then compute \(Q = Q_{0}\) using the values of \(y_{0}, k_{1}, k_{2}, \psi (k)\) in equation (34).

Step 4::

Compute \(k_{1}\) from (33) using \(Q_{0},y\) and \(\psi (k_{1})=\int _{k_{1}}^{\infty }(z-k_{1})\phi (z)dz\).

Step 5::

Compute \(k_{2}\) using \(k_{1}\) and the relation \(k_{2}=k_{1}\sqrt{\frac{pQ+b}{T_b}}\).

Step 6::

Compute y from (35) using the values of k and \(Q_{0}\) obtained in the previous step. If \(y\ge y_{0}\), then we set \(y=y_{0}\).

Step 7::

Compute Q from (34) using the updated values of k and y. If \(|Q-Q_{0}|\le \epsilon\), then compute \(ETC_i(Q,k_1,y,n)\) and go to Step 8. Else, set \(Q_{0}=Q\) and go to Step 4.

Step 8::

If \(ETC_i^{*}\ge ETC_i\), then set the followings \(ETC_i^{*}=ETC_i, Q^{*}=Q, y^{*}=y,k_{1}^{*}=k_{1}, n=n+1\) and go to Step 3. Else, put \(n^{*}=n-1\) and stop. The corresponding values of the control parameters for \(n^{*}=n-1\) give the optimal solution for the fixed value of i.

Step 9::

If \(|ETC^* - ETC_i^*|>\epsilon\), set \(i=i+1\) and go to Step 2. Else, stop.

It is to be noted here that we only get a local optimum by adopting the solution procedure mentioned above. Since it is difficult to prove analytically that the objective function \(ETC_i\) is convex in all control parameters, we cannot claim that the solution obtained is a global optimum.

Now, we take the partial derivatives of ETC with respect to \(w, \delta , y_{0}\) and get

$$\begin{aligned} \frac{\partial ETC_i}{\partial w}&= {} \frac{Dy}{1-y} \end{aligned}$$
(36)
$$\begin{aligned} \frac{\partial ETC_i}{\partial \delta }&= {} \frac{- \eta log\left( \frac{y_{0}}{y}\right) }{\delta ^2} \end{aligned}$$
(37)
$$\begin{aligned} \frac{\partial ETC_i}{\partial y_{0}}&= {} \frac{\eta }{y_{0} \delta } \end{aligned}$$
(38)

From Eq. (36), we can infer that the annual expected total cost \(ETC_i\) increases as the warranty cost increases. ETC also increases with an increase in the (original) percentage of defective items, i.e., with an increase in \(y_{0}\), as evident from (38). Equation (37) shows that \(ETC_i\) decreases as \(\delta\) increases, i.e., there is a reduction in the number of defective items with each dollar increase in investment. Thus, we can say that our effective total cost may decrease if we improve the process quality. To further showcase the effects of the process quality, the investment option and other model-parameters on the optimal decisions, numerical studies are carried out in the following section.

Numerical results and discussion

For numerical study, we consider the following data set:

\(D = 1000\), \(P = 3200\), \(A = 50\), \(F = 35\), \(K = 400\), \(L = 10/365\), \({h}_{v} = 4\), \({h}_{b1} = 6\), \({h}_{b2} = 10\), \(s = 0.25\), \(x = 175{,}200\), \(w = 20\), \(\pi = 100\), \(b = 0.01\), \(T_b=0.005\), \(c=100{,}000\), \(l=0.32\), \(\sigma = 5\), \(y = 0.22\), \(\eta = 0.2\), \(\delta = 0.0002\), in appropriate units.

Fig. 5
figure 5

Convexity of \(ETC_i\)

Fig. 6
figure 6

Convexity of \(ETC_i\) with respect to Q and \(k_1\)

For fixed values of \(Q,k_1\) and n, we find that the annual expected cost function \(ETC_i\) is convex in \(y~(0\le y \le y_{0})\), see Fig. 5a. For fixed values of \(Q,k_1\) and \(y~(0\le y \le y_{0})\), we find that the annual expected cost function \(ETC_i\) is convex in n, see Fig. 5b. Further, the convexity of \(ETC_i\) for given values of n and y is shown with the help of a 3D-graph of \(ETC_i\) in Fig. 6. From Figs. (5) and (6), we can see that \(ETC_i\) is convex w.r.t. y, n(real), Q and \(k_{1}\). Applying the algorithm developed in the previous section, the optimal solution of the model is obtained for consecutive 10 cycles. The results are shown in Table 1.

Table 1 Optimal results

From Table 1, we see that, as the learning cycle increases, the expected total cost of the system decreases and also the investment made to improve the production process quality also decreases.

We now examine the sensitivity of the optimal results with respect to some important parameters of the model. Table 2 shows the effects of demand on the optimal results. For each value of D, we see that the expected total cost and the investment required to improve the production process quality of the system decrease as the number of cycles increases. It is further seen that an increased demand incurs a greater investment in terms of process quality control. Also, we can see that the smaller the value of D, the faster the learning curve becomes plateau.

Table 3 shows the effects of the warranty cost on the optimal result. We see that the percentage of defective items decreases with the increase in the amount of warranty. Table 4 shows that, with an increase in the original percentage of defective items present, y also increases. Also, with an increase in \(y_0\), the investment needed to improve the production process quality increases.

Table 5 shows that, as the production cost of the system decreases, the expected total cost of the system also decreases. Also it is seen that, a decrease in the value of c (vendor’s unit production cost) results in a faster plateauing of the learning curve. From Table 6, we see that the expected total cost and the investment required to improve the production process quality decrease as the learning exponent l increases. It is also seen that the average Q increases with l. All trends found in the results are intuitively correct and are similar to trends obtained in the existing literature.

Table 2 Sensitivity with respect to D
Table 3 Effect of w on the optimal solution
Table 4 Effect of \(y_{0}\) on the optimal solution
Table 5 Effect of changes in the production cost on the optimal solution
Table 6 Effect of learning rate on the optimal solution

Concluding remarks

This paper presents a single-vendor single-buyer integrated imperfect production-inventory model with learning in production and investment for process quality improvement. The lead-time is assumed to be lot-size dependent, and the safety stock factor is assumed to be different for the first batch and the rest of the batches. The annual expected total cost of the integrated system is derived and a simple iterative procedure is suggested to obtain the optimal values of the decision variables so as to minimize this cost. Numerical studies show that, as the cumulative number of production cycles increases, the expected annual total cost incurred by the integrated system decreases. It is also seen that the expected annual total cost and the investment required to improve the production process quality decrease, as the value of learning exponent increases. It is further observed that an increased demand rate requires an increased investment to minimize the expected annual cost incurred. There are ample scopes of future research based on the current work. The model studied here can be extended in terms of investment for controllable lead-time. Inspection errors can be introduced into the model as a possible extension. The proposed model can also be studied to include variable shipment size or multiple buyers.