Multiunit dynamic pricing with different types of observable customer information

Schur, Rouven

doi:10.1007/s00291-024-00759-x

Multiunit dynamic pricing with different types of observable customer information

Original Article
Open access
Published: 08 April 2024

Volume 46, pages 589–636, (2024)
Cite this article

Download PDF

You have full access to this open access article

OR Spectrum Aims and scope Submit manuscript

Multiunit dynamic pricing with different types of observable customer information

Download PDF

Rouven Schur ORCID: orcid.org/0000-0003-0247-8238¹

432 Accesses
Explore all metrics

Abstract

In various sectors, such as retail, firms encounter customers with multiunit demand and often implement nonlinear pricing to accommodate this demand structure. While effective, this pricing strategy lacks the adaptability offered by dynamic pricing, a trend gaining significance in the retail landscape due to technological advancements. Neglecting multiunit demand in dynamic pricing, however, can result in suboptimal prices and revenue losses. In response, this paper introduces multiunit dynamic pricing which integrates the strengths of both nonlinear and dynamic pricing strategies. We formulate a stage-wise optimization problem, considering customer preferences for batches of a product through a model based on random willingness-to-pay. The willingness-to-pay is influenced by a combination of the customer’s attraction to and consumption of the product—both private information. The firm, functioning as a monopoly, has the ability to price-discriminate between various order sizes by quoting nonlinear batch prices. Our investigation explores three cases of observable information: attraction to the product, consumption of the product, or both. Optimality conditions are derived for all cases, establishing a closed-form expressions for two of them. Additionally, we demonstrate the preservation of desirable monotonicity in time and capacity. Leveraging this monotonicity, we showcase the dynamics of the optimal pricing policy. A simulation study underscores the potential of our approach, highlighting the value of information in supporting strategic decisions, particularly regarding investments in customer profiling and segmentation. Furthermore, we illustrate how our solutions enable firms to make informed stocking and restocking decisions, providing practical insights for firms in multiunit dynamic pricing environments.

Supplier selection and order allocation: a literature review

Article 17 May 2021

A perspective analysis of obligatory vacation and retention of impatient purchaser on queueing-inventory with retrial policy

Article 11 June 2024

A strategic analysis of timing of wholesale pricing and information sharing strategy in dual-channel retailing

Article 14 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nonlinear pricing has been a widespread practice in many industries, particularly in retail, for quite some time. The objective of this pricing strategy is to increase overall demand by tempting customers to buy more. Examples of nonlinear pricing include special offers such as “buy 3, pay 2” or “take an additional item, get 25% off,” as well as volume discounts where customers pay lower unit prices for purchasing more units.

Dynamic pricing, on the other hand, is a relatively new field but its significance has been growing in recent years, particularly with the emergence of e-commerce and digital price tags in physical stores. With the capability of quickly adjusting prices, dynamic pricing has had a significant impact on various industries such as travel, hospitality, entertainment, electricity, and retail. Through e-commerce platforms and loyalty cards, sellers have access to more information about a customer’s purchasing behavior which, combined with the ability to adjust prices, can significantly influence a seller’s earnings.

Standard dynamic pricing assumes that customers will buy only one unit at a time. While this assumption may be reasonable in some cases (such as car rental or hotel rooms), it is not applicable in many other situations, such as for most grocery and fashion products. Neglecting the possibility to influence customers’ purchase quantity can lead to suboptimal prices and lost revenue in these cases. To fully leverage the revenue potential in such fields, a combination of nonlinear and dynamic pricing is highly desirable.

This paper addresses precisely such a scenario by introducing nonlinear prices in a multiunit dynamic pricing setting. Here, customers are assumed to have multiunit demand and the product is available for purchase in all batch sizes, ranging from a single unit to the entirety of the remaining stock. The purchase quantity is influenced by a nonlinear pricing scheme, deviating from the traditional approach of quoting a single unit price with batch prices derived from multiplying batch size by unit price. The objective is to dynamically quote batch prices for a single product to maximize expected revenue. The selling horizon and product inventory are limited, and after prices are quoted for each batch size, customers purchase one of these batches or nothing at all.

Our model assumes customers hold an undisclosed willingness-to-pay for each batch size, optimizing their utility by choosing the batch size with the greatest surplus over the quoted price. To capture this, we use a two-parameter approach—integrating a base willingness-to-pay reflecting interest and a consumption indicator signaling diminishing marginal appreciation. Both parameters are modeled as random variables. Additionally, we explore scenarios where the firm gains insight into the next customer’s choice parameters, observing their base willingness-to-pay, consumption indicator, or both before quoting batch prices. This mirrors practices in personalized online pricing, profiling logged-in customers based on purchase history. For non-logged-in customers, technologies like applets and cookies facilitate customer profiling [see e.g., Raghu et al. (2001)].

We contribute to the multiunit dynamic pricing literature through a stochastic dynamic optimization model. This model quotes batch prices, influencing random customer demand to maximize expected revenue. We consider various types of observable information about customer choice parameters, adapting the model for each type and solving it analytically. Our study reveals key properties of the value function and optimal batch prices. Notably, we prove the monotonicity of expected revenues with respect to capacity and time, in the context of multiunit purchases. This property aligns with an intuitive understanding of pricing dynamics relative to product scarcity. Importantly, the monotonicity in capacity ensures a unique optimal solution in our stage-wise optimization.

In our simulation study, we examine the value of information by comparing the three types of observation. Additionally, we consider a scenario where the firm lacks the ability to observe customers’ choice parameters. For this situation, Schur (2023) proposed a heuristic solution mechanism, which we briefly explain and apply. Furthermore, we assess the impact of distribution on expected revenues, assuming both uniform and normal distributions. In another study, we relax the assumption of precise customer information observation. Instead, we allow the firm to accurately assign customers to predefined segments, narrowing down the distribution of corresponding random variables. Lastly, we introduce an additional layer of decision-making: stocking and restocking.

The implications of our work can be summarized as follows:

When the firm accurately observes one or both of customers’ choice parameters, our models offer optimal batch prices for every state in the selling horizon. Moreover, knowing the optimal expected revenue for every stocking level enables the firm to make optimal stocking and restocking decision.
Understanding the value of information allows firms to evaluate the profitability of potential investments in customer profiling or segmenting, contributing to strategic decision-making in the ever-evolving landscape of dynamic pricing.
Our value function serves as an upper bound in restricted pricing scenarios, aiding firms in assessing potential revenue losses from pricing structure limitations (e.g., linear price with volume discounts like “3 for 2”).
In settings where customer parameters are unobservable, our findings offer valuable structural insights. These insights can serve as the foundation for effective heuristics, ranging from simple business rules (e.g., “additional units at a 10% discount”) to more sophisticated strategies (refer to Schur 2023).

This paper is organized as follows: In Sect. 2, we give a short overview of existing literature connected to multiunit dynamic pricing. We then present the setting, the customer choice as well as a general optimization model in Sect. 3. In Sect. 4, we present three adjustments to the general model to deal with three types of observations regarding customer choice parameters. In Sect. 5, we conduct our simulation study.

2 Literature review

In this paper, we extend dynamic pricing with nonlinear pricing. Accordingly, we start by shortly reviewing literature from both streams, nonlinear pricing and dynamic pricing. Thereafter, we focus on research belonging to multiunit or multiproduct dynamic pricing. The first category also covers the (scarce) literature on nonlinear dynamic pricing (as nonlinear pricing requires multiunit demand). The second category is primarily related to our setting because of the applied customer choice models where customer choose one of several options.

Nonlinear pricing is an often-applied pricing scheme that can be found in many industries including e.g. telecommunications, transportation, energy, supply chains, and retail. This broadness results in a diverse body of literature and is addressed by Wilson (1993) by giving an overview of application, substantial economics and marketing. Most nonlinear pricing research from the economics literature considers only static pricing. This can be observed in the review articles of Stole (2007) and Armstrong (2016) where only a small portion of covered literature assumes a dynamic environment. This literature commonly assumes these dynamics stem from competing firms and lock-in effects of recurring customers. More relevant to our setting is literature that focus on a dynamic environment stemming from dynamic demand [e.g., Dhebar and Oren (1986), and Braden and Oren (1994)]. However, different to our setting, this research does not consider a product with limited stock which is one of the core assumptions in most of dynamic pricing literature [cf., e.g., Talluri and van Ryzin (2004, Chapter 5) and Phillips (2005, Chapter 10)].

In the following, we turn our attention on dynamic pricing of a product with limited stock, finite selling horizon, and customer choice behavior. One of the first to consider such a setting were Gallego and van Ryzin (1994) who showed that the optimal price increases with remaining time and decreases with remaining stock. Their work laid the foundation for dynamic pricing as an emerging discipline in revenue management, which, during that period, was predominantly influenced by capacity control. Afterwards, dynamic pricing gained a lot of attention by researchers. They often focused on finding optimality conditions and showing monotonic behavior in time and capacity. This research was summarized by several review articles [e.g., Bitran and Caldentey (2003), Chiang et al. (2007), and, with a special focus, Gönsch et al. (2013) and den Boer (2015)] as well as textbooks [e.g. Talluri and van Ryzin (2004, Chapter 5) and Phillips (2005, Chapter 10)].

By dropping the common assumption that customers purchase at most one unit of a product, a new stream in the dynamic pricing community was born. Multiunit dynamic pricing considers multiunit demand, which, in turn, is the basis for combining nonlinear and dynamic pricing. An early publication in this stream is Elmaghraby et al. (2008) where the optimal design of a markdown pricing mechanism in the presence of multiunit demand was analyzed. They assume a full information setting meaning that the firm knows at the beginning of the selling horizon every customer and the respective willingness-to-pay values. This setting aligns to one of the three scenarios in our study (refer to Sect. 4.3). In this scenario, we assume full information availability regarding the current customer. In contrast, the other two scenarios (outlined in Sects. 4.1 and 4.2) involve customer decisions based on private information, rendering them unpredictable in advance. Furthermore, future revenues remain uncertain across all scenarios. Thereby, we acknowledge that firms typically cannot accurately predict specific customer streams or their purchasing behavior with absolute certainty. Levin et al. (2014) introduce a dynamic pricing model with stochastic batch demand. They assume customers have a certain batch size they want to purchase and request exactly and only this batch size from the firm. The firm then quotes a price and customers either buy the batch or leave the shop without purchasing anything. The authors show optimality conditions and prove monotonic properties of optimal policy and value function. However, their setting does not accommodate nonlinear pricing as customers exhibit inflexibility in their purchase quantities (such as a family buying flight tickets for their vacation), rendering firms unable to influence these quantities through the application of appropriate nonlinear batch prices. In our study, we presume customers to be flexible concerning batch sizes, as is common in retail scenarios. Instead of specifying a particular batch size, customers observe quoted nonlinear prices for various batch sizes and select the one that maximizes their utility. This flexibility introduces complexity to the optimization model, as firms must decide on several prices simultaneously while anticipating a broader range of potential customer reactions.

There are currently only two other research articles that consider stochastic flexible multiunit demand that can be influenced by nonlinear dynamic pricing: Gallego et al. (2020) and Schur (2023). Gallego et al. (2020) consider three dynamic pricing models: nonlinear, linear, and block pricing. They consider utility maximization choice models where customers are characterized by one single parameter. This parameter cannot be observed by the firm and is modeled as random variable. The authors give optimality conditions for their nonlinear dynamic pricing model and show structural properties like the monotonicity in time and inventory. Their work is related to this paper in the following way: Our scenarios where a firm can observe one of two choice parameters is an extension to a special case of their nonlinear pricing model. The key distinction in our approach lies in our customer choice model, where customers are characterized by two parameters: one reflecting the product’s attractiveness and another indicating the inclination to purchase multiple units. This allows for nuanced variations among customers. For instance, a parent may value diapers more than a childless individual, and a family might be more prone to buying several packs of toilet paper compared to someone living alone. Schur (2023) considers the same customer choice model as we do, but without the possibility to (partially) observe customers’ private information. In the absence of such information, the optimization model becomes analytically intractable, leading Schur (2023) to develop three heuristics. These heuristics, utilizing fluid approximation, are designed to be asymptotically optimal. In our setting, where we assume partial observation of current customer information, we encounter distinct yet related optimization problems that can be solved to optimality (numerically). Additionally, we demonstrate that the well-known monotonicity in time and inventory persists in our scenario. Lastly, our simulation study explores the value of knowing customers’ private information, contributing to the understanding of the situational and contextual value of different types of information.

Multiproduct dynamic pricing is another field in the domain of dynamic pricing that emerged more and more in recent years [see for a review, e.g., Chen and Chen (2015)]. By defining batches of a single product as several different products, we can draw a parallel between multiproduct and multiunit dynamic pricing. In multiproduct dynamic pricing the products often are substitutes and an upcoming customer can pick at most one of these products. Customers’ demand is stochastic and the firm is facing a finite selling horizon with scarce product-dependent inventory [see, e.g., Zhang and Cooper (2009), Dong et al. (2009), and Akçay et al. (2010)]. The main difference between multiunit (i.e., our work) and these multiproduct dynamic pricing models is the inventory structure. Whereas every product has its own inventory in multiproduct dynamic pricing, every batch (“product”) exploits the same inventory (but in another quantity) in multiunit dynamic pricing. One exception to the product-specific inventory setting is Maglaras and Meissner (2006). They analyze a setting where every product consumes one unit of the same resource. This assumption leads to different pricing dynamics when compared to our setting, where each product has varying resource consumption based on batch size. Consequently, in our context, each batch price reacts differently to changes in dynamic scarcity, unlike their setting where all products equally respond to the dynamic scarcity of the common resource. Maglaras and Meissner (2006) show that dynamic pricing and capacity control can be reduced to a common formulation. Instead of concentrating on dynamic pricing or capacity control, the firm finds optimal decisions by controlling the consumption rate of every product regarding resource capacity.

In our literature review, it becomes evident that research on nonlinear dynamic pricing is exceptionally limited, with only two notable exceptions: Gallego et al. (2020) and Schur (2023). However, these works have distinctive characteristics that set them apart. Gallego et al. (2020) focuses on a model where customer behavior is characterized by a single (random) parameter, whereas our approach involves two (random) parameters. This enables us to capture a more individualized customer choice behavior and introduces additional uncertainty into the optimization problem. On the contrary, Schur (2023) employ the same customer choice model as we do. However, different from our setting, they cannot observe customers’ private information. With these observations, we (numerically) solve the optimization model to optimality and determine the value of information in a simulation study. Notably, other works in the field diverge significantly in at least one critical assumption, leading them to analyze distinct settings. In many cases, these works do not consider customers with flexible multiunit demand, and consequently, do not explore the application of nonlinear pricing schemes to influence stochastic purchase quantities.

3 Problem definition

After introducing general setting and notation in Sect. 3.1, we present the customer choice model in Sect. 3.2. Building on this, we finally introduce the optimization model in Sect. 3.3.

3.1 General setting and notation

We introduce the following framework to combine nonlinear and dynamic pricing: A monopolistic firm sells a single product with fixed stock $C$ over a finite selling horizon with $T$ periods. The selling horizon is indexed backwards in time, i.e., periods $T$ and $0$ mark the beginning and the end, respectively. We assume that exactly one customer arrives in each period $t = T, T-1, \dots , 1$ and is interested in buying one or more units of the product depending on the batch prices the firm is quoting. The capacity of the product is nonreplenishable and any capacity left at the end of the selling horizon ($t=0$) is worthless to the firm. At any point in time $t$, the firm decides on batch prices ${\varvec{r}}={\left({r}_{1}, {r}_{2}, \dots , {r}_{c}\right)}^{T}$ based on remaining capacity $c\le C$ and expectations of future demand. Thereby, the remaining capacity $c$ defines the maximal possible batch size $j$ that could be offered. Each ${r}_{j}$ represents the price a customer must pay for a batch of $j$ units. The firm’s goal is to set the prices that maximize overall revenue, taking into account future demand and customer behavior. Arriving customers react on quoted batch prices and decide on the batch size to purchase, with ${p}_{j}\left({\varvec{r}}\right)$ denoting the probability that an arriving customer chooses to buy $j$ units. In this case, the firm immediately earns ${r}_{j}$ in revenue and product’s capacity is lowered by $j$. Throughout the remainder of this paper, to simplify our notation, we adopt the convention that ${r}_{0}=0$.

3.2 Customer choice model

In our setting, customers face several options (i.e., batch sizes, including also a batch of zero) and pick exactly one of these. We assume that customers have a personal (unknown to the firm) evaluation for each option and this evaluation can be expressed monetarily via customers’ willingness-to-pay. The utility, representing the difference between customers’ willingness-to-pay and the price, determines the choice, with customers opting for the option that yields the highest utility. This model is commonly employed in economic and pricing literature as it captures customers heterogeneity regarding their preferences (via personal willingness-to-pay) and firm’s influence on customers’ decision (via price). Moreover, it relies on a sound theoretical groundwork, as it aligns with economic principles and the rationale that individuals make decisions based on perceived value and cost considerations. Specifically, if customers face multiple options rather than a binary decision (such as purchasing or not), this model is often applied [see, e.g., Braden and Oren (1994) with their nonlinear (static) pricing setting, and Akçay et al. (2010) with their multiproduct dynamic pricing setting].

Customers’ willingness-to-pay ${X}_{j}$ for a batch of size $j$ is private information and unknown to the firm. This makes ${X}_{j}$ a random variable and a proper model is needed to reflect customers’ preferences. In literature, a common assumption is that marginal willingness-to-pay, i.e., ${X}_{j+1}-{X}_{j}$, is non-negative and decreasing [see, e.g., Baucells and Sarin (2007), Goldman et al. (1984), Iyengar and Jedidi (2012), and Gallego et al. (2020)]. This assumption translates to: “An additional unit is never bad, but it is less appreciated than the previous one.” There are several methods to model random willingness-to-pay in settings where customers buy in batches. The model we apply is based on a formulation of Iyengar and Jedidi (2012) and was also applied by Schur (2023). Iyengar and Jedidi (2012) introduce a willingness-to-pay function that depends on known parameters. Uncertainty regarding customers’ behavior is then added with the help of an error term. Schur (2023) adapt this willingness-to-pay function. However, instead of using known parameters and adding randomness via an error term, the parameters itself are assumed to be private information, and thus, depicted by random variables. We follow the latter approach and define the willingness-to-pay $X_{j}$ for a batch size of $j$ by:

$$X_{j} = \omega \cdot \mathop \sum \limits_{k = 0}^{j - 1} \left( \lambda \right)^{k} \quad {\text{for}}\;j = 1, \ldots , c,$$

(1)

with independent continuous random variables $\omega$ and $\lambda$. We denote the corresponding density functions by ${f}_{\omega }$ and ${f}_{\lambda }$. Likewise, the cumulative distribution functions are given by ${F}_{\omega }$ and ${F}_{\lambda }$. We assume the support of both density functions is $\left[0, 1\right]$. Furthermore, we make an assumption regarding the continuous failure rates of both random variables, $\omega$ and $\lambda$, defined over the interval $\left(0, 1\right]$ by ${h}_{\omega }\left(x\right)=\frac{{f}_{\omega }\left(x\right)}{1-{F}_{\omega }\left(x\right)}$ and ${h}_{\lambda }\left(x\right)=\frac{{f}_{\lambda }\left(x\right)}{1-{F}_{\lambda }\left(x\right)}$, respectively. We assume that these failure rates are increasing in $x$. This assumption ensures the existence of a unique solution to our optimization problem, as evidenced by the proof of Propositions 1 and 5. This is consistent with common practices in the standard literature, as random variables with increasing failure rates have an increasing generalized failure rate [see Lariviere (2006)]. It aligns with one of the three standard assumptions mentioned in Ziya et al. (2004). Furthermore, it is compatible with numerous probability distributions, including but not limited to the uniform, triangular, normal, exponential, Weibull, Gumbel, gamma distributions, and their truncated variants (some of them with restrictions regarding parameter choice) [see Banciu and Mirchandani (2013)].

By restricting $\lambda$ on $\left[0, 1\right]$, we assure that marginal willingness-to-pay, i.e., ${X}_{j+1}-{X}_{j}=\omega \cdot {\lambda }^{j}$, is non-negative and decreases in quantity $j$ (given $\omega \ge 0$). Thereby, this model covers the common assumption regarding customers’ preferences that was stated earlier in this section. Restricting $\omega$ on $\left[0, 1\right]$ is only a matter of scaling and normalizes marginal willingness-to-pay. The interpretation of the random variables, $\omega$ and $\lambda$, is the following: As $\omega$ equals ${X}_{1}=\omega \cdot \sum_{k=0}^{0}{\left(\lambda \right)}^{k}=\omega$ and influences ${X}_{j}=\omega \cdot \left(\sum_{k=0}^{j-1}{\left(\lambda \right)}^{k}\right)$, $j\ge 2$, in a linear manner, we can interpret it as attractiveness of the product to the customer. We call this parameter base willingness-to-pay. In contrast, the consumption indicator $\lambda$ has no influence on ${X}_{1}=\omega$, but depicts the rate at which marginal willingness-to-pay is diminishing in $j$. This can be observed by ${X}_{j+1}-{X}_{j}=\omega \cdot {\lambda }^{j}=\lambda \cdot \left(\omega \cdot {\lambda }^{j-1}\right)=\lambda \cdot \left({X}_{j}-{X}_{j-1}\right)$. We can interpret $\lambda$ as customers’ willingness to stockpile or consume.

The following figure provides an illustrative representation of willingness-to-pay curves for three specific customers, each characterized by unique realizations of random variables $\omega$ and $\lambda$, denoted as $w$ and $l$, respectively.

In this example, customer 1 (dashed line) shares the same base willingness-to-pay ($w=0.8$) with customer 2 (dotted line), and the same consumption indicator ($l=0.8$) with customer 3 (solid line). Consequently, customers 1 and 2 exhibit identical willingness-to-pay values for a batch size of 1, implying they have a similar valuation of the product. However, a notable distinction arises when we examine the curves further. While the solid curve steadily increases until $j=10$, the dotted curve reaches a relatively constant level at $j=4$. This divergence stems from the fact that customer 1, with a consumption indicator twice as high, is significantly more interested in purchasing larger batches compared to customer 2.

Comparing customer 1 and 3, we observe that the solid line consistently falls exactly between the dashed line and zero. This is a direct consequence of both customers having the same consumption indicator, but with customer 3 having only half the base willingness-to-pay of customer 1. As a result, customer 1 is willing to pay twice as much as customer 3, indicating a substantially higher appreciation for the product.

From a theoretical perspective, if the firm were given the choice among the three customers, it would naturally prefer to serve customer 1, as it can charge the highest prices for each batch size. However, when deciding between customer 2 and 3, the choice is less clear-cut. When facing a stock shortage, serving customer 2 might be preferable, while in situations with ample stock availability, customer 3 could be the better option.

Briefly leaving the example behind us allows for the definition of customers’ utility. The utility ${u}_{j}\left({\varvec{r}}\right)$ for purchasing $j$ units is the difference between their willingness-to-pay $X_{j}$ and price $r_{j}$:

$$u_{j} \left( {\varvec{r}} \right) = X_{j} - r_{j} \quad {\text{for}}\;j = 1, \ldots , c.$$

(2)

Customers act rational and choose the option that yields the highest utility. Thus, they purchase $j$ units if and only if ${u}_{j}\left({\varvec{r}}\right)=\underset{j=0, \dots ,c}{{\text{max}}}\left\{{u}_{j}\left({\varvec{r}}\right)\right\}$ with ${u}_{0}\left({\varvec{r}}\right)=0$ denoting the no-purchase option.

Resuming the previous example (Fig. 1), we introduce an arbitrary price vector (as shown by the red line in Fig. 2, left side). It’s important to highlight that while we use a linear pricing scheme in this particular illustration, our model is not restricted to linear pricing and explicitly accommodates non-linear pricing structures. The application of Eq. (2) results in the generation of three distinct utility curves (depicted in Fig. 2, right side), one for each customer.

Upon close examination, we can observe that customer 1 (dashed line) has maximal utility at $j=3$, customer 2 (dotted line) at $j=1$, and customer 3 (solid line) at $j=0$. In a scenario where these three customers collectively constitute the entire market and each customer’s arrival is equally likely, the firm would have the following probabilities of selling units with this price vector: $0$ units, $1$ unit, or $3$ units, each with a probability of $\frac{1}{3}$.

Given our assumption that $\omega$ and $\lambda$ are continuous random variables, we find ourselves in a realm with an infinite number of willingness-to-pay curves, each representing a specific customer. In this expansive landscape, it is impractical to individually assess every customer to pinpoint where their maximum utility lies, as we did in the example. Instead, when presented with a specific price vector ${\varvec{r}}$, we want to determine which utility curves, described as combinations of $w$ and $l$, have their maximum at batch size $j$. In essence, for any given $j$, we seek all $\left(w, l\right)$ pairs for which ${u}_{j}\left({\varvec{r}}\right)=\underset{j=0, \dots ,c}{{\text{max}}}\left\{{u}_{j}\left({\varvec{r}}\right)\right\}$. According to Eq. (2), this condition holds for all $\left( {w, l} \right)$ that satisfy:

$$\begin{gathered} w \cdot \mathop \sum \limits_{k = 0}^{j - 1} l^{k} - r_{j} \ge w \cdot \mathop \sum \limits_{k = 0}^{i - 1} l^{k} - r_{i} \quad {\text{for}}\;i = 1, \ldots , c\;{\text{and}} \hfill \\ w \cdot \mathop \sum \limits_{k = 0}^{j - 1} l^{k} - r_{j} \ge 0. \hfill \\ \end{gathered}$$

(3)

To compute the probability that the next utility curve we encounter attains its maximum at $j$, we must calculate the probability that $\left(w, l\right)$ meets these conditions. This can be achieved using the density functions ${f}_{\omega }$ and ${f}_{\lambda }$ in combination with an indicator function ${1}_{\left\{{u}_{j}\left({\varvec{r}}\right)=\underset{j=0, \dots ,c}{{\text{max}}}\left\{{u}_{j}\left({\varvec{r}}\right)\right\}\right\}}\left(w,l\right)$. This indicator function equals $1$ when the condition is met and $0$ otherwise. Notably, this probability is technically equivalent to the probability $p_{j} \left( {\varvec{r}} \right)$ of selling $j$ units for a given price ${\varvec{r}}$ and we can express it as:

$$p_{j} \left( {\varvec{r}} \right) = \mathop \smallint \limits_{0}^{1} \mathop \smallint \limits_{0}^{1} f_{\omega } \left( w \right)f_{\lambda } \left( l \right)1_{{\left\{ {u_{j} \left( {\varvec{r}} \right) = \mathop {\max }\limits_{j = 0, \ldots ,c} \left\{ {u_{j} \left( {\varvec{r}} \right)} \right\}} \right\}}} \left( {w,l} \right) dwdl\quad {\text{for}}\;j = 1, \ldots , c.$$

(4)

3.3 Dynamic programming formulation

A firm maximizes expected revenue over the whole selling horizon by solving a dynamic optimization problem. Thereby, it searches for the optimal batch prices ${r}_{j}$, $1\le j\le c$, to offer at every time $t$ with remaining capacity $c$. The maximal number of purchasable units equals the remaining capacity in every state $\left(t, c\right)$. To take the varying character of remaining capacity into account, we define a state-dependent action space ${\mathcal{R}}_{c}=\left\{{\varvec{r}}\in {\mathbb{R}}^{c}: {r}_{j}\ge 0, j=1, \dots ,c\right\}$ with ${\mathcal{R}}_{0}=\varnothing$. Action space ${\mathcal{R}}_{c}$ defines the set of feasible solutions to our maximization problem. By taking the remaining capacity $c$ into account, it makes sure that only available batch sizes $j\le c$ are offered. The dynamic problem is given by:

$${V}_{t}\left(c\right)=\underset{{\varvec{r}}\in {\mathcal{R}}_{c}}{{\text{max}}}\left\{\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}\right)\cdot \left({r}_{j}+{V}_{t-1}\left(c-j\right)\right)+\left(1-\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}\right)\right)\cdot {V}_{t-1}\left(c\right)\right\}$$

(5)

where ${V}_{t}\left(c\right)$ denotes the optimal expected revenue-to-go from period $t$ onwards with remaining capacity $c$. The boundary conditions are ${V}_{0}\left(c\right)=0$ for $c\ge 0$ and ${V}_{t}\left(0\right)=0$ for $t\ge 0$.

In every state, one out of $c+1$ random events occurs: A customer purchases $0\le j\le c$ units at a price of ${r}_{j}$ with probability ${p}_{j}\left({\varvec{r}}\right)$. Additionally, the firm can expect future revenues from remaining capacity $c-j$ and time $t-1$. We denote the optimal batch prices selected in a state $\left(t,c\right)$ by ${{\varvec{r}}}_{t}\left(c\right)\in {\mathcal{R}}_{c}$.

An alternative formulation of (5) focuses on opportunity costs regarding selling $j$ units, i.e.

$${\Delta }_{j} V_{t} \left( c \right) = V_{t} \left( c \right) - V_{t} \left( {c - j} \right)\quad {\text{for}}\;j = 1, \ldots , c,$$

(6)

and is given by

$$V_{t} \left( c \right) = \mathop {\max }\limits_{{{\varvec{r}} \in {\mathcal{R}}_{c} }} \left\{ {\mathop \sum \limits_{j = 1}^{c} p_{j} \left( {\varvec{r}} \right) \cdot \left( {r_{j} - {\Delta }_{j} V_{t - 1} \left( c \right)} \right)} \right\} + V_{t - 1} \left( c \right).$$

(7)

Thus, the goal to maximize expected revenues can be achieved by maximizing additional revenue gains that are realized by selling up to $c$ units in period $t$ instead of retaining the capacity for later customers. This formulation offers several advantages over (5). The first and most apparent advantage is the immediate insight that optimal prices should surpass opportunity costs. Failing to do so would result in no gain in expected revenue by selling, or worse, it could even lead to a net loss in overall expected revenue. Another advantage becomes evident in later sections as we establish key properties based on formulation (7). These properties are crucial in our pursuit to find the optimal solution of our optimization problem. Lastly, it underscores the significance of opportunity costs, which constitute the sole state-dependent component and are the primary driver behind the dynamic changes in optimal prices over time.

4 Different types of observable information

In this section, we consider different degrees of observability regarding next customer’s private information, i.e. base willingness-to-pay $\omega$ and consumption indicator $\lambda$. In three subsections, we assume that the firm knows at customer’s arrival the exact value of base willingness-to-pay, consumption indicator, or both parameters, respectively. Each of these subsections shows the adapted problem formulation, structural properties, and optimal solution (or at least a sufficient condition for optimality).

4.1 Observable base willingness-to-pay

We now consider the case where a firm can observe the base willingness-to-pay of the next customer in line, i.e. the realization $w$ of random variable $\omega$ becomes known at the moment the firm decides upon the next batch prices. Consumption indicator $\lambda$ remains stochastic. Thereby, we eliminate some but not all of uncertainty regarding customers’ behavior.

4.1.1 Customer choice and model formulation

Selling at least one unit of the product is now a deterministic occurrence. Notably, for ${r}_{1}<w$, we know for certain that a customer has a higher utility for purchasing one unit than for purchasing nothing at all (${u}_{1}\left({\varvec{r}}\right)=w-{r}_{1}$ is deterministic and positive). However, we still face uncertainty regarding the precise number of units purchased, as we do not know if there are ${u}_{j}\left({\varvec{r}}\right)$ values exceeding ${u}_{1}\left({\varvec{r}}\right)$.

A customer is indifferent between purchasing zero and one unit of the product when ${r}_{1}=w$. As a tiebreaker, a firm could quote a price that is slightly above or below $w$ (${w}^{+}$ and ${w}^{-}$, respectively), depending on which outcome would be more suitable. Taking these two strategies explicitly into account would result in increased complexity of notation without adding to understandability. In most instances, a firm prefers customers to purchase at price $w$. Consequently, we will assume $w$ to act as ${w}^{-}$ without further mention. However, there are situations where the firm may not want to sell at $w$ (e.g., if $w$ is too low). In such cases, we will explicitly indicate that the firm employs ${w}^{+}$. Moreover, we ignore the case where a customer might have $w=0$. This case almost surely does not occur (recall that $\omega$ is continuously distributed), and even if it were to occur, it would have no impact. For a customer with a willingness-to-pay of zero for every batch size (as per Eq. (1)), there would be no price at which the customer desires to buy while the firm wishes to sell simultaneously. Consequently, the optimal solution in this case would be not to sell anything to that customer.

Observing realization $w$ has the advantage that we can formulate necessary conditions for a customer to purchase $j$ units:

(a)
$\sum_{k=0}^{j-1}{\lambda }^{k}\ge \frac{{r}_{j}}{w}$,
(b)
$\sum_{k=i}^{j-1}{\lambda }^{k}\ge \frac{{r}_{j}-{r}_{i}}{w}$ for all $i\in \left\{1, 2, \dots , j-1\right\}$, and
(c)
$\sum_{k=j}^{i-1}{\lambda }^{k}\le \frac{{r}_{i}-{r}_{j}}{w}$ for all $i\in \left\{j+1, j+2, \dots , c\right\}$.

Conditions (a)–(c) arise from (3) with $\lambda$ instead of $l$ and by separating all known variables, i.e., decision variable ${\varvec{r}}$ and realization $w$, from random variable $\lambda$. These conditions ensure that purchasing $j$ units yields at least the same utility for customers as purchasing nothing (condition (a)), purchasing less than $j$ units (condition (b)), and purchasing more than $j$ units (condition (c)).

Based on these conditions, there are several ways to eliminate demand for $j$ units:

Picking batch price ${r}_{j}>j\cdot w$ makes it impossible to fulfill condition (a) for any $\lambda \in \left[\mathrm{0,1}\right]$.
If ${r}_{j}>{r}_{i}$ for any $i>j$, then there is no $\lambda \in \left[\mathrm{0,1}\right]$ that satisfies condition (c).
Picking ${r}_{j}$ such that ${\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}>1$ makes it impossible to fulfill condition (b) for any $\lambda \in \left[\mathrm{0,1}\right]$.
With batch prices ${r}_{j-1}$, ${r}_{j}$, and ${r}_{j+1}$ such that ${\left(\frac{{r}_{j+1}-{r}_{j}}{w}\right)}^\frac{1}{j}<{\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}$, there is no $\lambda \in \left[\mathrm{0,1}\right]$ such that ${\lambda }^{j-1}\ge \frac{{r}_{j}-{r}_{j-1}}{w}$ and ${\lambda }^{j}\le \frac{{r}_{j+1}-{r}_{j}}{w}$ simultaneously (conditions (b) and (c) with $i=j-1$ and $i=j+1$, respectively).

In a scenario characterized by limited capacity, it becomes crucial to possess the capability to eliminate demand for any batch size $j$. There are two primary reasons why we seek this capability: firstly, we might encounter a situation where our capacity $c$ is insufficient to fulfill an order of $j$ units (i.e., $c<j$), and secondly, it may be more financially advantageous to reserve capacity for potential future customers. The latter circumstance arises when we are currently serving a customer with an exceptionally low willingness-to-pay, which is indicated by an exceedingly low value of $w$. We can establish a formal criterion for $w$ being too low by referring to Eq. (7). This equation reveals that ${r}_{j}$ should exceed ${\Delta }_{j}{V}_{t-1}\left(c\right)$ to increase overall expected revenue. The maximum possible willingness-to-pay for $j$ units by a customer is given by $j\cdot w$ (as per Eq. (1) with $\lambda =1$). When dealing with a customer whose $w$ falls below $\frac{{\Delta }_{j}{V}_{t-1}\left(c\right)}{j}$, there is no viable way to sell $j$ units without incurring a loss in overall expected revenue. In such cases, the firm’s preference is not to sell $j$ units to this customer, and we must ensure that at least one ${\varvec{r}}$ is feasible such that ${p}_{j}\left({\varvec{r}}\right)=0$.

Referring back to the previous points, we have ascertained that there exist numerous potential choices of ${r}_{j}$ to eliminate demand for $j$ units. Given that the primary goal of these ${r}_{j}$ is to abstain from selling, it becomes immaterial which specific ${r}_{j}$ is employed for this purpose. These observations prompt us to exclude the majority, though not all, of these alternatives from the action space ${\mathcal{R}}_{c}$. In the ensuing lemma, we define a refined action space that assumes a crucial role in this section. This set is denoted as ${\mathcal{R}}_{c}\left(w\right)$ and its elements are referred to as relevant prices, as we have removed only those prices deemed irrelevant.

Lemma 1

Relevant prices ${\varvec{r}}$ are given by

$${\mathcal{R}}_{c}\left(w\right)=\left\{{\varvec{r}}\in {\mathbb{R}}^{c}: 0\le {r}_{1}\le {w}^{+}, \text{and }0\le {\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}\le {\left(\frac{{r}_{j+1}-{r}_{j}}{w}\right)}^\frac{1}{j}\le 1 \text{for }2\le j\le c-1\right\}.$$

Proof

Firstly, it is essential to recognize that the definition of ${\mathcal{R}}_{c}\left(w\right)$ is derived exclusively by excluding any price vector that satisfies one of the conditions outlined in the bullet points above. Specifically, the first and third bullet points correspond to ${r}_{1}\le {w}^{+}$ and ${\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}\le 1$, the second to $0\le {r}_{1}$ and $0\le {\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}$, and the fourth to ${\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}\le {\left(\frac{{r}_{j+1}-{r}_{j}}{w}\right)}^\frac{1}{j}$.

The fundamental concept behind this proof is straightforward: We show that for any excluded price vector, there exists a price vector ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)$ that results in the same customer decisions and earned revenues. W.l.o.g., let us assume that an excluded price vector satisfies any of the bullet points for some $j$ (if there are multiple instances, we iteratively apply the following steps). The implication is that demand for $j$ units is eliminated. By substituting a certain value for ${r}_{j}$, we can ensure that demand for $j$ units is still eliminated, while the resulting price vector belongs to ${\mathcal{R}}_{c}\left(w\right)$. Considering the bullet points mentioned earlier, we want to shortly discuss what happens if we replace the inequality of these conditions with equality: Thereby, there is at most one $\lambda \in \left[\mathrm{0,1}\right]$ such that conditions (a) to (c) are fulfilled. As we assume $\lambda$ to be a continuously distributed random variable, the probability of $\lambda$ being exactly this value is zero. Thus, we can eliminate demand almost surely by choosing ${r}_{j}$ such that ${r}_{j}=j\cdot w$ (first bullet point with “$=$”), ${r}_{j}={r}_{i}$ (second bullet point with “$=$”), ${\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}=1$ (third bullet point with “$=$”), or ${\left(\frac{{r}_{j+1}-{r}_{j}}{w}\right)}^\frac{1}{j}={\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}$ (fourth bullet point with “$=$”). Please note that the definition of ${\mathcal{R}}_{c}\left(w\right)$ always covers at least one of these four alternatives. This is sufficient for the purpose of maximizing expected revenue, and we can exclude all the cases mentioned in the bullet points without limiting possibilities for our optimization problem. □

In Eq. (1), we can observe that ${\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}$ represents the lowest realization $l\in \left[\mathrm{0,1}\right]$, for which a customer has nonnegative marginal utility when purchasing the $j$th unit ($j\ge 2$): ${u}_{j}\left({\varvec{r}}\right)-{u}_{j-1}\left({\varvec{r}}\right)=\left({X}_{j}-{r}_{j}\right)-\left({X}_{j-1}-{r}_{j-1}\right)=\left({X}_{j}-{X}_{j-1}\right)-\left({r}_{j}-{r}_{j-1}\right)=w\cdot {\lambda }^{j-1}-\left({r}_{j}-{r}_{j-1}\right)\ge 0\iff \lambda \ge {\left(\frac{{r}_{j}-{r}_{j-1}}{w}\right)}^{\frac{1}{j-1}}$. This threshold is crucial, and we define

$$\mathop {\underline {l} }\nolimits_{j} \left( {r_{j} - r_{j - 1} } \right) = \left( {\frac{{r_{j} - r_{j - 1} }}{w}} \right)^{{\frac{1}{j - 1}}} \quad {\text{for }} j = 2, \ldots ,c$$

(8)

Let us discuss the implication of this threshold in a short example: Assume we are dealing with a customer with a specific observable base willingness-to-pay (e.g., $w=0.8$) and an unobservable consumption indicator $\lambda$ with realizations $l\in \left[0, 1\right]$. The firm quotes an arbitrary price vector ${\varvec{r}}$ with ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)$ (e.g., the same price vector as depicted in Fig. 2). Now, we can calculate for every batch size $j$ the marginal utility ${u}_{j}\left({\varvec{r}}\right)-{u}_{j-1}\left({\varvec{r}}\right)$. As the marginal utility depends on random variable $\lambda$, we portray it as function of every possible realization $l\in \left[0, 1\right]$, i.e., $l \mapsto w \cdot l^{j - 1} - \left( {r_{j} - r_{j - 1} } \right)$. To provide a clear illustration in Fig. 3, we only show the marginal utilities for $j\le 3$.

The marginal utility for the first unit is positive for all $l\in \left[0, 1\right]$. Therefore, in this example, every customer with $w=0.8$ prefers purchasing one unit over purchasing nothing at all. The marginal utility for the second and third unit becomes positive at ${\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)$ and ${\underline{l}}_{3}\left({r}_{3}-{r}_{2}\right)$, respectively. Customers with $l\ge {\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)$ and $l\ge {\underline{l}}_{3}\left({r}_{3}-{r}_{2}\right)$ can increase their utility by purchasing the second and third unit, respectively. We can now partition the interval $\left[0, 1\right]$ into $\left[0, {\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)\right]$, $\left[{\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right), {\underline{l}}_{3}\left({r}_{3}-{r}_{2}\right)\right]$, and $\left[{\underline{l}}_{3}\left({r}_{3}-{r}_{2}\right), 1\right]$. Customers with $l={\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right)$ almost surely do not arrive (remember, $\lambda$ is continuously distributed). Thus, it is irrelevant which of the adjacent intervals contains them. For presentation purposes, we decided to include them in both and work with closed intervals. Customers belonging to the first interval (based on their personal $l$) have positive marginal utility for purchasing one unit. They also have negative marginal utility for purchasing the second and third unit. Consequently, these customers attain their maximal utility by purchasing one unit. Analogously, customers belonging to the second and third interval decide to purchase two and three units, respectively.

We can generalize these considerations and partition $\left[0, 1\right]$ into $\left[0, {\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)\right]$, $\left[{\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{l}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$ for $j=2, 3, \dots , c-1$, and $\left[{\underline{l}}_{c}\left({r}_{c}-{r}_{c-1}\right), 1\right]$. For ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)$, by definition, ${\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right)$ is increasing in $j$. Hence, these intervals are well-defined, cover the entire interval $\left[0, 1\right]$, and are ordered such that $\left[{\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{l}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$ contains lower values than $\left[{\underline{l}}_{i}\left({r}_{i}-{r}_{i-1}\right), {\underline{l}}_{i+1}\left({r}_{i+1}-{r}_{i}\right)\right]$ if $j<i$. Moreover, we can conclude that customers belonging to $\left[{\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{l}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$ decide to purchase $j$ units (as they have positive marginal utilities for $i\le j$ and negative marginal utilities for $i>j$). Building on this, we can easily calculate the probability a customer purchase $j$ units ($1<j<c$) by calculating the probability a customer belongs to $\left[{\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{l}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$:

$${\mathbb{P}}\left( {l \in \left[ {\underline {l}_{j} \left( {r_{j} - r_{j - 1} } \right), \underline {l}_{j + 1} \left( {r_{j + 1} - r_{j} } \right)} \right]} \right) = F_{\lambda } \left( {\underline {l}_{j + 1} \left( {r_{j + 1} - r_{j} } \right)} \right) - F_{\lambda } \left( {\underline {l}_{j} \left( {r_{j} - r_{j - 1} } \right)} \right).$$

This remains true for $j=c$ by replacing ${\underline{l}}_{c+1}\left({r}_{c+1}-{r}_{c}\right)$ with $1$. However, the $j=1$ case is somewhat distinct: As utility for purchasing the first unit is independent of $l$, the marginal utility is constant. With ${r}_{1}\le {w}^{+}$ (one of the conditions that defines ${\mathcal{R}}_{c}\left(w\right)$), it is either positive (${r}_{1}<w$) or zero (${r}_{1}=w$ and ${r}_{1}={w}^{+}$). In the latter case, customers are indifferent between purchasing and not purchasing. We emphasize with ${r}_{1}=w$ and ${r}_{1}={w}^{+}$ which of these equally viable options customers choose. As $w$ and ${w}^{+}$ are the same value, the resulting intervals, i.e., $\left[0, {\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)\right]$, are identical in the sense that they cover the same area. And yet, they differ in meaning. One represents all customers that purchase exactly one unit (resulting from ${r}_{1}=w$), the other represents all customers that purchase nothing at all (resulting from ${r}_{1}={w}^{+}$). Please note that this ambiguity has no impact on customers that belong to $\left[{\underline{l}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{l}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$ with $j\ge 2$. These customers have a zero valued marginal utility for the first unit, a positive marginal utility for each $i$th unit with $i\le j$, and negative marginal utilities for every other unit. Thus, they still attain their maximal utility by purchasing $j$ units.

These considerations were enabled by restricting the action space to ${\mathcal{R}}_{c}\left(w\right)$. Only with this restriction, the intervals are guaranteed to be correctly ordered which, in turn, allows us to simplify the formulation of the selling probability $p_{j} \left( {{\varvec{r}}|w} \right)$ for a batch of size $j$:

$$p_{j} \left( {{\varvec{r}}|w} \right) = \left\{ {\begin{array}{*{20}l} {F_{\lambda } \left( {\underline {l}_{j + 1} \left( {r_{j + 1} - r_{j} } \right)} \right) - F_{\lambda } \left( {\underline {l}_{j} \left( {r_{j} - r_{j - 1} } \right)} \right)} \hfill & {{\text{for }} 1 \le j \le c - 1} \hfill \\ {1 - F_{\lambda } \left( {\underline {l}_{c} \left( {r_{c} - r_{c - 1} } \right)} \right)} \hfill & {{\text{for }} j = c} \hfill \\ \end{array} } \right.$$

(9)

with ${\underline{l}}_{1}\left({r}_{1}-{r}_{0}\right)={\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)\cdot {1}_{\left\{{r}_{1}={w}^{+}\right\}}$ to properly reflect the ambiguous behavior of customers belonging to $\left[0, {\underline{l}}_{2}\left({r}_{2}-{r}_{1}\right)\right]$.

The optimization problem with observable base willingness-to-pay is given by

$${V}_{t}^{\omega }\left(c\right)=\underset{0}{\overset{1}{\int }}\underset{{\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)}{{\text{max}}}\left\{\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w\right)\cdot \left({r}_{j}-{\Delta }_{j}{V}_{t-1}^{\omega }\left(c\right)\right)\right\}{f}_{\omega }\left(w\right) dw+{V}_{t-1}^{\omega }\left(c\right)$$

(10)

with boundary conditions ${V}_{0}^{\omega }\left(c\right)=0$ for $c\ge 0$ and ${V}_{t}^{\omega }\left(0\right)=0$ for $t\ge 0$. By definition of ${p}_{j}\left({\varvec{r}}|w\right)$, ${r}_{j}$ influences the probability of three possible outcomes: selling $j-1$, $j$, and $j+1$ units. This interconnection thwarts maximizing $\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w\right)\cdot \left({r}_{j}-{\Delta }_{j}{V}_{t-1}^{\omega }\left(c\right)\right)$ separately. We can circumvent this obstacle by defining $\Delta {r}_{j}={r}_{j}-{r}_{j-1}$ for $j\le c$ (${r}_{0}=0$) and reformulating $\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w\right)\cdot \left({r}_{j}-{\Delta }_{j}{V}_{t-1}^{\omega }\left(c\right)\right)=\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w\right)\cdot \left(\sum_{i=1}^{j}\Delta {r}_{i}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-i\right)\right)=\sum_{j=1}^{c}\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\Delta {r}_{j}\right)\right)\right)\cdot \left(\Delta {r}_{j}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\right)$ for ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)$. Moreover, with (8), we write ${\mathcal{R}}_{c}\left(w\right)=\left\{{\varvec{r}}\in {\mathbb{R}}^{c}:0\le\Delta {r}_{1}\le {w}^{+}, \text{and }0\le {\underline{l}}_{j}\left(\Delta {r}_{j}\right)\le {\underline{l}}_{j+1}\left(\Delta {r}_{j+1}\right)\le 1 {\text{for}} \, 2\le j\le c-1\right\}$. The only remaining connection between marginal unit prices $\Delta {r}_{j}$ is given by the imposed order ${\underline{l}}_{j}\left(\Delta {r}_{j}\right)\le {\underline{l}}_{j+1}\left(\Delta {r}_{j+1}\right)$ for $2\le j\le c-1$. When defining ${\underline{l}}_{j}\left(\Delta {r}_{j}\right)$ in (8), we showed that this is the threshold between negative ($l<{\underline{l}}_{j}\left(\Delta {r}_{j}\right)$) and positive ($l>{\underline{l}}_{j}\left(\Delta {r}_{j}\right)$) marginal utility for purchasing the $j$th unit. So, in conclusion, this order ensures a pricing scheme where customers only consider buying the $j+1$th unit if they also buy the $j$th.

Currently, this order, imposed by conditions ${\underline{l}}_{j}\left(\Delta {r}_{j}\right)\le {\underline{l}}_{j+1}\left(\Delta {r}_{j+1}\right)$, $2\le j\le c-1$, is preventing us from optimizing every decision variable independently. By removing these conditions, we formulate an optimization problem that is entirely separable in each decision variable and serves as an upper bound to (10):

$$\mathop \sum \limits_{j = 1}^{c} \mathop {\max }\limits_{{{\Delta }r_{j} \in \left[ {0,w} \right]}} \left\{ {\left( {1 - F_{\lambda } \left( {\underline {l}_{j} \left( {{\Delta }r_{j} } \right)} \right)} \right) \cdot \left( {{\Delta }r_{j} - {\Delta }_{1} V_{t - 1}^{\omega } \left( {c + 1 - j} \right)} \right)} \right\}.$$

(11)

The roadmap for the remaining section is as follows: We first determine the solution of upper bound problem (11), show that under certain conditions this solution is also the solution of (10) resulting in the same expected revenue, and finally show by induction that these conditions are indeed met.

4.1.2 Solution and structural properties

For every $j$, we check if we can economically sell the $j$th unit. We use the term “economic selling” to refer to selling an additional unit at a price that covers at least the lost expected revenue of the additionally sold capacity (opportunity cost), i.e. $\Delta {r}_{j}\ge {\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j+1\right)$. Whenever ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$ is exceeding $w$, we cannot economically sell the $j$th unit and choose to eliminate demand for it, i.e. we pick $\Delta {r}_{j}=w$.

Corollary 1

If ${V}_{t-1}^{\omega }\left(\cdot \right)$ is increasing and concave, ${N}_{t,c}\left(w\right)=\underset{j=1, \dots ,c}{max} \left\{j: {\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j+1\right)<w\right\}$ denotes the highest additional unit that can be sold economically. It holds that $\left\{j: {\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j+1\right)<w\right\}=\left\{1, 2, \dots , {N}_{t,c}\left(w\right)\right\}$.

Proof

As ${V}_{t-1}^{\omega }\left(\cdot \right)$ is concave, ${\Delta }_{1}{V}_{t-1}^{\upomega }\left(c-j+1\right)$ is increasing in $j$. Thus, there is $\widehat{j}\in \left\{\mathrm{1,2},\dots ,c\right\}$ with ${\Delta }_{1}{V}_{t-1}^{\upomega }\left(c-j+1\right)<w\iff j\le \widehat{j}$. □

By definition, it holds that ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c-{N}_{t,c}\left(w\right)+1\right)<w$ and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c-\left({N}_{t,c}\left(w\right)+1\right)+1\right)\ge w$. Consequently, it also holds that ${\Delta }_{1}{V}_{t-1}^{\omega }\left(\left(c-1\right)-\left({N}_{t,c}\left(w\right)-1\right)-1\right)<w$ and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(\left(c-1\right)-{N}_{t,c}\left(w\right)+1\right)\ge w$ as well as ${\Delta }_{1}{V}_{t-1}^{\omega }\left(\left(c+1\right)-\left({N}_{t,c}\left(w\right)+1\right)-1\right)<w$ and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(\left(c+1\right)-\left({N}_{t,c}\left(w\right)+2\right)+1\right)\ge w$. This observation leads to the following remark.

Remark 1

It holds that ${N}_{t,c-1}\left(w\right)+1={N}_{t,c}\left(w\right)={N}_{t,c+1}\left(w\right)-1$.

The solution to $\underset{\Delta {r}_{1}\in \left[0,w\right]}{{\text{max}}}\left\{\left(1-{F}_{\lambda }\left({\underline{l}}_{1}\left(\Delta {r}_{1}\right)\right)\right)\cdot \left(\Delta {r}_{1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c\right)\right)\right\}$ is $\Delta {r}_{1}=w$ (${w}^{+}$ if ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c\right)\ge w$). However, for every other $j$, the solution is less apparent.

In the proof of Proposition 1, we show that there exists exactly one solution $\Delta {r}_{j}$ to (11). We determine this solution with the help of the optimal customer threshold ${\underline{l}}_{j}$, which is (implicitly) defined in Proposition 1. There, we observe that the optimal solution depends on realization $w$ and opportunity costs ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$.

Proposition 1

In every state $\left(t,c\right)$ and for every $w\in \left[\mathrm{0,1}\right]$, there is a unique optimal solution $\Delta {r}_{t,j}\left(c|w\right)$, $j\le c$, for (11):

If ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\ge w$, $\Delta {r}_{t,j}\left(c|w\right)={w}^{+}$ with ${\underline{l}}_{j}\left(w\right)=1$.
If ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\in \left[0,w\right)$, $\Delta {r}_{t,1}\left(c|w\right)=w$ with ${\underline{l}}_{1}\left(w\right)=0$ for $j=1$ and $\Delta {r}_{t,j}\left(c|w\right)=w\cdot {\left({\underline{l}}_{j}\left(w\right)\right)}^{j-1}$ with ${\underline{l}}_{j}\left(w\right)$ implicitly defined by $w\cdot {\underline{l}}_{j}^{j-2}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)={\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$ for $j\ge 2$.

Proof

We have already established that $\Delta {r}_{{\text{t}},1}\left(c|w\right)=w$ when ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\in \left(0,w\right)$. Hence, our focus will be on $j\ge 2$ in the subsequent discussion. In this proof, we aim to achieve two objectives. First, we intend to derive the implicit definition and argue that there is at least one solution meeting this criterion. Second, we aim to prove that there could only be one solution meeting this criterion. To accomplish the first goal, we will formulate the first-order condition and examine the values of the first derivative at the interval boundaries. The second goal will be secured by establishing the second derivative and demonstrating its negativity for every solution that satisfies the first-order condition. With the continuity of the first derivative, this is sufficient to conclude that there is exactly one point where the first derivative equals zero. Hence, there is exactly one solution meeting the first-order condition and maximizing the optimization problem.

We commence with a reformulation of the optimization problem, a convenient step to simplify the second derivative.

There are different approaches to tackle this optimization problem: we can try to find the optimal marginal price $\Delta {r}_{j}$, the optimal customer threshold ${\underline{l}}_{j}$, or the optimal probability $\theta =1-{F}_{\lambda }\left({\underline{l}}_{j}\right)$. As ${\underline{l}}_{j}\left(\Delta {r}_{j}\right)={\left(\frac{\Delta {r}_{j}}{w}\right)}^{\frac{1}{j-1}}$ is bijective on $\left[0, w\right]$, and the distribution function is bijective on its support $\left[0, 1\right]$, there is a unique mapping between $\Delta {r}_{j}$, ${\underline{l}}_{j}$, and $\theta$. This enables us to treat each of these variables as a decision variable and use the mapping to calculate the other two.

In this proof, it is more convenient to focus on $\theta$ as our decision variable. Thereby, we do not have to deal with (varying) opportunity costs in the second derivative. We reformulate our optimization problem with $\Delta {r}_{j}=w\cdot {\left({\underline{l}}_{j}\right)}^{j-1}$, $\theta =1-{F}_{\lambda }\left({\underline{l}}_{j}\right)$, and ${F}_{\lambda }^{-1}$ being the inverse to ${F}_{\lambda }$:

$$\begin{aligned} \mathop {\max }\limits_{{{\Delta }r_{j} \in \left[ {0,w} \right]}} \left\{ {\left( {1 - F_{\lambda } \left( {\left( {\frac{{{\Delta }r_{j} }}{w}} \right)^{{\frac{1}{j - 1}}} } \right)} \right) \cdot \left( {{\Delta }r_{j} - {\Delta }_{1} V_{t - 1}^{\omega } \left( {c + 1 - j} \right)} \right)} \right\} & = \mathop {\max }\limits_{{\underline {l}_{j} \in \left[ {0,1} \right]}} \left\{ {\left( {1 - F_{\lambda } \left( {\underline {l}_{j} } \right)} \right) \cdot \left( {w \cdot \left( {\underline {l}_{j} } \right)^{j - 1} - {\Delta }_{1} V_{t - 1}^{\omega } \left( {c + 1 - j} \right)} \right)} \right\} \\ & = \mathop {\max }\limits_{{{\uptheta } \in \left[ {0,1} \right]}} \left\{ {\theta \cdot \left( {w \cdot \left( {F_{\lambda }^{ - 1} \left( {1 - \theta } \right)} \right)^{j - 1} - {\Delta }_{1} V_{t - 1}^{\omega } \left( {c + 1 - j} \right)} \right)} \right\}. \\ \end{aligned}$$

We can now approach our first goal. The optimal solution has to meet the first-order condition:

$$\frac{d}{d\theta }\theta \cdot \left(w\cdot {\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\right)=w\cdot {\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)-\theta \cdot w\cdot \left(j-1\right)\cdot {\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}^{j-2}\cdot \frac{1}{{f}_{\lambda }\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}=0.$$

This condition is well-defined as ${f}_{\lambda }>0$ on the distribution’s support $\left[0, 1\right]$. The existence of a solution is ensured by the continuity of the first derivative as well as the fact that ${\text{it}}$ is non-negative for $\theta =1$, and positive for $\theta =0$ (remember that $w>{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$).

With ${\underline{l}}_{j}={F}_{\lambda }^{-1}\left(1-\theta \right)$ and the definition of the failure rate, we can reformulate the first-order condition to

$$w\cdot {\left({\underline{l}}_{j}\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)-w\cdot \left(j-1\right)\cdot {\left({\underline{l}}_{j}\right)}^{j-2}\cdot \frac{1}{{h}_{\lambda }\left({\underline{l}}_{j}\right)}=0.$$

To achieve our second objective, we derive the second derivative on this formulation and write ${\underline{l}}_{j}\left(\theta \right)$ to emphasize that ${\underline{l}}_{j}$ depends on $\theta$ (our decision variable in this proof). Subsequently, we demonstrate that the second derivative is negative for every $\theta$ that satisfies the first-order condition. With the continuity of the first derivative, this is sufficient to establish the uniqueness of such a $\theta$:

$$\frac{{d}^{2}}{d{\theta }^{2}}\theta \cdot \left(w\cdot {\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\right)=\frac{d}{d\theta }w\cdot {\left({\underline{l}}_{j}\left(\theta \right)\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)-w\cdot \left(j-1\right)\cdot {\left({\underline{l}}_{j}\left(\theta \right)\right)}^{j-2}\cdot \frac{1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)}=w\cdot \left(j-1\right)\cdot {\left({\underline{l}}_{j}\left(\theta \right)\right)}^{j-3}\left({\underline{l}}_{j}\left(\theta \right)-\frac{\left(j-2\right)\cdot {h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)-{\underline{l}}_{j}\left(\theta \right)\cdot {h}^{\prime}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)}{{\left({h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)\right)}^{2}}\right)\cdot \frac{d}{d\theta }{\underline{l}}_{j}\left(\theta \right)$$

for $j\ge 3$, and

$$\frac{{d}^{2}}{d{\theta }^{2}}\theta \cdot \left(w\cdot {\left({F}_{\lambda }^{-1}\left(1-\theta \right)\right)}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\right)=\left(w+w\cdot \left(j-1\right)\cdot \frac{{h}_{\lambda }^{\prime}\left({\underline{l}}_{j}\left(\theta \right)\right)}{{\left({h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)\right)}^{2}}\right)\cdot \frac{d}{d\theta }{\underline{l}}_{j}\left(\theta \right)<0$$

for $j=2$. With $\frac{d}{d\theta }{\underline{l}}_{j}\left(\theta \right)<0$ and ${h}_{\lambda }^{\prime}\left({\underline{l}}_{j}\left(\theta \right)\right)\ge 0$, the latter case is trivial. Thus, we will focus on $j\ge 3$ for the remaining part of the proof. Again with $\frac{d}{d\theta }{\underline{l}}_{j}\left(\theta \right)<0$, we only need to show that ${\underline{l}}_{j}\left(\theta \right)-\frac{\left(j-2\right)\cdot {h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)-{\underline{l}}_{j}\left(\theta \right)\cdot {h}_{\lambda }^{\prime}\left({\underline{l}}_{j}\left(\theta \right)\right)}{{\left({h}_{\lambda }\left({\underline{l}}_{j}\left(\theta \right)\right)\right)}^{2}}>0$ for every $\theta$ that meets the first-order condition. It holds that

$$\underline {l}_{j} \left( \theta \right) - \frac{{\left( {j - 2} \right) \cdot h_{\lambda } \left( {\underline {l}_{j} \left( \theta \right)} \right) - \underline {l}_{j} \left( \theta \right) \cdot h_{\lambda }^{\prime } \left( {\underline {l}_{j} \left( \theta \right)} \right)}}{{\left( {h_{\lambda } \left( {\underline {l}_{j} \left( \theta \right)} \right)} \right)^{2} }} = \underbrace {{{\Delta }_{1} V_{t - 1}^{\omega } \left( {c + 1 - j} \right)}}_{ \ge 0} + \underbrace {{\frac{1}{{h_{\lambda } \left( {\underline {l}_{j} \left( \theta \right)} \right)}}}}_{ > 0} + \underbrace {{\frac{{\underline {l}_{j} \left( \theta \right) \cdot h_{\lambda }^{\prime } \left( {\underline {l}_{j} \left( \theta \right)} \right)}}{{\left( {h_{\lambda } \left( {\underline {l}_{j} \left( \theta \right)} \right)} \right)^{2} }}}}_{ \ge 0} > 0.$$

The equality follows by the first-order condition. Also note that ${h}_{\lambda }$ is positive on the distribution’s support and increasing by assumption.□

Remark 2

For $\lambda \sim U\left[0, 1\right]$ and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)=0$, the optimality condition leads to a closed-form solution: $\Delta {r}_{t,j}\left(c|w\right)={w\cdot \left(\frac{j-1}{j}\right)}^{j-1}$, $j\ge 2$.

For now, we have (implicitly) given the solution of upper bound (11). If we can show that this solution is a feasible solution to (10), i.e. $\Delta {r}_{t,j}\left(c|w\right)\in {\mathcal{R}}_{c}\left(w\right)$, we can immediately conclude that $\Delta {r}_{t,j}\left(c|w\right)$ results in the same expected revenue in (10) and is the unique optimal solution. It holds that $\Delta {r}_{t,j}\left(c|w\right)\in {\mathcal{R}}_{c}\left(w\right)\iff \left(\Delta {r}_{t,1}\left(c|w\right)\in \left[0, {w}^{+}\right] {\text{and}} \, {\underline{l}}_{j}\left(w\right)\le {\underline{l}}_{j+1}\left(w\right) {\text{for}} \, 2\le j\le c-1\right)$.

In proof of Proposition 1, we have seen that ${\underline{l}}_{j}\left(w\right)<1$ (resulting from $\theta >0$) if ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)<w$ and ${\underline{l}}_{j}\left(w\right)=1$ if ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\ge w$. This could lead to contradicting condition ${\underline{l}}_{j}\left(w\right)\le {\underline{l}}_{j+1}\left(w\right)$ if there is $j$ such that ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\ge w$ and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j\right)<w$. Therefore, a necessary condition for $\Delta {r}_{t,j}\left(c|w\right)\in {\mathcal{R}}_{c}\left(w\right)$ is ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-\widehat{j}\right)\ge w\Rightarrow \left({\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\ge w\ \forall j\ge \widehat{j}\right)$. This is ensured when ${V}_{t-1}^{\omega }\left(\cdot \right)$ is concave, so we will stick with this condition.

Proposition 2

If ${V}_{t-1}^{\omega }\left(\cdot \right)$ is increasing and concave, $\Delta {r}_{t,j}\left(c|w\right)$ defined by Proposition 1 is the optimal solution for (10).

Proof

We formulated optimization problem (11) by removing conditions ${\underline{l}}_{j}\left(w\right)\le {\underline{l}}_{j+1}\left(w\right) {\text{ for }} \, 2\le j\le c-1$. Therefore, demonstrating that $\Delta {r}_{t,j}\left(c|w\right)$ satisfies these conditions is sufficient to show Proposition 2. According to Corollary 1, ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j+1\right)<w$ holds for $j\le {N}_{t,c}\left(w\right)$, and ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j+1\right)\ge w$ holds for $j>{N}_{t,c}\left(w\right)$. Combined with Proposition 1, this implies that ${\underline{l}}_{j}\left(w\right)=1$ for $j>{N}_{t,c}\left(w\right)$, which evidently aligns with ${\underline{l}}_{j}\left(w\right)\le {\underline{l}}_{j+1}\left(w\right)$. Therefore, in the subsequent discussion, we exclusively focus on the case where $j\le {N}_{t,c}\left(w\right)$.

We know from Proposition 1 (and its proof) that $0={\underline{l}}_{1}\left(w\right)\le {\underline{l}}_{2}\left(w\right)$, and ${\underline{l}}_{j}\left(w\right)$ such that $w\cdot {\underline{l}}_{j}^{j-2}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)={\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$, $j\ge 2$.

Focusing on $w\cdot {\underline{l}}^{j-2}\cdot \left(\underline{l}-\frac{j-1}{{h}_{\lambda }\left(\underline{l}\right)}\right)$, we can observe that this formulation is decreasing in $j$ if $\underline{l}-\frac{j-1}{{h}_{\lambda }\left(\underline{l}\right)}\ge 0$. As ${\underline{l}}_{j}\left(w\right)-\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\ge 0$, it holds that

$$0=w\cdot {\underline{l}}_{j}^{j-2}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)>w\cdot {\underline{l}}_{j}^{j-1}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\ge w\cdot {\underline{l}}_{j}^{j-1}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)-{\Delta }_{1}{V}_{t-1}^{\omega }\left(c-j\right),$$

$j\ge 2$, where the equation follows by Proposition 1, the first inequality by increasing $j$ to $j+1$, and the last inequality by concavity of ${V}_{t-1}^{\omega }\left(\cdot \right)$. The negativity of the last term proves that the optimal solution ${\underline{l}}_{j}\left(w\right)$ for selling the $j$th unit does not satisfy the optimality condition for selling the $j+1$th unit. More precisely, it proves that the point where this optimality condition is fulfilled, denoted as ${\underline{l}}_{j+1}\left(w\right)$, must be positioned above ${\underline{l}}_{j}\left(w\right)$. In simpler terms, ${\underline{l}}_{j}\left(w\right)<{\underline{l}}_{j+1}\left(w\right)$. □

So far, we have shown $\Delta {r}_{t,j}\left(c|w\right)$ (implicitly) defined by Proposition 1 is the optimal solution to (10) for every $t$ if ${V}_{t-1}^{\omega }\left(\cdot \right)$ is increasing and concave. We will show that this condition indeed holds for the whole horizon. In the upcoming proof, the optimal expected margin ${m}_{j}$ for selling the $j$th unit, and its sensitivity to changes in opportunity costs, will play a crucial role. Hence, we introduce ${m}_{j}\left(\delta \right)=\underset{\Delta {r}_{j}\in \left[0,w\right]}{{\text{max}}}\left\{\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\Delta {r}_{j}\right)\right)\right)\cdot \left(\Delta {r}_{j}-\delta \right)\right\}$ as a function of variable $\delta$ which represents any opportunity costs. This allows us to analyze the impact of varying opportunity costs on the optimal expected margin. Lemma 2 outlines certain properties of ${m}_{j}\left(\delta \right)$ that will prove useful in establishing concavity of ${V}_{t-1}^{\omega }\left(\cdot \right)$ later in this section.

Lemma 2

If $\delta \in \left[0,w\right)$, it holds that:

(a)
${m}_{j+1}\left(\delta \right)-{m}_{j}\left(\delta \right)\le 0$
(b)
${m}_{j+1}\left(\delta \right)-{m}_{j}\left(\delta \right)$ is increasing in $\delta$
(c)
${m}_{j}\left(\delta \right)$ is decreasing in $\delta$

Proof

We will address (a), (b), and (c) separately, though not in this order. To streamline the proof of (b), we will employ a formulation derived in (c), so we will modify the order accordingly.

(a): The assertion that the optimal expected margin declines with $j$, i.e., ${m}_{j}\left(\delta \right)\ge {m}_{j+1}\left(\delta \right)$, is rooted in two observations: the suboptimality of the solution of ${m}_{j+1}\left(\delta \right)$ for ${m}_{j}\left(\delta \right)$, and the fact that expected margin decreases with $j$ for any $l\in \left[0, 1\right]$.

${m}_{j}\left(\delta \right)$ is the optimal value of $\underset{\Delta {r}_{j}\in \left[0,w\right]}{{\text{max}}}\left\{\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\Delta {r}_{j}\right)\right)\right)\cdot \left(\Delta {r}_{j}-\delta \right)\right\}=\underset{{\underline{l}}_{j}\in \left[\mathrm{0,1}\right]}{{\text{max}}}\left\{\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\right)}^{j-1}-\delta \right)\right\}=\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\left(w\right)\right)}^{j-1}-\delta \right)$ with ${\underline{l}}_{j}\left(w\right)$ representing the optimal solution. As ${\underline{l}}_{j+1}\left(w\right)$ (the optimal solution of ${m}_{j+1}\left(\delta \right)$) is suboptimal for ${m}_{j}\left(\delta \right)$ and $1-{F}_{\lambda }\left({\underline{l}}_{j+1}\left(w\right)\right)\ge 0$ as well as ${\underline{l}}_{j+1}\left(w\right)\le 1$, it holds that

$${m}_{j}\left(\delta \right)=\underset{{\underline{l}}_{j}\in \left[\mathrm{0,1}\right]}{{\text{max}}}\left\{\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\right)}^{j-1}-\delta \right)\right\}\ge \left(1-{F}_{\lambda }\left({\underline{l}}_{j+1}\left(w\right)\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j+1}\left(w\right)\right)}^{j-1}-\delta \right)\ge \left(1-{F}_{\lambda }\left({\underline{l}}_{j+1}\left(w\right)\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j+1}\left(w\right)\right)}^{j}-\delta \right)={m}_{j+1}\left(\delta \right)$$

(c): To prove (c), we will derive the first derivative of ${m}_{j}\left(\delta \right)$ with respect to $\delta$ and demonstrate its nonpositivity.

Based on its implicit definition $w\cdot {\underline{l}}_{j}^{j-2}\left(w\right)\cdot \left({\underline{l}}_{j}\left(w\right)-\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}\right)=\delta$ (refer to Proposition 1), the optimal solution ${\underline{l}}_{j}\left(w\right)$ of ${m}_{j}\left(\delta \right)$ depends also on $\delta$. As we are about to vary $\delta$, we highlight this fact by writing ${\underline{l}}_{j}\left(\delta \right)$ instead of ${\underline{l}}_{j}\left(w\right)$ ($w$ acts as a parameter in this proof). The same applies for ${\underline{l}}_{j+1}\left(\delta \right)$ and ${m}_{j+1}\left(\delta \right)$. Building the first derivative, we get

$$\frac{d}{d\delta }{m}_{j}\left(\delta \right)=\frac{d}{d\delta }\left(\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\left(\delta \right)\right)}^{j-1}-\delta \right)\right)=-{f}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\cdot \frac{d}{d\delta }\left({\underline{l}}_{j}\left(\delta \right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\left(\delta \right)\right)}^{j-1}-\delta \right)+\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\right)\cdot \left(w\cdot \left(j-1\right)\cdot {\left({\underline{l}}_{j}\left(\delta \right)\right)}^{j-2}\cdot \frac{d}{d\delta }\left({\underline{l}}_{j}\left(\delta \right)\right)-1\right)=\frac{d}{d\delta }\left({\underline{l}}_{j}\left(\delta \right)\right)\cdot {f}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\cdot \left(w\cdot {\left({\underline{l}}_{j}\left(\delta \right)\right)}^{j-2}\cdot \left(\frac{j-1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)}-{\underline{l}}_{j}\left(\delta \right)\right)+\delta \right)-\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\right)=-\left(1-{F}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right)\right)\le 0.$$

The last equation holds because of the implicit definition of ${\underline{l}}_{j}\left(\delta \right)$.

(b): Similarly to (c), we aim to calculate the first derivative $\frac{d}{d\delta }\left({m}_{j+1}\left(\delta \right)-{m}_{j}\left(\delta \right)\right)$. Fortunately, we can leverage the first derivative of ${m}_{j}\left(\delta \right)$ with respect to $\delta$. It is important to note that substituting $j$ by $j+1$ does not alter the reasoning in (c). Consequently, we find that $\frac{d}{d\delta }{m}_{j+1}\left(\delta \right)=-\left(1-{F}_{\lambda }\left({\underline{l}}_{j+1}\left(\delta \right)\right)\right)$. Combining the first derivative of ${m}_{j}\left(\delta \right)$ and ${m}_{j+1}\left(\delta \right)$ leads to

$$\frac{d}{d\delta }\left({m}_{j+1}\left(\delta \right)-{m}_{j}\left(\delta \right)\right)={F}_{\lambda }\left({\underline{l}}_{j+1}\left(\delta \right)\right)-{F}_{\lambda }\left({\underline{l}}_{j}\left(\delta \right)\right).$$

Recalling the argumentation while developing Proposition 2, we know that ${\underline{l}}_{j+1}\left(\delta \right)\ge {\underline{l}}_{j}\left(\delta \right)$. Hence, we can conclude that $\frac{d}{d\delta }\left({m}_{j+1}\left(\delta \right)-{m}_{j}\left(\delta \right)\right)\ge 0$. □

Even though we developed Lemma 2 mainly to show the desired concavity of ${V}_{t}^{\omega }\left(\cdot \right)$, it also brings interesting implications with it: The optimal expected margin for selling the $j$th unit is greater than the optimal expected margin for selling the $j+1$th unit given both cases result in the same additional opportunity costs. With a concave value function ${V}_{t}^{\omega }\left(\cdot \right)$, we can conclude that selling the $j+1$th unit results in higher additional opportunity costs and, thus, selling the $j+1$th unit definitely leads to a lower optimal expected margin than selling the $j$th unit does.

Before delving into the proof of the preservation of concavity across periods, let us examine a small example. Assume that $\omega$ and $\lambda$ follow a uniform distribution. We address the optimization problem for all states $\left(t, c\right)$ with $t=1, 2$ and $c=1, \dots , 5$. We start with $t=1$, as ${V}_{2}^{\omega }\left(\cdot \right)$ depends on ${V}_{1}^{\omega }\left(\cdot \right)$ which in turn depends on ${V}_{0}^{\omega }\left(\cdot \right)$. After the selling horizon, no revenue can be earned, leading to the boundary condition ${V}_{0}^{\omega }\left(c\right)=0$ for $c\ge 0$.

In $t=1$, observe that ${V}_{0}^{\omega }\left(c\right)$ is (as a constant, not strictly) increasing and concave. Propositions 1 and 2 allow us to calculate ${V}_{1}^{\omega }\left(c\right)$. In addition, with no opportunity costs (${\Delta }_{1}{V}_{0}^{\omega }\left(c\right)=0$), we can use the closed-form expression of the optimal solution from Remark 2, i.e., $\Delta {r}_{1}\left(c|w\right)=w$ and $\Delta {r}_{j}\left(c|w\right)={w\cdot \left(\frac{j-1}{j}\right)}^{j-1}$, for every possible realization $w$ of $\omega$. This yields a closed-form expression for ${V}_{1}^{\omega }\left(c|w\right)={V}_{0}^{\omega }\left(c\right)+\sum_{j=1}^{c}{m}_{j}\left(0\right)={V}_{0}^{\omega }\left(c\right)+w\cdot {1}_{\left\{w>{\Delta }_{1}{V}_{0}^{\omega }\left(c\right)\right\}}+\sum_{j=2}^{c}\left(1-{F}_{\lambda }\left(\frac{j-1}{j}\right)\right)\cdot {w\cdot \left(\frac{j-1}{j}\right)}^{j-1}$ and we calculate ${V}_{1}^{\omega }\left(c\right)={\int }_{0}^{1}{V}_{1}^{\omega }\left(c|w\right)\cdot {f}_{\omega }\left(w\right) dw$. The results of these calculations are presented in Table 1, revealing that ${V}_{1}^{\omega }\left(c\right)$ is increasing in $c$. As ${\Delta }_{1}{V}_{1}^{\omega }\left(c\right)={V}_{1}^{\omega }\left(c\right)-{V}_{1}^{\omega }\left(c-1\right)$ is decreasing in $c$, ${V}_{1}^{\omega }\left(c\right)$ is also concave.

Table 1 Example with $\omega , \lambda \sim U\left[0, 1\right]$ and $c\le 5, t=1, 2$

Full size table

Moving to $t=2$, Propositions 1 and 2 remain applicable (we just observed that ${V}_{1}^{\omega }\left(c\right)$ is increasing and concave). However, Remark 2 is no longer relevant (${\Delta }_{1}{V}_{1}^{\omega }\left(c\right)\ne 0$). Consequently, we can no longer rely on the closed-form expression of the optimal solution. Without the closed-form solution, solving the optimization problem for every realization $w$ becomes more challenging. As an example, we focus on the specific realization $w=0.1$ and $c=5$ in detail.

The maximal number of units that can be sold economically is given by ${N}_{\mathrm{2,5}}\left(0.1\right)=\underset{j=1, \dots ,5}{max} \left\{j: {\Delta }_{1}{V}_{t-1}^{\omega }\left(6-j\right)<0.1\right\}=3$. Consequently, the optimization problem becomes ${V}_{2}^{\omega }\left(5|0.1\right)={V}_{1}^{\omega }\left(5\right)+\sum_{j=1}^{3}{m}_{j}\left({\Delta }_{1}{V}_{1}^{\omega }\left(6-j\right)\right)={V}_{1}^{\omega }\left(5\right)+\left(0.1-{\Delta }_{1}{V}_{1}^{\omega }\left(5\right)\right)+{m}_{2}\left({\Delta }_{1}{V}_{1}^{\omega }\left(4\right)\right)+{m}_{3}\left({\Delta }_{1}{V}_{1}^{\omega }\left(3\right)\right)$. To apply the optimality condition, we note that $\frac{1}{{h}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}=\frac{1-{F}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}{{f}_{\lambda }\left({\underline{l}}_{j}\left(w\right)\right)}=1-{\underline{l}}_{j}\left(w\right)$ for $\lambda \sim U\left[0, 1\right]$. The optimality condition becomes $w\cdot {\underline{l}}_{j}^{j-2}\left(w\right)\cdot \left(j\cdot {\underline{l}}_{j}\left(w\right)-\left(j-1\right)\right)={\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$.

Optimal $\Delta {r}_{2}$ is given by $\Delta {r}_{2}=0.1\cdot {\underline{l}}_{2}\left(0.1\right)$ with ${\underline{l}}_{2}\left(0.1\right)$ such that $0.1\cdot \left(2\cdot {\underline{l}}_{2}\left(0.1\right)-1\right)={\Delta }_{1}{V}_{1}^{\omega }\left(4\right)$. Thus, ${\underline{l}}_{2}\left(0.1\right)=\frac{\left(10\cdot {\Delta }_{1}{V}_{1}^{\omega }\left(4\right)+1\right)}{2}\approx 0.7635$ and $\Delta {r}_{2}=\left(\frac{{\Delta }_{1}{V}_{1}^{\omega }\left(4\right)+0.1}{2}\right)\approx 0.0764$. Optimal $\Delta {r}_{3}$ is given by $\Delta {r}_{3}=0.1\cdot {\left({\underline{l}}_{3}\left(0.1\right)\right)}^{2}$ with ${\underline{l}}_{3}\left(0.1\right)$ such that $0.1\cdot {\underline{l}}_{3}\left(0.1\right)\cdot \left(3\cdot {\underline{l}}_{3}\left(0.1\right)-2\right)={\Delta }_{1}{V}_{1}^{\omega }\left(3\right)$. Thus, ${\underline{l}}_{3}\left(0.1\right)=\frac{2+\sqrt{4+120\cdot {\Delta }_{1}{V}_{1}^{\omega }\left(3\right)}}{6}\approx 0.9318$ and $\Delta {r}_{3}\approx 0.0868$. Consequently, ${V}_{2}^{\omega }\left(5|0.1\right)\approx 0.7928+0.059+0.0056+0.0009\approx 0.8583$.

Similarly, we can calculate ${N}_{\mathrm{2,4}}\left(0.1\right)=2$, ${N}_{\mathrm{2,3}}\left(0.1\right)=1$, and ${N}_{\mathrm{2,2}}\left(0.1\right)={N}_{\mathrm{2,1}}\left(0.1\right)=0$ as well as ${V}_{2}^{\omega }\left(c|0.1\right)$, $c\le 4$ (cf, Table 1). Once again, we observe that ${V}_{2}^{\omega }\left(c|0.1\right)$ increases, and ${V}_{2}^{\omega }\left(c|0.1\right)-{V}_{2}^{\omega }\left(c-1|0.1\right)$ decreases in $c$.

Finally, we numerically derive ${V}_{2}^{\omega }\left(c\right)$, $c\le 5$, and, again, observe that these properties are still intact.

In our example, we have seen that these conditions stayed intact. Now, we want to prove that these conditions indeed hold for every $t\le T$ and any distribution that meets the assumption formulated in Sect. 3.2.

Proposition 3

For every $t$, ${V}_{t}^{\omega }\left(\cdot \right)$ is increasing and concave.

Proof

See Supplement S.1.

Proposition 3 confirms the optimality of prices defined by Proposition 1 in a scenario where base willingness-to-pay is observable. The optimality condition is influenced by two factors: the specific customer type indicated by the observed base willingness-to-pay and opportunity costs. While the former is stochastic and, hence, ex ante unpredictable, the latter is state-dependent and can be determined beforehand. Consequently, understanding the dynamics of opportunity costs is crucial for comprehending the optimal pricing policy. The subsequent proposition illustrates how opportunity costs and the value function evolve over time. Notably, the increase in opportunity costs over time is intriguing, suggesting that optimal marginal prices may also experience an upward trend.

Proposition 4

For every $c$, it holds:

(a)
${\Delta }_{1}{V}_{t}^{\omega }\left(c\right)$ is increasing in $t$
(b)
${V}_{t}^{\omega }\left(c\right)$ is increasing and concave in $t$

Proof

See Supplement S.2.

For a first impression regarding dynamics of optimal prices, we start with a generic look at the optimality condition given by Proposition 1. We will use $\delta$ as variable for opportunity costs, and replace ${\underline{l}}_{j}\left(w\right)$ with $\left( {\frac{{{\Delta }r_{j} }}{w}} \right)^{{\frac{1}{j - 1}}}$. With some algebra, we can reformulate the (sufficient) first-order condition to

$$\frac{\delta }{{\left( {j - 1} \right) \cdot {\Delta }r_{j} }} + \frac{1}{{h_{\lambda } \left( {\left( {\frac{{{\Delta }r_{j} }}{w}} \right)^{{\frac{1}{j - 1}}} } \right) \cdot \left( {\frac{{{\Delta }r_{j} }}{w}} \right)^{{\frac{1}{j - 1}}} }} = \frac{1}{j - 1}.$$

(12)

We will momentarily set aside the fact that $\Delta {r}_{j}$ is our decision variable and consider $\delta$, $w$, and $\Delta {r}_{j}$ as arbitrary variables whose sole purpose is to satisfy the equality in (12). The left side of this equation increases with $\delta$ and $w$, while it decreases with $\Delta {r}_{j}$. To maintain equality, a change in one of these variables must result in a change in at least one of the other two variables. There are several possible combinations of such variations, but we will emphasize three particularly important ones:

(a)
An increase (decrease) of $\delta$ can be compensated by a decrease (increase) of $w$ while keeping $\Delta {r}_{j}$ constant
(b)
An increase (decrease) of $\delta$ can be compensated by an increase (decrease) of $\Delta {r}_{j}$ while keeping $w$ constant
(c)
An increase (decrease) of $w$ can be compensated by an increase (decrease) of $\Delta {r}_{j}$ while keeping $\updelta$ constant

These observations are crucial for understanding in which situations the marginal price for the $j$th unit stays constant, increases, or decreases.

Now, let us consider a specific situation: In state $\left(t,c\right)$ with (marginal) opportunity costs ${\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$, we encounter a particular customer ${w}_{t,c}$, and calculate the corresponding optimal marginal price $\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)$ (based on Eq. (12)). The question of whether we increase (decrease) the marginal price for the $j$th unit in a follow-up state $\left(t-1,c-i\right)$, $i<j$, where (marginal) opportunity costs are ${\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)$, ultimately depends on the future stochastic customer ${w}_{t-1,c-i}$ we will face.

Hence, we search for the specific customer type $w$ where we maintain optimality of the same marginal price $\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)$ in the follow-up state $\left(t-1,c-i\right)$. With Eq. (12), $w$ must fulfill:

$$\frac{{\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)}{\left(j-1\right)\cdot\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)}+\frac{1}{{h}_{\lambda }\left({\left(\frac{\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)}{w}\right)}^{\frac{1}{j-1}}\right)\cdot {\left(\frac{\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)}{w}\right)}^{\frac{1}{j-1}}}=\frac{1}{j-1}.$$

Certainly, the possibility of solving this equation in closed-form with respect to $w$ is heavily contingent on the failure rate ${h}_{\lambda }$, and consequently, on the distribution function of $\lambda$. Distributions featuring a simple failure rate, such as the uniform distribution, allow us to formulate a closed-form expression for such a $w$. However, achieving this for every distribution is not possible.

Nevertheless, we can still glean some insights into the characteristics of a scenario where $\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)$ maintains optimality in a follow-up state: As mentioned earlier (observation (a) from above), we observed that an increase (decrease) of opportunity costs can be offset by an appropriate decrease (increase) of $w$ without altering $\Delta {r}_{j}$. The formal description of this observation is provided in the following lemma.

Lemma 3

For $i+j\le c$, it holds

$${\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\iff \left(\Delta {r}_{t-1,j}\left(c-i|{w}_{t-1,c-i}\right)=\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)\Rightarrow {w}_{t-1,c-i}\le {w}_{t,c}\right)$$

Proof

This lemma is a formal description of the previous discussion and its results. □

In this section, our discussion has revolved around a scenario where the current customer ${w}_{t,c}$ is observed. Now, let us strive for a more comprehensive understanding of the dynamics of a marginal price that is not contingent on the observation of customer ${w}_{t,c}$.

These dynamics are inherently stochastic since marginal prices in both $\left(t, c\right)$ and $\left(t-1, c-i\right)$ hinge on the arrival of two independent customers, ${w}_{t,c}$ and ${w}_{t-1,c-i}$, with their values being (ex ante) unknown. Nonetheless, we can quantify the probability of marginal prices increasing or decreasing. For the sake of simplicity, let us assume ${\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$. Drawing from observation (b), we know that $\Delta {r}_{t-1,j}\left(c-i|{w}_{t,c}\right)\ge\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)$ for any ${w}_{t,c}$. This implies that the marginal unit price in a follow-up state increases compared to the current state, given the same costumer type ${w}_{t,c}$ in both states. Observation (c) further establishes that $\Delta {r}_{t-1,j}\left(c-i|w\right)\ge\Delta {r}_{t-1,j}\left(c-i|{w}_{t,c}\right)$ for every $w\ge {w}_{t,c}$ (with constant $\delta ={\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)$). This indicates that if the customer type in the follow-up state increases compared to the current one, the marginal unit price increases even further. In summary, it follows ${w}_{t-1,c-i}\ge {w}_{t,c}\Rightarrow\Delta {r}_{t-1,j}\left(c-i|{w}_{t-1,c-i}\right)\ge\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)$, and thus,

$${\mathbb{P}}\left(\Delta {r}_{t-1,j}\left(c-i|{w}_{t-1,c-i}\right)\ge\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)\right)\ge {\mathbb{P}}\left({w}_{t-1,c-i}\ge {w}_{t,c}\right)=\underset{0}{\overset{1}{\int }}\left(\underset{{w}_{t,c}}{\overset{1}{\int }}{f}_{\omega }\left({w}_{t-1,c-i}\right) d{w}_{t-1,c-i}\right) {f}_{\omega }\left({w}_{t,c}\right) d{w}_{t,c}=\underset{0}{\overset{1}{\int }}\left(1-{F}_{\omega }\left({w}_{t,c}\right)\right) {f}_{\omega }\left({w}_{t,c}\right) d{w}_{t,c}=1-\underset{0}{\overset{1}{\int }}{F}_{\omega }\left({w}_{t,c}\right) {f}_{\omega }\left({w}_{t,c}\right) d{w}_{t,c}=1-{\left[\frac{{F}_{\omega }{\left({w}_{t,c}\right)}^{2}}{2}\right]}_{0}^{1}=\frac{1}{2}.$$

This result implies that the probability of the marginal price in the follow-up state being greater than or equal to the marginal price in the current state is at least $\frac{1}{2}$. Analogously, we can conclude that ${\mathbb{P}}\left(\Delta {r}_{t-1,j}\left(c-i|{w}_{t-1,c-1}\right)\le\Delta {r}_{t,j}\left(c|{w}_{t,c}\right)\right)\le \frac{1}{2}$ if ${\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)\le {\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)$.

We are now poised to consolidate all the dynamics related to the optimal pricing policy. Theorem 1(a) and (b) naturally follow from the already outlined dynamics of opportunity costs. They show how optimal marginal prices, quoted to the same customer type $w$, change with $t$ and $c$, respectively. Theorem 1(c) states that customers with a higher base willingness-to-pay encounter higher marginal prices. Lastly, Theorem 1(d) takes the stochasticity of $\omega$ into account. It states that it is more likely for marginal prices to decrease from one state to a follow-up state if the opportunity costs are higher in the follow-up state.

Theorem 1

For every $c, t, j$ and $w$, it holds:

(a)
$\Delta {r}_{t,j}\left(c|w\right)$ is increasing in $t$, ${N}_{t,c}\left(w\right)$ is decreasing in $t$
(b)
$\Delta {r}_{t,j}\left(c|w\right)$ is decreasing in $c$, ${N}_{t,c}\left(w\right)$ is increasing in $c$
(c)
$\Delta {r}_{t,j}\left(c|w\right)$ is increasing in $w$, ${N}_{t,c}\left(w\right)$ is increasing in $w$
(d)
${\Delta }_{1}{V}_{t-2}^{\omega }\left(c-i+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\omega }\left(c+1-j\right)\iff {\mathbb{P}}\left(\Delta {r}_{t-1,j}\left(c-i|{w}_{t-1}\right)\ge\Delta {r}_{t,j}\left(c|{w}_{t}\right)\right)\ge \frac{1}{2}$

Proof

In observation (b) regarding (12), we established marginal prices are increasing in marginal opportunity costs. Thus, (a) and (b) immediately follow by Propositions 3 and 4.

(c) This is merely a repetition of observation (c) regarding (12).

(d) Proof can be found above Theorem 1□

4.2 Observable consumption indicator

We now assume the firm is able to observe the value of next customer’s consumption indicator, i.e. realization $l$ of random variable $\lambda$ is known at the moment the firm decides upon prices. Through this partially revealed information about customers’ preferences, the corresponding choice behavior can be more accurately assessed. Nevertheless, there is still uncertainty present as the base willingness-top-pay $\omega$ is still stochastic.

In this section, we follow the same structure as in the previous section: we will discuss the implications of the observable consumption indicator on our customer choice model, reduce the action space to exclude irrelevant prices, solve the resulting optimization model, and show several structural properties regarding optimization model und optimal policy.

In the following, we will often face a similar structure and use similar arguments as with observable base willingness-to-pay. Whenever possible, we try to keep our explanations brief and focus more on the differences. In particular, the characteristics related to the value function, opportunity costs, and dynamics of optimal marginal prices remain consistent when considering an observable consumption indicator. Therefore, Propositions 6 and 7 convey analogous insights to Propositions 3 and 4, respectively. Similarly, Theorem 2 aligns with the conclusions drawn in Theorem 1. Consequently, we will omit detailed explanations and discussion, and generally refer to Sect. 4.1. However, it is important to note that Propositions 6 and 7, as well as Theorem 2, necessitate specific, new proofs due to alterations in the mathematical formulation.

4.2.1 Customer choice and model formulation

We have seen in the previous section that selling at least one unit is a deterministic occurrence with observable base willingness-to-pay. This does not transfer to a setting where the consumption indicator is observable. Every decision a customer might make is now stochastic. To ease notation, we solely focus on customers with $l>0$.

Aiming at utility maximization, necessary conditions for purchasing $j$ units are:

(a)
$\omega \ge \frac{{r}_{j}}{\sum_{k=0}^{j-1}{l}^{k}}$,
(b)
$\omega \ge \frac{{r}_{j}-{r}_{i}}{\sum_{k=i}^{j-1}{l}^{k}}$ for all $i\in \left\{1, 2, \dots , j-1\right\}$, and
(c)
$\omega \le \frac{{r}_{i}-{r}_{j}}{\sum_{k=j}^{i-1}{l}^{k}}$ for all $i\in \left\{j+1, j+2, \dots , c\right\}$.

Analogue considerations as in Sect. 4.1.1 lead to the definition of relevant prices:

Lemma 4

Relevant prices ${\varvec{r}}$ are given by

$${\mathcal{R}}_{c}\left(l\right)=\left\{{\varvec{r}}\in {\mathbb{R}}^{c}: 0\le \frac{{r}_{j}-{r}_{j-1}}{{l}^{j-1}}\le \frac{{r}_{j+1}-{r}_{j}}{{l}^{j}}\le 1 \ {\text{for}} \, 1\le j\le c-1\right\} .$$

Proof

With ${r}_{j}$ such that $\frac{{r}_{j}-{r}_{j-1}}{{l}^{j-1}}=\frac{{r}_{j+1}-{r}_{j}}{{l}^{j}}$, there is only one customer type that is considering purchasing $j$ units: the one with realizations $w,l$ such that $w=\frac{{r}_{j+1}-{r}_{j}}{{l}^{j}}$. As $\omega$ is continuously distributed, the probability of arrival of exactly this customer type is zero. This is sufficient for the purpose of maximizing expected revenue. The same argumentation applies for $j=c$ with $\frac{{r}_{{c}}-{r}_{{\text{c}}-1}}{{{l}}^{{c}-1}}=1$. □

Similarly to our exploration following Lemma 1, we note that $\frac{{r}_{j}-{r}_{j-1}}{{l}^{j-1}}$ represents the minimum value of realization $w\in \left[\mathrm{0,1}\right]$ where a customer would have nonnegative marginal utility for purchasing the $j$th unit. This threshold is important and we define

$$\underline {w}_{j} \left( {r_{j} - r_{j - 1} } \right) = \frac{{r_{j} - r_{j - 1} }}{{l^{j - 1} }}\quad {\text{for}} \ j = 1, \ldots ,c$$

(13)

As established in Lemma 4, these thresholds separate the interval $\left[0, 1\right]$ in well-defined and ordered segments $\left[{\underline{w}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{w}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$, $0\le j\le c$ (setting ${\underline{w}}_{0}\left({r}_{0}-{r}_{-1}\right)=0$ and ${\underline{w}}_{c+1}\left({r}_{c+1}-{r}_{c}\right)=1$). By definition, and employing the same arguments that led to (9), any customer with $w\in \left[{\underline{w}}_{j}\left({r}_{j}-{r}_{j-1}\right), {\underline{w}}_{j+1}\left({r}_{j+1}-{r}_{j}\right)\right]$ achieves maximum utility when purchasing $j$ units.

To illustrate this point, consider the following scenario: Imagine a customer with a specific observable consumption indicator (e.g., $l=0.8$) and an unobservable base willingness-to-pay $\omega$ with realizations $w\in \left[0, 1\right]$. The firm quotes an arbitrary price vector ${\varvec{r}}$ with ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w\right)$ (for instance, the same price vector depicted in Fig. 2). In Fig. 4, we portray the marginal utility as a function of every possible realization $w\in \left[0, 1\right]$, i.e., $l \mapsto w \cdot l^{j - 1} - \left( {r_{j} - r_{j - 1} } \right)$. We only display the marginal utilities for $j\le 3$.

The marginal utility is a linear function in $w$ with gradient ${l}^{j-1}$. Moreover, the marginal utility at $w=0$ is $-\left({r}_{j}-{r}_{j-1}\right)$. As we employed a linear pricing scheme in this example, every of the portrayed lines start at $-\left({r}_{j}-{r}_{j-1}\right)=-0.45$. We can clearly observe that the interval $\left[0, 1\right]$ (at the red line) is separated in four intervals, $\left[0, {\underline{w}}_{1}\left({r}_{1}-{r}_{0}\right)\right]$, $\left[{\underline{w}}_{1}\left({r}_{1}-{r}_{0}\right), {\underline{w}}_{2}\left({r}_{2}-{r}_{1}\right)\right]$, $\left[{\underline{w}}_{2}\left({r}_{2}-{r}_{1}\right), {\underline{w}}_{3}\left({r}_{3}-{r}_{2}\right)\right]$, and $\left[{\underline{w}}_{3}\left({r}_{3}-{r}_{2}\right), 1\right]$.

Just like the derivation of (9), restricting the action space on ${\mathcal{R}}_{c}\left(l\right)$ has the advantage that the probability of selling $j$ units simplifies to:

$$p_{j} \left( {{\varvec{r}}|l} \right) = \left\{ {\begin{array}{*{20}l} {F_{\omega } \left( {\underline {w}_{j + 1} \left( {r_{j + 1} - r_{j} } \right)} \right) - F_{\omega } \left( {\underline {w}_{j} \left( {r_{j} - r_{j - 1} } \right)} \right)} \hfill & {{\text{for}} \;1 \le j \le c - 1} \hfill \\ {1 - F_{\omega } \left( {\underline {w}_{c} \left( {r_{c} - r_{c - 1} } \right)} \right)} \hfill & {{\text{for}} \;j = c} \hfill \\ \end{array} } \right.$$

The optimization problem with observable consumption indicator is given by

$${V}_{t}^{\lambda }\left(c\right)=\underset{0}{\overset{1}{\int }}\underset{{\varvec{r}}\in {\mathcal{R}}_{c}\left(l\right)}{{\text{max}}}\left\{\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|l\right)\cdot \left({r}_{j}-{\Delta }_{j}{V}_{t-1}^{\lambda }\left(c\right)\right)\right\}\cdot {f}_{\lambda }\left(l\right) dl+{V}_{t-1}^{\lambda }\left(c\right)$$

(14)

with boundary conditions ${V}_{0}^{\lambda }\left(c\right)=0$ for $c\ge 0$ and ${V}_{t}^{\lambda }\left(0\right)=0$ for $t\ge 0$. ${V}_{t}^{\lambda }\left(c\right)$ is the optimal expected revenue-to-go from period $t$ onwards (before observing the customer in $t$). In contrast to the general setting, the firm has access to realization $l$ of customers’ consumption indicator $\lambda$ before quoting prices. For every possible $l$, we denote the corresponding optimal batch prices selected in state $\left(t,c\right)$ by ${{\varvec{r}}}_{t}\left(c|l\right)\in {\mathcal{R}}_{c}\left(l\right).$

4.2.2 Solution and structural properties

The maximum number of units we can economically sell depends on the state $(t,c)$ and the realized consumption indicator $l$.

Corollary 2

If ${V}_{t-1}^{\lambda }\left(\cdot \right)$ is increasing and concave, ${N}_{t,c}\left(l\right)=\underset{j=1, \dots ,c}{max} \left\{j: {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-j+1\right)<{l}^{j-1}\right\}$ denotes the highest number of units that can be economically sold. It holds that $\left\{j: {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-j+1\right)<{l}^{j-1}\right\}=\left\{1, 2, \dots , {N}_{t,c}\left(l\right)\right\}.$

Proof

${\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-j+1\right)$ is increasing in $j$, while ${l}^{j-1}$ is decreasing. □

In the following, we ignore selling more than ${N}_{t,c}\left(l\right)$ units. Technically, we choose prices ${r}_{j}$ for $j>{N}_{t,c}\left(l\right)$ sufficiently large such that no sell occurs almost surely. This holds, e.g., for ${r}_{j}={l}^{j-1}+{r}_{j-1}$.

By definition, we have ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-{N}_{t,c}\left(l\right)+1\right)<{l}^{{N}_{t,c}\left(l\right)-1}$ and ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-\left({N}_{t,c}\left(l\right)+1\right)+1\right)\ge {l}^{{N}_{t,c}\left(l\right)}$. Consequently, it follows that ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(\left(c-1\right)-\left({N}_{t,c}\left(l\right)-1\right)+1\right)<{l}^{{N}_{t,c}\left(l\right)-1}$ and ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(\left(c-1\right)-{N}_{t,c}\left(w\right)+1\right)\ge {l}^{{N}_{t,c}\left(l\right)}$, respectively. Utilizing ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(\left(c-1\right)-\left({N}_{t,c}\left(l\right)-1\right)+1\right)<{l}^{{N}_{t,c}\left(l\right)-1}<{l}^{\left({N}_{t,c}\left(l\right)-1\right)-1}$, we can conclude that ${N}_{t,c}\left(l\right)-1\in \left\{j: {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-j+1\right)<{l}^{j-1}\right\}$, thus establishing ${N}_{t,c-1}\left(l\right)\ge {N}_{t,c}\left(l\right)-1$. Similarly, with ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(\left(c-1\right)-\left({N}_{t,c}\left(w\right)+1\right)+1\right)\ge {\Delta }_{1}{V}_{t-1}^{\lambda }\left(\left(c-1\right)-{N}_{t,c}\left(w\right)+1\right)\ge {l}^{\left({N}_{t,c}\left(l\right)+1\right)-1}$, we deduce that ${N}_{t,c}\left(l\right)+1\notin \left\{j: {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-j+1\right)<{l}^{j-1}\right\}$, indicating that ${N}_{t,c-1}\left(l\right)<{N}_{t,c}\left(l\right)+1$. Consequently, ${N}_{t,c-1}\left(l\right)$ either equals ${N}_{t,c}\left(l\right)-1$ or ${N}_{t,c}\left(l\right)$. This observation leads to the following remark.

Remark 3

It holds that ${N}_{t,c-1}\left(l\right)\le {N}_{t,c}\left(l\right)\le {N}_{t,c-1}\left(l\right)+1$.

Unlike the previous section where the base willingness-to-pay was observable, there are now two cases to consider for the maximal number of units sold in adjacent states. This introduces additional complexity in our upcoming proofs.

Proposition 5

If ${V}_{t-1}^{\lambda }\left(c\right)$ is increasing and concave in $c$, it holds: In every state $\left(t,c\right)$ and for every $l\in \left[\mathrm{0,1}\right]$ the optimal marginal price $\Delta {r}_{t,j}\left(c|l\right)$ for the $j$th unit, $j=1, \dots , {N}_{t,c}\left(l\right)$, is given by $\Delta {r}_{t,j}\left(c|l\right)={l}^{j-1}\cdot {\underline{w}}_{j}$ with ${\underline{w}}_{j}$ such that ${l}^{j-1}\cdot \left({\underline{w}}_{j}-\frac{1}{{h}_{\omega }\left({\underline{w}}_{j}\right)}\right)={\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)$.

Proof

See Supplement S.3.

Remark 4

For $w\sim U\left[\mathrm{0,1}\right]$, the optimality condition leads to a closed-form solution:

$$\Delta {r}_{t,j}\left(c|l\right)=\frac{1}{2}\cdot \left({l}^{j-1}+{\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)\right).$$

Remark 5

The pricing structure divides customers with the same consumption indicator into groups based on their base willingness-to-pay. The higher a customer’s willingness-to-pay, the more units are being sold. Specifically, a base willingness-to-pay of ${r}_{t,1}$ separates customers who buy nothing at all and customers who purchase at least one unit.

Remark 6

In the supplement (namely S.4), we show that Lemma 2 carries over to Sect. 4.2. Hence, it still holds that the expected margin for selling the $j$th unit is greater than the expected margin for selling the $j+1$th unit.

So far, we found the optimal solution in period $t$ under the condition that the value function in period $t-1$ is increasing and concave in $c$. We will now proof that this condition indeed holds for the whole selling horizon.

Proposition 6

For every $t$, ${V}_{t}^{\lambda }\left(\cdot \right)$ is increasing and concave.

Proof

See Supplement S.5.

In addition to Proposition 6, further structural properties of value function ${V}_{t}^{\lambda }\left(\cdot \right)$ and resulting opportunity costs ${\Delta }_{1}{V}_{t}^{\lambda }\left(c\right)$ are given by the following proposition:

Proposition 7

For every $c$, it holds:

(a)
${\Delta }_{1}{V}_{t}^{\lambda }\left(c\right)$ is increasing in $t$
(b)
${V}_{t}^{\lambda }\left(c\right)$ is increasing and concave in $t$

Proof

See Supplement S.6.

We have seen in the previous section that dynamics of opportunity costs are an important driver to pricing dynamics. Similarly, based on the optimality condition $\Delta {r}_{j}-\frac{1}{{h}_{\omega }\left(\frac{\Delta {r}_{j}}{{l}^{j-1}}\right)}=\delta$, it again holds that optimal marginal prices $\Delta {r}_{j}$ are increasing in customer type $l$ and in opportunity costs $\delta$.

Theorem 2

For every $c, t, j$ and $l$, it holds:

(a)
$\Delta {r}_{t,j}\left(c|l\right)$ is increasing in $t$, ${N}_{t,c}\left(l\right)$ is decreasing in $t$
(b)
$\Delta {r}_{t,j}\left(c|l\right)$ is decreasing in $c$, ${N}_{t,c}\left(l\right)$ is increasing in $c$
(c)
$\Delta {r}_{t,j}\left(c|l\right)$ is increasing in $l$, ${N}_{t,c}\left(l\right)$ is increasing in $l$
(d)
$\Delta {r}_{t,1}\left(c|l\right)$ is independent of $l$
(e)
${\Delta }_{1}{V}_{t-2}^{\lambda }\left(c-i+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)\Rightarrow {\mathbb{P}}\left(\Delta {r}_{t-1,j}\left(c-i|{l}_{t-1}\right)\ge\Delta {r}_{t,j}\left(c|{l}_{t}\right)\right)\ge \frac{1}{2}$

Proof

(a)–(d) are immediate results of Propositions 5, 6, and 7.

(e) holds with ${\Delta }_{1}{V}_{t-2}^{\lambda }\left(c-i+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)$

$\Rightarrow {\mathbb{P}}\left(\Delta {r}_{t-1,j}\left(c-i|{l}_{t-1,c-1}\right)\ge\Delta {r}_{t,j}\left(c|{l}_{t,c}\right)\right)\ge {\mathbb{P}}\left({l}_{t-1,c-1}\ge {l}_{t,c}\right)=\underset{0}{\overset{1}{\int }}\left(\underset{{l}_{t,c}}{\overset{1}{\int }}{f}_{\lambda }\left({l}_{t-1,c-1}\right) d{l}_{t-1,c-1}\right) {f}_{\lambda }\left({l}_{t,c}\right) d{l}_{t,c}=\underset{0}{\overset{1}{\int }}\left(1-{F}_{\lambda }\left({l}_{t,c}\right)\right) {f}_{\lambda }\left({l}_{t,c}\right) d{l}_{t,c}=1-\underset{0}{\overset{1}{\int }}{F}_{\lambda }\left({l}_{t,c}\right) {f}_{\lambda }\left({l}_{t,c}\right) d{l}_{t,c}=1-{\left[\frac{{F}_{\lambda }{\left({l}_{t,c}\right)}^{2}}{2}\right]}_{0}^{1}=\frac{1}{2}$ □

4.2.3 Special case: uniform distribution

In Remark 5, we provided the closed-form expression of optimal marginal prices. This allows us to compute selling probabilities and the expected revenue, ${V}_{t}^{\lambda }\left(c|l\right)$, for every possible realization $l$ of random consumption indicator $\lambda$. Subsequently, these $l$-dependent expected revenues can be employed to calculate the overall expected revenue ${V}_{t}^{\lambda }\left(c\right)$ of state $\left(t,c\right)$:

$${V}_{t}^{\lambda }\left(c\right)=\frac{1}{4}\cdot \left({\left(1-{\Delta }_{1}{V}_{t-1}^{\lambda }\left(c\right)\right)}^{2}+\frac{1}{2}-2{\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-1\right)+\left(\frac{3}{2}-{\text{ln}}\left({\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-1\right)\right)\cdot {\left({\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-1\right)\right)}^{2}\right)+\sum_{j=3}^{c}\left(\frac{1}{j}-2{\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)-\frac{{\left({\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)\right)}^{2}}{j-2}+\frac{2{\left(j-1\right)}^{2}}{j\left(j-2\right)}{\left({\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-j\right)\right)}^{\frac{1}{j-1}}\right)\right)+{V}_{t-1}^{\lambda }\left(c\right)$$

Moreover, we want to point out the special structure of ${r}_{t, j}\left(c|l\right)$: Consisting of $\sum_{k=0}^{j-1}{l}^{k}$ and ${\Delta }_{j}{V}_{t-1}^{\lambda }\left(c\right)$, ${r}_{t, j}\left(c|l\right)$ is increasing in $j$. While the first component is apparently concave in $j$ ($l\in \left[\mathrm{0,1}\right]$), the second component is convex in $j$ (as ${\Delta }_{j}{V}_{t-1}^{\lambda }\left(c\right)=\sum_{k=1}^{j}{\Delta }_{1}{V}_{t-1}^{\lambda }\left(c+1-k\right)$ and ${\Delta }_{1}{V}_{t-1}^{\lambda }\left(c-k\right)$ is increasing in $k$ (cf. Proposition 6)).

4.3 Observable base willingness-to-pay and consumption indicator

In this section, we assume that a firm can observe next customer’s base willingness-to-pay and consumption indicator, i.e. realizations $w$ and $l$ of random variables $\omega$ and $\lambda$, respectively, are known when the firm decides upon prices. Thereby, we eliminate every stochasticity of customers’ behavior and the whole optimization problem becomes deterministic:

$${p}_{j}\left({\varvec{r}}|w, l\right)={1}_{\left\{\underset{i=1,\dots ,c}{{\text{max}}}\left\{w\cdot \sum_{k=0}^{i-1}{l}^{k}-{r}_{i}, 0\right\}=w\cdot \sum_{k=0}^{j-1}{l}^{k}-{r}_{j}\right\}}, 1\le j\le c$$

$${p}_{0}\left({\varvec{r}}|w, l\right)={1}_{\left\{\underset{i=1,\dots ,c}{{\text{max}}}\left\{w\cdot \sum_{k=0}^{i-1}{l}^{k}-{r}_{i}\right\}<0\right\}},$$

for ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w,l\right)=\left\{{\varvec{r}}\in {\mathbb{R}}^{c}: {p}_{0}\left({\varvec{r}}|w, l\right)+\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w, l\right)=1\right\}$. Restricting the action space to ${\mathcal{R}}_{c}\left(w,l\right)$ is a technical decision to make the ${p}_{j}\left({\varvec{r}}|w, l\right)$ work the way it is intended. Otherwise, we would allow for selling a single customer every batch size at once by setting ${r}_{j}=w\cdot \sum_{i=0}^{j-1}{l}^{i}$ for every $j$. Alternatively, we could use a more elaborate definition of ${p}_{j}\left({\varvec{r}}|w, l\right)$ together with a set of assumptions regarding tiebreakers when a customer faces equally good options. As both ways have the same outcome, we preferred to have a simple definition of ${p}_{j}\left({\varvec{r}}|w, l\right)$.

The optimization problem is given by:

$${V}_{t}^{\omega , \lambda }\left(c\right)=\underset{0}{\overset{1}{\int }}\underset{0}{\overset{1}{\int }}\underset{{\varvec{r}}\in {\mathcal{R}}_{c}\left(w,l\right)}{{\text{max}}}\left\{\sum_{j=1}^{c}{p}_{j}\left({\varvec{r}}|w, l\right)\cdot \left({r}_{j}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right)\right\}\cdot {f}_{\lambda }\left(l\right){f}_{\omega }\left(w\right) dldw+{V}_{t-1}^{\omega , \lambda }\left(c\right),$$

(15)

with boundary conditions ${V}_{0}^{\omega , \lambda }\left(c\right)=0$ for $c\ge 0$ and ${V}_{t}^{\omega , \lambda }\left(0\right)=0$ for $t\ge 0$. Note that we still calculate expected revenue even though maximizing is now deterministic.

Without eliminating demand, the highest possible batch price ${r}_{j}$ for $j$ units is ${r}_{j}=w\cdot \sum_{i=0}^{j-1}{l}^{i}$. Thus, we are looking for the batch size with the highest possible additional revenue, i.e. ${j}^{*}={\text{arg}}\underset{1\le k\le c}{{\text{max}}}\left\{w\cdot \sum_{i=0}^{k-1}{l}^{i}-{\Delta }_{k}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right\}$. If $w\cdot \sum_{i=0}^{{j}^{*}-1}{l}^{i}-{\Delta }_{{j}^{*}}{V}_{t-1}^{\omega , \lambda }\left(c\right)<0$, we are not able to economically sell something to the current customer. In this case, we prefer not selling anything and pick ${r}_{j}>w\cdot \sum_{i=0}^{j-1}{l}^{i}$ for every $j$. If $w\cdot \sum_{i=0}^{{j}^{*}-1}{l}^{i}-{\Delta }_{{j}^{*}}{V}_{t-1}^{\omega , \lambda }\left(c\right)\ge 0$, we can earn additional revenue. By setting ${r}_{{j}^{*}}=w\cdot \sum_{i=0}^{{j}^{*}-1}{l}^{i}$ and ${r}_{j}>w\cdot \sum_{i=0}^{j-1}{l}^{i}$, $j\ne {j}^{*}$, we ensure ${\varvec{r}}\in {\mathcal{R}}_{c}\left(w,l\right)$ and have the optimal solution for given $w,l$.

Lemma 5

For every $w,l$, the best batch size greater than zero is given by

$$j=\mathit{arg}\underset{1\le k\le c}{\mathit{max}}\left\{w\cdot \sum_{i=0}^{k-1}{l}^{i}-{\Delta }_{k}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right\}.$$

The optimal solution to the maximization in (15) is given by:

${r}_{t,j}\left(c|w,l\right)=w\sum_{i=0}^{j-1}{l}^{i}$ and ${r}_{t,k}\left(c|w,l\right)>w\sum_{i=0}^{k-1}{l}^{i}$, $k\ne j$, if $w\sum_{i=0}^{j-1}{l}^{i}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)\ge 0$
${r}_{t,k}\left(c|w,l\right)>w\sum_{i=0}^{k-1}{l}^{i}$ for every $k$, if $w\sum_{i=0}^{j-1}{l}^{i}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)<0$

Proof

Above Lemma 5.

Even though solving the maximization problem is trivial, calculating ${V}_{t}^{\omega , \lambda }\left(c\right)$ is not. There are many cases to consider, and thus, it is not easy to find for every unit size $j$ the subset of $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$ where $j={\text{arg}}\underset{1\le k\le c}{{\text{max}}}\left\{w\cdot \sum_{i=0}^{k-1}{l}^{i}-{\Delta }_{k}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right\}$ as well as $w\cdot \sum_{i=0}^{j-1}{l}^{i}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)\ge 0$.

Again, it is useful to look at marginal prices and opportunity costs: For $j={\text{arg}}\underset{1\le k\le c}{{\text{max}}}\left\{w\cdot \sum_{i=0}^{k-1}{l}^{i}-{\Delta }_{k}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right\}$ it holds that $w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)$ and $w\cdot {l}^{j}<{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-j\right)$. For the time being, this is a necessary but no sufficient condition on $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$. It only ensures that selling $j$ units is better than selling $j-1$ and $j+1$ units. Neither does it automatically make $j$ the best batch size nor does it ensure the firm is earning additional revenue, i.e. $w\cdot \sum_{i=0}^{j-1}{l}^{i}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)\ge 0$. Looking at the aforementioned necessary condition, we observe $w\cdot {l}^{i-1}\ge w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)$, $i\le j$, and $w\cdot {l}^{i}<w\cdot {l}^{j}<{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-j\right)$, $i>j$. Assuming a suitable structure of ${\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$, we can derive the following lemma:

Lemma 6

If ${V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$ is increasing and concave, it holds: $j$ units is the optimal batch size to sell to every customer with $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$ such that $w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right),$ and $w\cdot {l}^{j}<{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-j\right)$. We define this number by ${N}_{t,c}\left(w, l\right)=\underset{j=1, \dots ,c}{max} \left\{j: {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-j+1\right)\le {w\cdot l}^{j-1}\right\}$.

Proof

${V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$ is increasing and concave, thus ${\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)$ is increasing in $j$. With $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$ such that $w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)$, it holds:

$$w\cdot {l}^{i-1}\ge w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-i\right), i\le j,$$

and

$$w\cdot {l}^{i}\le w\cdot {l}^{j}<{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-j\right)\le {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-i\right), i>j.$$

Finally, we can conclude $\underset{1\le k\le c}{{\text{max}}}\left\{w\cdot \sum_{i=0}^{k-1}{l}^{i}-{\Delta }_{k}{V}_{t-1}^{\omega , \lambda }\left(c\right)\right\}=w\cdot \sum_{i=0}^{j-1}{l}^{i}-{\Delta }_{j}{V}_{t-1}^{\omega , \lambda }\left(c\right)\ge 0$ making $j$ the optimal batch size to sell. □

Remark 7

In Sects. 4.1 and 4.2, ${N}_{t,c}$ served as an upper bound on the number of units a firm could sell economically, a consequence of the uncertainty arising from the unobservable part of customers’ information. During these instances, the firm lacked precise knowledge regarding the actual number of units it might sell to a current customer, but it recognized that overall expected revenues could be optimized by selling up to ${N}_{t,c}$ units. However, in this section, stochasticity is entirely eliminated, and the firm is fully aware of the quantity of units it sells for a given price. Therefore, ${N}_{t,c}$ precisely denotes the number of units a firm sells to a customer to maximize overall expected revenues.

Proof of Lemma 6 also showed that a firm sells in optimality at least $j$ units to a customer with $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$ such that $w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)$. This implies that every such customer is purchasing the $j$th unit. We can make use of this observation to show concavity of ${V}_{t}^{\omega , \lambda }\left(\cdot \right)$ and concentrate on ${V}_{t}^{\omega , \lambda }\left(\cdot |w,l\right)=\sum_{j=1}^{c}{1}_{\left\{w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)\right\}}\cdot \left(w\cdot {l}^{j-1}-{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-j\right)\right)+{V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$ for every realization $w$, $l$. In the proof of concavity, we need the following property regarding ${N}_{t,c}\left(w, l\right)$, the number of units a certain customer is purchasing.

Lemma 7

If ${V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$ is increasing and concave, for every $\left(w, l\right)\in {\left[\mathrm{0,1}\right]}^{2}$, it holds that

$${N}_{t,c+1}\left(w, l\right)-1\le {N}_{t,c}\left(w, l\right)\le {N}_{t,c+1}\left(w, l\right).$$

Proof

${V}_{t-1}^{\omega , \lambda }\left(\cdot \right)$ is increasing and concave, thus ${\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-{N}_{t,c}\left(w, l\right)\right)\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+2-{N}_{t,c}\left(w, l\right)\right)$. Together with ${w\cdot l}^{{N}_{t,c}\left(w, l\right)-1}\ge {\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c+1-{N}_{t,c}\left(w, l\right)\right)$, it holds that ${N}_{t,c}\left(w, l\right)\le {N}_{t,c+1}\left(w, l\right)$. Based on ${w\cdot l}^{{N}_{t,c}\left(w, l\right)+1}\le {w\cdot l}^{{N}_{t,c}\left(w, l\right)}<{\Delta }_{1}{V}_{t-1}^{\omega , \lambda }\left(c-{N}_{t,c}\left(w, l\right)\right)$, it also holds that ${N}_{t,c}\left(w, l\right)+2>{N}_{t,c+1}\left(w, l\right)$. As ${N}_{t,c}\left(w, l\right)$ and ${N}_{t,c+1}\left(w, l\right)$ are integer, we can use ${N}_{t,c}\left(w, l\right)+2\ge {N}_{t,c+1}\left(w, l\right)+1$ instead. □

We now have everything to state and show the following proposition.

Proposition 8

${V}_{t}^{\omega , \lambda }\left(\cdot \right)$ is increasing and concave.

Proof

See Supplement S.7.

Other dynamics of opportunity costs and value function are given in the following proposition.

Proposition 9

For every $c$, it holds:

(a)
${\Delta }_{1}{V}_{t}^{\omega ,\lambda }\left(c\right)$ is increasing in $t$
(b)
${V}_{t}^{\omega ,\lambda }\left(c\right)$ is increasing and concave in $t$

Proof

See Supplement S.8.

Proposition 9 has an immediate implication on dynamics of optimal marginal prices: As ${\Delta }_{1}{V}_{t}^{\omega ,\lambda }\left(c\right)$ is increasing in $t$, it is less likely that a customer arrives with $w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t}^{\omega ,\lambda }\left(c+1-j\right)$ for higher $t$. Thereby, the probability of selling the $j$th unit ${\mathbb{P}}\left(w\cdot {l}^{j-1}\ge {\Delta }_{1}{V}_{t}^{\omega ,\lambda }\left(c+1-j\right)\right)$ decreases. Moreover, as selling $j$ units is increasingly restricted to customers with high $w$ and $l$ in the optimal solution, the average price ${r}_{t,j}\left(c\right)$ that can be earned by selling $j$ units increases.

We conclude this section with a summary of all dynamics regarding optimal marginal prices we found.

Theorem 3

For every $c, t, w$, and $l$, it holds:

(a)
${r}_{t,j}\left(c|w,l\right)$ is constant in $t$ as long as $j={N}_{t,c}\left(w, l\right),$ ${N}_{t,c}\left(w, l\right)$ is decreasing in $t$
(b)
${r}_{t,j}\left(c\right)$ is increasing in $t$ for every $j$
(c)
${r}_{t,j}\left(c|w,l\right)$ is constant in $c$ as long as $j={N}_{t,c}\left(w, l\right)$, ${N}_{t,c}\left(w, l\right)$ is increasing in $c$
(d)
${r}_{t,j}\left(c\right)$ is decreasing in $c$ for every $j$
(e)
${r}_{t,j}\left(c|w, l\right)$ is increasing in $w$ and $l$ for every $j$, ${N}_{t,c}\left(w, l\right)$ is increasing in $w$ and $l$

Proof

(a)–(e) follow by Lemma 5, 6, Proposition 8, and 9. □

In light of Theorem 3, it is evident that a firm maintains the same price for two customers with identical $w$ and $l$ in adjacent states as long as the optimal batch size ${N}_{t,c}\left(w, l\right)$ remains unchanged (refer to (a) and (c)). However, the optimal batch size tends to decrease over time and increase with capacity. Essentially, the scarcer the product, the smaller the optimal batch size. Additionally, the firm quotes higher prices to customers with higher $w$ or $l$ and tends to increase the offered batch size (cf., (e)).

Moreover, we’ve observed that the average price quoted by a firm for $j$ units increases with $t$ and decreases with $c$ (cf., (b) and (d)). Understanding the dynamics of average prices is advantageous as they are not contingent on a specific customer represented by $w$ and $l$. In any selling process, the realization of a customer stream with specific ${w}_{t}$ and ${l}_{t}$ can lead to counterintuitive price changes (such as raising prices even if the firm did not sell in the previous period). However, on average, the optimal policy adheres to the conventional intuitive structure where prices increase if the product becomes scarcer (due to an increase of $t$ or decrease of $c$).

5 Simulation study

In this section, we compare earned revenues of (up to) four different kinds of observable information:

Full information (FI): Observable base willingness-to-pay $\omega$ and consumption indicator $\lambda$ (refer to Sect. 4.3).

As there is perfect personalized pricing and deterministic customer behavior, this scenario reflects the highest possible revenues earned. We will often refer to this case as upper bound.
Partial information (PI-$\omega$): Observable base willingness-to-pay $\omega$ (refer to Sect. 4.1).

In this scenario, we have no closed-form solution for the optimization problem, and thus, solve it numerically.
Partial information (PI-$\lambda$): Observable consumption indicator $\lambda$ (refer to Sect. 4.2).

In this scenario, we have a closed-form solution if $\omega \sim U\left[0, 1\right]$. Otherwise, we solve it numerically.
No information (NI):

We use heuristic $D$ from Schur (2023) and describe it briefly in Sect. 5.1. This heuristic solves the optimization problem without observable information for $t=1$ optimally, and for $t>1$ approximately.

For our simulation study, we align our setting with Gallego et al. (2020). Accordingly, we set $T=1, \dots , 40$, $C=1, \dots , 120$, and consider $\omega , \lambda \sim U\left[0, 1\right]$. In each state, we employ a random sample of $\mathrm{10,000}$ realizations for both $\omega$ and $\lambda$. Throughout Sect. 5, each presented revenue is derived from this randomized dataset and the corresponding policy generated by one of our mechanisms or heuristics.

In Sect. 5.1, we describe all three heuristics developed in Schur (2023) for the no information case (NI). Specifically, we elaborate on heuristic D, as it has proven to be the best-performing one. In Sect. 5.2, we determine the optimal solution for all four types of observable information: FI, PI-$\omega$, PI-$\lambda$, and NI, across every state $\left(t,c\right)$, $t\le 40, c\le 120$. The pair-wise differences in the resulting expected revenues represent the value of information. For instance, the discrepancy between the revenues of FI and PI-$\omega$ indicates the additional revenue that could be earned if both $\omega$ and $\lambda$ were observable instead of only $\omega$.

Moving to Sect. 5.3, we delve into the impact of the distribution of $\omega$ and $\lambda$. Alongside the uniform distribution, we opt for a (truncated) normal distribution with a mean of $0.5$ and a standard deviation of $0.1$. This introduces two distributions with the same mean but significantly different deviations. In Sect. 5.4, we relax our assumption that parameters can be precisely observed. Instead, we operate with predefined distinct intervals, assuming that the firm can accurately allocate (formerly observable) realizations to these intervals. Finally, in Sect. 5.5, we delve into an additional layer of decision-making. Specifically, we explore the scenario where the firm has the autonomy to determine its initial stock and investigate the implications of allowing the firm to decide on restocking in the middle of the planning horizon.

5.1 Heuristics for the no information case

The following heuristics $E\left(\lambda \right)$, $E\left(\omega \right)$, and D were developed in Schur (2023), and we refer to this work for a detailed analysis. However, we want to shortly explain how these heuristics work and why we chose to employ heuristic D.

Heuristics $E\left(\lambda \right)$ and $E\left(\omega \right)$ share the same underlying idea and rely on the results of our work. In our research, we demonstrated that we can find the optimal solution if we can observe the realization of $\lambda$ (Sect. 4.2) or $\omega$ (Sect. 4.1). The optimal price vectors are dependent on the realization of these random variables, becoming random optimal price vectors. Consequently, we can build the expected value and obtain a price vector known as the expected optimal price in Schur (2023). Both heuristics differ in the realization they use to define these random optimal price vectors. $E\left(\lambda \right)$ employs the realization of $\lambda$, utilizing our work discussed in Sect. 4.2, while $E\left(\omega \right)$ builds on the realization of $\omega$, stemming from our work discussed in Sect. 4.1.

Heuristic D decomposes batches into distinguished units ($1$st, $2$nd, etc.) and separately optimizes prices for each $i$th unit, where $i=1, \dots , c$. This approach utilizes a simplified customer choice behavior and is similar to the one applied in Sects. 4.1 and 4.2 to solve the optimization problem (see, e.g., (11)). However, in our case, we initially introduced this decomposition as an upper bound to our problem and later proved that it yields in the same values and optimal solutions as the original problem. This equivalence does not hold for a setting where neither random variable is observable. In such a scenario, this decomposition does not result in the same values and solutions and does not constitute an upper bound. However, in a simulation study, this heuristic yielded the highest revenues. It is worth noting that $E\left(\omega \right)$ produced almost the same revenues. This could be interpreted as an indication that both heuristics might be relatively close to the (unknown) optimal value. The choice to employ heuristic D in our current work was driven by its demonstrated effectiveness and higher revenue outcomes in comparison to the other two heuristics.

All three heuristics are further enhanced with the help of a fluid approximation. The fluid approximation finds the optimal solutions in states without opportunity costs (i.e., for $t=1$). Additionally, it forms a policy that is asymptotically optimal [refer to, e.g., Schur (2023), Maglaras and Meissner (2006), and Gallego and van Ryzin (1997)] and transfers this property to heuristics it is combined with.

5.2 Value of information

Table 2 shows expected revenues for all kinds of observable information with $C\in \left\{1, 20, 40, 60, 80, 100, 120\right\}$. Scenarios FI and NI are the upper and lower bound, respectively. In between, PI-$\omega$ is outperforming PI-$\lambda$ in every state. For $C=1$, there is just one unit of the product for sale, and thus, no multiunit demand can be served. In this state, PI-$\omega$ is performing like the full information scenario FI, and PI-$\lambda$ like the no information scenario NI. The more capacity, the higher is the importance of attending customers’ demand for more than one unit. This can be seen by comparing mechanisms PI-$\omega$ and PI-$\lambda$. While the absolute difference is increasing for $C\le 100$, we can observe that the relative difference is shrinking between those two scenarios for $C\ge 60$.

Table 2 Revenues for $C\le 120, T=40$

Full size table

To get a clear image regarding the relative value of information, we divide expected revenue of every scenario by upper bound FI. Thereby, we show the percentage of the best possible outcome every kind of information yields.

Figure 5 displays the same order as shown in Table 2, i.e. FI $\ge$ PI-$\omega$ $\ge$ PI-$\lambda \ge$ NI. For a lower amount of capacity ($C\le 20$), mechanisms FI and PI-$\omega$ as well as PI-$\lambda$ and NI are performing similarly with a significant gap between both groups. For a higher amount of capacity ($C\ge 40$), mechanism FI is significantly outperforming PI-$\omega$ while PI-$\lambda$ is marginally better than NI. The gap between PI-$\omega$ and PI-$\lambda$ is decreasing with capacity. However, it is still noticeably large.

These observations lead to the following conclusions: Observing the base willingness-to-pay is considerably more valuable than observing the consumption indicator. However, observing the consumption indicator is not useless. This information adds value in settings where the capacity is only moderately scarce or where the firm is able to also observe the base willingness-to-pay. In the latter case, the increase in revenue is especially large for higher capacity levels (ca. $30\%$ for $C=120$).

5.3 Different distributions

In this section, we explore the impact of the distribution of $\omega$ and $\lambda$ on expected revenues resulting from partial (PI-$\omega$ and PI-$\lambda$) and full information (FI) about customers’ private information. We consider two different distributions: a uniform distribution (denoted as $U\left[\mathrm{0,1}\right]$) and a (truncated) normal distribution with mean of $0.5$ and standard deviation of $0.1$ (denoted as $N\left[\mathrm{0.5,0.1,0}, 1\right]$). Both distributions share the same mean ($0.5$) but have significantly different deviations ($\sqrt{\frac{1}{12}} \approx 0.28$ vs. $0.1$). We investigate every combination of $\omega$ and $\lambda$ following one of the two distributions.

Table 3 presents the results of our simulation study. One noticeable effect is that expected revenues are higher for distributions with higher deviation. This holds true for every kind of observable information (PI-$\omega$, PI-$\lambda$, and FI) as well as for random variables $\omega$ and $\lambda$. However, the magnitude of this effect varies across different scenarios. A smaller deviation of $\omega$, i.e., $\omega \sim N\left[\mathrm{0.5,0.1,0}, 1\right]$ instead of $\omega \sim U\left[\mathrm{0,1}\right]$, has a more (less) significant impact on settings with low (high) capacity $C$. Conversely, for $\lambda$, we observe the opposite effect. Furthermore, the order observed in Sect. 5.1 is validated for every combination of distributions. Notably, for $\omega \sim N\left[\mathrm{0.5,0.1,0}, 1\right]$ and $\lambda \sim U\left[\mathrm{0,1}\right]$, PI-$\lambda$ is very close to PI-$\omega$, and the gap between both mechanisms diminishes for higher $C$.

Table 3 Revenues for $C\le 120, T=40$ and different distributions

Full size table

To provide a clearer overview of the influence of different distributions on different kinds of observable information, we depict the relative performances of PI-$\omega$ and PI-$\lambda$ in comparison to FI in Fig. 6. Once again, it is evident that observing the realization of $\omega$ is more crucial than observing the realization of $\lambda$ in each of the displayed scenarios. The relative difference between both partial information mechanisms is more pronounced for a state with severe scarcity ($C\le T$) than for one with moderate scarcity ($C\ge 2\cdot T$). Moreover, the gap between those two mechanisms is greatest for $\omega \sim U\left[\mathrm{0,1}\right], \lambda \sim N\left[\mathrm{0.5,0.1,0}, 1\right]$ and smallest for $\omega \sim N\left[\mathrm{0.5,0.1,0}, 1\right], \lambda \sim U\left[\mathrm{0,1}\right]$. This emphasizes that observing a random variable with a higher deviation carries more potential than observing a random variable with a lower deviation (although it is still not enough for PI-$\lambda$ to surpass PI-$\omega$ in the latter scenario). Lastly, in states with severe scarcity ($C\le T$), observing $\omega$ is almost as beneficial as observing both $\omega$ and $\lambda$. This is most noticeable in the third and fourth scenarios where $\omega \sim N\left[\mathrm{0.5,0.1,0}, 1\right]$, there is a high chance of a moderate to high realization of $\omega$. For example, there is roughly a $70\%$ chance of observing a realization of $w\ge 0.45$. Thereby, most of the time, it is favorable to sell at least the first unit in every period. As there is not enough capacity to sell more than one unit on average, a second unit is seldom sold in any period (the price of the second unit is going to be quite high, and thus, a second unit is only sold if the realization of $\lambda$ is close to $1$). For $C=T$, this is most apparent. The expected revenue in every period is close to the expected value of $\omega$ ($0.5$), and accordingly, the expected revenue for $\left(C,T\right)=\left(\mathrm{40,40}\right)$ is close to $20$ for PI-$\omega$ and FI (cf. Table 3).

Finally, we have a closer look at the third scenario, i.e., $\omega \sim N\left[\mathrm{0.5,0.1,0}, 1\right], \lambda \sim U\left[\mathrm{0,1}\right]$. We have seen that observing $\lambda$ was almost as good as observing $\omega$ for $C=120$. Indeed, it is evident that observing $\lambda$ becomes more crucial in states with less scarcity. Scarcity can be described by the ratio $T/C$, as less time (i.e., demand) or more capacity decreases scarcity. In our simulation study, scarcity varies from $1/120$ to $120/1$. For each $T\le 40$, we assessed whether PI-$\lambda$ outperforms PI-$\omega$ for some capacity $C\le 120$. We found that for $T\le 27$, there is always a capacity ${C}^{s}\left(T\right)$ such that PI-$\lambda$ outperforms PI-$\omega$ for $C\ge {C}^{s}\left(T\right)$. This ${C}^{s}\left(T\right)$ forms a line with a slope of approximately $4.5$ (cf. Fig. 7). It is worth noting that this slope represents the minimum scarcity for which PI-$\lambda$ outperforms PI-$\omega$.

5.4 Customer segmentation

In this section, we relax our initial assumption that realizations of random variables can be precisely observed. Instead, we consider predefined customer segments and assume the firm can accurately assign arriving customers to these segments. Technically, we divide $\left[0, 1\right]$ into several disjunct intervals, and we assume that the firm can only observe the specific interval to which a realization of the random variable belongs.

There are different approaches to designing $N$ intervals $\left[{a}_{n}, {b}_{n}\right]$, where $n\le N$, with ${b}_{n}={a}_{n+1}$ for $n\le N-1$, ${a}_{1}=0$, and ${b}_{N}=1$. Note that these intervals are almost surely disjunct, which is sufficient in our setting. One approach could be to employ equidistant intervals, i.e., ${b}_{n}-{a}_{n}={b}_{m}-{a}_{m}$ for all $m,n\le N$. Another approach is to use equally likely intervals, i.e., $F\left({b}_{n}\right)-F\left({a}_{n}\right)=F\left({b}_{m}\right)-F\left({a}_{m}\right)$ for all $m,n\le N$. Under a uniform distribution, which is employed in this section, both approaches lead to the same intervals. We assume that the firm can observe the correct interval $\left[{a}_{n}, {b}_{n}\right]$ to which the realization of $\omega$ (PI-$\omega$), $\lambda$ (PI-$\lambda$), or both (FI) belongs.

The firm then utilizes the (conditional) mean of this interval, calculated as $\frac{1}{F\left({b}_{n}\right)-F\left({a}_{n}\right)}{\int }_{{a}_{n}}^{{b}_{n}}x\cdot f\left(x\right) dx$, as an estimate for the unknown precise realization. Unobserved parameters, such as $\omega$ in PI-$\lambda$, are treated as random variables. Employing such an estimate transforms our mechanisms (PI-$\omega$, PI-$\lambda$, and FI) into heuristics (H-$\omega$, H-$\lambda$, and H-FI), resulting in calculated revenues (based on the estimate) that may differ from simulated revenues (based on realizations).

Moreover, we adapted the main idea behind heuristic D from Schur (2023) to create another heuristic designed to work with truncated uniform distributions. We made two modifications to the original formulation of D: First, the underlying uniform distribution is no longer required to be $U\left[0, 1\right]$ but can be truncated on any interval $\left[{a}_{n}, {b}_{n}\right]$, i.e. $U\left[{a}_{n}, {b}_{n}\right]$. Second, we omitted the part involving the fluid approximation as we lacked the necessary analytical results to efficiently solve it for truncated uniform distributions.

We implemented three versions of this heuristic, namely, D-$\omega$, D-$\lambda$, and D-FI. For these versions, we assume the observation of the correct interval $\left[{a}_{n}, {b}_{n}\right]$ to which the realization of $\omega$ (D-$\omega$), $\lambda$ (D-$\lambda$), or both (D-FI) belongs, and utilize the truncated probability distribution $U\left[{a}_{n}, {b}_{n}\right]$ for the corresponding random variable.

In our simulation study, we examine six scenarios resulting from a combination of three different kinds of observable information ($\omega$, $\lambda$, or both) and two different degrees of customer segmentation (size of $N$). We chose a very low $N$ ($=2$) and a medium-sized $N$ ($=5$). Apparently, for a large $N$, we would obtain almost identical results to those presented in Sect. 5.1. In each scenario, we apply two heuristics, one from our mechanisms with the corresponding estimate (H-$\omega$, H-$\lambda$, and H-FI) and one of the three versions of D (D-$\omega$, D-$\lambda$, and D-FI).

The findings from our simulation study are presented in Table 4, with each column corresponding to one of the six scenarios and showcasing the revenues generated by the respective H and D heuristics. A notable observation emerges: consistently, D outperforms H. This suggests that neglecting uncertainty in observed parameters (by assuming an estimate instead of a random variable on a truncated distribution) is more detrimental than substituting the true customer choice model with a simplified version.

Table 4 Revenues for $C\le 120, T=40$ under customer segmentation

Full size table

In D, the order of the value of observable information aligns with the one presented in Sect. 5.2. However, this is not the case for H. Specifically, H-$\omega$ surpasses H-FI and H-$\lambda$ for $N=2$ and $N=5$, and H-$\lambda$ outperforms H-FI for $N=2$. The descending order of performance in H-FI provides further evidence of the adverse effects of replacing truncated distributions with estimates, given that two random variables in H-FI are replaced by estimates.

Unsurprisingly, simulated revenues exhibit an upward trend with a more detailed observation of customer segments, applicable to both H and D. To better grasp the impact of the granularity in customer segmentation, a comparison between simulated revenues for D (presented in Table 4) and those of Table 2 is helpful. The simulated revenues for FI, P-$\omega$, and P-$\lambda$ in Table 2 represent an upper bound, stemming from a scenario where the exact realization was observable, akin to a scenario with $N=\infty$.

The performance of D under $N=2$ (dashed line) and $N=5$ (solid line) in relation to their respective upper bounds is visually depicted in Fig. 8. It becomes evident that segmenting customers based solely on their consumption indicator (dark green) is notably more robust than segmenting based on their base willingness-to-pay (light green) or a combination of both parameters (blue). However, an $\omega$-based costumer segmentation consistently achieves over $90\%$ of its upper bound. Furthermore, it results in considerably higher simulated revenues than a $\lambda$-based customer segmentation (cf. Table 4). This emphasizes the significance of observing $\omega$—the finer the granularity, the better the results.

5.5 Stocking and restocking

In this section, we introduce an additional layer of decision-making: stocking. Specifically, we consider the firm’s ability to determine the initial stocking level. Furthermore, in an extended scenario, we allow the firm to replenish its stock in the middle of the planning horizon at $t=20$.

For both decisions, we assume that the firm incurs constant unit acquisition costs denoted by $s$. Therefore, at the beginning of the planning horizon ($T=40$), the firm must expend $C\cdot s$ to acquire a stock of $C$ units. Consequently, the optimization of the initial decision is expressed as:

$$\underset{{\varvec{C}}\in {\mathbb{Z}}}{{\text{max}}}\left\{{V}_{T}\left(C\right)-C\cdot s\right\}$$

(16)

Within this maximization framework, ${V}_{T}\left(C\right)$ can be substituted with ${V}_{T}^{\omega , \lambda }\left(C\right)$, ${V}_{T}^{\omega }\left(C\right)$, or ${V}_{T}^{\lambda }\left(C\right)$, depending on the type of information that is observable.

Additionally, in the restocking scenario the firm has the flexibility to determine the restocking quantity between customers in periods $t=21$ and $t=20$. This decision is based on $\underset{{\varvec{x}}\in {\mathbb{Z}}}{{\text{max}}}\left\{{V}_{20}\left(c+x\right)-x\cdot s\right\}$ with $x$ denoting the restocking quantity. By additionally updating ${V}_{20}\left(c+x\right)$ with $\underset{{\varvec{x}}\in {\mathbb{Z}}}{{\text{max}}}\left\{{V}_{20}\left(c+x\right)-x\cdot s\right\}$, we proactively incorporate the possibility of restocking between $t=20$ and $t=21$ into our pricing decisions for $t\ge 21$. This adaption results in a new optimal policy that leans slightly towards selling more units between $t=20$ and $t=40$, as scarcity can be mitigated through the restocking option.

Table 5, displays simulated profits for stocking and restocking scenarios. We examine three distinct acquisition costs ($s\in \left\{0.3, 0.4, 0.5\right\}$). However, the integration of a restocking option notably amplifies overall profit, showcasing an improvement of up to $6\%$ (observable $\omega$, $s=0.5$). Additionally, in the restocking scenario, the optimal initial stock is consistently lower compared to the stocking in a scenario without restocking.

Table 5 Profits for $C\le 120, T=40$ under stocking and restocking

Full size table

6 Conclusion

In this study, we delved into a dynamic pricing framework encompassing multiunit demands, driven by customers’ base willingness-to-pay and consumption indicator. Our exploration considered three scenarios, each involving the firm’s observation of the current customer’s base willingness-to-pay, consumption indicator, or both. We found the optimality condition for each case. For the second (under uniform distribution) and third case, we derived a closed-form expression of the optimal batch prices.

In contrast to standard singleunit dynamic pricing with time-homogenous demand, economically selling is not always possible in our multiunit dynamic pricing context. In particular, larger batches were frequently priced-out, as convex increasing opportunity costs tended to surpass concave increasing willingness-to-pay. This stands in contrast to singleunit dynamic pricing, where there always exists a price at which the firm can increase its overall expected revenue.

We showed well-known monotonicity in time and capacity holds for all cases, inducing an intuitive structure with regard to scarcity of the product, and ensuring the existence of a unique optimal solution. By solving all cases to optimality, we calculated the value of all three types of information a firm might obtain from profiling its current customer. Additionally, we analyzed the impact of customer segmentation when precise observation of customers’ private information is unattainable. With this knowledge, a firm gains the ability to assess the profitability of potential investments in customer profiling and segmentation. Furthermore, we provide guidance on leveraging our results to make informed decisions regarding optimal initial stocking and restocking strategies.

References

Akçay Y, Natarajan HP, Xu SH (2010) Joint dynamic pricing of multiple perishable products under consumer choice. Manag Sci 56(8):1345–1361. https://doi.org/10.1287/mnsc.1100.1178
Article Google Scholar
Armstrong M (2016) Nonlinear pricing. Annu Rev Econ 8(1):583–614. https://doi.org/10.1146/annurev-economics-080614-115650
Article Google Scholar
Banciu M, Mirchandani P (2013) Technical note—new results concerning probability distributions with increasing generalized failure rates. Oper Res 61(4):925–931. https://doi.org/10.1287/opre.2013.1198
Article Google Scholar
Baucells M, Sarin RK (2007) Satiation in discounted utility. Oper Res 55(1):170–181. https://doi.org/10.1287/opre.1060.0322
Article Google Scholar
Bitran G, Caldentey R (2003) An overview of pricing models for revenue management. Manuf Serv Oper Manag 5(3):203–229. https://doi.org/10.1287/msom.5.3.203.16031
Article Google Scholar
Braden DJ, Oren SS (1994) Nonlinear pricing to produce information. Mark Sci 13(3):310–326. https://doi.org/10.1287/mksc.13.3.310
Article Google Scholar
Chen M, Chen Z-L (2015) Recent developments in dynamic pricing research: multiple products, competition, and limited demand information. Prod Oper Manag 24(5):704–731. https://doi.org/10.1111/poms.12295
Article Google Scholar
Chiang WC, Chen JC, Xu X (2007) An overview of research on revenue management: current issues and future research. Int J Revenue Manag 1(1):97. https://doi.org/10.1504/IJRM.2007.011196
Article Google Scholar
den Boer AV (2015) Dynamic pricing and learning: historical origins, current research, and new directions. Surv Oper Res Manag Sci 20(1):1–18. https://doi.org/10.1016/j.sorms.2015.03.001
Article Google Scholar
Dhebar A, Oren SS (1986) Dynamic nonlinear pricing in networks with interdependent demand. Oper Res 34(3):384–394. https://doi.org/10.1287/opre.34.3.384
Article Google Scholar
Dong L, Kouvelis P, Tian Z (2009) Dynamic pricing and inventory control of substitute products. Manuf Serv Oper Manag 11(2):317–339. https://doi.org/10.1287/msom.1080.0221
Article Google Scholar
Elmaghraby W, Gülcü A, Keskinocak P (2008) Designing optimal preannounced markdowns in the presence of rational customers with multiunit demands. Manuf Serv Oper Manag 10(1):126–148. https://doi.org/10.1287/msom.1070.0157
Article Google Scholar
Gallego G, van Ryzin G (1994) Optimal dynamic pricing of inventories with stochastic demand over finite horizons. Manag Sci 40(8):999–1020. https://doi.org/10.1287/mnsc.40.8.999
Article Google Scholar
Gallego G, van Ryzin G (1997) A multiproduct dynamic pricing problem and its applications to network yield management. Oper Res 45(1):24–41. https://doi.org/10.1287/opre.45.1.24
Article Google Scholar
Gallego G, Li MZF, Liu Y (2020) Dynamic nonlinear pricing of inventories over finite sales horizons. Oper Res 68(3):655–670. https://doi.org/10.1287/opre.2019.1891
Article Google Scholar
Goldman MB, Leland HE, Sibley DS (1984) Optimal nonuniform prices. Rev Econ Stud 51(2):305. https://doi.org/10.2307/2297694
Article Google Scholar
Gönsch J, Klein R, Neugebauer M, Steinhardt C (2013) Dynamic pricing with strategic customers. J Bus Econ 83(5):505–549. https://doi.org/10.1007/s11573-013-0663-7
Article Google Scholar
Iyengar R, Jedidi K (2012) A conjoint model of quantity discounts. Mark Sci 31(2):334–350. https://doi.org/10.1287/mksc.1110.0702
Article Google Scholar
Lariviere MA (2006) A note on probability distributions with increasing generalized failure rates. Oper Res 54(3):602–604. https://doi.org/10.1287/opre.1060.0282
Article Google Scholar
Levin Y, Nediak M, Bazhanov A (2014) Quantity premiums and discounts in dynamic pricing. Oper Res 62(4):846–863. https://doi.org/10.1287/opre.2014.1285
Article Google Scholar
Maglaras C, Meissner J (2006) Dynamic pricing strategies for multiproduct revenue management problems. Manuf Serv Oper Manag 8(2):136–148. https://doi.org/10.1287/msom.1060.0105
Article Google Scholar
Phillips RL (2005) Pricing and revenue optimization (Reprinted with corr). Stanford Business Books
Raghu TS, Kannan PK, Rao HR, Whinston AB (2001) Dynamic profiling of consumers for customized offerings over the Internet: a model and analysis. Decis Support Syst 32(2):117–134. https://doi.org/10.1016/S0167-9236(01)00106-3
Article Google Scholar
Schur R (2024) Asymptotically optimal solutions for nonlinear dynamic pricing in the presence of multiunit demand. Working Paper. Advance online publication. https://doi.org/10.13140/RG.2.2.18970.11207
Stole LA (2007) Chapter 34: price discrimination and competition. In: Handbook of industrial organization, vol 3, pp 2221–2299. https://doi.org/10.1016/S1573-448X(06)03034-2
Talluri KT, van Ryzin GJ (2004) The theory and practice of revenue management. Springer New York, NY. https://doi.org/10.1007/b139000
Wilson R (1993) Nonlinear pricing (1. Publ). Oxford University Press. http://www.loc.gov/catdir/enhancements/fy0639/91032603-d.html
Zhang D, Cooper WL (2009) Pricing substitutable flights in airline revenue management. Eur J Oper Res 197(3):848–861. https://doi.org/10.1016/j.ejor.2006.10.067
Article Google Scholar
Ziya S, Ayhan H, Foley RD (2004) Relationships among three assumptions in revenue management. Oper Res 52(5):804–809. https://doi.org/10.1287/opre.1040.0134
Article Google Scholar

Download references

Acknowledgements

The author would like to thank the anonymous referees for their valuable suggestions and feedback which contributed to an improved quality of the results of the paper.

The author acknowledges support by the Open Access Publication Fund of the University of Duisburg-Essen.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Chair of Production and Logistics Planning, Mercator School of Management, University of Duisburg-Essen, Lotharstraße 65, 47057, Duisburg, Germany
Rouven Schur

Authors

Rouven Schur
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rouven Schur.

Ethics declarations

Conflict of interest

The author has no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 69 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schur, R. Multiunit dynamic pricing with different types of observable customer information. OR Spectrum 46, 589–636 (2024). https://doi.org/10.1007/s00291-024-00759-x

Download citation

Received: 31 March 2023
Accepted: 26 February 2024
Published: 08 April 2024
Issue Date: June 2024
DOI: https://doi.org/10.1007/s00291-024-00759-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multiunit dynamic pricing with different types of observable customer information

Abstract

Similar content being viewed by others

Supplier selection and order allocation: a literature review

A perspective analysis of obligatory vacation and retention of impatient purchaser on queueing-inventory with retrial policy

A strategic analysis of timing of wholesale pricing and information sharing strategy in dual-channel retailing

1 Introduction

2 Literature review

3 Problem definition

3.1 General setting and notation

3.2 Customer choice model

3.3 Dynamic programming formulation

4 Different types of observable information

4.1 Observable base willingness-to-pay

4.1.1 Customer choice and model formulation

Lemma 1

Proof

4.1.2 Solution and structural properties

Corollary 1

Proof

Remark 1

Proposition 1

Proof

Remark 2

Proposition 2

Proof

Lemma 2

Proof

Proposition 3

Proof

Proposition 4

Proof

Lemma 3

Proof

Theorem 1

Proof

4.2 Observable consumption indicator

4.2.1 Customer choice and model formulation

Lemma 4

Proof

4.2.2 Solution and structural properties

Corollary 2

Proof

Remark 3

Proposition 5

Proof

Remark 4

Remark 5

Remark 6

Proposition 6

Proof

Proposition 7

Proof

Theorem 2

Proof

4.2.3 Special case: uniform distribution

4.3 Observable base willingness-to-pay and consumption indicator

Lemma 5

Proof

Lemma 6

Proof

Remark 7

Lemma 7

Proof

Proposition 8

Proof

Proposition 9

Proof

Theorem 3

Proof

5 Simulation study

5.1 Heuristics for the no information case

5.2 Value of information

5.3 Different distributions

5.4 Customer segmentation

5.5 Stocking and restocking

6 Conclusion

References

Acknowledgements

Funding