Minimum quality standards and benchmarking in differentiated duopoly

We study a two-period model of a duopoly with goods differentiated by quality. The periods’ length corresponds to the goods’ useful lifespan, and consumers are heterogeneous in their valuation of quality. In the second period, the regulator fixes a minimum quality standard based either on the quality supplied by the high-quality firm in the first period (strict regulation) or on the average quality supplied in the first period (average regulation). Assuming a covered market, we show that such an approach leads to decreasing qualities in the first period, and increasing qualities in the second one. In both periods, net utility aggregated over consumers is increasing and profits aggregated over firms are decreasing. Taken together, average regulation always leads to an increase in the present value of welfare, whereas strict regulation can cause a decline. If the discount factor exceeds a certain threshold, a policy based on average regulation is even superior to implementing the optimal minimum quality standard already in the first period.


Introduction
We study the welfare effects of introducing a minimum quality standard (MQS) based on a benchmarking mechanism into a two-period model of duopoly where products are differentiated by quality. The related literature traces back to the seminal contributions of Ronnen (1991) and Crampes and Hollander (1995) who assume that the demand side is described by a continuum of consumers differing by the value that they assign to quality, and the supply side is given by two firms 1 3 each offering one single quality. Both firms share the same technology where costs depend on quality. Under the assumption of quality competition in the first stage and price competition in the second stage, the authors show that an appropriately fixed MQS can increase welfare since it intensifies price competition by limiting product differentiation. Subsequently, this model has been modified and extended in several ways. For a detailed overview see Michaelis and Ziesemer (2017, pp. 620).
In contrast to the advanced theoretical topics dealt with in the recent MQS-literature, 1 we are concerned with the more basic question of how to fix an appropriate MQS in practical policies. Of course, in a first-best world, it should be fixed in such a way that it maximizes welfare. In practice, however, any such attempt would be hardly feasible due to the regulator's limited information about preferences and technologies. Therefore, we will analyze an alternative approach that relies on a simple benchmarking procedure. We borrowed this idea from the Japanese Top Runner Program (JTRP) which aims at enhancing the energy-efficiency of energy-consuming goods. The JTRP covers 31 products and obliges their producers to establish the currently highest efficiency level for the respective good by a certain target year (see, e.g., Kimura 2014, METI 2015. Applied to the topic of our paper, this mechanism can be described by a two-period model where the regulator introduces an MQS in period t = 2 which is based on the qualities offered in period t = 1 . We consider two different cases: Under strict regulation, which is perfectly in line with the JTRP, the MQS is fixed according to the quality chosen by the high-quality firm in t = 1 . Under average regulation, which is a softened variant of the JTRP, the MQS is fixed according to the average quality offered in t = 1. The complete time sequence of our model is shown in Fig. 1. If we consider only the first period in Fig. 1, the MQS to be introduced in the second period is irrelevant. In this case, our approach corresponds to the standard setup of a differentiated duopoly with Bertrand competition (e.g., Shaked and Sutton 1982). If we consider only the second period, the MQS is exogenously fixed and our approach resembles the standard set-up used for analyzing the impacts of an MQS in a differentiated duopoly with Bertrand competition (e.g., Crampes and Hollander, 1995). With both periods taken together, however, the MQS becomes endogenous and depends on the firms' decisions in the first period. In contrast to this, the models employed in the literature usually assume that the MQS either is completely exogenous or results from welfare maximization by a social planner. In virtually none of these models are the firms able to influence the MQS by their choice of quality since the regulator is the first mover and the firms only react to the standard imposed. The only exception is a paper by Lutz et al. (2000) where the high-quality firm moves first and makes a sunk investment in quality which influences the standard resulting from welfare maximization by the regulator. Hence, our model is related to Lutz 1 3 The Japanese Economic Review (2022) 73: 515-537 et al. (2000) but we replace the demanding procedure of welfare maximization by a simple benchmarking approach which can be implemented in practice more easily. 2 The economic rationale behind the timing of regulation in our model results from the observation that practical policies often involve a considerable delay between the legal adoption and the actual implementation of a new standard (for empirical evidence see Lutz et al. 2000). Normally, in this transition period the firms are no longer able to influence the forthcoming regulation. In contrast, within a benchmarking approach according to the JTRP, the firms have ample time to shape the impending standard by altering the qualities supplied. This, of course, gives rise to strategic decision-making. In particular, there is an incentive to lower quality in t = 1 to dampen the standard to be introduced in t = 2 . Nevertheless, the following analysis shows that a benchmarking approach based on average regulation can increase welfare in both periods. The underlying effects are an increase in consumers' utility that always outweighs the decrease in firms' profits. In contrast, under strict regulation the forthcoming standard is too severe such that the gains in welfare are smaller or even negative depending on the discount factor applied by the firms. For the case of more than two firms, we find in line with the basic literature on the effects of an MQS in oligopoly (Scarpa 1998;Pezzino 2010) that welfare will decrease under both regulatory schemes.
The remainder of our paper is organized as follows. In Sect. 2, we introduce the model and in Sect. 3, we calculate the unregulated equilibrium. In Sect. 4, we derive the regulated equilibria for two different benchmarking approaches. In Sect. 5, we analyze the resulting welfare effects and compare them to the outcome obtained from applying the optimal MQS. In Sect. 6, we briefly consider the case of more than two firms, and finally in Sect. 7 we discuss our main conclusions.

The model
Due to the complexity added by the dynamics to be considered below, we keep our model as simple as possible. 3 The time horizon covers two periods t = 1, 2 where the length of each period corresponds to the useful lifespan of the considered goods. There exist two firms j = H, L each producing a single variant differentiated by quality q jt ≥ 0 . Without loss in generality, we assume q Ht ≥ q Lt , i.e., firm H is the high-and firm L is the low-quality producer. The price of variant j in period t is denoted by p jt . Both firms share the same technology where costs per unit are independent of quantity but increasing in quality. In line with several other studies on MQS, 4 we assume quadratic unit costs c q jt = q 2 jt with > 0.

3
The demand side consists of a unit mass of consumers indexed by i , each buying one unit per period. The net utility of consumer i buying variant j in period t is given by u + i q jt − p jt . The term u indicates a baseline utility independent of quality which is assumed to be sufficiently large to ensure that the market is always completely covered (see, e.g., Kuhn 2007). 5 Moreover, i which is distributed uniformly on the interval i ∈ [a, a + z] with a ≥ 0 and z > 0 represents the individual valuation of quality by the consumer i . Solving the equation u + i q Ht − p Ht = u + i q Lt − p Lt for i yields the indifferent consumer's position in period t : ̂t = p Ht − p Lt ∕ q Ht − q Lt . The accompanying market shares are given by s Lt = ̂t − a ∕z for firm L and s Ht = 1 − s Lt = a + z −̂t ∕z for firm H.
In each period, firms compete in two stages: In the quality game in stage one, they simultaneously choose qualities q jt , and in the price game in stage two they simultaneously choose prices p jt . In period t = 1 there is no regulation of quality, whereas in period t = 2 an MQS denoted by q is introduced. Strict regulation implies q = q H1 and average regulation implies q = q L1 + q H1 ∕2 . The solution concept is subgame perfect equilibrium, i.e., we solve the model backwards.

Unregulated equilibrium
Without regulation, there are no dynamic effects and the equilibria in both periods will be identical. Hence, it suffices to calculate the equilibrium for a representative period t . As can easily be checked for the price game in the second stage, Period t=1 P eriod t=2 t Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Step 7 Step 8 Step 5: Regulator calculates the standard and obliges the firms to comply with it Step 6: Firms simultaneously decide on qualities Step 7: Firms simultaneously decide on prices Step 8: Consumers decide which product to buy Step 1: Information on the impending standard and its calculation becomes public Step 2: Firms simultaneously decide on qualities Step 3: Firms simultaneously decide on prices Step 4: Consumers decide which product to buy Fig. 1 Time sequence of the model 5 The implications of assuming a covered market will be discussed in more detail in sections 5 and 7. We are aware that this assumption considerably limits the scope of our analysis. However, if we allow for an uncovered market, our model is solvable only in numerical applications.

3
The Japanese Economic Review (2022) 73:515-537 maximizing the firms' profits jt = s jt p jt − q 2 jt and solving the resulting reaction functions for the prices p jt yields: Inserting (1) and (2) into ̂t as calculated in Sect. 2 yields the indifferent consumer's position solely in terms of qualities: ̂t q Ht , q Lt = 2a + z + q Ht + q Lt ∕3 . The corresponding market shares are: Hence, a duopoly equilibrium with both firms active in the market (which is the one of interest in this paper) requires a − z < q Ht + q Lt < a + 2z such that s jt q Ht , q Lt > 0 for j = H, L . In the next step, the reduced profit functions that we need for the quality game in the first stage can be calculated by inserting (1) to (4) into jt = s jt p jt − q 2 jt : The first-order conditions jt q Ht , q Lt ∕ q jt = 0 lead to the following reaction functions: Due to q Lt q Ht ∕ q Ht > 0 and q Ht q Lt ∕ q Lt > 0 qualities are strategic complements. Solving (7) and (8)  (1) p Lt q Ht , q Lt = (z − a) q Ht − q Lt + q 2 Ht + 2q 2 Lt 3 (2) p Ht q Ht , q Lt = (a + 2z) q Ht − q Lt + 2q 2 Ht + q 2 Lt 3 (3) s Lt q Ht , q Lt = z − a + q Ht + q Lt 3z (4) s Ht q Ht , q Lt = 2z + a − q Ht + q Lt 3z Ht q Ht , q Lt = q Ht − q Lt a + 2z − q Ht + q Lt 2 9z.

3
In the following, we concentrate on the more interesting case of an interior solution with z ≤ 4a . In the next step, inserting q o Lt and q o Ht into (1) and (2) yields the accompanying prices p o Lt = 8a(2a − z) + 25z 2 ∕64 and p o Ht = 8a(2a + 5z) + 49z 2 ∕64 . Moreover, it can easily be calculated that the indifferent consumer is located at the center of the market, i.e. ̂o t = a + (z∕2) , and the market shares are s o jt = 1∕2 with profits of o jt = 3z 2 ∕16 for j = H, L . Hence, everything else equal, the higher the degree of heterogeneity in consumers' preferences (given by the parameter z ), the higher the firms' profits are. The economic explanation is straightforward: Preferences that are more heterogeneous increase the importance of differences in quality from the viewpoint of consumers. This, in turn, relaxes price competition and increases profits (see, e.g., Shaked and Sutton 1982).

Regulated equilibria
Before analyzing the equilibria for strict regulation with q s = q H1 and average regulation with q a = q L1 + q H1 ∕2 , it is useful to consider the general effects of introducing a binding standard q > q o Lt in period t . Concerning the price game in the second stage, there is no difference from the unregulated case analyzed above. However, in the quality game in the first stage, the standard forces firm L to increase its quality up to q Lt q = q and firm H will decide for q Ht q = a + 2z + q ∕3 according to its reaction function (8). Inserting q Lt q and q Ht q into the reduced profit functions (5) and (6) yields the profits solely in terms of q: Comparing Ht q with the unregulated profit o Ht and taking into account q > q o Lt reveals Ht q < o Ht such that firm H is worse off under the standard. In contrast, Hence, firm L gains from the standard as long as it is not too severe and satisfies the condition q <q . The economic intuition is based on two effects: First, both firms suffer since the standard limits the scope for product differentiation and tightens price competition. Second, as already emphasized by Ronnen (1991, p.500) and Crampes and Hollander (1995, p.76), the standard applied enables firm L to commit to quality which has the same effect as granting a first-mover advantage to it. If the standard satisfies q <q , firm L ′ s advantage from the commitment effect outweighs its disadvantage from the tightened price competition and its profit increases.

Strict regulation
Replacing q in (9) and (10) for t = 2 by the strict standard q s = q H1 yields the firms' second-period profits solely as a function of firm H ′ s decision on quality in period t = 1: With their decisions in t = 1 , both firms j = H, L aim at maximizing the present value of total profits Π s j q H1 , q L1 ∶= j1 q H1 , q L1 + ⋅ s j2 q H1 . 8 The parameter ∈ (0, 1] indicates the discount factor. In the following, we use the abbreviation ∶= √ 9 − 8 for simplifying terms. It should carefully be noted that is a strictly decreasing function of with ∈ [1, 3).
From the first-order conditions, Π s j q H1 , q L1 ∕ q j1 = 0 we derive the reaction functions q s L1 q H1 = a − z + q H1 ∕3 and q s H1 q L1 = (a + 2z) + 3 q L1 ∕ (3 + 2 ) . For firm L we obtain the same reaction function as in the unregulated case because L is not able to influence the second-period outcome via the choice of q L1 . In contrast, the new reaction function of firm H implies that for any given q L1 firm H will choose a lower level of quality than in the unregulated case: q s H1 q L1 < q o H1 q L1 . The economic rationale is obvious since everything else equal the profits of firm H in period t = 2 are the lower, the higher is q s . Next, solving q s L1 q H1 and q s H1 q L1 for q j1 yields the qualities resulting in period t = 1 under strict regulation 9 : In the following, we again concentrate on the more interesting case of an interior solution. 10 Comparing q s j1 with the outcome of the unregulated case reveals that both firms will lower their quality: . The economic reasoning is that firm H reduces its quality because of the standard's detrimental effect on its second-period 8 Note that the profits in the first period, j1 q H1 , q L1 , still follow from inserting t = 1 into the reduced profit functions (5) and (6). 9 Inserting q s L1 and q s H1 into the second derivatives 2 Π s j q H1 , q L1 ∕ q 2 j1 proves that the second order conditions are satisfied. Moreover, Appendix A.1 shows that leapfrogging can be ruled out for both periods if leapfrogging by firm L in t = 1 leads to positive but arbitrarily small costs. 10 Due to ∈ [1, 3) , a sufficient condition for such an interior solution is z ≤ 2a . In contrast, for z > 2a a corner solution with q s L1 = 0 and q s H1 = (a + 2z)∕ (3 + 2 ) cannot be ruled out depending on the magnitude of the discount factor . 1 3 profits, and this decrease in q H1 induces firm L also to reduce its quality since qualities are strategic complements.
Moreover, due to q s H1 q L1 < q o H1 q L1 we observe a decreasing degree of product differentiation: . The economic rationale for this result is the assumption 2 c(q)∕ q 2 > 0 which implies for any q H > q L that improving quality is more costly for firm H than for firm L (see also Ronnen 1991, p.498).
In the next step, from inserting q s L1 and q s H1 into (1) and (2) for t = 1 we obtain the equilibrium prices: Compared to the unregulated case, both prices are decreasing: . From an economic point of view, this result does not come as a surprise since production costs decrease due to q s j1 < q o j1 and price competition intensifies due to . Hence, despite the intensified price competition, in t = 1 the high-quality firm gains from the regulation. The economic reason is obvious: Analogous to the findings of Ronnen (1991, p. 500) and Crampes and Hollander (1995, p.76), firm H ′ s influence on the forthcoming standard enables it to commit to quality and this advantage outweighs its disadvantage caused by the more intense price competition.
We now turn to period t = 2 . The results above imply q s H1 > q o L2 such that the standard q s is binding for firm L. 11 Consequently, we obtain from q s L2 = q s H1 : Next, inserting q s L2 into firm H ′ s reaction function (8) for t = 2 yields: Compared to the unregulated case we obtain q s j2 > q o j2 and q s H2 − q s L2 < q o H2 − q o L2 . Hence, qualities increase and the degree of product differentiation decreases again. Moreover, inserting the qualities q s L2 and q s H2 into (1) and (2) for t = 2 yields the equilibrium prices:

3
The Japanese Economic Review (2022) 73:515-537 At first glance, the standard's impact on prices in t = 2 is ambiguous because qualities and production costs increase but product differentiation decreases. However, as shown in Appendix A.2, for the case of an interior solution the effect of a smaller product differentiation dominates such that prices will increase in total.
Next, inserting q s j2 into (3) The economic explanation of this result is straightforward: From the discussion at the beginning of Sect. 4, we already know that firm L is better off under a minimum quality standard that is not too severe. However, if the discount factor falls short of the threshold ̂ , the adaption of quality by firm H in period t = 1 will be too weak. Therefore, the standard introduced in t = 2 will be too high and the profit of firm L will decrease because its disadvantage from an intensified price competition outweighs its advantage from being able to commit to quality. The analysis presented so far, however, entails a possible caveat that should carefully be recognized 12 : To ensure that q s H1 , q s L1 is indeed a Nash-equilibrium it must hold that none of the two firms has an incentive to change its quality as long as the other firm sticks to q s j1 . Concerning firm L this condition is obviously satisfied since L is not able to influence the forthcoming standard. In contrast, firm H could choose a quality q H1 ≤ q o Lt such that the standard in the second period is non-binding. Compared to q s H1 this would lead to a smaller degree of product differentiation and H1 would decrease whereas H2 would increase up to the unregulated level o H2 . In Appendix 2 we show that such a strategy does not pay for H and our solution q s H1 , q s L1 is indeed a Nash-equilibrium. Before proceeding to summarize our key findings in Proposition 1, we note that for the sake of clarity we display all thresholds regarding the discount factor in our Propositions as numerical figures rounded to three digits. We chose this approach because the formulas describing the exact magnitudes of the thresholds are extremely complex since they result from solving power equations of a higher order.
Proposition 1 Introducing a standard q s = q H1 in t = 2 induces a unique subgame perfect equilibrium with qualities q s jt according to (13), (14), (17) and (18). A comparison with the unregulated case reveals the following effects (see Appendix A.2): Finally, we note that everything else equal, the effects described in Proposition 1 are in period t = 1 the weaker and in period t = 2 the stronger, the lower is the discount factor . The economic intuition behind this result is clear: Lowering the discount factor reduces the weight attached to the outcome in the second period. This, in turn, diminishes the incentive for a pre-emptive lowering of qualities in the first period and strengthens the standard imposed in the second period.

Average regulation
We now turn to the case of average regulation. The main difference to strict regulation relates to the additional strategic incentive stemming from firm L ′ s ability to influence the standard directly. Replacing q in (9) and (10) for t = 2 by q a = q L1 + q H1 ∕2 yields the firms' second-period profits as a function of their decisions on quality in period t = 1: The profits in the first period, j1 q H1 , q L1 , still follow from inserting t = 1 into (5) and (6), and the present value of profits is Π a j q H1 , q L1 ∶= j1 q H1 , q L1 + ⋅ a j2 q H1 , q L1 . From the first-order conditions Π a j q H1 , q L1 ∕ q j1 = 0 we obtain the following reaction functions with Θ ∶= 144a(a − 2z) + z 2 711 − 72 − 2 2 13 : As indicated by (24), the reaction function of firm H is still linearly increasing in q L1 but its slope has changed. Compared to the reaction function under strict regula- Hence, for a wide range of possible discount factors firm H reacts stronger to a change in q L1 to compensate for its partial loss of control over the forthcoming standard.
With q L1 drawn at the vertical axis, the reaction function of firm L as given by (23) is now U-shaped with a minimum at q a H1 q L1 = (a + 2z) 9 + 2 + 27 − 2 q L1 45 + 2 .

3
The Japanese Economic Review (2022) 73:515-537 q H1 = 12(a − z) + z 27 − 2 √ 9 − 2 ∕ 9 + 2 ∕24 . The economic rationale behind this shape is that a decrease in q H1 induces two opposite effects on the optimal decision of firm L . On the one hand, a decrease in q H1 leads to an incentive to reduce q L1 to maintain a certain level of product differentiation. This is the same mechanism as under strict regulation. On the other hand, however, there is now also an incentive to increase q L1 because within a certain range a higher standard intensifies firm L ′ s advantage from being able to commit to quality in t = 2. 14 Hence, for q H1 >q H1 the former incentive dominates over the latter one, whereas for q H1 <q H1 the opposite holds. 15 Solving the above reaction functions (23) and (24) for q j1 leads to the qualities supplied in period t = 1 under average regulation 16 : Inserting q a L1 and q a H1 into q a = q L1 + q H1 ∕2 yields the standard to be introduced in the second period: 3) this standard is binding for firm L . Consequently, we obtain: Moreover, inserting q a L2 into firm H ′ s reaction function (8) for t = 2 yields: The remaining lines of calculation concerning prices, market shares and profits are the same as in Sect. 4.1. We, therefore, confine ourselves to summarize the corresponding results as compactly as possible using the abbreviations ∶= 405 − 2 18 − 2 1∕2 and ∶= 63 − − 2 for simplifying terms. In t = 1 , the resulting prices are: (28) q a H2 = 72a + z 2 + 81 + √ 405 − 18 − 2 2 144 .
14 Note that (9) implies L2 q ∕ q > 0 as long as q lies within the range q o L2 < q < (2a + z)∕4 . 15 Moreover, since L ′ s commitment-advantage becomes effective not before t = 2 we obtain q H1 ∕ > 0 such that the turning point q H1 occurs the earlier, the lower is the discount factor. 16 The assumption z ≤ 2a introduced in Sect. 4.1 is still sufficient to guarantee an interior solution. Inserting q a L1 and q a H1 into 2 Π a j q H1 , q L1 ∕ q 2 j1 proves that the second order conditions are satisfied. Moreover, in Appendix A.1 we show that leapfrogging can be ruled out.
The market share of firm L in t = 1 is s a L1 = 9 + 2 + ∕72, 17 and the accompanying profits are a L1 = z 2 9 + 2 9 + 2 + 2 ∕4478976 and a H1 = z 2 3 9 + 2 ∕4478976 . For t = 2 , we obtain prices of: The market share of firm L in t = 2 is s a L2 = 45 + 2 + ∕108 , and the accompanying profits are a L2 = z 2 45 + 2 + 2 ∕839808 and a H2 = z 2 3 ∕839808 . Finally, in Appendix A.3 we proof analogously to the case of strict regulation that none of the two firms has an incentive to strive or a non-binding standard. Proposition 2 summarizes our key findings from comparing average regulation with the unregulated case: Proposition 2 Introducing a standard q a = q L1 + q H1 ∕2 in t = 2 induces a unique subgame perfect equilibrium with qualities according to (25) .
As expected, a comparison of Propositions 1 and 2 shows that the general impacts of both regulations are quite similar: qualities decrease in the first period and increase in the second period, whereas product differentiation decreases in both periods. The only differences relate to the change in the firms' profits compared to the unregulated equilibrium. In contrast to strict regulation, firm H ′ s profit in t = 1 will now decrease if the discount factor is sufficiently high, and firm L ′ s profit in t = 2 will now always increase. The economic reason is that switching from strict to average regulation advantages firm L and disadvantages firm H which is no longer able to decide on the standard alone.
17 Remind that the market share of firm H is always s Ht = 1 − s Lt since we consider a covered market.
The Japanese Economic Review (2022) 73:515-537 Moreover, due to q a L2 < q s L2 (see Proposition 3 below) the standard under average regulation is always lower compared to its counterpart under strict regulation: q a < q s .

Welfare analysis
We denote welfare in period t by W t q Lt , q Ht ∶= Π t q Lt , q Ht + U t q Lt , q Ht . The first summand indicates profits aggregated over both firms and the second summand indicates net utility aggregated over consumers. Aggregated profits can easily be calculated by adding up the reduced profit functions [5] and [6]: To derive aggregated net utility, we start with the observation that consumers characterized by i <̂t q Lt , q Ht buy variant L , whereas consumers characterized by i >̂t q Lt , q Ht buy variant H. Since i is distributed uniformly on i ∈ [a, a + z] , the position of the average consumer buying variant L is ̃L t q Lt , q Ht = 0.5 a +̂t q Lt , q Ht , and the position of the average consumer buying H is ̃H t q Lt , q Ht = 0.5 a + z +̂t(q Lt , q Ht ) . Consequently, the net utility of the average consumer buying variant j in period t can be calculated as ũ jt q Lt , q Ht = u +̃j t q Lt , q Ht ⋅ q jt − p jt q Lt , q Ht . Finally, weighting ũ jt q Lt , q Ht by market shares s jt q Lt , q Ht and adding up over both variants j = L, H yields the consumers' aggregated net utility: As point of reference, we first calculate the optimal standard, denoted by q * . Inserting q Lt q = q as well as q Ht q = a + 2z + q ∕3 into (33), (34) and adding up both expressions, we obtain welfare in period t directly as a function of the standard, i.e. W t q . Solving W t q ∕ q = 0 for q and verifying the second-order condition yields: Due to q * > q o Lt the standard is binding for firm L . Hence, the resulting qualities are q * Lt = q * and q * Ht = � 40a + z � √ 145 + 45 �� ∕80 . However, before proceeding, it is important to note that applying q * will increase welfare but it will not lead to the optimal combination of qualities that maximizes W t q Lt , q Ht . 18 The economic reason is that the introduction of an MQS only allows the regulator to directly control the decision of firm L . In contrast, firm H continues to follow its reaction function (34) U t q Lt , q Ht ∑ j=L,H s jt q Lt , q Ht u +̃j t q Lt , q Ht ⋅ q jt − p jt q Lt , q Ht .
(8) such that for any quality q Lt enforced by an MQS the corresponding quality q Ht is predetermined. Hence, like strict or average regulation, even an optimized MQS is also only a second best solution.
Comparing q * with q s reveals q s > q * for ∈ (0, 1] . Hence, under strict regulation the standard is always too severe compared to the optimal one. In contrast, for aver- In the following welfare analysis, we compare strict and average regulation not only with each other but also with the (hypothetical) case that the regulator has complete information about the firms' cost functions and the consumers' preferences and introduces the optimal standard q * already in period t = 1 . As a pre-requisite, Proposition 3 summarizes our results from comparing qualities and product differentiation: Based on our previous analysis, most of the results stated in Proposition 3 are not surprising. Concerning the first period, however, we find for a sufficiently small discount factor that the pre-emptive reduction in quality is under average regulation stronger than under strict regulation (i.e., q a j1 < q s j1 ). In this case, the less stringent regulation leads to the stronger adaption in t = 1 . The economic reason for this somewhat unexpected result is the joint-effect caused by the impact of discounting on the firms' reaction functions as discussed above. In particular, lowering the discount factor increases firm L ′ s incentive to lower quality and at the same time firm H ′ s reaction to a decrease in q L1 becomes stronger compared to the case of strict regulation. Both effects reinforce each other in diminishing qualities.
Next, inserting the equilibrium-qualities calculated above for the unregulated case as well as for the different standards into (33) yields the firms' aggregated profits. Comparing the resulting expressions leads to Proposition 4:

, 2 and (see Appendix A.5):
Compared to the unregulated case, firms' aggregated profits are decreasing in both periods and under each of the standards considered. Moreover, a direct comparison between strict and average regulation reveals that the decrease in aggregated profits under average regulation is in the first period stronger than under strict regulation whereas in the second period the opposite holds. The economic explanation are the standards' different impacts on the degree of product differentiation and the intensity of price competition as stated in Proposition 3: In t = 1 , average regulation leads to a lower degree of product differentiation (implying a more intense price competition) than strict regulation, and in t = 2 the reverse is true.
Finally, the ranking between strict or average regulation on the one hand and the optimal standard, on the other hand, depends on the discount factor . In general, the higher the discount factor, the more likely is the optimal standard superior in the first period, and the opposite holds in the second period.
We now turn to the demand side. Inserting the equilibrium-qualities calculated above into (34) yields the consumers' aggregated net utility. By comparing the resulting expressions, we derive Proposition 5: Proposition 5 Denoting net utility aggregated over consumers in period t by U t and comparing the different standards with each other as well as with the unregulated case reveals U o t < min {U s t , U a t , U * t } for t = 1, 2 and (see Appendix A.6): Compared to the unregulated case, consumers 'aggregated net utility is increasing in both periods and under each of the standards considered. The economic reason is obvious: In the first period, under strict or average regulation the positive effects of decreasing prices dominate the negative effects of decreasing qualities, whereas under the optimal standard the positive effects of increasing qualities dominate the negative effects of increasing prices. In the second period, the latter relation between positive and negative effects holds under the optimal standard as well as under strict or average regulation.
Moreover, a direct comparison between strict and average regulation shows that in the first period average regulation is always superior. In contrast, in the second period the relation depends on the discount factor: The higher , the more likely is strict regulation superior compared to average regulation. Likewise, the ranking between strict or average regulation on the one hand and the optimal standard, on the other hand, depends on the discount factor: In the first period, increasing diminishes the relative attractiveness of the optimal standard, whereas in the second period there is no uniform pattern. Next, we consider welfare in period t. Inserting the equilibrium-qualities calculated above into W t q Lt , q Ht and comparing the expressions obtained leads to Proposition 6: Hence, compared to the unregulated situation, average regulation is beneficial in both periods since the consumers' increase in net utility dominates the firms' losses in profits. In contrast, the increase in welfare caused by strict regulation is always smaller compared to average regulation and can even become negative in the second period. The latter occurs if the discount factor falls short of the threshold ≈ 0.901 . In this case, the reduction of quality by firm H in period t = 1 is not strong enough such that the standard introduced in period t = 2 will be too severe. The economic rationale behind this result is in line with Crampes and Hollander (1995, p. 77) who conclude that an MQS "sufficiently close to the quality chosen by the low-quality producer in the unregulated equilibrium" can improve welfare.
Moreover, a comparison with the welfare effects of the optimal standard q * leads for t = 1 to the (possibly surprising) result that average regulation is superior to applying q * if the discount factor exceeds the threshold ≈ 0.520 . In this case, the reduction in qualities (accompanied by lower prices) induced under average regulation is more beneficial than the increase in qualities (accompanied by higher prices) enforced by the optimal standard. The economic background of this result is the above-mentioned observation that average regulation as well as applying the optimal standard are both only second-best solutions. Hence, there is no a priori reason to suppose that one of these solutions is always superior to the other. In the last step, we consider the standards, impact on the present value of total welfare denoted by W(q Lt, q Ht ) ∶= W 1 (q L1, q H1 ) + ⋅ W 2 (q L2, q H2 ). 20 Inserting the The reason for these exceptions is as follows: With drawn at the horizontal axis, the graphs of the differences ΔW s 1 ∶= W * 1 − W s 1 and ΔW a 2 ∶= W * 2 − W a 2 are U-shaped with a minimum of ΔW s ≈ 0.806. 20 This calculation implies that the social planner applies the same discount factor as the firms to evaluate the welfare effects of the MQS.

3
The Japanese Economic Review (2022) 73:515-537 equilibrium-qualities calculated above and analyzing the resulting expressions leads to our final proposition: Under average regulation, the gains in welfare compared to the unregulated case are a strictly increasing function of the discount factor .
Hence, despite of the firms' strategic incentive to dampen the forthcoming standard, the introduction of an MQS based on the average quality initially supplied always leads to an increase in the present value of welfare. If the discount factor exceeds the threshold ≈ 0.558 , the gains in welfare under average regulation are even higher than those obtained from applying the optimal MQS. In contrast, an MQS based on the high quality initially supplied leads only to a smaller increase in welfare that becomes even negative if the discount factor falls short of the threshold ≈ 0.747. Moreover, everything else equal, the gains in welfare under average regulation are the higher, the higher is the discount factor applied. Of course, one might suspect that this result is trivial since it might be solely driven by the diminishing effect of discounting on the present value of welfare enjoyed in the second period. To dispel this suspicion we also calculated normalized welfare W (q Lt , q Ht ) ∶= [1∕(1 + )] ⋅ W 1 (q L1 , q H1 ) + [ ∕(1 + )] ⋅ W 2 (q L2 , q H2 ) . This transformation eliminates the direct effect of on the weight attached to the second period and leaves only the strategic effects on the firms' decisions. In Appendix A.8, we show that the difference W (q a Lt , q a Ht ) −W(q o Lt , q o Ht ) , which indicates the gains in welfare compared to the unregulated situation, is strictly increasing in .
Finally, we note that all results derived above rely on the assumption of a covered market. This requires that even the consumer with the lowest valuation of quality (i.e., i = a ) decides to buy the good. Since this consumer chooses variant L, the general condition for market-coverage in period t is u + aq Lt − p Lt > 0 . This implies that the baseline utility u has to exceed the threshold û t ∶= p Lt − aq Lt . Inserting p o Lt and q o Lt yields the threshold for the unregulated case: û o t = 25z 2 − 16a 2 ∕64 . This threshold is the lower, the higher is the minimum valuation of quality given by the parameter a . Hence, assuming a covered market in the unregulated case is justified if either the baseline utility or the minimum valuation of quality (or a suitable combination of both) is sufficiently high such that u >û o t . Concerning market-coverage under quality regulation, we concentrate on the more beneficial case of average regulation. Calculating along the same lines as above we arrive at the respective thresholds û a 1 and û a 2 . As shown in Appendix A.9, both thresholds are strictly increasing in . Hence, to derive a sufficient condition for market coverage, it suffices to examine û a t for the case of = 3 . In doing so, we obtain û a 1 =û o t for t = 1 . Consequently, if the market is covered without regulation it will also be covered under average regulation in t = 1. 21 For t = 2 , however, we obtain û a 2 = 19z 2 − 12a 2 ∕48 >û o t . I.e., even if the market is covered in the first period, it might switch to an uncovered one in the second period.
6 The case of more than two firms Our analysis above concentrated on the case of duopoly although this might lead to results that are not robust if the number of firms becomes larger. To our best knowledge, Scarpa (1998) and Pezzino (2010) are the only two studies yielding a comprehensive analysis of this issue. 22 Both authors concentrate on a constellation with three firms. In contrast to our approach, they assume that (1) the MQS is exogenously given such that it does not depend on the firms' decisions, (2) the market is always uncovered and (3) there are only quality-dependent fixed cost. Contrary to the common findings for the case of duopoly, both authors show that introducing a binding MQS always reduces welfare.
Since our approach partly differs from Scarpa (1998) und Pezzino (2010, it is a priori unclear whether their result can be transferred to an MQS based on benchmarking as studied in the present paper. We, therefore, analyzed an extended numerical version of our model assuming that there is a third firm M that offers a quality between q Lt and q Ht . In line with the studies cited above, our results suggest that strict as well as average regulation will reduce welfare if there are three firms. 23 The main economic reason is that with more than two firms the differences between the optimal and the unregulated qualities become almost negligible (see also Schmidt 2009). Hence, introducing an MQS in such a situation implies harmful overregulation even if the standard is based on benchmarking.

Conclusions
We analyzed a two-period model of an MQS based on benchmarking in a differentiated duopoly. Although we are aware that our model is by far too simple to derive definite policy conclusions, we identified at least two interesting implications: 21 The economic reason is obvious since = 3 implies a discount factor of = 0 such that the firms' decisions on quality in t = 1 completely ignore the effects on the forthcoming standard and there is no change compared to the unregulated case. 22 Scarpa (1998) considers Bertrand competition in the second stage, whereas Pezzino (2010) assumes Cournot competition. 23 The details of these calculations are available in the Online Resource 1. −First, in principle, a standard based on benchmarking can increase welfare but this approach should be handled with care since it entails the risk that the resulting standard is too severe and might even reduce welfare. Therefore, the average quality supplied in the market seems to be a more suitable benchmark than the high quality. −Second, everything else equal, the gains in welfare resulting from a benchmark based on the average quality are the higher, the higher is the discount factor applied by the firms. For a given annual discount rate, however, the discount factor will be the higher the shorter is the considered good's useful lifespan that determines the length of each period in our model. Consequently, our benchmarking approach seems to be particularly beneficial for goods that are not too long-lived.
However, the significance of our results is limited by several restrictive assumptions. Examples are the use of a quadratic cost function and the assumption that the valuation of quality is uniformly distributed across consumers. The most serious limitation of our model presumably stems from the assumption of a covered market. In particular, our analysis has shown that a market, which is covered without regulation, might switch to an uncovered market after the MQS has been introduced in the second period.
Within the framework of our model, the assumption of full market-coverage is justified if either the baseline utility or the minimum valuation of quality (or a suitable combination of both) is sufficiently high. Whether this condition will be satisfied in practice, mainly depends on the specific good under consideration and on the question which particular feature of this good is regulated by the MQS. Suitable examples where the above condition is likely to be satisfied are safety regulations of cars and pharmaceuticals as well as regulations concerning the quality of medical services and devices. Moreover, there exists a wide range of quality-regulated goods and services whose consumption is mandatory due to legal requirements. In this case, the market will automatically be covered irrespective of the magnitude of baseline utility or the minimum valuation of quality. Prominent examples are several kinds of mandatory safety products like cyclists' helmets, smoke detectors or child safety seats for cars. Further examples relate to mandatory insurances for, e.g., health care or vehicle third-party liability.
Of course, in practice there also exist numerous cases of quality-regulated goods, where the assumption of full market-coverage is at least questionable. For instance, the regulated feature of goods sometimes relates to ecological characteristics. Although environmental awareness seems to increase steadily, there are still a lot of consumers who do not care much about the environment. With respect to our model, this implies a rather small minimum valuation of quality. Hence, as far as the baseline utility of the regulated good is also small, marketcoverage is unlikely. In this case, it cannot be ruled out that after introducing an MQS overall welfare will decrease since the net utility of consumers at the lowerquality end of the market who decide not to buy at all will drop to zero. However, the analytical difficulties associated with considering an uncovered market are left to future research. ruled out. With respect to firm L , the same holds in both periods. The only exception is the special case = 1 (i.e., no discounting at all) which leads to s L1 =̂s L1 . However, if a cheap low-quality producer like firm L tries to penetrate the high-quality segment of the market, it will most likely be forced to change marketing strategies or distribution channels. Although the associated costs are not quantifiable, it is obvious that even arbitrary small costs will suffice to prevent leapfrogging.

A.2 Proof of Proposition 1
To show that firm H will not try to achieve a non-binding standard under strict regulation, we denote the quality that maximizes H ′ s profit for given q s L1 and under the restriction that the standard in t = 2 is non-binding by q n H1 . In the first step, we show that q n H1 = q o Lt holds. The reason is that a further reduction of quality would lead to an unnecessarily strong decrease in product differentiation. This, in turn, would reduce H ′ s profit in t = 1 without changing the outcome in t = 2 . In the second step, we show that the presents value of H ′ s profit with the strategy q n H1 , q s L1 is always smaller compared to the strategy q s H1 , q s L1 . To derive the results from comparing strict regulation with the unregulated case as stated in Proposition 1, we first calculate the respective differences for the considered variables (e.g., Δq s L1 ∶= q s L1 − q o L1 for the qualities offered by firm L in t = 1 ). The signs of the resulting differences depend only on the magnitude of . Due to = √ 9 − 8 and ∈ (0, 1] , this magnitude is restricted to the domain ∈ [1, 3) . Hence, the results stated in Proposition 1 can easily be obtained by plotting the graphs of the respective differences for ∈ [1, 3) and by calculating zeros if necessary. In the latter case, to derive the accompanying thresholds in terms of the discount factor the resulting zeros are transformed using = (9 − 2 )∕8.

A.3 Proof of Proposition 2
In case of average regulation, a non-binding standard q a ≤ q o Lt requires (q L1 + q H1 )∕2 ≤ q o Lt . The main difference compared to strict regulation is that now both firms are able to achieve q a ≤ q o Lt with their decision on quality in period t = 1 . The proof that such a strategy does not pay for firm H proceeds analogously to the corresponding proof for strict regulation outlined in Appendix A.2. With respect to firm L , however, an additional complication occurs: The argument used in Appendix A.2, that due to the impact on product differentiation firm H will lower its quality only so far that the standard just does not bind, cannot directly be transferred to firm L . The reason is that for any given q H1 a reduction in q L1 would actually increase product differentiation. Nevertheless, the complete Appendix provided in Online Resource 2 shows that also firm L will not lower its quality any further than necessary to guarantee q a ≤ q o Lt . After this step, it is easy to show that the present value of L ′ s profit in case of a non-binding standard is always smaller than in the equilibrium calculated in Sect. 4.2.
To derive the results from comparing average regulation with the unregulated case as stated in Proposition 2, we analyze the corresponding differences between variables following follow the same steps as already outlined in Appendix A.2.

A.4-A.8 Proof of Propositions 3-7
To derive the results stated in Propositions 3-7, we analyze the corresponding differences between variables following the same steps as already outlined in Appendix A.2.

A.9 Market coverage under average regulation
To prove that the thresholds û a 1 and û a 1 are strictly increasing in we calculate the expressions û a t ∶= p Lt (q a Lt , q a Ht ) − aq Lt (q a Lt , q a Ht ) for t = 1, 2 and show that the sign of the first derivatives with respect to satisfy the condition û a t ∕ > 0 for ∈ [1, 3).