1 Introduction

Casual evidence suggests important differences in the quality of journalism from one country to another. The traditional press is one example to look at. Among other possible differences, a striking one is the coexistence of countries where high-quality newspapers have large readerships with others where high-circulation newspapers have surprisingly low editorial standards. The cases of the USA and Spain, where quantity aspects (newspaper circulation) and quality aspects (accuracy of information and trustworthiness) go hand in hand, are thus in sharp contrast to the case of the UK, where the newspapers that lead the market in terms of circulation, The Sun and The Daily Mail, are far down the ranking in terms of trust and accuracy of news. That ranking is led by The Times and The Guardian.Footnote 1

Differences in quantity and quality go beyond country-specific issues and can be found when different media sectors from the same region are compared. An example is the proliferation of fake news, a phenomenon that affects many spheres and sectors of the media industry but is especially severe and important on social media and online information platforms. How can these differences be explained? What determines which media outlets are more successful in a society?

This paper proposes a model of competition for audience that serves to draw predictions as to the fundamental aspects that determine what type of media outlets lead a market. In our opinion, this question requires a dynamic approach, because the accuracy of a story printed today may well affect an outlet’s future audience. Our objective is to determine what ingredients can play a role in explaining the differences observed in the real world. We acknowledge that the problem is complex and that many variables may be involved. For example, issues such as press penetration in different social classes, ideological considerations, and prices, which are not considered in this paper, may have major effects. In this sense, the present paper should be seen as an attempt to identify some initial general properties of the problem.

Our contribution is to pin down the importance of two variables in explaining the dynamics: On the one hand, the consumers’ discomfort with fake news, i.e., how harsh consumers are with the publication of false stories, and on the other hand the nature of competition between media outlets, i.e., whether they are homogeneous in their editorial standards for quality and thus compete for the same stories, or differ and offer different products.

In this paper, we use two approaches to characterize the dynamics of the market. On the one hand, we use a mean-field approach to find analytical solutions for the long-term average state of a duopoly media market. On the other hand, we numerically follow the stochastic dynamics of a market with several media outlets.Footnote 2 These approaches are not standard in the literature of media economics. In this respect, our simple model could serve as a guideline for future analyses of other, more complex economic systems.

The model is as follows. We consider a media industry with N non-strategic media outlets and a finite but large enough time horizon. At each step in time, referred to as a day, one scoop is released. We assume that each scoop can be published by only one media outlet. Media outlets compete for scoops according to a rule that determines the probability of an outlet receiving a scoop at a given step in time. The rule establishes that this probability is an increasing function of the outlet’s audience on that particular day.Footnote 3 Scoops contain information on a relevant variable and the information contained can be ex-ante more or less accurate. We refer to the ex-ante accuracy of the information as the quality of the scoop. We assume that the quality is i.i.d. across periods and that each scoop is either true or false with a probability proportional to its quality. Media outlets are characterized by their own editorial standards for quality, which establish the minimum quality that each outlet requires from a scoop in order to publish it. We consider that the editorial standards of outlets are exogenously given and invariant throughout the dynamics. We further consider that an outlet’s editorial standard determines the behavior of that outlet in that it prescribes that the outlet will publish the scoop when it is of sufficiently high quality and will reject it otherwise.Footnote 4 A scoop that is published is referred to as a story. After publication, consumers learn whether the story is true or false. If no story is published, consumers learn nothing. The behavior of an outlet at a time step, namely whether it publishes a story or not and whether the story is true or false, affects that outlet’s future audience and hence its probability of receiving the next day’s scoop.

We analyze the dynamics of the audiences of the outlets and the frequency of scoop publication (referred to as their share of the news). Our objective is to understand the effect on the dynamics of two issues: The valuation of news by consumers and the market dispersion. To that end, we take two complementary approaches: first we propose a mean-field analysis that substitutes the stochastic dynamics by a deterministic one.Footnote 5 In particular, at each time step we substitute the random assignation of a scoop to an outlet by its expected value. This simplification enables us to obtain analytical results for the case of two media outlets. Second, we perform numerical simulations on the stochastic dynamics and obtain predictions for the general case with more than two outlets.Footnote 6

Our analytical results for the mean-field approach with two outlets show that there are two stationary states: one in which the two outlets coexist, i.e., they both receive positive audiences and shares of the news, and one in which only the outlet with the lower editorial standard is active in the market, i.e., the high-standard outlet ends up with zero audience and zero share of the news. We also study the stability of these two solutions and find that the stationary state in which the outlets coexist is the only stable one. The results that follow therefore focus on this last case. We then study what market features determine which outlet receives a higher audience and/or a higher share of the news. We show that whether it is the low-standard outlet or the high-standard outlet depends on a combination of the two issues described above. In particular, we obtain that when the punishment for publishing false stories is not very high the low-standard outlet always has more audience. In this case, that outlet also has the higher share of the news. In this sense, our results predict that societies that are especially generous with outlets that break news and do not penalize them when their stories are shown to be false will have media industries dominated by outlets with low editorial standards. However, when the cost of publishing a false story is sufficiently high and the outlets are similar enough in their editorial standards, the high-standard outlet may lead the market (in terms of both audience and share of the news). It requires the media outlets to be sufficiently homogeneous. Otherwise, the high-standard outlet may be the leader in audience but the low-standard one will always have a higher share of the news.

Regarding the stochastic dynamics, the results of the numerical simulations show that most of the insights gained in the mean-field approach with two outlets are robust to the consideration of more outlets. We also use numerical simulations to obtain predictions that cannot be inferred from the mean-field approach, such as predictions as to how many outlets will survive in the industry in the long run and their characteristics. We find that the harsher consumers are with the publication of false stories and/or the more similar the editorial standards of the media outlets are, the more outlets will survive in the long run. By contrast, when consumers like consuming news and do not care whether printed stories are true or false and/or when outlets are very different in their vetting processes for stories, the results suggest that only a small number of outlets endure in the long run.

This work is related to the literature on the economics of the media, an extensive area of literature that has grown rapidly in the last twenty years. The first papers in this literature were mostly interested in analyzing the effects of the media on economic policies and outcomes. See Besley and Burgess (2001), Djankov et al. (2003), Strömberg (2004b). A second generation of papers studied whether news provision was biased (Groseclose and Milyo 2005; Egorov et al. 2009 and Larcinese et al. 2011) and what the determinants of the biases were (Strömberg 2004a; Mullainathan and Shleifer 2005; Gentzkow and Shapiro 2006; Baron 2006; Petrova 2008; Ellman and Germano 2009). At the same time, a different group of researchers focused their attention on the study of the media as a two-sided market. See Rochet and Tirole (2003), Anderson (2006), Doyle (2013). Currently, research on the economics of the media is extremely varied, but there is a tendency toward analyzing online news markets (Yang and Chyi 2011; Halberstam and Knight 2016; Germano and Sobbrio 2017) and their effects on political outcomes (Allcott and Gentzkow 2017; Boxell et al. 2017; Campante et al. 2018), collective action (Acemoglu et al. 2017; Little 2016; Enikolopov et al. 2016, 2017) and more generally, social outcomes (Gentzkow and Shapiro 2011; Quattrociocchhi et al. 2014; Bakshy et al. 2015).

Our contribution to this literature is to present a new approach to the study of competition and audience in the media industry. Previous research shows that competition can reduce the quality of news (Zaller 1999; Cagé 2014), can induce outlets to ideologically differentiate from competitors (Gentzkow et al. 2014) and can lead them to bias the news in an attempt to product-differentiate (Anand et al. 2007). It has also been shown that competition can reduce the possibility of media capture by the government (Besley and Prat 2006), increase media self-censorship on issues sensitive to advertisers (Germano and Meier 2013), and increase giving and volunteering (Adena 2016). In contrast to previous research, our interest lies not in the effects of competition but in its determinants, from a long-run perspective. To the best of our knowledge, this is new in the literature.

The rest of the paper is organized as follows. In Sect. 2, we propose a stochastic dynamic model of competition and audience in the media industry consisting of N recursive equations that account for changes in audience in a market with N outlets. In Sect. 3, we substitute the stochastic dynamics by a deterministic dynamics and analyze the resulting mean-field model for the case of two media outlets. This simplification enables us to obtain analytical solutions for the audiences of outlets and study the stability of the stationary states. In Sect. 4, we perform numerical simulations to study the stochastic dynamics with two, five, and ten media outlets. Finally, Sect. 5 concludes. The paper also contains Appendices, which are organized as follows. In Appendix A, we propose and analyze a one-shot model with rational agents that helps us show the rationale behind the behavioral assumptions that we make in Sect. 2. In Appendix B, we present results which are complementary to the stochastic dynamics analyzed in Sect. 4. Finally, Appendix C contains the proofs.

2 The model

We consider a market for news with N media outlets and a mass of consumers. All the agents in the model, i.e., media outlets and consumers, are non-strategic and take decisions according to behavioral rules. Appendix A presents a model with rational agents that micro-founds the behavioral assumptions made in this section.

Each \(t\in T=\{1,2,\ldots \}\) represents a time step which, without loss of generality, we identify with a day. For the sake of simplicity, two states of the world are assumed. At each time t, the state of the world is \(\omega (t)\in \Omega \), with \(\Omega \) representing the set of states. We assume that \(\omega \) is distributed according to a known distribution function. At each time t, there is a binary signal s(t) on the state of the world. We refer to the signal s(t) as the scoop at time t. Let \(\mu \) be the probability that a scoop is true, i.e., the quality of the scoop. We consider \(\mu \) to be a random variable distributed according to a uniform distribution function in [0, 1]. The realization of the random variable at t is denoted by \(\mu (t)\), where \(\mu (t)=P(s(t)=j|\omega (t)=j)\).

Each media outlet \(i\in \{1,2,\ldots ,N\}\) is characterized by a threshold \(\mu _{i}\in [0,1]\), which sets a lower bound on the quality of a scoop required by outlet i in order to publish it. We refer to \(\mu _{i}\) as the editorial standard of outlet i. Without loss of generality, we order media outlets according to their editorial standards, so that \(0\le \mu _{1}<\mu _{2}<\ldots <\mu _{N}\le 1\) and assume that an outlet’s editorial standard is invariant over time. This is a reasonable assumption if it is considered that an outlet’s editorial standard and editorial line constitute a kind of moral or ethical charter that a media outlet abides by once they have been adopted.

Conditionally on an outlet having received a scoop, threshold \(\mu _{i}\) determines whether outlet i publishes it or not. More precisely, we consider that if \(\mu (t)>\mu _{i}\) outlet i publishes the scoop and suppresses it otherwise. The optimality of this decision rule is analyzed in Appendix A. We denote by \(a_{i}(t)\in \{0,1\}\) the action of outlet i at time t, where \(a_{i}(t)=1\) when the outlet reports the scoop and \(a_{i}(t)=0\) otherwise. A scoop that is printed/published is referred to as a story.

We assume that at every t, if a story is published consumers immediately learn the value of the random variable \(\omega (t)\) with probability 1, whereas if no story is published there is no learning.Footnote 7 Thus, after a report, consumers always receive information that is valuable for assessing the accuracy of a story, which enables them to reward/punish a media outlet according to its behavior and the consumers’ valuation of the news. In this respect, we consider that consumers reward the publication of a story and punish the publication of a false story. The model in Appendix A proposes a rationale for this behavior. For the sake of simplicity, let \(\alpha \in (0,1)\) be the punishment imposed by the consumers on an outlet that publishes a false story and let \(1-\alpha \) be the reward for an outlet that obtains a story and publishes it. Thus, \(\alpha \) measures the relative weight of the punishment. Hence, the higher \(\alpha \) is, the greater the discomfort of consumers with fake news is, then the harsher the punishment by consumers for the publication of false stories (in relative terms).Footnote 8

The reward/punishment may be expected to have a positive/negative impact on an outlet’s reputation, market share, and/or audience. Although the intensity of the impact may differ from one of these points to another, it is reasonable to consider that they will all move in the same direction. For the sake of specificity, we focus our attention here on the evolution of one of these values: The audiences of outlets. Let \(A_{i}(t)\) denote the audience of media outlet i at date t, and let \({\widetilde{A}}_{i}(t)=\frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}\in [0,1]\), with \(\sum _{i=1}^{N}{\widetilde{A}}_{i}(t)=1\), be outlet i’s normalized audience. With a focus on normalized audiences, note that \({\widetilde{A}}_{i}(t+1)\) should belong to the interval \([{\widetilde{A}}_{i}(t),1]\) after outlet i publishes a story. This is so when \({\widetilde{A}}_{i}(t+1)={\widetilde{A}}_{i}(t)+(1-\alpha )(1-{\widetilde{A}}_{i}(t))\). This illustrates the reward that an outlet receives for publishing a story. However, if the story is false, consumers impose a punishment on the outlet that reduces its audience by \(\alpha {\widetilde{A}}_{i}(t)\). Putting the reward and the punishment together, we have the following recursive equation:

$$\begin{aligned} A_{i}(t+1)={\widetilde{A}}_{i}(t)+\varPhi \left[ (1-\alpha )(1-{\widetilde{A}} _{i}(t))I_{a}-\alpha {\widetilde{A}}_{i}(t)I_{s\ne \omega }\right] , \end{aligned}$$
(1)

where \(\varPhi >0\) is a parameter that modulates the change in an outlet’s audience due to the publication of a single scoop, and \(I_{a}\) and \(I_{s\ne \omega }\) are the following indicator functions:Footnote 9

$$\begin{aligned} I_{a}=\left\{ \begin{array}{l} 1 \text { if }a_{i}(t)=1,\\ 0 \text { otherwise}, \end{array} \right. \text { and } I_{s\ne \omega }=\left\{ \begin{array} [l]{l} 1 \text { if }a_{i}(t)=1 \text { and } s(t)\ne \omega (t),\\ 0 \text { otherwise}. \end{array} \right. \end{aligned}$$

Finally, the dynamic equation can be written either in terms of the audience [Eq. (2)] or the normalized audience [Eq. (3)], which are equivalent. For simplicity, in the analysis we use Eq. (2).

$$\begin{aligned} A_{i}(t+1)= & {} \frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}\nonumber \\&+\varPhi \left[ (1-\alpha )\left( 1-\frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}\right) I_{a}-\alpha \frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}I_{s\ne \omega }\right] , \end{aligned}$$
(2)
$$\begin{aligned} {\widetilde{A}}_{i}(t+1)= & {} \frac{{\widetilde{A}}_{i}(t)+\varPhi \left[ (1-\alpha )(1-{\widetilde{A}}_{i}(t))I_{a}-\alpha {\widetilde{A}}_{i}(t)I_{s\ne \omega }\right] }{\sum _{j=1}^{N}\left( {\widetilde{A}}_{j}(t)+\varPhi \left[ (1-\alpha )(1-{\widetilde{A}}_{j}(t))I_{a}-\alpha {\widetilde{A}}_{j}(t)I_{s\ne \omega }\right] \right) }. \end{aligned}$$
(3)

Note that the rule above considers that variations in an outlet’s audience are proportional to the outlet’s previous normalized audience \({\widetilde{A}}(t)\). In particular, it implies that the higher an outlet’s audience at t, the smaller the gain from publishing a scoop and the higher the cost of being shown to have printed a false story. On the other hand, the smaller an outlet’s audience, the greater the gain from breaking news and the smaller the cost of publishing a false story. We consider this property to give an accurate picture of the functioning of the real-world media industry.

We assume that an outlet’s audience at t affects the probability of it receiving a scoop at that time step. The idea is to model a situation in which outlets with higher audiences are more able to capture scoops than smaller ones. This is a reasonable assumption if we consider that outlets with larger market shares have larger newsrooms and are thus more likely to uncover events. Let \(p_{i}(\mu (t))\) denote the probability of outlet i receiving a scoop of quality \(\mu (t)\) at time t. Remember that there is one scoop per day and that only one media outlet can publish it. We assume that probability \(p_{i}(\mu (t))\) is given by:

$$\begin{aligned} p_{i}(\mu (t))=\frac{\theta _{i}(t)A_{i}(t)}{\sum _{k=1}^{N} \theta _{k}(t) A_{k}(t)} , \end{aligned}$$
(4)

where

$$\begin{aligned} \theta _{i}(t)= \left\{ \begin{array} [c]{ll} 1 &{} ~~\mathrm {if}\, \mu _{i} \le \mu (t)\\ 0 &{} ~~\mathrm {if}\, \mu _{i} > \mu (t). \end{array} \right. \end{aligned}$$
(5)

Note that the process above determines that given a scoop of quality \(\mu (t)\), only media outlets with editorial standards lower than \(\mu (t)\) are active recipients of the scoop, i.e., have a positive probability of receiving it. Note also that because the publication strategy of any outlet, say i, is to publish a scoop whenever \(\mu (t)>\mu _{i}\), the specification above determines that any media outlet that receives a scoop publishes it. This simple modeling approach should be seen as a reduced form of a more general mechanism consisting of the following: (i) At all t, any outlet has a positive probability of receiving the scoop; (ii) the probability of an outlet receiving a scoop is the same as its normalized audience; (iii) on receiving a scoop, the outlet decides whether to publish it or not; and (iv) if the outlet chooses to publish it, the mechanism at t stops; however, if the outlet rejects it, the process of assigning the scoop starts again, with the outlet that rejected the scoop being taken out of the pool and the rest of the outlets now having probabilities of receiving the scoop that are equal to each outlet’s new normalized audience.Footnote 10 Note that the outcome of this more general mechanism is equivalent to that under the simpler form, where scoops are offered directly to those outlets that are willing to publish them. For the sake of simplicity, we stick to the proposed formulation.

Finally, note also that the above process determines a division of the scoop’s quality space [0, 1] into different intervals, according to which competition for a scoop is softer or tougher. In particular, ceteris paribus N, the lower the quality \(\mu (t)\), the softer the competition, as only outlets with low editorial standards are willing to publish it. When \(\mu (t)< \mu _{1}\), no outlet wants to publish the scoop. On the other hand, the higher \(\mu (t)\), the tougher the competition, as more outlets are willing to publish it. Note also that the degree of competition for scoops depends on how similar or different the editorial standards of the media outlets are. In particular, the more similar the editorial standards are, the tougher the competition is. In contrast, the more different the media outlets are, the softer the competition in the market for scoops is.

The model described considers a stochastic sequence of scoops and N media outlets that repeatedly interact over time, with interactions determining the dynamics of the outlets’ audiences. This process defines a system of N stochastic recursive equations, which is analyzed in Sect. 4. Prior to this, in Sect. 3 we study a mean-field approach to the problem that allows us to study the average dynamics, characterize the stationary states, study the stability and perform a comparative static exercise for the case of an industry with two media outlets.

3 The mean-field dynamics

This section considers a mean-field approach to the problem. It makes the following simplification: We substitute the stochastic dynamics described in Eq. (2) by a deterministic dynamics. In particular, for every t and outlet i, we substitute the random variables \(I_{a}\) and \(I_{s\ne w}\) by their expected values \(F_{i,\mathrm{news}}\) and \(F_{i,\mathrm{false}}\):

$$\begin{aligned} A_{i}(t+1)= & {} \frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}+\left[ (1-\alpha )\left( 1-\frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}\right) F_{i,\mathrm{news}}(t)\nonumber \right. \\&\left. -\alpha \frac{A_{i}(t)}{\sum _{j=1}^{N}A_{j}(t)}F_{i,\mathrm{false}}(t)\right] . \end{aligned}$$
(6)

Note that the dynamics above makes \(\varPhi =1\). This is without loss of generality.Footnote 11

Since the quality of a scoop \(\mu (t)\) is assumed to be a random variable uniformly distributed in [0,1], the expected value of the random variable \(I_{a}\) is:

$$\begin{aligned} F_{i,\mathrm{news}}(t)= & {} \int _{\mu _{i}}^{\mu _{i+1}}p_{i,i}(t)\mathrm{d}\mu \\&+\int _{\mu _{i+1}} ^{\mu _{i+2}}p_{i,i+1}(t)\mathrm{d}\mu +\cdots +\int _{\mu _{N-1}}^{\mu _{N}}p_{i,N-1} (t)\mathrm{d}\mu +\int _{\mu _{N}}^{1}p_{i,N}(t)\mathrm{d}\mu , \end{aligned}$$

where \(p_{i,k}(t)=\frac{A_{i}(t)}{\sum _{l=1}^{k}A_{l}(t)}\) is the probability of outlet i receiving a scoop of quality \(\mu _{k}\le \mu \le \mu _{k+1}\) at time t. Note that \(F_{i,\mathrm{news}}\) represents the fraction of news published by i in stationary conditions (out of the total number of news items received by the outlets). Note also that \(\sum _{i=1}^{N}F_{i,\mathrm{news}}(t)=1-\mu _{1}\), since a fraction \(\mu _{1}\) of stories does not meet the minimum quality for publication by any outlet.

Let \(\mu _{N+1}=1\). This fraction can thus be rewritten as:Footnote 12

$$\begin{aligned} F_{i,\mathrm{news}}(t)=\sum _{k=i}^{N}\int _{\mu _{k}}^{\mu _{k+1}} p_{i,k}(t)\mathrm{d}\mu . \end{aligned}$$
(7)

Additionally, since the probability of a scoop being true is proportional to its quality \(\mu \), the expected value \(F_{i,\mathrm{true}}(t)\) of outlet i publishing a true story at t is:

$$\begin{aligned} F_{i,\mathrm{true}}(t)=\sum _{k=i}^{N}\int _{\mu _{k}}^{\mu _{k+1}}\mu p_{i,k}(t) \mathrm{d}\mu . \end{aligned}$$
(8)

Therefore,

$$\begin{aligned} F_{i,\mathrm{false}}(t)=F_{i,\mathrm{news}}(t)-F_{i,\mathrm{true}}(t) \end{aligned}$$
(9)

is the expected value of the random variable \(I_{s\ne w}\) and represents the fraction of the stories published by i at t that are false.

3.1 Two outlets

Next we restrict the analysis to the case of two media outlets. This simplification allows us to obtain analytical results and identify some general properties of the system that we subsequently test with numerical simulations in the stochastic dynamics.

Let 1 and 2 denote the two media outlets, and consider \(\mu _{1}<\mu _{2}\). In the following, we refer to outlet 1 as the low-standard outlet and to outlet 2 as the high-standard one. We start by analyzing the average participation of an outlet on the distribution of scoops and how it relates to the audiences and editorial standards of the two outlets.Footnote 13 Let \({\widetilde{A}}_{1}=\frac{A_{1}}{A_{1}+A_{2}}\) denote the normalized audience of the low-standard outlet, where \(A_{1}\) and \(A_{2}\) are any arbitrary values. From (7) we have:

$$\begin{aligned} F_{1,\mathrm{news}}&=(\mu _{2}-\mu _{1})+{\widetilde{A}}_{1}(1-\mu _{2}),\\ F_{2,\mathrm{news}}&=(1-{\widetilde{A}}_{1})(1-\mu _{2}). \end{aligned}$$

Figure 1 represents expressions \(F_{1,\mathrm{news}}\) and \(F_{2,\mathrm{news}}\) as a function of \({\widetilde{A}}_{1}\). Note that \(F_{1,\mathrm{news}}\) is increasing in \({\widetilde{A}}_{1}\) and \(F_{2,\mathrm{news}}\) is decreasing in \({\widetilde{A}}_{1}\). Additionally, note that if \(1-\mu _{2}>\mu _{2}-\mu _{1}\) the two functions cross at \(A_{F}=\frac{1}{2}-\frac{\mu _{2}-\mu _{1}}{2(1-\mu _{2})}\), with \(A_{F}\in (0,\frac{1}{2})\). Otherwise, \(F_{1,\mathrm{news}}\) is always above \(F_{2,\mathrm{news}}\), i.e., \(A_{F}\le 0\). Because the latter requires \(\mu _{2}-\mu _{1}>1-\mu _{2}\), or equivalently \(\mu _{2}>\frac{1+\mu _{1}}{2}\), we observe that if \(\mu _{1}\) and \(\mu _{2}\) are different enough the low-standard outlet will always receive a higher share of the news. By contrast, if \(\mu _{2}<\frac{1+\mu _{1}}{2}\), which requires \(\mu _{2}\) and \(\mu _{1}\) not to be very different, there are values of \({\widetilde{A}}_{1}\) for which the high-standard outlet receives a higher share of the news. This is the case when \(0<{\widetilde{A}}_{1}<A_{F}\).

Fig. 1
figure 1

We represent the share of the news of outlets 1 and 2. The red line corresponds to the fraction of news published by the low-standard outlet (with \(\mu _{1}=0.2\)) and the blue line to the fraction of news published by the high-standard outlet (with \(\mu _{2}=0.4\)). The vertical lines at \(A_{F}\) (value of \({\widetilde{A}}_{1}\) such that \(F_{1,\mathrm{news}}=F_{2,\mathrm{news}}\)) and at 1 / 2 (value of \({\widetilde{A}}_{1}\) such that \(A_{1}=A_{2}\)) delimit Regions I, II, and III described in the text

Figure 1 also shows three regions delimited by the values of \({\widetilde{A}}_{1}\) for which \(F_{1,\mathrm{news}}=F_{2,\mathrm{news}}\) on the one hand, and \(A_{1}=A_{2}\) on the other hand. Note that by definition the latter occurs when \({\widetilde{A}}_{1}=1/2\). Accordingly, Region I corresponds to values \(\frac{1}{2}<{\widetilde{A}}_{1}\le 1\), for which \(A_{1}>A_{2}\) and \(F_{1,\mathrm{news}}>F_{2,\mathrm{news}}\). That is, in that region the low-standard outlet leads the market in terms of both audience and news share. Region II corresponds to values \(A_{F}<{\widetilde{A}}_{1}<\frac{1}{2}\), for which \(A_{2}>A_{1}\) and \(F_{1,\mathrm{news}}>F_{2,\mathrm{news}}\). That is, in that region the low-standard outlet receives a higher share of the news but the high-standard outlet has a higher audience. Finally, Region III corresponds to values \(0\le {\widetilde{A}}_{1}<A_{F}\), for which \(A_{2}>A_{1}\) and \(F_{2,\mathrm{news}}>F_{1,\mathrm{news}}\). Note that in contrast to previous regions, this region does not always exist, as \(A_{F}\) may be smaller than zero. When it does exist, it represents a situation in which the high-standard outlet leads the market in terms of both audience and news share.

The analysis above determines the general conditions under which outlet i publishes more stories than outlet j, with \(i,j\in \{1,2\}\), \(i\ne j\). It also determines the way in which the fraction of news published by an outlet relates to that outlet’s audience. This analysis reveals that in general it is the low-standard outlet that publishes more scoops. Interestingly, this situation can occur for any pair of editorial standards, i.e., independently of whether media outlets are similar in their vetting processes for stories or not. By contrast, for the high-standard outlet to receive and publish a higher share of the news the market needs to be homogenous enough.

Next, we move into the analysis of the equilibrium of the mean-field dynamics. The stationary condition of the dynamics is \(\Delta A_{i}(t)=A_{i}(t+1)-A_{i}(t)=0\) in Eq. (6). In the case of a media industry with two outlets, this defines a system of two equations. The pairs \((A_{1}^{*},A_{2}^{*})\) that satisfy the system are the equilibrium values. Next proposition presents the result.

Proposition 1

The system has two stationary states.

  1. 1.

    In the first one, \(A_{2}^{*}=0\). This stationary state is unstable.

  2. 2.

    In the second one, \(0<A_{1}^{*}<1\) and \(\ 0<A_{2}^{*}<1\). This stationary state is asymptotically stable.

Proposition 1 states that the system has two stationary states and that only the second one is stable. To see the intuition for this, note that if an outlet reaches zero audience at a time t, it will have no ability to attract further scoops (unless it is the low-standard outlet, in which case it receives and publishes all the scoops whose quality lies within the interval \((\mu _{1},\mu _{2})\)) so it will remain with zero audience forever. Accordingly, there is a stationary state in which \(A_{2}^{*}=0\). However, if we perturb the dynamics and assign a scoop to this outlet, its audience will eventually increase. Accordingly, we say that this stationary state is unstable.

Figure 2 presents the phase diagram of the system (left panel) and the vector field (right panel) for a particular case of the parameter values, in particular \(\alpha =0.5\), \(\mu _{1}=0.4\) and \(\mu _{2}=0.5\). The left panel shows the pairs \((A_{1},A_{2})\) such that \(\Delta A_{1}(t)=0\) (in red) and \(\Delta A_{2}(t)=0\) (in blue). As observed, there are two stationary states: In the first one \(A_{2}^{*}=0\). In the second one \(0<A_{1}^{*}<1\) and \(\ 0<A_{2}^{*}<1\). See Appendix C for the explicit expressions of \({A}_{1}^{*}\) and \({A}_{2}^{*}\).Footnote 14 The right panel presents a graphic analysis of the stability of the system. As observed, only the stationary state with \(0<A_{1}^{*}<1\) and \(\ 0<A_{2}^{*}<1\) is stable.

The rest of the analysis focuses on the stable stationary state. We next seek to establish the conditions on \(\alpha \), \(\mu _{1}\) and \(\mu _{2}\) for which the stable stationary values \(({\widetilde{A}}_{1}^{*},{\widetilde{A}}_{2}^{*})\) belong to each of the regions described, i.e., Regions I, II, and III. This is done in Propositions 2 and 3 and the discussion that follows.

Fig. 2
figure 2

The left panel represents the phase diagram of the system. The red line corresponds to the pairs \((A_{1},A_{2})\) such that \(\Delta A_{1}(t)=0\). The blue lines to the pairs \((A_{1},A_{2})\) such that \(\Delta A_{2}(t)=0\). When \(\Delta A_{2}(t)=0\) for all \(A_{1}\), in the stationary state \(A_{2}^{*}=0\). When \(\Delta A_{2}(t)=0\) defines a decreasing function in \(A_{1}\), in the stationary state \(0<A_{1}^{*}<1\) and \(0<A_{2}^{*}<1\). In this case, the intersection of the described line with the red line defines four regions: A, B, C and D. The arrows in each region indicate the direction of a change in the outlets’ audiences. The right panel represents the vector field. In both panels, we consider \(\alpha =0.5\), \(\mu _{1}=0.4\) and \(\mu _{2}=0.5\)

First we show the existence of threshold \({\hat{\alpha }}_{A}\), which delimits Region I from Regions II and III. In this sense, threshold \({\hat{\alpha }}_{A}\) establishes which outlet receives a higher audience in equilibrium.

Proposition 2

There is a threshold \({\hat{\alpha }}_{A}=\frac{2}{4-(\mu _{1}+\mu _{2})}\) such that in the stable stationary state \({\widetilde{A}}_{2}^{*}\ge {\widetilde{A}}_{1}^{*}\) if and only if \(\alpha \ge {\hat{\alpha }}_{A}\), and \({\widetilde{A}}_{1}^{*}> {\widetilde{A}}_{2}^{*}\) otherwise.

Proposition 2 says that when societies are harsh enough with the publication of false stories, in equilibrium the outlet that obtains a higher audience will be the high-standard one. Otherwise, it will be the low-standard outlet. Note that since \({\hat{\alpha }}_{A}>1/2\), if \(\alpha \le 1/2\) it is always the case that \({\widetilde{A}}_{1}^{*}>{\widetilde{A}}_{2}^{*}\). That is, if the cost of publishing false stories is smaller than the reward for publishing a story, then the outlet with greater audience will always be the low-standard one, no matter whether outlets are similar in their vetting processes for stories or are very different. Lastly, note that \({\hat{\alpha }}_{A}\) is increasing in \(\mu _{1}+\mu _{2} \), so if \(\mu _{1}\) and \(\mu _{2}\) are small enough there are values of \(\alpha \) for which \({\widetilde{A}}_{2}^{*}>{\widetilde{A}}_{1}^{*}\). Note also that the higher the editorial standards of the outlets are, the harsher a society must be for the high-standard outlet to end up with a higher audience.

The next proposition shows that there is a threshold \({\hat{\alpha }}_{F}\) such that together with an additional condition on the outlets’ editorial standards, Region III is delimited both from Regions I and II. In other words, these conditions establish which outlet receives and publishes a higher share of the news.

Proposition 3

There is a threshold \({\hat{\alpha }}_{F}=\frac{4-4\mu _{2}}{5-\mu _{1}^{2}-6\mu _{2}+2\mu _{1}\mu _{2}}\) such that in the stable stationary state \(F_{2,\mathrm{news}}^{*}\ge F_{1,\mathrm{news}}^{*}\) if and only if \(\alpha \ge {\hat{\alpha }}_{F}\) and \(\mu _{2}<\frac{1+\mu _{1}}{2}\), and \(F_{1,\mathrm{news}}^{*}>F_{2,\mathrm{news}}^{*}\) otherwise.

Proposition 3 states that for punishment values higher than the threshold, if the editorial standards of the two outlets are not very different, in equilibrium the high-standard outlet receives and publishes a higher share of the news than the low-standard outlet. Note that \({\hat{\alpha }}_{F}\) is decreasing in both \(\mu _{1}\) and \(\mu _{2}\), thus \({\hat{\alpha }}_{F}>4/5\) always. Hence, if \(\alpha \le 4/5\) then \(F_{1,\mathrm{news}}^{*}>F_{2,\mathrm{news}}^{*}\) always, independently of the outlets’ editorial standards. That is, unless society is harsh enough with the publication of false stories, the outlet receiving and publishing a higher share of the news will always be the low-standard one.

Now, we put together the results from Propositions 2 and 3 and determine the equilibrium size of Regions I, II, and III and the values of \(\alpha \) for which the stable stationary values \(({\widetilde{A}}_{1}^{*},{\widetilde{A}}_{2}^{*})\) belong to each of the regions. To that end, observe that given a pair \((\mu _{1},\mu _{2})\), the equilibrium value \({\widetilde{A}}_{1}^{*}\) is decreasing in \(\alpha \), with \({{\widetilde{A}}}_{1}^{*}(\alpha =1)\ge 0\) and \({{\widetilde{A}}}_{1}^{*}(\alpha =0)\le 1\). Let \(A_\mathrm{low}={{\widetilde{A}}}_{1}^{*}(\alpha =1)\) denote the lower limit and \(A_\mathrm{up}={{\widetilde{A}}}_{1}^{*}(\alpha =0)\) denote the upper limit.Footnote 15 Figure 3 below represents these ideas in two cases. In the left panel, we consider \(\mu _{1}=0.2\) and \(\mu _{2}=0.4\); in the right panel, we consider \(\mu _{1}=0.4\) and \(\mu _{2}=0.6\). The colored strips indicate the range of values of the audiences for which the equilibrium of the system belongs to Regions I, II, and III. Note that together with values \(A_{F}\) and \({{\widetilde{A}}}_{1}^{*}=1/2\), thresholds \(A_\mathrm{low}\) and \(A_\mathrm{up}\) fully characterize Regions I, II, and III in equilibrium.Footnote 16

Fig. 3
figure 3

We represent Regions I, II, and III in equilibrium. The left panel considers \(\mu _{1}=0.2\) and \(\mu _{2}=0.4\); the right panel considers \(\mu _{1}=0.4\) and \(\mu _{2}=0.6\). In each panel, the red line corresponds to the fraction of news published by the low-standard outlet and the blue line to the fraction of news published by the high-standard outlet. The orange and cyan lines correspond to the fraction of false stories published by the low and the high-standard outlet, respectively. The black curve gives the value of parameter \(\alpha \) that is required to reach a particular equilibrium audience \({\widetilde{A}}_{1}^{*}\). The vertical lines at \(A_\mathrm{low}\), \(A_{F}\), 1 / 2 and \(A_\mathrm{up}\) determine the equilibrium size of Regions I, II, and III. The left boundary \(A_\mathrm{low}\) corresponds to the case with no reward, i.e., \({{\widetilde{A}}}_{1}^{*}(\alpha =1)\); the right boundary \(A_\mathrm{up}\) to the case with no punishment, i.e., \({{\widetilde{A}}}_{1}^{*}(\alpha =0)\). Note that at \(A_\mathrm{low}\), the fraction of false stories published in equilibrium by the two outlets is the same

Lastly, we conduct the exercise for the editorial standards. This is done in Fig. 4, where we determine the equilibrium size of Regions I, II, and III and the values of \(\mu _{1}\) and \(\mu _{2}\) for which the stable stationary values \(({\widetilde{A}}_{1}^{*},{\widetilde{A}}_{2}^{*})\) belong to each region. Since \({\widetilde{A}}_{1}^{*}>{\widetilde{A}}_{2}^{*}\) \(\forall \alpha \le 1/2\), and \(F_{1,\mathrm{news}}^{*}>F_{2,\mathrm{news}}^{*}\) \(\forall \alpha \le 4/5\), we only consider values of \(\alpha \) higher than 1/2. We observe that the higher \(\alpha \) is, the higher the region is in which the high-standard outlet receives a higher audience in equilibrium than the low-standard outlet. Note that this occurs in both Regions II and III. To see why, note that from Proposition 2 it is known that \({\widetilde{A}}_{2}^{*}\ge {\widetilde{A}}_{1}^{*}\) if and only if \(\alpha \ge \frac{2}{4-(\mu _{1}+\mu _{2})}\). This condition can be rewritten as \(\mu _{1}+\mu _{2}\le 4-\frac{2}{\alpha }\). Hence, the greater the punishment, the higher the pair of values \((\mu _{1},\mu _{2})\) for which \({\widetilde{A}}_{2}^{*}\ge {\widetilde{A}}_{1}^{*}\). Similarly, we observe that the higher \(\alpha \) is, the higher the region is in which the high-standard outlet receives and publishes a higher share of the news. This occurs only in Region III. Lastly, we observe that Region III is never the whole region. In fact, when \(\alpha =1\) there are pairs \((\mu _{1},\mu _{2})\) for which \({\widetilde{A}}_{2}^{*}>{\widetilde{A}}_{1}^{*}\) but \(F_{1,\mathrm{news}}^{*}>F_{2,\mathrm{news}}^{*}\). This is the case when the two outlets have relatively different editorial standards. Accordingly, our model predicts that low-standard outlets will receive and publish a greater share of the news whenever the media industry is made up of very different media outlets.

Fig. 4
figure 4

We represent Regions I, II, and III in equilibrium as a function of the pairs \((\mu _{1},\mu _{2})\) satisfying \(\mu _{1}<\mu _{2}\), for different values of parameter \(\alpha \). In blue we represent the pairs \((\mu _{1},\mu _{2})\) for which \({\widetilde{A}}_{1}^{*}={\widetilde{A}}_{2}^{*}\) and \(F_{1,\mathrm{news}}^{*}=F_{2,\mathrm{news}}^{*}\) (color figure online)

4 The stochastic dynamics

In this section, we discuss some of the results obtained when we consider the stochastic dynamics described in Eq. (2). The analysis of the stochastic dynamics has two purposes: On the one hand to check the robustness of the mean-field results to the consideration of a stochastic dynamics with a number N of media outlets. The idea is to study whether the relationship between audience, consumers’ valuation of news and market dispersion found previously holds in the present case. Figures 7 and 8 in Appendix B show that it does for most of the insights. For the sake of conciseness, these results are relegated to the Appendix. The second purpose of the stochastic dynamics analysis is to draw up new predictions on the dynamics of the system when there are more than two outlets. A relevant question here is what characteristics of the media industry make it more likely that a higher number of outlets will endure in the long run. Figures 5 and 6 below present results along this line. In these figures, we consider \(\varPhi =1/10\), which represents a situation in which the public needs less than a year to build a quasi-stationary opinion about a media outlet and at the same time the publication of a single scoop can have a substantial impact on an outlet’s audience.Footnote 17 See Appendix B for a discussion of the effect of \(\varPhi \) on the results.

Figure 5 represents the final audiences of a media industry with \(N=5\) after a 20-year period, for a representative combination of \(\mu _{1},\ldots ,\mu _{5}\) and \(\alpha \). The top, center and bottom panels show high, intermediate and low punishments, respectively. Each vertical line—connecting five dots— corresponds to a single simulation, where the lowest dot corresponds to \(\mu _{1}\) and the highest to \(\mu _{5}\). The color of a dot represents the final audience of the outlet, as indicated in the palette. Finally, the crosses indicate outlets that loose all their audience and go out of business. The first result that can be observed is that when \(\alpha \) is high there is a positive correlation between an outlet’s editorial standard and its final audience. By contrast, when \(\alpha \) is low the correlation is negative. The second (and new) result is that there are configurations of parameters for which the dynamics drive some outlets out of the market.Footnote 18 A look at Fig. 5 suggests that consumers’ valuation of news and market dispersion may both affect the degree of competition in the long run. Figure 6 explores this question in more detail.

Fig. 5
figure 5

We represent the audiences of \(N=5\) outlets after 20 years of evolution for different combinations of the editorial standards of the outlets. The top, center and bottom panels show the audiences when \(\alpha =0.9\), \(\alpha =0.5\), and \(\alpha =0.2\), respectively. Each vertical line—connecting five dots—corresponds to a single simulation, where the lowest dot corresponds to \(\mu _{1}\) and the highest dot to \(\mu _{5}\). The color of a dot represents the final audience of the outlet, as indicated in the palette. The simulations shown are a small sample of all possible \(\mu _{i}\) combinations, with \(i=1,\ldots ,5\). The value of \(\mu _{1}\) increases from left to right to facilitate the presentation. The crosses indicate the outlets that lost all their audience and went out of business (color figure online)

Figure 6 represents the probability that \(m\le N\) outlets will survive in the industry in the long run. The left and center panels represent the probability that \(m\le N\) outlets will survive as a function of \(\alpha \). It considers \(N=5\) (left panel) and \(N=10\) (center panel). For each value of \(\alpha \) (\(\alpha =0.05, 0.1, 0.15, \ldots 1\)), we perform twenty thousand simulations with random editorial standards and random initial audiences, and we run them for the equivalent of 20 years. Two results are worth mentioning. First, the probability of more outlets surviving in the long run seems to be positively correlated with parameter \(\alpha \). Hence, the more harshly consumers punish the publication of false stories, the more outlets survive in the long run. Second, for a given \(\alpha \), the number of outlets m that survive presents a low dispersion and an inverted U-shaped form, with dispersion being higher when \(\alpha \sim 0.5\) and lower when \(\alpha \sim 0\) and \(\alpha \sim 1\).

Fig. 6
figure 6

The left and center panels represent the probability that \(m\le N\) media outlets will survive in the long run as a function of \(\alpha \). The left panel considers \(N=5\) and the center panel \(N=10\). The label by each curve indicates the number of surviving outlets. The right panel represents the probability that \(m\le N\) outlets will survive in the long run as a function of the standard deviation \(\sigma _{\mu }\) of the editorial standards. We consider \(N=10\) and \(\alpha =0.7\)

The right panel represents the probability that \(m\le N\) outlets will survive in the long run as a function of the market dispersion. We define the market dispersion as the standard deviation \(\sigma _{\mu }\) of the N outlets’ editorial standards, i.e., \(\sigma _{\mu }=\sqrt{\frac{\sum _{i=1}^{N} (\mu _{i}-\langle \mu \rangle )^{2}}{N}}\). We consider \(\alpha =0.7\), 10 outlets and run a representative sample of all \(\mu _{i}\) combinations, with \(i=1,\ldots ,10\), two hundred and fifty thousand times. On average, we find that the number of outlets that survive in the industry in the long run increases as the standard deviation of the outlets’ editorial standards decreases. Hence, the more homogeneous outlets are, the tougher the competition in the industry will be in the long run.Footnote 19

5 Conclusion

This paper presents a dynamic model of competition for audience in the market for scoops. Our main contribution is to pin down two important variables to explain the dynamics: The consumers’ valuation of news and the composition of the market. We obtain that the harsher consumers are with the publication of false stories, the more likely it is that high-standard media outlets receive higher equilibrium audiences. In contrast, when consumers are especially generous with outlets that break news and do not penalize them when their stories are shown to be false, our prediction is that the market will be dominated (in terms of both audience and share of the news) by low-standard outlets. Regarding the composition of the market, we obtain that only if media outlets are similar enough in their editorial standards for quality, it is possible that high-standard outlets publish a higher share of the news. Otherwise, it will be the low-standard outlets that lead the market.

Although moving from theory to the real world is always difficult, we consider that our results help explain the great heterogeneity that characterizes the traditional press all over the world. Coming back to the examples discussed in Introduction, we propose an argument to explain why countries such as the USA and Spain show a positive correlation between newspapers’ editorial standards, audiences (circulation) and share of the news, whereas countries such as the UK have a negative correlation. The argument relies on the consumers’ valuation of news and builds on the idea that the average consumer of a newspaper in the UK is quite different from the average consumer of a newspaper in the US or Spain. The reason is the strength of the tabloid press in the UK and the distinctive features of its readership.Footnote 20 Based on this, can the average UK newspaper reader be described as more lenient with the publication of false stories and/or a more eager consumer of scoops than the average reader in the US or Spain? It is difficult and possibly controversial to argue so, but if we agree on this point then the result follows.Footnote 21 Namely, we have an argument to explain why countries with harsh consumers and homogeneous media industries (such as the US and Spain) have media outlets with high-quality standards, whereas countries such as the UK, where consumers are more lenient and the industry is more heterogeneous, show the opposite features.Footnote 22

Beyond the traditional press, we believe that our model also sheds light on the use of social media as a source of information, the lack of fact-checking and editorial judgment that characterize social media platforms, and the subsequent proliferation of fake news. In line with the discussion above, our work contributes to this debate by highlighting the importance of consumers’ behavior in monitoring fake news. In particular, it suggests that as long as consumers continue rewarding the mere publication of news (with shares, likes, comments and such) and not penalizing the publication of false stories, there is little to be done in this respect.

Our analysis also draws up predictions as to how many outlets will survive in the media industry in the long run. In this respect, our results show that the harsher consumers are and/or the more homogeneous outlets are, the more outlets will survive in the long run. Otherwise, the dynamics will drive outlets out of the industry, thus reducing competition over time.

To some extent, the results on the reward/punishment scheme and the number of outlets operating in the market in the long run are related to Duverger’s Law, which describes a nexus between electoral systems and the number of parties in a political system. More precisely, Duverger’s Law states that a plurality rule tends to favor a two-party system, whereas proportional representation tends to favor a multi-party system. In our view, the determinants of this result are similar to those of our case. Namely, high rewards lead to inequalities and the emergence of large-scale players and so erode competition, whereas low rewards leave room for more players and foster competition for longer. In our view, this conclusion extends beyond the current setup and can help explain other relevant empirical phenomena.

This study makes a number of assumptions. First, we consider media outlets and consumers that behave according to predetermined behavioral rules. Although the model that we present in Appendix A seeks to show the rationale behind these assumptions, we consider it an interesting, though complex, exercise, to analyze the long run outcomes of a dynamic model with strategic agents. Other assumptions in the model include the absence of ideological aspects and prices. These assumptions greatly simplify the analysis, as they limit the number of variables that may affect the probability of receiving a scoop. A more general model that seeks to relax these assumptions should carefully reformulate this rule. We consider this an interesting issue for future research.