1 Introduction

Risk taking underpins much of the economic activity in any modern society. At one extreme of the risk-taking spectrum lies financial investment, which is deliberate, consequential, and focused especially in the capital markets. At the other extreme, we have recreational risk taking and gambling which tend to be impulsive, low-stake, augmented by non-conscious cues and insensitivity to the inherently unfavorable odds. The perception of risk ranges from being a necessary evil to be tamed and managed in financial risk taking to an object of fun and entertainment to be enjoyed in moderation in recreational risk taking. Apart from pathological gamblers, recreational risk takers tend to be conscious of the prospect of losing while they enjoy gaming activities in the hope of winning, as prefaced by the quotation from Laplace in the epigraph.

Recreational gambling such as participating in lotteries has been part and parcel of human societies since time immemorial. In Selling Hope, Clotfelter and Cook (1989) argue that lottery products, characterized by small chances of sizable winnings at low prices, fulfill a demand for hope often by those of lower socioeconomic status. On the supply side, the state lottery has been viewed as a form of regressive taxation (Price & Novak, 1999). Earlier attempts to modeling a demand for lottery products in various non-expected utility models center around a sense of optimism arising from the overweighting of small winning probabilities. From this perspective, Chark et al. (2020) investigate the popular approach to modeling optimism using a probability weighting function directly as in Kahneman and Tversky (1979), and alternatively by applying an outcome-based salience function based on Chew (1983). Intuitively, optimism may underpin the consumer’s hope to win despite unfavorable odds (Lopes, 1987).

Chew and Ho (1994, this journal, henceforth CH94) and Lovallo and Kahneman (2000) offer a novel strand of thinking of hope as a sentiment underpinning lottery demand, in which individuals enjoy delaying the resolution of uncertainty. CH94 provide preference specifications that capture a hope-induced utility for preferring later resolution of uncertainty. For some lottery products, such as New York State’s Instant Scratch-off, the lottery outcome can be resolved immediately after purchase. Most other lottery products involve some delay between the time of purchase and the drawing of winning outcomes. A popular form of gambling in mainland China today is the daily three-digit lottery (3D). A 3D ticket costs CNY 2 and pays CNY 1,000 if the three digits picked by a bettor are drawn during the live broadcast on national TV the evening of purchase.Footnote 1 On average 21 million 3D lottery tickets are sold daily by China’s Welfare Lottery authority, equivalent to about 2 daily bets for every 100 persons aged 16 years and above.

The empirical literature points to a stylized phenomenon of an increase in demand for hope following adversity, for instance, during economic downturns or after natural disasters. This is corroborated by studies that find a positive relation between economic hardship and gambling or lottery purchases. Mikesell (1994) discusses the positive relationship between state unemployment and lottery sales in the US. Olason et al. (2017) find an increase in gambling participation but not problem gambling after the 2008 economic collapse in Iceland. Writing in the aftermath of the Great Recession, Zezima (2008) notes that “sweet dreams in hard times add to lottery sales.” In the wake of heavy snow and an earthquake in China in 2008, Li et al. (2011) report increased lottery demand (a 0.1% chance to win CNY 10,000 at a price of CNY 10) in affected vs. non-affected areas.Footnote 2Isidore (2017) reports on the “billions of dollars [that] go unclaimed in lottery prizes” pointing out that some people may even not bother to check if they won. For such consumers, the hope component of a lottery itself may be worth the ticket price.

We are led to posit an “adversity-hope hypothesis,” as prefaced by Horace in the epigraph, with lottery sales serving as a proxy for hope. To test this hypothesis, we examine a routine source of moderate-level adversity in the form of environmental pollution. Particles in ambient air—that contribute to haze or smog—are arguably the most visible time-varying manifestation of environmental pollution in developing countries, including China (Lelieveld et al., 2015; Marlier et al., 2016). When it is visible, this form of environmental degradation can affect a healthy human not only through inhalation but also through perception (Watson, 2002; Zhang et al., 2015). It is hard to miss a hazy vs. blue-sky day in Beijing, Delhi, or Jakarta even if one is able to stay inside, behind windows that properly shut. Recent papers have also documented the acute effect of air pollution on subjective well-being (Levinson, 2012; Zhang et al., 2017; Zheng et al., 2019). In this context of air pollution, our adversity-hope hypothesis would entail an increase in lottery ticket sales when air quality declines—with the mechanism being a preference for later resolution of uncertainty but not optimism.

We examine a possible relation between air quality proxied by PM2.5 (particulate matter up to 2.5 μ m in diameter) and daily sales of the 3D lottery which is generally purchased during the day and is drawn at 9 pm. The fixed-prize-fixed-odds nature of the 3D lottery, paying CNY 1,000 at 1/1000 winning odds and for which we have daily provincial-level sales data over multiple years is ideal for our investigation. We find evidence that is compatible with the adversity-hope hypothesis. Controlling for month-of-sample, day-of-week, and weather, flexibly by province, 3D betting rises and falls in step with PM2.5 (haze) pollution. The positive causal relationship that we document can readily be seen in Fig. 1 on combining the high-frequency, spatially resolved lottery sales and PM2.5 records.

Fig. 1
figure 1

Demeaned 3D lottery sales against demeaned daytime PM2.5, after subtracting province-level means. An observation is a province by day in the five years from 2013 to 2017. There are 32 provinces in the data. The top panel shows sales in CNY million, the bottom panel reports the natural logarithm of sales. For better visualization, we include locally weighted polynomials of best fit and omit a few outlying realizations (e.g., percentiles 1 and 99 of the distribution of demeaned log 3D sales are -0.6 and 0.7)

We provide regression evidence based on alternative identifying assumptions. Studies of the socioeconomic impact of pollution need to contend with the possibility that pollution is associated with unobserved day-to-day variation in economic activity. For instance, high-frequency shifts in labor demand and changes in daily wages could directly impact lottery sales. For this reason, one estimator we implement focuses on air quality variation attributable to naturally and exogenously occurring shifts in atmospheric regime (“thermal inversions”). We are then able to rule out economic activity as a candidate explanation for the positive relationship between lottery sales and pollution, which is co-produced with economic activity.

We further find that, fixing a day’s average pollution dose, daily 3D sales are higher when the pollution dose is allocated toward the morning and away from the afternoon. This is consistent with the expectation sense of hope since morning purchases deliver more time for the consumption of hope in a lottery ticket purchase prior to its resolution in the evening.

The empirical analysis in this paper supports the adversity-hope hypothesis with air quality shifting hope attitudes. Our paper contributes to a recent literature documenting the acute impacts of air pollution on socioeconomic outcomes. These include cognitive performance (Zhang et al., 2018; Archsmith et al., 2018), self-reported happiness and subjective well-being (Levinson, 2012; Zhang et al., 2017; Zheng et al., 2019), individual investor activity (Meyer and Pagel, 2017; Li et al., 2017), decision-making (Chew et al., 2021), test scores (Ham et al., 2014; Ebenstein et al., 2016; Graff Zivin et al., 2020), and crime and unethical behavior (Lu et al., 2018; Herrnstadt et al., 2019, 2021).Footnote 3

In the next section, we posit the adversity-hope hypothesis. Section 3 tests the adversity-hope hypothesis in the setting of daily lottery sales and an environment with routinely high and exogenously varying air pollution in China. We present the institutional setting, show evidence of a relationship in the raw data, and implement empirical models of lottery demand that seek to establish both internal and external validity. Section 4 summarizes and provides a discussion.

2 Adversity-hope hypothesis

To model the adversity-hope hypothesis exposited in the Introduction, we apply the CH94 model about hope being experienced “when there is enjoyment in delaying the resolution of uncertainty.” A lottery ticket typically involves a future period at which the final uncertainty is resolved. For such lotteries, attitude toward timing of uncertainty resolution has some bearing on the perceived attitude toward risk. Even if one is averse toward risks that resolve in the present period, one may still buy “hope” by purchasing a lottery ticket to enjoy the anticipation of winning.

We adopt the notation from CH94 in a two-period setting. Period-1 consumption is certain and is denoted by y whereas the uncertain consumption in period 2 is denoted by \(m = \sum q_{i}[z_{i}]\), a lottery which pays zi with probability qi (subscript i denotes a realization). Together, these flows constitute a consumption plan[y,m] in which y is consumed in period 1 and lottery m is consumed as period 2 begins but the consumer does not yet know which zi payoff she will receive until uncertainty is resolved.

The above \(\sum q_{i}[z_{i}]\) notation has the advantage of enabling us to differentiate it from another case, denoted by \(\sum q_{i}[y, z_{i}]\), in which uncertainty about period-2 consumption zi is resolved early along with the certain period-1 consumption y, i.e., the consumer already knows the specific zi payoff to be consumed in period 2 as period 1 begins. More generally, we refer to a temporal lottery, denoted by \(\sum q_{i}[y_{i}, m_{i}]\), as a stochastic consumption plan [yi,mi] which delivers the specific plan [yi,mi] with probability qi.Footnote 4

Let M denote the set of temporal lotteries. Consider two alternative temporal lotteries, each involving consumption of y in period 1:

$$ \alpha[y, m] + (1 - \alpha)[y, m^{\prime}] \ \text{and} \ [y, \alpha m + (1 - \alpha)m^{\prime}]. $$
(1)

The LHS of (1) represents early resolution in period 1 of the period-2 consumption that obtains—i.e., m versus \(m^{\prime }\), with probabilities α and 1 − α. By contrast, in the RHS the uncertainty of period-2 consumption resolves later. The enjoyment of hope corresponds to a preference for the temporal lottery described in the RHS of (1) rather than in the LHS.

We refer to a situation in which a consumer is indifferent between either temporal lottery, LHS vs. RHS irrespective of α,y,m, and \(m^{\prime }\), as one of timing indifference, following Kreps and Porteus (1978), which offers the following preference specification over temporal lotteries. Let W, called an aggregator (in the sense that it aggregates utility across periods), be a continuous and increasing function on Y × Y. For instance, consider a simple form of the intertemporal aggregator function of consumption y and z in periods 1 and 2:Footnote 5

$$ W(y, z) = v(y) + v(z), $$
(2)

The recursive expected utility (REU) of a temporal lottery d which yields [yi,mi] with probability qi is given by:

$$ V(d) = \sum q_{i} W(y_{i},\mu_{e}(m_{i})), $$
(3)

where μe(m) = u− 1(E[u,m]) is the certainty equivalent of lottery m under expected utility, i.e., accounting for the within-period aversion to risk under concave utility function u. As shown in CH94, an REU consumer is hopeful if \([y, \alpha m + (1 - \alpha )m^{\prime }]\) is preferred to early resolution \(\alpha [y, m] + (1 - \alpha )[y, m^{\prime }]\) when the concavity of the v function dominates and the composite h = vu− 1 is concave, i.e.,

$$ \begin{array}{@{}rcl@{}} V([y, \alpha m &+& (1 - \alpha)m^{\prime}]) = v(y) + h(E[u, \alpha m + (1 - \alpha)m^{\prime}]) \\ &=& v(y) + h(\alpha E[u, m] + (1 - \alpha)E[u, m^{\prime}]) \\ &> &v(y) + \alpha h(E[u, m]) + (1 - \alpha)h(E[u, m^{\prime}])\\ &=& V(\alpha[y, m] + (1 - \alpha)[y, m^{\prime}]) \end{array} $$
(4)

Note that standard discounted additive expected utility theory corresponds to the case of a linear h function. This case necessarily satisfies timing indifference and cannot exhibit hopefulness. To capture our adversity-hope hypothesis, we posit that the hope function h would become more concave at times of adversity.Footnote 6 To induce a baseline demand for a state lottery, we can augment the above analysis by applying some level of probability overweighting through a non-expected utility (NEU) model, such as weighted utility (WU) or reference-dependent utility (RDU), of the certainty equivalent of the period-2 uncertainty (Chark et al., 2020). The hope function then plays a modulating role of the baseline demand through changes in its curvature.

In the next section, we test the implication of the CH94 hope model in the setting of air quality and daily sales of a fixed-prize-fixed-odds lottery in China. An adverse environment would tend to increase the value of a lottery ticket purchased in the morning whose outcome will be revealed in the evening, giving rise to greater demand for the lottery. Absent a hope channel, deteriorating air quality associated with increased risk aversion or reduced optimism might lower, rather than increase, lottery demand.

3 Evidence from daily lottery sales

3.1 Institutional setting and data

Gambling was banned in China from 1949, the year the People’s Republic of China was founded, until 1984. That year, the Chinese Sports Association was allowed to offer betting on sporting events with the aim of funding the Beijing International Marathon. The following year, the State Council authorized the National Sports Commission to offer betting on a regular basis. In 1987, the Ministry of Civil Affairs was allowed to sell lottery products to fund social welfare and provide for public goods. Three decades later, the state-run Welfare Lottery and Sports Lottery authorities have similar revenue shares of China’s regulated lottery duopoly (Betting Guide, 2018; AGTech, 2018).Footnote 7

“Number picking” (hereafter, number) lotteries, supplied by the two national-level agencies, are the most popular accounting for 62% of lottery sales (Ministry of Finance, 2017). These consist of 3D, Double Color Balls, and Lotto 7 supplied by the Welfare Lottery, and PL3, PL5, Big Lotto, and Seven Stars from the Sports Lottery. The Double Color Balls, Lotto 7, Big Lotto, and Seven Stars run thrice weekly and have variable prizes that depend on whether there are rollovers from the preceding rounds. By contrast, the 3D, PL3, and PL5 have fixed winning prizes and are run daily.

Table 1 describes the sales data available for 32 areas, including 22 provinces, five Autonomous Regions, and five major cities, hereafter 32 provinces.Footnote 8 Detailed daily sales data are publicly available for 3D for all 32 provinces. Sales for the other number lotteries are available only as aggregated data for the entire country without geographic resolution. With about 21 million bets placed daily, 3D sales in 2017 of CNY 16 billion were double that of the other two daily fixed-prize number lotteries, PL3 and PL5, combined. With the availability of daily ambient particle levels by province starting in 2013, we examine daily 3D sales data over the five years 2013 to 2017.

Table 1 Summary statistics

In the 3D lottery, a player pays CNY 2 and picks three integers (0-9) and one of three possible formats: (i) exact order, (ii) two numbers repeated, and (iii) no repeated numbers. The winning odds is 1/1000 in format (i). For format (ii), the winning odds is 3/1000 since there are three equivalent combinations of any three winning numbers, e.g., 122, 212 or 221. Similarly, the winning odds for format (iii) is 6/1000 since there are six equivalent combinations of three winning numbers, e.g., 149, 194, 419, 491, 914 or 941. The corresponding winning amounts are CNY 1,040, CNY 346, and CNY 173 with expected payoffs of CNY 1.04 for format (i) and CNY 1.038 for formats (ii) and (iii). Online distribution remains restricted, and there were no officially sanctioned online platforms by 2018 (Betting Guide, 2018; xinhuanet.com, 2018b). The day’s sales, winning numbers, and number of winners by bet format are broadcast live, at 9 pm, on channel one of the Chinese Education TV Network (CETV-1) (cwl.gov.cn, 2017; AGTech, 2018).

Starting on January 18, 2013, PM2.5 concentrations at all surface-level air monitoring sites—about 1,500 across the country—became available, by hour, through the Chinese Ministry of Ecology and Environment (MEE). For each day and hour, we average across sites within a province to obtain a province-level mean.Footnote 9 For each day, we then average these province-level means over daytime/daylight hours, from 6 am to 8 pm inclusive.

Figure 2 shows wide PM2.5 variability within province over the 1,760 days in the study period, as well as across the 32 provinces. The 10th percentile in the combined lottery-environment sample over days and provinces is 17 μ g/m3, already in the range of human visibility (Watson, 2002). Most province-day observations amply exceed the US National Ambient Air Quality Standards, marked by the vertical lines (NAAQS annual at 12 μ g/m3, 24-hour at 35 μ g/m3). The 90th percentile at 99 μ g/m3 is severe. At such a level it is conceivable that individuals reduce their time outdoors including, where not too costly, staying more at home (He et al., 2017). Such potential avoidance behavior, as we subsequently discuss, would render our estimates conservative.

Fig. 2
figure 2

Distribution of daily mean PM2.5 mass concentrations (× 102 μ g/m3) over days (within panel) and across provinces (across panels). We compute daily mean PM2.5 during daytime hours from 6 am to 8 pm, based on hourly PM2.5 averaged across monitoring sites within each province. The sample period is January 2013 to December 2017. For better visualization, we show density up to 200 μ g/m3 and rescale the density function such that the maximum is 1 (otherwise plots for provinces with high PM2.5 such as Hebei are too flat). The annual and 24-hour US National Ambient Air Quality Standards of 12 and 35 μ g/m3 are marked by the vertical lines. Source: MEE

Besides PM2.5 and seasonal factors, our empirical models allow ambient temperature and humidity to directly affect lottery demand. For example, people may behave differently on a hot and humid summer day. From the US National Oceanic and Atmospheric Administration (NOAA), we obtain surface temperature and dew point depression, recorded every 12 hours, for 93 observatories across China.Footnote 10

In addition, NOAA data for these locations and times include wind speed and direction at surface as well as atmospheric thermal gradients (Table 1). These gradients, which fluctuate exogenously, indicate how temperature varies with altitude and whether there is a layer of warmer air overhead that traps pollutants close to the surface where they are emitted. We use atmospheric thermal gradients to instrument for PM2.5, to the extent that high-frequency unobservable shocks to economic activity are present and significantly influence PM2.5 (Seibert et al., 2000; Tang et al., 2016).

3.2 Adversity-hope hypothesis in the raw data

With lottery proxying for the demand for hope, the adversity-hope hypothesis is corroborated by Fig. 1 showing a positive relationship between 3D lottery sales against PM2.5 by province-day observation in the absence of any regression controls. To help visualize the data, we subtract province-level means from each variable and show locally weighted polynomial fits. Lottery sales are high in province-days with high and visible environmental degradation. The relationship seen in the raw data is further confirmed by the rigorous regression evidence provided next. The hypothesis is that, via a hope channel, smog and haze shift households’ preferences over the timing of uncertainty resolution, thus raising lottery sales.

3.3 Empirical models of lottery demand

We estimate variations of the empirical model of lottery demand:

$$ \begin{array}{@{}rcl@{}} q_{it}&=f(P_{it},\alpha)+W_{it}{\upbeta}_{i}+\gamma_{it}+\epsilon_{it} \end{array} $$
(5)

where the natural logarithm of 3D lottery sales in province i and day t, qit, is regressed on a parametric (e.g., quadratic) function of the key variable of interest Pit, province-day specific PM2.5 exposure, and controls. As in other developing countries, PM2.5 dominates China’s local air quality indices, with particle levels in outdoor air well in the visible range. The interpretation is that of PM2.5 as a general indicator (Dominici et al., 2010) of the severity of haze, including even smaller particles (ultrafine PM 0.01 to 0.1) that are not routinely monitored in China or elsewhere (He et al., 2019).Footnote 11

Province-day specific weather controls Wit include two complete sets of bins, each bin 5C wide, for ambient temperature and dew point depression.Footnote 12 We average variables across a day’s two readings, at 8 am and 8 pm local time, prior to taking 5C bins. We interact each set of bins with a complete set of province fixed effects (FE), flexibly allowing for the impact of weather on lottery sales to vary by province.

Time controls γit include complete sets of month(-of-year) FE, year FE, day-of-week FE, and a dummy variable for public holidays, to allow for systematic demand shifts over the years in the sample, within year, within week, and with the holiday calendar. We interact each set of time controls (month, year, day-of-week, public holiday) with province FE. This flexibly allows the effects of season on lottery sales to vary by province. We note that 3D sales and PM2.5 co-vary over the seasons, both variables increasing in the more adverse winter months of December-January relative to June-August. Since many factors can explain seasonal correlation, it is absorbed by the γit. A model variant replaces month-by-province FE and year-by-province FE with even tighter month-by-year-by-province FE, i.e., month-of-sample interacted with province.

Winning prizes for the 3D lottery were raised in August 2014, namely from CNY 1,000 to CNY 1,040 (for the exact-order format), from CNY 320 to CNY 346 (two numbers repeated format), and from CNY 160 to CNY 173 (no repeated numbers format). To allow for a change in winning prizes to shift demand, akin to a price change, we include a post August 2014 dummy interacted with province FE in the vector of time controls; this allows for cross-sectional variation in prize sensitivity. Beyond these increases in winning prizes, we note that the price per 3D bet has not changed from CNY 2 and that the supply of 3D bets is elastic. We thus interpret 𝜖it as an idiosyncratic demand shock.

By way of sensitivity analysis, we specify a cubic function of PM2.5 instead of a quadratic. Another variant of demand model (5) includes PM2.5 on each of the three preceding calendar days in addition to PM2.5 on the concurrent day of 3D sales. We also interact PM2.5 with dummies for China’s six different regions, allowing for geographic variation in the estimated impact of pollution on lottery demand.

Allowing for endogenous PM2.5. We estimate models both by ordinary least squares (OLS) and by two-stage least squares (2SLS). The OLS estimator assumes that PM2.5 exposure is measured without error and that omitted variables that correlate with lottery demand and PM2.5 are absent. This assumption may fail in the presence of components of economic activity that co-vary with emissions and that are not already accounted for by controls γit that capture systematic province-specific seasonal and day-of-week influences and time trends.

An alternative 2SLS estimator assumes that whether the atmosphere is stagnant or well ventilated has no bearing on other possible drivers of lottery demand, such as economic activity. These atmospheric regime shifts, which we observe through recorded thermal gradients, are excluded from equation (5), only affecting lottery demand through PM2.5 (Zhang et al., 2015; Tang et al., 2016; Liu & Salvo, 2018; He et al., 2019). Province-day specific atmospheric regime conditions, denoted by Ait, are a key determinant of local PM2.5 (Figs. A.4 and A.5) and are unlikely to correlate with unobserved lottery demand shocks, 𝜖it (exogeneity).Footnote 13

The 2SLS identifying assumption would be violated were a special kind of pollution control policy in place. Consider a hypothetical scenario in which emissions permits issued by a strong and well-informed regulator responded to day-to-day atmospheric regime shifts. In this hypothetical scenario, 2SLS estimates may be downward biased (conservative). For instance, atmospheric stagnation would lower industrial activity as regulators attempted to mitigate the high PM2.5 levels; canceled work shifts would reduce the purchase of lottery tickets through an income channel or a decline in commuter traffic (less workers passing by street-based lottery outlets on their way to work). We do not believe that such a scenario befits China’s environmental regulations and, in any case, such a scenario would render our 2SLS estimates conservative.

To use a household analogy for air pollutant dispersion and removal, shifting from a stagnant to a well-ventilated atmosphere is akin to flushing the toilet. Figs. A.4 and 5 illustrate the strong correlation between PM2.5 and key elements of Ait: (i) thermal gradients and (ii) wind. PM2.5 tends to be high: (i) when there is a positive (less negative) atmospheric temperature gradient in altitude, as the thermal inversion traps surface emissions;Footnote 14 and (ii) when horizontal dissipation is poor, characterized by low wind speeds.

We thus use atmospheric removal conditions Ait to form an instrument for measured pollution Pit, estimating the following auxiliary equation by OLS flexibly by province i:

$$ \begin{array}{@{}rcl@{}} P_{it}&=A_{it}\phi_{i}+\delta_{it}+\nu_{it}, \end{array} $$
(6)

Included in Ait are linear and quadratic terms for thermal gradients in the atmosphere’s five layers closest to the surface and wind speed, as well as wind direction.Footnote 15 To account for the build-up of particles during multi-day inversion episodes, we include same-day and prior-day conditions, and interact a dummy for light wind (up to 1 m/s) with each thermal gradient. δit are time fixed effects (month-by-year, day-of-week, public holiday), and νit is a disturbance.

We fit \(\hat {P}_{it}\) using the auxiliary atmospheric regime-pollution model (6) and employ these fitted values to instrument for measured Pit in the lottery demand equation (5). To obtain 2SLS estimates, we still generate first-stage predictions of P from the excluded instrument \(\hat {P}_{it}\) and the non-pollution covariates in the second-stage lottery demand equation.Footnote 16

To be clear, Ait does not include surface temperature or humidity, which we allow to directly affect lottery demand. Our results are robust to dropping wind speed from Ait and instead modeling this variable as a lottery demand shifter Wit in (5). In our setting, winds are mild and unlikely to affect lottery demand. Wind speeds average 2.2 m/s compared to 4.6 m/s and 3-4 m/s in Chicago and Los Angeles (Herrnstadt et al., 2019, 2021; Anderson 2020).

3.4 Adversity in environment increases lottery demand

Table 2 reports estimates for alternative models of the causal effect of PM2.5 pollution on daily 3D lottery demand. Impacts estimated with 2SLS, in specifications 5 to 8, are somewhat larger than with OLS, in specifications 1 to 4. This is consistent with either some measurement error in pollution exposure or some unobserved day-to-day variation in economic activity and emissions, which the 2SLS models account for. For instance, on unobservable busy days workers/individuals have less time to purchase lotteries and emissions are higher, suggesting downward biased OLS estimates. In specifications 1 (OLS) and 5 (2SLS), with a quadratic in daytime PM2.5, a shift from 10 to 50 μ g/m3 increases a day’s 3D sales by 1.1-1.3 log points, with day-clustered standard errors of 0.2-0.3 log points (the dependent variable is the log of sales). 2SLS estimates are generally less precise. The coefficient on the quadratic term is negative such that pollution’s marginal effect falls as PM2.5 rises from 10 μ g/m3 (the first percentile of PM2.5 over days and provinces). Figure 3 visualizes the impact of a large—but still in sample—PM2.5 shift from 10 to 200 μ g/m3, converted to a 95% confidence interval (CI) of percentage increase: Lottery demand increases by about 3.0 percent in specifications 1 and 5.

Table 2 The impact of particle pollution on daily 3D lottery sales across China’s provinces: OLS and 2SLS estimates
Fig. 3
figure 3

Percent increase in daily 3D lottery demand due to a 10 to 200 μ g/m3 in-sample shift in daytime PM2.5 levels (6 am to 8 pm mean). The figure plots the 95% confidence interval for the PM2.5 impact based on each of the 8 OLS and 2SLS regression models reported in Table 2, implemented on the 3D province-day sample

In specifications 2 and 6, estimates grow on replacing month-by-province and year-by-province intercepts with more flexible month-by-year-by-province intercepts. The effect of PM2.5 on lottery demand is now estimated from co-variation within month-of-sample and province. These detailed controls absorb a few isolated spikes (e.g., Beijing in October 2014) and breaks (e.g., Jianxi from March 2015 on) that we observe in the sales data. In specifications 3 and 7, estimated impacts are similar when we add a cubic term in PM2.5, suggesting that the quadratic form is not restrictive. Estimates on same-day PM2.5 are similar, in specifications 4 and 8, when we include linear and quadratic terms for PM2.5 in each of the three days preceding the specific daily draw. Across all models in Fig. 3 (Table 2), the impact of a 10 to 200 μ g/m3 PM2.5 shift on daily 3D lottery demand ranges from + 2.9 to + 5.1 percent, and is significant at the 1% level throughout.

In Table A.1, we repeat the quadratic PM2.5 specification on separate sub-samples by region of China (north, northeast, northwest, east, southwest, and southcentral). OLS estimates are positive for all regional sub-samples, and statistically significant for six of the seven regions. 3D sales impacts range from + 1.6 to + 7.4 percent for a 10 to 200 μ g/m3 PM2.5 shift. 2SLS estimates are more dispersed, ranging from 0.0 to + 13.3 percent, and less precise. (We discuss the case of independently managed Taiwan below.)

Table 3 restricts the sample to the two municipalities with province status in the 3D lottery data, Beijing and Shanghai, where the US State Department monitors PM2.5 since 2008 and 2011, respectively. We use US State Department rather than MEE PM2.5 records at these granular geographic areas (cities, not provinces), add controls for daytime precipitation (obtained from NASA), and temporally extend the sample over one decade. Similar estimates based on alternative US State Department pollution records for two major cities reassure us of our findings.Footnote 17

Table 3 PM2.5 recorded by the US State Department for two municipalities over a decade

Table 4 shows evidence that morning PM2.5 impacts lottery sales more than afternoon PM2.5. Fixing a change in the daytime mean PM2.5 (6 am to 8 pm), say from 10 to 200 μ g/m3, the impact on lottery sales is more pronounced when the morning is polluted (and the afternoon has cleared) than when the onset of haze happens in the afternoon (and the morning was clear). We interpret this pattern as being consistent with an individual spending the day savoring the prospect of winning, ahead of the resolution of uncertainty.

Table 4 The impact of morning vs. afternoon pollution on daily 3D lottery sales

3.4.1 Robustness

The Appendix provides further sub-sample analysis and robustness tests. Table A.2 shows OLS and 2SLS estimates for the quadratic PM2.5 specification implemented separately on (i) the 16 provinces with higher PM2.5 in the sample; (ii) the 16 less polluted provinces; (iii) the colder and often more polluted season from October to March; and (iv) the warmer April to September season. PM2.5’s estimated impact on daily 3D demand is similar and significant whether we stratify the sample by provinces’ typical air quality or by season.

Table A.3 reports on several robustness tests, with similar positive and significant effects estimated throughout. Estimates hardly change when we replace the 5C-wide ambient temperature bins by a quadratic in temperature, continuing to interact temperature covariates with province intercepts for flexibility. Effects shrink slightly when we progressively drop dew point depression and temperature controls, or when we correct for seasonality using week-of-year rather than month-of-year intercepts (always interacted with province). Very granular week controls likely absorb some variation of interest—and Table A.4 follows Isen et al. (2017b) and goes further still, specifying day-of-year rather than month-of-year or week-of-year.

Continuing with Table A.3, similar results obtain when replacing year intercepts with a trend (again province-specific), or when the estimation sample is trimmed at the 1st and 99th percentiles of the province-specific distribution of daily 3D sales or restricted to weekdays outside of public holidays. We also specify daily PM2.5 as the 24-hour mean rather than the 15-hour daytime mean. Table A.7 shows sensitivity analysis to the parametric form of PM2.5, specifying PM2.5 (i) in bins, (ii) as a linear spline function with knots set at 50 and 100 μ g/m3, and (iii) in logarithmic form, as alternatives to the quadratic and cubic functional forms of Table 2. We find a robust concave lottery demand response to air pollution. Finally, weighting the regression by a province’s 2010 Census population yields similar estimates (not shown).

3.4.2 Demand for other number lotteries

We find that demand for other number lotteries, with available data aggregated for the entire country, is positively associated with a weighted average for PM2.5 across China. Besides the 3D examined above, we consider the two other fixed-prize lotteries and the four variable-prize lotteries offering highly skewed prospects of winning millions of CNY (in contrast to CNY 1,000 in the 3D). The PL3 lottery has the same rules as the 3D. The PL5 lottery pays CNY 100,000 if a player picks five winning integers (0-9) in the exact order. Like the 3D, each PL3 and PL5 bet costs CNY 2. The rules for variable-prize lotteries are detailed in the Appendix.

Table 5 reports OLS estimates for the log of each product’s nationwide sales—varying daily in the case of fixed-prize lotteries, and thrice weekly otherwise—regressed on a quadratic function of average PM2.5 with controls for weather and time. Nationally aggregated environmental variables are weighted averages of their provincial-level counterparts, where the weights are the provincial shares of China’s 2013-2017 3D lottery sales (results are similar if we use population shares as weights).

Table 5 Robustness: The relationship between nationwide sales for seven number lotteries and average PM2.5

The top panel controls for month and year intercepts whereas the bottom panel specifies month-by-year intercepts to better account for fluctuating sales over the period. For comparability, columns 1 and 8 report on aggregated 3D lottery sales. Figure 4 shows the corresponding 95% CI. In columns 2 and 3, an in-sample aggregate PM2.5 shift from 10 to 140 μ g/m3 is associated with statistically significant increases in aggregate PL3 and PL5 lottery sales, of 4.2 and 5.7 percent respectively. With more flexible month-by-year intercepts, columns 9 and 10 show increases of 3.3 and 6.0 percent. Despite the use of a different data source, namely the Sports Lottery, the observed pattern is consistent with that of the 3D from the Welfare Lottery.

Fig. 4
figure 4

Percent increase in the nationwide sales of seven number lotteries associated with a 10 to 140 μ g/m3 in-sample shift in PM2.5, averaged across China. The figure plots the 95% confidence interval associated with a PM2.5 shift based on the OLS regression models reported in Table 5, implemented on daily time series (3D, PL3, and PL5 fixed-prize lotteries drawn every day) or thrice weekly time series (Double Color Balls, Lotto 7, Big Lotto, and 7 Stars variable-prize lotteries drawn three days every week). Models 1-7 specify month and year intercepts (top panel). Models 8-14 specify month-by-year intercepts (bottom panel)

Estimated associations for PM2.5 and variable-prize lottery sales are mostly significantly positive. In columns 11 to 14, a 10 to 140 μ g/m3 PM2.5 shift is associated with Double Color Balls, Lotto 7, Big Lotto, and 7 Stars sales increases of 6.8, 6.5, 6.9, and 4.4 percent, all positive and the first three separately statistically significant (7 Stars is the smaller of the four by sales). Since the variable-prize lotteries are drawn only three times per week, on spaced-out days, here we add lagged PM2.5 exposure (and report the cumulative impact) to account for sales on days preceding a draw. Despite the loss of geographic resolution, we are reassured by the results from the nationwide time series. These are illustrated in the scatterplots of Fig. 5, showing aggregated sales for the different lotteries against aggregated PM2.5.

Fig. 5
figure 5

Residuals of log nationwide sales, for the six largest number lotteries, against residuals of nationally aggregated PM2.5 on the day of the draw, after partialing out time fixed effects (month-by-year, day-of-week, and public holiday) and weather (quadratics in temperature and dew point depression) from each time series. An observation is a day (draw) in the period January 2013 to December 2017. The fixed-prize lotteries in panels a to c have draws every day, so the time series are daily. The variable-prize lotteries in panels d to f have three draws per week, so the frequency of the time series are thrice weekly. Environmental variables are aggregated across provinces as described in Table 1. For better visualization, the panels omit a few outlying residuals and show locally weighted polynomial fits

3.4.3 External validity: Taiwan

Authorities in Taiwan sell daily fixed-prize number lotteries that are very similar to (mainland) China’s 3D, PL3, and PL5 (Taiwanlottery.com.tw, 2019). Taiwan’s three- and four-digit 3D and 4D are similarly popular as their counterparts in China. A time series of daily sales across Taiwan is publicly available for the period 2014 to 2018. Exposure to particle pollution in Taiwan tends to be lower than in mainland China, yet PM2.5 routinely rises above the US NAAQS and varies in the range of human visibility. Here we look to this independently provided dataset for validity of our main analysis. We find that in Taiwan, too, PM2.5 significantly raises daily lottery demand.

The provider, Taiwan’s state-run Welfare Lottery, is not related to China’s lottery authorities. Similar to China’s 3D, Taiwan’s 3D and 4D run daily (except on Sunday), with winning numbers drawn between 8 and 9 pm. To place a bet in either the 3D or the 4D, a player pays 25 New Taiwan dollars (NT$), or about US$ 0.80, and picks three integers in the 3D, or four integers in the 4D. Top prizes are paid out if a bet matches the winning number in exact order, with winning odds and winning amounts of, respectively, 1/1000 and NT$12,500 in the 3D, and 1/10,000 and NT$125,000 in the 4D (and expected payoffs of NT$12.5 alike). As in China’s 3D, betting formats other than the exact-order format can be purchased. We obtained publicly available daily sales spatially aggregated for Taiwan over the five-year period between January 2014 and December 2018.Footnote 18

Hourly PM2.5 concentrations at all surface-level air monitoring sites—76 sites across 22 Taiwanese cities—are available from Taiwan’s Environmental Protection Administration. Following our main analysis of lottery demand in China, for each day and hour, we average PM2.5 across sites within a city; for each day, we then average these city-level means over daytime hours, from 6 am to 8 pm. Finally, Taiwan-aggregated PM2.5 are population-weighted averages of city-level daytime PM2.5. We obtain Taipei’s surface and atmospheric meteorological conditions, recorded every 12 hours, from NOAA. Again following our main analysis, NOAA data provides times series of weather controls Wt, namely ambient temperature and dew point depression, and of atmospheric pollutant removal variables At.

Table 6 reports estimates for OLS and 2SLS models of the causal effect of PM2.5 pollution on daily 3D and 4D lottery demand in Taiwan. Since the 90th and 99th percentiles of daytime PM2.5 in the combined lottery-environment sample for Taiwan are 35 μ g/m3 and 50 μ g/m3 (compared with 99 μ g/m3 and 214 μ g/m3 in the mainland China sample), we report the impact of a PM2.5 shift from 10 to 50 μ g/m3. We find that PM2.5 significantly increases 3D and 4D lottery sales. While point estimates tend to be larger, particularly under the 2SLS identifying assumption, evidence based on Taiwanese daily time series is consistent with our results using the province-day sample for mainland China. We note that some presence of online sales in Taiwan (Taiwanlottery.com.tw, 2019) can, by diminishing the potential role of outdoor avoidance in reducing lottery sales, explain the larger PM2.5 impact compared to what we find in mainland China.

Table 6 External validity: Daily 3D and 4D lottery sales and PM2.5 in Taiwan

3.4.4 Heterogeneity

We return to the province-day sample for 3D lottery sales in China. We cautiously explore whether the relationship between haze-induced adversity and lottery demand exhibits heterogeneity along some margins, for example, over the weekly cycle, with colder weather, and with short-run variation in the increasingly popular Shanghai Stock Exchange (SSE) composite index. To our baseline specifications in Table 2 (OLS and 2SLS), we include as regressors interaction terms between PM2.5 (and its square) and variables that might be related to the demand for hope. We control for the levels of these variables as well, allowing them to correlate with lottery demand directly.Footnote 19 Table 7 reports on a subset of the specifications we have implemented.

Table 7 The impact of particle pollution on daily 3D lottery sales: Heterogeneity

We first allow the economic and ambient environments to interact. As motivation, recall that Mikesell (1994) finds that US state lottery sales increase with state-level unemployment, suggesting that lottery demand rises when the economic environment deteriorates. Kumar (2009) documents that lottery expenditure is positively correlated with investment in lottery-type stocks, and that the demand for lottery-type stocks increases during economic downturns. It is plausible that stock market performance (a variable that moves daily) may correlate with lottery demand, and the participation rate in China’s stock market is already quite high, e.g., 138 million Chinese owned stocks in 2018 (xinhuanet.com, 2018a).

Columns 1 and 5 consider short-run stock market performance. A sharp fall in the stock market can be adversely perceived by individuals and potentially affect their hope attitudes and demand for lotteries. We generate a binary indicator that equals one if the one-day return in the Shanghai Stock Exchange on the previous day is in the bottom 10% of the distribution of one-day returns over the study period. The estimation sample thus excludes Sunday and Monday as there is no trading on Saturday and Sunday. As explained, we interact PM2.5 covariates with the Poor Stock Market indicator.

We find that the positive relationship between a day’s lottery sales and PM2.5 grows stronger when the stock market performed badly the day before, that is, when high-frequency economic adversity and environmental adversity interact. We also explored interactions between PM2.5 and five-day or seven-day returns, obtaining similar associations. One possible interpretation is that when “bad news” interact, such as haze and a poor stock market, the desire for “good news” to look forward to may grow stronger. While finding this association interesting, we caution that both short-run stock market performance and lottery demand may be responding to wider economic variables.

Research suggests that daily returns in the US stock market may vary systematically with day of the week (Birru, 2018). We consider whether the effect of PM2.5 on lottery demand differs by type of day. To our baseline specifications we include interactions between PM2.5 and separate indicators for Monday, Saturdays and Sundays/public holidays (we have explored other specifications). Columns 2 and 6 show that the PM2.5 effect is not significantly heterogeneous over the different weekdays and is perhaps stronger on Saturday.

We consider the interaction between PM2.5 and cold weather. We generate an indicator for cold days, defined as temperature in the bottom 10% of the province-day sample. Columns 3 and 7 report that the effect of PM2.5 on lottery demand is not significantly different on cold vs. normal days. If anything, the effect of PM2.5 on lottery demand is somewhat lower on cold days, which is consistent with avoidance of the cold outdoors (less gamblers visiting street-based lottery outlets irrespective of haze).

It is possible that individuals stay inside with space-heating on cold days. Studies have documented that, unlike their northern counterparts, cities south of the Huai river and Qin mountains do not enjoy subsidized heating, despite winter temperatures occasionally falling below 0C (e.g., Chen et al., 2013; Chu et al., 2018). We generate an indicator for the absence of subsidized heating, equal to one if the province’s capital city lies south of the Huai-Qin boundary. To check whether the impact of haze on lottery demand depends on temperature when it is difficult for households to seek comfort indoors, to our baseline specification we add a triple interaction between the cold-day indicator, the no-subsidized-heating indicator, and PM2.5. Columns 4 and 8 suggest that the impact of PM2.5 on lottery demand may be stronger in cold vs. non-cold days in cities where heating is not subsidized, but this difference is not statistically significant.

4 Discussion

Drawing on both laboratory and field observations, the economics and psychology literatures reveal a tendency for individuals to display longshot preference, i.e., risk affinity towards small chances of sizable gains, while simultaneously exhibiting aversion to risk involving moderate chances of winning (Kahneman and Tversky, 1979; Chark et al., 2020). Theoretical models offer preferential foundations for the observed concurrence of longshot preference behavior revealed in lottery purchases and pervasive risk aversion evident in capital markets (see, e.g., Quiggin 1991; Chew & Tan 2005). The literature has also developed models of preference over the timing of uncertainty resolution—or hope attitude—being distinct from attitude towards risk (Kreps & Porteus, 1978; Chew & Epstein, 1989).

Empirical evidence supports the notion that individuals tend to exhibit hopefulness when facing longshots, even choosing to delay their resolution (Chew & Ho, 1994). Lotteries have been described as “selling hope” (Clotfelter & Cook, 1989), and “are perhaps best understood in terms of the anticipatory fantasies and hopeful excitement that they permit” (Lovallo & Kahneman, 2000). In this regard, one may draw a further distinction between hope underpinned by optimism as in probability overweighting versus hope in terms of savoring the prospect of a positive resolution of uncertainty in the future.

An empirical literature points to increased demand for hope products, including lotteries and lottery-type stocks, at times of economic and environmental adversity (e.g., Mikesell 1994; Kumar 2009; Li et al., 2011; Olason et al.2017).Footnote 20 To some, candles are symbols of hope, and the press commonly shows footage of crowds holding lit candles in the aftermath of adverse events—whether natural or manmade, for example, an earthquake, a mass shooting, or the death of a celebrity (such as the Princess of Wales killed in a car accident in 1997). Besides an increase in hope, the literature finds that adversity may also induce less optimism and thus more pessimism in people (Kivimäki et al., 2005).

The empirical relationship we present adds to these findings across disciplines. Our design and robustness across data sources heightens our confidence that the result is not a statistical artifact or due to confounding factors. We study adversity in the form of pollution shocks that vary widely and visibly over time and space in a country of land area comparable to the US. The purchase of lottery tickets is widespread across Chinese households, not confined to problem or pathological gamblers.Footnote 21 Morning pollution has a larger impact than pollution in the afternoon, when the time to uncertainty resolution is shorter. We estimate a similarly large and statistically significant relationship using data from different state lottery providers—two separate authorities for mainland China, yet another for Taiwan.

We obtain a positive relationship despite possible avoidance behavior (Chu et al., 2020). Were routine haze to induce some individuals to stay at home, and given the limited opportunities for online lottery purchase directly from home, we conjecture that pollution’s impact on lottery demand would be higher still in the absence of avoidance or the presence of online sales. Indeed, this is consistent with our estimates for Taiwan.

Recent papers point to a positive association between air pollution and risk aversion based on overall stock market data (Levy & Yagil, 2011; Li & Peng, 2016; Li et al., 2017). This is in line with the direct finding of increased aversion to even-chance risk in (Chew et al., 2021), because financial investments generally involve deliberation over more-or-less evenly distributed risks with positive returns in which risk aversion plays a key role. In contrast, lotteries concern skewed prospects with negative actuarial values, typically involving delayed resolutions, in which a more impulsive motive such as hope may be at play. In this regard, there also are lottery-type stocks (Kumar, 2009) which exhibit positive skewness and excess kurtosis with poorer returns relative to the overall market.

Future work using observational or experimental evidence can verify whether adversity shocks—including pollution, snow, stock markets, conflict, terrorism, pandemics—shift the demand for lotteries in terms of the timing of uncertainty resolution, or induce substitution across lottery and investment products. Lottery sales data at a granular spatial level such as neighborhood or city may be available and combined with demographic data such as income. Spain’s two-century-old multi-billion Christmas lottery, with advance sales taking place as early as July, points to the prevalence of consumption of hope. Our adversity-hope hypothesis predicts that, all else equal, such lottery sales would grow with, for example, terrorist-group ETA bombings in time and space.Footnote 22 Individual-level data can further allow one to assess whether adversity induces a compositional change in the demand for lottery and investment products.

In the context of a theoretical literature that models hope and optimism as drivers of lottery demand, we interpret our results for pollution-induced daily lottery sales in China through the lens of the adversity-hope hypothesis. In the face of adversity, an increase in lottery demand is plausibly due to hope rather than to optimism (feeling lucky or confident). An increase in optimism when confronted with air pollution would be hard to square with studies that find that haze reduces subjective well-being (Zhang et al., 2017; Zheng et al., 2019). By the adversity-hope hypothesis, hazy and smoggy days shift preferences over the timing of uncertainty resolution. As the environment around them grows more severe, individuals may seek “good news” with greater fervor. Says an Arab proverb (noted in CH94), “there is fear in every hope, and hope in every fear.”