1 Introduction

Although the traditional body of literature on air pollution focuses on direct health impacts (Graff Zivin and Neidell 2013), recent work suggests that exposure has broader implications, such as reduced worker productivity (Chang et al. 2016), human capital formation (Ebenstein et al. 2016), and cognitive capabilities (Powdthavee and Oswald 2020). Failing to consider these (harder-to-measure) sub-clinical effects can lead policymakers to underestimate the negative impacts of contaminated air on human societies (Aguilar-Gomez et al. 2022).

In this article, we explore the impact of air pollution on decision-making by examining its effects on judicial rulings. Although judge decisions should be (in theory and by law) unaffected by biases and emotions (Eren and Mocan 2018), realists contend that there is substantial room for external factors to transpire into sentencing and sway human reasoning (Danziger et al. 2011). Our central argument relies on the notion that air pollution is one of these factors.

To the best of our knowledge, this is the first study of pollution-induced bias in the Indian judiciary and the first paper to find a significant effect of air pollution on judicial decisions. As such, we add to the growing literature suggesting that traditional cost-benefit analyses understate the actual costs of air pollution as they fail to incorporate its sub-clinical consequences. Specifically, we provide evidence that, in addition to the environmental, health, and productivity costs of air pollution, exposure can affect high-stakes decision-making. Our contribution also includes the empirical analysis of air pollution effects in the context of limited data availability and low-quality pollution measures.

Exposure to air pollution can confound judicial choices due to its physiological and psychological effects on humans. Air pollutants alter the brain’s chemistry and cause systemic inflammation in the central nervous system, leading to reduced cognitive performance, unstable risk preferences, fatigue, and a greater propensity to punish others (Lu 2020). The effect of air pollution on sentencing can lean in either direction. On the one hand, judges could refrain from convicting individuals as a mitigation measure when pollution affects their focus and memory (Aguilar-Gomez et al. 2022). On the other hand, judges may sentence more people if air pollution increases feelings of aggression, discomfort, and apathy (Lu 2020). Understanding the direction of the effect is critical as judges make daily decisions with long-lasting impacts on the lives of citizens (Ash et al. 2021). Hearings are also a relevant context to investigate the effect of air pollution on human behavior as judges’ routine tasks involve characteristics shared across other professions, such as sensory awareness, decision-making, social interaction, and critical reasoning (Sarmiento 2022b).

To examine the relationship of interest, we consider the universe of criminal cases in the Republic of India from 2010 to 2018. Our data comes from more than twenty million penal cases from the government E-Courts platform. We aggregated the individual observations into a panel of monthly cases and convictions per Indian subdistrict while building similar measures of corresponding temporal frequency and spatial resolution for air pollution and weather controls with remote sensing data from the North American and European Space Agencies (see van Donkelaar et al. 2021).

The core empirical strategy relies on high-dimensional fixed-effects Poisson pseudo-maximum likelihood-estimator (PPMLE) panel models of the relationship between PM2.5 and the number of monthly convictions per Indian subdistrict. We substantiate these estimates into causal evidence with a control function approach using strength-weighted atmospheric thermal inversions as a source of exogenous variation in air pollution.Footnote 1

Empirical results suggest a positive relationship between PM2.5 and convictions. Estimates from the fixed-effects model imply that an increase of 10 μg/m3 in monthly PM2.5 concentrations raises the number of convicted individuals by 1.62%. Causal estimates that rely entirely on the variation in pollution generated by thermal inversions indicate that the proposed impact is more sizable and stands at 7.24%.

These effects are statistically and economically significant. Back-of-the-envelope calculations suggest that decreasing the average concentration of PM2.5 by 10μg/m3 (or 38% of a standard deviation) would decrease the number of convictions in currently active cases by as much as 145,000. Robustness exercises show that the influence of contaminated air is geographically homogeneous and is primarily driven by extreme pollution episodes, that is, periods of contamination within the top quintiles of the PM2.5 distribution.

Although evaluating the costs associated with these wrongful convictions is challenging, we estimate that decreasing PM25 by 10 μg/m3 can lead to national annual savings of between ninety-six and four-hundred and four million dollars. For context, the Indian National Clean Air Programme (NCAP) aims to reduce PM2.5 by up to 30% in 2024 cf. 2017. This decrease implies a fourteen μg/m3 reduction when using the average 2017 concentration in our sample (47.5 μg/m3), meaning that in addition to the health advantages of improved air quality associated with NCAP, the decrease in exposure would also lead to sub-clinical benefits such as productivity improvements (Chang et al. 2016), lower crime rates (Bondy et al. 2020), and fewer convictions. It is also worth noting that aside from the fact that wrongful convictions have immense implications for the future of the concerned individual, they can also reduce the confidence of citizens in the legal system (Norris et al. 2020).

Our findings stand opposite to a similar analysis that found no impact of air pollution on sentence severity (Hou and Wang 2020). We posit that this difference occurs due to discrepancies in the institutional setting and other factors such as building standards, air pollution control capabilities, and adaptation. For instance, there is evidence that the impact of environmental factors like temperature on sentencing decisions depends on the setting; that is., while there is no evidence of temperature affecting Australian judges (Evans and Siminski 2021), recent work shows that temperature may affect conviction probabilities in India (Craigie et al. 2022).Footnote 2 In a similar context to ours, judges’ productivity has also been found to be affected by air pollution in both China (Kahn and Li 2020) and Mexico (Sarmiento 2022b).

We divide the rest of the study into eight sections. The Literature Review contextualizes our research within studies on the sub-clinical effects of air pollution and the effects of external factors transpiring into sentencing decisions. We divide the Background Section into three sub-sections; Air Pollution in India introduces the reader to the current state of affairs regarding air pollution levels, sources, and policies in India; Air Pollution and Human Behavior outlines the current state of research on the relationship between air pollution and human behavior; and The Indian Judiciary describes the structure of the Indian judicial system. The Data Section presents the judicial and environmental data sources we use in our empirical model. In the Theoretical Background, we present a small theoretical model of the relationship between air pollution and convictions; this section also works as a bridge between the Data Section and the Research Methodology. Research Methodology explains the empirical method we use to answer our research question, i.e., is there a statistically significant effect of air pollution (proxied by PM2.5) on the sentencing behavior of judges? We divide this section into two subsections; the Fixed Effects Model and the Control Function Approach. The Results Section presents the estimates from these two methods and provides econometric evidence on the relationship between PM2.5 and sentencing in India. Finally, the Discussion and Conclusion Sections contextualize, summarize, and conclude the study.

2 Literature Review

The World Health Organization (WHO) considers air pollution one of the most significant environmental risks to human health; in 2019, 99% of people lived in areas with exposure levels above WHO air quality guidelines (The World Health Organization 2023). Globally, 4.2 million premature deaths are linked to outdoor air pollution, with approximately 90% of the burden occurring in low and middle-income countries (The World Health Organization 2023). Averaged globally, particulate pollution alone decreases average life expectancy by 2.2 years compared to the counterfactual scenario of concentrations below WHO guidelines (Greenstone and Fan 2018).Footnote 3

Aside from mortality, air pollution affects health through its effects on strokes, chronic respiratory diseases,Footnote 4 reduced lung function, heart attacks, hypertension, and lung cancer (Jiang et al. 2016; Cao et al. 2020; Manisalidis et al. 2020; Shah et al. 2015). Exposure can result in systemic inflammation, oxidative stress, and the formation of blood clots, leading to cardiovascular conditions (Brook et al. 2010; Münzel et al. 2018). Further health consequences involve adverse birth outcomes such as preterm birth, low birth weight, and developmental issues linked to maternal exposure during pregnancy (Shah et al. 2011; Stieb et al. 2012). Additionally, air pollution was further linked to neurodevelopmental disorders, cognitive deterioration in older adults, and cancer (Power et al. 2016; Costa et al. 2020; Turner et al. 2020).

While the best-known consequence of exposure is its direct impact on mortality and morbidity (Deschenes et al. 2017; Greenstone and Fan 2018), recent work explores the sub-clinical effects of air pollution on other variables like human productivity, behavior, emotions, and cognitive capacity. Exploring the sub-clinical costs of exposure is critical to estimate its marginal effects (Chay and Greenstone 2005; Ebenstein et al. 2016). One stream of literature looking at the sub-clinical effects of air pollution examines the relationship between exposure and worker productivity, with several studies providing evidence of exposure’s negative impact on blue-collar workers and cognitively taxing jobs (He et al. 2019; Chang et al. 2019; Archsmith et al. 2018).

The effect of air pollution on cognitive abilities is quite relevant for modern societies. Late studies prove that air pollution lowers California reading and math exam scores (Zweig et al. 2009; Zou 2021), decreases performance in Chinese verbal tests (Zhang et al. 2018), and affects high-stake exam results in Brazil, Israel, Iran, England, and China (Ebenstein et al. 2016; Carneiro et al. 2021; Amanzadeh et al. 2020; Roth 2020; Zivin et al. 2020). Bedi et al. (2021) show that PM2.5 is especially relevant for fluid reasoning, and Powdthavee and Oswald (2020) calculate the influence of NO2 and PM10 on memory quality to be equivalent to ten years of aging when comparing the most to the least polluted areas of England. Results from randomized control trials substantiate the above evidence, wherein people score higher in cognitive function tests if randomly allocated to better air quality (Allen et al. 2016).

Air pollution can impair cognitive function by decreasing blood flow and cell oxygenation (Lu 2020; Aguilar-Gomez et al. 2022). Upon reaching the bloodstream, either through the lungs or directly from the air, contaminants interfere with the chemical composition of the central nervous system (CNS) via neuroinflammation and oxidative stress (Beurel and Jope 2014). Changes to the chemical composition of the CNS are especially relevant for the cerebral cortex as it plays a fundamental role in memory, insight, emotions, and consciousness (Bechara et al. 2000; Peters et al. 2006).

Exposure to contaminated air further results in sensory irritation, which can lead to claustrophobia or mild tension (Chang et al. 2016; Li et al. 2017). Polluted air may also provoke impatience, impede attention, make us more aggressive, and trigger tiredness (Anderson et al. 2002; Aguilar Gomez et al. 2022). Even a judge’s perception of air pollution can further increase anxiety as exposure triggers worries about personal health (Lu et al. 2018), which in turn could result in elevated immoral and self-interested violent and nonviolent behavior (Kouchaki and Desai 2015; Barlett and Anderson 2014). For instance, evidence from longitudinal studies shows that air pollution raises reports of psychological distress (Sass et al. 2017), depression (Szyszkowicz et al. 2009), suicide attempts (Szyszkowicz et al. 2010), and actual suicides (Yang et al. 2020).

Animal studies demonstrate that exposure can result in neurological impairments such as decreased novel object recognition, spatial learning, memory, and performance (Win-Shwe et al. 2008, 2014; Salvi et al. 2017). Animal trials also suggest that exposure can increase anxiety and depression by lowering bloodstream serotonin (Ehsanifar et al. 2019; Murphy et al. 2013), which is particularly relevant for inhibiting aggression and impulsive behavior in humans (Coccaro et al. 2011; Murphy et al. 2013). For instance, previous research has found a weak inverse link between serotonin, impulsive aggression, anger, and hostility (Frankle et al. 2005; Duke et al. 2013). Furthermore, Crockett et al. (2013) link lower serotonin levels with increased eagerness to punish adversaries and a lower probability of accepting fair deals, both key elements in judicial decision-making.

In previous studies on the effects of pollution on human decision-making, researchers provide evidence of financial investors decreasing optimism following exposure to contaminated air (Dong et al. 2021), higher instances of ambiguity aversion and impatience when making decisions (Chew et al. 2021), and changes to risk preferences (Levy and Yagil 2011; Bondy et al. 2020). This study adds to the growing literature on the cognitive consequences of exposure to air pollution by providing empirical evidence on the effects of exposure on high-stakes decision-making.

Concerning studies looking at the determinants of judicial decisions, the current literature shows that sentencing can change along the lines of religion (Shayo and Zussman 2011), race (Alesina and La Ferrara 2014; Arnold et al. 2018), and gender (Didwania 2018; Anwar et al. 2019). Regarding external factors, there is also evidence that they can be affected by news coverage (Lim et al. 2015), food break schedules (Danziger et al. 2011), and even the performance of local sports teams (Eren and Mocan 2018; Chen 2016).

In the context of environmental variables, while Heyes and Saberian (2019) relate higher temperatures to decreased favorable asylum decisions by US immigration judges,Footnote 5 Evans and Siminski (2021) find no evidence of temperature or PM2.5 affecting sentencing when examining 2.8 million criminal court cases in Australia. Nevertheless, Evans and Siminski (2021) point out that several factors may undermine the external validity of their findings, e.g., discrepancies in legal systems across countries, building standards, or climate control capabilities. For instance, their study’s average daily particle concentration is 5.18 μg/m3, 88.6/% lower than the average level in our sample.Footnote 6

Contrary to Evans and Siminski (2021), recent findings from Craigie et al. (2022) show that temperatures can affect Indian judicial processes, implying that the harmful consequences of rising temperatures or other environmental factors may be more prominent in low- and middle-income countries. In a similar spirit to this paper, Kahn and Li (2020) and Sarmiento (2022b) find that polluted air affects the productivity of judicial hearings by extending the length of Chinese and Mexican decision processes. Concerning studies solely focusing on air pollution effects on sentencing, Hou and Wang (2020) probed the universe of drug offense court decisions in five major Chinese cities between 2014 and 2015. The authors find that judges are unaffected by air pollution and temperature. However, they analyze sentence severity instead of convictions and measure air pollution with monitoring station data instead of remote sensing values.

Our study adds to the voluminous work on the biases of judicial choices by exploring the effects of one of the most relevant environmental externalities (air pollution) in one of the most polluted world regions (India). To the best of our knowledge, we are the first to look at the relationship between pollution and judges’ behavior in India and the first to find an effect of exposure on sentencing. Our contribution also includes the empirical analysis of air pollution effects in the context of limited data availability and low-quality pollution measures.

3 Background

3.1 Air Pollution in India

India has one of the worst ambient air qualities in the world (IQAir, 2021). While WHO guidelines set a maximum level of 5 μg/m3 of PM2.5 to remain within healthy boundaries, an analysis by the Financial Times estimated that some Indian cities could have surmounted that threshold by over ten times in 2018 (Bernard and Kazmin 2018; The World Health Organization 2023). Air pollution in India takes various shapes and forms, including PM pollution (PM2.5 and PM10), household indoor air pollution, sulfur dioxide (SO2), nitrogen dioxide (NO2), and ozone pollution (O3) (Central Pollution Control Board 2021). Contaminants come from multiple sources, including diesel exhaust fumes, coal-powered thermal power plants, festival fireworks, municipal waste, household combustion, construction, industrial emissions, and crop burning (Gurjar et al. 2016).

High levels of contaminated air in India have significant socio-economic and health impacts. Figures from the Air Quality Life Index (AQLI) suggest that around 500 million people in Northern India could increase average life expectancy by at least 8.5 years if the region would decrease pollution levels to WHO guidelines (Chen et al. 2013; Greenstone and Fan 2018). A study on the Global Burden of Disease in 2019 attributed 1.67 million deaths in India to air pollution (17.8% of aggregate mortality that year), with around 1 million traced to ambient PM pollution (Pandey et al. 2021). The high levels of PM pollution in India also increase respiratory and cardiovascular morbidity with relevant consequences for productivity and healthcare system utilization (Cohen et al. 2017; Balakrishnan and Tsaneva 2021).

Air pollution also has significant repercussions on education. For instance, it depresses reading and mathematics outcomes, lowers academic attendance, and causes teacher absenteeism (Balakrishnan and Tsaneva 2021). In 2021, excessive concentrations forced the Supreme Court of India to demand action from the government leading to the development of India’s National Clean Air Programme. Moreover, contaminated air in India brings about substantial direct economic losses. For example. Pandey et al. (2021) estimate that the costs of air pollution surmounted 37 billion USD in 2019 alone, i.e., around 1.4% of the country’s Gross Domestic Product (GDP). It is also worth noting that the burden of air pollution is unevenly distributed across the population, with lower-income groups more exposed to unsafe levels (Garg 2011). Contaminated air is also very present in rural areas, where satellite figures suggest it may even be higher than in urban agglomerations. However, it appears less apparent in governmental data because measurement stations are relatively scarce in rural vs. urban environments (Chatterjee 2019).

In 2019, India launched its first National Clean Air Programme (NCAP) financed by the Central Pollution Control Board (CPCB) to decrease PM2.5 and PM10 ambient air pollution in around one hundred cities by an estimated 20–30% by 2024 cf. its baseline level (CPCB 2021). The NCAP informs the government on air quality status, trends, and regulation performance. The government further strengthened its monitoring efforts by introducing the National Air Quality Index, which combines measures of eight pollutants into a single figure for public awareness of real-time air quality status (Board 2021). Aside from monitoring, government response measures involve a shift towards using compressed natural gas instead of traditional fuels and introducing the Bharat Stage VI (BS-VI) Emission Standards for vehicles and fuel beginning in April 2020 (International Energy Agency 2023). Decentralized solutions have been varied. Delhi’s Odd-Even Rule, launched in November 2017, determines the eligibility of car owners to drive on a given day based on the end digits of their license plates. Some local governments also tend to set higher vehicle emissions standards than nationwide ones, levy penalties for crop burning, and conduct strict oversight of road dust (Gurjar et al. 2016). An example is The Graded Response Action Plan (GRAP), introduced across the Delhi-NCR area, which imposes strict vehicle, construction activity, and industrial emissions controls during extreme pollution events (Board, 2023b).

3.2 The Indian Judiciary

India is a common law nation with legal principles and rules established through courts and parliamentary decisions (Central Intelligence Agency 2023). Courts use common law to interpret and apply the provisions of the Constitution and other statutes.Footnote 7 The judicial system includes the Supreme Court, High Courts, and Subordinate Courts. The Supreme Court and High Courts are the primary appeal institutions in the country (The Constitution of India, Art 124 (1) 1950). The Subordinate or District Courts are subordinate to the state High Court and comprise the lower judiciary (The Times of India 2023). They include Civil Courts, Criminal or Session Courts, People’s Courts, and Nyaya Panchayats (E-Justice India 2023).Footnote 8 The number of Subordinate Courts per district depends on the number of cases and population; one district can have more than one court, and one court can attend more than one district.

In 2001, the Supreme Court of India started the e-courts system to modernize the Indian judiciary and improve the efficiency of the courts (Nalanda District Court 2023). The system is a digital platform that offers litigants, lawyers, and other stakeholders court-related services. The first phase of the e-courts project started in 2005 and involved the automation of the country’s Supreme and High Courts. The second phase (launched in 2007) involved automating the Subordinate Courts in all states and union territories (E-Committee of the Supreme Court of India 2021a, b).

The e-courts system aims to increase transparency, accountability, and efficiency in the Indian Judiciary by introducing several innovative technologies like video conferencing, digital evidence management, online filing, legal information management, and case lists (E-Committee of the Supreme Court of India 2021a, b). The system functions in 3256 court complexes, and as of 2021, it managed more than 1360 million civil, criminal, and revenue cases (E-Committee of the Supreme Court of India 2021a, b). Essential for this study, the e-courts system provides data on cases’ characteristics necessary to identify the relationship between exposure to air pollution and sentencing.

Judges in India have significantly more power than judges in other common law nations like the United States. While the legal system in the United States is adversarial,Footnote 9 India follows a more inquisitorial system, where the judges play a more active role (Wilkins et al. 2017). They do not just passively receive information but actively direct and control the proceedings, with the authority to research, investigate, and question witnesses. Indian judges are not elected, but appointed by other judges, which insulates them from direct political pressures but can raise questions about nepotism and favoritism.

Concerning the juries and the court staff, different from the US, India abolished jury trials in 1960 following the famous K.M. Nanavati vs. State of Maharashtra case (Banerjee 2022), citing concerns about the potential for emotions and public opinion to influence a verdict unduly. Consequently, Indian judges are responsible for determining both the facts and the law, which can result in a heavier workload and more prolonged proceedings than their American counterparts (More et al. 2021).

The influence of court staff also varies significantly between the two systems. In the U.S., court staff such as law clerks, court reporters, and administrative staff play substantial roles in assisting the judge and ensuring the smooth operation of court proceedings. Law clerks often influence the judicial process by conducting legal research, preparing memos, and drafting preliminary opinions, thereby indirectly shaping a judge's decision-making process. In contrast, while the administrative staff in Indian courts handle logistical details, they do not play as significant a role in judicial decision-making (Wilkins et al. 2017). While the lack of professional legal support for Indian judges in the form of law clerks or legal assistants is often highlighted as a deficiency in the system, it is ideal to explore the effects of air pollution on the judges' decision-making.

4 Theoretical Background

Analyzing if external factors affect the judicial process is a long-discussed legal issue with two leading schools of thought; legal formalism and legal realism (Tampubolon et al. 2023). Legal formalism assumes that judges make decisions by systematically applying facts and arguments within the legal framework (Aiken et al. 2016). Legal realism contends that, although the law and previous rules are relevant, other factors like lunch breaks, religion, political views, race, or the environment can influence judicial rulings (Bator et al. 2020). Most realists contend that these external factors are more important than the law by arguing that judges use heuristics and biases to make decisions and only use the law to support their rulings. For instance, eighteenth-century US Supreme Court Judge Oliver Holmes defended that, besides legal reasoning, social and psychological factors are relevant when applying the law (Aletras et al. 2016).

Figure 1 shows the main difference between legal formalism and legal realism. While in legal formalism, rulings depend on facts, arguments, the law, and precedence, in legal realism, they depend on heuristics, biases, and external factors. In legal realism, the judges only use the law and information of preceding cases to justify their ruling.

Fig. 1
figure 1

Legal formalism and legal realism

The discussion between both groups affects the basic principles of constitutional democracy. In principle, the constitution and national laws safeguard impartial rulings and leave no room for biases and external factors to transpire into sentencing (Tampubolon et al. 2023). If legal realism is correct and external factors are the main determinants of judicial decisions, it could lead to a generalized lack of trust in the system’s impartiality with relevant consequences for the rule of law in modern democracies.

Although previous research has found consistent evidence of external factors like lunch breaks (Danziger et al. 2011), religion (Shayo and Zussman 2011), and gender (Anwar et al. 2019) affecting sentencing decisions, no in-lab experiments or randomized control trials have yet unquestionably proven their existence (Tampubolon et al. 2023). Most studies rely on observational data or natural experiments due to the ethical challenges of randomizing external factors like air pollution across judges or the reluctance of judicial authorities to test the impartiality of the system.

Our hypothesis that air pollution affects sentencing aligns with legal realism. We support this hypothesis by providing evidence on the effects of air pollution on human behavior with relevant consequences for cognitive capacity (Aguilar-Gomez et al. 2022), aggression (Frankle et al. 2005), and risk preferences (Levy and Yagil 2011) (see Sect. 3.2). Figure 2 presents the decision-making process for penal cases under legal realism. In it, a plethora of external factors like race, anchoring, and temperature affect the probability of convictions. Of these, we concentrate on the effects of air pollution. I.e., we test if air pollution affects convictions through its impact on human behavior and cognition.

Fig. 2
figure 2

Legal decision-making according to legal realism

From a mathematical perspective, while formalists believe that the probability of sentencing is only a function of the felony and legal characteristics of a case, realists would argue that external factors like weather, air pollution, lunch breaks, and judge characteristics are also relevant. Equation 1 presents the probability of conviction [P(Convicted = 1)] for individual i at time t as a function of a matrix of n case characteristics (Cit) and k external factors like biases, weather, air pollution, or news coverage (Xit).

$$\begin{gathered} f(C_{it} ,X_{it} ) = P(Convicted = 1)_{it:} \;{\mathbf{Formalism}}:\partial P(Convicted = 1)_{it} :0\;{\text{for}}\;{\text{all}}\;k \hfill \\ {\mathbf{Realism}}:\partial P(Convicted = 1)_{it} \ne 0\;{\text{for}}\;{\text{all}}\;k \hfill \\ \end{gathered}$$
(1)

Formalists contend that the partial derivative of the k variables in Xit is equal to zero and that the only relevant sentencing factors are the case characteristics and preceding rulings. However, as mentioned in Sect. 2, several studies have provided evidence of external factors like religion, gender, news coverage, and temperature affecting the conviction probability. To relate Eq. 1 to our short-form specification, we aggregate all convictions in subdistrict i happening in month t such that the total number of convictions for an entire subdistrict s and period t is a Poisson distributed count variable of the form:

$$Convictions_{st} = \left[ {f\left( {C_{st} ,X_{st} } \right) = \sum\limits_{i} {\left( {Conviction = 1} \right)_{st} } } \right]$$
$$\forall \lambda = E(Convictions_{st} ) = Var\left( {Convictions_{st} } \right)$$

This Poisson count variable is a function of case characteristics (Cit) and external factors (Xit), including judge biases, heuristics, weather, temperature, and lunch breaks. As such, we can estimate the effect of air pollution on the number of monthly convictions for subdistrict s as a function of case characteristics, external factors, and air pollution (Pst) according to

$$\exp \left[ {f\left( {C_{st} + X_{st} + P_{st} } \right)} \right] = Convictions_{st}$$
(2)

As long as we provide evidence that \(\frac{\partial Convictions}{\partial {P}^{st}}\ne 0\), we can confirm that there is a relationship between air pollution and sentencing. In the following section, we present the empirical methodology we use to identify \(\frac{\partial Convictions}{\partial {P}^{st}}\) with fixed effects Poisson estimators and instrumental variable techniques.

5 Data

5.1 Judicial Data

Hearings data comes from the e-courts platform of the Indian Judiciary.Footnote 10 The data set has approximately twenty million criminal records across more than seven thousand Subordinate Courts between 2010 and 2018. The raw data contains the filing, registration, hearing, and decision date of all criminal processes; the name of the petitioner and the respondent; the act and section that identifies the felony; the position of the judge; and the final ruling.Footnote 11 We restrict the data to criminal cases filed under the Indian Penal Code or the Code of Criminal Proceedings to distinguish between convicted and non-convicted individuals. We focus on criminal instead of civil cases to avoid ambiguity. For instance, it is hard to classify the outcome of most civil or commercial cases with a dichotomous decision rule. Likewise, if there is an agreement between parties, it is purely subjective whether or not it was a positive or negative outcome.

In line with Ash et al. (2021), we define convictions as all those cases when the defendant is either convicted, pleads guilty, or ends up in prison. We could only classify one-third of court decisions as convicted or acquitted due to limitations in the raw data. However, we assume missing classifications as measurement errors unrelated to monthly variations in air pollution (a claim we substantiate in the empirical section with instrumental variable designs). We match the judicial rulings with air pollution at the subdistrict level by aggregating the count of cases to the relevant geographical resolution. This study concentrates on variation at the GADM-3 (subdistrict) level. The final judicial file is a monthly panel with the number of cases, convictions, acquittals, and decisions for each Indian subdistrict between 2010 and 2018.Footnote 12

Table 1 compiles key summary statistics for the monthly-aggregated judicial data. The data set contains 128,755 monthly sub-district observations. On average, there are one hundred and twenty-six monthly cases per subdistrict, seven convictions, thirty-five acquittals, and eighty-four processes where we cannot correctly classify the trial’s outcome.

Table 1 Descriptive statistics on judicial hearings

Figure 3 shows the distribution of cases and convictions. In line with the count nature of the data, both distributions violate the standard normality assumption necessary for inference with traditional OLS estimators. Consequently, we estimate the effect with Poisson pseudo-maximum likelihood estimator (PPMLE) panel models, for they are consistent under heteroskedasticity, large shares of zero values, and overdispersion (see Wooldridge 2010). In Sect. 6, we present the implementation of the PPMLE within our framework.

Fig. 3
figure 3

Number of cases and convictions (histogram)

Figure 4 presents time series for the monthly average number of cases and convictions. In line with the growing digitalization of the judicial system, the number of reports increases during our sample period. We control for this trend by estimating the within-district variation in cases conditional on the year and month of observation.

Fig. 4
figure 4

Number of cases and convictions—twelve-month moving average time series

A particular worry is the quality of the underlying legal data. Entry errors and omitted observations from small or remote districts are possible and can bias our results (Rao 2019). Moreover, the basic information on the e-courts platform only allows for identifying the hearing outcome for one-third of the cases. Still, entry errors and missed recordings are not an issue as long as they are unrelated to air pollution (Hausman 2001). In the empirical section, we deal with this potential measurement-error bias by instrumenting for air pollution with thermal inversions.

5.2 Air pollution Data

Acquiring reliable pollution values poses an additional challenge. India’s Central Pollution Control Board (CPCB) only provides continuous monitoring station-level data as of 2016 for a small subset of urban districts. Moreover, the CPCB claims that because of inconsistencies in the measurement and data curation process, air pollution measures are only indicative and subject to biases (CPCB 2021). As CPCB only measures air pollution on a subset of urban districts, covers less than half of our sample period, and is prone to bias, we use representative PM2.5 estimates from state-of-the-art satellite measurements (van Donkelaar et al. 2021).Footnote 13

Satellite data is advantageous in contexts like India, with limited data availability, where it can help mitigate the risks of strategic behavior or capacity constraints by environmental authorities (Zou 2021). Furthermore, previous research has called into question the representatives of papers relying on measuring stations as they often restrict their study’s sample because of a lack of spatially resolved data (Kloog et al. 2013; Manisalidis et al. 2020).

Satellite measures come from monthly PM2.5 estimates constructed by van Donkelaar et al. (2021) using aerosol optical depth (AOD) values from NASA MODIS, MISR, and SeaWiFS instruments. The authors combine these data sources alongside the GEOS-Chem chemical transport model and Geographically Weighted Regression (GWR) to create a global 0.01 degrees grid of PM2.5 measures. Figure 5 shows the average ACAG PM2.5 value and population density across Indian subdistricts.

Fig. 5
figure 5

Spatial distribution of PM2.5 and population

Air pollution is higher in the Indo-Gangetic plain (north of the country) and large Urban areas like Mumbai, Calcutta, and Ahmedabad. The Indo-Gangetic plain is a highly fertile area between the Indus, Ganges, and Brahmaputra rivers. It is one of the more densely populated areas on the planet, with close to seven hundred million persons inhabiting less than one-eighth of the area of the continental United States. According to late estimates, decreasing the average level of PM2.5 in the Indo-Gangetic Plain to WHO guidelines could increase average regional life expectancy for up to seven years (Greenstone et al. 2022).

5.3 Weather Data

Weather can affect individual behavior and the concentration of air pollution (Deschênes and Greenstone 2011; Graff Zivin and Neidell 2013; Ranson 2014; Blakeslee and Fishman 2018). Failing to account for its effect on judicial decisions and PM2.5 could induce omitted variable bias or increase the uncertainty of our point estimates. We obtain weather controls from the ERA5-Land reanalysis data set (AgERA5) curated by the European Centre for Medium-Term Weather Forecasting (ECMWF) and measured from the Copernicus satellite (Hersbach et al. 2020).

AgERA5 provides high-resolution daily imagery (10x10 Km) of temperature, precipitation flux, wind speed, and wind direction.Footnote 14 Figure 6 shows the spatial distribution of average temperature and precipitation over India. Temperatures are higher in the Northwests (Rajasthan and Gujarat) and Southeast (Andhra Pradesh and Tamil Nadu) of the country and lower in the Western Gahti and Himalaya Mountains.Footnote 15 For precipitation, rain is low in the Thar Desert to the Northwest and significantly higher in the Northeastern region and Western coast.

Fig. 6
figure 6

Spatial distribution of weather controls

Table 2 presents summary statistics for the PM2.5 measures and weather controls. The average and maximum monthly PM2.5 concentration is 45.55 μg/m3 (SD 26.65) and 308 μg/m3. This maximum level is equivalent to an average air quality index higher than 300 units, according to the AQI of the US Environmental Protection Agency (EPA). For the weather controls, there is an average temperature of 25.4 degrees Celsius with a standard deviation of 4.87 degrees and an average precipitation flux of 3.28 mm3/mm2.

Table 2 Descriptive statistics for fine particulate matter and weather controls at the subdistrict level

In line with previous studies using thermal inversions to instrument for air pollution, we estimate the weighted number of thermal inversions within a month by computing the weighted sum of daily temperature differences (at 1:30 am) between 925 and 1000 hPa (Klauber et al. 2020; Jans et al. 2018; Sager 2019). The left panel of Fig. 7 shows the number of months with temperature inversions between 2010 and 2018 across India. Thermal inversions are common events in the states of Gujarat and Surat, the Indo-Gangetic plain, and the east coast of the country. The right panel of Fig. 7 shows the relationship between thermal inversions and average PM2.5. Following previous studies showing an increasing relationship between thermal inversions and air pollution, stronger inversions lead to higher PM2.5 values.

Fig. 7
figure 7

Thermal Inversions in India and their relationship with PM2.5

6 Research Methodology

6.1 Fixed Effects Model

A wide range of possible (observable and unobservable) confounding factors can affect the estimates of air pollution on convictions. For instance, traffic contributes to air pollution and is a possible determinant of individuals’ attitudes and behavior (Fenger 1999), for judges stuck in traffic during their work commute can be more aggressive and stressed before work (Stokols et al. 1978).Footnote 16Other potential sources of omitted variable bias (OVB) relate to cross-sectional differences across subdistricts, discrepancies in the legal capacity of different courthouses (Ash et al. 2021), or public policies affecting air pollution and convictions as industrial emissions regulation, gasoline taxes, or road-space rationing mechanisms.

While reverse causality is an unlikely problem since convictions should not change air pollution, there may be issues related to measurement error (ME). For instance, we do not know the actual exposure of judges to air pollution. Instead, we can only measure the average concentration in their subdistrict. Still, as is typical with studies on air pollution, we rely on average concentration to approximate real exposure values (Aguilar-Gomez et al. 2022). Another issue relates to measuring errors from administrative and judicial workers. However, as long as these measurement errors are orthogonal to air pollution at the time of the hearing, they should not affect our point estimates.

A skeptic may argue that there is no reason why outdoor air pollution would affect indoor activities. However, there is evidence that contaminants can penetrate indoor settings even in climate control facilities (Thatcher and Layton 1995; Vette et al. 2001; Scheepers et al. 2017). For instance, PM2.5 outdoor-indoor ratio can be as high as 70% to 100% (Thatcher and Layton 1995; Vette et al. 2001), with some studies even suggesting that most exposure to ambient PM2.5 may occur within indoor environments (Martins and Da Graca 2018; Krebs et al. 2021).

Another thread to the empirical strategy is if less-skilled judges come to work on more polluted days and have a different probability of convicting individuals. In the same way, if the Indian judicial system scheduled less-serious crimes for months with less pollution, we would again run into spurious correlations. Case sorting is unlikely to affect our estimates as judges (or other parties) have no inference on the assignment of cases (Ash et al. 2021). The Indian judiciary assigns cases to judges through a centrally determined set of rules that leave no space for self-selection (Ash et al. 2021). Scheduling cases with vast temporal anticipation also ensure independence between the number of convictions and the impacts of air pollution on criminality (Burkhardt et al. 2019). Finally, delayed case timing further guarantees that one cannot target a specific judge and expect to know the level of air pollution during the hearing (Ash et al. 2021).

People consider air quality an amenity relevant to housing decisions (Chay and Greenstone 2005).Better judges may self-select to work in cleaner regions. Subdistricts with more economic activity (and air pollution) may also attract more skilled legal workers. Two factors reduce residential sorting concerns. First, the judicial system forces judges to stay between two and three years in each courtroom (Ash et al. 2021). Second, even if they move, they are unlikely to get their location of preference (Rao 2019). Lastly, short-term avoidance behavior like closing windows or wearing masks can also confound our estimates; however, there is little scope for adaptation as individuals cannot entirely escape from PM2.5 due to its ability to penetrate indoors (Air Quality Life Index 2022).Footnote 17

The baseline specification uses high-dimensional fixed-effects Poisson pseudo-maximum likelihood estimator panel models (from now on PPMLE) to estimate the effects of variations in PM2.5 on the number of subdistrict judicial convictions (Hausman et al. 1984; Wooldridge 1999). We use PPMLEs because the count nature of the dependent variable violates the Ordinary Least Squares (OLS) assumptions of homoskedasticity and normally distributed errors. Panel data methods for count data are attractive in terms of statistical properties when the cross-sectional dimension (subdistricts) is much larger than the time dimension (month-years) (Wooldridge 1999). Moreover, even if the total number of convictions does not perfectly follow a Poisson distribution because of overdispersion, estimating such a model via quasi-MLE yields unbiased, consistent, and asymptotically normal coefficients (Wooldridge 1999; Azoulay et al. 2010; Burkhardt et al. 2019).

Our central assumption is that conditional on various fixed effects and weather covariates, PM2.5 concentrations are exogenous to court rulings. This conditional fixed-effects quasi-maximum likelihood estimator nets out (in an additive fashion) unobserved heterogeneity across courthouses, months, and years (Lin and Wooldridge 2019). As such, it decreases identification concerns regarding the effect of cross-sectional and seasonal unobservables on our point estimates.

The preferred baseline specification takes the following form:

$$C_{st} = \exp \left[ {\beta PM2.5_{st} + \Phi W_{st} + \lambda_{s} + \Omega_{t} } \right] + \epsilon_{st}$$
(3)

In it, Cst is the number of convictions at subdistrict s at time t; PM2.5st the average PM2.5 concentration; and β the coefficient of interest capturing the impact of a unit (1 μg/m3) increase in PM2.5 on the log difference of convictions. We augment the specification with subdistrict (λs) and temporal (Ωt) fixed effects to capture observed and unobserved subdistrict and temporal heterogeneity. These fixed effects allow the model to account for seasonality, time trends, and cross-sectional differences across subdistricts. As discussed by (Burkhardt et al. 2019), month fixed effects isolate confounders like allergens, influenza, or other seasonal conditions. Wst is a matrix of weather controls we use to improve the precision of the econometric design and avoid biased estimates arising from the influence of weather on air pollution and cognitive performance (Ranson 2014; Heyes and Saberian 2019). For example, rain affects the presence of PM2.5 in the air, wind displaces it, and temperature defines human behavior and the efficiency of internal combustion engines (Graff Zivin and Neidell, 2013). For the preferred specification, we remain agnostic about the potential effect of weather on convictions by nonparametrically controlling for temperature and precipitation with decile bin indicators of average levels.

The estimated coefficients come from within-subdistrict changes in convictions conditional on seasonality (month fixed effects), the year of observation (year fixed effects), and weather controls.Footnote 18 Standard errors, \(\epsilon\)st, are two-way clustered at the subdistrict-year level to address within subdistrict correlation in the error term and autocorrelation over time.Footnote 19

Notably, the inclusion of subdistrict-fixed effects decreases concerns about policy changes at the national level affecting our point estimates as we are estimating the effect from within-district variation. I.e., any change at the national level would not affect the estimates on the relationship between exposure and convictions as they are captured by the year, month, or year-by-month fixed effects. Regarding state or district policies, we further decrease the possibility of policy-related OVB by including a more restrictive specification with year-by-district and month-by-district fixed effects that accounts for all policy changes within a year in the same district. Despite the fact that most policy interventions in India occur at the national, state, or district levels, local policies can still affect our coefficients if they are correlated (or determinants) with (of) pollution and convictions. The Fixed Effects strategy does not allow us to control for this source of omitted variable bias effectively. However, we account for it with the control-function instrumental-variable approach we present in Sect. 6.2.

We explore the robustness of the baseline specification using a selection of possible fixed effects, weather controls, and clustering specifications. For instance, besides the baseline model with year and month fixed effects, we estimate more flexible specifications with year-by-month, year-by-district, and month-by-district fixed effects. These new specifications allow us to capture shocks common to all subdistricts in a district (e.g., state and district legal or environmental policies). Further robustness exercises include looking at non-linearities with nonparametric specifications of PM2.5 and examining if air pollution affects the total number of cases and not only convictions. We also present a model including the first and second pre-treatment PM2.5 lag as done by Ebenstein et al. (2016) and Burkhardt et al. (2019) to dispel worries regarding the econometric specification.

6.2 Control Function (Instrumental Variable) Approach

Even though our baseline specification can provide credible estimates on the effects of PM2.5 on judicial hearings, there is still the possibility of measuring error (ME) and omitted variable bias (OVB) affecting our results. To reduce concerns regarding ME and OVB, we rely on a control function (instrumental variable) approach that is easier to implement in the presence of nonlinear Poisson models (Lin and Wooldridge 2019; Burkhardt et al. 2019; Klauber et al. 2020). The first stage of the control function specification is an OLS estimation of the endogenous variable. In the second stage, we use the previously discussed PPMLE with the fitted values of the first stage as the outcome variable (Lin and Wooldridge 2019). Although this control function approach differs from a traditional Two-Stage Least Squares (2SLS) in that the second stage is nonlinear and estimated via pseudo-MLE instead of OLS, the intuition remains similar.

We use strength-weighted atmospheric thermal inversions to instrument for PM2.5. Thermal inversions shift the expected behavior of temperature in the troposphere. Under normal conditions, the air cools with altitude; however, during an inversion, warm air rises and acts as a lid over colder layers. This lid effect traps air pollution by avoiding its dispersion into the upper atmosphere, increasing the concentration of air pollutants, and acting as a natural experiment that creates exogenous spatio-temporal variation in exposure (Sager 2019). Similar to previous studies (see Sager 2019; Klauber et al. 2020), we aggregate daily thermal inversions to monthly frequencies by summing the number of daily inversions in a given subdistrict weighted by their intensity, i.e., the continuous difference between the temperature at 925 and 1000 hPa. While constructing the instrument, we also allow variation in its effects across states by interacting the intensity of inversions with state indicator variables.

Even though first-stage regressions confirm that thermal inversions increase PM2.5 values, there is the possibility that inversions correlate with \(\epsilon\) through weather conditions affecting both convictions and pollution (Ranson 2014; Heyes et al. 2016). We explicitly control for this by including a set of different specifications of weather covariates in the regression analysis and robustness exercises. There is also the possibility of topography playing a role in shaping inversions (Sager 2019). However, it is arguably fixed over time and captured by the subdistrict fixed effects. Concerning other possible associations between inversions and judges’ behavior, there is no evidence that they could directly influence health, well-being, or cognitive performance (Sager 2019; Klauber et al. 2020).

The primary IV assumption is that, after netting out the fixed effects and conditioning on meteorological conditions, thermal inversions can only affect the number of convictions through their influence on air pollution. However, as Klauber et al. (2020) pointed out, weather conditions behind thermal inversions can lead to more individuals using cars or staying at home. For this reason (and following previous studies), we use thermal inversions at 1:30 am local time (Jans et al. 2018; Sager 2019), which also helps circumvent the fact that daytime inversions may be visible and change human behavior (Sager 2019).

We interpret the point estimate as a local average treatment effect (LATE) on the population of compliers, i.e., subdistricts with variation in thermal inversions and a monotonic relationship between inversions and PM2.5. Equation 4 presents the first stage of the IV strategy. In it, invst is a vector of quintile bins of state-specific (strength-weighted) thermal inversions. Although we use the same control variables as the previous specifications, we estimate Eq. 4 with OLS.

$$\hat{PM2.5_{st}} = \delta Inv_{st} \times State_{s} + \Phi W_{st} + \lambda_{s} + \Omega_{t} + \eta_{st}$$
(4)

In the second stage (Eq. 5), we use the fitted values (PM2.5̂ st) from the above regression as a proxy for actual PM2.5. Besides using the fitted PM2.5 instead of actual measures, we also rely on nonparametric bootstrapped standard errors to account for the fact that we use fitted values instead of actual PM2.5 in the estimation (Lin and Wooldridge 2019).

$$C_{st} = \exp \left[ {\beta \hat{PM2.5_{st}} + \Phi W_{st} + \lambda_{s} + \Omega_{t} } \right] + \epsilon_{st}$$
(5)

The control function approach allows us to reduce concerns regarding the influence of omitted variable bias in our point estimates. Specifically, we account for the impact of traffic, economic, and policy confounders affecting air pollution and convictions by identifying our coefficients from the exogenous variation in air pollution due to thermal inversions. The core assumption is that, as long as thermal inversions remain orthogonal to these sources of bias, the estimates from the control function approach, though local, would remain unbiased. For instance, there is no apparent reason to think that drivers (or policymakers) would change their driving patterns (or policies) because of the temperature difference between 925 and 1000 hPa at 1:30 am.

7 Results

7.1 Fixed Effects Model

Table 3 presents the results of the PPMLE panel model across four specifications: (1) only includes subdistrict fixed effects; (2) accounts for seasonality through year and month fixed effects; (3) includes weather controls in discrete temperature and precipitation bins; and (4) examines a more flexible specification with year-by-month fixed effects. To simplify the interpretation of coefficients, we transform the value of β to [ex(β)−1]×1000 and interpret it as the percentage increase in the number of convictions because of a ten units increase in PM2.5.

Table 3 Effects of PM25 on judicial convictions in India

While all models display a positive and statistically significant coefficient, they range from 1.31 to 2.93%. The effect size decreases between the first and second columns after we include year and month fixed effects, suggesting the presence of time-varying unobservables affecting our results. After we include weather controls in column (3), the coefficient decreases slightly and remains statistically significant. Including year-by-month fixed effects increases the coefficient to a 1.64% increase in convictions after a ten units rise in PM2.5; this is our preferred specification as it flexibly controls for seasonality, cross-sectional differences across districts, and the effect of weather on air pollution and convictions.

In Tables 11, 12, and 13, we present robustness exercises across different specifications of weather controls (A.11), fixed effects (A.12), and clustering (A.13). Point estimates remain positive and statistically significant at the five percent level across all alternative specifications, increasing the robustness of our results and decreasing concerns of unobservable confounders driving our coefficients.

These results imply a positive association between the monthly number of convicted individuals and the average PM2.5 concentration in a given subdistrict. Although, to the best of our knowledge, these are the first estimates suggesting a negative effect of exposure to air pollution on sentencing, results can remain correlational if we fail to account for relevant unobservables like the incidence of traffic and noise. Moreover, point estimates may also be potentially affected by measurement error (ME) from aggregating and the use of remote sensing data. In the next section, we soothe worries related to OVB and ME with the control function (IV) approach (Lin and Wooldridge 2019).

7.2 Control Function Approach

Figures 11 and 12 of the appendix present the first stage point estimates on the effect of thermal inversions on PM2.5. Coefficients confirm that thermal inversions significantly increase PM2.5. Moreover, even though we cannot explicitly test for monotonicity, we find no evidence of inversions decreasing air pollution. As the instrument relates to weather conditions, there is still the possibility of confounding structural differences on days with and without thermal inversions (Sager 2019). Nevertheless, these differences are unlikely to bias our estimates after flexibly accounting for weather-related variables and high-dimensional fixed effects (Sager 2019).

Table 4 presents the results of our instrumental variable approach across four specifications; (1) only includes subdistrict fixed effects; (2) adds year and month fixed effects; (3) includes weather controls in discrete temperature and precipitation bins; and (4) presents a more flexible specification with year-by-month fixed effects. We interpret point estimates as the local average treatment effect (LATE) of PM2.5 on convictions for the sample of subdistricts where thermal inversions affect the concentrations of PM2.5.Footnote 20

Table 4 Effects of PM25 on judicial convictions in India (IV-PPML)

In line with positive and significant first-stage coefficients, F-statistics exceed the rule-of-thumb value for weak instrument detection suggested by Staiger and Stock (1994), implying that thermal inversions are a relevant determinant of monthly PM2.5. The coefficient of the preferred specification implies that a ten units increase in monthly PM2.5 raises the number of monthly convictions by a relevant 7.41%.

The coefficients of the control function approach represent the LATE for the set of states affected by thermal inversions. This distinction between the base and IV specifications is relevant in the presence of nonlinear effects, as the control function approach would put more weight on the upper side of the PM2.5 distribution. In this regard, while most epidemiological literature considers a linear link between air pollution and health outcomes (e.g., Medina-Ramon et al. 2006; Zanobetti and Schwartz 2006), some studies suggest the existence of nonlinearities for cardiovascular mortality (Smith and Peel 2010), respiratory morbidity (Dimeo et al. 1981; Shen et al. 2017), birth weight (Winckelmans et al. 2015), infant mortality (Chay and Greenstone 2003), outpatient visits (Lin et al. 2013), and pneumonia (Yang et al. 2022). For non-health outcomes, there is evidence of nonlinearities between air pollution and labor supply (Aragon et al. 2017), productivity (Chang et al. 2016, Chen and Zhang (2021), speech quality (Heyes et al. 2019), athletes’ performance (Guo and Fu 2019; Lichter et al. 2017), demand for health insurance (Chang et al. 2018), crime (Sarmiento 2022a), and cognitive function (Allen et al. 2016; Bedi et al. 2021). For instance, most effects of PM2.5 on cognitive performance happen above 100 AQI (Ebenstein et al. 2016).

In our context, effects may be nonlinear if a concave relationship exists between air pollution and judicial convictions. This nonlinear effect can happen if, for example, decision fatigue kicks in after a certain threshold. Another possibility is if there is an increasing marginal effect of PM2.5 on the number of convictions. For instance, if the impact of exposure on the propensity to convict increases with exposure. Table 5 explores the existence of nonlinearities in the relationship between PM2.5 and monthly convictions by dividing PM2.5 into exposure quintiles and estimating the effect of exposure concerning the lowest quintile.Footnote 21

Table 5 Nonlinear effects of PM25 on judicial convictions in India

In line with nonlinearities, results show that point estimates grow with exposure. While a month in the second quintile increases convictions by 8.17%, a month in the highest quintile does it by a significant 16.20%. We should consider these nonlinearities when comparing the point estimates of the fixed effects and IV coefficients. While the base specification may suffer from OVB and ME, the IV is a local average treatment effect that only applies to the population of complier districts. Nonetheless, finding qualitatively similar results for both strategies increases the credibility of the econometric strategy.

7.3 Robustness

Given the relevance of weather conditions for human behavior and air pollution, Table 6 presents results across five different specifications of weather covariates. (1) contains no weather controls; (2) accounts for temperature and precipitation linearly; (3) includes a second-order polynomial of atmospheric temperature to consider nonlinearities in its relationship with air pollution; (4) adds wind speed as an additional control; and (5) contain the estimates from the preferred specification with decile indicator variables of average temperature and precipitation. Across specifications, estimates on the effect of PM2.5 on sentencing remain robust and statistically significant at the one percent level, suggesting that our results are not con-founded by weather covariates.

Table 6 Effects of PM25 on monthly judicial convictions in India (weather—robustness)

Next, Table 7 presents point estimates for five subdistrict and time-fixed effects specifications. Across all specifications, we control for the weather with decile indicator variables of average rain and precipitation. (1) contains no subdistrict nor time fixed effects; (2) adds subdistrict fixed effects to account for cross-sectional differences in convictions between areas with high and low air pollution; (3) adds year and month fixed effects; (4) is our preferred specification accounting for seasonality with year-by-month fixed effects; and (5) interacts the time-fixed effects with district indicator variables, i.e., year-by-district and year-by-month fixed effects. This last specification allows us to effectively account for district-level seasonality and policy changes occurring for all subdistricts within a district (or state) in the same year. Point estimates remain positive and significant across all specifications, but the first one without controlling for cross-sectional differences.Footnote 22 Finding negative coefficients for the first specification aligns with the fact that counties with higher air pollution also have lengthier judicial processes because of cases backlog, lack of personnel, and overworking, highlighting the relevance of properly accounting for cross-sectional differences across sub-districts in the econometric design. In the second specification, although the point estimates align qualitatively with our results, they are almost 300% larger when failing to control for time-related unobservables with fixed effects, further highlighting the importance of controlling for unobservables in panel settings.

Table 7 Effects of PM25 on monthly judicial convictions in India (fixed effects—robustness)

Table 8 presents standard errors for four different cluster specifications of the error term. (1) is the preferred specification with two-way clustered standard errors at the subdistrict-year level to address within subdistrict correlation in the error term and autocorrelation over time; (2) assumes that the error correlates within districts by clustering at the district-year level; (3) only allows for one-way clustering at the subdistrict level; and (4) estimates standard errors by assuming that errors only correlate within all subdistricts in a district. Point estimates remain statistically significant across specifications, decreasing concerns that our statistical significance happens because of not adjusting for the correct correlation of unobservables across units.

Table 8 Effects of PM25 on monthly judicial convictions in India (clustering—robustness)

We complement the control function approach by using state-specific wind direction indicator variables as an additional instrument for PM2.5. By allowing the wind effect to change between states, our approach is similar to previous studies in economics using wind direction to instrument for air pollution like Bondy et al. (2020) and Deryugina et al. (2019a). The identifying assumption is that conditional on covariates, monthly wind direction affects convictions only through PM2.5. Table 9 shows the results of using wind direction instead of thermal inversions as an instrument. Reassuringly, point estimates remain positive, significant, and similar in size to the baseline fixed effects model. Finding similar results with two different instruments strengthens our causal claim as we are using a different source of variation to identify the local average treatment effect of PM2.5 on judicial decisions.

Table 9 Effects of PM25 on judicial convictions in India (IV-PPML)

Current literature suggests that exposure to air pollution increases criminal behavior (Herrnstadt et al. 2016; Burkhardt et al. 2019; Bondy et al. 2020; Sarmiento 2022a). If elevated PM2.5 increases the number of convictions only through more hearings, then the number of convictions would be higher by construction. Although high criminality is unlikely to drive our results because of the long gaps between crimes and hearings (typically weeks or months),Footnote 23 we conduct a formal check by re-estimating the baseline specification with the number of cases and not convictions as the outcome variable. Results displayed in Table 10 substantiate this theoretical and intuitive argument by showing a null effect of air pollution on the number of cases.

Table 10 Effects of PM25 on monthly judicial cases in India

Besides directly testing for the null effect of air pollution on the number of cases, we included two additional robustness exercises in Appendix Table 14 to probe that our result is not an artifact of the number of cases. First, we estimate the effect on the rate of convictions instead of the count of cases with OLS. Second, we directly control for the number of cases on the left-hand side of our preferred specification. The results of our main analysis hold in these two exercises.

Finally, we present a model including the first and second lag values of PM2.5 as done by Ebenstein et al. (2016) and Burkhardt et al. (2019) to dispel worries regarding the econometric specification. Figure 8 shows the coefficients from re-estimating the four initial models with two lagged values of PM2.5 as additional explanatory variables. The coefficients on the lags are of relatively small magnitudes and statistically insignificant, suggesting that air pollution levels for the months before the hearing do not affect sentencing decisions.

Fig. 8
figure 8

Effects of PM2.5 on convictions (dynamic model)

8 Discussion

While legal formalism obligates Indian judges to solely apply statutory considerations to case evidence in a rational, automated, and deliberative fashion (Danziger et al. 2011), we find monthly PM2.5 concentrations to increase convictions. Evidence from the fixed effects model suggests that a 10 μg/m3 increase in monthly PM2.5 raises convictions by 1.6%. The corresponding causal estimate from the control function approach demonstrates that its impacts are even more substantial at 7.41%. Although our results stand contrary to previous studies looking at the influence of contaminated air on judicial decisions (see Hou and Wang 2020), they are in line with a selection of articles suggesting a positive relationship between air pollution, aggression, anxiety, or the likelihood of punishing others (Herrnstadt et al. 2016; Younan et al. 2018; Burkhardt et al. 2019; Bondy et al. 2020; Herrnstadt et al. 2021).

To illustrate the repercussions of PM2.5 on convictions, we consider the universe of currently active criminal cases in the Indian lower Judiciary. There are one hundred and forty-four monthly cases and eight convictions per Indian subdistrict within our sample period. At his conviction rate, we expect 1.9 million convictions out of the 31.38 million active cases as of the 30 th of August 2022 (E-Courts 2022). As such, increasing the concentration of PM2.5 by 10 μg/m3 across the entire country (assuming homogeneous treatment effects) would increase convictions by 31,160 and 145,450 cases with the fixed effects model and the control function approach, respectively.

Although assessing the economic cost of a wrongful conviction is challenging, some studies have tried to estimate it through a combination of social, administrative, and individual costs. For instance, Silbert et al. (2015) look at 692 revoked convictions in California to estimate the legal costs of a single exoneration to approximately $400,000. Their figure includes incarceration costs, legal spending, and victim compensation. The 692 examined cases spent 2346 years of jail time, implying around 3.4 years per wrongful conviction. Taking costs to be linear, a year of wrongful conviction would have cost approximately $118,000 per case in California.

Let us consider a simplified scenario where the corresponding cost in India is proportionate to differences in 2022 GDP per capita, according to the World Bank. The equivalent figure would stand at $3068 per wrongful conviction, implying that the increase in convictions of 1.6–7.4% following a 10 μm3 increase in PM2.5 concentration would translate into a cost of around $96 to $444 million annually. While this back-of-the-envelope calculation is limited in insights, we find it helpful to illustrate that the effect of air pollution may not only be significant in terms of the number of convictions but also place a relevant cost burden on society.

Aside from the fact that convictions swayed by external factors have immense implications for the fate of the concerned individual, their influence can also undermine trust in the entire judicial system and pose a risk to public trust (Norris et al. 2020). Moreover, while it is impossible to estimate the actual proportion of wrongful convictions, previous studies find evidence that citizens’ perception of their share is often higher than the actual rate (Huff et al. 1996).

9 Conclusion

In this study, we explore the impact of air pollution on the number of judicial convictions in India. For this, we probe the universe of Indian criminal cases from 2010 to 2018 and proxy PM2.5 with state-of-the-art remote sensing data (van Donkelaar et al. 2021). Our identification strategy relies on a high-dimensional fixed-effects Poisson pseudo-maximum likelihood estimator panel model and a control function approach to estimate the causal relationship between PM2.5 and the number of monthly convictions in Indian subdistricts.

While the estimates from the fixed effects model imply that a 10 μg/m3 increase in monthly PM2.5 increases the number of convictions by 1.62%, the control function results show a more significant coefficient of 7.24%. Simple back-of-the-envelope calculations indicate that the estimated effects are statistically and economically significant. Notably, this paper does not explore the bio-physiological channels driving the effects. As such, the finding remains a black-box estimate hindering our capacity to explore the dynamics of the proposed mechanism, e.g., we cannot assert if the increase in convictions is due to physiological or psychological workings (Lu 2020). Nonetheless, our findings match an array of possible explanations. I.e., pollution-induced irritability (as in the context of crime (Herrnstadt et al. 2016)), higher anxiety (Kouchaki and Desai 2015), altered reasoning (Künn et al. 2019), and increased proclivity to punish (Lu 2020).

This study is the first to look at pollution-induced bias in Indian legal processes and the first to find a significant effect of contaminated air on judicial outcomes. Our contribution also includes the empirical analysis of external influences on sentencing decisions in the context of limited data availability and low-quality pollution measures. Although our findings oppose previous evidence from China (Huang et al. 2020), the relevant study only considers a selection of big cities and is not representative of the entire population. Moreover, setting, institutional capabilities, climate control, and environmental factors are likely to be different across contexts. Furthermore, the external validity of our results hinges on the particularities of the Indian Judiciary. Although there is little reason to believe that the bio-physiological mechanisms differ worldwide, Indian court processes are exceptionally long, and air pollution is particularly high in the Indian subcontinent.

Our results add to the growing literature claiming that traditional cost-benefit analyses understate the actual costs of air pollution because they fail to measure its substantial sub-clinical effects (Chay and Greenstone 2005; Ebenstein et al. 2016; Lu et al. 2018). According to this study, besides the enormous environmental, health, and resource misallocation cost of air pollution, exposure also entails a severe ethical burden on the judicial system with potential consequences for human highstakes decision-making. This new evidence on the effects of air pollution on decision-making puts further pressure on governments to enact environmental policies like the NCAP to decrease citizens’ exposure to nocive air.

Future work could examine heterogeneous treatment effects across different dimensions like age, gender, or experience. For instance, there is evidence that the effects of air pollution on cognitive performance are more prominent for men (Ebenstein et al. 2016; Zhang et al. 2018; Roth 2020; Lu 2020); This could be particularly relevant given the well-known gender imbalance in India’s Judiciary (Ash et al. 2021). Another extension would be to look at heterogeneous treatment effects by crime category. Likewise, we only look at the binary decision of convicting people. Researchers could explore air pollution’s influence on sentence severity instead of the number of convicted individuals, as done by Hou and Wang (2020).