1 Introduction

More than a hundred countries have introduced quotas for women in parliament or in party lists in the last two decades (Pande & Ford, 2012; Besley et al., 2017; Dahlerup, 2006) and the percentage of women in parliament worldwide has more than doubled, standing at 25.2 percent in October 2020.Footnote 1 The feminization of politics is one of the most exciting political phenomena of our time. Yet, we do not know what it portends for growth, the rising tide that is thought to lift all boats. In this paper we present the first systematic examination of whether women politicians are good for economic growth.

The association of women with redistributive policies and a tolerance of higher taxes (Edlund & Pande, 2002; Edlund et al., 2005; Campbell, 2004) makes it plausible that, at least in the short to medium term, women politicians are less effective than men at promoting growth. Women have been shown to favour public goods investments, such as in education and health (Bhalotra & Clots-Figueras, 2014; Clots-Figueras, 2012), which may have only long term returns. However, women legislators might promote growth if women who select into public office have a stronger sense of public mission, are more motivated to meet higher expectations, or are less corrupt (Beaman et al., 2006; Brollo & Troiano, 2016; Dollar et al., 2001; Swamy et al., 2001; Mauro, 1995; Prakash et al., 2019; Non et al., 2022).Footnote 2

We know of no causal estimates linking economic performance to the gender of politicians, but a few recent studies examine impacts on firm performance of women on corporate boards. The results of these studies are ambiguous, with many suggesting negative impacts or no impact (Ahern & Dittmar, 2012; Matsa & Miller, 2013; Gagliarducci & Paserman, 2014; Baltrunaite et al., 2021). However, this evidence base is too small to be conclusive, and the gender composition of decision makers may influence economic performance differently in the political and corporate sectors.

Two factors probably contribute to the scarcity of causal evidence on the relationship between legislator gender and economic performance. The first is that constituency level data on economic activity are not available in most countries. We use satellite imagery of nighttime luminosity as a measure of growth in economic activity. A number of studies examine the validity of this measure, and use it to proxy growth, including studies set in India, see Henderson et al. (2012), Chen and Nordhaus (2011), Costinot et al. (2016), Donaldson and Storeygard (2016), Bruederle and Hodler (2018) and Baragwanath et al. (2019).Footnote 3 We also show, using constituency-level data, that women are more effective than men at overseeing road building and at raising the share of non-farm employment, and that women are less prone to corruption. Each of these is an indicator of economic progress in its own right.

A second reason for the paucity of causal research on women legislators and growth is that it poses an identification challenge. Constituencies in which women win elections may be systematically different in ways that are correlated with economic performance. To isolate the role of legislator gender from voter preferences and other potentially omitted variables at the constituency level, we use a regression discontinuity design on close elections between men and women. In first-past-the-post elections in which the winner takes all, there is a sharp discontinuity at the zero vote margin between the top two candidates. In this setting, the identity (and hence gender) of the winner can be considered quasi-random (Lee, 2008; Eggers et al., 2015; Imbens & Lemieux, 2008). Comparing constituencies in which a woman wins against a man by a narrow margin (‘treated’) with those in which a man wins against a woman by a narrow margin (‘control’) can thus isolate the causal influence of legislator gender.Footnote 4

We examine data for 4265 state assembly constituencies for 1992–2012, during which time most states had four elections. This is a period of strong economic growth in India. It was also a period in which the share of female state legislators increased from about 4.5 percent to close to 8 percent. Moreover, there was vast regional variation in both the gender composition of state legislators and luminosity growth (see Figs. 1 and 2).

We find that women legislators in India raise economic performance in their constituencies by 2.3 percentage points per year more than male legislators. We discuss below the numerous specification checks that this coefficient is robust to. When we condition upon state-specific election-term fixed effects, the coefficient falls to half the size but, in all cases, we can reject that electing women compromises growth relative to electing men.Footnote 5

Fig. 1
figure 1

The geography of women legislators: 1992–2008. Subfigure a indicates constituencies in which a female candidate won an assembly constituency seat in state elections between 1992 and 2008. Subfigure b indicates constituencies in which the female candidate won in a close mixed-gender election. Subfigure c additionally shows constituencies where the female candidate lost in a close mixed-gender election

Fig. 2
figure 2

Luminosity in India. Subfigures a and b show the level of average luminosity in India in 1992 and 2009, respectively. The average growth rate of GDP in India during this period was about 120%. Source for all figures: DMSP-OLS v4 Time Stable Annual Composites from NOAA National Geophysical Data Center

Fig. 3
figure 3

GDP against luminosity- state data. Scatter of log of night lights per capita and log of GDP per capita, both using the state population in the denominator. The time period is 1992–2009

Now, if women-led constituencies do better on account of winning resources away from male-led constituencies, then overall impacts on economic growth of increasing the share of women legislators would depend upon the size of these negative spillovers. Assessing this by mapping growth in neighbouring constituencies to legislator gender in the index constituency, we reject negative spillovers.Footnote 6 Thus, our estimates suggest that increasing the share of women legislators favours economic progress.Footnote 7

In probing mechanisms using the same close election strategy, we find evidence that women legislators are more likely than their male counterparts to achieve completion of road infrastructure projects and that they are less corrupt, by two different indicators of corruption. In a different exercise that compares the male–female gap in performance in close vs. non-close elections, we find evidence consistent with women being less prone to distortions arising from electoral incentives. Each of these potential mechanism variables has been shown in previous work to impact growth. Thus, this sequence of results supports and strengthens our finding that women increase economic growth. Indeed, each of these results, on its own, is a contribution to the literature and, pulled together, they point to likely improvements in economic growth under women’s leadership.

We now elaborate the evidence on these intermediate outcomes. Since economic infrastructure is an important input to economic growth, especially in developing countries (Jacoby, 2000), we analysed legislator performance in implementation of a massive federally-funded village road construction program involving state legislators bidding for federal funds and delivering goods at the local level. Using administrative programme data we find that, although male and female politicians are equally likely to negotiate federal projects for road building in their constituencies, women are more likely to oversee completion of these projects. The share of incomplete road projects in woman-led constituencies is 22 percentage points lower than in male-led constituencies, a mechanism that plausibly contributes to the better growth performance of woman-led constituencies. Like George (2019), we interpret the share of projects completed vs. stalled as a measure of politician effort.

To investigate corruption, we use asset growth in office, a measure devised and validated in Fisman et al. (2014) (who do not look at gender differences). We find that the rate at which women accumulate assets while in office is 12 percentage points lower per annum than for men. We analyse an alternative measure of (potential) corruption that is measured before the legislator enters office, which is an indicator for whether the contestant has pending criminal charges against them. Comparing characteristics of male and female legislators in the close election sample, we find that men are about three times as likely as women to contest with pending charges. Following Prakash et al. (2019), we estimate the growth penalty associated with criminal legislators on our sample. Using this parameter, we estimate that criminal tendencies can explain close to one fourth of the identified difference in growth between male and female-led constituencies.Footnote 8

India is a large country and there is considerable variation across the states in governance or in opportunities for corruption. Using a crude marker of this, we divide the states into two samples which we refer to as more and less developed. We find a larger difference in economic performance in favour of women in the sample of less developed states. This is mirrored in larger gender difference in both criminality and corruption in office in the less developed states. However the legislator gender gap in road completion rates is not significantly different between the more and less developed states. These results suggest that, as economic development progresses, the growth advantage from electing women may narrow but is unlikely to be eliminated.

We also divide the Indian states into groups that are more vs less gender-unequal. We find that women do not perform significantly better than men in the most gender-unequal states (with a population female to male ratio below the 25th percentile). This is consistent with women, including women leaders, having more limited agency in these states.

As regards internal validity, the data satisfy a suite of checks on the RD design. We show that there is no evidence of sorting at the threshold, and that a rich set of constituency-level pre-determined electoral and demographic covariates are balanced around the threshold. This mitigates concerns that the estimates are driven by pre-existing differences in constituency characteristics, in particular the weaker performance of men cannot be attributed to mean reversion or to their being elected in places with weaker growth potential. Luminosity growth (and, similarly, road completion rates and non-farm employment share) in the preceding election term exhibit balance at the threshold. We nevertheless conduct a stricter test to allay the concern that constituencies in which women are narrowly elected against men are different in ways that favour growth. We posit that unobservable imbalance between constituencies with female and male legislators will tend to be smaller among neighbouring constituencies, and re-estimate the main equation limiting the estimation sample to constituencies with female legislators and their neighbours (mostly male-led). The coefficient of interest is almost identical, suggesting balance on unobservables in the original sample. In a further set of specification checks, we show results conditional upon covariates, constituency fixed effects and state-specific election term fixed effects. We further evaluate the RD design by re-estimating the model with a series of placebo thresholds, demonstrating that the placebo coefficients are smaller than the true coefficient, and not significantly different from zero.

In considering external validity of our results it is important to highlight that a third of all mixed-gender races are won with narrow margins (i.e. are in the close election sample). Indian elections are competitive in general, the share of all elections that is close also being a third. So elections in which women contest against men are neither more nor less competitive. Figure 1 shows that constituencies in which women win are geographically dispersed, so our analysis does not pertain to a specific region. Women are also not significantly more likely to be from any one of the main political parties, though we nevertheless show that our estimates are robust to controlling for legislator party.

We discuss selection into the close election sample. In particular, we consider whether men who win against women with narrow margins are negatively selected relative to men who win against women with wide margins. We find no significant difference in constituency or candidate characteristics between these two groups of men, except that men winning in close elections are more likely to have dynastic links than men winning with a wider margin. Dynasts are less effective leaders over an election term (George, 2019) but dynastic links cannot explain the better performance of women because, in the close election sample, men and women are equally likely to be dynasts.Footnote 9 To investigate whether men who win in close elections are negatively selected on unobservables, we adapt a strategy suggested in George (2019), using swings in the state-level vote share of the candidate’s party as a measure of luck. We show that our results are robust to accounting for negative selection among men in close elections. Even if we focus only on higher quality candidates (who won despite a negative party swing), women outperform men.

To explain why women appear to do no better than men (though no worse) in non-close elections while decisively doing better in close elections, we highlight that electoral incentives are sharper in close than in non-close election constituencies, and posit that men and women respond differently to these incentives. In particular, women are less likely than men to distort economic policies to pursue a narrow electoral agenda. We provide descriptive evidence consistent with this, showing that re-election rates for women vs. men are lower in close elections, while being similar in non-close elections,Footnote 10

Overall, we set out to find out if electing women, while being positive for certain redistribution-related outcomes, compromised economic growth. We find that it does not, indeed, electing women results in improved economic growth. So as to allay any concerns over luminosity growth as a measure of growth, and so as to bolster the result with evidence of plausible mechanisms, we provide estimates for five different outcomes, all of which show significant movements in a consistent direction.

1.1 Relation to existing literature

Recent research by economists on women in politics has been rather dominated by research on India because of the opportunities for identification created by randomisation of local government gender quotas. A constitutional amendment mandating that a random one third of village council positions be reserved for women was passed in India in 1993. A number of studies analyse these reservations, for example, Chattopadhyay and Duflo (2004), Beaman et al. (2009), Iyer et al. (2012); Afridi et al. (2017). The evidence in this study is different for four reasons. First, we analyse the performance of women vs. men who win in competitive elections, which is not comparable to relative performance if women are elected to reserved seats. Second, we analyse state legislatures, which have different powers and functions compared to village councils in India. Third, the village gender quota was implemented jointly for council membership and council leadership, while we isolate the role of membership of the state legislature. Fourth, previous studies of women competitively elected to state legislatures have focused upon the composition of state-provided public goods (Bhalotra & Clots-Figueras, 2014; Clots-Figueras, 2012), while we focus on growth and plausible determinants of growth. This is a step change because while there is considerable evidence that men and women in government have different preferences or priorities, it is unclear how a social planner would determine the trade-offs that arise. Economic growth, on the other hand, shifts the entire possibilities frontier outward.

We contribute new evidence to a literature on political identity and substantive representation (Osborne & Slivinski, 1996; Besley & Coate, 1997) that has tended to focus more narrowly upon differences in priorities and hence on the composition of government spending, rather than on growth. For instance, see Chattopadhyay and Duflo (2004); Iyer et al. (2012); Brollo and Troiano (2016); Bhalotra and Clots-Figueras (2014); Clots-Figueras (2012); Miller (2007); Edlund et al. (2005); Chaney et al. (1998); Thomas (1991); Svaleryd (2009); Bhalotra et al. (2019); Bhalotra et al. (2023).

Amongst the findings of these studies are that women in politics have influenced the passage of abortion laws in the US; equal inheritance rights legislation, the reporting of crime against women, and the promotion of public health inputs to child survival in India; government spending on childcare, expenditures on education and elderly care in Sweden; and maternal mortality decline in developing countries. A few studies find no significant influence of the gender of local politicians on policy choices (Ferreira & Gyourko, 2014; Rigon & Tanzi, 2012). Since road construction has higher returns for men (Asher & Novosad, 2019) and economic growth, in principle, favours all, our finding indicates that women politicians are not exclusively focused upon serving the interests of women voters, but are also more generally effective in providing public goods.

Our study is relevant as this is a time when women are increasingly participating in government across the globe. In India, a historic constitutional amendment proposing to reserve one third of all federal and state assembly seats for women was passed by the upper house of the federal parliament in 2010. However, it was not voted on in the lower house and lapsed in 2014. Our findings are of considerable interest beyond India, given the scarcity of evidence on the question of how legislator gender is associated with economic performance, and in view of the fact that the share of women in government is small but rising in many (rich and poor) countries. In addition to contributing the first causal estimates indicating how election of a female vs. male legislator influences luminosity growth, we also provide new causal evidence on how legislator gender influences road infrastructure, sectoral change and corruption and we present evidence suggesting that men and women respond differently to electoral incentives. We conclude the paper with remarks on how the growth premium associated with women leaders might evolve with economic development.

The remainder of this paper is structured as follows. Section 2 offers contextual information on Indian elections and women’s political participation. Section 3 presents our empirical strategy. In Sect. 4, we discuss the electoral, luminosity, road building and candidate characteristics data. Section 5 presents the main results. Section 6 explores geographical spillovers. In Sect. 7, we investigate mechanisms. Sections 8 and 9 present a number of extensions, and Sect. 10 concludes.

2 Context

India is a large federal country with highly competitive multi-party elections monitored by an independent electoral commission. Electoral fraud is uncommon, although some areas suffer from clientelism and elite capture (Anderson et al., 2015). The current 29 states of the Indian Union are parliamentary democracies in which, typically, a new legislative assembly is elected every five years. There is a high degree of turnover at the state level with state governments often voted out of office. In contrast to the case of the USA, but similar to Brazil, incumbents in India are less likely to win than challengers (Uppal, 2009). Members of Legislative Assemblies (legislators) are chosen according to a first-past-the-post system in single member constituencies. Voters vote for individual candidates rather than party lists. Successful candidates are typically fielded by an established party,Footnote 11 While there are political quotas for certain minority tribes and castes at the local, state and national level, gender quotas in India are only at the local level (village, town) and only since 1993 (Chattopadhyay & Duflo, 2004).

State legislators shape policy. They influence the flow of federal funds and the financing of village councils and they are responsible, inter alia, for roads, electricity, law and order, health and education. They act to serve their constituents, whether on account of electoral incentives or mission-driven preferences. They are able to implement their preferences because they are each endowed with a development fund that they can spend in their constituencies in the manner they think fit. Political manipulations by state officials can further influence the allocation of federal transfers (Khemani, 2006) and of federally funded development programs (Gupta & Mukhopadhyay, 2016). The state government needs the support of a majority of legislators to rule effectively, which gives them substantial influence at the state level and the power to negotiate for state resources for their constituencies. Legislators can directly influence economic conditions in their constituencies by exerting effort to pursue development opportunities or implementing federal or state government programs more or less effectively (Baskaran et al., 2015; Min, 2015).

In India state legislators are not only active in the state capital, influencing legislation or working in committees. They spend a lot of their time acting as intermediaries between their constituents and state bureaucracies and private firms. Jensenius (2015, p. 197), for example, summarizes their role as follows:

Members of Legislative Assemblies (MLAs) in India are often thought to matter more as “fixers” in their constituencies than as legislators (Chopra, 1996). At the state level, meeting activity in legislative assemblies is limited (Jensenius & Suryanarayan, 2015), so politicians spend most of their time in their constituencies, where they help people access government schemes, try to influence the bureaucracy to implement projects, or use their networks to attract construction or business projects (Chopra, 1996; Asher & Novosad, 2017; Bussel, 2014).

Evidence emerging from political quotas in village and town councils and analysis of close elections to state assemblies suggest that women politicians have different priorities from men, tending to favor the concerns of women and children (see references in Sect. 1). Despite a secular increase in the share of women legislators, women remain vastly under-represented in Indian federal and state politics, their share oscillating around 10 percent in recent years (Beaman et al., 2012). This reflects not so much lower chances of winning conditional on standing, but that fewer women come forward as candidates (Bhalotra et al., 2017). This may be because women dislike competitive or corrupt environments or because party leaders discriminate against women in the nomination process (Spary, 2014).

3 Empirical strategy

We aim to estimate the causal effect of election of a woman legislator on economic activity in her constituency. If the election of women was randomly determined, constituencies that elected a man would serve as a valid counterfactual. However, the election of women is unlikely to be random. For instance, one might expect that constituencies with more progressive voters are more likely to elect women. This creates the identification challenge that unobserved differences between constituencies that elect women vs. men are potentially correlated with the outcome (economic activity).

To address this challenge, we exploit the discontinuity in electoral outcomes that arises in first-past-the-post electoral systems by comparing female and male winners in close elections, defined as elections in which the margin of victory between the winner and the runner-up is arbitrarily small. Previous work shows that, in these circumstances, the identity of the winner is quasi-random (Lee, 2008).

The estimated model is:

$$\begin{aligned} y_{ist} = \alpha + \tau *femalelegislator_{ist} + f(margin_{ist}) + \varepsilon _{ist} \end{aligned}$$
(1)

where \(y_{ist}\) is average growth of light in constituency i in state s over the election term t. We calculated the growth of light as the difference in the logarithm of light density in years \(t+1\) and t. The margin of victory in constituency i in state s for election in t (\(margin_{ist}\)) is the forcing variable. Since we restrict the sample to elections in which the top two vote winners are a man and a woman, \(margin_{ist}\) is defined as the difference between the vote shares of the female and the male candidate. So, by construction, it is positive when a woman wins against a male runner-up and negative when a male wins against a female runner-up. At a (notional) margin of zero, the gender of the constituency leader changes discontinuously from male to female. We can think of the treatment \(femalelegislator_{ist}\), as an indicator for the winner being a woman, defined as follows:

$$\begin{aligned} femalelegislator_{ist}= & {} 1 \ if \ margin_{ist}>0 \nonumber \\= & {} 0 \ if \ margin_{ist}\le 0, \end{aligned}$$
(2)

The RD design considers a close neighbourhood, \(\lambda \), around the threshold margin of zero and premises that as \(\lambda \) goes to 0 the differences between constituencies that elected a female candidate and those that elected a male vanish, allowing us to identify the causal effect of electing a woman legislator:

$$\begin{aligned} \lim _{\lambda \rightarrow 0^+}E[y_{ist}\mid 0< margin_{ist}\le \lambda ] - \lim _{\lambda \rightarrow 0^-}E[y_{ist}\mid -\lambda \le margin_{ist}< 0] = \tau , \end{aligned}$$
(3)

This is the difference in the average outcomes of constituencies that barely elected a female legislator against a male runner-up and constituencies that barely elected a male legislator against a female runner-up. The RDD assumption that the distribution of the error term, \(\varepsilon _{ist}\), is continuous in the forcing variable is weaker than the identifying assumptions that other selection-on-observables methods rely upon. Since there is no within election term variation in our treatment variable (\(femalelegislator_{ist}\)), we average the growth of light over an election term. Standard errors are clustered at the constituency level to allow for within constituency correlation of the errors over different election terms.

We estimate the discontinuity using local linear regressions as suggested in Gelman and Imbens (2019). We report results for several bandwidth choices including the optimal bandwidth procedure suggested in Imbens and Kalyanaraman (2012). In further robustness checks, we retain only neighbours of female-led constituencies as any unobservable differences are likely to be smaller and we investigate sensitivity of our results to an alternative definition of the victory margin, using the larger sample of all races in which a female contested, irrespective of whether or not she was ranked among the top two in voteshare (Meyerson, 2014). We also show results conditional on party, allowing for measurement error and we show results for the early vs. late years of the electoral term. We then present estimates for spillovers and potential mechanisms before investigating heterogeneity in impact. The empirical specifications for these extensions of the main analysis are presented together with the findings below.

4 Data

Table 8 collects the variable definitions and sample periods. Table 9 provides summary statistics of the main outcome variables (Panel A) and the predetermined covariates (Panel B) in our data. It also provides summary statistics for variables available from the candidate affidavits (Panel C). In this section, we discuss the electoral data and the data on luminosity, road construction and non-farm employment.

4.1 Night lights data

We use nighttime light imagery data gathered by satellites from the U.S. Air Force Defense Meteorological Satellite Program’s Operational Linescan System. The data are processed by the National Oceanic and Atmospheric Agency to exclude pixels with low quality data due to clouds, stray light, lunar illuminance, auroral lights, and active fires. Annual composites are produced by averaging across all remaining good quality data from across the calendar year. Each pixel is encoded with a measure of its annual average brightness on a 6-bit scale from 0 to 63, and geo-referenced onto a 30 arc-second grid (approximately 1 km\(^2\) at the equator). Night lights data were first digitized in 1992 and our electoral data run through to 2012.

We overlaid a map of 4265 Indian State Assembly constituencies to create constituency level light density data as the sum of total light emitted by each pixel within constituency boundaries divided by the area of the constituency. Figure 2 shows considerable growth in the intensity and spread of lit areas over time, consistent with the substantial economic growth during this period.

To examine the relationship between growth in nighttime light output and economic growth, we use state-level GDP data, which is the smallest administrative unit for which consistent time series data are available. Figure 3 plots the data, showing a strong correlation. Panel data estimates, conditional on state and year fixed effects, indicate that a 1 percent increase in night lights is associated with a 0.15 percent increase in GDP (see Table 10).Footnote 12

Henderson et al. (2012) argue that although GDP data is widely reported, it is often unreliable in developing countries where accounting biases arise because the informal sector is large, making it harder to verify inputs, outputs, incomes and profit (see also Jerven, 2013, Bhalotra & Umana-Aponte, 2015). Thus GDP and night lights are both error-prone measures of economic activity, and it is unclear which is measured with more error. The compelling advantage of nighttime lights data, exploited here, is that it is available for disaggregated areas and can be measured for state assembly constituencies.

We nevertheless consider three technical limitations of the sensor that may generate measurement error in the use of night time lights to estimate economic activity: saturation, low sensitivity and blooming. Saturation occurs because of the limited dynamic range of the satellite sensor, leading to a limitation in recording high levels of brightness on the ground. This results in data censoring, with the brightest pixels being assigned the highest digital number value of 63 pixels. This is most common in the centers of large cities and will tend to result in an underestimate of growth if growth occurs within city centers where light output is saturated. On the other hand, the limited sensitivity of the sensors implies that dimly lit areas are not detectable, and assigned a value of zero.Footnote 13

In the close mixed-gender election sample, we have 7 cases (also 1% of observations) with a luminosity of zero and also 7 cases of top-coding (1% of observations). In the robustness checks section, we re-estimate the baseline model excluding these cases and, to anticipate those results, they are very similar in magnitude and not statistically significantly different. In the main analysis we have retained the top-coded cases and added 1 to each zero value before taking logs. If instead we use the inverse hyperbolic transformation, we get similar results, available on request.

The third potential source of measurement error is blooming, which refers to light output from a brightly lit area dispersing over neighbouring areas. Blooming is most prominent around the edges of large cities and can increase in the presence of nearby water sources that reflect light into space. This decreases the precision of light output measurement. If blooming occurs within constituencies, there is no problem. However, there is potential for bias in our estimates if substantial increases in light output in bright constituencies spill over into neighbouring constituencies. We will report a specification in which we estimate spillovers to neighbouring constituencies, and discuss there a robustness check in which we drop brightly lit areas to adjust for blooming potentially affecting neighboring constituencies.

Henderson et al. (2012) provide a detailed discussion of the satellite data, and the premise for interpreting light growth as economic activity. As most lights observable from space are from electric illumination, in principle, electricity consumption could be used to predict GDP growth, but electricity data are unavailable at the constituency level both for India and more generally. Among studies documenting an association of night lights and electricity use are Chand et al. (2009), Shi et al. (2016), Xie and Qihao (2016), the first for India and the other two on a global scale.

Electricity is the lifeblood of the modern economy. The quality and quantity of electricity service provision, including hours of supply, are a known constraint on output, see Allcott et al. (2016), Dinkelman (2011), Rud (2006), Lipscomb et al. (2013).Footnote 14 Politicians can influence availability of electricity through providing more connections and ensuring higher reliability (fewer power cuts), and electricity often features as one of the top priorities of Indian voters in election surveys (Chhibber et al., 2004). A number of recent studies highlight the relevance of political control over electricity distribution in India, see Burgess et al. (2020), Mahadevan (2019), Dubash (2018), Kale (2014), Baskaran et al. (2015). However, none of these studies is focused on distinguishing the behaviour of male and female politicians.

4.2 Election data

The election data are drawn from successive editions of the Statistical Reports on General Elections to Legislative Assembly of States, published by the Election Commission of India. For each election, the reports contain candidate names, vote counts, gender and party affiliation; assembly constituency names and codes, year of the election, size of the electorate, total number of votes cast, and number of valid votes. India currently has 29 states. Our data, which cover about 99% of the population in India, include all states and the union territory of Delhi, and exclude the disputed northern state of Jammu and Kashmir and smaller union territories.Footnote 15

A constitutional amendment in 1976 fixed the boundaries of constituencies until 2001 to avoid adversely affecting representation of states that implemented population control measures. The fourth Delimitation Commission empowered by the Delimitation Act of 2002 set out to redraw constituency boundaries based on the 2001 census data. However, the Commission’s order was only accepted in 2008 and the first election to use new boundaries was held in 2008 in the state of Karnataka. Due to non-comparability of the pre- and the post-delimitation constituencies, we only consider elections held before 2008. However, our data extend until 2012 for states which had not yet held new elections under the newly drawn boundaries.Footnote 16

In the analysis period, 1992–2012, there are 16,857 constituency-election years. Of these, 1709 (10.3%) constituency-election years are in the mixed-gender sample, defined as a sample in which a woman and a man are the top two vote-winners.Footnote 17 Among mixed-gender elections, 471 (27.6%) are close elections, defined as elections with a victory margin of less than 5%. In fact a third of all Indian elections are won with a victory margin of less than 5%, a marker of how competitive Indian elections are in general.

So elections in which women contest against men are, in general, neither more nor less competitive. Figure 1 shows that constituencies in which women win are fairly evenly distributed across the country, so our analysis does not pertain to a specific region.Footnote 18

We utilize data on candidate characteristics drawn from affidavits submitted to the Election Commission of India. The submission of an affidavit became mandatory for all political candidates following a Supreme Court of India order in 2003, the Right to Information Act. The Election Commission of India publishes the affidavits and they contain information on education, assets, liabilities, and pending criminal charges. The Association of Democratic Reforms (ADR), an election watchdog, has compiled the information since 2004.Footnote 19 The part of the analysis using candidate characteristics is thus restricted to state elections held between 2004–2008, encompassing one election for each state.

4.3 Road construction data

We investigate acquisition and completion of federally awarded village road building contracts as a proxy for public goods provision at the constituency level. We use administrative data on a centrally sponsored rural roads construction program, Pradhan Mantri Gram Sadak Yojana (PMGSY), launched in 2000 that aims to provide all weather road connectivity in rural areas, and forms an integral part of the Government of India’s poverty reduction strategy. This program is unprecedented in its scale and scope (Aggarwal, 2017). We obtained road sanctioning and completion dates. The data are available at the census block level, a sub-district census administrative unit. We matched the roads data to state assembly constituencies.Footnote 20

4.4 Non-farm employment data

In general, it is difficult to find conventional measures of economic activity such as GDP at the constituency level, but luminosity can be mapped to any coordinates. Recently, Asher et al. (2019) have made available constituency level data on non-farm employment. We use this share as a proxy for economic activity. The data are drawn from the Socioeconomic High-resolution Rural-Urban Geographic Data Platform for India (SHRUG), sourced from the 3rd through the 6th rounds of the Economic Census of India, covering the years 1990, 1998, 2005, and 2013. The Economic Census is a complete enumeration of all non-crop producing economic establishments in India including both public and private firms in the formal and non-formal sectors. The SHRUG files are available aggregated at the constituency-year level. Since the data are not annual, we assume that non-farm employment is constant between rounds. This is not ideal but as it is difficult to obtain constituency level economic activity data, we nevertheless use these data to provide a crude check on the luminosity data. As we do not have total employment at the constituency level we normalise on constituency-level population to arrive at the share of non-farm employment.

5 Results

5.1 Validity of RD design

Validity of the RD design requires continuity of predetermined characteristics of constituencies across the threshold of a zero victory margin. We use a rich set of variables determined before the election in t, either variables from the previous election in (\(t-1\)), or outcome variables averaged over the previous electoral term. These include the growth of night lights, the share of incomplete road projects, the share of non-farm employment, electorate size (i.e. number of registered voters), number of candidates, turnout, female turnout, whether the legislator was a woman, whether the legislator (in (\(t-1\))) was an incumbent, whether the head of the winning party was a woman, as well as whether the constituency was reserved for lower castes (Scheduled Castes or Scheduled Tribes), aligned with the state government, and aligned with the central government.

Figure 4 reports graphical evidence of the validity of the continuity assumption, and Table 11 shows tests of mean differences and the corresponding RD regression results.Footnote 21 To elaborate the graphs in Fig. 4, consider Panel (a) which plots average growth of light output in the previous election term against the margin of victory in t. The scatter plot depicts the local averages of growth of light in each successive interval of 0.5% of the margin of victory. The local linear curve is estimated using a triangular kernel and a 5% bandwidth and the 95% confidence interval is shown. The average growth of light in the previous term is a continuous function of the margin of victory. So there is no evidence here that women are more likely to be elected in constituencies that were performing either less well or better on luminosity growth in the electoral term preceding their election. Put differently, the balance test shows us that there is no differential pre-trend in the outcome in the “treated” constituencies (women win) as compared with the “control” constituencies (men win). We also find balance on the many other constituency characteristics listed above. Overall, the evidence suggests that only the gender of the legislator changes abruptly at the zero margin of victory and that, therefore, we can take the RD design as identifying the causal effect of the election of a woman.

Fig. 4
figure 4figure 4

Continuity checks- RD tests of balance on predetermined covariates. Each variable is plotted against female margin of victory in mixed gender races, which is the difference between vote shares of a female candidate and male candidate in mixed gender races. Mixed gender races are in which a woman either won or was a runnerup against a man. By construction, margin of victory is positive for female legislators and negative for male legislators. Each dot represents a local average in bins of 0.5 percent margin of victory. The solid lines are the smooth curves estimated using a local linear regression of each variable on margin of victory separately on either side of the cutoff of zero, triangular kernel and a 5 percent bandwidth. The figures also depict a 95 percent confidence interval for each variable around the solid curve

Fig. 5
figure 5

Density of the forcing variable. The figures plot the density of the margin of victory, which is the difference between vote shares of the female and male candidates in mixed gender races. Mixed gender races are defined as those in which a man and a woman rank in the top two. By construction, margin of victory is positive for female legislators and negative for male legislators. The magnitude of the discontinuity (log difference in height) is 0.13 (with a standard error of 0.15)

Fig. 6
figure 6

Legislator gender and luminosity growth. The dependent variable is the growth of light averaged over an election term against female margin of victory in mixed gender races. The victory margin is the difference between the vote shares of the female and male candidate in mixed gender races. These are races in which a man and a woman are the top two vote-winners. By construction, the margin of victory is positive when women win and negative when men win. Each dot represents a local average in bins of 0.5 percent margin of victory. The solid lines are the smooth curves estimated using a local linear regression of each variable on margin of victory separately on either side of the cutoff of zero, using a triangular kernel and a 5 percent bandwidth. The figures also depict a 95 percent confidence interval for each variable around the solid curve

Another RD validity check that we did is for sorting around the cutoff. Sorting has been documented in the case of close elections between Republicans and Democrats in the United States, and associated with manipulation of the margin of victory that renders the close election experiment invalid (Snyder, 2005; Caughey & Sekhon, 2011; Grimmer et al., 2012). To investigate this, Fig. 5 depicts the density of the margin of victory as suggested in McCrary (2008). There is no apparent discontinuity in the density around the cutoff. The point estimate of the discontinuity is 0.043 with a standard error of 0.075. This suggests there is no evidence of sorting in our sample of close mixed-gender races, and female and male candidates are equally likely to win. Observe that Fig. 5 also shows that the distribution of the margin by which women win is broadly similar to the distribution of the margin by which men win in mixed-gender races.

5.2 Results: legislator gender and economic performance

In this section we present estimates of the causal effect of female relative to male legislators on economic activity over the electoral term in the constituency from which they were elected. The RD estimate of the impact of electing a woman rather than a man is the difference in luminosity at the zero margin of victory.

The regression estimates are in Table 1. We estimate a local linear regression of growth of night lights on the margin of victory in the RD framework. The bandwidth is calculated using the optimal bandwidth procedure suggested by Imbens and Kalyanaraman (2012) (IK). We find that annual luminosity growth averaged over the electoral term is 15.25 percentage points higher in constituencies in which a woman won by a small margin than in constituencies in which a man won by a small margin, and this difference is significant at the 5% level (column 1). Using our estimate (from state-year data) of an elasticity of GDP to night lights of 0.15 (see Table 10), a 15.25 percentage point difference in luminosity growth translates into a 2.3 percentage point difference in GDP growth. Given that average growth in India during the period of study was about seven percent per year, our estimates indicate that the growth premium for constituencies stemming from them having a female legislator is about 32 percent.

The RD plot is in Fig. 6, which depicts average growth in luminosity against margin of victory. The data are averaged across bins that each cover 0.5 percentage points in the margin of victory and provide local linear smooths of the underlying data using a bandwidth of 5 percent. We observe a discontinuous jump in light output at the threshold margin of victory of zero, in line with the regression results. The graph plots coefficients for elections with victory margins larger than the optimal RD bandwidth, where we see the difference even out. These estimates are potentially contaminated by selection, which we examine in relation to the external validity of the RD results in the penultimate section of the paper.

Table 1 Legislator gender and luminosity growth
Table 2 Robustness tests
Table 3 Legislator gender and non-farm employment

Sensitivity to bandwidth Estimates using bandwidths that are half and twice the size of the optimal bandwidth are in Columns (2)–(3) of Table 1. The estimated coefficient declines as the bandwidth increases, but remains statistically significant.

We do not expect coefficient stability as we move outside the optimal bandwidth, but it is useful as a marker of how selection sets in as we move away from the threshold, and we discuss this later. In Panel A Column (5) of Table 2, we show that results using CCT optimal bandwidths are similar to the baseline results.

Sensitivity to functional form Column (4) of Table 1 shows that estimates with a second order local polynomial smoother are similar to those estimated with a local linear control function in Column (1). Gelman and Imbens (2019) argue against the use of polynomials in RD of higher order than the quadratic.

Sensitivity to controls While we have shown that pre-determined covariates are balanced at the RD threshold, a straightforward test for the effect of any imbalances is to directly control for pre-determined covariates. In Panel A Column (3) of Table 2, we thus re-estimate our RD specification for the optimal bandwidth while controlling for the pre-determined covariates discussed in Sect. 5.1 and we also control for constituency fixed effects. The resulting estimate is 18.07 percentage points, which is statistically similar to the baseline estimate. In Panel A Column (2), we show results with the less demanding inclusion of district fixed effects. In Panel A Column (4), we add state specific election term fixed effects. The coefficient is now half the size of the baseline coefficient, albeit not statistically different.Footnote 22

Neighbour sample We investigated the validity of the RD design using another strategy as follows. The idea is that any (unobservable) imbalances between constituencies with female and male legislators should be particularly small among neighbouring constituencies. We thus re-estimate the main equation limiting the estimation sample to constituencies with female legislators and their neighbours; see Panel B Column (2) of Table 2. The estimates are similar to those in Table 1, which suggests the absence of significant imbalances.

Alternative margin As a further sensitivity test, we estimated regressions with a larger sample that includes all mixed-gender races in which a woman contested, rather than just races in which a woman ranked among the top two, as in Meyerson (2014). The margin of victory is again defined as the difference in the vote shares of the top-ranked female and the top-ranked male candidate, except that now the top-ranked female may not be one of the top two vote-winners.Footnote 23 The results are similar to those in Table 1; see Panel B Column (1) of Table 2. This is because the victory margin in the additional races that are incorporated is likely to be away from the discontinuity and hence unlikely to influence estimates that exploit variation around the threshold of a zero victory margin.Footnote 24

Weighted regression We re-estimated the model weighting each mixed-gender race with the inverse of the proportion of mixed-gender races in the state over the sample period. The baseline (unweighted) model delivers an average coefficient that puts more weight on the relationship in states that have more close elections, and it is plausible that the relationship is heterogeneous across states (indeed we will show later that it is). The estimate is close to the baseline estimate, see Panel A Column (6).Footnote 25

Placebo outcome Luminosity growth is the primary outcome of interest, proxying economic growth, and we find supporting evidence when considering relevant mechanisms. We now model a placebo outcome- an outcome that we expect cannot change on account of legislator gender, which is rainfall. Panel B Column (6) shows a coefficient close to zero for the deviation of rainfall from its long-time trend.

Fig. 7
figure 7

Cross-country scatter–women in parliament and economic growth

Fig. 8
figure 8

Legislator gender and luminosity growth: placebo regressions with fake thresholds. This figure displays 62 placebo coefficient estimates for the gender dummy with confidence intervals. We obtain 31 placebo coefficients by estimating Eq. (1) on a subsample of male winners, redefining the margin of victory as placebo margin of victory = true margin of victory - x in steps of 0.5 within the interval { -20,-5}, thus effectively defining 31 placebo thresholds. We repeat this exercise on the subsample of female winners. The true coefficient estimate and confidence intervals are highlighted in red. Most placebo coefficients are not significantly different from zero and smaller than the true coefficient

Fig. 9
figure 9

Legislator gender and other economic outcomes. The dependent variable is the share of non-farm employment in Panel (a), the share of incomplete roads in Panel (b), and Asset growth in Panel (c). In Panel c the sample is restricted to candidates who re-contest the next election. Each variable is plotted against female margin of victory in mixed gender races, which is the difference between vote shares of a female candidate and male candidate in mixed gender races. Mixed gender races are in which a woman either won or was a runnerup against a man. By construction, the margin of victory is positive when women win and negative when men win. Each dot represents a local average in bins of 0.5 percent margin of victory. The solid lines are the smooth curves estimated using a local linear regression of each variable on margin of victory separately on either side of the cutoff of zero, using a triangular kernel and a 5 percent bandwidth. The figures also depict a 95 percent confidence interval for each variable around the solid curve

Placebo estimates Yet another strategy to evaluate our RD design is to estimate Eq. (1) with placebo thresholds using subsamples of only male and female winners, respectively. We estimate 62 placebo coefficients (and their confidence intervals) and collect them in Fig. 8 (see the figure notes for further details regarding the placebo regressions). We also include the true coefficient estimate in red. We find that all placebo coefficients are clearly smaller than the true coefficient (and also generally insignificant).

Gender versus constituency and other legislator characteristics We may be concerned that we are capturing the effects of other characteristics of the winning legislator or the constituency in question rather than those of gender per se. Specifically, women legislators could be (i) more or less likely to run in constituencies reserved for scheduled castes or tribes, (ii) more or less likely to be from scheduled castes or tribes, (iii) more or less likely to be Muslim, (iv) more or less likely to be from the BJP or the Congress, (v) more or less likely to be aligned with upper-level governments. Overall, this seems unlikely since we checked for balance in various individual and constituency characteristics (see Tables 11 and 13).

Nevertheless, to investigate this concern, we included indicators for these legislator and constituency characteristics. The estimates are robust to this (see Table 14). It is interesting that there is a positive association of luminosity growth with Congress rather than BJP Party leaders, with the alignment of the party of the legislator with the ruling party at the state and with the legislator being from a scheduled tribe.

Distribution of effects through the electoral term So as to investigate how the growth effects of having a female rather than a male legislator evolve, we re-estimated the model separating the first two years of growth from the last two years of growth in the electoral term. The coefficients are imprecise in these split samples and not significantly different from one another. However, the growth difference (between women and men) is more than twice as large later in the electoral term, consistent with any legislator activity cumulating or taking effect with an administrative lag (Table 15).Footnote 26

Non-farm employment share Non-farm employment share is a proxy for structural change, a process associated with economic growth as productivity in manufacturing and services tends to be higher than in agriculture. Using recently available data on non-farm employment at the constituency level we find that women perform better by 4% points over the electoral term, or 0.84% points p.a., see Table 3. This result is also robust to using different bandwidth choices and a local polynomial. Panel A in Fig. 9 is the corresponding RD plot, which displays a jump in non-farm employment at the threshold. Later we will show that heterogeneity in impacts of legislator gender on luminosity is mirrored in heterogeneity in impacts of legislator gender on non-farm employment share.Footnote 27

Measurement issues We discuss potential issues with the lights data in Sect. 4.1, including saturation, low sensitivity and blooming, explaining how we check that our estimates are not biased by these issues. We dropped all constituencies that are top-coded with respect to their luminosity, all observations with zero luminosity, and both top-coded and zero luminosity constituencies. The estimates are essentially unchanged, see Panel B Columns (3)-(5) of Table 2. We have presented the impact of legislator gender on luminosity growth as an impact on GDP growth using the luminosity-GDP elasticity derived from a state level panel data regression. We checked that this elasticity is not sensitive to the exact sample used. However this conversion is only indicative as, for example, the state-level elasticity may not be the correct elasticity if there are non-linearities in the relationship at the constituency level. We also provide estimates for a range of outcomes so that our results do not rely entirely upon the luminosity estimates. It is also worth reiterating that the motivation for this work is to test whether having women leaders might compromise economic growth so, strictly, we only require that we can reject this null.

Magnitude of effects To contextualise the magnitude of the effects we highlight two points. In our analysis sample (of mixed gender close elections) the share of constituencies won by a woman is 48% (and in the full unrestricted sample it is 5.4%). Thus the growth gains that we identify refer to a small share of constituencies, not to the country as a whole. As discussed in the section on spillovers, this does not translate into a discernible impact on state (and hence) national growth. Second, India is experiencing rapid growth over the analysis period, and potential growth in some less developed regions is high. Related, we show that the result that women are more growth producing than men emerges largely from the less-developed states of India.

6 Spillovers

We have shown that women are more effective than men at raising growth in their own constituencies. If this comes at the cost of lower growth in other constituencies, then effects of increasing the share of women on total growth are ambiguous. We therefore examine spillovers to contiguous constituencies. Spillovers can, in principle, go in either direction. They may be negative if legislators play a zero-sum game with fixed state resources. Alternatively positive spillovers may arise for the following reasons. First, legislators may build roads or electricity networks that continue across constituency boundaries, or road construction in one constituency may increase access to markets in neighbouring constituencies. Second, the success of women legislators may encourage yardstick competition if voters evaluate politicians in their jurisdiction by comparing outcomes with those in neighbouring jurisdictions (Besley & Case, 1995).

To implement this test, we define the dependent variable as light growth averaged over neighbours of constituency j identified using a constituency map. The mean (s.d.) of number of neighbours of a constituency is 5.8 (1.6).Footnote 28 The independent variable of interest is as before: the gender of the legislator in constituency j. The sample is still restricted to mixed gender races for j, and we use the RD approach described for the main analysis. This yields estimates of the difference in light growth in constituencies neighbouring female vs. male led constituencies.

The estimated coefficient is positive, but the difference is not significant (Panel A of Table 4). As discussed in the Data section, blooming in the night lights data could bias estimates of geographic spillovers from highly luminous constituencies. To assess the potential of any such bias to influence the estimates here, we dropped constituencies with top-coded light levels, and the results are robust to this- see Panel B of Table 4. Overall, there is no evidence of negative growth spillovers from female-led to neighbouring constituencies, allowing us to conclude that women legislators have a positive impact on overall growth.Footnote 29

Table 4 Spillovers to neighbouring constituencies
Table 5 Legislator gender and road completion
Table 6 Legislator gender and asset growth
Table 7 Heterogeneity by level of development and gender inequality

7 Mechanisms

7.1 Road infrastructure

We first investigate a hard outcome that is growth producing. In general and especially in developing countries, road infrastructure is a key ingredient to growth. Rural roads are estimated to have significant positive effects on local economic outcomes including growth and structural transformation, involving the decline of agricultural work in favour of wage work (which we also capture in the share of non-farm employment) (Jacoby, 2000; Shrestha, 2015; Jacoby & Minten, 2009; Casaburi et al., 2013; Asher & Novosad, 2019). In one of the few previous studies that uses luminosity growth to measure changes in economic activity in India, Asher and Novosad (2019) estimate that construction of a village road increases village-level luminosity by 2.5 percent per annum.

We use administrative data from the Prime Minister’s Village Road Program (PMGSY) described in Sect. 4.3. The PMGSY is a flagship programme that, between 2000 and 2015, funded the construction of over 400,000 km of roads (in over 100,000 new roads), benefiting almost 200,000 villages at a cost of almost 40 billion US dollars (Asher & Novosad, 2019). It is a program of considerable political and economic significance and effective delivery of this program is a good marker for public goods delivery, involving state legislators bidding for federal funds and delivering goods at the local level. PMGSY is federally funded but responsibility for road construction is delegated to state governments, and the program by definition involves village-level roads.

Program eligibility involved the village having a population above 1000 till the year 2003 and above 500 after then. Therefore validity of the RD design we use requires that constituencies won by men vs. women in close elections are not systematically different in population size, in particular around these thresholds. Using the 2001 census files, and using both threshold and average population figures at the village level, we test this premise just like we test for continuity across the zero vote margin threshold for other constituency characteristics. The results are in Table 12 and show no significant differences in population size.

Using data for 2004–2012 and the RD approach used for the main analysis, we investigate whether the share of incomplete roads relative to awarded road projects is a function of legislator gender. Table 5 reports the point estimate of the discontinuity. We find no significant difference in contracts allocated (Panel B of Table 5).Footnote 30 However, the share of incomplete roads is 22 percentage points lower in constituencies with female legislators (Panel A of Table 5).Footnote 31 This difference is significant across a range of bandwidth choices and robust to replacing the linear with a quadratic smoother.Footnote 32 Panel B of Fig. 9 shows the RD plot of the share of incomplete roads against the margin of victory.Footnote 33 We observe a discontinuous drop in the share of incomplete roads at the threshold margin of victory of zero, in line with the regression results.Footnote 34

Our findings reject the presumption that men are more effective at delivering growth-producing infrastructure. Since road construction in India has been shown to produce higher returns in terms of job mobility for men than for women (Asher & Novosad, 2019), our findings establish that women deliver public goods beyond those that serve the interests of women. The qualities that lead women to achieve higher completion rates may include efficiency, mission or lower corruption, all of which are related to effective delivery of public goods. In the next section we examine corruption and in the section on external validity we discuss evidence consistent with women legislators having greater intrinsic motivation than men.Footnote 35

7.2 Corruption in office

Following Fisman et al. (2014), we use growth in assets during office as a proxy for corruption. Since assets are only recorded in affidavits submitted by candidates when standing for election, Fisman et al. (2014), restrict the sample to candidates who contest for two consecutive elections, whether or not they win. They find higher asset growth for winners than for runners-up in close races, estimated as a difference of 3 to 5% p.a. and interpret this as evidence that politicians leverage public office for private benefits by engaging in rent-seeking activities.Footnote 36

Fisman et al. (2014) do not distinguish between male and female legislators. We adopt their sampling and measurement strategy but rather than compare winners with runners up in close races, we compare women who won in a close race with men who won in a close race. Regression estimates are in Table 6. Column (1), using the IK bandwidth, shows that asset growth during an electoral term is about 60 percentage points lower among female legislators. This translates into a 12 percentage point per annum difference in the rate at which male vs. female legislators accumulate rents in office.Footnote 37 As a benchmark, note that the mean annual growth rate of assets in the sample (averaging over all legislators) is 23 percentage points.

If we halve the bandwidth, this coefficient is similar but less precisely determined (column 2). If we double the bandwidth, the coefficient falls a bit more but is statistically significant. The result is robust to replacing the linear with a quadratic polynomial (column 4). Across the columns, the coefficients are not significantly different from the coefficient in the first column. Panel C in Fig. 9 plots asset growth between elections t and t+1 against the margin of victory between winners and losers (of opposite gender) in election t, confirming a discontinuity in asset growth at the zero margin of victory.Footnote 38

Overall, this constitutes compelling evidence that women legislators are less likely than men to exploit their office for personal financial gain. It indicates lower corruption as one likely contributor to the economic advantage of women legislators given evidence that lower corruption is conducive to economic growth (Dollar et al., 2001; Swamy et al., 2001; Mauro, 1995; Prakash et al., 2019).Footnote 39

A possible take on our finding of lower corruption among women legislators is that they tend to have less political experience and have not yet learned the ropes (Chaudhuri et al., 2022). If this were the case, gender differences in corruption would disappear as women’s political tenure lengthens. We respond to this potential concern in three ways. First, we note evidence that the association of experience in politics with corruption is not necessarily positive.Footnote 40 Second, we emphasise that even if tenure rather than gender were driving this result, policies the world over that are introducing new women into politics will tend to lead to lower corruption. Second, in the following section we investigate a measure of corruption that is available before the candidate takes office. If we were to find gender differences in this measure of criminality that project onto differences in growth once the candidate is elected, this result would indicate a role for corruption that is independent of legislator tenure. If at all, we may expect larger differences in pre-election characteristics if politicians in office face stricter scrutiny and are subject to a re-election constraint which encourages them to act in more accountable ways. Alternatively, they may develop a sense of duty once they attain office if “office ennobles” (Brennan & Pettit, 2002; Benabou & Tirole, 2003).Footnote 41

7.3 Candidate characteristics

In India, following passage of the Right to Information Act, all political candidates are required to file affidavits that include various information including whether or not they are carrying pending criminal charges. Using these data, we compare characteristics of male and female legislators in the analysis sample of mixed-gender close elections, see Appendix Fig. 10 and Table 13. In the close election sample (and also in the full sample of all mixed gender elections), there is no significant difference in education and wealth between male and female legislators. However, women legislators are significantly less likely than men to be carrying criminal charges and slightly younger on average.

In the close election sample, about 10% of women legislators face pending charges,Footnote 42 in contrast to about 32% of men.Footnote 43 It seems plausible that legislators with a criminal record are more likely to practice corruption, to have priorities other than economic development and, to be less likely to provide a stable business environment for growth. Using the RD approach developed in Prakash et al. (2019) on the expanded set of states in our sample, we estimate that luminosity growth is 16.8% points smaller in constituencies led by a legislator carrying pending criminal charges. Scaling this (gender-neutral) estimate by the difference in the propensity for criminality between men and women (a 21.8 percentage points difference in our close election sample—see Table 13) suggests that it can explain about 24% of the estimated growth premium associated with women legislators.

While the validity of a close election design depends on balance in constituency characteristics around the RD threshold (which we demonstrated above), it does not require balance on candidate characteristics. In fact, if men and women were identical, then the question of whether legislator gender influences economic performance would be void.Footnote 44 However, if criminality were to predict winning this could be problematic for our identification strategy. We therefore examined this on the mixed-gender sample, and we find no evidence of it (Table 19).

Differences in criminality between men and women legislators are consistent with experimental evidence that women are more risk-averse than men (Eckel & Grossman, 2008) and more patient (Silverman, 2003) since risk taking and high discount factors are positively associated with crime (Mastrobuoni & Rivers, 2016). If experimental evidence captures inherent personality traits, then differences in criminality are unlikely to erode over time, as more women join politics, or as women acquire longer political tenure.

7.4 Discussion

The results for intermediate outcomes serve two purposes. First, they lend plausibility to the main result by identifying mechanisms by which women legislators achieve higher luminosity growth. It is compelling to find that women do better on five different outcomes, drawn from different data sets. Second, they allay potential concerns over what luminosity growth captures. Consider two likely concerns. One is that luminosity growth captures expansion of street lighting, and that women leaders invest more in street lighting to ensure the safety of women in public places. There is some evidence that increasing public safety for women increases women’s economic participation which in turn increases economic activity (Borker, 2020; Siddique, 2020). Another natural contention is that our results for luminosity growth demonstrate that electricity provision improves under women legislators. This would be consistent with women legislators improving growth as electricity is the lifeblood of the modern economy and electricity supply is a known constraint on output in India (and other developing countries) (Allcott et al., 2016; Dinkelman, 2011; Rud, 2006; Lipscomb et al., 2013).Footnote 45

Nevertheless, for the skeptic who is not persuaded by the evidence for India and other countries that luminosity growth proxies economic growth (Sect. 4.1), our findings for road infrastructure, non-farm employment and corruption point to women legislators improving growth prospects. To summarise, if women leaders do improve street lighting and electricity provision (outcomes that we are unable to measure at the constituency level over time), (a) these outcomes are growth-enhancing, and (b) our results for the five outcomes we can analyse show that women leaders achieve more than merely an increase in street lighting and electricity.

Part of the explanation for why women legislators perform better than men is likely to be that they are positively selected. In view of evidence that women face stronger barriers to political candidacy than men, we expect that women candidates are positively selected relative to male candidates. However, even if the ability distribution of men and women was similar, the marginal female entrant would be positively selected because the baseline share of women is small (also see Besley et al. (2017)). The latter tendency will dissipate with an increase in the share of women legislators, but it seems plausible that a performance gap in favour of women will persist if women have stronger intrinsic motivation, or are intrinsically less corrupt (Andreoni and Vesterlund (2001) provide experimental evidence that women are intrinsically more fair). In line with our broad findings, Ashraf et al. (2022) find that female workers are, on average, more productive than male workers and that the gap is larger when the baseline share of women is lower. They attribute this to positive selection, highlighting that this symptomizes a misallocation of talent. Our estimates may be seen as a measure of the potential gain from lowering barriers to entry to women in politics and thus correcting some of this distortion.

8 Heterogeneity

In this section we investigate differences in the relative performance of male and female legislators in sub-samples distinguished, first, by an indicator of human development (a correlate of corruption) and, then by the sex ratio at the state level (Table 7).Footnote 46 We also explore heterogeneity by party alignment and gender of the state minister and the education and incumbency status of the legislator (Table 20). The differences in coefficients reported in Table 19 are in general not statistically significant but, in most cases, are of a considerable magnitude.Footnote 47

Institutional environment If clean governance is a reason that women-led constituencies experience higher growth, we may expect that women make a larger difference in institutional environments where (male-dominated) corruption is pervasive. Using the Human Development Index as a proxy for the prevailing quality of government (Sen & Dreze, 2005) and splitting the sample into states with HDI above or below the median value in 1999, we find that women are only significantly better than men at producing growth in the less developed states, where the coefficient is twice as large, see columns 1–2, Table 7.

Indicator of gender progressively at state level We used data from the 2001 census to construct the ratio of females to males at birth, widely used as a measure of progressiveness with regard to the status of women (Sen, 1992). We re-ran the main specification on two groups of states, defined by their sex ratio being below the 25th percentile and above the 25th percentile, see columns 3–4 in Table 7. We see a clear pattern, indicating that women legislators do no better than men (though still no worse) in states with the most male-biased sex ratios. This is consistent with women having more limited agency, their hands may be tied by men. For example, male contractors for road works may not take their cues from them, or their husbands may force them to be corrupt while in office.

Party alignment and gender of state minister State governments may have an incentive to favor aligned politicians in the allocation of public resources (Brollo & Nannicini, 2012; Asher & Novosad, 2017). If aligned legislators have more resources to work with and if the growth results emerge from women legislators making better use of these resources, then we should expect to see larger differences in female vs. male led constituencies in the sample of constituencies that is aligned. This is what we find. The difference between female and male legislators is 50% larger in the aligned sample. Although the difference between the two samples is not statistically significant, it is large. See columns 1–2, Table 20.Footnote 48

On the other hand, if female chief ministers favor female legislators, women may outperform men under female chief ministers not because they use resources better but because they are favoured. To investigate this, we estimate the baseline RD specification on subsamples of states ruled by female vs. male chief ministers (column 3–4, Table 20). We find no evidence of favoritism along the lines of gender. The sample with male chief ministers, which contains 85% of cases (states) exhibits a growth difference in favor of female legislators similar to the full sample results, while the smaller female chief minister sample shows a small and insignificant coefficient.Footnote 49

Education, caste and incumbency of legislator We showed earlier that there is on average no significant difference in the level of education of female and male legislators in the close mixed-gender sample. So education is unlikely to be a mechanism. However, given an interest in the relationship between politician education and policy choices (Besley et al., 2011), we investigate whether the relative success of women emerges from samples of more or less educated legislators. We separate the sample into constituencies led by legislators with at least ten years of education vs. those with less (column 5–6, Table 20). Growth in luminosity is only higher in women-led constituencies in the sample in which leaders are more educated. The results are similar if we cut at twelve years of education. Examining heterogeneity by caste of the legislator (columns 7–8), we find that the growth premium derived from electing women leaders is driven by high caste women. This is consistent with high caste women being more educated.Footnote 50 Finally, dividing the sample into incumbents and non-incumbents, we identify a larger male–female growth difference among incumbents (columns 9–10). Our proposed explanations of these results are speculative but they line up with our earlier results in suggesting that women use available resources with more effect for growth than men, insofar as their education and experience are such resources.

9 Analysis of behaviour outside the RD sample

Our first result, that luminosity growth is discontinuously lower when a man rather than a woman wins by a narrow margin was displayed in Fig. 6. The RD estimate shows a statistically significant difference. However, Fig. 6 also shows that outside the IK bandwidth (which, as noted in the Tables, is roughly 6%) luminosity growth in constituencies won by men vs. women is similar (note that men do not do better at any victory margin). It is not unusual that the causal RD estimates for close victory margins differ from the descriptive estimates for non-close victory margins as the latter are potentially contaminated by selection. In this section we discuss how representative close elections are likely to be of all elections in India, and then consider selection into the close election sample at constituency and candidate level.

Close mixed-gender elections in India are representative of all mixed-gender elections. In particular, a third of all mixed-gender elections are within the optimal bandwidth and about half have a victory margin of less than 10%. The median victory margin is 10.5% for women and 10.4% for men in the entire sample (the 25th percentile is about 4% and the 75th percentile is about 19% for both female and male winners). This directly diminishes concerns that our results have limited validity. We nevertheless now consider why men who win with narrow margins perform worse than men who win with wider margins to address the possible concern that the poorer performance of men relative to women in close elections stems from their being a bad lot.Footnote 51

Constituency characteristics Constituencies won by men with narrow margins may have been a bad selection compared with constituencies won by men with wider margins. For example, they may have historically struggled with generating growth. However, the balance plots in Fig. 4 and the corresponding data in Panel A of Table 24 show no meaningful differences between these two sets of constituencies.Footnote 52

Candidate characteristics including dynastic links An alternative possibility is that men who win in narrow races are selectively worse than men who win with wide margins. We find no evidence of this using characteristics available in the affidavit data, including education and wealth, see Panel B of Table 24. Using data recently created by George (2019), we compare the dynastic links of candidates, that is, whether a parent or spouse preceded them in political office. We find that men who win in close elections are more likely to have dynastic links (17.4%) than men who win with wide margins (13.6%). Since dynasts are less effective leaders over an election term (George, 2019), this can explain their poorer performance, evident in the dip to the left of the threshold in Fig. 6. However, dynastic links cannot explain the male–female performance gap in close elections. Using our RD design, we show that the probability that the winner is a dynast is invariant to the victory margin (Fig. 11). In close elections, the share of dynasts is 15.9% among women and 17.4% among men, and the difference is not statistically significant.Footnote 53

Unobservable candidate characteristics/quality We further investigate if men who win in close races are negatively selected on unobservables, adapting to our setting a test proposed in George (2019). The idea is that candidates who win with a narrow margin—relative to candidates who win with a wide margin—are either weaker candidates or unlucky. The trick is to use swings in the state-level vote share of the candidate’s party to measure luck, as aggregate party swings constitute a shock to the individual candidate’s victory margin.

The party swing of the winning candidate, \( \text {Swing}_i\), in a mixed-gender race is defined as follows:

$$\begin{aligned} \text {Swing}_i=\Delta \text {Party of winning candidate}_{t} - \Delta \text {Party of losing candidate}_{t}. \end{aligned}$$
(4)

\(\Delta _{t}\) is the state-level vote share of candidate k’s party in the state election in t minus the same share in the preceding state election in t-1. \(\text {Swing}_i\) hence captures the swing experienced by the party of the winning candidate i, relative to the party of the runner-up.Footnote 54

Candidates who win in a close race in a year with a positive net party swing (\(\text {Swing}_i > 0\)) are a relatively “bad” selection (they won with a narrow margin despite a positive party swing) and those winning during a negative party swing are a relatively “good” selection. We estimate impacts of legislator gender on luminosity growth for candidates winning during positive vs. negative swings. The estimates are similar and statistically indistinguishable, see Table 25. This makes it unlikely that candidate quality drives our results. Our main result is robust to accounting for negative selection among men in close elections. Even if we focus only on good candidates (who won despite a negative party swing), women perform better than men.

Electoral incentives A potential explanation of the difference in outcomes of close vs. non-close elections is that legislators who win in close races face more stringent electoral incentives than those who win with comfortable margins (because their re-election is more uncertain). That politicians pursuing a narrow electoral agenda have an incentive to distort economic policies has been discussed in a literature on distributive politics, which highlights this as a drawback of democratic politics (see e.g., Mani & Mukand, 2007; Cole, 2009; Golden & Min, 2013). Politicians may induce electoral cycles, engage in vote buying, or target resources to key electoral groups for purely electoral reasons; see Cole (2009), Mitra et al. (2017), Arulampalam et al. (2009) for evidence from India. With the exception of Brollo and Troiano (2016), this literature provides limited evidence of whether men are more likely than women to fall prey to electoral incentives.

We argue that if men are more opportunistic than women then we may expect the pattern seen in Fig. 6. We find (descriptive) support for this in comparing re-election rates of men and women in the mixed-gender election sample, see Table 26. Men and women elected with wide margins are equally likely to be re-elected, the chances being 30–35%. Among legislators who win in close races, men have a similarly high re-election rate of 27%, but women have a substantially lower re-election rate of about 18%, despite their better growth performance.Footnote 55 These estimates are consistent with women being less likely to engage in economic distortions even if it costs them electoral defeat. The results generalize in the sense that if a non-close man were to find himself in a close election, he would also behave opportunistically. We note again that close elections are not special cases, a third of all elections being close.

There are other possible explanations of lower growth in competitive constituencies with male legislators. One is that politicians with shorter expected tenure have less influence over the promotion of bureaucrats. In line with this, Nath (2016) shows that the performance of bureaucrats is worse in such constituencies. Women may be able to improve bureaucratic performance even without explicit control over promotions if they are more efficacious or intrinsically motivated. For instance, our result that road completion rates are higher in constituencies with female legislators is consistent with women exerting more effort to monitor bureaucrats effectively.Footnote 56

10 Conclusion

We estimate that women legislators in India raise economic growth (GDP) in their constituencies by 2.3 percentage points per annum more than male legislators. We find no evidence of negative spillovers from female-led constituencies, which suggests considerable overall growth gains. These are, as far as we know, the first causal estimates of the impact of legislator gender on economic activity.

Investigating mechanisms we find that women legislators are more effective at overseeing completion of road infrastructure projects (the share of incomplete projects being 22 percentage points lower) and increasing non-farm employment (by 0.84% points p.a.), they are less likely to rent-seek while in office (personal asset growth is about 12 percentage points p.a. lower), and only about a third as likely as men to be carrying pending criminal charges when they enter office. We also find evidence consistent with women legislators being less likely than men to distort economic policies in order to achieve electoral gains. Thus it seems that economic activity improves under women legislators on account of them being more efficacious, less corrupt and more intrinsically motivated. We note that this array of results makes it unlikely that what we capture is only that street lighting or electrification (which manifest in luminosity growth) improve under women leaders, also noting that both are potentially important contributors to growth in developing countries.

A lower initial share of women in government implies that the marginal female entrant will be higher ability than the marginal male entrant, and this may be reinforced by discrimination against women. Against this, as the share of women grows, average female tenure will fall. Our results are consistent with female politicians having higher ability.Footnote 57 Our findings are potentially relevant to the many (richer and poorer) countries in the world that have a small but growing share of women in the legislature.

To the extent that opportunities for corruption decline with development, any female-advantage that derives from lower corruption will tend to dissipate with development. In line with this, we find some evidence that the gender gap in legislator performance is smaller in the more developed states of India but, in general, it is unclear that these differences will disappear altogether if lower criminality and corruption are intrinsic to women. Gender differences in intrinsic motivation may persist, and our finding that women achieve higher road completion rates is not significantly different in more vs. less developed states of India. Overall, our analysis suggests that differences in economic performance by legislator gender may narrow but not necessarily close with economic development. Further work in other settings is merited.