Abstract
In some industry segments, more than 70% of leadership positions are held by men. This can often lead to the suggestion that men are more successful in leadership roles, particularly in areas where women are viewed to have no practical experience. Unfortunately, it is often difficult to find women and men in leadership positions where performance of the leader can be objectively evaluated, which can make it likewise difficult to establish if the gender identity of a leader makes a measurable difference. While men’s sports are an example of an industry dominated by gendered thinking in leadership, as women are assumed to be worse candidates for coaching positions due to a lack of familiarity with playing men’s sports, there is an exception to this general trend in women’s sports. In women’s sports, both women and men work as coaches, often in equivalent roles and positions. Consequently, we can scientifically evaluate if the gender of the coach impacts the outcomes we observe. In this article, we specifically consider the sport of college softball, where we note a more equal breakdown in coaching by gender than most other sports. The evidence from college softball indicates the gender of the coach does not impact outcomes. Specifically, we find the gender of the head coach does not appear to alter the performance of individual hitters and thus a team’s offensive production. Such findings challenge the gender stereotypes we see in coaching and leadership hiring.
Similar content being viewed by others
In 1950, the labor force participation rate for women was only approximately 33%.Footnote 1 Paid employment outside of the home was predominantly an activity for men. By the end of the 20th century, however, the labor force participation rate for women was 60%. Both women and men had become active participants in the paid labor force.
Despite this increased participation by women, many inequalities clearly remain. As Lennon (2013) documents, women only occupy 20–25% of all leadership positions across a variety of the top industry segments (i.e., technology, business, law, education). This underrepresentation of women in leadership roles has been attributed to gender discrimination and unequal opportunities for advancement (Koenig et al., 2011). In general, previous research focused on this stark discrepancy in leadership representation has attributed the discrimination and unequal opportunities to the presence of gender stereotypes that limit a woman’s ability to advance in an organization and thrive in a leadership position when one appears (Badura et al. 2018; Begeny et al. 2020; Darvin et al. 2018; Pullen and Vachhani 2021).
This poses a natural question, then: are women preferred less than men for general leadership roles, or would women be preferred if discrimination due to gendered stereotype thinking could be eliminated? Heilman (2012) noted that there are two distinct forms of gendered stereotype thinking. The first is descriptive, defined as stereotypes about how men and women are alike. Descriptive stereotypes will lead one to conclude that a woman can only succeed as a leader if she adopts the characteristics and mannerisms seen with male leaders (Heilman 2012; Manzi and Heilman 2021). The second set of stereotypes are prescriptive, and these are defined as stereotypes about what women and men should be like. Footnote 2 In these stereotypes, women are penalized when they do not act as people think women should act.
Thus, women in leadership positions often face what Janet Holmes refers to as “the tightrope of impressions management”.Footnote 3 If a woman behaves as a man stereotypically behaves in a leadership position, she is penalized for not acting like a woman. On the other hand, if a woman follows the stereotypes of how women are perceived to behave, she is penalized for not acting like a male leader. In sum, both descriptive and prescriptive gender stereotypes (i.e., both sides of the tightrope) can impede the ability of a woman to advance into a leadership position and maintain and succeed once she arrives (Badura et al. 2018; Begeny et al. 2020; Pullen and Vachhani 2021).
Unfortunately, examining leadership stereotypes is difficult in many industry settings. In addition, in many industry settings, it is highly difficult to objectively test the impact a leader has on the outcomes observed and the performance of their subordinates. Thus, attempts at debunking the discriminatory stereotyping women face are limited and serves to further institutionalize the notion that women are not inherently equipped to be successful leaders (Manzi and Heilman, 2021; Walker et al., 2017). However, if these stereotypes are indeed true, this would suggest it is not only harder for women to succeed in leadership positions, but that they are somehow presumed to maintain skillsets that would equate to less success in these roles. In contrast, men would instead be inherently equipped to perform better as leaders regardless of the context of the leadership role (Badura et al. 2018; Begeny et al. 2020).
Most researchers think of leadership in the context of boardrooms and business management exclusively, though, which can be limiting. Would we expect the same results to hold true in more specialized fields, particularly fields where women might instead have a perceived subject matter advantage over men?
Athletics provides just such a case study. Nowhere is this gendered thinking on greater display than in the area of men’s sports leadership, specifically coaching. Women are rarely if ever considered for coaching positions at the highest levels of collegiate and professional men’s sports due to the perception that playing experience is required in order to coach. There is however an exception to this issue of examining leadership performance and success: the women’s sports industry (Darvin et al. 2018; Wicker et al. 2019). In women’s sports, both men and women are found in leadership positions (i.e., the head coach role) at substantial numbers (Darvin & Lubke, 2021). As a result, the women’s sports industry can be used to examine the impact both women and men have on the organizations they lead in a setting where the organizational circumstances and expected outcomes are roughly the same for all coaches (Darvin et al. 2018; Wicker et al. 2019). Further, we can focus on a women’s sport that few if any men play at a significant college or professional level: softball.Footnote 4 If gendered stereotypes for leadership are true, men should perform better as coaches for women’s softball despite not having played the game they are coaching. If playing a sport is required to be a successful coach of that sport, then women should perform better as coaches for women’s softball.
The organization of our inquiry will be as follows. We begin with a discussion of leadership in sports that will review the impact coaches have on performance, as well as the prevalence of women leading in college softball, the sport of choice for our analysis. From there we will move on to our empirical study of this topic. This begins with the measurement of hitter performance in college softball. With measure in hand, we will then build and estimate an empirical model to evaluate how the gender of the coach impacts player performance. A discussion of this model’s estimation will then conclude the paper.
Gendered Sport Representation
In 1971, an estimated 3.7 million boys played high school sports in the United States. At the same time, only 300,000 girls played high school sports (Acosta & Carpenter, 2014). These proportions fed the long-standing stereotype that boys were simply more interested in sports than girls (Staurowsky et al. 2022).
If that stereotype were true, however, the events of 1972 would not have changed sports participation rates. In 1972, President Richard Nixon signed the following statement into law:
No person in the United States shall, on the basis of sex, be excluded from participation in, be denied the benefits of, or be subjected to discrimination under any education program or activity receiving Federal financial assistance.
This law, known colloquially as Title IX, prohibited institutions that receive federal funding from discriminating on the basis of gender (Stevenson, 2014). Although the statement does not mention sports explicitly, after Title IX passed into law, high schools and universities were no longer permitted to provide athletic opportunities to men that were not available to women (Staurowsky et al. 2022; Stevenson 2014).
As Zimbalist (2001) notes, by 1978 there were 2.1 million girls playing high school sports. In other words, participation in girls’ athletics increased about 700%. Zimbalist notes a similar story at the college level as well. In 1971, fewer than 32,000 women played college sports. By 1977 there were more than 64,000 women participating in college athletics (Zimbalist, 2001).
According to the NCAA (2022), today women comprise 47% of all student-athletes. While this demonstrates a significant and substantial move toward participation equity, this may not be entirely consistent with Title IX. One of the primary interpretations of Title IX is that opportunity should match student population. Women comprise more than 50% of college students in the United States, but they are decidedly less than 50% of athletes. Nevertheless, substantial progress has been made.Footnote 5
That is, substantial progress has been made in participation for collegiate athletics. Although women have made considerable gains with respect to athletic overall opportunities, the same cannot be said for sports leadership positions.
Gendered Sport Leadership
In terms of leader and employee representation, the sport industry has endured as one of the most male prominent segments of society (Fink 2016; Lapchick et al. 2020). Previous literature has established a variety of obstacles and barriers that exist due to the low proportion of women employees in this industry, including gender stereotyping and discrimination (Burton, 2015; Fink 2016; Forsyth et al. 2019; LaVoi, 2016). More specifically, the masculine nature of the sport industry is ripe for both descriptive and prescriptive gender stereotypes regarding how women should behave and act in the setting (Burton, 2015).
The outcomes of such gendered stereotypes result in much lower proportions of women leaders in widely varying occupations throughout the industry. For example, in terms of the athletic director position, women only account for roughly 25% of all collegiate athletic director roles regardless of governing body or level of play (Lapchick et al. 2020). Specific to sport coaching, women as head coaches, the most prominent leadership-level role of sport coaching, only account for roughly 40% of all head coach positions for collegiate women’s teams and less than 1% of men’s college teams (Darvin 2020; Lapchick et al. 2020). The underrepresentation of women as head coaches has been attributed to the same traditional gender stereotypes that women face in obtaining leadership positions across industry segments (Fink 2016).
Specific to this gendered representation, when Title IX was passed in 1972, roughly 90% of women’s college teams were coached by women (Hruby 2021). Across the next 50 years, while the number of women playing college sports continued to increase, the proportion of coaching opportunities that went to men also increased (Darvin and Sagas 2017). According to the NCAA (2022), 58.7% of head coaches in women’s college sports were men. At the same time, only 5.8% of head coaching jobs in men’s collegiate sports went to women (NCAA, 2022).Footnote 6 Consequently – just as Zimbalist observed in broader leadership roles across industries – men hold about 75% of all head coaching positions in college sports. In 2019, 88% of athletic directors in Division I college athletics were men (Hruby 2021). As LaVoi argues (Hruby 2021):
One of the theories that we often use is what social scientists call homologous reproduction. It basically means that people like to hire people like themselves. It’s very similar to affinity bias, in which we warm up to people like ourselves. We like to surround ourselves with people who are like us because it affirms our identities – and that’s true for athletic directors, too.
Elsesser (2019) offered a similar explanation for this pattern:
The truth is men dominate coaching for the same reason that they run most of our Fortune 500 companies and our country. When we think of leaders, we tend to think of men. We want someone to lead our team, our company or our country, then our experience and unconscious bias makes us gravitate toward men.
This pattern suggests that somewhere, women are being discriminated against in the hiring process.Footnote 7 However, there is another explanation. The market – both in sports and outside of sports – may indicate men are simply better leaders than women.
Outside of sports, it can be difficult to say if the market is correct in this assumption. In women’s sports, however, we have an opportunity to test the hypothesis. Both men and women coach women’s sports. If men truly are better coaches, we should see male coaches performing better than female coaches. This would indicate that even in an arena where women have more direct experience (women’s sports), the natural leadership advantages of being male support hiring a male coach over a female coach.
Before this can be tested, we must first define what coaches are being asked to accomplish to determine “better” performance. This is not hard to define when examined from the perspective of a team’s owner/operator. Whenever a team fires a coach and hires a new one, even if the team intends to rebuild, it is implicitly making the following statement: “A different coach would get more wins for this team.” While coaching changes may not be made purely on the basis of player productivity (for example, a belief that a new coach would be stronger at recruiting may factor in), the statement does indicate that at the very least a current coach has not been productive with the players they have. There is an underlying assumption that coaches have some basic ability to alter the productivity of the athletes they lead, which can be seen in players’ decisions to play for a particular coach rather than a particular school or program. Much of the basis of player recruitment is in the argument, “I am the coach who can take your playing to the next level.”
Bum Phillips put the argument this way in discussing the coaching skills of NFL legend Don Shula:Footnote 8
“Don Shula can take his’n and beat your’n. Or he can take your’n and beat his’n.“
Given this view of coaching, if men are being hired over women, it must be the case that men are better at getting improved performance from the athletes they lead. If that were not the case, all else being equal, why prefer men? Or put another way, why not prefer women, even in arenas where women are not traditionally found?
Coaching College Softball
To address this question, we will consider the sport of college softball. This is an interesting sport for men to coach. Unlike sports like basketball, volleyball, soccer, or lacrosse, men do not typically play in dedicated fastpitch softball competitions. There is a men’s fastpitch team who competes for the USA in international competitions,Footnote 9 but unlike the opportunities available for women, fastpitch softball is not offered to men at the collegiate level, and there has been little effort to create a professional men’s fastpitch league. Consequently, men without experience playing the game themselves are being hired to lead women’s fastpitch softball teams. Even though arguments can be made that the rules of men’s baseball and women’s softball demonstrate clear similarities, this similarity does not translate to the collegiate experience. As noted across the literature evaluating Title IX funding in college sports (e.g., Shaw 1995; Yanus & Karen, 2016; Druckman et al. 2018), women’s sports are significantly underfunded relative to men’s sports. Because of this, recruiting, playing facilities, and even player experience varies significantly as well. Playing collegiate baseball may give some insight into playing collegiate softball, but they are clearly not direct equivalents.
This issue of equivalency is often raised in the opposite direction when women are not considered for coaching positions in football or baseball, sports traditionally offered almost exclusively to men. Since women are thus often excluded from both sports, organizations argue women are not qualified to lead those teams. If this is true, though, certainly a similar argument must apply to men with respect to fastpitch softball. In practice, however, athletic directors have often looked past this issue to still hire men – who again, have never truly played the same game they are coaching – to lead these teams of women.
NCAA Softball Data
For this study, we employ an extensive set of individual player data scraped from the NCAA softball statistics website. We have collected data on NCAA Division-I softball teams from 2012 to 2022. For concision, we will consider a “season” to be a single team over a single year, so these years provide a sample of 2,662 team/season observations, or 2,662 “seasons.”
After further review, we verified 2,627 of these observations as seasons where a single person coached the team for the entire season. In this sample, there are 1,775 seasons where the team was solely coached by a woman, and 852 seasons where a team was solely coached by a man. In other words, 68% of the time a team was coached by a woman compared to 32% coached by a man.
The data set includes 595 total coaches, where a coach may have coached across multiple seasons and/or teams (though always completing a full season with a given team). Of these 595 coaches, 391 were women, and 204 were men. In other words, 66% of the coaches in the data sample are female. Subsequently, that means from 2012 to 2022, there are approximately 200 cases where a Division I athletic department hired a person to coach their softball team who has little to no experience playing fastpitch softball. If experience does truly matter to coaching, even 34% of coaching positions going to men would be unexpected if not disastrous for their programs.
Of course, if men are naturally superior leaders, this decision might be justified. Looking at the win-loss records of the teams, though, this story seems to lack support. Across the data sample, teams coached by women played 91,908 games and won 49.5% of the time.Footnote 10 Teams coached by men played 43,698 games and won 50.8% of the time. This appears to indicate there is not much difference between men and women in terms of team records.Footnote 11
Although coaches are often evaluated in terms of their team’s win-loss records, such records are not necessarily about the coach. Players on the field win or lose games. As noted, one key function of the coach is to find a way to get those players to play better. That is, a coach with great players they never impact is not as good a coach as a coach with relatively poor players who manages to elicit some improvement from those athletes. The former coach would likely have a better record. But it is the latter coach that is better at coaching.
Given this, our approach to evaluating coaching performance is similar to the approach used in studies of coaching in basketball and baseball.Footnote 12 We will begin with a measure of player performance. More specifically, we will focus on the productivity of hitters in softball.Footnote 13 After measuring the performance of these players, we then construct a model designed to evaluate if the gender of the coach systematically alters the performance of softball players. Again, the story told in many places is that men are better at coaching, and in fact, they are so much better at coaching that gender trumps experience. If that is true, then having a man as a coach should systematically lead to improvements in hitter performance. If instead experience wins overall, having a woman as a coach should instead lead to improvements in hitter performance.
We report the overall summary statistics of the measurement variables in our data in Table 1. The statistics used here are further defined when used in the modeling in the following sections, for those unfamiliar with terms such as “batting average” or “on-base percentage.” (We would also note that although we have included both “men coach” and “women coach” dummy variables in Table 1, only one dummy variable for the gender of the coach is included in the empirical modeling: a women coach or not a women coach. It is separated here to more clearly demonstrate the differences between male and female coaches only. The two separate variables exist in the data set to account for situations where teams do not have a single gendered coach, which does happen in approximately 1% of situations as seen in the table below as the difference in means between the male coach and female coach dummy variables. Examples of this rare situation would be a nongendered coach, a female/male coaching team, etc.)
Measuring the Performance of Hitters
In 2000, Lawrence Kahn offered the following justification for why economists should study sports:
There is no research setting other than sports where we know the name, face, and life history of every production worker and supervisor in the industry.
This extensive data on worker productivity allows economists to examine a host of issues. Before such studies can commence, though, researchers must decide how productivity should be measured.
For hitters in baseball and softball, the choices to evaluate productivity are varied and may seem endless. In 1872, H.A. Dobson introduced batting average.Footnote 14 Batting average is simply a batter’s hits divided by the batter’s at-bats. Of course, not all hits are equal, and hitters will also score runs with actions besides hits. Thus, Dobson’s simple measure has been recently discounted as the best measure of an overall hitter’s value.
Approximately 100 years after Dobson introduced batting average, an economist began using the statistics tracked for hitters in baseball to study the economic value of workers. Gerald Scully’s (1974) classic study compared the wages and marginal revenue product of Major League Baseball batters. To measure the productivity of a hitter, Scully employed slugging percentage, which is calculated as follows:
Slugging Percentage = Total Bases /At-Bats (1).
where Total Bases = Singles + 2*Doubles + 3*Triples + 4*Home Runs.
This measure does assign different values to different hits and as such makes a significant improvement over batting average. Across the next thirty years, slugging percentage was employed in a host of studies.Footnote 15 But slugging percentage – like batting average – also ignores many things a hitter can do to alter the outcome of a game.
More recently, researchers have put additional effort into capturing more of what hitters do to impact their team’s offense. For example, Hakes and Sauer (2006) employed both slugging and on-base percentage, where the latter is calculated as follows:
On-Base Percentage = [Hits + Walks + Hit by Pitch] /.
[At Bats + Walks + Hit by Pitch + Sacrifice Flies] (2).
As noted in Holmes et al. (2018), whereas on-base percentage explains 79% of the variation in runs scored, and slugging percentage explains 82%, on-base percentage plus slugging average (or OPS) explains 91% of the variation in runs scored.Footnote 16 So, slugging percentage and on-base percentage together capture much of what a hitter does to impact a team’s offense.
Holmes et al. (2018) argued that one could also explain 92% of the variation in runs scoredFootnote 17 with three different variables.Footnote 18 The first of these is batting average. Footnote 19 Beyond this measure, these authors also employed isolated power and eye. These are calculated as follows:
Isolated Power = Total Bases / Hits (3).
Eye = (Walks + Hit-by-Pitches)/Plate Appearances (4).
Each of the aforementioned measures is relatively simple to calculate and – by themselves – incomplete. Blass (1992) offered an approach that was both more sophisticated and more complete. More specifically, Blass estimates a regression – similar to the followingFootnote 20 – that incorporates nearly everything a hitter does to impact a team’s runs scored per game:
Runs Scored per Game = b0 + b1*1B + b2*2B + b3*3B + b4*HR + b5*BB + b6*HBP + b7*SB + b8*CS + b9*SF + b10*DP + b11*SO + b12*GO + b13*FO + et (5).
where.
1B = Singles per game 2B = Doubles per game.
3B = Triples per game HR = Home Runs per game.
BB = Walks per game HBP = Hit by pitch.
SB = Stolen bases per game CS = Caught stealing per game.
SF = Sacrifice flies per game DP = Double plays per game.
SO = Strike outs per game GO = Grounded out per game.
FO = Fly outs per game.
This model was estimated with team observations from Division-I NCAA softball from 2012 to 2022.Footnote 21 The results of this estimation are reported in the following table 2.
The results are highly consistent with Blass’ (1992) conclusions for Major League Baseball hitters decades ago. The model explains 94.5% of the variation in runs scored (a mark that is better than observed in any other approach taken). The model also demonstrated some important lessons about understanding softball play. For example, a single in softball appears to be worth an average of 0.55 runs scored. A naïve approach to evaluating hitters and productivity would lead to an assumption that a double should be worth twice that amount, but – just as Blass reported – it is worth slightly less. Likewise, a home run is also not worth four times the value of a single despite being worth four bases rather than the one base of a single.
Turning to the other variables, similar patterns exist with outs in that not all outs are created equal. An out from being caught stealing is significantly more damaging than an out achieved by a hitter at the plate. The reported results indicates that a baserunner in softball has to be successful on 66% of stolen base attempts just to break even on this activity.
From this model we can estimate how many runs each hitter created. Holmes et al. (2018) notes that this model can also be employed to estimate how many wins each hitter produces.
We first estimate the relationship between a team’s winning percentage and both that team’s runs scored per game and their opponent’s runs scored per game. The results of this simple model are presented in Table 3.
All variables in the above regression are per-game measures. Therefore, these results indicate that each additional run per game increases a team’s winning percentage (in decimal form) by 0.073, on average.
Given this measure of runs created, a natural follow-up would be to simply multiply the value of runs in terms of wins by our measure of each player’s runs created. But as Holmes et al. (2018) notes, that is not exactly correct. Pitchers also produce wins. However, pitching only directly impacts the opponent’s runs scored. As long as a pitcher allows the opponent to score, their impact on wins – according to the above model – will always be negative. In other words, the above model indicates that only hitters have a (direct) positive impact on team wins.
Obviously, this cannot be the full specification. Consequently, Holmes et al. (2018) propose a two-step approach, which can be easily illustrated by examining the productivity of Jocelyn Alo in 2022. The raw numbers for Alo are impressive. She finished her career at Oklahoma in 2022 as the all-time leader in career home runs. In 2022, she led all players in home runs, slugging percentage, on-base percentage, and OPS. Her 90.2 runs created also led all of Division-I softball. For this impressive performance, she was the clear first overall pick in the very first draft of the Women’s Professional Fastpitch league.
Clearly Alo was well above the average player. But how many wins was she worth to Oklahoma? The answer begins by ascertaining how many runs an average player would produce if they were given the same opportunities at the plate as the player we are examining. For example, the 51,504 hitter observations in our data setFootnote 22 produced 409,754.8 runs in 4,181,073 plate appearances. Therefore, per plate appearance the average softball hitter produced 0.098 runs. Alo had 226 plate appearances in 2022. An average player – producing 0.098 runs per plate appearance – would have created 22.2 runs given Alo’s opportunities.
Again, Alo produced 90.2 runs. This mark was 68.1 runs beyond what an average hitter would have accomplished. As shown above, these additional runs translate directly to an outsized increase in wins for Oklahoma purely based on Alo’s presence in the lineup.
Modeling the Impact of Coaching
As demonstrated in the previous section, the productivity of hitters in softball can be measured with batting average, slugging percentage, on-base percentage, OPS, isolated power, eye, runs created, and wins produced. Due to the nature of softball, this productivity should not be unduly impacted by the hitter’s teammates, at least not relative to the theoretical impact of the hitter’s coach. Thus, our primary focus is the impact to this productivity due to the hitter’s coach. (We note that better teammates may raise a player’s ability as well, though this can be difficult to quantify. Nevertheless, we will extend our analysis to attempt to take this into account as a later robustness and completeness check.)
This work builds on past studies looking at how coaches and managers impact player performance in sports. For example, in a study similar to this study of softball, Bradbury (2017) looked at how managers in baseball impacted a hitter’s performance in Major League Baseball.Footnote 23 Beyond this work, we also note three studies that looked at player performance in different basketball leagues. Coaching in the NBA was explored by Berri et al. (2009), while Croft et al. (2023) looked at how coaching impacted performance of players in men’s college basketball.
The paper that most closely follows our study of gender and coaching is Darvin et al. (2018). This paper examined how the gender of the coach impacted performance in both the WNBA and women’s college basketball. In both cases, gender of the coach did not impact the performance of women in basketball.
This earlier work inspires the model detailed in Eq. (6). Our dependent variable, Perf, will be the seven measures of performance detailed earlier.
Perf = γ0 + γ1 GENDER + γ4 NewCoach + γ3Class + γ4 DifGP + γ5 LagPerf + ε (6).
Our list of independent variables in Eq. (6) begins with a dummy variable equal to one if the head coach was a woman. As Table 1 shows, 67% of the players in the sample had a woman as head coach. This is especially relevant to our analysis, as the central question in the model concerns the impact of the gender of the coach on player performance.
We acknowledge, however, that other factors may also play a significant role. Beyond the gender of the coach, the model also includes whether or not a player is playing for a new coach. Presumably a new coach will have a different impact than an established coach. We would also expect a player to improve as they progress through college, hence the inclusion of the class a player is in (i.e., second year, third year, etc.). Progress could be derailed, though, by injuries, which we seek to capture by noting differences in games played (i.e., games played this season minus games played the previous season). This also leads to questions of intangible individual player characteristics, such as motivation or work ethic.
Thus, the final factor we consider is a player’s performance the previous season (denoted as lagPerf, where each Performance measure would have a particular lag, e.g., lagPref for batting average would be the lagged batting average). This allows for comparison to measure improvement against the same player, presumably with the same motivation and other personal characteristics. Again, if gender of the coach matters, then player performance should be systematically different from lagged performance depending on whether or not the coach is a woman or a man. As a further robustness check, we also interact the gender variable with the lagged performance variable, under the assumption that bigger gains could be observed due to coaching gender. If so, this would also provide evidence that gender contributes to a change in performance.
Estimation Results and Discussion
The parameters of the model are estimated via a fixed effects model with robust standard errors. The fixed effects here are the conference each player played in. These were employed to control for the level of competition the player faced.
We estimate the model with each of the seven performance variables reported above. In each case, we find that lagged performance and changes in games played matter. Class – or year in school – matters with respect to eye and on-base percentage. This suggests hitters do learn as their college careers proceed and consequently get better at discerning good pitches from bad pitches.
This improvement, though, seems independent of the gender of the coach. Regardless of how we measure performance, the gender of the coach does not appear statistically significant. Notably, the interaction term between each performance measure and the gender of the coach also appears insignificant for all performance measures (Table 4). Footnote 24
Once again, outside of softball, men are consistently the preferred leader, particularly in athletic organizations. Given that men typically do not play fastpitch softball, it seems odd for programs to still choose men as leaders in this environment if player experience truly does make one a better coach. However, since we note no discernible difference in player performance due to coach’s gender, it may be that teams should be indifferent to both gender and playing experience, in which case their decision will be based on external factors.Footnote 25
For this reason, we re-estimate the model by including program expenditures. The data in this case is more limited, but it provides a valuable proxy for many of the otherwise difficult to quantify measures of the quality of a program.Footnote 26 Due to the disruption of the COVID-19 pandemic, we only consider program expense for years up to 2019. This should give all schools accurate representation, since some programs were better able to weather the canceled seasons and low attendance of the pandemic.
Table 5 shows the results of this re-run analysis, this time including program expense (adjusted for real values for equal comparison across years and programs) as a variable to control for the quality of the program and its ability to attract top talent:
As Table 5 shows, expense is clearly significant, but again, the effects of the gender variables remain insignificant. The quality of the program does not appear to materially affect the relationships (or lack thereof) between the gender of a coach and the impact to player performance.Footnote 27
Concluding Observations
There is extensive literature indicating women in leadership positions face more obstacles than men. Hence, it would not be surprising to find some evidence that men – who appear to be faced with the easier task in leadership positions – would perform better overall in leadership roles, regardless of particular experience. However, that is not the evidence we find in exploring college softball data. The evidence of our empirical modeling shows no difference in how women and men perform with respect to changing the productivity of the hitters they lead.
Such a result is consistent with the work of Darvin et al. that found women and men are equally skilled at coaching basketball. Such results appear to be contrary to the arguments presented in both the world of sports and beyond. Men are overwhelmingly the preferred choice when people are hiring for leadership positions. This pattern suggests that many people simply perceive men are better leaders than women.
Obviously, this suggests that qualified women are being passed over for management and coaching positions in areas where they could be considered equally. It also has another, more concerning implication. As Nicole LaVoi observed, the tendency for people to prefer male leaders to women has a negative impact on young girls and women (Hruby 2021): “I think the effect in general – that we know from the literature – is that same-sex, same-identity role models matter. 100% of young men get a male coaching role model. A majority of young women don’t.”
While this is true of role models outside the coaching world, we do know when young people, and particularly young athletes, fail to see people like them in certain roles, they tend to conclude such roles are closed to them. Hence, the failure to consider and hire women in general has implications beyond the organizations doing the hiring. This is especially obvious in sports organizations, where many young women grow up with predominantly male professional sports leagues as their exposure to the world of professional sports and coaching opportunities. It is not that young women do not see female role models in their coaches but rather that seeing women as only qualified to coach female sports will hurt aspirations young women may have outside women’s sports. For example, seeing no female coaches or female referees in professional male sports will lead women to assume those roles are not available to them because of their gender. Young men do not find coaching women’s sports closed to them, since they can see evidence of role models in male coaches even in women’s sports. This seems to suggest a history of playing a given sport should neither qualify nor disqualify someone from consideration to coach that sport.
It is important to note in closing that this work does have limitations. Although we would argue that improving the play of players is a primary job of coaches, it is not the only job. Coaches in college also are charged with the task of recruiting talent and serving in other roles beyond improving current players. For example, this work does not take into account a coach’s role in athletics fundraising. These roles could provide important context to choosing a bigger-name male coach over a lesser-known female coach (though this would again naturally lead to a larger discussion on the opportunities given to males in sports to achieve that greater renown). Further, we would also note that many unobserved variables could contribute to player performance, some of which are likely also impossible to measure.
Where the function of a coach is in improving quantifiable player performance, however, our results show no difference in how men and women demonstrate impact. While extrapolating to the wider business world is beyond the scope of this single study, it does pose a question for men’s sports at both the collegiate and professional levels: do women truly deserve to be discounted on the basis of gender and lack of perceived experience?
Appendix A: Performance Results without Lagged Interactions
.
Availability of data and materials
The data used in this article is publicly available for Division I softball teams and players on the NCAA.com website. Authors scraped the data directly from the website with no changes or additions except the determination of a coach’s gender, which was done based on available official Athletics photographs and noted gender on team websites.
Notes
Data on the labor force participation rate can be found at the St. Louis Federal Reserve’s website: https://fred.stlouisfed.org/series/LNS11300002.
This is a phrase coined by Janet Holmes, author of Gendered Talk, cited by Cameron (2007, p. 141). It was also noted in Cordelia Fine’s book Delusions of Gender.
The authors acknowledge that many male softball coaches do have experience playing collegiate or professional baseball, which may be seen as equivalent to experience in softball. However, we note that playing professional women’s softball is not considered substitute experience in coaching professional baseball, for example, and so we apply the same logic here and in further detail in our methodology discussion.
This progress has not come at the expense of opportunities for men. The (2022) also reports that since 2002, men have gained 73,000 participation opportunities, while women have only added 67,000 such opportunities.
In major men’s professional sports, the picture is even more bleak. The major men’s sports leagues in North America are the National Football League, National Basketball Association, Major League Baseball, and the National Hockey League. In none of these leagues has a woman ever served as a head coach. And Kim Ng – the general manager of the Miami Marlins in MLB since 2020 – is the only woman to serve in that position in any of these leagues.
Previous literature has established a variety of obstacles and barriers that exist due to the low proportion of women employees in this industry, including gender stereotyping (Burton, 2015; Forsyth et al. 2019; LaVoi, 2016). More specifically, the masculine nature of the sport industry is ripe for both descriptive and prescriptive gender stereotypes regarding how women should behave and act in the setting (Burton, 2015). The outcomes of such gendered stereotypes result in a much lower proportion of women leaders in varying occupations throughout the industry.
It is possible for a softball game to end in a tie. So winning percentage is calculated as (wins + 0.5*ties)/ games.
We do note this is a statistically significant difference at the 0.05 level when using a simple two-sample proportion test. However, this is not surprising given the size of the samples, and it ignores any controlling factors such as the strength of a particular program or the opponents a team faced.
Most studies in looking at baseball have only examined the performance of hitters. Though we again point out softball and baseball are not perfect substitutes, we do borrow the measurement mechanic here as an attempt to provide a player performance metric already well-studied in the literature. As noted in Bradbury (2007), a pitcher’s performance is dependent to some extent on the defensive players around them. This is much less true for hitters.
Schwartz (2004, p. 11).
A number of studies in the sports economics literature have used slugging percentage as a measure of performance. A brief list would include Sommers and Quinton [1982], Raimondo [1983], Bruggink and Rose [1990], Hill [1985], Durland and Sommers [1991], Sommers [1993], Krautmann and Oppenheimer [1994], Krautmann [1999], Krautmann, Gustafson, and Hadley [2000], Maxcy, Fort, and Krautmann [2002], Krautmann and Oppenheimer [2002] and Goff, McCormick, and Tollison [2002]. Sommers [1993] also employed a player’s batting average. Slugging percentage has not been the only measure of productivity chosen. Medoff [1976], Hill and Spellman [1983], and MacDonald and Reynolds [1994] measured a hitter’s productivity with runs scored. Such a choice ignores the impact a player’s hitting has upon the scoring of teammates. Sommers [1990] utilized a player’s batting average, or simply hits divided by at-bats. Batting average ignores the quality of a player’s hits and is generally considered inferior to slugging average. Finally, Zimbalist [1992a, Zimbalist 1992b>] utilized slugging percentage with a player’s on-base percentage in the construction of a summary statistic he labeled PROD.
Holmes et al. (2018) regressed runs scored on a variety of measures for hitters using Major League Baseball team data from 1969 to 2016. We repeated the same exercise with our data from NCAA softball. Using 2,656 Division-I team observations from 2012 to 2022, we found that batting average explained 78% of the variation in a team’s runs scored. The explanatory power of on-base percentage, slugging percentage, and OPS were 0.86, 0.83, and 0.90 respectively.
We repeated the same exercise with our data from NCAA softball. Using 2,656 Division-I team observations from 2012 to 2022, we found that batting average, isolated power, and eye explained 91% of the variation in a team’s runs scored.
Holmes et al. (2018) argued that batting average, isolated power, and eye are better measures of performance because they are relatively independent of each other. In contrast, there is a high correlation between slugging percentage and on-base percentage.
Holmes et al. (2018) notes that batting average – by itself – explains 66% of the variation in runs scored.
This data came from the NCAA. The NCAA does not report complete data for softball prior to 2012.
Again, we are looking at Division I softball teams from 2012 to 2022.
Bradbury’s (2017) work looked at how a player’s OPS was explained by a hitter’s past performance, age, and a dummy variable for a player’s manager. The model we employ is similar to this earlier work.
For readers interested in seeing how these results do not change when the insignificant interaction term is not included, please see Table A in Appendix A.
We note this result should hold true regardless of gender and sport. Although we focus on the expansion of coaching to women’s roles in men’s sports, the same could be said in reverse: men should be equally represented in leadership for women’s sports. However, we also note this does not appear to be the same problem in women’s sports as it is in men’s sports currently.
This procedure follows the reasoning outlined in Von Allmen (2013), among others. Because the quality of an athletics program is difficult to measure objectively, the funds for the program can be used to provide a quantifiable measure instead. A better funded program will be able to attract and recruit better players; it will have better playing facilities; it will provide players with more “perks” and updated equipment; etc.
We again note here that our goal is not to create a model to explain each performance metric but rather to demonstrate gender of the coach does not affect the relationships that exist. While it is highly likely some unobserved variables would contribute to models with higher explanatory power, these variables are also likely to be innumerable and prohibitively difficult to quantify, as well as being outside the scope of our argument.
References
Acosta RV, Carpenter LJ. Woman in intercollegiate sport: a longitudinal, national study. thirty-seven year update, 1977-2014. Acosta-Carpenter. 2014.
Anderson D. (1992) “Sports of the Times; at Age 62, Don Shula is still going strong.” N Y Times. (November 1)
Badura KL, Grijalva E, Newman DA, Yan TT, Jeon G. Gender and leadership emergence: a meta-analysis and explanatory model. Pers Psychol. 2018;71(3):335–67.
Begeny CT, Ryan MK, Moss-Racusin CA, Ravetz G. In some professions, women have become well represented, yet gender bias persists—perpetuated by those who think it is not happening. Sci Adv. 2020;6(26):eaba7814.
Berri D, Schmidt M. (2010). Stumbling on Wins. Financial Times.
Berri DJ, Leeds M, Leeds EM, and Michael Mondello. The role of managers in Team Performance. Int J Sport Finance. 2009;4(May):n2.
Blass AA. Does the Baseball Labor Market contradict the Human Capital Model of Investment? Rev Econ Stat. 1992;74:261–8.
Bradbury JC. The Baseball Economist: the real game exposed. Plume Publishing; 2007.
Bradbury JC. 2017. “Hired to be fired: the publicity value of coaches.” Managerial and Decision Economics v38, n7. (October) pp. 929–940.
Burton LJ. Underrepresentation of women in sport leadership: A review of research. Sport Manage Rev. 2015;18(2):155–165.
Croft C, Paulson, Courtney, Stokowski S, Berri DJ, Mike Mondello. The player makes the coach: exploring player development among Division I basketball coaches. Int J Sport Sci Coaching. February 2023;9. https://doi.org/10.1177/17479541221146384. Published Online.
Darvin L. Voluntary occupational turnover and the experiences of former intercollegiate women assistant coaches. J Vocat Behav. 2020;116:103349.
Darvin L, Lubke L. Assistant coach hiring trends: an updated investigation of homologous reproduction in intercollegiate women’s sport. Sports Coaching Review. 2021;10(1):38–60.
Darvin L, Sagas M. An examination of homologous reproduction in the representation of assistant coaches of women’s teams: a 10-year update. Gend Issues. 2017;34:171–85.
Darvin L, Pegoraro A, Berri D. Are men better leaders? An investigation of head coaches’ gender and individual players’ performance in amateur and professional women’s basketball. Sex Roles. 2018;78(7):455–66.
Druckman JN, Rothschild JE, Sharrow EA. Gender policy feedback: perceptions of sex equity, title IX, and political mobilization among college athletes. Polit Res Q. 2018;7(3):642–53.
Durland D Jr, Paul M, Sommers. “Collusion in Major League Baseball: An Empirical Test.” Journal of Sport Behavior. 14, no.1. (March 1991): 19–29.
Elsesser K. 2019. “Here’s Why Women’s Teams Are Coached By Men.” Forbes. (March 1). https://www.forbes.com/sites/kimelsesser/2019/03/01/heres-why-womens-teams-are-coached-by-men/?sh=31576790b3f9.
Fink J. Hiding in plain sight: the embedded nature of sexism in sport. J Sport Manage. 2016;30:1–7. https://doi.org/10.1123/jsm.2015-0278.
Forsyth JJ, Jones J, Duval L, Bambridge A. Opportunities and barriers that females face for study and employment in sport. J Hospitality Leisure Sport Tourism Educ. 2019;24:80–9.
Goff BL, Robert E, McCormick, Tollison RD. Racial integration as an Innovation: empirical evidence from Sports Leagues. Am Econ Rev. v March 2002;92(1):16–26.
Heilman ME. Gender stereotypes and workplace bias. Res organizational Behav. 2012;32:113–35.
Hill JR. “The Threat of Free Agency and Exploitation in Professional Baseball: 1976–1979.” Quarterly Review of Economics and Business. 25, no.4. (Winter, 1985): 68–82.
Hill JR, Spellman W. “Professional Baseball: The Reserve Clause and Salary Structure.” Industrial Relations. 22, no.1. (Winter, 1983): 1–19.
Holmes PM, Simmons R, Berri DJ. Moneyball and the baseball players’ labor market. Int J Sport Finance. 2018;13(2):141–55.
Hruby P. (2021). “More Athletes, Fewer Leaders: The ‘Paradox’ Facing Women in College Sports. Global Sports Matters.” (May 20). https://globalsportmatters.com/research/2021/05/20/paradox-women-college-sports-dr-nicole-lavoi/.
Koenig AM, Eagly AH, Mitchell AA, Ristikari T. Are leadership stereotypes masculine? A meta-analysis of three research paradigms. Psychol Bull. 2011;137:616–642. https://doi.org/10.1037/a0023557.
Krautmann A. and Margaret Oppenheimer (1994). “Free Agency and the Allocation of Labor in Major League Baseball” Managerial and Decision Economics, 15, no.5. (Sept-Oct 1994): 459 – 69.
Krautmann A. (1999). “What’s Wrong with Scully-Estimates of a Player’s Marginal Revenue Product.” Economic Inquiry, 37, no. 2; (April 1999): 369–381.
Krautmann A. Elizabeth Gustafson, and Lawrence Hadley (2000). “Why Pays for Minor League Training Costs?” Contemporary Economic Policy. 18, no.1. (January 2000): 37–47.
Krautmann A. and Margaret Oppenheimer (2002). “Contract Length and the Return to Performance in Major League Baseball.” Journal of Sports Economics, 3, no.1. (February 2002): 6–17.
Lapchick R, Marfatia S, Bloom A, Sylverain S. (2020). “The 2019 racial and gender report card: College sport (TIDES).” The Institute for Diversity and Ethics in Sport. https://43530132-36e9-4f52-811a-182c7a91933b.filesusr.com/ugd/7d86e5_d69e3801bb8146f2b08f6e619bcddf22.pdf
LaVoi NM. A framework to understand experiences of women coaches around the globe: The Ecological-Intersectional Model. In Women in sports coaching. Routledge;2016. (pp. 13–34).
Lennon T. Benchmarking Women’s Leadership in the United States. University of Denver - Colorado Women’s College; 2013.
Manzi F, Heilman ME. Breaking the glass ceiling: for one and all? J Personal Soc Psychol. 2021;120(2):257.
Maxcy J, Fort R. Anthony Krautmann (2002). “The Effectiveness of Incentive Mechanisms in Major League Baseball.” Journal of Sports Economics 3, no.3. (August 2002): 246–255.
Medoff MH. On Monopsonistic Exploitation in Professional Baseball. Q Rev Econ Bus. 1976;16:113–21.
Pullen A, Vachhani SJ. Feminist ethics and women leaders: from difference to intercorporeality. J Bus Ethics. 2021;173(2):233–43.
Raimondo HJ. (1983). “Free Agents’ Impact on the Labor Market for Baseball Players.” Journal of Labor Research. 4, no.2: 183 – 93.
Scully GW. Pay and performance in Major League Baseball. ” Am Economic Rev. 1974;64:917–30.
Shaw P. Achieving title IX gender equity in college athletics in an era of fiscal austerity. J Sport Social Issues. 1995;19(1):6–27.
Sommers PM. and Noel Quinton (1982). “Pay and Performance in Major League Baseball: The Case of the First Family of Free Agents.” Journal of Human Resources. 17, no.3: 426 – 36.
Sommers PM. (1990). “An Empirical Note on Salaries in Major League Baseball.” Social Science Quarterly. 71, no.4: 861 – 67.
Sommers PM. (1993). “The Influence of Salary Arbitration on Player Performance.” Social Science Quarterly. 74, no.2: 439 – 43.
Staurowsky EJ, Flowers CL, Buzuvis E, Darvin L, Welch N. The women’s Sports Foundation 50 years of title IX: we’re not done yet executive summary and policy recommendations. Women in Sport and Physical Activity Journal. 2022;30(2):71–84.
The NCAA. (2022). “Title IX report shows gains in female participation, though rates lag increases by men.” (June 23). https://www.ncaa.org/news/2022/6/23/media-center-title-ix-report-shows-gains-in-female-participation-though-rates-lag-increases-by-men.aspx.
Thorn J, Palmer P. The hidden game of baseball: A Revolutionary Approach to Baseball and its statistics. New York: Doubleday; 1984.
Von Allmen P. (2013). “Coaching Women and Women Coaching: Pay Differentials in the Title IX Era.” In Handbook on the Economics of Women in Sports. pp. 269–289.
Walker NA, Schaeperkoetter C, Darvin L. “Institutionalized practices in sport leadership.” Women in sport leadership, 2017;33–46.
Wicker P, Cunningham GB, Fields D. Head coach changes in women’s college soccer: an investigation of women coaches through the lenses of gender stereotypes and the glass cliff. Sex Roles. 2019;81:797–807.
Yanus A, Karen O’Connor. To comply or not to comply: evaluating compliance with title IX of the Educational amendments of 1972. J Women Politics Policy. 2016;37(3):341–58.
Zimbalist A. “Salaries and Performance: Beyond the Scully Model.” In Diamonds are Forever: The Business of Baseball, Sommers, Paul, editor The Brookings Institution: Washington D.C.; 109–133. 1992a.
Zimbalist A. Baseball and billions. New York: Basic Books; 1992b.
Zimbalist A. Unpaid Professionals. Princeton University Press; 2001.
Funding
Not Applicable. No external funding was received for this work.
Author information
Authors and Affiliations
Contributions
Dr. Courtney Paulson served as lead author and editor, data expert/preparation, and statistical methodology contributor. Dr. Lindsey Darvin served as lead research author and sports management expert for literature review and research development. Dr. David Berri served as lead methodology author and economic expert for methodology creation/implementation.
Corresponding author
Ethics declarations
Competing interests
The authors have no competing interests in the submission or production of this manuscript. The authors are full-time research faculty at their respective institutions and do not receive financial or other personal compensation for this work.
Ethics approval and consent to participate
Not Applicable. No direct subject interaction was involved in this research study, and no ethical concerns apply to the use of the data and methods in the subjects of this research study. Data was collected from the public records at the NCAA website, and no subject experimentation was performed in the course of this study.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Paulson, C., Darvin, L. & Berri, D. Head Coach Gender and Player Performance in NCAA Softball. J Econ Race Policy 6, 173–186 (2023). https://doi.org/10.1007/s41996-023-00121-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41996-023-00121-3