What happens after enrollment? An analysis of the time path of racial differences in GPA and major choice
- First Online:
- Cite this article as:
- Arcidiacono, P., Aucejo, E.M. & Spenner, K. IZA J Labor Econ (2012) 1: 5. doi:10.1186/2193-8997-1-5
At the private university we analyze, the gap between white and black grade point averages falls by half between the students' freshmen and senior year. This outcome could suggest that affirmative action policies are playing a key role to reduce racial differences. However, this convergence masks two effects. First, the variance of grades given falls across time. Hence, shrinkage in the level of the gap may not imply shrinkage in the class rank gap. Second, grading standards differ across courses in different majors. We show that controlling for these two features virtually eliminates any convergence of black/white grades. In fact, black/white gpa convergence is symptomatic of dramatic shifts by blacks from initial interest in the natural sciences, engineering, and economics to majors in the humanities and social sciences. We show that natural science, engineering, and economics courses are more difficult, associated with higher study times, and have harsher grading standards; all of which translate into students with weaker academic backgrounds being less likely to choose these majors. Indeed, we show that accounting for academic background can fully account for average differences in switching behavior between blacks and whites.
KeywordsGrade inflationAffirmative actionMajor choiceI2I20I23
Scholars have known since the Coleman Report in 1966 that the black white educational achievement gap is a robust empirical regularity. Since then, a prolific literature in economics has emerged trying to describe the evolution, causes and consequences of the racial test scores gap in primary and secondary schools. The main findings indicate that African American children enter kindergarten lagging behind their white counterparts, and these differences are likely to persist for the foreseeable future (Neal ). Cunha et al. () argue that schooling raises measured ability, but does not close gaps between children from different racial and economic strata, and if anything widens them. Fryer and Levitt (), using the Early Childhood Longitudinal Study database, find that by the end of first grade, black children lost the equivalent of almost three months of schooling relative to whites. These trends continue through middle school with both Phillips and Chin () and Hanushek and Rivkin ([2006, 2009]) documenting increases in the math achievement gap between blacks and whites through the eighth grade.
The divergence in black/white outcomes at early ages is not surprising given disparities in resources between black and white families. It could also be the case that disparities may continue to grow in college due to differences in parental resources, support, and information that also matter for performing well in college. However, the college environment is substantially different in that students are more separated from their families. Hence, it is also possible to expect, by taking students whose academic background is weak due to lack of resources but whose academic potential is strong, that these students perform poorly at first as they acquire the needed skills to succeed and then, with time, catch up. By way of illustration, consider the case of Ph.D. economics programs in the United States. International students, who often have Master's degrees upon entry, typically come in better prepared than their American counterparts, with American students gradually catching up over time. With affirmative action promoting access to those who are otherwise less prepared, it is possible that the beneficiaries of affirmative action may also catch up, at least partially, over the course of their college career.
In this paper, we examine the evolution of racial disparities in college, focusing in particular on students at Duke University. While researchers have documented lower grades for black students in college (see, for example, Betts and Morell ), this is to be expected given differences in college preparation. Here, we are interested in the time path of racial differences. Clearly using data from one highly-selective school may lead to questions about how the results carry over to other environments. Weighed against this, however, is the ability to use within-school variation, ensuring that our results our not driven by grading patterns being different across the different types of schools blacks and whites attend.
There are, however, at least two reasons to be skeptical of Figure 1: variance and course selection. With regard to variance, instructors use much less of the grade distribution in upper year coursesb. Indeed, the standard deviation of grades for second-semester seniors is 86% percent of the standard deviation of grades for first-semester freshmen. For convergence to occur, it is therefore important to examine differences in class rank over time rather than GPA levels.
The second concern is course selection. Grading standards differ wildly across majors at Duke (see Johnson [1997, 2003]), with similar differences seen across many universities (see Sabot and Wakeman-Linn , Grove and Wasserman , Bar and Zussman  and Koedel ).c In particular, natural science, engineering, and economics classes have average grades that are 8% lower than the average grades in humanities and social science classes. Note that these averages do not take into account selection into courses: average SAT scores of natural science, engineering, and economic majors are over 50 points higher than their humanities and social science counterparts. Although blacks and whites initially have similar interests regarding whether to major in the more strictly graded fields, the patterns of switching result in 68% of blacks choosing humanities and social science majors compared to less than 55% of whitesd. We show that accounting for these two issues can explain virtually all the convergence of black white grades.
Accounting for shrinking grade variances and course selection also explains the convergence in grades for a group where we would expect catch up to not occur: legacies. Legacies at Duke start out behind their white non-legacy counterparts (though not as far back as blacks) with 65% of the gape removed by the end of the senior year. Similar major-switching patterns occur for legacies as well, with large shifts away from the natural sciences, engineering, and economics towards humanities and social sciences. The different grading standards across courses legacies and blacks take, coupled with the tighter variances on the grade distributions of upper year courses, accounts for their catch up to their white non-legacy counterparts.
The convergence of black/white grades is then a symptom of the lack of representation among blacks in the natural sciences, engineering, and economics. Over 54% of black men who express an initial interest in majoring in the natural sciences, engineering, or economics switch to the humanities or social sciences compared to less than 8% of white men. While the similar numbers for females are less dramatic across races, they are nonetheless large: 33% of white women switch out of the natural sciences, engineering, and economics with 51% of black women switching.
These cross-race differences in switching patterns can be fully explained by differences in academic background. We show that natural science, engineering, and economics courses are more difficult, associated with higher study times, and are more harshly graded than their humanities and social science counterparts. These trends are particularly true for students with weaker academic backgrounds resulting in those with relatively weaker academic backgrounds being much less likely to persist in natural science, engineering, and economics majors.f
2 The Campus Life and Learning Project Data (CLL)
The data we analyze come from the Campus Life and Learning Project (CLL). The data was collected from surveys of two consecutive cohorts of Duke University students before college and during the first, second and fourth college years. The target population was defined as all undergraduate students in the Trinity College of Arts & Sciences and the Pratt School of Engineering. The sampling design randomly selected about one third of white students, two thirds of Asian students, one third of bi- and multiracial students and all black and Hispanic students. As a result, the final sample (including both cohorts) consists of 1536 students: 602 white, 290 Asian, 340 black, 237 Hispanic and 67 bi- or multiracial students.
Each cohort was surveyed via mail in the summer before initial enrollment at the university; the questionnaire was completed by 1181 students, a 77% response rate. However, response rates declined in the years following enrollment: in the first year of college 71% of students responded to the survey; in the second year 65% and in the third year 59%.g In addition to the information provided by the surveys, the survey asked permission to access their confidential student records. Since the students were given the opportunity to answer yes to this question on each survey, permission was granted at a very high rate: 91% of the sample granted confidential access to their student records. These records include complete college transcripts, major selection, graduation outcomes, test scores (i.e. SAT, ACT), Duke Admission Officers rankings based on high school curriculum, reader rating scores, high school extracurricular activities, and financial aid and support.
Summary statistics for selected variables by race
Mother BA or more
Mother Doctorate/Professional Degree
Mother Ed missing
Father BA or more
Father Doctorate/Professional Degree
Father Ed missing
Family Inc ≤ $50,000
Family Inc missing
Private School missing
SAT (Math + Verbal)
Duke Admissions Office Rank
Letters of Recommendation
The second set of rows show the Duke Admission Office evaluations which are scaled from 1 to 5. The largest cross-racial gaps are on achievement and curriculum. Two evaluators are given each file and the scores for each of the categories are averaged across evaluators. The largest cross-racial gaps are on achievement and curriculum. Asian students are ranked highest on average in these two categories, followed closely by whites. Among the different races, blacks score on average the worst in all categories but the gap is smaller on personal qualities and letters of recommendation.
3 The time path of black/white GPA differences and their sources
Median GPA and percent A's for black and white students by academic year
Table 2 also shows that the fraction of grades given that are A's rises substantially over time for both blacks and whites. Both races see a fifteen percentage point increase in the fraction of A grades given. This censoring has the effect of compressing the actual grade distribution. In addition to censoring, grading practices may vary by course, and black and white students may select courses differently, particularly over time. The next two subsections investigate the importance of censoring and course selection in explaining black/white grade convergence.
3.1 Class rank adjustments with no selection
Median class rank for black and white students unadjusted and adjusted for average course grades by academic year
Adjusting for Mean Grade
Year 1 to Year 4
3.2 Class rank adjusted for selection
where ϵijt is assumed to be orthogonal to δjand αit. Given the composition of class ability, differences in δjthen reflect differences in grading practices. Given estimates of δj, we can purge the grades of inflation by subtracting these estimates off of observed grades and using the purged grades to form a new measure of class rank. This new measure will then provide a clear picture of how black performance changes across years.
There are, however, at least two issues associated with this specification. First, there are many individual and course fixed effects that we need to recover. Second, grades are censored from above and become more censored in later years. In particular, 41% of grades given for seniors are A's. Combining the iterative strategy in Arcidiacono et al. () to handle multiple fixed effects in large state space problems and the Expectations Maximization (EM) algorithm applied to a Tobit in Amemiya (), we are able to obtain estimates of the parameters of interest while circumventing the dimensionality and censoring problems. Given the censoring, however, we need to make a distributional assumption on ϵ and we assume that it is distributed N(0,σ).
The algorithm begins with an initial guess of the parameters . It then iterates on the following steps with the m th iteration given by:
Calculate . Take the mean across grades for i at time t to obtain
Calculate . Take the mean across grades for course j to obtain
Median class rank for black and white students adjusting for course selection
Year 1 to Year 4
3.3 Robustness check: legacies
Time path of median legacy and white non-legacy class rank
Adjusting for mean grade
Adjusting for selection
Year 1 to Year 4
The last set of columns adjust for selection into courses. Selection into courses has no effect on legacy rank as seniors relative to the second set of columns. However, controlling for course selection as freshmen raises legacy rank. The net effect is then a widening of the gap between white non-legacies and legacies over time. While the unadjusted class rank showed the median legacy improving their position relative to the median white non-legacy by 5.5 percentage points, adjusting for selection shows their position actually falls by 3.8 percentage points. The convergence pattern between legacies and white non-legacies are then similar to African Americans, though the legacy estimates are less stable.m
3.4 Which students improve their position?
With blacks showing little evidence of catching up once we account for selection into classes, what groups do improve their position? To answer this question we begin by transforming class ranks such that they are distributed N(0,1). Then, we differenced the transformed class rank for seniors with that of freshmen. Finally, we regressed this gain in class rank on a series of characteristics.
Estimates of gains in class rank
Senior rank−Freshman rank
Junior rank−Freshman rank
Duke Personal Qualities
The third column of Table 6 adds measures of the Duke ranking of the applicant. Here, we create five dummy variables, one for each of the Duke ranking measures.n Being highly ranked on achievement is associated with decreases in class rank as are having relatively strong letters. In contrast, being highly ranked on personal qualities and the essay is associated with gains in class rank. Controlling for Duke rankings renders the SAT score results insignificanto.
In order to address concerns related to possible lower levels of effort exerted by some individuals in their senior year; the second set of columns of Table 6 repeats the analysis but uses changes between the freshman and junior year. The negative effect on Asians disappears, suggesting that Asian students are particularly prone to decreasing their effort in their senior year. The coefficient on male, while still significant is now half the value. In addition, columns (2-3) and (5-6) show the same patterns on SAT scores and the Duke rankings, suggesting that those with lower SAT scores and exhibiting potential (as opposed to preparation) improve their relative position. Finally, the coefficient on black becomes negative and significant once controls for SAT scores are included. This outcome could be explained by blacks with lower SAT scores not being able to significantly improve their ranking over time, unlike their other low SAT counterparts.
4 Racial disparities in major choice
In the introduction, two post-enrollment trends in black/white educational outcomes were described. First, black students see their grade point averages come closer to their white counterparts as students move from their freshman to senior year. But in the previous section, we showed that this cross-race convergence of grades is driven not by black students catching up, but rather by differences in grading patterns and course selection. We now turn to the second trend, namely that black students are much more likely to leave natural science, engineering, and economics majors than their white counterparts. Next, we present evidence indicating that there are differences in the grading patterns and the demands that courses in different majors place on their students. As a consequence, these differences then lead to students with worse academic backgrounds being more likely to move away from the natural sciences, engineering, and economics majors. Indeed, we show that differences in academic background can fully account for the cross-race disparities in persistence in the natural sciences, engineering, and economics.
4.1 Patterns of major switching by race
Final major and expected major open by gender and race
Final Major (%)
Expected Major (%)
Do not Know
More specifically, the proportion of white males choosing natural science, engineering, or economics majors is over 19 percentage points higher than the corresponding proportion of black males. This occurs despite black males showing a much greater initial interest in natural science, engineering, and economics majors, though this result is clouded by white males being more likely to report uncertainty about their future major.q White females are also more likely to choose natural science, engineering, or economics majors than black females, but the gap is small. Again, black females express a greater preference for natural science, engineering, and economics majors but are also less likely to report that they are uncertainty about their future major.
Final major and expected major open by gender and race conditional on not reporting “Do not Know”
Final Major (%)
Expected Major (%)
4.2 Differences in selection and major demands
To explain why individuals leave the natural sciences, engineering, and economics as well as the large differences across races, we first examine how this group of majors is different from their humanities and social science counterparts. Three main differences emerge. First, similar to Johnson ([1997, 2003]), we show that grading practices vary dramatically across these major groupings. Second, those students who are better prepared academically are more likely to persist in the natural sciences and economics. Finally, and perhaps related to the differences in grading practices, students are working harder in natural science and economics classes and perceive these classes to be more challenging than classes in the humanities and social sciences.
Average grades received by type of course and year
The differences in grades across the two groups does become smaller over time, which is in part reflective of selection out of natural science, engineering, and economics. While the rise in average grades across years is small in humanities and social science classes, this is dwarfed by the rise in grades in natural sciences, engineering, and economics classes. The average grade given to non-black seniors in natural science, engineering, and economics classes is almost 0.4 points higher than the average for freshmen. This increase over time is even larger for blacks at over 0.6 points. However, despite this increase in grades over time, for non-blacks and blacks, seniors in natural science, engineering, and economics classes have lower grades on average than freshmen in humanities and social science classes.
Major migration by initial major and SAT
Percent of students
Do not know
Do not know
Given the different grading practices as well as the sorting across majors, we may also suspect that study times vary across courses taken in these major categories as well. We are then interested in the relationship between number of courses taken in the natural sciences, engineering, and economics category and study time. The CLL survey asked students in both their freshman and sophomore years the following question:
· Since entering college, how much time have you spent during a typical week doing the following activities?
of which “studying/homework” was one of the options. Respondents were given a menu of time intervals as possible answers.v Over 20% of observations in both years are censored at the top category, 16 or more hours. We used midpoints for the time intervals except for the last interval and then estimated censored regressions where study time was the dependent variable.
Number of Nat. Sci./Eng./Econ. Courses
Total Number of Courses
Only Typical Load
Table 11 shows that the coefficients on female are always significant and positive, while on races are insignificant in most of the cases. In this regard, the results suggest that females spend around two to two and a half hours more studying a week than their male counterpartsx, with the stronger effects found in the sophomore year. Given that the median study time reported is eight hours a week, this is a substantial difference.
The total number of courses and number of natural science, engineering, and economics courses are scaled to correspond to the number of classes taken in a semester as opposed to the whole year. Switching one humanities or social science class to a natural science, engineering, or economics class is associated with a half-hour to forty-five minute increase in weekly study time.y Comparing the coefficient on the number of natural science, engineering, and economics classes to the coefficient on total number of courses suggest that natural science, engineering, and economics courses are associated with 50% more study time that social science and humanities coursesz. Note that these results should not be interpreted as causal. Rather, we are describing the correlations seen in the data: whether it is selection into the courses or actual work requirements, more studying is occurring in natural science, engineering, and economics classes.
Most challenging course
SAT one standard deviation below the mean
SAT one standard deviation below the mean
The third column shows that, in first semester freshmen courses, a natural science, engineering, or economics course will be 46% more likely to be chosen as the most challenging courses than if the most challenging course was randomly assigned. The ratios for females are higher, with the ratios higher still for blacks. As freshmen, blacks are 69% more likely than random to choose a natural science, engineering, or economics course as most challenging. The results for blacks can be partly explained by academic background mattering more in the natural sciences, engineering, or economics. This is shown by those who have SAT scores one standard deviation below the mean also having higher ratios than the average for the population. The gap between humanities and social sciences versus natural science, engineering, and economics classes in terms of which classes are most challenging increases over the first two years of colleges as the ratios for all groups are higher in the sophomore year.
4.3 Explaining racial disparities in switching behavior
Logit marginal effects on the probability of switching out of the natural sciences, engineering and economics
Duke Personal Qualities
Year 1 Student Effect
Column (2) controls for SAT score. None of the race coefficients are statistically significant and the coefficients on black and Hispanic are cut by more than half. Those with high SAT scores are significantly less likely to move out of natural science, engineering, and economics, consistent with Table 10. The next two columns add measures of the ranking of the Duke admission's office as well as the first period student effect from the grades analysis (αi 1). Adding more controls further lowers the black coefficient while affirming that those with stronger backgrounds are more likely to persist in the natural sciences, engineering, and economics. Both the first period student effect and having a strong high school curriculum make switching out of the natural sciences, engineering, and economics less likely. Overall, while the gap between males and females persistsab, racial differences can be full explained with observable characteristicsac.
Logit marginal effects on the probability of social sciences or humanities final major conditional on social sciences or humanities not being the initial major
No Initial Major
Duke Personal Qualities
Year 1 Student Effect
To further reenforce the point that the cross-race differences in persistence in natural sciences, engineering, and economics is driven by academic background, we examine data on the reasons individuals switched majors. In particular, the CLL survey asked students during their sophomore year if they had changed their major and, if so, why. Students were given a series of reasons and could check more than one reason for switching. Two of the potential answers relate directly to academic preparation:ad
· Lack of pre-college academic preparation for the major course requirements
· Academic difficulty in the major course requirements
We categorized an individual as switching because of their academic background if they marked either of the two answers above as a reason they switched majors. Over 30% of individuals who switched majors in their sophomore year did so in part because of their academic background. We then estimated a logit model of switching majors because of academic background on the sample of those who switched majors. Our controls include those from Table 13 with additional controls for initial major choice. Note that these are switches in the sophomore year and may not be across the broad categories we have been using in the previous parts of this section.
Change of major because of difficulty
Initial Major Nat Sci/Eng/Econ
Duke Personal Qualities
Year 1 Student Effect
In this paper we have analyzed how black and white educational outcomes at an elite university vary over time. We have focused on two outcomes: grades and choice of major. An argument in favor of affirmative action in college admissions is that it identifies students with much potential but weak preparation, suggesting recipients should catch up to their more-prepared counterparts over time. While at first blush there appears to be evidence of this as the differences in grades between blacks and whites diminishes over their college careers, we show that this is not due to differential learning. Rather, it results from both changes in how the grade distribution is used over time (the grading distribution is more censored in later years) and changes in course selection.
Changes in course selection result from black and white students having very different persistence rates in the natural sciences, engineering, and economics. While conditional on sex black students have stronger initial preferences than whites for majoring in the natural sciences, engineering, or economics, they are significantly less likely to choose one of these majors for their final major. We show that these differences in persistence rates are fully explained by differences in academic background. Courses in the natural sciences, engineering, and economics are rated more difficult, are associated with higher study times, and have harsher grade distributions than those in the humanities and social sciences. The differences in difficulty levels across course types then works to dissuade individuals with relatively worse academic backgrounds from persisting in natural science, engineering, or economics majors.
The lack of minority representation in the sciences is of national interest and much money has been spent on encouraging minorities to enter the sciences. Seymour and Hewitt () point out that the National Science Foundation alone has spent more than $1.5 billion to increase participation of minorities in the sciences, and two programs at the National Institute of Health have invested $675 million in the same endeavor. It is possible, however, that affirmative action, is working against these goals. Namely, affirmative action primarily affects where minorities enroll in college, not whether they enroll, pushing students up through the school quality distribution (Arcidiacono et al. ). With the difference in course difficulty and grading standards between the natural sciences, engineering, and economics and their humanities and social sciences counterparts naturally leading the (relatively) least prepared students away from the sciences, affirmative action may be working to increase the number of non-science majors at top schools at the expense of science majors at less-selective schools. That is, minority students would be higher up in the preparation distribution at a less-selective school, potentially resulting in a higher probability of persisting in a science major. However, more work is needed on a larger set of schools in order to assess the counterfactual of how persistence rates in the sciences would change absent affirmative action.
Appendix: drop-out bias and non-response bias
The Registrar's Office data provided information on students who were not enrolled at the end semester in each survey year. Non-enrollment might occur for multiple reasons including academic or disciplinary probation, medical or personal leave of absence, dismissal or voluntary (including a small number of transfers) or involuntary withdrawal. Fewer than one percent of students (n=12) were not enrolled at the end of the first year; about three percent by the end of the second year (n=48) and just over five percent (n=81) by the end of the senior year. We combined all of these reasons and tested for differences in selected admissions file information of those enrolled versus not enrolled at the end of each survey year. The test variables included racial ethnic group, SAT verbal and mathematics score, high school rank (where available), overall admission rating (a composite of five different measures), parental education, financial aid applicant, public-private non-religious-private religious high school and US citizenship. Of over 40 statistical tests, only two produced significant differences (with p-value less than 0.05): (1). At the end of the first year, dropouts had SAT-verbal scores of 734 versus 680 for non-dropouts; (2). by the end of the fourth year, those who had left college had an overall admissions rating of 46.0 (on a 0-60 scale) while those in college had an average rating of 49.7. No other differences were significant. We conclude that our data contain very little drop-out bias.
We conducted similar tests for respondents versus non-respondents for each wave for the same variable set plus college major (in 4 categories: engineering, natural science/mathematics, social science, humanities), whether or not the student was a legacy admission, and GPA in the semester previous to the survey semester. Seven variables show no significant differences or only a few small sporadic differences (one wave but not others), including racial ethnic category, high school rank, admissions rating, legacy, citizenship, financial aid applicant, and major group. However, several other variables show more systematic differences:
· Non-respondents at every wave have lower SAT scores (math: 9-15 points lower, roughly one-tenth to one-fifth of a standard deviation; verbal: 18-22 points lower, roughly one-third of a standard deviation).
· Non-respondents have slightly better educated parents at waves one and three, but not waves two and four.
· Non-respondents at every wave are less likely to be from a public high school and somewhat more likely to be from a private (non-religious) high school.
· Non-respondents have somewhat lower GPA in the previous semester compared with respondents (by about one-quarter of a letter grade).
These differences are somewhat inconsistent in that they include lower SAT and GPA for non-respondents, but higher parental education and private (more expensive) high schools. In general, the non-response bias is largest in the pre-college wave and smaller in the in-college waves even though the largest response rates are in the pre-college wave. In general, we judge the non-response bias as relatively minor on most variables and perhaps modest on SAT measures.
1Graduation rates are quite high at Duke University, with 96% of the students finishing their studies.
2Grove and Waserman  show similar trends in grades for a large private university in the northeast. Moreover, data of four years college graduates from the NLSY97 also shows that students GPA increase in upper years of college while their standard deviation decreases. More specifically, mean GPA increased from 3.18 to 3.33, while their standard deviation decreased form 0.574 to 0.481 between the freshman and senior years.
3For instance, Koedel () shows that the grades awarded by education departments are substantially higher than the grades awarded by all other academic departments. The classroom level average GPAs in the education departments are 0.5 to 0.8 grade points higher than in other department groups.
4The high proportion of students that switch major can be explained by students learning about their ability and preferences in the first few years of college. Stange () finds that uncertainty about college completion and final major is empirically important. Similarly, Stinebrickner and Stinebrickner ([2011a]) show that students learning about academic matters plays a particularly prominent role in educational decisions.
5Based on comparing non-cumulative semester GPA.
6Stinebrickner and Stinebrickner ([2011b]) show that, in Berea College, the proportion of students who reported that math/science is their most likely major is higher than the proportion for any other major. However, by the second semester of the third year in college, the proportion of students who reported that math/science is the most likely major decreased by 45%. In this regard, they highlight the potential importance of policies at younger ages that lead students to enter college better prepared to study math or science.
7In the appendix we discuss the patterns of non-response and attrition.
8Note that the median student for each race is changing by year.
9Note that the median student is changing across years.
10See Bar et al. () for an analysis of Cornell's program, with Bar et al. () developing a theoretical model of how students change their course-taking behavior in response to programs such as this one.
11In principle there is a lower bound on grades. In practice, very few F's are given suggesting that censoring at the bottom end of the distribution is not an issue.
12The formula of the inverse Mill's ratio is given by
13Comparing junior class rank to freshmen class rank still shows a small legacy improvement even after controlling for course selection. However, course selection clearly matters as the gains would be much larger without these adjustments. Overall, the legacy estimates are less stable than the estimates for African Americans. This may be a result of having a smaller number of legacies (175).
14For each ranking category, we created the dummy variables by choosing splits such that a significant fraction received both a high and low ranking. For achievement, recommendations, personal qualities and the essay a high ranking was above 3.5, above 3.75, above 3.7, and above 3.7 respectively. The student needed to receive a 5 to obtain the high ranking on curriculum.
15It is important to highlight that the negative coefficient on SAT is not given by a mechanical result (i.e. students at the top of the distribution initially having less room to move up in later years). This result only implies that SAT is more correlated with the freshman class rank than the senior or the sophomore ones.
16The total sample size of this table (which only includes black and white students) is 663.
17Uncertainty is captured by individuals responding to the expected major question with “Do not know”.
18The proportion of students that reported “Do not know” is 30%.
19The National Longitudinal Survey of Freshmen, which follows a cohort of first-time freshman at 28 selective colleges and universities, shows a similar pattern in major persistence.
20The total number of grades in humanities/social science for non black (black) considering all years is 18535 (5340) while in natural sci/engineering/economics is 13100 (2530).
21Similarly, Bar and Zussman () shows that humanities courses at the College of Arts and Sciences of an elite university in the United States provide higher grades than natural sciences ones.
22The intervals are: 0 hours per week, less than 1 hour, 1 to 5 hours, 11 to 15 hours, 16 or more.
23We use the same set of dummy variables as in Table 7.
24Stinebrickner and Stinebrickner () find similar results. They show that males study half an hour less per day than females.
25Babcock (), Babcock and Marks () and Stinebrickner and Stinebrickner () also show large differences in study time across majors. Babcock () provides evidence that harsher grade distributions result are associated with more study time.
26If economics were classified as social science, then there would be a slight decrease in the the study time for engineering and natural sciences relative to humanities, social sciences, and economics. However, the coefficient would remain statistically significant.
27Given that so little switching occurs in the opposite direction (i.e. from humanities or social sciences to natural sciences or economics), we only focus on switches away from natural sciences and economics.
28The higher proportion of females relative to males leaving sciences is an empirical regularity that has been analyzed in Carrell et al. (). They show that professor gender affects female students' propensity to persist in the sciences.
29If (instead) economics is classified as a social science, the coefficient on female and black will fall slightly but they will remain statistically significant.
30The other reasons were: 1) Academic interests and values have changed since arriving at Duke, 2) Career interests have changed since arriving at Duke, 3) Career values have changed since arriving at Duke, 4) Lack of pre-professional learning opportunities available (e.g., internships, research opporutnities, and 5) Other .
We thank Nathan Martin, Todd Stinebrickner, and seminar participants at Stanford Education school for comments. The Campus Life and Learning data were collected by A. Y. Bryant, Claudia Buchmann and Kenneth Spenner, Principal Investigators, with support provided by the Andrew W. Mellon Foundation and Duke University. They bear no responsibility for conclusions, recommendations and opinions found in this paper. Partial funding was provided by Project SEAPHE.
Responsible Editor: Pierre Cahuc
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.