1 Introduction

A vast amount of evidence has piled up about serious negative consequences of smoking since the 1964 Surgeon General’s report on the health effects of smoking (Levine et al. 1997). This evidence has lead economists to investigate the potential short term and long term relationship between smoking and various life outcomes such as labor market performance; finding a strong negative association between the two. The literature offers several reasons for this association. The first is the set of causal mechanisms through which smoking adversely affects labor market performance. Some examples of such mechanisms are employer discrimination; health problems, absenteeism and resulting productivity decrease; and hours lost due to smoking breaks (Kristein 1983; Levine et al. 1997; Lee 1999; Halpern et al. 2001; Heineck and Schwarze 2003; Weng et al. 2013).

There is one more causal mechanism through which smoking can affect labor market performance. Smoking negatively affects educational attainment if it is initiated early (Zhao et al. 2012); thus, indirectly deteriorates labor market performance through education. Even though the majority of the documented adverse health effects of smoking is observed in the long term, smoking may have adverse immediate health consequences on young people; if so, early smoking affects education. A report of the Surgeon General in 1994 shows that teenagers who smoke suffer from shortness of breath, increased heart beat and other respiratory problems. Furthermore, they are more vulnerable to the risk of other drug use. Levine et al. (1997) reveal that smoking is associated with decreased physical endurance. In addition, early smoking shows strong association with mental health problems and depression (Andreski and Breslau 1993). Although the nature of this association is yet to be established, there is evidence that mild depression may follow smoking initiationFootnote 1 (Steuber and Danner 2006; Goodman and Capitman 2000). Moreover, brain development, cognitive abilities and memory skills of young individuals can also be adversely affected by smoking (Trauth et al. 2000; Jacobsen et al. 2005). Consequently, all these negative effects on health can distort academic achievement.

Health condition of the individuals is not the only way early smoking can affect educational attainment. Since it is forbidden to smoke at schools,Footnote 2 smokers need to leave the campus during the breaks and turn back to classrooms after the break. Therefore, they are more likely to be distracted by life outside the school and more likely to return late to classrooms. Moreover, in their seminal paper (Rosenthal and Jacobson 1968) showed that expectations breed the performance. In other words, teachers’ expectations about potential performance of students actually affect eventual performance. In the case of early smoking, if teachers form lower expectations about smokers, then early smokers might actually perform worse Furthermore, early smoking can lead students to search for side jobs because they need to finance their new habit. Time spent at such jobs eventually reduces the time spent for studying; thus, it subsequently harms students’ performance at school.

Admittedly, the negative association between smoking and labor market performance does not have to be causal. It could be the result of a non-causal correlation. Such a correlation between smoking and labor market performance may occur when they are jointly determined by a set of observable and unobservable factors, e.g. parental characteristics, important life events, general attitude towards risk in life, myopic behavior or time preferences. Finally, the third reason is the reverse causality. In other words, labor market performance affects the smoking decision; for example, loss of a job might nudge individuals towards substance use including tobacco.

Keeping such mechanisms in mind, this study analyzes the effects of early smoking on educational attainment and labor market performance. This is not an easy task. The reason is that the aforementioned causal and correlated mechanisms complicate any analysis. The first method to deal with such a complication is to take advantage of instrumental variables. However, most of the instruments used in the literature so far, such as religiosity or parental characteristics, suffer from endogeneity as well. It is hard to assume that such type of individual-level or family-level factors do not have direct effects on educational attainment or labor market performance. French and Zarkin (1995) argue that it is very hard to find reasonable instruments to estimate the effects of alcohol use on wages. Perhaps, the same goes for the effects of smoking. Moreover, several studies discuss the weakness of instruments used for risky health behaviors including smoking and its consequences (French and Popovici 2011; Bound et al. 1995; Conley et al. 2012). Another problem with the IV estimation in the literature is that the negative smoking effect on labor market performance increases in magnitude once the instruments are used (Auld 1998; Zarkin et al. 1998; Van Ours 2004). This finding suggests that unobserved factors that make an individual more likely to smoke also make them perform better in the job market.Footnote 3 Although it is technically not possible to refute such a case, the more likely scenario is that unobserved factors that make an individual more likely to smoke, such as ability, time preferences or parental characteristics, make them perform worse in the job market. The same probably goes for educational attainment as well. That means the coefficient for the effect of smoking on labor market performance (or education) should actually decrease in magnitude once the endogeneity is taken into account.

The current study uses a correlated discrete factor approach in order to deal with the endogeneity issue rather than using exclusion restrictions. Heckman and Singer (1984) introduced this approach in order to control for unobserved heterogeneity in hazard rates, and Mroz (1999), for example, used it to estimate the effects of dummy endogenous variables.Footnote 4 First, the dynamics of smoking—accounting for both starting rates and quit rates are analyzed through mixed proportional hazard models. Although the main interest is on the early starting behavior—smoking before the age of 15—the analysis also includes quit rates of smoking to have a complete picture of the unobserved heterogeneity affecting the smoking dynamics. Hazard models provide the best fit to analyze the smoking dynamics as the smoking and quitting decisions are taken in a dynamic setting. Second, educational attainment and labor market performance, are analyzed using mixed ordered probit models with unobserved heterogeneity. Unlike previous studies, this study uses not only hourly wage information but also other indicators to measure labor market performance. I use information on the jobs that respondents have to construct an ordered variable, i.e. job rankings. The advantage of using the job rankings instead of wage information is that the data at hand enable construction of the job ranking variable for both the first job, the first jobs that individuals had, and the current job, the jobs that the individuals had at the time of the survey. However, wage information is available for only current jobs. Finally, smoking dynamics and ordered outcomes (educational attainment and job rankings) are modelled jointly to allow for correlation between unobserved heterogeneity. This controls for unobserved factors that can jointly affect smoking, education and labor market performance. Since reverse causality is not an issue here, because the early smoking behavior occurs before the age of 15, this method corrects for possible endogeneity caused by omitted variables.

The results show that early smoking has a negative effect on educational attainment. Once education is controlled for, the effect of early smoking on the first job rankings vanishes. However, there is still an effect on the current job rankings, which is a finding in line with the existing literature on the wage effects of smoking. An analysis of the probability of moving upward in the job rankings over time supports the aforementioned effects on the current job; showing that those who start smoking early are less likely to move upward. In other words, early smokers not only end up with worse first jobs due the effects through education, but also they are less likely to make a career. Finally, an investigation into the log-hourly-wages shows that reported wage effects of early smoking may be due to the smoking effects on the type of jobs, rather than the wage differentials within the same job.

There are several contributions of this study to the literature on the smoking effects on labor market performance. First, the empirical analysis uses not only the classical hourly wages information to measure labor market performance, but also the initial and the current job rankings. Since the current literature only focuses on the effects on wages, this study provides additional evidence about the effects of smoking on labor market performance. Second, this is the first study which explores the effects of smoking on labor market performance through the effect of educational attainment. It shows that there is an early smoking effect on labor market performance through educational attainment. Third, empirical analysis contrasts the effects of early smoking on the first job rankings and the effects on the current job rankings. Analysis of the first job rankings is interesting as it shows whether early smokers start their job career from a disadvantaged point early in life.

The remainder of this paper is set up as follows. Section 2 introduces the data and briefly presents some stylized facts. Section 3 gives details about the econometric strategy. Section 4 presents and discusses parameter estimates obtained through maximum likelihood estimations, and Sect. 5 concludes.

2 Data and Stylized Facts

2.1 Data

The data used in the empirical part of this study are from the Longitudinal Internet Studies for the Social Sciences (LISS), which comprises detailed data for a representative sample of Dutch population above 16 years old.Footnote 5 More specifically, a combined data set—from three specific single-wave collections of information within the LISS data-is used; namely Alcohol and Drugs Study, Work and Schooling Study and Wage Indicator Study. These three surveys, which are explained later in detail, constitute a rich set of information about smoking dynamics, labor market performance, and history of labor market transitions for a sample individuals representing the Dutch population.

Alcohol and Drugs study (2008) is a single wave data set, which is ideal for the purpose of the current study because it contains answers to detailed questions on smoking. Respondents in the LISS panel report whether they have ever used tobacco. If so, they also answer the following question: At what age, approximately, did you first use tobacco?. This information allows for the investigation of the determinants of uptake of tobacco, i.e, starting rates of smoking. The respondents who reported ever smoking also report whether they smoked in the last 30 days prior to the survey time. This information is used to estimate the determinants of tobacco cessation, i.e, quit rates of smoking. Analyses of starting and quit rates, then, enables a complete picture of smoking dynamics. This single wave data set consists of 5597 observations in total.

Work and Schooling (2008) and Wage Indicator (2009) are two data sets which focus on the working history of the respondents and their educational attainment. Respondents in the LISS panel answer detailed questions on their educational background (the highest degree of education with a diploma), type of the first and the current job (the job that an individual has at the time of the survey) as well as many other questions on wages, working hours, job satisfaction, etc. Merging these two data sets with Alcohol and Drugs study results in considerable number of missing observations. The resulting merged data set consists of 4030 observations. The reason is that 1567 individuals who participated in Alcohol and Drugs study did not participate in Work and Schooling and Wage Indicator studies. However, in terms of observables, these 1567 respondents and the remaining 4030 respondents are comparable; therefore, there is no immediate evidence for a selection problem between the data sets.

This paper focuses on three outcome variables: educational attainment, the first job rankings and the current job rankings. Each of the outcomes variables are constructed as ordered variables on a scale from 1 to 9; 1 denotes the lowest educational attainment or the lowest ranked job and 9 denotes the highest educational attainment or the highest ranked job. The exact details of the scales of education and job variables are given in “Appendix 1”.Footnote 6 As for the job variables, Table 2 displays the details of the rankings. The main idea is that clerical jobs are ranked higher than manual jobs, and non-manual jobs are ranked the highest. Within non-manual jobs, professional ones are ranked higher than managerial ones. As noted earlier the previous literature uses mainly hourly wages to analyze labor market performance. This is neither worse or better than using the job rankings at hand. The advantage of using the job ranking in this study is that this information is available for both the first jobs and the current jobs. Moreover, Table 2 displays the mean hourly wages corresponding to each category in the ordered jobs. Both for males and females, there is a strong positive correlation between hourly wages and job categories, except for the last category. Higher academic jobs pay less than higher supervisory jobs, on average. However, job rankings seem to capture the overall wage differentials, and swapping the last two categories do not cause changes in the empirical findings.

Empirical analysis in the current study uses information only on respondents between 22 and 60 years old. This restriction and the missing observations decrease the sample size to 2174 respondents, 1021 of whom are males. The age restriction is imposed on the sample because most individuals complete their education around the age of 22 and enter the labor market. Moreover, many of them leave the labor market mainly due to early retirement around the age of 60. Not surprisingly, the data at hand also demonstrates this phenomenon. Percentage of those who are in paid employment rises sharply after the age of 22 and drops sharply after the age of 60. Similarly the percentage of those without a job is 24% for under 22 and 57% for above 60, whereas it is only 4% for between 22 and 60.

In addition, there are several sensitivity analyses throughout this study to provide evidence for the robustness of the results. Some of these sensitivity analyses were only possible after merging the data set with other assembled studies within the LISS data. “Appendix 3” briefly discusses data coming from other assembled studies.

2.2 Stylized Facts

Figure 1 highlights unconditional dynamics of starting age of smoking in the sample. Panel (a) displays the empirical hazard rates of tobacco uptake for both males and females. The figure shows that starting rates make a peak at the age of 16 and then another—smaller peak at the age of 18 for both males and females. The first peak indicates that, conditional on not smoking before, individuals have the highest risk of smoking at the age of 16.Footnote 7 Starting rates virtually become zero after the age of 25 for males and 23 for females; indicating that those who do not start smoking until mid 20s are very unlikely to do so afterwords. In other words, individuals mature out of smoking risk in their mid 20s regardless of gender. This finding is replicated by the cumulative starting probability figures in panel (b), where the slope of cumulative probability becomes almost zero around the age of 23 for females and 25 for males. The vertical axis displays the probabilities where the slope of cumulative probability becomes almost zero; indicating that more than 60% of females and 65% of males start using tobacco at some point in time.

“Appendix 2” presents the details and the descriptive statistics of the control variables and the variables of interest. The second row on the right panel shows that around 25% of the individuals, male or female, start using tobacco before the age of 15. The first sub-panel on the right presents the statistics of education variables and shows that, for both males and females, approximately 10% of the respondents have a university degree. Most individuals obtain an applied or a higher vocational degree. Around 4% of the respondents report that their education level is below the compulsory education in the Netherlands (a VMBO degree).Footnote 8 The last two sub-panels present the statistics of labor market performance variables. A quick comparison of the figures in the table reveals that there is an upward movement. The percentage of individuals having a lower ranked job is smaller in the current job variable whereas that having a higher ranked job is larger. Since the individuals can move upward in the job rankings after years of experience, this observation is reasonable.

Figure 2 displays the percentage of early smokers in each category of the ordered variables. For males, there is a clear pattern showing that lower ranked categories are mostly filled with early smokers. For females, as can be seen in Fig. 3, there seems to be no obvious pattern in educational attainment and labor market performance. This unconditional and purely descriptive evidence suggests that there is a negative association between early smoking and educational attainment as well as early smoking and labor market performance for males. Whether this association is causal or not is an empirical question.

3 Empirical Model

3.1 Dynamics of Smoking

Two main components of the smoking dynamics are analyzed: starting rates and quit rates. In the starting rates analysis, I assume that individuals become vulnerable to the risk of smoking from age 13 onwards, as only a handful of respondents report a smaller starting age. Specification of the starting rate at time t (t = 0 at age 12), conditional on observed characteristics x and unobserved characteristics u, is

$$\begin{aligned} \theta _s (t\mid x,u)=\lambda _s (t)\exp \left( x^{\prime }\beta _s + u\right) \end{aligned}$$
(1)

where \(\beta _s\) represents the effects of independent variables; \(\lambda _s (t)\), individual duration (age) dependence. u denotes Heckman and Singer type discrete unobserved heterogeneity (Heckman and Singer 1984), which is unmeasurable set of differences in individuals’ susceptibility to smoking. Duration (age) dependence has a form of flexible step function; \(\lambda _s(t)=\exp (\Sigma _{k}\lambda _{k}I_{k}(t))\), where k (= 1,...,9) is a subscript for age categories. \(I_{k}(t)\) presents time-varying dummy variables that are one in subsequent categories, 8 of which are for individual ages (age \(13,\ldots ,20\)) and the last interval is for ages above 20. Given that the model has a constant term in \(x^{\prime }\beta _c\), the first parameter in duration dependence, \(\lambda _{1}\), is normalized to 0.

Similar to starting rates of smoking, quit rates are also assessed using a duration model. The LISS panel includes questions on the last month use of tobacco. The specification below assumes that if an individual reports no use of tobacco in the last 30 days, that individual quit smoking in the time period starting from the first use of tobacco until 30 days prior to the survey. Specification of the quit rate at time \(\tau \) (\(\tau \) = 0 at the age of initiation), conditional on observed characteristics \(x_1\) and unobserved characteristics v, is

$$\begin{aligned} \theta _q (\tau \mid x_1,v)=\exp \left( x_1^{\prime }\beta _q + v\right) \end{aligned}$$
(2)

Note that this analysis does not contain any duration dependence, because observing the exact time of quitting in terms of respondents’ ages is not possible. However, interval censored nature of the data allows for the quit duration analysis (i.e., total duration of use) thanks to the information on the year in which the first use of tobacco takes place. This information gives an interval for quit duration; in other words, even though total duration of smoking is not observed, minimum and maximum values of this duration are known. Explicitly, duration of smoking, denoted by \(\tau \), will lie in the interval [0,\(\tau _q\)] where \(\tau _q\) is the difference between age at the time of survey and the age of the first use.

The joint density of completed durations until initiation of smoking and completed durations of smoking is specified asFootnote 9:

$$\begin{aligned} g_1(t,\tau \mid x,x_1)=\int _v \int _{u} f_s(t \mid x,u) f_q(\tau \mid x_1,v)dG(u,v) \end{aligned}$$
(3)

where G(uv) is the discrete joint mixing distribution of unobserved heterogeneity which allows for the possibility that conditional on the observed characteristics, starting age of smoking and total duration of smoking are correlated through unobserved characteristics. The number of support points in G(uv) is not predetermined and chosen using the likelihood ratio tests. For example, G(uv) can have 3 points of support \((u_{1},v_{1})\), \((u_{1}, v_{2})\), \((u_{2})\); with \(v_{2}=u_{2}=-\infty \). The associated probabilities denoted as \(\Pr (u_{1},v_{1})=p_{1}\), \(\Pr (u_{1}, v_{2})=p_{2}\) and \(\Pr (u_{2})=p_{3}\) are assumed to follow a logistic distribution, \(p_{i}=\frac{\exp (\alpha _{i})}{\Sigma _{i=1}^3 \exp (\alpha _{i})}\), where \(\alpha _{3}\) is normalized to zero. This indicates that the model identifies three types of individuals regarding starting and quitting smoking. The first group consists of individuals with a positive starting and positive quit rate. The second group consists of those with a positive starting rate but a zero quit rate. The third group has a zero starting rate, therefore the quit rate does not exist at all.

3.2 Educational Attainment and Labor Market Performance

I use ordered probit models to investigate how early smoking affects educational attainment and labor market performance. First, I assume that the smoking decision is independent from all the unobserved factors than can be correlated with educational attainment, i.e. that the smoking decision is exogenous. Given that such an assumption is, by and large, not plausible, the following section (Sect. 3.3) will present the model that takes account of possible endogeneity.

Educational attainment is measured as an ordinal variable in a scale of 1–9. To exploit the ordinal character of the dependent variable, I use an ordered probit model with discrete unobserved heterogeneity. Such unobserved heterogeneity captures time-invariant person specific unobserved factors that cause systematic differences in educational attainment. The unobserved latent variable in the ordered probit model is

$$\begin{aligned} y_{ed}^*=x^{\prime }\beta _{ed}+ \rho _{ed} smoke_{15-} + \epsilon _{ed} + e_{ed} \end{aligned}$$
(4)

where \(\rho _{ed}\) represents the effect of early smoking. \(\epsilon _{ed}\) controls for discrete type of unobserved heterogeneity, which is different from the error term \(e_{ed}\). Furthermore, \(\beta _{ed}\) measures the effect of the control variables. The observed ordered categories and the rest of the specification of the ordered model are given in “Appendix 4”.

Similar to educational attainment, labor market performance is also investigated through an ordered probit model. The unobserved latent variable in the analysis of labor market performance is

$$\begin{aligned} y_{j}^*=x^{\prime }\beta _{j}+ \phi _{j} education + \rho _{j} smoke_{15-} + \epsilon _{j} + e_{j} \end{aligned}$$
(5)

where \(\rho _{j}\) represents the effect of early smoking. \(\phi _{j}\) controls for the effect of educational attainment on labor market performance. \(\epsilon _{j}\) controls for discrete type of unobserved heterogeneity. \(\beta _{j}\) measures the effect of the control variables. The rest of the analysis is analogous to the analysis of educational attainment; therefore, the details of the model specifications are omitted.The analysis of labor market performance is the same for the initial and the current job rankings.

3.3 Joint (Correlated) Model

Assuming that smoking is exogenous to educational attainment and labor market performance might be unrealistic. The exogeneity assumption requires that the early smoking decision is orthogonal to any factor that affects educational attainment and labor market performance. It is, however, likely that there are unobserved personal characteristics that affect all three processes. Some individuals, to exemplify, can exhibit myopic behavior in general by opting for immediate pleasure rather than long term achievement. If such a behavior is formed early in life, then these individuals will be more likely to smoke at an early age and will be less likely to complete higher levels of education and less likely to invest in human capital. Thus, an estimated negative effect of smoking will reflect a correlation rather than causality. Distinguishing causality from correlation by relaxing the exogeneity assumption is crucial to explore the true effects of early smoking.

To distinguish causality from correlation, I adopt a model that controls for correlation between unobserved heterogeneity affecting the smoking decision, educational attainment and labor market performance. To establish a causal effect, all processes are modeled simultaneously such that unobserved factors are allowed to be correlated by using discrete mixing distributions. This correlated discrete factor approach is equivalent to a correlated random effects model. The main idea is that unobserved heterogeneity affecting these three processes can be correlated, i.e. they come from a joint mixing distribution. This is akin to assume that the endogeneity of the smoking decision stems from unobserved time invariant factors affecting early smoking and educational attainment, such as innate ability or rate of time preferences.Footnote 10 Since the early smoking decision is taken before the age of 15, reverse causality is not an issue here. Therefore, this assumption fits well in the empirical question that this study investigates. In the absence of reverse causality, this method corrects the endogeneity problem stemming from possible omitted variables.

The joint density function of the completed duration of smoking initiation, duration of smoking, educational attainment and labor market performance—\(g_3(t,\tau ,y_{ed}=k_{ed},y_{j}=k_{j} \mid x,x_1,x_{2,ed},x_{2,j})\) is specified as:

$$\begin{aligned} \int _{\epsilon _{ed}} \int _{\epsilon _{j}} \int _v \int _{u}&f_s(t \mid x,u) f_q(\tau \mid x_1,v) Prob(y_{ed}=k_{ed}\mid x_{2,ed},\epsilon _{ed}) \nonumber \\&Prob(y_{j}=k_{j}\mid x_{2,j}, \epsilon _{j}) dG(u,v,\epsilon _{ed}, \epsilon _{j}) \end{aligned}$$
(6)

where \(G(u,v,\epsilon _{ed}, \epsilon _{j})\) is a discrete mixing distribution underlying unobserved heterogeneity affecting age of onset of smoking, duration of use, educational attainment and labor market performance.

4 Parameter Estimates

4.1 The Dynamics of Smoking

Table 5 presents the parameter estimates of mixed proportional hazard models for starting and quit rates, for both males and females. The negative coefficient estimates on religiosity and age-cohort dummies show that individuals who were living with religious parents during their adolescence and individuals who belong to older birth cohorts have smaller hazard rates. In other words, they have a lower probability of initiating smoking. Moreover, males in couples seem to have higher quit rates compared to singles. In addition, those who start smoking at early ages are less likely to quit smoking. The parameter estimates for females display very similar results. Panel (b) in Table 5 presents the estimates for duration (age) dependence parameters. In line with the patterns observed in Fig. 1, smoking initiation makes a peak at the age of 16 and then a smaller peak at the age of 18 for both gender groups.

Panel (c) in Table 5 presents the parameter estimates of unobserved heterogeneity. In all columns, I set the second mass points to minus infinity, thereby allowing for the possibility of zero starting and quit rates. Columns (2) and (3) show that three mass points are identified in the joint mixing distribution. The finding of 3 points of support suggests that there are three types of individuals regarding starting and quitting smoking. The first group consists of individuals with a positive starting and positive quitting rate. The second group consists of those with a positive starting rate but a zero quitting rate; the third group, those with a zero starting rate. For the last group, therefore, quitting rate does not exist at all. The parameter estimates of probabilities associated to these mass points show that 47% of the males and 45% of the females have a positive starting rate and a positive quitting rate; 22% of the males and 18% of the females have a positive starting rate but a zero quitting rate. 31% of the males and 37% of the females have a zero starting rate. Finally, the log-likelihood test statistics presented in the same panel shows that correlation between unobserved heterogeneity affecting starting rates and quit rates is statistically significant.Footnote 11 Therefore, it is important to jointly model starting and quit rates to identify the unobserved heterogeneity behind the dynamics of smoking.

4.2 Educational Attainment

Table 6 displays the estimated parameters of individual and correlated ordered probit models. Columns (1) and (3) show that early smoking has a negative association with educational attainment for both males and females. As clarified before, these regressions ignore the possible endogeneity of the smoking decision. Accordingly, the parameter estimates are bound to be inconsistent.

Columns (2) and (4) present the results of the joint models that control for the possible correlation between unobserved heterogeneity affecting education and smoking. As can be seen in both columns, the parameter estimate of the early smoking effect decreases in size. For females early smoking does not have a causal effect on the educational attainment, and previously reported negative effect is purely due to the correlation through unobserved factors. For males, on the other hand, even though the coefficient estimate decreases in the joint model, it remains significant. Therefore, I cannot rule out the possibility that early smoking has a causal effect on educational attainment for males. Finally, the statistics for likelihood ratio tests that appear in panel (c) show that correlation between unobserved heterogeneity affecting education and smoking dynamics is statistically significant.Footnote 12

Admittedly, the finding that early smoking has a causal effect on education might seem surprising. One can argue that a few years of smoking would not possibly cause significant health problems that can impair the youth and prevent him or her from completing education. Such an argument has, of course, merit for the current analysis as well because even those who start using tobacco at an early age will not consume it for long years before they finish their education. Undeniably, certain adverse health effects can naturally be observed if individuals use tobacco early. Some of such affects are briefly discussed before. However, for the sake of the argument, I assume that these effects are not observed. If negative health effects are not driving the significant results presented in Table 6, and if none the other mechanisms discussed before is strong enough for an early smoking effect, then what can explain the results?

An alternative mechanism is the possibility of exogenous time-varying shocks that can simultaneously affect educational attainment and the smoking decision. For example, loss of a friend or a family member or parental divorce can cause frustration and depression; resulting in both lower education and involvement in risky health behaviors including smoking. Panels (a) and (b) of Table 7 attempt to control for some of such possibilities. Panel (a) introduces a dummy variable for early loss of parent(s); panel (b), a dummy variable for early parental divorce. In both cases, “early” means that the mentioned frustrating and depressive event takes place before the age of 15. Under both specifications the smoking effect remains unchanged; therefore, I conclude that the early smoking effect is robust to exogenous childhood shocks.

One can alternatively argue that the proposed joint model is unable to capture unobserved factors affecting the smoking decision and educational attainment. To investigate such a possibility I perform several robustness analyses by introducing control variables that are expected to be highly correlated with the unobserved factors that can affect both processes. One of such unobserved factors could be systematic differences between the rate of time preferences between individuals. If such preferences are formed early in life, then they can explain the negative coefficient estimates of early smoking. Individuals with high rates of time preference will place a higher value on present than on future. Consequently, they will be more likely to enjoy risky health behaviors and less likely to invest in human capital (Levine et al. 1997). In panel (c) of Table 7, I control for certain preference patterns to check the robustness of the smoking effect. If the early smoking effect changes after adding the preference variables, then it means the joint model fails to capture unobserved systematic differences. The preference patterns for which the specification controls are risk aversion, prudence and temperance. “Appendix 3” gives more information on the measurement and the use of these preference variables. For both males and females, parameter estimates in the table show that the results do not change; therefore, such preferences are already captured by the joint model and not driving the main results.

Another unobserved factor can be the selection into peer groups. For any risky health behavior, peers can have an effect on the individual. If for example, certain students select into certain peer groups, their education and smoking behavior can be affected simultaneously. It should be noted that current econometric model is exactly designed for such unobserved factors. If individuals select into peer groups based on some unobserved factors before smoking initiates, then this will be captured by the model. If selection happens after smoking initiates, based on the smoking behavior, then this is a consequence of smoking rather being a problem. Even though the econometric model is designed to capture such issues, I performed several sensitivity analysis using proxies for peer effects. Panel (d) of Table 7 introduces control variables for parental education. The smoking effect is robust to the inclusion of these variables. Finally, the last panel of Table 7 displays the results of an estimation where I include a dummy variable for the early use of alcohol (before age of 15). There is no data on the peer use of tobacco during childhood. Therefore, the idea here is that the peer effects that can lead an individual to use tobacco can also lead the same individual to involve in other risky health behaviors such as early alcohol use. In other words early alcohol use can be used as an imperfect proxy for the general peer effects. The results show that the early smoking effect is unchanged.Footnote 13 All in all, the results of these sensitivity checks show that the joint model successfully controls for correlated unobserved factors that can jointly affect the smoking decision and educational attainment.

Finally, the remaining mechanism possibly explaining the negative smoking effect is that early smoking has indeed a causal effect on education via the various channels mentioned before.Footnote 14 Unfortunately the data at hand does not allow for exploration of these mechanisms for there is no information on physical or mental health of individuals at young ages, or on attendance patterns at schools. If there is indeed a causal effect, then those who start smoking at an early age unintentionally enter a different life-labor path. Subsequently, such an early path diversion between smokers and non-smokers can result in serious disadvantages for the former through the accumulation of the effects. The following section will partly shed light on 2 of these disadvantages by presenting the analysis of the first and the current job rankings.

4.3 Labor Market Performance

4.3.1 The First Job Rankings

Table 8 presents the coefficient estimates of the joint models on the first job rankings for males and females. The first column suggests that the early smoking decision has a negative effect on the type of the first job that a male has. Nonetheless, this effect alters once the specification includes educational attainment variables. Considering the discussion about the effects of early smoking on education in the previous section, one can indeed expect a change in the coefficient estimate. However, the change is so manifest that as presented in the second column of the table, keeping the level of education constant, early smoking does not affect the first job. Considering the aforementioned mechanisms through which smoking can affect labor market performance, this empirical finding is reasonable. For the first jobs, no employer discrimination is expected as the smoking status of the first time job applicants will not be observable. Moreover, the serious health consequences that can affect the productivity are probably not observed since they take place after long years of use. The remaining mechanism is the educational attainment, which is what the results also indicate.

Additionally, column (3) controls for possible endogeneity of the educational attainment in the first job rankings estimation. The coefficient estimate of early smoking is unchanged; showing that it is robust to the extension of the functional form defining unobserved heterogeneity. The last three columns present the same results for females. For both gender groups, statistics of likelihood ratio tests that appear in panel (d) reveal that the correlation between unobserved heterogeneity affecting smoking dynamics, education and the first job rankings is significant. Therefore, estimations that fail in controlling for this correlation would suffer inconsistency.

Table 9 presents the results of various sensitivity analysis on the joint model for males and females, respectively. The first sensitivity analysis controls for the preference patterns to take account of different rate of time preferences, prudence and patience. Although the sample size decreases substantially, a quick comparison with the columns (3) and (6) of Table 8 reveals that the results remain similar. The second sensitivity analysis controls for the search efforts before an individuals finds his or her first job. Similar to the previous sensitivity analysis, the purpose is to take account of different preferences. This sensitivity analysis also takes account of possible exogenous shocks that can impair abilities to put search effort. The results show that the coefficient estimate of early smoking is robust to this specification for both gender groups. The final sensitivity check controls for possible calendar effects by introducing the year in which an individuals starts his or her first job. The results are also robust to this final specification for both gender groups.

All in all, the results indicate that early smoking has an adverse effect on the first job rankings, but only through education. Therefore, if the academic problems stemming from early smoking can be prevented, early smokers will be less likely to start their career in the labor market from a disadvantaged point. Admittedly, the first job is only a part of the life-time labor market performance, and the next section will explore the possible long-term effects of early smoking by investigating the effects on the current jobs of individuals.

4.3.2 The Current Job Rankings

Table 10 presents the parameter estimates of the current job ranking regressions.Footnote 15

Column (1) in Table 10 present the results when educational attainment is not controlled for. Column (2) adds the educational attainment dummies into the analysis. Comparing the coefficient estimates of early smoking in columns (1) and (2) shows that, similar to the first job rankings, early smoking has an effect on the current job rankings through education for males.

Column (2) shows the estimates of individual model for males, where early smoking is assumed to be exogenous. There is a negative and significant association between early smoking and the current job rankings. Column (3) presents the results of the correlated model, where the correlation between unobserved heterogeneity is taken into account. The coefficient estimate of early smoking decreases from column (2) to column (3). However, it remains significant.Footnote 16 Columns (5) and (6) present the results for females. Neither in the individual model nor in the joint model there is evidence for a negative effect. Likelihood ratio tests show that the joint model is preferred over individual models, which ignore the correlation between unobserved heterogeneity.

Contrasting the results in the previous section with the ones here shows that there is a difference between the first job rankings and the current job rankings in terms of the early smoking effect. Even though there is no evidence for the early smoking effect on the first job rankings (conditional on education), the results in Table 10 show that there is an effect on the current job rankings. Statistically, there can be two main reasons for such a phenomenon. The first is that those who start smoking at an early age are more likely to move downward in the labor market performance rankings, compared to those who do not start smoking early. The second is that early smokers are less likely to move upward. Since there are only a handful of observations where the first job is higher ranked than the current job, the second explanation sounds more plausible.

To check if early smokers are indeed less likely to more upward, I perform a similar ordered probit estimation, where the dependent variable is the difference between the current job rankings and the first job rankings. This variable takes a value of 1 if there is no change or the difference is negative; 2 if the difference is 1; 3 if the difference is 2; and 4 if the difference is more than 2. Table 4 displays the sample statistics of these job transitions. The parameter estimates are reported in Table 11. Since the chances of moving upward in the rankings can depend on the initial position, I also control for the first job type. Coefficient estimate of early smoking shows that males who start using tobacco at an early age are less likely to move upward in the job rankings. This inertia explains why there is no effect on the first job, but there is on the current job. This finding is in line with the expected long term adverse effects of smoking (especially on physical health). Apparently, the accumulation of effects distorts labor market performance and harms individuals’ ability to move upward in the job rankings.

Finally, Table 12 succinctly connects the labor market effects of early smoking to the wider literature on the wage effects of tobacco consumption. The dependent variable, in this estimation, is the log hourly wages of the individuals calculated as \(\frac{Monthly~wages}{hours~worked~per~week*4.29}\). For hourly wage estimations, I use information from individuals who report at least 20 h of work per week or at most 60.Footnote 17 Unlike the foregoing sections, the dependent variable is not an ordered one in this case. Therefore, “Appendix 4” briefly presents the econometric model that produces the parameter estimates in the table. The main idea behind the estimation is the same; the joint model allows for correlation between unobserved heterogeneity affecting hourly wages, smoking dynamics, education and job rankings.Footnote 18

In order to analyze the effect of early smoking, I initially include the early smoking dummy and the available control variables in the wage equation. The results show that early smoking negatively affect the wages of males. There seems to be no negative effect for females. In order to see the occupation-specific effects of early smoking, I include interactions between the early smoking variable and the job type dummies into the wage equation. Lower panel of Table 12 report the results. For almost all of the job types of males early smoking negatively affects wages. The effect seems to be the highest for lower ranked jobs. For females, on the other hand, there is no evidence for a smoking effect.

Although not reported, I performed several other sensitivity analysis on the current job rankings, wages and the probability of switching to a better job. First, I re-estimated the effect of early smoking on current labor market performance by using information about the current status of smoking. It seems that negative effect on the wages and the current job is higher for those who start smoking early and continue smoking until the time of the survey. However, the coefficient estimate of interaction between early smoking and current smoking is imprecisely estimated. The possible reason is that the vast majority of those who start using tobacco before the age of 15 consists of still-smokers. Second, I controlled for risk attitudes in the analysis of current labor market performance. The reason is that preferences and risk attitudes can affect incentives to invest further in human capital, e.g. through on the job trainings. The results in Tables 11 and 12 are found to be robust to the inclusion of the risk attitude and preference variables. Furthermore, using ordered logit models instead of probit ones did not also change the results, only rescaled the coefficient estimates.

4.4 Magnitude of the Effects

Figure 4 briefly displays the magnitude of the early smoking effect on educational attainment and the current job rankings for only males. The numbers in the figure are obtained as follows. First, I simulated the probabilities of belonging to the each category in the ordered choices for those who smoked before the age of 15 using the estimates presented in column 2 of Table 6 and column 2 of Table 10. Then, the same probabilities are simulated for those who did not smoke before the age of 15. The numbers in the figure reflect the differences between these probabilities for the each category. For all of the other control variables, sample means are used in the simulations.

The results show that early smoking decreases the probability of finishing a high level of education. The effect is the largest on the probability of finishing a scientific degree. The probability of completing scientific education is 4%-point lower for someone who smoked before the age of 15. Similarly, early smoking decreases the probability of having an academic job by almost 4%-point in the long run.

5 Conclusion

There is a small literature studying the causal effects of smoking on labor market performance. The majority of the studies within this literature focuses on earnings or hourly wages. The literature on the relationship between smoking and educational attainment is even smaller. A handful of studies explore the association between smoking behavior and education without establishing causal effects.

This study focuses on the effects of early initiation of smoking on educational attainment and labor market performance. It uses not only hourly wage information but also ranking of jobs to measure labor market performance as this information is available for both the first jobs and the current jobs. Since educational attainment and job ranking variables have ordinal character, ordered probit models are used in estimations. The results indicate that there is a strong negative association between smoking and education as well as smoking and labor market performance for both the first job and the current job rankings. However, it is possible that smoking and education, and smoking and labor market performance are jointly determined by a set of unobserved factors. To tackle this endogeneity problem, the current study uses a correlated discrete factor approach, which is equivalent to a correlated random effects model in which the main idea is that unobserved personal characteristics affecting smoking, education and labor market performance can be correlated. In the absence of reverse causality, this method yields causal effects as it controls for the endogeneity problem stemming from omitted variables.

The results show that early smoking has a negative effect on educational attainment for males only. This negative effect is robust to several sensitivity checks such as controlling for risk preferences and depressive childhood events. Apparently, smoking does not only have long term negative consequences, but also can start affecting one’s life earlier on. Once education is controlled for, there is no evidence for the early smoking effect on the first jobs. The only effect seems to be through education. This effect suggests that early smokers start their labor market career from a disadvantaged point, and this disadvantage is due to that early smokers perform worse in schools. No causal effect is found for females.

Unlike the first job rankings, educational attainment is not the only channel through which early smoking affects the current job rankings of males. The results show that there is still an effect on the current job conditional on educational attainment; which is a finding in line with the existing literature on the wage effects of smoking. Proposed mechanisms for the wage effects of smoking are mainly discrimination, serious health consequences and smoking breaks at workplaces. All these mechanisms might work after years in the labor market. All in all, it seems that the adverse effects of early smoking accumulates over time and early smokers who start their career with low ranked jobs become stuck in those jobs or become less likely to make a career. Finally, an analysis of the log-hourly-wages shows that reported wage effects of early smoking may be due to the possible effects on the type of jobs.

The reported effects of early smoking on educational attainment of males suggest that policies against smoking, targeting especially youth, can be indeed effective. It is not only the case that early smoking affects solely education, but it also affects other important life outcomes through education. Therefore fight against the adverse effects of smoking needs to begin very early in schools for once the smoking effect on educational attainment materializes, there might be several long term consequences. If the negative effect of early smoking on education can be prevented, the indirect adverse effects can also be prevented. The easiest way to do so is, of course, to prevent young individuals from initiation into smoking. This requires a much more detailed analysis of the determinants of tobacco uptake. Only then it is possible to identify the more vulnerable individuals and fight against the negative aspects of tobacco uptake. Furthermore, the difference between males and females in terms of the early smoking effect indicates that there is need for further analysis of the mechanisms through which smoking affects education and labor market performance. Apparently, some of the proposed mechanisms work only for males.

Table 1 Ordered categories of education variable
Table 2 Ordered categories of the job variables

Finally, the existence of early smoking effect on the current job conditional on educational attainment suggests that early smoking affects labor market performance through other channels in the long run. Apparently, the problems related to early initiation of smoking accumulate, and it is not only the education that matters in the long run. This further calls for preventive measures.