Introduction

Most urban areas suffer from serious traffic congestion that is attributable to high population density. Traffic congestion causes many problems, such as traveler stress and health effects (e.g., Cox et al. 2006; Currie and Walker 2011) and negative environmental externalities attributable to increased vehicle emissions (Zhang et al. 2011). Reducing congestion has great and obvious benefits. Policy makers and transportation planners need to know how much benefit will be gained from a costly project aimed at decreasing congestion. In the case of a public road, the benefit can be calculated by multiplying the reduced travel time by the value of time. Valuing in-vehicle crowding on public transport is somewhat dissimilar to the valuation for public road travel; the value is related to a comfort loss in the former but a time loss in the latter (Prud’homme et al. 2012).

Researchers have shown that in-vehicle crowding causes stress and health problems (Cox et al. 2006; Mohd Mahudin et al. 2011, 2012). In-vehicle crowding is also a determinant of waiting time, anxiety, exhaustion, safety, and choice of route and vehicle (Tirachini et al. 2013). Ignoring the effect of crowding causes an overestimation of demand for crowded public transport, which subsequently leads to overestimation of the benefits of travel in cost–benefit analysis (Batarce et al. 2016).

In-vehicle crowding on public transportation is also related to sustainable transportation policies. Although shifting to public transportation is repeatedly proposed as a solution to the environmental costs of mass automobile usage, this modal shift will lead to an increased level of crowding on public transportation, ultimately resulting in another welfare loss (Haywood et al. 2018). Several methods have been proposed to deal with crowding: increasing frequency, increasing the number of seats on public transit, increasing vehicle size, and imposing time-dependent fares (de Palma et al. 2017; Tirachini et al. 2013, 2014).

Previous studies have conducted stated preference (SP) surveys to investigate the value of in-vehicle crowding reduction on public transport (Basu and Hunt 2012; Batarce et al. 2016; Björklund and Swärdh 2017; Haywood and Koning 2015; Tirachini et al. 2017; Whelan and Crockett 2009). These studies required participants to select their preferred travel mode among hypothetical alternatives, which included attributes such as the level of crowdedness described by a picture showing the inside of the train/bus. The willingness to pay to reduce crowdedness was then calculated using coefficients in a utility function estimated by a choice model. Most of the studies focused on commuters: Batarce et al. (2016) included only workers, whereas Björklund and Swärdh (2017), Haywood and Koning (2015) and Whelan and Crockett (2009) included various types of users, although commuters occupied the largest share.

Some studies found that the value of crowding on public transport differs among individuals. Tirachini et al. (2017) showed that male, young and high-income travelers are sensitive to travel time but not to crowding compared with other types of travelers. In contrast, Haywood et al. (2017), who analyzed the effect of in-vehicle density on comfort satisfaction, showed that higher-income travelers are more sensitive to crowding.

Other studies have investigated in-vehicle crowding by revealed preference (RP) data. Hörcher et al. (2017), who used the RP method, suggested that SP studies overestimate the cost of in-vehicle crowding. Their study investigated travelers’ route choice in public transport using automated fare collection data on travelers, including smart card data. Despite the attempt to simulate reality, there are a limited number of RP studies that have estimated in-vehicle crowding costs (Halvorsen et al. 2016; Tirachini et al. 2016). While the RP studies are superior in terms of controlling the real variability of in-vehicle conditions during a trip, the SP method still has the advantage of collecting target variables more easily even if the variable is unobservable in reality.

The Japanese government recommends using choice-based methods to investigate in-vehicle crowding. The Japanese guideline for railway project evaluations (MLIT 2012) recommends that transportation planners use a discrete choice model to calculate the value of crowding. Before they include in-vehicle crowding as an explanatory variable in the choice equation, they convert the crowding rate into a generalized time based on a formula suggested by MLIT (1999). Based on this method, Kato et al. (2003) calculated the benefit of easing in-vehicle crowding from a project related to the relocation of a station in the Tokyo area.

Although many studies and the Japanese government have explored travelers’ mode choice behavior with SP and RP data to value in-vehicle crowding, evidence from recent studies implies that we should focus on the ex post evaluation of travel experiences because of the fundamental limitations of choice-based methods. The utility inferred from choice behavior is referred to as decision utility (Kahneman et al. 1997). The underlying assumption of the inference is that an individual predicts which travel mode is most likely to provide the highest utility among a set of alternatives. In other words, decision utility becomes a precise estimate of utility if an individual can accurately predict, at the time of mode choice, their future experiences of using transportation. However, previous studies have found that people are likely to underestimate the negative aspects of future commuting experiences and suffer more than they expected under the negative influences of travel (Frey and Stutzer 2014; Stutzer and Frey 2008). Although people can generally switch their commuting modes if they are dissatisfied with the current modes, such mode switching lacks feasibility if alternative modes to railway are not easily accessible. Despite the important roles that discrete choice methods play in demand prediction, the ex-ante choice model cannot precisely capture the stressful experience of crowding and its effects, especially for commuters living and working in an area with limited availability of alternative modes.

While SP and RP studies have focused on travelers’ decision utility, other studies have analyzed travelers’ experienced utility. Experienced utility refers to the experienced outcome of a choice (Ettema et al. 2010). One of the measures of experienced utility is the perceived quality of service (de Oña et al. 2013; dell’Olio et al. 2010). Some attributes such as comfort, safety, and space are recognized as determinants of the perceived quality of public transportation (de Oña and de Oña 2014). Furthermore, perceived quality of service has been found to determine travel satisfaction (Eboli and Mazzulla 2015). Subjective well-being (SWB) is another representative measure of experienced utility. Previous studies have analyzed the process of how daily travel affects life satisfaction, a frequently used proxy for SWB (Bergstad et al. 2011; Ettema et al. 2010; Friman et al. 2017).

Adaptation is a critical characteristic of experienced utility. Some researchers have found that a person’s level of SWB has a specific reference point (e.g., Lucas 2007). A major life event, such as marriage or an increase in salary, has short-term effects on SWB but no long-lasting effect, which suggests that SWB returns to the reference point after some time. If we consider a change in the commuting mode as a life event, we can formulate a hypothesis that the effects of commuting discomfort might be different depending on how long of a period an individual has commuted by train. Although previous studies have found the negative effects of crowding on people’s stress and health (Cox et al. 2006; Mohd Mahudin et al. 2012), whether adaptation can reduce the impact of in-vehicle crowding on stress and health remains an open question. Even if the direct effects on SWB do not exist, it is possible that the effects on people’s health and stress subsist for commuters who used trains not only for the short term but also for the long term. There is little evidence for the existence of adaptation to train commuting.

In this study, we analyzed the effects of dissatisfaction with in-vehicle crowding on experienced utility using two-wave panel data. More specifically, we examined whether the effects of dissatisfaction with in-vehicle crowding on life satisfaction might differ by the length of time the commuters have been using the train. We hypothesized that the effects of dissatisfaction with crowding on life satisfaction are divided into two groups: direct effects and indirect effects (i.e., through stress and health).

We use a sample from Tokyo, where travelers frequently experience Japan’s most severe in-vehicle crowding on trains. According to MLIT (2018), the average congestion rate of major railways in the greater Tokyo area during rush hour is 163%, which is higher than that in other major cities in Japan, such as the Chukyo area (around Nagoya) at 131% and the Keihanshin area (around Osaka) at 125%.Footnote 1 Commuters in Tokyo are experiencing the inconvenience of travel mode alternatives to trains due to the high parking rates and the poor bus route options. Thus, it is highly possible that the commuters tolerate train commuting even if they are dissatisfied with it, without switching to another mode.

The remainder of this paper is structured as follows. “Modeling framework” section describes the modeling framework built on previous studies, and “Data” section explains the data and the method used in the empirical analysis. “Results” section shows the results, and “Discussion” section discusses the results and concludes.

Modeling framework

Conceptual model

A structural model is specified in reference to two relationships identified by previous studies. One relationship concerns satisfaction with daily travel, emotional factors and SWB. Ettema et al. (2010) constructed a theoretical framework that describes how daily travel impacts SWB and hypothesized both direct and indirect effects of daily travel on SWB. Based on their framework, an empirical analysis was conducted, and evidence on the relationships was shown (Bergstad et al. 2011; De Vos 2019; De Vos and Witlox 2017; Friman et al. 2017). These studies suggest that satisfaction with travel attributes has both direct and indirect effects on SWB through affective, emotional factors.

The other relationship focuses on the influence of in-vehicle crowding in public transportation on stress and health. Cox et al. (2006) constructed a theoretical framework for how the passenger density of rail transport impacts stress and health by referring to extensive literature. They separated the meanings of density and crowding: density refers to an objective characteristic of an in-vehicle situation, whereas crowding stands for a subjective perception of the objective density. They then suggested that high passenger density is not necessarily recognized as a high level of crowding. Following this evidence, Mohd Mahudin et al. (2011, 2012) analyzed a model that specified the effects of passenger density and crowding and showed that affective reactions to crowding significantly predict commuter stress, which influences health, and feelings of exhaustion, which affect life satisfaction. They also showed that passenger density does not influence either stress or feelings of exhaustion directly but does so indirectly through affective reactions to crowding. This evidence indicates that to evaluate a reduction in in-vehicle crowding, passengers’ subjective evaluation of crowding should be a focus. The effects of crowding on stress are also found in neuroscientific studies. In an experiment, Kennedy et al. (2009) showed that close proximity to another person is found to activate one of the parts of the human brain related to stress reaction. In this experiment, the participants were facing each other. Although their study provides useful neuroscientific insights about stress and crowding, passengers are not always facing each other in reality. Taking this evidence into account, whether passenger density leads to crowding discomfort might be affected by whether passengers face each other.

Stress is known as a risk factor for health problems in the field of medical science. Mental stress is recognized as a major cause of cardiovascular disease (Greenwood et al. 1996). For instance, there is epidemiological evidence that perceived mental stress causes increased mortality from stroke and coronary heart disease (Iso et al. 2002). Furthermore, stress increases the risk of other diseases such as obesity (Torres and Nowson 2007), diabetes (Surwit et al. 2002), and asthma (Wright et al. 1998).

The two findings from these studies are key to constructing our model: (1) satisfaction with the travel conditions has direct and indirect effects on SWB through affective factors, and (2) passengers’ affective reactions to in-vehicle crowding impact stress and health. Based on these findings, we construct a model outlining the relationships between in-vehicle crowding, stress, health and life satisfaction. The hypothesized model is shown in Fig. 1.

Fig. 1
figure 1

A hypothesized model outlining the relationships between stress, health, life satisfaction, and dissatisfaction with crowding

Endogenous variables in this model explain the major relations between the main components of our hypothesis: dissatisfaction with in-vehicle crowding has negative effects on life satisfaction both directly and indirectly via stress and health. We hypothesize the main relationships by combining the two findings from the frameworks mentioned above. On the basis of the finding about the relationship between satisfaction with travel and SWB, we use dissatisfaction with in-vehicle crowding as our main explanatory variable for satisfaction with travel. Because in-vehicle crowding may be perceived by commuters as having only negative aspects, we focus on dissatisfaction with crowding whereas previous studies have used satisfaction with overall travel. Then, on the basis of the finding related to crowding and stress, we include stress and health in our model as mediators between dissatisfaction with crowding and SWB. Finally, we use overall life satisfaction as a proxy for SWB; this measure is used more frequently than other measures, such as happiness and eudemonic well-being, in the context of travel behavior.

Exogenous variables play a role in controlling the variables used to refine the estimates of interest. Travel attributes influence satisfaction with travel (Friman et al. 2017), stress (Ettema et al. 2010; Evans and Wener 2006), health (Christian 2012), and life satisfaction (Frey and Stutzer 2014; Stutzer and Frey 2008). Sociodemographic factors such as age, gender and income have been found to affect various indicators in many studies on transportation, health and happiness; therefore, we assume that these factors affect all of the main variables.

Major life events are control variables that we also include in the modeling framework to avoid omitted variable bias. We should control events related to changes in the commuting mode, such as a move or job change, because they may be correlated with whether passengers are dissatisfied with in-vehicle crowding.

In addition, we hypothesize that the effects of dissatisfaction with crowding vary depending on whether individuals are accustomed to train commuting. As a proxy for the level of adaptation to train commuting, we focus on whether individuals are long-term or short-term train commuters. We also assume that the probability of being a long-term or short-term train commuter is determined by sociodemographic factors and life events.

Estimation method

We use structural equation modeling (SEM) with observed variables (e.g. Bollen 1989) to estimate our hypothetical model and test differences depending on whether commuters are long-term train commuters.

We utilize the first-difference equation (e.g. Johnson 2005; Wooldridge 2010) to address the possibility of an endogeneity problem caused by time-invariant omitted variables. Whereas the first-difference equation is usually estimated by using OLS in many studies, some researchers have shown that the first-difference equation can also be estimated by the maximum likelihood method (Han and Phillips 2013; Kripfganz 2016). Differencing the regression equation across the two time periods can eliminate the time-invariant unobservable. This differencing procedure makes it possible to obtain the consistent estimated effects of independent variables if all omitted variables that might cause the endogeneity are constant over time. The time-varying error term needs to be uncorrelated with independent variables for all time periods, which is called the strict exogeneity of the independent variables.

The specification of our SEM model is shown in the following equations:

$$\begin{aligned}\Delta LS_{i} & = \beta_{L,0} + \beta_{L,1}\Delta Crowding_{i} +\varvec{\beta}_{L,2}^{'} (\Delta Crowding_{i} \varvec{Usertype}_{i} ) +\varvec{\beta}_{L,3}^{'} \varvec{Usertype}_{i} \\ & \quad + \beta_{L,4}\Delta Stress_{i} + \beta_{L,5}\Delta Health_{i} +\varvec{\beta}_{L,6}^{'} \varvec{events}_{\varvec{i}} +\varvec{\beta}_{L,7}^{'}\Delta \varvec{Control}_{\varvec{i}} + e_{L,i} , \\ \end{aligned}$$
(1)
$$\Delta Health_{i} = \beta_{H,0} + \beta_{H,1}\Delta Stress_{i} +\varvec{\beta}_{H,2}^{'} \varvec{Usertype}_{i} +\varvec{\beta}_{H,3}^{'} \varvec{events}_{\varvec{i}} +\varvec{\beta}_{H,4}^{'}\Delta \varvec{Control}_{\varvec{i}} + e_{H,i} ,$$
(2)
$$\begin{aligned}\Delta Stress_{i} & = \beta_{S,0} + \beta_{S,1}\Delta Crowding_{i} +\varvec{\beta}_{S,2}^{'} (\Delta Crowding_{i} \varvec{Usertype}_{i} ) +\varvec{\beta}_{S,3}^{'} \varvec{Usertype}_{i} \\ & \quad +\varvec{\beta}_{S,4}^{'} \varvec{events}_{\varvec{i}} +\varvec{\beta}_{S,5}^{'}\Delta \varvec{Control}_{\varvec{i}} + e_{S,i} , \\ \end{aligned}$$
(3)
$$\Delta Crowding_{i} = \beta_{C,0} +\varvec{\beta}_{C,1}^{'} \varvec{events}_{\varvec{i}} +\varvec{\beta}_{C,2}^{'}\Delta \varvec{Control}_{\varvec{i}} + e_{C,i}$$
(4)
$$\varvec{Usertype}_{\varvec{i}} = \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {Bus_{15,i} } \\ {Car_{15,i} } \\ \end{array} } \\ {\begin{array}{*{20}c} {MC_{15,i} } \\ {BC_{15,i} } \\ {FT_{15,i} } \\ \end{array} } \\ {\begin{array}{*{20}c} {NC_{15,i} } \\ {Switched_{i} } \\ \end{array} } \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\beta_{Bus,0} } \\ {\beta_{Car,0} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\beta_{MC,0} } \\ {\beta_{BC,0} } \\ {\beta_{FT,0} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\beta_{NC,0} } \\ {\beta_{Switched,0} } \\ \end{array} } \\ \end{array} } \right] + \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\varvec{\beta}_{Bus,1}^{'} } \\ {\varvec{\beta}_{Car,1}^{'} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\varvec{\beta}_{MC,1}^{'} } \\ {\varvec{\beta}_{BC,1}^{'} } \\ {\varvec{\beta}_{FT,1}^{'} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\varvec{\beta}_{NC,1}^{'} } \\ {\varvec{\beta}_{Switched,1}^{'} } \\ \end{array} } \\ \end{array} } \right]\varvec{events}_{i} + \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\varvec{\beta}_{Bus,2}^{'} } \\ {\varvec{\beta}_{Car,2}^{'} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\varvec{\beta}_{MC,2}^{'} } \\ {\varvec{\beta}_{BC,2}^{'} } \\ {\varvec{\beta}_{FT,2}^{'} } \\ \end{array} } \\ {\begin{array}{*{20}c} {\varvec{\beta}_{NC,2}^{'} } \\ {\varvec{\beta}_{Switched,2}^{'} } \\ \end{array} } \\ \end{array} } \right]\varvec{control}_{{\varvec{15,i}}} + \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {e_{Bus,i} } \\ {e_{Car,i} } \\ \end{array} } \\ {\begin{array}{*{20}c} {e_{MC,i} } \\ {e_{BC,i} } \\ {e_{FT,i} } \\ \end{array} } \\ {\begin{array}{*{20}c} {e_{NC,i} } \\ {e_{Switched,i} } \\ \end{array} } \\ \end{array} } \right],$$
(5)

where ΔLS, ΔHealth, ΔStress, and ΔCrowding are the changes in life satisfaction, health, stress, and dissatisfaction with in-vehicle crowding, respectively; Control is a vector of control variables; i refers to the respondents; L, H, S, and C are the initial letters of dependent variables in the equations; β refers to an unknown parameter; and e refers to an error term. A list of all of the explanatory variables and explained variables in our specification is shown in Table 8 in the “Appendix”.

Equations (1)–(4) are first-difference equations in which the time-invariant unobserved effects have been eliminated. To obtain consistent estimators of the coefficients, the strict exogeneity of the independent variables must be satisfied. This means that if any independent variables are correlated with the time-varying unobserved factors for any time period, the consistency will be violated.

The reason we use the differencing equations is to address the potential issues of endogeneity and self-selection, both of which will violate the consistency of estimators. For example, there may be some omitted variables that affect both stress and dissatisfaction with crowding. There may also be unobserved factors such as travel attitudes that affect the probability of being a long/short-term train commuter, stress, health and life satisfaction. Assuming that both omitted factors and determinants of self-selection are time-invariant unobserved effects, a first-difference equation can eliminate such unobserved effects.Footnote 2

Following the method of setting up a first-difference equation, we include time-varying factors (e.g., commuting time and income) and dummy variables for the event occurring between the two time periods (e.g., a modal shift, job change, or move) and exclude time-invariant characteristics (e.g., gender and education year). We also exclude age as an independent variable in the first-difference equation because ΔAge with a 1-year change equals 1 among all respondents; then, the effect of age is contained in the y-intercepts.

We assume that the effects of ΔCrowding differ by user type because of adaptation. As shown in Eq. (5), Usertype is a vector of dummy variables. Bus15 equals 1 if a respondent used the bus as his or her main commuting mode in 2015. Similarly, Car15, MC15, BC15, FT15, and NC15 refer to car commuters, motorcycle (MC) commuters, bicycle (BC) commuters, on-foot (FT) commuters, and noncommuters (NC), respectively, in the previous year. We include the information about the previous commuting mode because a satisfaction rating is affected by a change compared with the reference point (Abou-Zeid et al. 2012). Our assumption is that commuters who start commuting by train feel a certain level of crowding differently depending on their reference points based on their previous commuting modes. In Eqs. (1) and (3), the interaction terms between ΔCrowding and Usertype are included. We recognize a respondent as a short-term train commuter if any one of these dummy variables equals 1. If a respondent’s main commuting mode was the train in 2015, no dummy variable equals 1. Therefore, βL,1 and βS,1 refer to the effects of ΔCrowding for long-term train commuters, and βL,2 and βS,2 describe the differences in the effects between long-term and short-term train commuters.

We also include Switched in Usertype, a dummy variable that equals 1 if a commuter commuted by train in 2015 but switched to another mode in 2016, to test whether a potential selection bias exists. There is a possibility that commuters with strong dissatisfaction with crowding have switched from train commuting to other modes. If this is the case, the short-term group may contain a larger proportion of highly dissatisfied users who have not yet switched to another mode than the long-term group, and this may be another reason for the difference in the effects of ΔCrowding. To explore this possibility, we also estimate the difference in the effects of ΔCrowding between long-term train commuters and commuters who switched to another mode of transport. If the effects of ΔCrowding are more negative among the switched group than among the long-term group, being freed from the dissatisfaction with crowding will improve the switchers’ life satisfaction, which implies that strong dissatisfaction with crowding is one of the reasons for switching to other modes.

The set of control variables, Control, consists of commuting time and sociodemographic factors. We include the change in commuting time (ΔCommuting time), a natural log of household income (Δln(Household income)), and household size (ΔHousehold size) as explanatory variables in Eqs. (1), (2), (3), and (4). We apply the logarithmic function of income, which is frequently adopted in studies of subjective evaluations of satisfaction and health (Jones and Wildman 2008; Stutzer 2004). In terms of Eq. (5), because Usertype is a set of dummy variables showing whether commuters shifted their major commuting mode between 2015 and 2016, age, gender, education year, household income in 2015, household size in 2015, and car ownership in 2015 are included as determinants of the modal shift. In contrast to Eqs. (1) through (4), household income is specified as linear in Eq. (5) because previous studies on modal choice typically adopted the linear function of income as an explanatory variable (e.g., Belgiawan et al. 2016; Clark et al. 2016; Ding et al. 2017). The set of dummy variables of life events that occurred between 2015 and 2016 (events) is included as explanatory variables in all the equations.

Maximum likelihood estimation, the most frequently used estimator, is applied. The maximum likelihood method in SEM can be performed under the assumption that all dependent and independent observed variables in the model follow a multivariate normal distribution.Footnote 3 In practical situations, the multivariate normality of data is often violated (Micceri 1989). However, the maximum likelihood approach is recommended because of its advantage in terms of robustness against violations of the multivariate normality assumption when the sample size is sufficiently large (Golob 2003; Satorra 1990).

Data

We use two-wave panel data from a nationwide survey that we conducted over the Internet in Japan in 2015 and 2016. The sample was randomly collected from a research company’s nationwide web panel. This survey investigated individuals’ SWB and its determinants by asking a wide range of questions related to their socioeconomic characteristics, environmental preferences, transportation environment and commuting form.

We received 247,000 responses in 2015; among them, 131,000 individuals continued to participate in the 2016 survey. From this dataset, we extract 19,835 respondents who selected Tokyo as their prefecture of residence, representing 0.15% of all inhabitants in Tokyo. Among them, 14,187 respondents are workers.

From the sample of workers in Tokyo, we extract the respondents who used trains as a typical commuting mode. The respondents were asked, “Which transportation mode do you usually use when commuting?” and selected relevant responses from thirteen transportation modes. From the responses to this question, we identify train commuters who selected “train” as our target sample.

The aim of this study is to investigate the difference in the effects of dissatisfaction with in-vehicle crowding by how long the commuters have been using trains. We use the information about commuting modes in the previous year to identify accustomed and non-accustomed train commuters. Prior to performing the quantitative analysis, we categorized the respondents according to the following criteria:

  1. (1)

    Commuters who used a train as a main commuting mode in 2015 and continued to commute by train in 2016;

  2. (2)

    Commuters who used other forms of transportation as a main commuting mode in 2015 and switched to train commuting in 2016;

  3. (3)

    Respondents who were not commuters in 2015 but started to commute by train in 2016;

  4. (4)

    Commuters who used trains as a main commuting mode in 2015 and switched to other commuting modes in 2016 without changing their residence.

We identify the respondents who satisfy condition (1) as “long-term train commuters” (6971 observations) and the respondents who satisfy conditions (2) or (3) as “short-term train commuters” (1076 observations). We additionally derive the respondents who meet condition (4) and identify them as “commuters who switched from train to another mode” (249 observations) to test the existence of the selection problem that we mentioned in the previous section. Because the number of observations for analysis is insufficient, we exclude commuters who reported that their main commuting mode was the express bus, taxi, or another transportation mode from our analysis. In addition, the observations of the respondents who responded with “do not know” or “do not want to answer” regarding their income and/or education level are deleted. This leaves us with 8296 observations for our analysis.

Table 1 shows the average age, percentage of females, average commuting time and average household income in both our survey and government statistics to assess the sample representativeness of our survey. In our survey, male commuters are slightly overrepresented compared to their proportion in the worker population in Tokyo.

Table 1 Representativeness of the sample of workers in Tokyo

Table 2 shows the characteristics of the sample and the percentages of the categories or mean values of the variables reported by the respondents that we include in our analytical model. The rightmost column shows the statistical significance of the difference in the means (t test) or proportions (z test) between the long-term group and the other two groups.

Table 2 Characteristics of the three groups of train commuters

Dissatisfaction with in-vehicle crowding The respondents were asked, “Please select all items that you are dissatisfied with about transportation in your living environment.” We identify the respondents who selected “crowding inside of public transportation” as train commuters who are dissatisfied with in-vehicle crowding. The percentage of dissatisfied commuters is significantly higher among the long-term group than among the short-term and switched groups, which means that the long-term train commuters are more likely to be dissatisfied with in-vehicle crowding than the other groups.

Stress and health The respondents also reported the frequency of feeling a high level of stress and a low level of stress on a 5-point scale: 1 “hardly ever” to 5 “very often”. We average the frequencies of high and low levels of stress and include them as a stress variable in the analytical model. In addition, we derive the level of the respondents’ health from their answers to the question “All in all, how would you describe your state of health?” on a 5-point scale from 1 “very poor” to 5 “very good”. No significant differences in the means of these variables were observed among the groups.

Life satisfaction We use the indicator of life satisfaction as a response variable. Life satisfaction is based on the question, “Overall, how satisfied are you with your life?” The responses range on a scale from 1 “completely dissatisfied” to 5 “completely satisfied”. The mean value of life satisfaction for the switched group is significantly lower than that for the long-term group.

We use three types of control variables: travel attributes, sociodemographic factors and major life events from the previous year.

Travel attributes We use respondents’ commuting time as one of the variables for travel attributes in Eqs. (1) through (4). The respondents reported their usual one-way commuting time in increments of ten minutes. The average commuting time is lower among commuters who switched from trains to other modes than among the long-term group. The short-term group also has a lower average commuting time than the long-term group. Respondents also reported their commuting cost and commuting allowance. We calculate the net commuting cost by subtracting the amount of reported commuting allowance from the amount of reported commuting cost. Nominal and net commuting costs are shown only to describe the sample characteristics and were not included in our analytical model because this variable is available only for data from the 2016 survey. While the average nominal cost for the switched group is lower than that for the long-term group, the average net cost is higher among the short-term group than among the long-term group. The potential reason for this difference is that commuting allowance may be lower or there may be more workers who do not receive a commuting allowance in the short-term group.

Sociodemographic and attitudinal factors We include household income and household size as control variables in Eqs. (1) through (5). In addition to these variables, age, gender, education year, car ownership, and attitude toward crowding are included in Eq. (5). Attitude toward crowding is a dummy variable that equals 1 if a respondent checked the item, “I want to avoid crowding and congestion even if I spend money and time to do so”.

Major life events from last year The respondents reported major life events that had occurred within the past year. We then use 12 dummy variables to indicate whether the respondents had experienced each life event within the one-year period. In particular, work-related life events are controlled because these factors may be correlated with whether respondents are long-term train commuters. Divorce, traffic accidents and a death in the family may be triggers of other events, such as a job change or modal shift. Purchasing a new house or a condominium may also be correlated with the change in commuting time.

The percentage of female commuters is higher among the short-term group than among the other groups. In addition, the short-term and switched groups have a lower average education year and lower average household income than the long-term group. These characteristics may be explained by some sociodemographic trends. In Japan, the proportion of permanent regular workers is lower for female workers than for male workers (Kawaguchi and Ueno 2013). The group of nonpermanent workers also has a high proportion of individuals with lower years of education (Honda 2004). Additionally, nonpermanent workers might be more likely than permanent workers to switch their commuting mode frequently because of a job change. The correlations among these factors may be the determinants of the differences in sociodemographic variables among the groups.

Table 3 summarizes the changes in the variables. For the long-term group, the change in dissatisfaction with crowding might have been caused by a change in the level of crowding for some reason, such as a change in commuting hours (from or to rush hour). For the short-term and switched groups, the major reason for the change in dissatisfaction with crowding appears to have been the change in the commuting mode. Moreover, the summary shows that commuting time increased for the group of commuters who began train commuting, while commuting time decreased for the group who quit train commuting. In terms of the other variables, there were no substantial differences among the groups. In our estimation, we average the change in high and low stress levels and used them as an explanatory variable.

Table 3 Summary of changes in the variables

Results

Overall tendency of direct and indirect effects

First, we investigate the overall significance of the direct and indirect effects of dissatisfaction with crowding without consideration of the user types. We compared the three suggested models shown in Fig. 2 to test whether including the set of direct effects and the set of indirect effects will significantly improve the model fit. Model 1 includes only direct effects, Model 2 includes only indirect effects through stress and health, and Model 3 includes both direct and indirect effects. We used the following goodness-of-fit indices, which have been frequently used in previous studies: the Chi square over degrees of freedom, the standardized root mean residual (SRMR), the comparative fit index (CFI), the root mean square error of approximation (RMSEA) and Akaike’s information criterion (AIC). Next, we interpreted the coefficient estimates and compared the estimated effect between long-term and short-term train commuters. Finally, we calculated the indirect effects of dissatisfaction with crowding from the direct effects.

Fig. 2
figure 2

Suggested models for comparison (only related variables and paths are shown)

According to the values of the goodness-of-fit indices, in general, the models fit the data well. Table 4 shows the most frequently used fit indices for each model and a cutoff value for each index (see, for instance, Brown 2006; Mohd Mahudin et al. 2012; Ye and Titheridge 2017). Significant Chi squares were observed for all of the posited models. Because these indices are influenced by the large sample size (n = 8296), we looked at other fit indices that can take a large sample size into account. SRMR, CFI, and RMSEA showed that the models were acceptable.

Table 4 Goodness-of-fit indices of the suggested models

We showed that Model 3 fit significantly better than Model 1 according to the likelihood-ratio (LR) tests. We conducted LR tests because the differences in the values of goodness of fit were too trivial to decide which model was the most suitable. The rightmost column of Table 3 shows the results of the LR tests of Model 1 and Model 2 compared to Model 3. Model 3 was not significantly different from Model 2 in terms of goodness of fit, which implies that there were no overall direct effects of dissatisfaction with crowding. However, this procedure cannot clarify the difference in the significance of the direct effects by the user types in detail. To take the possible difference into account, we adopt Model 3 to perform the estimation considering differences among the groups in detail. The estimation results of Model 3 are described in the next subsection. The results of Models 1 and 2 are shown in Tables 9 and 10 in the “Appendix”.

Model estimation

The results of the hypothesized model estimation, including standardized coefficients and significance, are shown in Table 5. Standardized coefficients are used to make the interpretation of the estimated effects comparable between independent variables (e.g., You 2017). Standardized coefficients describe the change in a dependent variable when an independent variable increases by a standard deviation. Therefore, the coefficients can be interpreted as the relative importance of different independent variables.

Table 5 Standardized coefficient estimates of the hypothesized model

Regardless of whether the respondents were long-term or short-term train commuters, there was a significant relationship between crowding and life satisfaction mediated by stress and health (1st to 10th rows of Table 5). Stress was significantly predicted by crowding and significantly predicted health. These results suggest that dissatisfaction with in-vehicle crowding is associated with a higher frequency of stress and that a higher frequency of stress induces a lower level of health. In addition, both stress and health have significant effects on life satisfaction; a high frequency of stress significantly lowers life satisfaction, and a high level of health significantly increases life satisfaction. Because none of the interaction terms between ΔCrowding and the dummy variables of short-term train commuters were significant, there was no difference in the effect of dissatisfaction with crowding on stress between long-term and short-term train commuters. Similarly, there was no significant difference in the stress-related effects between long-term train commuters and commuters who switched from a train to other travel modes.

We found a difference in the direct effects of dissatisfaction with crowding on life satisfaction depending on whether respondents were long-term or short-term train commuters (1st to 8th rows of Table 5). The significance levels of ΔCrowding and the interaction terms indicate that dissatisfaction with in-vehicle crowding induced a lower level of life satisfaction among short-term train commuters who originally commuted by taking a bus, driving a car or walking in the previous year, whereas this was not the case among long-term train commuters; crowding did not have significant effects on life satisfaction for those who commuted by train for a longer period. Dissatisfaction with crowding had no significant effects directly on life satisfaction among the switched group, similar to long-term train commuters.

The results of the coefficients of Usertype dummies indicate that life satisfaction increased if the respondents were short-term train commuters who commuted mainly by foot and that health status increased if the respondents were not commuters in 2015 (11th to 17th rows of Table 5).

Commuting time significantly increases the level of stress and decreases the levels of health and life satisfaction (18th row of Table 5), whereas higher income results in better health status and a higher degree of life satisfaction (19th row). While a job change or promotion increases the likelihood of being dissatisfied with crowding, some life events result in a better situation in terms of stress, health and life satisfaction (28th to 39th rows). However, a divorce causes stress, and experiencing a promotion results in a lower level of health.

Sociodemographic factors are the determinants of modal change and of becoming a short-term train commuter (21st to 27th rows of Table 5). Major life events are also significant predictors (28th to 39th rows). In particular, the results show that work-related life events and moving are major triggers to shifting one’s commuting mode.

Direct and indirect effects of dissatisfaction with in-vehicle crowding

We calculated the indirect effects of dissatisfaction with crowding on life satisfaction from the estimates of direct effects. The direct and indirect effects by group were calculated using the estimated coefficients shown in Table 5 by the following equations:

$$Direct_{j} = \beta_{L,1} + \beta_{L,2,j} ,$$
(6)
$$Indirect_{j} = \left( {\beta_{S,1} + \beta_{S,2,j} } \right) \times \beta_{L,4} + \left( {\beta_{S,1} + \beta_{S,2,j} } \right) \times \beta_{H,1} \times \beta_{L,5} .$$
(7)

Here, j refers to the following groups: the group of long-term train commuters; the 6 groups of short-term train commuters divided according to their previous commuting modes; and the group of commuters who switched from trains to other commuting modes. Before the calculation, we assumed that the statistically insignificant coefficients equaled zero. If a coefficient of an interaction term between dissatisfaction with crowding and a certain user type is insignificant, we recognize the effect for that user type as identical to that for long-term train commuters.

Table 6 shows the results of the calculation. A negative indirect effect of dissatisfaction with crowding on life satisfaction existed regardless of the types of users. On the other hand, direct effects existed among only some subgroups of short-term train commuters. Because a direct effect did not exist for the long-term train commuters, these effects were expected to disappear due to adaptation. In addition, because no significant direct effect was shown for the switched group, we did not observe the existence of the selection effect, the possibility that commuters experiencing strong effects from dissatisfaction with crowding switched from commuting by train to another mode.

Table 6 Standardized direct and indirect effects of dissatisfaction with in-vehicle crowding on LS

Prefectural differences in Japan

Our analysis focused only on Tokyo, where both the train commuting rate and the level of in-vehicle crowding in trains are the highest in Japan; thus, the situation might differ in other cities depending on the city characteristics related to train commuting. We conducted a supplementary analysis in which our SEM model was applied to other prefectures in Japan. Figure 3 shows the rates of train commuting by prefecture, which were calculated using our survey data. Train commuting rates were especially high in the following three metropolitan areas: Greater Tokyo (including the prefectures around Tokyo prefecture), Keihanshin (around Osaka), and Chukyo (around Nagoya).

Fig. 3
figure 3

Train commuting rates by prefecture with the name of the major cities in Japan

Figure 4 and Table 7 show the results of the prefectural analysis. Although the sample of train commuters was too small to analyze in most of the prefectures, the model fit the data well for Saitama, Chiba, Aichi, Osaka, and Hyogo prefectures. All these prefectures belong to three metropolitan areas. No prefectures except for Tokyo had both direct and indirect negative effects. We found a negative direct effect of dissatisfaction with crowding on life satisfaction among only short-term train commuters in Hyogo and a negative indirect effect of dissatisfaction with crowding on life satisfaction among all train commuters in Saitama. There was also a negative indirect effect among only short-term train commuters who previously commuted by car in Chiba. Moreover, there were positive direct effects of dissatisfaction with crowding among short-term train commuters in Saitama. The coefficients may have been affected by some outliers because of the somewhat small size of the subsample of short-term train commuters in Saitama (n(MC15 = 1) = 16).

Fig. 4
figure 4

Effects of dissatisfaction with in-vehicle crowding on LS by prefecture in Japan

Table 7 Direct and indirect effects of dissatisfaction with in-vehicle crowding on LS by prefecture in Japan

Negative effects of dissatisfaction with crowding were also found in other prefectures, similar to the case of Tokyo. In Saitama, regardless of the types of users, dissatisfaction with in-vehicle crowding had indirect negative effects on life satisfaction through stress and health. In Hyogo prefecture, which encompasses the sixth largest city in Japan, namely, Kobe City, the direct effects exist only for those who switched from commuting by car, bus, or walking to train commuting, but the effects disappeared with habituation, similar to the phenomenon in Tokyo.

In summary, there was generally no effect of dissatisfaction with crowding among commuters in the Keihanshin and Chukyo areas. Although there were direct effects on some commuters, these effects disappeared with habituation. In the greater Tokyo area, there were indirect effects of dissatisfaction with crowding through stress and health, which did not disappear with habituation.

In addition, the greater Tokyo area has a high proportion of dissatisfied train commuters. Figure 5 presents the average congestion rate of the major railway in each metropolitan area and the percentage of train commuters dissatisfied with in-vehicle crowding in each prefecture. According to this figure, train commuters seem more likely to be dissatisfied with in-vehicle crowding if they are in areas with higher congestion rates on the railway. This is especially true of the greater Tokyo area. On the basis of our results, in-vehicle crowding in the greater Tokyo area of Japan should be considered more seriously.

Fig. 5
figure 5

Average congestion rate of major railways in three metropolitan areas (MLIT 2018) and the percentage of train commuters dissatisfied with in-vehicle crowding (only for the prefectures shown in Table 7)

Despite the high level of in-vehicle crowdedness of trains, commuters in Osaka and Aichi (encompassing Nagoya) did not experience significant effects of dissatisfaction with crowding. There might be other determinants of whether dissatisfaction with crowding has a substantial effect, such as the personality of commuters. New interesting findings might be obtained by further analysis including other determinants of regional differences such as personality variables.

Discussion

This study has the following two main findings: (1) Dissatisfaction with in-vehicle crowding directly lowers life satisfaction among short-term train commuters who previously commuted by bus, car or walking. (2) Dissatisfaction decreases life satisfaction indirectly through stress and health regardless of the types of transportation used. In our sample, direct effects were found on 50% of short-term train commuters but not on long-term train commuters. These results implied the possibility that the direct effects will disappear after commuting by train for a long time.

Why the direct effects disappear can, perhaps, be explained by the effects of the productive use of travel time. In the digital age, commuters can easily spend their in-vehicle time working, messaging, entertaining themselves, etc. by using electronic devices such as smartphones and laptop computers. Engaging in such activities makes commuting time more productive, thus reducing the disutility of travel time (Gripsrud and Hjorthol 2012; Wardman and Lyons 2016). Accordingly, commuters may adapt to in-vehicle crowding by being productive during the time spent commuting. Hence, all or part of the reduction in the direct effects can be attributed to the diminution of the disutility of travel on crowded train by using the travel time productively.

We did not observe a selection effect, the possibility that commuters experiencing strong direct effects from dissatisfaction with crowding have switched from commuting by train to another mode. In Tokyo, while 64% of workers commute mainly by train, only 3% of workers commute mainly by bus, and 8% commute by car (calculated using our survey data). This situation arises because of the inconvenience of all travel modes other than trains, such as the high parking price and the poor bus route options. Because of the low availability of other travel modes in Tokyo, dissatisfaction with crowding may not have substantial effects on mode switching.

Despite the inconvenience of other travel modes in Tokyo, it is possible that commuters will be dissatisfied with crowding on trains when they shift from other modes to rail transportation. There is evidence that commuters tend to overestimate the effects of monetary aspects and underestimate the burden of physical and mental stress from commuting when they choose their commuting modes (Frey and Stutzer 2014). Based on these insights, we can assume that commuters underestimate the discomfort of in-vehicle crowding before a shift to rail, which would bring them stressful experiences of crowding that cannot be fully compensated by being freed from other types of inconvenience, such as high parking prices.

Our estimation implied that among short-term train commuters, the previous commuting mode determines whether dissatisfaction with crowding has direct effects on life satisfaction. The possible reason is that the past commuting experience might form an individual’s reference point about commuting, and then, the commuter evaluates a new commuting method on the basis of the reference point. Car commuters may think that they can commute for long distances without any exposure to in-vehicle crowding. With this reference point, they may feel strongly dissatisfied with circumstances in which they must commute while dealing with serious in-vehicle crowdedness if they switch from a car to trains. For commuters who regularly walk to their workplace over relatively short distances with some flexibility, it will seem less flexible and stressful to commute on a crowded train for intermediate or long distances. On the other hand, new train commuters who were previously noncommuters might perceive in-vehicle crowding without a strong preconception because they do not seem to have a concrete reference point about commuting formed from daily experiences of regular commuting.

If this interpretation is correct, the direct effects of dissatisfaction with crowding can be regarded as the gap between a commuter’s reference point and a new commuting situation immediately after switching their commuting mode. Furthermore, adaptation may also be defined as a change in the commuting reference point after a change in the commuting mode. However, it is difficult to explain the direct effects on commuters who previously commuted by bus, a similar travel mode to a train. Further analysis on the reference point for commuting would be useful for testing this assumption.

The main results of this study indicate that even if direct effects on SWB do not exist, the effects on people’s health and stress subsist among not only short-term but also long-term train commuters. To avoid underestimating the value of the reduction in crowding, policy makers should also focus on indirect effects via stress and health conditions.

Most of the estimated effects of the main and control variables are consistent with those found in previous studies. The indirect effects of dissatisfaction with crowding are consistent with studies that investigated the relationship among crowding, stress and health (Cox et al. 2006; Mohd Mahudin et al. 2011, 2012). The results regarding commuting time are consistent with studies that found negative effects of commuting time on stress (Evans and Wener 2006), health (Christian 2012), and life satisfaction (Stutzer and Frey 2008). Our estimation showed similar results for the effects of income on health (Jones and Wildman 2008) and SWB (Stutzer 2004).

Although we found significant effects of commuting time on life satisfaction, previous studies, as well as our supplementary analysis, have inferred that the influence of commuting time differs from that of the previous commuting mode. Stutzer and Frey (2008) showed that a longer commuting time induces poorer life satisfaction by using data on workers that comprised a larger proportion of car commuters compared to our data. Wardman (2004) indicated that the value of time varies depending on travel modes, which leads to a variation in the marginal utility of time by modes. To test this variation, we conducted some supplementary analysis and investigated the variation in the effects of a commuting time change by the previous commuting modes. We eventually found that the respondents who commuted mainly by car in 2015 might be more sensitive to a commuting time change than the other respondents (Table 11, “Appendix”). While this supplementary model provided some interesting insight about the commuting time, the reliability seems somewhat poor because of the relatively low suitability of the model (χ2/df = 5.866 > 5; CFI = .866 < .90).

We also found inconsistent effects of sociodemographic variables on commuting mode in the previous year compared to existing studies of modal choice. For instance, our results show that travelers from smaller households are likely to have used a car for commuting in the previous year, while previous studies have found that travelers from larger households tend to commute by car (Ding et al. 2017). This different result may come from the characteristics of our sample that are different from those of previous studies: the choice of a regular commuting mode might be different from the commuting mode just before the mode shift.

Our findings have an implication for the difference in crowding cost estimation between SP and RP studies. There is some evidence that SP studies overestimate the cost of crowding, which we mentioned in the first section of the paper. This overestimation could be partially explained by adaptation. The possibility that respondents evaluated the condition of a travel mode that was different from what they regularly used was greater in SP studies than in RP studies, which focus on the travel mode that respondents actually use. Evaluating an unfamiliar in-vehicle condition may cause an overestimation of the effect of crowding in the SP studies. More studies should be conducted with RP methods, including sufficient individual characteristics for crowding cost estimation and for investigating adaptation to crowding.

Even if the direct effects were eliminated, there is a possibility that the effects will reappear due to some external cause such as an epidemic disease. When a communicable disease is being spread, people may become more sensitive to crowded situations, and thus the effects of dissatisfaction with crowding increase. In such an event, even a person who had once been accustomed to crowding will possibly suffer the direct effects again. In other words, the value of a reduction in crowding increases only temporally. Such a situation has become a reality due to the outbreak of the 2019 coronavirus disease (COVID-19), which emerged in Wuhan, China and has spread throughout the world. In fact, there are reports that some passengers contracted COVID-19 while traveling by public transport (e.g., Liu and Zhang 2020). It is also reported that at the beginning of May 2020, more than 80% of the Japanese population were worried about contracting COVID-19 (YouGov 2020).

When the epidemic disease is finally eliminated in Japan, the level of commuter dissatisfaction with crowding will likely remain higher than it was before the disease given that the concept of social distancing has become prevalent throughout the country. That said commuters behavior will partially, if not completely, return to what it was prior to the outbreak. Although the percentage of people in Tokyo who stopped commuting by train reached a peak of approximately 60% at the end of April 2020 (e.g., Tokyo Metropolitan Government 2020), commuters in Tokyo gradually began to return to normal in May 2020 (Asahi Shimbun Company, 2020). It is expected that even if the effects of dissatisfaction with crowding become stronger due to the epidemic, there will be little change in the behavioral level among commuters once the epidemic has passed.

Our study has some limitations. First, there is a possibility that reverse causality caused an endogeneity problem. The possible hypotheses are, for example, that a high level of stress causes a high probability of being dissatisfied with crowding or that low life satisfaction decreases an individual’s health conditions. The first-difference estimation that we adopted can address only the endogeneity problem originating from time-invariant omitted variables. Therefore, another method, such as the instrumental variable approach, should be adopted in analyses to address potential reverse causality in the future, which could not be addressed in this study due to data availability.

Another limitation is the criterion used to classify the respondents as long-term or short-term train commuters. Our data cannot identify how many years are necessary to reduce the direct effects of dissatisfaction by adaptation. This study focused only on whether the respondents had commuted by train for at least one year. Longer-term surveys that track commuters for additional years can more precisely analyze how long it takes for commuters to become accustomed to commuting by train.

In addition, the extent of adaptation might be determined by factors such as the frequency of use, the objective level of crowding, past experiences with crowding in transportation, and how long commuters have not used such travel modes. Moreover, we cannot classify the adaptation that we found in the analysis as psychological or behavioral. Some commuters may “adapt” to train commuting by changing the time that they leave home and by avoiding rush hour.

One of the possible determinants that was not included but should be discussed is the probability of securing a seat on the train. The value of time savings for train commuters has been determined to be greater when they are standing as compared to when they are than sitting (Wardman and Whelan 2011). For train commuters who largely prefer sitting to standing, dissatisfaction with in-vehicle crowding would be mitigated if they could secure a seat. Recently, some railway companies have introduced reservation services that allow passengers to secure their seats during rush hour by paying an extra fee (Mainichi Newspapers, 2016). Such services have great potential for re-optimization among those commuters who are sensitive to standing and crowding, and accordingly, the level of dissatisfaction due to crowding can be reduced among train commuters as a whole.

In conclusion, this study has implied the possibility that adaptation can reduce the direct effects of dissatisfaction with in-vehicle crowding on SWB, which might not be the case for indirect effects through stress and health.