Introduction

Travel is considered a derived demand. Few people travel for the sake of travelling. People must travel for activities at their destination, such as working in an office or studying at school. Therefore, people usually have a negative utility for the time spent travelling. Long travel distances and travel times are a serious social problem in Japan. For people living in Tokyo and its suburbs, commuting to work or school takes an extremely long time. For example, the average time to commute to and from office or school (averaged over commuters) is 107 min per day in 2016 for men living in Kanagawa Prefecture, which is adjacent to Tokyo (e-Stat 2023). These long-distance travellers primarily use rail, which allows them to enjoy a wider range of activities while commuting. Reading books and newspapers has been a popular activity, but recent technological advancements have expanded the range of activities that can be carried out while travelling (Pawlak 2020). Engaging in meaningful activities while travelling can reduce the negative utility. According to Wardman et al. (2020), the value of travel time has reportedly decreased due to the more meaningful use of travel time. This is an important factor in project evaluation (Wardman and Lyons 2016). Additionally, railway companies may invest in creating environments that promote such meaningful activities. Therefore, the present analysis is strongly linked to practice.

Keseru and Macharis (2018) strongly argued for investigating the historical changes in the onboard activities of rail passengers, but this is difficult. Few travel surveys in Japan have asked rail passengers about their onboard activities, so the authors relied on other sources of information, including academic and non-academic articles that have reported the number of passengers who were directly observed by surveyors to have engaged in activities on trains. Although this information spanned the period between 1983 and 2013, the period covered by each source was limited. Therefore, the present analysis utilises meta-analysis by combining several independent studies. Furthermore, the present analysis used YouTube videos as a data source. While observing passengers’ activities in the past is impossible, it can be done retrospectively by utilising photo or video archives. The use of YouTube videos extended the period of the analysis up to 2019. Certain types of travel-based multitasking, such as the use of mobile phones, have increased over time, but looking at trends still has value. Even if some general trends are obvious, specific figures and trends are crucial for understanding the phenomenon from a statistical point of view. To the best of the authors’ knowledge, no other studies have conducted a meta-analysis of rail-passenger activities. The meta-analysis in this study provides insights into previously obscure behaviours.

The main interest of this study is to demonstrate whether a particular perception is valid: People feel something was true in the past, but it is no longer true. For the example of car ownership, this statement is translated as follows: People feel that car ownership was low in the past, but it is high now. People’s feelings about car ownership can be expressed as a proportion of car ownership (e.g., the number of cars per household or per head). Past and present express the passage of time. It is, therefore, natural to try to explain the proportion of car ownership by the passage of time (e.g., years). In the context of the present study, people feel that travel-based multitasking in Japan was like ‘that’ in the past, but it is like ‘this’ now. Therefore, an attempt is made to explain the percentage of passengers engaged in each type of travel-based multitasking by the years. Factors affecting the choice of travel-based multitasking (socio-economics and trip characteristics) are of interest for researchers, including the present authors. Although not the main interest of the present study, ideas also are presented for research methods that use directly observed data to explain the possible social factors behind longitudinal variation in travel-based multitasking.

The rest of this paper is structured as follows. The “Background” Section presents the background of the study. The “Methodology” Section describes the methods, and the “Results” Section reports the results and the abovementioned ideas. Finally, the “Discussion and conclusions” Section discusses the results and ideas with reference to two longitudinal studies and presents the conclusions.

Background

This section discusses some important topics relating to the present analysis with reference to previous studies. The “Questionnaires versus direct observation” Section discusses the data collection methods and explains why the present analysis uses data collected only through direct observation. The “Longitudinal studies” Section reviews studies that have analysed travel-based multitasking over the years. The “YouTube videos as retrospective direct observation” Section reviews literature that used photo or video archives to retrospectively observe events in the past. The “Some historical events relating to onboard activities in Japanese railways” Section reviews historical events that are closely related to travel-based multitasking by rail passengers in Japan.

Questionnaires versus direct observation

Two of the most commonly used data collection methods for travel-based multitasking are questionnaires and direct observation (Keseru and Macharis 2018).Footnote 1 The questionnaire method asks respondents to report their activities while travelling, while the direct observation method places a person on board a train to observe and record the activities of the passengers. Cost-effectiveness in data collection is a huge advantage for the questionnaire method, while direct observations incur huge costs in both time and money.

Table 1 contrasts the two methods. Direct observation has three advantages. First, the observed activities are similar across studies. This is a benefit that derives from the fact that the surveyors assess the activities with their own eyes. To ensure consistent data collection, the surveyors are given criteria for activity registration. This is beneficial compared to questionnaires, in which respondents register their activities based on their own criteria. For example, if a passenger has their eyes closed, an observer cannot determine if the passenger is sleeping, thinking, or doing nothing. Therefore, the observer must register the passenger as ‘sleeping’, which is a criterion usually adopted. A careful review of the literature collected for the present meta-analysis shows that the classification of activities is similar across studies. This is likely due to the fact that the observers show little difference in their ability to identify the types of activities. On the other hand, questionnaires can include as many activities as interest the researchers, resulting in marked differences in the types of activities reported across studies.Footnote 2

Table 1 Differences between questionnaire and direct observation surveys

The second advantage of direct observation is it ensures that an activity actually occurred, and usually, only one activity is recorded per passenger. This is not always true in questionnaires since respondents are asked to report their activities retrospectively, and they do not always remember what they did at a particular moment (e.g., 9:01:01 this morning). To rectify this problem with human memory, Clayton et al. (2017) asked bus passengers about their current activities on the bus. However, this method is rarely adopted. Therefore, questionnaires can only ask respondents about their activities during the entire journey, with respondents ticking off either all of the activities that were conducted at any time during the journey or one activity that was conducted most of the time. Allowing multiple choices, however, results in the problem of aggregation.

This is the third advantage of direct observation: it is free from the problem of aggregation since activities are similar across studies, and only one activity is registered per respondent. This is very different from questionnaires, which include all of the activities that interest the researchers and allow respondents to report multiple activities. Consistent activity classification is essential for inter-study comparison, but this is not easy to do. For example, one study may classify reading as ‘reading (newspapers)’ and ‘reading (books)’, while another study classifies it as ‘reading’. The broader classification of ‘reading’ is usually adopted, but when multiple responses are allowed, adding together the numbers for ‘reading (newspapers)’ and ‘reading (books)’ results in double-counting.

In addition to these three advantages, direct observation also has the potential advantage of better representativeness of the population of interest. Random sampling, in which everyone in the population has an equal chance of being selected as a respondent, is generally thought to be ideal. For the present analysis, however, this representativeness may not be suitable. Respondents differ in the frequency of their train use and the length of their journeys, and the activities undertaken by these respondents likely differ from those observed on randomly selected trains at randomly selected moments. In direct observation, surveyors are asked to report on the activities of the passengers around them, and the passengers around them are the respondents. The representativeness of direct observation varies from survey to survey. In some studies, volunteers are recruited to report their observations of passengers on trains that they use daily. Other studies focus on specific railway lines and make the same number of direct observations on each line. While representativeness is not a main topic of the present analysis, when achieved through direct observation, representativeness differs across survey protocols, but it is likely to be similar to observations on randomly selected trains at randomly selected moments.

Longitudinal studies

Travel-based multitasking by rail passengers has a long history of interest, but historical trends in multitasking have rarely been studied. Lyons et al. (2013) examined data from the British National Rail Passenger Survey between the years 2004 and 2010, while Lyons et al. (2016) expanded the analysis to include data from 2014, a span of 10 years. The two studies compared how passengers used in-vehicle time for each trip purpose. In Japan, Sanko (2023) investigated travel-based multitasking using data from 2001, 2006, 2011, and 2016, covering a span of 15 years. The data came from Questionnaire B of the Japanese Surveys on Time Use and Leisure Activities. Questionnaire B asked for primary and secondary activities in 15-minute time slots, with travel-based multitasking identified by time slots where the primary and secondary activities involved travelling and other activities, respectively. However, the transportation modes used were unknown.

In the extensive review of the literature for the present analysis, some studies were found that collected data via direct observations from various time points. For example, a research group led by Itoga conducted direct observations every year between 2004 and 2013 (Osada et al. 2014). Kataoka et al. (2002) compared the activities of rail passengers in Tokyo in 1987, 1997, and 1999. The Shuppan Forum published a special article focusing on reading by rail passengers. Direct observation was carried out at five points in time: 1986, 1988, 1997, 2004, and 2007 (Shuppan Forum 1986, 1988, 1997, 2004, 2007). These studies are useful for drawing more concrete conclusions and are included in the meta-analysis below.

YouTube videos as retrospective direct observation

While the activities of passengers in the past cannot be directly observed, activities can be observed retrospectively by utilising archived photos and videos. Such photos and videos would be a snapshot or series of snapshots taken when a surveyor was hypothetically on the train at the time. However, photos and videos are not usually taken to record the daily activities of passengers. As a result, the authors found very few photos and videos in books, newspapers, videos, and their archives that met the aims of the present analysis. YouTube videos, however, did meet the aims.Footnote 3 Established in 2005, YouTube has become popular worldwide. According to a survey of 1500 people aged 13–69 by MIAT (2023), YouTube is the most popular video-sharing service in any age group in Japan. In 2022, more than 90% of people aged 13–49 watched YouTube videos, while 87% and 68% of people 50–59 years old and 60–69 years old, respectively, did so. Although YouTube videos may not be representative, they were a valuable source of data for the present analysis.

Some transport researchers have used archival data as a data source. Gruber et al. (2018) used photographic archives to understand changes in street use and street design in Vienna, Austria, between 1860 and 1949. YouTube videos have been increasingly used as a source of information in research (Mostafa et al. 2023). In a transportation study, Seeley et al. (2019) used 65 YouTube videos of street racing, stunt driving, and ghost driving and examined the content of the videos. However, using YouTube videos as a retrospective data source is less common due to the relatively short history of the service. While videos taken earlier than 2005 are available, they are relatively rare.

Some historical events relating to onboard activities in Japanese railways

The first railway in Japan opened in 1872 between Shimbashi and Yokohama, a distance of 29 km. The railway network subsequently expanded rapidly. This section describes some historical events related to passenger activities on trains.

Reading has been one of the most popular activities on trains. Nagamine (2004), whose main interest is Japan’s reading culture between 1897 and 1906, describes the reading behaviour of railway passengers at the beginning of railway service. Reading was already a popular activity in railway carriages. However, reading habits differed from person to person. It was common to read aloud, which annoyed other passengers and caused conflicts. The change in Japanese reading habits to silent reading is partly a consequence of these conflicts. Businesses that facilitated reading on board trains were established. In 1897, rental bookshops were opened at major stations, and passengers could borrow books and return them to any of the bookshops operated by the same company. In 1911, a railway library (a bookshelf) was installed in the first- and second-class carriages of express trains. Reading materials have traditionally been paper-based. In recent years, reading materials have taken many different forms, such as portable electronic devices.

Activities other than reading also became popular as a result of new technologies. Many of these technologies created new types of conflicts among passengers. For example, portable audio players were introduced around 1980, which made it possible to listen to music on trains. However, sound leaking from earphones annoyed other passengers. Mobile phones, with their loud ring tones and users’ loud voices during calls, also caused conflicts. Train operators have struggled to develop rules of social conduct. In September 2003, 17 train operators in the Tokyo area agreed to publish the same request to passengers regarding mobile phone etiquette: switch off the phone near priority seats and in other places use the silent mode and not talk (East Japan Railway Company 2003). Since October 2015, switching off while near priority seats has only been requested when trains are crowded (East Japan Railway Company 2015). Some activities that were allowed in the past are now prohibited. East Japan Railway Company’s local trains became completely smoke-free in 1997 (East Japan Railway Company 2023). Since then, there has been a growing trend to prohibit smoking in train carriages and stations. Controversial in-vehicle behaviours have been revealed in the results of surveys asking about annoying behaviours in trains and in letters to newspapers. Examples include putting on makeup and eating.

Methodology

The literature review was conducted and reported following the PRISMA statement (Page et al. 2021). The review was not registered in any public database, since the protocol had not yet been developed at the start of the present analysis. Ethical approval was not required, as the data used in the study were publicly available.

Search strategy and eligibility criteria

A comprehensive literature search was conducted using the CiNii, J-STAGE, and Web of Science databases. This was supplemented by a manual search using Google Scholar. An example of the search strategy can be found in Supplementary Table 1. The eligibility of an article was assessed based on the title and abstract; if eligibility could not be determined from the title and abstract, the full text was used for assessment. Also conducted was a snowball search of studies listed in the references of articles that had passed the eligibility check. The search for YouTube videos was entirely manual.

The meta-analysis included studies that met all of the following criteria: (1) subjects were rail passengers on Japanese railways; (2) passengers in reserved seats were excluded; (3) activity counts were included; (4) at least ten passengers were surveyed; (5) studies were conducted before and during 2019 (before the spread of COVID-19); and (6) studies were in English or Japanese.

The literature searches and counts for academic and non-academic journals were carried out by one author (SY) in 2022 and early 2023, while the data searches and counts for YouTube videos were carried out in 2019 and 2020 by a person mentioned in the Acknowledgements section under the supervision of one author (NS). One author (NS) reviewed both datasets and made the final decision in August and September 2023.

Data extraction

The number of times each activity was conducted and the year of data collection were extracted. Also noted were the respondents’ socio-economic attributes, which were used in the descriptive analysis in the “Study characteristics” Section, but not in the meta-analysis. Classifications of activities differ across studies, and even independent studies by the same research group may use different classifications based on their previous experience or on activities that have emerged due to technological developments. After careful consideration, the following five classifications were adopted: ‘(a) mobile phones’, ‘(b) sleeping’, ‘(c) reading’, ‘(d) music’, and ‘(e) other’. Some studies have finer classifications, but they have been grouped into the above five classifications. Note that ‘(c) reading’ is further divided into subclasses: ‘(c1) newspapers’, ‘(c2) books’, and ‘(c3) magazines and comics’.

The procedure used for extracting data from a YouTube video is described. In each video, the activities of each passenger were counted. The count was done for a single scene per video to avoid double-counting. The choice of scene was based on the ease of counting, i.e., a scene in which a smaller number of standing passengers and a larger number of seated passengers were observed. The activities were identified in the order of passengers closest to the camera to passengers farthest from the camera; identification was stopped when a passenger whose activity could not be identified appeared. In many cases, the process of identification stopped when seated passengers were hidden from the camera by other seated passengers. Passengers in more distant positions were not included, even if their activities were identifiable. As the activities of passengers in hidden positions would be easily identifiable if the onboard surveyors changed their viewpoint, the inclusion of passengers only in distant positions differs from the practice of direct observation. The content analysis here was designed to avoid arbitrary selection of passengers. The time of their recording was not considered to be the time of upload, as older videos may have been uploaded later. Rather, the time of their recording was determined from notes in the description section of the video or by clues (e.g., advertisements) recorded as part of the video.

Statistical analyses

R was used to conduct meta-analyses for percentages of travel-based multitasking (R Core Team 2023; Wang 2018). The percentages of passengers who performed activities were the effect size measure. A meta-analysis can be either fixed-effects or random-effects models. The fixed-effects model is used when groups of studies are to be combined under almost homogeneous circumstances using exactly the same methods, while the random-effects model is used when groups of studies are to be combined, each with different study qualities and methods (Borenstein et al. 2021). In this paper, the data include studies conducted by different researchers for different purposes. Therefore, the random-effects model was adopted. Logit transformation was performed, with continuity correction by adding 0.5 as zero frequencies. Each study was weighted by the inverse-variance method. The between-studies variance (τ2) was calculated using the DerSimonian and Laird method, and its confidence interval was calculated using the Jackson method. For the purpose of reference, this study presents I2 (the ratio of true heterogeneity to total observed variation) and Cochran’s Q-test (χ2 values) (a measure of weighted squared deviations).

Publication bias is not an issue in the present analysis. Activities were classified as (a)–(e), and all classifications were used in the meta-analysis. The sum of the proportions for classifications (a)–(e) is unity. Therefore, the deliberate selection of studies reporting a higher proportion for one activity leads directly to the deliberate selection of studies reporting a lower proportion for another activity. As the aims of the surveys differ across studies, it is unlikely that certain behaviours are over-reported. Publication bias should be assessed from a qualitative perspective. Barker et al. (2021) mentioned that no definition or consensus was made regarding a positive result in the proportional meta-analysis. Therefore, Egger’s test, Begg’s test, and funnel plots were not appropriate. However, for the purpose of reference, this study presents funnel plots.

Subgroup analyses were a suitable methodology for dividing all studies into subgroups, where the studies in each subgroup were homogeneous. However, the present analysis does not have such a factor, and no subgroup analyses were conducted. Meta-regression is a method for quantitatively analysing the degree to which the dependent variable can be explained as explanatory variables that are thought to create heterogeneity in individual studies (Borenstein et al. 2021). A univariate meta-regression was performed with the survey year (not the publication year but the year of data collection) as an explanatory variable. The year of the survey as the moderator is an obvious choice for examining how travel-based multitasking has changed over time. One possible concern is that the factors behind the trends over time are unknown. Although not the main aim of this study, the authors attempted to explain travel-based multitasking using other moderators that reflect the social trends of the time.

Results

Study characteristics

The PRISMA flow diagram in Fig. 1 shows how the studies were selected, with comprehensive searches carried out in the following databases: CiNii (158 hits), J-STAGE (5268 hits), and Web of Science (25 hits).

Fig. 1
figure 1

PRISMA diagram

First, all titles and abstracts were screened against the inclusion criteria. This strategy eliminated 5435 irrelevant studies. Second, the full texts of the remaining 16 studies that appeared to meet the inclusion criteria were assessed. Of these, one met the inclusion criteria and was included in the meta-analysis, while 15 were excluded for the following reasons: 11 studies did not include direct observation, three studies conducted direct observation but did not report counts, and one study (Ohmori and Harata 2008) included passengers in reserved seats. In addition, a manual search using Google Scholar identified one study. Finally, a snowball search for studies listed in references identified eight studies. Some articles include observations from multiple time points, and data from different time points were considered to be different observations. The ten articles included 21 independent studies. Where multiple independent survey results were reported in one article, each result was referred to as an independent study to avoid confusion.

YouTube videos were regarded as a single data source, so Fig. 1 mentions ‘N = 1’. Eighteen videos were identified from the years 2005, 2009, 2016, 2017, 2018, and 2019. After manually identifying activities and applying the eligibility criterion of having at least ten passengers, only data from the years 2016 and 2019, consisting of 12 videos, passed the eligibility criteria. Therefore, YouTube videos comprised two independent studies.

Table 2 summarises the studies included in the meta-analysis. The review identified 23 independent studies from 11 sources. All but one publication study reported data from the Tokyo metropolis. The exception was Harada et al. (2014), which was from the Osaka metropolis. Although Shuppan Forum (2007) did not specify the study area, the data likely was from Tokyo due to the survey’s continuity. Most of the surveys were conducted by university researchers with support from their students, whereas the Shuppan Forum used volunteers for data collection. Both standing and seated passengers were observed except for Kataoka et al. (2002), who observed only seated passengers. Research interests differed across the studies: group (1) investigated passenger activities (Kataoka et al. 2002; Harada et al. 2014; Osada et al. 2014), whereas group (2) focused on passenger reading habits (Shuppan Forum 1986, 1988, 1997, 2004, 2007; Aikyou 1984; Yada 1993). However, the studies in group (2) also reported non-reading activities, as did the studies in group (1). The YouTube group included two independent studies, and the data are from all of Japan.

Table 2 Studies included in meta-analysis

Several studies have collected passengers’ socio-economics data in order to understand the relationships between activity types and socio-economics. The data collected include gender, age group, posture (seated/standing), time of day, day of week, and railway line. Surveyors subjectively determined the gender and age group of the passengers. Some findings obtained from the Shuppan Forum are noteworthy regarding the relationship between reading materials and socio-economic status. Newspapers are more likely to be read by older males, although this tendency has weakened in recent years. Books are more likely to be read by females, while magazines are more likely to be read by younger males. In terms of time of day, newspapers usually are read in the morning, while books and magazines are read in the evening. Electronic devices are used more often at night. In the case of weekly magazines, a relationship was noted between reading material (media) and the day of the week of publication.

Meta-analysis

Table 3 presents the number of counts for each activity included in each independent study used for the meta-analysis. The published studies were sorted by the number of independent studies (largest number to smallest number), followed by the YouTube videos. As mentioned in the “Questionnaires versus direct observation” Section, only one activity per passenger usually is recorded. Not all studies mention the protocol for selecting a single activity when surveyors must choose from more than one activity; some studies explicitly stated that surveyors should only select a single activity that appears to have more weight. Some studies allowed surveyors to report more than one activity per passenger. However, this is a minor concern as, at most, only 12% of passengers had more than one activity. Some studies included typographical errors in the figures due to unknown reasons. However, the errors were minor, and the figures are used as is. The dataset has 11 sources (ten articles and one YouTube) with 23 independent studies from 332,355 passengers. Since the five classifications were not always examined in each independent study, the number of independent studies included in the meta-analysis differs for each activity classification. Activities included in ‘(e) other’ may not be directly comparable since some independent studies record parts of (a)–(d). Analysis for ‘(e) other’ is limited to independent studies that have all four (a)–(d) classes. However, this does not apply to independent studies without ‘(a) mobile phones’ in Shuppan Forum (1988) when data is collected from a time before mobile phones became widespread. The number of included independent studies ranged from 17 for ‘(e) other’ to 23 for ‘(c) reading’. The activities defined in each independent study were grouped as outlined in Supplementary Table 2.

Table 3 Datasets–activity counts

Activity types examined were: ‘(a) mobile phones’, ‘(b) sleeping’, ‘(c) reading’, ‘(d) music’, and ‘(e) other’. ‘(c) reading’ was further classified into ‘(c1) newspapers’, ‘(c2) books’, and ‘(c3) magazines and comics’. Supplementary Fig. 1 presents forest plots. The pooled estimates and 95% CI were: 15.22% (95% CI 11.86–19.32%) for ‘(a) mobile phones’, 14.20% (95% CI 12.83–15.68%) for ‘(b) sleeping’, 17.70% (95% CI 14.88–20.91%) for ‘(c) reading’, 6.97% (95% CI 5.71–8.48%) for ‘(d) music’, 45.45% (95% CI 41.63–49.32%) for ‘(e) other’, 4.01% (95% CI 2.97–5.41%) for ‘(c1) newspapers’, 8.99% (95% CI 8.07–10.00%) for ‘(c2) books’, and 3.82% (95% CI 2.88–5.04%) for ‘(c3) magazines and comics’. In all eight classifications, I2 (the ratio of true heterogeneity to total observed variation) was greater than or equal to 99%, and Cochran’s Q-test (χ2 values) (a measure of weighted squared deviations) of heterogeneity demonstrated a lack of homogeneity among the studies (p < 0.0001). For the purpose of reference, the funnel plots in Supplementary Fig. 2 indicated no clear publication bias. The funnel plots include a line of summary effect and pseudo 95% confidence limits. Scatter plots were generated in Fig. 2 with a solid regression line and two dotted lines showing the corresponding 95% confidence interval bounds. The size of the bubble represents the weight of the studies. Numeric labels attached to each bubble correspond to the number (‘No.’) representing each independent study shown in Table 3. The regression results also are presented in Table 4.

Fig. 2
figure 2

Meta-regression scatter plot using data year as moderator

Note: The numbers next to the bubbles are the ‘No.’ mentioned in Table 3

Table 4 Meta-regression using data year as moderator

The percentages of passengers engaging in activities utilising ‘(a) mobile phones’ and ‘(d) music’ showed increasing trends with statistical significance. On the other hand, the percentage of ‘(c) reading’ has decreased over the years with statistical significance. The percentages of ‘(b) sleeping’ and ‘(e) other’ were stable and did not change with any reasonable level of statistical significance. Among the (c) subcategories, ‘(c1) newspapers’ and ‘(c3) magazines and comics’ showed a statistically significant decreasing trend over time. The temporal evolution of rail passengers’ travel-based multitasking has never been statistically analysed using direct-observation data, and the results presented here are unique. Estimates with respect to the signs and level of statistical significance presented in Table 4 did not change when the independent studies from YouTube were excluded. The results support the use of YouTube for the purposes of this study, as the inclusion of YouTube data did not interfere with the results. Although the small sample size of YouTube data may also have been a reason, note that the random effects model is characterised by the relatively large weights given to studies with small sample sizes (compared to fixed effects models) for analysis (Borenstein et al. 2021).

However, the meta-analysis in the present analysis does not consider the socio-economic status of passengers. This is because only a small number of independent studies (fewer than 10) reported the number of activities for each socio-economic status. The meta-analysis was performed separately for men and women since the gender attribute provided the largest number of independent studies and so was the most appropriate choice for sub-group analysis. The results were slightly different from those based on the full data sets in some parts, but the authors have chosen not to report them. This might be misleading because of the smaller number of independent studies included and the much shorter time span of the data used.

An extension

Data collected by direct observation is potentially useful for investigating social trends that influence travel-based multitasking. As the socio-economic attributes collected through direct observation are limited, questionnaire surveys and in-depth interviews are more appropriate for a deeper understanding of socio-economics. However, as mentioned in Table 1, data collected by questionnaire surveys and in-depth interviews are rarely comparable across time points, and analysis using these data collection methods must rely on data from a single time point. Exceptions are Lyons et al. (2016) and Sanko (2023), which are discussed in the “Discussion and conclusions” Section. Data from a single time point express differences among respondents, which also is important, but do not explain changes of activities over time (Kitamura2003). Since data at a single time point have limited variations in some variables, such as mobile phone penetration and the availability of internet access, such data cannot be used to analyse the impact of factors that may change over time in a society as a whole. On the other hand, direct observation data collected from multiple time points have variations in variables that change over time in a society as a whole. Methods to improve forecasts by using cross-sectional data from multiple time points—also known as the updating function model—are proposed by Sanko (2014, 2016, 2018, 2022), which express parameters as functions of social trends, such as gross domestic product per capita.

Another direction of study is building a meta-model using factors expressing the social trends of the times as moderators to forecast the future. For comparison, the analysis is repeated when the moderator variable of ‘year’ is replaced by one of the following variables: fixed broadband subscriptions (per 100 people), mobile cellular subscriptions (per 100 people), and individuals using the Internet (% of population). The estimates for the moderator in Table 4 did not change in terms of sign and level of statistical significance (five per cent). This is because the correlation coefficients are close to unity: 0.929 between year and fixed broadband subscriptions (per 100 people), 0.980 between year and mobile cellular subscriptions (per 100 people), and 0.958 between year and individuals using the Internet (% of population) using 37 data points between 1983 and 2019. As an example, mobile cellular subscriptions (per 100 people) as a moderator is presented in Supplementary Fig. 3 for bubble plots and Supplementary Table 3 for meta-regressions.

Discussion and conclusions

This was the first analysis of the long-term changes in travel-based multitasking by rail passengers in Japan. The data were collected from direct observations published in both academic and non-academic articles and from content analysis of YouTube videos. In terms of results, no other relevant studies can compare with this study. However, two studies have reported results from multiple time points. These studies are useful for the following discussion.

The first study, Lyons et al. (2016), analysed the National Rail Passengers Survey for Great Britain in 2004, 2010, and 2014. While the analysis of the rail passengers is the same as the present meta-analysis, both the location of the study in Great Britain and the questionnaire method used for data collection differ from the present meta-analysis. Respondents who were about to board trains at stations received questionnaires and were asked to report any activities and on which activity they spent the most time. Table 5 summarises the results of this study using a slightly different presentation. There is no one-to-one correspondence between the classifications used in Lyons et al. (2016) and those used in the present meta-analysis. However, if the activities in Table 5 were re-classified using the (a)–(e) classifications of the present meta-analysis (noted in front of the names of the activities), then the following interpretations can be made. ‘Working/studying (reading/writing/thinking)’ in Lyons et al. (2016) was self-reported and the form it took is unclear; it could include any of (a)–(e) as defined in this study: ‘(a) mobile phones’ when writing emails for work, ‘(b) sleeping’ when thinking about their business with their eyes closed, ‘(c) reading’ when reading business-related documents, ‘(d) music’ when studying languages using portable music devices, and ‘(e) other’ when thinking about maths problems with their eyes open. The authors decided to assign ‘working/studying (reading/writing/thinking)’ to ‘(e) other’ as there is no good reason to assign it to any of (a)–(d). If ‘working/studying (reading/writing/thinking)’ were exactly allocated to (a)–(e), then the proportions of (a)–(d) and (e) would be greater and smaller, respectively.Footnote 4 The results in Table 5 include ‘any time’ and ‘most time’, but only the latter is utilised here. The sum of the percentages in ‘most time’ is unity, which is also true for direct observations. Each activity with the (a) notation increases over time, and the sum of the activities with the (a) notation comprises 1.9%, 8.1%, and 14.4% in 2004, 2010, and 2014, respectively. This is in line with the trend of ‘(a) mobile phones’ in the present meta-analysis. The percentage was slightly higher in the present meta-analysis at around 25% in 2010. ‘(b) sleeping/snoozing’ trends lower: 3.7%, 3.5%, and 2.8% in 2004, 2010, and 2014, respectively. The percentages in the present meta-analysis were higher. The activity ‘(c) reading for leisure’ also trended lower: 37.9%, 37.0%, and 32.6% in 2004, 2010, and 2014, respectively. In the present meta-analysis, between 2004 and 2014, the percentage points in reduction were similar in ‘(c) reading’, but the percentages themselves were smaller. The activity ‘(d) listening to music/radio/podcast’ trended higher overall between 2004 and 2014, with 3.7%, 8.6%, and 6.8% in 2004, 2010, and 2014, respectively. The sum of the activities with the (e) notation trended lower between 2004 and 2014, with 52.8%, 42.7%, and 43.3% in 2004, 2010, and 2014, respectively. Regarding activities (d) and (e), not only the trend of increasing/decreasing but also the percentages are similar between Lyons et al. (2016) and the present meta-analysis. The present meta-analysis had higher percentages in ‘(a) mobile phones’ and ‘(b) sleeping’ but lower percentages in ‘(c) reading’. This is partly due to the identification of activities made by surveyors’ eyes. Some people use mobile devices for reading, which are registered as ‘(a) mobile phones’ in direct observations but ‘(c) reading’ in questionnaires. Some people close their eyes, which is registered as ‘(b) sleeping’ under direct observation but can be registered as other activities in questionnaires. Percentages of ‘(c) reading for leisure’ were similar between ‘any time’ and ‘most time’, which suggests that people who read tend to spend most of their time reading.

Table 5 Results of Lyons et al. (2016): activities conducted by passengers at any time or most of the time while on the train (% of respondents in each autumn survey for all purposes)

The discussion of similarities and differences here does not lead to clear conclusions, as it is possible that onboard activities differ between Japan and Great Britain in the first place. However, the historical changes in the onboard activities of rail passengers collected by direct observation in Japan and by questionnaires asking the most time spent in Great Britain were similar, and reasonable interpretations were possible in terms of differences in trend and percentages.

The second study uses the Japanese Surveys on Time Use and Leisure Activities in 2001, 2006, 2011, and 2016. The use of data from Japan is the same as in the present meta-analysis, while the survey method of questionnaires and the analysis of trips irrelevant to transportation modes differ from the present meta-analysis. The dataset was the same as that used in Sanko (2023). Although Sanko (2023) analysed travel-based multitasking for separate trip purposes, the present analysis re-analysed it for all trip purposes combined. As explained in the “Longitudinal studies” Section, the Time Use Surveys asked respondents to report their primary and secondary activities for each 15-minute time slot. The authors identified the time slots in which the primary activity was travel for any purpose, and calculated the percentage of secondary activities in the identified time slots. The results are presented in Table 6. Again, there was no one-to-one correspondence in the classifications between the Time Use Surveys and the present meta-analysis. However, if it were possible to classify the activities in Table 6 in the way described in front of the names of the activities, then the following considerations would be made. ‘(e) No secondary’ had an extremely large percentage, and only ‘(a or d) TV and radio’, ‘(a or d) Video, DVDs, and CDs’, and ‘(e) Other activities’ exceeded 1%. The sums of ‘(a) Telephone conversation’, ‘(a) Communication by computing and correspondence’, ‘(a or d) TV and radio’, and ‘(a or d) Video, DVDs, and CDs’ were 3.47%, 2.34%, 2.92%, and 3.30% in 2001, 2006, 2011, and 2016, respectively, showing a surprisingly stable trend. Table 6 reveals significant changes between 2011 and 2016. ‘No secondary’ activities accounted for 89.71% in 2016, a decrease of 5.34% points since 2011, while ‘(e) other activities’ accounted for 5.05% in 2016, an increase of 4.68% points since 2011. The classification of ‘(e) Other activities’ should be treated with care. ‘Computing (use of PC)’ and ‘Computer and video games’ are separate categories in 2011 and 2016, but no such categories were present in 2001 and 2006. For the sake of comparability over time, ‘Computing (use of PC)’ and ‘Computer and video games’ were merged into ‘(e) Other activities’ in Table 6. However, most of ‘(e) Other activities’ (0.37% in 2011 and 5.05% in 2016) belong to ‘Computing (use of PC)’ (0.18% in 2011 and 4.47% in 2016) and ‘Computer and video games’ (0.07% in 2011 and 0.39% in 2016), which should have been classified under (a). This explains the increasing trend of ‘(a) mobile phones’ in this study. Activities with (c) notation related to reading showed an overall decreasing trend from 0.73% in 2001 to 0.49% in 2016. The present meta-analysis found an increasing trend for activities with notations (a) and (d) and a decreasing trend for activities with notation (c). However, the percentages in the Time Use Survey were extremely small.Footnote 5 There are some interpretations: (1) a smaller percentage of travellers other than rail has secondary activitiesFootnote 6, and (2) activities carried out only a part of a 15-minute time slot may not be reported. This means that the percentages calculated from Time Use Surveys are very different from those calculated from direct observations. Note that the percentages in Table 6 were calculated on a time slot basis, which is very similar to the idea of ‘any time’ by Lyons et al. (2016). Time slots consecutively used for travelling can be grouped into a single trip, and the activity that is registered most frequently in the grouped trips can represent the activity of the journey. This is more similar to the idea of ‘most time’ in Lyons et al. (2016).

Table 6 Activities by passengers while travelling (% of respondents for all modes of transport)

Before concluding the paper, the strengths and limitations of the present analysis are noted. Six important strengths are: (1) this paper was the first to describe a meta-analysis of travel-based multitasking; (2) the use of observational data from multiple sources, including non-academic literature, covered a long period; (3) the content analysis utilising YouTube videos as a data source is novel; (4) historical changes in the proportion of passenger engagement in each activity were clearly demonstrated by moderator analysis; (5) conclusions did not change for studies without YouTube data, which supports (or at least does not deny) the use of YouTube data for this purpose; and (6) results obtained by direct observations were compared with those from questionnaires. Three limitations are noted: (1) the data from YouTube videos is not comprehensive and is small in size, and its collection was entirely manualFootnote 7; (2) most of the studies in the available literature were conducted in Tokyo; and (3) factors other than time trends are not discussed in depth.

In conclusion, this meta-analysis of travel-based multitasking of rail passengers utilised data from direct observations as well as retrospective data from YouTube. Historical changes were statistically examined, and the results showed that activities ‘(a) mobile phones’ and ‘(d) music’ have significantly increased in usage in recent years. The activity ‘(c) reading’ trended downward significantly. However, this may be due to changes in how passengers read, from reading printed matter to reading using electronic devices, the use of which can also be evaluated as ‘(a) mobile phones’. However, in any case, it is evident that the use of electronic devices is trending upward strongly, and an environment that is friendly to mobile activities in railway carriages has had a huge impact on passenger satisfaction. Railway operators may invest in facilities to improve the environment, and governments may build a policy to facilitate the investment. As mentioned in the “Introduction” Section, a decrease in the value of travel time savings from meaningful use of in-vehicle time can impact the project evaluation. Without considering the proportions of travel-based multitasking on trains, any discussion of a project’s evaluation would surely fail. Such a discussion would require knowing 1) the historical trends in the proportions of travel-based multitasking, 2) the value of travel time savings used for project evaluation, and 3) the project evaluations when different values of travel time are utilised. Even if the general trends of the travel-based multitasking are obvious, such as the increase in mobile phone use, the actual figures should be examined, as this study does.