1 Introduction

Public transportation has been the subject of increasing interest in recent years, chiefly due to its potential to alleviate congestion, reduce emissions and protect the environment, provide critical support during emergencies and disasters, and enhance mobility in small urban and rural areas. To increase the ridership, public transport service needs to be more market-oriented, which can help maintain consumer loyalty and improve the long-term financial performance of public transit companies [1].

Satisfaction is considered as the main driver of consumer loyalty and behavior [2]. Coffel [3] found that the satisfaction level of public transit customers has a significant influence on whether they choose public transit as their primary commute method. Declining satisfaction levels among transit users lead to significant decrease in their customer loyalty regarding using transit again or recommending transit to a friend or relative. Lai and Chen [1] revealed the vital role of customer satisfactory in understanding the behavioral intention of public transit users. The authors found that passenger behavioral intentions significantly rely on passenger satisfaction.

Customer satisfaction also reflects the performance of a transit system regarding meeting customers’ needs [4]. Customer satisfaction measurement has been translated into service quality measures in the existing literature. For example, Eboli and Mazzulla [5] developed a customer satisfaction index to evaluate transit service quality. This index enables service quality monitoring, dissatisfaction identification, and future strategy definition. Nathanail [6] proposed a multi-criteria evaluation method to provide railway operator with a quality control toolbox. Results based on the multi-criteria evaluation provide transit planners and practitioners with valuable information for effective decision-making and marketing strategies.

For the reasons discussed above, researches on user satisfaction toward public transit service allow a better understanding of their behavior and provide directions for future planning and improvement strategies. However, most of the existing work in this area has relied on the use of “customer satisfaction surveys,” where participants express their point of view about services by filling out sample surveys. Two major concerns about the questionnaire-based studies are (1) the low response rate and (2) the potential lack of comprehensiveness due to the design of the questionnaire. To overcome these limitations, we propose a text mining framework utilizing online customer reviews to investigate customer satisfaction toward public transport services. In the following sections, previous studies on public transit customer satisfaction evaluation are reviewed and the latent Dirichlet allocation (LDA) topic model is discussed. Case study results and comparison with questionnaire surveys are presented at the end of the paper.

2 Related Work

As public transit becomes a more promising mode to serve all travel purposes, dedicated efforts were made to improve the existing service from various perspectives including accessibility, pricing, comfort, etc. The dimensions of customer satisfaction addressed by previous studies are summarized and discussed in the following sections.

2.1 Fare

Researchers have found that fare price has a great influence on the ridership of public transportation. For example, Cervero and Wachs [7] found that annual U.S. transit ridership declined by about 6 %, while average fares increased by 35 % between 1984 and 1987. According to the authors, customer dissatisfaction with fare price is the main reason for ridership decline. Goodwin [8] used fare elasticity index as an indicator to study customer satisfaction in public transportation. The author confirmed the significance of fare price in transit customer satisfaction. Coffel [3] also found that “service received for the fare paid” is one of the top satisfaction discriminators between “somewhat satisfied” and “very satisfied.” Wallin [9] identified “price level” as one of the nine service attributes that are believed to impact customer satisfaction. The author stated that price becomes an important factor when the offered service is considered to be of low quality. Perone and Volinski [10] found that while a fare-free policy is appropriate for smaller transit systems, it does not have the same effect for larger transit systems in major urban areas. Eboli and Mazzulla [5] used an index of customer satisfaction to evaluate transit service quality. The weight of ticket price was estimated to be 9.12 (scale from 1 to 10), which indicates that ticket price is crucial to customer satisfaction.

2.2 Wait and Travel Time

Wait and travel time are usually considered as critical measurements of transit customer satisfaction. For example, Cervero [11] found that transit riders are more sensitive to schedule reliability than almost any other service attributes. The author also found that riders are especially sensitive to out-of-vehicle travel time. Wall and McDonald [12] reported that in North West London, when the bus service frequency was changed from every 20 min to every 10 min, the estimated demand increased of around 20 %. More recently, Friman and Fellesson [13] found that there is a significant relationship between average public transportation speed and overall user satisfaction. Moreover, a survey report by Metropolitan Transportation Authority (MTA) in New York identified “how fast the public transit gets you where you want to go” as one of the highest satisfaction attributes concerned by the subway customers [14].

2.3 Cleanliness

Cleanliness is another popular topic in public transportation satisfaction surveys. Coffel [3] identified four top satisfaction discriminators between “somewhat satisfied” and “very satisfied.” These discriminators include “cleanliness of light rail vehicle interior,” “cleanliness of light rail vehicle exterior,” “cleanliness of heavy rail vehicle interior,” and “cleanliness of stations (waiting area).” Eboli and Mazzulla [5] studied cleanliness of both interior and exterior of transit vehicles and found that the weights of exterior and interior features are estimated to be 7.85 and 9.51, respectively (scale from 1 to 10). MTA [14] found that cleanliness had received a high attention from the subway user and the demand for improvement in cleanliness was overwhelming. The research concluded that cleanliness is one of the most important service attributes that transportation companies need to improve in the public transportation service.

2.4 Customer Service

Customer service is defined as the services provided by the employees of the public transportation agencies. It includes the behavior of the driver, conductor, in station customer service employees, etc. Wallin [9] developed a conceptual model to determine the relationships among customer preferences, customer satisfaction, and customer segments. The author reported that information service such as schedule timetables and corresponding lines have a significant impact on customer satisfaction. Coffel [3] identified “courtesy of bus drivers,” “courtesy and helpfulness of station staff (waiting area),” and “courtesy of the operator/conductor” as several staff behaviors that could attract more people to use public transit service.

2.5 Accessibility

The access to public transportation was another significant component of the overall transportation system. Coffel [3] identified “ease of making transfers from the station” as an important customer satisfaction dimension. Daganzo [15] investigated the structural effect of transit system on accessibility and proposed a combination of grid and hub-and-spoke network structure to improve the overall competitiveness of transit system over driving. Woldeamanuel and Cyganski [16] used a panel binomial probit model to analyze the parametric relationship between levels of traveler satisfaction and accessibility to public transport services. The results showed that travelers who tend to make frequent trips by public transportation demonstrate a higher probability of satisfaction with accessibility. According to MTA [14], “convenience of stops” is one of the most important dimensions among all public transit satisfaction measurements.

2.6 Safety

Turner [17] found that safety has great influence on commute experience of public transportation customers. Roberts et al. [18] reported that improving the security culture of public transportation will significantly improve the customer satisfaction as well as other aspects such as efficiency and employee morale. Safety is also among the nine service attributes identified by Wallin [9] that are believed to have impacts on customer satisfaction of public transportation. Coffel [3] identified the category of “safety from crime after getting off the bus” as an important factor to improve the customer satisfaction level from “somewhat satisfied” to “very satisfied.”

2.7 Crowdedness

Many researchers found that crowdedness has considerable influence on public transit customer satisfaction. For example, Lundberg [19] found that the crowdedness condition of public transit contributes more to travelers’ stress experience than trip duration. MTA [14] identified crowdedness as the most unsatisfied service attribute of New York City Transit. More recently, researchers found that user satisfaction is lower when individuals lack space in transit vehicles and the space between transit passengers is found as one of the main qualities desired by users [2022].

2.8 Comfortability

The comfortability of public transportation is related to conditions such as seat condition, temperature in the vehicles, and smoothness of the ride. Coffel [3] identified “smoothness of ride” and “seating comfort” as top attributes that can enhance the customer satisfaction level from “somewhat satisfied” to “very satisfied.” It was also found that improving these attributes can increase loyalty and ridership among current and potential customers. MTA [14] also identified “comfort of temperature on vehicles” as an important service attribute.

2.9 Summary

The most common methods of data collection in the studies discussed above were interviews and surveys. However, these approaches are limited by the response rate and the variability and subjective nature of the response. Recently, a variety of new data sources and an expanding set of novel analysis methods open up new opportunities for studying transit user satisfaction. For example, Aranguren and Tonnelat [23] use transit users’ facial expression to study their willingness to cope with the crowdedness in the Paris Metro. In this paper, we propose studying public transit customer satisfaction by analyzing online reviews and comments, which contain words expressing user sentiment or opinions about public transit service. In this research, we downloaded user comments from public transit review website and applied an unsupervised topic model to identify sets of satisfaction dimensions. A total of 17,747 reviews and comments were collected, and the extracted dimensions were compared with the findings reported in the previous studies.

3 Methodology

In this research, the LDA topic model was employed to extract opinions from user reviews. Topic models are usually used to analyze and summarize topics from large volume of textual documents. A topic is defined as a group of words that tend to occur together frequently and a document is defined as the mixture of different topics [24]. The LDA model is a generative probabilistic approach to analyze the collections of discrete data [25]. In this research, customer satisfaction attributes are considered as topics to be obtained from the documents (review comments). The following gives a short overview of the mathematical basis of LDA.

Let M be the number of review comments, \(N_{m}\) be the number of words in the mth comments, V be the number of distinct words, and K be the number of topics. The number of topics, K, is a user-specified parameter that provides control over the level of details of the discovered topics. Also, let \(w_{m,n}\) be the nth word in the mth comment, \(z_{m,n}\) be the topic of topic of \(w_{m,n}\), \({\varvec{\theta }}_{m}\) be the topic distribution for the mth document, \({\varvec{\phi }}_{k}\) be the word distribution for the kth topic, \({\varvec{\alpha }}\) be the prior distribution for topics in a review, and \({\varvec{{\varvec{\beta }}}}\) be the prior distribution for words in a topic. The words of the review comments are assumed to be generated in the following steps.

Step 1 The word distribution of the kth topic, \(\phi _{k}\), is generated from a Dirichlet distribution with parameter \({\varvec{\beta }}\).

$$ {\varvec{\phi }}_{k}\sim Dir({\varvec{\beta }})\,{\text { for }}\,1\le k\le K, $$
(1)

where Dir represents the Dirichlet distribution.

Step 2 The topic distribution of the mth review comment, \({\varvec{\theta }}_{m}\), is generated from a Dirichlet distribution with parameter \({\varvec{\alpha }}\).

$$ {\varvec{\theta }}_{m}\sim Dir({\varvec{\alpha }})\,{\text { for }}\,1\le m\le M. $$
(2)

Step 3 The topic of the nth word in the mth review comment, \(z_{m,n}\), is generated from the \({\varvec{\theta }}_{m}\) distribution as a discrete random variable.

$$ z_{m,n}\sim Disc({\varvec{\theta }}_{m})\,{\text { for }}\,1\le m\le M\,{\text { and }}\,1\le n\le N_{m}. $$
(3)

Step 4 The nth word in the mth comment, \(w_{m,n}\), is generated from the \({\varvec{\phi }}_{z_{m,n}}\) distribution as a discrete random variable.

$$ w_{m,n}\sim Disc({\varvec{\phi }}_{z_{m,n}})\,{\text { for }}\,1\le m\le M\,{\text { and }}\,1\le n\le N_{m}. $$
(4)

Given the data generating process above, the joint probability of all the parameters is

$$ p\left( w,z,\theta ,\phi |\alpha ,\beta \right) =p\left( w|\phi ,z\right) p\left( \phi |\beta \right) p\left( z|\theta \right) p\left( \theta |\alpha \right), $$
(5)

where

\(p\left( w|\phi ,z\right) =\prod _{m=1}^{M}\prod _{n=1}^{N_{m}}p\left( w_{m,n}|\phi _{z_{m,n}}\right) \);

\(p\left( \phi |\beta \right) =\prod _{k=1}^{K}p\left( \phi _{k}|\beta \right) \);

\(p\left( z|\theta \right) =\prod _{m=1}^{M}\prod _{n=1}^{N_{m}}p\left( z_{m,n}|\theta _{m}\right) \); and

\(p\left( \theta |\alpha \right) =\prod _{m=1}^{M}p\left( \theta _{m}|\alpha \right) \).

By integrating \(\theta \) and \(\phi \) out, we have

$$ p\left( w,z|\alpha ,\beta \right) =\prod _{m=1}^{M}\left( \frac{\Gamma \left( A\right) }{\Gamma \left( A+N_{m}\right) }\prod _{k=1}^{K}\frac{\Gamma \left( \alpha _{k}+n_{m,k}\right) }{\Gamma \left( \alpha _{k}\right) }\right) \times \prod _{k=1}^{K}\left( \frac{\Gamma \left( B\right) }{\Gamma \left( B+n_{k}\right) }\prod _{v=1}^{V}\frac{\Gamma \left( \beta _{v}+n_{k}^{v}\right) }{\Gamma \left( \beta _{v}\right) }\right), $$
(6)

where

\(n_{m,k}\) represents the number of words in the mth document that are assigned to the \(k-\)th topic,

\(n_{k}^{v}\) represents the number of v word assigned to the \(k-\)th topic,

\(A=\sum _{k=1}^{K}\alpha _{k},\)

\(B=\sum _{v=1}^{V}\beta _{v},\)

and \(n_{k}\) represents the number of words assigned to the \(k-\)topic.

In order to use Gibbs sampling to implement the LDA model, we need the following conditional probability:

$$ p\left( z_{m,n}=k|z^{\left( -m,n\right) },w,\alpha ,\beta \right) =\frac{p\left( z_{m,n}=k,z^{\left( -m,n\right) },w|\alpha ,\beta \right) }{p\left( z^{\left( -m,n\right) },w|\alpha ,\beta \right) }\propto p\left( z_{m,n}=k,z^{\left( -m,n\right) },w|\alpha ,\beta \right). $$
(7)

By plugging Eq. (6) into (7) and ignoring the terms that do not involve \(z_{m,n}\), the conditional posterior of \(z_{m,n}\) becomes as follows:

$$ p\left( z_{m,n}=k|z^{\left( -m,n\right) },w,\alpha ,\beta \right)\propto {} \frac{\left( \alpha _{k}+n_{m,k}^{\left( -m,n\right) }\right) \times \left( \beta _{v}+n_{k}^{w_{m,n},\left( -m,n\right) }\right) }{\left( B+n_{k}^{\left( -m,n\right) }\right) }, $$
(8)

where

\({\varvec{z}}^{\left( -m,n\right) } \,\)represents all the topic assignments other than \(z_{m,n}\);

\(n_{m,k}^{\left( -m,n\right) } \,\)represents the number of words (excluding the nth word) in the mth review comment that have been assigned to the kth topic.

\(n_{k}^{\left( -m,n\right) }\, \)represents the number of words (excluding the nth word in the mth review comment) assigned to the kth topic;

\(n_{k}^{v,\left( -m,n\right) } \,\)represents the number of the vth word assigned to the kth topic (excluding the nth word in the mth document).

In this paper, we used the Gibbs sampling algorithm to draw random samples from the derived condition posterior distribution [Eq. (8)]. The idea behind Gibbs sampling is that we can obtain random samples from the joint posterior distribution by sequentially simulating individual parameters from the set of conditional distributions. Draws from this simulation algorithm will converge to the target posterior distribution.

4 Case Study

4.1 Data Description

In this study, the LDA topic model was applied to 17,747 review comments extracted from 1452 different public transportation agencies on the website of www.yelp.com. These comments and reviews were posted between 2005 and 2013. Table 1 shows the descriptive statistics of the collected data. As shown in Table 1, most of the comments were collected from rapid transit services and only 0.9 % was collected from semi-rapid transit. A majority of the review ratings (five being the best) were between 2 and 4 (79.2 %). More than a half (52.4 %) of the comments was posted between 2011 and 2013, only 9.7 % of the comments were posted before 2008. Figure 1 shows the word cloud of reviews from four different transit methods (rapid transit, semi-rapid transit, street transit, and others). A word cloud is a visualization of the words frequency in a given text with words of higher frequency displayed in larger size. The word cloud using the online review in Fig. 1 clearly shows four themes of transit methods.

Table 1 Descriptive statistics of collected comments
Fig. 1
figure 1

The word cloud of reviews of different transit methods

The 10 agencies/facilities with the most reviews are listed in Table 2. The Bay Area Rapid Transit (BART) received 755 reviews, which is the highest among the ten agencies/facilities. Moreover, seven of the 10 agencies/facilities are related to rapid transportation services, while two of the ten agencies/facilities are related to boat transportation service. Only one of them is related to street transportation service.

Table 2 Agencies/facilities with the most reviews

4.2 Topic Model Results

In this case study, the proposed unsupervised LDA topic model was used to summarize the top 10 customer satisfaction dimensions from the collected review comments. In the topic model, words with high associations were grouped together. The topic model results are presented in Table 3. The dimensions most frequently identified include the following:

  1. 1.

    Waiting and travel time (the 1st topic of street transit, 1st and 2nd topics of semi-rapid transit, the 1st topic of rapid transit, the 7th and 9th topics of other public transits in Table 3).

  2. 2.

    Cleanliness of the vehicle (the 4th topic of street transit, the 9th topic of semi-rapid transit, the 10th topic of rapid transit, 2nd topic of other transit in Table 3). This is consistent with the existing studies, which have demonstrated the significance of the cleanliness in public transportation customer satisfaction.

  3. 3.

    Customer service (the 7th topic of street transit, the 4th topic of semi-rapid transit, the 3rd topic of rapid transit, the 5th topic of other transits in Table 3).

  4. 4.

    Transit price (the 9th topic of street transit, the 10th topic of semi-rapid transit, the 9th topic of rapid transit, the 8th topic of other transits in Table 3).

  5. 5.

    Accessibility (the 2nd topic of street transit, the 3rd topic of semi-rapid transit, the 4th topic of rapid transit, the 4th topic of other transits in Table 3).

  6. 6.

    Crowdedness (the 8th topic of street transit, the 2nd topic of rapid transit, the 8th topic of semi-rapid transit in Table 3).

  7. 7.

    Comfortability (the 3rd topic of street transit, the 6th topic of semi-rapid transit, the 6th topic of rapid transit, the 3rd and 6th topics of other public transits in Table 3).

  8. 8.

    Safety (the 10th topic of street transit, the 7th topic of semi-rapid transit, the 8th topic of rapid transit in Table 3).

  9. 9.

    Transfer service (the 6th topic in street transit, the 8th topic for semi-rapid transit, the 5th and 7th topics for rapid transit, the 10th topic for other public transit methods in Table 3).

  10. 10.

    Aesthetics (the 5th topic of street transit, the 1st topic for other public transits). This dimension was not found in the existing studies.

Table 3 Extracted top 10 topics

5 Conclusion

Transit customer satisfaction study helps understand customers’ behavior intentions and lays foundation for toolbox development to monitor service quality, evaluate system performance, identify customers' dissatisfaction, and develop improvement strategy. In this research, we applied a topic model to analyze the needs and expectations expressed by public transit customers. The proposed model identifies the most frequent customer satisfaction dimensions, which include waiting and travel time, cleanliness, customer service, price, accessibility, crowdedness, comfortability, safety, transfer service, and aesthetics. This research serves as a pilot study to test the feasibility and reliability of using online review comments to investigate transit users' satisfaction dimensions. With the research results confirming previous work on transit users’ perception of service quality, the proposed method prove to be a reliable way to study various dimensions of customer satisfaction toward transit system. Since online review comments can be obtained with low cost and labor, transit agencies can use this method to collectively gather feedback from its users, which contrasts the expensive and low responsive survey/interview approaches. Moreover, when combining the text mining method with the traditional survey/interview approach, the joint investigation will ensure the comprehensiveness of the results. The future directions of this research include testing other text mining models and integrating data collected from both online reviews and questionnaire-based surveys. These approaches have the potential to enhance model efficiency and effectiveness and will facilitate future model selection and modification to serve the emerging needs of transit customer satisfaction analysis.