Precise estimation of connections of metro passengers from Smart Card data

Hong, Sung-Pil; Min, Yun-Hong; Park, Myoung-Ju; Kim, Kyung Min; Oh, Suk Mun

doi:10.1007/s11116-015-9617-y

Precise estimation of connections of metro passengers from Smart Card data

Published: 01 April 2015

Volume 43, pages 749–769, (2016)
Cite this article

Transportation Aims and scope Submit manuscript

Sung-Pil Hong¹,
Yun-Hong Min¹^nAff3,
Myoung-Ju Park¹^nAff4,
Kyung Min Kim¹^nAff2 &
…
Suk Mun Oh²

1580 Accesses
40 Citations
3 Altmetric
Explore all metrics

Abstract

The aim of this study is to estimate both the physical and schedule-based connections of metro passengers from their entry and exit times at the gates and the stations, a data set available from Smart Card transactions in a majority of train networks. By examining the Smart Card data, we will observe a set of transit behaviors of metro passengers, which is manifested by the time intervals that identifies the boarding, transferring, or alighting train at a station. The authenticity of the time intervals is ensured by separating a set of passengers whose trip has a unique connection that is predominantly better by all respects than any alternative connection. Since the connections of such passengers, known as reference passengers, can be readily determined and hence their gate times and stations can be used to derive reliable time intervals. To detect an unknown path of a passenger, the proposed method checks, for each alternative connection, if it admits a sequence of boarding, middle train(s), and alighting trains, whose time intervals are all consistent with the gate times and stations of the passenger, a necessary condition of a true connection. Tested on weekly 32 million trips, the proposed method detected unique connections satisfying the necessary condition, which are, therefore, most likely true physical and schedule-based connections in 92.6 and 83.4 %, respectively, of the cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Isolating high-priority metro and feeder bus transfers using smart card data

Article 18 June 2016

Route choice estimation in rail transit systems using smart card data: handling vehicle schedule and walking time uncertainties

Article Open access 19 July 2022

Weaving Social Networks from Smart Card Data: An On-Journey-Accompanying Approach

References

Asakura, Y., Iryo, T., Nakajima, Y., Kusakabe, T., Takagi, Y., Kashiwadani, M.: Behavioural analysis of railway passengers using smart card data. In: Proceedings of the Urban Transport, pp. 599–608. Malta (2008)
Bagchi, M., White, P.R.: The potential of public transport smart card data. Transp. Policy 12(5), 464–474 (2005)
Article Google Scholar
Bureau of Public Roads: Traffic Assignment Manual. U.S, Department of Commerce (1964)
Cox, T., Houdmont, J., Griffiths, A.: Rail passenger crowding, stress, health and safety in Britain. Transp. Res. Part A 40, 244–258 (2006)
Google Scholar
Cronbach, L.J.: Coefficient alpha and the internal structure of tests. Psychometrika 16(3), 297–334 (1951)
Article Google Scholar
De Cea, J., Fernandez, J.E.: Transit assignment for congested public tranport system: an equilibrium model. Transp. Sci. 27(2), 133–147 (1993)
Article Google Scholar
Einmahl, J.H.J., Smeets, S.G.W.R.: Ultimate 100 m world records through through extreme-value theory. Stat. Neerl. 65(1), 32–42 (2011)
Article Google Scholar
Fu, Q., Liu, R., Hess, S.: A bayesian modelling framework for individual passenger’s probabilistic route choices: a case study on the London underground. In: 93rd Transportation Research Board (TRB) Annual Meeting (2014)
Guo, Z., Wilson, N.: Transfer behavior and transfer planning in public transport systems: a case of the London underground. In: Proceedings of the 11th International Conference on Advanced Systems for Public Transport, Hong Kong (2009)
Jang, W.: Travel time and transfer analysis using transit smart card data. Transp. Res. Rec. 2144, 142–149 (2010)
Article Google Scholar
Kato, H., Kaneko, Y., Inoue, M.: Comparative analysis of transit assignment: evidence from urban railway system in the Tokyo metropolitan area. Transportation 37, 775–799 (2010)
Article Google Scholar
Ko, S.-J., Kim, K.M., Hong, S.-P.: Estimation of transfer times and alighting times of the metro passengers in Seoul metropolitan area. Working paper
Kusakabe, T., Iryo, T., Asakura, Y.: Estimation method for railway passengers’ train choice behaviour with smart card transaction data. Transportation 37, 731–749 (2010)
Article Google Scholar
Lam, W.H.K., Lo, H.K.: Traffic assignment methods. In: Hensher, D.A., Button, K.J., Haynes, K.E., Stopher, P.R. (eds.) Handbook of Transport Geography and Spatial Systems, pp. 609–625 (2004)
Lehtonen, M., Rosenberg, M., Rasanen, J., Sirkia, A.: Utilization of the smart card payment system (scps) data in public tranport planning and statistics. In: Proceedings of the 9th World Congress on Intelligent Transport Systems, Chicago, Illinois, 14–17 October 2002
Morency, C., Trépanier, M., Agard, B.: Measuring transit use variability with smart-card data. Transp. Policy 14(3), 193–203 (2007)
Article Google Scholar
Nielsen, O.A.: A stochastic transit assignment model considering differences in passengers utility functions. Transp. Res. Part B 34(5), 377–402 (2000)
Article Google Scholar
Nour, A., Casello, J.M., Hellinga, B.: Anxiety-based formulation to estimate generalized cost of transit travel time. Transp. Res. Rec. 2143, 108–116 (2010)
Article Google Scholar
Park, J.Y., Kim, D.-J., Lim, Y.: Use of smart card data to define public transit use in Seoul, South Korea. Transp. Res. Rec. 2063, 3–9 (2008)
Article Google Scholar
Pelletier, M.-P., Trèpanier, M., Morency, C.: Smart card data use in public transit: a literature review. Transp. Res. Part C 19, 557–568 (2011)
Article Google Scholar
Raveau, S., Muñoz, J.C., de Grange, L.: A topological route choice model for metro. Transp. Res. Part A 45, 138–147 (2011)
Google Scholar
Rinks, D.B.: Revenue allocation methods for integrated transit systems. Transp. Res. Part A 20(1), 39–50 (1986)
Article Google Scholar
Seaborn, C.: Application of smart card fare payment data to bus network planning in London. UK. MS thesis, Massachusetts Institute of Technology, Cambridge (2008)
Seaborn, C., Attanucci, J., Wilson, N.: Analyzing multimodal public transport journeys in London with smart card fare payment data. Transp. Res. Rec. 2121, 55–62 (2009)
Article Google Scholar
Shin, S.G., Cho, Y., Lee, C.: Integrated transit service evaluation methodologies using transportation card data (In Korean). Technical Report 2007-R-09, Seoul Development Institute (2007)
Trépanier, M., Tranchant, N., Chapleau, R.: Individual trip destination estimation in a transit smart card automated fare collection system. J. Intell. Transp. Syst. 11(1), 1–14 (2007)
Article Google Scholar
Tsamboulas, D.A., Antoniou, C.: Allocating revenues to public transit operators under an integrated fare system. Transp. Res. Rec. 1986, 29–37 (2006)
Article Google Scholar
Utsunomiya, M., Attanuchi, J., Wilson, N.H.: Potential uses of transit smart card registration and transaction data to improve transit planning. Transp. Res. Rec. 1971, 119–126 (2006)
Article Google Scholar
Weidmann, U., Orth, H., Dorbritz, R.: Development of measurement system for public transport performance. Transp. Res. Rec. 2274, 135–143 (2012)
Article Google Scholar
Zhou, F., Xu, R.-H.: Model of passenger flow assignment for urban rail transit based on entry and exit time constraints. J. Transp. Res. Board 2284, 57–61 (2012)
Article Google Scholar

Download references

Acknowledgments

This research was supported in part by Basic Science Research Program (2014R1A2A1A11049663) through the National Research Foundation of Korea (NRF), and by the BK21 Plus Program(Center for Sustainable and Innovative Industrial Systems) funded by the Ministry of Education, Korea.

Author information

Kyung Min Kim
Present address: Policy-Technology Convergence Research Division, Korea Railroad Research Institute, 360-1 Woulam-dong, Uiwang-city, Geonggi-do, 437-757, South Korea
Yun-Hong Min
Present address: Intelligence Computing Laboratory, Samsung Electronics Co. Ltd., Suwon City, South Korea
Myoung-Ju Park
Present address: Department of Industrial Engineering and Management Systems Engineering, Kyung Hee University, Yongin City, South Korea

Authors and Affiliations

Department of Industrial Engineering, Seoul National University, San 56-1 Shilim-dong, Kwanahk-gu, Seoul, 151-742, South Korea
Sung-Pil Hong, Yun-Hong Min, Myoung-Ju Park & Kyung Min Kim
Policy-Technology Convergence Research Division, Korea Railroad Research Institute, 360-1 Woulam-dong, Uiwang-city, Geonggi-do, 437-757, South Korea
Suk Mun Oh

Authors

Sung-Pil Hong
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Hong Min
View author publications
You can also search for this author in PubMed Google Scholar
Myoung-Ju Park
View author publications
You can also search for this author in PubMed Google Scholar
Kyung Min Kim
View author publications
You can also search for this author in PubMed Google Scholar
Suk Mun Oh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kyung Min Kim.

Appendices

Appendix

Probability estimation of schedule-based connections

Suppose the current physical connection requires a single transfer, say, at Station $A$. The schedule-based connections on a physical connection can be represented by a time-expanded network as in Fig. 11.

The consistency check is initiated by finding consistent trains at both $O$ and $D$. By this assumption, there can be at most two trains, say $X_1$ and $X_2$, at $O$, whose time intervals contain the entry time, while at most one train, say $Y$, can be consistent with the exit time at $D$. If there are no such trains at either $O$ or $D$, the passenger did not use the physical connection.

If neither $X_1$ and $X_2$ can be connected to $Y$, in the sense that there is no relevant transfer reference passenger, we conclude that the passenger did not use the physical connection.

If there is only one such train, say $X_1$, whose connection to $Y$ can be verified by transfer reference passengers, then the schedule-based connection, $X_1-Y$ is confirmed as the unique connection of the passenger.

Finally, if there are two trains, say $X_1$ and $X_2$, from both of which we can find transfer reference passengers to $Y$ as in Fig. 11, we need to return both $X_1-Y$ and $X_2-Y$. It is a worst case in that the maximum number of schedule-based connections are confirmed as consistent connections.

The estimation, however, can be refined by a probability distribution over the two connections. In Fig. 11, we introduce some notations as follows:

$p$: The fraction of the boarding reference passengers from the overlap of the two time intervals that boarded train $X_1$
$1-p$: The fraction of the boarding reference passengers from the overlap of the two time intervals that boarded train not $X_1$ but $X_2$
$1-q_1$: The fraction of the transfer reference passenger from $X_1$ to $Y$
$q_2$: The fraction of the transfer reference passenger from $X_2$ to $Y$

It is not then difficult to show that

$$\begin{aligned} \begin{array}{rl} \Pr \left\{ {\text{Passenger}} {\text{ chose }}\; X_1-Y\right\} &{}= \frac{p (1-q_1)}{p (1-q_1) + (1-p) q_2},\\ \Pr \left\{ {\text{Passenger}} {\text{ chose} }\; X_2-Y\right\} &{}= \frac{(1-p) q_2}{p (1-q_1) + (1-p) q_2}. \end{array} \end{aligned}$$

(1)

Table 6 Numbers and lists of consistent schedule-based connection(s), the corresponding conditions, and the probability distributions for a single-transfer physical connection

Full size table

Table 6 summarizes the numbers and list of consistent schedule-based connection(s), the corresponding conditions, and the probability distributions. If none of the conditions from Table 6 is satisfied, no schedule-based connection can be consistent with the quadruple of our passenger and hence the physical connection is rejected.

For a physical connection that requires two transfers, there may be up to 3 schedule-based connections consistent with a quadruple if the trip is not abnormally delayed. The previous arguments can be easily extended to such a case.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hong, SP., Min, YH., Park, MJ. et al. Precise estimation of connections of metro passengers from Smart Card data. Transportation 43, 749–769 (2016). https://doi.org/10.1007/s11116-015-9617-y

Download citation

Published: 01 April 2015
Issue Date: September 2016
DOI: https://doi.org/10.1007/s11116-015-9617-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Precise estimation of connections of metro passengers from Smart Card data

Abstract

Access this article

Similar content being viewed by others

Isolating high-priority metro and feeder bus transfers using smart card data

Route choice estimation in rail transit systems using smart card data: handling vehicle schedule and walking time uncertainties

Weaving Social Networks from Smart Card Data: An On-Journey-Accompanying Approach

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Probability estimation of schedule-based connections

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Precise estimation of connections of metro passengers from Smart Card data

Abstract

Access this article

Similar content being viewed by others

Isolating high-priority metro and feeder bus transfers using smart card data

Route choice estimation in rail transit systems using smart card data: handling vehicle schedule and walking time uncertainties

Weaving Social Networks from Smart Card Data: An On-Journey-Accompanying Approach

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Probability estimation of schedule-based connections

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation