Understanding Job-Housing Relationship from Cell Phone Data Based on Hadoop

  • Miaoyi Li
  • Nawei WuEmail author
  • Xiaoyong Tang
  • Jia Lu
Part of the Advances in Geographic Information Science book series (AGIS)


Job-housing relationship has been a significant and classic topic in urban studies for many years. Compared with the studies based on data from census and public transportation surveys, cell phone data make it possible to understand job-housing relationship in finer time and spatial scales. Therefore, cell phone data contribute to the study of job-housing relationship at least in three aspects. First, cell phone data can make the indicators of job-housing relationship closer to their real meanings. Second, it makes analysis units more flexible according to analysis needs. Third, algorithms can be used to describe people’s movement traces more effectively and to classify various groups like commuters or travelers. In this chapter, a Hadoop-based cell phone data processing platform for job-housing relationship analysis is introduced as a solution to the storage and processing of large volumes of real-time cell phone data. The platform is innovative in its big data computation framework, cell phone data processing procedures and algorithms, as well as job-housing relationship and commuting analysis application system design. The big data computation framework is based on Hadoop for the collection and the processing of mass structured, semi-structured, and nonstructured data. The cell phone data processing platform is to realize the collection, processing, storage, export, and other processing of cell phone data and to save all the intermediate and final computational results for reuse. The job-housing relationship and commuting analysis application system provides functions such as the display, query, statistics, analysis, and comparison of indicators under multiple spatiotemporal scales. Chongqing is used as an example to demonstrate how the platform works by three main steps: original data cleansing and quality evaluation, trace series analysis, and user stop recognition and analysis. Job-housing relationship from cell phone data is observed and described by job-housing distributions, OD connections among analysis units, as well as three indicators, including external employment rate, foreign employment rate, and independence index.


Cell phone data Job-housing relationship Hadoop Spark 


  1. Auld, J., Williams, C., et al. (2009). An automated GPS-based prompted recall survey with learning algorithms. Transportation Letters, 1(1), 59–79.CrossRefGoogle Scholar
  2. Bar-Gera, H., Konduri, K. C., et al. (2009). Estimating survey weights with multiple constraints using entropy optimization methods. Transportation Research Board 88th Annual Meeting (No. 09–1354).Google Scholar
  3. Bonnel, P., Le Nir, M. (1998). “The quality of survey data: Telephone versus face-to-face interviews”. Transportation, 25(2), 147–167.Google Scholar
  4. Calabrese, F., Di Lorenzo, G., et al. (2011). Estimating origin-destination flows using mobile phone location data. IEEE Pervasive Computing, 10(4), 0036–0044.CrossRefGoogle Scholar
  5. Calabrese, F., Diao, M., et al. (2013). Understanding individual mobility patterns from urban sensing data: A mobile phone trace example. Transportation Research Part C: Emerging Technologies, 26, 301–313.CrossRefGoogle Scholar
  6. Cervero, R. (1989). Jobs-housing balancing and regional mobility. Journal of the American Planning Association, 55(2), 136–150.CrossRefGoogle Scholar
  7. Chai, Y. (2013). Space–time behavior research in China: Recent development and future prospect: Space–time integration in geography and GIScience. Annals of the Association of American Geographers, 103(5), 1093–1099.CrossRefGoogle Scholar
  8. Dou, A., Kalogeraki, V., et al. (2010). Misco: A mapreduce framework for mobile systems. In Proceedings of the 3rd International Conference on Pervasive Technologies Related to Assistive Environments, 32, New York.Google Scholar
  9. Eagle, N., & Pentland, A. S. (2006). Reality mining: Sensing complex social systems. Personal and Ubiquitous Computing, 10(4), 255–268.CrossRefGoogle Scholar
  10. Gao, H. Q., & Zhai, Y. J. (2010). System design of cloud computing based on mobile learning. In Knowledge Acquisition and Modeling (KAM), 3rd International Symposium on IEEE (pp. 239–242).Google Scholar
  11. Girardin, F., Calabrese, F., et al. (2008). Digital footprinting: Uncovering tourists with user-generated content. IEEE Pervasive Computing, 7(4), 36–43.CrossRefGoogle Scholar
  12. Giuliano, G. (1991). Is jobs-housing balance a transportation issue?. Davis: University of California Transportation Center.Google Scholar
  13. Gonzalez, M. C., Hidalgo, C. A., et al. (2008). Understanding individual human mobility patterns. Nature, 453(7196), 779–782.CrossRefGoogle Scholar
  14. Horner, M. W., & Schleith, D. (2012). Analyzing temporal changes in land-use–transportation relationships: A LEHD-based approach. Applied Geography, 35(1), 491–498.CrossRefGoogle Scholar
  15. Hu, P. S., & Young, J. R. (1999). Summary of travel trends: 1995 nationwide personal transportation survey. Demographics.Google Scholar
  16. Humbetov, S. (2012). Data-intensive computing with map-reduce and hadoop. In Application of Information and Communication Technologies (AICT), 6th International Conference on IEEE (pp. 1–5).Google Scholar
  17. Jiang, S., Ferreira, J., et al. (2016). Activity-based human mobility patterns inferred from mobile phone data: A case study of Singapore, PP(99), 1–1.Google Scholar
  18. Kovachev, D., Cao, Y., et al. (2011). Mobile cloud computing: A comparison of application models. Computer Science.Google Scholar
  19. Lai, J., Chen, Y., et al. (2014). Identification method of residence and employment locations based on cellular phone data. Journal of Networks, 9(8), 2183–2188.CrossRefGoogle Scholar
  20. Lenormand, M., & Ramasco, J. J. (2016). Towards a better understanding of cities using mobility data. Built Environment, 42(3).Google Scholar
  21. Levine, J. (1998). Rethinking accessibility and jobs-housing balance. Journal of the American Planning Association, 64(2), 133–149.CrossRefGoogle Scholar
  22. Levinson, D. M. (1998). Accessibility and the journey to work. Journal of Transport Geography, 6(1), 11–21.CrossRefGoogle Scholar
  23. Levinson, D. M., & Kumar, A. (1994). The rational locator: Why travel times have remained stable. Journal of the American Planning Association, 60(3), 319–332.CrossRefGoogle Scholar
  24. Liu, Z., Zhang, Y., et al. (2009). Home-work separation in the context of institutional and spatial transformation in urban China: Evidence from Beijing household survey data. Urban Studies, 16(9), 110–117.Google Scholar
  25. Long, Y., & Thill, J. C. (2015). Combining smart card data and household travel survey to analyze jobs–housing relationships in Beijing. Computers, Environment and Urban Systems, 53, 19–35.CrossRefGoogle Scholar
  26. Louail, T., Lenormand, M., et al. (2014). From mobile phone data to the spatial structure of cities. Scientific Reports, 4, 5276.CrossRefGoogle Scholar
  27. Mallik, B. B., & Visvanathan A. (2016). Smart-phone as measuring device. 2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI) (pp. 414–418). Kolkata.Google Scholar
  28. Margolis, J. (1957). Municipal fiscal structure in a metropolitan region. The Journal of Political Economy, 225–236.Google Scholar
  29. Marinelli, E. E. (2009). Hyrax: Cloud computing on mobile devices using MapReduce (No. CMU-CS-09-164). Carnegie-Mellon University, School of Computer Science, Pittsburgh.Google Scholar
  30. Meng, X. C., Wu, J., et al. (2009). A literature review on job-housing balance studies. Urban Studies, 06, 23–28+35.Google Scholar
  31. Niu, X., Ding, L., et al. (2014). Understanding urban spatial structure of shanghai central city based on mobile phone data. Urban Planning Forum, 6, 61–67.Google Scholar
  32. Picornell, M., Ruiz, T., et al. (2015). Exploring the potential of phone call data to characterize the relationship between social network and travel behaviour. Transportation, 42, 647–668.CrossRefGoogle Scholar
  33. Qi, H., & Gani, A. (2012). Research on mobile cloud computing: Review, trend and perspectives. In Digital Information and Communication Technology and It’s Applications (DICTAP), 2012 Second International Conference on (pp. 195–202).Google Scholar
  34. Ratti, C., Frenchman, D., et al. (2006). Mobile landscapes: Using location data from cell-phones for urban analysis. Environment and Planning B: Planning and Design, 33(5), 727–748.CrossRefGoogle Scholar
  35. Reades, J., Calabrese, F., et al. (2007). Cellular census: Explorations in urban data collection. IEEE Pervasive Computing, 6(3), 30–38.CrossRefGoogle Scholar
  36. Shin, D., Aliaga, D., et al. (2015). Urban sensing: Using smartphones for transportation mode classification. Computers, Environment and Urban Systems, 53, 76–86.CrossRefGoogle Scholar
  37. Soto, V., & Frias-Martinez, E. (2011). Robust land use characterization of urban landscapes using cell phone data. In Proceedings of the 1st Workshop on Pervasive Urban Applications, in conjunction with 9th Int. Conf. Pervasive Computing. San FranciscoGoogle Scholar
  38. Sultana, S. (2002). Job/housing imbalance and commuting time in the Atlanta metropolitan area: Exploration of causes of longer commuting time. Urban Geography, 23(8), 728–749.CrossRefGoogle Scholar
  39. Sun, B., Li, N., et al. (2010). A study on the impact of job-housing balance on commuting travels: An empirical test of a traditional idea in the field of urban planning. Urban Planning Forum, 6, 55–60.Google Scholar
  40. Syed, A., Gillela, K., et al. (2013). The future revolution on big data. Futures, 2(6), 2446–2451.Google Scholar
  41. Tal, G., & Handy, S. (2010). Travel behavior of immigrants: An analysis of the 2001 national household transportation survey. Transport Policy, 17(2), 85–93.CrossRefGoogle Scholar
  42. Toole, J. L., Herrera-Yaqüe, C., et al. (2015). Coupling human mobility and social ties. Journal of the Royal Society Interface, 6(4), 266–271.Google Scholar
  43. Vandersmissen, M. H., Villeneuve, P., et al. (2003). Analyzing changes in urban form and commuting time. The Professional Geographer, 55(4), 446–463.CrossRefGoogle Scholar
  44. Witayangkurn, A., Horanont, T., et al. (2012). Performance comparisons of spatial data processing techniques for a large scale mobile phone dataset. Com.Geo (pp. 1–6). Washington, DC. Google Scholar
  45. Yang, H., Sasaki, T., et al. (1992). Estimation of origin-destination matrices from link traffic counts on congested networks. Transportation Research Part B: Methodological, 26(6), 417–434.CrossRefGoogle Scholar
  46. Zaharia, M., Chowdhury, M., et al. (2012). Fast and interactive analytics over Hadoop data with spark. USENIX Login, 37(4), 45–51.Google Scholar
  47. Zhao, P., Lv, B., et al. (2009). The effects of transport accessibility and jobs–housing balance on commuting time: Evidence from Beijing. International Planning Studies, 14(1), 65–83.CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Joint International FZUKU Lab SPSDFuzhou UniversityFuzhou CityChina
  2. 2.Innovation Center for TechnologyTsinghua Tongheng Urban Planning and Design InstituteBeijingChina
  3. 3.Chongqing Transport Planning InstituteChongqingChina
  4. 4.Digital Intelligence System TechnologyShanghaiChina

Personalised recommendations