Skip to main content
Log in

Investigating Relationships Between Phone Use While Driving Behavior and Drivers’ Socio-demographic Characteristics: An Interpretable Machine Learning Approach

  • Research
  • Published:
Data Science for Transportation Aims and scope Submit manuscript

Abstract

Phone use while driving (PUWD) is the most common distracted driving behavior. Considering that distracted driving is a preventable cause of crashes, researchers and practitioners want to understand this behavior to offer effective interventions. This research utilizes a dataset containing massive phone use events from Android phone users in Texas from 2018 to 2020 to explore the relationships between PUWD behavior and drivers’ socio-demographic factors at the census tract level. EXtreme Gradient Boosting (XGBoost) algorithm is adopted to perform the classification task: high PUWD rate or low PUWD rate. SHapley Additive exPlanations (SHAP) algorithm is established on the classification model to investigate the possible associations between socio-demographic factors and PUWD behavior. Our analysis indicates poverty, education attainment, sex, age groups, and income levels are dominantly associated with PUWD behavior. The census tracts with a higher- or lower-income level are more likely to be classified to have a high PUWD rate, while drivers from census tracts with middle-income levels (between 50 and 120 k) drive more defensively. The impact of income on the PUWD rate is also reflected through the education attainment factor. The results also demonstrate the strong association between a younger male population and a high PUWD rate. The census tracts are less likely to be classified to have a high PUWD rate while the median age of the male population increases. These findings demonstrate the potential to help transportation agencies target regions in greater need of anti-distracted driving interventions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Availability of data and materials

Raw data is not available for sharing with the public based on the confidentiality agreement with the data provider.

References

  • Bingham CR (2014) Driver distraction: a perennial but preventable public health threat to adolescents. J Adolesc Health 54:S3–S5

    Article  Google Scholar 

  • Calabrese F, Diao M, Di Lorenzo G, Ferreira J Jr, Ratti C (2013) Understanding individual mobility patterns from urban sensing data: a mobile phone trace example. Transp Res Part C Emerg Technol 26:301–313

    Article  Google Scholar 

  • Cameron TA, DeShazo JR, Stiffler P (2010) Demand for health risk reductions: a cross-national comparison between the U.S. and Canada. J Risk Uncertain 41:245–273. https://doi.org/10.1007/s11166-010-9106-9

    Article  Google Scholar 

  • Carson RT, Louviere JJ, Anderson DA, Arabie P, Bunch DS, Hensher DA, Johnson RM, Kuhfeld WF, Steinberg D, Swait J, Timmermans H, Wiley JB (1994) Experimental analysis of choice. Mark Lett 5:351–367. https://doi.org/10.1007/BF00999210

    Article  Google Scholar 

  • Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H (2015) Xgboost: extreme gradient boosting. R Package Version 04-2 1

  • Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297

    Article  Google Scholar 

  • Côté S, Stellar JE, Willer R, Forbes RC, Martin SR Bianchi EC (2020) The psychology of entrenched privilege: high socioeconomic status individuals from affluent backgrounds are uniquely high in entitlement. Pers Soc Psychol Bull 0146167220916633

  • de Dios Ortúzar J, Willumsen LG (2011) Modelling transport. Wiley, New York

  • Elander J, West R, French D (1993) Behavioral correlates of individual differences in road-traffic crash risk: an examination of methods and findings. Psychol Bull 113:279–294. https://doi.org/10.1037/0033-2909.113.2.279

    Article  Google Scholar 

  • Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55:119–139

    Article  MathSciNet  Google Scholar 

  • Hallett C, Lambert A, Regan MA (2011) Cell phone conversing while driving in New Zealand: prevalence, risk perception and legislation. Accid Anal Prev 43:862–869. https://doi.org/10.1016/j.aap.2010.11.006

    Article  Google Scholar 

  • He J, Chaparro A, Nguyen B, Burge RJ, Crandall J, Chaparro B, Ni R, Cao S (2014) Texting while driving: is speech-based text entry less risky than handheld text entry? Accid Anal Prev 72:287–295. https://doi.org/10.1016/j.aap.2014.07.014

    Article  Google Scholar 

  • Iio K, Guo X, Lord D (2020) Examining driver distraction as a function of driving speed: an observational study using disruptive technology and naturalistic data. Presented at the 99th annual meeting of the transportation research board, Washington DC, United States

  • Ishak S, Osman OA, Codjoe J, Jenkins S, Karbalaieali S, Theriot M, Bakhit P, Ye M (2017) Exploring naturalistic driving data for distracted driving measures

  • Ismeik M, Al-Kaisy A (2010) Characterization of cell phone use while driving in Jordan. Transport 25:252–261. https://doi.org/10.3846/transport.2010.31

    Article  Google Scholar 

  • Jiang S, Ferreira J, Gonzalez MC (2017) Activity-based human mobility patterns inferred from mobile phone data: a case study of Singapore. IEEE Trans Big Data 3:208–219

    Article  Google Scholar 

  • Kahn CA, Cisneros V, Lotfipour S, Imani G, Chakravarthy B (2015) Distracted driving, a major preventable cause of motor vehicle collisions: “just hang up and drive.” West J Emerg Med 16:1033

    Article  Google Scholar 

  • Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y (2017) Lightgbm: a highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst 30:3146–3154

    Google Scholar 

  • Kita E, Luria G (2020) Differences between males and females in the prediction of smartphone use while driving: mindfulness and income. Accid Anal Prev 140:105514

    Article  Google Scholar 

  • Klauer SG, Dingus TA, Neale VL, Sudweeks JD, Ramsey DJ (2006) The impact of driver inattention on near-crash/crash risk: an analysis using the 100-car naturalistic driving study data

  • Kong X, Das S, Jha K, Zhang Y (2020) Understanding speeding behavior from naturalistic driving data: applying classification based association rule mining. Accid Anal Prev 144:105620

    Article  Google Scholar 

  • Kong X, Li Z, Wallis JR, Zhang Y (2023) Investigating factors influencing deck conditions of concrete bridge and steel bridge using an interpretable machine learning framework. Data Sci Transp 5:1

    Article  Google Scholar 

  • Laberge-Nadeau C, Maag U, Bellavance F, Lapierre SD, Desjardins D, Messier S, Saı̈di A (2003) Wireless telephones and the risk of road crashes. Accid Anal Prev 35:649–660

    Article  Google Scholar 

  • Lamble D, Rajalin S, Summala H (2002) Mobile phone use while driving: public opinions on restrictions. Transportation 29:223–236

    Article  Google Scholar 

  • Lee SE, Klauer SG, Olsen ECB, Simons-Morton BG, Dingus TA, Ramsey DJ, Ouimet MC (2008) Detection of road hazards by novice teen and experienced adult drivers. Transp Res Rec 2078:26–32. https://doi.org/10.3141/2078-04

    Article  Google Scholar 

  • Li W, Gkritza K, Albrecht C (2014) The culture of distracted driving: evidence from a public opinion survey in Iowa. Transp Res Part F Traffic Psychol Behav SI Traffic Saf Cult 26:337–347. https://doi.org/10.1016/j.trf.2014.01.002

  • Li Z, Kong X, Zhang Y (2023a) Exploring factors associated with crossing assertiveness of pedestrians at unsignalized intersections. Transp Res Rec 2677:182–198

    Article  Google Scholar 

  • Li Z, Wei Z, Zhang Y, Kong X, Ma C (2023b) Applying an interpretable machine learning framework to study mobility inequity in the recovery phase of COVID-19 pandemic. Travel Behav Soc 33:100621

    Article  Google Scholar 

  • Lipovac K, Đerić M, Tešić M, Andrić Z, Marić B (2017) Mobile phone use while driving-literary review. Transp Res Part F Traffic Psychol Behav 47:132–142

    Article  Google Scholar 

  • Lundberg SM, Lee S-I (2017) A unified approach to interpreting model predictions. In: Advances in neural information processing systems, pp 4765–4774

  • Ma C, Peng Y, Wu L, Guo X, Wang X, Kong X (2021) Application of machine learning techniques to predict the occurrence of distraction: affected crashes with phone-use data. Transp Res Rec. https://doi.org/10.1177/03611981211045371

    Article  Google Scholar 

  • Machado-León JL, de Oña J, de Oña R, Eboli L, Mazzulla G (2016) Socio-economic and driving experience factors affecting drivers’ perceptions of traffic crash risk. Transp Res Part F Traffic Psychol Behav 37:41–51. https://doi.org/10.1016/j.trf.2015.11.010

    Article  Google Scholar 

  • Mantouka EG, Barmpounakis EN, Vlahogianni EI (2019) Identifying driving safety profiles from smartphone data using unsupervised learning. Saf Sci 119:84–90

    Article  Google Scholar 

  • Mccartt AT, Hellinga LA, Bratiman KA (2006) Cell phones and driving: review of research. Traffic Inj Prev 7:89–106. https://doi.org/10.1080/15389580600651103

    Article  Google Scholar 

  • McEvoy SP, Stevenson MR, Woodward M (2006) Phone use and crashes while driving: a representative survey of drivers in two Australian states. Med J Aust 185:630–634

    Article  Google Scholar 

  • National Highway Traffic Safety Administration (2017) Traffic safety facts, speeding

  • National Safety Council (2021) Distracted parents—survey results

  • National Safety Council (2013) Crashes involving cell phones: challenges of collecting and reporting reliable crash data. National Safety Council Itasca, IL

  • NHTSA (2020a) Traffic safety facts—research note. Distracted driving 2018

  • NHTSA (2020b) Distracted driving 2018 (no. DOT HS 812 926), TRAFFIC SAFETY FACTS. Washington, DC

  • Niu J, Zhou Y, Wang D, Liu X (2021) Influences of gesture-based mobile phone use while driving. Transp Res Rec 03611981211026306

  • Overton TL, Rives TE, Hecht C, Shafi S, Gandhi RR (2015) Distracted driving: prevalence, problems, and prevention. Int J Inj Contr Saf Promot 22:187–192

    Article  Google Scholar 

  • Oviedo-Trespalacios O, Haque MM, King M, Demmel S (2018) Driving behaviour while self-regulating mobile phone interactions: a human-machine system approach. Accid Anal Prev 118:253–262. https://doi.org/10.1016/j.aap.2018.03.020

    Article  Google Scholar 

  • Papantoniou P, Papadimitriou E, Yannis G (2017) Review of driving performance parameters critical for distracted driving research. Transp Res Procedia 25:1796–1805

    Article  Google Scholar 

  • Parsa AB, Movahedi A, Taghipour H, Derrible S, Mohammadian AK (2020) Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid Anal Prev 136:105405

    Article  Google Scholar 

  • Petroulias T (2014) Community attitudes to road safety: 2013 survey report

  • Pöysti L, Rajalin S, Summala H (2005) Factors influencing the use of cellular (mobile) phone during driving and hazards while using it. Accid Anal Prev 37:47–51. https://doi.org/10.1016/j.aap.2004.06.003

    Article  Google Scholar 

  • Python API Reference—xgboost 1.5.0-dev documentation [WWW Document] (2020). https://xgboost.readthedocs.io/en/latest/python/python_api.html. Accessed 29 Sept 2021

  • Qu Y, Lin Z, Li H, Zhang X (2019) Feature recognition of urban road traffic accidents based on GA-XGBoost in the context of big data. IEEE Access 7:170106–170115

    Article  Google Scholar 

  • Regev S, Rolison J, Feeney A, Moutari S (2017) Driver distraction is an under-reported cause of road accidents: an examination of discrepancy between police officers’ views and road accident reports. In: DDI2017 E-Proceedings collection. The fifth international conference on driver distraction and inattention

  • Reimer B, Mehler B, Coughlin JF, Roy N, Dusek JA (2011) The impact of a naturalistic hands-free cellular phone task on heart rate and simulated driving performance in two age groups. Transp Res Part F Traffic Psychol Behav 14:13–25. https://doi.org/10.1016/j.trf.2010.09.002

    Article  Google Scholar 

  • Rudisill TM, Zhu M (2017) Hand-held cell phone use while driving legislation and observed driver behavior among population sub-groups in the United States. BMC Public Health 17:437. https://doi.org/10.1186/s12889-017-4373-x

    Article  Google Scholar 

  • Schroeder P, Wilbur M, Peña R (2018) National survey on distracted driving attitudes and behaviors—2015 (no. DOT HS 812 461). National Highway Traffic Safety Administration, Washington, DC

  • Shi X, Wong YD, Li MZ-F, Palanisamy C, Chai C (2019) A feature learning approach based on XGBoost for driving assessment and risk prediction. Accid Anal Prev 129:170–179

    Article  Google Scholar 

  • Shinar D, Tractinsky N, Compton R (2005) Effects of practice, age, and task demands, on interference from a phone task while driving. Accid Anal Prev 37:315–326. https://doi.org/10.1016/j.aap.2004.09.007

    Article  Google Scholar 

  • Strayer DL, Drews FA, Crouch DJ (2006) A comparison of the cell phone driver and the drunk driver. Hum Factors 48:381–391

    Article  Google Scholar 

  • Sullman MJ, Baas PH (2004) Mobile phone use amongst New Zealand drivers. Transport Res Part F Traffic Psychol Behav 7:95–105

    Article  Google Scholar 

  • TTI (2020) TTI’s teens in the Driver Seat® launches new app to encourage safety in young drivers [WWW Document]. Tex. AM Transp. Inst. https://tti.tamu.edu/news/ttis-teens-in-the-driver-seat-launches-new-app-to-encourage-safety-in-young-drivers/. Accessed 7 Nov 2020

  • TxDOT (2020) TxDOT urbanized areas [WWW Document]. http://gis-txdot.opendata.arcgis.com/datasets/txdot-urbanized-areas. Accessed 26 July 2020

  • TxDOT (2017) Cell phone ordinances [WWW Document]. https://www.txdot.gov/driver/laws/cellphones.html. Accessed 31 July 2020

  • TxDPS (2020) How to apply for a Texas Driver License as a Teen [WWW Document]. https://www.dps.texas.gov/driverlicense/teendriver.htm. Accessed 2 Aug 2020

  • US Census Bureau (2019) American Community Survey 5-Year Data (2009-2018) [WWW Document]. U. S. Census Bur. URL https://www.census.gov/data/developers/data-sets/acs-5year.html. Accessed 27 Aug 2020

  • Vanhoof M, Reis F, Ploetz T, Smoreda Z (2018) Assessing the quality of home detection from mobile phone data for official statistics. J off Stat 34:935–960

    Article  Google Scholar 

  • Virginia Tech Transportation Institute (2009) New data from Virginia Tech Transportation Institute provides insight into cell phone use and driving distraction [WWW Document]. https://www.vtnews.vt.edu/content/vtnews_vt_edu/en/articles/2009/07/2009-571.html. Accessed 2 Aug 2020

  • Wen X, Xie Y, Wu L, Jiang L (2021) Quantifying and comparing the effects of key risk factors on various types of roadway segment crashes with LightGBM and SHAP. Accid Anal Prev 159:106261

    Article  Google Scholar 

  • Yang C, Chen M, Yuan Q (2021) The application of XGBoost and SHAP to examining the factors in freight truck-related crashes: an exploratory analysis. Accid Anal Prev 158:106153

    Article  Google Scholar 

  • Ye M, Osman OA, Ishak S (2017) Accounting for driver distraction and socioeconomic characteristics in a crash risk index: naturalistic driving study. Transp Res Rec 2659:204–211. https://doi.org/10.3141/2659-22

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledge SAFE2SAVE, LLC for providing valuable datasets under their terms and conditions. We would also like to express our greatest appreciation to our four reviewers who have provided many valuable suggestions and insights to make this paper better and more robust.

Funding

No funding was obtained for this study.

Author information

Authors and Affiliations

Authors

Contributions

Dr. Yunlong Zhang and Dr. Xiaoqiang Kong were responsible for formulating the problem and building the analytical structure. Dr. Xiaoqiang Kong, Amy Zhang, Xiaoyu Guo, and Xiao Xiao collaborated to write the main manuscript. Zihao Li made significant contributions to the coding process.

Corresponding author

Correspondence to Yunlong Zhang.

Ethics declarations

Conflict of interest

There is no competing interest with other entities.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kong, X., Zhang, A., Zhang, Y. et al. Investigating Relationships Between Phone Use While Driving Behavior and Drivers’ Socio-demographic Characteristics: An Interpretable Machine Learning Approach. Data Sci. Transp. 6, 8 (2024). https://doi.org/10.1007/s42421-024-00092-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42421-024-00092-3

Keywords

Navigation