Abstract
Physical activity is important for people’s health. The physical activity intervention program reported here includes daily wearing of an activity monitor to provide people with insight into their activity behavior. The activity monitor consists of a triaxial accelerometer, where measured accelerations are transformed to a physical activity level (PAL). The PAL data quantifies the level of the daily physical activity and reflects the daily energy expenditure of the wearer. In the program, coaches provide e-mail based intervention to motivate participants to increase their activity step-by-step within 12 weeks. However, a significant portion of participants (∼41%) failed to complete the program. This paper examines methods to predict participants who are at risk of dropping out of the program based on a classification task. This allows for a timely delivery of tailored interventions and motivating triggers to prevent stopping of the program. In particular, this paper proposes to combine the features extracted from participants’ personal information, their behaviors during the use of the device, the observed PAL data and the features extracted from the process of predicting future PAL data to classify dropouts and non-dropouts every week. Experiment results show that a k-nearest-neighbor classifier achieved a dropout and a non-dropout prediction accuracy of 66.4 ± 13.8% and 74.1 ± 7.3%, respectively.
Similar content being viewed by others
References
Penedo FJ, Dahn JR. Exercise and well-being: review of mental and physical health benefits associated with physical activity. Curr Opin Psychiatry 2005;18(2):189–93.
Pate RR, Pratt M, Blair SN, Haskell WL, Macera CA, Bouchard C, Buchner D, Ettinger W, Heath GW, King AC. Physical activity and public health: A recommendation from the centers for disease control and prevention and the American college of sports medicine. J Am Med Assoc 1995;273(5):402–7.
Fox KR. The influence of physical activity on mental well-being. Public Health Nutr 1999;2(3a):411–8.
Driver HS, Taylor SR. Exercise and sleep. Sleep Med Rev 2000;4(4):387–402.
Ware LJ, Hurling R, Bataveljic O, Fairley BW, Hurst TL, Murray P, Rennie KL, Tomkins CE, Finn A, Cobain MR, Pearson DA, Foreyt JP. Rates and determinants of uptake and use of an internet physical and weight management program in office and manufacturing work sites in England: cohort study. J Med Internet Res 2008;10(4):e56.
Goris AHC, Holmes R. The effect of a lifestyle activity intervention program on improving physical activity behavior of employees. In Proc. 3rd Int Conf Persuasive Technol (PERSUASIVE), Oulu, Finland. 2008.
Lacroix J, Saini P, Holmes R. The relationship between goal difficulty and performance in the context of a physical activity intervention program. In Proc 10th Conf MobileHCI, Amsterdam, Netherlands. 2008;415–8.
Plasqui G, Joosen AM, Kester AD, Goris AHC, Westerterp KR. Measuring free-living energy expenditure and physical activity with triaxial accelerometry. Obes Res 2005;13(8):1363–9.
Hurling R, Catt M, Boni MD, Fairley BW, Hurst T, Murray P, Richardson A, Sodhi JS. Using internet and mobile phone technology to deliver an automated physical activity program. Randomized controlled trial. J Med Internet Res 2008;9(2):e7.
Clarkson BP. Life patterns: structure from wearable sensor. Ph.D Thesis: MIT Media Lab;2002.
Long X, Yin B, Aarts RM. Single-accelerometer-based daily physical activity classification. In Proc 31st Ann Int Conf IEEE EMBS, Minneapolis, MN, 2009;6107–10.
Bouten C, Westerterp K, Verduin M, Janssen J. Assessment of energy expenditure for physical activity using a triaxial accelerometer. Med Sci Sports Exerc 2003;26(12):1516–23.
Bonomi AG, Plasqui G, Goris AHC, Westerterp KR. Estimation of free-living energy expenditure using a novel activity monitor designed to minimize obtrusiveness. Obes 2010;18(9):1845–51.
Leenders NY, Sherman WM, Nagaraja HN, Kien CL. Evaluation of methods to assess physical activity in free-living conditions. Med Sci Sports Exerc 2001;33(7):1233–40.
Marcus BH, Lewis BA, Williams DM, Dunsiger S, Jakicic JM, Whiteley JA, Albrecht AE, Napolitano MA, Bock BC, Tate DF, Sciamanna CN, Parisi AF. A comparison of internet and print-based physical activity interventions. Arch Intern Med 2007;167(9):944–9.
van den Berg MH, Schoones JW, Vliet Vlieland TP. Internet-based physical activity interventions: a systematic review of the literature. J Med Internet Res 2007;9(3):e26.
Adams J White M. Are activity promotion interventions based on the transtheoretical model effective? A critical review. Br J Sports Med 2003;37(2):106–14.
Marcus BH, Bock BC, Pinto BM, Forsyth LH, Robeerts MB, Traficante RM. Efficacy of an individualized, motivationally-tailored physical activity intervention. Ann Behav Med 1998;20(3):174–80.
Segerstahl K, Oinas-Kukkonen H. Distributed user experience in persuasive technology environments. In Proc. 2nd Int Conf Persuasive Technol (PERSUASIVE), Palo Alto, CA. 2007.
Xu W, Zhang M, Sawchuk AA, Sarrafzadeh M. Robust Human activity and sensor location co-recognition via sparse signal representation. IEEE Trans Biomed Eng 2012;59(11):3169–76.
Khan AM, Lee YK, Kim TS. A triaxial accelerometer-based physical-activity recognition via augmented-signal features and a hierarchical recognizer. IEEE Trans Inf Technol Biomed 2010;14(5):1166–72.
Zhu C, Sheng W. Realtime recognition of complex human daily activities using human motion and location data. IEEE Trans Biomed Eng 2012;59(9):2422–30.
Mazilu S, Hardegger M, Zhu Z, Roggen D. Online detection of freezing of gait with smartphones and machine learning techniques. In Proc 6th Int Conf PervasiveHealth 2012:123–30.
Lustrek M, Kaluza B. Fall detection and activity recognition with machine learning. Informatica 2009;33:205–12.
Jin A, Yin B, Morren G, Duric H, Aarts RM. Performance evaluation of a tri-axial accelerometry-based respiration monitoring for ambient assisted living. In Proc. 31st Ann Int Conf IEEE EMBS, Minneapolis, MN, 2009;5677–80.
Nijsen TME, Aarts RM, Cluitmans PJM, Griep PAM. Time-frequency analysis of accelerometry data for detection of myoclonic seizures. IEEE Trans Inf Technol Biomed 2010;14(5):1197–203.
Pentland A, Liu A. Modeling and prediction of human behavior. Neur Comp 1999;11:229–42.
Yang M, Wong SCP, Coid J. The efficacy of violence prediction: A meta-analytic comparison of nine risk assessment tools. Psych Bullet 2010;136(5):740–67.
Cheng S, Tom K, Thomas L, Pecht M. Wireless sensor system for prognostics and health management. IEEE Sensors J 2010;10(4):856–62.
Long X, Pauws S, Pijl M, Lacroix J, Goris AHC, Aarts RM. Predicting daily physical activity in a lifestyle intervention program. In: Gottfried B, Aghajan H, editors. Behaviour Monitoring and Interpretation - Well-Being. The Netherlands: IOS Press; 2011, pp. 131–46.
Si XS, Hu CH, Yang JB, Zhou ZJ. A new prediction model based on belief rule base for system’s behavior prediction. IEEE Trans Fuzzy Syst 2011;19(4):636–51.
Schwabacher M, Goebel K. A survey of artificial intelligence for prognostics. In Proc AAAI Fall Sym, Arlington, VA. 2007;107–14.
Niu G, Yang BS. Dempster-Shafer regression for multi-step-ahead time-series prediction towards data-driven machinery prognosis. Mech Syst Sig Process 2009;23:740–51.
Japkowicz N, Stephen S. The class imbalance problem: A systematic study. Intell Data Anal 2002;6(5):429–49.
Ling CX, Li C. Data mining for direct marketing: problems and solutions. In Proc 4th ACM SIGKDD Int Conf Knowl Disc Data Min 1998:73–9.
Zhang J, Mani I. kNN approach to unbalanced data distributions: a case study involving information extraction. In Proc ICML Workshop Learning from Imbalanced Datasets. Washington, DC. 2003.
Bradley AP. The use of area under the ROC curve in the evaluation of machine learning algorithms. Patt Recogn 1997;30(7):1145–59.
Duda R, Hart P, Stork D. Pattern Classification, 2nd ed: Wiley;2001.
Cover T, Hart P. Nearest neighbor pattern classification. IEEE Trans Inf Theory 1967;13(1):21–7.
Hand DJ, Yu K.Idiot’s Bayes – not so stupid after all. Int Stat Review 2001;69(3):385–98.
Bhatia N. Vandana. Survey of nearest neighbor techniques. Int J Comput Sci Inf Secur 2010;8(2):302–5.
Weiss GM, Learning with rare cases and small disjuncts. In Proc 12th Int Conf Mach Learn Lake Tahoe CA. 1995;558–65.
Kubat M, Matwin S. Addressing the curse of imbalanced datasets: One-sided sampling. In Proc 14th Int Conf Mach Learn 1997:179–86.
Pazzani M, Merz C, Murphy P, Ali K, Hume T, Brunk C. Reducing Misclassification Costs. In Proc 11th Int Conf Mach Learn 1994:217–25.
Yang J, Honavar V. Feature subset selection using a genetic algorithm. IEEE Trans Intell Syst App 1998;13(2):44–9.
Bonomi AG, Plasqui G, Goris AHC, Westerterp KR. Improving assessment of daily energy expenditure by identifying types of physical activity with a single accelerometer. J Appl Physiol 2009;107(3):655–61.
Shetty PS, Henry CJ, Black AE, Prentice AM. Energy requirements of adults: an update on basal metabolic rates (BMRs) and physical activity levels (PALs). Eur J Clin Nutr 1996;50(1):S1—23.
World Health Organization. Human energy requirements. Rome: Report of a Joint FAO/WHO/UNI Expert Consultation;2011.
Clements MP, Hendry DF. Forecasting economic time series. Cambridge: Cambridge University Press;1998.
Helfenstein U. Box-Jenkins modelling in medical research. Stat Meth Med Res 1996;5(1):3–22.
Peng JY, Aston JAD. The state space models toolbox for MATLAB. J Stat soft 2011;41(6):1–26.
Box GEP, Jenkins GM. Time series analysis forecasting and control. San Francisco: Holden-Day;1976.
Bowerman BL, O’Connell RT. Forecasting and time series: an applied approach. Belmont: Duxbury Press;1993.
Harvey AC. Forecasting, structural time series models and the Kalman filter. Cambridge: Cambridge University Press;1989.
Schwarz G. Estimating the dimension of a model. Ann Stat 1978;6(2):416–64.
Koza J. Genetic programming: on the programming of computers by means of natural selection: MIT Press;1992.
Abdi H, Williams LJ. Principal component analysis. WIREs: Comp Stat 2010;2(4):433–59.
Parzen E. On estimation of a probability density function and mode. Ann Math Stat 1962;333(3):1065–76.
Hellinger E. Neue Begründung der Theorie quadratischer formen von unendlichvielen veränderlichen. J für die Reine und Angew Math 1909;136:210–71.
Provost F, Fawcett T. Analysis and visualization of classifier performance: comparison under imprecise class and cost distribution. In Proc 3rd ACM SIGKDD 1997:43–48.
van Rijsbergen CJ. Information retrieval. London: Butterworths;1979.
Xie Y, Li X, Ngai EWT, Ying W. Customer churn prediction using improved balanced random forests. Exp Syst Appl 2009;36(3):5445–9.
Joachims T. A support vector method for multivariate performance measures. In Proc 22nd Int Conf Mach Learn 2005:377–84.
King AC, Rejeski WJ, Buchner DM. Physical activity interventions targeting older adults: a critical review and recommendations. Am J Prev Med 1998;15(4):316–33.
Acknowledgements
The authors would like to thank three anonymous reviewers, and Dr. A. Bonomi, Dr. R. Haakma, and Dr. S. Jelfs from Philips Research Laboratories for their insightful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Conflict of interests
The authors declare that they have no conflict of interest.
Rights and permissions
About this article
Cite this article
Long, X., Pijl, M., Pauws, S. et al. Towards tailored physical activity health intervention: Predicting dropout participants. Health Technol. 4, 273–287 (2014). https://doi.org/10.1007/s12553-014-0084-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12553-014-0084-9