Abstract
Health surveillance is the process of ongoing systematic collection, analysis, interpretation, and dissemination of health data for the purpose of preventing and controlling disease, injury, and other health problems. Health surveillance data is often recorded continuously over a selected time interval or intermittently at several discrete time points. These can often be treated as functional data, and hence functional data analysis (FDA) can be applied to model and analyze these types of health data. One objective in health surveillance is early event detection. Statistical process monitoring tools are often used for online event detecting. In this paper, we explore the usefulness of FDA for prospective health surveillance and propose two strategies for monitoring using control charts. We apply these strategies to monthly ovitrap index data. These vector data are used in Hong Kong as part of its dengue control plan.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ai-Leen, G. T., & Jin Song, R. (2000). The use of GIS in ovitrap monitoring for dengue control in Singapore. WHO Regional Office for South-East Asia
Althouse, B. M., Ng, Y. Y., & Cummings, D. A. (2011). Prediction of dengue incidence using search query surveillance. PLoS Neglected Tropical Diseases, 5(8), e1258.
Buczak, A. L., Koshute, P. T., Babin, S. M., Feighner, B. H., & Lewis, S. H. (2012). A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data. BMC Medical Informatics and Decision Making, 12(1), 124.
Buczak, A. L., Baugher, B., Babin, S. M., Ramac-Thomas, L. C., Guven, E., Elbert, Y., et al. (2014). Prediction of high incidence of dengue in the Philippines. PLoS Neglected Tropical Diseases, 8(4), e2771.
Chen, P., Fu, X., & Goh, R. S. M. (2019). Determining epidemic threshold for dengue incidences in Singapore based on extreme value theory. Presentation at the 11th Mathematical Methods in Reliability S34
Colosimo, B. M., & Pacella, M. (2010). A comparison study of control charts for statistical monitoring of functional data. International Journal of Production Research, 48(6), 1575–1601.
Department of Health in Hong Kong(2019). Dengue Fever. https://www.fehd.gov.hk/english/pestcontrol/dengue_fever/index.html Accessed 30 June 2019.
Erbas, B., Hyndman, R. J., & Gertig, D. M. (2007). Forecasting age-specific breast cancer mortality using functional data models. Statistics in Medicine, 26(2), 458–470.
Department of Health in Hong Kong(2019). Statistics on Dengue fever. https://data.gov.hk/en-data/dataset/hk-dh-chpsebcdde-dengue-fever-cases. Accessed 30 June 2019.
Fricker, R. D. (2013). Introduction to statistical methods for biosurveillance: With an emphasis on syndromic surveillance. Cambridge: Cambridge University Press.
Golub, G. H., Heath, M., & Wahba, G. (1979). Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics, 21(2), 215–223.
Goto, K., Kumarendran, B., Mettananda, S., Gunasekara, D., Fujii, Y., & Kaneko, S. (2013). Analysis of effects of meteorological factors on dengue incidence in Sri Lanka using time series data. PLoS One, 8(5), e63,717.
Jackson, J. E. (2005). A user’s guide to principal components (Vol. 587). New York: Wiley.
Jackson, M. L., Baer, A., Painter, I., & Duchin, J. (2007). A simulation study comparing aberration detection algorithms for syndromic surveillance. BMC Medical Informatics and Decision Making, 7(1), 6.
Joner, M. D, Jr., Woodall, W. H., Reynolds, M. R, Jr., & Fricker, R. D, Jr. (2008). A one-sided mewma chart for health surveillance. Quality and Reliability Engineering International, 24(5), 503–518.
Leng, X., & Müller, H. G. (2005). Classification using functional data analysis for temporal gene expression data. Bioinformatics, 22(1), 68–76.
Lin, P. S. (2018). Cluster-temporal models for disease surveillance with an application to dengue fever infection in Taiwan. Presentation at the Joint Statistical Meeting 21
Qiu, P., & Xiang, D. (2014). Univariate dynamic screening system: An approach for identifying individuals with irregular longitudinal behavior. Technometrics, 56(2), 248–260.
Ramadona, A. L., Lazuardi, L., Hii, Y. L., Holmner, Å., Kusnanto, H., & Rocklöv, J. (2016). Prediction of dengue outbreaks based on disease surveillance and meteorological data. PloS One, 11(3), e0152,688.
Ramsay, J. O., Hooker, G., & Graves, S. (2009). Functional data analysis with R and MATLAB. New York: Springer.
Ramsay, J. O., & Silverman, B. W. (2005). Functional data analysis (2nd ed.). New York: Springer.
Ratcliffe, S. J., Leader, L. R., & Heller, G. Z. (2002). Functional data analysis with application to periodically stimulated foetal heart rate data. I: functional regression. Statistics in Medicine, 21(8), 1103–1114.
Shmueli, G., & Burkom, H. (2010). Statistical challenges facing early outbreak detection in biosurveillance. Technometrics, 52(1), 39–51.
Sonesson, C., & Bock, D. (2003). A review and discussion of prospective statistical surveillance in public health. Journal of the Royal Statistical Society: Series A (Statistics in Society), 166(1), 5–21.
Sparks, R. (2013). Challenges in designing a disease surveillance plan: What we have and what we need? IIE Transactions on Healthcare Systems Engineering, 3(3), 181–192.
Tatum, L. G. (1997). Robust estimation of the process standard deviation for control charts. Technometrics, 39(2), 127–141.
Teutsch, S., & Churchill, R. (2000). Principles and practice of public health surveillance (2nd ed.). Oxford: Oxford University Press.
Tsui, K. L., Chiu, W., Gierlich, P., Goldsman, D., Liu, X., & Maschek, T. (2008). A review of healthcare, public health, and syndromic surveillance. Quality Engineering, 20(4), 435–450.
Ullah, S., & Finch, C. F. (2013). Applications of functional data analysis: A systematic review. BMC Medical Research Methodology, 13(1).
Unkel, S., Farrington, C., Garthwaite, P. H., Robertson, C., & Andrews, N. (2012). Statistical methods for the prospective detection of infectious disease outbreaks: a review. Journal of the Royal Statistical Society: Series A (Statistics in Society), 175(1), 49–82.
Wang, J. L., Chiou, J. M., & Müller, H. G. (2015). Review of functional data analysis. arXiv:150705135.
Woodall, W. H. (2006). The use of control charts in health-care and public-health surveillance. Journal of Quality Technology, 38(2), 89–104.
World Health Organization. (2019). Dengue and severe dengue. https://www.who.int/news-room/fact-sheets/detail/dengue-and-severe-dengue Accessed 30 June 2019.
Yuan, M., Boston-Fisher, N., Luo, Y., Verma, A., & Buckeridge, D. L. (2019). A systematic review of aberration detection algorithms used in public health surveillance. Journal of Biomedical Informatics 103181
Yuan, H. Y., Wen, T. H., Kung, Y. H., Tsou, H. H., Chen, C. H., Chen, L. W., et al. (2019). Prediction of annual dengue incidence by hydro-climatic extremes for southern Taiwan. International Journal of Biometeorology, 63(2), 259–268.
Acknowledgements
We would like to express our sincere thanks to Dr. Rob Goedhart from the University of Amsterdam for his expert advice. We would also like to thank all the participants in the ISQC workshop for their constructive comments on our work. And the reviewer of this paper for his/her constructive comments. The work of Inez M. Zwetsloot was partially supported by a grant from the City University of Hong Kong (project 7005090).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix A: Choosing Smooth Parameters
The generalized cross-validation method, developed by Craven and Wahba (2013), is used to determine the smoothing parameter in (2). The GCV for one curve is defined as
where SSE is the sum of squared error calculated by
and where \(df(\lambda )\) is the degree of freedom of the fit defined by \(\lambda \) and computed as \(df(\lambda )=trace[\boldsymbol{H}(\lambda )]\). Where \(\boldsymbol{H}(\lambda )=\varPhi (\varPhi ^{T}\varPhi +\lambda \boldsymbol{R})^{-1}\varPhi ^{T}\), and \(\boldsymbol{R}=\int L\phi (t)L\phi '(t) dt\) is the symmetric roughness penalty matrix with order K.
In the case study, we predefined \(K=7\), and choose \(\lambda \) subject to it minimized the GCV as given in (14). Figure 9 illustrates how to choose smoothing parameter \(\lambda \) given the GCV values. Based on this, we select \(\lambda =10^{-1.4}\) (as \(log(\lambda )=-1.4\)).
Appendix B: Verify the Signal in Q Chart
We plot the mean cure shown in Fig. 3b and the fitted MOI curve of 2013 on Fig. 10. We can see the significant differences between these two lines showed in July and August, but reviewing the principal components in Fig. 5, it seems none of three principal components can capture the shifts in 2013 perfectly.
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, Z., Zwetsloot, I.M. (2021). Exploring the Usefulness of Functional Data Analysis for Health Surveillance. In: Knoth, S., Schmid, W. (eds) Frontiers in Statistical Quality Control 13. ISQC 2019. Frontiers in Statistical Quality Control. Springer, Cham. https://doi.org/10.1007/978-3-030-67856-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-67856-2_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67855-5
Online ISBN: 978-3-030-67856-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)