Skip to main content

Exploring the Usefulness of Functional Data Analysis for Health Surveillance

  • Conference paper
  • First Online:
Frontiers in Statistical Quality Control 13 (ISQC 2019)

Part of the book series: Frontiers in Statistical Quality Control ((FSQC))

Included in the following conference series:

Abstract

Health surveillance is the process of ongoing systematic collection, analysis, interpretation, and dissemination of health data for the purpose of preventing and controlling disease, injury, and other health problems. Health surveillance data is often recorded continuously over a selected time interval or intermittently at several discrete time points. These can often be treated as functional data, and hence functional data analysis (FDA) can be applied to model and analyze these types of health data. One objective in health surveillance is early event detection. Statistical process monitoring tools are often used for online event detecting. In this paper, we explore the usefulness of FDA for prospective health surveillance and propose two strategies for monitoring using control charts. We apply these strategies to monthly ovitrap index data. These vector data are used in Hong Kong as part of its dengue control plan.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Ai-Leen, G. T., & Jin Song, R. (2000). The use of GIS in ovitrap monitoring for dengue control in Singapore. WHO Regional Office for South-East Asia

    Google Scholar 

  • Althouse, B. M., Ng, Y. Y., & Cummings, D. A. (2011). Prediction of dengue incidence using search query surveillance. PLoS Neglected Tropical Diseases, 5(8), e1258.

    Article  Google Scholar 

  • Buczak, A. L., Koshute, P. T., Babin, S. M., Feighner, B. H., & Lewis, S. H. (2012). A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data. BMC Medical Informatics and Decision Making, 12(1), 124.

    Article  Google Scholar 

  • Buczak, A. L., Baugher, B., Babin, S. M., Ramac-Thomas, L. C., Guven, E., Elbert, Y., et al. (2014). Prediction of high incidence of dengue in the Philippines. PLoS Neglected Tropical Diseases, 8(4), e2771.

    Article  Google Scholar 

  • Chen, P., Fu, X., & Goh, R. S. M. (2019). Determining epidemic threshold for dengue incidences in Singapore based on extreme value theory. Presentation at the 11th Mathematical Methods in Reliability S34

    Google Scholar 

  • Colosimo, B. M., & Pacella, M. (2010). A comparison study of control charts for statistical monitoring of functional data. International Journal of Production Research, 48(6), 1575–1601.

    Article  Google Scholar 

  • Department of Health in Hong Kong(2019). Dengue Fever. https://www.fehd.gov.hk/english/pestcontrol/dengue_fever/index.html Accessed 30 June 2019.

  • Erbas, B., Hyndman, R. J., & Gertig, D. M. (2007). Forecasting age-specific breast cancer mortality using functional data models. Statistics in Medicine, 26(2), 458–470.

    Article  MathSciNet  Google Scholar 

  • Department of Health in Hong Kong(2019). Statistics on Dengue fever. https://data.gov.hk/en-data/dataset/hk-dh-chpsebcdde-dengue-fever-cases. Accessed 30 June 2019.

  • Fricker, R. D. (2013). Introduction to statistical methods for biosurveillance: With an emphasis on syndromic surveillance. Cambridge: Cambridge University Press.

    Google Scholar 

  • Golub, G. H., Heath, M., & Wahba, G. (1979). Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics, 21(2), 215–223.

    Google Scholar 

  • Goto, K., Kumarendran, B., Mettananda, S., Gunasekara, D., Fujii, Y., & Kaneko, S. (2013). Analysis of effects of meteorological factors on dengue incidence in Sri Lanka using time series data. PLoS One, 8(5), e63,717.

    Google Scholar 

  • Jackson, J. E. (2005). A user’s guide to principal components (Vol. 587). New York: Wiley.

    MATH  Google Scholar 

  • Jackson, M. L., Baer, A., Painter, I., & Duchin, J. (2007). A simulation study comparing aberration detection algorithms for syndromic surveillance. BMC Medical Informatics and Decision Making, 7(1), 6.

    Article  Google Scholar 

  • Joner, M. D, Jr., Woodall, W. H., Reynolds, M. R, Jr., & Fricker, R. D, Jr. (2008). A one-sided mewma chart for health surveillance. Quality and Reliability Engineering International, 24(5), 503–518.

    Article  Google Scholar 

  • Leng, X., & Müller, H. G. (2005). Classification using functional data analysis for temporal gene expression data. Bioinformatics, 22(1), 68–76.

    Article  Google Scholar 

  • Lin, P. S. (2018). Cluster-temporal models for disease surveillance with an application to dengue fever infection in Taiwan. Presentation at the Joint Statistical Meeting 21

    Google Scholar 

  • Qiu, P., & Xiang, D. (2014). Univariate dynamic screening system: An approach for identifying individuals with irregular longitudinal behavior. Technometrics, 56(2), 248–260.

    Article  MathSciNet  Google Scholar 

  • Ramadona, A. L., Lazuardi, L., Hii, Y. L., Holmner, Ã…., Kusnanto, H., & Rocklöv, J. (2016). Prediction of dengue outbreaks based on disease surveillance and meteorological data. PloS One, 11(3), e0152,688.

    Google Scholar 

  • Ramsay, J. O., Hooker, G., & Graves, S. (2009). Functional data analysis with R and MATLAB. New York: Springer.

    Book  Google Scholar 

  • Ramsay, J. O., & Silverman, B. W. (2005). Functional data analysis (2nd ed.). New York: Springer.

    Book  Google Scholar 

  • Ratcliffe, S. J., Leader, L. R., & Heller, G. Z. (2002). Functional data analysis with application to periodically stimulated foetal heart rate data. I: functional regression. Statistics in Medicine, 21(8), 1103–1114.

    Article  Google Scholar 

  • Shmueli, G., & Burkom, H. (2010). Statistical challenges facing early outbreak detection in biosurveillance. Technometrics, 52(1), 39–51.

    Article  MathSciNet  Google Scholar 

  • Sonesson, C., & Bock, D. (2003). A review and discussion of prospective statistical surveillance in public health. Journal of the Royal Statistical Society: Series A (Statistics in Society), 166(1), 5–21.

    Article  MathSciNet  Google Scholar 

  • Sparks, R. (2013). Challenges in designing a disease surveillance plan: What we have and what we need? IIE Transactions on Healthcare Systems Engineering, 3(3), 181–192.

    Article  Google Scholar 

  • Tatum, L. G. (1997). Robust estimation of the process standard deviation for control charts. Technometrics, 39(2), 127–141.

    Article  MathSciNet  Google Scholar 

  • Teutsch, S., & Churchill, R. (2000). Principles and practice of public health surveillance (2nd ed.). Oxford: Oxford University Press.

    Google Scholar 

  • Tsui, K. L., Chiu, W., Gierlich, P., Goldsman, D., Liu, X., & Maschek, T. (2008). A review of healthcare, public health, and syndromic surveillance. Quality Engineering, 20(4), 435–450.

    Article  Google Scholar 

  • Ullah, S., & Finch, C. F. (2013). Applications of functional data analysis: A systematic review. BMC Medical Research Methodology, 13(1).

    Google Scholar 

  • Unkel, S., Farrington, C., Garthwaite, P. H., Robertson, C., & Andrews, N. (2012). Statistical methods for the prospective detection of infectious disease outbreaks: a review. Journal of the Royal Statistical Society: Series A (Statistics in Society), 175(1), 49–82.

    Article  MathSciNet  Google Scholar 

  • Wang, J. L., Chiou, J. M., & Müller, H. G. (2015). Review of functional data analysis. arXiv:150705135.

  • Woodall, W. H. (2006). The use of control charts in health-care and public-health surveillance. Journal of Quality Technology, 38(2), 89–104.

    Article  Google Scholar 

  • World Health Organization. (2019). Dengue and severe dengue. https://www.who.int/news-room/fact-sheets/detail/dengue-and-severe-dengue Accessed 30 June 2019.

  • Yuan, M., Boston-Fisher, N., Luo, Y., Verma, A., & Buckeridge, D. L. (2019). A systematic review of aberration detection algorithms used in public health surveillance. Journal of Biomedical Informatics 103181

    Google Scholar 

  • Yuan, H. Y., Wen, T. H., Kung, Y. H., Tsou, H. H., Chen, C. H., Chen, L. W., et al. (2019). Prediction of annual dengue incidence by hydro-climatic extremes for southern Taiwan. International Journal of Biometeorology, 63(2), 259–268.

    Article  Google Scholar 

Download references

Acknowledgements

We would like to express our sincere thanks to Dr. Rob Goedhart from the University of Amsterdam for his expert advice. We would also like to thank all the participants in the ISQC workshop for their constructive comments on our work. And the reviewer of this paper for his/her constructive comments. The work of Inez M. Zwetsloot was partially supported by a grant from the City University of Hong Kong (project 7005090).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Inez Maria Zwetsloot .

Editor information

Editors and Affiliations

Appendices

Appendix A: Choosing Smooth Parameters

The generalized cross-validation method, developed by Craven and Wahba (2013), is used to determine the smoothing parameter in (2). The GCV for one curve is defined as

$$\begin{aligned} GCV(\lambda )=\Big (\frac{n}{n-df(\lambda )}\Big )\Big (\frac{SSE}{n-df(\lambda )}\Big ) \,, \end{aligned}$$
(14)

where SSE is the sum of squared error calculated by

$$\begin{aligned} SSE(x)=\sum _{j}^{n}[y_{j}-x(t_{j})]^{2} \end{aligned}$$

and where \(df(\lambda )\) is the degree of freedom of the fit defined by \(\lambda \) and computed as \(df(\lambda )=trace[\boldsymbol{H}(\lambda )]\). Where \(\boldsymbol{H}(\lambda )=\varPhi (\varPhi ^{T}\varPhi +\lambda \boldsymbol{R})^{-1}\varPhi ^{T}\), and \(\boldsymbol{R}=\int L\phi (t)L\phi '(t) dt\) is the symmetric roughness penalty matrix with order K.

In the case study, we predefined \(K=7\), and choose \(\lambda \) subject to it minimized the GCV as given in (14). Figure 9 illustrates how to choose smoothing parameter \(\lambda \) given the GCV values. Based on this, we select \(\lambda =10^{-1.4}\) (as \(log(\lambda )=-1.4\)).

Fig. 9
figure 9

The value of the GCV as a function of \(\lambda \) the smoothing parameter for fitting the MOI curve

Appendix B: Verify the Signal in Q Chart

We plot the mean cure shown in Fig. 3b and the fitted MOI curve of 2013 on Fig. 10. We can see the significant differences between these two lines showed in July and August, but reviewing the principal components in Fig. 5, it seems none of three principal components can capture the shifts in 2013 perfectly.

Fig. 10
figure 10

The comparison between mean curve and fitted MOI in 2013

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, Z., Zwetsloot, I.M. (2021). Exploring the Usefulness of Functional Data Analysis for Health Surveillance. In: Knoth, S., Schmid, W. (eds) Frontiers in Statistical Quality Control 13. ISQC 2019. Frontiers in Statistical Quality Control. Springer, Cham. https://doi.org/10.1007/978-3-030-67856-2_14

Download citation

Publish with us

Policies and ethics