Abstract
Multivariate spatio-temporal data analysis methods usually assume fairly complete data, while a number of gaps often occur along time or in space. In air quality data long gaps may be due to instrument malfunctions; moreover, not all the pollutants of interest are measured in all the monitoring stations of a network. In literature, many statistical methods have been proposed for imputing short sequences of missing values, but most of them are not valid when the fraction of missing values is high. Furthermore, the limitation of the methods commonly used consists in exploiting temporal only, or spatial only, correlation of the data. The objective of this paper is to provide an approach based on spatio-temporal functional principal component analysis (FPCA), exploiting simultaneously the spatial and temporal correlations for multivariate data, in order to provide an accurate imputation of missing values. At this aim, the methodology proposed in a previous proposal is applied, in order to obtain a good reconstruction of temporal/spatial series, especially in presence of long gap sequences, comparing spatial and spatio-temporal FPCA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cardot, H., Sarda, P.: Estimation in generalized linear models for functional data via penalized likelihood. J. Multivar. Anal. 92, 24–41 (2005)
Di Salvo, F., Ruggieri, M., Plaia, A.: Functional principal component analysis for multivariate multidimensional environmental data. Environ. Ecol. Stat. 22(4), 739–757 (2015)
Eilers, P., Marx, B.: Flexible smoothing with B-splines and penalties. Stat. Sci. 11, 89–121 (1996)
Escabias, M., Aguilera, A.M., Valderrama, M.J.: Principal component estimation of functional logistic regression discussion of two different approaches. J. Nonparametric Stat. 16, 365–384 (2004)
Hastie, T., Tibshirani, R.: Generalized Additive Models. Chapman & Hall/CRC, Boca Raton (1990)
Lee, D., Durban, M.: P-spline ANOVA-type interaction models for spatio-temporal smoothing. Stat. Model. 11, 49–69 (2011)
Li, Y., Wang, N., Hong, M., Turner, N.D., Lupton, J.R., Carroll, R.J.: Nonparametric estimation of correlation functions in longitudinal and spatial data, with application to colon carcinogenesis experiments. Ann. Stat. 35(4), 1608–1643 (2007)
Liu, C., Ray, S., Hooker, G.: Functional Principal Components Analysis of Spatially Correlated Data, arXiv:1411.4681 (2014)
Muller, H.G., Yao, F.: Functional additive models. J. Am. Stat. Assoc. 103, 1534–1544 (2008)
McCullagh, P., Nelder, J.: Generalized Linear Models, 2nd edn. Chapman & Hall/CRC, Boca Raton (1989)
Ramsay, J.O., Silverman, B.W.: Functional Data Analysis. 2nd edn. Springer-Verlag (2005)
Ruggieri, M., Di Salvo, F., Plaia, A., and Agró, G.: EOFs for gap filling in multivariate air quality data: a FDA approach. In: Lechevallier, Y. and Saporta G. (eds.), Proceedings of COMPSTAT 2010, Physica-Verlag, pp. 1557–1564 (2010)
Ruggieri, M., Di Salvo, F., Plaia, A., Agro’, G.: Functional principal component analysis for the explorative analysis of multisite-multivariate air pollution time series with long gaps. J. Appl. Stat. 40, 795–807 (2013)
Ruggieri, M., Plaia, A.: An aggregate AQI: comparing different standardizations and introducing a variability index. Sci. Total Environ. 420, 263–272 (2012)
Yao, F., Muller, H., Wang, J.: Functional data analysis for sparse longitudinal data. J. Am. Stat. Assoc. 100(470), 577–590 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Ruggieri, M., Plaia, A., Di Salvo, F. (2018). Comparing Spatial and Spatio-temporal FPCA to Impute Large Continuous Gaps in Space. In: Mola, F., Conversano, C., Vichi, M. (eds) Classification, (Big) Data Analysis and Statistical Learning. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-55708-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-55708-3_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55707-6
Online ISBN: 978-3-319-55708-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)