Imputation Strategies for Missing Data in Environmental Time Series for an Unlucky Situation

  • Daria Mendola
Part of the Studies in Classification, Data Analysis, and Knowledge Organization book series (STUDIES CLASS)


After a detailed review of the main specific solutions for treatment of missing data in environmental time series, this paper deals with the unlucky situation in which, in an hourly series, missing data immediately follow an absolutely anomalous period, for which we do not have any similar period to use for imputation. A tentative multivariate and multiple imputation is put forward and evaluated; it is based on the possibility, typical of environmental time series, to resort to correlations or physical laws that characterize relationships between air pollutants.


Multiple Imputation ARMA Model Imputation Procedure Imputation Strategy Anomalous Period 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. GOMEZ, V. and MARAVALL, A. (1994): Estimation, Prediction, and Interpolation for Nonstationary Series with the Kalman Filter. JASA, 89,426, 611–624.MathSciNetGoogle Scholar
  2. HOPKE, P.K., LIU, C, and RUBIN, D.B. (2001): Multiple Imputation for Multivariate Data with Missing and Below-Treshold Measurements: Time-Series Concentrations of Pollutants in the Artic. Biometrics, 57, 22–33.MathSciNetCrossRefGoogle Scholar
  3. JONES, R.H. (1980): Maximum Likelihood Fitting of ARMA Models to Time Series With Missing Observations. Technomtrics, 22,3, 389–395.MATHCrossRefGoogle Scholar
  4. LITTLE, R.J. and RUBIN, D.B. (1987): Statistical analysis with Missing Data. Wiley, New York.Google Scholar
  5. LOVISON, G. and MENDOLA (2003): Car-free Sundays and Air Pollution. Conference Book of Abstract of The ISI International Conference on Environmental Statistics and Health, Santiago de Compostela (Spain) July 16–18, 2003.Google Scholar
  6. MENDOLA, D. (2002): Road traffic restricions and air pollution in an urban area. A case study in Palermo. Working Paper GRASPA, n.15, Scholar
  7. MENDOLA, D. and LOVISON, G. (2002): Are car-free days effective. A case study in Palermo using Dynamic Linear Models. Invited Paper in TIES2002, The Annual Conference of The International Environmetric Society, Conference book of Abstracts, Genova, June 18–22, 2002.Google Scholar
  8. MENDOLA, D. and LOVISON, G. (2003): Multivariate Monitoring of Air Pollutants and of Effects of Meterological Conditions. Proceedings of SIS2003, Annual meeting of Italian Statistical Society.Google Scholar
  9. RUBIN, D.B. (1987): Multiple Imputation for Nonresponse in Surveys. Wiley, New York.Google Scholar
  10. RUBIN, D.B. (1996): Multiple Imputation After 18+ Years. JASA, 91,434, 473–489.MATHGoogle Scholar
  11. SHUMWAY, R.H. and STOFFER, D.S. (1982): An approach to time series smooth-ing and forecating using EM algorithm. J. Time Series Anal., 3, 253–264.Google Scholar
  12. SHUMWAY, R.H. and STOFFER, D.S. (2000): Time Series Analysis and Its Applications, Springer, New York.Google Scholar
  13. STOFFER, D.S. (1986): Estimation and Identification of Space-Time ARMAX Models in the Presence of Missing Data. JASA, 81,395, 762–772.MATHMathSciNetGoogle Scholar
  14. VAN BUUREN, S. and OUDSHOORN, C.G.M. (1999): Flexible multivariate imputation by MICE TNO report PG/VGZ/99.054, TNO Prevention and Health, Leiden.Google Scholar
  15. VAN BUUREN, S. and OUDSHOORN, C.G.M. (2000): Multivariate Imputation by Chained Equations,-MICE V1.0 User’s manual. TNO report PG/VGZ/00.038, TNO Prevention and Health, Leiden.Google Scholar

Copyright information

© Springer-Verlag Berlin · Heidelberg 2005

Authors and Affiliations

  • Daria Mendola
    • 1
  1. 1.Dipartimento di Scienze Statistiche e Matematiche “Silvio Vianelli”Universitá degli Studi di PalermoPalermoItaly

Personalised recommendations