Skip to main content
Log in

Functional ANOVA starting from discrete data: an application to air quality data

  • Published:
Environmental and Ecological Statistics Aims and scope Submit manuscript

Abstract

A nonparametric functional approach is proposed to compare the mean functions of \(k\) samples of curves. In practice, curves data are usually collected in a discrete form and hence they must be pre-processed to use purely functional techniques. However, in the context of \(k\)-sample tests, the pre-processing step can have effects in terms of power reduction. Hall and Van Keilegom (Stat Sin 17:1511–1531, 2007) proposed a methodology to minimizing these effects in the context of tests for the equality of two distribution functions. Their procedure is here extended to the case of \(k\)-sample hypothesis tests. The asymptotic validity of the procedure is established and its finite sample performance is analyzed through Monte Carlo experiments. As an illustration, the method is applied to air quality data collected from several monitoring stations placed at different geographical locations at the center of Spain.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Abramovich F, Angelini C (2006) Testing in mixed-effects fanova models. J Stat Plan Inference 136:4326–4348

    Article  Google Scholar 

  • Abramovich F, Antoniadis A, Sapatinas T, Vidakovic B (2004) Optimal testing in functional analysis of variance models. Int J Wavelets Multiresolution Inf Process 2:323–349

    Article  Google Scholar 

  • Aneiros G, Cardot H, Estévez G, Vieu P (2004) Maximum ozone concentration forecasting by functional non-parametric approaches. Environmetrics 15:675–685

    Article  Google Scholar 

  • Angelini C, De Canditiis D, Leblanc F (2003) Wavelet regression estimation in nonparametric mixed effect models. J Multivar Anal 85:267–291

    Article  Google Scholar 

  • Besse P, Cardot H, Stephenson D (2000) Autoregressive forecasting of some functional climatic variations. Scand J Stat 27:673–687

    Article  Google Scholar 

  • Bugli C, Lambert P (2006) Functional anova with random functional effects: an application to event-related potentials modelling for electroencephalograms analysis. Stat Med 25:3718–3739

    Article  PubMed  Google Scholar 

  • Cardot H, Faivre R, Goulard M (2003) Functional approaches for predicting land use with the temporal evolution of coarse resolution remote sensing data. J Appl Stat 30:1185–1200

    Article  Google Scholar 

  • Fernández de Castro B, Prada Sánchez J, González Manteiga W, Febrero Bande M (2003) Prediction of \({\rm SO}_{2}\) levels using neural networks. J Air Waste Manag Assoc 53:532–539

  • Fernández de Castro B, Guillas S, González Manteiga W (2005) Functional samples and bootstrap for predicting sulfur dioxide levels. Technometrics 47:212–222

    Article  Google Scholar 

  • Crambes C, Kneip A, Sarda P (2009) Smoothing splines estimators for functional linear regression. Ann Stat 37:35–72

    Article  Google Scholar 

  • Cuesta-Albertos J, Febrero-Bande M (2010) A simple multiway anova for functional data. Test 19:537–557

    Article  Google Scholar 

  • Cuevas A, Febrero M, Fraiman R (2004) An anova test for functional data. Comput Stat Data Anal 47: 111–122

    Article  Google Scholar 

  • Damon J, Guillas S (2002) The inclusion of exogenous variables in functional autoregressive ozone forecasting. Environmetrics 13:759–7747

    Article  CAS  Google Scholar 

  • Davis J, Speckman P (1999) A model for predicting maximum and 8 hour average ozone in Houston. Atmos Environ 33:2487–2500

    Article  CAS  Google Scholar 

  • Fan J, Gijbels I (1996) Local polynomial modelling and its applications, monographs on statistics and applied probability, vol 66. Chapman and Hall, London

    Google Scholar 

  • Fan J, Lin SK (1998) Test of significance when data are curves. J Am Stat Assoc 93:1007–1021

    Article  Google Scholar 

  • Febrero M, Galeano P, González-Manteiga W (2007) A functional analysis of nox levels: location and scale estimation and outlier detection. Comput Stat 22:411–427

    Article  Google Scholar 

  • Febrero M, Galeano P, González-Manteiga W (2008) Outlier detection in functional data by depth measures, with application to identify abnormal nox levels. Environmetrics 19:331–345

    Article  CAS  Google Scholar 

  • Ferraty F, Vieu P (2009) Additive prediction and boosting for functional data. Comput Stat Data Anal 53:1400–1413

    Article  Google Scholar 

  • Freedman D (1981) Bootstrapping regression models. Ann Stat 9(6):1218–1228

    Article  Google Scholar 

  • Gardner M, Dorling S (1999) Neural network modelling and prediction of hourly No and No\(_2\) concentrations in urban air in London. Atmos Environ 33:709–719

    Article  CAS  Google Scholar 

  • Ghattas B (1999) Prévisions des pics d’ozone par arbres de régression simples et agrégés par bootstrap. Revue de Statistique Appliquée XLVII:61–80

    Google Scholar 

  • Gu C (2002) Smoothing spline ANOVA models, springer series in statistics, vol xiii. Springer, New York

    Book  Google Scholar 

  • Hall P, Van Keilegom I (2007) Two-sample test in functional data analysis starting from discrete data. Stat Sin 17:1511–1531

    Google Scholar 

  • Huang J (1998) Projection estimation in multiple regression with application to functional anova models. Ann Stat 26:242–272

    Article  Google Scholar 

  • Ignaccolo R, Ghigo S, Giovenali E (2008) Analysis of air quality monitoring networks by functional clustering. Environmetrics 19:672–686

    Article  CAS  Google Scholar 

  • Kim S, Park S, Temiyasathit C (2009) Spatial prediction of ozone concentration profiles. Comput Stat Data Anal 53:3892–3906

    Article  Google Scholar 

  • Lin Y (2000) Tensor product space anova models. Ann Stat 28:734–755

    Article  Google Scholar 

  • Muñoz Maldonado Y, Staniswalis JG, Irwin LN, Byers D (2002) A similarity analysis of curves. Can J Stat 30:373–381

    Article  Google Scholar 

  • Marx B, Eilers P (1999) Generalized linear regression on sampled signals and curves: a p-spline approach. Technometrics 41:1–13

    Article  Google Scholar 

  • Mélard G, Roy R (1988) Modéles de series chronologiques avec seuils. Revue de Statistique Appliquée XXXVI:5–24

    Google Scholar 

  • R Development Core Team (2004) A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, URL www.R-project.org

  • Ramsay J, Silverman B (1997) Functional data analysis, springer series in statistics, vol xiv. Springer, New York

    Book  Google Scholar 

  • Shen Q, Faraway J (2004) An \(f\) test for linear models with functional responses. Stat Sin 14:1239–1257

    Google Scholar 

  • Yi J, Prybutok V (1996) A neural network model forecasting for prediction of daily maximum ozone concentration in an industrialized urban area. Environ Pollut 3:349–357

    Article  Google Scholar 

  • Zhang JT, Chen J (2007) Statistical inferences for functional data. Ann Stat 35(3):1052–1079

    Article  Google Scholar 

  • Zoglat A (2008) Functional analysis of variance. Appl Math Sci, Ruse 2:1115–1129

    Google Scholar 

Download references

Acknowledgments

This research was supported by the Xunta de Galicia (Spain) under research projects 2012-PG226 and CN2012/130, and by Spanish grant from the Ministerio de Ciencia y Tecnología (MTM2011-22392). The authors would like to thank the suggestions of the three anonymous referees and the Associate Editor that helped to improve the quality of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Graciela Estévez-Pérez.

Additional information

Handling Editor: Pierre Dutilleul.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Estévez-Pérez, G., Vilar, J.A. Functional ANOVA starting from discrete data: an application to air quality data. Environ Ecol Stat 20, 495–517 (2013). https://doi.org/10.1007/s10651-012-0231-2

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10651-012-0231-2

Keywords

Navigation