Abstract
A nonparametric functional approach is proposed to compare the mean functions of \(k\) samples of curves. In practice, curves data are usually collected in a discrete form and hence they must be pre-processed to use purely functional techniques. However, in the context of \(k\)-sample tests, the pre-processing step can have effects in terms of power reduction. Hall and Van Keilegom (Stat Sin 17:1511–1531, 2007) proposed a methodology to minimizing these effects in the context of tests for the equality of two distribution functions. Their procedure is here extended to the case of \(k\)-sample hypothesis tests. The asymptotic validity of the procedure is established and its finite sample performance is analyzed through Monte Carlo experiments. As an illustration, the method is applied to air quality data collected from several monitoring stations placed at different geographical locations at the center of Spain.
Similar content being viewed by others
References
Abramovich F, Angelini C (2006) Testing in mixed-effects fanova models. J Stat Plan Inference 136:4326–4348
Abramovich F, Antoniadis A, Sapatinas T, Vidakovic B (2004) Optimal testing in functional analysis of variance models. Int J Wavelets Multiresolution Inf Process 2:323–349
Aneiros G, Cardot H, Estévez G, Vieu P (2004) Maximum ozone concentration forecasting by functional non-parametric approaches. Environmetrics 15:675–685
Angelini C, De Canditiis D, Leblanc F (2003) Wavelet regression estimation in nonparametric mixed effect models. J Multivar Anal 85:267–291
Besse P, Cardot H, Stephenson D (2000) Autoregressive forecasting of some functional climatic variations. Scand J Stat 27:673–687
Bugli C, Lambert P (2006) Functional anova with random functional effects: an application to event-related potentials modelling for electroencephalograms analysis. Stat Med 25:3718–3739
Cardot H, Faivre R, Goulard M (2003) Functional approaches for predicting land use with the temporal evolution of coarse resolution remote sensing data. J Appl Stat 30:1185–1200
Fernández de Castro B, Prada Sánchez J, González Manteiga W, Febrero Bande M (2003) Prediction of \({\rm SO}_{2}\) levels using neural networks. J Air Waste Manag Assoc 53:532–539
Fernández de Castro B, Guillas S, González Manteiga W (2005) Functional samples and bootstrap for predicting sulfur dioxide levels. Technometrics 47:212–222
Crambes C, Kneip A, Sarda P (2009) Smoothing splines estimators for functional linear regression. Ann Stat 37:35–72
Cuesta-Albertos J, Febrero-Bande M (2010) A simple multiway anova for functional data. Test 19:537–557
Cuevas A, Febrero M, Fraiman R (2004) An anova test for functional data. Comput Stat Data Anal 47: 111–122
Damon J, Guillas S (2002) The inclusion of exogenous variables in functional autoregressive ozone forecasting. Environmetrics 13:759–7747
Davis J, Speckman P (1999) A model for predicting maximum and 8 hour average ozone in Houston. Atmos Environ 33:2487–2500
Fan J, Gijbels I (1996) Local polynomial modelling and its applications, monographs on statistics and applied probability, vol 66. Chapman and Hall, London
Fan J, Lin SK (1998) Test of significance when data are curves. J Am Stat Assoc 93:1007–1021
Febrero M, Galeano P, González-Manteiga W (2007) A functional analysis of nox levels: location and scale estimation and outlier detection. Comput Stat 22:411–427
Febrero M, Galeano P, González-Manteiga W (2008) Outlier detection in functional data by depth measures, with application to identify abnormal nox levels. Environmetrics 19:331–345
Ferraty F, Vieu P (2009) Additive prediction and boosting for functional data. Comput Stat Data Anal 53:1400–1413
Freedman D (1981) Bootstrapping regression models. Ann Stat 9(6):1218–1228
Gardner M, Dorling S (1999) Neural network modelling and prediction of hourly No and No\(_2\) concentrations in urban air in London. Atmos Environ 33:709–719
Ghattas B (1999) Prévisions des pics d’ozone par arbres de régression simples et agrégés par bootstrap. Revue de Statistique Appliquée XLVII:61–80
Gu C (2002) Smoothing spline ANOVA models, springer series in statistics, vol xiii. Springer, New York
Hall P, Van Keilegom I (2007) Two-sample test in functional data analysis starting from discrete data. Stat Sin 17:1511–1531
Huang J (1998) Projection estimation in multiple regression with application to functional anova models. Ann Stat 26:242–272
Ignaccolo R, Ghigo S, Giovenali E (2008) Analysis of air quality monitoring networks by functional clustering. Environmetrics 19:672–686
Kim S, Park S, Temiyasathit C (2009) Spatial prediction of ozone concentration profiles. Comput Stat Data Anal 53:3892–3906
Lin Y (2000) Tensor product space anova models. Ann Stat 28:734–755
Muñoz Maldonado Y, Staniswalis JG, Irwin LN, Byers D (2002) A similarity analysis of curves. Can J Stat 30:373–381
Marx B, Eilers P (1999) Generalized linear regression on sampled signals and curves: a p-spline approach. Technometrics 41:1–13
Mélard G, Roy R (1988) Modéles de series chronologiques avec seuils. Revue de Statistique Appliquée XXXVI:5–24
R Development Core Team (2004) A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, URL www.R-project.org
Ramsay J, Silverman B (1997) Functional data analysis, springer series in statistics, vol xiv. Springer, New York
Shen Q, Faraway J (2004) An \(f\) test for linear models with functional responses. Stat Sin 14:1239–1257
Yi J, Prybutok V (1996) A neural network model forecasting for prediction of daily maximum ozone concentration in an industrialized urban area. Environ Pollut 3:349–357
Zhang JT, Chen J (2007) Statistical inferences for functional data. Ann Stat 35(3):1052–1079
Zoglat A (2008) Functional analysis of variance. Appl Math Sci, Ruse 2:1115–1129
Acknowledgments
This research was supported by the Xunta de Galicia (Spain) under research projects 2012-PG226 and CN2012/130, and by Spanish grant from the Ministerio de Ciencia y Tecnología (MTM2011-22392). The authors would like to thank the suggestions of the three anonymous referees and the Associate Editor that helped to improve the quality of this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Handling Editor: Pierre Dutilleul.
Rights and permissions
About this article
Cite this article
Estévez-Pérez, G., Vilar, J.A. Functional ANOVA starting from discrete data: an application to air quality data. Environ Ecol Stat 20, 495–517 (2013). https://doi.org/10.1007/s10651-012-0231-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10651-012-0231-2