Skip to main content
Log in

kNN estimation in functional partial linear modeling

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

A statistical procedure combining the local adaptivity and the easiness of implementation of k-nearest-neighbours (kNN) estimates together with the semiparametric flexibility of partial linear modeling is developed for regression problems involving functional variable. Various asymptotic results are stated, both for the linear parameters and for the nonparametric operator involved in the model. A simulation study compares the finite sample behaviour of the kNN method with alternative estimation procedures. Finally, comparison with alternative functional regression models is carried out by means of a real curves data application which exhibits the interest both of the kNN method and of the semi-parametric modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Aneiros G, Bongiorno EG, Cao R, Vieu P (2017) Functional statistics and related fields. Contributions to statistics. Springer, Cham

    MATH  Google Scholar 

  • Aneiros G, Ling N, Vieu P (2015) Error variance estimation in semi-functional partially linear regression models. J Nonparametr Stat 27(3):316–330

    MathSciNet  MATH  Google Scholar 

  • Aneiros-Pérez G, Vieu P (2006) Semi-functional partial linear regression. Stat Probab Lett 76(11):1102–1110

    MathSciNet  MATH  Google Scholar 

  • Aneiros-Pérez G, Vieu P (2011) Automatic estimation procedure in partial linear model with functional data. Stat Pap 52(4):751–771

    MathSciNet  MATH  Google Scholar 

  • Attouch M, Benchikh T (2012) Asymptotic distribution of robust k-nearest neighbour estimator for functional nonparametric models. Mat Vesnik 644:275–285

    MathSciNet  MATH  Google Scholar 

  • Berlinet A, Servien R (2011) Necessary and sufficient condition for the existence of a limit distribution of the nearest-neighbour density estimator. J Nonparametr Stat 23(3):633–643

    MathSciNet  MATH  Google Scholar 

  • Biau G, Cérou F, Guyader A (2010) Rates of convergence of the functional k-nearest neighbor estimate. IEEE Trans Inf Theory 56(4):2034–2040

    MathSciNet  MATH  Google Scholar 

  • Bosq D, Blanke D (2007) Inference and prediction in large dimension. Wiley series in probability and statistics. Wiley, Chichester

    MATH  Google Scholar 

  • Burba F, Ferraty F, Vieu P (2009) \(k\)-Nearest Neighbour method in functional nonparametric regression. J Nonparametr Stat 21(4):453–469

    MathSciNet  MATH  Google Scholar 

  • Cerou F, Guyader A (2006) Nearest neighbor classification in infinite dimension. ESAIM Probab Stat 10:340–355

    MathSciNet  MATH  Google Scholar 

  • Chen Z, Wang H, Wang X (2016) The consistency for the estimator of nonparametric regression model based on martingale difference errors. Stat Pap 57(2):451–469

    MathSciNet  MATH  Google Scholar 

  • Chu B, Huynh K, Jacho-Chávez D (2013) Functionals of order statistics and their multivariate concomitants with application to semiparametric estimation by nearest neighbours. Sankhya B 75(2):238–292

    MathSciNet  MATH  Google Scholar 

  • Collomb G (1979) Estimation de la régression par la méthode des k points les plus proches: propriétés de convergence ponctuelle (French). C R Acad Sci Paris 289(3):245–247

    MathSciNet  MATH  Google Scholar 

  • Collomb G, Hassani S, Sarda P, Vieu P (1985) Convergence uniforme d’estimateurs de la fonction de hazard pour des observations dépendantes: méthodes du noyau et des k points les plus proches. (French). C R Acad Sci Paris Sér I Math 301(12):653–656

    MathSciNet  MATH  Google Scholar 

  • Cover T-M (1968) Estimation by the nearest neighbor rule. IEEE Trans Inf Theory IT–14:50–55

    MATH  Google Scholar 

  • Cuevas A (2014) A partial overview of the theory of statistics with functional data. J Stat Plann Inference 147:1–23

    MathSciNet  MATH  Google Scholar 

  • Dette H, Gefeller O (1995) The impact of different definitions of nearest neighbour distances for censored data on nearest neighbour kernel estimators of the hazard rate. J Nonparametr Stat 4(3):271–282

    MathSciNet  MATH  Google Scholar 

  • Devroye L, Györfi L, Krzyzak A, Lugosi G (1994) On the strong universal consistency of nearest neighbor regression function estimates. Ann Stat 22:1371–1385

    MathSciNet  MATH  Google Scholar 

  • Devroye L, Wagner T (1977) The strong uniform consistency of nearest neighbor density estimates. Ann Stat 5(3):536–540

    MathSciNet  MATH  Google Scholar 

  • Devroye L, Wagner T (1982) Nearest neighbor methods in discrimination. In: Classification, pattern recognition and reduction of dimensionality. Handbook of statistics, vol 2. North-Holland, Amsterdam. pp 193–197

    Google Scholar 

  • Engle RF, Granger CWJ, Rice J, Weiss A (1986) Semiparametric estimates of the relation between weather and electricity sales. J Am Stat Assoc 81:310–320

    Google Scholar 

  • Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Theory and practice. Springer, New York

    MATH  Google Scholar 

  • Goia A, Vieu P (2014) Some advances in semiparametric functional data modelling. In: Contributions in infinite-dimensional statistics and related topics. Esculapio, Bologna, pp 135–141

  • Goia A, Vieu P (2016) An introduction to recent advances in high/infinite dimensional statistics. J Multivar Anal 146:1–6

    MathSciNet  MATH  Google Scholar 

  • Györfi L, Kohler M, Krzyzak A, Walk H (2002) A distribution-free theory of nonparametric regression. Springer series in statisics. Springer, New York

    MATH  Google Scholar 

  • Härdle W, Liang H, Gao J (2000) Partially linear models. Physica-Verlag, Heidelberg

    MATH  Google Scholar 

  • Hong S (1992) Estimation theory of a class of semiparametric regression models. Sci China Ser A 35(6):657–674

    MathSciNet  MATH  Google Scholar 

  • Horváth L, Kokoszka P (2012) Inference for functional data with applications. Springer series in statistics. Springer, New York

    MATH  Google Scholar 

  • Hsing T, Eubank R (2015) Theoretical foundations to functional data analysis with an introduction to linear operators. Wiley series in probability and statistics. Wiley, Chichester

    MATH  Google Scholar 

  • Kara-Zaitri L, Laksaci A, Rachdi M, Vieu P (2017a) Data-driven kNN estimation in nonparametric functional data-analysis. J Multivar Anal 153:176–188

    MATH  Google Scholar 

  • Kara-Zaitri L, Laksaci A, Rachdi M, Vieu P (2017b) Uniform in bandwidth consistency for various kernel estimators involving functional data. J Nonparametr Stat 29(1):85–107

    MathSciNet  MATH  Google Scholar 

  • Kudraszow N, Vieu P (2013) Uniform consistency of \(k\)NN regressors for functional variables. Stat Probab Lett 83(8):1863–1870

    MATH  Google Scholar 

  • Laloë T (2008) A k-nearest approach for functional regression. Stat Prob Lett 10:1189–1193

    MathSciNet  MATH  Google Scholar 

  • Li H, Li Q, Liu R (2016) Consistent model specification tests based on k-nearest-neighbor estimation methods. J Econom 194(1):187–202

    MathSciNet  MATH  Google Scholar 

  • Lian H (2011) Functional partial linear model. J Nonparametr Stat 23(1):115–128

    MathSciNet  MATH  Google Scholar 

  • Ouadah S (2013) Uniform-in-bandwidth nearest-neighbor density estimation. Stat Probab Lett 83(8):1835–1843

    MathSciNet  MATH  Google Scholar 

  • Paindaveine D, Van Bever G (2015) Nonparametrically consistent depth-based classifiers. Bernoulli 21(1):69–82

    MathSciNet  MATH  Google Scholar 

  • Ramsay J, Silverman B (2005) Functional data analysis. Springer series in statistics. Springer, New York

    MATH  Google Scholar 

  • Robinson P (1995) Nearest-neighbour estimation of semiparametric regression models. J Nonparametr Stat 5(1):33–41

    MathSciNet  MATH  Google Scholar 

  • Sancetta A (2010) Nearest neighbor conditional estimation for Harris recurrent Markov chains. J Multivar Anal 100(10):2224–2236

    MathSciNet  MATH  Google Scholar 

  • Shang H (2014) Bayesian bandwidth estimation for a semi-functional partial linear regression model with unknown error density. Comput Stat 29(3–4):829–848

    MathSciNet  MATH  Google Scholar 

  • Zhang J (2013) Analysis of variance for functional data. Monographs on statistics and applied probability. Chapman & Hall/CRC, Boca Raton

    Google Scholar 

Download references

Acknowledgements

This work has received financial support from the National Social Science Funds of China (NSSF 14ATJ005), the NNSF of China (NNSF 11501005), the Spanish Ministerio de Economía y Competitividad (Grant MTM2014-52876-R), the Xunta de Galicia (Centro Singular de Investigaciń de Galicia accreditation ED431G/01 2016-2019 and Grupos de Referencia Competitiva ED431C2016-015) and the European Union (European Regional Development Fund—ERDF). The authors would like to thank the Associate Editor and the two anonymous referees for their constructive and helpful comments, which have greatly improved the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nengxiang Ling.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ling, N., Aneiros, G. & Vieu, P. kNN estimation in functional partial linear modeling. Stat Papers 61, 423–444 (2020). https://doi.org/10.1007/s00362-017-0946-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-017-0946-0

Keywords

Navigation