Abstract
Spherical data arise widely in various settings. Spherical statistics is an analysis of data on a unit hyper-spherical domain. In this paper, we mainly consider the local kernel estimators for regression models with a binary response and the predictors including spherical variables. We apply the random forests kernel to nonparametric binary regression models with spherical predictors. Simulation experiments and real examples are used to validate the performance of the new models. Compared with the classical von Mises–Fisher kernel and the linear-spherical kernel, the random forests kernel has better fitting effect and faster computation speed. Compared with other classifiers, the models proposed in this paper have better classification performance in both low and high dimensional cases.
Similar content being viewed by others
References
Al-Daffaie K, Khan S (2017) Logistic regression for circular data. In: AIP conference proceedings
Bai ZD, Rao CR, Zhao LC (1987) Kernel estimators of density function of directional data. J Multivar Anal 27:24–39
Bock RK, Chilingarian A, Gaug M (2003) Methods for multidimensional event classification: a case study using images from a Cherenkov gamma-ray telescope. Nuclear Inst Methods Phys Res A 516:511–528
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Di Marzio M, Panzera A, Taylor CC (2014) Nonparametric regression for spherical data. J Am Stat Assoc 109(506):748–763
Di Marzio M, Fensore S, Panzera A et al (2019a) Local binary regression with spherical predictors. Stat Probab Lett 144:30–36
Di Marzio M, Panzera A, Taylor CC (2019b) Kernel density classification for spherical data. Stat Probab Lett 144:23–29
Dua D, Graff C (2017) UCI machine learning repository. http://archive.ics.uci.edu/ml
Fan J, Gijbels I (1996) Local polynomial modelling and its applications. Chapman & Hall Press, Cambridge
Friedberg R, Tibshirani J, Athey S et al (2020) Local linear forests. https://arxiv.org/pdf/1807.11408.pdf
García-Portugués E, Crujeiras RM, González-Manteiga W (2013) Kernel density estimation for directional-linear data. J Multivar Anal 121:152–175
García-Portugués E, Van KI, Crujeiras RM et al (2016) Testing parametric models in linear-directional regression. Scand J Stat 43(4):1178–1191
Hall P, Watson GS, Cabrera J (1987) Kernel density estimation with spherical data. Biometrika 74(4):751–762
Pewsey A, García-Portugués E (2021) Recent advances in directional statistics. TEST 30(3):1–58
Qin X, Zhang JS, Yan XD (2011) A nonparametric circular-linear multivariate regression model with arule-of-thumb bandwidth selector. Comput Math Appl 62(8):3048–3055
R Core Team R (2020) A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Scornet E (2016) Random forests and kernel methods. IEEE Trans Inf Theory 62(3):1485–1500
Signorini DF, Jones MC (2004) Kernel estimators for univariate binary regression. J Am Stat Assoc 99(465):119–126
Sra S (2016) Directional statistics in machine learning: a brief review. https://arxiv.org/pdf/1605.00316.pdf
Tsagris M, Athineou G, Adam C, et al (2023) Directional: a collection of functions for directional data analysis. R package version 5.9
Acknowledgements
The authors thanks for the two anonymous reviewers with their comments about the work.
Funding
This work is supported by the National Natural Science Foundation of China (Grant No. 72033002) and Distinguished Young Scholars of Sichuan Province (No. 2022JDJQ0035).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Qin, X., Gao, H. Nonparametric binary regression models with spherical predictors based on the random forests kernel. Comput Stat (2023). https://doi.org/10.1007/s00180-023-01422-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00180-023-01422-9