Skip to main content
Log in

Clusters of effects curves in quantile regression models

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM permits simplifying computation and interpretation of the results, and may improve the ability to identify clusters. We illustrate a variety of applications, highlighting the advantages and the usefulness of the described method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Abramowitz M, Stegun I (1964) Handbook of mathematical functions: with formulas, graphs, and mathematical tables, vol 55. Courier Corporation, Chelmsford

    MATH  Google Scholar 

  • Adelfio G, Chiodi M, D’Alessandro A, Luzio D (2011) Fpca algorithm for waveform clustering. J Commun Comput 8(6):494–502

    Google Scholar 

  • Adelfio G, Chiodi M, D’Alessandro A, Luzio D, D’Anna G, Mangano G (2012) Simultaneous seismic wave clustering and registration. Comput Geosci 44:60–69

    Article  Google Scholar 

  • Adelfio G, Di Salvo F, Chiodi M (2016) Space-time FPCA algorithm for clustering of multidimensional curves. In: Proceeding of the 48th scientific meeting of the Italian Statistical Society, Salerno

  • Bouveyron C, Brunet-Saumard C (2014) Model-based clustering of high-dimensional data: a review. Comput Stat Data Anal 71:52–78

    Article  MathSciNet  MATH  Google Scholar 

  • Clogg C, Petkova E, Haritou A (1995) Statistical methods for comparing regression coefficients between models. Am J Sociol 100(5):1261–1293

    Article  Google Scholar 

  • Fisher R (1936) The use of multiple measurements in taxonomic problems. Ann Hum Genet 7(2):179–188

    Google Scholar 

  • Frumento P (2017) QRCM: quantile regression coefficients modeling. https://CRAN.R-project.org/package=qrcm, r package version 2.1

  • Frumento P, Bottai M (2016) Parametric modeling of quantile regression coefficient functions. Biometrics 72(1):74–84

    Article  MathSciNet  MATH  Google Scholar 

  • Garcia-Escudero L, Gordaliza A (2005) A proposal for robust curve clustering. J Classif 22(2):185–201

    Article  MathSciNet  MATH  Google Scholar 

  • Gower J (1975) Generalized procrustes analysis. Psychometrika 40(1):33–51

    Article  MathSciNet  MATH  Google Scholar 

  • Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24(6):417

    Article  MATH  Google Scholar 

  • Jacques J, Preda C (2014) Functional data clustering: a survey. Adv Data Anal Classif 8(3):231–255

    Article  MathSciNet  Google Scholar 

  • James G (2007) Curve alignment by moments. Ann Appl Stat 1:480–501

    Article  MathSciNet  MATH  Google Scholar 

  • Kneip A, Gasser T (1992) Statistical tools to analyze data representing a sample of curves. Ann Stat 20:1266–1305

    Article  MathSciNet  MATH  Google Scholar 

  • Koenker R (2005) Quantile regression, vol 38. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  • Koenker R, Bassett G Jr (1978) Regression quantiles. Econom J Econom Soc 46:33–50

    MathSciNet  MATH  Google Scholar 

  • Pearson K (1901) On lines and planes of closest fit to systems of points in space. In: Proceedings of of the 17th ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems (SIGMOD)

  • Ramsay J (2006) Functional data analysis. Wiley, New York

    Google Scholar 

  • Ramsay J, Li X (1998) Curve registration. J R Stat Soc Ser B 60(2):351–363

    Article  MathSciNet  MATH  Google Scholar 

  • Sangalli L, Secchi P, Vantini S, Veneziani A (2009) A case study in exploratory functional data analysis: geometrical features of the internal carotid artery. J Am Stat Assoc 104(485):37–48

    Article  MathSciNet  MATH  Google Scholar 

  • Silverman B (1995) Incorporating parametric effects into functional principal components analysis. J R Stat Soc Ser B 57:673–689

    MathSciNet  MATH  Google Scholar 

  • Sottile G, Adelfio G (2017) clustEff: clusters of effect curves in quantile regression models. R package version 0.1.1. https://CRAN.R-project.org/package=clustEff

  • Vichi M, Saporta G (2009) Clustering and disjoint principal component analysis. Comput Stat Data Anal 53(8):3194–3208

    Article  MathSciNet  MATH  Google Scholar 

  • Wang K, Gasser T (1997) Alignment of curves by dynamic time warping. Ann Stat 25(3):1251–1276

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

We would like to thank the two anonymous reviewers for their suggestions and comments, that allowed us to considerably improve the manuscript.

Funding This paper has been partially supported by the national grant of the Italian Ministry of Education University and Research (MIUR) for the PRIN-2015 program (Progetti di ricerca di Rilevante Interesse Nazionale), “Prot. 20157PRZC4 - Research Project Title Complex space-time modelling and functional analysis for probabilistic forecast of seismic events. PI: Giada Adelfio”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giada Adelfio.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sottile, G., Adelfio, G. Clusters of effects curves in quantile regression models. Comput Stat 34, 551–569 (2019). https://doi.org/10.1007/s00180-018-0817-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-018-0817-8

Keywords

Navigation