Skip to main content
Log in

Nonparametric density estimation for symmetric distributions by contaminated data

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

A semiparametric two-component mixture model is considered, in which the distribution of one (primary) component is unknown and assumed symmetric. The distribution of the other component (admixture) is known. We consider three estimates for the pdf of primary component: a naive one, a symmetrized naive estimate and a symmetrized estimate with adaptive weights. Asymptotic behavior and small sample performance of the estimates are investigated. Some rules of thumb for bandwidth selection are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bordes L, Delmas C, Vandekerkhove P (2006) Semiparametric estimation of a two-component mixture model where one component is known. Scand J Stat 33: 733–752

    Article  MathSciNet  MATH  Google Scholar 

  • Borovkov AA (1998) Mathematical statistics. Gordon and Breach Science Publishers, Amsterdam

    MATH  Google Scholar 

  • Bowman AW (1984) An alternative method of cross-validation for the smoothing of density estimates. Biometrika 71: 353–360

    Article  MathSciNet  Google Scholar 

  • Chacón JE, Montanero J, Nogales AG (2008) Bootstrap bandwidth selection using an h-dependent pilot bandwidth. Scand J Stat 35: 139–157

    Article  MATH  Google Scholar 

  • Devroye L, Györfi L, Lugosi G (1996) A probabilistic theory of pattern recognition. Springer, New York

    MATH  Google Scholar 

  • Devroye L (1997) Universal smoothing factor selection in density estimation: theory and practice (with discussion). Test 6: 223–320

    Article  MathSciNet  MATH  Google Scholar 

  • Efron B, Tibshirani R, Storey JD, Tusher V (2001) Empirical Bayes analysis of a microarray experiment. J Am Stat Assoc 96: 1151–1160

    Article  MathSciNet  MATH  Google Scholar 

  • Hall P (1990) Using the bootstrap to estimate mean squared error and select smoothing parameter in nonparametric problems. J Multivar Anal 32: 177–203

    Article  MATH  Google Scholar 

  • Hall P (1992) Effect of bias estimation on coverage accuracy of bootstrap confidence intervals for a probability density. Ann Stat 20: 675–694

    Article  MATH  Google Scholar 

  • Hall P, Marron JS, Park BU (1992) Smoothed cross-validation. Probab Theor Relat Fields 92: 1–20

    Article  MathSciNet  MATH  Google Scholar 

  • Hardle W, Muller M, Sperlich S, Werwatz A (2004) Nonparametric and semiparametric models. Springer, Berlin

    Book  Google Scholar 

  • Hedenfalk I, Duggan D, Chen YD, Radmacher M, Bittner M, Simon R, Meltzer P, Gusterson B, Esteller M, Kallioniemi OP et al (2001) Gene-expression profiles in hereditary breast cancer. N Engl J Med 344: 539–548

    Article  Google Scholar 

  • Ho YHS, Lee SMS (2008) Iterated bootstrap-t confidence intervals for density functions. Scand J Stat 35: 295–308

    Article  MathSciNet  MATH  Google Scholar 

  • Ibragimov IA, Khasminsky RZ (1979) Asymptotic estimation theory. Nauka, Moscow (in Russian)

  • Kraft CH, Lepage Y, van Eeden C (1985) Estimation of a symmetric density function. Commun Stat A Theor Methods 14: 273–288

    Article  MATH  Google Scholar 

  • Liu K, Tsokos CP (2001) Kernel estimates of symmetric density function. Int J Appl Math 6: 23–34

    MathSciNet  MATH  Google Scholar 

  • Macdonald P, Du J (2008) mixdist: finite mixture distribution models. R package version 0.5-2. http://www.math.mcmaster.ca/peter/mix/mix.html

  • Maiboroda R, Sugakova O (2010) Generalized estimating equations for symmetric distributions observed with admixture. Commun Stat Theor Method (to appear)

  • Meloche J (1991) Estimation of a symmetric density. Canad J Stat 19: 151–164

    Article  MathSciNet  MATH  Google Scholar 

  • Pearson K (1894) Contribution to the mathematical theory of evolution. Phil Trans Roy Soc A 185: 71–110

    Article  MATH  Google Scholar 

  • Robin S, Bar-Hen A, Daudin J, Pierre L (2007) A semi-parametric approach for mixture models: application to local false discovery rate estimation. Comput Statist Data Anal 51: 5483–5493

    Article  MathSciNet  MATH  Google Scholar 

  • Rudemo M (1982) Empirical choice of histograms and kernel density estimators. Scand J Stat 9: 65–78

    MathSciNet  MATH  Google Scholar 

  • Shao J (1998) Mathematical statistics. Springer, New York

    Google Scholar 

  • Silverman BW (1986) Density estimation for statistics and data analysis. Chapman & Hall, London

    MATH  Google Scholar 

  • Storey JD, Tibshirani R (2003) Statistical significance for genome-wide studies. Proc Natl Acad Sci 100: 9440–9445

    Article  MathSciNet  MATH  Google Scholar 

  • Sugakova O (2009) Estimation of location parameter by observations with admixture. Teorija Imovirnosti ta Matematychna Statystyka 80: 81–91

    Google Scholar 

  • Sugakova O (2010) Density estimation by observations with admixture. Theory of Stochastic Processes (to appear)

  • Wand MP, Jones MC (1995) Kernel smoothing. Chapman & Hall, London

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rostyslav Maiboroda.

Additional information

Research supported in part by Swedish Institute grant SI-01424/2007.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Maiboroda, R., Sugakova, O. Nonparametric density estimation for symmetric distributions by contaminated data. Metrika 75, 109–126 (2012). https://doi.org/10.1007/s00184-010-0317-5

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00184-010-0317-5

Keywords

Navigation