Skip to main content

Problems and Challenges in the Analysis of Complex Data: Static and Dynamic Approaches

  • Conference paper
  • First Online:

Part of the book series: Studies in Theoretical and Applied Statistics ((STASSPSS))

Abstract

This paper summarizes results in the use of the Forward Search in the analysis of corrupted datasets, and those with mixtures of populations. We discuss new challenges that arise in the analysis of large, complex datasets. Methods developed for regression and clustering are described.

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Atkinson, A. C. and M. Riani (2000). Robust Diagnostic Regression Analysis. New York: Springer–Verlag.

    Google Scholar 

  • Atkinson, A. C. and M. Riani (2007). Exploratory tools for clustering multivariate data. Computational Statistics and Data Analysis 52, 272–285. doi:10.1016/j.csda.2006.12.034.

    Article  MathSciNet  MATH  Google Scholar 

  • Atkinson, A. C. and M. Riani (2008). A robust and diagnostic information criterion for selecting regression models. Journal of the Japanese Statistical Society 38, 3–14.

    MathSciNet  Google Scholar 

  • Atkinson, A. C., M. Riani, and A. Cerioli (2004). Exploring Multivariate Data with the Forward Search. New York: Springer–Verlag.

    Google Scholar 

  • Fraley, C. and A. E. Raftery (1999). Mclust: software for model-based cluster analysis. Journal of Classification 16, 297–306.

    Article  MATH  Google Scholar 

  • Mallows, C. L. (1973). Some comments on C p . Technometrics 15, 661–675.

    MATH  Google Scholar 

  • Maronna, R. A., D. R. Martin, and V. J. Yohai (2006). Robust Statistics: Theory and Methods. New York: Wiley.

    Book  MATH  Google Scholar 

  • Riani, M. and A. C. Atkinson (2007). Fast calibrations of the forward search for testing multiple outliers in regression. Advances in Data Analysis and Classification 1, 123–141. doi:10.1007/s11634-007-0007-y.

    Article  MathSciNet  Google Scholar 

  • Riani, M. and A. C. Atkinson (2010). Robust model selection with flexible trimming. Computational Statistics and Data Analysis 54, 3300–3312. doi:10.1016/j.csda.2010.03.007.

    Article  MathSciNet  Google Scholar 

  • Riani, M., A. C. Atkinson, and A. Cerioli (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society, Series B 71, 201–221.

    MathSciNet  Google Scholar 

  • Riani, M., A. Cerioli, A. Atkinson, D. Perrotta, and F. Torti (2008). Fitting mixtures of regression lines with the forward search. In F. Fogelman-Soulié, D. Perrotta, J. Piskorski, and R. Steinberger (Eds.), Mining Massive Data Sets for Security, pp. 271–286. Amsterdam: IOS Press.

    Google Scholar 

  • Torti, F. and D. Perrotta (2010). Size and power of tests for regression outliers in the forward search. In S. Ingrassia, R. Rocci, and M. Vichi (Eds.), New Perspectives in Statistical Modeling and Data Analysis. Heidelberg: Springer-Verlag, pp. 377–384.

    Google Scholar 

  • Tufte, E. (2001). The Visual Display of Quantitative Information. Cheshire, CT: Graphics Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marco Riani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Riani, M., Atkinson, A., Cerioli, A. (2012). Problems and Challenges in the Analysis of Complex Data: Static and Dynamic Approaches. In: Di Ciaccio, A., Coli, M., Angulo Ibanez, J. (eds) Advanced Statistical Methods for the Analysis of Large Data-Sets. Studies in Theoretical and Applied Statistics(). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21037-2_14

Download citation

Publish with us

Policies and ethics