Abstract
This paper summarizes results in the use of the Forward Search in the analysis of corrupted datasets, and those with mixtures of populations. We discuss new challenges that arise in the analysis of large, complex datasets. Methods developed for regression and clustering are described.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Atkinson, A. C. and M. Riani (2000). Robust Diagnostic Regression Analysis. New York: Springer–Verlag.
Atkinson, A. C. and M. Riani (2007). Exploratory tools for clustering multivariate data. Computational Statistics and Data Analysis 52, 272–285. doi:10.1016/j.csda.2006.12.034.
Atkinson, A. C. and M. Riani (2008). A robust and diagnostic information criterion for selecting regression models. Journal of the Japanese Statistical Society 38, 3–14.
Atkinson, A. C., M. Riani, and A. Cerioli (2004). Exploring Multivariate Data with the Forward Search. New York: Springer–Verlag.
Fraley, C. and A. E. Raftery (1999). Mclust: software for model-based cluster analysis. Journal of Classification 16, 297–306.
Mallows, C. L. (1973). Some comments on C p . Technometrics 15, 661–675.
Maronna, R. A., D. R. Martin, and V. J. Yohai (2006). Robust Statistics: Theory and Methods. New York: Wiley.
Riani, M. and A. C. Atkinson (2007). Fast calibrations of the forward search for testing multiple outliers in regression. Advances in Data Analysis and Classification 1, 123–141. doi:10.1007/s11634-007-0007-y.
Riani, M. and A. C. Atkinson (2010). Robust model selection with flexible trimming. Computational Statistics and Data Analysis 54, 3300–3312. doi:10.1016/j.csda.2010.03.007.
Riani, M., A. C. Atkinson, and A. Cerioli (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society, Series B 71, 201–221.
Riani, M., A. Cerioli, A. Atkinson, D. Perrotta, and F. Torti (2008). Fitting mixtures of regression lines with the forward search. In F. Fogelman-Soulié, D. Perrotta, J. Piskorski, and R. Steinberger (Eds.), Mining Massive Data Sets for Security, pp. 271–286. Amsterdam: IOS Press.
Torti, F. and D. Perrotta (2010). Size and power of tests for regression outliers in the forward search. In S. Ingrassia, R. Rocci, and M. Vichi (Eds.), New Perspectives in Statistical Modeling and Data Analysis. Heidelberg: Springer-Verlag, pp. 377–384.
Tufte, E. (2001). The Visual Display of Quantitative Information. Cheshire, CT: Graphics Press.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Riani, M., Atkinson, A., Cerioli, A. (2012). Problems and Challenges in the Analysis of Complex Data: Static and Dynamic Approaches. In: Di Ciaccio, A., Coli, M., Angulo Ibanez, J. (eds) Advanced Statistical Methods for the Analysis of Large Data-Sets. Studies in Theoretical and Applied Statistics(). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21037-2_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-21037-2_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21036-5
Online ISBN: 978-3-642-21037-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)