Skip to main content
Log in

Visual clustering and classification: The Oronsay particle size data set revisited

  • Published:
Computational Statistics Aims and scope Submit manuscript

Summary

Interactive statistical graphics can be effectively used to find natural groupings in observations. In this paper we want to demonstrate how clustering and classification can be done with three approaches based on highly interactive graphical environments: high-dimensional scatterplots as available in XGobi, parallel coordinate plots as available in ExplorN, and linked low-dimensional views as available in Manet. We will point out the strengths and the weaknesses of these techniques by comparing their behavior when applied to the Oronsay particle size data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20

Similar content being viewed by others

References

  • Andrews, M. V., Gilbertson, D. D. & Kent, M. (1987), Storm Frequencies along the Mesolithic Coastline, in P. A. Meilars, ed., ‘Excavations on Oronsay: Prehistoric Human Ecology on a Small Island’, Edinburgh University Press, Edinburgh, UK, pp. 108–114.

    Google Scholar 

  • Asimov, D. (1985), ‘The Grand Tour: A Tool for Viewing Multidimensional Data’, SIAM Journal on Scientific and Statistical Computing 6(1), 128–143.

    Article  MathSciNet  MATH  Google Scholar 

  • Becker, R., Chambers, J. & Wilks, A. (1988), The New S LanguageA Programming Environment for Data Analysis and Graphics, Wadsworth and Brooks/Cole, Pacific Grove, CA.

    MATH  Google Scholar 

  • Buja, A. & Asimov, D. (1986), Grand Tour Methods: An Outline, in D. M. Allen, ed., ‘Proceedings of the 17th Symposium on the Interface between Computer Science and Statistics, Lexington, KY’, Elsevier, pp. 63–67.

    Google Scholar 

  • Carr, D. B., Wegman, E. J. & Luo, Q. (1997), ExplorN: Design Considerations Past and Present, Technical Report 137, Center for Computational Statistics, George Mason University, Fairfax, VA.

    Google Scholar 

  • Cleveland, W. S. & McGill, M. E., eds (1988), Dynamic Graphics for Statistics, Wadsworth & Brooks/Cole, Belmont, CA.

    Google Scholar 

  • Cook, D., Buja, A. & Cabrera, J. (1993), ‘Projection Pursuit Indices Based on Expansions with Orthonormal Functions’, Journal of Computational and Graphical Statistics 2(3), 225–250.

    Article  MathSciNet  Google Scholar 

  • Cook, D., Buja, A., Cabrera, J. & Hurley, C. (1995), ‘Grand Tour and Projection Pursuit’, Journal of Computational and Graphical Statistics 4(3), 155–172.

    Google Scholar 

  • Cook, D., Majure, J. J., Symanzik, J. & Cressie, N. (1996), ‘Dynamic Graphics in a GIS: Exploring and Analyzing Multivariate Spatial Data Using Linked Software’, Computational Statistics: Special Issue on Computeraided Analysis of Spatial Data 11(4), 467–480.

    Google Scholar 

  • Cook, D., Symanzik, J., Majure, J. J. & Cressie, N. (1997), ‘Dynamic Graphics in a GIS: More Examples Using Linked Software’, Computers and Geosciences: Special Issue on Exploratory Cartographic Visualization 23(4), 371–385. Paper, CD, and http://www.elsevier.nl/locate/cgvis.

    Article  Google Scholar 

  • Eick, S. G. & Wills, G. J. (1995), ‘High Interaction Graphics’, European Journal of Operational Research 84, 445–459.

    Article  MATH  Google Scholar 

  • Fieller, N. R. J., Flenley, E. C. & Olbricht, W. (1992), ‘Statistics of Particle Size Data’, Journal of the Royal Statistical Society, Series C (Applied Statistics) 41(1), 127–146.

    MATH  Google Scholar 

  • Fieller, N. R. J., Gilbertson, D. D. & Olbricht, W. (1984), ‘A new Method for Environmental Analysis of Particle Size Distribution Data from Shoreline Sediments’, Nature 311, 648–651.

    Article  Google Scholar 

  • Fieller, N. R. J., Gilbertson, D. D., Olbricht, W. & Timmins, D. A. Y. (1983), ‘A Computer-Compatible Archive of Sedimentological Data from Oronsay, Inner Hebrides’, Archive No. 2, Department of Prehistory and Archaeology, University of Sheffield, Sheffield, UK.

    Google Scholar 

  • Fieller, N. R. J., Gilbertson, D. D. & Timmins, D. A. Y. (1987), Sedimentological Analyses of the Shell-Midden Sites, in P. A. Mellars, ed., ‘Excavations on Oronsay: Prehistoric Human Ecology on a Small Island’, Edinburgh University Press, Edinburgh, UK, pp. 78–90.

    Google Scholar 

  • Flenley, E. C. & Olbricht, W. (1993), Classification of Archaeological Sands by Particle Size Analysis, in O. Opitz, B. Lausen & R. Klar, eds, ‘Information and Classification. Concepts, Methods and Applications. Proceedings of the 16th Annual Conference of the “Gesellschaft für Klassifikation e. V.”’, Springer, Berlin, Heidelberg, pp. 478–489.

    Google Scholar 

  • Friedman, J. H. & Tukey, J. W. (1974), ‘A Projection Pursuit Algorithm for Exploratory Data Analysis’, IEEE Transactions on Computers C-23, 881–889.

    Article  MATH  Google Scholar 

  • Huber, P. J. (1985), ‘Projection Pursuit (with Discussion)’, Annals of Statistics 13, 435–525.

    Article  MathSciNet  MATH  Google Scholar 

  • Inselberg, A. (1985), ‘The Plane with Parallel Coordinates’, The Visual Computer 1, 69–91.

    Article  MathSciNet  MATH  Google Scholar 

  • Klinke, S. & Cook, D. (1997), ‘Binning of Kernel-based Projection Pursuit Indices in XGobi’, Computational Statistics & Data Analysis 27(3), 363–369.

    Article  Google Scholar 

  • Koschat, M. A. & Swayne, D. F. (1996), ‘Interactive Graphical Methods in the Analysis of Customer Panel Data (with Discussion)’, Journal of Business and Economic Statistics 14(1), 113–132.

    Google Scholar 

  • Kruskal, J. B. (1969), Toward a Practical Method which Helps Uncover the Structure of a Set of Observations by Finding the Line Transformation which Optimizes a New “Index of Condensation”, in R. C. Milton & J. A. Neider, eds, ‘Statistical Computation’, Academic Press, New York, NY, pp. 427–440.

    Chapter  Google Scholar 

  • Olbricht, W. (1982), ‘Modern Statistical Analysis of Ancient Sand’. MSc Thesis, University of Sheffield, Sheffield, UK.

    Google Scholar 

  • Swayne, D. F., Cook, D. & Buja, A. (1998), ‘XGobi: Interactive Dynamic Graphics in the X Window System’, Journal of Computational and Graphical Statistics 7(1), 113–130.

    Google Scholar 

  • Symanzik, J., Majure, J. J. & Cook, D. (1996), ‘Dynamic Graphics in a GIS: A Bidirectional Link between ArcView 2.0 and XGobi’, Computing Science and Statistics 27, 299–303.

    Google Scholar 

  • Theus, M., Hofmann, H. & Wilhelm, A. (1998), ‘Selection Sequences — Interactive Analysis of Massive Data Sets’, Computing Science and Statistics 29(1), 439–444.

    Google Scholar 

  • Theus, M. & Wilhelm, A. F. X. (1998), Counts, Proportions, Interactions — A View on Categorical Data, in ‘ASA Proceedings of the Section on Statistical Graphics’, American Statistical Association, Alexandria, VA. Forthcoming.

    Google Scholar 

  • Timmins, D. A. Y. (1981), ‘Study of Sediment in Mesolithic Middens on Oronsay’. MA Thesis, University of Sheffield, Sheffield, UK.

    Google Scholar 

  • Unwin, A. R., Hawkins, G., Hofmann, H. & Siegl, B. (1996), ‘MANET — Missings Are Now Equally Treated’, Journal of Computational and Graphical Statistics 5, 113–122.

    Google Scholar 

  • Wegman, E. J. (1990), ‘Hyperdimensional Data Analysis Using Parallel Coordinates’, Journal of the American Statistical Association 85, 664–675.

    Article  Google Scholar 

  • Wegman, E. J. (1991), ‘The Grand Tour in k-Dimensions’, Computing Science and Statistics 22, 127–136.

    Google Scholar 

  • Wegman, E. J. (1995), ‘Huge Data Sets and the Frontiers of Computational Feasibility’, Journal of Computational and Graphical Statistics 4(4), 281–295.

    Google Scholar 

  • Wegman, E. J. & Carr, D. B. (1993), Statistical Graphics and Visualization, in C. R. Rao, ed., ‘Handbook of Statistics, Vol. 9’, Elsevier Science Publishers, Amsterdam, pp. 857–958.

    Google Scholar 

  • Wegman, E. J. & Luo, Q. (1997a), ‘High Dimensional Clustering using Parallel Coordinates and the Grand Tour’, Computing Science and Statistics 28, 361–368.

    Google Scholar 

  • Wegman, E. J. & Luo, Q. (1997b), High Dimensional Clustering using Parallel Coordinates and the Grand Tour, in R. Klar & O. Opitz, eds, ‘Classification and Knowledge Organization’, Springer, pp. 93–101.

  • Wilhelm, A., Unwin, A. R. & Theus, M. (1996), Software for Interactive Statistical Graphics — A Review, in F. Faulbaum & W. Bandilla, eds, ‘SoftStat’ 95 — Advances in Statistical Software 5’, Lucius & Lucius, Stuttgart, pp. 3–12.

    MATH  Google Scholar 

Download references

Acknowledgments

We would like to thank Nick Fieller for providing us with the Oronsay particle size data set and additional background information. Thanks are also due to Walter Olbricht for his additional comments and to Qiang Luo who assisted with the preparation of data and the analysis. We are also grateful to Debby Swayne and the anonymous referees for their helpful comments and suggestions. The work of all three authors was supported in part by the NSF with a Group Infrastructure Grant DMS-9631351. In addition, the work of Edward Wegman was supported in part by the Army Research Office under grant DAAH04-94-G-0267. This work was initiated when Adalbert Wilhelm was visiting the Center for Computational Statistics as a Habilitanden-Stipendiat of the Deutsche Forschungsgemeinschaft under contract Wi 1584/1-1.

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wilhelm, A.F.X., Wegman, E.J. & Symanzik, J. Visual clustering and classification: The Oronsay particle size data set revisited. Computational Statistics 14, 109–146 (1999). https://doi.org/10.1007/PL00022701

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/PL00022701

Keywords

Navigation