Skip to main content

Stochastic Algorithms for Exploratory Data Analysis: Data Clustering and Data Visualization

  • Chapter
Learning in Graphical Models

Part of the book series: NATO ASI Series ((ASID,volume 89))

Abstract

Iterative, EM-type algorithms for data clustering and data visualization are derived on the basis of the maximum entropy principle. These algorithms allow the data analyst to detect structure in vectorial or relational data. Conceptually, the clustering and visualization procedures are formulated as combinatorial or continuous optimization problems which are solved by stochastic optimization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • J. Buhmann and H. Kiihnel. Vector quantization with complexity costs. IEEE Transactions on Information Theory, 39 (4): 1133–1145, July 1993.

    Article  MATH  Google Scholar 

  • J. Buhmann and H. Kiihnel. Vector quantization with complexity costs. IEEE Transactions on Information Theory, 39 (4): 1133–1145, July 1993.

    Article  MATH  Google Scholar 

  • T. F. Cox and M.A.A. Cox. Multidimensional Scaling. Number 59 in Monographs on Statistics and Applied Probability. Chapman & Hall, London, 1994.

    MATH  Google Scholar 

  • Т. М. Cover and J. A. Thomas Elements of Information Theory. John Wiley & Sons, New York, 1991.

    Book  MATH  Google Scholar 

  • R. O. Duda and P. E. Hart. Pattern Classification and Scene Analysis. Wiley, New York, 1973. Sect. 3.2.

    MATH  Google Scholar 

  • A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser. B, 39: 1–38, 1977.

    MathSciNet  MATH  Google Scholar 

  • Thomas Hofmann and Joachim M. Buhmann. Hierarchical pairwise data clustering by mean—field annealing. In Proceedings of ICANN’95, NEURONÎMES’95, volume II, pages 197–202. EC2 & Cie, 1995.

    Google Scholar 

  • Thomas Hofmann and Joachim M. Buhmann. An annealed “neural gas” network for robust vector quantization. In Proceedings of ICANN’96, pages 151–156, Berlin, Heidelberg, New York, 1996. Springer.

    Google Scholar 

  • Thomas Hofmann and Joachim M. Buhmann Infering hierarchical clustering structures by deterministic annealing. In Proceedings of the Knowledge Discovery and Data Mining Conference 1996, Portland, Redwood City, CA, USA, 1996. AAAI Press. (in press).

    Google Scholar 

  • Thomas Hofmann and Joachim M. Buhmann Pairwise data clustering by deterministic annealing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19 (1): 1–14, 1997.

    Article  Google Scholar 

  • J. Hertz, A. Krogh, and R. G. Palmer. Introduction to the Theory of Neural Computation. Addison Wesley, New York, 1991.

    Google Scholar 

  • Thomas Hofmann. Data Clustering and Beyond: A Deterministic Annealing Framework for Exploratory Data Analysis. PhD thesis, Mathematisch-Naturwissenschaftliche Fakultät, Rheinische Friedrich-Wilhelms-Universität Bonn, D-53117 Bonn, Fed. Rep. Germany, 1997.

    Google Scholar 

  • A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs, NJ 07632, 1988.

    Google Scholar 

  • Hansjörg Klock and Joachim M. Buhmann. Multidimensional scaling by deterministic annealing. In M.Pellilo and E.R.Hancock, editors, Proceedings EMMCVPR’97, Lecture Notes In Computer Science. Springer Verlag, 1997.

    Google Scholar 

  • T. Kohonen. Self-organization and Associative Memory. Springer, Berlin, 1984.

    MATH  Google Scholar 

  • S.P. Luttrell. Hierarchical vector quantizations. IEE Proceedings, 136: 405–413, 1989.

    Google Scholar 

  • J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, pages 281–297, 1967.

    Google Scholar 

  • G. J. McLachlan and K. E. Basford. Mixture Models. Marcel Dekker, INC, New York, Basel, 1988.

    MATH  Google Scholar 

  • K. Rose, E. Gurewitz, and G. Fox. A deterministic annealing approach to clustering. Pattern Recognition Letters, 11 (11): 589–594, 1990.

    Article  MATH  Google Scholar 

  • H. Ritter, T. Martinetz, and K. Schulten. Neural Computation and Self-organizing Maps. Addison Wesley, New York, 1992.

    MATH  Google Scholar 

  • J. W Sammon Jr. A non-linear mapping for data structure analysis. IEEE Transactions on Computers, 18: 401–409, 1969.

    Article  Google Scholar 

  • Yoshio Takane and Forest W. Young. Nonmetric individul differences multidimensional scaling: An alternating least squares method with optimal scaling features. Psychometrika, 42 (1): 7–67, March 1977.

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Buhmann, J.M. (1998). Stochastic Algorithms for Exploratory Data Analysis: Data Clustering and Data Visualization. In: Jordan, M.I. (eds) Learning in Graphical Models. NATO ASI Series, vol 89. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-5014-9_14

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-5014-9_14

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-6104-9

  • Online ISBN: 978-94-011-5014-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics