Skip to main content

Part of the book series: Undergraduate Topics in Computer Science ((UTICS))

  • 3909 Accesses

Abstract

This is an introductory chapter in which(i)Goals of data analysis as a tool helping to enhance and augment knowledge of the domain are outlined. Since knowledge is represented by the concepts and statements of relation between them, two main pathways for data analysis are summarization, for developing and augmenting concepts, and correlation, for enhancing and establishing relations. (ii)A set of seven cases involving small datasets and related data analysis problems is presented. The datasets are taken from various fields such as monitoring market towns, computer security protocols, bioinformatics, cognitive psychology. (iii)An overview of data visualization, its goals and some techniques is given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 29.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Abdi, H., Valentin, D., Edelman, B.: Neural Networks, Series: Quantitative Applications in the Social Sciences, 124. Sage Publications, London, ISBN 0-7619-1440-4 (1999)

    Google Scholar 

  • Berthold, M., Hand D.: Intelligent Data Analysis. Springer, Berlin-Heidelberg (2003)

    Google Scholar 

  • Betts, M.J., Russell, R.B.: Amino acid properties and consequences of subsitutions. In: Barnes, M.R., Gray, I.C. (eds.) Bioinformatics for Geneticists. Wiley, New York, NY (2003)

    Google Scholar 

  • Card, S.K., Mackinlay, J.D., Shneiderman B.: Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann Publishers, San Francisco, CA, ISBN 1-55860-533-9 (1999)

    Google Scholar 

  • Duda, R.O., Hart, P.E., Stork D.G.: Pattern Classification. Wiley-Interscience, New York, NY, ISBN 0-471-05669-3 (2001)

    MATH  Google Scholar 

  • Engelbrecht, A.P.: Computational Intelligence. Wiley, Chichester, ISBN 0-470-84870-7 (2002)

    Google Scholar 

  • Fisher, R.: The use of multiple measurements in taxonomic problems. Annals Eugen. 7, 179–188 (1936)

    Article  Google Scholar 

  • Gama, J.: Knowledge Discovery from Data Streams. Boca Raton, Chapman & Hall/CRC (2010)

    Google Scholar 

  • Hair, J.F., Black, W.C., Babin, B.J., Anderson, R.E.: Multivariate Data Analysis, 7th edn. Prentice Hall, Upper Saddle River, NJ, ISBN-10: 0-13-813263-1 (2010)

    Google Scholar 

  • Han, J., Kamber, M., J. Pei: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann Publishers, San Francisco (2006)

    Google Scholar 

  • Hartigan, J.A.: Clustering Algorithms. Wiley, New York, NY (1975)

    MATH  Google Scholar 

  • Haykin, S. S.: Neural Networks, 2nd edn. Prentice Hall, Upper Saddle River NJ, ISBN 0132733501 (1999)

    Google Scholar 

  • Henikoff, S., Henikoff, J.G.: Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci.USA 89(22), 10915–10919 (1992)

    Article  Google Scholar 

  • Kendall, M.G., Stewart, A.: Advanced Statistics: Inference and Relationship, 3rd edn. Griffin, London, ISBN: 0852642156 (1973)

    Google Scholar 

  • Lebart, L., Morineau, A., Piron, M.: Statistique Exploratoire Multidimensionelle. Dunod, Paris, ISBN 2-10-002886-3 (1995)

    Google Scholar 

  • Lohninger, H.: Teach Me Data Analysis. Springer, Berlin-New York-Tokyo, ISBN 3-540-14743-8 (1999)

    MATH  Google Scholar 

  • Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge, England (2008)

    Book  MATH  Google Scholar 

  • Mazza, R.: Introduction to Information Visualization. Springer, London, ISBN: 978-1-84800-218-0 (2009)

    Google Scholar 

  • Mirkin, B.: Mathematical Classification and Clustering. Kluwer Academic Press, Boston-Dordrecht (1996)

    Book  MATH  Google Scholar 

  • Mirkin, B.: Clustering for Data Mining: A Data Recovery Approach. Chapman & Hall/CRC, London, ISBN 1-58488-534-3 (2005)

    Book  MATH  Google Scholar 

  • Mitchell, T.M.: Machine Learning. McGraw Hill, New York, NY (2005)

    Google Scholar 

  • Mitsa, T.: Temporal Data Mining. Chapman & Hall/CRC, Boca Raton (2010)

    Google Scholar 

  • Murtagh, F.: Multidimensional Clustering Algorithms. Physica-Verlag, Vienna (1985)

    MATH  Google Scholar 

  • Polyak, B.: Introduction to Optimization. Optimization Software, Los Angeles, CA, ISBN: 0911575146 (1987)

    Google Scholar 

  • Schölkopf, B., Smola, A.J.: Learning with Kernels. The MIT Press, Cambridge, MA (2005)

    Google Scholar 

  • Spence, R.: Information Visualization. ACM Press, New York, NY, ISBN 0-201-59626-1 (2001)

    Google Scholar 

  • Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading, MA (1977)

    MATH  Google Scholar 

  • Vapnik, V.: Estimation of Dependences Based on Empirical Data, 2d edn. Springer Science + Business Media Inc., New York, NY (2006)

    Google Scholar 

  • Webb, A.: Statistical Pattern Recognition. Wiley, Chichester (2002)

    Google Scholar 

  • Weiss, S.M., Indurkhya, N., Zhang, T., Damerau, F.J.: Text Mining: Predictive Methods for Analyzing Unstructured Information. Springer Science+Business Media, New York, NY, ISBN 0-387-95433-3 (2005)

    MATH  Google Scholar 

  • Zhang, Z., Zhang, R.: Multimedia Data Mining. Chapman & Hall/CRC, Boka Raton (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag London Limited

About this chapter

Cite this chapter

Mirkin, B. (2011). Introduction: What Is Core. In: Core Concepts in Data Analysis: Summarization, Correlation and Visualization. Undergraduate Topics in Computer Science. Springer, London. https://doi.org/10.1007/978-0-85729-287-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-0-85729-287-2_1

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-0-85729-286-5

  • Online ISBN: 978-0-85729-287-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics