Abstract
This is an introductory chapter in which(i)Goals of data analysis as a tool helping to enhance and augment knowledge of the domain are outlined. Since knowledge is represented by the concepts and statements of relation between them, two main pathways for data analysis are summarization, for developing and augmenting concepts, and correlation, for enhancing and establishing relations. (ii)A set of seven cases involving small datasets and related data analysis problems is presented. The datasets are taken from various fields such as monitoring market towns, computer security protocols, bioinformatics, cognitive psychology. (iii)An overview of data visualization, its goals and some techniques is given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdi, H., Valentin, D., Edelman, B.: Neural Networks, Series: Quantitative Applications in the Social Sciences, 124. Sage Publications, London, ISBN 0-7619-1440-4 (1999)
Berthold, M., Hand D.: Intelligent Data Analysis. Springer, Berlin-Heidelberg (2003)
Betts, M.J., Russell, R.B.: Amino acid properties and consequences of subsitutions. In: Barnes, M.R., Gray, I.C. (eds.) Bioinformatics for Geneticists. Wiley, New York, NY (2003)
Card, S.K., Mackinlay, J.D., Shneiderman B.: Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann Publishers, San Francisco, CA, ISBN 1-55860-533-9 (1999)
Duda, R.O., Hart, P.E., Stork D.G.: Pattern Classification. Wiley-Interscience, New York, NY, ISBN 0-471-05669-3 (2001)
Engelbrecht, A.P.: Computational Intelligence. Wiley, Chichester, ISBN 0-470-84870-7 (2002)
Fisher, R.: The use of multiple measurements in taxonomic problems. Annals Eugen. 7, 179–188 (1936)
Gama, J.: Knowledge Discovery from Data Streams. Boca Raton, Chapman & Hall/CRC (2010)
Hair, J.F., Black, W.C., Babin, B.J., Anderson, R.E.: Multivariate Data Analysis, 7th edn. Prentice Hall, Upper Saddle River, NJ, ISBN-10: 0-13-813263-1 (2010)
Han, J., Kamber, M., J. Pei: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann Publishers, San Francisco (2006)
Hartigan, J.A.: Clustering Algorithms. Wiley, New York, NY (1975)
Haykin, S. S.: Neural Networks, 2nd edn. Prentice Hall, Upper Saddle River NJ, ISBN 0132733501 (1999)
Henikoff, S., Henikoff, J.G.: Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci.USA 89(22), 10915–10919 (1992)
Kendall, M.G., Stewart, A.: Advanced Statistics: Inference and Relationship, 3rd edn. Griffin, London, ISBN: 0852642156 (1973)
Lebart, L., Morineau, A., Piron, M.: Statistique Exploratoire Multidimensionelle. Dunod, Paris, ISBN 2-10-002886-3 (1995)
Lohninger, H.: Teach Me Data Analysis. Springer, Berlin-New York-Tokyo, ISBN 3-540-14743-8 (1999)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge, England (2008)
Mazza, R.: Introduction to Information Visualization. Springer, London, ISBN: 978-1-84800-218-0 (2009)
Mirkin, B.: Mathematical Classification and Clustering. Kluwer Academic Press, Boston-Dordrecht (1996)
Mirkin, B.: Clustering for Data Mining: A Data Recovery Approach. Chapman & Hall/CRC, London, ISBN 1-58488-534-3 (2005)
Mitchell, T.M.: Machine Learning. McGraw Hill, New York, NY (2005)
Mitsa, T.: Temporal Data Mining. Chapman & Hall/CRC, Boca Raton (2010)
Murtagh, F.: Multidimensional Clustering Algorithms. Physica-Verlag, Vienna (1985)
Polyak, B.: Introduction to Optimization. Optimization Software, Los Angeles, CA, ISBN: 0911575146 (1987)
Schölkopf, B., Smola, A.J.: Learning with Kernels. The MIT Press, Cambridge, MA (2005)
Spence, R.: Information Visualization. ACM Press, New York, NY, ISBN 0-201-59626-1 (2001)
Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading, MA (1977)
Vapnik, V.: Estimation of Dependences Based on Empirical Data, 2d edn. Springer Science + Business Media Inc., New York, NY (2006)
Webb, A.: Statistical Pattern Recognition. Wiley, Chichester (2002)
Weiss, S.M., Indurkhya, N., Zhang, T., Damerau, F.J.: Text Mining: Predictive Methods for Analyzing Unstructured Information. Springer Science+Business Media, New York, NY, ISBN 0-387-95433-3 (2005)
Zhang, Z., Zhang, R.: Multimedia Data Mining. Chapman & Hall/CRC, Boka Raton (2009)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this chapter
Cite this chapter
Mirkin, B. (2011). Introduction: What Is Core. In: Core Concepts in Data Analysis: Summarization, Correlation and Visualization. Undergraduate Topics in Computer Science. Springer, London. https://doi.org/10.1007/978-0-85729-287-2_1
Download citation
DOI: https://doi.org/10.1007/978-0-85729-287-2_1
Published:
Publisher Name: Springer, London
Print ISBN: 978-0-85729-286-5
Online ISBN: 978-0-85729-287-2
eBook Packages: Computer ScienceComputer Science (R0)