Skip to main content

High Dimensional Big Data and Pattern Analysis: A Tutorial

  • Conference paper
Big Data Analytics (BDA 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8302))

Included in the following conference series:

Abstract

Sensors and actuators embedded in physical objects being linked through wired/wireless networks known as “internet of things” are churning out huge volumes of data (McKinsey Quarterly report, 2010). This phenomenon has led to the archiving of mammoth amounts of data from scientific simulations in the physical sciences and bioinformatics, to social media and a plethora of other areas. It is predicted that over 30 billion devices with 200 billion intermittent connections will be connected by 2020. The creation and archival of the massive amounts of data spawned a multitude of industries. Data management and up-stream analytics is aided by data compression and dimensionality reduction. This review paper will focus on some foundational methods of dimensionality reduction by examining in extensive detail some of the main algorithms, and points the reader to emerging next generation methods that seek to identify structure within high dimensional data not captured by 2nd order statistics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Committee on the Analysis of Massive Data, Frontiers in Massive Data Analysis. National Academies Press (2013)

    Google Scholar 

  2. Shalizi, C.R.: Advanced Data Analysis from an Elementary Point of View (2013), http://www.stat.cmu.edu/~cshalizi

  3. Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 3rd edn. Prentice Hall, Englewood Cliffs (1992)

    MATH  Google Scholar 

  4. Hotelling, H.: Relations Between Two Sets of Variables. Biometrika 28, 321–377 (1936)

    MATH  Google Scholar 

  5. Mood, A.M., Graybill, F.A., Boes, D.C.: Introduction to the Theory of Statistics, 3rd edn. McGraw-Hill (1974)

    Google Scholar 

  6. Hardle, W.K., Simar, L.: Applied Multivariate Statistical Analysis, 3rd edn. Springer (2011)

    Google Scholar 

  7. Friedman, J.H.: Exploratory Projection Pursuit. Journal of the American Statistical Association 82(397), 249–266 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  8. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning, Data Mining, Inference, and Prediction. Springer (2001)

    Google Scholar 

  9. Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley, Inter-Science (2001)

    Book  Google Scholar 

  10. Lakshminarayan, C.K., Baron, M.I.: Pattern Recognition in Large-Scale Data Sets: Application in Integrated Circuit Manufacturing. In: Bhatnagar, V. (ed.) BDA 2013. Springer, Heidelberg (2013)

    Google Scholar 

  11. Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes in C, The Art of Scientific Computing. Cambridge University Press (1990)

    Google Scholar 

  12. Strang, G.: Linear Algebra and its Applications, 4th edn. Brooks/Cole Publishing Company (2005)

    Google Scholar 

  13. Burgess, C.J.C.: Dimension Reduction: A guided Tour. Foundation and Trends in Machine Learning 2(4), 275–365 (2010)

    Article  Google Scholar 

  14. Hardoon, D.R., Szedmak, S., Shawe-Taylor, J.: Canonical correlation analysis; An overview with application to learning methods, technical report, CSD-TR-03-02, Dept. of Computer Science, Royal Holloway, University of London (2003)

    Google Scholar 

  15. Timm, N.H.: Applied Multivariate Analysis. Springer (2002)

    Google Scholar 

  16. Lee, J.A., Verleysen, M.: Nonlinear Dimensionality Reduction. Springer (2007)

    Google Scholar 

  17. Strang, G.: Introduction to Applied Mathematics. Wellesley-Cambridge Press (1986)

    Google Scholar 

  18. Ng, A.: Independent Component Analysis, CS229, Lecture Notes. Stanford University

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Lakshminarayan, C.K. (2013). High Dimensional Big Data and Pattern Analysis: A Tutorial. In: Bhatnagar, V., Srinivasa, S. (eds) Big Data Analytics. BDA 2013. Lecture Notes in Computer Science, vol 8302. Springer, Cham. https://doi.org/10.1007/978-3-319-03689-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03689-2_5

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03688-5

  • Online ISBN: 978-3-319-03689-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics