Skip to main content

Einführung

  • Chapter
  • First Online:
Data Science
  • 7966 Accesses

Zusammenfassung

Daten sind gemäß internationalem Technologiestandard [1] eine „formalisierte Darstellung von Informationen, welche für die Kommunikation, Interpretation oder Verarbeitung geeignet sind“. Eine weitere Charakterisierung liefert der Duden [2]: Daten sind „(durch Beobachtungen, Messungen, statistische Erhebungen u. a. gewonnene) [Zahlen]werte, (auf Beobachtungen, Messungen, statistischen Erhebungen u. a. beruhende) Angaben, formulierbare Befunde“.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 29.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 39.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Literatur

  1. ISO Central Secretary. Information technology – Vocabulary. Standard ISO/IEC 2382:2015. Genf, Schweiz: International Organization for Standardization, 2015, S. 2121272.

    Google Scholar 

  2. Dudenredaktion. „Daten“ auf Duden online. url: https://www.duden.de/node/30506/revision/30535.

  3. Clifford M. Will. Theory and Experiment in Gravitational Physics. Cambridge University Press, Sep. 2018. doi: https://doi.org/10.1017/9781316338612.

  4. Peter Walde u. a. „Erstellung von Technologie- und Wettbewerbsanalysen mithilfe von Big Data“. In: Wirtschaftsinformatik & Management 5.2 (Feb. 2013), S. 12–23. doi: https://doi.org/10.1365/s35764-013-0274-7.

  5. infas Institut fur angewandte Sozialwissenschaft GmbH. Mobilität in Deutschland – MiD. 2017. url: http://www.mobilitaet-in-deutschland.de/publikationen2017.html.

  6. Deutscher Wetterdienst – Zentraler Vertrieb Klima und Umwelt. Klimadaten Deutschland. Aufgerufen am 01. Apr. 2020. Offenbach. url: https: //www.dwd.de/DE/leistungen/klimadatendeutschland/klimadatendeutschland.html.

  7. Jiawei Han, Micheline Kamber und Jian Pei. Data Mining: Concepts and Techniques. 3. Aufl. Elsevier, 2012. isbn: 9-380-93191-3.

    Google Scholar 

  8. Dudenredaktion. „Data-Mining“ auf Duden online. url: https://www.duden.de/node/30498/revision/30527.

  9. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. 2020. url: https://www.Rproject.org/.

  10. Guido van Rossum und Fred L. Drake. Python 3 Reference Manual. Scotts Valley, USA: CreateSpace, 2009. isbn: 1441412697.

    Google Scholar 

  11. Hadley Wickham. ggplot2. Elegant Graphics for Data Analysis. Springer, New York, 2009. doi: https://doi.org/10.1007/978-0-387-98141-3.

  12. Till Tantau. The TikZ and PGF Packages. Manual for version 3.1.7. Nov. 2020. url: https://pgf-tikz.github.io/pgf/pgfmanual.pdf.

  13. Simon Urbanek und Jeffrey Horner. Cairo: R Graphics Device using Cairo Graphics Library for Creating High-Quality Bitmap (PNG, JPEG, TIFF), Vector (PDF, SVG, PostScript) and Display (X11 and Win32) Output. R-Paket, Version 1.5-12.2. 2020. url: https://CRAN.R-project.org/package=Cairo.

  14. Claus O. Wilke. cowplot: Streamlined Plot Theme and Plot Annotations for ’ggplot2’. R-Paket, Version 1.1.0. 2020. url: https://CRAN.R-project.org/package=cowplot.

  15. Hadley Wickham u. a. dplyr: A Grammar of Data Manipulation. R-Paket, Version 1.0.2. 2020. url: https://CRAN.R-project.org/package=dplyr.

  16. Winston Chang. extrafont: Tools for using fonts. R-Paket, Version 0.17. 2014. url: https://CRAN.R-project.org/package=extrafont.

  17. Daniel Mullner. „fastcluster: Fast Hierarchical, Agglomerative Clustering Routines for R and Python“. In: Journal of Statistical Software 53.9 (2013), S. 1–18. url: http://www.jstatsoft.org/v53/i09/.

  18. Alina Beygelzimer u. a. FNN: Fast Nearest Neighbor Search Algorithms and Applications. R-Paket, Version 1.1.3. 2019. url: https://CRAN.Rproject.org/package=FNN.

  19. Hadley Wickham. forcats: Tools for Working with Categorical Variables (Factors). R-Paket, Version 0.5.0. 2020. url: https://CRAN.R-project.org/package=forcats.

  20. Andrie de Vries und Brian D. Ripley. ggdendro: Create Dendrograms and Tree Diagrams Using ’ggplot2’. R-Paket, Version 0.1.22. 2020. url: https://CRAN.R-project.org/package=ggdendro.

  21. Thomas Lin Pedersen. ggforce: Accelerating ’ggplot2’. R-Paket, Version 0.3.2. 2020. url: https://CRAN.R-project.org/package=ggforce.

  22. Kamil Slowikowski. ggrepel: Automatically Position Non-Overlapping Text Labels with ’ggplot2’. R-Paket, Version 0.8.2. 2020. url: https://CRAN.R-project.org/package=ggrepel.

  23. Herve Cardot. Gmedian: Geometric Median, k-Median Clustering and Robust Median PCA. R-Paket, Version 1.2.5. 2020. url: https://CRAN.R-project.org/package=Gmedian.

  24. Baptiste Auguie. gridExtra: Miscellaneous Functions for ’Grid’ Graphics. R-Paket, Version 2.3. 2017. url: https://CRAN.R-project.org/package=gridExtra.

  25. Gabor Csardi und Tamas Nepusz. „The igraph software package for complex network research“. In: InterJournal Complex Systems (2006), S. 1695. url: https://igraph.org.

  26. Stefano Meschiari. latex2exp: Use LaTeX Expressions in Plots. R-Paket, Version 0.4.0. 2015. url: https://CRAN.R-project.org/package=latex2e xp.

  27. Garrett Grolemund und Hadley Wickham. „Dates and Times Made Easy with lubridate“. In: Journal of Statistical Software 40.3 (2011), S. 1–25. url: https://www.jstatsoft.org/v40/i03/.

  28. Jeroen Ooms. magick: Advanced Graphics and Image-Processing in R. RPaket, Version 2.5.2. 2020. url: https://CRAN.R-project.org/package=magick.

  29. Doug McIlroy u. a. mapproj: Map Projections. R-Paket, Version 1.2.7. 2020. url: https://CRAN.R-project.org/package=mapproj.

  30. W. N. Venables und B. D. Ripley. Modern Applied Statistics with S. 4. Aufl. ISBN 0-387-95457-0. Springer, New York, 2002. doi: https://doi.org/10.1007/978-0-387-21706-2.

  31. Friedrich Leisch und Evgenia Dimitriadou. mlbench: Machine Learning Benchmark Problems. R-Paket, Version 2.1-1. 2010. url: https://CRAN.R-project.org/package=mlbench.

  32. Alan Genz u. a. mvtnorm: Multivariate Normal and t Distributions. RPaket, Version 1.1-1. 2020. url: https://CRAN.R-project.org/package=mvtnorm.

  33. Alan Genz und Frank Bretz. Computation of Multivariate Normal and t Probabilities. Lecture Notes in Statistics. Springer, Berlin, Heidelberg, 2009. isbn: 978-3-642-01688-2.

    Google Scholar 

  34. Stefan Fritsch, Frauke Gunther und Marvin N. Wright. neuralnet: Training of Neural Networks. R-Paket, Version 1.44.2. 2019. url: https://CRAN.R-project.org/package=neuralnet.

  35. David Meyer und Christian Buchta. proxy: Distance and Similarity Measures. R-Paket, Version 0.4-24. 2020. url: https://CRAN.R-project.org/package=proxy.

  36. Damian W. Betebenner. randomNames: Function for Generating Random Names and a Dataset. R-Paket, Version 1.4-0.0. 2019. url: https://cran.r-project.org/package=randomNames.

  37. Hadley Wickham. „Reshaping Data with the reshape Package“. In: Journal of Statistical Software 21.12 (2007), S. 1–20. url: http://www.jstatsoft.org/v21/i12/.

  38. Hadley Wickham und Dana Seidel. scales: Scale Functions for Visualization. R-Paket, Version 1.1.1. 2020. url: https://CRAN.R-project.org/package=scales.

  39. Carter T. Butts. sna: Tools for Social Network Analysis. R-Paket, Version 2.6. 2020. url: https://CRAN.R-project.org/package=sna.

  40. Edzer J. Pebesma und Roger S. Bivand. „Classes and methods for spatial data in R“. In: R News 5.2 (Nov. 2005), S. 9–13. url: https://CRAN.Rproject.org/doc/Rnews/.

  41. Roger S. Bivand, Edzer Pebesma und Virgilio Gomez-Rubio. Applied spatial data analysis with R. 2. Aufl. Springer, New York, 2013. url: https://asdar-book.org/.

  42. Mark P. J. van der Loo. „The stringdist package for approximate string matching“. In: The R Journal 6 (1 2014), S. 111–122. url: https://CRAN.R-project.org/package=stringdist.

  43. Hadley Wickham. stringr: Simple, Consistent Wrappers for CommonString Operations. R-Paket, Version 1.4.0. 2019. url: https://CRAN.Rproject.org/package=stringr.

  44. Hadley Wickham u. a. „Welcome to the tidyverse“. In: Journal of Open Source Software 4.43 (2019), S. 1686. doi: https://doi.org/10.21105/joss.01686.

  45. Julia Silge und David Robinson. „tidytext: Text Mining and Analysis Using Tidy Data Principles in R“. In: JOSS 1.3 (2016). doi: https://doi.org/10.21105/joss.00037.

  46. Justin Donaldson. tsne: t-Distributed Stochastic Neighbor Embedding for R (t-SNE). R-Paket, Version 0.1-3. 2016. url: https://CRAN.R-project.org/package=tsne.

  47. Kyle Bittinger. usedist: Distance Matrix Utilities. R-Paket, Version 0.4.0. 2020. url: https://CRAN.R-project.org/package=usedist.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthias Plaue .

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Der/die Autor(en), exklusiv lizenziert durch Springer-Verlag GmbH, DE, ein Teil von Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Plaue, M. (2021). Einführung. In: Data Science. Springer Spektrum, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-63489-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-63489-9_1

  • Published:

  • Publisher Name: Springer Spektrum, Berlin, Heidelberg

  • Print ISBN: 978-3-662-63488-2

  • Online ISBN: 978-3-662-63489-9

  • eBook Packages: Computer Science and Engineering (German Language)

Publish with us

Policies and ethics