An Insight on Big Data Analytics

Sparks, Ross; Ickowicz, Adrien; Lenz, Hans J.

doi:10.1007/978-3-319-26989-4_2

Ross Sparks⁴,
Adrien Ickowicz⁴ &
Hans J. Lenz⁵

Part of the book series: Studies in Big Data ((SBD,volume 16))

4154 Accesses
4 Citations

Abstract

This paper discusses the opportunities big data offers decision makers from a statistical perspective. It calls for a multidisciplinary approach by computer scientists, statisticians and domain experts to providing useful big data solutions. Big data calls for us to think in new ways and communicate effectively within such teams. We make a plea for linking data-driven and model-driven analytics, and stress the role of cause-effect models for knowledge enhancement in big data analytics. We remember Kant’s statement that theory without data is blind, but facts without theories are meaningless. A case is made for each discipline to define the contribution they offer to big data solutions so that effective teams can be formed to improve inductions. Although new approaches are needed much of the past learning related to small data are valuable in providing big data solutions. Here we have in mind the long-term academic training and field experience of statisticians concerning reduction of dataset volumes, sampling in a more general setting, data depreciation and quality, model design and validation, visualisation, etc. We expect that combining the present approaches will give incentives for increasing the chances for “real big solutions”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bolt, S., Sparks, R.: Detecting and diagnosing hotspots for the enhanced management of hospital emergency departments in Queensland, Australia. Med. Inform. Dec. Making 13, 134 (2013)
Google Scholar
Breiman, L.: Statistical modeling: the two cultures (with comments and a rejoinder by the author). Statis. Sci. 16(3), 199–231 (2001)
Google Scholar
Deming, W.E.: The new economics: for industry, government, education, 2nd edn. The MIT Press, Cambridge (2000)
Google Scholar
Friedman, J.H., Stuetzle, W.: Projection pursuit regression. J. Am. Statis. Assoc. 76, 817–823 (1981)
Google Scholar
Friedman, J.H.: Fast sparse regression and classification. Int. J. Forecast. 28, 722–738 (2012)
Google Scholar
Harford, T.: Big data: are we making a big mistake? Significance 11(5), 14–19 (2014)
Google Scholar
Lahiri, P., Larsen, M.: Regression analysis with linked data. J. Am. Statis. Assoc. 100, 222–230 (2005)
Google Scholar
Megahed, F.M., Jones-Farmer, L.A.: A statistical process monitoring perspective on big data. In: XIth International Workshop on Intelligent Statistical Quality Control, CSIRO, Sydney (2013)
Google Scholar
Popper, K.: Science as falsification. Conject. Refutat. Readings in the Philosophy of Science, 33–39 (1963)
Google Scholar
Savage, L.J.: The Foundations of Statistics, Dover edn, 352pp (1972)
Google Scholar
Sparks, R.S., Okugami, C.: Data quality: algorithms for automatic detection of unusual measurements. Front. Statis. Proc. Control 10, 385–400 (2012)
Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Royal Statis. Soc. Series B (Methodological), 24, 267–288 (1996)
Google Scholar
West, M., Harrison, P.J.: Bayesian Forecasting and Dynamic Models. Springer, New York (1997)
Google Scholar
Williams, C., Rasmussen, C.: Gaussian processes for regression (1996). http://eprints.aston.ac.uk/651/1/getPDF.pdf

Download references

Author information

Authors and Affiliations

CSIRO Computational Informatics, North Ryde, NSW, 1670, Australia
Ross Sparks & Adrien Ickowicz
Institut Für Statistik und Ökonometrie, Inst. Für Wirtschaftsinformatik, Freie Universität Berlin, Boltzmannstr.20, K30 D-14195, Berlin, Germany
Hans J. Lenz

Authors

Ross Sparks
View author publications
You can also search for this author in PubMed Google Scholar
Adrien Ickowicz
View author publications
You can also search for this author in PubMed Google Scholar
Hans J. Lenz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hans J. Lenz .

Editor information

Editors and Affiliations

University of Ottawa, Ottawa, Ontario, Canada
Nathalie Japkowicz
Institute of Computing Sciences, Poznań University of Technology, Poznań, Poland
Jerzy Stefanowski

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sparks, R., Ickowicz, A., Lenz, H.J. (2016). An Insight on Big Data Analytics. In: Japkowicz, N., Stefanowski, J. (eds) Big Data Analysis: New Algorithms for a New Society. Studies in Big Data, vol 16. Springer, Cham. https://doi.org/10.1007/978-3-319-26989-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-26989-4_2
Published: 17 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26987-0
Online ISBN: 978-3-319-26989-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics