Skip to main content

Foundations of Big Data, Machine Learning, and Artificial Intelligence and Explainable Artificial Intelligence

  • Chapter
  • First Online:
Digitalization of Power Markets and Systems Using Energy Informatics

Abstract

This chapter provides a brief and compact overview of the basic terminologies and definitions in Big Data (BD), machine learning (ML), artificial intelligence (AI) and explainable AI (XAI). BD focuses on collecting, cleansing, storing, analyzing, extracting information from, align with interpreting large datasets in science and businesses from healthcare, media, energy systems to defense. AI and ML-based algorithms are widely used in the research to perform tasks that typically require human intelligence, such as driving cars, translating speech, and image recognition. Machine learning algorithms are generally split into four main categories: Supervised Learning, Unsupervised Learning, Reinforcement Learning, and Ensemble Learning. The traditional AI function as a black-box model typically does not provide decision-makers and domain experts with any guidance as to why a particular decision was made. This lack may often lead decision-makers and domain experts to question the results and ultimately reject them because they cannot explain the approach to stakeholders, politicians, and others who want to know why a particular decision was made. XAI provides a more transparent and explanatory approach, thereby rendering the decision more acceptable to people who need to explain why a particular decision was made to others.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 79.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sagiroglu, S. and Sinanc, D., 2013, May. Big data: A review. In 2013 international conference on collaboration technologies and systems (CTS) (pp. 42–47). IEEE.

    Book  Google Scholar 

  2. Gandomi, A. and Haider, M., 2015. Beyond the hype: Big data concepts, methods, and analytics. International journal of information management, 35(2), pp. 137–144.

    Article  Google Scholar 

  3. PCA vs LDA vs T-SNE, https://medium.com/analytics-vidhya/pca-vs-lda-vs-t-sne-lets-understand-the-difference-between-them-22fa6b9be9d0, (Accessed Aug. 29, 2021).

  4. Singh, S. and Silakari, S., 2009. Generalized discriminant analysis algorithm for feature reduction in cyber attack detection system. arXiv preprint arXiv:0911.0787.

    Google Scholar 

  5. ML-Classification vs Regression, https://www.geeksforgeeks.org/ml-classification-vs-regression/, (Accessed Aug. 29, 2021).

  6. Montgomery, D.C., Peck, E.A. and Vining, G.G., 2012. Introduction to linear regression analysis (Vol. 821). John Wiley & Sons.

    MATH  Google Scholar 

  7. Hall, L.O., Chawla, N. and Bowyer, K.W., 1998, October. Decision tree learning on very large data sets. In SMC’98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No. 98CH36218) (Vol. 3, pp. 2579–2584). IEEE.

    Chapter  Google Scholar 

  8. Friedman, J., Hastie, T. and Tibshirani, R., 2001. The elements of statistical learning (Vol. 1, No. 10). New York: Springer series in statistics.

    MATH  Google Scholar 

  9. Cali, Umit, and Claudio Lima. “Energy informatics using the distributed ledger technology and advanced data analytics.” Cases on Green Energy and Sustainable Development. IGI Global, 2020. 438–481. (2)

    Google Scholar 

  10. Rahman, I., Kuzlu, M. and Rahman, S., 2018. Power disaggregation of combined HVAC loads using supervised machine learning algorithms. Energy and Buildings, 172, pp. 57–66.

    Article  Google Scholar 

  11. https://analyticsindiamag.com/most-popular-clustering-algorithms-used-in-machine-learning/, (Accessed Aug. 29, 2021).

  12. https://towardsdatascience.com/gaussian-mixture-models-d13a5e915c8e, (Accessed Aug. 29, 2021).

  13. https://www.geeksforgeeks.org/ml-mean-shift-clustering/, (Accessed Aug. 29, 2021).

  14. https://www.analyticsvidhya.com/blog/2016/11/an-introduction-to-clustering-and-different-methods-of-clustering/, (Accessed Aug. 29, 2021).

  15. https://www.freecodecamp.org/news/8-clustering-algorithms-in-machine-learning-that-all-data-scientists-should-know/, (Accessed Aug. 29, 2021).

  16. Sutton, R.S. and Barto, A.G., 2018. Reinforcement learning: An introduction. MIT press.

    MATH  Google Scholar 

  17. https://towardsdatascience.com/a-beginners-guide-to-q-learning-c3e2a30a653c, (Accessed Aug. 29, 2021).

  18. https://medium.com/swlh/introduction-to-reinforcement-learning-coding-sarsa-part-4-2d64d6e37617, (Accessed Aug. 29, 2021).

  19. https://towardsdatascience.com/boosting-algorithms-explained-d38f56ef3f30, (Accessed Aug. 29, 2021).

  20. https://www.mygreatlearning.com/blog/bagging-boosting, (Accessed Aug. 29, 2021).

  21. https://towardsdatascience.com/ensemble-methods-bagging-boosting-and-stacking-c9214a10a205, (Accessed Aug. 29, 2021).

  22. Mellit A, Kalogirou SA. Artificial intelligence techniques for photovoltaic applications: A review. Progress in energy and combustion science. 2008 Oct 1;34(5):574–632.

    Article  Google Scholar 

  23. Raza MQ, Khosravi A. A review on artificial intelligence based load demand forecasting techniques for smart grid and buildings. Renewable and Sustainable Energy Reviews. 2015 Oct 1;50:1352–72.

    Article  Google Scholar 

  24. Zhou H, Rao M, Chuang KT. Artificial intelligence approach to energy management and control in the HVAC process: an evaluation, development and discussion. Developments in Chemical Engineering and Mineral Processing. 1993;1(1):42–51.

    Article  Google Scholar 

  25. De Benedetti M, Leonardi F, Messina F, Santoro C, Vasilakos A. Anomaly detection and predictive maintenance for photovoltaic systems. Neurocomputing. 2018 Oct 8;310:59–68.

    Article  Google Scholar 

  26. Utility Dive, How does AI improve grid performance? No one fully understands and that’s limiting its use, [Online]. Available: https://www.utilitydive.com/news/how-does-ai-improve-grid-performance-no-one-fully-understands-and-thats-l/566997/

  27. Ribeiro MT, Singh S, Guestrin C. “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 2016 Aug 13 (pp. 1135–1144).

    Google Scholar 

  28. Shrikumar, A., Greenside, P. and Kundaje, A., 2017. Learning important features through propagating activation differences. arXiv preprint arXiv:1704.02685.

    Google Scholar 

  29. Skater: Python library for model interpretation/explanations. GitHub repository, https://github.com/oracle/Skater

  30. Siddhartha M, Maity P, Nath R. Explanatory Artificial Intelligence (Xai) In The Prediction Of Post-Operative Life Expectancy In Lung Cancer Patients. International Journal of Scientific Research. 2020 Jan 24;8(12).

    Google Scholar 

  31. “ELI5.” https://github.com/TeamHG-Memex/eli5, 2019. [Online; Accessed 2019-06-04].

  32. Arya, V., Bellamy, R.K., Chen, P.Y., Dhurandhar, A., Hind, M., Hoffman, S.C., Houde, S., Liao, Q.V., Luss, R., Mojsilovic, A. and Mourad, S., 2020. Ai explainability 360: An extensible toolkit for understanding data and machine learning models. Journal of Machine Learning Research, 21(130), pp. 1–6.

    MATH  Google Scholar 

  33. Raschka S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. Journal of open source software. 2018 Apr 22;3(24):638.

    Article  Google Scholar 

  34. InterpretML, Explain Your Model with Microsoft’s InterpretML, https://medium.com/analytics-vidhya/explain-your-model-with-microsofts-interpretml-5daab1d693b4

  35. Rulex Explainable AI (XAI), https://www.rulex.ai/rulex-explainable-ai-xai/

  36. FĂ©lix Revert, Interpreting Random Forest and other black box models like XGBoost, https://towardsdatascience.com/interpreting-random-forest-and-other-black-box-models-like-xgboost-80f9cc4a3c38

  37. Seldon, Alibi, https://docs.seldon.io/projects/alibi/en/latest/overview/getting_started.html.

  38. IBM, Contrastive Explanation Method (CEM), https://github.com/IBM/Contrastive-Explanation-Method

  39. Machine Learning Interpretability (MLI), https://github.com/h2oai/mli-resources

  40. XAI – The eXplainable AI Framework, https://ethical.institute/xai.html

  41. Joshua Poduska, SHAP and LIME Python Libraries: Part 1 – Great Explainers, with Pros and Cons to Both, https://blog.dominodatalab.com/shap-lime-python-libraries-part-1-great-explainers-pros-cons/

  42. Sumit Saha, local interpretable model-agnostic explanations (lime) – the eli5 way, https://medium.com/intel-student-ambassadors/local-interpretable-model-agnostic-explanations-lime-the-eli5-way-b4fd61363a5e

  43. Marco Tulio Ribeiro, LIME – Local Interpretable Model-Agnostic Explanations, https://homes.cs.washington.edu/~marcotcr/blog/lime/

  44. Sumit Saha, local interpretable model-agnostic explanations (lime) – the eli5 way, https://medium.com/intel-student-ambassadors/local-interpretable-model-agnostic-explanations-lime-the-eli5-way-b4fd61363a5e

  45. https://github.com/marcotcr/lime

  46. Eriksson, T., 2020. Occlusion method to obtain saliency maps for CNN.

    Google Scholar 

  47. Skater Overview, https://oracle.github.io/Skater/overview.html

  48. Skater, https://github.com/oracle/Skater

  49. Hart, S., 1989. Shapley value. In Game Theory (pp. 210–216). Palgrave Macmillan, London.

    Google Scholar 

  50. Lundberg, Scott M., and Su-In Lee. “A unified approach to interpreting model predictions.” Advances in Neural Information Processing Systems. 2017

    Google Scholar 

  51. towardsdatascience, Explain Your Model with the SHAP Values, https://towardsdatascience.com/explain-your-model-with-the-shap-values-bc36aac4de3d

  52. On Click 260, nterpretable Machine Learning With Lime+ELI5+SHAP+InterpretML, https://www.onclick360.com/interpretable-machine-learning-with-lime-eli5-shap-interpret-ml/

  53. ELI5, ELI5, https://github.com/TeamHG-Memex/eli5

  54. AI Explainability 360, https://aix360.readthedocs.io/en/latest/

  55. Raschka, S., 2018. MLxtend: providing machine learning and data science utilities and extensions to Python’s scientific computing stack. Journal of open source software, 3(24), p.638.

    Google Scholar 

  56. Nori, H., Jenkins, S., Koch, P. and Caruana, R., 1909. InterpretML: A Unified Framework for Machine Learning Interpretability (2019). arXiv preprint arXiv:1909.09223.

    Google Scholar 

  57. InterpretML - Alpha Release, https://github.com/interpretml/interpret

  58. Alibi Explain, https://github.com/SeldonIO/alibi

  59. Dhurandhar, A., Chen, P.Y., Luss, R., Tu, C.C., Ting, P., Shanmugam, K. and Das, P., 2018. Explanations based on the missing: Towards contrastive explanations with pertinent negatives. In Advances in Neural Information Processing Systems (pp. 592–603).

    Google Scholar 

  60. Explainable AI: The Next Best Thing in Digital Health, https://ekare.ai/explainable-ai-the-next-best-thing-in-digital-health/

  61. An Introduction to eXplainable AI with H2O Driverless AI, https://andisama.medium.com/an-introduction-to-explainable-ai-with-h2o-driverless-ai-2a9e8f27e03f

  62. XAI – An eXplainability toolbox for machine learning, https://github.com/EthicalML/xai

  63. Carvalho, D.V., Pereira, E.M. and Cardoso, J.S., 2019. Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8), p. 832.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Cali, U., Kuzlu, M., Pipattanasomporn, M., Kempf, J., Bai, L. (2021). Foundations of Big Data, Machine Learning, and Artificial Intelligence and Explainable Artificial Intelligence. In: Digitalization of Power Markets and Systems Using Energy Informatics. Springer, Cham. https://doi.org/10.1007/978-3-030-83301-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-83301-5_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-83300-8

  • Online ISBN: 978-3-030-83301-5

  • eBook Packages: EnergyEnergy (R0)

Publish with us

Policies and ethics