Foundations of Big Data, Machine Learning, and Artificial Intelligence and Explainable Artificial Intelligence

Cali, Umit; Kuzlu, Murat; Pipattanasomporn, Manisa; Kempf, James; Bai, Linquan

doi:10.1007/978-3-030-83301-5_6

Umit Cali⁶,
Murat Kuzlu⁷,
Manisa Pipattanasomporn⁸,
James Kempf⁹ &
…
Linquan Bai¹⁰

951 Accesses
1 Citations

Abstract

This chapter provides a brief and compact overview of the basic terminologies and definitions in Big Data (BD), machine learning (ML), artificial intelligence (AI) and explainable AI (XAI). BD focuses on collecting, cleansing, storing, analyzing, extracting information from, align with interpreting large datasets in science and businesses from healthcare, media, energy systems to defense. AI and ML-based algorithms are widely used in the research to perform tasks that typically require human intelligence, such as driving cars, translating speech, and image recognition. Machine learning algorithms are generally split into four main categories: Supervised Learning, Unsupervised Learning, Reinforcement Learning, and Ensemble Learning. The traditional AI function as a black-box model typically does not provide decision-makers and domain experts with any guidance as to why a particular decision was made. This lack may often lead decision-makers and domain experts to question the results and ultimately reject them because they cannot explain the approach to stakeholders, politicians, and others who want to know why a particular decision was made. XAI provides a more transparent and explanatory approach, thereby rendering the decision more acceptable to people who need to explain why a particular decision was made to others.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sagiroglu, S. and Sinanc, D., 2013, May. Big data: A review. In 2013 international conference on collaboration technologies and systems (CTS) (pp. 42–47). IEEE.
Book Google Scholar
Gandomi, A. and Haider, M., 2015. Beyond the hype: Big data concepts, methods, and analytics. International journal of information management, 35(2), pp. 137–144.
Article Google Scholar
PCA vs LDA vs T-SNE, https://medium.com/analytics-vidhya/pca-vs-lda-vs-t-sne-lets-understand-the-difference-between-them-22fa6b9be9d0, (Accessed Aug. 29, 2021).
Singh, S. and Silakari, S., 2009. Generalized discriminant analysis algorithm for feature reduction in cyber attack detection system. arXiv preprint arXiv:0911.0787.
Google Scholar
ML-Classification vs Regression, https://www.geeksforgeeks.org/ml-classification-vs-regression/, (Accessed Aug. 29, 2021).
Montgomery, D.C., Peck, E.A. and Vining, G.G., 2012. Introduction to linear regression analysis (Vol. 821). John Wiley & Sons.
MATH Google Scholar
Hall, L.O., Chawla, N. and Bowyer, K.W., 1998, October. Decision tree learning on very large data sets. In SMC’98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No. 98CH36218) (Vol. 3, pp. 2579–2584). IEEE.
Chapter Google Scholar
Friedman, J., Hastie, T. and Tibshirani, R., 2001. The elements of statistical learning (Vol. 1, No. 10). New York: Springer series in statistics.
MATH Google Scholar
Cali, Umit, and Claudio Lima. “Energy informatics using the distributed ledger technology and advanced data analytics.” Cases on Green Energy and Sustainable Development. IGI Global, 2020. 438–481. (2)
Google Scholar
Rahman, I., Kuzlu, M. and Rahman, S., 2018. Power disaggregation of combined HVAC loads using supervised machine learning algorithms. Energy and Buildings, 172, pp. 57–66.
Article Google Scholar
https://analyticsindiamag.com/most-popular-clustering-algorithms-used-in-machine-learning/, (Accessed Aug. 29, 2021).
https://towardsdatascience.com/gaussian-mixture-models-d13a5e915c8e, (Accessed Aug. 29, 2021).
https://www.geeksforgeeks.org/ml-mean-shift-clustering/, (Accessed Aug. 29, 2021).
https://www.analyticsvidhya.com/blog/2016/11/an-introduction-to-clustering-and-different-methods-of-clustering/, (Accessed Aug. 29, 2021).
https://www.freecodecamp.org/news/8-clustering-algorithms-in-machine-learning-that-all-data-scientists-should-know/, (Accessed Aug. 29, 2021).
Sutton, R.S. and Barto, A.G., 2018. Reinforcement learning: An introduction. MIT press.
MATH Google Scholar
https://towardsdatascience.com/a-beginners-guide-to-q-learning-c3e2a30a653c, (Accessed Aug. 29, 2021).
https://medium.com/swlh/introduction-to-reinforcement-learning-coding-sarsa-part-4-2d64d6e37617, (Accessed Aug. 29, 2021).
https://towardsdatascience.com/boosting-algorithms-explained-d38f56ef3f30, (Accessed Aug. 29, 2021).
https://www.mygreatlearning.com/blog/bagging-boosting, (Accessed Aug. 29, 2021).
https://towardsdatascience.com/ensemble-methods-bagging-boosting-and-stacking-c9214a10a205, (Accessed Aug. 29, 2021).
Mellit A, Kalogirou SA. Artificial intelligence techniques for photovoltaic applications: A review. Progress in energy and combustion science. 2008 Oct 1;34(5):574–632.
Article Google Scholar
Raza MQ, Khosravi A. A review on artificial intelligence based load demand forecasting techniques for smart grid and buildings. Renewable and Sustainable Energy Reviews. 2015 Oct 1;50:1352–72.
Article Google Scholar
Zhou H, Rao M, Chuang KT. Artificial intelligence approach to energy management and control in the HVAC process: an evaluation, development and discussion. Developments in Chemical Engineering and Mineral Processing. 1993;1(1):42–51.
Article Google Scholar
De Benedetti M, Leonardi F, Messina F, Santoro C, Vasilakos A. Anomaly detection and predictive maintenance for photovoltaic systems. Neurocomputing. 2018 Oct 8;310:59–68.
Article Google Scholar
Utility Dive, How does AI improve grid performance? No one fully understands and that’s limiting its use, [Online]. Available: https://www.utilitydive.com/news/how-does-ai-improve-grid-performance-no-one-fully-understands-and-thats-l/566997/
Ribeiro MT, Singh S, Guestrin C. “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 2016 Aug 13 (pp. 1135–1144).
Google Scholar
Shrikumar, A., Greenside, P. and Kundaje, A., 2017. Learning important features through propagating activation differences. arXiv preprint arXiv:1704.02685.
Google Scholar
Skater: Python library for model interpretation/explanations. GitHub repository, https://github.com/oracle/Skater
Siddhartha M, Maity P, Nath R. Explanatory Artificial Intelligence (Xai) In The Prediction Of Post-Operative Life Expectancy In Lung Cancer Patients. International Journal of Scientific Research. 2020 Jan 24;8(12).
Google Scholar
“ELI5.” https://github.com/TeamHG-Memex/eli5, 2019. [Online; Accessed 2019-06-04].
Arya, V., Bellamy, R.K., Chen, P.Y., Dhurandhar, A., Hind, M., Hoffman, S.C., Houde, S., Liao, Q.V., Luss, R., Mojsilovic, A. and Mourad, S., 2020. Ai explainability 360: An extensible toolkit for understanding data and machine learning models. Journal of Machine Learning Research, 21(130), pp. 1–6.
MATH Google Scholar
Raschka S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. Journal of open source software. 2018 Apr 22;3(24):638.
Article Google Scholar
InterpretML, Explain Your Model with Microsoft’s InterpretML, https://medium.com/analytics-vidhya/explain-your-model-with-microsofts-interpretml-5daab1d693b4
Rulex Explainable AI (XAI), https://www.rulex.ai/rulex-explainable-ai-xai/
Félix Revert, Interpreting Random Forest and other black box models like XGBoost, https://towardsdatascience.com/interpreting-random-forest-and-other-black-box-models-like-xgboost-80f9cc4a3c38
Seldon, Alibi, https://docs.seldon.io/projects/alibi/en/latest/overview/getting_started.html.
IBM, Contrastive Explanation Method (CEM), https://github.com/IBM/Contrastive-Explanation-Method
Machine Learning Interpretability (MLI), https://github.com/h2oai/mli-resources
XAI – The eXplainable AI Framework, https://ethical.institute/xai.html
Joshua Poduska, SHAP and LIME Python Libraries: Part 1 – Great Explainers, with Pros and Cons to Both, https://blog.dominodatalab.com/shap-lime-python-libraries-part-1-great-explainers-pros-cons/
Sumit Saha, local interpretable model-agnostic explanations (lime) – the eli5 way, https://medium.com/intel-student-ambassadors/local-interpretable-model-agnostic-explanations-lime-the-eli5-way-b4fd61363a5e
Marco Tulio Ribeiro, LIME – Local Interpretable Model-Agnostic Explanations, https://homes.cs.washington.edu/~marcotcr/blog/lime/
Sumit Saha, local interpretable model-agnostic explanations (lime) – the eli5 way, https://medium.com/intel-student-ambassadors/local-interpretable-model-agnostic-explanations-lime-the-eli5-way-b4fd61363a5e
https://github.com/marcotcr/lime
Eriksson, T., 2020. Occlusion method to obtain saliency maps for CNN.
Google Scholar
Skater Overview, https://oracle.github.io/Skater/overview.html
Skater, https://github.com/oracle/Skater
Hart, S., 1989. Shapley value. In Game Theory (pp. 210–216). Palgrave Macmillan, London.
Google Scholar
Lundberg, Scott M., and Su-In Lee. “A unified approach to interpreting model predictions.” Advances in Neural Information Processing Systems. 2017
Google Scholar
towardsdatascience, Explain Your Model with the SHAP Values, https://towardsdatascience.com/explain-your-model-with-the-shap-values-bc36aac4de3d
On Click 260, nterpretable Machine Learning With Lime+ELI5+SHAP+InterpretML, https://www.onclick360.com/interpretable-machine-learning-with-lime-eli5-shap-interpret-ml/
ELI5, ELI5, https://github.com/TeamHG-Memex/eli5
AI Explainability 360, https://aix360.readthedocs.io/en/latest/
Raschka, S., 2018. MLxtend: providing machine learning and data science utilities and extensions to Python’s scientific computing stack. Journal of open source software, 3(24), p.638.
Google Scholar
Nori, H., Jenkins, S., Koch, P. and Caruana, R., 1909. InterpretML: A Unified Framework for Machine Learning Interpretability (2019). arXiv preprint arXiv:1909.09223.
Google Scholar
InterpretML - Alpha Release, https://github.com/interpretml/interpret
Alibi Explain, https://github.com/SeldonIO/alibi
Dhurandhar, A., Chen, P.Y., Luss, R., Tu, C.C., Ting, P., Shanmugam, K. and Das, P., 2018. Explanations based on the missing: Towards contrastive explanations with pertinent negatives. In Advances in Neural Information Processing Systems (pp. 592–603).
Google Scholar
Explainable AI: The Next Best Thing in Digital Health, https://ekare.ai/explainable-ai-the-next-best-thing-in-digital-health/
An Introduction to eXplainable AI with H2O Driverless AI, https://andisama.medium.com/an-introduction-to-explainable-ai-with-h2o-driverless-ai-2a9e8f27e03f
XAI – An eXplainability toolbox for machine learning, https://github.com/EthicalML/xai
Carvalho, D.V., Pereira, E.M. and Cardoso, J.S., 2019. Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8), p. 832.
Google Scholar

Download references

Author information

Authors and Affiliations

Elektro E/F, E427, Gløshaugen, Norwegian University of Science and Technology, Trondheim, Norway
Umit Cali
Electrical Engineering Technology, Old Dominion University, Norfolk, VA, USA
Murat Kuzlu
Smart Grid Research Unit (SGRU), Chulalongkorn University, Bangkok, Thailand
Manisa Pipattanasomporn
Kempf and Associates, Mountain View, CA, USA
James Kempf
Systems Engineering & Engineering Management, University of North Carolina at Charlotte, Charlotte, NC, USA
Linquan Bai

Authors

Umit Cali
View author publications
You can also search for this author in PubMed Google Scholar
Murat Kuzlu
View author publications
You can also search for this author in PubMed Google Scholar
Manisa Pipattanasomporn
View author publications
You can also search for this author in PubMed Google Scholar
James Kempf
View author publications
You can also search for this author in PubMed Google Scholar
Linquan Bai
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cali, U., Kuzlu, M., Pipattanasomporn, M., Kempf, J., Bai, L. (2021). Foundations of Big Data, Machine Learning, and Artificial Intelligence and Explainable Artificial Intelligence. In: Digitalization of Power Markets and Systems Using Energy Informatics. Springer, Cham. https://doi.org/10.1007/978-3-030-83301-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-83301-5_6
Published: 27 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-83300-8
Online ISBN: 978-3-030-83301-5
eBook Packages: EnergyEnergy (R0)

Publish with us

Policies and ethics