Abstract
The quality of water has a direct influence on both human health and the environment. Water is utilized for a variety of purposes, including drinking, agriculture, and industrial use. The water quality index (WQI) is a critical indication for proper water management. The purpose of this work was to use machine learning techniques such as RF, NN, MLR, SVM, and BTM to categorize a dataset of water quality in various places across India. Water quality is dictated by features such as dissolved oxygen (DO), total coliform (TC), biological oxygen demand (BOD), Nitrate, pH, and electric conductivity (EC). These features are handled in five steps: data pre-processing using min-max normalization and missing data management using RF, feature correlation, applied machine learning classification, and model’s feature importance. The highest accuracy Kappa, Accuracy Lower, and Accuracy Upper findings in this research are 99.83, 99.17, 99.07, and 99.99, respectively. The finding showed that Nitrate, PH, conductivity, DO, TC, and BOD are the key qualities that contribute to the orderly classification of water quality, with Variable Importance values of 74.78, 36.805, 81.494, 105.770, 105.166, and 130.173, respectively.
Article PDF
Avoid common mistakes on your manuscript.
References
H.A.N. Silva, A. Rosato, R. Altilio, M. Panella, Water quality prediction based on wavelet neural networks and remote sensing, 2018 International Joint Conference on Neural Networks (IJCNN), IEEE, Rio de Janeiro, Brazil, 2018, pp. 1–6.
S. Chatterjee, S. Sarkar, N. Dey, S. Sen, T. Goto, N.C. Debnath, Water quality prediction: multi objective genetic algorithm coupled artificial neural network based approach, 2017 IEEE 15th International Conference on Industrial Informatics (INDIN), IEEE, Emden, Germany, 2017, pp. 963–968.
T.H. Aldhyani, M. Al-Yaari, H. Alkahtani, M. Maashi, Water quality prediction using artificial intelligence algorithms, Applied Bionics and Biomechanics 2020 (2020), 6659314.
U. Ahmed, R. Mumtaz, H. Anwar, A.A. Shah, R. Irfan, J. García-Nieto, Efficient water quality prediction using supervised machine learning, Water 11 (2019), 2210.
P. Liu, J. Wang, A.K. Sangaiah, Y. Xie, X. Yin, Analysis and prediction of water quality using LSTM deep neural networks in IoT environment, Sustainability 11 (2019), 2058.
L. Wang, Z. Zhu, L. Sassoubre, G. Yu, C. Liao, Q. Hu, et al., Improving the robustness of beach water quality modeling using an ensemble machine learning approach, Science of The Total Environment 765 (2021), 142760.
C.V. Sillberg, P. Kullavanijaya, O. Chavalparit, Water quality classification by integration of attribute-realization and support vector machine for the chao phraya river, Journal of Ecological Engineering 22 (2021), 70–86.
M. Yilma, Z. Kiflie, A. Windsperger, N. Gessese, Application of artificial neural network in water quality index prediction: a case study in little Akaki River, Addis Ababa, Ethiopia, Modeling Earth Systems and Environment 4 (2018), 175–187.
D.T. Bui, K. Khosravi, J. Tiefenbacher, H. Nguyen, N. Kazakis, Improving prediction of water quality indices using novel hybrid machine-learning algorithms, Science of The Total Environment 721 (2020), 137612.
Y.R. Ding, Y.J. Cai, P.D. Sun, B. Chen, The use of combined neural networks and genetic algorithms for prediction of river water quality, Journal of Applied Research and Technology 12 (2014), 493–499.
A. Azad, H. Karami, S. Farzin, A. Saeedian, H. Kashi, F. Sayyahi, Prediction of water quality parameters using ANFIS optimized by intelligence algorithms (case study: Gorganrood river), KSCE Journal of Civil Engineering 22 (2018), 2206–2213.
Y. Zhang, X. Gao, K. Smith, G. Inial, S. Liu, L.B. Conil, et al., Integrating water quality and operation into prediction of water production in drinking water treatment plants by genetic algorithm enhanced artificial neural network, Water research 164 (2019), 114888.
V. Geethanjali, N. Anbarivan, Indian water pollution monitoring and forecasting for anomaly with fail-safe wireless sensor networks using machine learning techniques, International Research Journal of Engineering and Technology (IRJET) 6 (2019), 771–781.
F. Tang, H. Ishwaran, Random forest missing data algorithms, Statistical Analysis and Data Mining: The ASA Data Science Journal 10 (2017), 363–377.
S. Jain, S. Shukla, R. Wadhvani, Dynamic selection of normalization techniques using data complexity measures, Expert Systems with Applications 106 (2018), 252–262.
M.M. Hassan, Z.J. Peya, S. Mollick, M.A. Billah, M.M. Hasan Shakil, A.U. Dulla, Diabetes prediction in healthcare at early stage using machine learning approach, 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), IEEE, Kharagpur, India, 2021, pp. 1–5.
S.V.S. Ganga Devi, Random forest advice for water quality prediction in the regions of Kadapa district, International Journal of Innovative Technology and Exploring Engineering 8 (2019), 1464–1466.
S. Bedi, A. Samal, C. Ray, D. Snow, Comparative evaluation of machine learning models for groundwater quality assessment, Environmental Monitoring and Assessment 192 (2020), 1–23.
M.R. Islam, S. Liu, X. Wang, G. Xu, Deep learning for misinformation detection on online social networks: a survey and new perspectives, Social Network Analysis and Mining 10 (2020), 1–20.
A.M. El-Habil, An application on multinomial logistic regression model, Pakistan Journal of Statistics and Operation Research 8 (2012), 271–291.
A. Mosavi, F.S. Hosseini, B. Choubin, M. Goodarzi, A.A. Dineva, E.R. Sardooi, Ensemble boosting and bagging based machine learning models for groundwater potential prediction, Water Resources Management 35 (2021), 23–37.
M.M. Hassan, M.A. Mamun Billah, M.M. Rahman, S. Zaman, M.M. Hasan Shakil, J.H. Angon, Early predictive analytics in healthcare for diabetes prediction using machine learning approach, 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), IEEE, Kharagpur, India, 2021, pp. 1–5.
G. Tan, J. Yan, C. Gao, S. Yang, Prediction of water quality time series data based on least squares support vector machine, Procedia Engineering 31 (2012), 1194–1199.
W.C. Leong, A. Bahadori, J. Zhang, Z. Ahmad, Prediction of water quality index (WQI) using support vector machine (SVM) and least square-support vector machine (LS-SVM), International Journal of River Basin Management 19 (2021), 149–156.
Author information
Authors and Affiliations
Corresponding author
Additional information
Peer review under responsibility of KEO (Henan) Education Technology Co. Ltd
Rights and permissions
This is an open access article distributed under the CC BY-NC 4.0 license (https://doi.org/creativecommons.org/licenses/by-nc/4.0/).
About this article
Cite this article
Hassan, M.M., Hassan, M.M., Akter, L. et al. Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms. Hum-Cent Intell Syst 1, 86–97 (2021). https://doi.org/10.2991/hcis.k.211203.001
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2991/hcis.k.211203.001