Abstract
Algal blooms caused by climate change and human activities have received considerable attention in recent years. Since chlorophyll a (Chl-a) can be used as an indicator of phytoplankton biomass, it has been selected as a direct indicator for monitoring and early warning of algal blooms. With the development of artificial intelligence, data-driven approaches with small sample data and high accuracy prediction have been gradually applied to water quality prediction. This study aimed at using environment factors (water quality and meteorological data) to assist the prediction of Chl-a concentration based on the optimization support vector machine (SVM) model. The most relevant environment factors were extracted from the commonly used environment factors according to the method of cosine similarity. The traditional particle swarm optimization (PSO) algorithm was adopted to optimize the ANN and SVM models, respectively. Then, the better prediction model PSO-SVM can be obtained according to the results of three scientific evaluation indicators. The latest optimization algorithm of grey wolf optimizer (GWO) was also proposed to optimize the SVM to realize high-accuracy Chl-a concentration predication. The GWO-SVM model achieved higher accuracy than the other models both in training and validation processes. Therefore, the dimension of the input vector could be reduced with using the cosine similarity method, and the prediction of Chl-a concentration in high accuracy and the early warning of algal blooms in the study area of this paper could also achieved.
Similar content being viewed by others
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Code availability
The code used during the current study is available from the corresponding author on reasonable request.
References
Antanasijević, D., Pocajt, V., Perić-Grujić, A., & Ristić, M. (2014). Modelling of dissolved oxygen in the danube river using artificial neural networks and Monte carlo simulation uncertainty analysis. Journal of Hydrology, 519, 1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009
Bendu, H., Deepak, B. B. V. L., & Murugan, S. (2016). Application of GRNN for the prediction of performance and exhaust emissions in HCCI engine using ethanol. Energy Conversion and Management, 122, 165–173. https://doi.org/10.1016/j.enconman.2016.05.061
Boyer, J. N., Kelble, C. R., Ortner, P. B., & Rudnick, D. T. (2009). Phytoplankton bloom status: Chlorophyll a biomass as an indicator of water quality condition in the southern estuaries of Florida, USA. Ecological Indicators, 9, 56–67. https://doi.org/10.1016/j.ecolind.2008.11.013
Burges, C. J. C. (1998). A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2, 121–167.
Cho, K. H., Sthiannopkao, S., Pachepsky, Y. A., Kim, K.-W., & Kim, J. H. (2011). Prediction of contamination potential of groundwater arsenic in Cambodia, Laos, and Thailand using artificial neural network. Water Research, 45, 5535–5544. https://doi.org/10.1016/j.watres.2011.08.010
Corinna Cortes, V. V. (1995). Support-vector networks. Maching Learning, 3, 273–297. https://doi.org/10.1109/64.163674
Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other kernel-based learning methods. Cambridge: Cambridge University Press.
Eberhart, R., Kennedy, J. (1995). “A new optimizer using particle swarm theory,” Sixth Int. Symp. Micro Mach. Hum. Sci., pp. 39–43.
Elbisy, M. S. (2015). Sea wave parameters prediction by support vector machine using a genetic algorithm. Journal of Coastal Research, 314, 892–899. https://doi.org/10.2112/jcoastres-d-13-00087.1
Fernandez, M., Caballero, J., Fernandez, L., & Sarai, A. (2011). Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM). Molecular Diversity, 15, 269–289. https://doi.org/10.1007/s11030-010-9234-9
Ge, J., Shuyuan, W., Touré, D., Cheng, L., Miao, W., Cao, H., Pan, X., Li, J., Yao, M., & Feng, L. (2017). Analysis on biomass and productivity of epilithic algae and their relations to environmental factors in the Gufu River basin, Three Gorges Reservoir area, China. Environmental Science and Pollution Research, 24, 26881–26892. https://doi.org/10.1007/s11356-015-4112-8
He, Z., Wen, X., Liu, H., & Du, J. (2014). A comparative study of artificial neural network, adaptive neuro fuzzy inference system and support vector machine for forecasting river flow in the semiarid mountain region. Journal of Hydrology, 509, 379–386. https://doi.org/10.1016/j.jhydrol.2013.11.054
Heddam, S., & Kisi, O. (2018). Modelling daily dissolved oxygen concentration using least square support vector machine, multivariate adaptive regression splines and M5 model tree. Journal of Hydrology, 559, 499–509. https://doi.org/10.1016/j.jhydrol.2018.02.061
Ji, X., Shang, X., Dahlgren, R. A., & Zhang, M. (2017). Prediction of dissolved oxygen concentration in hypoxic river systems using support vector machine: A case study of Wen-Rui Tang River, China. Environmental Science and Pollution Research, 24, 16062–16076. https://doi.org/10.1007/s11356-017-9243-7
Jiao, J., Ma, W., & Pei, Q. (2018). Study on the characteristics of eutrophication evolution of tributaries of the Three Gorges Reservoir[J]. Joumal of China Institute Water Resource and Hydropower Research, 6(06), 544–548.
Keerthi, S. S., & Lin, C. J. (2003). Asymptotic behaviors of support vector machines with gaussian kernel. Neural Computation, 15, 1667–1689. https://doi.org/10.1162/089976603321891855
Kennedy J. (1995). “Particle swarm optimisation,” IEEE Int. Conf. Neural Networks, Perth,Australia, pp. 1942–1948.
Li, W., Yang, M., Liang, Z., Zhu, Y., Mao, W., Shi, J., & Chen, Y. (2013). Assessment for surface water quality in Lake Taihu Tiaoxi River Basin China based on support vector machine. Stochastic Environmental Research and Risk Assessment, 27, 1861–1870. https://doi.org/10.1007/s00477-013-0720-3
Liu, M., & Lu, J. (2014). Support vector machine-an alternative to artificial neuron network for water quality forecasting in an agricultural nonpoint source polluted river? Environmental Science and Pollution Research, 21, 11036–11053. https://doi.org/10.1007/s11356-014-3046-x
Mirjalili, S., Mirjalili, S. M., & Lewis, A. (2014). Grey wolf optimizer. Advances in Engineering Software, 69, 46–61.
Nawi, N. M., Atomi, W. H., & Rehman, M. Z. (2013). The effect of data pre-processing on optimized training of artificial neural networks. Procedia Technology, 11, 32–39. https://doi.org/10.1016/j.protcy.2013.12.159
Noori, R., Der, Y. H., Abbasi, M., et al. (2015). Uncertainty analysis of support vector machine for online prediction of five-day biochemical oxygen demand. Journal of Hydrology, 527, 833–843. https://doi.org/10.1016/j.jhydrol.2015.05.046
Park, Y., Cho, K. H., Park, J., Cha, S. M., & Kim, J. H. (2015). Development of early-warning protocol for predicting chlorophyll-a concentration using machine learning models in freshwater and estuarine reservoirs, Korea. Science of the Total Environment, 502, 31–41. https://doi.org/10.1016/j.scitotenv.2014.09.005
Qiu, G. S., Hu, S., Ye, D., Yuan, L., & Zang, X. P. (2011). Investigation on the present situation of eutrophication and water bloom in the branches of three Gorges Reservoir. Resources and Environment in the Yangtze Basin, 20(3), 311–316.
Renno, C., Petito, F., & Gatto, A. (2015). Artificial neural network models for predicting the solar radiation as input of a concentrating photovoltaic system. Energy Conversion and Management, 106, 999–1012. https://doi.org/10.1016/j.enconman.2015.10.033
Sha, Y., Wei, Y., Li, W., Fan, J., & Cheng, G. (2015). Artificial tide generation and its effects on the water environment in the backwater of Three Gorges Reservoir. Journal of Hydrology, 528, 230–237. https://doi.org/10.1016/j.jhydrol.2015.06.020
Singh, K. P., Basant, N., & Gupta, S. (2011). Support vector machines in water quality management. Analytica Chimica Acta, 703, 152–162. https://doi.org/10.1016/j.aca.2011.07.027
Xia, J., Gaohong, Xu., Guo, P., Hong Peng, X., Zhang, Y. W., & Zhang, W. (2018). Tempo-spatial analysis of water quality in the Three Gorges Reservoir, China, after its 175-m experimental impoundment. Water Resources Management, 32, 2937–2954. https://doi.org/10.1007/s11269-018-1918-4
Xiao, Q., Hu, D., & Xiao, Y. (2017). Assessing changes in soil conservation ecosystem services and causal factors in the Three Gorges Reservoir region of China. Journal of Cleaner Production, 163, S172–S180. https://doi.org/10.1016/j.jclepro.2016.09.012
Yanhong, X., Peng, H., Yang, Y., Zhang, W., & Wang, S. (2014). A cumulative eutrophication risk evaluation method based on a bioaccumulation model. Ecological Modelling, 289, 77–85. https://doi.org/10.1016/j.ecolmodel.2014.07.006
Yan, H., Zhang, J., & Rahman, S. S. (2020). Predicting permeability changes with injecting CO2 in coal seams during CO2 geological sequestration: A comparative study among six SVM-based hybrid models. Science of the Total Environment, 705, 135941. https://doi.org/10.1016/j.scitotenv.2019.135941
Yuan, Y., Zhao, S., & Wang, Y. (2016). Study on distribution law and influencing factors of chlorophyll a in backwater area of Three Gorges Reservoir[J]. Yangtze River, 47(23), 23–27.
Zeng, Q., Liu, Y., Zhao, H., Sun, M., & Li, X. (2017). Comparison of models for predicting the changes in phytoplankton community composition in the receiving water system of an inter-basin water transfer project. Environmental Pollution, 223, 676–684. https://doi.org/10.1016/j.envpol.2017.02.00
Zhang, K., Xiong, X., Hu, H., Wu, C., Bi, Y., Wu, Y., Zhou, B., Lam, P. K. S., & Liu, J. (2017a). Occurrence and characteristics of microplastic pollution in Xiangxi Bay of Three Gorges Reservoir, China. Environmental Science and Technology, 51, 3794–3801. https://doi.org/10.1021/acs.est.7b00369
Zhang, T., Huang, M., & Wang, Z. (2020). Estimation of chlorophyll-a concentration of lakes based on SVM algorithm and landsat 8 OLI images. Environmental Science and Pollution Research, 27, 14977–14990. https://doi.org/10.1007/s11356-020-07706-7
Zhang, Y., Xia, R., Zhang, M., Jing, Z., Zhao, Q., & Fan, J. (2017b). Research progress on cause analysis and modeling of river algal blooms under background of mega water projects. Research of Environmental Sciences, 30(8), 1163–1173.
Zhou, C., Yin, K., Cao, Y., & Ahmed, B. (2016). Application of time series analysis and PSO-SVM model in predicting the Bazimen landslide in the Three Gorges Reservoir, China. Engineering Geology, 204, 108–120. https://doi.org/10.1016/j.enggeo.2016.02.009
Zhou, S., Chu, X., Cao, S., Liu, X., & Zhou, Y. (2020). Prediction of the ground temperature with ANN, LS-SVM and fuzzy LS-SVM for GSHP application. Geothermics, 84, 101757.
Acknowledgements
The supports of this work is by the Natural Science Foundation of Hubei Province (2020CFB292). And the authors wish to thank Chongqing Academy of Environmental Science for providing the water quality dataset used in this research. Additionally, we greatly appreciate the helpful comments on the manuscript by the anonymous reviewers.
Funding
The support of this work is by the Natural Science Foundation of Hubei Province (2020CFB292).
Author information
Authors and Affiliations
Contributions
Dr. X is mainly responsible for data processing and model establishment and Dr. Z mainly responsible for result analysis and paper writing.
Corresponding author
Ethics declarations
Conflict of interest
No conflict of interest exits in the submission of this manuscript, and manuscript is approved by all authors for publication.
Consent to participate and consent to publish in the manuscript
No conflict of interest exits in the submission of this manuscript, and manuscript is approved by all authors for publication. I would like to declare on behalf of my co-authors that the work described is original research that has not been published previously and not under consideration for publication elsewhere.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Xia, J., Zeng, J. Early warning of algal blooms based on the optimization support vector machine regression in a typical tributary bay of the Three Gorges Reservoir, China. Environ Geochem Health 44, 4719–4733 (2022). https://doi.org/10.1007/s10653-022-01203-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10653-022-01203-1