Abstract
Analysis of temporal patterns of high-dimensional time-series water quality data is essential for pollution management worldwide. This study has applied dynamic factor analysis (DFA) and cluster analysis (CA) to analyze time-series water quality data monitored at the five stations installed along the La Buong river in Southern Vietnam. Application of the DFA identified two types of temporal patterns, one of the run-off driven parameters (total suspended solid (TSS), turbidity, and iron) and the other of diffuse source pollution. The association of the variables like BOD5 and COD at most stations to the run-off-driven parameters revealed their sharing of drivers. On the contrary, separating variables like phosphate (PO43) at the three upstream stations from the run-off patterns suggested their local point-source origin. The DFA-derived factors were later used in the time-point CA to explore the seasonality of water quality parameters and their pollution intensities compared to regulatory levels. The result suggested intensification in wet season of Fe, TSS, BOD5, and COD concentrations at most sites, which are unobservable in run-off detached parameters like reactive nitrogen, phosphate (PO43−), and E. coli. These findings generated robust insights to support water quality management for river habitat conservation.
Similar content being viewed by others
Availability of data and materials
The datasets used and/or analyzed for the study are available on reasonable request (Dr. Nguyen Hong Quan with the email address nh.quan@iced.org.vn).
References
Aguilera, R., Sabater, S., & Marcé, R. (2018). A methodological framework for characterizing the spatiotemporal variability of river water-quality patterns using dynamic factor analysis. Journal of Environmental Informatics, 31(2), 97–110. https://doi.org/10.3808/jei.201600333
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705
Albanese, D., Filosi, M., Visintainer, R., Riccadonna, S., Jurman, G., & Furlanello, C. (2013). Minerva and minepy: A C engine for the MINE suite and its R Python and MATLAB Wrappers. Bioinformatics, 29(3), 407–408. https://doi.org/10.1093/bioinformatics/bts707
Albanese, D., Riccadonna, S., Franceschi, P., & Donati, C. (2018). A practical tool for maximal information coefficient analysis. GigaScience, (April), 1–8. https://doi.org/10.1093/gigascience/giy032
Ansari, A. A., Gill, S. S., Lanza, G. R., & Rast, W. (2011). Eutrophication: causes, consequences and control. In Eutrophication: Causes, Consequences and Control (pp. 1–394). https://doi.org/10.1007/978-90-481-9625-8
Barbato, G., Barini, E. M., Genta, G., & Levi, R. (2011). Features and performance of some outlier detection methods. Journal of Applied Statistics, 38(10), 2133–2149. https://doi.org/10.1080/02664763.2010.545119
Budaev, S. V. (2010). Using principal components and factor analysis in animal behaviour research: Caveats and guidelines. Ethology, 116(5), 472–480. https://doi.org/10.1111/j.1439-0310.2010.01758.x
Chiew, F. H. S., & McMahon, T. A. (1993). Assessing the adequacy of catchment streamflow yield estimates. Australian Journal of Soil Research, 31(5), 665–680. https://doi.org/10.1071/SR9930665
Diamantini, E., Lutz, S. R., Mallucci, S., Majone, B., Merz, R., & Bellin, A. (2018). Driver detection of water quality trends in three large European river basins. Science of the Total Environment, 612, 49–62. https://doi.org/10.1016/j.scitotenv.2017.08.172
Dixon, W., & Chiswell, B. (1996). Review of aquatic monitoring program design. Water Research, 30(9), 1935–1948. https://doi.org/10.1016/0043-1354(96)00087-5
Efron, B. (1979). Bootstrap methods: Another look at the jackknife. The Annals of Statistics, 7(1), 1–26. https://doi.org/10.1214/aos/1176348654
Emerson, C. H., & Traver, R. G. (2008). Multiyear and seasonal variation of infiltration from storm-water best management practices. Journal of Irrigation and Drainage Engineering, 134(5), 598–605. https://doi.org/10.1061/(asce)0733-9437(2008)134:5(598)
Felipe-Sotelo, M., Andrade, J. M., Carlosena, A., & Tauler, R. (2007). Temporal characterisation of river waters in urban and semi-urban areas using physico-chemical parameters and chemometric methods. Analytica Chimica Acta, 583(1), 128–137. https://doi.org/10.1016/j.aca.2006.10.011
Glavan, M., & Pintar, M. (2012). Strengths, weaknesses, opportunities and threats of catchment modelling with Soil and Water Assessment Tool (SWAT) model. Water Resources Management and Modeling, (March). https://doi.org/10.5772/34539
Ishii, S., & Sadowsky, M. J. (2008). Escherichia coli in the environment: Implications for water quality and human health. Microbes and Environments, 23(2), 101–108. https://doi.org/10.1264/jsme2.23.101
Jang, J., Hur, H. G., Sadowsky, M. J., Byappanahalli, M. N., Yan, T., & Ishii, S. (2017). Environmental Escherichia coli: Ecology and public health implications—a review. Journal of Applied Microbiology, 123(3), 570–581. https://doi.org/10.1111/jam.13468
Juahir, H., Zain, S. M., Kamil, M., Hanidza, T. I. T., Armi, A. S. M., Ekhwan, M., & Mazlin, T. (2011). Spatial water quality assessment of Langat River Basin ( Malaysia ) using environmetric techniques, 625–641. https://doi.org/10.1007/s10661-010-1411-x
Khoi, D. N., Nguyen, V. T., Sam, T. T., & Nhi, P. T. T. (2019). Evaluation on effects of climate and land-use changes on streamflow and water quality in the La Buong River Basin, Southern Vietnam. Sustainability (Switzerland), 11(24). https://doi.org/10.3390/SU11247221
Kinney, J. B., & Atwal, G. S. (2014). Equitability , mutual information , and the maximal information coefficient, 2014, 21–26. https://doi.org/10.1073/pnas.1309933111
Kisekka, I., Migliaccio, K. W., Muñoz-Carpena, R., Schaffer, B., & Li, Y. C. (2013). Dynamic factor analysis of surface water management impacts on soil and bedrock water contents in Southern Florida Lowlands. Journal of Hydrology, 488, 55–72. https://doi.org/10.1016/j.jhydrol.2013.02.035
Kjelland, M. E., Woodley, C. M., Swannack, T. M., & Smith, D. L. (2015). A review of the potential effects of suspended sediment on fishes: Potential dredging-related physiological, behavioral, and transgenerational implications. Environment Systems and Decisions, 35(3), 334–350. https://doi.org/10.1007/s10669-015-9557-2
Kowalkowski, T., Zbytniewski, R., Szpejna, J., & Buszewski, B. (2006). Application of chemometrics in river water classification. Water Research, 40(4), 744–752. https://doi.org/10.1016/j.watres.2005.11.042
Kuo, Y. -M., & Chang, F. -J. (2010). Dynamic factor analysis for estimating ground water arsenic trends. Journal of Environmental Quality, 39(1), 176–184. https://doi.org/10.2134/jeq2009.0098
Kuo, Y. M., Chiu, C. H., & Yu, H. L. (2014). Influences of ambient air pollutants and meteorological conditions on ozone variations in Kaohsiung Taiwan. Stochastic Environmental Research and Risk Assessment, 29(3), 1037–1050. https://doi.org/10.1007/s00477-014-0968-2
Kuo, Y. M., Jang, C. S., Yu, H. L., Chen, S. C., & Chu, H. J. (2013). Identifying nearshore groundwater and river hydrochemical variables influencing water quality of Kaoping River Estuary using dynamic factor analysis. Journal of Hydrology, 486(May 2015), 39–47. https://doi.org/10.1016/j.jhydrol.2013.01.027
Kupka, D., Kania, M., Pietrzykowski, M., Łukasik, A., & Gruba, P. (2021). Multiple factors influence the accumulation of heavy metals (Cu, Pb, Ni, Zn) in forest soils in the vicinity of roadways. Water, Air, & Soil Pollution, 232(5), 194. https://doi.org/10.1007/s11270-021-05147-7
Le, T. T. H., Fettig, J., & Meon, G. (2019). Kinetics and simulation of nitrification at various pH values of a polluted river in the tropics. Ecohydrology and Hydrobiology, 19(1), 54–65. https://doi.org/10.1016/j.ecohyd.2018.06.006
Le, T. T. H., Zeunert, S., Lorenz, M., & Meon, G. (2017). Multivariate statistical assessment of a polluted river under nitrification inhibition in the tropics. Environmental Science and Pollution Research, 24(15), 13845–13862. https://doi.org/10.1007/s11356-017-8989-2
Liu, C. W., Lin, K. H., & Kuo, Y. M. (2003). Application of factor analysis in the assessment of groundwater quality in a blackfoot disease area in Taiwan. Science of the Total Environment, 313(1–3), 77–89. https://doi.org/10.1016/S0048-9697(02)00683-6
Liu, X., Zhang, X., & Zhang, M. (2008). Major factors influencing the efficacy of vegetated buffers on sediment trapping: A review and analysis. Journal of Environmental Quality, 37(5), 1667–1674. https://doi.org/10.2134/jeq2007.0437
Magyar, N., Hatvani, I. G., Székely, I. K., Herzig, A., Dinka, M., & Kovács, J. (2013). Application of multivariate statistical methods in determining spatial changes in water quality in the Austrian part of Neusiedler See. Ecological Engineering, 55, 82–92. https://doi.org/10.1016/j.ecoleng.2013.02.005
Molenaar, P. C. M. (1985). A dynamic factor model for the analysis of multivariate time series. Psychometrika, 50(2), 181–202. https://doi.org/10.1007/BF02294246
Muñoz-Carpena, R., Ritter, A., & Li, Y. C. (2005). Dynamic factor analysis of groundwater quality trends in an agricultural area adjacent to Everglades National Park. Journal of Contaminant Hydrology, 80(1–2), 49–70. https://doi.org/10.1016/j.jconhyd.2005.07.003
Naef, F., Scherrer, S., & Weiler, M. (2002). A process based assessment of the potential to reduce flood runoff by land use change. Journal of Hydrology, 267(1–2), 74–79. https://doi.org/10.1016/S0022-1694(02)00141-5
Nash, J. E., & Sutcliffe, I. V. (1970). River flow forecasting through conceptual models. Part I - A discussion of principles. Journal of Hydrology, 10(2), 282–290. https://doi.org/10.1080/00750770109555783
Nguyen, H. D., Hong Quan, N., Quang, N. X., Hieu, N. D., & Thang, L. V. (2019). Spatio-temporal pattern of water quality in the Saigon-Dong Nai river system due to waste water pollution sources. International Journal of River Basin Management, 0(0), 1–34. https://doi.org/10.1080/15715124.2019.1700513
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Pesce, S. F., & Wunderlin, D. A. (2000). Use of water quality indices to verify the Córdoba City ( Argentina ) on Suquía River. Wat. Res., 34(11), 2915–2926.
Razmkhah, H., Abrishamchi, A., & Torkian, A. (2010). Evaluation of spatial and temporal variation in water quality by pattern recognition techniques: A case study on Jajrood River (Tehran, Iran). Journal of Environmental Management, 91(4), 852–860. https://doi.org/10.1016/j.jenvman.2009.11.001
Reshef, D., Reshef, Y., Finucane, H., Grossman, S., Mcvean, G., Turnbaugh, P., et al. (2011). Detecting novel associations in large data sets. Science Translational Medicine, 334(6062), 1518–1524.
Riemersma, S., Little, J., Ontkean, G., & Moskal-Hebert, T. (2006). Phosphorus sources and sinks in watersheds: A review. Alberta Soil Phosphorus Limits Project, 5, 82.
Rivers, E. N., Heitman, J. L., McLaughlin, R. A., & Howard, A. M. (2021). Reducing roadside runoff: Tillage and compost improve stormwater mitigation in urban soils. Journal of Environmental Management, 280(October), 111732. https://doi.org/10.1016/j.jenvman.2020.111732
Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20(C), 53–65. https://doi.org/10.1016/0377-0427(87)90125-7
Rybczyńska, K., & Korniłłowicz-Kowalska, T. (2015). Evaluation of dye compounds’ decolorization capacity of selected H. haematococca and T. harzianum strains by principal component analysis (PCA). Water, Air, & Soil Pollution, 226(7), 228. https://doi.org/10.1007/s11270-015-2473-8
Seabold, S., & Perktold, J. (2010). statsmodels: econometric and statistical modeling with python. In 9th Python in Science Conference.
Shrestha, S., & Kazama, F. (2007). Assessment of surface water quality using multivariate statistical techniques: A case study of the Fuji river basin Japan. Environmental Modelling and Software, 22(4), 464–475. https://doi.org/10.1016/j.envsoft.2006.02.001
Singh, K. P., Malik, A., & Sinha, S. (2005). Water quality assessment and apportionment of pollution sources of Gomti river (India) using multivariate statistical techniques - a case study. Analytica Chimica Acta, 538(1–2), 355–374. https://doi.org/10.1016/j.aca.2005.02.006
Urbaniak, M., Urbaniak, M., Moran, S., & Urbaniak, M. (2014). Point sources of nutrient pollution in the lowland river catchment in the context of the Baltic Sea eutrophication. Ecological Engineering.
Van Drecht, G., Bouwman, A. F., Knoop, J. M., Beusen, A. H. W., & Meinardi, C. R. (2003). Global modeling of the fate of nitrogen from point and nonpoint sources in soils, groundwater, and surface water. Global Biogeochemical Cycles, 17(4). https://doi.org/10.1029/2003gb002060
Van Elsas, J. D., Semenov, A. V., Costa, R., & Trevors, J. T. (2011). Survival of Escherichia coli in the environment: Fundamental and public health aspects. ISME Journal, 5(2), 173–183. https://doi.org/10.1038/ismej.2010.80
Vega, M., Pardo, R., Barrado, E., & Debán, L. (1998). Assessment of seasonal and polluting effects on the quality of river water by exploratory data analysis. Water Research, 32(12), 3581–3592. https://doi.org/10.1016/S0043-1354(98)00138-9
Vialle, C., Sablayrolles, C., Lovera, M., Jacob, S., Huau, M. C., & Montrejaud-Vignoles, M. (2011). Monitoring of water quality from roof runoff: Interpretation using multivariate analysis. Water Research, 45(12), 3765–3775. https://doi.org/10.1016/j.watres.2011.04.029
Vuori, K. M. (1995). Direct and indirect effects of iron on river ecosystems. Annales Zoologici Fennici, 32(3), 317–329.
Wang, Y., Wang, P., Bai, Y., Tian, Z., Li, J., Shao, X., et al. (2013). Assessment of surface water quality via multivariate statistical techniques: A case study of the Songhua River Harbin region China. Journal of Hydro-Environment Research, 7(1), 30–40. https://doi.org/10.1016/j.jher.2012.10.003
Winterbourn, M. J., Mcdiffett, W. F., & Eppley, S. J. (2000). Aluminium and iron burdens of aquatic biota in New Zealand streams contaminated by acid mine drainage: Effects of trophic level. Science of the Total Environment, 254(1), 45–54. https://doi.org/10.1016/S0048-9697(00)00437-X
Wunderlin, D. A., María Del Pilar, D., María Valeria, A., Fabiana, P. S., Cecilia, H. A., & María De Los Ángeles, B. (2001). Pattern recognition techniques for the evaluation of spatial and temporal variations in water quality. A Case Study: Suquía River basin (Córdoba-Argentina). Water Research, 35(12), 2881–2894. https://doi.org/10.1016/S0043-1354(00)00592-3
Zuur, A. F., Fryer, R. J., Jolliffe, I. T., Dekker, R., & Beukema, J. J. (2003a). Estimating common trends in multivariate time series using dynamic factor analysis. Environmetrics, 14(7), 665–685. https://doi.org/10.1002/env.611
Zuur, A. F., Tuck, I. D., & Bailey, N. (2003b). Dynamic factor analysis to estimate common trends in fisheries time series. Canadian Journal of Fisheries and Aquatic Sciences, 60(5), 542–552. https://doi.org/10.1139/F03-030.
Funding
This research is funded by the Vietnam National University of Ho Chi Minh (VNU-HCM) city under grant number C2020-24–09/HD-KHCN.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study in various aspects. LHP developed the concept, conducted data analysis, and wrote the first draft of the manuscript, while DDT contributed to revising the structure and editing. TDHL performed correlation analysis and, together with QTD, writo various parts of the manuscript. NHQ provided monitoring data and revised the manuscript. DNK, NTTH, NHA, and DTA presented their comments on the methods and discussion.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of Topical Collection on Environmental sustainability and impacts of climate change in the Mekong delta region
Appendix
Appendix
Rights and permissions
About this article
Cite this article
Pham, L.H., Tran, D.D., Le, T.D.H. et al. Dynamic multivariate analysis for pollution assessment and river habitat conservation in the Vietnamese La Buong watershed. Environ Monit Assess 194 (Suppl 2), 774 (2022). https://doi.org/10.1007/s10661-022-10184-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10661-022-10184-8