Abstract
Regression is a powerful tool in statistical modeling suited for qualitative and quantitative analysis and widely used in forecasting and prediction. The partial least squares modeling (PLSM) is one of the regression tools used in statistical analysis. There are many fields in which PLSM has been used; water is one of them, which is an area of interest for many researchers and scientists for more than two decades. Since water has multiple parameters to analyze, there is a problem of dimensionality and collinearity. The problem of multidimensionality, as well as collinearity, can be solved by PLSM. PLS regression can be suitable for analysis as it is the most prominent multivariate regression tool. This paper describes the use of PLS regression modeling for water quality analysis of different kinds of water samples (groundwater, wastewater, river water, and coastal water). Various methods employing PLSM for water quality analysis has been discussed in detail.
Similar content being viewed by others
Data availability
Not applicable.
Code availability
Not applicable.
References
Abdi H (2010) Partial least squares regression and projection on latent structure regression (PLS Regression). Wiley Interdiscip Rev Comput Stat 2:97–106. https://doi.org/10.1002/wics.51
Abdul-Aziz OI, Ahmed S (2017) Relative linkages of stream water quality and environmental health with the land use and hydrologic drivers in the coastal-urban watersheds of southeast Florida. GeoHealth 1:180–195. https://doi.org/10.1002/2017gh000058
Afonso-Olivares C, Sosa-Ferrera Z, Santana-Rodríguez JJ (2012) Analysis of anti-inflammatory, analgesic, stimulant and antidepressant drugs in purified water from wastewater treatment plants using SPE-LC tandem mass spectrometry. J Environ Sci Heal Part A 47:887–895. https://doi.org/10.1080/10934529.2012.665005
Agüera A, Mezcua M, Mocholí F et al (2006) Application of gas chromatography-hybrid chemical ionization mass spectrometry to the analysis of diclofenac in wastewater samples. J Chromatogr A 1133:287–292. https://doi.org/10.1016/j.chroma.2006.08.017
Aguilera PA, Frenich AG, Castro H, Vidal JLM (2000) PLS and PCR methods in the assessment of coastal water quality. Environ Monit Assess 62:193–204. https://doi.org/10.1023/A:1006236804405
Amaral AL, Ferreira EC (2005) Activated sludge monitoring of a wastewater treatment plant using image analysis and partial least squares regression. Analytica chimica acta. Elsevier, Amsterdam, pp 246–253
Anan K (2003) Water-related diseases responsible for 80 per cent of all illnesses, deaths in developing world’, says secretary-general in environment day message. In: UN. https://www.un.org/press/en/2003/sgsm8707.doc.htm. Accessed 5 Dec 2017.
Artursson T, Eklov T, Lundstrom I et al (2000) Drift correction for gas sensors using multivariate methods. J Chemom 14:711–723. https://doi.org/10.1002/1099-128X(200009/12)14:5/6%3c711::AID-CEM607%3e3.0.CO;2-4
Baffi G, Martin EB, Morris AJ (1999a) Nonlinear projection to latent structures revisited: the quadratic PLS algorithm. Comput Chem Eng 23:395–411. https://doi.org/10.1016/S0098-1354(98)00283-X
Baffi G, Martin EB, Morris AJ (1999b) Nonlinear projection to latent structures revisited (the neural network PLS algorithm). Comput Chem Eng 23:1293–1307. https://doi.org/10.1016/S0098-1354(99)00291-4
Bang YH, Yoo CK, Lee IB (2002) Nonlinear PLS modeling with fuzzy inference system. Chemom Intell Lab Syst 64:137–155. https://doi.org/10.1016/S0169-7439(02)00084-9
Barker M, Rayens W (2003) Partial least squares for discrimination. J Chemom 17:166–173. https://doi.org/10.1002/cem.785
Bertrand-Krajewski JL, Winkler S, Saracevic E et al (2007) Comparison of and uncertainties in raw sewage COD measurements by laboratory techniques and field UV-visible spectrometry. Water Sci Technol 56:17–25. https://doi.org/10.2166/wst.2007.759
Bezdek JC, Pal SK (eds) (1992) Fuzzy models for pattern recognition: methods that search for structures in data (Ieee Press Selected Reprint Series). IEEE CS Press, New York
Boulesteix A, Strimmer K (2007) Partial least squares: a versatile tool for the analysis of high-dimensional genomic data. Brief Bioinform 8:32–44. https://doi.org/10.1093/bib/bbl016
Braak CJF, Piert FMV (1995) Canonical correspondence analysis and related multivariate methods in aquatic ecology. Aquat Sci 57:255–289
Camarrone F, Van Hulle MM (2019) Fast multiway partial least squares regression. IEEE Trans Biomed Eng 66:433–443. https://doi.org/10.1109/TBME.2018.2847404
Campos I, Alcañiz M, Aguado D et al (2012) A voltammetric electronic tongue as tool for water quality monitoring in wastewater treatment plants. Water Res 46:2605–2614. https://doi.org/10.1016/j.watres.2012.02.029
Chen BW (2020) Novel kernel orthogonal partial least squares for dominant sensor data extraction. IEEE Access 8:36131–36139. https://doi.org/10.1109/ACCESS.2020.2974873
Conzen JP, Bürck J, Ache HJ (1994) Determination of chlorinated hydrocarbons in water by fiber-optic evanescent wave spectroscopy and partial least-squares regression. Fresenius J Anal Chem 348:501–505. https://doi.org/10.1007/BF00323919
Cozzolino D, Parker M, Dambergs RG et al (2006) Chemometrics and visible-near infrared spectroscopic monitoring of red wine fermentation in a pilot scale. Biotechnol Bioeng 95:1101–1107
Dahlén J, Karlsson S, Bäckström M et al (2000) Determination of nitrate and other water quality parameters in groundwater from UV/Vis spectra employing partial least squares regression. Chemosphere 40:71–77. https://doi.org/10.1016/S0045-6535(99)00242-8
Davis R, Mauer LJ (2010) Fourier transform infrared (FT-IR) spectroscopy: a rapid tool for detection and analysis of foodborne pathogenic bacteria. Curr Res Technol Educ Top Appl Microbiol Microb Biotechnol 2:1582–1594
de Jong S (1993) SIMPLS: an alternative approach to partial least squares regression. Chemom Intell Lab Syst 18:251–263. https://doi.org/10.1016/0169-7439(93)85002-X
de Oliveira LFR, Santana RC (2020) Estimation of leaf nutrient concentration from hyperspectral reflectance in Eucalyptus using partial least squares regression. Sci Agric. https://doi.org/10.1590/1678-992x-2018-0409
Deepa S, Venkateswaran S (2018) Appraisal of groundwater quality in upper Manimuktha sub basin, Vellar river, Tamil Nadu, India by using Water Quality Index (WQI) and multivariate statistical techniques. Model Earth Syst Environ 4:1165–1180. https://doi.org/10.1007/s40808-018-0468-3
Deng X, Chen Y, Wang P, Cao Y (2020) Soft Sensor Modeling for unobserved multimode nonlinear processes based on modified kernel partial least squares with latent factor clustering. IEEE Access 8:35864–35872. https://doi.org/10.1109/ACCESS.2020.2974783
Djerioui M, Bouamar M, Ladjal M, Zerguine · Azzedine, (2019) Chlorine soft sensor based on extreme learning machine for water quality monitoring. Arab J Sci Eng 44:2033–2044. https://doi.org/10.1007/s13369-018-3253-8
Dodge Y, Whittaker J (2009) Partial quantile regression. Metrika 70:35–57. https://doi.org/10.1007/s00184-008-0177-4
Doltra A, Dietrich T, Schneeweis C et al (2013) Magnetic resonance imaging of cardiovascular fibrosis and inflammation: from clinical practice to animal studies and back cardiovascular MRI view project magnetic resonance imaging of cardiovascular fibrosis and inflammation: from clinical practice to ani. Biomed Res Int 676489:1–2. https://doi.org/10.1155/2013
FAO (2016) AQUASTAT website. Food and Agriculture Organization in the United Nations (FAO). https://www.fao.org/nr/water/aquastat/dams/index.stm. Accesed 27 June 2016.
Feng H, Zhou Y (2015) Analysis of soluble microbial products by mass spectrometry: potential in monitoring bioprocesses of wastewater treatment. J Anal Bioanal Tech. https://doi.org/10.4172/2155-9872.1000241
Feng YZ, Elmasry G, Sun DW et al (2013) Near-infrared hyperspectral imaging and partial least squares regression for rapid and reagentless determination of Enterobacteriaceae on chicken fillets. Food Chem 138:1829–1836. https://doi.org/10.1016/j.foodchem.2012.11.040
Feudale RN, Woody NA, Tan H et al (2002) Transfer of multivariate calibration models: a review. Chemom Intell Lab Syst 64:181–192
Fezai R, Abodayeh K, Mansouri M et al (2020) Reliable fault detection and diagnosis of large-scale nonlinear uncertain systems using interval reduced kernel PLS. IEEE Access. https://doi.org/10.1109/access.2020.2989917
Fritzsche A, Ritschel T, Schneider L, Totsche KU (2019) Identification and quantification of single constituents in groundwater with Fourier-transform infrared spectroscopy and Positive Matrix Factorization. Vib Spectrosc 100:152–158. https://doi.org/10.1016/j.vibspec.2018.09.008
Gan R, Tan J, Mo L et al (2020) Using partial least squares regression to fit small data of H7N9 incidence based on the baidu index. IEEE Access 8:60392–60400. https://doi.org/10.1109/ACCESS.2020.2983799
Garcia-Breijo E, Peris RM, Pinatti CO et al (2013) Low-cost electronic tongue system and its application to explosive detection. IEEE Trans Instrum Meas 62:424–431. https://doi.org/10.1109/TIM.2012.2215156
Geladi P, Kowalski BR (1986) Partial least-squares regression: a tutorial. Anal Chim Acta 185:1–17. https://doi.org/10.1016/0003-2670(86)80028-9
Gerlach RW, Kowalski BR, Wold HOA (1979) Partial least-squares path modelling with latent variables. Anal Chim Acta 112:417–421. https://doi.org/10.1016/S0003-2670(01)85039-X
Guidetti R, Beghi R, Giovenzana V (2012) Chemometrics in food technology. Chemometrics in practical applications. In Tech, London, pp 217–252
Habibi V, Ahmadi H, Jafari M, Moeini A (2019) Application of nonlinear models and groundwater index to predict desertification case study: Sharifabad watershed. Nat Hazards 99:715–733. https://doi.org/10.1007/s11069-019-03769-z
Haider H, IbrahimSalamah A-SSS et al (2017) Development of groundwater quality index using fuzzy-based multicriteria analysis for Buraydah, Qassim, Saudi Arabia. Arab J Sci Eng 42:4033–4051. https://doi.org/10.1007/s13369-017-2578-z
Hangbo L, Dai JJJQ (2004) Application of the neural network model with partial least-squares regression to the displacement prediction of roadway surrounding rock. J China Coal Soc 29:274–278
Hiroaki I, Toyonori N, Eiji T (2002) Measurement of pesticide residues in food based on diffuse reflectance IR spectroscopy. IEEE Trans Instrum Meas 51:886–890. https://doi.org/10.1109/TIM.2002.807791
Inagaki T, Shinoda Y, Miyazawa M et al (2010) Near-infrared spectroscopic assessment of contamination level of sewage. Water Sci Technol 61:1957–1963. https://doi.org/10.2166/wst.2010.070
Iranmanesh A, Locke RA, Wimmer BT (2014) Multivariate statistical evaluation of groundwater compliance data from the Illinois Basin-Decatur Project. Energy Proced 63:3182–3194. https://doi.org/10.1016/j.egypro.2014.11.343
Isiyaka HA, Mustapha A, Juahir H, Phil-Eze P (2019) Water quality modelling using artificial neural network and multivariate statistical techniques. Model Earth Syst Environ 5:583–593. https://doi.org/10.1007/s40808-018-0551-9
Jacobsen T, Kolset K, Vogt NB (1986) Partial least-squares regression and fuzzy clustering. A joint approach. Mikrochim Acta 89:125–138. https://doi.org/10.1007/BF01207312
Jia R, Mao Z, Chang Y (2010) A nonlinear robust partial least squares method with application. Chin Control Decis Conf. https://doi.org/10.1109/CCDC.2010.5498819
Jiang J-H, Berry RJ, Siesler HW, Ozaki Y (2002) Wavelength interval selection in multicomponet spectral analysis by moving window partial least-squares regression with application to mid-infrared and near-infrared spectroscopic data. Anal Chem 74:3555–3565
Jing X (2019) Rapid fluorescence analysis method for monitoring water environment with wide concentration range based on multidimensional partial least squares method. Int Geosci Remote Sens Symp. https://doi.org/10.1109/IGARSS.2019.8898214
Judran NH, Kumar A (2020) Evaluation of water quality of Al-Gharraf River using the water quality index (WQI). Model Earth Syst Environ 6:1581–1588. https://doi.org/10.1007/s40808-020-00775-0
Kadam AK, Wagh VM, Muley AA et al (2019) Prediction of water quality index using artificial neural network and multiple linear regression modelling approach in Shivganga River basin, India. Model Earth Syst Environ 5:951–962. https://doi.org/10.1007/s40808-019-00581-3
Karydis M (1996) Quantitative assessment of eutrophication: a scoring system for characterising water quality in coastal marine ecosystems. Environ Monit Assess 41:233–246
Killner MHM, Rohwedder JJR, Pasquini C (2011) A PLS regression model using NIR spectroscopy for online monitoring of the biodiesel production reaction. Fuel 90:3268–3273. https://doi.org/10.1016/J.FUEL.2011.06.025
Kommineni M, Reddy KV, Jagathi K et al (2020) Groundwater level prediction using modified linear regression. Institute of Electrical and Electronics Engineers (IEEE), New Jersey, pp 1164–1168
Kosanovic BR, Chaparro LF, Sclabassi RJ (1995) On estimation of temporal fuzzy sets for signal analysis: FCM vs. FMLE approaches. In: Proceedings of 3rd International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society. IEEE Comput. Soc. Press, pp 583–588
Krishnan A, Williams LJ, McIntosh AR, Abdi H (2011) Partial Least Squares (PLS) methods for neuroimaging: a tutorial and review. Neuroimage 56:455–475. https://doi.org/10.1016/j.neuroimage.2010.07.034
Kundu PK, Panchariya PC, Kundu M (2011) Classification and authentication of unknown water samples using machine learning algorithms. ISA Trans 50:487–495. https://doi.org/10.1016/j.isatra.2011.03.003
Langergraber G, Fleischmann N, Hofstädter F (2003) A multivariate calibration procedure for UV/VIS spectrometric quantification of organic matter and nitrate in wastewater. Water science and technology. IWA Publishing, London, pp 63–71
Laref R, Ahmadou D, Losson E, Siadat M (2017) Orthogonal signal correction to improve stability regression model in gas sensor systems. J Sensors. https://doi.org/10.1155/2017/9851406
Larraín M, Guesalaga AR, Agosin E (2008) A multipurpose portable instrument for determining ripeness in wine grapes using NIR spectroscopy. IEEE Trans Instrum Meas 57:294–302. https://doi.org/10.1109/TIM.2007.910098
Lawrence KC, Windham WR, Park B et al (2006) Partial least squares regression of hyperspectral images for contaminant detection on poultry carcasses. J Near Infrared Spectrosc 14:223–230. https://doi.org/10.1255/jnirs.639
Lee DS, Vanrolleghem PA, Park JM (2005) Parallel hybrid modeling methods for a full-scale cokes wastewater treatment plant. J Biotechnol 115:317–328. https://doi.org/10.1016/j.jbiotec.2004.09.001
Lee DS, Lee MW, Woo SH et al (2006) Nonlinear dynamic partial least squares modeling of a full-scale biological wastewater treatment plant. Process Biochem 41:2050–2057. https://doi.org/10.1016/j.procbio.2006.05.006
Li B, Hassel PA, Morris AJ, Martin EB (2005) A nonlinear nested partial least-squares algorithm. Comput Stat Data Anal 48:87–101. https://doi.org/10.1016/j.csda.2003.10.015
Liu Y, Yu H, Sun Y, Chen J (2016) Novel assessment method of heavy metal pollution in surface water: a case study of Yangping River in Lingbao City, China. Environ Eng Res 22:31–39. https://doi.org/10.4491/eer.2016.015
Lourenço ND, Menezes JC, Pinheiro HM, Diniz D (2008) Development of PLS calibration models from UV-Vis spectra for TOC estimation at the outlet of a fuel park wastewater treatment plant. Environ Technol 29:891–898. https://doi.org/10.1080/09593330802015581
Luo B, Yuan Z, Kai C, Xinhua Z (2009) Partial least squares regression model to predict water quality in urban water distribution systems. Trans Tianjin Univ 15:140–144. https://doi.org/10.1007/s12209-009-0025-2
Lyu Y, Kinouchi T (2018) Estimation of river water quality using diffential ultraviolet-visible spectra based on partial least squares regression. J Japan Soc Civ Eng Ser B1. https://doi.org/10.2208/jscejhe.74.i_301
MacGregor JF, Jaeckle C, Kiparissides C, Koutoudi M (1994) Process monitoring and diagnosis by multiblock PLS methods. AIChE J 40:826–838. https://doi.org/10.1002/aic.690400509
Maggio RM, Cerretani L, Chiavaro E et al (2010) A novel chemometric strategy for the estimation of extra virgin olive oil adulteration with edible oils. Food Control 21:890–895
Mardia KV, Kent JT, Bibby JM (1979) Multivariate analysis. Academic Press, New York
Marsili-Libelli S, Müller A (1996) Adaptive fuzzy pattern recognition in the anaerobic digestion process. Pattern Recognit Lett 17:651–659. https://doi.org/10.1016/0167-8655(96)00030-X
Martínez Galera M, Martínez Vidal JL, Garrido Frenich A, Gil García MD (1996) Determination of cypermethrin, fenvalerate and cis- and trans-permethrin in soil and groundwater by high-performance liquid chromatography using partial least-squares regression. J Chromatogr A 727:39–46. https://doi.org/10.1016/0021-9673(95)01151-X
Martins L, Pereira A, Oliveira A et al (2019) An assessment of groundwater contamination risk with radon based on clustering and structural models. Water 11:1107. https://doi.org/10.3390/w11051107
McIntosh AR, Lobaugh NJ (2004) Partial least squares analysis of neuroimaging data: applications and advances. Neuroimage. https://doi.org/10.1016/j.neuroimage.2004.07.020
McIntosh AR, Bookstein FL, Haxby JV, Grady CL (1996) Spatial pattern analysis of functional brain images using partial least squares. Neuroimage 3:143–157. https://doi.org/10.1006/nimg.1996.0016
Meacham-Hensold K, Montes CM, Wu J et al (2019) High-throughput field phenotyping using hyperspectral reflectance and partial least squares regression (PLSR) reveals genetic modifications to photosynthetic capacity. Remote Sens Environ 231:111176. https://doi.org/10.1016/j.rse.2019.04.029
Mehmood T, Bohlin J, Snipen L (2015) A partial least squares based procedure for upstream sequence classification in Prokaryotes. IEEE/ACM Trans Comput Biol Bioinforma 12:560–567. https://doi.org/10.1109/TCBB.2014.2366146
Mesquita DP, Quintelas C, Amaral AL, Ferreira EC (2017) Monitoring biological wastewater treatment processes: recent advances in spectroscopy applications. Rev Environ Sci Biotechnol 16:395–424
Mo C, Lim J, Kwon SW et al (2017) Hyperspectral imaging and partial least square discriminant analysis for geographical origin discrimination of white rice. J Biosyst Eng 42:293–300
Mokarram M (2016) Modeling of multiple regression and multiple linear regressions for prediction of groundwater quality (case study: north of Shiraz). Model Earth Syst Environ 2:1–7. https://doi.org/10.1007/s40808-015-0059-5
Nag SK (2014) Evaluation of hydrochemical parameters and quality assessment of the groundwater in Gangajalghati block, Bankura District, West Bengal, India. Arab J Sci Eng 39:5715–5727. https://doi.org/10.1007/s13369-014-1141-4
Nakar A, Schmilovitch Z, Vaizel-Ohayon D et al (2020) Quantification of bacteria in water using PLS analysis of emission spectra of fluorescence and excitation-emission matrices. Water Res 169:115197. https://doi.org/10.1016/j.watres.2019.115197
Nash MS, Chaloud DJ (2011) Partial least square analyses of landscape and surface water biota associations in the Savannah River Basin. Int Sch Res Netw ISRN Ecol 2011:11. https://doi.org/10.5402/2011/571749
Noori R, Sabahi MS, Karbassi AR et al (2010) Multivariate statistical analysis of surface water quality based on correlations and variations in the data set. Desalination. https://doi.org/10.1016/j.desal.2010.04.053
Nørgaard L, Saudland A, Wagner J et al (2000) Interval partial least squares regression (i PLS): a comparative chemometric study with an example from the near infrared spectroscopy. Appl Spectrosc 54:413–419. https://doi.org/10.1366/0003702001949500
Nosrati K (2015) Application of multivariate statistical analysis to incorporate physico-chemical surface water quality in low and high flow hydrology. Model Earth Syst Environ 1:1–13. https://doi.org/10.1007/s40808-015-0021-6
Omran ESE (2016) Inference model to predict heavy metals of Bahr El Baqar soils, Egypt using spectroscopy and chemometrics technique. Model Earth Syst Environ 2:200. https://doi.org/10.1007/s40808-016-0259-7
Padilla M, Perera A, Montoliu I et al (2010) Drift compensation of gas sensor array data by orthogonal signal correction. Chemom Intell Lab Syst 100:28–35. https://doi.org/10.1016/j.chemolab.2009.10.002
Pan T-T, Sun D-W, Cheng J-H, Pu H (2016) Regression algorithms in hyperspectral data analysis for meat quality detection and evaluation. Compr Rev Food Sci Food Saf 15:529–541. https://doi.org/10.1111/1541-4337.12191
Park S, Kazama F, Lee S (2014) Assessment of water quality using multivariate statistical techniques: a case study of the Nakdong river basin, Korea. Environ Eng Res 19:197–203. https://doi.org/10.4491/eer.2014.008
Parrilla P, Martinez Galera M, Garrido Frenich A, Martinez Vidal JL (1997) Determination of folpet, procymidone, and triazophos in groundwater by HPLC using partial least squares and principal component regression. J Liq Chromatogr Relat Technol 20:425–442. https://doi.org/10.1080/10826079708010661
Páscoa RNM, Lopes JA, Lima JLFC (2008) In situ near infrared monitoring of activated dairy sludge wastewater treatment processes. J Near Infrared Spectrosc 16:409–419. https://doi.org/10.1255/jnirs.803
Qin SJ, McAvoy TJ (1992) Nonlinear PLS modeling using neural networks. Comput Chem Eng 16:379–391. https://doi.org/10.1016/0098-1354(92)80055-E
Razmjoo A, Khalili N, Majidi Nezhad M et al (2020) The main role of energy sustainability indicators on the water management. Model Earth Syst Environ 6:1419–1426. https://doi.org/10.1007/s40808-020-00758-1
Razmkhah H, Abrishamchi A, Torkian A (2010) Evaluation of spatial and temporal variation in water quality by pattern recognition techniques: a case study on Jajrood River (Tehran, Iran). J Environ Manag 91:852–860. https://doi.org/10.1016/j.jenvman.2009.11.001
Rieger L, Langergraber G, Siegrist H (2006) Uncertainties of spectral in situ measurements in wastewater using different calibration approaches. In: Water Science and Technology. IWA Publishing, pp 187–197
Ryan K, Ali K (2016) Application of a partial least-squares regression model to retrieve chlorophyll-a concentrations in coastal waters using hyper-spectral data. Ocean Sci J 51:209–221. https://doi.org/10.1007/s12601-016-0018-8
Sakizadeh M, Ahmadpour E (2016) Geological impacts on groundwater pollution: a case study in Khuzestan Province. Environ Earth Sci 75:1–12. https://doi.org/10.1007/s12665-015-4944-z
Santos AB, De Albuquerque Araujo A, Schwartz WR, Menotti D (2015) Hyperspectral image interpretation based on partial least squares. In: Proceedings - International Conference on Image Processing, ICIP. IEEE Computer Society, pp 1885–1889
Sarraguça MC, Paulo A, Alves MM et al (2009) Quantitative monitoring of an activated sludge reactor using online UV-visible and near-infrared spectroscopy. Anal Bioanal Chem 395:1159–1166. https://doi.org/10.1007/s00216-009-3042-z
Shariati-Rad M, Irandoust M, Haghighi M (2015) Introduction of a spectrophotometric method for simultaneous determination of nitrite and nitrate in water samples using partial least squares. Int J Environ Sci Technol 12:3837–3842. https://doi.org/10.1007/s13762-015-0846-6
Shawul AA, Chakma S, Melesse AM (2019) The response of water balance components to land cover change based on hydrologic modeling and partial least squares regression (PLSR) analysis in the Upper Awash Basin. J Hydrol Reg Stud 26:100640. https://doi.org/10.1016/j.ejrh.2019.100640
Shrestha S, Kazama F (2007) Assessment of surface water quality using multivariate statistical techniques: a case study of the Fuji river basin, Japan. Environ Model Softw. https://doi.org/10.1016/j.envsoft.2006.02.001
Sim SF, Ling TY, Lau S, Jaafar MZ (2015) A novel computer-aided multivariate water quality index. Environ Monit Assess 187(4):181. https://doi.org/10.1007/s10661-015-4416-7
Singh KP, Malik A, Singh VK (2006) Chemometric analysis of hydro-chemical data of an alluvial river. A case study. Water Air Soil Pollut 170:383–404. https://doi.org/10.1007/s11270-005-9010-0
Singh KP, Malik A, Basant N, Saxena P (2007) Multi-way partial least squares modeling of water quality data. Anal Chim Acta 584:385–396. https://doi.org/10.1016/j.aca.2006.11.038
Sun W, Sun J (2017) Prediction of carbon dioxide emissions based on principal component analysis with regularized extreme learning machine: the case of China. Environ Eng Res 22:302–311. https://doi.org/10.4491/eer.2016.153
Tafreshi GM, Tafreshi AM (2020) Statistical approaches and hydrochemical modeling of groundwater in the Golpayegan Plain aquifer. Iran Model Earth Syst Environ. https://doi.org/10.1007/s40808-020-00822-w
Teófilo RF, Martins JPA, Ferreira MMC (2009) Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression. J Chemom 23:32–48. https://doi.org/10.1002/cem.1192
Teppola P, Mujunen S-P, Minkkinen P (1998) A combined approach of partial least squares and fuzzy c-means clustering for the monitoring of an activated-sludge wastewater treatment plant. Chemom Intell Lab Syst 41:95–103. https://doi.org/10.1016/S0169-7439(98)00026-4
Thaysen D, van den Berg F, Rasmussen MA et al (2017) Monitoring process water quality using near infrared spectroscopy and partial least squares regression with prediction uncertainty estimation. Appl Spectrosc 71(3):410–421
Ukah BU, Ubido OE, Igwe O (2020) Geo-statistical assessment of the soil quality and its influence on groundwater pollution in some part of Lagos State Nigeria. Model Earth Syst Environ 6:953–965. https://doi.org/10.1007/s40808-020-00731-y
UNF and AO (2016) AQUASTAT. https://www.fao.org. Accessed 25 Aug 2017.
UNICEF, WHO (2009) Diarrhoea: why children are still dying and what can be done? WHO/ UNICEF Report: 1–16. Lancet 375:870–872
Ustaoğlu F, Tepe Y, Taş B (2019) Assessment of stream quality and health risk in a subtropical Turkey river system: a combined approach using statistical analysis and water quality index. Ecol Indic. https://doi.org/10.1016/j.ecolind.2019.105815
Walczak B, Massart DL (1996) The radial basis functions-partial least squares approach as a flexible nonlinear regression technique. Anal Chim Acta 331:177–185. https://doi.org/10.1016/0003-2670(96)00202-4
Wang P, Yu J, Zhang Y, Liu C (2013) Groundwater recharge and hydrogeochemical evolution in the Ejina Basin, northwest China. J Hydrol 476:72–86. https://doi.org/10.1016/j.jhydrol.2012.10.049
Wang Y, Wang P, Bai Y et al (2013) Assessment of surface water quality via multivariate statistical techniques: a case study of the Songhua River Harbin region, China. J Hydro-Environ Res 7:30–40. https://doi.org/10.1016/j.jher.2012.10.003
Wang Z, Kawamura K, Sakuno Y et al (2017) Retrieval of chlorophyll-a and total suspended solids using iterative stepwise elimination partial least squares (ISE-PLS) regression based on field hyperspectral measurements in irrigation ponds in Higashihiroshima. Jpn Remote Sens 9:264. https://doi.org/10.3390/rs9030264
Wen Q, Ge Z, Song Z, Wang P (2015) Dynamic process calibration based on sparse partial least squares. In: Proceedings of the World Congress on Intelligent Control and Automation (WCICA). IEEE, pp 1366–1371
Wold S, Kettaneh-Wold N, Skagerberg B (1989) Nonlinear PLS modeling. Chemom Intell Lab Syst 7:53–65. https://doi.org/10.1016/0169-7439(89)80111-X
Wold S, Sjöström M, Eriksson L (2001) PLS-regression: a basic tool of chemometrics. Chemom Intell Lab Syst 58:109–130. https://doi.org/10.1016/S0169-7439(01)00155-1
World Health Organization (WHO) (1996) WHO Drinking Water. Fact sheet No. 391. World Health Organization. 2017. https://www.who.int/mediacentre/factsheets/fs391/en/. Accessed 25 Nov 2019
World Resources Institute (2016) India Water Tool Version 2.1. https://maps.indiawatertool.in/iwt/av/IWTUserGuide.pdf. Accessed 5 Dec 2017.
Xing Z, Chen J, Zhao X et al (2019) Quantitative estimation of wastewater quality parameters by hyperspectral band screening using GC. VIP SPA PeerJ 2019:e8255. https://doi.org/10.7717/peerj.8255
Xu ZM, Wen XQZJL (2011) Cooling water fouling resistance prediction of plate heat exchanger based on partial least-squares regression. J Chem Ind Eng 62:1531–1536
Xu G, Ren X, Yang Z et al (2019) Influence of landscape structures on water quality at multiple temporal and spatial scales: a case study of Wujiang River watershed in Guizhou. Water (Switzerland). https://doi.org/10.3390/w11010159
Xu S, Cui Y, Yang C et al (2020) The fuzzy comprehensive evaluation (FCE) and the principal component analysis (PCA) model simulation and its applications in water quality assessment of Nansi Lake Basin, China. Environ Eng Res. https://doi.org/10.4491/eer.2020.022
Yan K, Zhang D (2016) Correcting instrumental variation and time-varying drift: a transfer learning approach with autoencoders. IEEE Trans Instrum Meas 65:2012–2022. https://doi.org/10.1109/TIM.2016.2573078
Yang Q, Liu Z, Yang J (2009) Simultaneous determination of chemical oxygen demand (COD) and biological oxygen demand (BOD5) in wastewater by near-infrared spectrometry. J Water Resour Prot 01:286–289. https://doi.org/10.4236/jwarp.2009.14035
Yazgan Karacaglar NN, Bulat T, Boyaci IH, Topcu A (2019) Raman spectroscopy coupled with chemometric methods for the discrimination of foreign fats and oils in cream and yogurt. J Food Drug Anal 27:101–110. https://doi.org/10.1016/j.jfda.2018.06.008
Ye X, Wu Z, Wang Z et al (2018) Seasonal prediction of the yangtze river runoff using a partial least squares regression model. Atmos Ocean 56:117–128. https://doi.org/10.1080/07055900.2018.1448751
Yoo C-K, Son H-R, Lee I-B (2005) Modeling and multiresolution analysis in a full-scale industrial plant. Environ Eng Res 10:88–103. https://doi.org/10.4491/eer.2005.10.2.088
Zamora D, Torres A (2014) Method for outlier detection: a tool to assess the consistency between laboratory data and ultraviolet-visible absorbance spectra in wastewater samples. Water Sci Technol 69:2305–2314. https://doi.org/10.2166/wst.2014.139
Zheng H, Jiang L, Lou H et al (2011) Application of Artificial Neural Network (ANN) and Partial Least-Squares Regression (PLSR) to predict the changes of anthocyanins, ascorbic acid, total phenols, flavonoids, and antioxidant activity during storage of red bayberry juice based on fractal ana. J Agric Food Chem 59:592–600. https://doi.org/10.1021/jf1032476
Ziyatdinov A, Marco S, Chaudry A et al (2010) Drift compensation of gas sensor array data by common principal component analysis. Sens Actuators B. https://doi.org/10.1016/j.snb.2009.11.034
Acknowledgement
The authors are pleased to acknowledge the Birla Institute of Technology and Science for providing an enabling environment for carrying out the research work.
Funding
The research was supported by the Council of Scientific and Industrial Research-Human Resource Development Group (CSIR-HRDG), New Delhi, India (Award No. 09/719(0101)/2019-EMR-1) as a fellowship.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Khatri, P., Gupta, K.K. & Gupta, R.K. A review of partial least squares modeling (PLSM) for water quality analysis. Model. Earth Syst. Environ. 7, 703–714 (2021). https://doi.org/10.1007/s40808-020-00995-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40808-020-00995-4