Introduction

Natural organic matter (NOM) is a complex mixture of heterogeneous organic matter present in natural waters. NOM affects the performance of the unit processes of water treatment, in addition to the biological stability in drinking water. The amount, character, and properties of NOM differ in waters and depend on the biogeochemical cycles of the surrounding environments. Disinfection by-products (DBPs), of which trihalomethanes (THMs) and haloacetic acids (HAAs) are the major groups, were formed when NOM reacts with a disinfectant such as chlorine. The varying leves of DBPs formation are dependent upon the reactivity of the NOM, chlorine dose, pH, temperature, reaction time, and bromide concentration (Roccaro et al. 2008; Han et al. 2015). Therefore, the removal of dissolved organic in drinking water treatment is important for controlling disinfection by-products (DBPs) formation and for preventing water quality deterioration in distribution system caused by biofilm formation.

Several treatment options have been studied for NOM removal from source water. Generally, coagulation is considered as the most common and feasible process for NOM removal. The removal of NOM by coagulation using inorganic coagulant is influenced by the character of NOM and factors such as pH and alkalinity of source water. NOM can be characterized in terms of its humic and non-humic fractions, its hydrophobic and hydrophilic properties, and the molecular weights of its constituents (Tran et al. 2015; Hidayah et al. 2016). Humic fractions, which generally have higher aromatic content, higher charge, and more hydrophobic tend to be more easily removed by coagulation through charge neutralization mechanism (Aslam et al. 2013; Hidayah et al. 2017). In charge neutralization, cationic metal hydrolysis products interacts with anionic NOM to form insoluble charge-neutral products. The non-humic fractions, which consist of biopolymers, such as polysaccharides and proteins, are more hydrophilic. The possible coagulation removal mechanism of these fractions is to be adsorbed onto metal hydroxide surface or floc (Chow et al. 2009). In addition, the formation of insoluble metal hydroxide will cause entrapment of NOM or sweep coagulation. Floc formed during various coagulation mechanisms has been observed to have different properties of size, structure, and strength, which is influenced by coagulant type and dosage (Sharp et al. 2006; Chow et al. 2009).

The characteristics of NOM were also found to affect DBPs formation. Hydrophobic NOM, content phenolic acidity, and ultraviolet absorbance showed higher halogenated organics formation potential (Chen et al. 2008; Wang et al. 2013). It is considered to be the more important source of DBPs precursors than hydrophilic. However, hydrophilic compounds are also found to contribute in the formation of DBPs, especially in low humic waters (Hua and Reckhow 2007; Han et al. 2015).

A number of characterization techniques have been employed to obtain a better understanding of the types of NOM present in the source water and their removal or transformation through the water treatment process train (Tran et al. 2015; Lai et al. 2015). High-performance size exclusion chromatography (HPSEC), especially with organic carbon detector (OCD), has proved to be an attractive option for NOM characterization, due to its ease of operation, its ability to detect any type of organic carbon bonded species, and also its simplicity of sample preparation and requirement for minimal sample volume (Lai et al. 2015; Hidayah et al. 2016).

In this study, HPSEC was used to characterize the NOM in the source water and the treated water from coagulation under various dosage of alum. Peak-fitting technique was applied to resolve the chromatographic peaks of HPSEC and provide quantitative information on NOM fractions removed by coagulation. Further, NOM fractions’ removal, trihalomethanes formation potential (THMFP), and haloacetic acid formation potential (HAAFP) reduction, and NPDOC removal were also explored.

Materials and methods

Source water

The source water used in this study was collected from the effluent of the slow sand filters in drinking water treatment plant in Kinmen island, Taiwan. The raw water of the plant was drawn from an eutrophic lake. The treatment train before slow sand filtration included coagulation, dissolved air floatation, and rapid sand filtration.

Coagulation

Alum (Al2(SO4)3·18H2O, Merck, Germany) was used as coagulant with various dosage 4, 8, and 16 mg/L as Al (0.15; 0.30; 0.59 mmol/L as Al). A variable speed, six-paddle gang stirrer (Jar test apparatus, Phipps and Bird, Richmond, Virginia, USA) was used. The filtered source water samples (1 L) were placed on the gang stirrer with three samples tested at a time and the coagulant was added while stirring at 100 rpm. After 3 min of rapid mixing at 100 rpm, the speed was reduced to 35 rpm for 15 min, then allowed to settle for 15 min. The supernatants were collected and filtered through 0.45 µm filter (cellulose acetate, Advantec, Japan) before further analysis.

Water quality analysis

The water samples, including source water and the supernatants, were analysed for general physicochemical characteristics, such as non-purgeable dissolved organic carbon (NPDOC), ultraviolet absorbance at 254 nm (UV254), pH, and alkalinity, based on procedures from Standard Methods (APHA et al. 2012). NPDOC was measured by the combustion-infrared method using a total organic carbon analyzer (Model TOC-500, Shimadzu, Kyoto, Japan). UV254 was measured using a UV/Vis spectrophotometer (Model U-2001, Hitachi, Japan) with a 1-cm quartz cell.

HPSEC-OCD and UVD analysis

To investigate the composition of dissolved organic matter as a function of apparent molecular weight (AMW), high-performance liquid chromatography (HPLC, LC-20 ATV, Shimadzu, Japan)-size exclusion chromatography (SEC) was conducted with sequential on-line detectors UVD-OCD. The chromatographic column (20 (I.D.) × 200 (L) mm TSK HW-50S, Toso, Japan) was a weak cation exchange resin (Toyopearl HW-50S, 30 µm) based on polymethacrylate basis. The eluent system consisted of phosphate buffer (2.4 mM NaH2PO4 + 1.6 mM Na2HPO4; pH 6.8), containing sodium sulphate (25 mM), to achieve an ionic strength of 100 mM. The eluent flow rate was 0.5 mL/min, and the sample volume is 2 mL. The system was calibrated with polyethylene glycol (PEG) standards ranging from 300 to 110,000 g/mol, as PEG was reported to have low undesirable interactions with Toyopearl HW resin within the column (Her et al. 2002). A semilog calibration curve (Fig. 1) was used to calculate the AMWs. Prior to chromatographic separation, the samples were made particle-free by passing through a 0.45-µm filter (cellulose acetate, Advantec, Japan).

Fig. 1
figure 1

PEG calibration curve for molecular weight distribution

Chromatograms were analysed using peak-fitting technique to resolve the overlapping peaks and to determine the area under each peak. PeakFit (Version 4.12, Systat Software Inc., CA, USA), a commercially available software package, was used for this analysis. The procedure was adapted from the method described by Lai et al. (2015).

Disinfection by-products formation potential (DBPFP) analysis

For the DBP precursor analysis, the pH value of water samples was kept at 7.0 ± 0.2 by first adding H2SO4 or NaOH (Merck, Germany) solution, and followed by adding adequate amount of phosphate buffer solution (0.8 M, Merck, Germany). Then, an adequate amount of sodium hypochlorite solution (30 mg Cl2/mL, Merck, Germany) was injected into the samples to ensure that at least 1 mg/L free residual chlorine existed at the end of a 7-day (25 °C) incubation period. Residual chlorine was measured by the DPD ferrous method (APHA et al. 2012). The samples were then dechlorinated by adding sodium thiosulfate solution (0.04 M, Merck, Germany).

Four species of trihalomethanes, namely chloroform, dichlorobromomethane, dibromochloromethane, and bromoform, were measured by the purge and trap packed-column gas chromatographic method using a gas chromatograph (Model 3400 GC, Varian, USA) equipped with a purge and trap module (Model LCS-2000, Tekmar USA) and an electron capture detector. The measurement of nine species of haloacetic acids, namely monochloroacetic acid, monobromoacetic acid, dichloroacetic acid, dibromoacetic acid, trichloroacetic acid, tribromoacetic acid, bromochloroacetic acid, dichlorobromoacetic acid, and dibromochloroacetic acid, basically involved liquid–liquid extraction with methyl tertiary butyl ether (MTBE) and esterification with diazomethane prior to GC-ECD analysis.

Results and discussion

Characteristics of source water

The general water quality of the source water, which was the effluent from slow sand filter, is shown in Table 1. It can be noticed that source water has neutral pH and low alkalinity. The NPDOC value indicates that the source water has high dissolved organic content. Further, the SUVA value indicates that the dissolved organics is rich in hydrophilic compounds. As certain types of NOM adsorb UV254 light per unit concentration of DOC to a great degree than other types, SUVA is an indicator of NOM composition in water. It has been reported that water samples with SUVA values higher than 4 indicate that NOM is mainly composed of aquatic humic matter, while water samples with SUVA values lower than 2 contain mainly non-humic matter, which generally is more hydrophilic (Silanpää et al. 2015a).

Table 1 Characteristics of source water prior to coagulation

Figure 2 shows the variations in the dissolved organic content with AMW of the source water, obtained using by HPSEC chromatography equipped with OCD, as well as the results from peak-fitting. The higher peak heights reflect higher concentration of dissolved organic compound. The major dissolved organic groups of the source water based on AMW include four peaks, namely Peaks A, B, C, and D, with AMW of about 50,000, 1650, 1300, and 630 Da, respectively.

Fig. 2
figure 2

Distribution molecular weight of organic fractions as detected by HPSEC-OCD in slow sand filter effluent

Figure 3 shows the overlapping figures of the HPSEC-OCD and UVD chromatograms. It indicates that Peak A of HPSEC-OCD has no corresponding peak in UVD chromatograms, while the Peaks B, C, and D have. As OCD can detect all NOM containing carbon, while only NOM with conjugated or aromatic organic constituents absorb ultraviolet light, Peak A is probably contributed by biopolymers, such as polysaccharide or protein, which are hydrophilic. The other peaks can be associated with humic substances, HS (Peak B), building blocks (Peak C) which represent HS-like materials of lower molecular weight, and low molecular weight acids (Peak D) (Huber et al. 2011; Lai et al. 2015). The peak area and percentage distribution of the resolved peaks of the HPSEC-OCD chromatogram in source water by peak-fitting are shown in Table 2. It can be noticed that the percentage area contributed by Peak A is higher than those of other peaks. This means the source water contains quite large amount of hydrophilic organic, which is also reflected in the low SUVA value of the source water.

Fig. 3
figure 3

HPSEC-OCD and -UVD of slow sand filter effluent

Table 2 The peak area and percentage distribution of the resolved peaks of the HPSEC-OCD chromatograms of source water with peak-fitting

The raw water of the treatment plant is from an eutrophic lake, and the water samples for this study were collected in March, which was in the middle of dry season. Therefore, the hydrophilic organic is probably due to the biological activities in the lake itself, as seen in the production of extracellular polymeric substances (EPS), which include polysaccharide, protein, and so on (Sillanpää et al. 2015b). However, the area of Peak B, C, and D, which contain UV-absorption hydrophobic organics, contributed significantly to total area of the chromatograms. Therefore, their effect on the water treatment, such as DBPs formation, could not be ignored. As the source water contains high concentration of organic matter, and coagulation is the most efficient unit for NOM removal in conventional water treatment process, it was employed for investigating the characteristic of organic fractions related to DBPs formation.

NOM fractions removal by coagulation

The percentage removal of NPDOC and UV254 under various dosage is shown in Fig. 4. First, it reveals that the removal of both parameters increased with increasing coagulant dosage. However, it levelled off when alum dosage was higher than 12 mg/L as Al (0.45 mmol/L as Al). The highest NPDOC removal is about 30%. This is consistent with that reported in the literature, which indicated that the DOC removal by coagulation was less than 20–40%, when the SUVA value of the source water was less than 2 (Silanpää and Matilainen 2015). This is because of low SUVA value indicates the NOM in source water contains high proportion of hydrophilic organics, which are more difficult to be removed by coagulation. Figure 4 also shows the percentage of UV254 removal is higher than that of NPDOC. This can be explained by the fact that NOM molecules with aromatic structure and conjugated C=C double bond, i.e. the humic fractions, absorb more UV light per unit concentration of DOC than the general NOM molecules (Silanpää et al. 2015a).

Fig. 4
figure 4

Organic removal under various alum dosage

Figure 5 shows the pH values of supernatants from coagulation drop consistently with increasing alum dosage. This is due to the reaction between alum and alkalinity; more alkalinity was reduced by increasing coagulant dose with a consequent reduction in pH.

Fig. 5
figure 5

pH values of supernatants under various alum dosage

Figure 6 shows the HPSEC-OCD and UVD chromatograms of the source water and supernatants under alum doses 4, 8, and 16 mg/L as Al (0.16; 0.30; and 0.59 mmol/L as Al). Table 3 lists the area of Peak A–D (from peak-fitting) of the source water and the supernatants after coagulation, and the percentage reduction in the area of each peak, as compared to that of the source water. First, it can be seen that the heights of all peaks of the HPSEC chromatograms decreased with increasing coagulant dosage, also shown in Table 3. Table 3 also shows that the area of individual Peak A to D and the sum of the area of all peaks decreased with increasing alum dosage. This is consistent with the increase in NPDOC removal with increasing alum dosage, as shown in Fig. 4. According to Figs. 4 and 6, significant removal of UV254 than NPDOC under alum doses less than 8 mg/L indicated removal of humic substance fraction as represented by Peaks B, C, and D removal. Further, NPDOC removal became close to UV254 removal under alum doses higher than 8 mg/L. It is probably related to the removal of biopolymer fractions, which is represented by Peak A.

Fig. 6
figure 6

a HPSEC-OCD and b HPSEC-UVD chromatograms of coagulated water under various alum dosages

Table 3 The organic fractions’ removal by coagulation under various alum dosage using HPSEC-OCD chromatograms combined with peak-fitting

Second, Table 3 also reveals that Peak A has almost similar reduction as that of Peak B, while Peaks C and D have much lower removal. The decrease in Peak A of coagulated water corresponded well to the reduction of larger molecular weight by charge neutralization and bridge action, while the remaining compound presented a low conjugated bond of Peak A that could not be removed through those mechanism (Han et al. 2015). According to Fig. 4, Peaks B, C, and D of OCD have corresponding peak in the UVD chromatograms. Peak B, which represents humic substances, also Peaks C and D, which represent lower molecular weight, are featuring molecules with aromatic structure and conjugated C=C double bonds, contain NOM which are more hydrophobic than Peak A. The percentage reduction of the sum of the areas of Peaks B, C, and D is higher than the reduction of Peak A. It has been well established in the literature that hydrophobic are more amenable to removal by coagulation than hydrophilic, since hydrophobic NOM has been noted to carry high levels of negative charge due to the presence of ionized groups such as carboxylic (Zhu et al. 2014; Sillanpää and Matilainen 2015; Hidayah et al. 2016).

Table 4 shows the NPDOC, THMFP, and HAAFP of the source water and supernatants from coagulation under alum dosage of 4 and 8 mg/L (as Al). It clearly shows that the reduction in THMFP is higher than that of NPDOC and HAAFP. It has been well established in the literature that hydrophobic NOM, which generally has higher aromatic content, phenolic acidity, and ultraviolet absorbance, shows higher halogenated organics formation potential (Chen et al. 2008; Wang et al. 2013). This means the subgroup of NOM represented by THMFP should have higher SUVA value and hydrophobicity than that of the general natural organics. As coagulation also tends to have higher removal of hydrophobic NOM than hydrophilic fractions, the THMFP reduction by coagulation is higher than HAAFP and NPDOC. This is also consistent with the results show in Table 3, which indicates that hydrophobic fractions, which is mainly represented by Peaks B reduction, has the largest reduction than hydrophilic fractions by coagulation with alum. In addition, it seems that THMFP formation could be inhibited by lower pH. Figure 5 shows that pH is less than 6 under alum dosage of 8 and 16 mg/L. It indicated that decreasing of THMFP depends on the decreasing of pH value, instead of organic removal (Liang and Singer 2003).

Table 4 The NPDOC and THMFP of the source water and treated water after alum coagulation

Conclusions

Source water from effluent of slow sand filter contains more aromatic humic substances as detected by HPSEC-OCD and HPSEC-UVD, in addition to aliphatic biopolymers. This study reveals that among all NOM fractions, alum coagulation mainly removed hydrophobic aromatic, which is mainly humic substance. The reduction in THMFP was found to be higher than that of NPDOC under specific alum dosage, and the former was also found to be proportional to the corresponding reduction in the area of hydrophobic aromatic fraction, mostly humic subtances, as obtained from HPSEC chromatogram with peak-fitting.