Introduction

Starch production plants produce high strength wastewater. Several planted crops such as corn, potato, tapioca, wheat, etc. are used to extract starch. Starch wastewater properties vary according to the type of feed stocks, extraction method, level of technology, and the purity of products. The type of feed stocks and wastewater properties of several starch factories are presented in Table 1. As it is shown, high chemical oxygen demand (COD), high biochemical oxygen demand (BOD), high solid contents, and acidic pH are the common characteristics of starchy wastewaters.

Table 1 Different starch wastewater characteristics

It is important to note that according to Tehran Province Water and Wastewater organization (TPWW), the maximum allowable COD of wastewater that is allowed to directly discharge into the surface waters is 60 mg L−1.

The technologies that are used in starch wastewater treatment plants are classified into three categories: (1) biological, (2) physical, and (3) chemical methods.

Colin et al. (2007) have studied the treatment of cassava starch wastewater using anaerobic horizontal flow filter. At steady state conditions and maximum organic loading rate (11.8 g COD L−1 d−1), 87% of the inlet COD was removed. Rajbhandari and Annachhatre (2004) assessed the possibility of an anaerobic pond system for treatment of starchy wastewater. Wastewater was treated in a series of anaerobic ponds with a total area of 7.39 ha followed by facultative ponds with an area of 29.11 ha. Overall COD and TSS removal of 90% was observed. Movahedyan et al. (2007) treated the starchy wastewater using an anaerobic baffled reactor. In optimum conditions, the COD removal of 67% was reported. Rajasimman and Karthikeyan (2007) used a fluidized bed bioreactor with low density particles to treat high organic concentration wastewater of starch industry. At the COD of 2250 mg L−1 and the hydraulic retention time of 24 h, the optimum COD removal of 93.8% was reported. Furthermore, there are several methods using aerobic biological processes to treat starch wastewater (Pirmoradian 1997; Kian 2010).

It is worthwhile noting that during biological and chemical treatment of wastewater, the possibility of starch extraction would be eliminated. Furthermore, the efficiency of chemical treatment is low and insufficient to achieve the stringent discharge standard. So, in current study, membrane technology has been selected to improve the efficiency of starch production process and wastewater treatment.

Cancino et al. (2006) used a hydrophilic polyethersulfone membrane to treat a corn starch wastewater. First, they treated the wastewater using a microfiltration membrane with a pore size of 0.2 µm at a trans-membrane pressure (TMP) of 250 kPa. Permeate contained only 17% of the original wastewater BOD5. In second step, they used a reverse osmosis module for further treatment of wastewater. The permeate had only 0.2% of the original wastewater BOD5. In another study, Mannan et al. (2007) investigated the possibilities of recycling the concentrated retentate back to production line by using MF and RO membranes. Permeate flux above 100 L m−2 h−1 was achieved for the 100 nm membrane. The reported COD and BOD5 removal percentages were approximately 60%.

There are several methods for wheat starch production. In Fig. 1, the block process diagram of wheat starch production (Dough–Batter process) is shown.

Fig. 1
figure 1

Block flow diagram of hydro cyclone production process (Dough–Batter)

In hydro cyclone (Dough–Batter) process, two types of starch are produced (type A and type B). The distinctive difference between these two types of starch is the degree of polymerization. The granule size of starch type A is larger than the other one.

First, dough, water, and salt are mixed for 2 h. Dough/gluten hydration process takes place in a hydration tank. Then, through using dough washer, gluten is separated from starch solution, dried, grinded and prepared for sale. The resulted solution contains two types of starch (A and B). Starch type A and B are separated using centrifuge device and dried.

As illustrated in Fig. 1, a large portion of wastewater originates from centrifuging the solution of starch type B. In other words, centrifuge device is not capable of separation of the starch type B from the solution completely and some of the starch would be existed in the stream discharged to wastewater treatment plant (Pirmoradian 1997; BeMiller and Whistler 2009).

Cleaner production is not necessarily using expensive tools to reduce the contamination of industries. Cleaner production also means applying simple tools and making innovative methods to improve conventional systems, which would result in less contamination (Sans et al. 1998). There are a few researches (Bujak 2009; Dakwala et al. 2009; Virunanon et al. 2012) that investigated the improvements in starch production systems to achieve cleaner production.

There are some theoretical approaches to predict the performance of membrane process. These approaches are based on some models such as mass transfer model (Bhattacharjee and Datta 1997; Lin and Juang 2001), gel-polarization model (Palacios et al. 2002), osmotic pressure model (Wijmans et al. 1984; Ghose et al. 2000), Brownian diffusion model (Samuelsson et al. 1997), and shear-induced diffusion model (Kromkamp et al. 2002; Vincent Vela et al. 2007). The complexity of these mathematical models and their non-universality would limit their application.

To face this issue, several works were fulfilled to investigate the applicability of intelligent systems to simulate membrane processes. These methods would be classified as artificial neural networks (ANNs) (Bowen et al. 1998; Farshad et al. 2011; Chen and Kim 2006), adaptive neuro-fuzzy inference systems (ANFISs) (Shahsavand and Chenar 2007; Rahmanian et al. 2012), and fuzzy inference systems (FISs) (Sargolzaei et al. 2008; Altunkaynak and Chellam 2010). Accordingly, these methods that are based on the direct analysis of obtained data could simulate the membrane processes. Also, they have the ability to determine the unpredictable relations between input and output variables in many processes, which are complicated (Raasimman et al. 2010).

In general, the main preference of the systems such as FIS over other methods is that the desired predictions can be performed in an easy, fast, and accurate way which is not achievable using other forecasting tools.

In this study, a hydrophilic polyethersulfone membrane fixed in a plate and frame module was applied to modify hydro cyclone (Dough–Batter) process. The aim of this modification is to (1) improve the efficiency of starch production line and (2) decrease the organic loading rate and COD discharged into wastewater treatment system or environment. Also, FIS was applied to model non-linearity of this system. Several operators (implication, aggregation, and defuzzification) were applied and their abilities to model the permeate flux and rejection factor values were examined and the best structures for each output were selected. Finally, a comparison has been made between actual values and data obtained by FIS.

Methods and materials

Membrane properties

Hydrophilic polyethersulfone membrane (GE company,USA) with pore size of 0.65 µm, minimum bubble point of 19 psi, typical flow rate of 100.875 mL min−1 cm−2, operating pH 1–14, membrane thickness 110–150 µm, and maximum operating temperature of 130 °C was used. The membrane area was 49 cm2 (7 × 7 cm). The plate and frame membrane module was made of steel.

Modifying the process

The process flow diagram of the modified process is shown in Fig. 2. The outlet stream from the centrifuge B that previously was discharged into wastewater treatment plant, in this way, is forwarded to a storage tank (ST). To prevent the starch settling in the tank, a mixer is also installed. Using a pipeline (PL-1), the feed stream containing low concentration of starch, leaves the tank and passes through a valve (V-1) and enters into the centrifugal pump (CP) with the maximum head of 50 m and maximum flow rate of 50 L min−1. In this study, several values of flow rates must be investigated; so the feed is divided into two streams. One is recycled to the storage tank (PL-4), and the other stream enters the heat exchanger (HE), after passing from a valve (V-2). The flow rate is regulated by valves V-2 and V-3. The effect of temperature on starch separation could be important due to affecting coagulation phenomena; so the temperature variations should be studied. To achieve the appropriate temperature interval, a heat exchanger (HE) which is mentioned earlier was used. As it is shown, the feed with the adjusted temperature passes through a flow meter (FM) with a maximum flow rate of 20 L min−1 and a valve (V-4) to enter into the plate and frame membrane module.

Fig. 2
figure 2

Process flow diagram of modified process

To measure and adjust the operative pressure, two manometers are installed on the feed stream (B-1) and the retentate stream (B-2). The operative pressure is adjusted using valves V-4 (inlet pressure) and V-5 (outlet pressure). TMP is calculated using the following equation:

$${\text{TMP}} = \frac{{p_{\text{i}} + p_{\text{o}} }}{2} - p_{\text{p}} ,$$
(1)

where \(p_{\text{i}}\) and \(p_{\text{o}}\) are inlet and outlet pressure, respectively, and \(p_{\text{p}}\) is the pressure of permeate side.

The permeate flow leaves the membrane module through the valve (V-6). A portion of the retentate stream is recycled to the feed tank, and the left would be recycled to the centrifuge B for re-extracting the starch type B. It is worthwhile considering that the ratio of these two streams is adjusted by valve V-5.

The performance of membrane filtration was evaluated by two variables; permeate flux and rejection factor. The permeate flux was defined as follows:

$$J_{\text{p}} = \frac{V}{{{{A}} \times {{t}}}},$$
(2)

where \(J_{\text{p}}\) is the permeate flux (L m−2 min−1), V is the permeate volume (m3) that has been collected during time t (min), and A is the active area of membrane. The rejection factor was defined as follows:

$$R = 1 - \frac{{{\text{COD}}_{\text{p}} }}{{{\text{COD}}_{\text{f}} }},$$
(3)

where \({\text{COD}}_{\text{p}}\) and \({\text{COD}}_{\text{f}}\) are the value of COD in permeate and feed streams, respectively. Influent and effluent COD were measured by standard methods (APHA 1998). Fouling could limit using membrane technology, strongly. In other works of these authors, the procedure of flux retrieving, type of backwashing, etc. for starch wastewater treatment using membrane filtration have been discussed in details (Moghaddam et al. 2013a, b, 2016).

Fuzzy inference system

Fuzzy inference systems (FISs) have the ability to figure out unpredictable relations between input and output variables in many processes, which are too complicated to model by other techniques. Many investigations have been carried out in using FIS to model the membrane processes and wastewater treatment (Sadrzadeh et al. 2009). Modeling and simulation by FIS consist of two steps: (1) determining the input–output space partition and the number of rules that have to be used by the fuzzy system, and (2) finding the optimum value of the parameters that were involved in the fuzzy system (Lin and Ho 2005). The fuzzy rules that are used in fuzzy set theory are very close to human language. Therefore, this property would make the explanation and justification of the predictions easier (Rahmanian et al. 2011).

Figure 3 shows a FIS, which consists of fuzzifier, defuzzifier, and fuzzy inference engine. In Fig. 3 X and Y are input and output data sets, respectively. A fuzzy set is characterized by a membership function µ f ϵ[0, 1], which determines a grade of membership for each element within the fuzzy set (Freissinet et al. 1999).

Fig. 3
figure 3

Fuzzy expert system approach

The two most important types of fuzzy inference method are Mamdani’s fuzzy inference method, which is used to predict the permeate flux and rejection factor, and so called Sugeno or Takagi Sugeno–Kang method (Yaqiong et al. 2011).

In the Mamdani fuzzy model, the “if–then” rules take the place of the usual set of equations used to characterize a system. The general “if–then” rule structure of the Mamdani algorithm is given in the following form (Rahmanian et al. 2011):

$${\text{If }}\;X\;{\text{ is}}\; \, A_{i} \;{\text{and }}\;Y\;{\text{ is}}\; \, B_{i} \;{\text{and }} \ldots \;{\text{ then }}\;Z\;{\text{ is }}\;C_{i} \;{\text{and}} \ldots$$
(4)

i = 1, 2,…, n where X and Y are input variables and Z is the output.

The first step is to fuzzify the input variables. The fuzzifier maps the input data X into the fuzzy set A, Y into the fuzzy set B, and so on. The next step is to evaluate the truth value for the premise of each rule, and then applying the result to the conclusion part of each rule using the fuzzy implication. The membership functions defined on the input variables are applied to their actual values to determine the degree of truth for each rule premise. MATLAB software R2010a (7.10.0.499) was used to construct and simulate the membrane performance.

Design experiment

The operative parameters that affect the performance of the membrane process are TMP, flow rate, temperature, pH, and feed concentration. It was found that TMP has no considerable influence on rejection factor but improves the value of permeate flux (Sargolzaei et al. 2008). So, during conducting the experiments, the value of TMP was set at possible maximum value of 2.5 bar. The experiments layout that was adjusted by response surface methodology (RSM) and output variables, have been shown in Table 2. Sometimes, there are variables, out of control, that would affect the result of experiment. In these situations, the effects of these variables (noises) would be alleviated using blocks. In Table 2, one block was introduced because no uncontrollable variable has been detected.

Table 2 Design layout of experiments in one block

The experimental plan and data analyzing generation were performed using Design Expert software (Vaughn 2011). Applying RSM to arrange experimental layout resulted in a considerable decrease in the number of experiments and also the interactions between parameters were considered. In previous work (Moghaddam et al. 2013a, b), according to the results that have been shown in Table 2, two regression models for the permeate flux (PF), rejection factor (RF), and COD of permeate were generated as follows:

$${\text{PF}} = 2.22 + 0.54 \times F + 0.028\; \times \;T - 0.20 \; \times \;{\text{pH}} - 0.15 \times C - 0.04 \times F^{2} - 0.0006632 \times T^{2} ,$$
(5)
$${\text{RF}} = 78.26 - 0.43 \times F + 0.54 \times T - 0.11 \times {\text{pH}} + 2.04 \times C - 0.023 \times T \times {\text{pH}} - 0.029 \times T \times C ,$$
(6)
$${\text{COD}}_{\text{f}} = 93912 \times C - 516 \times F \times C + 648 \times T \times C - 132 \times {\text{pH}} \times C + 2448 \times C^{2} - 27.6 \times T \times {\text{pH}} \times C - 34.8 \times T \times C^{2} .$$
(7)

The accuracy of models was measured using correlation coefficient index (R 2) and mean square error (MSE), which are defined as follows:

$${\text{MSE}} = \frac{{\mathop \sum \nolimits \left( {y_{{{ \exp } .}} - y_{{{\text{pred}} .}} } \right)^{2} }}{N},$$
(8)
$$R^{2} = 1 - \frac{{\mathop \sum \nolimits \left( {y_{{{ \exp }.}} - y_{{{\text{pred}} .}} } \right)^{2} }}{{\mathop \sum \nolimits \left( {y_{{{ \exp } .}} - \bar{y}} \right)^{2} }},$$
(9)
$$\bar{y} = \frac{{\mathop \sum \nolimits y_{{{ \exp } .}} }}{N},$$
(10)

where \(y_{{{ \exp } .}}\) and \(y_{{{\text{pred}}.}}\) are experimental and predicted values, respectively, and N is the number of data.

Result and discussion

Fuzzy model

The ranges of four input variables were divided into three sections: low (L), middle (M) and high (H). In Fig. 4 the membership function of flow rate has been illustrated. The membership function of other input variables is just like flow rate. As it is shown in Fig. 5, to improve the preciseness of the model, nine membership functions were defined for permeate flux and rejection factor. For input and output parameters, Gaussian and triangle membership functions were used, respectively. Gaussian membership function is as follows:

$$f\left( x \right) = e^{{\frac{{ - \left( {x - c} \right)^{2} }}{{2\sigma^{2} }}}} ,$$
(11)

where c and σ are the parameters for the determination of the shape of the curve.

Fig. 4
figure 4

Membership function of flow rate

Fig. 5
figure 5

a Membership function of permeate flux, b membership function of rejection factor (VL very low, L low, VMO very moderate, MO moderate, M medium, I increase, VI very increase, H high, VH very high)

Figure 6 shows the architecture of FIS that was built for modeling the permeate flux and rejection factor. To increase the model accuracy, two separate FIS were generated for permeate flux and rejection factor. In current research, both Mamdani and Sugino model were investigated and no distinctive difference was observed between the data predicted by these two models. So, only Mamdani results have been shown.

Fig. 6
figure 6

The generated fuzzy inference system

Permeate flux (J p)

Several structures of FIS have been generated and their ability to predict the output variables was examined in terms of the decreasing values of R 2 and MSE. As is presented in Table 3, the best agreement between experimental and predicted data for permeate flux is specified by R 2 of 0.9752 and MSE of 0.0072 where defuzzification operator was centroid. It is important to note that the number of rules for FIS of permeate flux is 48. Figure 7 shows the values of residuals vs. run number for permeate flux predicted by FIS. According to Fig. 7, the maximum difference between the actual and the predicted value in this case is 0.2. Figure 8 represents a comparison between the actual and the predicted permeate flux through a bar plot. As is shown, there is a good agreement between predicted and actual values. This agreement shows the excellent ability of this model in data predicting.

Table 3 Comparison and selection of the best structures of FIS by Mamdani method for permeate flux
Fig. 7
figure 7

Residual vs. run no. for best FIS model of permeate flux

Fig. 8
figure 8

Comparison between the fuzzy predicted and experimental data for the permeate flux (J P)

Figure 9 shows the relationship between the permeate flux and four operative parameters through surfaces generated by FIS.

Fig. 9
figure 9

FIS surfaces for permeate flux

It would be expected that the cake resistant would be raised after increasing the concentration, and the permeate flux would decrease. But, as it is shown in Fig. 9, the permeate flux increases with increasing concentration at 20 °C. In the following, the reason of this unusual phenomenon would be explained:

Specific cake resistance, α, is defined as the cake resistance, Rc, normalized by the accumulated cake mass per unit of membrane area (Listiarini et al. 2009):

$$\alpha = \frac{{R_{c} }}{{{\raise0.7ex\hbox{$M$} \!\mathord{\left/ {\vphantom {M A}}\right.\kern-0pt} \!\lower0.7ex\hbox{$A$}}}},$$
(12)

where M is the mass of the cake deposited on membrane surface, and A is the effective area of membrane. According to the Carman–Kozeny equation, α is inversely proportional to cake porosity (Chang and Kim 2005):

$$\alpha = \frac{{180\left( {1 - \varepsilon } \right)}}{{\rho d_{\text{p}}^{2} \varepsilon^{3} }},$$
(13)

where d p is the particle diameter, ε is the porosity of cake layer, and ρ is the particle density.

According to Eq. 12, specific cake resistance is strongly dependent on d p. So that by increasing the size of solids the cake resistant would decrease. In this case, increasing concentration at low temperature led to forming larger particles that caused decreasing cake resistant and increasing permeate flux.

Rejection factor

Similarly to permeate flux, the values of R 2 and MSE corresponded to several structures of FIS for the rejection factor have been shown in Table 4.

Table 4 Comparison and selection of the best structures of FIS by Mamdani method for rejection factor

To check the adequacy of the final model, the predicted rejection factor vs. the actual values was checked and illustrated in Fig. 10. It is important to note that these predicted values were corresponded to the best FIS structure. In this case, the points that follow a straight line (y = x) confirm that errors are normally distributed with a mean of zero. Furthermore, the R 2 of 0.9711 implies the normal distribution of predicted values versus experimental values.

Fig. 10
figure 10

Predicted rejection factor by FIS vs. experimental data

In Fig. 11, four 3-D plots that show the rejection factor in terms of two operative parameters have been illustrated. As is shown, in middle range, change in temperature has no considerable effect on rejection factor and the rejection factor remains relatively constant. But at temperature below 30 °C or higher than 50 °C, considerable change in rejection factor would be seen by changing the temperature. Similar effect was observed for change in pH values.

Fig. 11
figure 11

FIS surfaces for rejection factor

Amount of recovered starch

We assume a wheat starch production plant discharges 1500 m3 d−1 (1041.67 L min−1) of wastewater with the average COD of 7000 ppm, pH of 6, and temperature of 30 °C to the wastewater treatment system. Starch concentration of 5.83 g L−1 resulted in COD value of 7000 ppm.

As it is mentioned earlier, the performance of modified system was evaluated by rejection factor and the permeate flux. So, the optimum design of system was investigated in two states. In states Ι and ΙΙ, the aim is to obtain the maximum permeate flux and the maximum rejection factor, respectively.

State Ι

According to the values of COD, pH, and temperature, Eq. 5 was simplified to the following equation:

$${\text{PF}} = 0.39 + 0.54 \times F - 0.04 \times F^{2} .$$
(14)

Figure 12 shows block scheme of calculation for State I. Using Eq. 14 it is found that the maximum permeate flux would be obtained at the flow rate of 6.75 L min−1.

Fig. 12
figure 12

The flowchart of calculating the total recovered starch for state Ι

At this value of flow rate, according to Eq. 6, the value of rejection factor was 93.58%. Consequently, 4787.48 kg of starch would be recovered from wastewater, annually. Additionally the COD value of wastewater decreased from 7000 to 449 ppm.

State ΙΙ

According to Eq. 6, the rejection factor is maximum when the flow rate is at its minimum value. So, the flow rate was adjusted at 3 L min−1. 8183.24 kg of starch could be recovered each year, accordingly. Figure 13 shows the block diagram for calculating the recovered starch for state ΙΙ.

Fig. 13
figure 13

The flowchart of calculating the total recovered starch for state ΙΙ

The minimum and maximum values of flow rate are 3 and 10 L min−1, respectively. Wastewater flow rate is 1041.67 L min−1. By selecting 347 and 104 membrane module, minimum and maximum values of flow rate for each membrane module would be achieved, respectively. Figure 14 shows the permeate flux of each membrane module (a), rejection factor (b), and total recovered starch (c) vs. number of membrane modules. Just similar to state ΙΙ, the maximum starch would be recovered when the number of membrane module was at maximum value or flow rate was at minimum value. In Table 5, the amount of water and energy saving achieved by modified system have been shown.

Fig. 14
figure 14

a The permeate flux of one membrane module, b rejection factor, and c total recovered starch vs. number of membrane modules

Table 5 The advantages of modified system

Conclusion

In this study, a hydrophilic polyethersulfone membrane was applied for recovering the starch and recycling it into the production line. The maximum permeate flux and rejection factor were observed to be 2.42 L m−2 min−1 and 97.5%, respectively, that were acceptable. Also, the capabilities of several structures of fuzzy inference system (FIS) for simulating the membrane filtration of starch were investigated and compared with each other. The best correlation coefficient index (R 2) and mean square error (MSE) for predicted permeate flux were 0.9704 and 0.0086, respectively. Similarly, R 2 and MSE for predicted rejection factor were 0.9711 and 0.4743, respectively. According to the regression models that were obtained by response surface methodology (RSM), the recovered starch under different conditions has been calculated. The maximum recovered starch was 8183.24 kg year−1, which caused a huge amount of energy and water saving.