Introduction

Tea has been under scientific attention for many years, because, as one of the most commonly consumed beverages worldwide, it provides significant amount of bioactive compounds in human diet. Tea is produced from the leaves of Camellia sinensis and is usually classified into three types: unfermented, semi–fermented and fermented tea based on the extent of fermentation. Green tea as non-fermented tea is obtained by an inactivation of the enzymes (polyphenol oxidase and peroxidase) in the fresh tea leaves either by roasting or steaming tea shoots. Thus, it contains a great number of polyphenolic compounds, mainly catechins, and alkaloids, including caffeine. Catechins account for about 30% of the dry weight of green tea leaves and most of the health benefits of green tea is referred to the high antioxidant activity of these flavonoids [1]. The fermentation of tea leaves induces enzymatic oxidation of tea catechins and their condensation to theaflavins and further polymerization of the latter to thearubigins. Both groups of compounds (theaflavins and thearubigins) are responsible for typical colour and strong, astringent flavour of black tea.

The antioxidant activity and polyphenol content of tea infusion are well documented and strong correlation between the two parameters has been observed [2]. Since the content of polyphenols has been used as the quality factor of many food products such as fruits and vegetables, also the quality of tea infusion could be expressed as phenolic content or the antioxidant activity. Green tea infusion is typically prepared by brewing tea leaves in hot water of 80 °C for 3–5 min [3], whereas black tea at 100 °C during 6–8 min [4]. Higher temperatures and prolonged time of extraction lowered the catechin (epi–form) content [3] which is not necessarily correlated with the decreasing antioxidant activity. At high temperature non-epi forms of catechins possessing high antioxidant activities could be formed [1, 5]. Although scientific research does not provide unequivocal evidence of therapeutic effects of edible plants, some of them, such as tea, are used to prevent or support the treatment of certain diseases. Because some consumers drink significant amounts of tea to maintain or improve their health, the information how to obtain tea infusion of the highest quality i.e. with high antioxidant capacity is needed. It was previously shown that factors such as temperature and time of infusion, leaf size or quality of water influence the extraction process of bioactive compounds and their antioxidant properties, but as far as is known the predictive approach to tea infusion quality had been not previously discussed.

Kinetic modelling for tea was used to describe extraction of total phenolics, individual polyphenols and caffeine [6,7,8]. The modeling techniques were also applied in food science for an objective, fast and economic monitoring of the food quality changes under various conditions [9].

Kinetic models could be divided into two categories: physical and empirical ones. Physical models are based on the physical phenomena of mass transfer through plant particles and from external solid surfaces into bulk of the liquid phase [10]. The empirical ones do not describe any physical phenomena and include rather limited information, if any, on the fundamental principles involved in the extraction process. Nevertheless, the empirical models are used as an excellent tool for curve fitting and allow presentation in a function of physical properties and process conditions of both, the extraction medium and the extracting plant material.

The typical mathematical models include Spiro’s steady state model—pseudo–first order model [8, 11,12,13], Peleg’s model [14,15,16], Page’s model [14], second-order model and logarithmic model [14, 15]. For the investigation of the temperature influence on the extraction rate constant, the Arrhenius law is often implemented [8, 17,18,19].

Recently, an attempt based on data mining techniques application for the quality modeling of various products has been undertaken. Among these techniques, multivariate adaptive regression splines (MARS) and artificial neural networks (ANNs) have gained more attention. MARS is a nonparametric regression procedure that constructs the relation between the dependent and independent variables from a set of coefficients and basis functions that are entirely "driven" from the regression data [20, 21]. This means that the model does not previously assume any functional relationship between the variables [20]. ANNs is an alternative approach to MARS techniques implemented in the finding of predictive models. They were applied in a wide range of problem domains since they could applicate very complex functions (linear and non-linear regressions) [22]. Moreover, these techniques do not need the explicit expression of physical meaning of the system/process under study, since the networks learn throughout successive trainings and the knowledge on how it works is not needed to perform the analysis thus ANNs are often ascribed to the “black–box” models [20, 22].

The aim of the study was to develop and compare the predictive models of tea infusion quality by investigating total polyphenol content and the antioxidant activity of leaf black and green teas in the function of infusion time and temperature. To this end, Spiro’s steady state, Peleg’s and logarithmic models were used as classical mathematical models. The effect of temperature was introduced to the models by the Arrhenius equation. MARS and ANNs models were also constructed as data mining techniques. Determination coefficients (R2), adjusted determination coefficients (adj.R2), correlation coefficients (r), and/or root mean square errors (RMSE) were used to assess how the experimental data fit to each model. The extraction of bioactive compounds was studied at four temperatures, namely 60, 70, 90 and 100 °C to build the predictive models. The samples extracted at 80 °C were used as controls, in order to verify the applied conditions and to validate models.

Materials and methods

Chemicals

2,2-diphenyl-1-picrylhydrazyl (DPPH), 6-hydroxy-2,5,7,8-tetramethylchroman-2-carboxylic acid (Trolox—water–soluble vitamin E analogue), Folin-Ciocalteu’s reagent were purchased from Sigma-Aldrich (Steinheim, Germany). Sodium carbonate was obtained from POCh (Gliwice, Poland). Methanol and n–hexane were purchased from ChemPur (Poland). Water was purified using the Milli–Q system (Millipore, Bedford, MA, USA).

Tea samples and tea infusion preparation

Green and black leaf teas were purchased in China and Poland, respectively. Green tea was Lv Mao Hou, grown in Zhang Yi Yuan (Yongtai, Fujian). The black tea was packaged by Poznańska Palarnia Kawy ASTRA in Poznań, and the dried tea leaves were from China’s Yunnan province.

Tea infusions were prepared by brewing of 2 g of tea leaves with 100 ml of purified water, at five different temperatures (60, 70, 80, 90 and 100 °C) and at ten extraction times (0.25, 0.5, 1, 3, 5, 8, 10, 12, 15, 30 and 60 min). The initial time was set when the water was added to the tea leaves. The mixture was stirred during brewing using the magnetic stirrer (MS–H–Pro plus, Chemland, Poland) with electronic control for constant speed. When the time of extraction was reached the leaves were separated from the aqueous extract and the extract was cooled down to room temperature. The tea extract was then filtered twice through the filter (type 388 filter, Filtrak, Niederschlag Bärenstein, Germany). All samples were prepared in triplicate.

Antioxidant activity

The DPPH free radical scavenging activity measurements were carried out according to the procedure of Sánchez-Moreno et al. [23] with some modifications [24]. The DPPH radical scavenging activity expressed in the TEAC values (Trolox equivalent antioxidant capacity in mmol of Trolox per g of tea leaves) may reflect the concentration of antioxidants (AOX) as it was previously shown by Jurinjak Tušek et al. [10].

Total polyphenol content

Total polyphenol content (TPC) was determined using Folin–Ciocalteu’s reagent (FCR) as described by Singleton and Rossi [25].The results were expressed as mg of gallic acid equivalents (GAE) per g of tea leaves.

HPLC determination of bioactive compound content

The composition and content of catechins and gallic acid in tea infusions were determined using a HPLC method developed by Enko and Gliszczyńska-Świgło [26].

Mathematical models

Analysis of the effects of infusion time and temperature was performed by fitting the experimental data of total polyphenol content and the antioxidant activity to kinetic models: Spiro’s steady state [13], Peleg’s [14, 16] and logarithmic [14] models. The Spiro’s steady state model which predicts first order behavior is expressed by the equation (Eq. 1):

$$dc/dt=k\times ({C}_{\infty }-C)$$
(1)

The integrated equation is as follows (Eq. 2):

$$\mathrm{ln}{(C}_{\infty }/({C}_{\infty }-C))= k\times t+a$$
(2)

where C is the concentration of TPC or AOX at equilibrium and C is the concentration at any time, k is the observed first order rate constant and a is the intercept that is typically small but not predicted by the model.

Peleg’s model was developed to describe the sorption isotherms of food materials [16] (Eq. 3):

$$C\left(t\right)={C}_{0}+ t/{(K}_{1}+{K}_{2}\times t)$$
(3)

where C(t) is the concentration of bioactive compounds at time t, K1 is Peleg’s rate constant and K2 is Peleg’s capacity constant and C0 is the concentration of bioactive compounds at time t = 0. The model was adopted for description of solid–liquid extraction process [15] since the shape of the extraction curves of bioactive compounds from plant sources was very similar to the one of sorption curves. As initial concentration of TPC and AOX is zero, C0 value equals zero and modified Peleg’s model was fitted to the experimental data according the equation (Eq. 4):

$$C\left(t\right)= t/{(K}_{1}+{K}_{2}\times t)$$
(4)

The Peleg’s rate constant K1 relates to extraction rate (R0) at the very beginning of the extraction process thus at t = t0 the Eq. (4) becomes Eq. (5):

$$C\left(t=0\right)={R}_{0}= 1/{K}_{1}$$
(5)

The Peleg’s capacity constant K2 corresponds to maximum extraction yield that is equilibrium concentration (Ce) of bioactive compounds thus at t →  the Eq. (4) becomes Eq. (6):

$$C\left(t\to \infty \right)={C}_{e}= 1/{K}_{2}$$
(6)

Logarithmic model was used to the experimental data [14] following the equation (Eq. 7)

$$C\left(t\right)=a\times \mathrm{ln}t+b$$
(7)

where a and b are logarithmic model constants and C(t) is the concentration of bioactive compounds at time t.

Temperature effect

Temperature dependence of the extraction kinetics could be assessed using Arrhenius equation (Eq. 8):

$$k={k}_{0 }\times \mathrm{exp }({-E}_{a}/(R\times T))$$
(8)

where k represents the extraction rate, k0 is pre-exponential factor, Ea is the activation energy, R is the universal gas constant and T is the absolute temperature.

The modified logistic Arrhenius equation was given by the equation (Eq. 9):

$$\mathrm{ln}k= \mathrm{ln}{k}_{0 }- {E}_{a}/(R\times T)$$
(9)

A plot of lnk versus 1/T gives a regression line with the slope –Ea/R and lnk0 as intercept. Arrhenius equation is also used for empirical models, but then the parameters –Ea/R and lnk0 do not really have a physical meaning and the Eq. (8) is often presented as Arrhenius–like equation (Eq. 10):

$$k={A}\times \mathrm{exp }(-B/T)$$
(10)

where A and B are the model parameters.

Multivariate adaptive regression splines (MARS)

The main idea in MARS is its ability for dividing the whole space of each independent variable into various sub–regions (the joining points of these sub–regions are called knots and are derived from the data set) [20]. Then two–sided truncated spline functions are defined as basis functions for describing the relationships between the dependent and predictor variables in each distinct intervals of the predictor variable [20, 21]. The basis functions are as follows (Eqs. 11, 12):

$${h(x-t)}_{+}= \left\{\begin{array}{l}x-t, if\;x>t\\ 0\;otherwise\end{array}\right.$$
(11)
$${h(t-x)}_{+}= \left\{\begin{array}{l}t-x, if\;x<t\\ 0\;otherwise\end{array}\right.$$
(12)

where parameter t is the knot of the basis functions and the " + " signs next to the terms (t-x) and (x-t) indicate that only positive results of the respective equations are considered; otherwise the respective functions evaluate to zero [20].

For each of the predictor variables MARS selects the pair of splines and the knot location, that best describe the dependent variable. In a next step, the basis functions together with the model parameters (estimated via least squares estimation) are combined to build the general MARS model equation, described as (Eq. 13):

$$y=f\left(x\right)={\beta }_{0 }+\sum_{m=1}^{M}{\beta }_{m}{H}_{m}(x)$$
(13)

where y is the predicted value for the dependent variable, β0 is the intercept term and βm represents a regression coefficient that is estimated by minimizing the residual sum of squares, M—the number of basis functions in the current model and Hm(x) is the product of basis functions (Eq. 14) [20, 21]:

$$H_{m} = \prod_{(k = 1)}^{K} \,h(t_{k} ;x).$$
(14)

To avoid an excessive number of spline functions in the final MARS model (so-called overfitting) the generalized cross-validation (GCV) error as an adjusted residual sum of squares (Eq. 15), in which a penalty for the model complexity (C) is incorporated (Eq. 16), is calculated

$$GCV = \,{{\sum\limits_{i = 1}^{N} {\left( {y_{i} - f\left( {x_{i} } \right)} \right)}^{2} } \mathord{\left/ {\vphantom {{\sum\limits_{i = 1}^{N} {\left( {y_{i} - f\left( {x_{i} } \right)} \right)}^{2} } {\left( {1 - \frac{C}{N}} \right)^{2} }}} \right. \kern-\nulldelimiterspace} {\left( {1 - \frac{C}{N}} \right)^{2} }}$$
(15)

with

$$C\, = \,1\, + \,cd$$
(16)

where N is the number of cases in the data set, d is the effective degrees of freedom, which is equal to the number of independent basis functions (M). The quantity c is the penalty for adding a basis function. The value of C increases with the number of basis functions in the model [20].

In this study, MARS models created for the data matrix included maximum 21 basis functions, two degrees of interaction. The penalty was set to 2, and threshold to 0.0005. Increased numbers of basis functions did not increase the values of R2 or adj. R2 nor influence the value of GCV.

Artificial neural networks (ANNs)

The data set collected in this study has been divided into: learning (65%), training (15%) and validating (20%) sets. Multilayer feed–forward fully connected ANN has been trained with the Broyden-Fletcher-Goldfarb-Shanno learning algorithm (200 epoch). The search for an appropriate ANN model was performed using multilayer perceptron (MLP) and radial basis function (RBF) networks. In total, 20 networks were tested of which the best five were retained. The network architecture created for the tea data matrix included an input layer, one hidden layer of neurons, and an output layer. The input layer consisted of two neurons, 3–9 neurons in the hidden layer for TPC values and 4–9 neurons for the TEAC values (depending on the ANN model), and one neurons in the output layer (TPC or the TEAC values). The sums of squares error function was used during the network training process. The success of the model to predict TPC and the TEAC values was evaluated based on the correlation coefficients for: training—a percentage of the samples in the learning set correctly predicted during the networks learning step; test—a percentage of the samples in the testing set correctly predicted during the networks testing step and validation—a percentage of the samples in the validation set (samples not used in the learning and testing steps) correctly predicted by the models during the networks validation step.

Statistical analysis

All analyses were run in triplicate. The statistical tests were performed using Statistica 13.0 software (Statsoft, Oklahoma, USA) at α = 0.05.

Values of kinetic parameters were estimated by non-linear estimation analysis using least–squares criterion with Levenberg–Marquardt algorithm. The goodness of fits of the models was assessed using determination coefficient (R2) (Eq. 17):

$$R^{2} = 1 - (total\,sum\,of\,squares)/(residual\,sum\,of\,squares),$$
(17)

correlation coefficients (r), root mean square error (RMSE) (Eq. 18):

$$RMSE= \sqrt{\sum_{i=1}^{n}{({y}_{i}- {\widehat{y}}_{1})}^{2}/n}$$
(18)

and coefficient of variation (CV) (Eq. 19):

$$CV=100/\stackrel{-}{y} \times \sqrt{\sum_{i=1}^{n}{({y}_{i}- {\widehat{y}}_{1})}^{2}/n}$$
(19)

and/or generalized cross-validation (GCV) error (Eq. 15). Analysis of variance (ANOVA) was performed to statistically evaluate all parameters obtained.

Results and discussion

Bioactive compound content and the antioxidant activity of tea

The changes in the TPC of green and black teas are shown in Fig. 1. The antioxidant activity of the infusions expressed in the TEAC values calculated from the DPPH· radical scavenging assay is presented in Fig. 2. By analyzing the data obtained it could be concluded that TPC and the TEAC values of green and black teas were affected by the time and temperature of extraction (p < 0.05). With the increasing temperature and time of brewing, the content of total polyphenols increased (Fig. 1a, b). After 60 min of brewing at 100 °C TPC values reached the highest level of 78.3 mg GAE/g for green tea and 55.6 mg GAE/g for black tea.

Fig. 1
figure 1

Total polyphenol content (TPC) in green (a) and black (b) teas brewed at temperatures: 60, 70, 90, 100 °C

Fig. 2
figure 2

The antioxidant activity expressed in the TEAC values of green (a) and black (b) teas brewed at temperatures: 60, 70, 90, 100 °C

It was found that also the composition of both teas varied (Tables 1 and 2). The green tea contained EGCG, EGC, ECG, and GC as the main catechins which accounted for about 90% of catechins (after 15 min of brewing at 100 °C). The main catechins found in black tea prepared at 100 °C were as follows: EGC, EGCG and ECG which together constituted about 86% of catechins. These findings are in agreement with the results of Enko & Gliszczyńska-Świgło [26], Muzolf et al. [27] and Gramza et al. [28]. However, the effect of time and temperature on the particular black and green tea components was different comparing to their TPC changes (Tables 1 and 2).

Table 1 The effect of temperature (after 15 min of brewing) on gallic acid (GA) and catechin contents in green and black tea
Table 2 The effect of time (at 70 °C) on gallic acid (GA) and catechin contents in green and black tea

For green tea, the maximum levels of GC, GCG, ECG, ECGC and EGC were reached at 80 °C (Table 1). According to our results, there were no significant differences between EC content at various temperature, whereas, gallic acid (GA) content increased with temperature showing the maximum level at 100 °C. It was observed in previous studies that extraction at 85 °C gave the highest contents of ECG and EGC, whereas for non-epistructured catechins the highest levels were reached at 95 °C [3]. The levels of GA and catechins, namely: EC, ECG, EGCG and GCG slightly increased with increasing time of brewing at 70 °C (Table 2). EGC increased with time up to 10 min of brewing and, there was no trend observed in GC changes in green tea infusion with time.

For black tea, the contents of EGC and GCG increased with the increasing temperature (Table 1). EC showed the maximum level at 80 °C, then further increase of temperature did not change the content of the compound. The levels of EGCG and ECG were stable between 60 and 70 °C and then increased up to 100 °C. The effect of time on particular catechins in black tea was various (Table 2). The contents of EGCG, GCG and ECG increased at the beginning of brewing process (between 5 and 8 min) and thereafter their levels were stable with time. There was observed a steady increase of EC and a decrease of GC with time. EGC increased, up to 8 min of extraction, and thereafter decreased.

A significant increase of the antioxidant activity of both teas with increasing temperature and time was also observed (Fig. 2a, b). The TEAC values increased after 60 min of brewing at 100 °C up to 706 mmol/g for green tea (Fig. 2a) and 577 mmol/g for black tea (Fig. 2b). Correlation coefficient between TPC and the TEAC values is very high (0.98) for both teas. The effect of temperature and time on TPC and the antioxidant activity of teas was previously reported by others [4, 29,30,31]. Moreover, TPC and the antioxidant activity of black tea was significantly lower than in green tea at each temperature tested, which is in accordance with the previous studies [32].

Development and evaluation of mathematical models for TPC and the antioxidant activity of tea

The extraction curves for both types of teas, obtained by plotting TPC or the TEAC values vs time of brewing were similar in shape (Figs. 1 and 2). A rapid increase of TPC and the TEAC values at the very beginning of the process was observed followed by slow increase in the further stage of extraction until the equilibrium concentration of bioactive compounds was achieved. Similar results were previously observed for compounds extracted from tea [8, 11], sage [33], Asteraceae plants [15], soybeans [14] or grape cane [34].

Based on the shape of the curves, Spiro’s, Peleg’s and logarithmic models were chosen for description of total polyphenol and the antioxidant activity changes in green and black teas and the predictive model development. Proposed models were fitted to the experimental data by non-linear regression. The R2 and RMSE values of each model were used for the assessment of model adequacies (Tables 3 and 4).

Table 3 Values of estimated model parameters for TPC in green tea (GT) and black tea (BT) with adjusted determination coefficients (R2) and root mean square errors (RMSE)
Table 4 Values of estimated models parameters for the antioxidant activity of green tea (GT) and black tea (BT) with adjusted determination coefficients (R2) and root mean square errors (RMSE)

Based on the results, it could be concluded that all models had very high goodness of fit as representing by high R2 and low RMSE values (Tables 3 and 4). Nevertheless, Peleg’s model could best describe the extraction of polyphenols as well as the antioxidant activity changes for both types of tea with R2 above 0.99 and similar, low RMSE values.

For Spiro’s model, the rate of polyphenol extraction (k) significantly increased with increasing temperature of brewing (Table 3). This effect could be explained by disruption of the cell wall by high temperature which made phenolic compounds easier to extract from tea leaves [8]. Moreover, k values for black tea were higher for about 37–48% than the corresponding k values for green tea. This indicates that the extraction of phenolic compounds is faster from black tea than from green tea at each temperature tested which was previously observed by Jaganyi and Price [8] for caffeine extraction. The difference could be explained by the various manufacturing process of the tea [35]. During green tea processing cellular structure remains almost intact, while in the black tea process, the leaves are rolled, so that the leaf cells are broken and the content of the cell cytoplasm is released. This makes the black tea phenolic compounds more available to the extraction than green tea catechins.

For Peleg’s model, K1 constant (Table 3), related to the initial extraction rate (R0, calculated from Eq. (5)) of polyphenols, decreased with increasing temperature indicating that the extraction of polyphenols at the beginning of the process was accelerated with temperature (R0 from 0.137 mg GAE/g/s to 0.586 mg GAE/g/s for green tea, and from 0.207 mg GAE/g/s to 0.450 mg GAE/g/s for black tea). Although, at the temperatures of 60 and 70 °C, the R0 values were higher for black tea comparing to green tea, reverse order was observed at the temperatures of 90 and 100 °C. This implies that, at higher temperatures, the extraction of polyphenols from green tea at the beginning of the process was faster compared to black tea. The effect could be partly explain by the significant increase of the extraction rate of some individual catechins (such as epigallocatechin gallate, gallocatechin gallate and catechin) in green tea infusion, at the temperature of 85 and 95 °C comparing to the extraction rate of these compounds at 75 °C [3]. The K2 values calculated for TPC (Table 3) also decreased with higher temperature which indicated the temperature–dependent increase of polyphenol equilibrium concentration (Ce) values (calculated from Eq. (6)). The Ce values obtained for TPC were higher for green than for black tea (the K2 values for green tea were lower than for black tea).

In case of logarithmic model the same trend was observed that for the previous ones; the extraction rate of total polyphenols from tea increased by increasing temperature.

The antioxidant activity of tea, expressed in the TEAC values, was also analyzed using the three selected models. Based on Spiro’s model, the rate of the antioxidant activity (k) significantly increased with an increase of temperature (Table 4). Interestingly, k values for black tea, which represent the Spiro’s rate constant, were lower than the corresponding k values for green tea. This indicates that the reaction of catechins extracted from green tea with the DPPH radicals is faster than the reaction of the bioactive compounds extracted from black tea. It may be due to the fact that during production of black tea more complex antioxidant active compounds are formed (theaflavins and thearubigins); the access of the DPPH radicals to their reactive hydroxyl groups could be more difficult (due to steric hindrance).

On the basis of Peleg’s model, the K1 constant (Table 4) decreased with increasing temperature indicating that the antioxidant activity of tea at the beginning of the process was accelerated with temperature (R0 calculated from K1 (Eq. 5) was from 2.80 mmol/g/s to 5.81 mmol/g/s for green tea, and from 0.99 mmol/g/s to 2.87 mmol/g/s for black tea). The K2 values calculated for the TEAC values also decreased with higher temperature which indicated the temperature–dependent increase of antioxidant equilibrium concentration (Ce) values (calculated from Eq. (6)). The Ce values obtained for the TEAC values were higher for green than for black tea (the K2 values for green tea were lower than for black tea).

For logarithmic model the same trend was observed that for the previous ones; the extraction rate of antioxidant active compounds from tea increased with an increase of temperature.

The effect of temperature was included to the mathematical models using Arrhenius relationship (Eq. 8). Based on the modified logistic Arrhenius equation (Eq. 9), the plot of lnk (for Spiro’s model), ln(1/K1) (for Peleg’s model) or lna (for Logarithmic model) versus 1/T gave straight regression lines for both teas with the slopes –Ea/R and k0 as intercepts (Fig. 3). The temperature dependence of the extraction rates of phenolics was described by Arrhenius kinetics in the whole temperature range under study. From the slope of these regression lines Ea values were calculated (Table 5).

Fig. 3
figure 3

Modeling temperature dependency of constant rates (k for Spiro’s model, K1 for Peleg’s model and a for logarithmic model) of total polyphenol content (TPC) and the antioxidant activity (TEAC) of green and black teas for the modified logistic Arrhenius equation

Table 5 Activation energies (Ea) and pre-exponential factors (k0) calculated, based on the Arrhenius equation, for the extraction of bioactive compounds (TPC) from green tea (GT) and black tea (BT) and their antioxidant activity (TEAC)

Ea values for polyphenol extraction from green tea were lower than for black tea based on the calculation from Spiro’s (respectively, 14.239 kJ/mol and 18.494 kJ/mol) and logarithmic models (respectively, 6.945 kJ/mol and 13.770 kJ/mol). The results imply that the polyphenol extraction rates for green tea calculated from the two models are less sensitive to temperature than the ones for black tea. However, based on the results obtained from Peleg’s model, Ea value calculated for the initial extraction rate (R0) of polyphenols from green tea was 35.622 kJ/mol and was higher than Ea (R0) obtained for black tea (17.916 kJ/mol). Thus, it could be stated that the temperature affects the initial extraction rate of polyphenols from green tea more than that from black tea. Since Ce calculated from the reciprocal of Peleg’s K2 constant (Eq. (6)) was also temperature–dependent, Arrhenius equation in its modified form (Eq. (10)) was used to calculate Ea (Ce) (Tables 3 and 4). The Ea (Ce) values for TPC in green tea were lower than in black tea. These results implied that Ce tends to be more temperature sensitive for black tea rather than for green tea.

The Ea values for the TEAC values of green tea were lower than those of black tea according to all three models included in the study (Table 5). Thus, the effect of temperature is more significant for the antioxidant activity changes of black than of green tea.

Since the phenolic content and the antioxidant activity of tea corresponds to the quality of tea infusions, these parameters could be used to create models predicting the quality changes of tea infusion in the function of time and temperature of infusion.

With the constant rates of empirical models (Tables 3 and 4) and the parameters calculated from Arrhenius equation (Table 5) the predictive models for tea quality (expressed as TPC and the TEAC values) were created based on the (Eqs. 2, 4 and 7). The validation of the models was performed by the comparison of the experimental data of the samples brewed at 80 °C with the predictive values calculated from the three models (Fig. 4) and R2, RMSE and r were calculated to verify the model accuracy (Table 6). Based on the data from Fig. 4a, b and Table 6 it could be concluded that Spiro’s model gives very good prediction of the quality changes of green tea infusion (both, for TPC and TEAC parameters). The R2 and r values were the highest, and RMSE values were the lowest. The coefficient of variation (CV) for RMSE is below 10% for the model, with CV equal to 6.64% and 5.45% for the prediction of TPC and the TEAC values, respectively. The following equations were created for green tea:

Fig. 4
figure 4

The validation of the models generated for total polyphenol content (TPC) (a and b) and the antioxidant activity (expressed in the TEAC values) (c and d) of green and black teas brewed at 80 °C

Table 6 Validation of generated models for TPC and TEAC values of green (GT) and black (BT) teas brewed at 80 °C

\(TPC{ }GT\left( t \right) = {{C_{\infty } - { }C_{\infty } }}/{{\exp \left( {k \times t} \right)}}\), where \(k = 0.340 \times {\text{exp }}( - 14239/\left( {R \times T} \right))\)

\(TEAC{ }GT\left( t \right) = { }C_{\infty } - { }C_{\infty } /\exp \left( {k \times t} \right)\), where \(k = 104.954 \times {\text{exp }}( - 20515/\left( {R \times T} \right))\)

The TPC and the TEAC values of black tea could be accurately predicted by Peleg’s model (Fig. 4c, d), with CV values of 6.1% and 10.99% respectively for TPC and TEAC parameters (Table 6). The following equations were created for black tea:

\(TPC BT(t)= t/(K_1+K_2 \times t)\), where \(K_{1} = 1/\left( {153.659 \times {\text{exp }}( - 17916/\left( {R \times T} \right)} \right))\) and \(K_{2} = 1/\left( {3813.568 \times {\text{exp }}( - 12891/\left( {R \times T} \right)} \right))\),

\(TEAC{ }BT\left( t \right) = { }t/(K_{1} + K_{2} \times t)\), where \(K_{1} = 1/\left( {32591.986{ } \times {\text{exp }}( - 28835/\left( {R \times T} \right)} \right))\) and \(K_{2} = 1/\left( {20245.501{ } \times {\text{exp }}( - 10287/\left( {R \times T} \right)} \right))\)

The worst among mathematical models for the prediction of tea quality in the function of time and temperature of brewing was the logarithmic model (Table 6).

Multivariate adaptive regression splines models

The second order MARS model has been used in this study and the maximum number of terms was limited by pruning. The results of MARS models calculated by the Statistica program were shown in Table 7. The model created for TPC of green and black teas included, respectively, five and six terms as well as five and six basis functions. All predictor variables; time and temperature, were statistically significant. The R2 values for TPC of green and black tea were equal to, respectively, 0.95 and 0.96, whereas adjusted R2 was 0.95 for both types of teas. The GCV calculated for TPC of green tea was 32.44 and of black tea 11.20. In Fig. 5a, c, the validation of the MARS models was shown by the comparison of the experimental data of the teas brewed at 80 °C with the predictive TPC values calculated from the MARS models (Table 7) and R2, RMSE and r were calculated to verify the model accuracy. Obtained models predicted the data very good, since the determination and correlation coefficients were high (R2 = 0.99, r = 0.98 for both types of tea), whereas RMSE values low (RMSE = 5.0 for green tea and 2.8 for black tea). The CV values were around 10% (9.4% for black tea and 11.5% for green tea).

Table 7 MARS model coefficients and basis functions calculated for TPC and the TEAC values of green (GT) and black (BT) teas
Fig. 5
figure 5

The validation of the MARS models generated for total polyphenol content (TPC) (a and b) and the antioxidant activity (expressed in the TEAC values) (c and d) of green and black teas brewed at 80 °C

The MARS models created for the prediction of the antioxidant activity changes of green and black teas were also presented in Table 7. For the TEAC values of green tea five terms and five basis functions were included in the model whereas the antioxidant activity of black tea could be described by the MARS model contained eight terms and eight basis functions. The goodness of fit of both models was very high. The adjusted R2 values were equal to 0.96 and 0.97 for green and black teas, respectively. To verify the model accuracy, the comparison of the experimental data of the teas brewed at 80 °C with the predictive TEAC values calculated from the MARS models was shown on Fig. 5b, d and R2 RMSE, and r values were calculated. The MARS models obtained showed very high accuracy, since R2 was equal to 0.99 for both types of teas and r = 0.98 for green tea and r = 0.99 for black tea, whereas RMSE = 42.0 for green tea and 27.3 for black tea. The CV values were equal to 10.09% for green tea and 10.42% for black tea.

Artificial neural network models

The prediction of tea quality based on TPC and the TEAC values was performed with an artificial neural networks. The best five ANN–MLP networks are presented in Tables 8 and 9. In the neural network obtained for TPC of green tea (Table 8), the logistic and exponential functions were used in the hidden layer, whereas the logistic, identity and Tanh functions were used in the output layer. For the TPC in black tea, the hidden layer was based on Tanh and logistic functions, while the output layer Tanh—on identity and exponential functions. For the TEAC values analysis by the ANN method mainly exponential and also logistic functions were used in the input and output layers (Table 9). The accuracy of the models to predict TPC and TEAC was very high and accounted for more than 0.99 for the training, test and validation sets of data. The accuracies of the models obtained could be also expressed as the sum of square error values, as presented in Tables 8 and 9. The values indicated satisfactory quality of the presented ANN models.

Table 8 ANN model parameters for TPC of green (GT) and black tea (BT)
Table 9 ANN model parameters for the antioxidant activity of green (GT) and black tea (BT)

Conclusions

The content of phenolics is highly correlated with the antioxidant activity of tea, and could be used as a quality parameter of tea. Thus, for the first time, the predictive models were developed for the description of the quality changes of green and black tea infusions under various time and temperature conditions. Most of the tea bioactive compounds were extracted from the samples during 30 min of the extraction process. The highest values of TPC and TEAC were achieved at the highest temperature of 100 °C for both teas.

The experimental data of TPC and TEAC values were fitted to three mathematical models: Spiro’s, Peleg’s and Logarithmic models as well as implemented to MARS and ANN models. This study demonstrated the high potential of the selected models for the prediction of tea quality during infusion, providing reliable results and models of simple application. Moreover, data mining techniques (MARS and ANNs) enable to create models very fast using the high number of data. The results presented here together with the models built could be developed further to fully validate the effectiveness of all models, using a higher number of green and black tea samples.