1 Introduction

Bioactive peptides are usually 3–20 amino acids in length and are normally inactive within the sequence of the parent animal or plant protein from which they are derived. However, despite their apparent lack of activity, bioactive peptides can exert a variety of different physiological functions after they have been released from the parent protein as a consequence of food processing or enzymatic proteolysis in the gastrointestinal tract (Moller et al. 2008). In contrast to pharmaceutical and chemical additives, protein-derived bioactive peptides exert very few side effects in humans, and there is also a growing preference among consumers for products derived from natural sources (Sakai et al. 1997; Moller et al. 2008). For this reason, considerable research efforts have been focused on the identification and characterization of bioactive peptides derived from various dietary proteins, such as milk, soy, fish, and egg (Samaranayaka and Li-Chan 2011).

β-Lactoglobulin (β-Lg) is a major whey protein that is considered to be an important source of biologically active peptides (Korhonen 2009). Bioactive peptides derived from β-Lg may function as antihypertensives, antioxidants, and antimicrobials, and can also be used to lower cholesterol (Hernandez-Ledesma et al. 2008). As for antioxidant activity, Liu et al. (2007) found that the antioxidant activity of bovine milk was reduced by 50% when β-Lg in the milk was depleted. Furthermore, the antioxidant activity of the hydrolysates derived from β-Lg was higher than that of butylated hydroxyanisole, which is the most commonly used antioxidant (Hernández-Ledesma et al. 2005). Therefore, β-Lg is a good template to investigate the structure–activity relationship of antioxidant peptides.

To prepare and isolate protein-derived antioxidant peptides, considerable research efforts have been made towards the optimization of the experimental variables involved in hydrolysis of various dietary proteins, and the following separation and purification of peptides with high levels of activity (Gu and Wu 2013). Despite progress in these areas, the isolation of bioactive peptides using these methods can be tedious, and it is hard to know beforehand the structure or activities of the peptides resulting from hydrolysis. To prepare peptides with high activities more effectively, one possible solution would be to develop a deeper understanding of the relationship between the activity of the bioactive peptides and their structure, which could be used to predict activities of any given structure and instruct whether the parent protein have the potential of preparing highly active peptides.

To identify relationships between the structural features of molecules and their activities, a mathematical method, quantitative structure–activity relationship (QSAR) modeling has been introduced. It provides a method to find mathematical expressions for the relationship between structure and activity, which may be useful for estimating or predicting highly active structures. It has been used extensively in this context in the fields of medical science and toxicology. With regard to bioactive peptides, QSAR has been successfully applied to ACE-inhibitory peptides (Gu and Wu 2013; Jing et al. 2014; Wu et al. 2006a, 2006b) and antimicrobial peptides (Jenssen et al. 2008; Yan et al. 2013).

The successful development of QSAR models has been reported to evaluate the antioxidant properties of peptides using the trolox-equivalent antioxidant capacity (TEAC), oxygen radical absorption capacity (ORAC), superoxide radical (SOR), and ferric thiocyanate (FTC) methods (Li and Li 2013; Li et al. 2011). Since the databases of these QSAR models were collected from different sources, there still lacks a systemic evaluation of each fragment that could be produced by a certain protein. Chemically synthesized peptides prepared with well-developed solid-phase synthesis technology provide an alternative solution to obtaining large amounts of peptides with known sequences in a relatively short time, which allows the fast screening of a large amount of sequences for potential biological activities as the database of QSAR modeling. Tripeptides have been reported to exhibit higher levels of antioxidant activity than other oligopeptides (Chen et al. 1996) and are not limited in the same way as dipeptides in terms of the diversity of their structural variations (Jian-Hui et al. 2013). Furthermore, the well-known multifunctional antioxidant, glutathione (GSH) is also a tripeptide composed of sequential Glu, Cys, and Gly residues, which exists in animals and plants to defend against oxidative stress (Gu et al. 2012).

In this study, we chemically synthesized all of the tripeptides that could possibly be derived from β-Lg based on its amino acid sequence, and evaluated their antioxidant activities using a ferric-reducing antioxidant power (FRAP) assay. A QSAR model was constructed to develop a relationship between the antioxidant activity of β-Lg tripeptides and their structure.

2 Materials and methods

2.1 Chemicals

TPTZ (2,4,6-tripyridyl-s-triazine) and glutathione (GSH) were purchased from Sigma-Aldrich (St. Louis, MO, USA). O-(Benzotriazol-1-yl)-N,N,N′,N′-tetramethyluronium hexafluoro-phosphate (HBTU), 1-hydroxybenzotriazole (HOBT), thioanisole, N,N-diisopropylcarbodiimide (DIC), ethanedithiol (EDT), trifluoroacetic acid (TFA), diisopropylethylamine (DIPEA), and all Fmoc-protected amino acids were purchased from GL Biochem (Shanghai, China). Wang resin and 2-chlorotrityl resin were purchased from NankaiHecheng Science and Technology Co. Ltd. (Tianjin, China). HPLC-grade methanol and acetonitrile were purchased from Fisher Scientific (Loughborough, UK). All of the other reagents used in the current study were obtained commercially as the analytical grade.

2.2 Peptide synthesis and purification

β-Lg was selected as the parent protein in the current study, and the sequence of the bovine β-Lg precursor (Mercadante et al. 2012) is shown below:

$$ \begin{array}{c}\hfill 1\ \mathrm{MKCLLLALAL}\ \mathrm{TCGAQALIVT}\ \mathrm{QTMKGLDIQK}\ \mathrm{VAGTWYSLA}\hfill \\ {}\hfill 41\ \mathrm{AASDISLLDA}\ \mathrm{QSAPLRVYVE}\ \mathrm{ELKPTPEGDL}\ \mathrm{EILLQKWENG}\hfill \\ {}\hfill 81\ \mathrm{ECAQKKIIAE}\ \mathrm{KTKIPAVFKI}\ \mathrm{DALNENKVLV}\ \mathrm{LDTDYKKYLL}\hfill \\ {}\hfill 121\ \mathrm{FCMENSAEPE}\ \mathrm{QSLACQCLVR}\ \mathrm{TPEVDDEALE}\ \mathrm{KDKALKALP}\hfill \\ {}\hfill 161\ \mathrm{MHIRLSFNPT}\ \mathrm{QLEEQCHI}\hfill \end{array} $$

This particular sequence provided 175 tripeptides, with each peptide having a two-amino-acid overlap. These 175 tripeptides contained three repeated sequences, which were removed to reduce the total number of tripeptides to 172 (Table 1).

Table 1 Antioxidant activities of the 172 tripeptides based on β-Lactoglobulin

All of the peptides were synthesized using solid phase Fmoc Chemistry (Ren et al. 2011). 2-Chlorotrityl resin was used for the peptide acids containing carboxyl-terminal Trp, Cys, and Pro residues, whereas Wang resin was used for all of the other peptide acids. When more than one tripeptide shared the same N-terminal sequence, resins bearing different pre-loaded C-terminal amino acids were mixed together, with up to four tripeptides being synthesized in one batch.

All of the individual peptides and peptide mixtures were purified using a Shimadzu (Kyoto, Japan) HPLC equipped with LC-10AT pumps and an SPD-10A VP Plus UV detector. HPLC purifications were conducted over an Agilent Eclipse XDB-C18 column (9.4 × 250 mm, 5 μm). The mobile phases used for the HPLC were 0.1% TFA in water (solvent A) and 0.1% TFA in acetonitrile (solvent B). The elution gradients were varied slightly depending on the peptide sequence. In a typical purification run, the solvent gradient was increased from 5 to 40% B in a linear manner over 40 min at a flow rate of 5 mL.min−1. The solvent eluting from the column was monitored at 230 (for those without Trp) or 280 nm (for those with Trp). The peptide purities were determined to be >95% in all cases by RP-HPLC using a Kramsil C18 column (4.6 × 250 mm) and an elution gradient of 5 to 50% B over 30 min at a flow rate of 1 mL.min−1. All of the peptides were characterized by ESI-MS using an Agilent 1100 Series LC/MSD (Santa Clara, CA, USA). The LC/MS system was operated under following conditions: HV capillary, 3,500 V; nebulizer pressure, 15 psi; dry gas flow rate, 5 L.min−1; and dry gas temperature, 325 °C.

2.3 Ferric-reducing antioxidant power assay (FRAP assay)

The FRAP assay was performed according to the method described by Benzie and Strain (1996) with some minor modifications. Ten microliters of a sample solution containing 100 mmol.mL−1 of peptides was incubated at 37 °C with 100 μL of FRAP reagent, containing 10 mmol.L−1 of TPTZ and 20 mmol.L−1 of FeCl3. The absorption values were read after 10 min at a wavelength of 570 nm using a microplate reader (Model 680, Bio-Rad, Hercules, CA, USA). Aqueous Fe2+ solutions at concentrations in the range of 10–1,000 μmol.L−1 were used to produce a calibration curve. The activity of glutathione (GSH) was also measured as a positive control. The results were expressed as micromoles Fe2+ equivalents per mole of the sample based on a standard curve of FeSO4•7H2O. All of these experiments were carried out in triplicate.

2.4 QSAR modeling

2.4.1 Structural characterization and regression analysis

The antioxidant activities of the 172 peptides synthesized in the current study were subjected to a data standardization protocol to make up the dependent variables. The divided physicochemical property score (DPPS) descriptors (Tian et al. 2009) were applied to describe the tripeptides, which make up the independent variables. The DPPS descriptors were made up of 10 variables which were used to represent the electronic (X1–4), steric (X5–6), hydrophobic (X78), and hydrogen-bonding properties (X910) of each individual amino acid. The amino acids in the sequence from N-terminal to the C-terminal were designed as n, m, and c, respectively. Subsequently, each tripeptides had 30 descriptors (i.e., 3 × 10), which were expressed as nX110, mX110, and cX110. The relationships between the biological activity and the peptide descriptor variable matrix were then modeled by multiple linear regression (MLR) analysis using the version R2013a of the MATLAB software (The MathWorks, Inc., Natick, MA, USA).

2.4.2 Model evaluation and cross validation

Before modeling, five peptides with the highest activities between which there were large gaps (that exceed 10–15% of the entire range of activities) and 14 inactive peptides were suppressed. The peptides left were then randomly divided into training and test sets in a 2:1 ratio. The training dataset was used to establish the QSAR models and perform an internal validation process, whereas the test set was used to perform an external validation process. The quality of the fit between the model-predicted and experimental values, and the suitability of the regression model in terms of its ability to describe the data were measured by the coefficient of determination (R 2) of the training set and Fisher’s test value (F value). The actual stability of the QSAR model was measured by Q 2 (the leave-one-out cross-validation R 2) and the R 2 test of the test dataset. The robustness was also measured by the randomization of the modeled property (Y-scrambling).

3 Results and discussions

3.1 The antioxidant activity of tripeptides

The antioxidant activities of the 172 synthetic tripeptides were evaluated using a FRAP assay, with GSH being used as a positive control. The activities observed in a FRAP assay are presented as the amount of Fe3+ reduced per mole of antioxidant, which allows for a direct comparison between results generated under different testing concentrations. The relative activities of the 172 tripeptide and GSH were calculated, and the results are listed in Table 1.

As glutathione, the well-known multifunctional antioxidant, shows activity around 273 mmol Fe3+.mol−1 peptide, we consider the range around its activity as “high,” that is to say, above 100 mmol Fe3+.mol−1 peptide. Marcuse (1960) found that some of the amino acids can act as primary antioxidants themselves; we assessed the activity of some free amino acids and found that Trp and Cys have activities around 5.96 and 50.74 mmol Fe3+.mol−1 amino acid. Therefore, we consider 1–100 mmol Fe3+.mol−1 peptide as “moderate,” and lower than 1 mmol Fe3+.mol−1 peptide as “low.” More than half of the peptides showed very poor antioxidant activities at a concentration of 100 mmol .L−1. Seventy-four of the peptides showed antioxidant activities greater than 1 mmol Fe3+.mol−1 peptide. Among this particular group of tripeptides, six peptides (LTC, CQC, GTW, LFC, CLV, and QKW) showed high activities, and three peptides, LTC, CQC, and GTW, showed higher activities than that of the GSH.

3.2 Model evaluation and cross validation

As shown in Table 2, the values of R 2 and F were 0.643 and 7.324, respectively, which suggested that the activities of these peptides shared a linear relationship with the physicochemical properties of the individual residues in their sequence at 5% significance level. Figure 1a shows a plot of the calculated activities against the observed activities for the training set. The relationship between the observed and predicted activities of the test set is shown in Fig. 1b. The scatter plots of the estimates vs the experimental values are rather evenly distributed along the regression line, as well for the training set than for the test set, suggesting a reliability of the model.

Table 2 Statistical parameters of the MLR model for the QSAR of the 172 antioxidative tripeptides based on β-Lactoglobulin
Fig. 1
figure 1

Calculated antioxidant values versus observed values a for the training set and b for the test set. The circles represent the chemosynthetic tripeptides, whereas the diamond represents the GSH. Outliers were excluded before the modeling process for the training set

The external validation of the current model was performed using the prediction dataset. The Q 2 and R 2 test values were 0.541 and 0.635, respectively, which suggested that the model was creditable and stable. The model was further validated by applying the Y-randomization test. Nineteen random shuffles of the Y-vector gave R 2 values in the ranges of 0.143–0.368 and Q 2 values in the ranges of 0.000–0.208 as shown in Table 3. The confidence level is 95%. The low R 2 and Q 2 values obtained show that the good results in our original model are not due to a chance correlation or structural dependency of the training set.

Table 3 Y-randomization results of the QSAR model of the 172 antioxidative tripeptides based on β-Lactoglobulin

3.3 Relationship between antioxidant activity and amino acids in the peptide sequences

Figure 2 shows the regression coefficients of the different variables for the tripeptides. These regression coefficients revealed that mX9, cX6, mX10, cX1, nX10, cX10, cX4, nX6, mX2, and nX1 made a much greater contribution than any of the other variables. Furthermore, nX8, nX9, mX9, cX5, and cX8 had a negative effect on the antioxidant activity; whereas, all of the other variables had a positive impact. Based on the meaning of the DPPS descriptors and the positions of different amino acids in the sequences, it was clear the electronic and hydrogen-bonding properties of the amino acids in the tripeptide sequences, as well as the steric properties of the amino acid residues at the C- and N-termini, played an important role in the antioxidant activities of the tripeptides. Considering that the dependent variables (i.e., antioxidant activities) obtained were presented in a reducing capacity based on their performance in the FRAP assay, which provides a direct measurement of the antioxidant activity through reduction reaction based on single electron transfer, it is perhaps not surprising that the results of the regression showed that the antioxidant activity was mainly related to the electronic and hydrogen-bonding properties of the amino acid residues.

Fig. 2
figure 2

Plot of regression coefficients for the 172 tripeptides based on β-Lactoglobulin

Based on the regression equation from the QSAR modeling, we calculated the antioxidant activities of tripeptides bearing one of the 20 common amino acids at their C- or N-terminus, or in the middle of their sequence in an attempt to predict which peptide sequence would provide the strongest activity. It turned out to be a similar rank for the predicted values, with Cys (C) and Trp (W) residues tending to provide higher predicted values.

3.4 Illustration of the structural characteristics of antioxidative peptides

Numerous studies have been reported in the literature that peptides containing Cys (C) and Trp (W) amino acid residues have higher antioxidant activities than peptides not containing these residues. An early study indicated that the antioxidant effect of β-Lg in the raw milk was associated with the free sulfhydryl group at Cys-121 (Liu et al. 2007). Saito et al. (2003) found that tripeptides containing Trp (W) at the C-terminus had strong radical scavenging ability. Other oligopeptides with high antioxidant activity were also separated and identified containing one or more Trp (W) from β-Lg (Contreras et al. 2011; Hernández-Ledesma et al. 2005), β-CN (Suetsuna et al. 2000), and sardinelle (Bougatef et al. 2010).

For a given amino acid or peptide to act as an antioxidant, it must be preferentially oxidized in the presence of other compounds in the food matrix. In principle, all 20 of the amino acids found in proteins could interact with free radicals if the energy of the free radical was high enough (Samaranayaka and Li-Chan 2011). The oxidative reactivities of a variety of amino acids were evaluated by photochemical oxidation with an OH-containing compound and it was found that the reactivities of the amino acids were of the order Cys > Trp and Tyr > Met > Phe > His > Ile > Leu > Pro (Sharp et al. 2004). The sulfhydryl (R-SH) group in Cys (C) has a unique antioxidant activity and interacts with radical species by hydrogen donation from the SH group or the loss of an electron from its sulfur atom (Elias et al. 2008). Residues with an aromatic ring structure (i.e., Trp W) have been reported to donate a proton to electron deficient radicals (Chen et al. 1998). Taken together, these results demonstrate that the nucleophilic sulfur-containing amino acids Cys (C) and the aromatic amino acids Trp (W) make a significant contribution to the antioxidant activities of certain peptide.

This result appeared to contradict the results of previous studies (Li and Li 2013; Li et al. 2011) in some cases where it may prefer different amino acids at different positions of the sequence, although these differences in the results were mainly due to the differences in the assay conditions. The FRAP assay measures the antioxidant activity through a reduction reaction based on single electron transfer reactions, where the redox potential of the compounds is important. Whereas the ORAC, SOR, and TEAC assays are based on a free radical scavenging system which are influenced quite strongly by the type of reactive species used. Although TEAC assays is also based on single electron transfer reactions and the redox potential of the redox couple, ABTS/ABTS+ (0.68 V), is comparable to that of Fe(II)/(III) (0.70 V), the TEAC and FRAP assays provided different results because the two assays operate under different reaction conditions, including different pH values (e.g., FRAP, pH 3.6; TEAC, pH 7.4) and different levels of steric hindrance for the molecules being oxidized (Müller et al. 2011). Assays carried out in the presence of lipids (e.g., FTC assay) or an emulsion to inhibit lipid peroxidation could correlate more closely with the amphiphilic and steric properties of the antioxidants, which would help them enter into the lipid system so that they could react with the free radicals.

It is interesting to note that despite peptides with Cys (C) or Trp (W) in the sequence show higher activities, their activities still vary. This may be attributed to the influence of neighboring amino acids beside these two particular amino acid residues in the peptide sequence. The strong radical scavenging ability of Trp (W) residues may be explained by the special capability of indolic group to serve as hydrogen donor and the stability of the indolic radical which depends on neighboring amino acid residues in the presence of an acid-ferric ion system (Saito et al. 2003). That is to say, the physicochemical properties of neighboring amino acids may influence the ability to donate electron of the Cys or Trp, and thus, influence the antioxidant activity. However, a further study is still needed to understand the influence of neighboring amino acid residues and its mechanism.

4 Conclusions

In summary, all the tripeptides that could possibly derive from bovine beta-lactoglobulin have been synthesized, and their antioxidant activities measured using a ferric-reducing antioxidant power assay. The results revealed that the antioxidant activities of three tripeptides, LTC, CQC, and GTW, were higher than that of GSH. A QSAR model of antioxidant activities of these peptides was also performed. The results of QSAR modeling showed that the electronic and hydrogen-bonding properties of the amino acids in the tripeptide sequences, as well as the steric properties of the amino acid residues at the C- and N-termini, played an important role in the antioxidant activities of the tripeptides. Tripeptides exhibiting the strongest antioxidant activities tended to contain Cys (C) and Trp (W) residues.