Introduction

Currently, healthy and nutritional products in human food are of tremendous importance. One such product is butternut squash, which is characterized by antioxidant and anticancer properties. Pumpkin fruits contain a high content of carotenoids, vitamins A, C and B, various minerals, and organic acids (see e.g., [1,2,3]). Additionally, pumpkin is low in calories; therefore, it can be successfully used in slimming diets. The quality of butternut squash is considered in the literature of the subject in different aspects, namely, on production and market (see e.g., [3,4,5,6]), sensory, chemical or mechanical properties (e.g., [6,7,8,9, 11]).

Prior to consumption, pumpkin fruits are subjected to different types of thermal treatments. During these processes, textural properties are changed (e.g., [1,2,3, 8]). The aim of this study was to determine changes in the textural properties of butternut squash caused by thermal treatment in a convection steam oven with different process parameters. Textural properties were measured in terms of five features (characteristics): hardness, chewiness, gumminess, springiness, and cohesiveness.

Separate statistical analysis of each feature are presented in [10]. However, the textural characteristics under consideration are mainly mutually significantly correlated; therefore, a multivariate approach was applied to identify any simultaneous influences of thermal treatment parameters on butternut squash texture. MANOVA analysis and selected multivariate techniques were employed, such as canonical correlation analysis, principal component analysis (PCA), factor analysis (FA), clustering analysis, and discriminant analysis. All calculations were performed in SAS 9.3 and SAS Enterprise Guide 5.1.

Materials and methods

In the experiment, butternut squash (Cucurbita moschata Duch.) originating from Portugal was selected as the research material. Details about the raw material, its properties, and the treatments are presented in [10]. In the experiment, three factors were introduced:

  • temperature at the levels of 80 and 100 °C,

  • steam at the levels of 0%, 20%, 40%, 60%, 80%, and 100% for the temperature of 80 °C, and 0%, 20%, 40%, and 60% for the temperature of 100 °C,

  • heating times: 5, 10, 15, 20, 25, and 30 min.

Immediately after the thermal treatment, warm samples were measured with the TPA test using a Zwick/Roell Z0.5 strength-testing device. Measurements in the form of texturegrams of strength and time were conducted in six replications. Based on the measurements, the following texture characteristics were determined: hardness [N]—the maximum force recorded during the first compression cycle; springiness [–] characterizing the degree of shape recovery (calculated as a quotient of sample deformation values during the second and the first compression cycle); gumminess [N]—quotient of hardness and cohesiveness, chewiness [N] denoting the force required to chew a bite of the food product before swallowing (quotient of hardness, cohesiveness, and elasticity); cohesiveness [–], characterizing internal binding forces ensuring the entity of the product (calculated as a quotient of the fields under the graph of the second and first compression cycles) in [11]. As the final result, a data set was collected containing 360 independent vectors (individual samples), each of five measurable characteristics (features), respectively. Each of the four texture features, i.e. hardness, springiness, chewiness, and cohesiveness, was analyzed separately in [10].

Results and discussion

The aim of this section is to analyze, using multivariate statistical techniques, the impact of treatment factors on the texture of butternut squash as measured by five features. First, we studied for any significant impact from the main factors and from interactions between them on butternut squash textural characteristics during the thermal treatment using multivariate analysis of variance (MANOVA). Furthermore, multivariate techniques available in SAS Enterprise Guide 5.1 were used to identify similarities and differences between the samples and the characteristics under consideration.

Multivariate analysis of variance (MANOVA)

To check for the significant impact of the analyzed factors during the thermal treatment—namely, temperature, steam, and duration of heating—on multivariate textural characteristics of butternut squash, a three-way classification model was used in [12] including the main factors and all double interactions between the main factors. To conduct an appropriate analysis, we considered only the same levels for steam at both temperatures: 0%, 20%, 40%, and 60%.

Six types of null hypotheses were defined:

\(H_{{Temp}}^{0}\) :

No overall temperature effect,

\(H_{{Steam}}^{0}\) :

No overall steam effect,

\(H_{{Time}}^{0}\) :

No overall time effect,

\(H_{{Temp \times Steam}}^{0}\) :

No overall temperature*steam interaction effect,

\(H_{{Temp \times Time}}^{0}\) :

No overall temperature*time interaction effect,

\(H_{{Steam \times Time}}^{0}\) :

No overall steam*time interaction effect.

Before testing the above hypotheses, an additional hypothesis, i.e. that measurements for each of the features under consideration represented normal distributions, was tested using Shapiro–Wilk statistics. Only for some factors of cohesiveness was normality rejected at a significance level of 0.05. For that reason, four features (hardness, springiness, gumminess, and chewiness) were considered in further analyses. The calculations were performed in SAS 9.3 using the following code:

figure a

Based on all measurements of the four textural characteristics simultaneously—hardness, springiness, gumminess, and chewiness—all the tests used in MANOVA (Wilks’ Lambda, Pillai’s Trace, Hotelling–Lawley Trace, and Roy’s Greatest Root) rejected all the null hypotheses under consideration. This demonstrated that all factors and their interactions had a simultaneous significant impact on the variability of the textural characteristics of butternut squash at a significance level of 0.05.

Pearson’s correlations

To check the linear relations between the considered features, Pearson’s correlation coefficients for each temperature, separately and together for both temperatures, were calculated. The obtained results are presented in Table 1, where the following abbreviations are used: Hard—hardness, Spr—springiness, Gum—gumminess, Chew—chewiness, and Coh—cohesiveness. In Table 1, Pearson’s correlations (significant at a 0.05 significance level) are shown in bold.

Table 1 Pearson’s correlations for textural characteristics with respect to the temperatures 80%, 100% and both (significant coefficients at the level of 0.05 are printed in bold)

The correlations between the features under consideration included in Table 1 support the fact that, based on all measurements (last row), most features are significantly correlated. In terms of the temperature of 80 °C (the second row), only the correlation between springiness and cohesiveness is not significant. For the temperature of 100 °C (the third row), hardness is significantly correlated with gumminess, chewiness, and cohesiveness, and chewiness is also correlated with springiness and gumminess. The most strongly and positively correlated characteristics were hardness and gumminess for temperatures both separately and in total. These facts indicate that, in addition to the statistical analysis of each characteristic separately, additional multivariate analyses should also be performed for all characteristics simultaneously to identify additional sources of variations in the textural attributes of butternut squash.

Canonical correlation analysis

In this subsection, we concentrate on exploring the relationships between two multivariate sets of variables. The three factors of temperature, steam, and time are considered as the first set of variables, and the textural features under consideration, i.e. hardness, springiness, gumminess, chewiness, and cohesiveness, are the second set of variables. All the regression coefficients between textural characteristics and factors presented in Table 2 are significant at the significance level of 0.05, except for the regression between cohesiveness and time.

Table 2 Regression coefficients between the group of textural features and the group of factors (significant coefficients at the level of 0.05 are given in bold)

The results in Table 2 show that all the considered factors have a significant negative influence on the texture of butternut squash during thermal treatment, for example, hardness, gumminess and chewiness, and a positive influence on cohesiveness and springiness, with the exception of time.

Based on the canonical analysis, the following first pair of canonical variables:

$$\left\{ {\begin{array}{*{20}{l}} {{V_1}= - 0.0047{\text{ Hard}}+1.0463{\text{ Spr}} - 0.0281{\text{ Gum}} - 0.0294{\text{ Chew}}+24.1402{\text{ Coh}}} \\ {{W_1}=0.0819{\text{ Temperature}}+0.0277{\text{ Steam}}+0.0185{\text{ Time}}} \end{array}} \right.$$

is very highly correlated (0.9134) and explains the total dependence between all canonical variables at a level of more than 90%.

Principal component and factor analyses

In cases when many features are measured on the same experimental sample, PCA or the more elaborate FA are proposed in the literature to reduce the dimensions of a multivariate data set [6, 7, 13,14,15]. To apply these methods, we proposed to stabilize the variances of the considered textural characteristics of butternut squash during thermal treatment; we divided the measurements of hardness by 10 and multiplied the measurements of springiness and cohesiveness by 100. In principal components analysis, new variables are created as linear combinations of the observed characteristics (features) and proper components as principal vectors. Principal component vectors are included in Table 3, and the explanation of variance by the principal components is illustrated in Fig. 1. The most important principal component is the first one and this explains about 52.6% of the total variability of the original data set. This principal component is strongly correlated with springiness (in bold); moreover, it increases with increasing springiness. The second principal component explained 44.8% of total data variability and was strongly and also positively correlated with hardness. Summarizing, the first two principal components explained more than 97% of the variability, so the remaining three principal components explained only about 3% of data set variability and could be neglected.

Table 3 Principal components of the covariance matrix for texture modified features
Fig. 1
figure 1

Eigenvalues of the covariance matrix (left panel) and explanation of variances by the principal component vectors (right panel)

Each variable contributes significantly to only one component (printed in bold; see Table 3).

Textural characteristics represented by Pearson’s correlations between original features and the first two principal components are illustrated in Fig. 2. We can see that the original textural characteristics, i.e. hardness, gumminess, and chewiness, and their principal component counterparts are close to each other. They are positively correlated with the second principal component and negatively correlated with the first principal component. Another conclusion can be formulated on the basis of springiness or cohesiveness. The correlations for these features are positive both with the first principal component and with the second principal component for springiness. Very small correlations can be observed for cohesiveness with the first two principal component.

Fig. 2
figure 2

Textural features in the space of the first two principal components

Another method for the reduction of the dimensions of data sets is FA in [7]. In a FA, the observed variables are modeled as linear functions of certain unknown factors. The results of FA for the modified data set (after variance stabilization) using the Varimax rotation method are illustrated in Fig. 3.

Fig. 3
figure 3

Textural features in the space of the first two factor components

Hardness, chewiness, and gumminess are very close, i.e. similar (relative to the PCA result). Moreover, cohesiveness and springiness are also closely located. In conclusion, the results of PCA and FA divided the texture characteristics into two separate groups of similar characteristics. The first group contains hardness, chewiness, and gumminess, and the second comprises cohesiveness and springiness.

Cluster analysis of textural properties

In this subsection, we concentrate on detection of the similarity between objects (samples) characterized by the same thermal treatment conditions. For comparison, we had a total of 6 × 6 + 4 × 6 = 60 different samples, characterizing by averages of all the textural characteristics under consideration. So, for the temperature of 80 °C, we considered six levels of steam and five periods of heating; and for the temperature of 100 °C, we considered four levels of steam and five periods. Cluster analysis was applied to identify the similarity of the variability of the textural characteristics between the samples (e.g., [13, 14]). The results are presented in Fig. 4.

Fig. 4
figure 4

Clusters for textural characteristics of butternut squash using Ward’s minimum-variance method (label T denotes temperature, St—steam and time of heating in minutes)

Figure 4 shows that, in the case where five clusters are considered, those samples for the temperature of 100 °C or 80 °C but with higher steam levels are located in the last cluster. Four other clusters mainly contain samples for 80 °C. The first cluster contains samples for the temperature of 80 °C without steam, and the second cluster contains samples characterized by steam 20% or without steam for the temperature of 100%. The higher amount of steam divides the samples into the third and fourth clusters.

Discriminant analysis of butternut squash textural properties

Discriminant analysis was used to answer the question of which of the three factors, i.e. temperature, steam, or duration of thermal treatment, is most responsible for the clear classification of butternut squash textural characteristics in [13]. For each individual factor, based on the original data set for all the features considered, the results are presented in Tables 4, 5 and 6, respectively. When the temperature is regarded as a discriminant variable, the results presented in Table 4 show that the samples at the temperature of 80 °C are well classified in almost 88% of cases, while the samples treated at the temperature of 100 °C are well classified in almost 72%, considering the entire set of measurements of textural characteristics.

Table 4 Percentage of well classified data with respect to temperature (in bold)
Table 5 Percentage of well classified measurements with respect to steam (in bold)
Table 6 Percentage of well classified measurements with respect to time (in bold)

To consider steam as a classification variable, the measurements of the steam values 80% and 100% for the temperature of 80 °C were not included in the calculation. The results in Table 5 support the fact that samples with added steam were not best identified. Only samples without steam were recognized the best (in about 67%), while samples with the steam level of 20% were recognized the worst (in about 29%).

The results when the heating period was considered as a classification variable are presented in Table 6.

Based on the results in Table 6, we can say that, in general, the samples were not well recognized with respect to the duration of heating. The best-recognized samples were those for the shortest time of 5 min (more than 48%). The other classifications with respect to duration of heating are not satisfactory (below 35%).

Therefore, we can conclude that the best recognized factor in the thermal treatment of butternut squash was the temperature of heating and the worst factor was the duration of heating.

Conclusions

The multivariate approach used in the analysis of five textural features of butternut squash during thermal treatment where three factors were applied at several levels allowed the formulation of six general conclusions:

  1. 1.

    The measurements of the textural characteristics (hardness, springiness, gumminess, chewiness, and cohesiveness) confirmed a significant mutual linear dependence according to Pearson’s correlations at the significance level of 0.05. This conclusion was the main motivation to apply multivariate statistical methods in the data set analysis.

  2. 2.

    Based on four-dimensional measurements including hardness, springiness, gumminess, and chewiness (without cohesiveness) satisfying the assumption on normality, the three-way MANOVA confirmed that the main factors in thermal treatment (temperature, steam, and the time of heating and all double interactions between them) significantly differentiate the texture of butternut squash pulp at the level of 0.05.

  3. 3.

    The canonical correlation analysis proved the significant dependence of the textural characteristics with the group of all factors under consideration in the thermal treatment.

  4. 4.

    The PCA and FA with Varimax rotation showed similarities between the variability of hardness, gumminess, and chewiness as well as between springiness and cohesiveness. Moreover, the dimensionality of the textural characteristics can be reduced to two principal components, as they explain 97% of total measurement variability.

  5. 5.

    Some similarities can be observed using clustering analysis when comparing all samples (pumpkin slices) treated at the temperatures of 80 °C and 100 °C, with steam added at the levels of 0%, 20%, 40%, 60%, 80% and 100% for the temperatures of 80 °C, and 0%, 20%, 40%, and 60% for the temperature of 100 °C and when heated for 5, 10, 15, 20, 25, and 30 min. Nevertheless, it is difficult to specify which of the main factors classifies the samples into the clusters.

  6. 6.

    The discriminant analysis applied to the whole data set of textural characteristics confirmed that the temperature in the treatment classified the samples properly in about 88% of cases for the temperature of 80 °C and in about 76% of cases for the temperature of 100 °C. The duration of heating and the added steam in the thermal treatment were not good discriminators for the samples. These factors frequently classified the samples to neighboring classes.