Introduction

Seed size is an element of the quality factors that affect the yield of the plant in field crops after harvest and is the determining factor in product development [1]. Few studies have been conducted examining the relationship between seed size and quality factors in field crops. In these studies, on different plant patterns, especially soybean [1] and canola [2], it has been reported that seed sizes have an effect on quality characteristics.

Tea (Camellia sinensis) is an important medicinal and aromatic plant in the Theaceae family. Tea seed has a high oil content ranging from 30 to 32%. This seed oil contains oleic acid, linoleic acid and low levels of saturated fatty acids [3, 4]. A healthy and edible oil desired for human health should contain a suitable mixture of oleic, linoleic and linolenic acids, and tea seed also has these properties. Therefore, it has been reported that tea seed oil is beneficial against many diseases [5, 6].

Tea is an income-generating agricultural product that is considered as a beverage as a result of processing its fresh shoots and grown in marginal areas such as rough lands in Turkey [7]. The tea plant, whose leaves are used in Turkey, achieved a yield of 15.620 kg ha−1 and a production of 1.3 million tons in areas of 83.200 ha in 2022 and was grown in the provinces of Artvin, Giresun, Ordu, Rize and Trabzon [8].

Different chemometric analysis methods were used in order to better see, understand and compare the results obtained for the parameters examined in recent studies. The methods used in these studies include correlation analysis [9, 10], principal component analysis [11, 12] and agglomerative hierarchical clustering analysis [13, 14] were also widely used. Some studies were conducted in which the important quality parameters of the tea plant were examined by chemometric analysis such as principal component analysis and cluster analysis [15, 16].

This study was planned to determine the effects of different seed size on seed internal ratio, crude oil content, fatty acid and mineral compositions in the tea plant, which has an economic and widespread use value, by using different chemometric analysis methods (correlation analysis, agglomerative hierarchical clustering analysis and principal component analysis).

Materials and methods

Plant material and grouping

The seeds of the tea plant used in the study were collected in 2021 from a local producer in Artvin/Turkey. After the harvest, the seeds were divided into four different groups with the size of 11.00–12.50 mm (SD1), 12.51–14.00 mm (SD2), 14.01–15.50 mm (SD3) and 15.51–17.00 mm (SD4).

Determination of seed internal ratio

The internal ratio of dry seeds divided into different groups was determined before the analysis. While determining the internal ratio, 10 g seeds were taken in three parallels for each group and their inner parts were separated from their shells and weighed. Afterwards, the seed kernels were ground with the help of a grinder and kept in the refrigerator at + 4 ℃ until analysis.

Crude oil content analysis

The ground 3 g dry seed samples were analyzed with petroleum ether in the oil analyzer and the oils obtained as a result of the analysis were kept in the refrigerator at + 4 ℃ until the component analysis. Crude oil content analysis was performed in three parallels and the results were expressed as % [17].

Fatty acid composition analysis

Fatty acid methyl esters (FAMEs) of seed samples were obtained according to one-step extraction-transesterification method [18] and oil profile was determined by gas chromatograph (Schimadzu, GC 2010 plus) instrument equipped with FID (flame ionization detector) (Schimadzu, Kyoto, Japan) detector and capillary column (60 m, i.d. = 0.53 mm, Restek RTX-200). During the analysis, the H2 was used as the carrier gas and the temperature gradient program was used to separate the FAMEs [19]. While defining the components obtained as a result of the analysis, the retention times of the peaks were compared using a standard (Sigma Supelco #37, Supelco Inc., Bellefonte, PA, USA).

Mineral composition analysis

The mineral composition (Al, B, Ca, Co, Cr, Cu, Fe, K, Mg, Mn, Na, Ni, P, Pb, S and Zn) of tea samples was determined with ICP/OES spectrophotometer (Inductively Couple Plasma spectrophotometer) device (Agilent, 5110 Optima, ICP/OES). During the analysis, 0.2 g of the ground seed samples were treated with a nitric acid-hydrogen peroxide (2:3) mixture and analyzed in a microwave wet burning unit (Anton Paar Microwave) resistant to 40 bar pressure. The results obtained as a result of the analysis were expressed as mg kg−1 [20, 21].

Statistical analysis

Statistical analysis of the data obtained from all parameters examined after the study were performed with the SPSS package program (IBM SPSS, Version 21.0, Chicago, USA) in triplicate. In the study, one-way analysis of variance (ANOVA) and Duncan's Test were used to determine the differences between the groups (p < 0.05). The XLSTAT program (Lumivero 2023, XLSTAT statistical and data analysis solution, https://www.xlstat.com/en) was used to compare the average results with correlation analysis, agglomerative hierarchical clustering analysis and principal component analysis methods.

Results and discussion

Seed internal ratio, crude oil content and fatty acid composition

The averages of seed internal ratio, crude oil content, fatty acid composition, SFA, MUFA and PUFA values according to different seed size are given in Table 1. According to the analysis of variance, crude oil content, palmitic acid (C16:0), stearic acid (C18:0), oleic acid (C18:1n9c), linoleic acid (C18:2n6c) and eicosenoic acid (C20:1n9), saturated fatty acid (SFA), monounsaturated fatty acid (MUFA) and polyunsaturated fatty acid (PUFA) values were statistically significant (p < 0.05), while the α-linolenic acid (C18:3n3) content and seed internal ratio were found to be insignificant (p > 0.05). In the study, the seed internal ratio value changed between 65.92 and 69.40%, and although it was statistically insignificant, it had the highest value in the SD2 group. The crude oil content varied between 26.53 and 29.39%, and SD2, SD3 groups had the highest values, which were not statistically different in terms of crude oil content. In addition, the crude oil content value (26.53%) in the SD4 group showed a significant decrease. Six different fatty acids were determined according to different seed size in fatty acid composition analysis (Table 1). Changes in the major fatty acids and SFA, MUFA and PUFA values are presented in Fig. 1. The oleic acid component was determined as the major component. This component was followed by linoleic acid, palmitic acid and stearic acid components, respectively (Fig. 1). The oleic acid component had the highest value in SD4 group (60.43%) compared to other groups. The linoleic acid component had the highest value (22.63%) in the SD1 group. In addition, palmitic acid and stearic acid components reached their highest values in SD2 (15.46%) and SD1 (2.28%) groups, respectively. The SFA values were between 16.71 and 17.52%, MUFA values were between 59.05 and 60.69% and PUFA values were between 22.27 and 23.43%. The SD1 group achieved the highest results in SFA and PUFA values, while SD4 group achieved the highest results in MUFA values (Table 1). In a study by Wang et al. (2011) [22] reported that the crude oil content of the tea seed was 25.3% as a result of Soxhlet extraction and the fatty acid compositions were oleic acid (52.9%), linoleic acid (24.2%) and palmitic acid (17.7%). In another study, Zhenggang et al. (2021) [4] reported that Camellia sinensis seed oil ranged from 31.04 to 33.20% and the oleic acid content was the highest (59.21%) among all determined fatty acids. In addition, they reported the rate of SFA as 16.83%. In another study, it was reported that the crude oil content in tea seeds varied between 22.5 and 28.9% [23]. Our study results regarding crude oil content were found to be higher than the value reported by Wang et al. (2011) [22], lower than the value reported by Zhenggang et al. (2021) [4] and similar to the value reported by Yahaya et al. (2019) [23]. These differences in crude oil content value may vary depending on the extraction method and the solvent preferred during analysis. Additionally, our results for oleic acid in fatty acid composition were found to be higher and similar to the values reported by Wang et al. (2011) [22] and Zhenggang et al. (2021) [4], respectively.

Table 1 Variation of crude oil content, fatty acid composition and seed internal rate values (%) according to different seed size
Fig. 1
figure 1

Change in major fatty acid compositions (components below 1% were not included in the figure) and SFA, MUFA, PUFA values according to different seed size, (Error bars represent standard deviation data, C16:0 palmitic acid, C18:0 stearic acid, C18:1n9c oleic acid, C18:2n6c linoleic acid, SFA saturated fatty acid, MUFA mono-unsaturated fatty acid, PUFA poly-unsaturated fatty acid)

Mineral composition

Average results of mineral composition (macro, micro elements and heavy metals) according to different seed size are given in Table 2. In analysis of variance of mineral composition values according to different seed size, Al, Fe and Mg values were found to be statistically significant (p < 0.05), while B, Ca, Co, Cr, Cu, K, Mn, Na, Ni, P, Pb, S and Zn values were found to be insignificant (p > 0.05) (Table 2). Changes in micro (Fig. 2A) and macro (Fig. 2B) element values according to different seed size are presented as detail in Fig. 2. The five micro (B, Cu, Fe, Mn and Zn) and six macro (Ca, K, Mg, Na, P and S) elements were determined in this study (Fig. 2). Considering the parameters found to be important in this study, the Al content varied between 68.36 and 85.69 mg kg−1 and although there was no statistical difference between the last three groups, it had the highest value in the SD3 group. The Fe content varied between 7.27 and 11.91 mg kg−1 and although there was no statistical difference between the last three groups, it had the highest value in the SD4 group. The Mg content varied between 678.9 and 833.9 mg kg−1 and although there was no statistically significant difference between the first three groups, it had the highest value in the SD3 group. In addition, although statistically insignificant, the B content was highest in the SD1 group, the Ca, Mn, P, Pb and Zn contents in the SD2 group, the Co, Cr, Cu, K, Na and Ni contents in the SD3 group, and the Co and S contents in the SD4 group (Table 2). In a study by Yahaya et al. (2019) [23] reported that the contents of Na, Zn, Mg, Fe, K and Ca in tea seed ranged between 2.91–3.20, 3.81–4.10, 145.9–163.1, 0.063–0.095, 21.94–23.65 and 16.16–18.41 mg 100 g−1, respectively.

Table 2 Variation of mineral composition values (mg kg−1) according to different seed size
Fig. 2
figure 2

Change in micro (A) and macro (B) element values according to different seed size, (Error bars represent standard deviation data, ns: not significant)

Chemometric analysis

The results of the seed internal ratio, crude oil content, fatty acid composition, mineral composition, which were made by grouping the tea seed according to different seed size, were examined using three different analysis methods (correlation analysis, agglomerative hierarchical clustering analysis and principal component analysis).

Correlation analysis

The correlation analysis results for seed internal ratio, crude oil content, fatty acid composition and SFA, MUFA, PUFA values according to different seed size are given in Table 3 and the correlation analysis results for mineral composition values are given in Table 4. The statistically significant positive and negative correlations between these properties are presented as detail in Fig. 3. According to correlation analysis, four different positive correlations were found between fatty acid composition, MUFA and PUFA values (Fig. 3A). Accordingly, positive correlations were found between oleic acid content and MUFA, between linoleic acid and α-linolenic acid, between linoleic acid and PUFA, and between α-linolenic acid and PUFA (p < 0.05) (Table 3). According to correlation analysis, seven positive and three negative correlations were found between the Al, B, Ca, Co, Cr, Cu, K, Mg, Na, Ni, P, Pb, S and Zn values (Fig. 3B). There were positive correlations between Al and Na, between Ca and Pb, between Ca and Zn, between Co and K, between Cr and Cu, between Mg and Ni, between Mg and P. Negative correlations were found between B and K, between Mg and S, between Ni and S (p < 0.05) (Table 4).

Table 3 Correlation relationships of crude oil content, fatty acid composition and seed internal rate values according to different seed size
Table 4 Correlation relationships of mineral composition values according to different seed size
Fig. 3
figure 3

Correlation diagrams of statistically significant parameters of fatty acids, MUFA and PUFA (A) and mineral (B) compositions according to different seed size, (PC positive correlation, NC negative correlation, C18:1n9c oleic acid, C18:2n6c linoleic acid, C18:3n3 α-linolenic acid, MUFA mono-unsaturated fatty acid, PUFA poly-unsaturated fatty acid, p < 0.05: significant at the 5% level)

Principal component analysis

The principal component analysis graph of seed internal ratio, crude oil content and fatty acid composition values according to different seed size is given in Fig. 4A, and the principal component analysis graph according to mineral composition values is given in Fig. 4B. Examining Fig. 4A, PC1 (62.18%) and PC2 (30.11%) explained 92.29% of the total variance. According to the graph (Fig. 4A), it was seen that the SD4 group was different from the other groups in terms of oleic acid and MUFA. Also, the SD1 group was different from the other groups in terms of stearic acid, linoleic acid, α-linolenic acid, eicosenoic acid and PUFA. When Fig. 4B is examined, PC1 (59.06%) and PC2 (29.35%) explained 88.41% of the total variance. According to the graph (Fig. 4B), it was seen that the SD1 group was different from the other groups in terms of B content. The SD2 group was different from the other groups in terms of Mn and P contents, the SD3 group was different from the other groups in terms of Al and Na contents and the SD4 group was different from the other groups in terms of Fe and S contents.

Fig. 4
figure 4

Principal component analysis graphs of crude oil content, fatty acid composition, seed internal ratio (A) and mineral composition (B) values by different seed size, (COC crude oil content, SIR seed internal ratio, C16:0 palmitic acid, C18:0 stearic acid, C18:1n9c oleic acid, C18:2n6c linoleic acid, C18:3n3 α-linolenic acid, C20:1n9 eicosenoic acid, SFA saturated fatty acid, MUFA mono-unsaturated fatty acid, PUFA poly-unsaturated fatty acid)

Agglomerative hierarchical clustering analysis

The dendrogram graph based on seed internal ratio, crude oil content and fatty acid composition values according to the different seed size is given in Fig. 5A, and the dendrogram graph based on the mineral composition values is given in Fig. 5B. When Fig. 5A is examined, two groups were formed according to seed internal ratio, crude oil content and fatty acid composition value. The SD1 was alone in the 1st group, SD2, SD3 and SD4 were in the 2nd group together (Fig. 5A). When Fig. 5B is examined, two groups were formed according to the mineral composition. SD1 and SD4 were in the 1st group together, SD2 and SD3 were in the 2nd group together (Fig. 5B). In some studies, some quality characteristics of the tea plant were examined using correlation, principal component analysis and cluster analysis [15, 16, 24,25,26,27]. In a study, Liu et al. (2022) [27] used cluster and correlation analyzes to examine the mineral composition in tea. In a study examining the fatty acid composition of leaves of different tea species collected at different times, Anh et al. (2016) [24] demonstrated the functional use of principal component analysis to compare fatty acids in Camellia sinensis. In another study, Wang et al. (2017) [25] used multivariate chemometric methods such as principal component analysis and hierarchical clustering analysis to reveal relationships between fatty acid components of the tea plant. In this study, it was reported that the samples examined as a result of principal component analysis were divided into three groups and the variability distribution was formed by two main PCs that explained 77.4% of the total variance.

Fig. 5
figure 5

Dendrogram graphs of crude oil content, fatty acid composition, seed internal ratio (A) and mineral composition (B) by different seed size

Conclusion

At the end of the study, crude oil content, palmitic acid (C16:0), stearic acid (C18:0), oleic acid (C18:1n9c), linoleic acid (C18:2n6c) and eicosenoic acid (C20:1n9), SFA, MUFA, PUFA, Al, Fe and Mg values were found to be significant according to different seed size. In terms of crude oil content, SD2 and SD3 groups had the highest values, although they were not statistically different. In the study, six different fatty acids were determined according to different seed size in the tea seed oil, and oleic acid was the main component. According to correlation analysis, positive correlations were found between oleic acid and MUFA, between linoleic acid and α-linolenic acid, between linoleic acid and PUFA, between α-linolenic acid and PUFA, between Al and Na, between Ca and Pb, between Ca and Zn, between Co and K, between Cr and Cu, between Mg and Ni, between Mg and P. According to the principal component analysis, it was determined that the SD1 group was different from the other groups in terms of the ratio of stearic acid, linoleic acid, α-linolenic acid, eicosenoic acid, PUFA and B. In addition, according to the agglomerative hierarchical clustering analysis, seed sizes were divided into two different groups according to seed internal ratio, crude oil content, fatty acid and mineral composition. At the end of the study, it can be said that different chemometric analysis methods such as correlation analysis, agglomerative hierarchical clustering analysis and principal component analysis were useful and decisive in determining the quality characteristics of tea seeds classified according to seed size.