Skip to main content

Advertisement

Log in

Geographical Origin Identification of Chinese Tomatoes Using Long-Wave Fourier-Transform Near-Infrared Spectroscopy Combined with Deep Learning Methods

  • Published:
Food Analytical Methods Aims and scope Submit manuscript

Abstract

Tomato cultivation in China is concentrated in Xinjiang, Henan, and Shandong provinces, and the quality of tomatoes produced in different regions shows huge differences. Xinjiang tomatoes have extraordinary nutritional compositions and sensory qualities, and counterfeit Xinjiang tomato products are proliferating in the market. This study aimed to investigate the feasibility of identifying the geographical origin of Chinese tomatoes by using long-wave Fourier-Transform near-infrared spectroscopy (FT-NIR, 10,000–4000 cm−1). First, principal component analysis (PCA) was conducted on the raw spectra, and it was found that the first two PCs effectively identified the geographical origin of the tomatoes. Meanwhile, the results of partial least squares discriminant analysis (PLS-DA) combined with different preprocessing methods showed that the PLS-DA model based on the raw spectra achieved the best performance. The optimal PLS-DA model achieved the correct classification rate (CCR) of 97.8% in an external prediction set, and showed that raw spectra contained sufficient valid information. Besides, six algorithms, grid search (GS), genetic algorithm (GA), particle swarm algorithm (PSO), grey wolf algorithm (GWO), improved grey wolf algorithm (IGWO), and sparrow search algorithm (SSA) were employed to optimize the parameters of support vector machine (SVM). The SSA-SVM model exhibited the best performance with a CCR of 97.8% in the prediction set. Afterward, 40 and 8 spectral features were extracted from the raw full spectra using stacked autoencoder (SAE) and PC loading, respectively. Finally, the qualitative analysis model based on the feature variables was further investigated, and the SAE-SSA-SVM simplified model showed the best performance with a CCR of 98.7%, 95.6%, and 95.6% in the calibration, cross-validation, and prediction set, respectively. The results of this study provide theoretical support for applying long-wave FT-NIRS combined with machine learning and deep learning to tomato origin identification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Download references

Funding

This work was supported by National Natural Science Foundation of China [grant number 32102071] and the Natural Science Foundation of the Jiangsu Higher Education Institutions of China [grant number 21KJB220013].

Author information

Authors and Affiliations

Authors

Contributions

Weidong Yuan: Methodology, data curation, investigation, validation, funding acquisition, writing—original draft. Hongzhe Jiang: conceptualization, writing—review & editing. Mengmeng Sun: data curation, software. Yu Zhou: formal analysis, methodology. Cong Zhang: investigation. Hongping Zhou: conceptualization, writing—review & editing.

Corresponding author

Correspondence to Hongping Zhou.

Ethics declarations

Competing Interests

Weidong Yuan declares that he has no conflict of interest. Hongzhe Jiang declares that she has no conflict of interest. Mengmeng Sun declares that she has no conflict of interest. Yu Zhou declares that he has no conflict of interest. Cong Zhang declares that he has no conflict of interest. Hongping Zhou declares that he has no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yuan, W., Jiang, H., Sun, M. et al. Geographical Origin Identification of Chinese Tomatoes Using Long-Wave Fourier-Transform Near-Infrared Spectroscopy Combined with Deep Learning Methods. Food Anal. Methods 16, 664–676 (2023). https://doi.org/10.1007/s12161-023-02444-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12161-023-02444-1

Keywords

Navigation