Methods of Authentication of Food Grown in Organic and Conventional Systems Using Chemometrics and Data Mining Algorithms: a Review
- 215 Downloads
There is a general consensus that the consumption of organic food can contribute to a healthy diet; nevertheless, large-scale production of organic food is not an easy task since it requires intense care due to the number of pests, fungi, and diseases that can wipe out an entire crop. Researchers evaluating food quality are often concerned with the use of pesticides, antibiotics, and hormones in agriculture, along with genetic modification (GMOs) and additives in food processing. Thus, a major challenge that arises in this context is how to obtain products that are free of these toxic elements. In this review, we give an overview of the research conducted in relation to the chemometric tools for extraction of variables in several types of food and the use of data mining techniques and statistical analysis to classify samples grown in organic and conventional systems. The expansion of the organic sector, driven by growing demand and high prices, could lead to fraud. Then, creating mechanisms that can be used by regulators, supervisory bodies, or even installed in supermarkets so the client can do this verification may be a deterrent for this type of deception. Results presented by recent research have shown that chemometric methods associated with data mining algorithms or statistical methods can be used to successfully classify products grown in organic and conventional systems.
KeywordsAuthenticity Chemometrics Organic Data mining Multivariate analysis Food authentication Food fraud
The authors would like to thank the editor and anonymous reviewers whose valuable comments and feedback have helped us to improve the content and presentation of the paper.
Compliance with Ethical Standards
Conflict of Interest
Author Márcio Lima declares that he has no conflict of interest. Author Rommel Barbosa declares that he has no conflict of interest.
This article does not contain any studies with human participants or animals performed by either of the authors.
- Barbosa RM, Batista BL, Barião CV, Varrique RM, Coelho VA, Campiglia AD, Barbosa F Jr (2015) A simple and practical control of the authenticity of organic sugarcane samples based on the use of machine-learning algorithms and trace elements determination by inductively coupled plasma mass spectrometry. Food Chem 184:154–159CrossRefGoogle Scholar
- Batista BL, da Silva LRS, Rocha BA, Rodrigues JL, Berretta-Silva AA, Bonates TO, Gomes VSD, Barbosa RM, Barbosa F (2012) Multi-element determination in Brazilian honey samples by inductively coupled plasma mass spectrometry and estimation of geographic origin with data mining techniques. Food Res Int 49:209–215CrossRefGoogle Scholar
- Berrar D (2019) Bayes’ theorem and naive Bayes classifier. Encycl Bioinforma Comput Biol 1:403–412Google Scholar
- Bona E, Marquetti I, Link JV, Makimori GYF, da Costa Arca V, Guimarães Lemes AL, Ferreira JMG, dos Santos Scholz MB, Valderrama P, Poppi RJ (2017) Support vector machines in tandem with infrared spectroscopy for geographical classification of green arabica coffee. LWT - Food Sci Technol 76:330–336CrossRefGoogle Scholar
- Brasil (2007) Decreto no 6.323, de 27 de dezembro de 2007. Regulamenta a Lei no 10.831, de 23 de dezembro de 2003, que dispõe sobre a agricultura orgânica, e dá outras providências. In: Ministério da Agric. Pecuária e do Abast. http://www.planalto.gov.br/ccivil_03/_ato2007-2010/2007/decreto/d6323.htm. Accessed 10 Feb 2018
- Calderón-Celis F, Encinar JR, Sanz-Medel A (2018) Standardization approaches in absolute quantitative proteomics with mass spectrometry. Mass Spectrom Rev 37(6):715–737Google Scholar
- Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297Google Scholar
- Granato D, Santos JS, Escher GB, Ferreira BL, Maggio RM (2018) Use of principal component analysis (PCA) and hierarchical cluster analysis (HCA) for multivariate association between bioactive compounds and functional properties in foods: a critical perspective. Trends Food Sci Technol 72:83–90CrossRefGoogle Scholar
- Hamidatou L, Slamene H, Akhal T, Zouranen B (2013) Concepts, instrumentation and techniques of neutron activation analysis. In: F. K (ed) Imaging and Radioanalytical Techniques in Interdisciplinary Research-Fundamentals and Cutting Edge Applications. InTech, Rijeka, Croatia, pp 141–178Google Scholar
- Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. Int Jt 14:0–6Google Scholar
- Laursen KH, Mihailova A, Kelly SD, Epov VN, Bérail S, Schjoerring JK, Donard OFX, Larsen EH, Pedentchouk N, Marca-Bell AD, Halekoh U, Olesen JE, Husted S (2013) Is it really organic?—multi-isotopic analysis as a tool to discriminate between organic and conventional plants. Food Chem 141:2812–2820CrossRefGoogle Scholar
- Liu M, Wang M, Wang J, Li D (2013) Comparison of random forest, support vector machine and back propagation neural network for electronic tongue data classification: application to the recognition of orange beverage and Chinese vinegar. Sensors Actuators B Chem 177:970–980. https://doi.org/10.1016/j.snb.2012.11.071 CrossRefGoogle Scholar
- Maillo J, Triguero I, Herrera F (2015) A mapreduce-based k-nearest neighbor approach for big data classification. In: Trustcom/BigDataSE/ISPA, 2015 IEEE, IEEE, pp 167–172Google Scholar
- Maione C, de Oliveira Souza VC, Togni LR, et al (2018) Establishing chemical profiling for ecstasy tablets based on trace element levels and support vector machine. Neural Comput Applic 30(3):947–955Google Scholar
- McLachlan GJ (2004) Discriminant analysis and statistical pattern recognition, 544th edn. John Wiley & Sons, HobokenGoogle Scholar
- Monteiro PI, Santos JS, Alvarenga Brizola VR, Pasini Deolindo CT, Koot A, Boerrigter-Eenling R, van Ruth S, Georgouli K, Koidis A, Granato D (2018) Comparison between proton transfer reaction mass spectrometry and near infrared spectroscopy for the authentication of Brazilian coffee: a preliminary chemometric study. Food Control 91:276–283CrossRefGoogle Scholar
- Song W, Wang H, Maguire P, Nibouche O (2016) Differentiation of organic and non-organic apples using near infrared reflectance spectroscopy—a pattern recognition approach. In: SENSORS, 2016 IEEE, IEEE, pp 1–3Google Scholar
- Swartz ME (2005) Ultra performance liquid chromatography (UPLC): an introduction. Sep Sci Redefined 5:8–14Google Scholar
- Turra C, de Lima MD, Fernandes EADN et al (2017) Multielement determination in orange juice by ICP-MS associated with data mining for the classification of organic samples. Inf Process Agric 4:199–205Google Scholar
- Vanneschi L, Castelli M (2018) Multilayer perceptrons. Ref Modul Life Sci:1–9. https://doi.org/10.1016/B978-0-12-809633-8.20339-7
- Vervoort J, Mueller P (2018) Multicollector-inductively coupled plasma mass spectrometer (MC-ICPMS). In: Carlet Coll Geochemical Instrum Anal.https://serc.carleton.edu/research_education/geochemsheets/techniques/MCICPMS.html. Accessed 21 Aug 2018
- Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: Machine Learning-International Workshop Then Conference, vol 97. Morgan Kaufmann Publishers, Inc., San Francisco, pp 412–420Google Scholar