On Combining Boosting with Rule-Induction for Automated Fruit Grading

Conference paper

Abstract

The automation of post-harvest fruit grading in the industry is a problem that is receiving considerable attention in the realm of computer vision and machine learning. Classification accuracy with automated systems in this domain is a challenge given the inherent variability in the visual appearance of fruit and its quality-determining features. While the accuracy of automated systems is of paramount importance, the usability and the interpretability of machine learning solutions to the operators are also crucial since many sophisticated algorithms involve numerous tunable parameters and are often “black-boxes”. This research presents a generalizable machine learning solution that balances the need for high accuracy and usability by decomposing the problem into sub-tasks. A powerful boosting algorithm (AdaBoost.ECC) with low interpretability is employed for learning fruit-surface characteristics. The classification outputs of boosting then become inputs for rule-induction algorithms (RIPPER and FURIA), generating human-interpretable rule sets that are amenable for review and revisions by operators. Using seven datasets of different fruit varieties, the performance of the proposed method was compared against a manually calibrated commercial fruit-grading system. The results showed that the proposed system is able to match the accuracy of machines calibrated by domain experts having many years of experience, while providing simpler rule sets possessing high interpretability and usability while yielding knowledge discovery.

Keywords

AdaBoost.ECC Boosting Classification Decomposition strategies Fruit grading FURIA Machine learning RIPPER Rule-induction 

Notes

Acknowledgment

The authors express their gratitude to Compac Sorting Ltd. for providing access to their datasets, grading maps and the necessary software for these experiments.

References

  1. 1.
    T. Brosnan, D.W. Sun, Improving quality inspection of food products by computer vision: a review. J. Food Eng. 61(1), 3–16 (2004)CrossRefGoogle Scholar
  2. 2.
    S. Cubero, N. Aleixos, E. Molto, J. Gómez-Sanchis, J. Blasco, Advances in machine vision applications for automatic inspection and quality evaluation of fruits and vegetables. Food Bioprocess. Tech. 4(4), 487–504 (2011). doi: 10.1007/s11947-010-0411-8 CrossRefGoogle Scholar
  3. 3.
    C.J. Du, D.W. Sun, Learning techniques used in computer vision for food quality evaluation: a review. J. Food Eng. 72(1), 39–55 (2006)CrossRefGoogle Scholar
  4. 4.
    D. Lorente, N. Aleixos, J. Gómez-Sanchis, S. Cubero, O. García-Navarrete, J. Blasco, Recent advances and applications of hyperspectral imaging for fruit and vegetable quality assessment. Food Bioprocess. Tech. 5(4), 1121–1142 (2012)CrossRefGoogle Scholar
  5. 5.
    J. Gómez-Sanchis, J.D. Martín-Guerrero, E. Soria-Olivas, M. Martínez-Sober, R. Magdalena-Benedito, J. Blasco, Detecting rottenness caused by penicillium genus fungi in citrus fruits using machine learning techniques. Expert Syst. Appl. 39(1), 780–785 (2012)CrossRefGoogle Scholar
  6. 6.
    T. Sugiyama, J. Sugiyama, M. Tsuta, K. Fujita, M. Shibata, M. Kokawa, T. Araki, H. Nabetani, Y. Sagara, Nir spectral imaging with discriminant analysis for detecting foreign materials among blueberries. J. Food Eng. 101(3), 244–252 (2010)CrossRefGoogle Scholar
  7. 7.
    J. Zhao, Q. Ouyang, Q. Chen, J. Wang, Detection of bruise on pear by hyperspectral imaging sensor with different classification algorithms. Sens. Lett. 8(4), 570–576 (2010)CrossRefGoogle Scholar
  8. 8.
    V. Leemans, M.F. Destain, A real-time grading method of apples based on features extracted from defects. J. Food Eng. 61(1), 83–89 (2004)CrossRefGoogle Scholar
  9. 9.
    C.S Nandi, B. Tudu, C. Koley, An automated machine vision based system for fruit sorting and grading, in Proceedings of the 6th International Conference on Sensing Technology (ICST, IEEE), 2012, pp. 195–200Google Scholar
  10. 10.
    K. Mollazade, M. Omid, A. Arefi, Comparing data mining classifiers for grading raisins based on visual features. Comp. Electron. Agr. 84, 124–131 (2012)CrossRefGoogle Scholar
  11. 11.
    H. Zheng, H. Lu, Y. Zheng, H. Lou, C. Chen, Automatic sorting of Chinese jujube (zizyphus jujuba mill. cv. ‘hongxing’) using chlorophyll fluorescence and support vector machine. J. Food Eng. 101(4), 402–408 (2010). doi: 10.1016/j.jfoodeng.2010.07.028 CrossRefGoogle Scholar
  12. 12.
    J. Qin, T.F. Burks, M.A. Ritenour, W.G. Bonn, Detection of citrus canker using hyperspectral reflectance imaging with spectral information divergence. J. Food Eng. 93(2), 183–191 (2009). doi: 10.1016/j.jfoodeng.2009.01.014 CrossRefGoogle Scholar
  13. 13.
    J. Qin, T.F. Burks, X. Zhao, N. Niphadkar, M.A. Ritenour, Development of a two-band spectral imaging system for real-time citrus canker detection. J. Food Eng. 108(1), 87–93 (2012). doi: 10.1016/j.jfoodeng.2011.07.022 CrossRefGoogle Scholar
  14. 14.
    J. Blasco, S. Cubero, J. Gómez-Sanchís, P. Mira, E. Moltó, Development of a machine for the automatic sorting of pomegranate (Punica Granatum) arils based on computer vision. J. Food Eng. 90(1), 27–34 (2009). doi: 10.1016/j.jfoodeng.2008.05.035 CrossRefGoogle Scholar
  15. 15.
    M. Barnes, T. Duckett, G. Cielniak, G. Stroud, G. Harper, Visual detection of blemishes in potatoes using minimalist boosted classifiers. J. Food Eng. 98(3), 339–346 (2010)CrossRefGoogle Scholar
  16. 16.
    W. Huang, C. Zhang, B. Zhang, in Identifying Apple Surface Defects Based on Gabor Features and SVM Using Machine Vision, eds. by D. Li, Y. Chen. Computer and Computing Technologies in Agriculture V, IFIP Advances in Information and Communication Technology, vol. 370 (Springer, Berlin, 2012), pp. 343–350. doi: 10.1007/978-3-642-27275-2_39
  17. 17.
    P. Domingos, A few useful things to know about machine learning. Commun. ACM 55(10), 78–87 (2012). doi: 10.1145/2347736.2347755 CrossRefGoogle Scholar
  18. 18.
    V. Guruswami, A. Sahai, Multiclass learning, boosting, and error-correcting codes, in Proceedings of the 12th Annual Conference on Computational Learning Theory (COLT’99) (ACM, New York, 1999), pp. 145–155Google Scholar
  19. 19.
    J. Fürnkranz, D. Gamberger, N. Lavrac, Foundations of Rule Learning (Springer, New York, 2012)Google Scholar
  20. 20.
    J. Hühn, E. Hüllermeier, Furia: an algorithm for unordered fuzzy rule induction. Data Min. Knowl. Disc. 19(3), 293–319 (2009). doi: 10.1007/s10618-009-0131-8 CrossRefGoogle Scholar
  21. 21.
    W. Cohen, Fast effective rule induction, in Proceedings of the 12th International Conference on Machine Learning, 1995, pp. 115–123Google Scholar
  22. 22.
    T. Susnjak, A. Barczak, N. Reyes, A decomposition machine-learning strategy for automated fruit grading, in Proceedings of the World Congress on Engineering and Computer Science (WCECS 2013), (San Francisco, 2013), pp. 819–825Google Scholar
  23. 23.
    Y. Freund, R. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)CrossRefMATHMathSciNetGoogle Scholar
  24. 24.
    D. Unay, B. Gosselin, Stem and calyx recognition on ‘jonagold’ apples by pattern recognition. J. Food Eng. 78(2), 597–605 (2007)CrossRefGoogle Scholar
  25. 25.
    O. Maimon, L. Rokach, Decomposition methodology for knowledge discovery and data mining: Data Mining and Knowledge Discovery Handbook (Springer, New York, 2005), pp. 981–1003Google Scholar
  26. 26.
    A.L. Samuel, Some studies in machine learning using the game of checkers ii: recent progress. IBM J. Res. Dev. 11(6), 601–617 (1967)CrossRefGoogle Scholar
  27. 27.
    A.J.C. Sharkey, G.O. Chandroth, N.E. Sharkey, A multi-net system for the fault diagnosis of a diesel engine. Neural Comput. Appl. 9(2), 152–160 (2000)CrossRefGoogle Scholar
  28. 28.
    L. Rokach, Decomposition methodology for classification tasks: a meta decomposer framework. Pattern Anal. Appl. 9(2), 257–271 (2006)CrossRefMathSciNetGoogle Scholar
  29. 29.
    Y. Freund, R.E. Schapire, A short introduction to boosting. J. JSAI 14(5), 771–780 (1999)Google Scholar
  30. 30.
    F. Yuan, X. Li, W. Li-ming, P. Le-ping, S. Ying, in Knowledge Discovery of Energy Management System Based on Prism, Furia and J48, vol. 100, ed. by M. Ma. Communication Systems and I.T (Lecture Notes in Electronic Engineering). (Springer, Berlin, Heidelberg, 2011) pp. 593–600Google Scholar
  31. 31.
    V. Kumari, P. Kumar, Fuzzy unordered rule induction for evaluating cardiac arrhythmia. Biomed. Eng. Lett. 3(2), 74–79 (2013). doi: 10.1007/s13534-013-0096- CrossRefGoogle Scholar
  32. 32.
    I. Witten, E. Frank, M. Hall, Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann, San Francisco, 2011)Google Scholar
  33. 33.
    F. Provost, T. Fawcett, Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions, in Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining (AAAI Press, 1997), pp. 43–48Google Scholar
  34. 34.
    Y. Sun, M. Kamel, Y. Wang, Boosting for learning multiple classes with imbalanced class distribution, in Proceedings of 6th International Conference on Data Mining ICDM’06, 2006, pp. 592–602Google Scholar
  35. 35.
    R. Iman, J. Davenport, Approximations of the critical region of the fbietkan statistic. Commun. Stat. A Theor. 9(6), 571–595 (1980)CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2014

Authors and Affiliations

  1. 1.School of Engineering and Advanced TechnologyMassey UniversityAlbanyNew Zealand
  2. 2.Institute of Natural and Mathematical SciencesAlbanyNew Zealand

Personalised recommendations