Abstract
Due to the availability of large-scale datasets (e.g., ImageNet, UECFood) and the advancement of deep Convolutional Neural Networks (CNN), computer vision image recognition has evolved dramatically. Currently, there are three major methods for using CNN: starting from scratch, using a pre-trained network off the shelf, and performing unsupervised pre-training with supervised changes. When it comes to those with dietary restrictions, automatic food detection and assessment are critical. In this research, we show how to address detection difficulties by combining three CNNs. The different CNN architectures are then assessed. The amount of parameters in the examined CNN models ranges from 5,000 to 160 million, depending on the number of layers. Second, the various CNNs under consideration are assessed based on dataset sizes and physical image context. The results are assessed in terms of performance vs. training time vs. accuracy. Finally, the accuracy of CNNs is investigated and examined using human knowledge and classification from the human visual system (HVS). Finally, additional categorization techniques, such as bag-of-words, are considered to solve this problem. Based on the findings, it can be concluded that the HVS is more accurate when a data set comprises a wide range of variables. When the dataset is restricted to niche photos, the CNN outperforms the HVS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abou Baker, N., Zengeler, N., Handmann, U.: A transfer learning evaluation of deep neural networks for image classification. Mach. Learn. Knowl. Extract. 4(1), 22–41 (2022)
Chen, F., Wei, J., Xue, B., Zhang, M.: Feature fusion and kernel selective in inception-v4 network. Appl. Soft Comput. 119, 108582 (2022)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Khan, R., Kumar, S., Dhingra, N., Bhati, N.: The use of different image recognition techniques in food safety: a study. J. Food Qual. 2021 (2021)
Lagani, G., Falchi, F., Gennaro, C., Amato, G.: Comparing the performance of Hebbian against backpropagation learning using convolutional neural networks. Neural Comput. Appl. 34, 1–17 (2022)
Lande, M.V., Ridhorkar, S.: A comprehensive survey on content-based image retrieval using machine learning. In: Gupta, D., Polkowski, Z., Khanna, A., Bhattacharyya, S., Castillo, O. (eds.) Proceedings of Data Analytics and Management. LNDECT, vol. 91, pp. 165–179. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-6285-0_14
Mao, R., He, J., Shao, Z., Yarlagadda, S.K., Zhu, F.: Visual aware hierarchy based food recognition. In: Del Bimbo, A., et al. (eds.) ICPR 2021. LNCS, vol. 12665, pp. 571–598. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68821-9_47
Ohri, K., Kumar, M.: Review on self-supervised image recognition using deep neural networks. Knowl.-Based Syst. 224, 107090 (2021)
Salim, N.O., Zeebaree, S.R., Sadeeq, M.A., Radie, A., Shukur, H.M., Rashid, Z.N.: Study for food recognition system using deep learning. In: Journal of Physics: Conference Series, vol. 1963, p. 012014. IOP Publishing (2021)
Sharma, P., Sharma, A., et al.: Hybrid approach for food recognition using various filters. Int. J. Adv. Comput. Technol. 11(1), 1–5 (2022)
Touvron, H., Cord, M., Sablayrolles, A., Synnaeve, G., Jégou, H.: Going deeper with image transformers. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 32–42 (2021)
Alom, M.Z., Hasan, M., Yakopcic, C., Taha, T.M., Asari, V.K.: Inception recurrent convolutional neural network for object recognition. Mach. Vis. Appl. 32(1), 1–14 (2021). https://doi.org/10.1007/s00138-020-01157-3
Zhu, Y., Urtasun, R., Salakhutdinov, R., Fidler, S.: segDeepM: exploiting segmentation and context in deep neural networks for object detection. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4703–4711. IEEE (2015)
Acknowledgements
“This work is funded by National Funds through the FCT Foundation for Science and Technology, IP, within the project Ref UIDB/05583/2020. Furthermore, we would like to thank the Research Centre in Digital Services (CISeD), the Polytechnic of Viseu, for their support”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Abbasi, M., Wanzeller, C., Cardoso, F., Martins, P. (2023). Comparing Machine Learning vs. Humans for Dietary Assessment. In: de la Iglesia, D.H., de Paz Santana, J.F., López Rivero, A.J. (eds) New Trends in Disruptive Technologies, Tech Ethics and Artificial Intelligence. DiTTEt 2022. Advances in Intelligent Systems and Computing, vol 1430. Springer, Cham. https://doi.org/10.1007/978-3-031-14859-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-14859-0_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-14858-3
Online ISBN: 978-3-031-14859-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)