Skip to main content
Log in

Mixed fuzzy pooling in convolutional neural networks for image classification

  • 1207: Innovations in Multimedia Information Processing & Retrieval
  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Convolutional Neural Networks (CNN) are being widely practised in computer vision applications, where pooling indicates as an integral part. Pooling significantly reduces the training time and computational cost of any network. The pooling operations generally being used are max, average, or stochastic. The performance of the CNN architecture can be increased by using more than one pooling operations. Herein, a novel mixed fuzzy pooling is proposed for image classification in the CNN architecture. In the proposed mixed fuzzy pooling, max pooling and fuzzy pooling are combined together to boost the classification accuracy. The proposed mixed fuzzy pooling is designed as a function of max pooling, fuzzy pooling, and α parameter where the learning of the proposed pooling function takes place with the value of α. In fuzzy pooling, Type-2 fuzzy logic is used to get the convolved dominant features. This is obtained by the estimation of a threshold for image region to be pooled. Thereafter, Type-1 fuzzy logic is used to get the pooling output by a weighted average of convolved dominant features. The performance of various pooling strategies on MNIST dataset for handwritten digits classification and CIFAR-10 dataset for RGB images classification have been compared and analysed. The experimental results show better image classification accuracy for the proposed mixed fuzzy pooling than conventional pooling operations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press. ISBN: 978-0-19-853864-6

  2. Boureau Y, Ponce J, LeCun Y (2010) A theoretical analysis of feature pooling in visual recognition. In: Proceedings of international conference on machine learning, pp 111–118

  3. Boureau Y, Roux NL, Bach F, Ponce J, LeCun Y (2011) Ask the locals: multi-way local pooling for image recognition. In: Proceedings of international conference on computer vision, pp 2651–2658

  4. Chauhan R, Ghanshala KK, Joshi RC (2018) Convolutional neural network (CNN) for image detection and recognition. In: 2018 First international conference on secure cyber computing and communication (ICSCCC), Jalandhar, India, pp 278–282. https://doi.org/10.1109/ICSCCC.2018.8703316

  5. Chen J, Hua Z, Wang J, Cheng S (2017) A convolutional neural network with dynamic correlation pooling. In: 2017 13th international conference on computational intelligence and security (xCIS), Hong Kong, China, pp 496–499. https://doi.org/10.1109/CIS.2017.00115

  6. Greeshma MS, Bindu VR (2017) Single image super resolution using fuzzy deep convolutional networks. In: International conference on technological advancements in power and energy (TAP Energy), pp 1–6. https://doi.org/10.1109/TAPENERGY.2017.8397224

  7. Juang CF, Hsieh CD (2010) A locally recurrent fuzzy neural network with support vector regression for dynamic-system modeling. IEEE Trans Fuzzy Syst 18(2):261–273. https://doi.org/10.1109/TFUZZ.2010.2040185

    Google Scholar 

  8. Juang CF, Ku KC (2005) A recurrent fuzzy network for fuzzy temporal sequence processing and gesture recognition. IEEE Trans Syst Man Cybern Part B (Cybernetics) 35(4):646–658. https://doi.org/10.1109/tsmcb.2005.844594

    Article  Google Scholar 

  9. Kebria PM, Khosravi A, Nahavandi S, Wu D, Bello F (2020) Adaptive Type-2 fuzzy neural-network control for teleoperation systems with delay and uncertainties. IEEE Trans Fuzzy Syst 28(10):2543–2554. https://doi.org/10.1109/TFUZZ.2019.2941173

    Article  Google Scholar 

  10. Krizhevsky A, Nair V, Hinton G (2009) The CIFAR-10 dataset. Available [Online]: https://www.cs.toronto.edu/kriz/cifar.html. Accessed 11 Apr 2020

  11. Krizhevsky A, Sutskever I, Hinton GE (2021) Imagenet classification with deep convolutional neural networks. Advan Neural Inform Process Syst 25:1097–1105. https://doi.org/10.1145/3065386

    Google Scholar 

  12. Kumar A, Sharma T, Verma NK, Sircar P, Vasikarla S (2019) Detection and removal of salt and pepper noise by Gaussian membership function and Guided filter. In: 48th IEEE applied imagery pattern recognition workshop 2019 (AIPR 2019), Washington, DC, USA, pp 1–9. https://doi.org/10.1109/AIPR47015.2019.9174579

  13. Kumar M, Stoll N, Thurow K, Stoll R (2015) Fuzzy membership descriptors for images. IEEE Trans Fuzzy Syst 24(1):195–207. https://doi.org/10.1109/TFUZZ.2015.2451706

    Article  Google Scholar 

  14. LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551. https://doi.org/10.1162/neco.1989.1.4.541

    Article  Google Scholar 

  15. LeCun Y, Bottou K, Bengio Y, Haffner P (1998) Gradient based learning applied to document recognition. Proc IEEE 86(11):2278–2324. https://doi.org/10.1109/5.726791

    Article  Google Scholar 

  16. LeCun Y, Cortes C, Burges CJC (1998) The MNIST database of handwritten digits. Available [Online]: http://yann.lecun.com/exdb/mnist/. Accessed 30 Apr 2020

  17. Lee CY, Gallagher P, Tu Z (2017) Generalizing pooling functions in CNNs: mixed, gated, and tree. IEEE Trans Pattern Anal Mach Intell 40 (4):863–875. https://doi.org/10.1109/TPAMI.2017.2703082

    Article  Google Scholar 

  18. Lee C-S, Guo S-M, Hsu C-Y (2005) Genetic-based fuzzy image filter and its application to image processing. IEEE Trans Syst Man Cybern Part B (Cybernetics) 35(4):694–711. https://doi.org/10.1109/TSMCB.2005.845397

    Article  Google Scholar 

  19. Mizumoto M, Tanaka K (1976) Some properties of fuzzy sets of type-2. Inf Control 31(4):312–340. https://doi.org/10.1016/S0019-9958(76)80011-3

    Article  MathSciNet  MATH  Google Scholar 

  20. Ranzato M, Boureau YL, LeCun Y (2007) Sparse feature learning for deep belief networks. Adv Neural Inf Process Syst 20(NIPS 2007):1185–1192. https://doi.org/10.5555/2981562.2981711

    Google Scholar 

  21. Scherer D, Müller A, Behnke S (2010) Evaluation of pooling operations in convolutional architectures for object recognition. In: International conference on artificial neural networks (ICANN) pp 92–101. https://doi.org/10.1007/978-3-642-15825-4_10

  22. Sharma T, Singh V, Sudhakaran S, Verma NK (2019) Fuzzy based pooling in convolutional neural network for image classification. In: 2019 IEEE international conference on fuzzy systems (FUZZ-IEEE), New Orleans, LA, USA, pp 1–6. https://doi.org/10.1109/FUZZ-IEEE.2019.8859010

  23. Sharma T, Verma NK (2020) Estimating depth and global atmospheric light for image dehazing using Type-2 fuzzy approach. IEEE Trans Emerg Topic Comput Intell, 1–10. https://doi.org/10.1109/TETCI.2020.3032970

  24. Sharma T, Verma NK (2021) Adaptive interval type-2 fuzzy filter: an AI agent for handling uncertainties to preserve image naturalness. IEEE Trans Artif Intell 2(1):83–92. https://doi.org/10.1109/TAI.2021.3077522

    Article  Google Scholar 

  25. Zadeh LA (1975) The concept of a linguistic variable and its application to approximate reasoning. Inf Sci 8(3):199–249. https://doi.org/10.1016/0020-0255(75)90036-5

    Article  MathSciNet  MATH  Google Scholar 

  26. Zeiler MD, Fergus R (2013) Stochastic pooling for regularization of deep convolutional neural networks. In: Proceedings of International Conference on Learning and Representation (ICLR), 1-9. Available [Online]: arXiv:1301.3557v1

  27. Zhou S, Chen Q, Wang X (2014) Fuzzy deep belief networks for semisupervised sentiment classification. Neurocomputing 312322:131. https://doi.org/10.1016/j.neucom.2013.10.011

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Teena Sharma.

Ethics declarations

Conflicts of interests/Competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sharma, T., Verma, N.K. & Masood, S. Mixed fuzzy pooling in convolutional neural networks for image classification. Multimed Tools Appl 82, 8405–8421 (2023). https://doi.org/10.1007/s11042-022-13553-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-13553-0

Keywords

Navigation