Abstract
Convolutional Neural Networks (CNN) are being widely practised in computer vision applications, where pooling indicates as an integral part. Pooling significantly reduces the training time and computational cost of any network. The pooling operations generally being used are max, average, or stochastic. The performance of the CNN architecture can be increased by using more than one pooling operations. Herein, a novel mixed fuzzy pooling is proposed for image classification in the CNN architecture. In the proposed mixed fuzzy pooling, max pooling and fuzzy pooling are combined together to boost the classification accuracy. The proposed mixed fuzzy pooling is designed as a function of max pooling, fuzzy pooling, and α parameter where the learning of the proposed pooling function takes place with the value of α. In fuzzy pooling, Type-2 fuzzy logic is used to get the convolved dominant features. This is obtained by the estimation of a threshold for image region to be pooled. Thereafter, Type-1 fuzzy logic is used to get the pooling output by a weighted average of convolved dominant features. The performance of various pooling strategies on MNIST dataset for handwritten digits classification and CIFAR-10 dataset for RGB images classification have been compared and analysed. The experimental results show better image classification accuracy for the proposed mixed fuzzy pooling than conventional pooling operations.
Similar content being viewed by others
References
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press. ISBN: 978-0-19-853864-6
Boureau Y, Ponce J, LeCun Y (2010) A theoretical analysis of feature pooling in visual recognition. In: Proceedings of international conference on machine learning, pp 111–118
Boureau Y, Roux NL, Bach F, Ponce J, LeCun Y (2011) Ask the locals: multi-way local pooling for image recognition. In: Proceedings of international conference on computer vision, pp 2651–2658
Chauhan R, Ghanshala KK, Joshi RC (2018) Convolutional neural network (CNN) for image detection and recognition. In: 2018 First international conference on secure cyber computing and communication (ICSCCC), Jalandhar, India, pp 278–282. https://doi.org/10.1109/ICSCCC.2018.8703316
Chen J, Hua Z, Wang J, Cheng S (2017) A convolutional neural network with dynamic correlation pooling. In: 2017 13th international conference on computational intelligence and security (xCIS), Hong Kong, China, pp 496–499. https://doi.org/10.1109/CIS.2017.00115
Greeshma MS, Bindu VR (2017) Single image super resolution using fuzzy deep convolutional networks. In: International conference on technological advancements in power and energy (TAP Energy), pp 1–6. https://doi.org/10.1109/TAPENERGY.2017.8397224
Juang CF, Hsieh CD (2010) A locally recurrent fuzzy neural network with support vector regression for dynamic-system modeling. IEEE Trans Fuzzy Syst 18(2):261–273. https://doi.org/10.1109/TFUZZ.2010.2040185
Juang CF, Ku KC (2005) A recurrent fuzzy network for fuzzy temporal sequence processing and gesture recognition. IEEE Trans Syst Man Cybern Part B (Cybernetics) 35(4):646–658. https://doi.org/10.1109/tsmcb.2005.844594
Kebria PM, Khosravi A, Nahavandi S, Wu D, Bello F (2020) Adaptive Type-2 fuzzy neural-network control for teleoperation systems with delay and uncertainties. IEEE Trans Fuzzy Syst 28(10):2543–2554. https://doi.org/10.1109/TFUZZ.2019.2941173
Krizhevsky A, Nair V, Hinton G (2009) The CIFAR-10 dataset. Available [Online]: https://www.cs.toronto.edu/kriz/cifar.html. Accessed 11 Apr 2020
Krizhevsky A, Sutskever I, Hinton GE (2021) Imagenet classification with deep convolutional neural networks. Advan Neural Inform Process Syst 25:1097–1105. https://doi.org/10.1145/3065386
Kumar A, Sharma T, Verma NK, Sircar P, Vasikarla S (2019) Detection and removal of salt and pepper noise by Gaussian membership function and Guided filter. In: 48th IEEE applied imagery pattern recognition workshop 2019 (AIPR 2019), Washington, DC, USA, pp 1–9. https://doi.org/10.1109/AIPR47015.2019.9174579
Kumar M, Stoll N, Thurow K, Stoll R (2015) Fuzzy membership descriptors for images. IEEE Trans Fuzzy Syst 24(1):195–207. https://doi.org/10.1109/TFUZZ.2015.2451706
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551. https://doi.org/10.1162/neco.1989.1.4.541
LeCun Y, Bottou K, Bengio Y, Haffner P (1998) Gradient based learning applied to document recognition. Proc IEEE 86(11):2278–2324. https://doi.org/10.1109/5.726791
LeCun Y, Cortes C, Burges CJC (1998) The MNIST database of handwritten digits. Available [Online]: http://yann.lecun.com/exdb/mnist/. Accessed 30 Apr 2020
Lee CY, Gallagher P, Tu Z (2017) Generalizing pooling functions in CNNs: mixed, gated, and tree. IEEE Trans Pattern Anal Mach Intell 40 (4):863–875. https://doi.org/10.1109/TPAMI.2017.2703082
Lee C-S, Guo S-M, Hsu C-Y (2005) Genetic-based fuzzy image filter and its application to image processing. IEEE Trans Syst Man Cybern Part B (Cybernetics) 35(4):694–711. https://doi.org/10.1109/TSMCB.2005.845397
Mizumoto M, Tanaka K (1976) Some properties of fuzzy sets of type-2. Inf Control 31(4):312–340. https://doi.org/10.1016/S0019-9958(76)80011-3
Ranzato M, Boureau YL, LeCun Y (2007) Sparse feature learning for deep belief networks. Adv Neural Inf Process Syst 20(NIPS 2007):1185–1192. https://doi.org/10.5555/2981562.2981711
Scherer D, Müller A, Behnke S (2010) Evaluation of pooling operations in convolutional architectures for object recognition. In: International conference on artificial neural networks (ICANN) pp 92–101. https://doi.org/10.1007/978-3-642-15825-4_10
Sharma T, Singh V, Sudhakaran S, Verma NK (2019) Fuzzy based pooling in convolutional neural network for image classification. In: 2019 IEEE international conference on fuzzy systems (FUZZ-IEEE), New Orleans, LA, USA, pp 1–6. https://doi.org/10.1109/FUZZ-IEEE.2019.8859010
Sharma T, Verma NK (2020) Estimating depth and global atmospheric light for image dehazing using Type-2 fuzzy approach. IEEE Trans Emerg Topic Comput Intell, 1–10. https://doi.org/10.1109/TETCI.2020.3032970
Sharma T, Verma NK (2021) Adaptive interval type-2 fuzzy filter: an AI agent for handling uncertainties to preserve image naturalness. IEEE Trans Artif Intell 2(1):83–92. https://doi.org/10.1109/TAI.2021.3077522
Zadeh LA (1975) The concept of a linguistic variable and its application to approximate reasoning. Inf Sci 8(3):199–249. https://doi.org/10.1016/0020-0255(75)90036-5
Zeiler MD, Fergus R (2013) Stochastic pooling for regularization of deep convolutional neural networks. In: Proceedings of International Conference on Learning and Representation (ICLR), 1-9. Available [Online]: arXiv:1301.3557v1
Zhou S, Chen Q, Wang X (2014) Fuzzy deep belief networks for semisupervised sentiment classification. Neurocomputing 312322:131. https://doi.org/10.1016/j.neucom.2013.10.011
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interests/Competing interests
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sharma, T., Verma, N.K. & Masood, S. Mixed fuzzy pooling in convolutional neural networks for image classification. Multimed Tools Appl 82, 8405–8421 (2023). https://doi.org/10.1007/s11042-022-13553-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13553-0