Image Classification Using Deep Learning and Fuzzy Systems

Ravi, Chandrasekar

doi:10.1007/978-3-030-16660-1_50

Image Classification Using Deep Learning and Fuzzy Systems

Chandrasekar Ravi¹⁸

Conference paper
First Online: 14 April 2019

1152 Accesses
1 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 941))

Abstract

Classification of images is a significant step in pattern recognition and digital image processing. It is applied in various domains for authentication, identification, defense, medical diagnosis and so on. Feature extraction is an important step in image processing which decides the quality of the model to be built for image classification. With the abundant increase in data now-a-days, the traditional feature extraction algorithms are finding difficulty in coping up with extracting quality features in finite time. Also the learning models developed from the extracted features are not so easily interpretable by the humans. So, considering the above mentioned arguments, a novel image classification framework has been proposed. The framework employs a pre-trained convolution neural network for feature extraction. Brain Storm Optimization algorithm is designed to learn the classification rules from the extracted features. Fuzzy rules based classifier is used for classification. The proposed framework is applied on Caltech 101 dataset and evaluated using accuracy of the classifier as the performance metric. The results demonstrate that the proposed framework outperforms the traditional feature extraction based classification techniques by achieving better accuracy of classification.

Download conference paper PDF

1 Introduction

The discipline of Artificial Intelligence that describes the manner in which knowledge from videos or images to computers is termed as Computer Vision. This inter disciplinary branch enables the machines to see the real world just as a human eye can do in co-ordination with the brain [1,2,3]. It extracts the features from the images, analyses the images using the extracted features and finally derives useful knowledge automatically. This is all totally done with the help of mathematical techniques in the background [4]. The source of information for this computer vision system in often images or videos from single or several cameras. The data could also be multi dimensional like the data from medical imaging devices [5]. These systems finds several applications in various domains like identification (fingerprint, iris, face, voice recognition and so on), manufacturing industry which can use it to detect faults, robotics for building robots for various purposes, surveillance, medical image processing (cancer, neurological disorder detection and so on), unmanned vehicles and so on. The objective of such systems would be to decide whether the target image belongs to a particular class or not. This decision is made by using several steps like image acquisition, preprocessing, feature extraction, segmentation, image recognition, image registration and decision making. In this paper, few of the aforementioned steps are used for image classification. The remaining sections of this paper are organized as follows. Section 2 discusses literature review. Section 3 proposes the novel framework. Section 4 discusses the findings and finally conclusion summarizes the entire paper.

2 Literature Review

Table 1, summarizes the literature based on image classification using convolution neural network for feature extraction. The review clearly highlights that the researchers are recently focusing on feature extraction using convolution neural network. The benefit of this approach is that huge volume and wide variety of images can be used to train the convolution neural network to extract the features. Thus the extracted features would be of good quality and thus result in a better classifier model. Also the enormous time and hardware required to learn from the huge volume of images can be drastically reduced by employing a pre-trained convolution neural network for feature extraction. Fuzzy logic based classification is becoming popular recently due to its capacity to handle uncertainty and produce highly interpretable knowledge.

Table 1. Summary of literature.

Full size table

3 Proposed Framework

The proposed framework is depicted in Fig. 1. The Caltech-101 dataset [20] contains images of 101 categories of objects. ResNet [21] is the Convolutional Neural Networks trained on ImageNet dataset, that has 1000 categories of object and 1.2 million images. When the Caltech-101 dataset is given as input to Resnet-50, the features are extracted. Then, Caltech-101 is partitioned into training and test datasets which comprises the features extracted using Resnet-50. The training dataset is given as input to the Brain Storm optimization [22] algorithm which derives the optimal rule base for the image classification. The test set is then classified using the Fuzzy Inference System and the optimal rule base.

The Brain Storm optimization [22] algorithm, described below, is customized according to the proposed framework, to produce optimal rule base. The brainstorm optimization algorithm is preferred than other algorithms like Genetic algorithm, Particle Swarm Optimization and so on. This is because brainstorm optimization algorithm is based on brain storming activity done by humans, whereas other algorithms are based on the social behavior of birds, ants, etc. in Particle Swarm Optimization, Ant Colony Optimization respectively. Since humans are considered to be the superior most in the ecology, the brainstorm optimization algorithm is assumed to produce the best results.

The individuals in the population of the brainstorm optimization algorithm are called as ideas. The ideas are represented as vectors for easy of computation. The idea vector consists of ‘m + 2’ elements, where ‘m’ represents the number of features extracted by Resnet-50, (m + 1)^th element represents image class and the (m + 1)^th element represents the AND or OR method used by the Fuzzy Inference System. Initial population consists of ‘n’ ideas.

The fitness function for the brainstorm optimization algorithm to generate optimal rule base is designed based on two factors, namely, length of the rule and adaptiveness of the rules to the training dataset. Adaptiveness of the rules is described based on how well the rules match the training dataset. Generally optimal rules are those which are having small length and more adaptive. Hence the fitness function is inversely proportional to rule length and directly proportional to adaptivity. The below Eq. (1) describes the proposed fitness function.

$$ {\text{F}} = \left( {{\text{w}}*{\text{m/l}}} \right) + \left( {{\text{w}}*{\text{r/p}}} \right) $$

(1)

Where, ‘w’ is a constant deciding the weightage of the length of the rule and adaptiveness factor. Generally ‘w’ takes the value 0.5 indicating that both the factors are of equal weightage. ‘m’ represents the number of features extracted by Resnet-50, ‘l’ represents the length of the rule generated by brain storm optimization algorithm, ‘r’ represents the number of rules matching the training dataset and ‘p’ represents the total number of instances in the training dataset.

The ‘e’ and ‘k’ constants, which ranges between 0 to 1, are experimentally decided. The input probabilities P_5a, P_6b, P_6c are randomly chosen between 0 to 1. Initially ‘n’ ideas are generated and clustered into groups. The cluster center is decided based on the fitness value of the ideas in the cluster. New ideas are generated and worst ideas are replaced with them. This is repeated until the maximum fitness value is achieved for the cluster centers. These cluster centers form the optimal rule base for the proposed framework.

4 Results and Discussions

The features are extracted from Caltech-101 dataset using a traditional Local Binary Pattern (LBP) [23] approach and classified using traditional classifiers like support vector machine, naïve bayes, decision tree and k-nearest neighbours. Then the features are extracted from Caltech-101 dataset using Resnet-50 and classified using Fuzzy Inference System.

The ‘e’ and ‘k’ values for the brain storm optimization algorithm are experimentally decided. Figure 2 depicts that the average accuracy of classification is constant for ‘e’ values in the range 0.3 to 0.7 and Fig. 3 depicts that the average accuracy of classification is constant for ‘k’ values in the range 10 to 30. Thus ‘e’ and ‘k’ are chosen in this range.

The accuracy of the classifier is defined as the ratio of correctly classified images to total images in the dataset. The k-folds average accuracy of the above combinations of feature extraction and classification in tabulated in Table 2. From the results, it is evident that the proposed framework outperforms the traditional feature extraction techniques based classification.

Table 2. Comparison of accuracy.

Full size table

5 Conclusion

A novel image classification framework has been proposed. The framework employs a pre-trained convolution neural network for feature extraction. Brain Storm Optimization algorithm is designed to learn the classification rules from the extracted features. Fuzzy rules based classifier is used for classification. The proposed framework is applied on Caltech 101 dataset and evaluated using accuracy of the classifier as the performance metric. The results demonstrate that the proposed framework outperforms the traditional feature extraction based classification techniques by achieving better accuracy of classification.

References

Ballard, D.H., Brown, C.M.: Computer Vision. Prentice Hall, Upper Saddle River (1982)
Google Scholar
Huang, T., Vandoni, C.: Computer Vision: Evolution and Promise. 19th CERN School of Computing, pp. 21–25. CERN, Geneva (1996)
Google Scholar
Sonka, M., Hlavac, V., Boyle, R.: Image Processing, Analysis, and Machine Vision. Thomson, Pacific Grove (2008)
Google Scholar
http://www.bmva.org/visionoverview
Murphy, M.: Star Trek’s tricorder medical scanner just got closer to becoming a reality
Google Scholar
Yang, X., Gao, X., Song, B., Yang, D.: Aurora image search with contextual CNN feature. Neurocomputing 281, 67–77 (2018)
Article Google Scholar
Zhang, M., Li, W., Du, Q., Gao, L., Zhang, B.: Feature extraction for classification of hyperspectral and LiDAR data using patch-to-patch CNN. IEEE Trans. Cybern. (2018). Early Access
Google Scholar
Kong, Y., Wang, X., Cheng, Y.: Spectral–spatial feature extraction for HSI classification based on supervised hypergraph and sample expanded CNN. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 11, 4128–4140 (2018). Early Access
Article Google Scholar
Liu, B., Yu, X., Zhang, P., Yu, A., Fu, Q., Wei, X.: Supervised deep feature extraction for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 56(4), 1909–1921 (2018)
Article Google Scholar
Ding, Y., Deng, R., Xie, X., Xu, X., Zhao, Y., Chen, X., Krylov, A.S.: No-reference stereoscopic image quality assessment using convolutional neural network for adaptive feature extraction. IEEE Access 6, 37595–37603 (2018)
Article Google Scholar
Wang, X., Chen, C., Cheng, Y., Wang, Z.J.: Zero-shot image classification based on deep feature extraction. IEEE Trans Cogn. Dev. Syst 10(2), 432–444 (2018)
Article Google Scholar
Lv, Y., Zhou, W., Tian, Q., Sun, S., Li, H.: Retrieval oriented deep feature learning with complementary supervision mining. IEEE Trans. Image Process. 27(10), 4945–4957 (2018)
Article MathSciNet Google Scholar
Wen, T., Zhang, Z.: Deep convolution neural network and autoencoders-based unsupervised feature learning of EEG signals. IEEE Access 6, 25399–25410 (2018)
Article Google Scholar
Ye, F., Su, Y., Xiao, H., Zhao, X., Min, W.: Remote sensing image registration using convolutional neural network features. IEEE Geosci. Remote Sens. Lett. 15(2), 232–236 (2018)
Article Google Scholar
Yang, Z., Dan, T., Yang, Y.: Multi-temporal remote sensing image registration using deep convolutional features. IEEE Access 6, 38544–38555 (2018)
Article Google Scholar
Nguyen, K., Fookes, C., Ross, A., Sridharan, S.: Iris recognition with off-the-shelf CNN features: a deep learning perspective. IEEE Access 6, 18848–18855 (2018)
Article Google Scholar
Claessens, B.J., Vrancx, P., Ruelens, F.: Convolutional neural networks for automatic state-time feature extraction in reinforcement learning applied to residential load control. IEEE Tran. Smart Grid 9(4), 3259–3269 (2018)
Article Google Scholar
Hao, W., Bie, R., Guo, J., Meng, X., Wang, S.: Optimized CNN based image recognition through target region selection. Optik-Int. J. Light Electron Opt. 156, 772–777 (2018)
Article Google Scholar
Yu, W., Sun, X., Yang, K., Rui, Y., Yao, H.: Hierarchical semantic image matching using CNN feature pyramid. Comput. Vis. Image Underst. 169, 40–51 (2018)
Article Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: IEEE CVPR 2004, Workshop on Generative-Model Based Vision (2004)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: LSVRC 2015 (2015)
Google Scholar
Ojala, T., Pietikäinen, M., Harwood, D.: Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition (ICPR 1994), vol. 1, pp. 582–585 (1994)
Google Scholar
Shi, Y.: Brain storm optimization algorithm. In: Advances in Swarm Intelligence, LNCS, vol. 6728, pp. 303–309 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology Puducherry, Karaikal, India
Chandrasekar Ravi

Authors

Chandrasekar Ravi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandrasekar Ravi .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham
School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
Aswani Kumar Cherukuri
Tijuana Institute of Technology, Tijuana, Mexico
Patricia Melin
Machine Intelligence Research Labs, Auburn, WA, USA
Niketa Gandhi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ravi, C. (2020). Image Classification Using Deep Learning and Fuzzy Systems. In: Abraham, A., Cherukuri, A., Melin, P., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2018 2018. Advances in Intelligent Systems and Computing, vol 941. Springer, Cham. https://doi.org/10.1007/978-3-030-16660-1_50

Download citation

DOI: https://doi.org/10.1007/978-3-030-16660-1_50
Published: 14 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16659-5
Online ISBN: 978-3-030-16660-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics