Abstract
Classification of images is a significant step in pattern recognition and digital image processing. It is applied in various domains for authentication, identification, defense, medical diagnosis and so on. Feature extraction is an important step in image processing which decides the quality of the model to be built for image classification. With the abundant increase in data now-a-days, the traditional feature extraction algorithms are finding difficulty in coping up with extracting quality features in finite time. Also the learning models developed from the extracted features are not so easily interpretable by the humans. So, considering the above mentioned arguments, a novel image classification framework has been proposed. The framework employs a pre-trained convolution neural network for feature extraction. Brain Storm Optimization algorithm is designed to learn the classification rules from the extracted features. Fuzzy rules based classifier is used for classification. The proposed framework is applied on Caltech 101 dataset and evaluated using accuracy of the classifier as the performance metric. The results demonstrate that the proposed framework outperforms the traditional feature extraction based classification techniques by achieving better accuracy of classification.
1 Introduction
The discipline of Artificial Intelligence that describes the manner in which knowledge from videos or images to computers is termed as Computer Vision. This inter disciplinary branch enables the machines to see the real world just as a human eye can do in co-ordination with the brain [1,2,3]. It extracts the features from the images, analyses the images using the extracted features and finally derives useful knowledge automatically. This is all totally done with the help of mathematical techniques in the background [4]. The source of information for this computer vision system in often images or videos from single or several cameras. The data could also be multi dimensional like the data from medical imaging devices [5]. These systems finds several applications in various domains like identification (fingerprint, iris, face, voice recognition and so on), manufacturing industry which can use it to detect faults, robotics for building robots for various purposes, surveillance, medical image processing (cancer, neurological disorder detection and so on), unmanned vehicles and so on. The objective of such systems would be to decide whether the target image belongs to a particular class or not. This decision is made by using several steps like image acquisition, preprocessing, feature extraction, segmentation, image recognition, image registration and decision making. In this paper, few of the aforementioned steps are used for image classification. The remaining sections of this paper are organized as follows. Section 2 discusses literature review. Section 3 proposes the novel framework. Section 4 discusses the findings and finally conclusion summarizes the entire paper.
2 Literature Review
Table 1, summarizes the literature based on image classification using convolution neural network for feature extraction. The review clearly highlights that the researchers are recently focusing on feature extraction using convolution neural network. The benefit of this approach is that huge volume and wide variety of images can be used to train the convolution neural network to extract the features. Thus the extracted features would be of good quality and thus result in a better classifier model. Also the enormous time and hardware required to learn from the huge volume of images can be drastically reduced by employing a pre-trained convolution neural network for feature extraction. Fuzzy logic based classification is becoming popular recently due to its capacity to handle uncertainty and produce highly interpretable knowledge.
3 Proposed Framework
The proposed framework is depicted in Fig. 1. The Caltech-101 dataset [20] contains images of 101 categories of objects. ResNet [21] is the Convolutional Neural Networks trained on ImageNet dataset, that has 1000 categories of object and 1.2 million images. When the Caltech-101 dataset is given as input to Resnet-50, the features are extracted. Then, Caltech-101 is partitioned into training and test datasets which comprises the features extracted using Resnet-50. The training dataset is given as input to the Brain Storm optimization [22] algorithm which derives the optimal rule base for the image classification. The test set is then classified using the Fuzzy Inference System and the optimal rule base.
The Brain Storm optimization [22] algorithm, described below, is customized according to the proposed framework, to produce optimal rule base. The brainstorm optimization algorithm is preferred than other algorithms like Genetic algorithm, Particle Swarm Optimization and so on. This is because brainstorm optimization algorithm is based on brain storming activity done by humans, whereas other algorithms are based on the social behavior of birds, ants, etc. in Particle Swarm Optimization, Ant Colony Optimization respectively. Since humans are considered to be the superior most in the ecology, the brainstorm optimization algorithm is assumed to produce the best results.
The individuals in the population of the brainstorm optimization algorithm are called as ideas. The ideas are represented as vectors for easy of computation. The idea vector consists of ‘m + 2’ elements, where ‘m’ represents the number of features extracted by Resnet-50, (m + 1)th element represents image class and the (m + 1)th element represents the AND or OR method used by the Fuzzy Inference System. Initial population consists of ‘n’ ideas.
The fitness function for the brainstorm optimization algorithm to generate optimal rule base is designed based on two factors, namely, length of the rule and adaptiveness of the rules to the training dataset. Adaptiveness of the rules is described based on how well the rules match the training dataset. Generally optimal rules are those which are having small length and more adaptive. Hence the fitness function is inversely proportional to rule length and directly proportional to adaptivity. The below Eq. (1) describes the proposed fitness function.
Where, ‘w’ is a constant deciding the weightage of the length of the rule and adaptiveness factor. Generally ‘w’ takes the value 0.5 indicating that both the factors are of equal weightage. ‘m’ represents the number of features extracted by Resnet-50, ‘l’ represents the length of the rule generated by brain storm optimization algorithm, ‘r’ represents the number of rules matching the training dataset and ‘p’ represents the total number of instances in the training dataset.
The ‘e’ and ‘k’ constants, which ranges between 0 to 1, are experimentally decided. The input probabilities P5a, P6b, P6c are randomly chosen between 0 to 1. Initially ‘n’ ideas are generated and clustered into groups. The cluster center is decided based on the fitness value of the ideas in the cluster. New ideas are generated and worst ideas are replaced with them. This is repeated until the maximum fitness value is achieved for the cluster centers. These cluster centers form the optimal rule base for the proposed framework.
4 Results and Discussions
The features are extracted from Caltech-101 dataset using a traditional Local Binary Pattern (LBP) [23] approach and classified using traditional classifiers like support vector machine, naïve bayes, decision tree and k-nearest neighbours. Then the features are extracted from Caltech-101 dataset using Resnet-50 and classified using Fuzzy Inference System.
The ‘e’ and ‘k’ values for the brain storm optimization algorithm are experimentally decided. Figure 2 depicts that the average accuracy of classification is constant for ‘e’ values in the range 0.3 to 0.7 and Fig. 3 depicts that the average accuracy of classification is constant for ‘k’ values in the range 10 to 30. Thus ‘e’ and ‘k’ are chosen in this range.
The accuracy of the classifier is defined as the ratio of correctly classified images to total images in the dataset. The k-folds average accuracy of the above combinations of feature extraction and classification in tabulated in Table 2. From the results, it is evident that the proposed framework outperforms the traditional feature extraction techniques based classification.
5 Conclusion
A novel image classification framework has been proposed. The framework employs a pre-trained convolution neural network for feature extraction. Brain Storm Optimization algorithm is designed to learn the classification rules from the extracted features. Fuzzy rules based classifier is used for classification. The proposed framework is applied on Caltech 101 dataset and evaluated using accuracy of the classifier as the performance metric. The results demonstrate that the proposed framework outperforms the traditional feature extraction based classification techniques by achieving better accuracy of classification.
References
Ballard, D.H., Brown, C.M.: Computer Vision. Prentice Hall, Upper Saddle River (1982)
Huang, T., Vandoni, C.: Computer Vision: Evolution and Promise. 19th CERN School of Computing, pp. 21–25. CERN, Geneva (1996)
Sonka, M., Hlavac, V., Boyle, R.: Image Processing, Analysis, and Machine Vision. Thomson, Pacific Grove (2008)
Murphy, M.: Star Trek’s tricorder medical scanner just got closer to becoming a reality
Yang, X., Gao, X., Song, B., Yang, D.: Aurora image search with contextual CNN feature. Neurocomputing 281, 67–77 (2018)
Zhang, M., Li, W., Du, Q., Gao, L., Zhang, B.: Feature extraction for classification of hyperspectral and LiDAR data using patch-to-patch CNN. IEEE Trans. Cybern. (2018). Early Access
Kong, Y., Wang, X., Cheng, Y.: Spectral–spatial feature extraction for HSI classification based on supervised hypergraph and sample expanded CNN. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 11, 4128–4140 (2018). Early Access
Liu, B., Yu, X., Zhang, P., Yu, A., Fu, Q., Wei, X.: Supervised deep feature extraction for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 56(4), 1909–1921 (2018)
Ding, Y., Deng, R., Xie, X., Xu, X., Zhao, Y., Chen, X., Krylov, A.S.: No-reference stereoscopic image quality assessment using convolutional neural network for adaptive feature extraction. IEEE Access 6, 37595–37603 (2018)
Wang, X., Chen, C., Cheng, Y., Wang, Z.J.: Zero-shot image classification based on deep feature extraction. IEEE Trans Cogn. Dev. Syst 10(2), 432–444 (2018)
Lv, Y., Zhou, W., Tian, Q., Sun, S., Li, H.: Retrieval oriented deep feature learning with complementary supervision mining. IEEE Trans. Image Process. 27(10), 4945–4957 (2018)
Wen, T., Zhang, Z.: Deep convolution neural network and autoencoders-based unsupervised feature learning of EEG signals. IEEE Access 6, 25399–25410 (2018)
Ye, F., Su, Y., Xiao, H., Zhao, X., Min, W.: Remote sensing image registration using convolutional neural network features. IEEE Geosci. Remote Sens. Lett. 15(2), 232–236 (2018)
Yang, Z., Dan, T., Yang, Y.: Multi-temporal remote sensing image registration using deep convolutional features. IEEE Access 6, 38544–38555 (2018)
Nguyen, K., Fookes, C., Ross, A., Sridharan, S.: Iris recognition with off-the-shelf CNN features: a deep learning perspective. IEEE Access 6, 18848–18855 (2018)
Claessens, B.J., Vrancx, P., Ruelens, F.: Convolutional neural networks for automatic state-time feature extraction in reinforcement learning applied to residential load control. IEEE Tran. Smart Grid 9(4), 3259–3269 (2018)
Hao, W., Bie, R., Guo, J., Meng, X., Wang, S.: Optimized CNN based image recognition through target region selection. Optik-Int. J. Light Electron Opt. 156, 772–777 (2018)
Yu, W., Sun, X., Yang, K., Rui, Y., Yao, H.: Hierarchical semantic image matching using CNN feature pyramid. Comput. Vis. Image Underst. 169, 40–51 (2018)
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: IEEE CVPR 2004, Workshop on Generative-Model Based Vision (2004)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: LSVRC 2015 (2015)
Ojala, T., Pietikäinen, M., Harwood, D.: Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition (ICPR 1994), vol. 1, pp. 582–585 (1994)
Shi, Y.: Brain storm optimization algorithm. In: Advances in Swarm Intelligence, LNCS, vol. 6728, pp. 303–309 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Ravi, C. (2020). Image Classification Using Deep Learning and Fuzzy Systems. In: Abraham, A., Cherukuri, A., Melin, P., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2018 2018. Advances in Intelligent Systems and Computing, vol 941. Springer, Cham. https://doi.org/10.1007/978-3-030-16660-1_50
Download citation
DOI: https://doi.org/10.1007/978-3-030-16660-1_50
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16659-5
Online ISBN: 978-3-030-16660-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)