Abstract
Annotating, retrieving and classifying images in an ever increasing large image datasets with semantic information is still an exigent task. Image pre-processing plays a vital role in partitioning an image into small meaningful regions and extracting features from these regions can be used to stimulate the learning models for better and accurate image classification. Deeplabv3+ framework based on dilated convolution is used to separate out the semantic regions and Histogram of Oriented Gradients (HOG) technique is applied to these regions and computed the distribution of gradients as features. A comparative analysis and examination of various Machine Learning classifiers using edge based, semantically segmented features are compared against the state-of-the-art deep learning techniques. The proposed deep learning based Inception-v3 architecture enables dynamic object recognition with factorized convolution and parameter reduction. The proposed model outperforms hand crafted ML classifiers, shows a significant performance improvement on much diverse dataset like Caltech-256 in comparison with Caltech 101.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mahantesh K, Shubha Rao A (2019) Content based image retrieval - inspired by computer vision & deep learning techniques. In: 2019 4th international conference on electrical, electronics, communication, computer technologies and optimization techniques (ICEECCOT), pp 371–377. https://doi.org/10.1109/ICEECCOT46775.2019.9114610
Ray S (2019) A quick review of machine learning algorithms. In: 2019 international conference on machine learning, big data, cloud and parallel computing (COMITCon), pp 35–39. https://doi.org/10.1109/COMITCon.2019.8862451
Obulesu O, Mahendra M, ThrilokReddy M (2018) Machine learning techniques and tools: a survey. In: 2018 international conference on inventive research in computing applications (ICIRCA), pp 605–611. https://doi.org/10.1109/ICIRCA.2018.8597302
Shrestha A, Mahmood A (2019) Review of deep learning algorithms and architectures. IEEE Access 7:53040–53065. https://doi.org/10.1109/ACCESS.2019.2912200
Chowdhury S, Schoen MP (2020) Research paper classification using supervised machine learning techniques. In: 2020 intermountain engineering, technology and computing (IETC), pp 1–6. https://doi.org/10.1109/IETC47856.2020.9249211
Tamilselvi P, Kumar KA (2017) Unsupervised machine learning for clustering the infected leaves based on the leaf-colours. In: 2017 third international conference on science technology engineering & management (ICONSTEM), pp 106–110. https://doi.org/10.1109/ICONSTEM.2017.8261265
Le Nguyen MH, Gomes HM, Bifet A (2019) Semi-supervised learning over streaming data using MOA. In: 2019 IEEE international conference on big data (Big Data), pp 553–562. https://doi.org/10.1109/BigData47090.2019.9006217
Locatello F, Bauer S, Lucic M, Raetsch G, Gelly S, Schölkopf B, Bachem O (2019) Challenging common assumptions in the unsupervised learning of disentangled representations. In: Proceedings of the 36th international conference on machine learning. PMLR, vol 97, pp 4114–4124
Frankle J, Carbin M (2019) The lottery ticket hypothesis: finding sparse, trainable neural networks. In: International conference on learning representations
Metz L, Maheswaranathan N, Cheung B, Sohl-Dickstein J (2018) Meta-learning update rules for unsupervised representation learning. CoRR
Taher KA, Mohammed Yasin Jisan B, Rahman MM (2019) Network intrusion detection using supervised machine learning technique with feature selection. In: 2019 international conference on robotics, electrical and signal processing techniques (ICREST), pp 643–646. https://doi.org/10.1109/ICREST.2019.8644161
Liu L, Jiang H, He P, Chen W, Liu X, Gao J, Han J (2019) On the variance of the adaptive learning rate and beyond. CoRR
Korkmaz SA, Akçiçek A, Bínol H, Korkmaz MF (2017) Recognition of the stomach cancer images with probabilistic HOG feature vector histograms by using HOG features. In: 2017 IEEE 15th international symposium on intelligent systems and informatics (SISY), pp 000339–000342. https://doi.org/10.1109/SISY.2017.8080578
Weixing L, Haijun S, Feng P, Qi G, Bin Q (2015) A fast pedestrian detection via modified HOG feature. In: 2015 34th Chinese control conference (CCC), pp 3870–3873. https://doi.org/10.1109/ChiCC.2015.7260236
Nguyen N, Bui D, Tran X (2019) A novel hardware architecture for human detection using HOG-SVM co-optimization. In: 2019 IEEE Asia Pacific conference on circuits and systems (APCCAS), pp 33–36. https://doi.org/10.1109/APCCAS47518.2019.8953123
Park W, Kim D, Suryanto CL, Roh TM, Ko S (2012) Fast human detection using selective block-based HOG-LBP. In: 2012 19th IEEE international conference on image processing, pp 601–604. https://doi.org/10.1109/ICIP.2012.6466931
Shyla NSJ, Emmanuel WRS (2021) Automated classification of glaucoma using DWT and HOG features with extreme learning machine. In: Third international conference on intelligent communication technologies and virtual mobile networks (ICICV), pp 725–730. https://doi.org/10.1109/ICICV50876.2021.9388376
Wang X, Xu P, Yu Z, Li F (2020) Image object extraction based on semantic segmentation and label loss. IEEE Access 8:109325–109334. https://doi.org/10.1109/ACCESS.2020.2999942
Li X, Ma H, Luo X (2020) Weaklier supervised semantic segmentation with only one image level annotation per category. IEEE Trans Image Process 29:128–141. https://doi.org/10.1109/TIP.2019.2930874
Pan X, Li L, Yang D, He Y, Liu Z, Yang H (2019) An accurate nuclei segmentation algorithm in pathological image based on deep semantic network. IEEE Access 7:110674–110686. https://doi.org/10.1109/ACCESS.2019.2934486
Xie C et al (2021) Image style transfer algorithm based on semantic segmentation. IEEE Access 9:54518–54529. https://doi.org/10.1109/ACCESS.2021.3054969
Kim M, Park B, Chi S (2020) Accelerator-aware fast spatial feature network for real-time semantic segmentation. IEEE Access 8:226524–226537. https://doi.org/10.1109/ACCESS.2020.3045147
Zhang L, Zhou W, Li J, Li J, Lou X (2020) Histogram of oriented gradients feature extraction without normalization. In: IEEE Asia Pacific conference on circuits and systems (APCCAS), pp 252–255. https://doi.org/10.1109/APCCAS50809.2020.9301715
Libiao J, Wenchao Z, Changyu L, Zheng W (2021) Semantic segmentation based on DeeplabV3+ with multiple fusions of low-level features. In: IEEE 5th advanced information technology. Electronic and automation control conference (IAEAC), pp 1957–1963. https://doi.org/10.1109/IAEAC50856.2021.9390753
Rao AS, Mahantesh K (2021) Learning semantic features for classifying very large image datasets using convolution neural network. SN Comput. Sci. 2:187. https://doi.org/10.1007/s42979-021-00589-6
Bamne B, Shrivastava N, Parashar L, Singh U (2020) Transfer learning-based object detection by using convolutional neural networks. In: International conference on electronics and sustainable communication systems (ICESC), pp 328–332. https://doi.org/10.1109/ICESC48915.2020.9156060
Wu X, Liu R, Yang H, Chen Z (2020) An xception based convolutional neural network for scene image classification with transfer learning. In: 2020 2nd international conference on information technology and computer application (ITCA), pp 262–267. https://doi.org/10.1109/ITCA52113.2020.00063.
Berg TL, Berg AC, Malik J (2005) Shape matching and object recognition using low distortion correspondence. In: IEEE CVPR, vol 1, pp 26–33
Grauman K, Darell (2006) Pyramid match kernels: discriminative classification with sets of image features. Technical report MIT-CSAIL-TR-2006-020
Maire M, Malik J, Zhang H, Berg AC (2006) SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: IEEE-CVPR, vol 2, pp 2126–2136
Mc Cann S, Lowe DG (2012) Local naive bayes nearest neighbor for image classification. In: IEEE-CVPR, pp 3650–3656
Mutch J, Lowe DG (2006) Multiclass object recognition with sparse, localized features. In: IEEE CVPR, vol 1, pp 11–18
González G, Türetken E, Benmansour F, Rigamonti R, Lepetit V (2014) On the relevance of sparsity for image classification. Comput Vis Image Underst 125:115127
Mahantesh K, Aradhya VNM, Niranjan SK (2014) An impact of complex hybrid color space in image segmentation. In: Recent advances in intelligent informatics. Advances in intelligent systems and computing, vol 235. Springer, pp 73–83. https://doi.org/10.1007/978-3-319-01778-58
Liu Y-J-D, Wang Y-X (2012) Learning dictionary on manifolds for image classification. Pattern Recogn 46:1879–1890
Vieira AW, Campos MF, Oliveira GL, Nascimento ER (2012) Sparse spatial coding: a novel approach for efficient and accurate object recognition. In: IEEE international conference on robotics and automation (ICRA), pp 2592–2598
Zhang Y, Zheng Y, Liu B, Wang Y (2012) Discriminant sparse coding for image classification. In: Proceedings of the 37th international conference on acoustics, speech and signal processing, pp 2193–2196
Holub A, Griffin G, Perona P (2007) Caltech 256 object category dataset. Technical report, California Institute of Technology
Banerji CLS, Sinha A (2013) New image descriptors based on color, texture, shape, and wavelets for object and scene image classification. Neurocomputing 117:173–185
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shubha Rao, A., Mahantesh, K. (2022). Image Classification Based on Inception-v3 and a Mixture of Handcrafted Features. In: Majhi, S., Prado, R.P.d., Dasanapura Nanjundaiah, C. (eds) Distributed Computing and Optimization Techniques. Lecture Notes in Electrical Engineering, vol 903. Springer, Singapore. https://doi.org/10.1007/978-981-19-2281-7_49
Download citation
DOI: https://doi.org/10.1007/978-981-19-2281-7_49
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2280-0
Online ISBN: 978-981-19-2281-7
eBook Packages: Computer ScienceComputer Science (R0)