Skip to main content

Scene Classification Using Transfer Learning

  • Chapter
  • First Online:
Recent Advances in Computer Vision

Part of the book series: Studies in Computational Intelligence ((SCI,volume 804))

Abstract

Categorization of scene images is considered as a challenging prospect due to the fact that different classes of scene images often share similar image statistics. This chapter presents a transfer learning based approach for scene classification. A pre-trained Convolutional Neural Network (CNN) is used as a feature extractor for the images. The pre-trained network along with classifiers such as Support Vector Machines (SVM) or Multi Layer Perceptron (MLP) are used to classify the images. Also, the effect of single plane images such as, RGB2Gray, SVD Decolorized and Modified SVD decolorized images are analysed based on classification accuracy, class-wise precision, recall, F1-score and equal error rate (EER). The classification experiment for SVM was also done using a dimensionality reduction technique known as principal component analysis (PCA) on the feature vector. By comparing the results of models trained on RGB images with those grayscale images, the difference in the results is very small. These grayscale images were capable of retaining the required shape and texture information from the original RGB images and were also sufficient to categorize the classes of the given scene images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rasiwasia, N., Vasconcelos, N.: Scene classification with low-dimensional semantic spaces and weak supervision. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–6 (2008)

    Google Scholar 

  2. Viitaniemi, V., Laaksonen, J.: Techniques for still image scene classification and object detection. In: International Conference on Artificial Neural Networks, pp. 35–44 (2006)

    Chapter  Google Scholar 

  3. Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  4. LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)

    Article  Google Scholar 

  5. Razavian, A.s., Hossein, A., Josephine, S., Stefan, C.: CNN features off-the-shelf: an astounding baseline for recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 512–519 (2014)

    Google Scholar 

  6. Oquab, M., Leon, B., Iva, L., Josef, S.: Learning and transferring mid-level image representations using convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1717–1724 (2014)

    Google Scholar 

  7. Jeff, D., Yangqing, J., Vinyals, O., Judy, H., Zhang, N., Eric, T., Trevor, D.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: International Conference on Machine Learning, pp. 647–655 (2014)

    Google Scholar 

  8. Sachin, R., Sowmya, V., Govind, D., Soman, K.P.: Dependency of various color and intensity planes on CNN based image classification. In: International Symposium on Signal Processing and Intelligent Recognition Systems, pp. 167–177 (2017)

    Google Scholar 

  9. Aude, O., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)

    Article  MATH  Google Scholar 

  10. Liang, Z., Yali, Z., Shengjin, W., Jingdong, W., Tian, Q.: Good practice in CNN feature transfer (2016). arXiv:1604.00133

  11. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  12. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv:1409.1556

  13. Jolliffe, I., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. Ser. A Math. Phys. Eng. Sci. 374(2065), 1–10 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  14. Sowmya, V., Govind, D., Soman, K.: Significance of incorporating chrominance information for effective color-to-grayscale image conversion. Signal Image Video Process. 11(1), 129–136 (2017)

    Article  Google Scholar 

  15. Kede, M., Tiesong, Z., Kai, Z., Zhou, W.: Objective quality assessment for color-to-gray image conversion. IEEE Trans. Image Process. 24(12), 4673–4685 (2015)

    Article  MathSciNet  Google Scholar 

  16. Viswanathan, S., Divakaran, G., Soman, K.P.: Significance of perceptually relevant image decolorization for scene classification. J. Electron. Imaging 26(6), 063019 (2017)

    Article  Google Scholar 

  17. Zhou, B., Khosla, A., Lapedriza, A., Torralba, A., Oliva, A.: Places: an image database for deep scene understanding (2016). arXiv:1610.02055

  18. Kaiming, H., Xiangyu, Z., Shaoqing, R., Jian, S.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)

    Article  Google Scholar 

  19. Ross, G., Jeff, D., Trevor, D., Jitendra, M.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)

    Google Scholar 

  20. Razavian, A.S., Sullivan, J., Carlsson, S., Maki, A.: Visual instance retrieval with deep convolutional networks. ITE Trans. Media Technol. Appl. 4(3), 251–258 (2016)

    Article  Google Scholar 

  21. Ali, S., Josephine, S., Stefan, C., Atsuto, M.: CNN features off-the-shelf: an astounding baseline for recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 806–813 (2014)

    Google Scholar 

  22. Artem, B., Anton, S., Alexandr, C., Victor, L.: Neural codes for image retrieval. In: European Conference on Computer Vision, pp. 584–599. Springer (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Sowmya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Damodaran, N., Sowmya, V., Govind, D., Soman, K.P. (2019). Scene Classification Using Transfer Learning. In: Hassaballah, M., Hosny, K. (eds) Recent Advances in Computer Vision. Studies in Computational Intelligence, vol 804. Springer, Cham. https://doi.org/10.1007/978-3-030-03000-1_15

Download citation

Publish with us

Policies and ethics