Convolutional Neural Networks Features: Principal Pyramidal Convolution
- Cite this paper as:
- Guo Y., Lao S., Liu Y., Bai L., Liu S., Lew M.S. (2015) Convolutional Neural Networks Features: Principal Pyramidal Convolution. In: Ho YS., Sang J., Ro Y., Kim J., Wu F. (eds) Advances in Multimedia Information Processing -- PCM 2015. Lecture Notes in Computer Science, vol 9314. Springer, Cham
The features extracted from convolutional neural networks (CNNs) are able to capture the discriminative part of an image and have shown superior performance in visual recognition. Furthermore, it has been verified that the CNN activations trained from large and diverse datasets can act as generic features and be transferred to other visual recognition tasks. In this paper, we aim to learn more from an image and present an effective method called Principal Pyramidal Convolution (PPC). The scheme first partitions the image into two levels, and extracts CNN activations for each sub-region along with the whole image, and then aggregates them together. The concatenated feature is later reduced to the standard dimension using Principal Component Analysis (PCA) algorithm, generating the refined CNN feature. When applied in image classification and retrieval tasks, the PPC feature consistently outperforms the conventional CNN feature, regardless of the network type where they derive from. Specifically, PPC achieves state-of-the-art result on the MIT Indoor67 dataset, utilizing the activations from Places-CNN.