Abstract
This paper presents a method of localizing wooden knots in images of oak boards using deep convolutional networks (ConvNets). In particular, we show that transfer learning from generic images works effectively with a limited amount of available data when training a classifier for this highly specialized problem domain. We compare our method with a previous commercially developed technique based on kernel SVM with local feature descriptors. Our method is found to improve the detection performance significantly: \(F_1\) score \(0.750 \pm 0.018\) vs 0.695. Furthermore, we report some observations regarding the behavior of KL-divergence on the test set which is counter-intuitive in its relation to the accuracy of classification.
Chapter PDF
References
Imagenet large scale visual recognition challenge 2013 (ilsvrc2013). http://www.image-net.org/challenges/LSVRC/2013/
Argyriou, A., Evgeniou, T., Pontil, M.: Multi-task feature learning. In: NIPS, pp. 41–48 (2006)
Azizpour, H., Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: From generic to specific deep representations for visual recognition (2014). arXiv:1406.5774 [cs.CV]
Bengio, Y., Courville, A.C., Vincent, P.: Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(8), 1798–1828 (2013)
Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: CVPR (2014)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: Delving deep into convolutional nets (2014). arxiv:1405.3531 [cs.CV]
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Schmid, C., Soatto, S., Tomasi, C. (eds.) International Conference on Computer Vision & Pattern Recognition, INRIA Rhône-Alpes, ZIRST-655, av. de l’Europe, Montbonnot-38334, vol. 2, pp. 886–893, June 2005
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: a deep convolutional activation feature for generic visual recognition. In: ICML (2014)
Efron, B.: Bootstrap methods: Another look at the jackknife. Ann. Statist. 7(1), 1–26 (1979)
Girshick, R.B., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR (2014)
Gutstein, S., Fuentes, O., Freudenthal, E.: Knowledge transfer in deep convolutional neural nets. IJAIT 17(3), 555–567 (2008)
Jia, Y.: Caffe: An open source convolutional architecture for fast feature embedding (2013). http://caffe.berkeleyvision.org/
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates Inc. (2012)
Li, L.-J., Su, H., Xing, E.P., Li, F.-F.: Object bank: a high-level image representation for scene classification & semantic feature sparsification. In: NIPS (2010)
Mesnil, G., Dauphin, Y., Glorot, X., Rifai, S., Bengio, Y., Goodfellow, I.J., Lavoie, E., Muller, X., Desjardins, G., Warde-Farley, D., Vincent, P., Courville, A.C., Bergstra, J.: Unsupervised and transfer learning challenge: a deep learning approach. In: JMLR Proceedings of the ICML Unsupervised and Transfer Learning, vol. 27, pp. 97–110 (2012)
Platt, J.C.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Advances in Large Margin Classifiers, pp. 61–74. MIT Press (1999)
Pölzleitner, W., Schwingshakl, G.: Real-time surface grading of profiled wooden boards. Industrial Metrology 2(3–4), 283–298 (1992). Machine Vision Technology in the Forest Products Industry
Pratt, L.Y.: Discriminability-based transfer between neural networks. In: NIPS (1992)
Qiu, Z.F.: A Simple Machine Vision System for Improving the Edging and Trimming Operations Performed in Hardwood Sawmills. Master’s thesis, Virginia Polytechnic Institute and State University, Blacksburg, Virginia (1996)
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: Cnn features off-the-shelf: an astounding baseline for visual recognition. In: CVPR Workshop of DeepVision (2014)
Schmoldt, D.L., Li, P., Lynn Abbott, A.: Machine vision using artificial neural networks with local 3d neighborhoods. Computers and Electronics in Agriculture 16(3), 255–271 (1997)
Skogsindustrierna. Skogsindustrin, en faktasamling, 2010 års branschstatistik, 26 11 2014. http://www.skogsindustrierna.org/MediaBinaryLoader.axd?MediaArchive_FileID=62e53e92-510b-4134-a47e-08d6095b2a62&FileName=Faktasamling_Sv_2010.pdf
Image Systems. 2013 annual report (2013). http://mb.cision.com/Main/7480/9570148/233985.pdf
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. CoRR, abs/1311.2901 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Norlander, R., Grahn, J., Maki, A. (2015). Wooden Knot Detection Using ConvNet Transfer Learning. In: Paulsen, R., Pedersen, K. (eds) Image Analysis. SCIA 2015. Lecture Notes in Computer Science(), vol 9127. Springer, Cham. https://doi.org/10.1007/978-3-319-19665-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-19665-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19664-0
Online ISBN: 978-3-319-19665-7
eBook Packages: Computer ScienceComputer Science (R0)