Abstract
This chapter gives a brief overview of machine learning and related fields of study. The concept of treating image and text in a similar fashion is then presented. A few successful examples of knowledge transfer between computer vision and text mining are also given. The chapter ends with a full overview of the organization of this book.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal S, Roth D (2002) Learning a sparse representation for object detection. In: Proceedings of ECCV, pp 113–127
Agirre E, Edmonds PG (2006) Word Sense Disambiguation: Algorithms and applications. Springer
Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: Proceedings of CVPR, pp 73–80
Alexe B, Deselaers T, Ferrari V (2012) Measuring the objectness of image windows. IEEE Trans Pattern Anal Mach Intell 34(11):2189–2202
Barnard K, Duygulu P, Forsyth D, De Freitas N, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135
Barnard K, Johnson M (2005) Word sense disambiguation with pictures. Artif Intell 167(1–2):13–30
Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press Inc, New York, USA
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Caruana R, Niculescu-Mizil A (2006). An empirical comparison of supervised learning algorithms. In: Proceedings of ICML, pp 161–168
Chen Y, Garcia EK, Gupta MR, Rahimi A, Cazzanti L (2009) Similarity-based classification: concepts and algorithms. J Mach Learn Res 10:747–776
Chifu AG, Ionescu RT (2012) Word sense disambiguation to improve precision for ambiguous queries. Cent Eur J Comput Sci 2(4):398–411
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Dinu LP, Ionescu RT, Popescu M (2012) Local Patch Dissimilarity for images. In: Proceedings of ICONIP 7663:117–126
Dinu LP, Ionescu RT (2013) Clustering based on median and closest string via rank distance with applications on DNA. Neural Comput Appl 24(1):77–84
Dinu LP, Ionescu RT, Tomescu AI (2014) A rank-based sequence aligner with applications in phylogenetic analysis. PLoS ONE, 9(8):e104006. doi:10.1371/journal.pone.0104006
Dinu LP, Manea F (2006) An efficient approach for the rank aggregation problem. Theoret Comput Sci 359(1–3):455–461
Dinu LP, Popescu M (2009) Comparing statistical similarity measures for stylistic multivariate analysis. In: Proceedings of RANLP
Duygulu P, Barnard K, De Freitas JFG, Forsyth DA (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of ECCV 97–112
Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of ECCV 15–29
Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proceedings of CVPR 2:524–531
Forsyth DA, Ponce J (2002) Computer vision: a modern approach. Prentice Hall Professional Technical Reference
Galleguillos C, Belongie S (2010) Context based object categorization: a critical survey. Comput Vis Image Underst 114:712–722
Inza I, Calvo B, Armañanzas R, Bengoetxea E, Larrañaga P, Lozano JA (2010) Machine learning: an indispensable tool in bioinformatics. Meth Mol Biol (Clifton, N.J.) 593:25–48
Ionescu RT, Chifu A-G, Mothe J (2015b) DeShaTo: describing the shape of cumulative topic distributions to rank retrieval systems without relevance judgments. In: Proceedings of SPIRE 9309:75–82
Ionescu RT, Popescu AL, Popescu M, Popescu D (2015a) BiomassID: a biomass type identification system for mobile devices. Comput Electron Agric 113:244–253
Ionescu RT, Popescu AL, Popescu D, Popescu M (2014a) Local Texton Dissimilarity with applications on biomass classification. In: Proceedings of VISAPP
Ionescu RT, Popescu M (2013a) Speeding up Local Patch Dissimilarity. In: Proceedings of ICIAP 8156:1–10
Ionescu RT, Popescu M (2013b) Kernels for visual words histograms. In: Proceedings of ICIAP 8156:81–90
Ionescu RT, Popescu M (2015a) PQ kernel: a rank correlation kernel for visual word histograms. Pattern Recogn Lett 55:51–57
Ionescu RT, Popescu M (2015b) Have a SNAK. Encoding spatial information with the Spatial Non-Alignment Kernel. In: Proceedings of ICIAP 9279:97–108
Ionescu RT, Popescu M, Cahill A (2014b) Can characters reveal your native language? A language-independent approach to native language identification. In: Proceedings of EMNLP, pp 1363–1373
Ionescu RT, Popescu M, Grozea C (2013) Local learning to improve bag of visual words model for facial expression recognition. In: Workshop on Challenges in Representation Learning, ICML, 2013
Ionescu RT (2013) Local Rank Distance. In: Proceedings of SYNASC 221–228
Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of NAACL, pp 103–112
Jurafsky D, Martin JH (2000) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 1st edn. Prentice Hall PTR, Upper Saddle River, NJ, USA
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of NIPS 1106–1114
Lazebnik S, Schmid C, Ponce J (2005) A sparse texture representation using local affine regions. IEEE Trans Pattern Anal Mach Intell 27(8):1265–1278
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR 2:2169–2178
Lebret R, Legrand J, Collobert R (2013) Is deep learning really necessary for word embeddings? Deep learning workshop NIPS
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Leslie CS, Eskin E, Noble WS (2002) The spectrum kernel: a string kernel for svm protein classification. In: Proceedings of Pacific Symposium on Biocomputing, pp 566–575
Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis 43(1):29–44
Lodhi H, Saunders C, Shawe-Taylor J, Cristianini N, Watkins CJCH (2002) Text classification using string kernels. J Mach Learn Res 2:419–444
Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of ICCV 2:1150–1157
Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: Proceedings of CVPR
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York, USA
Manning CD, Schütze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge, MA, USA
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS, pp 3111–3119
Montavon G, Orr GB, Müller K-R (eds) (2012) Neural Networks: Tricks of the Trade. In: Lecture notes in computer science (LNCS), vol 7700, 2nd edn. Springer
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of CVPR, pp 1–8
Popescu M, Grozea C (2012) Kernel methods and string kernels for authorship analysis. CLEF (Online Working Notes/Labs/Workshop)
Popescu M, Ionescu RT (2013) The story of the characters, the DNA and the native language. In: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pp 270–278
Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In: Proceedings of ICCV
Sadeghi MA, Farhadi A (2011) Recognition using visual phrases. In: Proceedings of CVPR, pp 1745–1752
Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47
Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their localization in images. In: Proceedings of ICCV, pp 370–377
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Proceedings of NIPS, pp 3104–3112
Tetreault J, Blanchard D, Cahill A, Chodorow M (2012) Native tongues, lost and found: resources and empirical evaluations in native language identification. In: Proceedings of COLING 2012:2585–2602
Vedaldi A, Zisserman A (2010) Efficient additive kernels via explicit feature maps. In: Proceedings of CVPR, pp 3539–3546
Zhang J, Marszalek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: a comprehensive study. International J Comput Vis 73(2):213–238
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Ionescu, R.T., Popescu, M. (2016). Motivation and Overview. In: Knowledge Transfer between Computer Vision and Text Mining. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-30367-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-30367-3_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30365-9
Online ISBN: 978-3-319-30367-3
eBook Packages: Computer ScienceComputer Science (R0)