Motivation and Overview

Ionescu, Radu Tudor; Popescu, Marius

doi:10.1007/978-3-319-30367-3_1

Radu Tudor Ionescu⁴ &
Marius Popescu⁴

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

1676 Accesses

Abstract

This chapter gives a brief overview of machine learning and related fields of study. The concept of treating image and text in a similar fashion is then presented. A few successful examples of knowledge transfer between computer vision and text mining are also given. The chapter ends with a full overview of the organization of this book.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agarwal S, Roth D (2002) Learning a sparse representation for object detection. In: Proceedings of ECCV, pp 113–127
Google Scholar
Agirre E, Edmonds PG (2006) Word Sense Disambiguation: Algorithms and applications. Springer
Google Scholar
Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: Proceedings of CVPR, pp 73–80
Google Scholar
Alexe B, Deselaers T, Ferrari V (2012) Measuring the objectness of image windows. IEEE Trans Pattern Anal Mach Intell 34(11):2189–2202
Google Scholar
Barnard K, Duygulu P, Forsyth D, De Freitas N, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135
Google Scholar
Barnard K, Johnson M (2005) Word sense disambiguation with pictures. Artif Intell 167(1–2):13–30
Google Scholar
Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127
Google Scholar
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press Inc, New York, USA
Google Scholar
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Google Scholar
Caruana R, Niculescu-Mizil A (2006). An empirical comparison of supervised learning algorithms. In: Proceedings of ICML, pp 161–168
Google Scholar
Chen Y, Garcia EK, Gupta MR, Rahimi A, Cazzanti L (2009) Similarity-based classification: concepts and algorithms. J Mach Learn Res 10:747–776
Google Scholar
Chifu AG, Ionescu RT (2012) Word sense disambiguation to improve precision for ambiguous queries. Cent Eur J Comput Sci 2(4):398–411
Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Google Scholar
Dinu LP, Ionescu RT, Popescu M (2012) Local Patch Dissimilarity for images. In: Proceedings of ICONIP 7663:117–126
Google Scholar
Dinu LP, Ionescu RT (2013) Clustering based on median and closest string via rank distance with applications on DNA. Neural Comput Appl 24(1):77–84
Google Scholar
Dinu LP, Ionescu RT, Tomescu AI (2014) A rank-based sequence aligner with applications in phylogenetic analysis. PLoS ONE, 9(8):e104006. doi:10.1371/journal.pone.0104006
Google Scholar
Dinu LP, Manea F (2006) An efficient approach for the rank aggregation problem. Theoret Comput Sci 359(1–3):455–461
Google Scholar
Dinu LP, Popescu M (2009) Comparing statistical similarity measures for stylistic multivariate analysis. In: Proceedings of RANLP
Google Scholar
Duygulu P, Barnard K, De Freitas JFG, Forsyth DA (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of ECCV 97–112
Google Scholar
Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of ECCV 15–29
Google Scholar
Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proceedings of CVPR 2:524–531
Google Scholar
Forsyth DA, Ponce J (2002) Computer vision: a modern approach. Prentice Hall Professional Technical Reference
Google Scholar
Galleguillos C, Belongie S (2010) Context based object categorization: a critical survey. Comput Vis Image Underst 114:712–722
Google Scholar
Inza I, Calvo B, Armañanzas R, Bengoetxea E, Larrañaga P, Lozano JA (2010) Machine learning: an indispensable tool in bioinformatics. Meth Mol Biol (Clifton, N.J.) 593:25–48
Google Scholar
Ionescu RT, Chifu A-G, Mothe J (2015b) DeShaTo: describing the shape of cumulative topic distributions to rank retrieval systems without relevance judgments. In: Proceedings of SPIRE 9309:75–82
Google Scholar
Ionescu RT, Popescu AL, Popescu M, Popescu D (2015a) BiomassID: a biomass type identification system for mobile devices. Comput Electron Agric 113:244–253
Google Scholar
Ionescu RT, Popescu AL, Popescu D, Popescu M (2014a) Local Texton Dissimilarity with applications on biomass classification. In: Proceedings of VISAPP
Google Scholar
Ionescu RT, Popescu M (2013a) Speeding up Local Patch Dissimilarity. In: Proceedings of ICIAP 8156:1–10
Google Scholar
Ionescu RT, Popescu M (2013b) Kernels for visual words histograms. In: Proceedings of ICIAP 8156:81–90
Google Scholar
Ionescu RT, Popescu M (2015a) PQ kernel: a rank correlation kernel for visual word histograms. Pattern Recogn Lett 55:51–57
Google Scholar
Ionescu RT, Popescu M (2015b) Have a SNAK. Encoding spatial information with the Spatial Non-Alignment Kernel. In: Proceedings of ICIAP 9279:97–108
Google Scholar
Ionescu RT, Popescu M, Cahill A (2014b) Can characters reveal your native language? A language-independent approach to native language identification. In: Proceedings of EMNLP, pp 1363–1373
Google Scholar
Ionescu RT, Popescu M, Grozea C (2013) Local learning to improve bag of visual words model for facial expression recognition. In: Workshop on Challenges in Representation Learning, ICML, 2013
Google Scholar
Ionescu RT (2013) Local Rank Distance. In: Proceedings of SYNASC 221–228
Google Scholar
Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of NAACL, pp 103–112
Google Scholar
Jurafsky D, Martin JH (2000) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 1st edn. Prentice Hall PTR, Upper Saddle River, NJ, USA
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of NIPS 1106–1114
Google Scholar
Lazebnik S, Schmid C, Ponce J (2005) A sparse texture representation using local affine regions. IEEE Trans Pattern Anal Mach Intell 27(8):1265–1278
Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR 2:2169–2178
Google Scholar
Lebret R, Legrand J, Collobert R (2013) Is deep learning really necessary for word embeddings? Deep learning workshop NIPS
Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Google Scholar
Leslie CS, Eskin E, Noble WS (2002) The spectrum kernel: a string kernel for svm protein classification. In: Proceedings of Pacific Symposium on Biocomputing, pp 566–575
Google Scholar
Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis 43(1):29–44
Google Scholar
Lodhi H, Saunders C, Shawe-Taylor J, Cristianini N, Watkins CJCH (2002) Text classification using string kernels. J Mach Learn Res 2:419–444
Google Scholar
Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of ICCV 2:1150–1157
Google Scholar
Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: Proceedings of CVPR
Google Scholar
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York, USA
Google Scholar
Manning CD, Schütze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge, MA, USA
Google Scholar
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS, pp 3111–3119
Google Scholar
Montavon G, Orr GB, Müller K-R (eds) (2012) Neural Networks: Tricks of the Trade. In: Lecture notes in computer science (LNCS), vol 7700, 2nd edn. Springer
Google Scholar
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of CVPR, pp 1–8
Google Scholar
Popescu M, Grozea C (2012) Kernel methods and string kernels for authorship analysis. CLEF (Online Working Notes/Labs/Workshop)
Google Scholar
Popescu M, Ionescu RT (2013) The story of the characters, the DNA and the native language. In: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pp 270–278
Google Scholar
Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In: Proceedings of ICCV
Google Scholar
Sadeghi MA, Farhadi A (2011) Recognition using visual phrases. In: Proceedings of CVPR, pp 1745–1752
Google Scholar
Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47
Google Scholar
Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press
Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556
Google Scholar
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their localization in images. In: Proceedings of ICCV, pp 370–377
Google Scholar
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Proceedings of NIPS, pp 3104–3112
Google Scholar
Tetreault J, Blanchard D, Cahill A, Chodorow M (2012) Native tongues, lost and found: resources and empirical evaluations in native language identification. In: Proceedings of COLING 2012:2585–2602
Google Scholar
Vedaldi A, Zisserman A (2010) Efficient additive kernels via explicit feature maps. In: Proceedings of CVPR, pp 3539–3546
Google Scholar
Zhang J, Marszalek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: a comprehensive study. International J Comput Vis 73(2):213–238
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Bucharest, Bucharest, Romania
Radu Tudor Ionescu & Marius Popescu

Authors

Radu Tudor Ionescu
View author publications
You can also search for this author in PubMed Google Scholar
Marius Popescu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Radu Tudor Ionescu .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ionescu, R.T., Popescu, M. (2016). Motivation and Overview. In: Knowledge Transfer between Computer Vision and Text Mining. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-30367-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-30367-3_1
Published: 26 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30365-9
Online ISBN: 978-3-319-30367-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics