Skip to main content

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

  • 1676 Accesses

Abstract

This chapter gives a brief overview of machine learning and related fields of study. The concept of treating image and text in a similar fashion is then presented. A few successful examples of knowledge transfer between computer vision and text mining are also given. The chapter ends with a full overview of the organization of this book.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Agarwal S, Roth D (2002) Learning a sparse representation for object detection. In: Proceedings of ECCV, pp 113–127

    Google Scholar 

  • Agirre E, Edmonds PG (2006) Word Sense Disambiguation: Algorithms and applications. Springer

    Google Scholar 

  • Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: Proceedings of CVPR, pp 73–80

    Google Scholar 

  • Alexe B, Deselaers T, Ferrari V (2012) Measuring the objectness of image windows. IEEE Trans Pattern Anal Mach Intell 34(11):2189–2202

    Google Scholar 

  • Barnard K, Duygulu P, Forsyth D, De Freitas N, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135

    Google Scholar 

  • Barnard K, Johnson M (2005) Word sense disambiguation with pictures. Artif Intell 167(1–2):13–30

    Google Scholar 

  • Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127

    Google Scholar 

  • Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press Inc, New York, USA

    Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Google Scholar 

  • Caruana R, Niculescu-Mizil A (2006). An empirical comparison of supervised learning algorithms. In: Proceedings of ICML, pp 161–168

    Google Scholar 

  • Chen Y, Garcia EK, Gupta MR, Rahimi A, Cazzanti L (2009) Similarity-based classification: concepts and algorithms. J Mach Learn Res 10:747–776

    Google Scholar 

  • Chifu AG, Ionescu RT (2012) Word sense disambiguation to improve precision for ambiguous queries. Cent Eur J Comput Sci 2(4):398–411

    Google Scholar 

  • Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297

    Google Scholar 

  • Dinu LP, Ionescu RT, Popescu M (2012) Local Patch Dissimilarity for images. In: Proceedings of ICONIP 7663:117–126

    Google Scholar 

  • Dinu LP, Ionescu RT (2013) Clustering based on median and closest string via rank distance with applications on DNA. Neural Comput Appl 24(1):77–84

    Google Scholar 

  • Dinu LP, Ionescu RT, Tomescu AI (2014) A rank-based sequence aligner with applications in phylogenetic analysis. PLoS ONE, 9(8):e104006. doi:10.1371/journal.pone.0104006

    Google Scholar 

  • Dinu LP, Manea F (2006) An efficient approach for the rank aggregation problem. Theoret Comput Sci 359(1–3):455–461

    Google Scholar 

  • Dinu LP, Popescu M (2009) Comparing statistical similarity measures for stylistic multivariate analysis. In: Proceedings of RANLP

    Google Scholar 

  • Duygulu P, Barnard K, De Freitas JFG, Forsyth DA (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of ECCV 97–112

    Google Scholar 

  • Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of ECCV 15–29

    Google Scholar 

  • Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proceedings of CVPR 2:524–531

    Google Scholar 

  • Forsyth DA, Ponce J (2002) Computer vision: a modern approach. Prentice Hall Professional Technical Reference

    Google Scholar 

  • Galleguillos C, Belongie S (2010) Context based object categorization: a critical survey. Comput Vis Image Underst 114:712–722

    Google Scholar 

  • Inza I, Calvo B, Armañanzas R, Bengoetxea E, Larrañaga P, Lozano JA (2010) Machine learning: an indispensable tool in bioinformatics. Meth Mol Biol (Clifton, N.J.) 593:25–48

    Google Scholar 

  • Ionescu RT, Chifu A-G, Mothe J (2015b) DeShaTo: describing the shape of cumulative topic distributions to rank retrieval systems without relevance judgments. In: Proceedings of SPIRE 9309:75–82

    Google Scholar 

  • Ionescu RT, Popescu AL, Popescu M, Popescu D (2015a) BiomassID: a biomass type identification system for mobile devices. Comput Electron Agric 113:244–253

    Google Scholar 

  • Ionescu RT, Popescu AL, Popescu D, Popescu M (2014a) Local Texton Dissimilarity with applications on biomass classification. In: Proceedings of VISAPP

    Google Scholar 

  • Ionescu RT, Popescu M (2013a) Speeding up Local Patch Dissimilarity. In: Proceedings of ICIAP 8156:1–10

    Google Scholar 

  • Ionescu RT, Popescu M (2013b) Kernels for visual words histograms. In: Proceedings of ICIAP 8156:81–90

    Google Scholar 

  • Ionescu RT, Popescu M (2015a) PQ kernel: a rank correlation kernel for visual word histograms. Pattern Recogn Lett 55:51–57

    Google Scholar 

  • Ionescu RT, Popescu M (2015b) Have a SNAK. Encoding spatial information with the Spatial Non-Alignment Kernel. In: Proceedings of ICIAP 9279:97–108

    Google Scholar 

  • Ionescu RT, Popescu M, Cahill A (2014b) Can characters reveal your native language? A language-independent approach to native language identification. In: Proceedings of EMNLP, pp 1363–1373

    Google Scholar 

  • Ionescu RT, Popescu M, Grozea C (2013) Local learning to improve bag of visual words model for facial expression recognition. In: Workshop on Challenges in Representation Learning, ICML, 2013

    Google Scholar 

  • Ionescu RT (2013) Local Rank Distance. In: Proceedings of SYNASC 221–228

    Google Scholar 

  • Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of NAACL, pp 103–112

    Google Scholar 

  • Jurafsky D, Martin JH (2000) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 1st edn. Prentice Hall PTR, Upper Saddle River, NJ, USA

    Google Scholar 

  • Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of NIPS 1106–1114

    Google Scholar 

  • Lazebnik S, Schmid C, Ponce J (2005) A sparse texture representation using local affine regions. IEEE Trans Pattern Anal Mach Intell 27(8):1265–1278

    Google Scholar 

  • Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR 2:2169–2178

    Google Scholar 

  • Lebret R, Legrand J, Collobert R (2013) Is deep learning really necessary for word embeddings? Deep learning workshop NIPS

    Google Scholar 

  • LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Google Scholar 

  • Leslie CS, Eskin E, Noble WS (2002) The spectrum kernel: a string kernel for svm protein classification. In: Proceedings of Pacific Symposium on Biocomputing, pp 566–575

    Google Scholar 

  • Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis 43(1):29–44

    Google Scholar 

  • Lodhi H, Saunders C, Shawe-Taylor J, Cristianini N, Watkins CJCH (2002) Text classification using string kernels. J Mach Learn Res 2:419–444

    Google Scholar 

  • Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of ICCV 2:1150–1157

    Google Scholar 

  • Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: Proceedings of CVPR

    Google Scholar 

  • Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York, USA

    Google Scholar 

  • Manning CD, Schütze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge, MA, USA

    Google Scholar 

  • Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS, pp 3111–3119

    Google Scholar 

  • Montavon G, Orr GB, Müller K-R (eds) (2012) Neural Networks: Tricks of the Trade. In: Lecture notes in computer science (LNCS), vol 7700, 2nd edn. Springer

    Google Scholar 

  • Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of CVPR, pp 1–8

    Google Scholar 

  • Popescu M, Grozea C (2012) Kernel methods and string kernels for authorship analysis. CLEF (Online Working Notes/Labs/Workshop)

    Google Scholar 

  • Popescu M, Ionescu RT (2013) The story of the characters, the DNA and the native language. In: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pp 270–278

    Google Scholar 

  • Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In: Proceedings of ICCV

    Google Scholar 

  • Sadeghi MA, Farhadi A (2011) Recognition using visual phrases. In: Proceedings of CVPR, pp 1745–1752

    Google Scholar 

  • Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47

    Google Scholar 

  • Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press

    Google Scholar 

  • Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556

    Google Scholar 

  • Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their localization in images. In: Proceedings of ICCV, pp 370–377

    Google Scholar 

  • Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Proceedings of NIPS, pp 3104–3112

    Google Scholar 

  • Tetreault J, Blanchard D, Cahill A, Chodorow M (2012) Native tongues, lost and found: resources and empirical evaluations in native language identification. In: Proceedings of COLING 2012:2585–2602

    Google Scholar 

  • Vedaldi A, Zisserman A (2010) Efficient additive kernels via explicit feature maps. In: Proceedings of CVPR, pp 3539–3546

    Google Scholar 

  • Zhang J, Marszalek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: a comprehensive study. International J Comput Vis 73(2):213–238

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Radu Tudor Ionescu .

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Ionescu, R.T., Popescu, M. (2016). Motivation and Overview. In: Knowledge Transfer between Computer Vision and Text Mining. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-30367-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30367-3_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30365-9

  • Online ISBN: 978-3-319-30367-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics