Skip to main content

Part of the book series: Algorithms for Intelligent Systems ((AIS))

  • 536 Accesses

Abstract

Photographs are compact stores of special moments of our lives. Due to the convenience of photography tools such as mobiles and cameras, we can capture each and every moment of celebration. We often share those moments with our friends and colleagues and sometimes put them on our social media profiles. To make our social media posts attractive, we usually try to associate them with some inspirational quotes. Those quotes are either self-created or searched from the Internet. Searching and selecting the right quote from the Internet for our photograph is tedious as well as uninspiring at the same time. We have to first think about the theme which our image is depicting and then we have to search quotes related to that theme. After this, we have to filter out the most probable quotes for our image from about thousands of quotes. Our application aims to ease the process of searching and associating the right quote for the image. It accepts a picture as an input and after processing it, suggests relevant quotes for the image to the user. To achieve this functionality, we have used the computation power of Convolution neural networks, the concept of Long Short Term Memory and Similarity measures for suggesting the suitable caption for the image which are then further utilized to render quotes to the user. In this paper, we not only represent the method of conversion of image to quote but also a comparative study on performance measures (BLEU scores and validation loss in this work) of various pre-trained CNN models, and based on the comparative study, the best performing CNN model is chosen (MobileNetV2 in our study).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tanti M, Gatt A, Camilleri KP (2017) What is the role of recurrent neural networks (RNNs) in an image caption generator? arXiv:1708.02043

  2. Kiros R, Salakhutdinov R, Zemel RS (2014) Unifying visual-semantic embeddings with multimodal neural language models. arXiv:1411.2539

  3. Kiros R, Salakhutdinov R, Zemel R (2014) Multimodal neural language models. In: International conference on machine learning. PMLR, pp 595–603

    Google Scholar 

  4. Mitchell M, Dodge J, Goyal A, Yamaguchi K, Stratos K, Han X, Mensch A, Berg A, Berg T, Daumé H III (2012) Midge: generating image descriptions from computer vision detections. In: Proceedings of the 13th conference of the European chapter of the Association for Computational Linguistics, pp 747–756

    Google Scholar 

  5. Mnih V, Heess N, Graves A, Kavukcuoglu K (2014) Recurrent models of visual attention. arXiv:1406.6247

  6. Goel S, Madhok R, Garg S (2018) Proposing contextually relevant quotes for images. In: European conference on information retrieval. Springer, pp 591–597

    Google Scholar 

  7. Chaudhuri S, Ganti V, Kaushik R (2006) A primitive operator for similarity joins in data cleaning. In: 22nd international conference on data engineering (ICDE’06). IEEE, p 5

    Google Scholar 

  8. Ratcliff JW, Metzener DE (1988) Pattern-matching-the gestalt approach. Dr Dobbs J 13(7):46

    Google Scholar 

  9. Hodosh M, Young P, Hockenmaier J (2013) Framing image description as a ranking task: data, models and evaluation metrics. J Artif Intell Res 47:853–899

    Article  MathSciNet  Google Scholar 

  10. Famous quotes database. https://thewebminer.com/buy-famous-quotes-database. Accessed 15 Dec 2018

  11. Keras documentation. https://github.com/keras-team/keras. Accessed 10 Oct 2018

  12. NLTK documentation. https://www.nltk.org. Accessed 12 Dec 2018

  13. Google colab platform. https://colab.research.google.co. Accessed 15 Dec 2018

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mukesh Kumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, A., Gupta, D., Kumar, M. (2022). Context-Based Quote Generation from Images. In: Mathur, G., Bundele, M., Lalwani, M., Paprzycki, M. (eds) Proceedings of 2nd International Conference on Artificial Intelligence: Advances and Applications. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-16-6332-1_66

Download citation

Publish with us

Policies and ethics