Skip to main content

MirBot: A Multimodal Interactive Image Retrieval System

  • Conference paper
Pattern Recognition and Image Analysis (IbPRIA 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7887))

Included in the following conference series:

Abstract

This study presents a multimodal interactive image retrieval system for smartphones (MirBot). The application is designed as a collaborative game where users can categorize photographs according to the WordNet hierarchy. After taking a picture, the region of interest of the target can be selected, and the image information is sent with a set of metadata to a server in order to classify the object. The user can validate the category proposed by the system to improve future queries. The result is a labeled database with a structure similar to ImageNet, but with contents selected by the users, fully marked with regions of interest, and with novel metadata that can be useful to constrain the search space in a future work. The MirBot app is freely available on the Apple app store.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lew, M.S., Sebe, N., Djeraba, C., Jain, R.: Content-based Multimedia Information Retrieval: State of the Art and Challenges. ACM Trans. on Multimedia Computing, Communications, and Applications 2(1), 1–19 (2006)

    Article  Google Scholar 

  2. Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  3. Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys 40(2), 1–60 (2008)

    Article  Google Scholar 

  4. Jegou, H., Douze, M., Schmid, C.: Recent Advances in Large Scale Image Search. In: Nielsen, F. (ed.) ETVC 2008. LNCS, vol. 5416, pp. 305–326. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  5. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: A Large-Scale Hierarchical Image Database. In: IEEE CVPR, pp. 248–255 (2009)

    Google Scholar 

  6. Torralba, A., Fergus, R., Freeman, W.T.: 80 Million Tiny Images: A Large Data Set for Non-parametric Object and Scene Recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 30(11), 1958–1970 (2008)

    Article  Google Scholar 

  7. Dinakaran, B., Annapurna, J., Kumar, C.A.: Interactive image retrieval using text and image content. Cybernetics and Information Technologies 10(3), 20–30 (2010)

    Google Scholar 

  8. Boutell, M., Luo, J.: Beyond pixels: Exploiting camera metadata for photo classification. Pattern Recognition 38(6), 935–946 (2005)

    Article  Google Scholar 

  9. Barrington, L.L., Turnbull, D.D., Lanckriet, G.G.: Game-powered machine learning. Proc. National Academy of Science (PNAS) 109(17), 6411–6416 (2012)

    Article  Google Scholar 

  10. Von Ahn, L., Dabbish, L.: Labeling images with a computer game. In: CHI 2004: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 319–326. ACM Press, NY (2004)

    Chapter  Google Scholar 

  11. Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)

    Google Scholar 

  12. Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)

    Article  Google Scholar 

  13. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-Up Robust Features (SURF). Computer Vision and Image Understanding 110(3), 346–359 (2008)

    Article  Google Scholar 

  14. Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. IJCV 60(1), 63–86 (2004)

    Article  Google Scholar 

  15. Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: ICCV, pp. 1470–1477 (2003)

    Google Scholar 

  16. Thomee, B., Bakker, E.M., Lew, M.S.: TOP-SURF: a visual words toolkit. In: Proc. of the 18th ACM Int. Conf. on Multimedia, Firenze, Italy, pp. 1473–1476 (2010)

    Google Scholar 

  17. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Proc. of IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  18. Salton, G., McGill, M.: Introduction to modern information retrieval. McGraw-Hill (1983)

    Google Scholar 

  19. Jeong, S.: Histogram-Based Color Image Retrieval, Stanford University (2001)

    Google Scholar 

  20. Exchangeable image file format for digital still cameras: Exif Version 2.3. CIPA, http://www.cipa.jp/english/hyoujunka/kikaku/pdf/DC-008-2010_E.pdf

  21. Lin, J.: Divergence measures based on the Shannon entropy. IEEE Trans. on Information Theory 37(1), 145–150 (1991)

    Article  MATH  Google Scholar 

  22. MirBot, http://www.mirbot.com

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pertusa, A., Gallego, AJ., Bernabeu, M. (2013). MirBot: A Multimodal Interactive Image Retrieval System. In: Sanches, J.M., Micó, L., Cardoso, J.S. (eds) Pattern Recognition and Image Analysis. IbPRIA 2013. Lecture Notes in Computer Science, vol 7887. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38628-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38628-2_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38627-5

  • Online ISBN: 978-3-642-38628-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics