Skip to main content

Multimedia Interfaces for People Visually Impaired

  • Conference paper
  • First Online:
Advances in Design for Inclusion

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 500))

Abstract

In our society, there is a substantial number of visually impaired individuals. However many social mechanisms are not designed with these people in mind thus making the development of electronic assistive tools essential in order to perform basic day-to-day activities. Due to the penetration of capabilities of mobile devices, such devices have become an ideal candidate for designing solutions to aid the visually impaired. The objective of this research is to develop a multimedia user interface whose scope is to aid the visually challenged. We propose and design a product recognition system utilizing computer vision and machine learning techniques. Our system allows visually impaired individuals to identify products in grocery stores and supermarkets without any additional assistance, thus encouraging them to perform daily activities without requiring any additional help thus further promoting their independence within society. Our approach is composed of two main modules one capable of classifying grocery products using an unsupervised feature extraction methods posed by deep learning techniques while the other module is capable of recognizing products in an image using the traditionally handcrafted feature extraction algorithms. We considered multiple robust approaches to identify the one most suited for our task. Through evaluation we determined that the best approach for classification is to fine-tune a convolutional neural network pre-trained on a larger dataset. We were successful in not only surpassing our base accuracy but also obtaining an accuracy of 63 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.maltasupermarket.com.

  2. 2.

    http://www.itemmaster.com/.

References

  1. George, M., Floerkemeier, C.: Recognizing products: a per-exemplar multi-label image classification approach. In: Computer Vision. Springer, Berlin (2014)

    Google Scholar 

  2. Rivera-Rubio, J., Idrees, S., Alexiou, I., Hadjilucas, L., Bharath, A.A.: Small hand-held object recognition test (short). In Applications of Computer Vision. IEEE (2014)

    Google Scholar 

  3. Merler, M., Galleguillos, C., Belongie, S.: Recognizing groceries in situ using in vitro training data. In: Computer Vision and Pattern Recognition. IEEE (2007)

    Google Scholar 

  4. Winlock, T., Christiansen, E., Belongie, S.: Toward real-time grocery detection for the visually impaired. In: Computer Vision and Pattern Recognition Workshops. IEEE (2010)

    Google Scholar 

  5. Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of key points. In: Workshop on statistical learning in computer vision (2004)

    Google Scholar 

  6. Yao, B., Khosla, A., Fei-Fei, L.: Combining randomization and discrimination for fine-grained image categorization. In Computer Vision and Pattern Recognition. IEEE (2011)

    Google Scholar 

  7. Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. (2004)

    Google Scholar 

  8. Krizhevsky, I.S., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances In Neural Information Processing Systems (2012)

    Google Scholar 

  9. Arel, I., Rose, D.C., Karnowski, T.P.: Deep machine learning-a new frontier in artificial intelligence research. In Computational Intelligence Magazine. IEEE (2010)

    Google Scholar 

  10. Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: Cnn features off-the-shelf: an astounding baseline for recognition. In Computer Vision and Pattern Recognition Workshops. IEEE (2014)

    Google Scholar 

  11. Sunderhauf, N., McCool, C., Upcroft, B., Tristan, P.: Fine-grained plant classification using convolutional neural networks for feature extraction. In: Working notes of CLEF 2014 Conference (2014)

    Google Scholar 

  12. Yangqing, J., Shelhamer, E., Donahue J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: Convolutional architecture for fast feature embedding. In: arXiv preprint (2014)

    Google Scholar 

  13. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    Google Scholar 

  14. Bay, H., Tuytelaars, T., Van Gool, L.: Surf: speeded up robust features. In Computer Vision–ECCV 2006. Springer, Berlin (2006)

    Google Scholar 

  15. Muja, M., Lowe, D.G.: Fast approximate nearest neighbors with automatic algorithm configuration. VISAPP (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexiei Dingli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Dingli, A., Mercieca, I. (2016). Multimedia Interfaces for People Visually Impaired. In: Di Bucchianico, G., Kercher, P. (eds) Advances in Design for Inclusion. Advances in Intelligent Systems and Computing, vol 500. Springer, Cham. https://doi.org/10.1007/978-3-319-41962-6_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41962-6_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41961-9

  • Online ISBN: 978-3-319-41962-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics