Abstract
In our society, there is a substantial number of visually impaired individuals. However many social mechanisms are not designed with these people in mind thus making the development of electronic assistive tools essential in order to perform basic day-to-day activities. Due to the penetration of capabilities of mobile devices, such devices have become an ideal candidate for designing solutions to aid the visually impaired. The objective of this research is to develop a multimedia user interface whose scope is to aid the visually challenged. We propose and design a product recognition system utilizing computer vision and machine learning techniques. Our system allows visually impaired individuals to identify products in grocery stores and supermarkets without any additional assistance, thus encouraging them to perform daily activities without requiring any additional help thus further promoting their independence within society. Our approach is composed of two main modules one capable of classifying grocery products using an unsupervised feature extraction methods posed by deep learning techniques while the other module is capable of recognizing products in an image using the traditionally handcrafted feature extraction algorithms. We considered multiple robust approaches to identify the one most suited for our task. Through evaluation we determined that the best approach for classification is to fine-tune a convolutional neural network pre-trained on a larger dataset. We were successful in not only surpassing our base accuracy but also obtaining an accuracy of 63 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
George, M., Floerkemeier, C.: Recognizing products: a per-exemplar multi-label image classification approach. In: Computer Vision. Springer, Berlin (2014)
Rivera-Rubio, J., Idrees, S., Alexiou, I., Hadjilucas, L., Bharath, A.A.: Small hand-held object recognition test (short). In Applications of Computer Vision. IEEE (2014)
Merler, M., Galleguillos, C., Belongie, S.: Recognizing groceries in situ using in vitro training data. In: Computer Vision and Pattern Recognition. IEEE (2007)
Winlock, T., Christiansen, E., Belongie, S.: Toward real-time grocery detection for the visually impaired. In: Computer Vision and Pattern Recognition Workshops. IEEE (2010)
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of key points. In: Workshop on statistical learning in computer vision (2004)
Yao, B., Khosla, A., Fei-Fei, L.: Combining randomization and discrimination for fine-grained image categorization. In Computer Vision and Pattern Recognition. IEEE (2011)
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. (2004)
Krizhevsky, I.S., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances In Neural Information Processing Systems (2012)
Arel, I., Rose, D.C., Karnowski, T.P.: Deep machine learning-a new frontier in artificial intelligence research. In Computational Intelligence Magazine. IEEE (2010)
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: Cnn features off-the-shelf: an astounding baseline for recognition. In Computer Vision and Pattern Recognition Workshops. IEEE (2014)
Sunderhauf, N., McCool, C., Upcroft, B., Tristan, P.: Fine-grained plant classification using convolutional neural networks for feature extraction. In: Working notes of CLEF 2014 Conference (2014)
Yangqing, J., Shelhamer, E., Donahue J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: Convolutional architecture for fast feature embedding. In: arXiv preprint (2014)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: speeded up robust features. In Computer Vision–ECCV 2006. Springer, Berlin (2006)
Muja, M., Lowe, D.G.: Fast approximate nearest neighbors with automatic algorithm configuration. VISAPP (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Dingli, A., Mercieca, I. (2016). Multimedia Interfaces for People Visually Impaired. In: Di Bucchianico, G., Kercher, P. (eds) Advances in Design for Inclusion. Advances in Intelligent Systems and Computing, vol 500. Springer, Cham. https://doi.org/10.1007/978-3-319-41962-6_43
Download citation
DOI: https://doi.org/10.1007/978-3-319-41962-6_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41961-9
Online ISBN: 978-3-319-41962-6
eBook Packages: EngineeringEngineering (R0)