Deep Learning for Multimedia Data in IoT

Hiriyannaiah, Srinidhi; Akanksh, B. S.; Koushik, A. S.; Siddesh, G. M.; Srinivasa, K. G.

doi:10.1007/978-981-13-8759-3_4

Srinidhi Hiriyannaiah⁶,
B. S. Akanksh⁶,
A. S. Koushik⁶,
G. M. Siddesh⁶ &
…
K. G. Srinivasa⁷

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 163))

206k Accesses
6 Citations

Abstract

With the advent of Internet leading to proliferation of large amounts of multimedia data, the analytics of the aggregated multimedia data is proven to be one of the active areas of research and study. Multimedia data includes audio, video, images associated with applications like similarity searches, entity resolution, and classification. Visual data mining is now one of the active learning fields that include surveillance applications for object detection, fraud detection, crime detection, and other applications. Multimedia data mining includes many challenges like data volume, variety, and unstructured nature, nonstationary, and real time. It needs advanced processing capabilities to make decisions in near real time. The existing traditional database systems, data mining techniques cannot be used because of its limitations. Hence, to process such large amounts of data advanced techniques like machine learning, deep learning methods can be used. Multimedia data also includes sensor data that is widely generated. Most of the healthcare applications include sensors for detecting heart rate, blood pressure, and pulse rate. The advancement of the smartphones has resulted in fitness based applications based on the number of steps walked, calories count, kilometers ran, etc. All these types of data can be classified as Multimedia data for Internet of Things (IoT). There are many interfacing devices that are interconnected to each other with backbone as a computer network when sensor data is involved. The main aim of this chapter is to highlight the importance and convergence of deep learning techniques with IoT. Emphasis is laid on classification of IoT data using deep learning and the essential fine-tuning of parameters. A virtual sensor device implemented in python is used for simulation. An account of protocols used for communication of IoT devices is briefly discussed. A case study is provided regarding classification of Air Quality Dataset using deep learning techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

A. Kumari, S. Tanwar, S. Tyagi, N. Kumar, M. Maasberg, K.-K.R. Choo, Multimedia big data computing and Internet of Things applications: a taxonomy and process model. J. Netw. Comput Appl. (2018)
Google Scholar
P.K. Atrey, M. Anwar Hossain, A. El Saddik, M.S. Kankanhalli, Multimodal fusion for multimedia analysis: a survey. Multimed. Syst. 16(6), 345–379 (2010)
Article Google Scholar
J. Gubbi, R. Buyya, S. Marusic, M. Palaniswami, Internet of Things (IoT): a vision, architectural elements, and future directions. Future Gener. Comput. Syst. 29(7), 1645–1660 (2013)
Article Google Scholar
C.A. Bhatt, M.S. Kankanhalli, Multimedia data mining: state of the art and challenges. Multimed. Tools Appl. 51(1), 35–76 (2011)
Article Google Scholar
F. Venter, A. Stein, Images & videos: really big data. Anal. Mag. 14–47 (2012)
Google Scholar
D. Che, M. Safran, Z. Peng, From big data to big data mining: challenges, issues, and opportunities, in Database Systems for Advanced Applications (Springer, Wuhan, China, 2013), pp. 1–15
Google Scholar
Z. Wu, M. Zou, An incremental community detection method for social tagging systems using locality-sensitive hashing. Neural Netw. 58(1), 12–28 (2014)
Google Scholar
P.K. Atrey, N.C. Maddage, M.S. Kankanhalli, Audio based event detection for multimedia surveillance, in 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol. 5 (IEEE, 2006)
Google Scholar
NLTK, https://www.nltk.org/
Scikit learn, http://scikit-learn.org/
Hadoop, https://hadoop.apache.org/
Spark, https://spark.apache.org
J. Herrera, G. Molto, Detecting events in streaming multimedia with big data techniques, in 2016 Proceedings of 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Heraklion Crete, Greece (2016), pp. 345–349
Google Scholar
S. Hershey, S. Chaudhuri, D.P. Ellis, J.F. Gemmeke, A. Jansen, R.C. Moore, M. Plakal, D. Platt, R.A. Saurous, B. Seybold et al., CNN architectures for large-scale audio classification, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (IEEE, 2017), pp. 131–135, https://www.tableau.com/ (14. Tableau)
OpenCV, https://opencv.org/
Librosa, https://librosa.github.io/librosa/
K. Alex, I. Sutskever, E.H. Geoffrey, ImageNet Classification with Deep Convolutional Neural Networks (2012), pp. 1097–1105
Google Scholar
J. Schmidhuber, Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)
Article Google Scholar
C. Harris, M. Stephens, A combined corner and edge detector, in Alvey Vision Conference, vol. 15, no. 50 (1988), pp. 10–5244
Google Scholar
T. Lindeberg, Scale invariant feature transform (2012)
Article Google Scholar
E. Rublee, V. Rabaud, K. Konolige, G. Bradski, ORB: an efficient alternative to SIFT or SURF, in 2011 IEEE International Conference on Computer Vision (ICCV) (IEEE, 2011), pp. 2564–2571
Google Scholar
N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, CVPR 2005, vol. 1 (IEEE, 2005), pp. 886–893
Google Scholar
Air quality data, https://archive.ics.uci.edu/ml/datasets/Air+quality

Download references

Author information

Authors and Affiliations

Ramaiah Institute of Technology, Bengaluru, India
Srinidhi Hiriyannaiah, B. S. Akanksh, A. S. Koushik & G. M. Siddesh
National Institute of Technical Teacher Training Research, New Delhi, India
K. G. Srinivasa

Authors

Srinidhi Hiriyannaiah
View author publications
You can also search for this author in PubMed Google Scholar
B. S. Akanksh
View author publications
You can also search for this author in PubMed Google Scholar
A. S. Koushik
View author publications
You can also search for this author in PubMed Google Scholar
G. M. Siddesh
View author publications
You can also search for this author in PubMed Google Scholar
K. G. Srinivasa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Srinidhi Hiriyannaiah .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Institute of Technology, Nirma University, Ahmedabad, Gujarat, India
Sudeep Tanwar
Department of Electronics and Communication Engineering, Thapar Institute of Engineering and Technology, Deemed University, Patiala, Punjab, India
Sudhanshu Tyagi
Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Deemed University, Patiala, Punjab, India
Neeraj Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hiriyannaiah, S., Akanksh, B.S., Koushik, A.S., Siddesh, G.M., Srinivasa, K.G. (2020). Deep Learning for Multimedia Data in IoT. In: Tanwar, S., Tyagi, S., Kumar, N. (eds) Multimedia Big Data Computing for IoT Applications. Intelligent Systems Reference Library, vol 163. Springer, Singapore. https://doi.org/10.1007/978-981-13-8759-3_4

Download citation

DOI: https://doi.org/10.1007/978-981-13-8759-3_4
Published: 18 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8758-6
Online ISBN: 978-981-13-8759-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics