Skip to main content

Deep Learning for Multimedia Data in IoT

  • Chapter
  • First Online:
Multimedia Big Data Computing for IoT Applications

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 163))

Abstract

With the advent of Internet leading to proliferation of large amounts of multimedia data, the analytics of the aggregated multimedia data is proven to be one of the active areas of research and study. Multimedia data includes audio, video, images associated with applications like similarity searches, entity resolution, and classification. Visual data mining is now one of the active learning fields that include surveillance applications for object detection, fraud detection, crime detection, and other applications. Multimedia data mining includes many challenges like data volume, variety, and unstructured nature, nonstationary, and real time. It needs advanced processing capabilities to make decisions in near real time. The existing traditional database systems, data mining techniques cannot be used because of its limitations. Hence, to process such large amounts of data advanced techniques like machine learning, deep learning methods can be used. Multimedia data also includes sensor data that is widely generated. Most of the healthcare applications include sensors for detecting heart rate, blood pressure, and pulse rate. The advancement of the smartphones has resulted in fitness based applications based on the number of steps walked, calories count, kilometers ran, etc. All these types of data can be classified as Multimedia data for Internet of Things (IoT). There are many interfacing devices that are interconnected to each other with backbone as a computer network when sensor data is involved. The main aim of this chapter is to highlight the importance and convergence of deep learning techniques with IoT. Emphasis is laid on classification of IoT data using deep learning and the essential fine-tuning of parameters. A virtual sensor device implemented in python is used for simulation. An account of protocols used for communication of IoT devices is briefly discussed. A case study is provided regarding classification of Air Quality Dataset using deep learning techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. A. Kumari, S. Tanwar, S. Tyagi, N. Kumar, M. Maasberg, K.-K.R. Choo, Multimedia big data computing and Internet of Things applications: a taxonomy and process model. J. Netw. Comput Appl. (2018)

    Google Scholar 

  2. P.K. Atrey, M. Anwar Hossain, A. El Saddik, M.S. Kankanhalli, Multimodal fusion for multimedia analysis: a survey. Multimed. Syst. 16(6), 345–379 (2010)

    Article  Google Scholar 

  3. J. Gubbi, R. Buyya, S. Marusic, M. Palaniswami, Internet of Things (IoT): a vision, architectural elements, and future directions. Future Gener. Comput. Syst. 29(7), 1645–1660 (2013)

    Article  Google Scholar 

  4. C.A. Bhatt, M.S. Kankanhalli, Multimedia data mining: state of the art and challenges. Multimed. Tools Appl. 51(1), 35–76 (2011)

    Article  Google Scholar 

  5. F. Venter, A. Stein, Images & videos: really big data. Anal. Mag. 14–47 (2012)

    Google Scholar 

  6. D. Che, M. Safran, Z. Peng, From big data to big data mining: challenges, issues, and opportunities, in Database Systems for Advanced Applications (Springer, Wuhan, China, 2013), pp. 1–15

    Google Scholar 

  7. Z. Wu, M. Zou, An incremental community detection method for social tagging systems using locality-sensitive hashing. Neural Netw. 58(1), 12–28 (2014)

    Google Scholar 

  8. P.K. Atrey, N.C. Maddage, M.S. Kankanhalli, Audio based event detection for multimedia surveillance, in 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol. 5 (IEEE, 2006)

    Google Scholar 

  9. NLTK, https://www.nltk.org/

  10. Scikit learn, http://scikit-learn.org/

  11. Hadoop, https://hadoop.apache.org/

  12. Spark, https://spark.apache.org

  13. J. Herrera, G. Molto, Detecting events in streaming multimedia with big data techniques, in 2016 Proceedings of 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Heraklion Crete, Greece (2016), pp. 345–349

    Google Scholar 

  14. S. Hershey, S. Chaudhuri, D.P. Ellis, J.F. Gemmeke, A. Jansen, R.C. Moore, M. Plakal, D. Platt, R.A. Saurous, B. Seybold et al., CNN architectures for large-scale audio classification, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (IEEE, 2017), pp. 131–135, https://www.tableau.com/ (14. Tableau)

  15. OpenCV, https://opencv.org/

  16. Librosa, https://librosa.github.io/librosa/

  17. K. Alex, I. Sutskever, E.H. Geoffrey, ImageNet Classification with Deep Convolutional Neural Networks (2012), pp. 1097–1105

    Google Scholar 

  18. J. Schmidhuber, Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)

    Article  Google Scholar 

  19. C. Harris, M. Stephens, A combined corner and edge detector, in Alvey Vision Conference, vol. 15, no. 50 (1988), pp. 10–5244

    Google Scholar 

  20. T. Lindeberg, Scale invariant feature transform (2012)

    Article  Google Scholar 

  21. E. Rublee, V. Rabaud, K. Konolige, G. Bradski, ORB: an efficient alternative to SIFT or SURF, in 2011 IEEE International Conference on Computer Vision (ICCV) (IEEE, 2011), pp. 2564–2571

    Google Scholar 

  22. N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, CVPR 2005, vol. 1 (IEEE, 2005), pp. 886–893

    Google Scholar 

  23. Air quality data, https://archive.ics.uci.edu/ml/datasets/Air+quality

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Srinidhi Hiriyannaiah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Hiriyannaiah, S., Akanksh, B.S., Koushik, A.S., Siddesh, G.M., Srinivasa, K.G. (2020). Deep Learning for Multimedia Data in IoT. In: Tanwar, S., Tyagi, S., Kumar, N. (eds) Multimedia Big Data Computing for IoT Applications. Intelligent Systems Reference Library, vol 163. Springer, Singapore. https://doi.org/10.1007/978-981-13-8759-3_4

Download citation

Publish with us

Policies and ethics