Abstract
The number of connected smart devices enabling multimedia applications has expanded tremendously in Internet-of-Things (IoT) environments. Specifically, the requirement for a high quality of experience (QoE) for video streaming services is a crucial prerequisite for a range of use cases, including smart surveillance, smart healthcare, smart agriculture and many more. However, providing a high QoE for video streaming is challenging due to underlying dynamic network conditions. To address this issue, several adaptive bit rate (ABR) algorithms based on predetermined rules have been developed. However, they do not generalize well to a wide variety of network conditions. ABR algorithms, based on reinforcement learning (RL), have been proven to be more effective at generalizing to varying network conditions but they still have limitations, specifically, constrained exploration and high variance in value estimates. In this paper, we propose asynchronous advantage actor-critic (A3C) based actor-learner architectures for generating the adaptive bit rates for video streaming in IoT environments. To address the existing issues, we propose integrating two advanced A3C algorithms: Follow then Forage Exploration (FFE) and Averaged A3C. We demonstrate their efficacy in improving the QoE over vanilla A3C. Additionally, we also demonstrate the benefits of the proposed architecture for video streaming under different network conditions and for different variants of the QoE metric. We show that advanced A3C methods provide up to 30.70% improvement in QoE over vanilla A3C and a considerably higher QoE over other fixed-rule-based ABR algorithms.
Similar content being viewed by others
References
Mumtaz S, Al-Dulaimi A, Frascolla V, Hassan SA, Dobre OA (2019) Guest editorial special issue on 5G and beyond-mobile technologies and applications for IoT. IEEE Internet Things J 6(1):203–206
Alvi SA, Afzal B, Shah GA, Atzori L, Mahmood W (2015) Internet of multimedia things: vision and challenges. Ad Hoc Netw 33:87–111
Barakabitze AA, Barman N, Ahmad A, Zadtootaghaj S, Sun L, Martini MG et al (2019) QoE management of multimedia streaming services in future networks: a tutorial and survey. IEEE Commun Surv Tutor 22(1):526–565
Floris A, Atzori L (2015) Quality of experience in the multimedia internet of things: definition and practical use-cases. In: 2015 IEEE international conference on communication workshop (ICCW). IEEE, 2015. pp 1747–1752
Floris A, Atzori L (2016) Managing the quality of experience in the multimedia internet of things: a layered-based approach. Sensors 16(12):2057
Karaadi A, Sun L, Mkwawa IH (2017) Multimedia communications in internet of things QoT or QoE? In: 2017 IEEE international conference on internet of things (iThings) and IEEE green computing and communications (GreenCom) and IEEE cyber, physical and social computing (CPSCom) and IEEE smart data (SmartData). IEEE, 2017. pp 23–29
Fizza K, Banerjee A, Mitra K, Jayaraman PP, Ranjan R, Patel P et al (2021) QoE in IoT: a vision, survey and future directions. Discov Internet Things 1(1):1–14
Rajavel R, Ravichandran S, Harimoorthy K, Nagappan P, Kanagachidambaresan G (2021) IoT-based smart healthcare video surveillance system using edge computing. J Ambient Intell Humaniz Comput 03:1–13
Plageras A, Psannis K, Ishibashi Y, Kim BG (2016) IoT-based surveillance system for ubiquitous healthcare
Islam M, Rahaman A, Islam R (2020) Development of smart healthcare monitoring system in IoT environment. SN Comput Sci 05(1):185
Khan J, Li JP, Ahamad B, Parveen S, Ul Haq A, Khan GA et al (2020) SMSH: secure surveillance mechanism on smart healthcare IoT system with probabilistic image encryption. IEEE Access 8:15747–15767
Haque A, Milstein A, Fei-Fei L (2020) Illuminating the dark spaces of healthcare with ambient intelligence. Nature 09(585):193–202
Nayyar A, Puri V (2016) Smart farming: IoT based smart sensors agriculture stick for live temperature and moisture monitoring using Arduino, cloud computing and solar technology, pp 673–680
Farooq MS, Riaz S, Abid A, Umer T, Zikria YB (2020) Role of IoT technology in agriculture: a systematic literature review. Electronics 9(2). Available from: https://www.mdpi.com/2079-9292/9/2/319
Boobalan J, Jacintha V, Nagarajan J, Thangayogesh K, Tamilarasu S (2018) An IOT based agriculture monitoring system. In: 2018 international conference on communication and signal processing (ICCSP), pp 0594–0598
Ayaz M, Ammad-Uddin M, Sharif Z, Mansour A, Aggoune EHM (2019) Internet-of-Things (IoT)-based smart agriculture: toward making the fields talk. IEEE Access 7:129551–129583
Hussain T, Muhammad K, Khan S, Ullah A, Lee M, Baik S (2019) Intelligent baby behavior monitoring using embedded vision in IoT for smart healthcare centers. J Artif Intell Syst 01(1):110–124
Sharma S, Rajan udeja R, Gagangeet ujla S, Rasmeet et al (2020) DeTrAs: deep learning-based healthcare framework for IoT-based assistance of Alzheimer patients. Neural Comput Appl pp 1-13
Vasisht D, Kapetanovic Z, Won J, Jin X, Chandra R, Sinha S, et al (2017) FarmBeats: An IoT platform for data-driven agriculture. In: 14th USENIX symposium on networked systems design and implementation (NSDI 17). Boston, MA: USENIX Association, pp 515–529. Available from: https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/vasisht
Datta SK, Dugelay JL, Bonnet C (2018) IoT Based UAV platform for emergency services. In: 2018 international conference on information and communication technology convergence (ICTC), pp 144–147
Sandvine Global Interrnet Phenomena Report (2020) Available from: https://www.sandvine.com/phenomena
Dash.js Available from: https://github.com/Dash-Industry-Forum/dash.js/
Shahzadi S, Iqbal M, Dagiuklas T, Qayyum ZU (2017) Multi-access edge computing: open issues, challenges and future perspectives. J Cloud Comput 6(1):1–13
Jiang X, Yu FR, Song T, Leung VC (2021) A survey on multi-access edge computing applied to video streaming: some research issues and challenges. IEEE Commun Surv Tutor 23(2):871–903
Li Y, Frangoudis PA, Hadjadj-Aoul Y, Bertin P (2016) A mobile edge computing-based architecture for improved adaptive HTTP video delivery. In: 2016 IEEE conference on standards for communications and networking (CSCN). IEEE, pp 1–6
Bentaleb A, Taani B, Begen AC, Timmerer C, Zimmermann R (2018) A survey on bitrate adaptation schemes for streaming media over HTTP. IEEE Commun Surv Tutor 21(1):562–585
Spiteri K, Urgaonkar R, Sitaraman RK (2020) BOLA: near-optimal bitrate adaptation for online videos. IEEE/ACM Trans Netw 28(4):1698–1711
Sutton RS, Barto AG (2011) Reinforcement learning: an introduction
Mao H, Netravali R, Alizadeh M (2017) Neural adaptive video streaming with pensieve. In: Proceedings of the conference of the ACM special interest group on data communication, pp 197–210
Saxena P, Naresh M, Gupta M, Achanta A, Kota S, Gupta S (2020) NANCY: neural adaptive network coding methodology for video distribution over wireless networks. arXiv preprint arXiv:2008.09559
Mnih V, Badia AP, Mirza M, Graves A, Harley T, Lillicrap TP, et al (2016) Asynchronous methods for deep reinforcement learning. In: Proceedings of the 33rd international conference on international conference on machine learning - Volume 48. ICML’16. JMLR.org, pp 1928–1937
Babaeizadeh M, Frosio I, Tyree S, Clemons J, Kautz J (2016) GA3C: GPU-based A3C for deep reinforcement learning. CoRR abs/161106256
Espeholt L, Soyer H, Munos R, Simonyan K, Mnih V, Ward T, et al (2018) Impala: scalable distributed deep-rl with importance weighted actor-learner architectures. In: International conference on machine learning. PMLR, pp 1407–1416
Chen S, Zhang XF, Wu JJ, Liu D (2018) Averaged-A3C for asynchronous deep reinforcement learning. International conference on neural information processing. Springer, Berlin, pp 277–288
Holliday JB, Le TN (2020) follow then forage exploration: improving asynchronous advantage actor critic. In: International conference on soft computing, artificial intelligence and applications (SAI). pp 107–118
Huang TY, Johari R, McKeown N, Trunnell M, Watson M (2014) A buffer-based approach to rate adaptation: evidence from a large video streaming service. In: Proceedings of the 2014 ACM Conference on SIGCOMM. SIGCOMM ’14. pp 187–198. Association for Computing Machinery, New York
Sun Y, Yin X, Jiang J, Sekar V, Lin F, Wang N, et al (2016) CS2P: Improving video bitrate selection and adaptation with data-driven throughput prediction. In: Proceedings of the 2016 ACM SIGCOMM Conference
Jiang J, Sekar V, Zhang H (2012) Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE. CoNEXT ’12. pp 97–108. Association for Computing Machinery, New York
Spiteri K, Urgaonkar R, Sitaraman RK (2016) BOLA: near-optimal bitrate adaptation for online videos. In: IEEE INFOCOM 2016 - The 35th Annual IEEE international conference on computer communications, pp 1–9
Federal Communications Commission (2016) Raw Data - Measuring Broadband America, Available from: https://www.fcc.gov/reports-research/reports/ measuring- broadband- america/raw- data- measuring- broadband- america- 2016
Riiser H, Vigmostad P, Griwodz C, Halvorsen P (2013) Commute path bandwidth traces from 3G networks: analysis and applications. MMSys ’13. pp 114–118. Association for Computing Machinery, New York
Akhtar Z (2018) Oboe: Auto-tuning Video ABR Algorithms to Network Conditions. Oboe: Auto-tuning Video ABR Algorithms to Network Conditions. August 20–25, Budapest, Hungary
Yin X, Jindal A, Sekar V, Sinopoli B (2015) A control-theoretic approach for dynamic adaptive video streaming over HTTP. SIGCOMM Comput Commun Rev 45(4):325–338
De Cicco L, Caldaralo V, Palmisano V, Mascolo S (2013) Elastic: a client-side controller for dynamic adaptive streaming over http (dash). In: 20th International packet video workshop. IEEE, pp 1–8
Yousef H, Feuvre JL, Storelli A (2020) ABR prediction using supervised learning algorithms. In: 2020 IEEE 22nd international workshop on multimedia signal processing (MMSP), pp 1–6
Sani Y, Raca D, Quinlan JJ, Sreenan CJ (2020) SMASH: A supervised machine learning approach to adaptive video streaming over HTTP. In: 2020 twelfth international conference on quality of multimedia experience (QoMEX), pp 1–6
Huang T, Sun L (2020) Deepmpc: a mixture abr approach via deep learning And Mpc. In: 2020 IEEE International conference on image processing (ICIP), pp 1231–1235
Amour L, Souihi S, Mellouk A, Mushtaq MS (2019) Q2ABR: QoE-aware adaptive video bit rate solution. Int J Commun Syst 11:33
Tian Z, Zhao L, Nie L, Chen P, Chen S (2019) Deeplive: QoE optimization for live video streaming through deep reinforcement learning. In: 2019 IEEE 25th international conference on parallel and distributed systems (ICPADS), pp 827–831
Liu J, Tao X, Lu J (2019) QoE-oriented rate adaptation for DASH with enhanced deep Q-learning. IEEE Access 7:8454–8469
Mao H, Netravali R, Alizadeh M (2017) Neural adaptive video streaming with pensieve. In: Proceedings of the conference of the ACM special interest group on data communication. SIGCOMM ’17. pp 197–210. Association for Computing Machinery, New York
Akhtar Z, Nam YS, Govindan R, Rao S, Chen J, Katz-Bassett E, et al (2018) Oboe: auto-tuning video ABR algorithms to network conditions. In: Proceedings of the 2018 conference of the ACM special interest group on data communication. SIGCOMM ’18. pp 44–58. Association for Computing Machinery, New York
Mnih V, Badia AP, Mirza M, Graves A, Lillicrap TP, Harley T et al (2016) Asynchronous Methods for Deep Reinforcement Learning. CoRR arXiv:1602.01783
Yang SR, Tseng YJ, Huang CC, Lin WC (2018) Multi-access edge computing enhanced video streaming: proof-of-concept implementation and prediction/QoE models. IEEE Trans Veh Technol 68(2):1888–1902
Schulman J, Moritz P, Levine S, Jordan M, Abbeel P (2018) High-dimensional continuous control using generalized advantage estimation
Tuli S, Ilager S, Buyya R (2020) Dynamic scheduling for stochastic edge-cloud computing environments using A3C learning and residual recurrent neural networks
Goudarzi M (2021) A distributed deep reinforcement learning technique for application placement in edge and fog computing environments
Wang X, Wang C, Li X, Leung VC, Taleb T (2020) Federated deep reinforcement learning for Internet of Things with decentralized cooperative edge caching. IEEE Internet Things J 7(10):9441–9455
Wu CL, Chiu TC, Wang CY, Pang AC (2020) Mobility-aware deep reinforcement learning with glimpse mobility prediction in edge computing. In: ICC 2020-2020 IEEE international conference on communications (ICC). IEEE, pp 1–7
Mondal A, Palit B, Khandelia S, Pal N, Jayatheerthan J, Paul K et al (2020) Efficient EnDASH-A mobility adapted energy, video streaming ABR, for cellular networks. In: IFIP networking conference (Networking). IEEE, pp 127–135
Chen S, Zhang XF, Wu JJ, Liu D (2018) Averaged-A3C for asynchronous deep reinforcement learning. In: Cheng L, Leung ACS, Ozawa S (eds) Neural information processing. Springer International Publishing, Cham, pp 277–288
Netravali R, Sivaraman A, Das S, Goyal A, Winstein K, Mickens J, et al (2015) Mahimahi: accurate record-and-replay for HTTP. USENIX ATC ’15. pp 417–429. USENIX Association, USA
Narayanan A, Ramadan E, Carpenter J, Liu Q, Liu Y, Qian F et al (2020) A first look at commercial 5G performance on smartphones. In: Proceedings of the web conference, pp 894–905
Acknowledgements
This work has been supported by TCS foundation, India under the TCS research scholar program, 2019-2023.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Naresh, M., Das, V., Saxena, P. et al. Deep reinforcement learning based QoE-aware actor-learner architectures for video streaming in IoT environments. Computing 104, 1527–1550 (2022). https://doi.org/10.1007/s00607-021-01046-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-021-01046-1
Keywords
- Deep reinforcement learning
- Internet of things (IoT)
- Quality of experience (QoE)
- Adaptive bit rates (ABR)
- Video streaming