Skip to main content
Log in

Joint Task Dispatching and Bandwidth Allocation with Hard Deadlines in Distributed Serverless Edge Computing Systems

  • Research
  • Published:
Journal of Grid Computing Aims and scope Submit manuscript

Abstract

Serverless computing lifts the burden of infrastructure maintenance from application developers and also reduces the usage cost of cloud/edge computing platforms. However, the pay-as-you-go pricing model offered by serverless computing complicates the task dispatching problem in serverless computing systems. Specifically, when the pay-as-you-go pricing model is adopted, the task response latency is not simply the task execution latency, but the cold-start latency and the container image downloading latency should also be considered as part of the task response latency. In this paper, we focus on the joint task dispatching and bandwidth allocation problem with hard deadlines in distributed serverless edge computing systems. To maximize the overall profit, a new algorithm called PN-GRD is presented. PN-GRD first uses a Pointer Network model that is well trained offline to inference a task permutation, which is used to determine the task priority. Then, multiple edge node selection steps are carried out to select an edge node for each task according to the task priority. The final task dispatching and bandwidth allocation is obtained once the best edge node for every task is chosen or none of the edge nodes is suitable to run a task. We validate the performance of PN-GRD through simulations. The results show that PN-GRD outperforms practical baselines in terms of the average overall profit.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Castro, P., Ishakian, V., Muthusamy, V., Slominski, A.: The rise of serverless computing. Commun. ACM 62(12), 44–54 (2019). https://doi.org/10.1145/3368454

    Article  Google Scholar 

  2. Google: Google Cloud Functions (2023). https://cloud.google.com/functions

  3. Amazon: Amazon Lambda (2023). https://aws.amazon.com/lambda

  4. Microsoft: Microsoft Azure Functions (2023). https://azure.microsoft.com/en-us/services/functions

  5. Nastic, S., Rausch, T., Scekic, O., Dustdar, S., Gusev, M., Koteska, B., Kostoska, M., Jakimovski, B., Ristov, S., Prodan, R.: A serverless real-time data analytics platform for edge computing. IEEE Internet Comput. 21(4), 64–71 (2017). https://doi.org/10.1109/MIC.2017.2911430

    Article  Google Scholar 

  6. Ao, L., Izhikevich, L., Voelker, G.M., Porter, G.: Sprocket: A serverless video processing framework. In: ACM Symposium on Cloud Computing, pp. 263–274 (2018)

  7. Nesen, A., Bhargava, B.: Towards situational awareness with multimodal streaming data fusion: Serverless computing approach. In: Int. Workshop on Big Data in Emergent Distributed Environments (2021)

  8. Mvondo, D., Bacou, M., Nguetchouang, K., Ngale, L., Pouget, S., Kouam, J., Lachaize, R., Hwang, J., Wood, T., Hagimont, D., De Palma, N., Batchakui, B., Tchana, A.: OFC: an opportunistic caching system for faas platforms. In: 16th European Conf. on Computer Systems, pp. 228–244 (2021)

  9. Fuerst, A., P., S.: Faascache: Keeping serverless computing alive with greedy-dual caching. In: 26th ACM Int. Conf. on Architectural Support for Programming Languages and Operating Systems, pp. 386–400 (2021)

  10. Shahrad, M., Fonseca, R., Goiri, I., Chaudhry, G., Batum, P., Cooke, J., Laureano, E., Tresness, C., Russinovich, M., Bianchini, R.: Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In: 2020 USENIX Annual Technical Conference, pp.205–218 (2020)

  11. Wu, F., Wu, Q., Tan, Y.: Workflow scheduling in cloud: A survey. The Journal of Super computing 71, 3373–3418 (2015). https://doi.org/10.1007/s11227-015-1438-4

    Article  Google Scholar 

  12. Luo, Q., Hu, S., Li, C., Li, G., Shi, W.: Resource scheduling in edge computing: A survey. IEEE Communications Surveys & Tutorials 23(4), 2131–2165 (2021). https://doi.org/10.1109/COMST.2021.3106401

    Article  Google Scholar 

  13. Wu, Q., Zhou, M., Zhu, Q., Xia, Y., Wen, J.: Moels: Multiobjective evolutionary list scheduling for cloud workflows. IEEE Trans. Autom. Sci. Eng. 17(1), 166–176 (2021). https://doi.org/10.1109/TASE.2019.2918691

    Article  Google Scholar 

  14. Zhu, Z., Zhang, G., Li, M., Liu, X.: Evolutionary multi-objective workflow scheduling in cloud. IEEE Trans. Parallel Distrib. Syst. 27(5), 1344–1357 (2016). https://doi.org/10.1109/TPDS.2015.2446459

    Article  Google Scholar 

  15. Kao, Y.-H., Krishnamachari, B., Ra, M.-R., Bai, F.: Hermes: Latency optimal task assignment for resource-constrained mobile computing. IEEE Trans. Mob. Comput. 16(11), 3056–3069 (2017). https://doi.org/10.1109/TMC.2017.2679712

    Article  Google Scholar 

  16. Liu, L., Tan, H., Jiang, S.H.-C., Han, Z., Li, X., Huang, H.: Dependent task placement and scheduling with function configuration in edge computing. In: Int. Symposium on Quality of Service (2019)

  17. Goudarzi, M., Palaniswami, M., Buyya, R.: A distributed deep reinforcement learning technique for application placement in edge an fog computing environments. IEEE Trans. Mob. Comput. 22(5), 2491–2505 (2023). https://doi.org/10.1109/TMC.2021.3123165

    Article  Google Scholar 

  18. Zhang, W., Zhang, Z., Zeadally, S., Chao, H.-C.: Efficient task scheduling with stochastic delay cost in mobile edge computing. IEEE Commun. Lett. 23(1), 4–7 (2019). https://doi.org/10.1109/LCOMM.2018.2879317

    Article  Google Scholar 

  19. Zhang, D., Tan, L., Ren, J., Awad, M.K., Zhang, S., Zhang, Y., Wan, P.-J.: Near optimal and truthful online auction for computation offloading in green edge-computing systems. IEEE Trans. Mob. Comput. 19(4), 880–893 (2020). https://doi.org/10.1109/TMC.2019.2901474

    Article  Google Scholar 

  20. Wei, X., Rahman, A.B.M.M., Cheng, D., Wang, Y.: Joint optimization across timescales: Resource placement and task dispatching in edge clouds. IEEE Transactions on Cloud Computing 11(1), 730–744 (2023). https://doi.org/10.1109/TCC.2021.3113605

    Article  Google Scholar 

  21. Zeng, D., Gu, L., Guo, S., Cheng, Z., Yu, S.: Joint optimization of task scheduling and image placement in fog computing supported software-defined embedded system. IEEE Trans. Comput. 65(12), 3702–3712 (2016). https://doi.org/10.1109/TC.2016.2536019

    Article  MathSciNet  Google Scholar 

  22. Lei, Z., Xu, H., Huang, L., Meng, Z.: Joint service placement and request scheduling for multi-sp mobile edge computing network. In: IEEE 26th Int. Conf. on Parallel and Distributed Systems, pp. 27–34 (2020)

  23. Fei, Z., Wang, Y., Sun, R., Liu, Y.: Delay oriented task scheduling and bandwidth allocation in fog computing networks. In: IEEE Global Communications Conf. (2019)

  24. Arabnejad, V., Bubendorfer, K., Ng, B.: Budget and deadline aware e-science workflow scheduling in clouds. IEEE Trans. Parallel Distrib. Syst. 30(1), 29–44 (2019). https://doi.org/10.1109/TPDS.2018.2849396

    Article  Google Scholar 

  25. Rodriguez, M.A., Buyya, R.: Deadline based resource provisioningand scheduling algorithm for scientific workflows on clouds. IEEE Transactions on Cloud Computing 2(2), 222–235 (2014). https://doi.org/10.1109/TCC.2014.2314655

    Article  Google Scholar 

  26. Wu, Q., Ishikawa, F., Zhu, Q., Xia, Y., Wen, J.: Deadline-constrained cost optimization approaches for workflow scheduling in clouds. IEEE Trans. Parallel Distrib. Syst. 28(12), 3401–3412 (2017). https://doi.org/10.1109/TPDS.2017.2735400

    Article  Google Scholar 

  27. Zuo, X., Zhang, G., Tan, W.: Self-adaptive learning pso-based deadline constrained task scheduling for hybrid iaas cloud. IEEE Trans. Autom. Sci. Eng. 11(2), 564–573 (2014). https://doi.org/10.1109/TASE.2013.2272758

    Article  Google Scholar 

  28. Meng, J., Tan, H., Li, X., Han, Z., Li, B.: Online deadline-aware task dispatching and scheduling in edge computing. IEEE Trans. Parallel Distrib. Syst. 31(6), 1270–1286 (2020). https://doi.org/10.1109/TPDS.2019.2961905

    Article  Google Scholar 

  29. Lou, J., Tang, Z., Zhang, S., Jia, W., Zhao, W., Li, J.: Cost-effective scheduling for dependent tasks with tight deadline constraints in mobile edge computing. IEEE Trans. Mob. Comput. 22(10), 5829–5845 (2023). https://doi.org/10.1109/TMC.2022.3188770

    Article  Google Scholar 

  30. Sundar, S., Liang, B.: Offloading dependent tasks with communication delay and deadline constraint. In: IEEE Conf. on Computer Communications, pp. 37–45 (2018)

  31. Fan, J., Wei, X., Wang, T., Lan, T., Subramaniam, S.: Deadline-aware task scheduling in a tiered iot infrastructure. In: IEEE Global Communications Conf. (2017)

  32. Chai, R., Li, M., Yang, T., Chen, Q.: Dynamic priority-based computation scheduling and offloading for interdependent tasks: Leveraging parallel transmission and execution. IEEE Trans. Veh. Technol. 70(10), 10970–10985 (2021). https://doi.org/10.1109/TVT.2021.3110401

    Article  Google Scholar 

  33. Gu, L., Zeng, D., Hu, J., Li, B., Jin, H.: Layer aware microservice placement and request scheduling at the edge. In: IEEE Conf. on Computer Communications (2021)

  34. Gu, L., Chen, Z., Xu, H., Zeng, D., Li, B., Jin, H.: Layer-aware collaborative microservice deployment toward maximal edge through put. In: IEEE Conf. on Computer Communications, pp. 71–79 (2022)

  35. Wei, X., Lu, F., Wang, T., Gu, J., Yang, Y., Chen, R., Chen, H.: No provisioned concurrency: Fast rdma-codesigned remote fork for serverless computing. In: 17th USENIX Symposium on Operating Systems Design and Implementation, pp. 497–517 (2023)

  36. Puchinger, J., Raidl, G.R., Pferschy, U.: The multidimensional knapsack problem: Structure and algorithms. INFORMS J. Comput. 22(2), 250–265 (2010). https://doi.org/10.1287/ijoc.1090.0344

    Article  MathSciNet  Google Scholar 

  37. Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: Advances in Neural Information Processing Systems (2015)

  38. Sutskever, I., Vinyals, O., Le., Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (2014)

  39. Katoch, S., Chauhan, S.S., Kumar, V.: A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications 80, 8091–8126 (2021). https://doi.org/10.1007/s11042-020-10139-6

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan Sun.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sun, Y., Zhang, C. & Huang, T. Joint Task Dispatching and Bandwidth Allocation with Hard Deadlines in Distributed Serverless Edge Computing Systems. J Grid Computing 22, 51 (2024). https://doi.org/10.1007/s10723-024-09770-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10723-024-09770-6

Keywords

Navigation