Abstract
Serverless computing lifts the burden of infrastructure maintenance from application developers and also reduces the usage cost of cloud/edge computing platforms. However, the pay-as-you-go pricing model offered by serverless computing complicates the task dispatching problem in serverless computing systems. Specifically, when the pay-as-you-go pricing model is adopted, the task response latency is not simply the task execution latency, but the cold-start latency and the container image downloading latency should also be considered as part of the task response latency. In this paper, we focus on the joint task dispatching and bandwidth allocation problem with hard deadlines in distributed serverless edge computing systems. To maximize the overall profit, a new algorithm called PN-GRD is presented. PN-GRD first uses a Pointer Network model that is well trained offline to inference a task permutation, which is used to determine the task priority. Then, multiple edge node selection steps are carried out to select an edge node for each task according to the task priority. The final task dispatching and bandwidth allocation is obtained once the best edge node for every task is chosen or none of the edge nodes is suitable to run a task. We validate the performance of PN-GRD through simulations. The results show that PN-GRD outperforms practical baselines in terms of the average overall profit.
Similar content being viewed by others
References
Castro, P., Ishakian, V., Muthusamy, V., Slominski, A.: The rise of serverless computing. Commun. ACM 62(12), 44–54 (2019). https://doi.org/10.1145/3368454
Google: Google Cloud Functions (2023). https://cloud.google.com/functions
Amazon: Amazon Lambda (2023). https://aws.amazon.com/lambda
Microsoft: Microsoft Azure Functions (2023). https://azure.microsoft.com/en-us/services/functions
Nastic, S., Rausch, T., Scekic, O., Dustdar, S., Gusev, M., Koteska, B., Kostoska, M., Jakimovski, B., Ristov, S., Prodan, R.: A serverless real-time data analytics platform for edge computing. IEEE Internet Comput. 21(4), 64–71 (2017). https://doi.org/10.1109/MIC.2017.2911430
Ao, L., Izhikevich, L., Voelker, G.M., Porter, G.: Sprocket: A serverless video processing framework. In: ACM Symposium on Cloud Computing, pp. 263–274 (2018)
Nesen, A., Bhargava, B.: Towards situational awareness with multimodal streaming data fusion: Serverless computing approach. In: Int. Workshop on Big Data in Emergent Distributed Environments (2021)
Mvondo, D., Bacou, M., Nguetchouang, K., Ngale, L., Pouget, S., Kouam, J., Lachaize, R., Hwang, J., Wood, T., Hagimont, D., De Palma, N., Batchakui, B., Tchana, A.: OFC: an opportunistic caching system for faas platforms. In: 16th European Conf. on Computer Systems, pp. 228–244 (2021)
Fuerst, A., P., S.: Faascache: Keeping serverless computing alive with greedy-dual caching. In: 26th ACM Int. Conf. on Architectural Support for Programming Languages and Operating Systems, pp. 386–400 (2021)
Shahrad, M., Fonseca, R., Goiri, I., Chaudhry, G., Batum, P., Cooke, J., Laureano, E., Tresness, C., Russinovich, M., Bianchini, R.: Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In: 2020 USENIX Annual Technical Conference, pp.205–218 (2020)
Wu, F., Wu, Q., Tan, Y.: Workflow scheduling in cloud: A survey. The Journal of Super computing 71, 3373–3418 (2015). https://doi.org/10.1007/s11227-015-1438-4
Luo, Q., Hu, S., Li, C., Li, G., Shi, W.: Resource scheduling in edge computing: A survey. IEEE Communications Surveys & Tutorials 23(4), 2131–2165 (2021). https://doi.org/10.1109/COMST.2021.3106401
Wu, Q., Zhou, M., Zhu, Q., Xia, Y., Wen, J.: Moels: Multiobjective evolutionary list scheduling for cloud workflows. IEEE Trans. Autom. Sci. Eng. 17(1), 166–176 (2021). https://doi.org/10.1109/TASE.2019.2918691
Zhu, Z., Zhang, G., Li, M., Liu, X.: Evolutionary multi-objective workflow scheduling in cloud. IEEE Trans. Parallel Distrib. Syst. 27(5), 1344–1357 (2016). https://doi.org/10.1109/TPDS.2015.2446459
Kao, Y.-H., Krishnamachari, B., Ra, M.-R., Bai, F.: Hermes: Latency optimal task assignment for resource-constrained mobile computing. IEEE Trans. Mob. Comput. 16(11), 3056–3069 (2017). https://doi.org/10.1109/TMC.2017.2679712
Liu, L., Tan, H., Jiang, S.H.-C., Han, Z., Li, X., Huang, H.: Dependent task placement and scheduling with function configuration in edge computing. In: Int. Symposium on Quality of Service (2019)
Goudarzi, M., Palaniswami, M., Buyya, R.: A distributed deep reinforcement learning technique for application placement in edge an fog computing environments. IEEE Trans. Mob. Comput. 22(5), 2491–2505 (2023). https://doi.org/10.1109/TMC.2021.3123165
Zhang, W., Zhang, Z., Zeadally, S., Chao, H.-C.: Efficient task scheduling with stochastic delay cost in mobile edge computing. IEEE Commun. Lett. 23(1), 4–7 (2019). https://doi.org/10.1109/LCOMM.2018.2879317
Zhang, D., Tan, L., Ren, J., Awad, M.K., Zhang, S., Zhang, Y., Wan, P.-J.: Near optimal and truthful online auction for computation offloading in green edge-computing systems. IEEE Trans. Mob. Comput. 19(4), 880–893 (2020). https://doi.org/10.1109/TMC.2019.2901474
Wei, X., Rahman, A.B.M.M., Cheng, D., Wang, Y.: Joint optimization across timescales: Resource placement and task dispatching in edge clouds. IEEE Transactions on Cloud Computing 11(1), 730–744 (2023). https://doi.org/10.1109/TCC.2021.3113605
Zeng, D., Gu, L., Guo, S., Cheng, Z., Yu, S.: Joint optimization of task scheduling and image placement in fog computing supported software-defined embedded system. IEEE Trans. Comput. 65(12), 3702–3712 (2016). https://doi.org/10.1109/TC.2016.2536019
Lei, Z., Xu, H., Huang, L., Meng, Z.: Joint service placement and request scheduling for multi-sp mobile edge computing network. In: IEEE 26th Int. Conf. on Parallel and Distributed Systems, pp. 27–34 (2020)
Fei, Z., Wang, Y., Sun, R., Liu, Y.: Delay oriented task scheduling and bandwidth allocation in fog computing networks. In: IEEE Global Communications Conf. (2019)
Arabnejad, V., Bubendorfer, K., Ng, B.: Budget and deadline aware e-science workflow scheduling in clouds. IEEE Trans. Parallel Distrib. Syst. 30(1), 29–44 (2019). https://doi.org/10.1109/TPDS.2018.2849396
Rodriguez, M.A., Buyya, R.: Deadline based resource provisioningand scheduling algorithm for scientific workflows on clouds. IEEE Transactions on Cloud Computing 2(2), 222–235 (2014). https://doi.org/10.1109/TCC.2014.2314655
Wu, Q., Ishikawa, F., Zhu, Q., Xia, Y., Wen, J.: Deadline-constrained cost optimization approaches for workflow scheduling in clouds. IEEE Trans. Parallel Distrib. Syst. 28(12), 3401–3412 (2017). https://doi.org/10.1109/TPDS.2017.2735400
Zuo, X., Zhang, G., Tan, W.: Self-adaptive learning pso-based deadline constrained task scheduling for hybrid iaas cloud. IEEE Trans. Autom. Sci. Eng. 11(2), 564–573 (2014). https://doi.org/10.1109/TASE.2013.2272758
Meng, J., Tan, H., Li, X., Han, Z., Li, B.: Online deadline-aware task dispatching and scheduling in edge computing. IEEE Trans. Parallel Distrib. Syst. 31(6), 1270–1286 (2020). https://doi.org/10.1109/TPDS.2019.2961905
Lou, J., Tang, Z., Zhang, S., Jia, W., Zhao, W., Li, J.: Cost-effective scheduling for dependent tasks with tight deadline constraints in mobile edge computing. IEEE Trans. Mob. Comput. 22(10), 5829–5845 (2023). https://doi.org/10.1109/TMC.2022.3188770
Sundar, S., Liang, B.: Offloading dependent tasks with communication delay and deadline constraint. In: IEEE Conf. on Computer Communications, pp. 37–45 (2018)
Fan, J., Wei, X., Wang, T., Lan, T., Subramaniam, S.: Deadline-aware task scheduling in a tiered iot infrastructure. In: IEEE Global Communications Conf. (2017)
Chai, R., Li, M., Yang, T., Chen, Q.: Dynamic priority-based computation scheduling and offloading for interdependent tasks: Leveraging parallel transmission and execution. IEEE Trans. Veh. Technol. 70(10), 10970–10985 (2021). https://doi.org/10.1109/TVT.2021.3110401
Gu, L., Zeng, D., Hu, J., Li, B., Jin, H.: Layer aware microservice placement and request scheduling at the edge. In: IEEE Conf. on Computer Communications (2021)
Gu, L., Chen, Z., Xu, H., Zeng, D., Li, B., Jin, H.: Layer-aware collaborative microservice deployment toward maximal edge through put. In: IEEE Conf. on Computer Communications, pp. 71–79 (2022)
Wei, X., Lu, F., Wang, T., Gu, J., Yang, Y., Chen, R., Chen, H.: No provisioned concurrency: Fast rdma-codesigned remote fork for serverless computing. In: 17th USENIX Symposium on Operating Systems Design and Implementation, pp. 497–517 (2023)
Puchinger, J., Raidl, G.R., Pferschy, U.: The multidimensional knapsack problem: Structure and algorithms. INFORMS J. Comput. 22(2), 250–265 (2010). https://doi.org/10.1287/ijoc.1090.0344
Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: Advances in Neural Information Processing Systems (2015)
Sutskever, I., Vinyals, O., Le., Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (2014)
Katoch, S., Chauhan, S.S., Kumar, V.: A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications 80, 8091–8126 (2021). https://doi.org/10.1007/s11042-020-10139-6
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sun, Y., Zhang, C. & Huang, T. Joint Task Dispatching and Bandwidth Allocation with Hard Deadlines in Distributed Serverless Edge Computing Systems. J Grid Computing 22, 51 (2024). https://doi.org/10.1007/s10723-024-09770-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10723-024-09770-6