Abstract
Processing neural network inferences on edge devices, such as smartphones, IoT devices and smart sensors can provide substantial advantages compared to traditional cloud-based computation. These include technical aspects such as low latency or high data throughput – but also data sovereignty can be a concern in many applications. Even though general approaches of distributed inference have been developed recently, a transfer of these principles to the edge of the network is still missing. In an extreme edge setting, computations typically are severely constraint by available energy resources and communication limitations among individual devices, which makes the distribution and execution of large-scale neural networks particularly challenging. Moreover, since mobile devices are volatile, existing static networks are unsuited and instead a highly dynamic network architecture needs to be orchestrated and managed. We present a novel, multi-stage concept which tackles all associated tasks in one framework. Specifically, distributed inference approaches are complemented with necessary resource management and network orchestration in order to enable distributed inference in the field – hence paving the way towards a broad number of possible applications, such as autonomous driving, traffic optimization, medical applications, agriculture, and industry 4.0.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Badihi, B., Ghavimi, F., Jäntti, R.: On the system-level performance evaluation of bluetooth 5 in IoT: Open office case study. In: 2019 16th International Symposium on Wireless Communication Systems (ISWCS), pp. 485–489 (2019). https://doi.org/10.1109/ISWCS.2019.8877223
Bughin, J., Seong, J., Manyika, J., Chui, M., Joshi, R.: Notes from the AI frontier: modeling the impact of AI on the world economy. McKinsey Glob. Inst. 4 (2018)
Giordani, M., Polese, M., Mezzavilla, M., Rangan, S., Zorzi, M.: Toward 6G networks: use cases and technologies. IEEE Commun. Mag. 58(3), 55–61 (2020). https://doi.org/10.1109/MCOM.001.1900411
Hemmat, M., Davoodi, A., Hu, Y.H.: EdgenAI: distributed inference with local edge devices and minimal latency. In: Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC, vol. 2022-January, pp. 544–549. Institute of Electrical and Electronics Engineers Inc. (2022). https://doi.org/10.1109/ASP-DAC52403.2022.9712496
Hemmat, M., Miguel, J.S., Davoodi, A.: CAP’NN: class-aware personalized neural network inference. In: 2020 57th ACM/IEEE Design Automation Conference (DAC), pp. 1–6. IEEE (2020). https://doi.org/10.1109/DAC18072.2020.9218741
Hinds, A., Ngulube, M., Zhu, S., Al-Aqrabi, H.: A review of routing protocols for mobile ad-hoc networks (manet). Int. J. Inf. Educ. Technol. 3(1), 1 (2013)
OpenSim Ltd.: Omnet++ discrete event simulator (2023). https://omnetpp.org
Nakamura, T.: 5G evolution and 6G. In: 2020 IEEE Symposium on VLSI Technology, pp. 1–5 (2020). https://doi.org/10.1109/VLSITechnology18217.2020.9265094
Parthasarathy, A., Krishnamachari, B.: DEFER: distributed edge inference for deep neural networks. In: 2022 14th International Conference on COMmunication Systems and NETworkS, COMSNETS 2022, pp. 749–753. Institute of Electrical and Electronics Engineers Inc. (2022). https://doi.org/10.1109/COMSNETS53615.2022.9668515, arXiv: 2201.06769
Peltonen, E., et al.: 6G white paper on edge intelligence. arXiv preprint arXiv:2004.14850 (2020)
Royer, E., Toh, C.K.: A review of current routing protocols for ad hoc mobile wireless networks. IEEE Pers. Commun. 6(2), 46–55 (1999). https://doi.org/10.1109/98.760423
Shahraki, A., Ohlenforst, T., Kreyß, F.: When machine learning meets Network Management and Orchestration in edge-based networking paradigms. J. Netw. Comput. Appl. 212, 103558 (2023). https://doi.org/10.1016/j.jnca.2022.103558
Stahl, R., Hoffman, A., Mueller-Gritschneder, D., Gerstlauer, A., Schlichtmann, U.: DeeperThings: fully distributed CNN inference on resource-constrained edge devices. Int. J. Parallel Prog. 49(4), 600–624 (2021). https://doi.org/10.1007/s10766-021-00712-3
Zhao, Z., Barijough, K.M., Gerstlauer, A.: DeepThings: Distributed adaptive deep learning inference on resource-constrained IoT edge clusters. In: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 37, no. 11, pp. 2348–2359. Institute of Electrical and Electronics Engineers Inc. (2018). https://doi.org/10.1109/TCAD.2018.2858384. ISSN: 02780070
Acknowledgements
This work was supported by German Aerospace Center (DLR), by Bayerisches Staatsministerium für Wirtschaft, Landesentwicklung und Energie (StMWi) and Bundesministerium für Wirtschaft und Klimaschutz (BMWK).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ohlenforst, T., Schreiber, M., Kreyß, F., Schrauth, M. (2023). Enabling Distributed Inference of Large Neural Networks on Resource Constrained Edge Devices using Ad Hoc Networks. In: Ossowski, S., Sitek, P., Analide, C., Marreiros, G., Chamoso, P., Rodríguez, S. (eds) Distributed Computing and Artificial Intelligence, 20th International Conference. DCAI 2023. Lecture Notes in Networks and Systems, vol 740. Springer, Cham. https://doi.org/10.1007/978-3-031-38333-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-031-38333-5_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-38332-8
Online ISBN: 978-3-031-38333-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)