CNNX: A Low Cost, CNN Accelerator for Embedded System in Vision at Edge

Farahani, Ali; Beithollahi, Hakem; Fathi, Mahmood; Barangi, Reza

doi:10.1007/s13369-022-06931-1

CNNX: A Low Cost, CNN Accelerator for Embedded System in Vision at Edge

Research Article-Computer Engineering and Computer Science
Published: 21 May 2022

Volume 48, pages 1537–1545, (2023)
Cite this article

Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Ali Farahani¹,
Hakem Beithollahi ORCID: orcid.org/0000-0002-8420-6545¹,
Mahmood Fathi¹ &
…
Reza Barangi¹

391 Accesses
1 Citation
Explore all metrics

Abstract

Convolutional neural networks (CNNs) have been widely deployed in artificial intelligence, including computer vision and pattern recognition. In these applications, CNN is the most computationally intensive part. Recently, many researchers have used depthwise convolution to decrease the computational load in the execution of CNNs; on the other hand, today, CNNs have become larger and larger. Consequently, they need more computational budget for their executions. The problem is more serious when this application is run in an embedded system, especially in the edge devices, as the embedded processor can hardly handle these heavy computational loads. This paper proposes a lightweight, low-power, and efficient CNN hardware accelerator for edge computing devices. This accelerator is explicitly designed for depthwise CNN. The proposed accelerator can be configured and programmed to run any lightweight CNN of a wide range of AI networks such as MobileNet, Xception, and shuffleNet. Our experimental results show that our accelerator can run MobileNet 70 times per second in a remote sensing AI application with a \(224\times 224\) pixel image from the ImageNet dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Fig. 5

Towards An FPGA-targeted Hardware/Software Co-design Framework for CNN-based Edge Computing

Article 14 May 2022

A Reconfigurable Convolutional Neural Networks Accelerator Based on FPGA

Hardware Architecture of Embedded Inference Accelerator and Analysis of Algorithms for Depthwise and Large-Kernel Convolutions

References

Andrii, O.T.; et al.: Convolutional neural networks as a model of the visual system: past, present, and future. J. Cognit. Neurosci. 33(10), 2017–2031 (2021)
Article Google Scholar
Khan, A.; et al.: A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 53, 5455–5516 (2020)
Article Google Scholar
Tianyi, L.; et al.: Implementation of Training Convolutional Neural Networks. arXiv:1506.01195 (2015)
Pérez, I.; Figueroa, M.: A heterogeneous hardware accelerator for image classification in embedded systems. Sensors 21(8), 2637 (2021). https://doi.org/10.3390/s21082637
Article Google Scholar
Hammami, E.; et al.: An overview on loop tiling techniques for code generation. In: IEEE/ACS 14th International Conference on Computer Systems and Applications (2017)
Pérez, I.; Figueroa, M.: A heterogeneous hardware accelerator for image classification in embedded systems. Sensors 21(8), 2637 (2021). https://doi.org/10.3390/s21082637
Bouguezzi, S.; et al.: An efficient FPGA-based convolutional neural network for classification: Ad-MobileNet. MDPI Electron. 10, 1025 (2021). https://doi.org/10.3390/electronics10182272
Article Google Scholar
Liu, B.; et al.: An FPGA-based CNN accelerator integrating depthwise separable convolution. MDPI Electron. 8(3), 281 (2019). https://doi.org/10.3390/electronics8030281
Article Google Scholar
Jiang, S.; et al.: Redundancy-reduced MobileNet acceleration on reconfigurable logic for ImageNet classification. In: International Symposium on Applied Reconfigurable Computing, ARC, pp. 16–28. Springer (2018)
Ma, Y.; et al.: ALAMO: FPGA acceleration of deep learning algorithms with a modularized RTL compiler. Integr. VLSI J. 62, 14–23 (2018)
Article Google Scholar
Shen, Y., et al. Escher: A CNN accelerator with flexible buffering to minimize off-chip transfer. In: Annual IEEE Symposium on Filed-Programmable Custom Computing Machine FCCM, pp. 93–100 (2017)
Lian, X.; et al.: High-performance FPGA-based CNN accelerator with block-floating-point arithmetic. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 27(8), 1874–1885 (2019)
Article Google Scholar
Dinga, W.; et al.: Designing efficient accelerator of depthwise separable convolutional neural network on FPGA. J. Syst. Archit. 97, 278–286 (2019)
Article Google Scholar
Jahanshahi, A.; Sharifi, R.; Rezvani, M.; Zamani, H.: Inf4edge:Automatic resource-aware generation of energy-efficient CNN inference accelerator for edge embedded FPGA. In: 2021 12th International Green and Sustainable Computing Workshops (IGSC), Energy-Efficient Machine Learning (E2ML). IEEE (2021)
Howard, A.G., et al. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.0486 (2017)
Chen, Y.-H.; et al.: Efficient processing of deep neural networks: a tutorial and survey. Proc. IEEE 105(12), 2295–2329 (2017)
Article Google Scholar
Wang, M.; et al.: Factorized convolutional neural networks. In: Proceedings of IEEE International conference on Computer Vision Workshops, pp. 545–553 (2017)
Sharma, H.; et al.: Bit fusion: bit-level dynamically composable architecture for accelerating deep neural networks. In: ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA) (2018)

Download references

Author information

Authors and Affiliations

School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran
Ali Farahani, Hakem Beithollahi, Mahmood Fathi & Reza Barangi

Authors

Ali Farahani
View author publications
You can also search for this author in PubMed Google Scholar
Hakem Beithollahi
View author publications
You can also search for this author in PubMed Google Scholar
Mahmood Fathi
View author publications
You can also search for this author in PubMed Google Scholar
Reza Barangi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hakem Beithollahi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Farahani, A., Beithollahi, H., Fathi, M. et al. CNNX: A Low Cost, CNN Accelerator for Embedded System in Vision at Edge. Arab J Sci Eng 48, 1537–1545 (2023). https://doi.org/10.1007/s13369-022-06931-1

Download citation

Received: 29 December 2021
Accepted: 18 April 2022
Published: 21 May 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s13369-022-06931-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CNNX: A Low Cost, CNN Accelerator for Embedded System in Vision at Edge

Abstract

Access this article

Similar content being viewed by others

Towards An FPGA-targeted Hardware/Software Co-design Framework for CNN-based Edge Computing

A Reconfigurable Convolutional Neural Networks Accelerator Based on FPGA

Hardware Architecture of Embedded Inference Accelerator and Analysis of Algorithms for Depthwise and Large-Kernel Convolutions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

CNNX: A Low Cost, CNN Accelerator for Embedded System in Vision at Edge

Abstract

Access this article

Similar content being viewed by others

Towards An FPGA-targeted Hardware/Software Co-design Framework for CNN-based Edge Computing

A Reconfigurable Convolutional Neural Networks Accelerator Based on FPGA

Hardware Architecture of Embedded Inference Accelerator and Analysis of Algorithms for Depthwise and Large-Kernel Convolutions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation