Abstract
Moving from homogeneous to heterogeneous architectures offers performance, size, weight, and power advantages, which are especially important for embedded solutions. However, the development of parallel software for heterogeneous computing systems is a rather difficult task due to the requirements of high performance, easy programmability, and scalability. The paper investigates the efficiency of parallel-pipeline processing of video information in multicore heterogeneous systems on a chip (SoC). A mathematical model of the used structures is described and the criteria for efficiently processing video data are formulated, based on which the target function is built. The typical scheme for parallel-pipeline processing of video data using various streaming video processing accelerators (DSP, GPU, ISP, VDP, VPU, etc.) is presented. A scheme for parallel-pipeline processing of video data on a heterogeneous 1892VM248 SoC is developed. A technique for efficient parallel-pipeline video information processing in heterogeneous computing SoCs, including a software stack consisting of an operating system level, a programming technology level, and an application level, is proposed. A comparative analysis of the most common programming technologies for heterogeneous SoCs, such as OpenCL, OpenMP, MPI, and OpenAMP, is carried out. The results of the analysis show that, depending on the purpose of the end device, two programming paradigms should be used: based on OpenCL technology (for embedded systems) and MPI technology (for intercore and interprocessor interactions). The results of parallel-pipeline processing in the face recognition problem confirms the effectiveness of the chosen solutions.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1134%2FS1063739722070113/MediaObjects/11180_2022_7350_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1134%2FS1063739722070113/MediaObjects/11180_2022_7350_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1134%2FS1063739722070113/MediaObjects/11180_2022_7350_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1134%2FS1063739722070113/MediaObjects/11180_2022_7350_Fig4_HTML.png)
Similar content being viewed by others
REFERENCES
Belyaev, A.A., Gavrilov, S.V., Kuznetsov, D.A., et al., Evolution of multi-core heterogeneous computing systems in the field of video data processing, in Tr. V Vseros. nauch.-tekhn. konf. Problemy razrabotki perspektivnykh mikro- i nanoelektronnykh sistem – 2014 (Proceedings of the 5th All-Russia Conference on Problems of Developing Promising Micro- and Nanoelectronic Systems MES-2014), Stempkovskii, A.L., Ed., Moscow: IPPM RAN, 2014, part 4, pp. 129–134.
Laude, T., Adhisantoso, Y.G., Voges, J., et al., Comprehensive video codec comparison, APSIPA Trans. Signal Inform. Process., 2019, vol. 8, p. e30. https://doi.org/10.1017/ATSIP.2019.23
ELVIS. https://multicore.ru. Accessed July 19, 2020.
Buffer Sharing and Synchronization. www.kernel.org/ doc/html/v4.16/driver-api/dma-buf.html. Accessed July 19, 2020.
OpenMP. https://www.openmp.org/. Accessed July 19, 2020.
MPI. https://computing.llnl.gov/tutorials/mpi/. Accessed July 19, 2020.
OpenCL. https://www.khronos.org/opencl/. Accessed July 19, 2020.
OpenAMP. https://www.openampproject.org/. Accessed July 19, 2020.
Yanakova, E.S., Macharadze, G.T., Kostulin, N.V., and Tyurin, A.A., Solving large SLE on heterogeneous SOC of ‘multicore’ series, Probl. Razrab. Persp. Mikro- Nanoelektron. Sist., 2020, no. 1, pp. 70–75. https://doi.org/10.31114/2078-7707-2020-1-70-75
FaceScrub Database. http://megaface.cs.washington.edu/ results/facescrub.html. Accessed July 19, 2020.
Intel Xeon Gold 6140 Processor. https://www.intel.in/ content/www/in/en/products/processors/xeon/scalable/ gold-processors/gold-6140.html. Accessed July 19, 2020.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yanakova, E.S., Macharadze, G.T., Gagarina, L.G. et al. Parallel-Pipeline Processing of Video Information in Multiprocessing Heterogeneous Systems on a Chip. Russ Microelectron 51, 619–626 (2022). https://doi.org/10.1134/S1063739722070113
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1063739722070113