Skip to main content

Implementation of the Beamformer Algorithm for the NVIDIA Jetson

Part of the Lecture Notes in Computer Science book series (LNTCS,volume 10049)


Nowadays, the aim of the technology industry is intensively shifting to improve the ratio Gflop/watt of computation. Many processors implement the low power design of ARM architecture like, e.g. the NVIDIA TK1, a chip which also includes a GPU embedded in the same die to improve performance at a low energy consumption. This type of devices are very suitable target machines to be used on applications that require mobility like, e.g. those that manage and reproduce real acoustics environments. One of the most used algorithms in these reproduction environments is the Beamformer Algorithm. We have implemented the variant called Beamformer QR-LCMV, based on the QR decomposition, which is a very computationally demanding operation. We have explored different options differing basically in the high performance computing library used. Also we have built our own version with the aim of approaching the real-time processing goal when working on this type of low power devices.


  • Audio processing
  • Beamformer
  • GPU-CPU Processing
  • Heterogeneous QR Factorization

This work has been supported by projects TEC2015-67387-C4-1-R of the Spanish Ministerio de Economía y Competitividad and PROMETEOII/2014/003 of the Generalitat Valenciana.

This is a preview of subscription content, access via your institution.

Buying options

USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-49956-7_16
  • Chapter length: 11 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
USD   59.99
Price excludes VAT (USA)
  • ISBN: 978-3-319-49956-7
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   79.99
Price excludes VAT (USA)
Fig. 1.
Fig. 2.
Fig. 3.


  1. ARM. (2016) ARM processors.

  2. Dongarra, J., et al.: “PLASMA users’ guide”, Electrical Engineering, Computer Science Department, Univesity of Tennessee, Knoxville, Tennessee 37996, Technical report (2015).

  3. Benesty, Y.H.J., Chen, J., Dmochowski, J.: On microphone-array beamforming from a mimo acoustic signal processing perspective. IEEE Trans. Audio Speech Lenguage Process. 15, 1053–1065 (2007)

    CrossRef  Google Scholar 

  4. Lorente, J., Piñero, G., Vidal, A., Belloch, J., González, A.: Parallel implementations of beamforming design and filtering for microphone array applications. In: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), Barcelona, Spain, pp. 501–505 (2011)

    Google Scholar 

  5. NVIDIA: NVIDIA CUDA Basic Linear Algebra Subroutines.

  6. NVIDIA. (2015) NVIDIA Jetson TK1.

  7. NVIDIA. (2016) NVIDIA Kepler.

  8. OpenBLAS: An optimized BLAS library.

  9. Tomov, S., Dongarra, J., Baboulin, M.: Towards dense linear algebra for hybrid GPU accelerated manycore systems. Parallel Comput. 36(5–6), 232–240 (2010).

    CrossRef  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Pedro Alonso .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Alventosa, F.J., Alonso, P., Piñero, G., Vidal, A.M. (2016). Implementation of the Beamformer Algorithm for the NVIDIA Jetson. In: , et al. Algorithms and Architectures for Parallel Processing. ICA3PP 2016. Lecture Notes in Computer Science(), vol 10049. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49955-0

  • Online ISBN: 978-3-319-49956-7

  • eBook Packages: Computer ScienceComputer Science (R0)