Skip to main content

Low-Power Processor Array Design Strategy for Solving Computationally Intensive 2D Topographic Problems

  • Chapter
  • First Online:
  • 584 Accesses

Abstract

2D wave type topographic operators are distributed into six classes, based on their implementation methods on different low-power many-core architectures. The following architectures are considered: (1) pipe-line architecture, (2) coarse-grain cellular parallel architecture, (3) fine-grain fully parallel cellular architecture with discrete time processing, (4) fine-grain fully parallel cellular architecture with continuous time processing, and (5) DSP-memory architecture as a reference. Efficient implementation methods of the classes are shown on each architecture. The processor utilization efficiencies, as well as the execution times, and the major constrains are calculated. On the basis of the calculated parameters, an optimal architecture can be selected for a given algorithm.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The minimal value of the pixels clock is equivalent to the product of the frame-rate and the number of pixels (resolution). If the image source is a sensor, the pixel clock is defined by the sensor readout speed. In balanced sensor applications (high-speed applications are usually balanced), the integration time and the readout time are roughly the same. Since there are short blank periods in the sensor readout protocol for synchronization purposes, the pixel clock is slightly higher then the minimal pixel clock in balanced application. However, in low light applications, the sensor integration time takes much longer than the readout time. In these cases, the sensor pixel clock can be orders of magnitude higher than the minimal pixel clock.

References

  • Chua L O and Yang L, “Cellular neural networks: theory and applications”, IEEE Transactions on Circuits and Systems, vol. 35, no. 10, October 1988, pp. 1257–1290.

    Google Scholar 

  • Chua L O, Roska T, Kozek T, Zarándy Á, “CNN universal chips crank up the computing power”, IEEE Circuits and Devices, July 1996, pp. 18–28.

    Google Scholar 

  • Cruz J M, Chua L O, Roska T, “A fast, complex and efficient test implementation of the CNN universal machine”, Proc. of the third IEEE Int. Workshop on Cellular Neural Networks and their Application (CNNA-94), pp. 61-66, Rome Dec. 1994.

    Google Scholar 

  • Dudek P, and Carey S J, “A general-purpose 128 ×128 SIMD processor array with integrated image sensor”, Electronics Letters, vol. 42, no. 12, June 2006, pp. 678–679.

    Google Scholar 

  • Dudek P, “An asynchronous cellular logic network for trigger-wave image processing on fine-grain massively parallel arrays”, IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol. 53 no. 5, 2006 pp. 354–358.

    Google Scholar 

  • Espejo S, Carmona R, Domínguez-Castro R, Rodríguez-Vázquez, A, “A VLSI-oriented continuous-time CNN model”, International Journal of Circuit Theory and Applications, vol. 24, May–June 1996, pp. 341–356.

    Google Scholar 

  • Espejo S, Carmona R, Domingúez-Castro R, Rodrigúez-Vázquez A, “CNN universal chip in CMOS technology”, Int. J. of Circuit Theory & Appl., vol. 24, 1996, pp. 93–111.

    Google Scholar 

  • Foldesy P, Zarándy Á, Rekeczky Cs, Roska T, “Configurable 3D integrated focal-plane sensor-processor array architecture”, Int. J. Circuit Theory and Applications (CTA), 2008, pp. 573–588.

    Google Scholar 

  • Harrer H, Nossek J A, Roska T, Chua L O, “A Current-mode DTCNN Universal Chip”, Proc. of IEEE Intl. Symposium on Circuits and Systems, 1994, pp. 135–138.

    Google Scholar 

  • Kahle J A, Day M N, Hofstee H P, Johns C R, Maeurer T R, Shippy D, “Introduction to the Cell multiprocessor”, IBM J. Res. & Dev., vol. 49, no. 4/5, July/September 2005.

    Google Scholar 

  • Keresztes P, Zarándy Á, Roska T, Szolgay P, Bezák T, Hídvégi T, Jónás P, Katona A, “An emulated digital CNN implementation”, Journal of VLSI Signal Processing Special Issue: Spatiotemporal Signal Processing with Analogic CNN Visual Microprocessors, (JVSP Special Issue), Kluwer, 1999 November–December.

    Google Scholar 

  • Liñan-Cembrano G, Rodríguez-Vázquez A, Espejo-Meana S, Domínguez-Castro R, “ACE16k: A 128 ×128 focal plane analog processor with digital I/O”, International Journal of Neural System, vol. 13, no. 6, 2003, pp. 427–434.

    Google Scholar 

  • Lopich, Dudek P, “Implementation of an asynchronous cellular logic network as a co-processor for a general-purpose massively parallel array”, ECCTD 2007, Seville, Spain.

    Google Scholar 

  • Nagy Z, and Szolgay P, “Configurable Multi-Layer CNN-UM Emulator on FPGA”, IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, vol. 50, 2003, pp. 774–778.

    Google Scholar 

  • Paasio A, Dawindzuk, Halonen K, Porra V, “Minimum size 0.5 micron CMOS programmable 48 ×48 CNN test chip”, European Conference on Circuit Theory and Design, Budapest, 1997, pp. 154–115.

    Google Scholar 

  • Rekeczky C S, and Chua L O, “Computing with front propagation: active contour and skeleton models in continuous-time CNN”, Journal of VLSI Signal Processing Systems, Vol. 23, No. 2/3, November-December 1999, pp. 373–402.

    Google Scholar 

  • Roska T and Chua L O, “The CNN universal machine: an analogic array computer”, IEEE Transactions on Circuits and Systems - II, vol. 40, March 1993, pp. 163–173.

    Google Scholar 

  • Roska T, Kék L, Nemes L, Zarándy Á, Brendel M, Szolgay P, “CNN Software Library (Templates and Algorithms) Version 7.2”, (DNS-1-1998), Budapest, MTA SZTAKI, 1998, http://cnn-technology.itk.ppke.hu/Library_v2.1b.pdf

  • Zarándy Á, “The art of CNN template design”, Int. J. Circuit Theory and Applications - Special Issue: Theory, Design and Applications of Cellular Neural Networks: Part II: Design and Applications, (CTA Special Issue - II), Vol. 17, No. 1, 1999, pp. 5–24.

    Google Scholar 

  • Zarándy Á, Keresztes P, Roska T, Szolgay P, “CASTLE: An emulated digital architecture; design issues, new results”, Proceedings of 5th IEEE International Conference on Electronics, Circuits and Systems, (ICECS’98), Vol. 1, 1998, pp. 199–202, Lisboa.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ákos Zarándy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Zarándy, Á., Rekeczky, C. (2010). Low-Power Processor Array Design Strategy for Solving Computationally Intensive 2D Topographic Problems. In: Baatar, C., Porod, W., Roska, T. (eds) Cellular Nanoscale Sensory Wave Computing. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-1011-0_10

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-1011-0_10

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-1010-3

  • Online ISBN: 978-1-4419-1011-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics