Advertisement

Towards Energy Efficient Parallel Computing on Consumer Electronic Devices

  • Karl Fürlinger
  • Christof Klausecker
  • Dieter Kranzlmüller
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6868)

Abstract

In the last two decades supercomputers have sustained a remarkable growth in performance that even out-performed the predictions of Moore’s law, primarily due to increased levels of parallelism [19]. As industry and academia try to come up with viable approaches for exascale systems, attention turns to energy efficiency as the primary design consideration. At the same time, energy efficiency has always been the main concern in the mobile computing area. Additionally, mobile and consumer electronic devices are becoming ever more powerful as the use cases (e.g., Web 2.0 applications, video encoding, virtual and augmented reality) become more computationally demanding. It is therefore an interesting question to ask if these devices are the possible building blocks of future HPC systems. It was the workstation and server market in the past that provided the CPUs that power supercomputers and it might be the consumer electronic market that provides the underlying technology in the future.

In this paper we try to analyze the current state of energy efficient parallel and distributed computing on mobile and consumer electronic devices. We provide an overview of performance characteristics of some current and announced future devices for scientific computation and we build a small proof-of-concept cluster from Apple’s second generation “Apple TV” devices and evaluate its performance on standard benchmark applications. We discuss the limiting factors, and analyze the industry trajectory that we believe could make consumer electronic-based design a feasible technology basis for future HPC system designs.

Keywords

Augmented Reality Double Precision Cache Size Consumer Electronic Device Advance Risc Machine 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
    STREAM: Sustainable memory bandwidth in high performance computers, http://www.cs.virginia.edu/stream/
  6. 6.
  7. 7.
    Adiga, N. R., et al.: An overview of the BlueGene/ L supercomputer (2002)Google Scholar
  8. 8.
    Arpaci, R.H., Culler, D.E., Krishnamurthy, A., Steinberg, S.G., Yelick, K.: Empirical evaluation of the CRAY-T3D: a compiler perspective. SIGARCH Comput. Archit. News 23, 320–331 (1995)CrossRefGoogle Scholar
  9. 9.
    Buttari, A., Dongarra, J., Langou, J., Langou, J., Luszczek, P., Kurzak, J.: Mixed precision iterative refinement techniques for the solution of dense linear systems. Int. J. High Perform. Comput. Appl. 21, 457–466 (2007)CrossRefzbMATHGoogle Scholar
  10. 10.
    Buttari, A., Luszczek, P., Kurzak, J., Dongarra, J., Bosilca, G.: SCOP3: A rough guide to scientific computing on the PlayStation 3. version 0.1. Technical Report UT-CS-07-595, Innovative Computing Laboratory, University of Tennessee Knoxville (April 2007)Google Scholar
  11. 11.
  12. 12.
    The Green500 List, http://www.green500.org
  13. 13.
    Kogge, P.M., et al.: Exascale computing study: Technology challenges in achieving exascale systems. DARPA Information Processing Techniques Office (IPTO) Sponsored Study (2008)Google Scholar
  14. 14.
    Neill, R., Shabarshin, A., Carloni, L.P.: A heterogeneous parallel system running OpenMPI on a broadband network of embedded set-top devices. In: Proceedings of the 7th ACM International Conference on Computing Frontiers, CF 2010, pp. 187–196. ACM, New York (2010)Google Scholar
  15. 15.
    NVIDIA. The benefits of multiple CPU cores in mobile devices, whitepaper (2010), http://goo.gl/g3MXo
  16. 16.
  17. 17.
  18. 18.
    Katie, R.-H., Hedge, P.: ARM Cortex-A8 vs. Intel Atom: Architectural and benchmark comparisons (2009)Google Scholar
  19. 19.
    The Top 500 Supercomputer Sites, http://www.top500.org
  20. 20.
    Unmodified Xbox Cluster, http://www.bgfax.com/xbox/home.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Karl Fürlinger
    • 1
  • Christof Klausecker
    • 1
  • Dieter Kranzlmüller
    • 1
  1. 1.Department of Computer Science, MNM-TeamLudwig-Maximilians-Universität (LMU)MünchenGermany

Personalised recommendations