Characterizing the Impact of Using Spare-Cores on Application Performance

Sancho, José Carlos; Kerbyson, Darren J.; Lang, Michael

doi:10.1007/978-3-642-15277-1_8

Characterizing the Impact of Using Spare-Cores on Application Performance

José Carlos Sancho¹⁹,
Darren J. Kerbyson²⁰ &
Michael Lang²¹

Conference paper

1239 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6271))

Abstract

Increased parallelism on a single processor is driving improvements in peak-performance at both the node and system levels. However achievable performance, in particular from production scientific applications, is not always directly proportional to the core count. Performance is often limited by constraints in the memory hierarchy and also by a node inter-connectivity. Even on state-of-the-art processors, containing between four and eight cores, many applications cannot take full advantage of the compute-performance of all cores. This trend is expected to increase on future processors as the core count per processor increases. In this work we characterize the use of spare-cores, cores that do not provide any improvements in application performance, on current multi-core processors. By using a pulse-width modulation method, we examine the possible performance profile of using a spare-core and quantify under what situations its use will not impact application performance. We show that, for current AMD and Intel multi-core processors, spare-cores can be used for substantial computational tasks but can impact application performance when using shared caches or when significantly accessing main memory.

Download to read the full chapter text

Chapter PDF

References

Intel: Futuristic Intel Chip Could Reshape How Computers are Built, Consumers Interact with Their PCs and Personal Devices (2009) Press released at, http://www.intel.com/pressroom/archive/releases/2009/20091202comp_sm.html
Barker, K., Davis, K., Hoisie, A., Kerbyson, D., Lang, M., Pakin, S., Sancho, J.: A Performance Evaluation of the Nehalem Quad-core Processor for Scientific Computing. Parallel Processing Letters 18(4), 453–469 (2008)
Article MathSciNet Google Scholar
Sakuma, K., Andry, P.S., Tsang, C.K., Wright, S.L., Dang, B., Patel, C.S., Webb, B.C., Maria, J., Sprogis, E.J., Kang, S.K., Polastre, R.J., Horton, R.R., Knickerbocker, J.U.: 3D Chip-stacking Technology with Through-silicon Vias and Low-volume Lead-free Interconnections. IBM Journal of Research and Development 52(6), 611–622 (2008)
Article Google Scholar
Wells, P.M., Chakraborty, K., Sohi, G.S.: Adapting to Intermittent Faults in Multicore Systems. In: Proc. ACM ASPLOS, Seattle, WA, pp. 255–264 (March 2008)
Google Scholar
Joseph, R.: Exploring Salvage Techniques for Multi-core Architectures. In: Proc. Workshop on High Performance Computing Reliability in Conjunction with HPCA-11, San Francisco, CA (February 2005)
Google Scholar
Zhou, H.: Dual-Core Execution: Building a Highly Scalable Single-Thread Instruction Window. In: International Conference on Parallel Architecture and Compilation Techniques, St. Louis, MO, pp. 231–242 (2005)
Google Scholar
Ganusov, I., Burtscher, M.: Future Execution: A Hardware Prefetching Technique for Chip Multiprocessors. In: International Conference on Parallel Architecture and Compilation Techniques, St. Louis, MO, pp. 350–360 (September 2005)
Google Scholar
Porterfield, A., Fowler, R., Neyer, M.: MAESTRO: Dynamic Runtime Power and Concurrency. In: Workshop on Managed Many-Core Systems Colocated with the ACM International Symposium on High Performance Distributed Computing, Boston, MA (June 2008)
Google Scholar
Chow, J., Garfinkel, T., Chen, P.: Decoupling Dynamic Program Analysis from Execution in Virtual Environment. In: Proc. Usenix Annual Technical Conference, Boston, MA, pp. 1–14 (June 2008)
Google Scholar
Kerbyson, D.J., Alme, H.J., Hoisie, A., Petrini, F., Wasserman, H.J., Gittings, M.: Predictive Performance and Scalability Modeling of a Large-Scale Application. In: Supercomputing Conference, Denver, Colorado, p. 39 (November 2001)
Google Scholar
Koch, K.R., Baker, R.S., Alcouffe, R.E.: Solution of the First-order Form of the 3-D Discrete Ordinates Equation on a Massively Parallel Processor. Transactions of the American Nuclear Society 65, 192–198 (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Barcelona Supercomputing Center, Barcelona, 08034, Spain
José Carlos Sancho
Pacific Northwest National Laboratory, Richland, WA, 99352, USA
Darren J. Kerbyson
Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Michael Lang

Authors

José Carlos Sancho
View author publications
You can also search for this author in PubMed Google Scholar
Darren J. Kerbyson
View author publications
You can also search for this author in PubMed Google Scholar
Michael Lang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ICAR-CNR, Via P. Castellino, 111, 80131, Napoli,, Italy
Pasqua D’Ambra
ICAR-CNR, Via P. Castellino, 111, 80131, Napoli, Italy
Mario Guarracino
ICAR-CNR, Via P. Bucci 41c, 87036, Rende, Italy
Domenico Talia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sancho, J.C., Kerbyson, D.J., Lang, M. (2010). Characterizing the Impact of Using Spare-Cores on Application Performance. In: D’Ambra, P., Guarracino, M., Talia, D. (eds) Euro-Par 2010 - Parallel Processing. Euro-Par 2010. Lecture Notes in Computer Science, vol 6271. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15277-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-15277-1_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15276-4
Online ISBN: 978-3-642-15277-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics