Optimizing Excited-State Electronic-Structure Codes for Intel Knights Landing: A Case Study on the BerkeleyGW Software

  • Jack Deslippe
  • Felipe H. da Jornada
  • Derek Vigil-Fowler
  • Taylor Barnes
  • Nathan Wichmann
  • Karthik Raman
  • Ruchira Sasanka
  • Steven G. Louie
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9945)

Abstract

We profile and optimize calculations performed with the BerkeleyGW [2, 3] code on the Xeon-Phi architecture. BerkeleyGW depends both on hand-tuned critical kernels as well as on BLAS and FFT libraries. We describe the optimization process and performance improvements achieved. We discuss a layered parallelization strategy to take advantage of vector, thread and node-level parallelism. We discuss locality changes (including the consequence of the lack of L3 cache) and effective use of the on-package high-bandwidth memory. We show preliminary results on Knights-Landing including a roofline study of code performance before and after a number of optimizations. We find that the GW method is particularly well-suited for many-core architectures due to the ability to exploit a large amount of parallelism over plane-wave components, band-pairs, and frequencies.

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Jack Deslippe
    • 1
  • Felipe H. da Jornada
    • 2
  • Derek Vigil-Fowler
    • 3
  • Taylor Barnes
    • 1
  • Nathan Wichmann
    • 4
  • Karthik Raman
    • 5
  • Ruchira Sasanka
    • 5
  • Steven G. Louie
    • 2
  1. 1.NERSCLawrence Berkeley National LaboratoryBerkeleyUSA
  2. 2.Department of PhysicsUniversity of California at Berkeley, and Materials Sciences Division, Lawrence Berkeley National LaboratoryBerkeleyUSA
  3. 3.National Renewable Energy LaboratoryGoldenUSA
  4. 4.CraySaint PaulUSA
  5. 5.IntelHillsboroUSA

Personalised recommendations