A robust, simple, and efficient convergence workflow for GW calculations

Großmann, Max; Grunert, Malte; Runge, Erich

doi:10.1038/s41524-024-01311-9

A robust, simple, and efficient convergence workflow for GW calculations

Article
Open access
Published: 27 June 2024

Volume 10, article number 135, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Computational Materials

A robust, simple, and efficient convergence workflow for GW calculations

Download PDF

522 Accesses
Explore all metrics

Abstract

A robust, simple, and efficient convergence workflow for GW calculations in plane-wave-based codes is derived from more than 7000 GW calculations on a diverse dataset of 70 semiconducting and insulating solids divided into 60 bulk and 10 2D materials. The workflow can significantly accelerate material screening projects and high-precision single-system studies. Our method is based on two main results: The convergence of the two interdependent parameters in the numerical implementation of the dynamically screened Coulomb interaction W in a plane-wave basis set is accelerated by a ‘cheap first, expensive later’ coordinate search that maintains the same accuracy as a state-of-the-art convergence algorithm, but converges faster. In addition, we empirically establish the practical independence of the k-point grid and the aforementioned parameterization of W. Incorporating both results into one workflow dramatically speeds up convergence.

Speeding up GW Calculations to Meet the Challenge of Large Scale Quasiparticle Predictions

Article Open access 11 November 2016

Towards fully automated GW band structure calculations: What we can learn from 60.000 self-energy evaluations

Article Open access 29 January 2021

Towards high-throughput many-body perturbation theory: efficient algorithms and automated workflows

Article Open access 18 May 2023

Introduction

The demand for efficient high-throughput ab initio computations for materials screening^1,2,3 is increasing due to, among other things, the availability of more and more structural data for theoretically stable solid compounds through the work of, e.g., Merchant et al.⁴ and the possibility of autonomous laboratories as shown by Szymanski et al.⁵. Furthermore, automated workflows for methods going beyond Kohn-Sham (KS) density functional theory (DFT), used in the aforementioned and other work^{6,7,8,9,10,11}, such as many-body perturbation theory (MBPT), are becoming increasingly more relevant^{12,13,14,15,16} due to the rapidly increasing computational resources and the growing demand for more accurate data. Among these approaches, GW calculations provide a state-of-the-art method for accurately predicting band structures and quasiparticle energy levels of 2D/3D materials^12,14,17 for which DFT (with common approximations) often fails. GW calculations have been performed by us and others for, e.g., complex molecules^18,19,20,21, solar cells²², solar water splitting^23,24, batteries²⁵, piezoelectrics²⁶, and general high-performance electronics. In addition, GW provides a well-established starting point for the Bethe-Salpeter equation (BSE), which is necessary to accurately predict the optical properties of semiconductors and insulators²⁷. Furthermore, quasiparticle energies obtained from GW calculations are typically used in linear-response time-dependent density functional theory (TDDFT)^28,29 calculations with approximations for the exchange-correlation kernel such as the long-range contribution³⁰ or bootstrap kernels^31,32,33. These in turn are necessary ingredients for the development of optical materials tailored for, e.g., photovoltaic applications.

Multiple flavors of GW calculations exist, ranging from simple one-shot G₀W₀³⁴ to various self-consistent GW implementations³⁵ going so far as to quasiparticle self-consistent frameworks³⁶ that include static vertex corrections^37,38 or even further to a self-consistent solution of Hedin’s equations that includes more advanced vertex corrections^39,40. However, all of these suffer from rather poor scaling with system size as measured by the number N of included electrons. In its simplest plane-wave implementation, a G₀W₀ calculation has a computational complexity scaling of ${{{\mathcal{O}}}}({N}^{4})$ (cf. KS-DFT calculations, which scale with ${{{\mathcal{O}}}}({N}^{3})$ or even better) and a scaling of ${{{\mathcal{O}}}}({N}_{{{{\bf{k}}}}}^{2})$ in regards to the number of k-points N_k. Recent developments have led to better-scaling GW algorithms, see e.g. refs. ^{41,42,43,44,45,46,47}. However, these often suffer from large prefactors in their computation time, making them more suitable for very large systems⁴⁷.

The fact that GW calculations require much more computational resources, such as memory and runtime, than DFT calculations for the same system implies that the choice of computational parameters for GW calculations is a much more important decision. Moreover, GW methods have more convergence parameters (as discussed below) than DFT calculations, and—to make things worse—GW methods generally converge rather slowly.

Just recently, the reproducibility of GW calculations in solids⁴⁸ has been investigated, validating the precision of different GW implementations and comparing them between various MBPT codes. Such comparative studies of different codes and the ever-increasing computational power are slowly opening up the possibility of high-throughput GW calculations. To support this progress, the present paper investigates how to converge GW calculations in a robust, simple, and efficient way, using the simplest implementation, a G₀W₀ calculation, as example. The corresponding workflow is presented in detail.

In many codes, quasiparticle (QP) energies ${\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{QP}}}}}$ are commonly approximated using first-order perturbation theory with respect to KS eigenvalues ${\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{KS}}}}}$ for a given k-point at band n, resulting in a linearized QP equation:

$${\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{QP}}}}}={\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{KS}}}}}+{Z}_{n{{{\bf{k}}}}}\,\left\langle n{{{\bf{k}}}}\right\vert \,\left({{\Sigma }}({\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{KS}}}}})-{V}_{xc}^{{{{\rm{KS}}}}}\right)\,\left\vert n{{{\bf{k}}}}\right\rangle$$

(1)

where ${Z}_{n{{{\bf{k}}}}}^{-1}=1-\left\langle n{{{\bf{k}}}}\right\vert \,d{{\Sigma }}/{{{\rm{d}}}}\epsilon {| }_{{\epsilon }_{n{{{\bf{k}}}}}^{{{{\rm{KS}}}}}}\,\left\vert n{{{\bf{k}}}}\right\rangle$ stands for the renormalization factor⁴⁹, ${V}_{xc}^{{{{\rm{KS}}}}}$ for the KS exchange correlation potential, and $\left\vert n{{{\bf{k}}}}\right\rangle$ for the respective KS state. The self-energy operator Σ can be split up into an exchange contribution Σ^x and a frequency-dependent correlation part Σ^c, i.e. Σ = Σ^x + Σ^c. In plane-wave codes, these are defined in terms of the unit cell volume Ω, the bare Coulomb interaction v(q), the generalized dipole matrix element ${F}_{nm{{{\bf{k}}}}}({{{\bf{q}}}})=\left\langle n{{{\bf{k}}}}\right\vert {{{{\rm{e}}}}}^{{{{\rm{i}}}}{{{\bf{q}}}}\cdot {{{\bf{r}}}}}\left\vert m({{{\bf{k}}}}-{{{\bf{q}}}})\right\rangle$, the Fermi occupation function f_mk, and the number of empty states N_b to be included as:

$${\Sigma}^{\mathrm{x}}_{n{\mathbf{k}}} = - \sum\limits_m^{{\mathrm{occ}}} \int_{\mathrm{BZ}} \frac{d{\mathbf{q}}}{(2\pi)^3} \sum\limits_{\mathbf{G}}^{G_{\mathrm{cut}}} \,v({\mathbf{q}}+{\mathbf{G}}) \,\vert F_{nm{\mathbf{k}}}({\mathbf{q}}+{\mathbf{G}}) \vert^2 \, f_{m({\mathbf{k}}-{\mathbf{q}})}$$

(2)

$$\begin{array}{ll}{{{\Sigma }}}_{n{{{\bf{k}}}}}^{{{{\rm{c}}}}}\,=\,{{{\rm{i}}}}\sum\limits_{m}^{{N}_{{{{\rm{b}}}}}}\displaystyle{\int}_{{{{\rm{BZ}}}}}\frac{d{{{\bf{q}}}}}{{(2\pi )}^{3}}\sum\limits_{{{{\bf{G}}}}{{{\bf{G}}}}{^\prime} }^{{G}_{{{{\rm{cut}}}}}}{F}_{nm{{{\bf{k}}}}}({{{\bf{q}}}}+{{{\bf{G}}}}){F}_{nm{{{\bf{k}}}}}^{* }({{{\bf{q}}}}+{{{\bf{G}}}}{^\prime} )\\\qquad\quad \times\, \displaystyle\int\frac{d\omega {^\prime} }{2\pi }{G}_{m({{{\bf{k}}}}-{{{\bf{q}}}})}^{0}(\omega -\omega {^\prime} ){W}_{{{{\bf{G}}}}{{{\bf{G}}}}{^\prime} }({{{\bf{q}}}},\omega {^\prime} ).\end{array}$$

(3)

Here, G⁰ is the non-interacting Green’s function

$${G}_{m{{{\bf{k}}}}}^{0}(\omega )=\frac{{f}_{m{{{\bf{k}}}}}}{\omega -{\epsilon }_{m{{{\bf{k}}}}}-{{{\rm{i}}}}\eta }+\frac{1-{f}_{m{{{\bf{k}}}}}}{\omega -{\epsilon }_{m{{{\bf{k}}}}}+{{{\rm{i}}}}\eta }$$

(4)

involving the KS bandstructure ϵ_mk and an infinitesimal complex shift iη (η > 0). The sums over reciprocal lattice vectors G are usually defined through an energy cutoff G_cut, which restricts the summations in Eqs. (2) and (3) to G with kinetic energy ℏ²∣G∣/(2m₀) ≤ G_cut.

It is known that the sum over states in Eq. (3) converges extremely slowly with respect to the number of included empty states N_b. This can be mended through the technique introduced by Bruneval and Gonze⁵⁰ and is by and large not an obstacle when converging GW calculations.

The screened Coulomb interaction is usually expressed in terms of the dielectric matrix ε and polarizability χ through ${W}_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }({{{\bf{q}}}},\omega )=v({{{\bf{q}}}}+{{{\bf{G}}}})\ {\varepsilon }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }^{-1}({{{\bf{q}}}},\omega )$ and ${\varepsilon }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }^{-1}({{{\bf{q}}}},\omega )={\delta }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }+v({{{\bf{q}}}}+{{{\bf{G}}}}){\chi }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }({{{\bf{q}}}},\omega )$. The polarizability χ is then evaluated within the random phase approximation (RPA) as solution of a Dyson equation²⁷

$${\chi }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }({{{\bf{q}}}},\omega )=\sum\limits_{{{{\bf{G}}}}{''} }^{{G}_{{{{\rm{cut}}}}}}{[{\delta }_{{{{\bf{G}}}}{{{\bf{G}}}}{''} }-v({{{\bf{q}}}}+{{{\bf{G}}}}{''} ){\chi }_{{{{\bf{G}}}}{{{\bf{G}}}}{''} }^{0}({{{\bf{q}}}},\omega )]}^{-1}{\chi }_{{{{\bf{G}}}}{''} {{{\bf{G}}}}{\prime} }^{0}({{{\bf{q}}}},\omega )$$

(5)

involving the independent-(quasi-)particle approximation to the polarizability:

$$\begin{array}{ll}{\chi }_{{{{\bf{G}}}}{{{\bf{G}}}}{\prime} }^{0}({{{\bf{q}}}},\omega )\,=\,\displaystyle\frac{2}{{{\Omega }}}\sum\limits_{{{{\bf{k}}}}}\sum\limits_{nm}^{{N}_{{{{\rm{b}}}}}}{f}_{n,{{{\bf{k}}}}-{{{\bf{q}}}}}\,(1-{f}_{m{{{\bf{k}}}}})\,{F}_{nm{{{\bf{k}}}}}^{* }({{{\bf{q}}}}+{{{\bf{G}}}}){F}_{mn{{{\bf{k}}}}}({{{\bf{q}}}}+{{{\bf{G}}}}{\prime} )\\ \qquad\qquad\qquad\times\, \left[\,\sum\limits_{\beta =\pm 1}\,\frac{\beta }{\omega +\beta ({\epsilon }_{n,{{{\bf{k}}}}-{{{\bf{q}}}}}-{\epsilon }_{m{{{\bf{k}}}}}+{{{\rm{i}}}}\eta )}\,\right].\end{array}$$

(6)

The evaluation of Eqs. (3), (5) and (6) is usually the most time-consuming part of a GW calculation⁵¹, since they involve multiple large sums, matrix multiplications and inversions as well as a frequency integration⁵². A significant reduction in computational effort is often achieved by replacing the frequency integration in Eq. (3) with the plasmon-pole model (PPM)⁵³. We note that the PPM is known to be problematic for at least some materials⁵⁴. A further development of this approach is the multipole model, which however is not yet widely adopted^55,56.

As mentioned above, high-throughput calculations using the described G₀W₀ framework or any other GW variant are still challenging even with modern resources. The main reason for this is that the sums over the bands and the G-vectors in Eqs. (2), (5), and (6) converge slowly when the parameters N_b and G_cut are increased. In addition, the number of k-points also needs to be converged. This is further complicated by the interdependence of both N_b and G_cut, which requires a simultaneous convergence of both parameters^57,58. Rewriting this as a standard optimization problem with the gap convergence as the target function and an estimation of the computational time as the penalty is not as straightforward and recommendable as it seems because standard derivative-based optimization methods tend to perform poorly on integer variables, such as N_b and the number of G-vectors.

It is heuristically known that the number of k-points and the parameters (N_b, G_cut) used to compute the dynamical screening W are somewhat decoupled, i.e. their convergence can be studied more or less independently. This was investigated to some extent by van Setten et al.¹³ by converging N_b and G_cut via fitting functions with a predefined asymptotic behavior to data for the band gap obtained through G₀W₀ calculations on a low-density Γ-centered 2 × 2 × 2 k-point grid for 80 semiconducting or insulating solids. They then compared the derivative of the band gap with respect to N_b and G_cut calculated through finite differences on the low-density grid (LDG) with those obtained on a converged high-density grid (HDG). The results show that for about 90% of materials, the derivative of the gap energy with respect to the convergence parameters on the HDG is lower than on the LDG¹³. Their results suggest that it is in principle efficient to converge (N_b, G_cut) on a LDG, but a more thorough investigation is warranted to study this behavior more closely, for example by investigating this relation on more extreme, i.e. a Γ-only calculation, or denser intermediate k-point grids than van Setten et al.¹³ considered.

The main question investigated in the present paper, namely ’How to best converge GW calculations for high-throughput applications?’ can be reduced to two sub-problems using the knowledge described above: (1) find a robust, simple, and effective way to converge the interdependent parameters (N_b, G_cut). (2) Check to what extent the practical independence of the choice of the k-point grid when converging (N_b, G_cut) and the inverse, i.e. the choice of (N_b, G_cut) when investigating the k-point grid convergence can be used to create an efficient convergence scheme for all three parameters.

Recently, automated MBPT workflows including the GW method have been proposed and tested for the G₀W₀ case^13,16, focusing on the convergence of the parameters N_b and G_cut, i.e. subproblem (1). Current state-of-the-art (SOTA) methods^13,16 rely on the extrapolation of functions with a predefined asymptotic behavior fitted to the band gap calculated at the Γ-point ${{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}$ using multiple G₀W₀ calculations at different (N_b, G_cut)-points in the parameter space to find suitable parameters N_b and G_cut to be used in the actual production runs. One problem with these methods lies in the fact that the exponents α and β in the trial function

$$F({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})=\left(\frac{A}{{N}_{{{{\rm{b}}}}}^{\alpha }}+B\right)\left(\frac{C}{{G}_{{{{\rm{cut}}}}}^{\beta }}+D\right)$$

(7)

commonly used for the asymptotic fit to the 2D convergence surface are not known and also seem to be material-dependent. Furthermore, in most applications, e.g.^13,16, the exponents are not fitted, but instead are limited to very few integers, e.g., α, β ∈ {1, 2}¹⁶, in order to reduce the number of initial GW calculations required for the fit and to improve the stability of the fit. We observed that this can then lead to errors in the extrapolation and suboptimal parameter choices for (N_b, G_cut). In addition, the initial GW calculations for the fitting procedure are mostly done in a grid-like fashion^13,16, which automatically implies many, as we shall see, unnecessary GW calculations. To obtain accurate fit results, a grid spanning points with small and large parameters, i.e. inexpensive and expensive GW calculations, is commonly used. However, the expensive grid points can represent over-converged points in the parameter space, while the computationally optimal point may be at a lower (N_b, G_cut)-point, thereby increasing the computation time unnecessarily.

To overcome the described difficulty of the convergence of the two integer-valued parameters N_b and G_cut, we propose and benchmark a simpler, more robust and time-saving Coordinate Search (CS) algorithm to converge (N_b, G_cut). It follows the heuristic ‘cheap first, expensive later’ and reduces the total computational effort needed.

We note that the proposed CS algorithm is similar to most straightforward convergence routines, which many practitioners have long since settled on for pragmatic reasons. This makes the systematic comparison with more sophisticated convergence workflows^13,16 particularly relevant. Furthermore, the CS algorithm is easy to implement manually or automatically, without having to rely on an estimated extrapolation of the convergence surface.

Our coordinate search algorithm can be summarized in four steps, as described in Algorithm 1.

Algorithm 1

Coordinate Search

1: Calculate a reference ${{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})$ at a starting point (N_b, G_cut).

2: Take steps of length ΔN_b until the band gap converges for an a-priori fixed δ, i.e.

$$| {{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+(i+1){{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})-{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+i{{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})| \le \delta$$

with i ∈ {0, 1, 2, 3, . . . }.

3: Take steps of length ΔG_cut until the band gap converges, i.e.

$$\begin{array}{l}|{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+(i+1){{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}}+(j+1){{\Delta }}{G}_{{{{\rm{cut}}}}})\\\;\; -{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+(i+1){{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}}+j{{\Delta }}{G}_{{{{\rm{cut}}}}})| \le \delta \end{array}$$

with fixed i obtained from step 2 and j ∈ {0, 1, 2, 3 . . . }.

4: Check if

$$| {{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+(i+1){{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}}+(j+1){{\Delta }}{G}_{{{{\rm{cut}}}}})-{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})| \le \delta$$

holds, i.e. the convergence surface is sufficiently flat along the diagonal direction to minimize the parameter interdependence. If this is not the case, go back to step 2 and use the already calculated ${{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}}+(i+1){{\Delta }}{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}}+(j+1){{\Delta }}{G}_{{{{\rm{cut}}}}})$ as a reference/starting point.

In the formulation chosen for illustration and presented in Algorithm 1, the direct band gap ${{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})={{{{\rm{E}}}}}_{{{{\rm{CBM}}}}}^{{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})-{{{{\rm{E}}}}}_{{{{\rm{VBM}}}}}^{{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})$ is defined as the difference between the conduction band minimum (CBM) and the valence band maximum (VBM) at the Γ point. Instead of converging the direct band gap, which is the quantity of primary interest for predicting optical properties of semiconductors and insulators, one could in principle also converge the absolute energies of the band edges simultaneously, i.e. using the criterion $| {{\Delta }}{{{{\rm{E}}}}}_{{{{\rm{VBM}}}}}^{{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})| < \delta$ and $| {{\Delta }}{{{{\rm{E}}}}}_{{{{\rm{CBM}}}}}^{{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})| < \delta$, where ΔE represents the difference conditions formulated in steps 2-4 of Algorithm 1. This can be relevant, e.g., for catalytic applications²⁴ and is furthermore a sound way to address the convergence of metals⁴⁸. In that case, it is important to emphasize that both conditions must be satisfied simultaneously, since convergence can occur at very different parameter values. A notorious example of such a system is ZnO, where the very different nature of the VBM being derived from the localized O 2p state and the CBM being derived from the delocalized Zn 4s state leads to a drastically different convergence behavior of the two⁵⁷.

As a test for our CS workflow algorithm and for comparison with a SOTA workflow, we present results for 60 bulk solids and 10 2D materials. A complete material list can be found in the Supplementary Information, i.e. Supplementary Note 1. To investigate the subproblems (1) and (2), we analyze the convergence behavior of the SOTA and CS algorithms on five Γ-centered k-point grids of increasing density, starting with a Γ-only calculation. As SOTA algorithm, we implemented the method introduced by Bonacci et al.¹⁶ in our in-house workflow package and tested it by reproducing their results using the provided DFT input files, see Supplementary Notes 2 and 3. On each k-point grid, convergence in (N_b, G_cut) is obtained using both algorithms with a convergence threshold δ = 25 meV for the bulk and δ = 50 meV for the 2D materials. The convergence criterion for the SOTA algorithm involving δ can be found in Ref. ¹⁶, where it is written as Δ^Γ. The chosen convergence thresholds are a good compromise between speed and accuracy and are appropriate to mimic a high-throughput environment. The starting grid and step size for the SOTA algorithm are set to the values used in Ref. ¹⁶. As the starting point for our CS algorithm, we use the smallest grid point from the SOTA algorithm, i.e. (N_b = 200, G_cut = 4 Ry). Preliminary analysis shows that a higher starting point does not significantly affect the band gaps obtained by the CS algorithm, see Supplementary Note 4. The step sizes ΔN_b = 100 and ΔG_cut = 4 Ry are based on experience. An optimization of these hyperparameters can be considered in future. In order to assess how well converged the CS and SOTA results are, an expensive, high-quality reference G₀W₀ calculation with N_b = 1200 and G_cut = 46 Ry is carried out for all k-point grids of each material.

Results

To explain the data analysis in a simple but well-defined manner, we first define the quantities ${{{{\mathcal{G}}}}}_{{{{\mathcal{M}}}},{{{\bf{k}}}}}^{A}$ and ${{{{\mathcal{P}}}}}_{{{{\mathcal{M}}}},{{{\bf{k}}}}}^{A}$ which represent the calculated final band gap ${{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}({N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}})$ and final parameter vector (N_b, G_cut) for each material ${{{\mathcal{M}}}}$ using algorithm A ∈ {S for ’starting point’, CS, SOTA, R for ’reference’} on k-point grid k, respectively. The collected data for all k-point grids and all materials is visualized in Supplementary Notes 7 and 8.

In the following, we provide a detailed statistical analysis of the accuracy and performance of both convergence algorithms and an investigation of the parameter independence of the k-point grid and two interdependent parameters (N_b, G_cut) in the numerical implementation of the dynamically screened Coulomb interaction W using the data obtained for the 60 bulk materials. We will then briefly discuss how the results of the 2D materials compare to their bulk counterparts.

Convergence benchmark

To address subproblem (1), as defined above, we compare both algorithms with respect to how close the converged gaps are to those of the reference calculations and how fast each algorithm obtains its results. The accuracy is evaluated by calculating the absolute deviation of the band gap at the convergence point to the reference value for each k-point grid of every material ${{{\mathcal{M}}}}$, i.e. $| {{{\Delta }}}_{{{{\rm{R}}}}}{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}| =| {{{{\mathcal{G}}}}}_{{{{\mathcal{M}}}},{{{\bf{k}}}}}^{X}-{{{{\mathcal{G}}}}}_{{{{\mathcal{M}}}},{{{\bf{k}}}}}^{{{{\rm{R}}}}}|$ where X ∈ {CS, SOTA}, while the computation speed is analyzed through the number of GW calculations N_GW needed to find ${{{{\mathcal{P}}}}}_{{{{\mathcal{M}}}},{{{\bf{k}}}}}^{X}$.

Figure 1 a,b show a comparison of the accuracy of the two algorithms. The mean accuracy for both algorithms, considering all materials and k-point grids, is approximately 25 meV, in line with the set convergence threshold δ = 25 meV. The SOTA algorithm demonstrates slightly better performance compared to the CS algorithm in terms of the median (16 meV vs. 20 meV). A few outliers with deviations larger than 100 meV exist for both algorithms. Interestingly, these are not always the same for both cases. In some cases, outliers are calculations on an LDG, or even a Γ-only calculation, as is the case for BaF₂ highlighted in Fig. 1a and ZnO in Fig. 1b. This may indicate that the gap is more sensitive to the choice of (N_b, G_cut) on low-density k-point grids. Materials like KF, AgI and As₂Os cause problems for both algorithms, independent of the k-point grid. It is worth noting that KF does not appear in Fig. 1b because the SOTA algorithm was only able to converge for two of the five k-grids and failed in the other cases. These outliers raise interesting questions and provide possible avenues for improvement in the algorithms. Some outliers for the SOTA algorithm require an inordinate amount of calculations before reaching a (N_b, G_cut)-grid with an adequate fit. On the other hand, some outliers for the CS algorithm terminate unusually early. If such a case is detected in real-world applications, a calculation with increased parameters could be automatically performed to ensure convergence. Finally, the source of such outliers may not be related specifically to the convergence algorithms, but to interesting physical properties arising from special aspects of their band structures.

**Fig. 1: Benchmark results visualized by histograms.**

Figure 1 c,d provides a comprehensive comparison of the performance of the algorithms. One striking difference is that the CS algorithm reaches convergence with a maximum of 12 GW computations, whereas the SOTA algorithm can require up to 70. Specifically, both the mean and median number of GW calculations required for the CS algorithm are around 7, while the mean and median for the SOTA algorithm are around 14 and 8, respectively. We note in passing that for the aforementioned ‘problematic’ compound As₂Os the CS algorithm converged extremely quickly, using only three GW calculations, see Fig. 1c. Possibly, the convergence surface of the W parameters is already flat enough at low parameter values for the convergence algorithms to stop, but it does not become flatter with increasing parameters and the second derivative of the convergence surface with respect to the parameters N_b and G_cut remains small. This explains the large discrepancy with the reference calculation and illustrates the somewhat pathological behavior of the sums in Eqs. (2) and (3) with respect to N_b and G_cut in materials such as As₂Os.

We observe that the CS algorithm has a broader distribution of $| {{{\Delta }}}_{{{{\rm{R}}}}}{{{{\rm{E}}}}}_{{{{\rm{gap}}}}}^{{{{\bf{\Gamma }}}}-{{{\bf{\Gamma }}}}}|$ around δ (black line) in Fig. 1a compared to the SOTA algorithm in Fig. 1b. The opposite is true for the number of GW calculations needed to find the convergence point, as shown in Fig. 1c,d. This trade-off between accuracy and computation time can be understood in the sense that there is “no free lunch”. However, we want to emphasize that reducing the number of GW calculations by about 50% on average by far outweighs the minor loss in accuracy for most applications.

Obviously, besides the number of GW calculations, one also has to take the runtime of each calculation into account, as calculations with increased parameters can often take significantly longer to run. For example, a Γ-only GW calculation for AgI with N_b = 200 and G_cut = 4 Ry takes around 20 s, whereas the Γ-only reference calculation with N_b = 1200 and G_cut = 46 Ry took 30 min, both using 8 cores of an Intel® Xeon® Processor E5-2650. For this reason, we analyzed the ratio of the computation times T for the CS and SOTA algorithms, i.e. the convergence speedup achieved by the CS algorithm for each material and k-point grid individually, shown in Fig. 2. Values greater than one indicate that the CS algorithm is faster, while values less than one indicate that the SOTA algorithm is faster. The average speedup achieved by the CS algorithm is 4.5, while the median speedup is about 2.4. Note that the large discrepancy between mean and median is caused by cases where the CS algorithm is more than ten times faster than the SOTA algorithm (not shown in Fig. 2 due to axis truncation for better visibility). In these extreme cases, the CS algorithm requires up to two orders of magnitude less computational time and thus resources, while maintaining a similar level of accuracy. The SOTA algorithm performs better only in very few cases, one of which is AgI, where the SOTA algorithm required fewer GW calculations to reach a convergence point than the CS algorithm. However, for AgI, neither algorithm found adequate convergence parameters due to the aforementioned pathologies. This highlights the challenges that both simple and more complex convergence algorithms face when dealing with certain materials. At this point, we recognize that computation times may not be directly comparable, as it is possible that some computations may be performed on slower nodes than others due to varying loads, but overall this effect should average out.

**Fig. 2: CS Speedup visualized through a histogram.**

To show how the two algorithms work in practice and to explain the considerable difference in computation time, we have visualized the convergence paths of each algorithm for two materials in Fig. 3, inspired by the presentation shown in Ref. ¹⁶. In the case of Si (Fig. 3a), both algorithms require a similar number of GW calculations to converge. While the SOTA algorithm finds a cheaper convergence point with N_b = 200 and G_cut = 12 Ry compared to the CS algorithm with N_b = 400 and G_cut = 16 Ry, the CS algorithm used 35% less computation time to reach convergence. This can be easily explained by the required grid calculation for the SOTA fit of the convergence surface, as points with higher parameters are needed for a good fit, which in turn takes significantly longer to run. For LiF, shown in Fig. 3b, the reason for the performance difference is obvious: SOTA had to compute multiple grids to get an adequate fit of the convergence surface, while the CS algorithm simply “just walks” to the convergence point.

**Fig. 3: Convergence path visualization.**

Independence of the k-point grid and W

This section deals with subproblem (2), i.e. the practical independence of the choice of the k-point grid from the choice of (N_b, G_cut) and vice versa.

First, we investigate how the convergence properties of the interdependent parameters (N_b, G_cut) in W depend on the k-point grid. For this, we calculate the parameter change ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}\,=\,({{{{\mathcal{P}}}}}_{{{{\mathcal{M}}}},{{{\bf{\Gamma }}}}}^{X}\,-\,{{{{\mathcal{P}}}}}_{{{{\mathcal{M}}}},{{{{\bf{k}}}}}_{f}}^{X})/{{\Delta }}{{{\mathcal{P}}}}$ (${{{\mathcal{P}}}}\in \{{N}_{{{{\rm{b}}}}},{G}_{{{{\rm{cut}}}}}\}$) for all materials ${{{\mathcal{M}}}}$ and X ∈ {CS, SOTA}. This quantity compares converged parameters for the coarsest possible grid, i.e. a Γ-only calculation, and for the finest considered k-point grid k_f with the highest density. The algorithm-specific parameter step sizes ${{\Delta }}{{{\mathcal{P}}}}$ were already defined above. ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}\,$ values of order 1, i.e. { − 1, 0, 1}, imply that the converged parameters ${{{\mathcal{P}}}}$ are more or less independent of the k-point grid. Figure 4 visualizes ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}$ for both algorithms and both convergence parameters in W. In all cases, the mean and median are small positive values or very close to zero. Evidently, ${{{\mathcal{P}}}}$ and k are in general more or less independent, but a few outliers exist outside of { − 1, 0, 1}. It is worthwhile to have a more detailed look from the point of view of convergence algorithms for high-throughput calculations. The sign of ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}$ distinguishes between two different cases: If ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}\ge 0$, the convergence for the Γ-only grid is as flat or flatter than for the HDG. If ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}} < 0$ the opposite is true, meaning that the convergence algorithm would underconverge W on the Γ-only grid compared to the HDG. So in general, positive ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}$ values indicate “overconvergence”, i.e. “being on the safe side” when using a LDG, in this case a Γ-only calculation. Therefore, we calculate the percentage of materials where ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}\ge 0$ for both N_b and G_cut, and get a value of 92% (85%) for the CS (SOTA) algorithm. We also compute the ${C}_{{{{\mathcal{P}}}}}^{{{{{\bf{k}}}}}_{2}}$, where k₂ represents a Γ-centered 2 × 2 × 2 k-point grid. Using the same analysis as in the Γ-only case, we observe that the convergence surface is as flat or flatter than on the HDG for 92% (89%) of the computed materials for the CS (SOTA) algorithm. This result is in agreement with a previous study by van Setten et al.¹³. The corresponding figure for ${C}_{{{{\mathcal{P}}}}}^{{{{{\bf{k}}}}}_{2}}$ and an analysis of ${C}_{{{{\mathcal{P}}}}}^{{{{{\bf{k}}}}}_{3}}$ and ${C}_{{{{\mathcal{P}}}}}^{{{{{\bf{k}}}}}_{4}}$ are shown in Supplementary Note 5.

**Fig. 4: Histograms showing that the convergence of the parameters of W is practically independent of the k-point grid.**

Now that is has been established that N_b and G_cut as converged on a LDG are in all but a few cases well-converged parameters on a HDG, we check whether a similar conclusion can be reached for the necessary density of the k-point grid for low and high values of N_b and G_cut.

To analyze the convergence surface with respect to the k-point grid, we check for all materials ${{{\mathcal{M}}}}$ for which index i_Y the band gap converges with respect to the k-point grid, i.e. $| {{{{\mathcal{G}}}}}_{{{{\mathcal{M}}}},{{{{\bf{k}}}}}_{i}}^{Y}-{{{{\mathcal{G}}}}}_{{{{\mathcal{M}}}},{{{{\bf{k}}}}}_{i-1}}^{Y}| \le \delta$. Here Y represents the starting point calculation S (N_b = 200, G_cut = 4 Ry) or the reference calculation R (N_b = 1200, G_cut = 46 Ry). Then, we evaluated ${{{\mathcal{K}}}}={i}_{{{{\rm{S}}}}}-{i}_{{{{\rm{R}}}}}$ for all materials ${{{\mathcal{M}}}}$ where the band gap converged with respect to the k-point grid within the five increasingly dense k-point grids used. Similar to ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}$, if ${{{\mathcal{K}}}}\ge 0$ than the k-point grid convergence surface is as flat or flatter for small parameters in W as it is for larger parameters in W. On the other hand, if ${{{\mathcal{K}}}} < 0$ holds, the k-point grid would be underconverged at small parameters in W. In total, only six materials do not converge on the same k-point grid, and only two of them have ${{{\mathcal{K}}}}=-1$. Thus, only 4% of the time the k-point grid would be underconverged if a convergence were performed with small parameters (N_b, G_cut) in dynamically screened Coulomb interaction W.

2D materials

For the ten 2D materials studied, we repeated all the analyses shown above. All corresponding figures can be found in Supplementary Note 8.

Both the CS and SOTA convergence algorithms achieve a similar average (median) distance to reference band gaps of 26 meV (15 meV) and 28 meV (25 meV), respectively, while requiring on average (median) 6 (6) and 7 (7) GW calculations, respectively, to converge the band gap. Looking at these results, both algorithms seem surprisingly equal in performance, but the CS algorithm is on average (median) more than 3 (2) times faster than the SOTA algorithm. The reason for this is simply that the initial computation of a grid in the (N_b, G_cut) parameter space requires GW calculation with high parameters. Therefore, the average number of GW calculations required to converge the band gap can be similar, while the associated cost of each GW calculation can be drastically different.

To analyze how the band gap convergence with respect to (N_b, G_cut) depends on the choice of k-point grid, we again evaluated ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}$ as defined above. For 9 (7) of the 10 2D materials, ${C}_{{{{\mathcal{P}}}}}^{{{{\bf{\Gamma }}}}}\ge 0$ holds for both N_b and G_cut for the CS (SOTA) algorithm. To analyze how the band gap convergence with respect to the k-point grid depends on the choice of (N_b, G_cut), we evaluated ${{{\mathcal{K}}}}$ as defined above. Leaving out 2D-hBN, since here no k-point convergence was achieved on the last k-point grid, we find that for 7 of the 9 2D materials ${{{\mathcal{K}}}}=0$ holds. The other materials (CdI₂ and MoSe₂) had ${{{\mathcal{K}}}} < 0$, i.e. the k-point grid would be underconverged when using small parameters in W. These results indicate that, similar to the bulk materials, the convergence of the band gap with respect to the parameters (N_b, G_cut) is practically independent of the k-point grid and vice versa.

The results of the 2D materials presented here are analogous to those of the bulk materials, but two important points need to be addressed. First, ten materials is a small sample to study the convergence properties of G₀W₀ calculations. Therefore, we suggest to treat the results of the 2D materials with care and note that a follow-up work with a larger dataset is warranted. Second, we strongly recommend using the technique introduced by Guandalini et al.⁵⁹ to improve the convergence properties of 2D materials with respect to the k-point grid, since e.g. a Γ-only convergence of the band gap with respect to (N_b, G_cut) is otherwise unstable and extremely slow.

Discussion

The presented results show that the interdependent parameters N_b and G_cut in the dynamically screened Coulomb interaction W can be converged on a Γ-only k-point grid, where the convergence should be performed using the more efficient CS algorithm. Additionally, the band gap can be converged with respect to the k-point grid while using low values of (N_b, G_cut) in W. Taking all of this into account, we recommend the following strategy for converging high-throughput GW calculations:

(i) Converge the interdependent parameters N_b and G_cut in W using the presented ‘cheap first, expensive later’ CS algorithm on a Γ-only k-point grid.

(ii) In parallel, the band gap can be converged with respect to the k-point grid using low values for the parameters in W, e.g., N_b = 200 and G_cut = 4 Ry.

(iii) A final GW calculation is then performed on the converged k-point grid using the converged N_b and G_cut.

For the bulk materials, we roughly estimate that the suggested Γ-only convergence of N_b and G_cut speeds up the convergence on average (median) by a factor of 2.3 (1.7) when compared to the suggestion of van Setten et al.¹³, i.e. performing a convergence on a Γ-centered 2 × 2 × 2 k-point grid. In addition, we observed that the convergence of the k-point grid with low N_b and G_cut in W further speeds up the convergence on average (median) by 9.8 (5.8), when compared to a k-point grid convergence with converged W parameters. Details concerning the speedup estimates and analogous results for the 2D materials can be found in Supplementary Notes 6 and 8, respectively.

The presented workflow is therefore highly effective for future high-throughput GW material screening projects where a speed reduction is more important than absolute convergence. The CS-based workflow is readily available in the provided code base. We would also like to point out that for projects requiring very accurate GW calculations, e.g., for high-precision single-system studies for catalysis, the convergence parameters obtained using our workflow can be used as good starting point for further convergence investigations on denser k-point grids, again saving valuable computational time. The basic principle of ‘cheap first, expensive later’ can also be applied to these more complex MBPT calculations, although the interdependence between different convergence parameters would likely have to be verified again for non-GW approaches. Furthermore, based on the results of Zein et al.⁶⁰, we believe that the proposed workflow can also be applied to vertex-corrected GW variants, as vertex functions tend to be more localized in real space and therefore can be converged with smaller k-point grids.

As a final note, we would like to emphasize that the convergence workflow described here has been tested using an implementation of the G₀W₀ method in a plane-wave basis set. We suppose that a similar convergence workflow is practical for GW implementations utilizing alternative basis sets, such as linearized augmented-plane-waves (LAPW)^{61,62,63,64,65} or linear muffin-tin-orbitals (LMTO)⁶⁶. Since for these basis sets parameters similar to G_cut exist which control the basis set size, it is expected that similar conclusions as for G_cut hold.

In summary, a robust, simple, and efficient convergence workflow for GW calculations is presented, based on the results of more than 7000 GW calculations on a diverse dataset of 70 semiconducting and insulating materials divided into 60 bulk and 10 2D materials. The workflow is based on two main results: Firstly, we showed that a ‘cheap first, expensive later’ coordinate search algorithm is able to converge the two interdependent parameters in the dynamically screened Coulomb interaction W with the same accuracy as the current state-of-the-art method of Bonacci et al.¹⁶, while being more than twice as fast. Secondly, we empirically demonstrated the practical independence of the parameters in W and the density of the k-point grid. These two insights have been integrated into our workflow, dramatically improving computational efficiency. The final convergence workflow is extremely efficient and well suited for high-throughput GW calculations, paving the way for the use of many-body perturbation theory in large-scale materials screening projects to discover materials for various applications with high technological impact. Furthermore, it can also be used to accelerate high-precision single-system GW calculations.

Methods

Ab initio calculations

Computationally relaxed structure files were obtained from the Materials Project^67,68 for all bulk materials and from the Materials Cloud two-dimensional crystals database (MC2D)^69,70 for the 2D materials. For 2D-MoS₂ and 2D-hBN, we used the structures provided by Bonacci et al.¹⁶. All structures were reduced to their primitive standard structure using pymatgen^71,72 to mimic real high-throughput calculations. The corresponding material identifiers, determined convergence parameters, direct gaps at the Γ-point, and other associated metadata can be found in Supplementary Note 1. The DFT calculations were performed with the plane-wave code Quantum ESPRESSO^73,74 with PBE⁷⁵ as exchange-correlation functional and optimized norm-conserving Vanderbilt pseudopotentials from the SG15 library (version 1.2)⁷⁶. Γ-centered k-point grids with an even number of subdivisions defined by a structure-independent reciprocal density ρ_k as defined in pymatgen⁷⁷ were used. All DFT calculations were converged with respect to the k-point grid and plane-wave cutoff until a convergence threshold of 1 kcal mol⁻¹ was reached. The GW corrections were calculated using the YAMBO code^51,78 on the G₀W₀ level. The frequency dependence of the dynamical screening W was approximated through the Godby-Needs plasmon-pole approximation⁵³. To accelerate the convergence of the correlation self-energy Σ_c with respect to the number of empty bands N_b, the Bruneval-Gonze technique⁵⁰ was used. The q → 0 divergence of the Coulomb potential v(q) was treated with the random integration method⁵¹ as implemented in YAMBO. The same number of G-vectors as used for the converged DFT energies was used to expand the KS wavefunctions in the transition matrix elements and the plane-wave expansion. To improve the convergence properties of the 2D materials with respect to the k-point grid, we used the technique introduced by Guandalini et al.⁵⁹ based on a stochastic averaging and interpolation of the screened potential.

Data availability

The data supporting the findings of this study are openly available on Zenodo at https://doi.org/10.5281/zenodo.11125747.

Code availability

The used third-party codes YAMBO and Quantum ESPRESSO are available at the time of publication of this work at https://www.yambo-code.eu/ and https://www.quantum-espresso.org/, respectively. All workflows used to produce the results presented here, as well as scripts for analysis and visualization of all results, are available at https://github.com/MaxGrossmann/FastGWConvergence.

References

Ludwig, A. Discovery of new materials using combinatorial synthesis and high-throughput characterization of thin-film materials libraries combined with computational methods. Npj Comput. Mater. 5, 70 (2019).
Article Google Scholar
Kulik, H. J. et al. Roadmap on machine learning in electronic structure. Electron. Struct. 4, 023004 (2022).
Article CAS Google Scholar
Pyzer-Knapp, E. O. et al. Accelerating materials discovery using artificial intelligence, high performance computing and robotics. Npj Comput. Mater. 8, 84 (2022).
Article Google Scholar
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Article CAS PubMed PubMed Central Google Scholar
Szymanski, N. J. et al. An autonomous laboratory for the accelerated synthesis of novel materials. Nature 624, 86–91 (2023).
Article CAS PubMed PubMed Central Google Scholar
Greeley, J., Jaramillo, T. F., Bonde, J., Chorkendorff, I. & Nørskov, J. K. Computational high-throughput screening of electrocatalytic materials for hydrogen evolution. Nat. Mater. 5, 909–913 (2006).
Article CAS PubMed Google Scholar
Yim, K. et al. Novel high-κ dielectrics for next-generation electronic devices screened by automated ab initio calculations. NPG Asia Mater. 7, e190 (2015).
Article CAS Google Scholar
Montoya, J. H. & Persson, K. A. A high-throughput framework for determining adsorption energies on solid surfaces. Npj Comput. Mater. 3, 14 (2017).
Article Google Scholar
Schmidt, J. et al. Predicting the thermodynamic stability of solids combining density functional theory and machine learning. Chem. Mater. 29, 5090–5103 (2017).
Article CAS Google Scholar
Brunin, G., Ricci, F., Ha, V.-A., Rignanese, G.-M. & Hautier, G. Transparent conducting materials discovery using high-throughput computing. Npj Comput. Mater. 5, 63 (2019).
Article Google Scholar
Gao, Z. et al. High-throughput screening of 2D van der Waals crystals with plastic deformability. Nat. Commun. 13, 63 (2022).
Article Google Scholar
Hüser, F., Olsen, T. & Thygesen, K. S. Quasiparticle GW calculations for solids, molecules, and two-dimensional materials. Phys. Rev. B 87, 235132 (2013).
Article Google Scholar
van Setten, M. J., Giantomassi, M., Gonze, X., Rignanese, G.-M. & Hautier, G. Automation methodologies and large-scale validation for GW: Towards high-throughput GW calculations. Phys. Rev. B 96, 155207 (2017).
Article Google Scholar
Rasmussen, A., Deilmann, T. & Thygesen, K. S. Towards fully automated GW band structure calculations: What we can learn from 60.000 self-energy evaluations. Npj Comput. Mater. 7, 22 (2021).
Article Google Scholar
Biswas, T. & Singh, A. K. pyGWBSE: a high throughput workflow package for GW-BSE calculations. Npj Comput. Mater. 9, 22 (2023).
Article Google Scholar
Bonacci, M. et al. Towards high-throughput many-body perturbation theory: efficient algorithms and automated workflows. Npj Comput. Mater. 9, 74 (2023).
Article Google Scholar
Rodrigues Pela, R. et al. Critical assessment of G₀W₀ calculations for 2D materials: the example of monolayer MoS2. Npj Comput. Mater. 10, 44 (2024).
Article Google Scholar
Faber, C., Attaccalite, C., Olevano, V., Runge, E. & Blase, X. First-principles GW calculations for DNA and RNA nucleobases. Phys. Rev. B 83, 115123 (2011).
Article Google Scholar
Faber, C., Janssen, J. L., Côté, M., Runge, E. & Blase, X. Electron-phonon coupling in the C₆₀ fullerene within the many-body GW approach. Phys. Rev. B 84, 155104 (2011).
Article Google Scholar
Blase, X., Attaccalite, C. & Olevano, V. First-principles GW calculations for fullerenes, porphyrins, phtalocyanine, and other molecules of interest for organic photovoltaic applications. Phys. Rev. B 83, 115103 (2011).
Article Google Scholar
Förster, A. & Visscher, L. Quasiparticle self-consistent gw-bethe-salpeter equation calculations for large chromophoric systems. J. Chem. Theory Comput. 18, 6779–6793 (2022).
Article PubMed PubMed Central Google Scholar
Umari, P., Mosconi, E. & De Angelis, F. Relativistic GW calculations on CH₃NH₃PbI₃ and CH₃NH₃SnI₃ perovskites for solar cell applications. Sci. Rep. 4, 4467 (2014).
Article PubMed PubMed Central Google Scholar
Pham, T. A., Ping, Y. & Galli, G. Modelling heterogeneous interfaces for solar water splitting. Nat. Mater. 16, 401–408 (2017).
Article CAS PubMed Google Scholar
Guo, Z., Ambrosio, F. & Pasquarello, A. Evaluation of photocatalysts for water splitting through combined analysis of surface coverage and energy-level alignment. ACS Catal. 10, 13186–13195 (2020).
Article CAS Google Scholar
Radin, M. D. & Siegel, D. J. Charge transport in lithium peroxide: relevance for rechargeable metal-air batteries. Energy Environ. Sci. 6, 2370 (2013).
Article CAS Google Scholar
Seo, H., Govoni, M. & Galli, G. Design of defect spins in piezoelectric aluminum nitride for solid-state hybrid quantum technologies. Sci. Rep. 6, 20803 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bechstedt, F. Many-body Approach to Electronic Excitations: Concepts and Applications. Springer Series in Solid-State Sciences (Springer, New York, 2014).
Runge, E. & Gross, E. K. U. Density-functional theory for time-dependent systems. Phys. Rev. Lett. 52, 997–1000 (1984).
Article CAS Google Scholar
Gross, E. K. U. & Kohn, W. Local density-functional theory of frequency-dependent linear response. Phys. Rev. Lett. 55, 2850–2852 (1985).
Article CAS PubMed Google Scholar
Botti, S. et al. Long-range contribution to the exchange-correlation kernel of time-dependent density functional theory. Phys. Rev. B 69, 155112 (2004).
Article Google Scholar
Sharma, S., Dewhurst, J. K., Sanna, A. & Gross, E. K. U. Bootstrap approximation for the exchange-correlation kernel of time-dependent density-functional theory. Phys. Rev. Lett. 107, 186401 (2011).
Article CAS PubMed Google Scholar
Rigamonti, S. et al. Estimating excitonic effects in the absorption spectra of solids: problems and insight from a guided iteration scheme. Phys. Rev. Lett. 114, 146402 (2015).
Article PubMed Google Scholar
Byun, Y.-M., Sun, J. & Ullrich, C. A. Time-dependent density-functional theory for periodic solids: assessment of excitonic exchange-correlation kernels. Electron. Struct. 2, 023002 (2020).
Article CAS Google Scholar
Shishkin, M. & Kresse, G. Implementation and performance of the frequency-dependent GW method within the PAW framework. Phys. Rev. B 74, 035101 (2006).
Article Google Scholar
Shishkin, M. & Kresse, G. Self-consistent GW calculations for semiconductors and insulators. Phys. Rev. B 75, 235102 (2007).
Article Google Scholar
Kotani, T., van Schilfgaarde, M. & Faleev, S. V. Quasiparticle self-consistent GW method: a basis for the independent-particle approximation. Phys. Rev. B 76, 165106 (2007).
Article Google Scholar
Shishkin, M., Marsman, M. & Kresse, G. Accurate quasiparticle spectra from self-consistent GW calculations with vertex corrections. Phys. Rev. Lett. 99, 246403 (2007).
Article CAS PubMed Google Scholar
Cunningham, B., Grüning, M., Pashov, D. & van Schilfgaarde, M. $QSG\hat{W}$: Quasiparticle self-consistent GW with ladder diagrams in W. Phys. Rev. B 108, 165104 (2023).
Kutepov, A. L. Electronic structure of Na, K, Si, and LiF from self-consistent solution of Hedin's equations including vertex corrections. Phys. Rev. B 94, 155101 (2016).
Article Google Scholar
Kutepov, A. L. & Kotliar, G. One-electron spectra and susceptibilities of the three-dimensional electron gas from self-consistent solutions of Hedin’s equations. Phys. Rev. B 96, 035108 (2017).
Article Google Scholar
Neuhauser, D. et al. Breaking the theoretical scaling limit for predicting quasiparticle energies: the stochastic GW approach. Phys. Rev. Lett. 113, 076402 (2014).
Article CAS PubMed Google Scholar
Liu, P., Kaltak, M., Klimeš, J. & Kresse, G. Cubic scaling GW: Towards fast quasiparticle calculations. Phys. Rev. B 94, 165109 (2016).
Article Google Scholar
Grumet, M., Liu, P., Kaltak, M., Klimeš, J. & Kresse, G. Beyond the quasiparticle approximation: fully self-consistent GW calculations. Phys. Rev. B 98, 155143 (2018).
Article CAS Google Scholar
Kutepov, A. L. Self-consistent GW method: O(N) algorithm for polarizability and self energy. Comput. Phys. Commun. 257, 107502 (2020).
Article CAS Google Scholar
Duchemin, I. & Blase, X. Cubic-scaling all-electron GW calculations with a separable density-fitting space-time approach. J. Chem. Theory Comput. 17, 2383–2393 (2021).
Article CAS PubMed Google Scholar
Graml, M., Zollner, K., Hernangómez-Pérez, D., Faria Junior, P. E. & Wilhelm, J. Low-scaling GW algorithm applied to twisted transition-metal dichalcogenide heterobilayers. J. Chem. Theory Comput. 20, 2202–2208 (2024).
Article PubMed PubMed Central Google Scholar
Shi, R., Lin, P., Zhang, M.-Y., He, L. & Ren, X. Subquadratic-scaling real-space random phase approximation correlation energy calculations for periodic systems with numerical atomic orbitals. Phys. Rev. B 109, 035103 (2024).
Article CAS Google Scholar
Rangel, T. et al. Reproducibility in GW calculations for solids. Comput. Phys. Commun. 255, 107242 (2020).
Article CAS Google Scholar
Onida, G., Reining, L. & Rubio, A. Electronic excitations: density-functional versus many-body Green’s-function approaches. Rev. Mod. Phys. 74, 601–659 (2002).
Article CAS Google Scholar
Bruneval, F. & Gonze, X. Accurate GW self-energies in a plane-wave basis using only a few empty states: towards large systems. Phys. Rev. B 78, 085125 (2008).
Article Google Scholar
Marini, A., Hogan, C., Grüning, M. & Varsano, D. yambo: An ab initio tool for excited state calculations. Comput. Phys. Commun. 180, 1392–1403 (2009).
Article CAS Google Scholar
Godby, R. W., Schlüter, M. & Sham, L. J. Self-energy operators and exchange-correlation potentials in semiconductors. Phys. Rev. B 37, 10159–10175 (1988).
Article CAS Google Scholar
Godby, R. W. & Needs, R. J. Metal-insulator transition in Kohn-Sham theory and quasiparticle theory. Phys. Rev. Lett. 62, 1169–1172 (1989).
Article CAS PubMed Google Scholar
Stankovski, M. et al. G⁰W⁰-band gap of ZnO: effects of plasmon-pole models. Phys. Rev. B 84, 241201 (2011).
Article Google Scholar
Leon, D. A. et al. Frequency dependence in GW made simple using a multipole approximation. Phys. Rev. B 104, 115157 (2021).
Article CAS Google Scholar
Leon, D. A., Ferretti, A., Varsano, D., Molinari, E. & Cardoso, C. Efficient full frequency GW for metals using a multipole approach for the dielectric screening. Phys. Rev. B 107, 155130 (2023).
Article CAS Google Scholar
Shih, B.-C., Xue, Y., Zhang, P., Cohen, M. L. & Louie, S. G. Quasiparticle band gap of ZnO: high accuracy from the conventional G⁰W⁰ approach. Phys. Rev. Lett. 105, 146401 (2010).
Article PubMed Google Scholar
Ergönenc, Z., Kim, B., Liu, P., Kresse, G. & Franchini, C. Converged GW quasiparticle energies for transition metal oxide perovskites. Phys. Rev. Mater. 2, 024601 (2018).
Article Google Scholar
Guandalini, A., D’Amico, P., Ferretti, A. & Varsano, D. Efficient GW calculations in two dimensional materials through a stochastic integration of the screened potential. Npj Comput. Mater. 9, 44 (2023).
Article CAS Google Scholar
Zein, N. E., Savrasov, S. Y. & Kotliar, G. Local self-energy approach for electronic structure calculations. Phys. Rev. Lett. 96, 226403 (2006).
Article CAS PubMed Google Scholar
Usuda, M., Hamada, N., Kotani, T. & van Schilfgaarde, M. All-electron GW calculation based on the LAPW method: application to wurtzite ZnO. Phys. Rev. B 66, 125101 (2002).
Article Google Scholar
Friedrich, C., Blügel, S. & Schindlmayr, A. Efficient implementation of the GW approximation within the all-electron FLAPW method. Phys. Rev. B 81, 125102 (2010).
Article Google Scholar
Friedrich, C., Betzinger, M., Schlipf, M., Blügel, S. & Schindlmayr, A. Hybrid functionals and GW approximation in the FLAPW method. J. Phys.: Condens. Matter 24, 293201 (2012).
PubMed Google Scholar
Gulans, A. et al. exciting: a full-potential all-electron package implementing density-functional theory and many-body perturbation theory. J. Phys.: Condens. Matter 26, 363202 (2014).
PubMed Google Scholar
Haule, K. & Mandal, S. All electron GW with linearized augmented plane waves for metals and semiconductors. Comput. Phys. Commun. 295, 108986 (2024).
Article CAS Google Scholar
Kotani, T. & van Schilfgaarde, M. All-electron GW approximation with the mixed basis expansion based on the full-potential LMTO method. Solid State Commun. 121, 461–465 (2002).
Article CAS Google Scholar
Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article Google Scholar
Ong, S. P. et al. The Materials Application Programming Interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles. Comput. Mater. Sci. 97, 209–215 (2015).
Article Google Scholar
Mounet, N. et al. Two-dimensional materials from high-throughput computational exfoliation of experimentally known compounds. Nat. Nanotechnol. 13, 246–252 (2018).
Article CAS PubMed Google Scholar
Campi, D., Mounet, N., Gibertini, M., Pizzi, G. & Marzari, N. Expansion of the materials cloud 2D database. ACS Nano 17, 11268–11278 (2023).
Article CAS PubMed PubMed Central Google Scholar
Togo, A. and Tanaka, I. Spglib: a software library for crystal symmetry search. Preprint at https://arxiv.org/abs/1808.01590 (2018).
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. J. Phys.: Condens. Matter 21, 395502 (2009).
PubMed Google Scholar
Giannozzi, P. et al. Advanced capabilities for materials modelling with Quantum ESPRESSO. J. Phys.: Condens. Matter 29, 465901 (2017).
CAS PubMed Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article CAS PubMed Google Scholar
Hamann, D. R. Optimized norm-conserving Vanderbilt pseudopotentials. Phys. Rev. B 88, 085117 (2013).
Article Google Scholar
Jain, A. et al. A high-throughput infrastructure for density functional theory calculations. Comput. Mater. Sci. 50, 2295–2310 (2011).
Article CAS Google Scholar
Sangalli, D. et al. Many-body perturbation theory calculations using the yambo code. J. Phys.: Condens. Matter 31, 325902 (2019).
CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank the staff of the Compute Center of the Technische Universität Ilmenau and especially Mr. Henning Schwanbeck for providing an excellent research environment. Additionally we would also like to thank Miguel A. L. Marques, Bochum, Germany, for the inspiring discussions and the provision of the automated symmetry detection aiding the Quantum ESPRESSO workflows. This work is supported by the Deutsche Forschungsgemeinschaft DFG (Project 537033066).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Max Großmann, Malte Grunert.

Authors and Affiliations

Institute of Physics and Institute of Micro- and Nanotechnologies, Technische Universität Ilmenau, 98693, Ilmenau, Germany
Max Großmann, Malte Grunert & Erich Runge

Authors

Max Großmann
View author publications
You can also search for this author in PubMed Google Scholar
Malte Grunert
View author publications
You can also search for this author in PubMed Google Scholar
Erich Runge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G. and M.G. conceived the idea; M. Großmann wrote the workflows based on input from M. Grunert and E.R.; M. Großmann ran the calculations; M.G. and M.G. analyzed the data; M. Großmann visualized the results; M.G. and M.G. wrote the manuscript; E.R. supervised the work; all authors modified and approved the manuscript. Max Großmann and Malte Grunert contributed equally to this work.

Corresponding authors

Correspondence to Max Großmann or Malte Grunert.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Großmann, M., Grunert, M. & Runge, E. A robust, simple, and efficient convergence workflow for GW calculations. npj Comput Mater 10, 135 (2024). https://doi.org/10.1038/s41524-024-01311-9

Download citation

Received: 27 February 2024
Accepted: 05 June 2024
Published: 27 June 2024
DOI: https://doi.org/10.1038/s41524-024-01311-9
Springer Nature Limited

A robust, simple, and efficient convergence workflow for GW calculations

Abstract

Similar content being viewed by others

Speeding up GW Calculations to Meet the Challenge of Large Scale Quasiparticle Predictions

Towards fully automated GW band structure calculations: What we can learn from 60.000 self-energy evaluations

Towards high-throughput many-body perturbation theory: efficient algorithms and automated workflows