Machine-learning structural reconstructions for accelerated point defect calculations

Mosquera-Lois, Irea; Kavanagh, Seán R.; Ganose, Alex M.; Walsh, Aron

doi:10.1038/s41524-024-01303-9

Machine-learning structural reconstructions for accelerated point defect calculations

Article
Open access
Published: 06 June 2024

Volume 10, article number 121, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Computational Materials

Machine-learning structural reconstructions for accelerated point defect calculations

Download PDF

4449 Accesses
2 Citations
15 Altmetric
Explore all metrics

Abstract

Defects dictate the properties of many functional materials. To understand the behaviour of defects and their impact on physical properties, it is necessary to identify the most stable defect geometries. However, global structure searching is computationally challenging for high-throughput defect studies or materials with complex defect landscapes, like alloys or disordered solids. Here, we tackle this limitation by harnessing a machine-learning surrogate model to qualitatively explore the structural landscape of neutral point defects. By learning defect motifs in a family of related metal chalcogenide and mixed anion crystals, the model successfully predicts favourable reconstructions for unseen defects in unseen compositions for 90% of cases, thereby reducing the number of first-principles calculations by 73%. Using CdSe_xTe_1−x alloys as an exemplar, we train a model on the end member compositions and apply it to find the stable geometries of all inequivalent vacancies for a range of mixing concentrations, thus enabling more accurate and faster defect studies for configurationally complex systems.

Machine-learning potentials for crystal defects

Article Open access 12 August 2022

Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning

Article Open access 09 December 2016

Machine learning potential assisted exploration of complex defect potential energy surfaces

Article Open access 24 January 2024

Introduction

Defects control the properties of many functional materials and devices¹, like solar cells^2,3, batteries^4,5, catalysts^6,7,8, and quantum computers^9,10,11,12. To discover better materials for these applications it is thus necessary to predict how their defects behave. However, defect calculations are computationally demanding. The large supercells and high level of theory required to obtain robust predictions typically limit point defect analysis to in-depth studies of specific materials. In a move towards data-driven defect workflows¹³, defect databases^{14,15,16,17,18,19,20} and surrogate models have been developed to predict defect properties, like the dominant defect type¹⁸, formation^{19,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35} and migration³⁵ energies, and charge transition levels^19,25,36. By learning the relationship between defect structure and properties, these models enable high-throughput studies that quickly evaluate and screen a group of materials based on their defect behaviour.^27,28,30,37

Despite progress in accelerating defect predictions, most high-throughput studies are limited in scope. Typically, their training datasets are generated assuming the ideal defect structure inherited from the crystal host, which often lies within a local minimum, thereby trapping a gradient-based optimisation algorithm in a metastable arrangement^38,39,40,41. By yielding incorrect geometries, the predicted defect properties, such as equilibrium concentrations^39,41,42,43, charge transition levels^39,41,42 and recombination rates³⁹, are rendered inaccurate^{8,44,45,46,47}. However, defect structure searching is often too expensive for high-throughput studies that target thousands of defects³⁰ or materials with complex (defect) energy landscapes, like alloys, disordered solids, and low-symmetry crystals.

In this study, we aim to reduce the computational burden of defect structure searching by introducing a machine-learning surrogate model. We build a dataset containing a set of point defect structures, energies, forces and stresses from first-principles, and use it to fine-tune a universal machine-learning force field (MLFF) and qualitatively explore the energy landscape across 132 defects. Defect reconstructions often follow common motifs⁴¹, especially when comparing similar defects in families of related compounds. By learning the plausible reconstructions undergone by defects in similar hosts, a surrogate model can be used to optimise the initial sampling structures and thus identify the promising, low-energy configurations (Fig. 1), as previously shown for surface adsorbates^48,49 and transition state searches⁵⁰.

**Fig. 1: Schematic of a machine-learning surrogate model used to accelerate defect structure searching.**

Results

To assess the ability of a surrogate model to learn defect reconstructions, we will focus on one of the most common — and often strongest in terms of energy-lowering — reconstruction motifs: dimerisation^{41,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73}. Dimers/trimers have been previously reported for numerous vacancies and interstitials, including \({{V}_{{{{\rm{Se}}}}}}^{0}\) in ZnSe, CuInSe₂ and CuGaSe₂⁵¹, \({{V}_{{{{\rm{S}}}}}}^{0}\) in ZnS⁵¹, \({{V}_{{{{\rm{Cd}}}}}}^{0}\) in CdTe³⁹, \({{V}_{{{{\rm{Sb}}}}}}^{0,+1,+2}\) in Sb₂S/Se₃^40,42, \({{V}_{{{{\rm{Ti}}}}}}^{0,-1}\) and \({{V}_{{{{\rm{Zr}}}}}}^{0}\) in CaZrTi₂O₇⁴⁵, \({{V}_{{{{\rm{Sb}}}}}}^{0}\) in Sb₂O₅⁷⁴, \({{{{{\rm{O}}}}}_{{{{\rm{i}}}}}}^{0}\) in In₂O₃⁵⁵, ZnO⁵⁸, Al₂O₃⁵⁹, MgO^60,61, CdO⁶², SnO₂^63,64, PbO₂⁶⁵, CeO₂⁶⁶, BaSnO₃⁷⁵, In₂ZnO₄⁶⁷ and LiNi_0.5Mn_1.5O₄⁷⁶, \({{{{\rm{Ag}}}}}_{{{{\rm{i}}}}}^{0}\) in AgCl and AgBr⁵³, \({{V}_{{{{\rm{I}}}}}}^{-}\), \({{{{{\rm{I}}}}}_{{{{\rm{i}}}}}}^{0}\), \({{{{{\rm{Pb}}}}}_{{{{\rm{i}}}}}}^{0}\), \({{{{\rm{Pb}}}}}_{{{{{\rm{CH}}}}}_{3}{{{{\rm{NH}}}}}_{3}}^{0}\) and \({{{{\rm{I}}}}}_{{{{{\rm{CH}}}}}_{3}{{{{\rm{NH}}}}}_{3}}^{0}\) in CH₃NH₃PbI₃^68,69,70,71, \({{{{\rm{Pb}}}}}_{i}^{0}\) in CsPbBr₃⁵², (CH₃NH₃)₃Pb₂I₇⁵⁴, (CH₃NH₃)₂Pb(SCN)₂I₂⁷² and \({{{{{\rm{Sn}}}}}_{{{{\rm{i}}}}}}^{0}\) in CH₃NH₃SnI₃⁵⁷. While cation dimerisation has been reported in several hosts (AgCl/Br, CuInSe₂, CuGaSe₂, ZnS/Se, CdTe, Sb₂S/Se₃, CH₃NH₃PbI₃, CsPbBr₃, (CH₃NH₃)₃Pb₂I₇, CH₃NH₃SnI₃)^{41,51,52,53,54}, anion dimers are more common and will be the focus of our study.

To target dimerisation, we consider cation vacancies in low-symmetry metal sulfides/selenides, where their covalent character and soft structures favour dimer formation^41,42,56. Our first-principles dataset spans 50 hosts (exemplified in Fig. 2a) and 132 neutral cation vacancies, covering 25 elements (Fig. 3b) and 6 space groups. The configurational landscape of each vacancy was explored with the ShakeNBreak method^41,77 by applying 15 chemically-guided distortions to the unperturbed defect structure, followed by geometry optimisation with DFT (see Methods)—resulting in a diverse set of trajectories for each defect and the dataset shown in Fig. 2c.

**Fig. 2: Distribution of the defect dataset generated with first-principles calculations.**

**Fig. 3: Analysis of the point defect dataset.**

Defect reconstructions

By analysing our first-principles dataset, we find that 29.9% of the neutral defects undergo symmetry-breaking reconstructions missed by both the standard modelling approach but also when applying a rattle distortion (with energy differences between the identified ground state and the relaxed ideal configuration greater than 0.5 eV; Supplementary Table 1, Supplementary Fig. 1). Rattle distortions (i.e. randomised displacements) have been used in recent studies³⁷ as the prevalence of defect reconstructions have become more recognised. While rattling helps to break the symmetry of the initial defect configuration and escape PES saddle points, it often fails to identify reconstructions with significant energy barriers (i.e. bond formation), highlighting the need for structure searching.

The identified reconstructions are driven by anion–anion bond formation, with the number of new bonds determined by the number of valence electrons lost upon defect formation (Supplementary Fig. 2). In general, energy-lowering structural reconstructions at defects tend to be driven by the localisation of excess charge introduced by the defect formation, through various bonding (re-)arrangements. Here, excess charge refers to the change in valence electrons available for bonding, which is determined by the oxidation state of the original defect atom and the defect charge state—and in fact is the chemical guiding principle used in ShakeNBreak to target likely distortion pathways. For instance, upon forming a neutral antimony vacancy (\({{V}_{{{{\rm{Sb}}}}}}^{0}\)) in Sb₂(S/Se)₃ (where Sb is in the +3 oxidation state), we have removed three bonding electrons and so we have three excess holes. Further changes in the defect charge state will then alter this excess charge (e.g. 2 excess holes in the −1 charge state, or zero excess charge in the ‘fully-ionised’ -3 charge state). Similarly, for a neutral Li vacancy in Li₄SnS₄, we have removed 1 bonding electron and so we have 1 excess hole (and zero excess charge for the fully ionised -1 state). Analogously for an anion vacancy behaving as a donor defect (as in most semiconductors), it would contribute x excess electrons where -x is the oxidation state of the anion in that compound. Defects resulting in one hole (e.g. \({{V}_{{{{\rm{Li}}}}}}^{0}\) in Li₄SnS₄) can easily accommodate the missing charge without strong reconstructions, while defects with two or more holes (e.g. \({{V}_{{{{\rm{Bi}}}}}}^{0}\) in BiSI) tend to form anion dimers or trimers, as shown in Fig. 3b. As a result, anion–anion bonds are more favourable for more positive defect charge states, and can stabilise unexpected defect oxidation states, as observed previously for \({{V}_{{{{\rm{Sb}}}}}}^{+1}\) in Sb₂(S/Se)₃^41,42 and \({{{{{\rm{O}}}}}_{{{{\rm{i}}}}}}^{+1,+2}\) in several metal oxides^41,55,58.

There are some exceptions to this trend, where systems are able to accommodate three or more holes without undergoing strong reconstructions. One example is hosts with d/f metals that adopt multiple stable oxidation states (e.g. Fe, Co, Cu), which can accommodate a hole by adopting a higher oxidation state⁸. To verify this trend, we compared two isostructural A^IIIB^IS₂ systems which only differ in the identity of the B cation: \({{V}_{{{{\rm{Ga}}}}}}^{0}\) in GaCuS₂ and GaAgS₂; and \({{V}_{{{{\rm{In}}}}}}^{0}\) in InCuS₂ and InAgS₂ (Supplementary Fig. 3). In (Ga/In)AgS₂, two of the holes localise in a S–S bond formed by the vacancy nearest neighbours (NN), while the third hole is split between the remaining two NNs. In contrast, in (Ga/In)CuS₂, no dimer forms since three holes are localised in three of the vacancy NNs and five of the Cu ions closer to the vacancy — with these Cu ions showing shorter Cu–S bonds. The different behaviour of Cu and Ag can be rationalised by considering their second ionisation energies (I2(Cu): 20.3 eV, I2(Ag): 21.5 eV)⁷⁸, where the low I2 (and thus higher d states) of Cu(I) favours cation oxidation, while the higher I2 of Ag(I) results in a sulfur dimer accommodating two of the holes (Supplementary Fig. 3).

In addition to systems with d/f elements, defects with nearby anion–anion bonds can localise the positive charge in these bonds and thus avoid forming new ones. This behaviour is exemplified by RhSe₂, where the two symmetry-inequivalent Rh vacancies show different reconstructions. The first vacancy site is surrounded by four Se–Se bonds (Supplementary Fig. 4b), and thus can accommodate the four holes by depopulating the anti-bonding orbitals of these bonds. In contrast, the second site has only one Se–Se bond neighbouring the vacancy (Supplementary Fig. 4c), and thus has to form an additional Se dimer to accommodate the positive charge.

Beyond chalcogenide dimers, other rearrangements to accommodate positive charge involve chalco-halide (e.g. S–Cl formed by \({{V}_{{{{\rm{Bi}}}}}}^{0}\) in AgBiSCl₂) and halide-halide bond formation (e.g. Cl dimers formed by \({{V}_{{{{\rm{Sb}}}}}}^{0}\) in SbSCl₉) (Supplementary Fig. 5). Here we note that the zero-dimensional character of SbSCl₉ enables this defect to undergo strong distortions forming two Cl dimers (Supplementary Fig. 5). Overall, we highlight the common reconstruction motifs exhibited by different defects in various host structures (Supplementary Fig. 2), facilitating the requisite diversity for a model to learn the plausible reconstructions for a group of related defects.

Model training

To develop a model that can be applied for defect structure searching in unseen compositions, we first split our dataset by composition into training, validation and test sets (Supplementary Fig. 6), amounting to 68%, 5% and 27% of defects, respectively. The validation set is then augmented with 5% of the configurations selected for the systems in the training set, to ensure that the structural diversity of the training set is also included for validation (thus evaluating how the model performs for a large diversity of defects and compositions and also how it extrapolates to unseen compositions). This results in training, validation and test sets of 11,955 (63%), 2,100 (11%), and 4830 (26%) configurations, respectively, where configuration denotes a point defect structure with its associated energy, forces and stresses.

To sample the training data, we compared two approaches: (i) a manual method where we sample 10 evenly spaced frames from each relaxation (MS) and (ii) the Dimensionality-Reduced Encoded Clusters approach (DIRECT)⁷⁹, which aims to select a robust training set from a complex configurational space. Surprisingly, we find that, when using datasets of similar sizes, the MS approach performs better—with the DIRECT approach only outperforming MS when the final DIRECT dataset is larger than the MS one (Supplementary Table 3). This is because the DIRECT approach mainly samples structures from the initial ionic steps (Supplementary Fig. 9), which correspond to high distortions and thus lead to larger errors for the low energy structures (Supplementary Fig. 10).

As a surrogate model, we aim for a method that takes an initial defect structure and outputs the energy and structure of the locally relaxed configuration. Machine-learning force fields are ideal for this task since they can map regions of the potential energy surface (PES) by learning the energies, forces, and, optionally stresses of a set of training structures. Specifically, we focus on universal graph-based MLFFs, which are trained on relaxation data from diverse databases of bulk crystals^80,81,82,83, and thus already incorporate general chemical behaviour. Accordingly, we use a universal MLFF as a base model and fine-tune it with a training set of defect configurations. We have compared different model architectures (M3GNet⁸⁰, CHGNet⁸¹ and MACE⁸⁴), elemental reference energies, structure featurisation parameters (graph cutoffs, readout layers) and fine-tuning strategies, which are discussed in detail in the Supporting Information (SI) (Supplementary Notes 1.B). In addition, we compared a model trained on just defect structures, and both defect and bulk structures, with the second case improving performance (Supplementary Table 10 and Supplementary Fig. 12). From these benchmarks, the optimal model architecture and parameters were selected: a M3GNet model⁸⁰ with radial and 3-body cutoffs of 5 Å and 4 Å, respectively, and the weighted atom readout function^80,85 (further details in Methods).

Overall, we note that the mean absolute errors for the absolute energies in the validation and test sets are significant (MAE_E,test = 31.2 meV atom⁻¹, Table 1), but comparable to those obtained in MLFFs used for bulk structure searching of carbon (MAE_E,test = 64.8 meV atom⁻¹)⁸⁶. However, a more meaningful metric for our purpose is the error for the relative energies of each defect configuration relative to its ground state structure (MAE_E,rel,test = 11.3 meV atom⁻¹). Further, we mostly care about the low-energy region of the potential energy surface, which can be measured by calculating the relative energy errors for configurations less than ≈ 5 eV above the global minimum, resulting in MAEs of 3.6 meV atom⁻¹ ≈ 0.29 eV for an 80 atom supercell.

Beyond these metrics, we calculate the Spearman correlation coefficient (ρ) to measure how well the MLFF and DFT energies are monotonically related (i.e. if greater DFT energies correspond to greater MLFF energies⁸⁷). While the value of ρ for the test set is significantly lower than those obtained with MLFFs developed for bulk structure searching for a single composition (0.72 versus 0.98–0.999⁸⁷), this was expected considering that our dataset spans a diverse range of compositions and a wide range of energies. While the errors are high, we note that this does not prevent the model from being used as a qualitative surrogate of the DFT PES for structure searching (i.e. identification of local minima), as previously observed for surface adsorbates^88,89.

Model performance

To evaluate the model performance, we apply the trained model to a robust test set, which includes 13 unseen compositions and 32 defects (accounting for 26% and 26.5% of the total number of compositions and defects in our dataset, respectively; Supplementary Fig. 6). For each defect, the MLFF is used to relax the 15 distorted structures generated with ShakeNBreak⁷⁷ to sample the defect PES. The MLFF-relaxed structures are then compared to identify the different local minima in the MLFF PES using the SOAP fingerprint⁹⁰ of the defect site. These local minima are then further relaxed with DFT. By comparing the ground state identified from the MLFF+DFT approach and full DFT search, we find the former to correctly identify the DFT ground state for 88% of test defects, while simultaneously reducing the number of DFT calculations required by 73% (Table 2) and accelerating structure searching by a factor of 13 (Supplementary Notes 1.C4). In addition, it identifies a more favourable structure than the ones found in the DFT search for V_Ge,9 in TlGeS₂, with an energy lowering of 0.5 eV (Supplementary Fig. 15). The 12% of failed cases, where the MLFF ground state structure differed from the DFT one, mostly involve complex hosts. For instance, \({V}_{{{{\rm{Sn}}}}}\) in Li₄SnS₄ has a complex DFT energy surface, which traps most of the relaxations in very high energy basins (Supplementary Fig. 19). PESs of similar complexity are displayed by the iso-structural systems that were included in the training set (Li₄GeS₄ and Li₄TiS₄; Supplementary Fig. 19), which biases our training data to the high energy region of the PES for these compositions and thus hinders learning the low-energy region. Accordingly, the training data for these systems can be improved by reducing the magnitude of the distortion used by ShakeNBreak to generate their sampling structures; which would improve model performance. Other defects for which the surrogate model misses the most stable structure are V_Tl,0 in TlGeS₂ and V_Bi in BiSeI — yet in both cases the DFT and MLFF+DFT structures are very similar and differ by small energy differences (0.1 and 0.2 eV, respectively) (Supplementary Figures 16 and 17). In all failed cases, while the model misses the full DFT ground state, it still correctly predicts a favourable reconstruction, that lowers the energy compared to the relaxed ideal configuration.

Beyond identifying the correct ground state in the majority of cases, the model has indirectly learned the correlation between the number of holes and the number of formed dimers. For defects with 1 missing electron, the candidate structures generated by the surrogate model rarely contained anion–anion bonds; while for defects with more missing electrons, the model often identifies at least one local minima with a dimer.

The decreased performance observed for out-of-sample compositions less similar to the training set posed the question of what performance could be achieved if targeting a family of more related systems. To consider more similar host compositions, we select the chalcohalide systems from our dataset and split them composition-wise into training, validation and test sets as described in Supplementary Notes 1.C5. After training the model on the training set and applying it to the unseen test defects (details in Supplementary Notes 1.C5), we find that the model identifies the correct ground state for all test cases, and achieves lower mean absolute errors compared to the full model. This suggests that higher accuracy can be achieved when targeting more similar host structures, which is likely the case in most high-throughput defect studies.

Our current trained model is limited to neutral cation vacancies in metal sulphides/selenides. However, the approach can be extended to a different compositional space or defect type by first generating a custom training set through first-principles calculations and using it to fine-tune the universal bulk MLFF.

Application to alloys

Beyond high-throughput studies of many single-phase materials, the surrogate model can also be used to accelerate structure searching in alloys or disordered solids, which is computationally challenging due to the high number of local host compositions and inequivalent defects to consider⁹¹. The distinct local or site environments of a given defect can significantly affect its properties^{34,35,92,93,94,95,96,97,98,99,100,101,102,103}, altering formation and migration energies by up to 1.5 eV^{34,35,93,100,101,102,103}. Properly sampling various site environments is key to characterise the defect behaviour in such cases.

We consider the case of cadmium vacancies in the CdSe_xTe_(1−x) (x = 0, 0.2, 0.3, 0.5, 0.6, 0.8, 0.9, 1) pseudo-binary alloy. For each composition, a supercell is generated through random substitution of Te sites, and the Cd sites with a unique nearest neighbour chemical environment are considered (e.g. Cd surrounded by 4 Te; by 3 Te and 1 Se; by 2 Se and 2 Te, etc) (Fig. 4a). The configurational landscape of each vacancy is explored with the ShakeNBreak method (14 sampling structures), using the relaxations from the pure compositions as the training and validation data while the mixed systems (0 < x < 1) are reserved as the test set.

**Fig. 4: Structures for V_Cd in the CdSe_xTe_1−x alloys.**

After fine-tuning the surrogate model (MLFF) on the training configurations (details in Methods), it is applied to all alloys to perform the structure searching calculations, allowing a more extensive sampling than for the DFT search (31 sampling structures). From the MLFF-relaxed structures, the unique configurations are selected for further relaxation with DFT and compared with the results from the DFT-only search. This comparison shows that the model successfully identifies the ground state for all defects, even in cases where the defects form Te-Se bonds not seen in the training set – which only included the Te-Te and Se-Se bonds formed by V_Cd(CdTe) and V_Cd(CdSe), respectively. Although Te-Se bonds were not present in our defect training set, they were included in the Materials Project (bulk) training set, thus suggesting the benefit of transfer learning for model generalisability.

More significantly, for 70% of the defects, the model identifies a more favourable ground state missed in the coarser DFT search (with a mean energy lowering of −0.4 eV, Supplementary Notes 1.E). These reconstructions are driven by forming a more favourable anion–anion bond (Fig. 4b) and missed in the DFT-only search due to the coarser sampling performed. This illustrates the advantage of the faster surrogate model to tackle defects with complex configurational landscapes, that require a more exhaustive exploration than a DFT-based search would allow, like alloys, compositionally disordered materials^{104,105,106,107}, and low-symmetry or multinary systems with many degrees of freedom in their PES.

Discussion

By building a dataset for defect structure searching, we have demonstrated the prevalence of defect reconstructions missed by the standard modelling approach – and thus the need to perform structure searching in high-throughput defect studies. To reduce the associated computational burden, we have developed a surrogate model by fine-tuning a universal machine-learning force field on defect configurations. By qualitatively learning the defect configurational landscapes, the trained model successfully predicts low-energy defect structures for unseen defect environments in unseen compositions, thereby reducing the number of DFT calculations by 73%. While our current model is limited to neutral cation vacancies in metal chalcogenides, the methodology can be applied to different defect types or compositional spaces. In addition, our openly-available dataset could be used to measure the out-of-distribution performance of universal MLFFs¹⁰⁸ by testing the ability to extrapolate from learned bulk motifs to defect environments.

Beyond accelerating structure searching in high-throughput studies, this approach is ideal for systems with a complex defect landscape, like alloys, disordered, or low-symmetry materials where their many inequivalent defects make it intractable to explicitly calculate all of them with accurate DFT methods. By using a surrogate model, we can consider a range of alloy compositions and all inequivalent defects, while performing a more exhaustive sampling of the PES — thereby identifying more favourable reconstructions missed in the (coarser) DFT-based search. Beyond (pseudo-)binary alloys, this approach could be extended to model more chemically complex systems, like high-entropy alloys, where the MLFF could be trained on defects of the constituent binary systems and applied to the ternary, quaternary, or high-entropy alloys.

A current limitation of this strategy is the handling of defects in distinct charge states, which have different energy landscapes and structural configurations (e.g. a defect in two different charge states can have a common local structure with different energies). The approach could handle the potential energy landscape for each charge state independently (e.g. training a separate model for defects in the -1 charge state). To consider different charge states simultaneously, the net charge state can be encoded as a graph global attribute¹⁰⁹. However, a more descriptive encoding could be achieved by using fourth-generation MLFFs that include atomic charges¹¹⁰. Beyond accounting for the defect charge, another improvement could be MLFFs that are fine-tuned on-the-fly during geometry optimisations. As shown for surface absorbates^88,89, this strategy would accelerate the defect geometry optimisation by skipping many ionic steps that are performed with the surrogate model. Overall, we note the promise of surrogate models to accelerate and increase the accuracy of defect modelling, whether this is by improving structure searching, accounting for metastable configurations^111,112, enabling the calculation of defect formation entropies^109,112, accelerating defect migration studies¹¹³ or going beyond the dilute limit¹⁰⁷.

Methods

High-throughput vacancies in chalcogenide hosts

The conventional supercell approach for modelling defects in periodic solids was used¹¹⁴. To reduce periodic image interactions, supercell dimensions of at least 10 Å in each direction¹¹⁵ were employed. To explore the configurational landscape of each defect, we used the ShakeNBreak code⁷⁷, with a distortion increment of 0.1 and the default rattle standard deviation (10% of the nearest neighbour distance in the bulk supercell). This strategy results in 14 sampling structures. In addition to these, to ensure that dimerisation was properly sampled, we also generated a sampling structure where two defect neighbours were pushed towards each other with a separation of 2 Å, resulting in a total of 15 initial configurations. Due to the limitation of universal MLFFs to describe charge, we only considered one charge state of the defects. We chose the neutral state for several reasons. First, it is often stable for cation vacancies in metal chalcogenides. Secondly, it is usually included within the potential charge states to be calculated for a given defect (e.g. generally ranging from the fully ionised state to (at least) the neutral one), and it often has a complex potential energy landscape. However, we note that we did not check whether it was the thermodynamically stable state for each defect.

All reference calculations were performed with Density Functional Theory using the exchange-correlation functional HSE06¹¹⁶ and the projector augmented wave method¹¹⁷, as implemented in the Vienna Ab initio Simulation Package^118,119. Calculations for the pristine unit cells were performed using a plane wave energy cutoff of 585 eV and sampling reciprocal space with a Monkhorst-Pack mesh of density 900 k-points/site. The convergence thresholds for the geometry optimisations were set to 10⁻⁶ eV and 10⁻⁵eV Å⁻¹ for energy and forces, respectively. Defect relaxations were performed with the Γ-point approximation, which is accurate enough for defect structure searching⁴¹, and with a plane wave energy cutoff of 350 eV. The energy and force thresholds for defect relaxations were set to 10⁻⁴ eV and 10⁻²eV Å⁻¹, respectively. We note that these settings were selected for an efficient exploration of the defect configurational landscape due to the high number of relaxations required for structure searching. In a full defect study aiming for high accuracy, once the ground state configuration is identified with these settings, it should be further relaxed with tighter convergence thresholds and account for spin-orbit coupling when necessary (elements from period five/six and below).

To automate the generation of input files, we designed a workflow using aiida^120,121,122, pymatgen^123,124,125, pymatgen-analysis-defects^126,127, ASE¹²⁸, doped¹²⁹ and ShakeNBreak⁷⁷. This code is available from https://github.com/ireaml/defects_workflow.git. The datasets and trained models are available from the Zenodo repository with https://doi.org/10.5281/zenodo.10579527.

To generate the training and test set for the machine learning model, we processed the DFT defect relaxation data by removing unreasonably high-energy configurations (e.g. structures with positive energies), as they decreased model performance. After cleaning the data, 10 evenly-spaced ionic steps were selected from each relaxation. We used the M3GNet model⁸⁰, as implemented in ref.⁸⁵, with radial and 3 body cutoffs of 5 Å and 4 Å, respectively, and the weighted atom readout function. The loss function was defined as a combination of the mean squared errors for the energies, forces and stresses, with respective weights of 1, 1 and 0.1^80,85. For fine-tuning, the model was initialised with the weights from the trained bulk crystal model⁸⁵ and then trained on the defect training set (see Supplementary Notes 1.B5 for further details). A batch size of 4 and an exponential learning rate scheduler with an initial rate of 5 × 10⁻⁴ were used. The model was trained on a Quadro RTX6000 GPU until the validation errors were converged (30 epochs, 5.3 hours) (Supplementary Fig. 13).

MLFF geometry optimisation was performed with the FIRE algorithm¹³⁰, as implemented in the ASE package¹²⁸, until the mean force was lower than 10⁻⁵eV Å⁻¹ or the number of ionic steps exceeded 1500, which were found to be reasonable thresholds. After relaxing the sampling structures with the model, we identified the different local minima or configurations by calculating the cosine distance between the SOAP descriptor⁹⁰ for the defect site of each configuration, which was found to be an effective metric for identifying different defect motifs. We note that using the SOAP fingerprint of the defect site was more robust than considering the energies or the root mean squared displacement between the structures. The first case can miss local minima if these have similar energies in the MLFF PES, while the second was more sensitive to structural differences far from the defect site. The parameters used to generate the SOAP descriptor were: r = 5 Å, n_max = 10, l_max = 10, σ = 1.0 Å, for the local cutoff, number of radial basis functions, maximum degree of spherical harmonics, and the standard deviation of the Gaussian functions used to expand the atomic density, respectively. To evaluate the correlation between DFT and MLFF energies, the Spearman coefficient was calculated for each defect independently, and then averaged across defects.

Application to the CdSe_xTe_(1−x) alloy

To generate the supercells for the mixed compositions in CdSe_xTe_(1−x) (x = 0.2, 0.3, 0.5, 0.6, 0.8, 0.9), we used random substitution of Te sites with Se. For each supercell, we consider the Cd sites with a unique nearest neighbour chemical environment as vacancy sites (e.g. Cd surrounded by 4 Te; by 3 Te and 1 Se; by 2 Se and 2 Te etc), and generate the vacancy high-symmetry structures with pymatgen^123,124,125. For the DFT-based exploration of the PES, we apply ShakeNBreak with default parameters, generating 14 sampling structures, which were relaxed with DFT as previously described.

To generate the dataset, we again processed the defect relaxation data by removing unreasonably high-energy configurations (>15 eV above the defect ground state configuration). As the training set, we used a combination of defect and bulk configurations: 45 evenly-spaced frames from the 14 relaxations of V_Cd in CdTe and CdSe, and 20 frames from the relaxations of each pristine system, resulting in a total of 1420 frames. For validation, we selected 5 unseen configurations from the 14 relaxations of V_Cd in CdTe and CdSe (total of 140 frames). The M3GNet surrogate model was trained with similar parameters as previously described and until the errors were converged (80 epochs, see Supplementary Fig. 26).

To perform a finer exploration of the PES with the surrogate model, we applied a set of bond distortions generated by ShakeNBreak (−0.6, −0.5, −0.4, −0.3, −0.2) to all unique pair combinations of nearest neighbours (e.g. for a V_Cd surrounded by two Te and two Se anions, Te(1), Te(2), Se(1) and Se(2), we considered the pairs Te(1)-Te(2), Te(1)-Se(1), Te(1)-Se(2), Te(2)-Se(1), Te(2)-Se(2), and Se(1)-Se(2)). By default, for a defect with two missing electrons like \({{V}_{{{{\rm{Cd}}}}}}^{0}\), ShakeNBreak only applies the bond distortions to the two atoms closest to the defect. This is typically a reliable approach for most pure systems, but can miss reconstructions for alloys with complex defect environments (e.g. V_Cd surrounded by a mix of Te and Se anions). The model application and analysis were performed as described in the previous section.

Data availability

The datasets and trained models are available from the Zenodo repository with https://doi.org/10.5281/zenodo.10579527.

Code availability

The code used to generate the defect dataset is available from https://github.com/ireaml/defects_workflow.git.

References

Sambur, J. & Brgoch, J. Unveiling the hidden influence of defects via experiment and data science. Chem. Mater. 35, 7351–7354 (2023).
Article CAS Google Scholar
Shockley, W. & Read, W. T. Statistics of the recombinations of holes and electrons. Phys. Rev. 87, 835–842 (1952).
Article CAS Google Scholar
Kim, S., Márquez, J. A., Unold, T. & Walsh, A. Upper limit to the photovoltaic efficiency of imperfect crystals from first principles. Energy Environ. Sci. 13, 1481–1491 (2020).
Article CAS Google Scholar
Maier, J. Thermodynamics of electrochemical lithium storage. Angew. Chem. Int. Ed. 52, 4998–5026 (2013).
Article CAS Google Scholar
Squires, A. G. et al. Low electronic conductivity of Li₇La₃Zr₂O₁₂ solid electrolytes from first principles. Phys. Rev. Mater. 6, 085401 (2022).
Article CAS Google Scholar
Li, W. et al. Defect engineering for fuel cell electrocatalysts. Adv. Mater. 32, 1907879 (2020).
Article CAS Google Scholar
Pastor, E. et al. Electronic defects in metal oxide photocatalysts. Nat. Rev. Mater. 7, 503–521 (2022).
Article CAS Google Scholar
Kehoe, A. B., Scanlon, D. O. & Watson, G. W. Role of lattice distortions in the oxygen storage capacity of divalently doped CeO₂. Chem. Mater. 23, 4464–4468 (2011).
Article CAS Google Scholar
Ivády, V., Abrikosov, I. A. & Gali, A. First principles calculation of spin-related quantities for point defect qubit research. npj Comput Mater. 4, 76 (2018).
Article Google Scholar
Weber, J. R. et al. Quantum computing with defects. Proc. Natl Acad. Sci. USA 107, 8513–8518 (2010).
Article CAS PubMed PubMed Central Google Scholar
Thomas, J. et al. A substitutional quantum defect in WS₂ discovered by high-throughput computational screening and fabricated by site-selective STM manipulation. Nat. Commun. 15, 3556 (2024).
Article CAS PubMed Google Scholar
Dreyer, C. E., Alkauskas, A., Lyons, J. L., Janotti, A. & Van de Walle, C. G. First-principles calculations of point defects for quantum technologies. Annu. Rev. Mater. Res. 48, 1–26 (2018).
Article CAS Google Scholar
Yan, Q., Kar, S., Chowdhury, S. & Bansil, A. The case for a defect genome initiative. Adv. Mater. 36, 2303098 (2024).
Article CAS Google Scholar
Davidsson, J., Bertoldo, F., Thygesen, K. S. & Armiento, R. Absorption versus adsorption: in 2D materials. npj 2D Mater. Appl. 7, 26 (2023).
Article Google Scholar
Sluydts, M., Pieters, M., Vanhellemont, J., Speybroeck, V. V. & Cottenier, S. High-throughput screening of extrinsic point defect properties in Si and Ge: database and applications. Chem. Mater. 29, 975–984 (2016).
Article Google Scholar
Bertoldo, F., Ali, S., Manti, S. & Thygesen, K. S. Quantum point defects in 2d materials—the QPOD database. npj Comput. Mater. 8, 56 (2022).
Article CAS Google Scholar
Huang, P. et al. Unveiling the complex structure-property correlation of defects in 2D materials based on high throughput datasets. npj 2D Mater. Appl 7, 1–10 (2023).
Google Scholar
Medasani, B. et al. Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning. npj Comput Mater. 2, 1–10 (2016).
Article CAS Google Scholar
Rahman, M. H. et al. Accelerating defect predictions in semiconductors using graph neural networks. APL Mach. Learn. 2, 016122 (2024).
Article Google Scholar
Ivanov, V. et al. Database of semiconductor point-defect properties for applications in quantum technologies. Preprint at https://arxiv.org/abs/2303.16283 (2023).
Kumagai, Y., Tsunoda, N., Takahashi, A. & Oba, F. Insights into oxygen vacancies from high-throughput first-principles calculations. Phys. Rev. Mater. 5, 123803 (2021).
Article CAS Google Scholar
Deml, A. M., Holder, A. M., O’Hayre, R. P., Musgrave, C. B. & Stevanović, V. Intrinsic material properties dictating oxygen vacancy formation energetics in metal oxides. J. Phys. Chem. Lett. 6, 1948–1953 (2015).
Article CAS PubMed Google Scholar
Broberg, D. et al. High-throughput calculations of charged point defect properties with semi-local density functional theory—performance benchmarks for materials screening applications. npj Comput. Mater. 9, 72 (2023).
Article CAS Google Scholar
Mannodi-Kanakkithodi, A. et al. Universal machine learning framework for defect predictions in zinc blende semiconductors. Patterns 3, 100450 (2022).
Article CAS PubMed PubMed Central Google Scholar
Varley, J. B., Samanta, A. & Lordi, V. Descriptor-based approach for the prediction of cation vacancy formation energies and transition levels. J. Phys. Chem. Lett. 8, 5059–5063 (2017).
Article CAS PubMed Google Scholar
Wan, Z., Wang, Q.-D., Liu, D. & Liang, J. Data-driven machine learning model for the prediction of oxygen vacancy formation energy of metal oxide materials. Phys. Chem. Chem. Phys. 23, 15675–15684 (2021).
Article CAS PubMed Google Scholar
Wexler, R. B., Gautam, G. S., Stechel, E. B. & Carter, E. A. Factors governing oxygen vacancy formation in oxide perovskites. J. Am. Chem. Soc. 143, 13212–13227 (2021).
Article CAS PubMed Google Scholar
Frey, N. C., Akinwande, D., Jariwala, D. & Shenoy, V. B. Machine learning-enabled design of point defects in 2d materials for quantum and neuromorphic information processing. ACS Nano 14, 13406–13417 (2020).
Article CAS PubMed Google Scholar
Sharma, V., Kumar, P., Dev, P. & Pilania, G. Machine learning substitutional defect formation energies in ABO₃ perovskites. J. Appl. Phys. 128, 034902 (2020).
Article CAS Google Scholar
Baldassarri, B. et al. Oxygen vacancy formation energy in metal oxides: High-throughput computational studies and machine-learning predictions. Chem. Mater. 35, 10619–10634 (2023).
Article CAS Google Scholar
Park, S. et al. Exploring the latent chemical space of oxygen vacancy formation energy by a machine learning ensemble. ACS Mater. Lett. 6, 66–72 (2024).
Article CAS Google Scholar
Kazeev, N. et al. Sparse representation for machine learning the properties of defects in 2D materials. npj Comput. Mater. 9, 113 (2023).
Article Google Scholar
Choudhary, K. & Sumpter, B. G. Can a deep-learning model make fast predictions of vacancy formation in diverse materials? AIP Adv. 13, 095109 (2023).
Article CAS Google Scholar
Zhao, X., Yu, S., Zheng, J., Reece, M. J. & Zhang, R.-Z. Machine learning of carbon vacancy formation energy in high-entropy carbides. J. Eur. Ceram. Soc. 43, 1315–1321 (2023).
Article CAS Google Scholar
Manzoor, A. et al. Machine learning based methodology to predict point defect energies in multi-principal element alloys. Front. Mater. 8, 673574 (2021).
Article Google Scholar
Polak, M. P., Jacobs, R., Mannodi-Kanakkithodi, A., Chan, M. K. Y. & Morgan, D. Machine learning for impurity charge-state transition levels in semiconductors from elemental properties using multi-fidelity datasets. J. Chem. Phys. 156, 114110 (2022).
Article CAS PubMed Google Scholar
Witman, M. D., Goyal, A., Ogitsu, T., McDaniel, A. H. & Lany, S. Defect graph neural networks for materials discovery in high-temperature clean-energy applications. Nat. Comput. Sci. 3, 675–686 (2023).
Article PubMed Google Scholar
Arrigoni, M. & Madsen, G. K. H. Evolutionary computing and machine learning for discovering of low-energy defect configurations. npj Comput. Mater. 7, 1–13 (2021).
Article Google Scholar
Kavanagh, S. R., Walsh, A. & Scanlon, D. O. Rapid recombination by cadmium vacancies in CdTe. ACS Energy Lett. 6, 1392–1398 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mosquera-Lois, I. & Kavanagh, S. R. In search of hidden defects. Matter 4, 2602–2605 (2021).
Article Google Scholar
Mosquera-Lois, I., Kavanagh, S. R., Walsh, A. & Scanlon, D. O. Identifying the ground state structures of point defects in solids. npj Comput. Mater. 9, 1–11 (2023).
Article Google Scholar
Wang, X., Kavanagh, S. R., Scanlon, D. O. & Walsh, A. Four-electron negative-U vacancy defects in antimony selenide. Phys. Rev. B 108, 134102 (2023).
Article CAS Google Scholar
Wang, X., Kavanagh, S. R., Scanlon, D. O. & Walsh, A. Upper efficiency limit of Sb₂Se₃ solar cells. Joule 8, 1–18 (2024).
Google Scholar
Morris, A. J., Pickard, C. J. & Needs, R. J. Hydrogen/nitrogen/oxygen defect complexes in silicon from computational searches. Phys. Rev. B 80, 144112 (2009).
Article Google Scholar
Mulroue, J., Morris, A. J. & Duffy, D. M. Ab initio study of intrinsic defects in zirconolite. Phys. Rev. B 84, 094118 (2011).
Article Google Scholar
Al-Mushadani, O. K. & Needs, R. J. Free-energy calculations of intrinsic point defects in silicon. Phys. Rev. B 68, 235205 (2003).
Article Google Scholar
Kononov, A., Lee, C.-W., Shapera, E. & Schleife, A. Identifying native point defect configurations in α-alumina. J. Phys. Condens. 35, 334002 (2023).
Article Google Scholar
Schaarschmidt, M. et al. Learned force fields are ready for ground state catalyst discovery. Preprint at https://arxiv.org/abs/2209.12466 (2022).
Lan, J. et al. AdsorbML: a leap in efficiency for adsorption energy calculations using generalizable machine learning potentials. npj Comput. Mater. 9, 172 (2023).
Article CAS Google Scholar
Heinen, S., von Rudorff, G. F. & von Lilienfeld, O. A. Transition state search and geometry relaxation throughout chemical compound space with quantum machine learning. J. Chem. Phys. 157, 221102 (2022).
Article CAS PubMed Google Scholar
Lany, S. & Zunger, A. Metal-dimer atomic reconstruction leading to deep donor states of the anion vacancy in II-VI and chalcopyrite semiconductors. Phys. Rev. Lett. 93, 156404 (2004).
Article PubMed Google Scholar
Kang, J. & Wang, L.-W. High defect tolerance in lead halide perovskite CsPbBr₃. J. Phys. Chem. Lett. 8, 489–493 (2017).
Article CAS PubMed Google Scholar
Wilson, D. J., Sokol, A. A., French, S. A. & Catlow, C. R. A. Defect structures in the silver halides. Phys. Rev. B 77, 064115 (2008).
Article Google Scholar
Zhao, Y. et al. Correlations between immobilizing ions and suppressing hysteresis in perovskite solar cells. ACS Energy Lett. 1, 266–272 (2016).
Article CAS Google Scholar
Ágoston, P., Erhart, P., Klein, A. & Albe, K. Geometry, electronic structure and thermodynamic stability of intrinsic point defects in indium oxide. J. Phys. Condens. Matter 21, 455801 (2009).
Article PubMed Google Scholar
Han, D., Du, M.-H., Dai, C.-M., Sun, D. & Chen, S. Influence of defects and dopants on the photovoltaic performance of Bi₂S₃: first-principles insights. J. Mater. Chem. A 5, 6200–6210 (2017).
Article CAS Google Scholar
Meggiolaro, D., Ricciarelli, D., Alasmari, A. A., Alasmary, F. A. S. & De Angelis, F. Tin versus lead redox chemistry modulates charge trapping and self-doping in tin/lead iodide perovskites. J. Phys. Chem. Lett. 11, 3546–3556 (2020).
Article CAS PubMed Google Scholar
Erhart, P., Klein, A. & Albe, K. First-principles study of the structure and stability of oxygen defects in zinc oxide. Phys. Rev. B 72, 085213 (2005).
Article Google Scholar
Sokol, A. A., Walsh, A. & Catlow, C. R. A. Oxygen interstitial structures in close-packed metal oxides. Chem. Phys. Lett. 492, 44–48 (2010).
Article CAS Google Scholar
Evarestov, R. A., Jacobs, P. W. M. & Leko, A. V. Oxygen interstitials in magnesium oxide: a band-model study. Phys. Rev. B 54, 8969–8972 (1996).
Article CAS Google Scholar
Kotomin, E. A. & Popov, A. I. Radiation-induced point defects in simple oxides. Nucl. Instrum. Methods Phys. Res. B 141, 1–15 (1998).
Article CAS Google Scholar
Burbano, M., Scanlon, D. O. & Watson, G. W. Sources of conductivity and doping limits in CdO from hybrid density functional theory. J. Am. Chem. Soc. 133, 15065–15072 (2011).
Article CAS PubMed Google Scholar
Scanlon, D. O. & Watson, G. W. On the possibility of p-type SnO₂. J. Mater. Chem. 22, 25236–25245 (2012).
Article CAS Google Scholar
Godinho, K. G., Walsh, A. & Watson, G. W. Energetic and electronic structure analysis of intrinsic defects in SnO₂. J. Phys. Chem. C 113, 439–448 (2009).
Article CAS Google Scholar
Scanlon, D. O. et al. Nature of the band gap and origin of the conductivity of PbO₂ revealed by theory and experiment. Phys. Rev. Lett. 107, 246402 (2011).
Article PubMed Google Scholar
Keating, P. R. L., Scanlon, D. O., Morgan, B. J., Galea, N. M. & Watson, G. W. Analysis of intrinsic defects in CeO₂ using a Koopmans-like GGA+U approach. J. Phys. Chem. C. 116, 2443–2452 (2012).
Article CAS Google Scholar
Walsh, A., Da Silva, J. L. F. & Wei, S.-H. Interplay between order and disorder in the high performance of amorphous transparent conducting oxides. Chem. Mater. 21, 5119–5124 (2009).
Article CAS Google Scholar
Whalley, L. D., Crespo-Otero, R. & Walsh, A. H-center and V-center defects in hybrid halide perovskites. ACS Energy Lett. 2, 2713–2714 (2017).
Article CAS Google Scholar
Agiorgousis, M. L., Sun, Y.-Y., Zeng, H. & Zhang, S. Strong covalency-induced recombination centers in perovskite solar cell material CH₃NH₃PbI₃. J. Am. Chem. Soc. 136, 14570–14575 (2014).
Article CAS PubMed Google Scholar
Whalley, L. D. et al. Giant Huang-Rhys factor for electron capture by the iodine intersitial in perovskite solar cells. J. Am. Chem. Soc. 143, 9123–9128 (2021).
Article CAS PubMed PubMed Central Google Scholar
Motti, S. G. et al. Defect activity in lead halide perovskites. Adv. Mater. 31, 1901183 (2019).
Article CAS Google Scholar
Xiao, Z., Meng, W., Wang, J. & Yan, Y. Defect properties of the two-dimensional (CH₃NH₃)₂Pb(SCN)₂I₂ perovskite: a density-functional theory study. Phys. Chem. Chem. Phys. 18, 25786–25790 (2016).
Article CAS PubMed Google Scholar
Na-Phattalung, S. et al. First-principles study of native defects in anatase TiO₂. Phys. Rev. B 73, 125205 (2006).
Article Google Scholar
Li, K., Willis, J., Kavanagh, S. R. & Scanlon, D. O. Computational prediction of an antimony-based n-type transparent conducting oxide: F-doped Sb₂O₅. Chem. Mater. 36, 2907–2916 (2024).
Article CAS PubMed PubMed Central Google Scholar
Scanlon, D. O. Defect engineering of basno₃ for high-performance transparent conducting oxide applications. Phys. Rev. B 87, 161201 (2013).
Article Google Scholar
Cen, J., Zhu, B., Kavanagh, S. R., Squires, A. G. & Scanlon, D. O. Cation disorder dominates the defect chemistry of high-voltage LiMn_1.5Ni_0.5O₄ (LMNO) spinel cathodes. J. Mater. Chem. A 11, 13353–13370 (2023).
Article CAS Google Scholar
Mosquera-Lois, I., Kavanagh, S. R., Walsh, A. & Scanlon, D. O. ShakeNBreak: navigating the defect configurational landscape. J. Open Source Softw. 7, 4817 (2022).
Article Google Scholar
NIST Chemistry WebBook. https://doi.org/10.18434/M32147 (Accessed May 2023).
Qi, J., Ko, T. W., Wood, B. C., Pham, T. A. & Ong, S. P. Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling. npj Comput. Mater. 10, 1–11 (2024).
Article Google Scholar
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Article PubMed Google Scholar
Deng, B. et al. Chgnet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell. 5, 1031–1041 (2023).
Article Google Scholar
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Article CAS PubMed PubMed Central Google Scholar
Batatia, I. et al. A foundation model for atomistic materials chemistry. Preprint at https://arxiv.org/abs/2401.00096 (2024).
Batatia, I., Kovacs, D. P., Simm, G., Ortner, C. & Csanyi, G. MACE: higher order equivariant message passing neural networks for fast and accurate force fields. Adv. Neural Inf. Process Syst. 35, 11423–11436 (2022).
Google Scholar
Chen, C. & Ong, S. P. M3GNet (version 0.2.4). GitHub https://github.com/materialsvirtuallab/m3gnet (2023).
Salzbrenner, P. T. et al. Developments and further applications of ephemeral data derived potentials. J. Chem. Phys. 159, 144801 (2023).
Article CAS PubMed Google Scholar
Pickard, C. J. Ephemeral data derived potentials for random structure search. Phys. Rev. B 106, 014102 (2022).
Article CAS Google Scholar
Musielewicz, J., Wang, X., Tian, T. & Ulissi, Z. FINETUNA: fine-tuning accelerated molecular simulations. Mach. Learn. Technol. 3, 03LT01 (2022).
Article Google Scholar
Jung, H., Sauerland, L., Stocker, S., Reuter, K. & Margraf, J. T. Machine-learning driven global optimization of surface adsorbate geometries. npj Comput Mater. 9, 114 (2023).
Article Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article Google Scholar
Hu, Y.-J. First-principles approaches and models for crystal defect energetics in metallic alloys. Comput. Mater. Sci. 216, 111831 (2023).
Article CAS Google Scholar
Piochaud, J. B. et al. First-principles study of point defects in an fcc Fe-10Ni-20Cr model alloy. Phys. Rev. B 89, 024101 (2014).
Article Google Scholar
Guan, H. et al. Chemical environment and magnetic moment effects on point defect formations in CoCrNi-based concentrated solid-solution alloys. Acta Mater. 187, 122–134 (2020).
Article CAS Google Scholar
Rio, E. D. et al. Formation energy of vacancies in FeCr alloys: dependence on Cr concentration. J. Nucl. Mater. 408, 18–24 (2011).
Article Google Scholar
Zhang, Y. et al. Influence of chemical disorder on energy dissipation and defect evolution in concentrated solid solution alloys. Nat. Commun. 6, 8736 (2015).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Atomic-level heterogeneity and defect dynamics in concentrated solid-solution alloys. Curr. Opin. Solid State Mater. Sci. 21, 221–237 (2017).
Article CAS Google Scholar
Arora, G., Bonny, G., Castin, N. & Aidhy, D. S. Effect of different point-defect energetics in Ni₈₀X₂₀ (X=Fe, Pd) on contrasting vacancy cluster formation from atomistic simulations. Acta Mater. 15, 100974 (2021).
CAS Google Scholar
Zhao, S., Stocks, G. M. & Zhang, Y. Defect energetics of concentrated solid-solution alloys from ab initio calculations: Ni_0.5Co_0.5, Ni_0.5Fe_0.5, Ni_0.8Fe_0.2 and Ni_0.8Cr_0.2. Phys. Chem. Chem. Phys. 18, 24043–24056 (2016).
Article CAS PubMed Google Scholar
Manzoor, A. & Zhang, Y. Influence of defect thermodynamics on self-diffusion in complex concentrated alloys with chemical ordering. JOM 74, 4107–4120 (2022).
Article CAS Google Scholar
Zhao, S., Egami, T., Stocks, G. M. & Zhang, Y. Effect of d electrons on defect properties in equiatomic NiCoCr and NiCoFeCr concentrated solid solution alloys. Phys. Rev. Mater. 2, 013602 (2018).
Article CAS Google Scholar
Li, C. et al. First principle study of magnetism and vacancy energetics in a near equimolar NiFeMnCr high entropy alloy. J. Appl. Phys. 125, 155103 (2019).
Article Google Scholar
Manzoor, A., Zhang, Y. & Aidhy, D. S. Factors affecting the vacancy formation energy in Fe₇₀Ni₁₀Cr₂₀ random concentrated alloy. Comput. Mater. Sci. 198, 110669 (2021).
Article CAS Google Scholar
Muzyk, M., Nguyen-Manh, D., Kurzydłowski, K. J., Baluc, N. L. & Dudarev, S. L. Phase stability, point defects, and elastic properties of W-V and W-Ta alloys. Phys. Rev. B 84, 104115 (2011).
Article Google Scholar
Wang, Y. et al. Cation disorder engineering yields AgBiS₂ nanocrystals with enhanced optical absorption for efficient ultrathin solar cells. Nat. Photon. 16, 235–241 (2022).
Article CAS Google Scholar
Williford, R., Weber, W., Devanathan, R. & Gale, J. Effects of cation disorder on oxygen vacancy migration in Gd₂Ti₂O₇. J. Electroceram. 3, 409–424 (1999).
Article CAS Google Scholar
Quadir, S. et al. Short- and long-range cation disorder in (Ag_xCu_1−x)₂ZnSnSe₄ kesterites. Chem. Mater. 34, 7058–7068 (2022).
Article CAS Google Scholar
Morrow, J. D. et al. Understanding defects in amorphous silicon with million-atom simulations and machine learning. Angew. Chem. Int. Ed. 63, e202403842 (2024).
Riebesell, J. et al. Matbench discovery—an evaluation framework for machine learning crystal stability prediction. Preprint at https://arxiv.org/html/2308.14920v2 (2023).
Shimizu, K. et al. Using neural network potentials to study defect formation and phonon properties of nitrogen vacancies with multiple charge states in GaN. Phys. Rev. B 106, 054108 (2022).
Article CAS Google Scholar
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. General-purpose machine learning potentials capturing nonlocal charge transfer. Acc. Chem. Res. 54, 808–817 (2021).
Article CAS PubMed Google Scholar
Kavanagh, S. R., Scanlon, D. O., Walsh, A. & Freysoldt, C. Impact of metastable defect structures on carrier recombination in solar cells. Faraday Discuss. 239, 339–356 (2022).
Article CAS PubMed PubMed Central Google Scholar
Mosquera-Lois, I., Kavanagh, S. R., Klarbring, J., Tolborg, K. & Walsh, A. Imperfections are not 0 K: free energy of point defects in crystals. Chem. Soc. Rev. 52, 5812–5826 (2023).
Article CAS PubMed Google Scholar
Pols, M., Brouwers, V., Calero, S. & Tao, S. How fast do defects migrate in halide perovskites: insights from on-the-fly machine-learned force fields. Chem. Commun. 59, 4660–4663 (2023).
Article CAS Google Scholar
Freysoldt, C. et al. First-principles calculations for point defects in solids. Rev. Mod. Phys. 86, 253–305 (2014).
Article Google Scholar
Lany, S. & Zunger, A. Assessment of correction methods for the band-gap problem and for finite-size effects in supercell defect calculations: Case studies for ZnO and GaAs. Phys. Rev. B 78, 235104 (2008).
Article Google Scholar
Heyd, J., Scuseria, G. E. & Ernzerhof, M. Hybrid functionals based on a screened coulomb potential. J. Chem. Phys. 118, 8207–8215 (2003).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Kresse, G. & Hafner, J. Ab initio molecular dynamics for liquid metals. Phys. Rev. B 47, 558–561 (1993).
Article CAS Google Scholar
Kresse, G. & Hafner, J. Ab initio molecular-dynamics simulation of the liquid-metal–amorphous-semiconductor transition in germanium. Phys. Rev. B 49, 14251–14269 (1994).
Article CAS Google Scholar
Pizzi, G., Cepellotti, A., Sabatini, R., Marzari, N. & Kozinsky, B. AiiDA: automated interactive infrastructure and database for computational science. Comput. Mater. Sci. 111, 218–230 (2016).
Article Google Scholar
Uhrin, M., Huber, S. P., Yu, J., Marzari, N. & Pizzi, G. Workflows in AiiDA: engineering a high-throughput, event-based engine for robust and modular computational workflows. Comput. Mater. Sci. 187, 110086 (2021).
Article Google Scholar
Huber, S. P. et al. AiiDA 1.0, a scalable computational infrastructure for automated reproducible workflows and data provenance. Sci. Data 7, 300 (2020).
Article PubMed PubMed Central Google Scholar
Ong, S. P. et al. Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Jain, A. et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article Google Scholar
Ong, S. P. et al. The materials application programming interface (API): a simple, flexible and efficient API for materials data based on REpresentational state transfer (REST) principles. Comput. Mater. Sci. 97, 209–215 (2015).
Article Google Scholar
Shen, J.-X. & Varley, J. pymatgen-analysis-defects: A python package for analyzing point defects in crystalline materials. J. Open Source Softw. 9, 5941 (2024).
Article Google Scholar
Shen, J.-X., Voss, L. F. & Varley, J. B. Simulating charged defects at database scale. J. Appl. Phys. 135, 145102 (2024).
Article CAS Google Scholar
Larsen, A. H. et al. The atomic simulation environment-a python library for working with atoms. J. Condens. Matter Phys. 29, 273002 (2017).
Article Google Scholar
Kavanagh, S. R. et al. doped: Python toolkit for robust and repeatable charged defect supercell calculations. J. Open Source Softw. 9, 6433 (2024).
Article Google Scholar
Bitzek, E., Koskinen, P., Gähler, F., Moseler, M. & Gumbsch, P. Structural relaxation made simple. Phys. Rev. Lett. 97, 170201 (2006).
Article PubMed Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Hinton, G. E. & Roweis, S. Stochastic neighbor embedding. Adv. Neural Inf. Process Syst. 15, 857–864 (2002).
Google Scholar
Chen, C., Ye, W., Zuo, Y., Zheng, C. & Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 31, 3564–3572 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank David O. Scanlon for discussions on defect symmetry breaking. I.M.L. acknowledges Imperial College London for funding a President’s PhD scholarship. S.R.K. acknowledges the EPSRC Centre for Doctoral Training in the Advanced Characterisation of Materials (CDT-ACM)(EP/S023259/1) for funding a PhD studentship. A.M.G. is supported by EPSRC Fellowship EP/T033231/1. A.W. is supported by EPSRC project EP/X037754/1. We are grateful to the UK Materials and Molecular Modelling Hub for computational resources, which are partially funded by EPSRC (EP/P020194/1 and EP/T022213/1). This work used the ARCHER2 UK National Supercomputing Service (https://www.archer2.ac.uk) via our membership of the UK’s HEC Materials Chemistry Consortium, which is funded by EPSRC (EP/L000202). We acknowledge the Imperial College London’s High Performance Computing services for computational resources.

Author information

Authors and Affiliations

Thomas Young Centre and Department of Materials, Imperial College London, Exhibition Road, London, SW7 2AZ, UK
Irea Mosquera-Lois, Seán R. Kavanagh & Aron Walsh
Thomas Young Centre and Department of Chemistry, Imperial College London, 80 Wood Ln, London, W12 7TA, UK
Alex M. Ganose
Department of Physics, Ewha Womans University, 52 Ewhayeodae-gil, Seodaemun-gu, Seoul, 03760, Korea
Aron Walsh

Authors

Irea Mosquera-Lois
View author publications
You can also search for this author in PubMed Google Scholar
Seán R. Kavanagh
View author publications
You can also search for this author in PubMed Google Scholar
Alex M. Ganose
View author publications
You can also search for this author in PubMed Google Scholar
Aron Walsh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation & Project Administration: All authors. Investigation and methodology: I.M.-L. Supervision: S.R.K., A.M.G., A.W. Writing—original draft: I.M.-L. Writing—review & editing: All authors. Resources and funding acquisition: A.M.G., A.W. These author contributions are defined according to the CRediT contributor roles taxonomy.

Corresponding author

Correspondence to Aron Walsh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mosquera-Lois, I., Kavanagh, S.R., Ganose, A.M. et al. Machine-learning structural reconstructions for accelerated point defect calculations. npj Comput Mater 10, 121 (2024). https://doi.org/10.1038/s41524-024-01303-9

Download citation

Received: 19 January 2024
Accepted: 24 May 2024
Published: 06 June 2024
DOI: https://doi.org/10.1038/s41524-024-01303-9
Springer Nature Limited

Associated content

Machine Learning of Defects in Crystals

Collection 21 November 2023

Machine-learning structural reconstructions for accelerated point defect calculations

Abstract

Similar content being viewed by others

Machine-learning potentials for crystal defects

Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning

Machine learning potential assisted exploration of complex defect potential energy surfaces

Introduction

Results

Defect reconstructions

Model training

Model performance

Application to alloys

Discussion

Methods

High-throughput vacancies in chalcogenide hosts

Application to the CdSe_xTe_(1−x) alloy

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Machine Learning of Defects in Crystals

Navigation

Machine-learning structural reconstructions for accelerated point defect calculations

Abstract

Similar content being viewed by others

Machine-learning potentials for crystal defects

Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning

Machine learning potential assisted exploration of complex defect potential energy surfaces

Introduction

Results

Defect reconstructions

Model training

Model performance

Application to alloys

Discussion

Methods

High-throughput vacancies in chalcogenide hosts

Application to the CdSexTe(1−x) alloy

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

Application to the CdSe_xTe_(1−x) alloy