Machine learning and atomistic origin of high dielectric permittivity in oxides

Shimano, Yuho; Kutana, Alex; Asahi, Ryoji

doi:10.1038/s41598-023-49603-2

Machine learning and atomistic origin of high dielectric permittivity in oxides

Article
Open access
Published: 14 December 2023

Volume 13, article number 22236, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Machine learning and atomistic origin of high dielectric permittivity in oxides

Download PDF

Yuho Shimano¹,
Alex Kutana¹ &
Ryoji Asahi¹

809 Accesses
Explore all metrics

Abstract

Discovering new stable materials with large dielectric permittivity is important for future energy storage and electronics applications. Theoretical and computational approaches help design new materials by elucidating microscopic mechanisms and establishing structure–property relations. Ab initio methods can be used to reliably predict the dielectric response, but for fast materials screening, machine learning (ML) approaches, which can directly infer properties from the structural information, are needed. Here, random forest and graph convolutional neural network models are trained and tested to predict the dielectric constant from the structural information. We create a database of the dielectric properties of oxides and design, train, and test the two ML models. Both approaches show similar performance and can successfully predict response based on the structure. The analysis of the feature importance allows identification of local geometric features leading to the high dielectric permittivity of the crystal. Dimensionality reduction and clustering further confirms the relevance of descriptors and compositional features for obtaining high dielectric permittivity.

High dielectric ternary oxides from crystal structure prediction and high-throughput screening

Article Open access 06 March 2020

Computational screening of materials with extreme gap deformation potentials

Article Open access 20 July 2022

High-throughput ab initio calculations on dielectric constant and band gap of non-oxide dielectrics

Article Open access 04 October 2018

Introduction

High dielectric permittivity materials are the key component in capacitive electronic and high electric power density applications and devices¹. Besides the required high relative dielectric permittivity, the desired properties for such materials include temperature and electric field stability, low dielectric losses, and high breakdown voltage. Most of the presently known materials with highest permittivities do not meet all of these conditions, limiting applications. For example, the high permittivity of ferroelectrics near the phase transition shows large variations with respect to temperature and external electric field. In another kind of materials, CaCu₃Ti₄O₁₂ being a typical example², the apparent high permittivity arises due to extrinsic effects, e.g. barrier layer capacitance at the grain boundaries^3,4,5, making these materials impractical due to the high dielectric loss.

Insulating paraelectrics are free from these shortcomings, and are therefore considered the best candidates for dielectric applications. As the permittivity of the known paraelectrics is only moderate (10–10²), the present challenge is to find stable and loss-free paraelectrics with large permittivity > 10². One of the most successful and systematic approaches towards this goal is to increase the intrinsic permittivity of the host paraelectric by permittivity “boosting”^6,7,8 through impurity doping. In In-Nb co-doped rutile TiO₂, permittivity boosting to > 10⁴ was originally reported⁹. Subsequent studies revealed that the major part of this apparent permittivity increase stems from the grain boundary^10,11,12 and contact¹³ barrier layer capacitances, just as in CaCu₃Ti₄O₁₂. More detailed studies^6,7,13 show however, that at low temperatures where all thermally excited carriers are frozen out and insulating state restored, the rutile permittivity is indeed boosted by co-doping, although the effect is smaller than originally reported. Our previous theoretical analysis⁸ confirmed intrinsic permittivity boosting in co-doped rutile and other substituted paraelectric titanates, and also showed that the effect can be accounted for by the lattice mechanism. It was found that the boosting is due to the softening of the active phonon mode by the local strain¹⁴ from impurities, and a simple descriptor in the form of the maximum Ti–O bond length was proposed⁸. The descriptor was found heuristically; finding such descriptors is generally challenging and largely depends on luck and intuition. The descriptor is of limited utility as it only applied to titanates; correlation between more general structural features and permittivity should be pursued to apply to the other metal oxides.

In this regard, machine learning (ML) approaches offer a more systematic way of finding relevant descriptors and features in materials, which can also be utilized for property predictions. Here, we use two ML approaches, random forest (RF)^15,16 and graph convolutional neural network (GCNN)^17,18, to predict dielectric constants, dynamic stability, and identify features that are relevant to high permittivity. While differing in approaches, both methods can be used for classification and regression tasks. RF is an ensemble method that operates by polling decision trees, while GCNNs can learn complex dependencies on the graph through neighborhood aggregation schemes. RF has been previously employed¹⁹ for predicting the dielectric constants of oxides found in the Materials Project²⁰ database. Being one of the most robust and best performing universal techniques for regression and classification, RF can serve as a benchmark for other ML techniques, such as GCNNs, for dielectric constant prediction. Here we extend our training set of metal oxides to include non-titanates: Hf- and Zr-based perovskites and cubic double perovskites, to explore local strain sensitivity in other materials, as well as other high permittivity mechanisms besides the strain tuning of Ti–O interactions. We train and test the two classes of models using the dataset, describe their performances, and discuss their similarities and differences.

Dataset

Derivative structure enumeration

For ML model training, we generate an in-house ab initio dataset with optimized geometries and static electronic and ionic dielectric tensors of candidate large dielectric constants materials. Oxides composed of alkaline earth and transition metal elements, with rutile, perovskite, Ruddlesden-Popper, and orthorhombic Cmcm structures, were used as prototypes for co-doping and isovalent substitutions. Co-doping in rutile TiO₂ and rutile phases²¹ of SiO₂ (stishovite) and SnO₂ was explored for boosting the dielectric permittivity in these materials^7,9,13,22. In rutile prototypes, aliovalent co-doping with III-V (Al³⁺, Ga³⁺, In³⁺, Sc³⁺, Y³⁺, La³⁺–V⁵⁺, Nb⁵⁺, Ta⁵⁺) and II-VI (Mg²⁺, Ca²⁺, Sr²⁺, Ba²⁺–Cr⁶⁺, Mo⁶⁺, W⁶⁺) ions was performed at the X = Ti⁴⁺/Si⁴⁺/Sn⁴⁺ cation sites, with the general formula A^(4-δ)+B^(4+δ)+X₂O₈, with δ = 1,2. Here the ionic valencies were presumably assigned; however, the charge transfer may result in zero band-gap metallic states after self-consistent electronic structure calculations. Such metallic states were excluded in the present study. The co-doping motifs are shown in Fig. 1a. The prototypes for isovalent substitutions were Pnma perovskite CaTiO₃, Ruddlesden-Popper phases^23,24 Sr₂TiO₄ and Sr₃Ti₂O₇, cubic perovskite barium zirconate²⁵ BaZrO₃, and Cmcm strontium zirconate²⁶ SrZrO₃. Ca²⁺, Sr²⁺, Ba²⁺, and Pb²⁺ isovalent substitutions were performed on the alkaline earth metal site and Ti⁴⁺, Zr⁴⁺, and Hf⁴⁺ on the transition metal site. An example BaPbSr₂Ti₄O₁₂ structure, obtained by the Ba, Pb, Sr substitutions on the A-site of the Pnma CaTiO₃, is shown in Fig. 1b. Finally, substitutions in double perovskites, with a general formula A₂B’B’’O₆, were performed. We employed the formula A²⁺₂B’^(4-δ)+B’’^(4+δ)+O₆, with A = Ca²⁺, Sr²⁺, Ba²⁺, and δ = 1, 2, 3, constraining the charge states of B’ and B’’ to maintain neutrality. Here, high symmetry cubic structures were explored, Fig. 1c, with B’ and B’’ ions selected from across the periodic table.

Symmetry unique derivative supercell structures^27,28 were generated by substitutions on the sites of the primitive cell lattice, as implemented in the ICET code²⁹. The properties of 6808 structures with isovalent substitutions and 453 co-doped structures were calculated. The structures were first optimized, and then DFPT response was obtained. Out of 6808 substituted structures, 1991 structures were dynamically stable, and 4817 had imaginary phonon frequencies. In co-doped rutiles, 159 structures were dynamically stable, and remaining 294 structures unstable. Relaxations along the displacements of the unstable phonon modes were not pursued, and the dynamically unstable structures were excluded from regression model training. The dataset consisting of the 1991 substituted and 159 co-doped dynamically stable structures was used in both classification and regression training.

For the above dataset, we evaluated the optimized geometry, Born charges, phonon vibrational modes, and dielectric permittivity by using density functional theory and density functional perturbation theory (DFPT)³⁰ implemented in the VASP package³¹. The calculated data is available and top 20 materials which have large dielectric permittivity are listed in Table S1.

Predicting the dielectric constant and dynamic stability with random forest

In this section we employed the RF machine learning method. First, we constructed machine learning regression models to predict the dielectric constant. Because the enhancement of dielectric permittivity is closely related to softening of the optical phonon modes^32,33, we made another machine model, namely, a classification model in terms of the minimum optical mode. One of the features of RF is accessibility of importance of descriptors as will be discussed in detail.

The RF machine learning method employs the bootstrap aggregating for sampling the decision/prediction trees to improve the stability and accuracy, and shows reasonably good performance in most cases for classification and regression. The input variables included elemental properties and structural features encoded by using the pymatgen³⁴ and matminer³⁵ packages. The list of the features used is given in Table S2 of the Supplementary Information; the choice of the 45 descriptors was similar to that in a previous study¹⁹.

The calculated dielectric constants for the given structures in the dataset were used to train the RF regression³⁶, as implemented in the scikit-learn³⁷ code. The ionic and electronic contributions were treated separately, and the decimal logarithm of the dielectric constant was taken as the target variable for the model. The logarithm value was used to mitigate the disproportional effect of the systems with large dielectric constants on the model. Tables S3–S5 provide the results of the hyperparameter space search. The number of decision trees was set to 150, and the maximum tree depth was not constrained, by keeping the minimum number of samples required to split internal nodes at the default value of 2. In evaluation, fourfold cross-validation was used. RF is robust against the variation of hyperparameters, with the default hyperparameters values resulting in nearly same performance as those obtained by search using grid or Bayesian optimizations. The root-mean-squared errors (RMSE) and coefficients of determination (R²) were used as the metrics for model performance.

The parity plots for the calculated and RF predicted dielectric constant are shown in Fig. 2. The calculated permittivities vary in a wide range and noticeably reach ɛ_ion > 100. The large dielectric permittivities are obtained with doped and substituted titania as previously reported⁸. The electronic contribution is reasonably small, ɛ_el > 10, as in the intrinsic paraelectric materials. Thus, a large dielectric permittivity can be achieved by realizing a large ɛ_ion. After optimizing the hyperparameters of the RF regression model as described above, the RMSE = 0.174 and R² = 0.887 for the ionic permittivity, and RMSE = 0.032 and R² = 0.921 for the electronic permittivity were obtained for the test data in the cross-validation. These values are better than the previous work using the RF model constructed for the metal oxides in the Materials Project²⁰, RMSE = 0.148 and R² = 0.73 for the ionic permittivity¹⁹. It is important to note that our model successfully predicts the dielectric constant even with a significant structural distortion by doping and the corresponding quite high permittivity.

We also created a classification model using the phonon frequency of the smallest optical mode as the supervised data. All materials with a band gap greater than 0.2 eV, including those with imaginary phonon frequencies, were used. A classification model was generated by labeling three classes of phonon frequencies ω: ω_opt² ≤ 0, 0 < ω_opt ≤ 2 THz, and ω_opt > 2 THz. The first class, ω_opt² ≤ 0, indicates dynamical instability. The instability may include a phase transition to a ferroelectric phase. The second class, 0 < ω_opt ≤ 2 THz, is of interest in terms of possibility of colossal dielectric permittivity, as it is inversely proportional to ω_opt²^30,32,38. The third class, ω_opt > 2 THz, is categorized into normal paraelectric materials. For the classification model, we used the RF classification of the scikit-learn code. The same descriptors and the optimization procedure were used as in the regression model.

Prediction results of the classification model are shown in Fig. 3. Note that the imaginary part of the phonon frequency was expressed as a negative number in the plot. Regarding the prediction accuracy, the accuracy and F₁ scores were 0.885 and 0.880, respectively.

Because the above machine learning models predict the dielectric constant with a reasonable accuracy, it is interesting to investigate which descriptors are important for the prediction. The importance of descriptors in the RF regression of ionic permittivity and their correlation coefficients are shown in Fig. 4a. We observe that local differences in atomic properties and structures turn out to be important. For instance, the most important descriptor for the regression is the minimum value of the absolute local differences between the number of unfilled electrons in each atom and its neighbors (local difference in Nunfilled (min)) defined as min_i(δ(N_{unfilled, i})). The local difference for each atom or site i is calculated using the following equation³⁹:

$$\delta \left({p}_{i}\right)= \frac{{\sum }_{n}{A}_{ni}\left|{p}_{n}-{p}_{i}\right|}{{\sum }_{n}{A}_{ni}}.$$

Here, the sum is taken over all nearest neighbor sites n as determined by the Voronoi tessellation, p_n is the property (e.g. atomic number) of site n, and A_ni is a weight that corresponds to the area of the facet on the tessellation corresponding to that neighbor. The second most important descriptor is the maximum of the differences in the atomic numbers Z among neighbors (local difference in Number (max)) defined as max_i(δ(Z_i)). The third most important descriptor is the mean of the standard deviations of the Voronoi areas around each atom (Voro_area_std_dev (mean)). The fourth most important descriptor is the mean of the neighbor distance variation, MNDV, which indicates the degree to which the atoms are displaced from the high-symmetry positions, and the degree to which the lattice is distorted with respect to the high-symmetry structure, and is calculated from the following equation^19,39:

$$MNDV=\frac{1}{{N}_{\rm atom}}{\sum }_{i}\frac{{\sum }_{n}{A}_{ni}|{r}_{ni}-{\bar r}_{i}|}{{\bar r}_{i}{\sum }_{n}{A}_{ni}}.$$

Here, N_atom is the number of atoms in the unit cell, and $\bar{r}_i$ is the average nearest neighbor distance for atom i, i.e. the sum over n of r_ni, distances between atoms i and n, weighted by A_ni/Σ_nA_ni. The fifth important descriptor is the transition metal fraction in the material (TMetalFraction)⁴⁰, the ratio of the number of transition metal atoms to the total number of atoms in the unit cell. This descriptor does not require any structural information, and is based on the compositional information only. In Figs. 4b–f, we plot correlations of some important descriptors with the ionic dielectric constant. There is a high correlation coefficient of “local difference in Nunfilled (min)” with ɛ_ion; however, most of the high ɛ_ion oxides are found around the intermediate descriptor value ~ 2. Around this value, there are many co-doped materials with both large and small ɛ_ion due to variation of their local configurations. We found similar correlations between ε_ion and both “Voro_area_std_dev (mean)” and MNDV. Both of these descriptors are related to geometrical symmetry, with smaller values indicating higher symmetry. These results suggest that just the right amount of asymmetry is needed for permittivity boosting. The trend is consistent with previous studies¹⁹, and our finding of the optimal value of the max(d_Ti-O) descriptor⁸. It was also reported that the Born effective charge of BaTiO₃ decreases with displacement from the cubic symmetry⁴¹ and that some polycrystalline perovskite oxides tend to have larger dielectric constants with increasing symmetry in the DFPT calculations⁴². The fact that such a trend was also observed in co-doping suggests that co-doping may control the symmetry of the local structure and increase the dielectric constant.

The importance of descriptors in the RF classification for phonon frequencies was also investigated. The descriptor importance for the RF classification of phonon frequencies is shown in Fig. 5a, and their correlations with the frequency in Fig. 5b–f. The most important descriptor is the mean of the linear coordination number, (linear CN_2(mean)), indicating how similar the atomic environment is to that of a linearly 2-coordinated atom. Coordination numbers are order parameters assuming values between 0 and 1 quantifying coordination patterns of atoms⁴³. Similarity to 2-coordinated linear geometry may reflect chain instability^44,45 driving ferroelectric transitions in some oxides. Three of the remaining top descriptors are related to the shape of the Voronoi partitions, while another is min_i(δ(N_{unfilled, i})), so that all of the 5 top descriptors are related to the local structure. There is an unusual dispersion in the plot of the minimum phonon frequency and the average value of Voronoi volume maxima around each atom (Voro_vol_maximum (mean)), as shown in Fig. 5e. Here we show that most of the samples have 0 < ω_opt ≤ 2 THz when this descriptor takes more than three. It is suggested that controlling the local structure may reduce the phonon frequency and increase the dielectric constant.

Figure 6a visualizes the data distribution in a two-dimensional descriptor space. The t-distributed stochastic neighbor embedding method (t-SNE)⁴⁶ was used to reduce from 45-dimensional vectors of matminer descriptors into two dimensions. The distances among the data points represent similarity of the descriptor vectors. We observed clustering of the data points where each cluster has overall similar dielectric constant. This means that the descriptors used properly express the magnitude of the dielectric constant. In order to make clustering in the descriptor space, we employed the density-based spatial clustering of applications with noise (DBSCAN)⁴⁷. The DBSCAN is a density-based clustering algorithm. Different from the k-means method, DBSCAN does not require to specify the number of clusters in advance. The data points which do not belong to any clusters are regarded as noise. With DBSCAN, we were able to divide the data set into 40 clusters (Fig. 6b). The average of the dielectric constant in each cluster is clearly distinct. For the two clusters, C-1 and C-2, which show the highest dielectric constant, we counted appearance of the elements in composition of the materials included in each cluster as shown in Fig. 7. The C-1 cluster consists of Pb and alkaline earth metals indicating doping systems for perovskite CaTiO₃. On the other hand, the C-2 cluster mainly includes co-doped rutile TiO₂. Both types of modification turn out to be effective for boosting the dielectric permittivity.

Graph convolutional neural networks

We also utilized graph convolutional neural networks (GCNNs) with SchNet¹⁸ architecture for the regression and classification tasks. GCNNs^48,49 are well suited for learning properties of molecules and crystals^17,50,51,52, whose structure can be naturally represented by graphs. The target property is learned through local message passing⁵³ among the neighboring nodes. Crystal GCNNs operate with the node (atom) and edge (bond) attributes of the graph representing the spatial connectivity of atoms in a crystal¹⁷. In SchNet, atomic numbers are used for initial node embeddings, while a trainable edge filter W(r): ℝ⁺ → ℝ^m is created by passing the Gaussian expansions of interatomic distances r through a multilayer perceptron. Here, m is the number of edge filter attributes. The convolution¹⁸ on each node i is a sum of element-wise products over all neighbors j of node i, i.e. $\Sigma_{j \in \mathcal{N}(i)}$ x_j⊙W(|r_j-r_i|). The complete message passing/interaction layer has additional perceptrons before and after the convolution, and after several interactions, node attributes are pooled into the target value.

Figure 8 shows the parity plot for a fivefold cross-validation SchNet training with log₁₀(ε_ion), yielding the coefficients of determination R² = 0.852, and RMSE = 0.14. The performance is somewhat better than that of RF, with R² = 0.887 and RMSE = 0.174. Here, log₁₀(ε_ion) is taken as the target value, as using the actual ε_ion for the target value results in large absolute errors, Fig. S1. We also compared the performance of SchNet and CGCNN¹⁷, consistently obtaining better results with SchNet. SchNet was found to be rather robust against hyperparameter variation, with rather wide regions of stable performance, as seen in Fig. S2. In particular, 3 interaction layers were sufficient to achieve close to optimal performance.

We benchmarked the improvement of the performance of SchNet and CGCNN with the size of the training dataset. The dataset with dynamically stable structures was split at different ratios, using one part for training, and the remaining part for validation. Both SchNet and CGCNN show significant improvements with increasing dataset size, as seen in Fig. S3, with SchNet performing better than CGCNN. The overall SchNet performance shows a large benefit of using a simple network architecture for high quality prediction of the dielectric constant of complex materials.

One can obtain spatial information about the dielectric constant contribution by querying the node (atom) attributes in the last interaction layer before average pooling. In Fig. 9, individual atomic contributions in In-Nb co-doped rutile TiO₂, which shows one of the largest dielectric constants in the dataset, are shown. The GCNN assigns the largest contribution to O atoms, while Ti is assigned the smallest contribution. This is probably because the GCNN can noticeably detect a difference in local configuration around O atom to assign a wide range of dielectric constants. Significantly large dielectric contributions are seen at O atoms whose Ti–O bond lengths are around 2.02 Å as shown in Fig. 9b. The observation is indeed consistent with the results obtained for doped TiO₂ and pristine rutile TiO₂, in which the softening of Ti–O mode occurs at max(d_Ti-O) ≃ 2.02 Å as the result of the strain induced by doping⁸. This unique feature of the trained GCNN model gives visualization of the atomic contributions to dielectric constant in any unit cell. Figure 10 shows the atomic contributions to dielectric constant in an unrelaxed large rutile TiO₂ unit cell including a pair of In-Nb co-doping. While the local strain is not introduced because each atomic position is not optimized, relatively large contributions around the In-Nb co-doping site are predicted by the trained SchNet model. This is interpreted as contributions to dielectric constant from chemical coordination without changing bond lengths. One can expect additional boosting of dielectric constant in the fully relaxed structure where local stain around doping is introduced. Further performance improvements can be achieved with models utilizing not only information about bond distances, but also bond angles⁵⁴.

A computationally efficient workflow must utilize ML-only steps. Thus, additional ML models for structural optimization or band gap prediction are needed for a closed-cycle screening and discovery of dielectric materials, and our dielectric constant models, which were trained with optimized geometries, must be coupled with another ML model for structural relaxations. Alternatively, one could train a dielectric constant model directly with unrelaxed structures. The comparison, evaluation, benchmarking, and analysis of such models trained with relaxed vs. unrelaxed geometries has not been performed here and is deferred to future work. We also trained a SchNet GCNN using DFT electronic band gaps as target values and relaxed geometries as inputs. The model shows rather good performance, as seen from the parity plot, Fig. S4, and can be used for screening for dielectrics with large gaps. Eventually, it is desirable to search for completely new structures beyond the derivatives by doping. While proposing new structures including their synthesizability with ML-only approaches is still quite challenging, we believe that our prediction models hold promise for use in new materials discovery.

In conclusion, we carried out ML modeling to predict the dielectric constants of oxides and classify their dynamic stability based on the structural information. Two classes of ML models, random forest and graph convolutional neural networks, are used for classification and regression. Both models show similar cross-validation performance with the coefficient of determination R² ~ 0.8–0.9 for predicting the dielectric constants in a wide range 10–10³. Feature importance analysis shows that the local differences of atomic and geometric features play a large role in determining the value of the dielectric constant of the material. Both approaches show fast performance and are suitable for high throughput screening and evaluation of high dielectric constant materials.

Methods

The ionic dielectric response was calculated using a density functional perturbation theory (DFPT) approach³⁰, as implemented in the VASP package³¹. PBEsol functional⁵⁵ with Hubbard U correction⁵⁶ was used. U = 3 eV was applied to d electrons in transition metals, as employed in previous work¹⁹, except for Ti and Sc, where U = 0 was used¹⁴, and U = 5 eV was applied to f electrons in rare earths. Table S6 compares the calculated values of the dielectric constant obtained with different U values with experimental ones, showing U = 3 eV to be a reasonable choice except for the titanates and scandium oxide, where U = 0 is the best choice. Projector-augmented wave (PAW) method was used to treat core electrons. Recommended PAW potentials were employed for all elements except Ti, where valence 4 “Ti” potential was found to better represent the vibrational and optical properties of titanium dioxide¹⁴. The k point grid density of 3000 k points·atom (approximately corresponding to a distance of 0.2 Å⁻¹ between the k points) was used for the integration over the Brillouin zone. The cell shape and atomic position were fully optimized prior to DFPT calculations. The norms of all forces acting on atoms were minimized to within 0.005 eV/Å. We employed this criterion for computational efficiency, although we observed that it led to errors in some cases, in particular for the dynamically unstable structures.

The ionic part of the dielectric tensor ε_{ion αβ} is obtained from DFPT as a sum over phonon modes, according to^30,38,57

$${\varepsilon }_{\rm{ion} \,\alpha \beta }=\frac{{e}^{2}}{{{\varepsilon }_{0}M}_{0}V}{\sum }_{\nu \,}\frac{{\bar{Z}*}_{\nu \alpha }{\bar{Z}*}_{\nu \beta }}{{\omega }_{\nu }^{2}}.$$

(1)

Here, e is an elementary charge, M₀ is a mass reference, V a unit cell volume, ${\bar{Z}*}_{\nu \alpha }={\sum }_{\kappa \beta \,}{Z*}_{\kappa ,\alpha \beta }{\left({M}_{0}/{M}_{\kappa }\right)}^{1/2}{\xi }_{\nu ,\kappa \beta }$ is the αth Cartesian component of the unnormalized effective charge vector for phonon mode ν, and ω_ν is mode frequency. ξ_ν,κβ are the eigenvectors of the dynamical matrix, normalized according to Σ_κβξ_ν,κβ ξ_ν’,κβ = δ_νν’, and Z*_κ,αβ are the atomic Born charges. Note that only modes with nonzero effective charges contribute to the static tensor Eq. (1). The interatomic force constants are used to construct the dynamical matrix. The phonon frequencies were obtained by diagonalizing the dynamical matrix. The mode must be polar, which also makes it a candidate for ferroelectric transition when soft. Note that the large ionic epsilon can be achieved by either increasing Z or decreasing ω. The latter means softening of the phonon mode, which leads to a ferroelectric instability.

The importance I(j) of feature j in a random forest in scikit-learn is defined using Gini importance as

$$I\left(j\right)={\sum }_{i=1}^{n\in F\left(j\right)}\left({w}_{i}{C}_{i}-{w}_{left\left(i\right)}{C}_{left\left(i\right)}-{w}_{right\left(i\right)}{C}_{right\left(i\right)}\right),$$

where F(j) is the importance of feature j, w_j is a weighted number of samples reaching node i, and C_i is an impurity value of node i. Left(i) and right(i) are the right and left child nodes on node i, respectively.

The impurity value C in the classification problem is

$$C\left(i\right)= {\sum }_{k=1}^{N}{f}_{k}(1-{f}_{k}),$$

where N is the number of unique labels and f_k is the frequency of label k.

For regression problems, the impurity value C is

$$C\left(i\right)=\frac{1}{M}{\sum }_{k=1}^{M}{\left({y}_{k} -\mu \right)}^{2},$$

where M is the number of instances, y_k is label for an instance and μ is the mean value given by $\frac{1}{M}{\sum }_{k=1}^{M}{y}_{k}^{2}$. The final value is output after normalizing the Gini importance of each feature.

Data availability

The datasets generated during the current study will be available at https://doi.org/10.17632/m5jhkc3p9d.1 and the other data used in the study will be available on reasonable request from the corresponding author.

References

Yang, Z., Du, H., Jin, L. & Poelman, D. High-performance lead-free bulk ceramics for electrical energy storage applications: Design strategies and challenges. J. Mater. Chem. A 9, 18026–18085 (2021).
Article CAS Google Scholar
Subramanian, M. A., Li, D., Duan, N., Reisner, B. A. & Sleight, A. W. High dielectric constant in ACu₃Ti₄O₁₂ and ACu₃Ti₃FeO₁₂ Phases. J. Solid State Chem. 151, 323–325 (2000).
Article CAS ADS Google Scholar
Sinclair, D. C., Adams, T. B., Morrison, F. D. & West, A. R. CaCu₃Ti₄O₁₂: One-step internal barrier layer capacitor. Appl. Phys. Lett. 80, 2153–2155 (2002).
Article CAS ADS Google Scholar
Lunkenheimer, P. et al. Origin of apparent colossal dielectric constants. Phys. Rev. B 66, 052105 (2002).
Article ADS Google Scholar
Cohen, M. H., Neaton, J. B., He, L. & Vanderbilt, D. Extrinsic models for the dielectric response of CaCu₃Ti₄O₁₂. J. Appl. Phys. 94, 3299–3306 (2003).
Article CAS ADS Google Scholar
Taniguchi, H., Ando, K. & Terasaki, I. Enhancement of the dielectric permittivity of (Nb1/2In1/2)0.02Ti0.98O2 single crystals at low temperatures due to (Nb + In) codoping. Jpn. J. Appl. Phys. 56, 1002 (2017).
Article Google Scholar
Taniguchi, H., Sato, D., Nakano, A. & Terasaki, I. Permittivity boosting in “yellow” (Nb + In) co-doped TiO₂. J. Mater. Chem. C 8, 13627–13631 (2020).
Article CAS Google Scholar
Kutana, A., Shimano, Y. & Asahi, R. Permittivity boosting by induced strain from local doping in titanates from first principles. Sci. Rep. 13, 3761 (2023).
Article CAS PubMed PubMed Central ADS Google Scholar
Hu, W. et al. Electron-pinned defect-dipoles for high-performance colossal permittivity materials. Nat. Mater. 12, 821–826 (2013).
Article CAS PubMed ADS Google Scholar
Li, J. et al. Microstructure and dielectric properties of (Nb + In) co-doped rutile TiO₂ ceramics. J. Appl. Phys. 116, 074105 (2014).
Article ADS Google Scholar
Li, J. et al. Evidences of grain boundary capacitance effect on the colossal dielectric permittivity in (Nb + In) co-doped TiO₂ ceramics. Sci. Rep. 5, 8295 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Bovtun, V. et al. Wide range dielectric and infrared spectroscopy of (Nb+In) co-doped rutile ceramics. Phys. Rev. Mater. 2, 075002 (2018).
Article CAS ADS Google Scholar
Kawarasaki, M., Tanabe, K., Terasaki, I., Fujii, Y. & Taniguchi, H. Intrinsic enhancement of dielectric permittivity in (Nb + In) co-doped TiO₂ single crystals. Sci. Rep. 7, 5351 (2017).
Article PubMed PubMed Central ADS Google Scholar
Varadwaj, P. R., Dinh, V. A., Morikawa, Y. & Asahi, R. Polymorphs of titanium dioxide: An assessment of the variants of projector augmented wave potential of titanium on their geometric and dielectric properties. ACS Omega 8, 22003–22017 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ho, T. K. Random decision forests. Proc. 3rd Int. Conf. Doc. Anal Recognit 1, 278–282 (1995).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Xie, T. & Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 120, 145301 (2018).
Article CAS PubMed ADS Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. SchNet—A deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article PubMed ADS Google Scholar
Takahashi, A., Kumagai, Y., Miyamoto, J., Mochizuki, Y. & Oba, F. Machine learning models for predicting the dielectric constants of oxides based on high-throughput first-principles calculations. Phys. Rev. Mater. 4, 103801 (2020).
Article CAS Google Scholar
Jain, A. et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article ADS Google Scholar
Yamanaka, T., Kurashima, R. & Mimaki, J. X-ray diffraction study of bond character of rutile-type SiO₂, GeO₂ and SnO₂. Z. Kristallogr. Cryst. Mater. 215, 424–428 (2000).
Article CAS Google Scholar
Kakimoto, S. et al. Controlling dielectric properties of Nb + X (X = Al, Ga, In) co-doped and Nb-doped rutile-type TiO2 single crystals. J. Mater. Chem. C 11, 1304–1310 (2023).
Article CAS Google Scholar
Ruddlesden, S. N. & Popper, P. New compounds of the K₂NiF₄ type. Acta Crystallogr. 10, 538–539 (1957).
Article CAS Google Scholar
Ruddlesden, S. N. & Popper, P. The compound Sr₃Ti₂O₇ and its structure. Acta Crystallogr. 11, 54–55 (1958).
Article CAS Google Scholar
Megaw, H. D. Crystal structure of double oxides of the perovskite type. Proc. Phys. Soc. 58, 133 (1946).
Article CAS ADS Google Scholar
Kennedy, B. J., Howard, C. J. & Chakoumakos, B. C. High-temperature phase transitions in SrZrO₃. Phys. Rev. B 59, 4023–4027 (1999).
Article CAS ADS Google Scholar
Hart, G. L. W. & Forcade, R. W. Algorithm for generating derivative structures. Phys. Rev. B 77, 224115 (2008).
Article ADS Google Scholar
Hart, G. L. W. & Forcade, R. W. Generating derivative structures from multilattices: Algorithm and application to HCP alloys. Phys. Rev. B 80, 014120 (2009).
Article ADS Google Scholar
Ångqvist, M. et al. ICET—A python library for constructing and sampling alloy cluster expansions. Adv. Theory Simul. 2, 1900015 (2019).
Article Google Scholar
Gonze, X. & Lee, C. Dynamical matrices, Born effective charges, dielectric permittivity tensors, and interatomic force constants from density-functional perturbation theory. Phys. Rev. B 55, 10355–10368 (1997).
Article CAS ADS Google Scholar
Gajdos, M., Hummer, K., Kresse, G., Furthmüller, J. & Bechstedt, F. Linear optical properties in the projector-augmented wave methodology. Phys. Rev. B 73, 045112 (2006).
Article ADS Google Scholar
Maradudin, A. A., Montroll, E. W., Weiss, G. H. & Ipatova, I. P. Theory of Lattice Dynamics in the Harmonic Approximation (Academic Press, 1971).
Google Scholar
Lee, C., Ghosez, P. & Gonze, X. Lattice dynamics and dielectric properties of incipient ferroelectric TiO₂ rutile. Phys. Rev. B 50, 13379–13387 (1994).
Article CAS ADS Google Scholar
Ong, S. P. et al. Python materials genomics (pymatgen): A robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Ward, L. et al. Matminer: An open source toolkit for materials data mining. Comput. Mater. Sci. 152, 60–69 (2018).
Article Google Scholar
Louppe, G. Understanding random forests: From theory to practice (2015).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. Mach. Learn. Python 12, 2825–2830 (2011).
MathSciNet Google Scholar
Zhao, X. & Vanderbilt, D. First-principles study of structural, vibrational, and lattice dielectric properties of hafnium oxide. Phys. Rev. B 65, 233106 (2002).
Article ADS Google Scholar
Ward, L. et al. Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations. Phys. Rev. B 96, 024104 (2017).
Article ADS Google Scholar
Deml, A. M., O’Hayre, R., Wolverton, C. & Stevanović, V. Predicting density functional theory total energies and enthalpies of formation of metal-nonmetal compounds by linear regression. Phys. Rev. B 93, 085142 (2016).
Article ADS Google Scholar
Ghosez, Ph., Gonze, X., Lambin, Ph. & Michenaud, J.-P. Born effective charges of barium titanate: Band-by-band decomposition and sensitivity to structural features. Phys. Rev. B 51, 6765–6768 (1995).
Article CAS ADS Google Scholar
Kersch, A. & Fischer, D. Phase stability and dielectric constant of ABO₃ perovskites from first principles. J. Appl. Phys. 106, 014105 (2009).
Article ADS Google Scholar
Zimmermann, N. E. R., Horton, M. K., Jain, A. & Haranczyk, M. Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization. Front. Mater. 4, 34 (2017).
Article ADS Google Scholar
Yu, R. & Krakauer, H. First-principles determination of chain-structure instability in KNbO₃. Phys. Rev. Lett. 74, 4067–4070 (1995).
Article CAS PubMed ADS Google Scholar
Ghosez, P. S. H., Gonze, X. & Michenaud, J. P. Ab initio phonon dispersion curves and interatomic force constants of barium titanate. Ferroelectrics 206, 205–217 (1998).
Article ADS Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (eds Ester, M. et al.) 226–231 (AAAI Press, 1996).
Google Scholar
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M. & Monfardini, G. The graph neural network model. IEEE Trans. Neural Netw. 20, 61–80 (2009).
Article PubMed Google Scholar
Niepert, M., Ahmed, M. & Kutzkov, K. Learning convolutional neural networks for graphs. In Proceedings of The 33rd International Conference on Machine Learning Vol. 48 (eds Balcan, M. F. & Weinberger, K. Q.) 2014–2023 (PMLR, 2016).
Google Scholar
Duvenaud, D. K. et al. Convolutional networks on graphs for learning molecular fingerprints. In Advances in Neural Information Processing Systems Vol. 28 (eds Cortes, C. et al.) (Curran Associates Inc, 2015).
Google Scholar
Kearnes, S., McCloskey, K., Berndl, M., Pande, V. & Riley, P. Molecular graph convolutions: Moving beyond fingerprints. J. Comput. Aided Mol. Des. 30, 595–608 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. NPJ Comput. Mater. 5, 83 (2019).
Article ADS Google Scholar
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry. In Proceedings of the 34th International Conference on Machine Learning Vol. 70 (eds Precup, D. & Teh, Y. W.) 1263–1272 (PMLR, 2017).
Google Scholar
Choudhary, K. & DeCost, B. Atomistic line graph neural network for improved materials property predictions. NPJ Comput. Mater. 7, 185 (2021).
Article ADS Google Scholar
Perdew, J. P. et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. Lett. 100, 136406 (2008).
Article PubMed ADS Google Scholar
Anisimov, V. I., Zaanen, J. & Andersen, O. K. Band theory and Mott insulators: Hubbard U instead of Stoner I. Phys. Rev. B 44, 943–954 (1991).
Article CAS ADS Google Scholar
Wu, X., Vanderbilt, D. & Hamann, D. R. Systematic treatment of displacements, strains, and electric fields in density-functional perturbation theory. Phys. Rev. B 72, 035105 (2005).
Article ADS Google Scholar

Download references

Acknowledgements

We thank Drs. Pradeep R. Varadwaj, Hiroki Taniguchi, Van An Dinh, Yoshitada Morikawa, and Koichi Hayashi for their fruitful discussions. The work was also supported by the JSPS Grant-in-Aid for Transformative Research Areas (A) (21H05560, 23H04105).

Author information

Authors and Affiliations

Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan
Yuho Shimano, Alex Kutana & Ryoji Asahi

Authors

Yuho Shimano
View author publications
You can also search for this author in PubMed Google Scholar
Alex Kutana
View author publications
You can also search for this author in PubMed Google Scholar
Ryoji Asahi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed equally to the manuscript.

Corresponding author

Correspondence to Ryoji Asahi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shimano, Y., Kutana, A. & Asahi, R. Machine learning and atomistic origin of high dielectric permittivity in oxides. Sci Rep 13, 22236 (2023). https://doi.org/10.1038/s41598-023-49603-2

Download citation

Received: 22 September 2023
Accepted: 10 December 2023
Published: 14 December 2023
DOI: https://doi.org/10.1038/s41598-023-49603-2
Springer Nature Limited

Machine learning and atomistic origin of high dielectric permittivity in oxides

Abstract

Similar content being viewed by others

High dielectric ternary oxides from crystal structure prediction and high-throughput screening

Computational screening of materials with extreme gap deformation potentials

High-throughput ab initio calculations on dielectric constant and band gap of non-oxide dielectrics

Introduction