Abstract
Since the proposal of the “Materials Genome Initiative”, several material databases have emerged and advanced many materials fields. In this work, we present the Materials Informatics Platform with Three-Dimensional Structures (MIP-3d). More than 80,000 structural entries, mainly from the inorganic crystal structural database, are included in MIP-3d. Density functional theory calculations are carried out for over 30,000 entries in the database, which contain the relaxed crystal structures, density of states, and band structures. The calculation of the equations of state and sound velocities is performed for over 12,000 entries. Notably, for entries with band gap values larger than 0.3 eV, the band degeneracies for the valence band maxima and the conduction band minima are analysed. The electrical transport properties for approximately 4,400 entries are also calculated and presented in MIP-3d under the constant electron-phonon coupling approximation. The calculations of the band degeneracies and electrical transport properties make MIP-3d a database specifically designed for thermoelectric applications.
Measurement(s) | structural entity • relaxed crystal structure • density of states • band structure • electrical transport property • equation of states |
Technology Type(s) | density functional theory • computational modeling technique |
Factor Type(s) | material |
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.15164577
Similar content being viewed by others
Background & Summary
Accelerating the process of materials research and development has become a common pursuit in all countries around the world1,2. How to quickly obtain new materials based on low-cost and highly reliable prediction methods to guide experiments is an important issue. Traditional materials research and development mainly rely on trial and error, which consumes considerable time and resources. In 2011, the White House proposed the “Materials Genome Initiative” (MGI)3. The project consists of three parts: high-throughput (HTP) calculations, experimental methods, and digital databases. Its goal is to reduce the cost of materials research and shorten the development cycle4,5.
The MGI focuses on the combination of calculations, experiments, and databases. Among them, big data for materials have been of vital importance. After the proposal of the MGI, several HTP digital material databases based on first-principles calculations have emerged, such as the Materials Project (MP, https://materialsproject.org)6,7, the Automatic Flow for Materials Discovery (AFLOW, http://aflow.org)8,9,10, the Open Quantum Materials Database (OQMD, http://www.oqmd.org)11, and Novel Materials Discovery (NOMAD, https://www.nomad-coe.eu/)12. These platforms all provide basic information such as the formation energy, phase diagram, and electronic structure. They also provide extended functions, including searching for the elastic properties of compounds13, piezoelectric materials discovery14, online analysis and the design of an algorithm for other material properties. Most of these material databases are open and can be accessed by all researchers seeking to obtain the required material information. These databases have greatly promoted the development of the materials field, for example, thermoelectric materials. Chen et al.15 studied more than 48,000 materials from MP platform, and calculated the electrical transport properties of approximately 25,000 semiconductor materials to form the MP electrical transport database. Then Ricci et al.16 made a more detailed summary of the overall distribution of the electrical transport properties in MP. Based on the AFLOW database, Wang et al.17 calculated the power factors for sintered materials. Toher et al.18 tested 75 materials based on AFLOW and proposed several low thermal conductivity materials, such as AgI and CuI, which could be used for thermoelectric application.
In recent years, we established our own materials data repository, i.e., the Materials Informatics Platform with Three-Dimensional Structures (MIP-3d). Our initial purpose was to apply big data technology to functional materials (such as thermoelectric materials)19,20 of interest. The transport data calculated by home-made packages such as TransOpt21 have been integrated into MIP-3d, where electronic relaxation times are computed by the constant electron-phonon coupling approximation (see below). To date, MIP-3d has recorded over 30,000 electronic structures, 4,400 electrical transport properties, and 12,000 equations of state and sound velocities. For entries with finite band gaps, the band degeneracy for the band-edge states has been analysed. Band degeneracy serves as a convenient search criterion for good thermoelectric materials22. In the rest of this paper, we present the details of the computational methodology, data record and technical validation of the data in MIP-3d (http://www.mip3d.org).
Methods
The thermoelectric performance is governed by the dimensionless figure of merit, ZT = (S2σT)/κ, where S, σ, T, and κ are the Seebeck coefficient, electrical conductivity, absolute temperature, and thermal conductivity, respectively. In the Boltzmann transport theory, the electrical conductivity σ and Seebeck coefficient S are expressed as follows:
Here, \({\varepsilon }_{n{\bf{k}}}\) and vnk are the electronic energy and group velocity, respectively, corresponding to band index n and reciprocal coordinate k, and \({\tau }_{n{\bf{k}}}\) is the electronic relaxation time. \(T,\mu ,V,{f}_{\mu },\)and e are respectively the absolute temperature, the Fermi level, the volume of the unit cell, the Fermi-Dirac distribution, and the electron charge. Identifying high-performance thermoelectric materials by optimizing the individual parameters of ZT is a difficult task23. To cope with this challenge, Xing et al.24 proposed the electronic fitness function t = (σ/τ)S2/N2/3, where N is the volumetric density of states (DOS) and τ is the relaxation time. Usually, valley anisotropy25,26, band convergence27, heavy-light band combinations28,29, reduced dimensionality30, and nonparabolic bands31,32 will complicate the electronic structures and enlarge the fitness function. Good thermoelectric materials usually possess complex electronic structures, and thus, with the help of the electronic fitness function, one can efficiently identify materials with complex band characteristics.
Herein, the electronic relaxation time is the important parameter for determining the electrical transport coefficients. By the full evaluation of the electron-phonon coupling matrix, one can obtain the relaxation time accurately33,34, but the computational cost is too high to be applicable in high-throughput calculations. The constant relaxation time approximation can predict the Seebeck coefficient reasonably35. However, because of the undetermined relaxation times, the calculations of electrical conductivity are less accurate, which limit the prediction power. Thanks to the constant electron-phonon coupling approximation, the computational cost is moderate and the electrical transport coefficients have been predicted well, such as the studies in diamond-like chalcogenides20. The electronic relaxation time in our work is written as:
Here C is the constant electron-phonon coupling. Equation 3 demonstrates that the electronic scattering phase space is treated explicitly in our method, which is more accurate than the constant relaxation time approximation. The C constant can be expressed as follows under the deformation potential approximation:
where Edef is the deformation potential constant of the band edge, and G is the Young’s modulus.
Besides the calculations of electrical transport properties, MIP-3d also contains several other quantities suitable for thermoelectric study, such as the band degeneracy and sound velocity. All these calculations make MIP-3d a repository for the HTP study in thermoelectrics. The rest of the work will present the overall workflow and the modules in MIP-3d, as well as the data for thermoelectric-related quantities.
Workflow
The calculation method of MIP-3d mainly includes two modules: an initial structure check and HTP calculations. The overall processes are shown in Fig. 1, and each step is explained in detail below.
Initial structure check
Most of the materials structure information in MIP-3d came from the Inorganic Crystal Structure Database36,37 and the MP6. All structures with partially occupied atomic sites had been ignored. With the help of the phonopy code38,39, we obtained the primitive cells of all compounds, as well as their space groups and the Wyckoff symbols on atomic sites. Duplicated entries were screened out based on the chemical formulas, space groups, and atomic Wyckoff symbols, and we obtained 84,908 entries out of 139,257 initial structures containing 60,628 entries from MP and 78,629 from ICSD.
High-throughput calculations
We performed first-principles HTP calculations for portions of the 84,908 entries on several of the properties, including structural optimization calculations, self-consistency and DOS calculations, band structure calculations, electrical transport calculations and equation of state calculations. The number of entries for the respective properties is shown in Fig. 1. All the calculations in the present work were performed using the Vienna ab initio simulation package (VASP)40,41 based on density functional theory with the projector-augmented wave method42,43. The Perdew–Burke–Ernzerhof generalized gradient approximation was used as the exchange-correlation functional44. The Hubbard U values from ref. 45 were applied46. In our high-throughput calculation, the same U values were adopted for the same elements in different entries. Recently, Timrov et al.47 developed a new framework based on the density functional perturbation theory to calculate the U more accurately, but it is not within the scope of this paper. A Gaussian-type smearing with the smearing factor of 0.05 eV was adopted throughout the work. A plane-wave cut-off energy of 520 eV and an energy convergence criterion of 10−4 eV for self-consistency were adopted. In this work, most of the pseudopotential files recommended by the VASP (https://cms.mpi.univie.ac.at/vasp/vasp) were adopted, except for W (W instead of W_pv) and Re (Re_pv instead of Re), since some abnormal horizontal lines appeared in the band structures when pseudopotential files of W_pv/Re were used (The comparison of two band structures can be found in the supplemental Fig. S1). The computational parameters and statistics of the respective results are shown below.
Magnetism precheck
This module determined whether to set spin-polarization-related tags in the following calculations based on a simple self-consistent calculation with ISPIN=2. The default magnetic moments were 1.0 per atom for ISPIN=2 in VASP. The k-point mesh setting in this module was set as (30/|a|+1, 30/|b|+1, 30/|c|+1), where a, b, and c are the lattice parameter values. If the absolute value of the magnetic moment after convergence for the material investigated was greater than 0.02 μB, we tagged this material as spin-polarized and added the line “ISPIN=2” to all the INCAR files for the following calculations. Based on the current statistics, 16,611 compounds were magnetic, and 14,441 compounds were non-magnetic.
Structural optimization
The atomic positions, the cell shape, and the volume were relaxed in this module. The k-point mesh was set as (30/|a|+1, 30/|b|+1, 30/|c|+1). The convergence criterion of the Hellmann–Feynman force on each atom was less than 10−2 eV/Å. For each compound, we initially performed up to 5 VASP rounds of structural optimization with both the atomic positions and cell freely relaxed (ISIF = 3 & IBRION = 2, NSW = 40). If the convergence criterion was not reached, up to 5 more rounds of structural optimization with only the atomic positions relaxed (ISIF = 0 & IBRION = 1, NSW = 40) were conducted. If the compound did not converge after the above ten rounds, it was tagged “relaxation not converged”. Based on the current statistics, 31,052 out of around 33,000 compounds reached the convergence criterion.
Self-consistent calculations and density of states
If the structural optimization was completed with the “converged” tag, the self-consistent calculation was triggered to obtain the charge density, total energy, and magnetic moments (if the material was tagged “spin-polarized” in the magnetism precheck step). The k-point mesh used in the self-consistent calculations was (60/|a|+1, 60/|b|+1, 60/|c|+1). Moreover, the projected DOS (as shown in Fig. 2) for the material was also obtained based on the self-consistent calculations, and four plots with different levels of smearing factors are displayed online. In MIP-3d, 31,052 self-consistent calculations, as well as their electronic DOSs, were completed. In some of the subsequent calculations, such as those for the band structures and electrical transport properties, the charge density obtained in this step was adopted.
Equations of state
The optimized non-magnetic entries were taken for the equations of state calculations. Nine different volumes, including the optimized volume, were taken into account (Fig. 3a). The structure was scaled to the required volume, and the total energy was subsequently calculated with a self-consistent calculation. The volume-energy potential surfaces was fit by the Vinet-type equation of state to obtain the bulk modulus K48,49. The 12,400 entries with the fitting determination coefficient R2> 0.98 were stored in the MIP-3d database. As shown in Fig. 3(b), for the statistics of K, the bulk moduli of most compounds are between 40~120 GPa, which accounts for approximately 50% of the total entries, and approximately 2,000 entries possess K values less than 40 GPa. According to the formula \({{\rm{V}}}_{{\rm{S}}}={({\bf{K}}/\rho )}^{1/2}\) (VS is the sound velocity and ρ is the density of the compound), a small bulk modulus will result in a low sound velocity of the compound and thus low thermal conductivity50. As shown in Fig. 3(c), for the statistics of the sound velocity, 10,000 compounds exist with sound velocities lower than 2,000 m/s, which may be promising in thermoelectric applications.
Band structure
The high-symmetry k-points of the three-dimensional Brillouin zone used in our band structure calculations referred to refs. 8,51. Forty points existed between each pair of high-symmetry k-points. Band structure calculations were performed for all the relaxed materials, i.e., 31,052 entries. The band gap values in MIP-3d for all the materials were obtained in this step. As shown in Fig. 4(d), most of the band gaps are below 0.03 eV, for which, in principle, good thermoelectric properties are impossible to achieve. From Table 1, most of the unary (83%), binary (78%), and ternary (60%) compounds in MIP-3d are metallic, while approximately 54% of the quaternary compounds are wider-band-gap (gap > 1 eV) materials. This result suggests that quaternary compounds are more likely to have wide band gaps than unary and binary compounds and shows that the band gap of compounds tends to widen as the number of constituent elements increases.
For all the compounds, the elemental projected band structures are displayed, as shown in Fig. 4 for MIP3D-17744-Fe1Nb1Sb1. The bands around the conduction band minima (CBM) for MIP3D-17744-Fe1Nb1Sb1 are typical two-band diagrams, i.e., the CBM at the point X is mainly contributed by Nb, and the second conduction band at the same point is from Fe. The projected DOS plot also reveals the Nb-contributed CBM (Fig. 2); however, the projected band structures are more distinct to demonstrate the band-resolved information. This fact is useful for thermoelectric applications due to the clear presentation of this information, which is lacking in other HTP repositories.
The band degeneracy Nv is another useful band-related feature, especially for thermoelectrics22. Nv consists of two parts: k-point degeneracy and energy degeneracy. The k-point degeneracy represents the number of equivalent k-points corresponding to one irreducible k-point. Within each energy pocket, the number of bands with sufficiently close eigenvalues (0.05 eV from the band edge, either the valence band minima (VBM) or CBM) was defined as the energy degeneracy. A schematic plot of Nv for MIP3D-17744-Fe1Nb1Sb1 is shown in Fig. 5(a). For MIP3D-17744-Fe1Nb1Sb1, the k-point degeneracy at VBM (L point) is 4, and the energy degeneracy is 2; thus, the Nv at the VBM of this compound is 8. Band degeneracy is useful for the quick screening of TE materials since a large Nv will result in a large quality factor. We proceeded with Nv analyses for all the materials with band gaps greater than 0.3 eV in MIP-3d. The statistical results of the VBM and CBM are shown in Fig. 5(b). The plot demonstrates the existence of 894 systems with a VBM Nv greater than four. Note that the statistics of Nv are based on the current 0.05 eV criterion for energy degeneracy. If the criterion is set to 0.1 eV, 1,067 entries will have a VBM Nv greater than four.
Electrical transport
In MIP-3d, for some materials with band gaps > 0.03 eV (more than 4,400), we calculated the electronic transport properties by using TransOpt21. A high-density k-point mesh (240/|a|+1, 240/|b|+1, 240/|c|+1) was adopted. The electronic group velocity was obtained by the momentum matrix method, as implemented in TransOpt package. The constant electron-phonon coupling approximation was adopted, with the Edef = 3 eV and G = 100 GPa for all the materials investigated (Eq. 4). The Seebeck coefficient is independent with the choices of Edef and G under the constant electron-phonon coupling approximation, while the electrical conductivity and power factor are relevant to these values. More accurate power factors can be obtained if the HTP deformation potential calculations are to be solved, which will be done in our future work. Fig. 6 shows the calculated electrical transport properties at 700 K for MIP3D-17744-Fe1Nb1Sb1, including the carrier-concentration-dependent Seebeck coefficients and power factors. The choice of temperature 700 K was due to the potential high temperature thermoelectric applications, as also discussed in our previous works20,21,52. Based on Fig. 6, the maximum power factors (PFmax) for both n-type and p-type transport, as well as the corresponding carrier concentrations and Seebeck coefficients, can be obtained.
According to the calculated PFmax, we took the top 5% of the entries as compounds with promising electrical transport properties. Moreover, a low sound velocity (<2,000 m/s) was taken as the indicator of low thermal conductivity. As shown in Fig. 7(a),(b) and supplemental Table S1, 85 n-type compounds and 90 p-type compounds are screened out. It is a simple screen of the materials with good thermoelectric performance. Although the listed compounds have unusually high absolute values of PFmax due to the uniform deformation potential and Young’s modulus, further studies are still worthy due to their good band-related properties. As shown in supplemental Table S1, many chalcogenides and compounds with heavy elements, such as Bi, Pb, are screened out. Furthermore, the maximum electronic fitness functions tmax are shown in Fig. 7(c),(d). Due to the fact that the electronic fitness functions have the volumetric DOS in the denominator, the electronic scattering phase space are also considered. By comparing the material suggestions in Fig. 7 (a)–(d), around 40% of the materials screened out by PFmax are also recommended by tmax, implying the similarity of the two methods in proposing new thermoelectric candidates.
Data Records
Our MIP-3d can be found at http://www.mip3d.org. We have provided output files for all the calculated compounds, which could be found in figshare and our website. A JSON file is available on the web interface (http://mip3d.org/materials/download) and also in a figshare repository53,54. Table 2 shows the key variables of the materials database, which include the name, the data type and a short description. ‘id’ is the number of each material in the database. ‘formula’ is the chemical formula, and ‘volume’ is the volume of the unit cell. ‘natoms’ is the total number of atoms in the unit cell, and ‘space_group’ is the space group number of the unit cell. ‘energy’ is the total energy of the system obtained by the static calculation, and ‘is_magnetic’ indicates whether the system is magnetic. ‘total_magnetic_moment’ is the magnetic moment value. In addition, the bulk modulus (‘bulk-modulus’), band gap (‘gap’), and system degeneracy (‘degeneracy_vbm’ and ‘degeneracy_cbm’) are given.
Technical Validation
In this work, most of the recommended pseudopotentials from the VASP were adopted, except for W(W) and Re(Re_pv). At each step of the workflow, we set reliable convergence criteria, and the calculation of each step was based on the previous step to achieve convergence. The calculation details were given in the method introduction section above. We performed the following validations for our results in MIP-3d. The Seebeck coefficient values were computed with constant relaxation time (10−14 s), 700 K and a doping level of 1020 cm−3. We benchmarked the volumes (6,000), band gaps (gap > 0.03 eV, 1,100), bulk moduli (1,500) and Seebeck coefficients (739) against the data in an existing 3D material database, MP, as shown in Fig. 8. The Pearson correlation coefficients (the average of the absolute relative errors) between MIP-3d and MP for the volume, band gap, bulk modulus, and Seebeck coefficient are 0.998 (1.71%), 0.991 (6.39%), 0.993 (4.73%), 0.953 (4.59%), and 0.981 (5.39%), respectively, implying high uniformity between this work and MP. Furthermore, we compared the entries with and without U-elements, as shown in the supplemental Fig. S2. The corresponding Pearson correlation coefficients and the average of the absolute relative errors are listed in supplemental Table S2. The Pearson correlation coefficient of band gaps slightly improves from 0.991 (entries with U-elements) to 0.997 (entries without U-elements).
Usage Notes
In this work, we provided a high-throughput electronic structure database for the prediction and discovery of new materials. Our data can be accessed at www.mip3d.org. In addition, the database is growing rapidly.
Code availability
The calculations of the electrical transport properties in this work rely heavily on TransOpt21. The code of TransOpt is available at https://github.com/yangjio4849/TransOpt. In the initial structure checking part, we used the phonopy (http://phonopy.github.io/phonopy/). All the home-made codes used to generate the data is available at https://github.com/yangjio4849/MIP.
References
Greeley, J., Jaramillo, T. F., Bonde, J., Chorkendorff, I. & Norskov, J. K. Computational high-throughput screening of electrocatalytic materials for hydrogen evolution. Nat. Mater. 5, 909–913 (2006).
Bhattacharya, S., Chmielowski, R., Dennler, G. & Madsen, G. K. H. Novel ternary sulfide thermoelectric materials from high throughput transport and defect calculations. J. Mater. Chem. A. 4, 11086–11093 (2016).
Ward, C. Materials Genome Initiative for Global Competitiveness. (2012)
Christodoulou, J. A. Integrated computational materials engineering and materials genome initiative: accelerating materials innovation. Adv. Mater. Processes 171, 28–31 (2013).
Juan, D. P., Barbara, J., Cora, L.-K., Vidvuds, O. & Arthur, P. R. The Materials Genome Initiative, the interplay of experiment, theory and computation. Curr. Opin. Solid State Mater. Sci. 18, 99–117 (2014).
Jain, A., Ong, S. P., Hautier, G., Chen, W. & Persson, K. A. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater 1, 011002 (2013).
Jain, A. et al. A high-throughput infrastructure for density functional theory calculations. Comput. Mater. Sci. 50, 2295–2310 (2011).
Setyawan, W. & Curtarolo, S. High-throughput electronic band structure calculations: Challenges and tools. Comput. Mater. Sci. 49, 299–312 (2010).
Curtarolo, S. et al. AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations. Comput. Mater. Sci. 58, 227–235 (2012).
Taylor, R. H. et al. A RESTful API for exchanging materials data in the AFLOWLIB.org consortium. Comput. Mater. Sci. 93, 178–192 (2014).
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: The Open Quantum Materials Database (OQMD). JOM 65, 1501–1509 (2013).
Draxl, C. & Scheffler, M. NOMAD: The FAIR concept for big-data-driven materials science. MRS Bull. 43, 676–682 (2018).
Jong, M. D. et al. Charting the complete elastic properties of inorganic crystalline compounds. Sci. Data 2, 150009 (2015).
Jong, M. D., Chen, W., Geerlings, H., Asta, M. & Persson, K. A. A database to enable discovery and design of piezoelectric materials. Sci. Data 2, 150053 (2005).
Chen, W. et al. Understanding thermoelectric properties from high-throughput calculations: trends, insights, and comparisons with experiment. J. Mater. Chem. C 4, 4414–4426 (2016).
Ricci, F. et al. An ab initio electronic transport database for inorganic materials. Sci. Data 4, 170085 (2017).
Wang, S., Wang, Z., Setyawan, W., Mingo, N. & Curtarolo, S. Assessing the thermoelectric properties of sintered compounds via high-throughput ab-initio calculations. Phys. Rev. X 1, 021012 (2011).
Toher, C. et al. High-throughput computational screening of thermal conductivity, Debye temperature, and Gruneisen parameter using a quasiharmonic Debye model. Phys. Rev. B 90, 174107 (2014).
Li, R. X. et al. High-throughput screening for advanced thermoelectric materials: diamond-Like ABX 2 compounds. ACS Appl. Mater. Interfaces 28, 24859–24866 (2019).
Xi, L. L. et al. Discovery of high-performance thermoelectric chalcogenides through reliable high-throughput material screening. J. Am. Chem. Soc. 140, 10785–10793 (2018).
Li, X. et al. TransOpt. A code to solve electrical transport properties of semiconductors in constant electron–phonon coupling approximation. Comput. Mater. Sci. 186, 110074 (2021).
Yan, J. et al. Material descriptors for predicting thermoelectric performance. Energ. Environ. Sci. 8, 983–994 (2015).
Sarikurt, S., Kocaba, T. & Sevik, C. High-throughput computational screening of 2D materials for thermoelectrics. J. Mater. Chem. A 8, 19674–19683 (2020).
Xing, G. et al. Electronic fitness function for screening semiconductors as thermoelectric materials. Phys. Rev. Mater. 1, 065405 (2017).
Sun, J. & Singh, D. J. Thermoelectric properties of AMg2X2, AZn2Sb2 (A=Ca, Sr, Ba; X=Sb, Bi), and Ba2ZnX2 (X=Sb, Bi) Zintl compounds. J. Mater. Chem. A 5, 8499–8509 (2017).
Parker, D. S., May, A. F. & Singh, D. J. Benefits of carrier-pocket anisotropy to thermoelectric performance: The case of p-type AgBiSe2. Phys. Rev. Appl. 3, 064003 (2015).
Pei, Y. et al. Convergence of electronic bands for high performance bulk thermoelectrics. Nature (London) 473, 66 (2011).
Singh, D. J. & Mazin, I. I. Calculated thermoelectric properties of La-filled skutterudites. Phys. Rev. B 56, R1650 (1997).
May, A. F., Singh, D. J. & Snyder, G. J. Influence of band structure on the large thermoelectric performance of lanthanum telluride. Phys. Rev. B 79, 153101 (2009).
Parker, D., Chen, X. & Singh, D. J. High three-dimensional thermoelectric performance from low-dimensional bands. Phys. Rev. Lett. 110, 146601 (2013).
Shi, H., Parker, D., Du, M.-H. & Singh, D. J. Connecting thermoelectric performance and topological-insulator behavior: Bi2Te3 and Bi2Te2Se from first principles. Phys. Rev. Appl. 3, 014004 (2015).
Mecholsky, N. A., Resca, L., Pegg, I. L. & Fornari, M. Theory of band warping and its effects on thermoelectronic transport properties. Phys. Rev. B 89, 155131 (2014).
Xi, J. Y., Wang, D., Yi, Y. P. & Shuai, Z. G. Electron-phonon couplings and carrier mobility in graphynes sheet calculated using the Wannier-interpolation approach. JCP 141, 407 (2014).
Xi, J. Y., Wang, D. & Shuai, Z. G. Electronic properties and charge carrier mobilities of graphynes and graphdiynes from first principles. Wires. Comput. Mol. Sci. 5, 215–227 (2015).
Yang et al. Evaluation of half-heusler compounds as thermoelectric materials based on the calculated electrical transport properties. Adv. Funct. Mater. 19, 2880–2888 (2008).
Belsky, A., Hellenbrandt, M., Karen, V. L. & Luksch, P. New developments in the Inorganic Crystal Structure Database (ICSD): accessibility in support of materials research and design. Acta Crystallogr. 58, 364–369 (2010).
Bergerhoff, G., Hundt, R., Sievers, R. & Brown, I. D. The inorganic crystal structure database. J. Chem. Inf, Comp. Sci. 23, 66–69 (1983).
Atsushi, T. & Isao, T. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Togo, A. & Tanaka, I. Spglib: a software library for crystal symmetry search. Preprint at https://arxiv.org/abs/1808.01590 (2018).
Kresse, G. & Hafne, J. Ab initio molecular dynamics for liquid metals. Phys. Rev. B 47, 558–561 (1993).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Wang, S. D., Wang, Z., Setyawan, W., Mingo, N. & Curtarolo, S. Assessing the thermoelectric properties of sintered compounds via high-throughput ab-initio calculations. Phys. Rev. X 1, 021012 (2011).
Dudarev, S. L., Botton, G. A., Savrasov, S. Y., Humphreys, C. J. & Sutton, A. P. Electron-energy-loss spectra and the structural stability of nickel oxide: An LSDA+U study. Phys. Rev. B 57, 1505 (1998).
Timrov, I., Marzari, N. & Cococcioni, M. Self-consistent Hubbard parameters from density-functional perturbation theory in the ultrasoft and projector-augmented wave formulations. Phys. Rev. B 103, 045141 (2021).
Sun, J. X., Wu, Q., Cai, L. C. & Jing, F. Q. Thermal Vinet Equation of State and Its Applications. Chin. J. High Pressure Phys. 18, 109–115 (2004).
Vinet, P., Ferrante, J., Smith, J. R. & Rose, J. H. A universal equation of state for solids. J. Phys. C 19, L467–L473 (1986).
LI, W. et al. Low sound velocity contributing to the high thermoelectric performance of Ag8SnSe6. Adv. Sci. 3, 1600196 (2016).
Hinuma, Y., Pizzi, G., Kumagai, Y., Oba, F. & Tanaka, I. Band structure diagram paths based on crystallography. Comput. Mater. Sci. 128, 140–184 (2017).
Sheng, Y. et al. Active learning for the power factor prediction in diamond-like thermoelectric materials. npj Comput Mater. 6, 171 (2020).
Yao, M. J. et al. Materials informatics platform with three dimensional structures (MIP-3d). figshare https://doi.org/10.6084/m9.figshare.13655276.v7 (2021).
Yao, M. J. et al. Materials informatics platform with three dimensional structures, workflow and thermoelectric applications. figshare https://doi.org/10.6084/m9.figshare.c.5396844 (2021).
Acknowledgements
This work was supported by the National Key Research and Development Program of China (Nos. 2018YFB0703600 and 2017YFB0701600), the National Natural Science Foundation of China (Grant Nos. 11674211, 21703136, 51632005, and 51761135127), and the 111 Project D16002. W. Z. acknowledges support from the Guangdong Innovation Research Team Project (No. 2017ZT07C062), Guangdong Provincial Key-Lab program (No. 2019B030301001), Shenzhen Municipal Key-Lab program (ZDSYS20190902092905285), and the Centers for Mechanical Engineering Research and Education at MIT and SUSTech. Part of the calculations were supported by the Center for Computational Science and Engineering at Southern University of Science and Technology and the National Supercomputing Center in Guangzhou.
Author information
Authors and Affiliations
Contributions
Yao, M.J. developed the web UI. Wang, Y.X. performed the high-throughput calculations and produced the dataset. The data processing algorithms were developed and tested by Li, X. Sheng, Y, Huo, H.Y. and Xi, L.L. coordinated the project. Yang, J. and Zhang, W.Q. were the leaders of this project. All authors commented on the results and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
About this article
Cite this article
Yao, M., Wang, Y., Li, X. et al. Materials informatics platform with three dimensional structures, workflow and thermoelectric applications. Sci Data 8, 236 (2021). https://doi.org/10.1038/s41597-021-01022-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-021-01022-6
- Springer Nature Limited
This article is cited by
-
Dealing with the big data challenges in AI for thermoelectric materials
Science China Materials (2024)
-
High throughput calculations for a dataset of bilayer materials
Scientific Data (2023)
-
Semiclassical electron and phonon transport from first principles: application to layered thermoelectrics
Journal of Computational Electronics (2023)
-
MatHub-2d: A database for transport in 2D materials and a demonstration of high-throughput computational screening for high-mobility 2D semiconducting materials
Science China Materials (2023)
-
High-throughput screening of room temperature active Peltier cooling materials in Heusler compounds
npj Computational Materials (2022)