Application of the PM6 method to modeling the solid state

Stewart, James J. P.

doi:10.1007/s00894-008-0299-7

Application of the PM6 method to modeling the solid state

Original Paper
Open access
Published: 01 May 2008

Volume 14, pages 499–535, (2008)
Cite this article

Download PDF

You have full access to this open access article

Journal of Molecular Modeling Aims and scope Submit manuscript

Application of the PM6 method to modeling the solid state

Download PDF

James J. P. Stewart¹

7199 Accesses
139 Citations
Explore all metrics

Abstract

The applicability of the recently developed PM6 method for modeling various properties of a wide range of organic and inorganic crystalline solids has been investigated. Although the geometries of most systems examined were reproduced with good accuracy, severe errors were found in the predicted structures of a small number of solids. The origin of these errors was investigated, and a strategy for improving the method proposed.

Symmetry-adapted formulation of the hybrid treatment resulting from the G-particle-hole Hypervirial equation and equations of motion methods: a procedure for modeling solids

Article 02 January 2021

Investigation of G4(MP2)-XK theory for antimony compounds’ thermochemistry

Article 15 November 2022

Physical description of the monoclinic phase of zirconia based on the bond-order characteristic of the Tersoff potential

Article 05 February 2021

Introduction

The semiempirical method PM6 [1] was designed primarily for the investigation of molecular species of biochemical interest. That is, the objective of parameter optimization was to reproduce the properties of molecules. When other semiempirical methods, e.g., MNDO [2, 3], AM1 [4], and PM3 [5, 6], were developed, initial reports indicated that they were significantly more accurate than earlier methods. But later, when each new method was used for modeling species that were significantly different from those used in the training set, average errors typically increased quite significantly. This unfortunate result was a natural consequence of the way in which semiempirical method development was done: if, during a survey, a systematic fault was identified, then the training set would be modified in such a way as to correct the fault. The close relationship between the survey set and the training set meant that, by its nature, properties of species in the survey set were reproduced with a higher accuracy than those of species not in the survey set.

During the development of PM6, efforts were made to minimize the potential for this increase in error. Among these were the construction and use of very large survey and training sets. In contrast to previous methods in which the training set was a subset of the survey set, during the development of PM6 the training set was a superset of the survey set.

No solids were used in either the training set or the survey set while PM6 was being developed because inclusion of even one solid in the parameter optimization would have made the whole process extremely slow, which in turn would have precluded optimization of the parameters in any reasonable time. Because of this, solids were excluded from the parameterization, and therefore they form an ideal, clearly defined set of systems for testing the applicability of PM6 to species that were not used in the development of the method.

Theory

There are several problems associated with solid-state calculations that do not exist when discrete molecules are modeled, all of which are related to the fact that there are an infinite number of interacting atoms. The most obvious consequence of this is that the electric potential experienced by each of these atoms is the result of the contributions of an infinitely large number of electrostatic terms arising from the partial charges of all the other atoms. Another implication is that the number of one-electron wavefunctions contributing to the density matrix during the solution of the self-consistent field (SCF) equations is also infinite. Various techniques have been developed for solving these problems. Thus, in all solid-state methods, the assumption is made that the wavefunctions exhibit a perfect periodicity; this assumption is formalized in the Born–von Kármán [7] periodic boundary conditions.

The electrostatic contribution or Madelung integral can be solved analytically using the Ewald sum [8]. In this procedure, an elegant mixture of real and reciprocal space contributions is used in the evaluation of the potential. To assist in the solution of the SCF equations, the near infinite number of occupied wavefunctions contributing to the density matrix is replaced by an integration over the Brillouin zone. In turn, this integration is approximated by a Simpson’s rule technique involving a weighted sampling of points within the zone.

Several complete procedures have been developed for modeling solids using semiempirical methods. One of these, the MOSOL program [9], used sampling of the Brillouin Zone but, because it used complex mathematics, it was impractical for application to anything more complicated than simple binary solids. If the unit cell used is sufficiently large, then, instead of sampling the Brillouin Zone using a regular mesh of points, only one point need be used and, if the point chosen is at the origin of k-space, i.e. the Γ point, then complex mathematics can be avoided entirely. This is the basis for the large unit cell [10] or cluster [11] approximation. More recently, Gale [12] has addressed the problem of solving the Ewald sum when neglect of diatomic differential overlap (NDDO) methods are used, and developed a technique that would allow the Madelung integral to be solved more rapidly. In turn, this has allowed the structures and energies of some crystalline oxides, such as corundum and some of the polymorphs of silica, and of ice, to be modeled.

During the development of a procedure to allow PM6 to be applied to solids, various deficiencies and limitations were found in earlier procedures. Some of these, and the resulting modifications that had to be made in order to allow PM6 to be used for modeling solids, will now be described.

NDDO error

The NDDO methods pioneered by Dewar and Thiel use the Dewar-Sabelli [13–15]-Klopman [16] (DSK) approximation, Eq. 1, which is equivalent to the Ohno approximation [17], for the two-electron two center integral γ _AB involving atoms A and B separated by a distance R _AB.

$$\gamma _{AB} = \frac{1}{{\sqrt {R_{AB}^2 + \frac{1}{4}\left( {\frac{1}{{G_A }} + \frac{1}{{G_B }}} \right)^2 } }}$$

(1)

The DSK approximation has the correct behavior at the extremes. That is, it converges to the exact point-charge expression as the interatomic distance becomes very large, and also converges to the exact two-electron, one-center term, G _A, when the interatomic separation becomes zero. Additionally, it has good behavior at chemical bonding distances. Over the past 40 years, the DSK approximation has proven very successful in NDDO models when applied to both discrete species and to polymers.

Surprisingly, in its unmodified form, the DSK approximation gives rise to an infinite error when applied correctly to any non-elemental crystalline solid. This error arises from the fact that the one-center two-electron integrals differ from element to element. The origin of the error can be understood by considering the potential at an atom A in a simple binary solid, AB, arising from all atoms on the surface of a spherical shell of radius R; in such a solid, if the charge on atom A is Q, then the charge on atoms of type B would be −Q. When R becomes large enough, the fraction of atoms of type A and B at that distance will be the same, and the resulting electric potential, V, at atom A could then be represented by Eq. 2.

$$ V \propto 4\pi R^2 \left( {\frac{Q} {{\sqrt {R^2 + \frac{1} {4}\left( {\frac{1} {{G_A }} + \frac{1} {{G_A }}} \right)^2 } }} - \frac{Q} {{\sqrt {R^2 + \frac{1} {4}\left( {\frac{1} {{G_A }} + \frac{1} {{G_B }}} \right)^2 } }}} \right) $$

(2)

A Taylor series expansion of this function shows that V is proportional to the reciprocal of the distance. For all values of R greater than about 10 Å this potential is clearly very small. In solids, however, the potential at an atom is the result of the summed electric fields of all such shells, out to infinity. For convenience, this sum can be replaced by an integral, Eq. 3.

$$V \propto \int\limits_{R = 10}^\infty {\frac{1}{R}} $$

(3)

The value of this integral is infinity, which means that, if the DSK approximation is used and the integration is done correctly, the potential experienced by an atom of type A arising from the electrostatic contributions of all other atoms would then be either plus infinity or minus infinity, depending on the sign of its partial charge. This is an obviously unphysical result.

This catastrophe can readily be avoided by modifying the DSK approximation to ensure that it converges to the point charge expression for large values of R. The simplest modification would be to ensure that, as the interatomic separation increased, a smooth transition is made from the DSK equation to the exact point charge equation. Several trial functions were examined and, from these, a Gaussian function was selected as having the best characteristics; this function is shown in Eq. 4. Below 5 Å, the unmodified DSK equation would be used; at larger distances, Eq. 4 would be used. This function is well-behaved in that it is single-valued and has finite first and second derivatives.

$$\gamma _{AB} = \frac{1}{4}\left( {1 - e^{ - 0.05\left( {R - 5} \right)^2 } } \right) e^{ - 0.05\left( {R - 5} \right)^2 } \left( {R^2 \frac{1}{4}\left( {\frac{1}{{G_A }} \frac{1}{{G_B }}} \right)^2 } \right)^{{{ - 1} \mathord{\left/{\vphantom {{ - 1} 2}} \right.\kern-\nulldelimiterspace} 2}} $$

(4)

Electrostatic interaction

Evaluating the electrostatic potential at an atom in a crystal involves summing interactions from all surrounding atoms, and since there are, for all practical purposes, an infinite number of these, the direct sum must be replaced by a tractable alternative. The simplest and most efficient method of evaluating the electrostatic potential arising from an infinite lattice of point charges is the Ewald sum [8]. In this summation, the contribution to the potential is divided into two terms, a real-space and a reciprocal- or Fourier-space term. When an appropriate error function is used, the Ewald sum is both accurate and readily evaluated, and is the method of choice when the model used represents the electrostatic potential as the sum of contributions from classical point charges. Gale successfully applied a modified version of the Ewald sum [12] in evaluating the electrostatic potential used in MNDO, AM1, and PM3 solid state calculations.

In all NDDO [18, 19] methods, including PM6, the electrostatic contribution to the potential at an atom arising from the charges on distant atoms can be represented by the classical point charge equation, at small distances by the DSK, and at intermediate distances by the modified DSK approximation. Gale [12] noted that a modification must be made to the potential in order for the Ewald summation to be used in an NDDO method. This change requires the point-charge contribution to the potential of each atom that arises from all nearby atoms to be replaced by the exact NDDO contribution. Derivatives of the energy with respect to geometry require all potential functions to be continuous, but if corrections of the type just described were made, the resulting function would obviously be discontinuous, and further corrections would be needed. So, although the Ewald sum is aesthetically attractive, its practical implementation would necessarily involve aesthetically unattractive corrections.

An alternative to the Ewald sum would be to modify the way in which the electrostatic sum is evaluated. In this approach, use is made of the fact that an integer number of interacting unit cells are used in a solid state calculation. If the DSK equation, either unmodified or modified as in Eq. 4, is used, then the potential at any given atom arising from the direct summation of the NDDO electrostatic terms from all the other atoms would contain artifacts reflecting the asymmetric environment. In other words, the presence of boundary effects introduces spurious terms into the potential. If these terms were not eliminated, they would have a perturbative effect on the optimized geometry that would severely compromise the validity of the results. A method for removing these spurious effects was developed that involves modifying the distance term in the DSK approximation.

The potential experienced by each atom in a solid that arises from the partial charges on other atoms falls off rapidly with increasing distance. This is a natural result of the fact that the net charge arising from all atoms in a spherical shell must rapidly converge to zero as the radius increases. An implication of this is that, for large radii, the precise value of the radius used in evaluating the potential is unimportant. Conversely, when the radius is small, and there are relatively few atoms, the potential arising from the associated partial charges is large. In that case, the value of the interatomic separation used is of great importance. This behavior can be used as the basis for modifying the electrostatic sum. At large distances, because the electrostatic effect of the distant atoms is small, the value of the interatomic distance used in calculating the potential can be different from the actual value, and, in fact, can be set to any arbitrary large fixed value. That is, all potentials arising from distant atoms can be treated as if their partial charges were moved in to the surface of a sphere of fixed radius. A result of this is that the gradient or force arising from a charge that was initially outside the sphere would be exactly zero: any potential motion of the central atom in response to the presence of a charge on the surface of the sphere would be accompanied by a simultaneous motion of that charge. A consequence of this is that the gradient of the potential arising from a charge on the surface of the sphere is precisely zero. This modification of the effective interatomic distance (EID) used in evaluating the electrostatic potential completely eliminates all directional effects, in particular all artifacts arising from the use of a finite number of interacting unit cells.

If no further modifications to the EID were made, then there would be a discontinuity in the gradient arising from the presence of the sphere. The gradient arising from a partial charge just inside the sphere would be finite, but if that charge were to move just outside the sphere, its gradient would now become zero, and there would be a discontinuity. The presence of such discontinuities would then preclude the gradients being used in subsequent operations such as geometry optimization and calculation of vibrational frequencies. To avoid them, the EID must be further modified to ensure that the gradient arising from an atom near the surface of the sphere drops smoothly to zero as the atom approaches the surface of the sphere. This is most simply accomplished by reducing the EID of an atom as it approaches the surface of the sphere.

To summarize: the value of the EID is set to a constant for all atoms separated by a large distance, is set less than the actual distance for intermediate distances, and is equal to the actual distance when the interatomic separation is small. A simple function that satisfies these criteria can be defined using three domains, as shown in Fig. 1.

For atoms that are at a distance less than some predefined value, 2/3C, the exact DSK approximation is used. Between 2/3C and 4/3C, the EID to be used in Eq. 4 would be reduced as shown in Eq. 5.

$$R_{AB}^\prime = 2R_{AB} - C/3 - \frac{{3R_{AB}^2 }}{{4C}}$$

(5)

, and at distances greater than 4/3C the value of the EID would be a constant C. The effect of these changes when applied to an example set of charges is illustrated in Fig. 2, with the original charges shown in black, and the locations of the charges that would be used in evaluating the electrostatic potential shown in green.

Provided enough unit cells are used to ensure that all atoms within the sphere of radius 4/3C are present, the effect of this modification is to remove any directional influence, specifically surface effects, arising from the presence or absence of distant atoms. As with the unmodified DSK equation, the potential arising from an atom at any distance is single-valued and its first derivative is finite. An integer number of unit cells is always used in the evaluation of the electrostatic potential; therefore, the net charge on the surface of the sphere of radius 4/3C precisely counterbalances the sum of all the charges within the sphere, regardless of how many unit cells are outside the sphere. This is a natural and necessary consequence of the requirement that unit cells in a solid must have a zero charge.

The electrostatic potential is, of course, dependent on the value of C. With increasing values of C, the potential converges rapidly to a constant, but also as C increases the number of unit cells that need to be used increases rapidly. The value of C was set to 30 Å, this being the best compromise between computational effort and numerical stability.

Unlike the Ewald summation, this modified DSK approximation can be used directly in evaluating the electrostatic potential. The new approximation is relatively simple in that the use of error functions and reciprocal space terms are avoided.

Solids with unpaired electrons

Many solids, particularly those containing transition metals, have unpaired electrons. Of the two standard methods available for modeling such systems when only molecules are involved, unrestricted Hartree Fock (UHF) and restricted Hartree Fock followed by configuration interaction (RHF-CI), only UHF is suitable for modeling solids. The use of RHF-CI methods is precluded because of the very large active space involved. For example, consider the garnet uvarovite, calcium chromium silicate, Ca₃Cr^III ₂Si₃O_12.. Each chromium ion in this mineral has three unpaired d electrons. The unit cell contains eight formula units, so, if the RHF-CI procedure was used, at a minimum the active space would need to include all 80 molecular orbitals of predominantly d character. Even if the reasonable assumption was made that all the unpaired electrons had the same spin, the number of microstates involved would still be very large, ${{80!} \mathord{\left/ {\vphantom {{80!} {\left( {32!48!} \right) \approx 2.2 \times 10^{22} }}} \right. \kern-\nulldelimiterspace} {\left( {32!48!} \right) \approx 2.2 \times 10^{22} }}$, and evaluation of the gradients for the resulting non-degenerate state would be prohibitively slow.

For solids in which the ions with unpaired electrons are well separated, that is, where the ions are electronically isolated from each other, the assumption can be made that the spin-state of one ion will not interact significantly with the spin-state of any adjacent ion. In addition, if the atom in question is a transition metal ion, then the spin state can usually be inferred from its environment. In the case of uvarovite, each chromium ion is in an almost octahedral environment (the exact symmetry is S₆), being surrounded by six oxygen atoms, so the three d electrons would be in a t_2g manifold, and would therefore be unpaired. If the Hund’s rule assumption is made that the spin state is a maximum, then each chromium atom would be in a local ⁴A_2g state, and any Jahn-Teller tendency to geometric distortion to a lower symmetry would be avoided. This assumption can be formalized in the calculation when a UHF method is used by defining the difference between the number of electrons of α and β spin to correspond to the maximum possible spin state of the entire system. For uvarovite, the unit cell would then be defined as having a spin of M_s = 24, and therefore would have 48 more α than β electrons.

Applications

Organic compounds

Data sets were constructed for each organic compound, with, in each case, the starting geometry being the X-ray structure: i.e., the observed geometry. In contrast to molecular calculations, where internal coordinates are normally used, in the work reported here Cartesian coordinates were used exclusively. An attempt was made initially to use internal coordinates, but the numerical instabilities associated with the geometric gradients at the interfaces of the unit cells rendered their use impractical; no such difficulties were encountered when Cartesian coordinates were used. Each cluster consisted of between 100 and 200 atoms, and geometries were converged until the gradient norm had dropped below 5 kcal mol⁻¹ Å⁻¹, this corresponding to an uncertainty in the optimized geometry of about 0.001 Å. All unit cell parameters were optimized, as were the coordinates of all atoms within the unit cell. Unless indicated otherwise, symmetry was not used to accelerate the optimization. All calculations were done using MOPAC2007 [20] on a 3.6 GHz Pentium computer, and each geometry optimization took between 20 min and 1 day, with most taking about 1 h. No problems were encountered in any of the optimizations.

With the possible exception of polymers, crystalline organic compounds consist of discrete molecules held together by relatively weak forces. As PM6 has been shown [1] to reproduce bond lengths and angles of simple organic compounds with useful accuracy, in this work attention was focused on the prediction of the structures of entire molecules and on the forces and energies arising from intermolecular interactions. A useful measure of accuracy of prediction of molecular structure is the root-mean-square (RMS) difference between the calculated and reference geometries of a single molecule or ion in a crystal. This quantity differs from the geometric quantities reported earlier [1] in that it measures the accuracy of prediction of the overall structure of a molecule, not just the accuracy of prediction of individual bond lengths and angles. It is possible for only relatively small distortions to exist in individual angles and, at the same time, for the overall structure to be severely in error. The RMS error is therefore complementary to the errors in individual bond lengths and angles. In order to probe the suitability of PM6 for modeling organic solids, compounds were selected that illustrate a wide range of common intermolecular interactions, the most important of these being, in order of the energies involved: ionic, hydrogen bonding, and π-stacking.

Densities

Another useful measure of the accuracy of prediction of organic and inorganic solids is the density. In most cases when the density is accurately reproduced the internal structure of the unit cell is also accurately predicted. This is not an infallible rule, in that it is possible for the density to be predicted with good accuracy and, at the same time, the unit cell structure to be significantly distorted. This rare occurrence can usually be detected by distortions of the unit cell parameters. No cases were found where the unit cell parameters were predicted with good accuracy and, at the same time, significant errors existed in the internal structure of the unit cell. A comparison of PM6 and X-ray unit cell parameters for 124 organic solids is presented in Table 1. In this table, the unit cell used was often different from that reported in the literature, particularly so in hexagonal crystals, that is, crystals in which the interface angles are 90°, 90°, and 60°. Unit cells were chosen that would maximize the size of sphere that could be contained in a given cluster; to this end, most hexagonal unit cells were replaced by equivalent orthorhombic unit cells. Predicted densities were reproduced with good accuracy, the average unsigned error in density being 6.9%, with the bulk of this error arising from errors in the calculated intermolecular distances. Although most systems optimized with only small changes in the geometry, in three instances quantitative changes occurred.

Table 1 Calculated and X-ray structural parameters for organic compounds

Application of the PM6 method to modeling the solid state

Abstract

Similar content being viewed by others

Symmetry-adapted formulation of the hybrid treatment resulting from the G-particle-hole Hypervirial equation and equations of motion methods: a procedure for modeling solids

Investigation of G4(MP2)-XK theory for antimony compounds’ thermochemistry

Physical description of the monoclinic phase of zirconia based on the bond-order characteristic of the Tersoff potential

Introduction

Theory

NDDO error

Electrostatic interaction

Solids with unpaired electrons

Applications

Organic compounds

Densities

Heats of formation

Heats of sublimation

Biomolecules

Oligopeptides

Acetylcholine

Adenosine diphosphate

Adenosine triphosphate

Nicotinamide adenine dinucleotide

Hydrogen bonding

Individual types of hydrogen bonds

O–H–O

N–H–N

N–H–O

π−π stacking

Very weak interactions

Polymorphs

Co-crystals

Metal-containing species

Inorganic compounds

Elements

Halides

Oxides

SiO2

H2O

Al2O3

TiO2

B(OH)3

Other AB-type solids

Carbonates, nitrates, and borates

Nitrates

Borates

Molybdates, tungstates chromates vanadates, sulfates, and phosphates

Silicates

Isolated SiO4 tetrahedra: nesosilicates

Double and triple tetrahedra: sorosilicates

Chains: inosilicates

Cyclosilicates

Sheets: phyllosilicates

Frameworks: tectosilicates

Discussion

Systems that are badly predicted

Accuracy of geometry vs hardness

Crystal packing automatically considered

Conclusions

References

Acknowledgments

Open Access

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

SiO₂

H₂O

Al₂O₃

TiO₂

B(OH)₃

Isolated SiO₄ tetrahedra: nesosilicates