Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements

Stewart, James J. P.

doi:10.1007/s00894-007-0233-4

Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements

Original Paper
Open access
Published: 09 September 2007

Volume 13, pages 1173–1213, (2007)
Cite this article

Download PDF

You have full access to this open access article

Journal of Molecular Modeling Aims and scope Submit manuscript

Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements

Download PDF

James J. P. Stewart¹

29k Accesses
2913 Citations
12 Altmetric
6 Mentions
Explore all metrics

Abstract

Several modifications that have been made to the NDDO core-core interaction term and to the method of parameter optimization are described. These changes have resulted in a more complete parameter optimization, called PM6, which has, in turn, allowed 70 elements to be parameterized. The average unsigned error (AUE) between calculated and reference heats of formation for 4,492 species was 8.0 kcal mol⁻¹. For the subset of 1,373 compounds involving only the elements H, C, N, O, F, P, S, Cl, and Br, the PM6 AUE was 4.4 kcal mol⁻¹. The equivalent AUE for other methods were: RM1: 5.0, B3LYP 6–31G*: 5.2, PM5: 5.7, PM3: 6.3, HF 6–31G*: 7.4, and AM1: 10.0 kcal mol⁻¹. Several long-standing faults in AM1 and PM3 have been corrected and significant improvements have been made in the prediction of geometries.

A semiempirical method optimized for modeling proteins

Article Open access 22 August 2023

An Introduction and Overview of Basis Sets for Molecular and Solid-State Calculations

Ab Initio, Density Functional Theory, and Semi-Empirical Calculations

Introduction

Over the past 30 years, NDDO-type [1, 2] semiempirical methods have evolved steadily. The earliest of these methods was MNDO [3, 4], which itself was a major advance over even earlier non-NDDO methods such as MINDO/3 [5]. The main advantage of MNDO over earlier methods was that the values of the parameters were optimized to reproduce molecular rather than atomic properties. When it first appeared, MNDO was immediately popular because of its increased accuracy, but, with the passage of time, various limitations were found, among the most important of which was the almost total absence of a hydrogen bond. As hydrogen bonding is essential to life, this particular fault essentially precluded MNDO being used in modeling biochemistry.

In 1985 an attempt, AM1 [6], was made to improve MNDO by adding a stabilizing Gaussian function to the core-core interaction to represent the hydrogen bond. Despite the fact that this was an over-simplification of a very complicated phenomenon, the overall effect was similar, and for the first time NDDO methods gave a good, albeit limited, model of hydrogen bonding.

In the course of the next several years, improvements were made to the method of parameter optimization. The result of this was the PM3 method [7–10], which culminated in the parameterization of all the elements in the main group in 2004 [11]. At the same time, various changes to the original set of approximations used in MNDO were proposed, the most important of which were the addition of d-orbitals to main-group elements [12, 13] and the introduction of diatomic parameters. Work started on the transition metals, and parameters for some of these have been reported [14, 15]. More recently, parameter sets tailored to reproduce specific phenomena such as the binding energy of nucleic acid base pairs [16], iron complex catalyzed hydrogen abstraction [17], phosphatase-catalyzed reaction barriers [18], and the redox properties of iron containing proteins [19] have been developed.

Because of the way advances in NDDO developments occurred, in terms of the modifications of the approximations and the extensions to specific elements or groups of elements, there has been an inevitable lack of consistency. The aim of the current work was three-fold: to investigate the incorporation of some of the reported modifications to the core-core approximations into the NDDO methodology; to carry out a systematic global parameter optimization of all the main group elements, with emphasis on compounds of interest in biochemistry; and to extend the methodology by performing a restricted optimization of parameters for the transition metals. This resulted in the development of a new method, consisting of the final set of approximations used and the optimized parameters. This method will be referred to as parametric method number 6, or PM6. The name PM6 was chosen to avoid any confusion with two other unpublished methods, PM4 and PM5.

Theory

Despite the apparent complexity of semiempirical methods, there are only three possible sources of error: reference data may be inaccurate or inadequate, the set of approximations may include unrealistic assumptions or be too inflexible, and the parameter optimization process may be incomplete. In order for a method to be accurate, all three potential sources of error must be carefully examined, and, where faults are found, appropriate corrective action taken.

Reference data

In contrast to earlier methods, in which reference data was assembled by painstakingly searching the original literature, the current work relies heavily on the large compendia of data that have been developed in recent years. The most important of these are the WebBook [20], for thermochemistry, and the Cambridge Structural Database [21] (CSD), for molecular geometries.

During the early stages of the current work, consistency checks were performed to ensure that erroneous data were not used. These checks revealed many cases in which the calculated heats of formation were inconsistent with the reference heats of formation reported in the NIST database. On further checking, many of these reference data were also found [22, 23] to be inconsistent with other data in the WebBook. In those cases where there was strong evidence of error in the reference data, the offending data were deleted, and the webbook updated [24].

For molecular geometries, gas phase reference data are preferred, but in many instances such data were unavailable, and recourse was made to condensed-phase data. Provided that care was taken to exclude those species whose geometries were likely to be significantly distorted by crystal forces, or which carried a large formal charge, condensed-phase data of the type found in the CSD were regarded as being suitable as reference data.

Because earlier methods used only a limited number of reference data, most of the cases where the method gave bad results were not discovered until after the method was published. In an attempt to minimize the occurrence of such unpleasant surprises, the set of reference data used was made as large as practical. To this end, where there was a dearth or even a complete absence of experimental reference data, recourse was made to high level calculations. Thus, for the Group VIII elements, there are relatively few stable compounds, and the main phenomena of interest involve rare gas atoms colliding with other atoms or molecules, so reference data representing the mechanics of rare gas atoms colliding with other atoms was generated from the results of ab-initio calculations. Additionally, there is an almost complete lack of thermochemical data for many types of complexes involving transition metals, so augmenting what little data there was with the results of ab-initio calculations was essential.

Use of Ab-Initio results

Ab-initio calculations provide a convenient source of reference data; for this work, extensive use has been made of results of Hartree Fock and B3LYP density functional [25, 26] methods (DFT), both with the 6–31G(d) basis set for elements in the periodic table up to argon. For systems involving heavier elements, the B88–PW91 functional [27, 28] was used with the DZVP basis set. Within the spectrum of ab-initio methods these methods are not particularly accurate; many methods with larger basis sets and with post-Hartree-Fock corrections are more accurate. However, the methods used in this work were chosen because they were regarded as robust, practical methods, allowing many systems to be modeled in a reasonable amount of time, a condition that could not be achieved with the more sophisticated ab-initio methods.

Procedure used in deriving ΔH_f

Reference heats of formation, ΔH_f, for compounds and ions of elements for which there was a paucity of data were derived from DFT total energies in two stages. In the first stage, a basic set of ∼1,400 well-behaved compounds, for which reliable reference values of experimental ΔH_f were available, was assembled. Only compounds containing one or more of the elements H, C, N, O, F, P, S, Cl, Br, and I were used. For this set, a root-mean-square fit was made to the reference ΔH_f using the calculated total energies, E _tot and the atom counts. Thus, the error function, S, in Eq. (1) was minimized.

$$ S = {\sum\limits_j {{\left( {\Delta H_{j} {\left( {\operatorname{Re} {\text{f}}{\text{.}}} \right)} - 627.51{\left( {E_{{{\text{Tot}}}} + {\sum\limits_i {C_{i} n_{i} } }} \right)}} \right)}^{2}_{j} } }$$

(1)

In this expression, the C _i are constants for each atom of type i, and the n _i are the number of atoms of that type.

In the second stage, the contribution to the total energy of compounds containing element X arising from the elements in the first stage was removed using the coefficients from Equation (1). A second RMS fit was then performed. In this, the function minimized, S, was the RMS difference between the reference ΔH_f of compound X and the values predicted from the DFT energy, Eq. (2).

$$S = {\sum\limits_j {{\left( {\Delta H_{j} {\left( {\operatorname{Re} {\text{f}}{\text{.}}} \right)} - 627.51{\left( {E_{{{\text{Tot}}}} + {\sum\limits_i {C_{i} n_{i} } } + C_{x} n_{x} } \right)}} \right)}^{2}_{j} } }$$

(2)

In this expression, the only unknown is the multiplier coefficient C _x. After solving for C _x, the ΔH_f of any compound of X could then be predicted as soon as its DFT total energy was evaluated.

Training set reference data

The training set of reference data used was considerably larger than that used in parameterizing PM3 [7, 8], where approximately 800 discrete species were used. In optimizing the parameters for PM6, somewhat over 9,000 separate species were used, of which about 7,500 were well-behaved stable molecules. The remainder consisted of reference data that were tailored to help define the values of individual parameters or sets of parameters.

Use of rules in parameter optimization

Most reference data can be expressed as simple facts. Indeed, all the earlier NDDO methods were parameterized using precisely four types of reference data: ΔH_f, molecular geometries, dipole moments, and ionization potentials. During the development of PM6, however, the use of other types of reference data was found to be necessary. Because of their behavior, these new data are best described as “rules.” In this context, a rule can therefore be regarded as a reference datum that is a function of one or more other data. To illustrate the use of a rule, consider the binding energy of a hydrogen bond in the water dimer. By default, the weighting factor for ΔH_f for normal compounds is 1.0 kcal mol⁻¹. With this weighting factor, average unsigned errors in the predicted ΔH_f of the order of 3–5 kcal mol⁻¹ would be acceptable, particularly as the spectrum of values of ΔH_f spans several hundreds of kilocalories per mole. However, the binding energy of a hydrogen bond in a water dimer is only 5 kcal mol⁻¹. To have an average unsigned error (AUE) of 4 kcal mol⁻¹ in the prediction of hydrogen bond energies would render such a method almost useless for modeling such phenomena.

One way to increase the importance of the hydrogen bond in water would be to increase the weight for the ΔH_f of the water molecule, −57.8 kcal mol⁻¹, and the water dimer system, ca. −120.6 kcal mol⁻¹. While this would have the intended effect of increasing the weight of the hydrogen bond energy, it would also have the undesired effect of increasing the weight of the ΔH_f of water.

An alternative would be to express the ΔH_f of the water dimer in terms of the ΔH_f of two individual water molecules. The difference between the two ΔH_f, that of water dimer and that of two isolated water molecules, would be the energy of the hydrogen bond. If the weight assigned to this quantity were then increased, it would increase the weight for the hydrogen bond energy without also increasing the weight for the ΔH_f of water. Such a reference datum is referred to here as a rule. That is, rules relate the ΔH_f of a moiety to that of one or more other moieties. Thus, in the above example, the simple reference datum H, representing the ΔH_f of an isolated water molecule, could be expressed as:

$${\text{H}} = - 57.8$$

Using a rule-based reference datum to represent the strength of the hydrogen bond, and giving a weight of 10 to the hydrogen bond energy, the ΔH_f of the water dimer would then be defined as

$${\text{H = 10}}{\left( {{\text{ - 5 + H}}_{{{\text{H2O}}}} {\text{ + H}}_{{{\text{H2O}}}} } \right)}$$

In this expression, H_H2O was the calculated ΔH_f, in kcal mol⁻¹, of an isolated water molecule. This rule could be interpreted as “The calculated strength of the hydrogen bond formed when two water molecules form the dimer should be 5 kcal mol⁻¹, and the importance should be 100 times that of ordinary heats of formation.”

Rules are very useful in defining the parameter hypersurface. Examples of such tailoring are as follows:

Correcting qualitatively incorrect predictions

During the parameterization of transition metals, some systems were predicted to have qualitatively the wrong structure. For example, [Cu^IICl₄]²⁻ was initially predicted to have a tetrahedral structure, instead of the D_2d geometry observed. To induce the parameters to change so as to make the D_2d geometry more stable than the T_d geometry, a rule was added to the set of reference data for copper compounds. This rule was constructed using the results of B3LYP calculations on [Cu^IICl₄]²⁻. First, the total energies of the optimized B3LYP structure and that of the structure resulting from the semiempirical calculation were evaluated. The difference between these energies was then used in constructing the rule. In this case, the rule was that “The ΔH_f of the geometry predicted by the faulty semiempirical method should be n.n kcal mol⁻¹ more than that of the B3LYP geometry.” When such a rule was included in the parameter optimization, with an appropriate large weight, any tendency of the parameters to predict the incorrect geometry resulted in a large contribution to the error function. That is, with the new rule in place, there was a strong disincentive to prediction of the incorrect structure. Usually one rule was sufficient to correct most qualitative errors, but for a few complicated structures more than one rule was needed. The commonest need for multiple rules occurred when, initially, one rule was used to correct a faulty prediction and, after re-optimizing the parameters, the geometry optimized to a new structure that was distinctly different from either the correct structure or the incorrect structure covered by the rule. When that happened, the procedure just described was repeated, and a new rule added to the set of reference data to address the new incorrect structure. In extreme cases, several such rules might be needed, each one defining a geometry that was incorrect and should therefore be avoided.

Rare gas atoms at sub-equilibrium distances

For some elements, specifically those of Group VIII, there is an understandable shortage of useful experimental reference data. In addition, most simulations involving these elements are likely to involve a rare-gas atom dynamically interacting with another atom or with a molecule at distances significantly less than the equilibrium distance. This makes determining the potential energy surface at sub-equilibrium distances important. As with hydrogen bond energies, the energies involved in this domain are likely to be in the order of a few kcal mol⁻¹. The shape of the potential energy surface (PES) can readily be mapped using DFT methods. By selecting two or three representative points on this PES, reference data rules can be constructed that describe the mechanical properties of the interactions. As with hydrogen bonding, a large weight can be assigned to these rules.

Use of rules to restrain parameter values

In general, uncharged atoms that are separated by a distance sufficiently large so that all overlaps between orbitals on the two atoms are vanishingly small will not interact significantly, and what interaction energy exists would arise from VDW terms: of their nature, these are mildly stabilizing. Although statements of this type are obviously true, when they are expressed as rules and added to the training set of reference data they can help define the parameter values. For a pair of atoms, A and B, a simple diatomic system would be constructed in which the interatomic separation was the minimum distance at which any overlaps of the atomic orbitals would still be insignificant. The electronic state of such a system would then be the sum of the states of the two isolated atoms. Thus, if both A and B were silicon, then, since the ground state of an isolated silicon atom is a triplet, the combined state would be a quintet. Because the two atoms do not interact significantly, a rule could then be constructed that said “The energy of the diatomic system is equal to the addition of energies of the two individual systems.” By giving this rule a large weight, any tendency of the method to generate a spurious attraction or repulsion between the atoms would be prevented.

Atomic energy levels

In keeping with the philosophy that a large amount of reference data should be used in the parameter optimization, spin-free atomic energy levels were used for most elements. The exceptions were carbon, nitrogen, and oxygen, where there were enough conventional reference data that the addition of atomic energy levels would not significantly improve the definition of the parameter surface.

NDDO approximations do not allow for spin-orbit coupling. Therefore, spin-free levels were needed. For a few elements, there were insufficient spin states to allow the spin-free energy levels to be calculated. For all the remaining elements, spin-free energy levels were calculated.

In Moore’s compendia [29–31] of atomic energy levels, observed emission spectra were used in determining the energy levels of the various states of neutral and ionized atoms. Most of these energy levels were characterized by three quantum numbers: the spin and orbital angular momenta, and the “J” or spin-orbit quantum number. The starting point for determining the spin-free atomic energy levels for a given element consisted of identifying each complete manifold of atomic energy levels for that element, that is, each set of levels split by spin-orbit coupling. If all members of the set were present, i.e., all energy levels from L+S to |L−S|, then the weighted barycenter of energy could be calculated. The spin-free energy level, E, was derived from the spin-split levels E(S,L,J) using Eq. (3).

$$ E = \frac{1} {{{\left( {2S + 1} \right)}{\left( {2L + 1} \right)}}}{\sum\limits_{J = {\left| {L - S} \right|}}^{L + S} {{\left( {2J + 1} \right)}E{\left( {S,L,J} \right)}} } $$

(3)

In those cases where the ground state of an atom was itself a member of a spin-split manifold, the barycenter of the ground state manifold was calculated and used in re-defining the spin-free ground state. For all elements except tungsten, this change in definition was benign. There is a ⁷S₃ level present in tungsten that is located only 8.4 kcal mol⁻¹ above the ground state. This puts it inside the ⁵D_J, manifold, which has a barycenter at 12.7 kcal mol⁻¹. The effect of this was that, on going from a spin-split to a spin-free ground state, the ground state changed from 6d ²5d ⁴ or ⁵D to 6d ¹5d ⁵ or ⁷S, and the ⁵D state now became an excited state with an energy of 4.4 kcal mol⁻¹. To allow for this, a corresponding change was made to the ground state configuration in the PM6 definition of tungsten.

Where there were relatively few other reference data, the singly-ionized, and, in rare cases, the doubly-ionized, spin-free states were also evaluated and used as reference data.

Each energy level contributed one reference datum to the training set. Most atoms have a large number of atomic energy levels, so in order to minimize the probability that a level might be incorrectly assigned, each level was labeled with three quantum numbers: the total spin momentum, the total angular momentum, and the principal quantum number for these two quantum numbers. These were compared with the corresponding values calculated from the state functions. Since each set of three quantum numbers is unique, the potential for miss-assignment was minimized. In rare cases, particularly during the early stages of parameter optimization, two states with the same total spin and angular quantum numbers would be interchanged, with the result that the calculated principal quantum number would also be interchanged. All such cases always involved the ground state, and were quickly identified and corrected.

Approximations

Most of the approximations used in PM6 are identical to those in AM1 and PM3. The differences are:

Core-core interactions

In the original MNDO set of approximations, two changes were made to the simple point-charge expression for the core-core repulsion term. Beyond about five Ångstroms, there should be no significant interaction of two neutral atoms. However, in MNDO, the two-electron, two-center $\left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.$ integrals and the electron-core interactions do not converge to the exact point charge expression; instead, they are always slightly smaller. To prevent there being a small net repulsion between two uncharged atoms, the core-core expression is modified by the exact 1/R_AB term being replaced by the term used in the $\left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.$ integrals. An additional term is needed to represent the increased core-core repulsion at small distances due to the unpolarizable core. These two changes can be expressed as the MNDO core-core repulsion term as shown in Eq. (4).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.{\left( {1 + e^{{ - \alpha _{A} R_{{AB}} }} + e^{{ - \alpha _{B} R_{{AB}} }} } \right)}$$

(4)

This approximation works well for most main-group elements, but when molybdenum was being parameterized, Voityuk [14] found that the errors in heats of formation and geometries were unacceptably large, and good results were achieved only when a diatomic term was added to the core-core approximation, as shown in Eq. (5).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.{\left( {1 + x_{{AB}} e^{{ - \alpha _{{AB}} R_{{AB}} }} } \right)}$$

(5)

When PM3 parameters for elements of Groups IA were being optimized, the MNDO approximation to the core-core expression was found to be unsuitable. In these elements there is only one valence electron so the core charge is the same as that of hydrogen. A consequence of this was that the apparent size of these elements was also approximately that of a hydrogen atom, in marked contrast with observation. For these elements, diatomic core-core parameters were also found to be essential.

Further examination showed that when diatomic parameters were used, there was always an increase in accuracy; therefore, in the current work, Eq. (4) was replaced systematically by Eq. (5).

As the interatomic separation increased, Voityuk’s equation converged to the exact point-charge interaction, as expected. However, for rare gas interactions, an increase in accuracy was found when the rate of convergence was increased by the addition of a small perturbation. Subsequently, the perturbed function was found to be generally beneficial. Because of this, the general form of the core-core interaction used in PM6 is that given in Eq. (6).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.{\left( {1 + x_{{AB}} e^{{ - \alpha _{{AB}} {\left( {R_{{AB}} + 0.0003R^{6}_{{AB}} } \right)}}} } \right)}$$

(6)

At normal chemical bonding distances, Eqs. (5) and (6) have essentially similar behavior, but at distances of greater than about 3 Å the effect of the perturbation is to make the PM6 function significantly smaller than the Voityuk approximation.

d-orbitals on main-group elements

Thiel and Voityuk have shown [13] that a large increase in accuracy results when d-orbitals are added to main-group elements that have the potential to be hypervalent. During preliminary stages of this work, d-orbitals were excluded from main-group elements, and the parameters were optimized. This work was then repeated but with d-orbitals on various main-group elements. The results were in accordance with Thiel’s observation: the accuracy of the method increased significantly. Because of this, d-orbitals were added to several main-group elements: the value of the increased accuracy far outweighs the extra computational cost.

The effect of the addition of d-orbitals was fundamentally different between main-group elements and transition metals. For main-group elements, the effect of d-orbitals is merely a perturbation: to a large degree the chemistry of these elements is determined by the s and p atomic orbitals. This is not the case with transition metals, where the d-orbitals are of paramount importance and the s and p orbitals are of only very minor significance. In recognition of the importance of the s and p shells in main-group chemistry, specific parameters are used for the five one-center two-electron integrals. Conversely, for the transition metals, the values of these integrals are derived directly from the internal orbital exponents.

Unpolarizable core

As noted earlier, the NDDO core-core interaction is a function of the number of valence electrons. For elements on the left of the periodic table these numbers are small and can cause the elements to appear to be too small. This was part of the rationale behind the adoption of Voityuk’s diatomic core-core parameters. However, even the Voityuk approximation failed during parameter optimization when, in rare cases, a pair of atoms would approach each other very closely. Examination of these catastrophes indicated that the cause was the complete neglect of the unpolarizable core of the atoms involved. To allow for its presence, the core-core interaction for all element pairs was modified by the addition of a simple function, f _AB, based on the first term of the Lennard-Jones potential [32]. A candidate function was constructed, Eq. (7), using the fact that, to a first approximation, the size of an atom increases as the third power of its atomic number.

$$ f_{{AB}} = c{\left( {\frac{{{\left( {Z^{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 3}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$3$}}}_{A} + Z^{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 3}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$3$}}}_{B} } \right)}}} {{R_{{AB}} }}} \right)}^{{12}} $$

(7)

The value of c was set to 10⁻⁸, this being the best compromise between the requirements that the function should have a vanishingly small value at normal chemical distances. That is, under normal conditions the value of the function should be negligible, and at small interatomic separations the function should be highly repulsive, i.e., that it should represent the unpolarizable core.

Individual core-core corrections

For a small number of diatomic interactions, the general expression for the core-core interaction was modified in order to correct a specific fault. Because it is desirable to keep the methodology as simple as possible, modifications of the approximations were made only after determining that the existing approximations were inadequate. The diatomic specific modifications were:

O–H and N–H

In the original MNDO formalism, the general core-core interaction, Eq. (4), was replaced in the cases of O–H and N–H pairs with Eq. (8).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|} \right.\left. {s_{B} s_{B} } \right\rangle {\left( {1 + R_{{AB}} e^{{ - \alpha _{A} R_{{AB}} }} + R_{{AB}} e^{{ - \alpha _{B} R_{{AB}} }} } \right)}$$

(8)

An unintended effect of this change was that at distances where hydrogen-bonding interactions are important, the diatomic contribution to the ΔH_f is greater than if the general approximation, Eq. (4), had been used. This contributed to a reduced hydrogen-bonding interaction in MNDO, and was a contributor to the need for modified core-core interactions in AM1 and PM3.

In PM6, the MNDO core-core approximation is replaced by Voityuk’s diatomic expression, but even with that modification, the resulting hydrogen bond interaction energy was too small. In an attempt to increase it, the Voityuk approximation was replaced by Eq. (9).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|\left. {s_{B} s_{B} } \right\rangle } \right.{\left( {1 + x_{{AB}} e^{{ - \alpha _{{AB}} R^{2}_{{AB}} }} } \right)}$$

(9)

At normal O–H and N–H separations, approximately 1 Å, Eqs. (5) and (9) have similar values, but at hydrogen bonding distances, ∼2 Å, the contribution arising from the exponential term is significantly reduced, resulting in a corresponding increased hydrogen bond interaction energy.

C–C

After optimizing all parameters, it was found that compounds containing yne groups, -C≡C-, were predicted to be too stable by about 10 kcal mol⁻¹ per yne group. This error was unique to compounds with extremely short C–C distances, and in light of the increased emphasis on accurately reproducing the properties of organic compounds, the C–C core-core term was perturbed by the addition of a repulsive term. This term was optimized to correct the error in the yne groups and to have a negligible effect on all other C–C interactions. The optimized form of the C–C core-core interaction is given in Eq. (10).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|} \right.\left. {s_{B} s_{B} } \right\rangle {\left( {1 + x_{{AB}} e^{{ - \alpha _{{AB}} {\left( {R_{{AB}} + 0.0003R^{6}_{{AB}} } \right)}}} + 9.28e^{{ - 5.98R_{{AB}} }} } \right)}$$

(10)

Si–O

During testing of PM6, neutral silicate layers of the type found in talc, H₂Mg₃Si₄O₁₂, were found to be slightly repulsive instead of being slightly bound. An attempt was made to correct for this error by adding a weak perturbation to the Si–O interaction, illustrated by Eq. (11).

$$E_{n} {\left( {A,B} \right)} = Z_{A} Z_{B} \left\langle {\left. {s_{A} s_{A} } \right|} \right.\left. {s_{B} s_{B} } \right\rangle {\left( {1 + x_{{AB}} e^{{ - \alpha _{{AB}} {\left( {R_{{AB}} + 0.0003R^{6}_{{AB}} } \right)}}} - 0.0007e^{{ - {\left( {R_{{AB}} - 2.9} \right)}^{2} }} } \right)}$$

(11)

Nitrogen sp ² pyramidalization

Although PM6 predicted the degree of pyramidalization of primary amines correctly, it overestimated the pyramidalization of secondary and tertiary amines. The degree of pyramidalization of these amines was decreased by adding a function to make the calculated ΔH_f more negative as the nitrogen became more planar, as shown in Eq. (12).

$$ \Delta {H}\ifmmode{'}\else$'$\fi_{f} = \Delta H_{f} - 0.5e^{{ - 10\phi }} $$

(12)

In this equation, the angle ϕ is a measure of the non-planarity of the nitrogen environment, and is given by 2π minus the sum of the three contained angles about the nitrogen atom. For planar sp ² secondary and tertiary amines, this correction amounted to 0.5 kcal mol⁻¹ per nitrogen atom.

More elements

The NDDO basis sets of many of the elements parameterized in PM6 have not previously been described. For all elements except hydrogen, which has only an s orbital, the basis set consists of an s orbital, three p orbitals, and, for most elements, a set of five d orbitals. Slater atomic orbitals are used exclusively; these are of form:

$$ \varphi = \frac{{{\left( {2\xi } \right)}^{{n + \raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$2$}}} }} {{{\left( {{\left( {2n} \right)}!} \right)}^{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$2$}}} }}r^{{n - 1}} e^{{ - \xi r}} Y^{m}_{l} {\left( {\theta ,\phi } \right)} $$

Where ξ is the orbital exponent, n is the principal quantum number (PQN), and the Y _l ^m(θ, ϕ) are the normalized real spherical harmonics. The PQN are those of the valence shell, i.e., the set of atomic orbitals most important in forming chemical bonds. For PM6, the PQN used are shown in Table 1. For most main-group elements, the s and p PQN are the same, and, when d orbitals are present, all three PQN are the same: that is, the PQN are (ns, np, nd). For transition metals, the d PQN is one less than that of the s and p shells, i.e., (ns, np, (n–1)d). An exception to this generalization occurs in the elements of Group VIII. Here, the valence shell is completely filled, so in all chemical interactions that could occur between an atom of a Group VIII element and any other atom, electron density could only migrate from the Group VIII element to the other atom. That is, when a rare gas element forms any type of chemical bond it would necessarily become slightly positive. This is an unrealistic result. In order to allow rare gas atoms to have the potential of being slightly negative, the set of valence orbitals was changed from (ns, np) to (np, (n+1)s), for the elements Ne, Ar, Kr, and Xe. Helium is the only exception to this change, because it does not have a “1p” valence shell. For helium, the valence shell used was (1s, 2p), this being considered the best compromise.

Table 1 Principal quantum numbers for atomic orbitals

Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements

Abstract

Similar content being viewed by others

A semiempirical method optimized for modeling proteins

An Introduction and Overview of Basis Sets for Molecular and Solid-State Calculations

Ab Initio, Density Functional Theory, and Semi-Empirical Calculations

Introduction

Theory

Reference data

Use of Ab-Initio results

Procedure used in deriving ΔHf

Training set reference data

Use of rules in parameter optimization

Correcting qualitatively incorrect predictions

Rare gas atoms at sub-equilibrium distances

Use of rules to restrain parameter values

Atomic energy levels

Approximations

Core-core interactions

d-orbitals on main-group elements

Unpolarizable core

Individual core-core corrections

O–H and N–H

C–C

Si–O

Nitrogen sp 2 pyramidalization

More elements

Parameter optimization

Background

Sequence of optimization of parameters

Parameters that determine atomic electronic properties

Parameters that determine molecular electronic properties

Parameters that determine geometries

Results

Parameters for PM6

Accuracy

Comparison with other semiempirical methods

Comparison with AM1*

Comparison with RM1

Comparison with high-level methods

Hydrogen bonding

Nitrogen pyramidalization

Transition metals

Sets of transition metals

Group IIIA: Scandium, Yttrium, Lanthanum, and Lutetium

Group IVA: Titanium, Zirconium, and Hafnium

Group VA: Vanadium, Niobium, and Tantalum

Group VIA: Chromium, Molybdenum, and Tungsten

Group VIIA: Manganese, Technetium, and Rhenium

Group VIIIA: Iron, Cobalt, Nickel, Ruthenium, Rhodium, Palladium, Osmium, Iridium, and Platinum

Group IB: Copper, Silver, and Gold

Group IIB Zinc, Cadmium, and Mercury

Discussion

Methodological changes

Elimination of computational artifacts

Accuracy

Permanent errors

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

ESM

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Procedure used in deriving ΔH_f

Nitrogen sp ² pyramidalization