Avoiding pitfalls of a theoretical approach: the harmonic oscillator measure of aromaticity index from quantum chemistry calculations

Andrzejak, Marcin; Kubisiak, Piotr; Zborowski, Krzysztof K.

doi:10.1007/s11224-012-0148-2

Avoiding pitfalls of a theoretical approach: the harmonic oscillator measure of aromaticity index from quantum chemistry calculations

Original Research
Open access
Published: 14 October 2012

Volume 24, pages 1171–1184, (2013)
Cite this article

Download PDF

You have full access to this open access article

Structural Chemistry Aims and scope Submit manuscript

Avoiding pitfalls of a theoretical approach: the harmonic oscillator measure of aromaticity index from quantum chemistry calculations

Download PDF

Marcin Andrzejak¹,
Piotr Kubisiak¹ &
Krzysztof K. Zborowski²

2288 Accesses
30 Citations
Explore all metrics

Abstract

The concept of the harmonic oscillator measure of aromaticity (HOMA) is based on comparing the geometrical parameters of a studied molecule with the parameters for an ideal aromatic system derived from bond lengths of the reference molecules. Nowadays, HOMA is routinely computed combining the geometries from quantum chemistry calculations with the experimentally based parameterization. Thus, obtained values of HOMA, however, are bound to suffer from inaccuracies of the theoretical methods and strongly depend on computational details. This could be avoided by obtaining both the input geometries and the parameters with the same theoretical method, but efficiency of the error compensation achieved in this way has not yet been probed. In our work, we have prepared a benchmark set of HOMA values for 25 cyclic hydrocarbons, based on the all core CCSD(T)/cc-pCVQ(T)Z geometries, and used it to investigate the impact of different choices of the exchange–correlation functionals and basis sets on HOMA, calculated against the experimentally based (HOMA^EP) or the consistently calculated (HOMA^CCP) parameters. We show that using HOMA^EP leads to large and unsystematic errors, and strong sensitivity to the choice of XC functional, basis set, and the experimental data for the reference geometry. This sensitivity is largely, although not completely attenuated in the consistent approach. We recommend the most suitable functionals for calculating HOMA in both approaches (HOMA^EP and HOMA^CCP), and provide the HOMA parameters for 25 studied exchange–correlation functionals and two popular basis sets.

Aromaticity: what does it mean?

Article Open access 10 June 2015

Quadripartite bond length rule applied to two prototypical aromatic and antiaromatic molecules

Article Open access 13 March 2023

Energetic and geometric characteristics of the substituents. Part 1. The case of NO2 and NH2 groups in their mono-substituted derivatives of simple benzenoid hydrocarbons

Article Open access 05 March 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The concept of aromaticity, introduced in 1855 by Hoffman [1] has been one of the most momentous ideas in organic chemistry. Geometric indices quantify the aromaticity utilizing the fact that in non-aromatic systems, the single and double bonds are clearly defined and have distinctly different lengths, whereas in aromatic systems the lengths of the nominally single and double bonds are similar or even equal to one another. Probably the most popular of the geometric indices of aromaticity is the harmonic oscillator measure of aromaticity (HOMA) index, introduced and developed by Krygowski et al. [2–7]. The value of HOMA for a n-member unsaturated ring is based on the lengths of individual bonds l _i, according to the formula:

$$ {\text{HOMA}} = 1 - \frac{1}{n}\left( {\sum\limits_{i = 1}^{n} {\alpha_{i} (l_{i} - l_{i,opt}^{{}} )^{2} } } \right), $$

(1)

in which the proportionality constants α _i and the optimum aromatic bond lengths $ l_{i,opt}^{{}} $ are the parameters that have to be independently determined for each pair of atoms (e.g., CC, CN, CO, NO) that form the bonds within the ring. Thus, the HOMA parameterization is based on carefully selected reference systems [3]. The optimum bond length between a given pair of atoms was originally defined as: $ l_{opt}^{{}} = (2l_{2} + l_{1} )/3 $, and the constant as $ \alpha = 2/\left[ {(l_{1} - l_{opt} )^{2} + (l_{2} - l_{opt} )^{2} } \right] $. The l ₁ and l ₂ are the lengths of a nominally single and a nominally double bond, respectively, that are present in the reference molecule(s). The constant α is designed to give HOMA = 0 for the Kekulé structure of a typical aromatic system and HOMA = 1 for the system with all bond lengths equal to the optimum value $ l_{opt}^{{}} $. The formula for the optimum bond length was first derived under the assumption that the force constants for l ₂ is twice the force constant for the l ₁. This assumption, however, is satisfied only approximately and the improved optimum bond length can be calculated from the formula: $ l_{opt}^{{}} = (\omega_{{}} l_{2} + l_{1} )/(1 + \omega_{{}} ) $, in which ω = w ₂/w ₁ denotes the ratio of force constants for the shorter and longer reference bonds, respectively [6]. The improved optimum bond lengths lead then to the modified values of the constants α.

One of the main advantages of HOMA is that apart from using just the bond length differences present in the molecule of interest, it also accounts for the differences between the average bond length for this molecule, and the optimum bond length for the ideally aromatic system. This is best observed when the definition of HOMA is rewritten as [5, 8]:

$$ \begin{aligned} {\text{HOMA}} = & 1 - {\text{EN}} - {\text{GEO}} \\ {\text{EN}} = & \alpha \cdot \left( {l_{opt} - l_{ave} } \right)^{2} \\ {\text{GEO}} = & \frac{\alpha }{n}\sum\limits_{i = 1}^{n} {\left( {l_{i} - l_{ave} } \right)^{2} } \\ \end{aligned} $$

(2)

The GEO component reflects the impact of the bond length differences (BLD) within the ring on the aromaticity, whereas the EN component is sensitive to changes in the average bond length. Thus, HOMA correctly predicts anti–aromaticity of e.g., cyclohexanehexone, whereas other popular geometry-based aromaticity descriptors like the Julg-François index [9] or the Bird index [10] fail spectacularly by classifying this system as a 100 % aromatic one. Note that the above formulas for HOMA are strictly equivalent to the original one (Eq. 1) only for hydrocarbons. For heterocyclic rings, the lengths of bonds involving atoms other than carbon have to be transformed to mimic the CC bond lengths of the same order [5]. This procedure, however, recovers the values of HOMA from the original formulation only for the force constant ratios ω identical for all pairs of atoms. When they are independently estimated for different pair of atoms, HOMA obtained from Eq. 2 are somewhat different from the original (Eq. 1) values. The discrepancies are nonetheless small and can usually be ignored, as decomposition (2) is needed mostly for specific interpretational purposes. It introduced, however, an intriguing novelty: the EN part is to be taken with the negative sign whenever the average bond length is shorter than l _opt [8]. It may lead to HOMA > 1 provided that the GEO part is small (e.g., for symmetry reasons). This behavior is rather counterintuitive, and we will discuss it briefly while commenting on our results.

Originally, the HOMA index was designed to estimate the aromaticity of molecules based on geometries taken usually from crystallographic experiments. The values of HOMA were thus directly linked to measurements. This, however, made them vulnerable to errors inherent to applied experimental techniques, and related to interactions with environment. Moreover, the errors for the studied systems were likely to be different from errors for the reference molecules. The natural question in this context is what would the values of HOMA be, were they free from the environmental and experimental bias. Besides, geometries of many systems cannot be determined experimentally, especially if one is interested not only in the ground state properties, but also e.g., in the reactivity of a molecule in its excited state. In such cases, one usually resorts to quantum chemical calculations, which nowadays have become a standard way to determine molecular properties, including equilibrium geometries for both the ground state and the excited states. The rapidly growing computational power and the advent of new efficient theories and algorithms, led by the methods based on the density functional theory (DFT), have allowed for studying large and complex systems containing as many as several hundreds of atoms. However, the necessarily simplified treatments of electron correlation as well as other approximations routinely used in quantum chemistry are bound to affect the theoretical results. In many situations (e.g., the energies of reactions, activation barriers, or excitation energies), the theoretical results are surprisingly accurate, because most of the errors fortuitously cancel out. However, when the calculated quantities (e.g., bond lengths) are mixed with the experimental ones, the shortcomings of quantum chemical treatment are bound to resurface. Unfortunately, HOMA is routinely calculated in just such a way: the theoretically obtained bond lengths for a studied system are combined with the parameterization based on experimental geometries of the reference molecules [7, 11–18]. One may have justified suspicions that HOMA computed in such a way would undergo strong changes with a change of the basis set, computational method, or even the exchange–correlation functional of DFT (a great variety of which have been recently developed and presented for general use). Such behavior of any quantitative descriptor of aromaticity is, of course, highly undesirable. One may expect, however, that this sensitivity to details of computational schemes would be reduced if a consistent theoretical treatment of both the studied system and the reference molecules is used. This approach offers a chance for systematic cancelation of errors of the quantum chemical calculations. The HOMA obtained in this way will be further referred to as HOMA^CCP (consistently calculated parameters) as opposed to HOMA^EP, obtained with parameters based on the experimental geometries.

The sensitivity of HOMA^EP to computational details of the geometry optimization can thus be anticipated, as well as its reduction for HOMA^CCP. The magnitude of these effects, however, cannot be easily predicted. In this paper, we would like to determine quantitatively the impact of different choices of computational methods of geometry optimizations on HOMA calculated in both outlined ways (HOMA^CCP and HOMA^EP) for a group of compounds containing all-carbon unsaturated rings of varying sizes and degrees of aromaticity. The whole paper will be divided into two main sections. First, we will present the benchmark HOMA values for the selected unsaturated hydrocarbons obtained by means of the CCSD(T) computational scheme, which was reported to provide accuracy comparable to that of the best experiments [19–21]. Subsequently, the benchmark will be used to test the performance of the DFT method, with a choice of 25 exchange–correlation functionals destined to various fields of chemistry, and two basis sets of different sizes. We will examine the consequences of using the original parameterization of HOMA, propose a new parameterization derived from the recent experimental geometry of the trans-1,3-butadiene, and demonstrate the changes brought about by using the consistent approach. The paper will be concluded by recommending the best functionals for the purpose of studying aromaticity of organic systems based on their geometries, and by providing the list of HOMA parameters (for the CC bonds) for all the studied DFT functionals and basis sets.

Results and discussion

The CCSD(T) study of trans-1,3-butadiene and selected hydrocarbons

In this section, we will focus initially on trans-1,3-butadiene, which is the original source of HOMA parameters for the CC bonds [3], and for which the experimental equilibrium bond lengths are known to a very good accuracy [22]. We will investigate the performance of the CCSD(T) method in predicting equilibrium geometries for this molecule, comparing the quantum chemical results with the experimental data. Similar analysis for benzene will be performed in “DFT calculations” section, where the ab initio results will be directly compared with the outcome of the DFT calculations.

Judging from the studies concerning the accuracy of ab initio methods for prediction of molecular equilibrium structures [19–21, 23], the best method that is feasible for medium size molecules (up to 6-9 heavy atoms, depending on the symmetry of the system) appears to be the coupled-clusters singles and doubles, with perturbative inclusion of triples—CCSD(T), especially when combined with the cc-pVTZ or, preferably, with the cc-pVQZ basis sets. The mean error of this computational scheme ($ \bar{\Updelta } $), determined for a set of molecules containing first and second row atoms, is less than 1 pm, with the maximum absolute deviation $ \left| {\Updelta_{\hbox{max} } } \right| $=1.511 pm. When the core electrons are also correlated (all core CCSD(T)/cc-pCVQZ), the errors are further reduced ($ \bar{\Updelta } $ = 0.026 pm, and $ \left| {\Updelta_{\hbox{max} } } \right| $ = 0.706 pm). The latter computational scheme, however, is much more costly than the standard, frozen-core one, owing to both the higher number of active orbitals and the enlarged basis set, containing additional tight polarization functions for more flexible description of the core electrons.

For our ab initio calculations, we have selected the Dunning cc-pVXZ (X = D,T, and Q) basis sets [24]—the three consecutive members of the popular sequence of basis sets that allow for approaching the complete basis set limit by going to higher levels in the sequence. For the all core calculations, we have also used their dedicated counterparts: the cc-pCVXZ basis sets [24]. All the ab initio calculations have been carried out using MOLPRO 2010.1 [25] and Cfour [26] program packages.

Trans-1,3-butadiene and the HOMA parameters

1,3-butadiene was selected by Krygowski et al. [3] as the reference molecule to parameterize HOMA for the CC bonds. Since the trans isomer of the butadiene is more stable than the cis one, the experimental data refer to the former. One may argue that it is the cis isomer that should be used as the reference system for HOMA, as it more closely resembles a part of the benzene ring. The geometry of the cis isomer is difficult to determine experimentally, but as it is easily accessible theoretically, it could be used to parameterize HOMA^CCP. Such a parameterization, however, would lead to the values of HOMA that could not be directly compared with HOMA^EP, the parameters for which are necessarily based on the geometry of the trans-1,3-butadiene. Since the main goal of this study is comparing the behaviors of HOMA^EP and HOMA^CCP, we have decided to use the geometry of trans-1,3-butadiene throughout the whole study. It is interesting to study the changes of HOMA introduced by switching from the trans to the cis isomer of butadiene, but such an analysis is beyond the scope of the present paper, and will be addressed in the future.

The optimized CC bond lengths, the bond length difference (BLD) and the HOMA parameters l _opt and α are displayed in Fig. 1, and collected in Table 1. For every combination of the computational scheme and basis set, we have also estimated the force constant ratio ω in the way outlined by Cyrański et al. [6]. The energies were calculated for a series of geometries obtained from the optimized equilibrium structure by changing either the l ₁ or l ₂ by ±0.005, ±0.010, and ±0.050 Å. Second order polynomial fits provided the approximate relations E(Δl ₁) or E(Δl ₂) and yielded the force constants w for both types of bonds. Owing to cancelation of errors, this simple procedure can be expected to provide good estimates for the ratios w ₂/w ₁. Thus obtained values of ω are also included in Table 1.

Table 1 The HOMA parameters (l _opt, α) derived from the lengths of the CC bonds (l ₁, l ₂) and the related force constant ratios (ω) for trans-1,3-butadiene, as calculated at the CCSD(T) level of theory

Full size table

The calculated bond lengths are compared with two sets of the experimental ones. The first set comes from the electron diffraction experiment [27] and was selected by Krygowski [3] as reference to parameterize HOMA for the CC bonds. The other set comes from a recent paper of Craig et al. [22], in which the authors reported the equilibrium bond lengths (r _e) with the accuracy of 0.001 Å. The equilibrium bond lengths were obtained from the measured rotational constants of various deuterated butadienes, corrected for the influence of zero point vibrations. These bond lengths are by over 0.01 Å shorter (l ₂ = 1.338 Å and l ₁ = 1.454 Å) than those used by Krygowski (l ₂ = 1.349 Å and l ₁ = 1.467 Å). We do not suggest that the new bond lengths should be used in the classical calculations of HOMA, in which the index is based on experimental bond lengths of the studied molecules (typically obtained in crystallographic studies). The original parameterization may in such cases give better results, owing to favorable compensation of experimental or environmental errors. Quantum chemical calculations, however, yield directly the equilibrium bond lengths, so the new experimental data should be more appropriate for assessing the quality of the theoretical results. Analysis of the calculated CC bond lengths shows that it is indeed the case. The “old” bond lengths are best reproduced in the least accurate calculations, which employ the cc-pVDZ basis set. The calculated bond lengths, however, have decreased significantly when the basis set has been improved, eventually converging to the “new” experimental values. The agreement is nearly perfect for the all core CCSD(T)/cc-pCVQZ results.

The CCSD(T)/cc-pVQZ calculations have yielded the lengths of both kinds of CC bonds that are almost uniformly overestimated by approximately 0.004 Å, which results in similarly overestimated l _opt, but the BLD is still of the same quality as the all core- CCSD(T)/cc-pCVQZ value. Since the error in the CC bond lengths seems to be nearly independent of the bond order, analogous behavior of the CCSD(T)/cc-pVQZ (similar overestimation of the average CC bond length, good reproduction of the BLD) can be expected also for other unsaturated hydrocarbons. In such a case, almost complete cancelation of errors can be expected while calculating the HOMA^CCP values, and so they can be regarded as equivalents to HOMA based on the accurate experimental equilibrium bond lengths. The same argument holds for the CCSD(T)/cc-pVTZ and the all core- CCSD(T)/cc-pCVTZ values: again, the errors for l ₁ and l ₂ are very similar (somewhat larger than for the QZ basis sets), and the BLD is almost as accurate as that obtained with the quadruple-ζ basis set.

Model compounds—benchmark results

In this section, we will analyze the values of HOMA computed for the test set of 25 cyclic hydrocarbons of varying aromaticity, displayed in Chart 1. The molecules were assumed to be planar by imposing the symmetry constraints (C_2v or C_s), with obvious exceptions for compounds 1, 8, 24, and 25, for which planarization would be highly unfavorable energetically. The input bond lengths have been obtained with the CCSD(T) method in both the frozen-core and all core versions. The quadruple-ζ correlation consistent basis sets were used. They were replaced with their smaller, triple-ζ counterparts whenever the molecule proved too large. The accuracy of the cc-pVDZ results was deemed insufficient and they were excluded from further use in this study. The values of HOMA^CCP (obtained with the consistently calculated reference parameters) are listed in Table 2. A glance at the results allows one to conclude that HOMA^CCP are only weakly dependent on the choice of the basis set. In particular, the CCSD(T)/cc-pVXZ and the all core CCSD(T)/cc-pCVXZ results are very close to each other for both values of X (T or Q). The benchmark values will be chosen in the following fashion: the quadruple-ζ results will be preferred whenever available, and of those the potentially more accurate all core ones. Otherwise, we will select the all core CCSD(T)/cc-pCVTZ values. Thus created benchmark set will be applied to assess the performance of the selected XC functionals used in DFT calculations.

Table 2 HOMA^CCP for the selected unsaturated hydrocarbons (labeled according to Chart 1) obtained from geometries optimized at the CCSD(T) level of theory

Full size table

Note that the results seem to be well saturated with basis set already at the triple-ζ level even though the bond lengths are not. It shows that substantial compensation of errors does take place while computing HOMA^CCP, as envisaged in the preceding chapter. It also indicates that the errors in the CC bond lengths calculated at the CCSD(T) level of theory are approximately transferable, regardless of the bond order, and of the size of the molecule. That the largest differences occur for the least aromatic systems is rather easily understandable, as HOMA is based on the squared differences between bond lengths. The impact of the errors in bond lengths is thus the more severe, the farther a bond length deviates from the optimum value (l _opt), and the larger the BLDs are in a studied molecule.

DFT calculations

Choice of the exchange–correlation functionals

In the Kohn–Sham formulation of the density functional theory (KS-DFT) the computationally demanding direct solution of the electronic Schrodinger equation is replaced by solving a system of equations for non-interacting electrons defined to have the same one-electron density as the true system. Such calculations are much shorter than the traditional direct approach, and thus, the boundaries of applicability of (non-semiempirical) quantum chemical calculations has been moved from several tens to several hundreds of heavy atoms. KS-DFT provides a way to incorporate dynamic electron correlation into the one-electron model (or single-determinant wavefunction), previously characteristic for the Hartree–Fock scheme, in which all the Coulomb correlation of electrons was neglected. However, all the subtleties of the correlated motion of electrons have to be introduced in the KS-DFT through a complicated exchange–correlation (XC) functional, the exact form of which is unknown. Much of modern DFT research is therefore devoted to developing approximations to the XC functional, which are intended to give more and more accurate results. Unfortunately, no single systematic approach for developing the exact functional currently exists, and so hundreds of different functionals have been proposed, leaving the potential user at a loss as to which one would be most suitable for a particular task. An ideal functional would, of course, be well suited to all applications in chemistry and physics. Such functionals, however, are not likely to be discovered in the foreseeable future. Most of the existing functionals are more or less directed toward increased accuracy in a particular field (e.g., main group thermochemistry, barrier heights, or electronic spectroscopy) at the expense of deteriorated performance in calculating other properties.

For our study, we have selected functionals representing each of the levels of approximation (or rungs of the Jacob’s ladder [28]). The SVWN [29, 30] functional was chosen mostly for comparative purposes to emphasize the improvements introduced at the higher rungs. For GGA functionals (the second rung), we selected BLYP [31–33], PBE [34, 35], and HCTH [36–38]. From the third rung (the meta-GGA functionals), we include TPSS [39], τ-HCTH [40], and M06-L [41]. The fourth rung (the hybrid functionals) is most strongly represented, as the functionals here may contain different admixtures of non-local exchange, which significantly modifies their performance. Here, we have chosen the following functionals: TPSSh [39] (10 % of non-local exchange), B97-1 [38] (19 %), B3LYP [30, 32, 42] (21 %), PBE0 [43], PBEh [44], and ωPBEh [45] (25 % each), M06 [46] (27 %), BMK [47] (42 %), BHandHLYP [48] (50 %), M06-2X [46] (54 %), and M06-HF [46] (100 %). We also consider the recently developed range-corrected functionals, for which the admixture of non-local exchange varies with the interelectronic distance r. This feature is intended to improve the incorrect long-distance behavior of the approximate XC functionals. Thus, we have also CAM-B3LYP [49] (19–60 % of non-local exchange), LC-PBE [50] (0–100 %), LC-ωPBE [51] (25–100 %), ωB97 [52] (0–100 %), and ωB97X [52] (19–100 %). Finally, we include two double-hybrid functionals (the fifth rung): B2PLYP [53] and its improved version mPW2PLYP [54], which were reported to provide considerably higher accuracy with respect to BLYP, TPSS, and B3LYP, when tested on the extensive G3 set of molecules [54]. All the DFT calculations have been performed using the popular Dunning DZP basis set [55, 56], and the def2-TZVPP basis set of the Karlshruhe group [57]. The former provides a reasonable compromise between accuracy and computational cost, whereas the latter gives results that for DFT calculations can be regarded as close to the complete basis set limit. The DFT calculations were performed using GAUSSIAN’09 [58]. The selected functionals are listed in Table 3, together with the respective HOMA parameters

Table 3 Selected exchange–correlation functionals and the respective HOMA parameters obtained for both basis sets used in our DFT calculations

Full size table

for the CC bonds obtained with both basis sets chosen for the DFT calculations. These parameters have been used in this study to compute HOMA^CCP for the test set of molecules. The parameterizations derived in the simplified way (ω = 2) are available in the supplementary material.

HOMA for benzene

Before we embark on the statistical analysis of the performance of the DFT functionals for the model compounds, we would like to focus briefly on benzene, for which the high quality experimental equilibrium geometry is available [23]. The equilibrium CC bond length was established to be r _e = 1.391 ± 0.001 Å (the same accuracy as that for butadiene [22]). This value is in excellent agreement with the all core CCSD(T)/cc-pCVQZ value of 1.3918 Å, whereas the frozen-core version of CCSD(T) slightly overestimates the experimental bond length, yielding r _e = 1.3949 Å. Nonetheless, HOMA^CCP is the same for both versions of CCSD(T): 0.973, owing to the compensation of errors discussed above for butadiene. This value is also practically the same as the HOMA based solely on the experimental equilibrium bond lengths both for benzene and butadiene: 0.970 ± 0.012 (the uncertainty being estimated from the maximum experimental errors for the CC bond lengths in both molecules).

The accuracy of DFT is considerably lower. The errors due to approximations in the XC functionals and limited basis sets are especially noticeable when the HOMAs are calculated using the experimental parameterizations (HOMA^EP). Figure 2 panel a shows HOMA^EP for benzene computed with three sets of the experimentally based parameters: the original one taken from Krygowski [3], and two sets of parameters obtained from the equilibrium bond lengths of butadiene [22] using either the force constant ratio ω = 2 or ω = 1.684 (the all core CCSD(T)/cc-pCVQZ value). After a glance at thus obtained values of HOMA^EP, it becomes obvious that they strongly depend both on the computational parameters (XC functional, basis set) selected for geometry optimization of benzene and on the choice of experimental geometry of the reference molecule. They are also sensitive to whether the simple (ω = 2) or improved (ω = 1.684) parameterization was used. The values of HOMA^EP based on the original parameters proposed by Krygowski et al. [3] are generally too high with respect to the HOMA solely based on the experimental equilibrium bond lengths, as well as to the HOMA^CCP from the CCSD(T) calculations. When computed from geometries optimized with the def2-TZVPP basis set, they closely approach or even exceed unity, going as high as 1.012 for BHLYP, and 1.049 for LC-PBE. The values based on bond lengths optimized with the DZP basis set are slightly lower, varying between 0.827 (BLYP) and 0.998 (LC-PBE). On the other hand, the combination of the DZP geometries and the parameterization based on the equilibrium bond lengths of butadiene [22] lead to extremely low values of HOMA (going down to 0.619 for BLYP). Analogous results but based on the def2–TZVPP geometries are much more reasonable, especially when the improved parameterization (derived with ω = 1.682) is used, which also helps to somewhat reduce the dependence of HOMA on the choice of the XC functional. The HOMAs calculated in this way vary between 0.89 (BLYP) and 1.011 (LC-PBE).

The real stabilization of the results, however, are achieved by switching to the parameterizations based on the consistently optimized CC bond lengths of butadiene (the HOMA^CCP, as displayed in Fig. 2 panel b). First of all, variations of HOMA with respect to the choice of the XC functional are further attenuated. Here, the importance of using the calculated values of the force constant ratios ω must be emphasized, as they show a surprisingly strong dependence on the functional, ranging from 1.608 (SVWN) to 1.880 (M06HF). Using these values has reduced the HOMA dependence on the functional almost threefold with respect to HOMA calculated in the simplified approach (ω = 2). The sensitivity of the HOMA^CCP to the size of the basis set is also very small: going from the moderate DZP basis set to the large def2-TZVPP basis set brings about a uniform (for all functionals) lowering of HOMA by less that 0.01. Moreover, thus calculated values of HOMA^CCP are quite accurate regardless of the choice of the functional, the errors being attenuated to within the margin of 0.04 with respect to the experimentally based value of 0.970.

It is also worth a while to look closer at the cases of HOMA^EP >1. For benzene, it means that the length of the CC bonds in the ring is smaller than the optimum length of the CC bond for the aromatic system (l _opt). In the energy terms, the systems with shorter bonds can be expected to be more stable, or more aromatic. Therefore, for such cases, HOMA was defined to be greater than one [8]. Such a situation was difficult, however, to understand on physical grounds. Therefore, HOMA > 1 was rather attributed to imperfect choice of the reference systems, or to inaccuracies of the experimental bond lengths for the studied systems and the reference molecules. In our study, it may stem also from incompatibility of the quantum chemistry results and the experimental data used to obtain the HOMA parameters: when the original parameterization have been used, HOMA^EP > 1 have been observed for six XC functionals (SVWN, M06, BHandHLYP, CAM-B3LYP, LC-PBE, LC-ωPBE), and for HF, combined with the def2-TZVPP basis set. The values of HOMA^EP computed with the new experimental parameterization (based on the equilibrium bond lengths) have exceeded unity only for the LC-PBE/def2-TZVPP method. HOMA^CCP, on the other hand, has not exceeded unity for any of the XC functionals and both basis sets used in our study. We may thus conclude that while butadiene itself seems to be an appropriate choice for the source of the HOMA parameters, the peculiarities of HOMA^EP being larger than one are brought about by combining the experimental bond lengths for the reference system with the theoretically obtained bond lengths for the studied molecules.

Performance of DFT functionals—statistical analysis

Figures 3, 4 and 5 contain the statistical data for the selected DFT functionals. We have included also the Hartree–Fock results in the analysis, as this method is both computationally inexpensive and it provides a reference point for the functionals with high content of non-local exchange (even though in DFT functionals the non-local exchange is computed using the Kohn–Sham orbitals). Another non-DFT method that we included is MP2, because it offers a considerable increase of accuracy with respect to the HF scheme at a reasonable cost. In fact, it is somewhat less computationally demanding than the double-hybrid functionals (B2PLYP, and mPW2PLYP). It is thus prudent to compare the accuracy of MP2 with that of DFT. The mean signed errors (MSE), the mean absolute errors (MAE), and the maximum absolute errors (MaxAE), have been calculated with respect to the benchmark values of HOMA, collected in Table 2. Three sets of data have been analyzed, corresponding to three choices of parameterization: (a) the original parameterization of Krygowski et al.; (b) the parameterization based on the new experimental equilibrium bond lengths of butadiene and the force constant ratio ω = 1.682, as calculated at the all core CCSD(T)/cc-pCVQZ level of theory; (c) the parameters calculated from the consistently calculated bond lengths and the force constant ratios ω for trans-1,3-butadiene (Table 3). The results obtained using the original parameters seem to confirm the conclusions made in the case of benzene. The values of HOMA are overestimated for most of the functionals, and the errors are significantly larger for the larger basis set. These findings are not surprising as the original parameterization is based on the bond lengths from the electron diffraction experiment (r _a), and not on the equilibrium geometry of butadiene (r _e). It appears that quantum chemical results should not be used in combination with these parameters. Using the experimental equilibrium bond lengths [22] as the source for the HOMA parameters has led to reduction of the errors of HOMA^EP, which are no longer systematically overshot. They are generally a little too low if based on the DZP geometries. For HOMA^EP calculated using the def2-TZVPP geometries, however, no systematic errors can be observed: the MSE of HOMA^EP are randomly positive or negative, while the absolute deviations for most functionals are noticeably smaller in comparison with the DZP results.

The HF values of HOMA are substantially underestimated (too low aromaticities), which is in accordance with the well-known tendency of the HF method to over-localize the π-electrons and thus yield too high bond length differences (BLD) and too low polarizabilities [59–61]. On the other hand, the MP2 values of HOMA are too high, which again corresponds to the frequently observed for this method overshot delocalization of the π-electrons, resulting in too low BLDs and overestimated polarizabilities, especially in the extended π-conjugated systems (e.g., oligoenes, oligothiophenes) [59–61]. Out of the DFT functionals, only the local functional (SVWN) and the M06-HF one yielded worse results than HF. All the other functionals have outperformed HF by far, being also better than, or at least comparable with MP2.

The best functionals are TPSSh, B3LYP, BHandHLYP, CAM-B3LYP, and the two fifth rung functionals (B2PLYP and mPW2PLYP). For all of them, the maximum errors do not exceed 0.15, the mean absolute errors are less than 0.05, and the mean signed errors are in the range of –0.035 to 0.035. It appears that for good performance in geometry optimization the functional has to contain a moderate to medium content of non-local exchange.

Interestingly, out of the functionals from the first three rungs, the best performance (MSE ≈0, relatively low values of MAE and MaxAE) has been observed for PBE and TPSS, the two functionals that were created using the exact constraint satisfaction method, without any empirical fitting procedure [62].

Further improvement of the DFT results has been achieved using the consistently calculated parameters (listed in Table 3), which leads to the HOMA^CCP values. A distinct trend can be observed here, much as in the case of benzene. The non-hybrid (local, GGA, and meta-GGA) functionals yield the lowest values of HOMA and generally underestimate the aromaticity (MSE <0). This systematic error is reduced when some admixture of the non-local exchange appears in the functional. The best performance is observed for the PBE hybrids and the M06 functional (25 and 27 % of the non-local exchange, respectively). Further increase of the non-local contribution to exchange brings about an increase of HOMA, leading to positive values of MSE, and to elevated values of MAE. This trend does not hold for the double-hybrid functionals, however, owing to the presence of the non-local correlation component, which reduces the errors associated with the high content (over 50 %) of non-local exchange. Nonetheless, for all of the DFT functionals studied here, and the two ab initio methods included in the analysis, the values of MAE are lower than 0.13. Note that for HOMA^EP (obtained using the new experimental parameters), the MAE of 0.13 was exceeded for six DFT functionals, as well as for HF and MP2. The effect of favorable compensation of errors is thus evident.

This is also the reason of the much reduced sensitivity of the HOMA^CCP to the size of the basis set. The results obtained from geometries optimized with the moderate DZP basis set are to be within a few percent identical to the results based on geometries optimized with the far better, larger, and more computationally demanding def2-TZVPP basis set. The changes are moderate even for the double-hybrid functionals, which are potentially the most sensitive to the size of the basis set, as they require not only the occupied orbitals (in the exchange part), but also make use of all the virtual orbitals (in the correlation part).

Detailed analysis of the errors for HOMA^CCP is not as straightforward as for HOMA^EP, since they originate from differences in performance of a given theoretical method for the studied molecules and the reference ones. In the ideal case, in which the errors in bond lengths are independent of the bond order and the size of the molecule (which would result in exact BLDs, even in l _ave were inperfect), the values of HOMA^CCP would be completely free from the errors of quantum chemical treatment. CCSD(T)/cc-pVQ(T)Z results are close to fitting in that picture. DFT geometries, however, satisfy neither of the above conditions: the BLD is usually underestimated, and the deviations depend on the size of the π-conjugated system. As a result, the compensation of errors in HOMA^CCP based on DFT geometries is incomplete, and functional dependent.

Conclusions

In our study, we have investigated the sensitivity of HOMA to the choice of computational methods used for optimizing molecular geometries. The values of HOMA have been computed using either the experimentally based parameterizations (HOMA^EP)—the original one of Krygowski et al. [3] and a new one based on the recently reported equilibrium geometry of the trans-1,3-butadiene [22]—or using the parameters derived from geometry of the trans-1,3-butadiene optimized in the same way as the studied molecules (HOMA^CCP).

We have found out that the consistent approach strongly reduces the dependence of HOMA on the choice of computational method and basis set used for geometry optimization. The compensation of errors has been particularly good for the CCSD(T) method, for which the values of HOMA^CCP can be regarded as nearly error-free. For DFT calculations, the error cancelation is not so perfect, as the errors in the CC bond lengths of the unsaturated hydrocarbons (and especially the bond length difference between the nominally single and double bonds) depend on the size of the π-conjugated system in the way that is unique for every XC functional. Consequently, the errors of HOMA^CCP are still functional dependent. In particular, the MSE changes from the negative to positive values proportionally to the content of the non-local exchange in the hybrid functionals. The absolute errors are nevertheless small: even though PBE0 has been found perform better than other functionals in computing HOMA^CCP (MAE <0.04, MaxAE <0.1), the MAE is below 0.05 for a wide range of the XC functionals with small to medium admixture of non-local exchange (from TPSSh to CAM-B3LYP). This observation is of practical importance as it facilitates direct comparisons of HOMA^CCP obtained with different XC functionals that belong to this group. Moreover, since the errors seem to depend mostly on the content of exact exchange, one may speculate that any hybrid functional containing between 20 and 50 % of exact exchange should yield rather accurate values of HOMA^CCP. Another advantage of the consistent approach is a very strong reduction of HOMA sensitivity to the choice of the basis set. The values of HOMA^CCP obtained using the DZP basis set are of comparable accuracy as their counterparts based on much more computationally demanding calculations with the def2-TZVPP basis set.

Using the experimentally based parameters has resulted in considerable variations of the HOMA^EP values, depending strongly on the choice of both the XC functional and the basis set used for geometry optimizations of the studied molecules. In addition, the HOMA^EP are necessarily dependent on the selection of experimental data concerning the geometry of 1,3-butadiene. We have shown that the original parameterization successfully used for computing HOMA based on the crystallographic data is rather ill suited for using in combination with the quantum chemical results. Not only the HOMA^EP obtained with this parameterization are considerably overestimated for nearly all of the studied XC functionals, but the errors are larger for the larger basis set (def2-TZVPP). These systematic, positive errors of HOMA^EP have been eliminated using the parameterization based on the experimental equilibrium geometry of the reference system, which is by definition directly comparable with the quantum chemistry results. Using the new parameterization brought about considerable reduction of the errors of HOMA^EP, especially for the results obtained with the def2-TZVPP geometries. Several hybrid functionals (TPSSh, B3LYP, BHandHLYP, CAM-B3LYP) and both double-hybrid ones (B2PLYP, mPW2PLYP) have yielded MAE below 0.05. The errors, however, have increased twofold or more when the triple-ζ basis set have been replaced by the DZP one. From among the GGA (and meta-GGA) functionals PBE and TPSS showed the best performance, with errors only slightly exceeding those for the hybrid and double-hybrid functionals.

In view of the above findings, we suggest using the HOMA^CCP when the input geometries are to be obtained by means of the quantum chemistry calculations. We are aware that aromaticity is not a simple, rigorously quantifiable property. On the other hand, the increased consistency and comparability of results within the framework of one aromaticity index, achieved through using the HOMA^CCP is a desirable quality. For convenience, we have included the ready-to-use parameters for the CC bonds for all the studied XC functionals and the two basis sets. In the following paper, the analogous sets of parameters will be given for other bonds frequently encountered in organic systems (CN, CO, NN, CP, CS, NO).

References

Hoffman AW (1855) Proc R Soc 8:1–3
Article Google Scholar
Kruszewski J, Krygowski TM (1972) Tetrahedron Lett 28:3839–3842
Article Google Scholar
Krygowski TM (1993) J Chem Inf Comput Sci 33(1):70–78
Article CAS Google Scholar
Krygowski TM, Cyrański MK, Ciesielski A, Świrska B, Leszczyński P (1993) J Chem Inf Comput Sci 36:1135–1141
Google Scholar
Krygowski TM, Cyrański MK (1996) Tetrahedron 52:10255–10264
Article CAS Google Scholar
Madura ID, Krygowski TM, Cyrański MK (1998) Tetrahedron 54:14913–14918
Article CAS Google Scholar
Krygowski TM, Cyrański MK (1999) Tetrahedron 55:11143–11148
Article CAS Google Scholar
Krygowski TM, Cyrański MK (1996) Tetrahedron 52:1713–1722
Article CAS Google Scholar
Julg A, Francois P (1967) Theor Chim Acta 7:249–259
Article Google Scholar
Bird CW (1985) Tetrahedron 41:1409–1414
Article CAS Google Scholar
Stępień BT, Cyrański MK, Krygowski TM (2001) Chem Phys Lett 350:537–542
Article Google Scholar
Stępień BT, Krygowski TM, Cyranski MK (2002) J Org Chem 67:5987–5992
Article Google Scholar
Stępień BT, Krygowski TM, Cyrański MK (2003) J Phys Org Chem 12:426–430
Google Scholar
Mollerstedt H, Piqueras MC, Crespo R, Ottosson H (2004) J Am Chem Soc 126:13928–13939
Article Google Scholar
Portella G, Poater J, Bofill JM, Alemany P, Sola M (2005) J Org Chem 70:2509–2521
Article CAS Google Scholar
Alonso M, Poater J, Sola M (2007) Struct Chem 18:773–783
Article CAS Google Scholar
Krygowski TM, Zachara-Horeglad JE (2007) J Phys Org Chem 20:594–599
Article CAS Google Scholar
Feixas F, Matito E, Poater J, Sola M (2007) J Comput Chem 29:1543–1554
Article Google Scholar
Coriani S, Marchesan D, Gauss J, Hattig C, Helgaker T, Jorgensen P (2005) J Chem Phys 123(1–12):184107
Article Google Scholar
Keld LB, Gauss J, Jorgensen P, Olsen J, Helgaker T, Stanton JF (2001) J Chem Phys 114:6548–6556
Article Google Scholar
Helgaker T, Gauss J, Jorgensen P, Olsen J (1997) J Chem Phys 106:6430–6440
Article CAS Google Scholar
Craig NC, Groner P, McKean DC (2006) J Phys Chem A 110:7461–7469
Article CAS Google Scholar
Gauss J, Stanton JF (2000) J Phys Chem A 104:2865–2867
Article CAS Google Scholar
Dunning THJ (1989) J Chem Phys 90:1007
Article CAS Google Scholar
Werner HJ, Knowles PJ, Knizia G, Manby FR, Schütz M, Celani P, Korona T, Lindh R, Mitrushenkov A, Rauhut G, Shamasundar KR, Adler TB, Amos RD, Bernhardsson A, Bernig A, Cooper DL (2010) MOLPRO, version 2010.1, a package of ab initio programs. Elsevier, Amsterdam
Stanton JF, Gauss J, Harding ME, Szalay PG (2010) CFOUR, coupled-cluster techniques for computational chemistry. CFOUR, Austin
Google Scholar
Kveseth K, Ragnhild S, Kohl DA (1980) Acta Chem Scand A 34:31–42
Article Google Scholar
Perdew JP, Schmidt K (2000) AIP Conf Proc 577:1–20
Article Google Scholar
Slater JC (1974) The self-consistent field for molecules and solids, quantum theory of molecules and solids, vol 4. In: The self-consistent field for molecular and solids, quantum theory of molecular and solids, vol 4. McGraw-Hill, New York
Vosko SH, Wilk L, Nusair M (1980) Can J Phys 58:1200–1211
Article CAS Google Scholar
Miehlich B, Savin A, Stoll H, Preuss H (1989) Chem Phys Lett 157:200–206
Article CAS Google Scholar
Lee C, Yang W, Parr RG (1988) Phys Rev B 37:785–789
Article CAS Google Scholar
Becke AD (1988) Phys Rev A 38:3098–3100
Article CAS Google Scholar
Perdew JP, Burke B, Ernzerhof M (1997) Phys Rev Lett 78:1396
Article CAS Google Scholar
Perdew JP, Burke B, Ernzerhof M (1996) Phys Rev Lett 77:3865–3868
Article CAS Google Scholar
Boese AD, Handy NC (2001) J Chem Phys 114:5497–5503
Article CAS Google Scholar
Boese AD, Doltsinis NL, Handy NC, Sprik M (2000) J Chem Phys 112:1670–1678
Article CAS Google Scholar
Hamprecht FA, Cohen A, Tozer DJ, Handy NC (1998) J Chem Phys 109:6264–6271
Article CAS Google Scholar
Tao MJ, Perdew JP, Staroverov VN, Scuseria GE (2003) Phys Rev Lett 91(1–4):146401
Article Google Scholar
Boese AD, Handy NC (2002) J Chem Phys 116:9559–9569
Article CAS Google Scholar
Zhao Y, Thrular DG (2006) J Chem Phys 125(1–18):194101
Article Google Scholar
Becke AD (1993) J Chem Phys 98:5648–5652
Article CAS Google Scholar
Adamo C, Barone V (1999) J Chem Phys 110:6158–6169
Article CAS Google Scholar
Ernzerhof M, Perdew JP (1998) J Chem Phys 109:3313–3320
Article CAS Google Scholar
Heyd J, Scuseria G, Ernzerhof M (2003) J Chem Phys 118:8207–8215
Article CAS Google Scholar
Zhao Y, Thrular DG (2008) Ther Chem Acc 120:215–241
Article CAS Google Scholar
Boese AD, Martin JML (2004) J Chem Phys 121:3405–3416
Article CAS Google Scholar
Becke AD (1993) J Chem Phys 98:1372–1377
Article CAS Google Scholar
Yanai T, Tew D, Handy D (2004) Chem Phys Lett 393:51–57
Article CAS Google Scholar
Ikura H, Tsuned T, Yanai T, Hirao K (2001) J Chem Phys 115:3540–3544
Article Google Scholar
Vydrov OA, Scuseria GE (2006) J Chem Phys 125(1–9):234109
Article Google Scholar
Chai JD, Head-Gordon M (2008) J. Chem. Phys 128(1–15):084106
Article Google Scholar
Grimme S (2006) J Chem Phys 124(1–16):034108
Article Google Scholar
Schwabe T, Grimme S (2006) Phys Chem Chem Phys 8:4398–4401
Article CAS Google Scholar
Dunning THJ, Hay PJ (1977) Methods of electronic structure theory, vol 2. In: Schaefer HF (ed) Modern theoretical chemistry. Plenum Press, New York
Google Scholar
Dunning THJ (1970) J Chem Phys 53:2823–2833
Article CAS Google Scholar
Weigend F, Ahlrichs R (2005) Phys Chem Chem Phys 7:3297–3305
Article CAS Google Scholar
Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Scalmani G, Barone V, Mennucci B, Petersson GA, Nakatsuji H, Caricato M, Li X, Hratchian HP, Izmaylov AF, Bloino J, Zheng G, Sonnenberg JL, Hada M, Ehara M, Toyota K, Fukuda R, Hasegawa J, Ishida M, Nakajima T, Honda Y, Kitao O, Nakai H, Vreven T, Montgomery JJA, Peralta JE, Ogliaro F, Bearpark M, Heyd JJ, Brothers E, Kudin KN, Staroverov VN, Kobayashi R, Normand J, Raghavachari K, Rendell A, Burant JC, Iyengar SS, Tomasi J, Cossi M, Rega N, Millam JM, Klene M, Knox JE, Cross JB, Bakken V, Adamo C, Jaramillo J, Gomperts R, Stratmann RE, Yazyev O, Austin AJ, Cammi R, Pomelli C, Ochterski JW, Martin RL, Morokuma K, Zakrzewski VG, Voth GA, Salvador P, Dannenberg JJ, Dapprich S, Daniels AD, Farkas Ö, Foresman JB, Ortiz JV, Cioslowski J,Fox DJ (2009) Gaussian 09, revision a.1. Gaussian, Inc., Wallingford
Kertesz M, Choi CH, Yang S (2005) Chem Rev 105:3448–3481
Article CAS Google Scholar
Chou CP, Li WF, Witek HA, Andrzejak M (2010) Vibrational spectroscopy of linear carbon chains. In: Nemes L, Irle S (eds) Spectroscopy, dynamics and molecular theory of carbon plasmas and vapors: advances in the understanding of the most complex high-temperature elemental. World Scientific Publishing Company, Singapore
Google Scholar
Song JW, Watson MA, Sekino H, Hirao K (2008) J Chem Phys 129(1–8):024117
Article Google Scholar
Perdew JP, Ruzsinszky A, Tao J, Staroverov VN, Scuseria GE, Csonka GI (2005) J Chem Phys 123(1–9):062201
Article Google Scholar

Download references

Acknowledgments

This research was supported in part by PL-Grid Infrastructure. The calculations were performed on Zeus: HP Cluster Platform of the Academic Computer Centre CYFRONET and on Supernova Cluster of the Wroclaw Centre for Networking and Supercomputing.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

K. Gumiński Department of Theoretical Chemistry, Faculty of Chemistry, Jagiellonian University, Ingardena 3, 30-060, Kraków, Poland
Marcin Andrzejak & Piotr Kubisiak
Department of Chemical Physics, Faculty of Chemistry, Jagiellonian University, Ingardena 3, 30-060, Kraków, Poland
Krzysztof K. Zborowski

Authors

Marcin Andrzejak
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Kubisiak
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof K. Zborowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marcin Andrzejak.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOC 85 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Andrzejak, M., Kubisiak, P. & Zborowski, K.K. Avoiding pitfalls of a theoretical approach: the harmonic oscillator measure of aromaticity index from quantum chemistry calculations. Struct Chem 24, 1171–1184 (2013). https://doi.org/10.1007/s11224-012-0148-2

Download citation

Received: 19 September 2012
Accepted: 28 September 2012
Published: 14 October 2012
Issue Date: August 2013
DOI: https://doi.org/10.1007/s11224-012-0148-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Avoiding pitfalls of a theoretical approach: the harmonic oscillator measure of aromaticity index from quantum chemistry calculations

Abstract