Abstract
An outline is given of how to split the n-dimensional space of torsion angles occurring in flexible (bio-)polymers (like alkanes, nucleic acids, or proteins, for instance) into n one-dimensional potential curves. Forthcoming applications will focus on the “protein folding problem,” beginning with polyglycine.
Context
In accordance with Euler’s topology rules, molecules are considered to be composed of “vertices” (atoms, ligands, bonding sites, functional groups, and bigger fragments). Following Hückel, each vertex is represented by only one basis function. Starting from the “monofocal” hydrids CH\(_4\), NH\(_3\), OH\(_2\), FH, and SiH\(_4\), PH\(_3\), SH\(_2\), ClH as anchor units, “chemionic” Hamiltonians (of individual “chemion ensembles” and proportional nuclear charges) are constructed recursively, together with an appropriate basis set for the first five (normal) alkanes and some related oligomers like primary alcohols, alkyl amines, and alkyl chlorides.
Methods
Standard methods (“Restricted Hartree-Fock RHF” and “Full Configuration Interaction FCI”) are used to solve the various stationary Schrödinger equations. Two software packages are indispensable: “SMILES” for integral evaluations over Slater-type orbitals (STO), and “Numerical Recipes” for matrix diagonalizations and inversions. While managing with only two-center repulsion integrals, “implicit multi-center integrations” lead us to the non-empirical fundament of Hoffmann’s “Extended-Hückel Theory.”
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Different from the physical view with electrons and nuclei as basic particles, chemists consider molecules to be built-up of \(\#vch\) “valence chemions” localized at \(\#vtx\) “vertices” [1,2,3,4]. Because vertex (sub)sets mostly are sharing four “chemion pairs” (\(\#chp=\#vch/2\)), they locally show a tetrahedral geometry (van’t Hoff [5] and Le Bel [6]). Keeping in mind this “octet rule” [7], bonding topologies of singly bound molecules can be graphically illustrated by Lewis’ [8,9,10] and Langmuir’s [11] formula language, commonly used in textbooks of organic and biochemistry.
Instead of saying “electron,” we are going to use the term “chemion” for mainly two reasons:
- \(\circ \):
-
Due to their localization, chemions are characterized by an “ensemble individuality,” which severely violates the indistinguishability postulate of Schrödinger [12, 13] and Pauli [14].
- \(\circ \):
-
Speaking of “valence chemions,” we claim the separability of the “valence shell” from its “atomic core” [15,16,17]. Later, we will even dare to distinguish “valence ligand chemions” and “valence bond chemions,” which dominantly govern the molecular shape due to their responsibility for interfragmental torsions.
Such classifications emphasize the fact that physicists and chemists sometimes look at the same molecular object from different points of view. The questions “Was sind Elektronen?” and “Kann Chemie auf Physik reduziert werden?” are thoroughly discussed in articles and books of Hans Primas and Ulrich Müller-Herold [18,19,20,21,22,23,24,25]. In this context, let us just mention another example of an extraordinary view: Topological properties of the charge density lead Bader to his quantum theoretical picture entitled “Atoms in Molecules” [26,27,28,29,30].
A priori predictions of molecular bond lengths, bond angles, dihedral and torsion angles are rather sophisticated [31,32,33,34]. “In ab initio electronic-structure calculations, approximate solutions are obtained to the molecular Schrödinger equation. The errors made in such calculations arise from the truncation of the one-electron space... and from the approximate treatment of the N-electron space...” [35]. If this is so, we cannot expect high-quality predictions of geometric parameters like bond lengths and bond angles. Focussing on highly flexible organic polymers (like alkane chains, proteins, or nucleic acids), their “secondary structure” is mostly determined by low-barrier rotations around single bonds [36, 37].
The quantum chemical picture to be outlined below is irreducible [38, 39]: being based on standard theories (like “Restricted Hartree-Fock” [40, 41] and “Full Configuration Interaction” [43]) to solve the time-independent Schrödinger equation [44]
it manages with an irreducible basis set of spherical “Slater-type orbitals” (STO) [45] of only the valence shell [46, 47]. Hence, it draws a more symbolic than quantitative sketch in the spirit of Leonhard Euler (1707–1783) [2,3,4] and Erich Hückel (1896–1980) [48,49,50,51,52]. Using standard “vertex”-geometries, we hope to get an idea of pure topological effects, which arise apart from a more sophisticated quantum theoretical treatment [53].
Next, we shall try to establish some “Quantum Chemical Aufbau Principles” [54, 55]. Introducing “vertex condensations” of Roothaan’s Fock-matrix expression, we are prepared to recursively construct growing diagonal elements from precomputed fragments. A successively augmented construction set will serve us to approximately build-up higher elements from lower units. Precomputed quantities thus can be readily taken from an external storage device, yielding a “Recursively Extendable Chemion Picture” [38, 39].
Similar to the atomic “Aufbau Principles” of Niels Bohr [56, 57] and Friedrich Hund [58, 59], we want to bring some order into quantum chemical descriptions of the molecular world. Therefore, we close the present paper with an outlook to straightforward generalizations, which might help us to estimate the shape of even macromolecular chains, particularly those of biochemical interest. If it becomes possible one day, to substitute our model assumptions by more realistic geometries, appropriate orbital exponents, and above all more complex fragments (including doubly bound groups like phenyl, carbonyl, and the peptide bond), this might be an important step towards a quantum chemical description of the “protein folding problem” [60].
It is one of the basic insights of chemical taxonomy, that molecules can be ordered into compound classes. Due to the recursive nature of our “Chemionic Hamiltonians,” classifications of molecular families become evident [61], this time also quantum chemically [62, 63].
During the recent decades, considerable progress has been made in our science. However, the topic “explicitly correlated coupled cluster calculations,” for instance, is beyond the scope of our article. Focusing on standard orbital theories with linear expansions, our main tools are “Restricted Hartree-Fock (RHF)” and “Full Configuration Interaction (FCI).”
The article has four chapters: 1. The Chemical Model; 2. Solution Methods; 3. Compound Classes; 4. Outlook and Conclusions. Appendices: Anchor Data and First Results.
The chemical model
To begin with less complex examples, our paper focuses on polymerized organic chain molecules (normal, iso- and neo-alkanes; primary, secondary, and tertiary amines or alcohols, for instance). From an even more exposed viewpoint, our goal is the geometry prediction of biopolymers (like nucleic acids or proteins) with all their sensitive interactions (among them interfragmental torsions and hydrogen bonds). Always keeping in mind these background intentions, our present paper starts with some general considerations.
Based on the Born-Oppenheimer assumption [44], molecular orbital theories, in general, have to specify mainly three items:
- \(\circ \):
-
The number of chemions under consideration
- \(\circ \):
-
Position and charge of each nucleus
- \(\circ \):
-
The basis set to be used
The quantum chemical picture to be drawn in this paper is summarized through the following headlines:
-
1.1.
Euler Topologies
-
1.2.
Recursive Polymerizations
-
1.3.
Proportional Nuclear Charges
-
1.4.
Chemionic Hamiltonians
-
1.5.
Vertex Orbitals of Euler-Hückel-Type
-
1.5.1.
Anchor Orbitals, their Exponents and Locations
-
1.5.2.
Anchor Integrals
-
1.5.3.
Recursion Orbitals and Integrals
Euler topologies
“Vertices” can be synonymously understood as “bonding sites”: central atoms, hydrogen ligands, and “lone pair positions (love).” Beginning with the eight “monofocals” methane, ammonia, water, and hydrogen fluoride, as well as silane, phosphine, hydrogen sulfide, and hydrogen chloride
we can easily see, that they all confirm the Eulerian topology rule [2,3,4]:
Inspecting the Lewis-Langmuir formulae of more complex molecular frameworks, we find that the validity of this rule will always hold for “bifocals” (like ethane, methylamine, methanol, for instance), “trifocals” (like propane, ethylamine, ethanol \(\dots \)), and all the other “polyfocals” of such kind:
It even remains valid for a chain of “functional groups” like H, CH\(_2\), or CH\(_3\), NH\(_2\), OH, F, SiH\(_3\), PH\(_3\), SH\(_2\), and Cl, the formulae of which do not take notice any more of their internal chemionic structure:
The geometry of such a polymer mainly depends on the n torsion angles \(\tau \) [36, 37]. A non-gradient search for the global minimum of the corresponding n-dimensional “Born-Oppenheimer Energy Function (BOEF)” practically can only be done for n=1 or n=2 [64]. Therefore, we first have to answer the question of how to split a higher n-dimensional BOEF into n one-dimensional “potential curves.”
Recursive polymerizations
Similar to a continuous polymerization process, which stepwise adds another monomer to its precursor, we consider a recursive chain growth. Taking the normal alkanes and primary amines and alcohols as an example, and using the convenient formula language (with X \(=\) CH\(_3\), NH\(_2\), OH) [8,9,10,11], it can be written down as in Table 1.
For each n, the numbers \(\#chp\), \(\#vtx\), and \(\#nuc(1)\) remain constant; only the number \(\#nuc(2)\) (i.e., the nuclei of the second vertex) augments steadily. Due to successive “vertex condensations” \(\textrm{X}\rightarrow \mathrm{CH_2X}\rightarrow \mathrm{C_2H_4X}\rightarrow \mathrm{C_3H_6X}\rightarrow \dots \), only one torsion angle \(\tau _\textrm{n}\) has to be varied. For symmetry reasons, few of such variations may show some “degeneracy” [65,66,67,68].
Proportional nuclear charges
In an electrically neutral molecule of Eqs. 2 and 3, the total chemion charge \(2\#chp\) corresponds to the sum of “valence charges” \(\textrm{Z}_C^{val}\) of all the nuclei (i.e., the column index of Mendelejew’s and Meyer’s “Periodic Table” [69,70,71]). Inspecting Table 1, however, \(2\#chp{=}2\) chemion charges must be distributed among the \(\#nuc\) nuclei under consideration.
In order to guarantee a proportional set of positive point charges for any chemionic ensemble \(\epsilon \), we deal with “proportional valence charges” of the different nuclei [72, 73]:
The nuclear repulsion energy with respect to any chemionic ensemble \(\epsilon \) then reads as follows:
Chemionic Hamiltonians
For a given chemionic ensemble \(\epsilon \), the Born-Oppenheimer Hamiltonian reads as follows:
where \(\Delta (\textbf{r}_i)\) is the Laplace operator in Cartesian coordinates.
The components of the ground state energy
-
\(\circ \) are the nuclear Born-Oppenheimer repulsion \(\mathcal {E}^{nucl}_\epsilon \) of Eq. 6,
-
\(\circ \) and the chemionic part \(\mathcal {E}^{chem}_\epsilon \), which will be identified with the lowest “Full Configuration Interaction” energy (FCI) of singlet or triplet multiplicity:
$$\begin{aligned} \mathcal {E}^{chem}_\epsilon = {\left\{ \begin{array}{ll} {^1} \textbf{E}_1 ^\textrm{FCI} \quad \textrm{if} \ {^1} \textbf{E}_1 ^\textrm{FCI} \le {^3} \textbf{E}_1 ^\textrm{FCI}\\ {^3} \textbf{E}_1 ^\textrm{FCI} \quad \textrm{else} \end{array}\right. } \end{aligned}$$(9)
Vertex orbitals of Euler-Hückel-type
From Hückel, we adopt the idea that each Eulerian “anchor vertex” V is represented by only one basis orbital \(\textrm{n}_V^{val}\textrm{s}\) of spherical symmetry [48,49,50,51,52], also for any “lone vertex (love).” The orbital type is specified through the following [70, 71]:
which is the row index of the “Periodic Table” [70, 71].
Anchor orbitals, their exponents and locations
Orbital exponents are considered to be given by the ratio
Alternatively, we might follow the recommendations of Slater’s rules, which also incorporate an estimated “screening” [74,75,76]. The best thing to do, however, would be an exponent optimization by means of a non-linear variation procedure [77] (see Appendix A.3). Nevertheless, we like to stress that we aim at a more symbolic approach in the spirit of Erich Hückel [48,49,50,51,52].
In the tradition of van’t Hoff [5] and Le Bel [6], we consider mono-, bi-, and poly-tetrahedral model geometries (see Appendix A.1) with the following features:
- \(\circ \):
-
Bond lengths \(\lambda \) according to the Appendix A.2 [72, 73]
- \(\circ \):
-
One unique tetrahedron bond angle \( \theta := \arccos (-{1\over 3})~;\) \(\theta ^\circ \) \(= \theta \cdot {180^\circ \over \pi } \approx 109.471^\circ \)
- \(\circ \):
-
One unique dihedral angle \(\delta :={2\pi \over 3}~;\) \(\delta ^\circ = \delta \cdot {180^\circ \over \pi } = 120^\circ \)
- \(\circ \):
-
The full range of torsion angles \(\tau \): \(0\le \tau < 2\pi ~;\) \(0^\circ \le \tau ^\circ =\tau \cdot {180^\circ \over \pi } < 360^\circ \)
Due to the indistinguishability of the \(2\#chp\) chemions within any “chemionic ensemble” \(\epsilon \), we can take \(\textbf{r}_1\) as a dummy position. \(\phi _{m}\equiv \phi (\zeta _{m},\textbf{r}_1-\textbf{R}_m)\) then stands for the mth “Euler-Hückel Vertex Orbital (EHVO)” located at \(\textbf{R}_m\) with orbital exponent \(\zeta _{m}\). Remember that all EHVO are of s-type [46, 47].
In our model, hence, “lone vertices (love)” are represented by a n\(^{val}\)s-function, localized at one of the peripheric tetrahedron positions; for the distance from their center, we adopt the Bohr radius \((\textrm{B} \approx 0.529177249~{\text{\AA }})\). As an artifact, EHVO bases produce “rotational barriers” also for axial symmetric halogen compounds like F\(_2\), Cl\(_2\), and FCl.
Anchor integrals
Quantum chemical calculations, in general, have to deal with at least four types of integrals:
- \(\circ \):
-
“Overlap integrals” \(S_{(m.n)}^{(0)}\)
- \(\circ \):
-
“Kinetic energy integrals” \(K_{(m.n)}^{(0)}\)
- \(\circ \):
-
“Attraction energy integrals” \(U_{(m.n.C)}^{(0)}\)
- \(\circ \):
-
The two-chemion “repulsion energy integrals” \(R_{(m.n|p.q)}^{(0)}\)
For the one- and two-chemion anchor integrals EHVI over EHVO, we use the following notations:
Numerical evaluations are available by the SMILES package of FORTRAN routines, for instance [78,79,80].
Summing up the charge-weighted “attraction energy integrals,” we arrive at the \(\epsilon \)-dependent “attraction energy matrix” \( {V}^{(0)}_{\epsilon (m.n)} \); subtracted from \(K_{(m.n)}^{(0)}\), it yields the matrix \(H_{\epsilon (m.n)}^{(0)}\), which represents the “one-chemion energy”:
Recursion orbitals and integrals
Now let us return to the simplified Lewis/Langmuir-formulas of Table 1, which promise the successive “Aufbau” of macromolecular chains from smaller units:
As we can see, each prolongation step n is composed of two “diagonal” and one “off-diagonal” contribution (\(\alpha \alpha \), \(\beta \beta ^{(\text {n})}\), and \(\alpha \beta ^{(\text {n})}\), respectively).
In order to construct two basis functions for all the chain molecules with \(\#chp=1\), we define the following set of normalized “Recursion Orbitals”
With them, we arrive at
- \(\circ \):
-
“recursive overlap integrals” \( {S}^{(\mathrm n) }_\mathrm{(\alpha .\beta )} \)
- \(\circ \):
-
“recursive kinetic integrals” \( {K}^{(\mathrm n) }_\mathrm{(\alpha .\beta )} \)
- \(\circ \):
-
“recursive attraction integrals” \( {U}^{(\mathrm n) }_{(\alpha .\beta .C{})} \)
- \(\circ \):
-
and “recursive repulsion integrals” \( {R}^{(\mathrm n) }_{(\alpha .\beta |\gamma .\delta )} \)
They read as follows:
The “recursive attraction matrix” and the “recursive one-chemion matrix” then read as follows:
Let us illustrate some recursion relations between the diagonal \(\textbf{M}\)-matrix elements of ethane, propane, normal butane, normal pentane etc., where \(\textbf{M}\in \{\textbf{S},\textbf{K}\}\).
Starting from \( \mathrm{HCH_2}\rightarrow {M}^{(0)}_{(\alpha ,\alpha )} \) and \( \mathrm{XV_3}\rightarrow {M}^{(0)}_{(\beta ,\beta )} \)
Recursion relations for the \(\epsilon \)-dependent matrices \(\textbf{M}_\epsilon \in \{\textbf{V}_\epsilon ,\textbf{H}_\epsilon \}\) will be discussed below, in the context of Roothaan’s Fock matrix expressions.
Solution methods
In order to approximately solve the time-independent Schrödinger Eq. 1, we start from an expansion of the molecular orbitals in terms of the vertex basis. In spite of the linear nature of this expansion, such “Roothaan-Hartree-Fock Orbitals (RHFO)” have to be determined iteratively. Expanding the wave function in terms of all the constructable singlet and triplet configurations, corresponding ground state energies are available by linear variations. Non-linear variations come in through an optimization of a few orbital exponents. The methodical chapter is subdivided into three sections:
-
2.1.
Linear combinations of vertex orbitals
-
2.2.
Self-consistent Roothaan-Hartree-Fock orbitals
-
2.3.
Full configuration interaction
Linear combinations of vertex orbitals
Molecular RHFO are linear combinations of the given EHVO:
For convenience, we skipped the chain length (n) on the left side.
With this expansion, molecular “Roothaan-Hartree-Fock Integrals (RHFI)” relate to the “Euler-Hückel Vertex Integrals (EHVI)” as follows:
For large basis sets, the transformation of Eq. 21, in particular, is quite time-consuming. Due to the irreducible Euler-Hückel expansion, however, its expense is moderate.
Self-consistent Roothaan-Hartree-Fock orbitals
Roothaan’s expression of the Fock-matrix [40, 41] now reads as follows:
\(\textbf{D}_{\epsilon }^\mathrm{(n)}\) is the density matrix [40, 41] of the chemionic ensemble \(\epsilon \).
Recursion relations for the matrices \(\textbf{M}_\epsilon \in \{\textbf{H}_\epsilon ,\textbf{F}_\epsilon \}\) essentially are those of Eq. 18. This time, however, they additionly depend on the “diagonal” or “off-diagonal” ensemble parameter \(\epsilon \), as indicated above.
Given both Fock and overlap matrices \(\textbf{F}_\epsilon ^{(\mathrm n)}\) and \(\textbf{S}^{(\mathrm n)}\), respectively, the Roothaan equation
can be solved by means of standard techniques [40,41,42]. Due to the density dependence of \(\textbf{F}_\epsilon \), however, the coefficient matrix \(\textbf{C}^\textrm{RHF}\) and the diagonal matrix \(\textbf{E}^\textrm{RHF}\) of orbital energies have to be determined iteratively. Self-consistence of the density matrix has been achieved, if for successive iterations (i) and \((i-1)\) the following convergence criterion \(\sigma {(i)}\) falls below a predefined threshold:
Full configuration interaction
Seeking for solutions of the time-independent Schrödinger Eq. 1, “Full Configuration Interaction (FCI)” “is the best we can do” [43]. Although being sparse, however, the so-called “minimal basis set” [40, 41] representations \(\textbf{H}^\textrm{FCI}\) of many-chemion Hamiltonians are truly gigantic, even for rather small molecules [43].
Due to the irreducible nature of the chemionic vertex picture (with \(\#vtx=\#chp+1\)), on the other hand, the matrix dimensions \(\#sing\) and \(\#trip\) of its singlet block \({^1}{} \textbf{H}^\textrm{FCI}_\epsilon \) and triplet block \({^3}{} \textbf{H}^\textrm{FCI}_\epsilon \), respectively, are quite moderate:
The corresponding eigenvalue equations now read as follows:
Matrix elements may become zero due to group theoretical reasons, with respect to the spatial symmetry of the molecule [65,66,67,68]. Regarding the “spin operators” \(\mathcal {S}^2\) and \(\mathcal {S}_z\), matrix elements between states \({^{(2{S}+1)}_{M_S} \Psi }\) with different quantum numbers \(S\in \{0,1\}\) and \(M_S\) (with integer values \(-S\le M_S\le +S\)) vanish [44]: triplet states thus neither mix with singlet states nor with other triplet states of different \(M_S\).
For intermediately normalized wave functions [43], the singlet-type matrix elements (upper triangle) with non-vanishing expressions read as follows:
where we used the following relations known from RHF theory [40, 41]:
The triplet-type matrix elements (upper triangle) with non-vanishing expressions read as follows:
Triplet states are triply degenerate:
Compound classes
For a given “chemionic ensemble” \(\epsilon \), the Hamiltonian of Eq. 7 is mainly specified through the quantities \(\#chp(\epsilon )\) and \(\#vtx(\epsilon )=\#chp(\epsilon )+1\), which enter into a parameter chart like Table 1. Parameter charts of “iso-chemionic” molecules (such as methane, ammonia, water, and hydrogen fluoride, for instance) thus refer to some topological relationship.
If the nuclear composition is even the same, family relations become apparently stronger. Table 2 indicates that (due to identical entries) normal alkanes, iso-alkanes, and neo-alkanes, primary, secondary, and tertiary alcohols, etc. form compound classes: their members only differ through \(\{\textrm{Z}_{\epsilon C}^{prop},{C=1,\dots ,\#nuc}\}\) (i.e., the number and kind of nuclei of ensemble \(\epsilon \)) and another nuclear charge distribution, hence.
Considering chemionic intra-valence separations, on the other hand, distinctions of “functional groups” like CH\(_3\) and CH\(_2\) become evident. Most important, however, is the “chain bond” or “backbone bond” like \(\mathrm CH_3{-}CH_2\), which keeps its parameter chart constant as in Table 2. Parameter tables of identical form thus indicate a chemical relationship: molecular families or compound classes essentially show the same parameter table.
According to Primas and Müller-Herold, those quantum theoretical descriptions are preferable, which bring in some chemical evidence. For our purposes, Lorentz-relativistic kinematics, for instance, are useless; although being “better,” such a picture does not offer the desired systematic abstractions [81].
Outlook and concluding remarks
There are at least two items, which could improve our model:
- \(\circ \):
-
Making the vertex geometry more realistic through empirical bond lengths and angles [82]
- \(\circ \):
-
Optimizing the few orbital exponents by nonlinear variation [77]
Its applicability could be enlarged by adding an Euler-Hückel treatment of:
- \(\circ \):
-
Multiple bonds
- \(\circ \):
-
Aromatic rings
- \(\circ \):
-
Hydrogen bridges
In order to estimate the shape of proteins, condensed quantities for the 20 naturally occurring amino acids should enter into a data bank of fragments, together with the “peptide bond” [83]:
The side-chain R is a fragment of the naturally occurring \(\alpha \)-amino acids glycine (with R=H), alanine, serine, threonine, methionine, valine, leucine, isoleucine, phenylalanine, tyrosine, cysteine, aspartic acid, glutamic acid, arginine, lysine, histidine, tryptophan, asparagine, and glutamine. Proline has a five-membered ring instead of the \(-\underline{\textrm{N}}\mathrm{H-CH(-R)}-\) moiety.
Similar considerations will be necessary for the conformation analysis of nucleic acids, which are alternant chains of phosphoric acids and pentose sugars, each substituted by a purin or pyrimidine base [84, 85]:
- \(\circ \):
-
Deoxyribonucleic acids (DNA) have the deoxiribose (\(\mathrm C_5H_{10}O_4\)) as pentose-component, substituted by the purin bases adenine and guanine or the pyrimidines thymine and cytosine.
- \(\circ \):
-
In ribonucleic acids (RNA), the deoxyribose is replaced by ribose (\(\mathrm C_5H_{10}O_5\)), and the pyrimidine base thymine is replaced by uracil.
In contrast to polypeptides and proteins, DNA structures are better known. In 1953, after Chargaff had found the complementary pairing of adenine-thymine and guanine-cytosine, Watson and Crick proposed their famous “double helix model” [84, 85].
Protein geometries, on the other hand, are very hard to predict. For a given sequence of amino acids (called its “primary structure”), a couple of “secondary structures” may appear: \(\alpha \)-helices and \(\beta \)-strands, forming either parallel or anti-parallel sheets [83]. How such distinct domains fold into a more compact geometry is still an open question of biochemical sciences. “The general difficulties in obtaining protein structures using experimental techniques means that there is considerable interest in theoretical methods for predicting the three-dimensional structure of proteins from the amino acid sequence: this is often referred to as the protein folding problem” [60, 86,87,88,89,90,91,92,93,94,95].
While matrix dimensions can be kept constantly low during the described polymerization process, the evaluation of anchor integrals for each new torsion angle \(\tau _{\textrm{n}}\) is still a “computational bottleneck” [40, 41] − especially of the “repulsions.” Due to the recursive nature of our Euler-Hückel picture, however, some of them were already stored during the precursor step of torsion optimization (see Table 3 for \(\mathrm X=C,N,O,F,Si,P,S,Cl\) and their three vertices V):
Since
and the difference \((a-b)\) always is constant (namely 4), the task of “anchor integration” is considerably accelerated. “Linear scaling” [96] can be achieved, if we restrict ourselves to one- and two-center integrations, only.
Let us close this article with some fundamental considerations on quantum mechanics, in general. Physical theories can be characterized by an \(*\)-algebra of observables. Algebras of classical theories (like Newton’s mechanics, Maxwell’s electrodynamics, Clausius’ thermodynamics, and Einstein’s theory of relativity) are “commutative” (i.e., any two observables \(\hat{A}\) an \(\hat{B}\) interchange pairwise: \(\hat{A}\hat{B}= \hat{B}\hat{A}\)). The algebra of quantum mechanics, however, is “non-commutative” with incompatible observables: \(\hat{A}\hat{B}\ne \hat{B}\hat{A}\). Different from the sharp values of classical observables, quantum theory deals with potential properties, which cannot be actualized simultaneously. For a thorough discussion of this topic, please consult the lectures and writings of Hans Primas, particularly his famous book of 1983, entitled “Chemistry, Quantum Mechanics and Reductionism” [18,19,20,21,22,23,24,25].
Following Ulrich Hoyer, on the other hand, Schrödinger’s (stationary and time-dependent) equations, for instance, can be derived by applying the statistical rules of Boltzmann’s distribution law. Alternatively, such a “Synthetic Quantum Theory” also emerges from Liouville’s theorem. It seems, that non-compatible properties of the molecular world can be readily explained by simple probability arguments. Let us just mention that such an “Ansatz” already shows the non-existence of \(\mathrm He_2, Ne_2\) and \(\mathrm Be_2\); for the diatomics \(\mathrm H_2, Li_2, B_2, C_2, N_2, O_2\) and \(\mathrm F_2\), it also leads to an estimation of dissociation energies; the bond length of \(\mathrm{H_2}\) is predicted as \(\root 3 \of {4}\)B, where B is the Bohr radius. According to Hoyer, pioneer quantum mechanics mainly suffers from a “hýsteron-próteron”: instead of proceeding with a belated probabilistic interpretation of the postulated wave function, one should start with purely statistical axioms from the very beginning [97,98,99,100,101,102,103,104,105,106,107].
Finally, we want to acknowledge for help and friendship. Moreover, we like to add three aphorisms of a home-attached philosopher, a cosmopolitan discoverer, and an expert for “life as work of art” [108], that might be encouraging for others, too.
“Ich stehe in der Einbildung, es sei zuweilen nicht unnütze, ein gewisses edles Vertrauen in seine eigenen Kräfte zu setzen. Eine Zuversicht von der Art belebet alle unsere Bemühungen, und erteilet ihnen einen gewissen Schwung, der der Untersuchung der Wahrheit sehr beförderlich ist.”
Immanuel Kant (1746) [109]
“Die Naturphilosophie kann den Fortschritten der empirischen Wissenschaften nie schädlich sein. Im Gegenteil, sie führt das Entdeckte auf Principien zurück, wie sie zugleich neue Entdeckungen begründet. Steht dabei eine Menschenklasse auf, welche es für bequemer hält, die Chemie durch die Kraft des Hirnes zu treiben, als sich die Hände zu benetzen, so ist das weder Ihre Schuld noch die der Naturphilosophie überhaupt. Darf man die Analysis verschreien, weil unsere Müller oft bessere Maschinen bauen als die, welche der Mathematiker berechnet hat ?”
Alexander von Humboldt (1. Februar 1805) an Friedrich Wilhelm Joseph Schelling [110]
“Wer will was Lebendigs erkennen und beschreiben, Sucht erst den Geist heraus zu treiben, Dann hat er die Teile in seiner Hand, Fehlt, leider ! nur das geistige Band. Encheiresin naturae nennt’s die Chemie, Spottet ihrer selbst und weiß nicht wie.”
Johann Wolfgang Goethe: “Faust I” (Ausgabe letzter Hand 1828) [111,112,113]
Availability of data and materials
Not applicable
Change history
23 February 2024
A Correction to this paper has been published: https://doi.org/10.1007/s00894-024-05864-w
References
“Vertex” is a technical term of Eulerian topology and graph theory. Most prominent is Euler’s formula for convex polyhedra: \(V+F=E+2\), where \(V, F, E\) stand for the numbers of vertices (or corners), faces, and edges, respectively [2–4]
Spanier EH (1966) Algebraic topology. McGraw-Hill, New York & London, p 205
Flegg HG (2001) From geometry to topology. Dover, Mineola N.Y., p 40
Richeson DS (2008) Euler’s gem: the polyhedron formula and the birth of topology. Princeton University Press
van’t Hoff JH (1874) Sur les formules de structure dans l’espace. Arch Néerl Sci Ex Nat 9:445–454
Le Bel JA (1874) Sur les relations qui existent entre les formules atomiques des corps organiques et le pouvoir rotatoire de leurs dissolutions. Bull Soc Chim France 22:337–347
Jensen WB (1984) Abegg, Lewis, Langmuir and the Octet Rule. J Chem Educ 61:191–200
Lewis GN (1916) The atom and the molecule. J Am Chem Soc 38:762–785
Lewis GN (1923) Valence and the structure of atoms and molecules. Chemical Catalog Company, New York
Kossel W (1916) Über Molekülbildung als Frage des Atombaus. Ann Phys 49:229–363
Langmuir I (1919) The arrangement of electrons in atoms and molecules. J Am Chem Soc 41:868–934
“Man kann die Elektronen nicht kennzeichnen, nicht ‘rot anstreichen’, und nicht nur das, man darf sie sich nicht einmal gekennzeichnet denken, sonst erhält man durch ‘falsche Abzählung’ auf Schritt und Tritt falsche Ergebnisse” [13]
Schrödinger E (1962) Was ist ein Naturgesetz? Oldenburg, München, p 118
Pauli W (1933) Die allgemeinen Prinzipien der Wellenmechanik. In: Geiger H, Scheel K (eds) Handbuch der Physik Vol 24/1, 2nd edn. Springer, Berlin
Segal GA (ed) (1977) Semiempirical methods of electronic structure calculation, part A: techniques. Plenum Press, New York
Segal GA (ed) (1977) Semiempirical methods of electronic structure calculation, part B: applications. Plenum Press, New York
Scholz M, Köhler HJ (1981) Quantenchemie 3: Quantenchemische Näherungsverfahren. Hüthig, Heidelberg
Primas H (1964) Was sind Elektronen? Helv Chim Acta 47:1840–1851
Primas H (1983) Chemistry, quantum mechanics and reductionism. 2nd corrected edition. Springer, Berlin
Primas H (1985) Kann Chemie auf Physik reduziert werden? I. Das molekulare Programm. Chemie in unserer Zeit 19:109–119
Primas H (1985) Kann Chemie auf Physik reduziert werden? II. Die Chemie der Makrowelt. Chemie in unserer Zeit 19:160–166
Primas H (1988) Can we reduce chemistry to physics? In: Radnitzky G (ed) Centripetal Forces in the Sciences, vol II. Paragon House, New York, pp 119–133
Müller-Herold U (1985) Vom Mythos der Zahl - Anmerkungen zum Stand der Theoretischen Chemie. Chimia 39:3–10
Primas H, Müller-Herold U (1984) Elementare Quantenchemie. Teubner, Stuttgart, pp 244–258
Primas H, Müller-Herold U (1984) Elementare Quantenchemie. Teubner, Stuttgart :Kapitel 3
Bader RFW (1994) Atoms in Molecules: a Quantum Theory. Clarendon Press, Oxford
Bader RFW (1985) Atoms in molecules. Acc Chem Res 18:9–15
Bader RFW (2005) The quantum mechanical basis of conceptual chemistry. Monatshefte für Chemie 136:819–854
Bader RFW (2008) Nearsightedness of electronic matter as seen by a physicist and a chemist. J Phys Chem A 112:13717–13728
Bader RFW, Popelier PLA, Keith TA (1994) Theoretical definition of a functional group and the molecular orbital paradigm. Angew Chem Int Ed Engl 33:620–631
McWeeny R (1984) Coulsons chemische Bindung, 2nd edn. Hirzel, Stuttgart, pp 122–128
Historically, experimental \(R_{e^{-}}\) and \(D_{e^{-}}\)values of the hydrogen molecule could be correctly reproduced only through elliptic coordinate descriptions with an explicit \(r_{12^{-}}\) dependent term. Best agreements could be achieved by [33,34]
James HM, Coolidge AL (1933) The ground state of the hydrogen molecule. J Chem Phys 1:825
Kołos W, Wolniewicz L (1968) Improved theoretical ground-state energy of the hydrogen molecule. J Chem Phys 49:404
Helgaker T, Jørgensen P, Olsen J (2000) Molecular electronic-structure theory. Wiley, Chichester, p 817
Leach AR (2001) Molecular modelling \(-\) principles and applications, 2nd edn. Pearson, Harlow (England)
Höltje HD, Sippl W, Rognan D, Folkers G (2003) Molecular modeling \(-\) basic principles and applications, 2nd edn. Wiley-VCH, Weinheim
Following the invitation “Invent your own Approximate MO Method” [39]
Cook DB (1978) Structures and approximations for electrons in molecules. Ellis Horwood, Chichester
Szabo A, Ostlund NS (1996) Modern quantum chemistry. Dover, Mineola N.Y.: Chapter 3
Wagnière GH (1976) Elementary molecular orbital theory. Springer, Berlin, Heidelberg, New York
Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes in FORTRAN, the art of scientific computing, 2nd edn. Cambridge University Press
Szabo A, Ostlund NS (1996): Chapter 4
Szabo A, Ostlund NS (1996): Chapter 2
Slater JC (1930) Atomic shielding constants. Phys Rev 36:57–64
Due to their spherical symmetry, s-functions keep the approximation “rotationally invariant” [47]
Pople JA, Beveridge DL (1970) Approximate molecular orbital theory. McGraw-Hill, New York, pp 22–30
Hückel E (1930) Zur Quantentheorie der Doppelbindung. Z Phys 60:423–456
Hückel E (1931) Die Elektronenkonfiguration des Benzols und verwandter Verbindungen. Z Phys 70:204–286
Hückel E (1932) Quantentheoretische Beiträge zum Problem der aromatischen und ungesättigten Verbindungen. Z Phys 76:628–648
Streitwieser A (1961) Molecular orbital theory for organic chemists. Wiley, New York
Heilbronner E, Bock H (1968-70) Das HMO-modell und seine Anwendung I, II, III. Verlag Chemie, Weinheim
Günthard HsH, Primas H (1956) Zusammenhang von Graphentheorie und MO-Theorie von Molekeln mit Systemen konjugierter Bindungen. Helv Chim Acta 34:1645–1653
Primas and Müller-Herold call it “Baukastenprinzip” [55]
Primas H, Müller-Herold U (1984) Elementare Quantenchemie. Teubner, Stuttgart, p 227
Bohr N (1922) Drei Aufsätze über Spektren und Atombau. Braunschweig
Hoyer U (1974) Die Geschichte der Bohrschen Atomtheorie. Physik-Verlag, Weinheim
Hund F (1927) Linienspektren und periodisches System der Elemente. Berlin
Hund F (1955) Geschichte der Quantentheorie. Bibliographisches Institut, Zürich
Leach AR (2001) Molecular modelling – principles and applications. 2nd edn. Pearson, Harlow (England), p 512
Primas H (1998) Emergence in exact natural sciences. Acta Polytech Scand Ma91:83–98
“Die unbedachte Bemerkung, dass die ‘Quantentheorie die Einheit von Chemie und Physik’ hergestellt habe, verleitet leicht dazu, das Begriffssystem der Chemie als ein System minderen Ranges anzusehen oder gar zum Ableugnen all dessen, was nicht quantenphysikalisch erklärt werden kann. Damit sollte man vorsichtig sein. So ist es etwa bis heute nicht gelungen, den Substanzbegriff der Chemiker in eine strenge Beziehung zur molekularen Quantenmechanik zu setzen. Beispielsweise spielt in der chemischen Taxonomie der Begriff ‘Keton’ eine wichtige und sicherlich sinnvolle Rolle. Selbst wenn wir für den Moment einmal eine ein-eindeutige Beziehung zwischen chemisch reinen Substanzen und Einzelmolekülen akzeptieren, was eine zwar übliche, aber theoretisch zur Zeit nicht durchschaubare Hypothese ist, dann kennen wir in der Quantenchemie zwar das Propanon, das Butanon, das Acetophenon, das Benzophenon, das Diacetyl usw., aber der mit dem Begriff ‘Keton’ gemeinte strukturelle Klassenzusammenhang kann in der Quantenmechanik nicht in Evidenz gesetzt werden. Die Hamiltonoperatoren der erwähnten speziellen Ketone sind grundverschieden, Nach dem heutigen Stand der Kunst ist nicht zu sehen, wie man in der Quantentheorie Äquivalenzklassen von Molekülen einführen könnte, die cum grano salis den Stoffklassen der Chemiker entsprechen. Der Begriff der Stoffklasse konnte bis heute nicht auf die Quantenmechanik reduziert werden. Das Bedürfnis nach einer quantentheoretisch orientierten Theorie der Stoffklassen ist jedoch legitim.” [63]
Primas H, Müller-Herold U (1984) Elementare Quantenchemie. Teubner, Stuttgart, pp 310–311
Zerner MC (1996) Analytic derivative methods and geometry optimization. In: Szabo A, Ostlund NS: Appendix C
Hamermesh M (1962) Group theory and its applications to physical problems. Addison-Wesley, Reading Mass
Cotton FA (1963) Chemical applications of group theory. Interscience Publishers, New York & London
Tinkham M (1964) Group theory and quantum mechanics. McGraw-Hill, New York
Jaffé HH, Orchin M (1973) Symmetrie in der Chemie. Hüthig, Heidelberg
Atomic “valence charges” are identical with the column index of Dmitri Mendelejev’s and Lothar Meyer’s “Periodic Table” [70,71]
Zülicke L (1985) Quantenchemie 2: Atombau, chemische Bindung und molekulare Wechselwirkungen. Hüthig, Heidelberg, pp 94–180
Brock WH (1997) Viewegs Geschichte der Chemie. Vieweg, Braunschweig / Wiesbaden
If not declared otherwise, “atomic units” are used throughout in this paper [73]
Szabo A, Ostlund NS (1996) Modern quantum chemistry. Chapter 2
Besides Slater’s original recommendations [45], also other sets of orbital exponents are in common use; among them [75,76]
Clementi E, Raimondi DL (1963) Atomic screening constants from SCF functions. J Chem Phys 38:2686–2689
Burns G (1964) Atomic shielding parameters. J Chem Phys 41:1521–1522
Szabo A, Ostlund NS (1996): Chapter 1
Bouferguene A, Fares M, Hoggan PE (1996) STOP: a Slater-type orbital package for molecular electronic structure determination. Int J Quantum Chem 57:801–810
Fernández Rico J, López R, Aguado A, Ema I, Ramírez G (1998) Reference program for molecular calculations with Slater-type orbitals. J Comput Chem 19:1284–1293
Fernández Rico J, López R, Ema I, Ramírez G (2004) Efficiency of the algorithms for the calculation of Slater molecular integrals in polyatomic molecules. J Comput Chem 25:1987–1994
Primas H, Müller-Herold U (1984) Elementare Quantenchemie. Teubner, Stuttgart, p 75
Hehre WJ, Radom L, vR Schleyer P, Pople JA, (1986) Ab initio molecular orbital theory. Wiley, New York, pp 138–139
Leach AR (2001) Molecular modelling \(-\) principles and applications. 2nd edn. Pearson, Harlow (England): Chapter 10
Pauling L (1970) General chemistry. Dover, New York. Section 24–4, 781–788
Karlson P, Doenecke D, Koolman J (1994) Kurzes Lehrbuch der Biochemie für Mediziner und Naturwissenschaftler, 14th edn. Thieme, Stuttgart & New York
Future will show, if the “Recursively Extendable Euler-Hückel picture” might ever compete with latest approaches like [87–95]
Hou J, Wu T, Cao R, Cheng J (2019) Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13. Proteins 87:1165–1178. https://doi.org/10.1002/prot.25697
Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, Qin C, Žídek A, Nelson AWR, Bridgland A, Penedones H, Petersen S, Simonyan K, Crossan S, Kohli P, Jones DT, Silver D, Kavukcuoglu K, Hassabis D (2019) Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13). Proteins 87:1141–1148. https://doi.org/10.1002/prot.25834
Hou J, Wu T, Guo Z, Quadir F, Cheng J (2020) The MULTICOM protein structure prediction server empowered by deep learning and contact distance prediction. Methods Mol Biol 2165:13–26. https://doi.org/10.1007/978-1-0716-0708-4
Jisna VA, Jayaraj PB (2021) Protein structure prediction: conventional and deep learning perspectives. Protein J 40:522–544. https://doi.org/10.1007/s10930-021-10003-y
Pakhrin SC, Shrestha B, Adhikari B, Kc DB (2021) Deep learning-based advances in protein structure prediction. Int J Mol Sci 22:5553
Pearce R, Zhang Y (2021) Toward the solution of the protein structure prediction problem. J Biol Chem 297:100870. https://doi.org/10.1016/j.jbc.2021.100870
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, Bridgland A, Meyer C, Kohl SAA, Ballard AJ, Cowie A, Romera-Paredes B, Nikolov S, Jain R, Adler J, Back T, Petersen S, Reiman D, Clancy E, Zielinski M, Steinegger M, Pacholska M, Berghammer T, Bodenstein S, Silver D, Vinyals O, Senior AW, Kavukcuoglu K, Kohli P, Hassabis D (2021) Highly accurate protein structure prediction with AlphaFold. Nature 596:583–589. https://doi.org/10.1038/s41586-021-03819-2
Zhou G et al (2022) Deep learning of dynamically responsive chemical Hamiltonians with semiempirical quantum mechanics. Proc Natl Acad Sci. https://doi.org/10.1073/pnas.2120333119
Ravisetti M (2022) Google’s DeepMind AI predicts 3D structure of nearly every protein known to science. https://www.cnet.com/science/biology/googles-deepmind-ai-predicts-3d-structure-of-nearly-every-protein-known-to-science/
Zaleśny R, Papadopoulos MG, Mezey PG, Leszcynski J (eds) (2011) Linear-scaling techniques in computational chemistry and physics. Springer, Berlin, Heidelberg
Hoyer U (2002) Synthetische Quantentheorie. Georg Olms Verlag, Hildesheim, Zürich, New York
Hoyer U, Sallhofer HH (2007) Fallstrick Kopenhagener Deutung. Universitas, Wien
Hoyer U (2008) Kann man die Quantentheorie verstehen? In: Ferge G (ed) Existentia, vol XVIII. Szeged. Budapest, Frankfurt/Main, Münster, Miami, pp 109–122
Hoyer U (2010) Über die statistische Bedeutung der de Broglie-Wellenlänge? In: Ferge G (ed) Existentia, vol XX. Szeged. Budapest, Frankfurt/Main, Münster, Miami, pp 35–44
Hoyer U (2012) Die statistischen Grundlagen der Quantentheorie. In: Ferge G (ed) Existentia, vol XXII. Szeged. Budapest, Frankfurt/Main, Münster, Miami, pp 89–99
Hoyer U (2014) Über die statistische Bedeutung der Heisenbergschen Unschärferelationen. In: Ferge G (ed) Existentia, vol XXIV. Szeged. Budapest, Frankfurt/Main, Münster, Miami, pp 29–36
Hoyer U (2014) Die statistischen Grundlagen der Quantenchemie. In: Ferge G (ed) Existentia, vol XXIV. Szeged. Budapest, Frankfurt/Main, Münster, Miami, pp 37–51
Born M (1927) Quantenmechanik und Statistik. Naturwissenschaften 15:238–242
Born M (1955) Die statistische Deutung der Quantenmechanik. Physikalische Blätter 11:193–202
Born M (1961) Bemerkungen zur statistischen Deutung der Quantenmechanik. In: Bopp F (ed) Werner Heisenberg und die Physik unserer Zeit. Vieweg, Braunschweig, pp 103–118
Aristoteles (1968) Metaphysik. Fünftes Buch, Abschnitt 11, Rowohlt, Hamburg
Safranski R (2013) Goethe, Kunstwerk des Lebens. Carl Hanser Verlag, München
Kant I (1746) Gedanken von der wahren Schätzung der lebendigen Kräfte und Beurteilung der Beweise derer sich Herr von Leibniz und andere Mechaniker in dieser Streitsache bedienet haben, nebst einigen vorhergehenden Betrachtungen welche die Kraft der Körper überhaupt betreffen. In: Weischädel W, (eds) (1960) Werke I, Vorkritische Schriften bis 1768. Insel Verlag, Wiesbaden, pp 18–19
Plessner H (1928) Die Stufen des Organischen und der Mensch. Suhrkamp, Frankfurt am Main (2003)
Goethe JW (1828) Faust - Der Tragödie erster Teil. Reclam, Stuttgart (1986):54
For a thorough interpretation see: Friedrich T, Scheithauer LJ (1998) Kommentar zu Goethes Faust. Reclam, Stuttgart, p 296
Understood as an anti-reductionistic comment the quotation says: through the radical splitting of a complex molecule into atomic nuclei and electrons, chemical insights get completely lost
Preuss H (1963) Quantentheoretische Chemie I. Mannheim, Bibliographisches Institut, p 74
Koch WHG (2013) Explicit and implicit multi-center integrations. Int J Quantum Chem 113:1573–1583
Koch WHG (2013) On the ‘Wolfsberg-Helmholz’ conjecture of ‘extended-Hückel theory’. Int J Quantum Chem 113:1568–1572
Wolfsberg M, Helmholz L (1952) The spectra and electronic structure of the tetrahedral ions MnO\(_4^-\), CrO\(_4^{2-}\), and ClO\(_4^-\). J Chem Phys 20:837–843
Hoffmann R (1963) An extended Hückel theory. I. hydrocarbons. J Chem Phys 39:1397–1412
Acknowledgements
Many thanks to my Mexican and German families—specially to Silvia: in memoriam of my dear parents Traudel and Walter. In addition I would like to thank the following friends and teachers, who were particularly involved in different periods of this work or who accompanied it with sympathy: Dr. Claus Kast\(^{~\dagger }\), Werner Kuetgens\(^{~\dagger }\), Reiner-Gert Nickel (Rottweil, Germany), Hans Herb\(^{~\dagger }\), Usch Büchtemann, Brigitte Wandel\(^{~\dagger }\) & Frieder Hoeckh, María de Basauri\(^{~\dagger }\), Klaus Strecker\(^{~\dagger }\), Dr. Bastian Frey, Dr. Dietrich Hoffmann, Prof. Friedrich F. Seelig\(^{~\dagger }\), Prof. Claus Kahlert, Prof. Otto E. Rössler, Dr. Frank Kuske, Prof. Günter Häfelinger, Prof. Michael Hanack\(^{~\dagger }\), Dr. Wolfgang Langer, Dra. Christine Stadler, Prof. Alfred J. Meixner, Prof. Wolfgang Maier (Tübingen, Germany), Dra. Zerrin Akkan, Prof. Heinrich R. Karfunkel, Dr. Martin Ebeling (Basel, Switzerland), Dr. Walter Nadler\(^{~\dagger }\) (Jülich, Germany), Dr. Georg A. Becker (Leipzig, Germany), Prof. Michael C. Böhm\(^{~\dagger }\) (Darmstadt, Germany), Klaus Ellwanger (Oldenburg, Germany), Prof. Manfred Fahle\(^{~\dagger }\) (Nice, France), Prof. Gerd Folkers (Zürich, Switzerland), Dr. Thomas Scior (Puebla, Mexico), Mtro. Juan Francisco Sánchez Ruiz (Ciudad de México), Prof. Per-Olov Löwdin\(^{~\dagger }\), Prof. Jean-Louis Calais\(^{~\dagger }\) (Uppsala, Sweden), and Wolfgang Bürkle (Feinmetall GmbH Herrenberg, Germany). Also, many thanks to the authors of the SMILES package (a collection of FORTRAN routines for the evaluation of quantum chemical integrals over atomic “Slater Type Orbitals” [79, 80], 1998). Their work was, is, and will be indispensable for numerical studies of the past, at present, and in future research.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
The complete article has been written by the only author.
Corresponding author
Ethics declarations
Ethical approval
Not applicable
Conflict of interest
The authors declare no competing interests.
Additional information
Dedicated to Prof. Hans Primas $$^{\dagger }$$ † (Zürich, Switzerland), Prof. Ulrich Hoyer $$^{\dagger }$$ † (Münster, Germany), and Dr. F.-W. E.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised due to many of the author’s amendments and corrections, which was sent to the Production team have not been taken into consideration.
Appendix: Anchor data and first results
Appendix: Anchor data and first results
A.1 Standard vertex geometries
Given the bond lengths \(\lambda _\mathrm{(AV_i)}\), the dihedral angle \(\delta =2\pi /3\) and with \(\vartheta \overset{\textrm{def}}{= }\pi -\arccos \textstyle (-{1\over 3})\), the five vertex position vectors of the eight monofocal hydrids AV\(_4\) read as in Table 4:
For A\(=\)C, reflecting the \(\textbf{a}_0\), \(\textbf{a}_1\), \(\textbf{a}_2\), \(\textbf{a}_3\) at the xy-plane and shifting them to the right (i.e., along the positive z-axis) by \(\lambda _\mathrm{(CC)}\), we get another four vectors \(\textbf{b}_{0}\), \(\textbf{b}_{1}\), \(\textbf{b}_{2}\), \(\textbf{b}_{3}\), which all together define the “eclipsed” conformation of ethane \(\mathrm CH_3{-}CH_3\). For a “bifocal” methyl compound \(\mathrm CH_3{-}XV_3\), in general (where X = C, N, O, F, Si, P, S, Cl), we equivalently adopt the additional coordinates of Table 5.
The extended chain \(\mathrm H_3C{\overset{\tau _\textrm{n}}{-}}(CH_2)_nXV_3\) \((\mathrm{n=1,2,3,4,5,}\) \(\dots )\) is also composed of two parts:
- \(\circ \):
-
The constant methyl unit with four vertices \(\{\textbf{a}_i|i=0,1,2,3\}\)
- \(\circ \):
-
The recursively extendable \(\mathrm (CH_2)_nXV_3\) block with 3n+4 vertices \(\{\textbf{b}^\textrm{n}_i|i=0,\dots ,3(\textrm{n}{+}1)\}\).
Altogether, hence, we deal with \( \{\textbf{c}^\textrm{n}_i|i=1,2,\dots ,3\textrm{n}{+}8\} \) vertices.
Defining a shift vector (with \(\lambda \overset{\textrm{def}}{= } \lambda _\mathrm{(CC)}\)), a rotation and a torsion matrix, respectively:
the bifocal geometries (i.e., of the \(\mathrm{H_3C}\overset{\tau _0}{-}\textrm{XV}_3\)) and the polyfocal chain structures (i.e., of the \(\mathrm{H_3C}\overset{\tau _\textrm{n}}{-}(\textrm{CH}_2)_\textrm{n}\mathrm{XV_3}\)) respectfully, are mainly defined through the flexible set of \(\textbf{b}_{i}^\textrm{n}\)-vectors:
The required torsion angle \(\tau _{\textrm{n}}\) refers to the minimum of the potential curve \( \mathcal {E}_0(\tau )\). In highly symmetric cases (as for “eclipsed” ethane (point group \(D_{3v}\)) and “staggered” ethane (point group \(D_{3d}\))) energetic “degeneracies” will occur.
A.2 Bond lengths
See Tables 6 and 7. For the heteroatomic distances \( \lambda _\mathrm{(AB)}\), we assume the following [114]:
A.3 Optimized orbital exponents
See Table 8.
A.4 Proportional nuclear charges
Due to “chemionic separations” and “vertex condensations,” different nuclear charges enter into the diagonal and off-diagonal elements of attraction energy and their Fock matrices. Corresponding considerations hold for the nuclear repulsion energies. While in “diagonal” contributions \(\textrm{V}_\textrm{CC}\) (and the corresponding \(\mathcal {E}_{\epsilon \mathrm (dia)}^{nucl}\)), \(2\#chp\) chemions have to be proportionally distributed on a subset of vertices, “off-diagonal” terms \(\textrm{V}_\textrm{AB}\) (and the corresponding \(\mathcal {E}_{\epsilon \mathrm (off)}^{nucl}\)) contribute very low nuclear charges—namely the proportional weight of only two chemions. See Tables 9 and 10.
A.5 Approximate three- and four-center repulsion integrals
In order to accelerate the described recursive procedure, we consider two additional techniques for the purpose of further simplification: an explicit and an implicit pathway.
A.5.1 Explicit integrations
For the “Coulomb-type three-center repulsions,” we write:
For the “exchange-type three-center repulsions,” we write:
With respect to the four-center integrals, in general, such distinctions cannot be made. Nevertheless, we consider four different approximations:
For the “Coulomb-type four-center repulsions,” we write:
For the “exchange-type four-center repulsions,” we write:
With the following substitutions, we get from Eq. (36\(_1\)):
With the following substitutions, we get from Eq. (37\(_1\)):
With the substitution \(c.d=a.b\), we get from Eq. (39\(_1\)):
With the substitution \(c.d=a.b\), we get from Eq. (40\(_1\)):
A.5.2 Implicit integrations
In Roothaan’s “Restricted Hartree-Fock” theory (RHF) [40, 41], a couple of two-chemion repulsion integrals \(R_{(a.b|c.d)}\) occur. Recall that
where we separated the off-diagonal two-chemion part into a three-index and a four-index contribution:
Using the formalism of the previous section, we get for the off-diagonal terms the following:
Having the same form as the Wolfsberg-Helmholz formula [115,116,117,118], the Eqs. 48–51 lay the non-empirical foundations for Roald Hoffmann’s “Extended Hückel Theory” [115,116,117,118].
The diagonal elements \(W_{\epsilon (m.m)}^\mathrm{cou(II)}\) and \(W_{\epsilon (m.m)}^\mathrm{exc(II)}\), which also contain two-chemion three-index interactions, reduce to a compact form, too:
Due to their ability of leading back repulsive multi-center interactions on only one- and two-center integrations, the above expressions yield considerable accelerations. Their use will be another important step towards a “linear scaling” of our recursive algorithm [96].
A.6 Polyfocal ground state energies, torsion angles, and rotational barriers
All the “Born-Oppenheimer Energy Functions (BOEF)” discussed here are one-dimensional singlet and triplet state energies (i.e., potential curves). Mainly interesting
- \(\circ \):
-
are their minimum \(\mathcal {E}_0^\textrm{min}\) and the corresponding torsion angle \(\tau ^\textrm{min}\)
- \(\circ \):
-
are their maximum \(\mathcal {E}_0^\textrm{max}\) and the corresponding torsion angle \(\tau ^\textrm{max}\)
- \(\circ \):
-
and the difference \(\mathcal {E}_0^{\Delta }\), which corresponds to a “barrier of internal rotation.”
A.6.1 Singlet ground state energies
See Tables 11 and 12. Note, for instance, that the “eclipsed” conformation of ethane (with \(^1\tau _0^\textrm{min}=0^\circ ,120^\circ ,240^\circ \)) is predicted to be 0.011989 a.u. (\(\approx \) 7.523 kcal/mole) more stable than the “staggered” one (with \(^1\tau _0^\textrm{max}=60^\circ ,180^\circ ,300^\circ \)). The same observation holds for \(\mathrm{H_3C}{\overset{^1\tau _0}{-}}\mathrm (CH_2)_0SiH_3 \), \(\mathrm{H_3C}{\overset{^1\tau _0}{-}}\mathrm (CH_2)_0NH_2 \) and \(\mathrm{H_3C}{\overset{^1\tau _0}{-}}\mathrm (CH_2)_0PH_2 \), \(\mathrm{H_3C}{\overset{^1\tau _0}{-}}\mathrm (CH_2)_0OH \) and \(\mathrm{H_3C}{\overset{^1\tau _0}{-}} \mathrm (CH_2)_0\) \(\mathrm SH \). Rotational barriers of methylfluoride and -chloride, however, are an obvious arfifact of our model.
A.6.2 Triplet ground state energies
See Tables 13 and 14. Looking at the energies of the triplet ground state, sometimes, the observation can be made, that the “eclipsed” conformation (with \(^3\tau _0^\textrm{min}=0^\circ ,120^\circ ,240^\circ \)) is predicted to be less stable than the “staggered” one (with \(^3\tau _0^\textrm{max}=60^\circ ,180^\circ ,300^\circ \)); namely for \(\mathrm{H_3C}{\overset{^3\tau _0}{-}}\mathrm{(CH_2)_0SiH_3} \), \(\mathrm{H_3C}{\overset{^3\tau _0}{-}}\mathrm{(CH_2)_0PH_2} \), \(\mathrm{H_3C}{\overset{^3\tau _0}{-}}\) \(\mathrm{(CH_2)_0SH} \), as well as for methylfluoride and -chloride. Rotational barriers of all these cases, however, are very low (less than 0.1 kcal/mole).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Koch, W.H.G. Quantum chemical “Aufbau” principles: how to estimate the shape of highly flexible (bio-)polymers? A recursively extendable “chemion picture” of Euler-Hückel-type. J Mol Model 30, 47 (2024). https://doi.org/10.1007/s00894-023-05807-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00894-023-05807-x