1 Introduction

In the last decade, a series of functionals has been developed [1, 2] using a reconstruction proposed by Piris [3] of the two-particle reduced density matrix (2-RDM) in terms of the one-particle RDM (1-RDM). In particular, the Piris natural orbital functional 5 (PNOF5) [4, 5] has proved to be comparable to accurate quantum chemistry methods in many cases [418]. So far this is the only natural orbital functional (NOF) that has been obtained by top-down and bottom-up methods [19]. In the bottom-up method, the functional was generated by progressive inclusion of known necessary N-representability conditions on the 2-RDM, whereas the top-down method was used through reducing the energy expression generated from an N-particle wavefunction to a functional of the occupation numbers and natural orbitals [5]. In the case of PNOF5, this wavefunction is an antisymmetrized product of strongly orthogonal geminals (APSGs), with the expansion coefficients explicitly expressed by the occupation numbers [20, 21].

The idea of a function of this type dates back to the early fifties [22, 23] and is inspired by the valence bond theory [24, 25]; in fact, PNOF5 can also be considered as a type of GVB-PP method with fixed signs for the expansion coefficients of the corresponding determinants. Many scientists have worked actively in the field of strongly orthogonal geminals, and one of them is Professor Péter R. Surján to whom is dedicated this Festschrift. Indeed, an excellent review summarizing the evolution of the geminal theory up to 1999 can be found at his work [26]. An overview of geminal-based perturbative techniques for describing electron correlation was given recently [27].

Consequently, PNOF5 is an orbital-pairing approach that takes into account most of the non-dynamical effects, but also an important part of the dynamical electron correlation corresponding to the intrapair (intrageminal) interactions. The existence of a generating wavefunction confirms that PNOF5 is strictly N-representable, i.e., the 2-RDM is derived from a function that is antisymmetric in N-particles [28]. Moreover, it demonstrates the size extensivity and size consistency of PNOF5, which is an inherent property to the generating singlet-type APSG wavefunction [29, 30].

In NOF theory [31], the solution is established optimizing the energy functional with respect to the occupation numbers and to the natural orbitals, separately. If PNOF5 is employed, the occupancies can be expressed through auxiliary variables in order to enforce automatically the N-representability bounds on the 1-RDM, so the variation can be performed without constraints. A self-consistent procedure proposed in Ref. [32] yields the natural orbitals. This scheme requires computational times that scale as n 4, n being the number of basis functions, like in the Hartree–Fock (HF) approximation. However, our implementation in the molecular basis set requires also four-index transformation of the electron repulsion integrals, which is the time-consuming step though a parallel implementation of this part of the code has substantially improved the performance of our program. Besides the matrix scaling carried out in [32], the direct inversion in the iterative subspace (DIIS) method has recently been implemented in our code, but new techniques should be considered to reduce the number of iterations required to achieve the convergence.

In spite of its promising performance, the computational cost of PNOF5 calculations prevents them for a wider use. In this sense, any strategy to reduce calculation times is highly interesting, as it would expand the range of applicability of NOF theory. The fragmentation technique has allowed to span the range of systems attainable by wavefunction-based and density-functional theories, by means of a dividing approach that allows the calculation of the whole system fragmented in subsystems. In the present paper, we use such an approach, the so-called thermodynamic fragment energy method (FEM) [33], to assess the performance of PNOF5 in the context of fragment energy calculations.

The aim of this work is to determine how PNOF5-FEM energies converge to the exact PNOF5 values for selected oligomers, namely the polyalkene chains C n H2n+2 and planar zigzag (FH) n clusters. The size-consistency property, and the fact that the functional tends to localize spatially the natural orbitals, makes this functional an exceptional candidate for fragment calculations. We demonstrate that the convergence of fragment energy calculations is very fast, especially in those cases where the interaction between fragments is small.

Table 1 PNOF5/6-31G energies E(P n ) and fragment energies \(E_{l}^{F}(P_{n})\) (Eq. 10), in a.u., for the C n H2n+2 oligomers, and the corresponding error, δ E, in kcal/mol

2 Methods

2.1 The functional

At the beginning [4], PNOF5 was formulated as an orbital-pairing approach that involves coupling each orbital g, below the Fermi level (g ≤ F = N/2), with only one orbital above F (N g  = 1). This model was further improved by a better description of the electron pair in the so-called extended PNOF5 [5], in which each orbital g was coupled with N g  > 1 orbitals above F. This pairing condition is reflected in the following sum rule for the occupation numbers:

$$\sum _{p\in \Omega _{g}}n_{p}=1;\quad g=\overline{1,F}$$
(1)

where p is the running index referring to the spatial part of natural spin orbitals and n p their occupation numbers. Notice that for spin-compensated systems the spatial part of α and β natural spin orbitals are the same, so that the total occupation number for a given natural spatial orbital is \(n_{p}^{occ}=n_{p}^{\alpha }+n_{p}^{\beta }=2\times n_{p}\) and therefore can take values between [0, 2].

In Eq. (1), \(\Omega _{g}\) is the subspace containing the orbital g and its N g coupled orbitals. It is worth to note that these subspaces are mutually disjoint \(\left( \Omega _{g_{1}}\cap \Omega _{g_{2}}=\varnothing \right)\), i.e., each orbital belongs only to one subspace \(\Omega _{g}\). The PNOF5 energy for a singlet state of an N-electron system can be cast as

$$E=\sum \limits _{g=1}^{F}E_{g}+\sum \limits _{g_{1}\ne g_{2}}^{F}\sum \limits _{p\in \Omega _{g_{1}}}\sum \limits _{q\in \Omega _{g_{2}}}E_{pq}^{int}$$
(2)

The first term of the energy (2) draws the system as independent F electron pairs described by

$$E_{g}= \sum \limits _{p\in \Omega _{g}}n_{p}\left( 2\mathcal {H}_{pp}+\mathcal {J}_{pp}\right) +\sum \limits _{p,q\in \Omega _{g},p\ne q}\Pi \left( n_{p},n_{q}\right) \mathcal {L}_{qp}$$
(3)
$$\Pi \left( n_{p},n_{q}\right)= \left\{ \begin{array}{ll} -\sqrt{n_{p}n_{q}}, \quad _{p\,=\,g\,or\,}{}_{q\,=\,g}\\ \;\sqrt{n_{p}n_{q}}, \quad _{p,\,q\,>\,F} \end{array}\right.$$
(4)

where \(\mathcal {H}_{pp}\) denotes for the one-particle matrix elements of the core Hamiltonian. \(\mathcal {J}_{pq}=\left\langle pq|pq\right\rangle\) and \(\mathcal {K}_{pq}=\left\langle pq|qp\right\rangle\) are the usual direct and exchange integrals, respectively. \(\mathcal {L}_{pq}=\left\langle pp|qq\right\rangle\) is the exchange and time-inversion integral [34], which only differs in phases of the natural orbitals with respect to the exchange integrals, so \(\mathcal {L}_{pq}=\mathcal {K}_{pq}\) for real orbitals. The interaction energy \(E_{pq}^{int}\) is given by

$$E_{pq}^{int}=n_{q}n_{p}\left( 2\mathcal {J}_{pq}-\mathcal {K}_{pq}\right)$$
(5)

Accordingly, the last term of Eq. (2) contains the contribution to the HF mean field of the electrons belonging to different pairs. It is clear that the weaknesses of this Ansatz is the absence of the interpair electron correlation. Recently [35], a new functional PNOF6, which includes interpair correlations, has been developed. The latter is able to treat orbital delocalization in aromatic systems such as in benzene, a key aspect in radical stabilization. In this work, we study systems in which such effects are not present, so it seems proper to use the PNOF5 approach. Moreover, we will limit ourselves to the simplest formulation of the functional, i.e., we consider N g  = 1. Besides, in the present paper we will fix the n associated with core orbitals to 1.

2.2 The “fragment energy method” (FEM)

We follow the “fragment energy method” (FEM) proposed by Suárez et al. [33], in which the total energy of a molecule \(\mathbf {P}\) composed by a linear chain of M interconnected fragments A i (\(A_{1}-A_{2}-_{...}-A_{M}\)) is estimated as the sum of the energy of the fragments. Thus,

$$E\left( \mathbf {P}\right)= E_{R}^{F}\left( \mathbf {P}\right) +\delta E$$
(6)

where δ E is the error committed by the approximation.

The fragment energy \(E_{R}^{F}(\mathbf {P})\) is defined according to

$$\begin{aligned} E_{R}^{F}\left( \mathbf {P}\right) &= E\left( A_{1}-B_{1}^{R}\right) -{\displaystyle \sum _{i=1}^{M-1}}E\left( Y_{i}-B_{i}^{R}\right) \\ &\quad +\,{\displaystyle \sum _{i=2}^{M-1}}E\left( Y_{i-1}-A_{i}-B_{i}^{R}\right) +E\left( Y_{M-1}-A_{M}\right) \end{aligned}$$
(7)

where \(B_{i}^{R}\) are buffer regions that include a number of atoms around the fragments according to a well-defined R-dependent criterium (i.e., a distance and a number of monomer units), and Y i are atoms or functional groups introduced to cap the boundaries upon fragmentation of covalent bonds. For details, we refer to the work of Suarez et al. [33]. The evaluation of fragment energies is less demanding computationally than the evaluation of the energy of the whole molecule. As the size of the buffer region increases, more accurate calculations are performed and smaller δ E errors are obtained, but at higher computational cost. We have to take into account that PNOF5 scales as f(m) × N 4 (where f(m) is a prefactor related to the m number of iterations), due to the required four-index transformation of the J and K integrals in the molecular orbital basis (N 4 scaling). At present, the code is time-consuming since the required consistency on the minimization over occupations and coefficients of the natural orbital lead to the need of performing several cycles (large m ). Moreover, this scaling prefactor increases with the size of the system. Therefore, there is a big advantage of using fragment energies. The aim of this work is to analyze the convergence of these errors with the fragment sizes for PNOF5 method.

Table 2 PNOF5/cc-pVDZ energies, E(P n ) and fragment energies \(E_{l}^{F}(P_{n})\) (Eq. 11), in a.u., for the (FH) n clusters, and the corresponding error divided by the number of FH units, \(\delta E/N_{FH}\), in kcal/mol

3 Results and discussion

Geometries were optimized at the B3LYP/6-31+G(d) [36] level of theory using the GAUSSIAN09 program package [37]. For the (FH) n clusters, we first optimized the (FH)8 cluster, and the rest of geometries were taken by deleting FH units, to prevent the collapse of the cluster of lower size to geometries other than zigzag ones. All PNOF5 calculations have been carried out using our computational code DoNOF with the 6-31G and 6-31G(d,p) basis set [38] and the correlation-consistent valence double-ζ (cc-pVDZ) developed by Dunning et al. [39]. The matrix element of the kinetic energy and nuclear attraction terms, as well as the electron repulsion integrals, are inputs to our computational code. In the current implementation, we have used the GAMESS program [40, 41] for this task. The convergence criteria applied for PNOF solutions is 10−8 a.u. in the energy and 10−5 for the tolerance of the hermiticity of the matrix of Lagrange multipliers λ (see reference [32] for details on the iterative diagonalization procedure).

Fig. 1
figure 1

Difference in energy between oligomer energy (\(E(\mathbf {P}_{n})\)) and fragment energies (\(E_{l}^{F}(\mathbf {P}_{n})\)) per oligomer size, namely \(\delta E/n=\left[ E(\mathbf {P}_{n})-E_{l}^{F}(\mathbf {P}_{n})\right] /n\) , in kcal/mol. Calculations done at the PNOF5/6-31G level of theory for oligomers of size n = 1, 10

3.1 Polyalkene chain C n H2n+2

PNOF5/6-31G energies for oligomers of size n = 1, 10 and PNOF5/cc-pVDZ energies for n = 1, 8 can be found in Table 1. In order to calculate fragment energies, we will particularize Eq. (7) for the C n H2n+2 oligomers. In this case, the fragments are constituted by \(A_{i}=-({\text {CH}}_{2})_{i}-\) units and the terminal –CH3. H atoms are used to cap the boundaries upon fragmentation (Y i  = H). Finally, the buffer region is constituted by a number of –CH2– units, namely \(B_{i}^{R}=-({\text {CH}}_{2})_{R}-{\text {H}}\). For this particular case, Eq. (7) can be cast as

$$E_{R}^{F}\left( \mathbf {P}_{n}\right) =E\left( \mathbf {P}_{R+1}\right) +(n-R-1)\left[ E\left( \mathbf {P}_{R}\right) -E\left( \mathbf {P}_{R-1}\right) \right]$$
(8)

where n is the order of the oligomer. Equation (8) implies that the evaluation of fragment energy of a n-size oligomer with a R-buffer region requires the calculation of three oligomers of size R + 1, R and R − 1. Therefore, the size of the buffer region determines the size of the largest oligomer in the fragment energy calculation. Obviously, the larger the R-size, the more accurate the results, but also the higher computational cost. The formula above can be further simplified by taking into account that E(P R ) − E(P R−1) is a measure of the bond energy between fragments, and this bond energy can also be estimated more accurately by E(P R+1 ) − E(P R ). Hence,

$$E_{R}^{F}\left( \mathbf {P}_{n}\right) =(n-R)E\left( \mathbf {P}_{R+1}\right) -(n-R-1)E\left( \mathbf {P}_{R}\right)$$
(9)

For convenience, we will use the index l for the largest oligomer size in fragment calculations, namely

$$E_{l}^{F}\left( \mathbf {P}_{n}\right)= (n-l+1)E\left( \mathbf {P}_{l}\right) -(n-l)E\left( \mathbf {P}_{l-1}\right)$$
(10)

In summary, the evaluation of a fragment energy requires the evaluation of the energy for two oligomers of size l and l − 1. The size of the largest oligomer in the fragment calculations is determined by the size of the buffer region according to l = R + 1.

As mentioned above, the aim of this work is to determine how fragment energies converge to the exact PNOF5 energy for a given size of the oligomer as function of l. Fragment energies, \(E_{l}^{F}(\mathbf {P}_{n})\) , for the oligomers of size (n = 3, 10) calculated with the 6-31G basis set can be found in Table 1, along with the difference between the two energies per oligomer unit, δ E/n, in kcal/mol. In Fig. 1, we represent δ E/n as a function of the largest oligomer size (l) used in the fragment calculation. Calculations were repeated with the cc-pVDZ basis set for n = (3, 8) and 6-31G(d,p) basis set for (n = 3, 6), obtaining similar results. Therefore, we will analyze the PNOF5/6-31G results for which a larger dataset of oligomers was calculated.

There is a very good convergence of the fragment calculations for any of the oligomer size. It is remarkable, that in all cases, we are within chemical accuracy (as defined by an error of <1 kcal/mol per oligomer unit) for any oligomer and fragment size calculation. For instance, if we consider the largest oligomer, namely C10H22, the δ E/n error of the fragment energies for l = 2, 9 is −0.604, 0.051, 0.021, −0.004, 0.000, −0.001, 0.000, 0.000 kcal/mol, respectively. Thus, the errors in the energy converge very fast and gradually between l = 2 to l = 6. For higher fragment sizes, we can say that the results are converged within the accuracy of the method.

As expected for a given l, the errors get bigger as the size of the oligomer increases, but it is very relevant that these increases are very low. For instance, if we consider the smallest fragment calculation (l = 2), the error obtained for the different oligomer sizes is all within the same order of magnitude. Thus, for l = 2, we obtain a δ E/n of only −0.273 kcal/mol (C3H8), −0.399 (C4H10), −0.466 (C5H12), −0.512 (C6H14), −0.545 (C7H16), −0.570 (C9H20), −0.589, −0.604 (C10H22).

As one can see from Table 1, the improvement on the basis set implies no dramatic changes, even an slight amelioration of the convergence is observed in fragment energies. There is a very fast and good convergence of PNOF5 fragment energies with fragment size. In all cases when considering \(l\approx n/2\), results are very accurate with errors within the chemical accuracy.

It is clear that linear C n H2n+2 oligomers are a very favorable case for fragment energy calculations. This is not surprising due to the nonpolar nature of the bonds between fragments, and the small interaction expected among the monomers of the chain. Therefore, we decided to investigate a less favorable case: a chain of hydrogen bonds among units with polar bonds: (FH) n .

3.2 Hydrogen-bonded chain (FH) n

This case is a prototypical system bound by a chain of hydrogen bonds. We have constructed a planar zigzag (FH)8 cluster and employed only the cc-pVDZ basis set. The incremental energies per cluster unit are depicted in Fig. 2. As the size of the cluster increases, the energy per FH unit decreases indicating some cooperativity among the whole hydrogen-bonded chain in the hydrogen bond interaction between two neighbor FH units. Notice that as the size of the clusters increases, one should reach a limiting value, still not attained by the size of the clusters of the present work.

Contrary to the alkane series, in this case, the higher the cluster size, the lower the energy per cluster unit. This is expected for a system bound by hydrogen bonds, due to the cooperative nature of the hydrogen bonding network. As the cluster size increases, the increments in energy per cluster unit tend to increase linearly. For instance, at n = 2, we obtain a value of 1.9 kcal/mol, and at n = 8, a value of 4.9 kcal/mol, at the PNOF5/cc-pVDZ level of theory. Remind that these structures are frozen.

Fig. 2
figure 2

Energy per oligomer unit (E/n) of the (FH) n clusters with respect to the energy of the hydrogen fluoride (E FH). Calculations done at the PNOF5/cc-pVDZ level of theory

Note that in this case there is no need to introduce cap atoms, since the fragments are constituted by each FH unit. The buffer region will be formed of R-number of FHs. It is straightforward to demonstrate that the fragment energies can be calculated according to a formula analogous to the one used in the previous section, namely

$$E_{l}^{F}\left( P_{n}\right) =(n-l+1)E(l)-(n-l)E(l-1)$$
(11)

with l = R + 1. Results can be found in Table 2 and Fig. 3. The first aspect to remark is that the errors in fragment energies are bigger than for the alkane series. For instance, the error \(\delta E/n\) for the (FH)8 cluster with l = 4 is −0.7739 kcal/mol, whereas for \({\text {C}}_{8}{\text {H}}_{16}\) is one order of magnitude less, 0.0239 kcal/mol. The convergence of fragment energies with l is significantly slower than in the case of C n H2n+2. The decay in \(\delta E/n\) for (FH)8 with \(l=\overline{2,7}\) is −1.678, −1.229, −0.774, −0.532, −0.309, −0.129 kcal/mol, respectively. This is a significant reduction in error but at a much lower rate than for C n H2n+2 system. On the other hand, notice that these errors show a systematic behavior, approaching the total energy of the cluster from above, due to the gradual recovery of the cooperative effects in the hydrogen bonding network as the size of the fragments increases and approaches the total size of the cluster. For a given size l, the errors also increase when the cluster grows at a higher rate than in the alkane series; for instance, for l = 2 the error in evaluating the energy for \(({\text {FH}})_{n}\) increases with the value of \(n=\overline{3,8}\) as −0.247, −0.510, −0.764, −1.033, −1.328, −1.678 kcal/mol, respectively.

Fig. 3
figure 3

Error in PNOF5/cc-pVDZ fragment energies, \(E_{l}^{F}(P_{n})\), with respect to the energy of the cluster \(E(P_{n})\) for the \(({\text {FH}})_{n}\) clusters, divided by the number of FH units in the cluster, \(\delta E/N_{{\text {FH}}}\), in kcal/mol

The origin of the worst performance of fragment energies for \(({\text {FH}})_{n}\) cluster is the interaction between fragments. The interaction between hydrogen-bonded species is of long range type. Accordingly, the interaction is extended over large number of clusters and gives rise to cooperative effects among the hydrogen bonding network. The use of fragment energies will therefore have a sizable effect. However, in the case of alkane, even though we fragment the molecule through covalent bonds, the apolar nature of the fragments makes them more amenable for fragment calculations. However, one has to highlight that the convergence is still quite good, and considering fragment calculations of size \(l\approx n/2\) , chemical accuracy is obtained for all the clusters.

4 Conclusions

We can conclude that the size-consistent and localized orbital nature of PNOF5 allows for a good performance of fragment calculations. For fragment calculations of size \(l\approx n/2\) , the chemical accuracy is obtained for all the clusters, even in the case of hydrogen bond interactions between the units. This leads to very significant computational gains, at least of one order of magnitude. Therefore, the PNOF5-FEM method could be a promising tool to extend the size of systems amenable for PNOF-type calculations.