Keywords

1 Introduction

Astronomical observations tell us that the Interstellar Medium (ISM) is rich in molecules that may become relevant in the prebiotic context.

These molecules can be produced by diverse chemical processes inside the gas phase of the ISM, but some molecules like \(\mathrm {H}_2\) (the most abundant molecule in the ISM [15, 17]) cannot be formed by simple gas-phase process [15, 17]. However, the presence inside the ISM of micron-sized particles called Interstellar Grains (IGs), made of a silicate core covered by several layers of amorphous ice mantles [3, 12, 18], are the place where H atoms get adsorbed and the excess energy dispersed into the grain when \(\mathrm {H}_2\) is finally formed. Indeed, these IGs host chemical reactions producing interstellar Organic Molecules (iCOMs) by absorbing the excess of energy due to the bond formation of the iCOMs [13], while also acting, in specific cases, as a catalyst by reducing the activation energy [7, 14]. Thus, chemical reactions at the surfaces of IGs could explain the presence in the ISM of molecules not feasible through gas phase processes.

The information about binding energies (BEs) of interstellar molecules on the grain is, therefore, essential for understanding the adsorption, movement, and desorption of iCOMs. All this is essential in order to understand the amount and distribution of iCOM in the gas-phase, as the BE controls both the diffusivity and the velocity of desorption from the grain towards the gas-phase where further processing of the iCOM will occur. Numerical models are, indeed, based on these data to provide the chemical composition of the ISM.

The BEs can be worked out in terrestrial laboratory by temperature programmed desorption (TPD), but the extreme conditions of the ISM (low temperature, low pressure, and very long time scale) hinders the real possibility to mimic the actual interstellar conditions. Quantum computation can be adopted as a useful complementary tool to the experimental data, provided that proper comparison between TPD extracted BE and those computed by quantum mechanical methods are properly made [8].

2 Methodology

2.1 Computational Method

In this work we aim at benchmarking a recently developed semiempirical quantum mechanical method (SQM) called GFN-xTB and more precisely the GFN2 method [1, 2]. This method is being developed by the Grimme’s group at the Bonn University and can be almost three orders of magnitude faster than conventional DFT methods; it can be used efficiently on simple desk computers and, indeed, every result showed in this paper was computed using a laptop computer equipped with an Intel i5 CPU and 8 GB of RAM.

We chose to use GFN-xTB not only because it is inexpensive in time and computational power, but also because it is particularly well suited for the system we want to study (IG made mostly of water ice), owning to its excellent performance in treating systems dominated by non-covalent interactions (H-bonds included), as it is the case of iCOMs adsorbed on water ice.

In a previous work [10], we showed that the method was accurate to deal with small water clusters for both energy and structures; here, we extended the applicability of the GFN2 method to model the interaction between a large number of relevant iCOMs adsorbed on a crystalline model of water ice. Our target is to establish whether GFN2 can compete with more robust and accurate DFT methods, as recently adopted by some of us [8], to model realistic and amorphous water clusters close to the real grains compared to the crystalline ice adopted here.

2.2 Computational Dataset

The set of data used to benchmark the GFN2 method for the computation of BEs was taken from Ferrero et al. 2020 work [8].

In that work Ferrero et al. [8] computed the BEs of 21 interstellar molecules, four of which were radicals. The BEs were computed using DFT/A-VTZ* method (M06-2X for radicals and B3LYP-D3 for closed shell ones).

In contrast to other computational chemistry work on BEs which use grain models composed of a few water molecules [6, 16], Ferrero et al. [8] used a crystalline (010) slab model of proton-ordered ice (P-ice) [4] to simulate the grain structure, within the paradigm of the periodic boundary conditions. This crystalline model does not properly represent IGs, known to be amorphous. Nonetheless, the periodic boundaries used in the work by Ferrero et al. [8] indicated that the hydrogen bond cooperativity, too often underestimated in small water clusters mimicking the icy grains, is accurately represented.

Ferrero et al. [8] also adopted an amorphous periodic water model to study the BEs. This amorphous water slab is closer to the expected structure of IGs compared to a crystalline structure, but is not taken into account here as it is too difficult to simulate by a finite cluster (vide infra) as we did for the crystalline ice model.

Fig. 1.
figure 1

Slab cut from the proton-ordered P-ice crystal used by Ferrero et al. 2020 [8]. Top: Slab seen from the ab plane with the dO and dH binding sites indicated inside the blue rectangle. The net dipole moment is indicated by a black arrow. Center: Slab seen from the ac plane. Bottom: Electrostatic potential mapped on the electron density of the slab. Negative regions are in red and positive regions are in blue. (Color figure online)

2.3 Crystalline Slab Model

We cannot perform a one by one comparison with the Ferrero et al. [8] results as the periodic boundary condition (PBC) implementation in GFN-xTB does not include the GFN2 hamiltonian. Furthermore, the PBC coding in the xTB program is still under development. Therefore, we prefer to use a large molecular cube envisaging 84 water molecules (see Fig. 1) cut out from the periodic crystalline slab by Ferrero et al. [8].

This slab ensures that the dangling hydrogen (dH) and oxygen (dO) shown in Fig. 1, and acting as a binding sites, are properly embedded within surrounding layers of water molecules.

To simulate geometric constraints due to the long-range effects of the periodic boundaries applied to the a and b axis, we fixed the H atoms of the water molecules at the border of the cluster to the periodic conditions, while leaving free to move the H atoms within the core of the cluster. This ensures that the dangling hydrogen dH, used in partnership with the dangling oxygen dO as a binding site for the different chemical species, is unconstrained and free to move. These constrains will also avoid the collapse of the structure during geometry optimisation due to the missing water molecules at the frontiers of the cluster.

2.4 Binding Energies and Role of the ZPE Correction

For each chemical species to be studied we performed a GFN2 optimisation using the “extreme” level of optimisation of GFN-xTB. For each molecule, three optimisations were performed, one for the molecule alone, one for the slab alone, and one for both the slab and the molecule in interaction. The starting positions of the molecules above the binding site were as similar as possible to that adopted by Ferrero et al. 2020 [8] and optimised using GFN2 to check if the GFN-xTB positions after optimisation matched that of the DFT/A-VTZ* calculations. If the final position were significantly different, the starting position of the molecule was changed by trying to reproduce the final position of Ferrero et al. [8] instead of the initial one. The computation was then restarted (in the remainder of this article results for which the final position has been used will be followed by the suffix “-end”. For example \(\mathrm {N}_2\) indicates results using the starting position and \(\mathrm {N}_2\text {-}\mathrm {end}\) indicates results using the final position). This way, we were able to check if the method could maintain this position as a stable one.

The BEs of these chemical species were computed as follow:

$$\begin{aligned} BE = E_{mol} + E_{slab} - E_{mol+slab} \end{aligned}$$
(1)

Where \(E_{mol}\), \(E_{slab}\), and \(E_{slab+mol}\) are the GFN2 energies after optimisation of the molecule, the 84 water molecules slab with the constrains applied, and the molecule interacting with the slab, respectively.

To confirm that the optimised geometries are minima on the potential energy surface we computed the harmonic vibrational frequencies and checked that all frequencies were real. In case of imaginary frequencies, xTB provides a perturbed geometry which, after optimisation and further vibrational frequency calculation, may end up in a minimum.

From the set of frequencies, also the zero point energy (\( ZPE \)) is provided for each component of the BE as:

$$\begin{aligned} \varDelta ZPE = ZPE _{slab+mol} - ZPE _{mol} - ZPE _{slab} \end{aligned}$$
(2)

We can then obtain the zero point energy corrected binding energies \(\mathrm {BE(0)}\) by subtracting \(\varDelta ZPE \) to the original BE:

$$\begin{aligned} BE \left( 0\right) = BE - \varDelta ZPE \end{aligned}$$
(3)

In general, the \(\varDelta ZPE \) is a positive quantity, therefore the \( BE \left( 0\right) \) is always smaller than the uncorrected one.

3 Results and Discussion

The adoption of a finite water cluster to mimic the PBC is not drawback-free. The first one, is the presence of a dipole moment of 14.4 Debye in the a-b plane of the slab and directed along the a cell axis (see Fig. 1). This dipole moment is exactly cancelled out when rigorous PBC are adopted, as in the work by Ferrero et al. [8]. In our case it could alter the behaviour of the most dipolar molecules of the set. The second one, is the fact that even if we try to mimic the long range effects imposed by the periodic boundaries conditions by applying constrains on the slab, this does not include the infinite hydrogen bond cooperativity of the real crystalline material. As a result, the final positions of these molecules could also be impacted.

Table 1. GFN2 BE, zero point energy corrected BE(0) and \(\Delta \mathrm {ZPE}\) values (in kJ/mol) for all considered species. For \(\mathrm {H}_{2}\mathrm {CO}\) and \(\mathrm {HCONH}_2\) “SC1” and “SC2” refer to different initial positions used for the molecules on the binding sites by Ferrero et al. 2020 [8].

The binding energy values are given in Table 1.

GFN2 results are in good agreement with the DFT ones except for some unusual cases, mainly: \(\mathrm {CH}_3\), CO, HCO, OH, and \(\mathrm {O}_2\). All of them, except CO, are radicals (the ground state of \(\mathrm {O}_2\) is a triplet) making them highly reactive. Thus, they demand a level of theory that GFN2 cannot provide.

Fig. 2.
figure 2

Left: DFT//DFT results for CO. From Ferrero et al. 2020 [8]. Centre: GFN2 results for CO. Right: \(\mathrm {H}_2{\mathrm {O-CO}}\) dimer with the starting geometry from B3LYP-D3/aug-cc-pVTZ on top and the GFN2 optimised geometry at the bottom. Dotted lines were added to emphasise the H--C and C--O interactions.

The particularly intriguing electron configuration of CO gives to the molecule a very weak dipole moment (with the negative pole located at the C atom) and a large quadrupole. The experimental dipole moment is 0.11 D [11], with carbon atom carrying the negative end despite being less electronegative than oxygen (see Ref. [9]). GFN2 overestimates the dipole moment (0.615 D) and swaps the charges (+0.0049 e on carbon) with respect to accurate calculations. This affect the results as, usually, CO interacts preferentially from the C-end when involved in H-bond interactions (see Fig. 2 Left) or with net charges like \(\mathrm {Na}^+\). Unfortunately, GFN2 is unable to treat such a delicate bonding situation for CO interacting with the grain (see Fig. 2 Centre). Even when CO is put in interaction with a single water molecule, the optimised structure differs dramatically from the optimum B3LYP-D3/aug-cc-pvTZ one taken as internal reference, due to the bad description of the electronic structure of CO (see Fig. 2 Right). This is very unfortunate, owning to the general good performance of GFN2, and due to the extreme relevance of CO in the astrochemical context (the second most abundant gas-phase molecule in the ISM).

Figure 3 shows the comparison between the structures of a subset of the considered molecules computed at GFN2 with that resulting by the DFT optimization by Ferrero et al. The agreement is, in general, very good considering the speedup due to GFN2. Also, the critical case of the methyl radical is well reproduced.

Fig. 3.
figure 3

Comparison between DFT structures (in parenthesis) from Ferrero et al. 2020 [8] and GFN2 results for selected molecules. Distances in unit of Å.

Fig. 4.
figure 4

Up: Linear fit (black line) between BEs using DFT//DFT results available in Ferrero et al. 2020 [8] and BEs using GFN2. The grey line represents the “ideal line”. Down: Same as up except the BEs are ZPE corrected in both cases.

Figure 4 compares the GFN2 BEs against the DFT ones. We exclude from the correlation these unusual cases: \(\mathrm {O}_2,\mathrm {CO}, \mathrm {OH}, \mathrm {CH}_3\), simply showing the more common molecules of the set. For both the \(\mathrm {BE}\) and \(\mathrm {BE(0)}\) the linear correlation is close to the “ideal line”, i.e. the trend line we would have obtained if the GFN2 results were exactly equal to the DFT ones computed by Ferrero et al. [8]. The good agreement is also evidence that the relative high dipole moment in the crystalline ice plane does not affect the GFN2 BE to a significative extent.

Fine analysis of data in Fig. 4 allows to detect specific cases with larger deviations, namely:

  • HCl: +12.2 kJ/mol (22.6%) for the BE (when compared to DFT//DFT results) and +10.8 kJ/mol (22.7%) for the ZPE corrected \(\mathrm {BE(0)}\) (when compared to ZPE corrected DFT//DFT results)

  • HCN: −7.14 (16.8%) for the BE and −10.4 (24.9%) for the \(\mathrm {BE(0)}\)

  • \(\mathrm {CH}_3{\mathrm {CN}}\): −9.85 (15.7%) for the BE and −9.64 (17.0%) for the \(\mathrm {BE(0)}\)

  • \(\mathrm {H}_2{\mathrm {CO}}\text {-}\mathrm {SC2}\): −9.22 (17.4%) for the BE and −7.95 (18.1%) for the \(\mathrm {BE(0)}\)

  • HCOOH: +7.86 (10.0%) for the BE and +9.76 (13.6%) for the \(\mathrm {BE(0)}\)

Fig. 5.
figure 5

Left: DFT//DFT results for HCN. From Ferrero et al. 2020 [8]. Right: GFN2 results for HCN.

In this list, only HCN shows a significant difference in the final molecule position compared to Ferrero et al. 2020 [8] (as can be seen on Fig. 5). The nitrogen is still accepting the hydrogen bond from the dH, but the HCN hydrogen does not make a H-bond with the dO going toward an adjacent dangling oxygen. This is due to the delicate balance between the H-bond donor/acceptor geometrical features which is handled differently by GFN2 compared to DFT. Therefore, HCN is probably a critical molecule for GFN2 for the same reasons explained for the CO case, as the triple bond electronic features are difficult to describe.

Fig. 6.
figure 6

Top: Linear fit between GFN2 BE and \(\mathrm {BE(0)}\). Bottom: Linear fit between GFN2 \(\Delta \mathrm {ZPE}\) and DFT//DFT \(\Delta \mathrm {ZPE}\).

Correcting the BE for ZPE is important in the astrochemical context of the cold (10K) molecular clouds. Unfortunately, the frequency calculations can be quite demanding, due to the calculation of the expensive Hessian matrix. Therefore, it is important to establish how much the BE are affected by the ZPE correction. Figure 6 shows, indeed, that a very robust linear correlation exists between the two set of \(\mathrm {BE}\) and \(\mathrm {BE(0)}\) values. This implies that \(\mathrm {BE(0)}\) values can be arrived at without actually computing the vibrational frequencies, by a simple re-scaling of the BE values. This has important consequences for the future calculations in which the BE of iCOMs will be computed with respect to large amorphous icy grains than the present crystalline model. Even more important is to establish whether the \(\Delta \mathrm {ZPE}\) computed by GFN2 (very cheap) correlates with the corresponding DFT ones reported by Ferrero et al. [8] (very expensive). The graph in the bottom part of Fig. 6 reveals that the correlation is only coarse: unfortunately, one cannot adopt the GFN2 \(\Delta \mathrm {ZPE}\) to correct the DFT BE.

4 Conclusion

In this work we benchmarked the GFN2 method provided by the GFN-xTB program developed by the Grimme’s group at the Bohn university, against DFT results from the work by Ferrero et al. 2020 [8]. The benchmark consisted in comparing structures and binding energies (BE) of a large set of interstellar molecules adsorbed at the surface of a crystalline, proton ordered ice cluster grain model. This cluster was cut out from the periodic P-ice adopted by Ferrero et al. [8] as a crystalline model for the interstellar ice. We also extended the comparison to study the role of zero point energy correction to the electronic BEs. The comparison is biased by the adoption of a molecular cluster of 84 water molecules to represent an otherwise periodic crystalline model as reported by Ferrero et al. 2020 [8]. Nevertheless, the results showed excellent performance of the GFN2 for both structures and BE, with some important exceptions, namely radical species and the CO molecule. As many radical species are also challenging for DFT functionals, we did not expect good GFN2 performance. The bad behaviour for CO is, instead, much more serious, due to the relevance of this molecule in the ISM. Unfortunately, GFN2 badly fails in predicting even the corrected structure, in which CO is H-bonded through its carbon end with the dH of the ice surface.

We also showed that the zero point energy corrected \(\mathrm {BE(0)}\) nicely correlates with the uncorrected \(\mathrm {BE}\) ones, allowing to skip the very expensive calculation of the Hessian matrix in future applications of GFN2 for the larger icy grain models.

In conclusion, we are optimistic about the application of the GFN2 method to model large icy grain models closer in size to the real grain than what has been simulated up to now. The excellent performance for structures and BE also allows to use GFN2 as a promising low model method in a refined ONIOM approach [5], in which high DFT/MP2/CCSD(T) level of theory for a limited portion of the icy grain where the adsorption takes place are adopted to compute very accurate BE values. This is the subject of future work by our group.