Introduction

Amino acids form a numerous class of compounds that counts over 500 substances existing in nature. Of them, the most important are 22 proteinogenic alpha-amino acids occurring in the genetic code. Hereafter, we are focussing to a series of linear (catena) aliphatic amino acids starting from the simplest member–glycine, β-alanine, γ-aminobutyric acid and δ-aminovaleric acid.

Glycine (aminoacetic acid, C2H5NO2) is a white solid, solubility in water 25 g/100 cm3, octanol/water partition coefficient logPow = − 3.21, acidity constants pKa = 2.34 (carboxyl) and 9.60 (amino). β-alanine (3-aminopropanoic acid, C3H7NO2) is an isomer to the α-alanine; it is a white solid, solubility in water 54 g/100 cm3, logPow = − 3. 05, pKa = 3.55 (carboxyl) and 10.24 (amino). GABA (4-aminobutanoic acid, γ-aminobutyric acid, C4H9NO2) is a white solid, solubility in water 150 g/100 cm3, logPow = − 3.17, pKa = 4.02 (carboxyl) and 10.56 (amino). It contains 3 rotatable bonds. A continuation of this series is DAVA (δ-aminovaleric acid, 5-aminopentanoic acid, C5H11NO2), solubility in water 100 g/100 cm3, logPow = − 2.63, pKa = 4.27, 4 rotatable bonds.

All species under study can exist in the canonical amino acid forms, or in the forms of zwitterions with spatially separated -NH3+ and -COO groups (Fig. 1). In solutions, complex equilibria between them depending upon pH exist. Solid-state zwitterionic forms are obtained by crystallisation from the neutral water solutions. The situation is much more complex: for instance, in the solid state, three anhydrous polymorphs of glycine were identified along with the dihydrate form (Drebushchak et al. 2002; Iitaka 1960, 1959, 1961; Jönsson and Kvick 1972; Kvick et al. 1980; Xu et al. 2017).

Fig. 1
figure 1

Structural formulas of linear amino acids including their zwitterionic forms. Colour code: white–H, black–C, blue–N, red–O

Glycine is one of the proteinogenic amino acids and is integral to the formation of alpha-helices in secondary protein structure due to its compact form. Glycine is the most abundant amino acid in collagen forming triple-helices. In the nervous system, glycine is also an inhibitory neurotransmitter. β-Alanine is a part of several key components of the intermediary metabolism, such as carnosine and of pantothenic acid (vitamin B5), which itself is a component of coenzyme A. The highest concentration of β-alanine is in skeletal muscles, heart and brain. Its pharmacological administration increases performance of muscle and reduces fatigue in athletes. GABA is the main inhibitory neurotransmitter in the developmentally mature mammalian central nervous system. Its principal role is reducing neuronal excitability throughout the nervous system (Boonstra et al. 2015). GABA deficiency is associated with several neurological and psychiatric diseases, such as epilepsy, schizophrenia and anxiety. DAVA is a delta-amino acid with a weak GABA agonist activity (Kristiansen et al. 1992). DAVA is also important in haem synthesis, and its increased level is significant in diagnosis of porphyria.

Amino acids are non-innocent with respect to redox processes. They reduce the iron(II) salts under anaerobic conditions to yellow nanoparticles of Fe(0) which, exposed to air, are readily oxidised to a mixture of dark-coloured iron oxides (Matelková et al. 2013; Klačanová et al. 2013; Vatrál et al. 2014).

An inspection of the databases (PubChem, DrugBank, HMDB—Human Metabolome Database) confirms that there a gap in the physicochemical properties of amino acids. That is our main motivation to fill this gap.

High-quality ab initio calculations that include a part of the correlation energy are very effective in studying the electronic structure and molecular properties of small bioactive molecules including amino acids (Bohórquez et al. 2003; Chipot et al. 1993; Flaig et al. 2002; Head-Gordon et al. 1997; Matta and Bader 2003; Matta 2014; Popelier and Aicken 2003). The aim of the present work is to calculate an extended set of non-local molecular descriptors associated with the electron transfer that involves the adiabatic ionisation energy, electron affinity, molecular electronegativity, chemical hardness and electrophilicity index using a consistent methodology. As these species are redox active, the absolute redox potentials on reduction and/or oxidation were enumerated based upon the reaction Gibbs energy. Which molecular descriptor correlates with the absolute reduction and oxidation potentials serves for a hypothesis. The non-local non-additive molecular descriptors were compared along the series showing which of them behave as extensive, varying in match with the molar mass and/or separation of the carboxyl and amino groups. In total, 24 species were investigated by B3LYP, 12 by MP2 and 24 by DLPNO-CCSD(T) methods including neutral molecules, molecular cations and molecular anions for canonical and zwitterion forms of linear amino acids in water as a solvent.

There is an obstacle associated with the molecular geometry–a huge number N3 of possible rotamers for N rotatable C–C bonds. Along the studied series, N increases as 1, 2, 3 and 4. The number of conformers is even higher owing to a various attachment of the hydrogen atoms. One “wrapped” structure of DAVA is visualised in Fig. 2. However, nowadays there are automatic tools such as CREST (Conformer-Rotamer Ensemble Sampling Tool) based on the XTB Semiempirical Extended Tight-Binding for a fast sampling of possible rotamers (Precht et al. 2020).

Fig. 2
figure 2

A wrapped structure of DAVA

Methods

ORCA software, ver. 5.0.4 (Neese et al. 2012, 2020, 2022) has been used for ab initio and DFT calculations of molecular properties and electronic structure of amino acids under study. Actually, the Density Functional Theory with B3LYP hybrid functional was applied (method M1). Alternatively, the HF-MP2 (Hartree–Fock + 2nd-order Many-Body Perturbative Theory with Moller–Plesset partitioning) calculations were exploited (method M2). The basis set def2-TZVPD (triple-zeta-valence with polarisation and diffuse functions) has been applied with the contraction scheme of primitive Gaussians C-, N-, O-{621111/4111/111/1} and H-{311/11} where s-, p-, d- and f-shells are separated by a slash (Weigend and Ahlrichs 2005). The solvent effect has been simulated by Conductor-like Polarizable Continuum Model (CPCM) (Takano and Houk 2005) where the relative permittivity of water was εr  = 80.4. The complete vibrational analysis has been performed after the full geometry optimization where no imaginary frequencies were detected.

As a first step, the molecular structures of the amino acids were retrieved from PubChem database; they were used as a starting point for the full geometry optimization until the gradient criteria indicated the global energy minimum. In the case of zwitterionic forms, the crystallographic CCDC data served as an input for the geometry optimization. Then the molecular properties were evaluated: the energies of the HOMO (the highest occupied molecular orbital) and LUMO (the lowest unoccupied molecular orbital), the permanent dipole moment p, the quadrupole moment Q and the dipole polarizability α. The complete vibrational analysis (indicating no imaginary frequencies) facilitated evaluating the electronic, vibration, rotation and translations partition functions allowing to evaluate the zero-point vibration energy, inner energy U, enthalpy H, entropy S and Gibbs energy G. Absolute redox potential is calculated using the reaction Gibbs energy on oxidation and/or reduction Eabsø(L0/Lq) [V] = – ∆rGø[J mol−1]/zF, for F = 96,485 C mol−1 and z = 1.

The positively and/or negatively charged open-shell systems in the unrestricted variant (UKS or UHF) at the optimised geometries of the neutral species and molecular ions allow evaluating the properties which characterise the redox processes forming ionised forms: (i) adiabatic ionisation energy Ei = E+E0 (for electron withdrawal); (ii) electron affinity Eeg = EE0 (for electron attachment); (iii) the chemical potential expressed by the molecular electronegativity \(- \mu = \chi = (E_{{\text{i}}} {-}E_{{{\text{eg}}}} ) \, /{2}\) (electronic gradient, a driving force for electron transfer); (iv) chemical hardness \(\eta = (E_{{\text{i}}} + E_{{{\text{eg}}}} ) \, /{2}\) (electronic force constant, resistivity against change of electron density); (v) the electrophilicity index ω = χ2/2η (electrophilic power), (Sen 1993; Pearson 1977; Parr 1999). The present methodology follows detailed studies on solvation of the glycine molecule, its molecular cation and molecular anion using B3LYP as well as HF-MP2 methods (Boča et al. 2023).

More advanced quantum chemical methods are at the disposal, as implemented in ORCA package. However, the target quantity–the redox potential requires the reaction Gibbs energy that is evaluated after a complete vibrational analysis preceded by the full geometry optimization. DLPNO-CCSD(T) method (Domain-based Local Pair Natural Orbitals–Coupled Cluster Singlets + Doublets + Triplets) is slow when the geometry optimization is activated: for numerical evaluation of gradient in each step of the geometry optimization (dozens to hundreds steps), the energies for a huge number of displacements need to be calculated (for instance 114 displacements for DAVA). Then the time requirement is ca 105 times higher than for a single point calculation that is comparable to more approximate methods like B3LYP (Liakos and Neese 2015; Sandler et al. 2021; Mallick et al. 2020; Garcia-Rates et al. 2021; Paiva et al. 2020). DLPNO-CCSD(T) calculations (method M3) were done in the aug-cc-pVTZ basis set for O-, N-, C-19s6p3d2f contracted to 5s4p3d2f pattern {88111/3111/111/11}, and H-6s3p2d → 4s3p2d{3111/111/11}; auxiliary basis set was aug-cc-pVTZ/C for O-, N-, C- 9s7p6d4f2g {111111111/1111111/111111/1111/11} and H-5s4p3d2f {11111/1111/111/11} (Kendall et al. 1992; Weigend et al. 2002). Augmented basis set is essential for a reliable description of molecular anions.

Results and discussion

As a result of the geometry optimization, the calculated energies of amino acids in water are listed in Table S1. They include 24 molecules and/or molecular ions calculated by B3LYP/def2-TZVPD method, 12 entries by MP2/def2-TZVPD method and 24 ones by DLPNO-CCSD(T)/aug-cc-pVTZ. The optimised geometries by B3LYP method are drawn in Tables S2 and S3.

Based upon the B3LYP total electronic energy, glycine and β-alanine are more stable in their zwitterionic forms relative to their canonical counterparts (in water): ∆E = E(Z0)–E(A0) = − 3.9 and − 1.1 kcal mol−1, respectively. For GABA and DAVA, on the contrary, a more stable are the canonical amino acid forms: ∆E = 0.2 and 0.4 kcal mol−1. This finding cannot be overemphasised since we are comparing only two of a huge number of isomers. The MP2/def2-TZVPD calculation gave ∆E = − 4.6 and − 2.0 kcal mol−1 for 5 and 6; ∆E =− 2.9, 0.1, 1.3, and 1.7 kcal mol−1 for 710 calculated by DLPNO-CCSD(T)/aug-cc-pVTZ.

The calculated total energies of the neutral molecule and its cations and anions allow determining the adiabatic ionisation energy Ei and adiabatic electron affinity Eeg. Notice, these quantities have little to do with the energies of HOMO and LUMO for the neutral molecule (the Koopmans’ theorem could be valid for the vertical ionisation process). The quantities referring to the adiabatic oxidation and/or reduction processes are listed in Table 1. A comparison of data calculated for the canonical forms brings the following findings. (i) The highest ionisation energy possesses the glycine molecule (147 kcal mol−1 for 1A), whereas three remaining molecules have a bit lower values (139, 137 and 136 kcal mol−1 for 2A –4A). (ii) The MP2 values of ionisation energies are a bit higher relative to B3LYP data (151 and 146 kcal mol−1 for 5A and 6A). (iii) All values of electron affinities are negative (in water as a solvent). (iv) High ionisation energy manifests itself in high molecular electronegativity (χ = 86 kcal mol−1 is highest for glycine). (v) More negative electron affinity causes the drop of the chemical hardness (η = 54–55 kcal mol−1) showing that 2A through 4A molecules are “softer” than glycine. (vi) The electrophilicity index reflects the ability of the species to undergo reduction; it is lowest for glycine (ω = 61 kcal mol−1) which matches the lowest reduction potential along the series (Erø = 1.21 V).

Table 1 Molecular descriptors calculated by DFT-B3LYP (1–4), HF-MP2 (5, 6) and DLPNO-CCSD(T) (7–10) methods using adiabatic ionisation/affinity processes in watera

The molecular properties related to the electroneutral molecules vary systematically with the bulkiness of the species: the isotropic value of the quadrupole moment Q = − 23, − 27, − 34 and − 37 ea02, the isotropic value of the dipole polarizability α = 57, 74, 91 and 108 a03, the solvated surface S and solvated volume V, the zero-point vibrational energy Ezpe = 49.6, 67.2, 85.1 and 102.7 kcal mol−1, and the total entropic terms S·Tø = 22, 25, 27 and 29 kcal mol−1. The dipole moment p = 1.75, 1.93, 1.70 and 1.82 D does not vary systematically because it depends upon the delicate spatial separation of the barycenters of opposite charges.

A comparison of data calculated by MP2 method with those obtained by B3LYP shows some differences which, however, are not principal and do not alter the conclusions: the calculated bulk properties (Q, α, S, V, Ezpe, S·Tø) are almost the same. The main difference lies in the energies of one-electron orbitals HOMO and LUMO which have no physical impact (e.g. for glycine: − 163 vs − 260 for HOMO, and − 3 vs + 38 kcal mol−1 for LUMO); these, however, have no impact to observables, such as ionisation energies and electron affinities. The adiabatic ionisation energy and the electron affinity are little affected when comparing B3LYP and MP2 data (for glycine: Ei = 147 vs 151, Eeg = − 25.1 vs − 25.0 kcal mol−1). The derived electronic properties, such as molecular electronegativity, chemical hardness and electrophilicity index, are very similar (for glycine: χ = 86 vs 88, η = 61 vs 63, ω = 61 vs 61 kcal mol−1); the same hold true for the absolute reduction potential (Erø = 1.21 vs 1.20 V).

The molecular descriptors calculated by DLPNO-CCSD(T) method are restricted to single-point energy calculations (including molecular ions), i.e. the dipole moment, quadrupole moment, ionisation energy and electron affinity along with derived electronic parameters, such as the molecular electronegativity, chemical hardness and the electrophilicity index. These data are almost the same as obtained by MP2 (for glycine: Ei = 151 vs 149, Eeg = − 25.0 vs − 27.3 kcal mol−1; for β-alanine: Ei = 146 vs 144, Eeg = − 24.6 vs − 26.7 kcal mol−1). Then the derived electronic properties are also analogous (for glycine: χ = 88 vs 88, η = 63 vs 61, ω = 61 vs 63; for β-alanine: χ = 85 vs 85, η = 61 vs 59, ω = 60 vs 62 kcal mol−1).

When comparing all four members calculated by DLPNO-CCSD(T) method (7 through 10), some systematic trends are recognised (all energy data in kcal mol−1): (i) the ionisation energy decreases as Ei = 149, 144, 143, 141; (ii) the electron affinity varies as Eeg = − 27.3, − 26.7, − 26.1 − 25.9; (iii) the molecular electronegativity decreases as χ = 88, 85, 84, 83; (iv) the chemical hardness decreases as η = 61, 59, 58, 57; (v) the electrophilicity index decreases as ω = 63, 62, 62, 60. The quadrupole moment varies as Q = − 23, − 27, − 34 and − 37 ea02; however, the dipole moment varies non-systematically as p = 1.89, 2.12, 1.77 and 1.98 D owing to specific separations of barycenters of the positive–negative charges.

The same set of molecular properties has been calculated for the zwitterionic (Z) forms of the amino acids, after full geometry optimization and complete vibrational analysis for the neutral and ionised species using B3LYP/def2-TZVPD and MP2/def2-TZVPD methods (Table 2). One can see: (i) the values of the adiabatic electron affinities are much lower (in absolute values) relative to canonical forms; (ii) consequently the electrophilicity index is much lower; (iii) the reduction potential drops substantially; (iv) according to the chemical hardness, the zwitterionic forms are more hard.

Table 2 Comparison of calculated and experimental dataa,b

DLPNO-CCSD(T) method within the series 7Z through 10Z gave the following systematic trends: (i) the ionisation energy decreases as Ei = 156, 148, 144, 142; (ii) the electron affinity varies as Eeg = − 18.1, − 18.6, − 19.9, − 20.1; (iii) the molecular electronegativity decreases as χ = 87, 83, 82, 81; (iv) the chemical hardness decreases as η = 69, 65, 62, 61; (v) the electrophilicity index varies as ω = 55, 53, 54, 54 kcal mol−1; (vi) the dipole moment increases as p = 14, 22, 28 and 34 D; and (vii) the quadrupole moment varies as Q = − 21, − 24, − 28, − 32 ea02.

The absolute reduction potential correlates with the electrophilicity index ω: such a correlation is displayed in Fig. 3. Note that ω is a property derived from the electronic energy of L0, L+ and L species whereas the reduction potential is a thermodynamic quantity derived from the reaction Gibbs energy between L and L0.

Fig. 3
figure 3

A correlation of the absolute reduction potential with the electrophilicity index. Yellow circles–full data set, solid points–MP2 data. Regression line: Eredø [V] = − 1.05 [V] + 0.038 ω [kcal mol−1]; regression coefficient r2 = 0.94

A comparison of the calculated data in solution with experimental ones is rather problematic since the available data refer mostly to the proteinogenic α-amino acids in vacuo. The same holds for the ab initio calculations usually done without a solvent effect. A collection of calculated and experimental data is presented in Table 2. This table contains data about ionisation energies as obtained by the Ultra-Violet Photoelectron Spectroscopy (UPS) in the gaseous phase. The vertical ionisation energies were also calculated by various methods in vacuo for testing their reliability (energy of the neutral species refers to the optimised geometry, for molecular cation to the frozen geometry of neutral molecule). This process is assigned as an ionisation of the nitrogen lone pair (abbr. n).

The vertical ionisation energy in the gaseous phase of glycine, Ei(v) = 230 kcal mol−1, is well reproduced by all methods: M1–224, M2–237 and M3–235 kcal mol−1. For β-alanine, Ei(v) = 223 kcal mol−1 is compared by M1–216, M2–229 and M3–227 kcal mol−1; for GABA, Ei(v) = 221 kcal mol−1 is reproduced by M1–210 and M3–220 kcal mol−1; for DAVA, Ei(v) = 217 kcal mol−1 is compared by M1–207 and M3–221 kcal mol−1. It can be concluded that B3LYP method offers vertical ionisation energies that are a bit lower (− 5%) relative to the gas-phase UPS data; the DLPNO-CCSD(T) method, on the contrary a bit higher (+ 2%). Their trends are correctly reproduced by all applied methods.

The dipole moment measured in an aqueous solution with dominating zwitterionic form does not reflect a single molecular unit because of existence various rotamers in a complex equilibrium. The agreement with M1, M2 and M3 methods in not perfect, however, it is acceptable.

The calculated molecular descriptors served as a worksheet for advanced statistical methods (Statgraphics 1982). The Cluster Analysis (CA) is a kind of the classification statistical methods that forms similar objects or observables into groups according to their “distance”. A squared Euclidean distance metric has been applied along with the Wards method. Results are displayed in Fig. 4 for objects (molecules) and observables (molecular properties). It is seen that 12 molecules can be classified into three groups. The bulk properties form own group: polarizability (a), solvated surface (S) and volume (V), zero-point energy (Z) and total entropic term (ST); high degree of similarity (low distance) show electrophilicity index (o) and the reduction potential (Er).

Fig. 4
figure 4

Results of the Cluster Analysis (top and centre) and the Principal Component Analysis (bottom) using dataset from Table 2

The Principal Component Analysis (PCA) belongs to the multivariate statistical methods making “principal components” formed of linear combination of variables that contain the greatest variance. PCA biplot is drawn in Fig. 4 (bottom) showing which molecular properties correlate/anti-correlate/non-correlate. It matches the classification by CA: variables located in adjacent rays mutually correlate, e.g. {O, Er}, {(al, Z, S, V, ST), (Eo, Ho)}.

The calculated pair correlation coefficients are listed in Table 3; high values ρ(A, B) > 0.95 point to significant correlations. Amongst them, the absolute oxidation potential (calculated from the reaction Gibbs energy) correlates with the adiabatic ionisation energy (calculated from the electronic energies only). Analogously, the absolute reduction potential correlates with the adiabatic electron affinity (this has nothing to do with LUMO) and also with the electrophilicity index (Fig. 2). A matrix of scatter points referring to the molecular descriptors of amino acids is deposited in Supplementary Information.

Table 3 Pair correlation coefficients amongst molecular descriptors

The calculated vibrational frequencies are compared in Table 4; they form the vibrational spectra displayed in Figure S2. A comparison of scaled MP2 with experimental data taken in vacuo is possible; an agreement is satisfactory. This data can be compared with those calculated by other versions of DFT (Barone et al. 2013).

Table 4 Calculated (unscaled) harmonic vibrational frequencies [cm−1] for glycine in the canonical form 1 and zwitterionic form Z by MP2 and B3LYP methodsa

Conclusions

In conclusion, three quantum chemical methods were used to calculate the electronic properties of linear amino acids (glycine, β-alanine, GABA and DAVA) in water as a solvent: B3LYP/def2-TZVPD (a hybrid variant of the Density Functional Theory; method M1), HF-MP2/def2-TZVPD (the 2nd-order many-body perturbation theory to the correlation energy; method M2), and DLPNO-CCSD(T)/aug-cc-pVTZ (a variant of the coupled-cluster method with excitations of singlets, doublets and triplets included in the correlation energy; method M3). Each species was considered in the canonical amino acid (A) form as well as in the zwitterionic (Z) form as an electroneutral molecule L0, its molecular cation L+ and molecular anion L. A total of 60 systems were examined. Full geometry optimization followed by complete vibrational analysis was performed with M1 for all four amino acids, and by M2 for glycine and β-alanine. M3 calculations were performed in geometries optimised by M1 or M2 methods.

The present work has certain advantages compared to the fragmentary quantum chemical studies found in literature.

  1. 1.

    The selected objects (linear ω-amino acids) are members of a series along which the carboxyl group is monotonously separated from the terminal amino group. It would be beneficial if this series will continue with branched-chain α-amino acids, sulphur containing amino acids, etc., to accumulate extensive data set on the molecular properties of different types of amino acids generated by a consistent methodology.

  2. 2.

    All studies were done in water as a solvent, where experimental data is very lacking.

  3. 3.

    Before the conclusions, the ability to reproduce experimental data in vacuum was tested using the B3LYP method, MP2 method, and the DLPNO-CCSD(T) method, taking into account vertical and adiabatic ionisation energies from UPS spectra, and vibrational spectra.

  4. 4.

    The calculated properties include two classes: (i) energetic and electrical properties of neutral species, such as dipole moment p (measure of polarity, or charge separation), quadrupole moment Q (measure of eccentricity of the charge cloud), dipole polarizability α (the ability of the electron cloud to be distorted in an electric field), solvated surface S and solvated volume V, zero-point vibrational energy Ezpe, and total entropic term Sø·Tø; (ii) properties that characterise the redox processes forming ionised forms, such as adiabatic ionisation energy Ei (for electron withdrawal), electron affinity Eeg (for electron attachment), chemical hardness η (electronic force constant, resistance to change in electron density), molecular electronegativity χ (electronic gradient, driving force of electron transfer), electrophilicity index ω (electrophilic power), absolute oxidation Eoxø and reduction Eredø potentials (thermodynamic driving forces for oxidation/reduction).

  5. 5.

    According to B3LYP data for the canonical forms of amino acids, the set of bulk properties shows a monotonic increase with the (molar) mass of the molecule: |Q| (23–37 ea02), α (57–108 a03), S (401–629 a02), V (623–1040 a03), Ezpe (50–103 kcal mol−1) and Sø·Tø (22–29 kcal mol−1). The dipole moment varies unsystematically because it depends on the spatial separation of the opposite charges, p (1.75, 1.93, 1.70 and 1.82 debye). For zwitterionic forms of amino acids, these properties are very similar: |Q| (21–32 ea02), α (59–111 a03), S (393–627 a02), V (628–1040 a03), Ezpe (52–105 kcal mol−1) and Sø·Tø (22–28 kcal mol−1). However, the dipole moment increases dramatically: p (14–34 debye).

  6. 6.

    For canonical forms, the electronic properties are independent of molar mass and they can vary either systematically, such as Ei (147–136 kcal mol−1) and Eoxø (− 6.3 to − 5.8 V), or less systematically, such as Eeg (− 25 to − 30 kcal mol−1), Eredø (1.21–1.39 V), χ (86–82 kcal mol−1), η (61–54 kcal mol−1) and ω (61–64 kcal mol−1). For zwitterionic forms: Ei (150–135 kcal mol−1) and Eoxø (− 6.4 to − 5.7 V), Eeg (− 19 to − 11 kcal mol−1), Eredø (1.11–0.52 V), χ (84–72 kcal mol−1), η (66–61 kcal mol−1) and ω (53–42 kcal mol−1). There is a significant difference between the first pair (glycine, β-alanine) and the second pair (GABA, DAVA).

  7. 7.

    Calculations confirm that in water as a solvent: (i) glycine zwitterion is the most stable forms by 4–2 kcal mol−1 relative to the canonical form, whilst for GABA and DAVA, it is less stable by 0.5–2 kcal mol−1; (ii) the dipole moment is extremely high, 14–34 debye; (iii) the absolute reduction potential is moderate, 1.5 V for glycine and β-alanine (stronger oxidising agents), and 0.5 V for GABA and DAVA (weaker oxidising agents); (iv) the absolute reduction potential correlates with the electrophilicity index.

  8. 8.

    With a representative set of objects and their properties, advanced statistical methods (Cluster analysis, Principal component analysis) were used to unhide latent correlations between molecular properties.

Based on similarity of the results obtained with B3LYP and the more exact DLPNO-CCSD(T), it can be concluded that B3LYP cannot currently be discriminated as a method of low confidence, at least for a set of linear amino acids. Its advantage lies in rapid geometry optimization and fast vibrational analysis allowing to evaluate the absolute redox potentials on the basis of the reaction Gibbs energies.