Introduction

There are different generations of cephalosporin ß-lactam antibiotics which show a broad activity against Gram-positive and Gram-negative bacteria as well as different kinds of resistance to hydrolysis by ß-lactamase [1,2,3]. The cephalosporins have similar central structures, but a wide range of additional functional groups can vary their properties due to the combination of the “two-side” modifications. In this work, three cephalosporins were studied: both first-generation cefapirin (CEFA) and third-generation ceftiofur (CEF) have one side which is as same as in the third-generation cefotaxime (CEFO), see Fig. 1. CEFA and CEF are used as antibiotics in veterinary medicine, while CEFO has a broader range of usage, especially in the agricultural industry regarding its more complex structure and functionalities [4,5,6]. Their neutral form is known as a zwitter ion since the proton can be stabilized by the heterocyclic rings [7, 8].

Fig. 1
figure 1

The zwitter ion form of the cephalosporin antibiotics which were studied

The ionic forms depending on the pH can be different; therefore, it is necessary to obtain their pKa values which show a wide range dependent on the method that was used for determining them [9,10,11,12]. Some predictors like Marvin developed by ChemAxon or ACD/Percepta by Advanced Chemistry Development packages result in accurate values based on a wide range of molecular structures using empirical statistic algorithms derived from experimental pKa values [13, 14].

One of the possible ways to determine pKa values is the empirical pKa calculation method [15, 16], where from the reaction free energies of the acid/base dissociations (Eqs. 1, 2, 3, and 4), the dissociation constant can be calculated (Eq. 5) by an additional experimental term (− 1.74) that corrects the standard state of liquid water of 55.5 mol L−1 (− log55.5) where the * denotes the standard state of 1 mol L−1.

$$\text{HA}(\text{aq, 1M})+ \text{H}_{2}\text{O}\;(\text{1, 1M})\rightarrow \text{H}_{3}\text{O}^{+}(\text{aq, 1M})+\text{A}^{-} (\text{aq, 1M})$$
(1)
$$\Delta\text{G}^{*}_{\;\text{Diss}}= \text{G}(\text{A}^{-})-\text{G(HA)}+\text{G}(\text{H}_{3}\text{O}^{+})-\text{G}(\text{H}_{2}\text{O})$$
(2)
$$\text{HB}^{+}(\text{aq, 1M})+\text{H}_{2}\text{O}(\text{1, 1M})\rightarrow \text{H}_{3} \text{O}^{+} (\text{aq, 1M})+\text{B}(\text{aq, 1M})$$
(3)
$$\Delta\text{G}^{*}_{\;\text{Diss}}= \text{G}(\text{B})-\text{G}(\text{HB}^{+})+\text{G}(\text{H}_{3}\text{O}^{+})-\text{G}(\text{H}_{2}\text{O})$$
(4)
$$\text{pK}_{\text{a}}=\Delta\text{G}^{*}_{\;\text{Diss}}/(\text{RT ln(10)})-1.74$$
(5)

This study is aimed at calculating the pKa values of three cephalosporin antibiotics (CE), cefapirin (CEFA), cefotaxime (CEFO), and ceftiofur (CEF) by cellmetry method where Eq. 5 including the experimental term was used to calculate the pK values [17,18,19,20]. Cellmetry is a newly developed method whereby choosing relevant cell sizes, dissociation, and stability constants can be calculated by plane wave DFT method [21, 22].

Computational method

Plane-wave density functional theory calculations were carried out using the Quantum Espresso software packages [23,24,25]. The geometry optimizations were performed with the PBE (Perdew–Burke-Ernzerhof) functional [26, 27], and the Ultrasoft RRKJ (Rabe-Rappe-Kaxiras-Joanoupoulos algorithm) pseudopotential (PP) pseudopotential [https://pseudopotentials.quantum-espresso.org/legacy_tables]. If there was no available _psl.0.1 version of the rrkjus PP, then other _psl.0.2 or _psl.1.0.0 versions were used instead. The Bravais lattice index was 1 (simple or primitive cubic cell), where the kx, ky, and kz points of the Brillouin zone were 1 1 1. The kinetic energy cutoff for wavefunctions (ecutwfc) was 30.0 Rydberg, and the kinetic energy cutoff for charge density and potential for norm-conserving pseudopotential (ecutrho) was 300 Ry. The implicit ENVIRON module described the solvent effect that uses a revised self-consistent continuum solvation model (SCCS) [28,29,30]. The absolute energies are in Rydberg (1 Rydberg = 0.5; Hartree = 1312.75 kJ mol−1), and the distances are in Bohr (1 Bohr = 0.5292 Ånsgström). Gabedit and Avogadro programs were used to build molecules. The calculated F = E – TS energies are noted in the article as E or energy. The structures of the molecules are represented by the Molekel program.

Results

As a reference, two clusters were defined: the water and hydronium ion together with 3 explicit water molecules optimized at cell size 20 Bohr (Fig. 2). The energies of the acid–base reactions (Eqs. 6 and 7) of the cephalosporin antibiotics (CE) were calculated, where HCE+ is the protonated form of the species, CEzw is the neutral zwitter ion (zw) form, and CE is the deprotonated form of the molecules, and the pK values were derived from these energy values according to Eq. 5. For all molecules, the zwitter ion form had significantly lower energy than the non-zw form by ~ 35–45 kJ mol−1 in the investigated cell range which would cause ~ 6–8 units pK shift. It also means the minor species had a very small population; therefore, the macroscopic pK only depends on the one derived from zw form.

Fig. 2
figure 2

The structure of the reference clusters optimized at cell size 20 Bohr

The size of the three antibiotics is quite different: the distance between the hydrogen atoms which are the furthest from each other is 37.8 Bohr (20 Å) for ceftiofur (CEF), 35 Bohr (18.5 Å) for cefapirin (CEFA), and 33.1 Bohr (17.5 Å) for cefotaxime (CEFO). It was found that the most valuable data were derived in the call range 31–36 Bohr for all species which is in general smaller than the longest distance of the furthest atoms, see above, however, it is worth mentioning that the inside the cell cube the molecule can rotate (turn); therefore, the body (solid) diagonal of the cube is more relevant than its side length. For instance, if cell size was 30 Bohr, then the face diagonal would be 42.4 Bohr as well as the body diagonal would be 52 Bohr. It means that the interstitial region was not too large compared to the atomic core region where the molecule was located.

$$\text{HCE}^{+}+[4 \;\text{H}_{2}\text{O}]=\text{CE}^\text{zw}+[\text{H}_{3}\text{O}^{+}+3 \;\text{H}_{2}\text{O}]$$
(6)
$$\text{CE}^{\text{zw}}+[4\;\text{H}_{2}\text{O}]=\text{CE}^{-}+[\text{H}_{3}\text{O}^{+}+3 \text{H}_{2}\text{O}]$$
(7)

In some cases, the molecules were optimized at certain cell sizes (CS), i.e., in a certain cell range but the cell ranges were also scanned by SCF single-point calculations and the derived pK values were compared to the ones derived from the energies of the optimized geometries. The geometry optimizations were performed using the implicit solvent model.

First, the cefapirin was studied and discussed. Due to the different values obtained by experimental measurements (potentiometry, capillary zone electrophoresis, and UV–Vis spectroscopy), there was a relatively wide range which could be a result of accurate pK value [2, 3, 7, 19,20,21,22]. For pK2 (zwitter ion – deprotonated acid–base pair), there were full geometry optimizations performed at cell size 31, 33–35 Bohr (CS31, CS33–CS35); then, single point calculations were made in a wide cell range using the structures optimized at CS33 and CS34, and the values were compared to each other. We found that at these cell sizes, the values showed accurate values that were smaller than the length of the molecule but significantly shorter than the body diagonal length of the cube. The trendlines derived from single point calculations were almost the same in the cell range 32–42 Bohr, and there was a crossing point of the trendlines around cell size 33 Bohr, but < 32 Bohr, the pK values were different, and this difference was increasing (see Fig. 3). A fine scan was also made where the fractured cell sizes had Δ = 0.2 Bohr, and some characteristic breakpoints were found around 33 and 34 Bohr (see Fig. 3). In the previous study, the calculated pK values of the chosen systems belonged to a breakpoint or a middle point of a characteristic motif of the trendlines. Based on these results, similar calculations were made to determine the pK1 value, and accurate values were found again at CS33 and CS34; however, the trendlines run together at small cell sizes, and > 33 Bohr, they showed larger differences.

Fig. 3
figure 3

Trendlines of the equilibria of the CEFA species. The shaded part shows the region of the experimental data. The blue trendlines represent the pK values derived from SCF single point calculations of the optimized geometry at cell size 33 Bohr, while the red points were derived from the optimized structure at 34 Bohr

For the larger (longer) ceftiofur (CEF), the trendlines showed some similarities but differences as well (Fig. S1). One would predict that for a larger species, somewhat larger cell size would be reliable according to the differences. If we believe that both pK1 and pK2 can be determined at the same call size, then some inconsistencies help us to decide which cell size can be relevant. At CS33 and CS34, the calculated pK values were almost the same, while at CS35, the pK2 was smaller than pK1 which could not be correct; however, the calculated pK1 at CS33 was 1 unit larger than the experimental value. Furthermore, for CEFA, it could be seen that there was a shift, namely, the calc. pK1 at CS34 was a middle (average) value of the exp. pK1 range, while the calc. pK2 was slightly larger than the lowest exp. pK2 value. According to this phenomenon, the pK1 can be more accurately estimated at a (slightly) larger cell size.

At this point, the breakpoints could help us to decide, while in the case of CEFA, the characteristic breakpoints appeared for pK1 at CS34.2 where the lower exp. values were found, while for pK2, they belonged to the upper exp. values at CS33.2. For pK1 of CEF, a fine scan was made where the characteristic breakpoint appeared at CS35 (pK = 3.12), the values decreased significantly in the range 35.2–35.6, and the exp. value was in the middle of this section at CS 35.4. Since the breakpoint appeared at CS34 for pK2 (pK = 3.18), the correct value could be between CS33 and CS34. These values were similar to the ones predicted by Marvin (3.28) and ACD/Percepta (3.60) predictors.

For the smallest (shortest) cefotaxime, without performing calculations, one could assume that for pK1 around CS33, while pK2 around CS32 appears, and these assumptions were confirmed by the calculations. At CS33, 2.24 was calculated for pK1, and at CS32, 2.97 was calculated for pK2. A combination of half-unit smaller or larger CS showed also acceptable values: for both pK1 and pK2, a slightly underestimated value was calculated at CS 31.5 and a slightly overestimated one at CS 32.5, see Table 1. Interestingly, the curves of the fine scans (Δ = 0.2) made from the optimized structures at 31.5 and 32 Bohr showed a cross point at pK = 3.15 for the equilibrium CEFOzw – CEFO which was in the range of the experimental data (see Fig. S2). The HCEFO+ – CEFOzw system showed very different curves derived from the energies of the optimized structures and from the SCF energies of the optimized structures at 32 and 33 Bohr. Obviously, in the pK of the experimental values, breakpoints and a decreasing section of the curves occurred (Fig. S2).

Table 1 The experimental and the calculated acid dissociation constants of the cephalosporins

Correction factors were also defined as the length of the molecule/the cell size resulting in accurate values where no fractional CS was used except one case. The derived correction values were 0.94 for CEF (pK1), 0.97 for CEFA (pK1), 1.0 for CEFO (pK1), 0.90 for CEF (pK2), 0.94 for CEFA (pK2), and 0.97 for CEFO (pK2), see in Table 1. It can be seen that there was an almost linear trend for the species, and the larger the molecule was, the smaller the correction factor was.

Conclusion

The acid dissociation constants of three cephalosporin antibiotics were calculated by the cellmetry methodology. Correlations were found between the size of the molecules and the cell size of the cube which was used for the geometry optimizations, and correction factors were defined which showed a linear trend. It was found that for the major neutral species, the zwitter ion form, and the pK2, the equilibrium between neutral and deprotonated species needed ~ 1 unit smaller cell sizes than the equilibrium of protonated–neutral species (pK1). Cellmetry methodology was useful for larger molecules (~ 45–50 atoms) to calculate the acid dissociation constants.