Introduction

Tandem mass spectrometry is one of the essential techniques for biomolecular structure determination by means of analysis of the molecular fragmentation pattern. Established activation methods can be classified as either collisional, electronic, or photonic. In collision-based methods, excitation of the molecules is achieved in collisions with atoms (collision-induced dissociation, CID [1]), surfaces (surface-induced dissociation, SID [2]), or ions [3,4,5]. Electron-based methods employ the attachment of electrons (e.g., electron capture-induced dissociation, ECD [6]) or on their detachment (e.g., charge transfer dissociation, CTD [7]). The third class of activation methods is photon-based and includes established techniques such as infrared multiphoton dissociation (IRMPD [8]) or ultraviolet photodissociation (UVPD [9]). Over the last few years, the potential of more energetic vacuum ultraviolet photons for mass spectrometry was explored, by interfacing tandem mass spectrometers with synchrotron beamlines. In pioneering experiments, for the protonated peptide leucine enkephalin, the resulting photofragmentation spectra were found to be dominated by immonium ions and small sequence ions [10], whereas for the much larger multiply protonated protein cytochrome c, non-dissociative photoionization was found to dominate [11].

It was a straightforward next step to increase the photon energy into the soft X-ray regime. Soft X-ray spectroscopy is based on excitation or ionization of inner-shell electrons and therefore element specific and sensitive to the local electronic structure. For amino acids, peptides, and proteins, soft X-ray spectroscopy has mostly been performed using thin films [12,13,14]. Soft X-ray spectra obtained for different proteins were found to share a common structure with some characteristic differences, most of which are indicative of the particular amino acid composition of the protein [14].

First gas-phase experiments on the C K-edge of protonated leucine enkephalin [15] and on the C, N, and O K-edges of multiply protonated cytochrome c both revealed similar spectral features as observed in the condensed phase, such as for instance the main resonance being due to 1s-π* excitations in the amide C=O bonds. However, the mass spectrometric approach adds the dimension of fragmentation channel to the soft X-ray spectroscopic data. For leucine enkephalin, it was for example shown that soft X-ray absorption in the sidechains of aromatic amino acids leads to formation of fewer small immonium ion fragments than does absorption in the amide groups [15]. Milosavljević and co-workers found that 1s ionization energies of multiply protonated ubiquitin do not systematically increase with protein protonation state. Instead, over a wide range of protonation states, protein unfolding due to Coulomb repulsion was found to compensate the increase in protonation state, resulting in a near-constant 1s ionization energy [16]. Very recently, in two complementary studies, the interplay between protein size and photoexcitation energy was investigated. Egorov et al. [17, 18] systematically investigated soft X-ray- and EUV-induced protein fragmentation in the 0.5–12-kDa range and found fragmentation yields and patterns to largely scale with protein temperature with large proteins being mainly subject to non-dissociative ionization. On the other hand, when multiple 90-eV photons are absorbed simultaneously by a large protonated protein in a free-electron laser pulse, localized and non-ergodic fragmentation channels seem to dominate and the system behaves like an ensemble of small isolated peptides/amino acids [19].

The focus of this article is an in-depth X-ray spectrometry study on the small multiply protonated protein melittin. Melittin has an average mass of 2846.5 Da and with this intermediate size, it is expected to show a very rich fragmentation pattern, featuring immonium ions, sequence ions, and non-dissociative ionization.

For protonation states 2 and 3, melittin is known to exhibit a primarily helical gas-phase secondary structure, which is lost upon increasing protonation to 4 [20]. This renders melittin an ideal system to study the interplay of structure and fragmentation upon soft X-ray absorption. In a previous work on soft X-ray spectroscopy of cryogenic (T ~ 10 K) multiply protonated melittin at the C K-edge, we have recently found a correlation between melittin structure and electron impact ionization from within [21], i.e., inelastic scattering of photoelectrons along their trajectories through the photoabsorbing molecule. For low protonation, i.e., a compact structure, resonant \( 1\mathrm{s}-{\uppi}_{\mathrm{C}=\mathrm{O}}^{\ast } \) excitation in an amide bond is significantly more likely to be followed by two-electron emission than for high protonation and elongated conformations. This is due to the differences in the integral cross section for electron impact ionization “from within,” i.e., ionization by the Auger electron emitted after photoexcitation. Here, we will further investigate this effect for protonated melittin close to room temperature and at all three relevant K-edges of carbon, nitrogen, and oxygen. Furthermore, it is the goal of the article to investigate soft X-ray-induced protein fragmentation in greater detail in order to assess the potential of the technique for analytical mass spectrometry.

Experimental

All experiments were performed at the BESSY II synchrotron (Helmholtz-Zentrum Berlin für Materialien und Energie, Germany). Two similar experimental setups were used, both of which are based on the same concept: The gas-phase protonated protein of interest is first mass selected and subsequently trapped in a radiofrequency (RF) ion trap. The trapped ions are then exposed to a beam of soft X-ray photons. Eventually, the photoproducts are extracted into a time-of-flight mass spectrometer. The details, however, are different, allowing for complementary experiments.

The Groningen Tandem Mass Spectrometer

This apparatus has been described in detail before [10, 18]. Briefly, the protonated peptides are brought into the gas-phase by means of a home-built high-fluence electrospray ionization (ESI) source, phase space compressed by an RF ion funnel, collected in an RF octopole ion trap/guide, mass selected by an RF quadrupole mass filter, and collected in a 3D ion trap, of classical Paul trap geometry (Jordan TOF Products, CA). The ions are pre-trapped in the octopole for several 100 ms and bunched into the ion trap during less than 50 ms. He buffer gas at T=300 K is injected into the Paul trap only during the 50-ms filling period. For the experiments presented here, the Paul trap was interfaced with the beamline to expose the trap content to monochromatic soft X-ray photons at the C K-edge (E=280–300 eV, 350 meV bandwidth, typically 2×1013 photons/s). Typical photoexposure times of several 100 ms lead to photoabsorption in 5–10% of the trapped protonated proteins. This way, more than 90% of the photoproduct ions are due to single-photon absorption events. The photoproducts are then extracted into a linear time-of-flight (TOF) mass spectrometer to record a mass spectrum. Mass spectra are recorded in (photons on–photons on–photons off) cycles, to record photoproduct and precursor spectra simultaneously and minimize the influence of fluctuations in ESI signal intensity (for a stable ESI signal, a single “photons off” spectrum at the beginning of an acquisition would be sufficient). The eventual net photoabsorption effect is determined by subtraction of the precursor spectra from the photoproduct spectra, yielding a negative peak at m/z of the precursor and positive peaks for the photoproducts. Key features of the experiment are quantitatively reliable mass spectra over the entire relevant m/z range as well as absence of He buffer gas during and after the photoexposure. The latter implies that the photoproducts are not cooled.

All mass spectra presented in this article have been obtained with the Groningen tandem mass spectrometer.

The NanoClusterTrap at HZB

This apparatus is permanently installed at the UE52-PGM beamline. It is a mass spectrometer custom-built to record partial ion yield spectra of trapped cluster and molecular ions [22, 23]. We have combined the NanoClusterTrap with the Groningen high-fluence ESI source. RF phase space compression, ion guiding, and mass selection are very similar to the Groningen apparatus. The ions then pass an electrostatic quadrupole deflector and are continuously transferred into a linear RF quadrupole ion trap for storage. The trap content is continuously exposed to the collinear soft X-ray beam, whose intensity is monitored by means of a GaAsP photodiode. The trap is filled with He buffer gas (~ 1×10−4 mbar) to facilitate collisional cooling of the photoproducts, with typical temperature equilibration times of less than 1 μs. The trap can be operated down to T=3 K ion trap wall temperature, but is operated at T ~ 200 K for the experiments presented here. Bunches of ions are extracted into a reflectron-type TOF mass spectrometer with a repetition rate of the order of 100 Hz. The timing scheme of the experiment only allows for acquisition of a mass range of several hundred daltons at a time. The nominal mass resolution of the instrument is 3000 Da, but during the experiments reported here, significantly lower resolutions are employed. The soft X-ray energy is varied with a step size of 100 meV (C and N K-edge) and 250 meV (O K-edge). The UE52-PGM beamline delivers left or right circularly polarized photons. In general, pairs of mass spectra are recorded for the two polarizations. No polarization effect is observed for melittin within the noise level of the soft X-ray spectra recorded. X-ray absorption spectra are obtained by plotting yields of photoproducts against the photon energy. The data are normalized with respect to the photon flux, measured simultaneously with every absorption spectrum with a GaAsP photodiode at the end of the apparatus. Due to carbon-containing contaminations on the optical elements along the beamline, there is an increased uncertainty of the normalization in the region of the carbon K-edge. The photon energy was calibrated using the nitrogen K-edge of an N2 gas target. The calibration had an error bar of ± 125 meV.

Key features of the experiments are the large trap capacity, allowing for efficient recording of soft X-ray absorption spectra and the presence of a buffer gas, quenching slow fragmentation channels. No full range and quantitative mass spectra can be obtained.

All soft X-ray spectra presented in this article have been obtained with the NanoClusterTrap.

Electrospray Ionization

Melittin from honey bee venom was purchased from Sigma-Aldrich at a > 85% purity. For the experiments with the Groningen apparatus, a 40-μM melittin solution in HPLC-grade methanol is used; 0.005% formic acid is added to facilitate protonation. For the NanoClusterTrap experiments, a 30-μM melittin solution in 44% HPLC-grade water, 55% HPLC-grade methanol, and 1% formic acid is employed. The ESI solution is then pumped through the ESI-emitting needle using a syringe pump with a flow rate of typically less than 10 μl/min. After phase space compression in the RF ion funnel, ion currents typically exceed 1 nA and more than 100 pA of mass-selected [melittin+3H]3+ ions can be generated.

Results and Discussion

Soft X-ray Spectra for Non-dissociative Single and Double Ionization

From a soft X-ray spectroscopy point of view, it is most straightforward to look into the data for non-dissociative single and double photoionization, i.e., the partial ion yields of [melittin+qH](q+1)+ and [melittin+qH](q+2)+ as a function of photon energy. Figure 1 shows the respective spectra for q=2–4 at the K-edges of carbon, nitrogen, and oxygen. At all the three edges, the non-dissociative single ionization (NDSI) spectra (Figure 1, top row) exhibit a dominating peak that is due to excitations from the respective 1s orbital to the π* orbital localized at the C=ONH amide group of the protein.

Figure 1
figure 1

Partial ion yields of [melittin+qH](q+1)+ (non-dissociative single ionization, top row) and [melittin+qH](q+2)+ (non-dissociative double ionization, bottom row) for [melittin+qH]q+, with q=2–4. The three photon energy intervals correspond to the K-edges of carbon (left column), nitrogen (middle column), and oxygen (right column). The single ionization spectra were normalized to the dominating 1s-π* peak and the double ionization spectra were normalized to the maximum of the photoionization continuum. Offsets of 30% and 60% were added to the q=3 and q=4 spectra for visualization

The C \( 1\mathrm{s}-{\uppi}_{\mathrm{C}=\mathrm{O}}^{\ast } \) transition is observed at 288.14 eV, at a FWHM of about 0.7 eV. Soft X-ray bandwidths of 100 meV (q=2) and 50 meV (q=3, 4) were used at the C K-edge. The FWHM is thus mostly due to the variation of the C \( 1\mathrm{s}-{\uppi}_{\mathrm{CONH}}^{\ast } \) transition energies in the different amino acid residues of melittin. The smaller peak at 285.1 eV results from C 1s-π* transitions in the tryptophan (W) indole sidechain, which is the only aromatic amino acid contained in melittin. The relative intensity of this transition increases strongly with q. We have recently shown [21] that this increase is unlikely to be a consequence of the dramatic change in molecular structure upon increasing protonation [20]. It rather reflects a general decrease of protein stability with increasing charge state. The non-dissociative channels discussed here are clearly related to photoabsorption processes that neither deposit sufficient energy for swift thermal fragmentation nor induce fast localized fragmentation. For the most stable protonation case (q=2), NDSI appears not very site specific given the rather broad structureless yield distribution. For the intrinsically less stable cases (q=3, 4), absorption in the tryptophan indole group is relatively increased, as 1s-π* transitions in aromatic sidechains are known to be generally less destructive to protein/peptide integrity [15].

The shoulder at 287.3 eV and the peak at 289.4 eV stem from the C 1s-σ* (CH) and C 1s-σ* (CC) transitions in various sidechains, respectively. All energies are in good agreement with literature data for gas-phase protonated cytochrome c and neutral amino acids (see Table 1). The broad structure between roughly 290 and 294 eV is due to a superposition of various transitions to σ-orbitals, shape resonances, and Rydberg states. With increasing protonation state q, this structure gets suppressed, in particular on the high-energy side.

Table 1 Peak assignments and energies for the C, N, and O K-edges from this work, compared to data for gas-phase [cytochrome c+qH]q+ and for several neutral gas-phase amino acids. The low-energy peak assigned to tryptophan is a superposition of transitions in the indole sidechain of which only the lowest energy transition can be clearly identified and is reported

The N \( 1\mathrm{s}-{\uppi}_{\mathrm{CONH}}^{\ast } \) transition is observed at 401.5 eV. The N \( 1\mathrm{s}-{\uppi}_{\mathrm{C}-\mathrm{N}}^{\ast } \) peak at 402.6 eV is the only additional transition that can be clearly identified. Both energies are close to literature data for protonated cytochrome c (see Table 1). As for C, the broad feature at high energies, starting at 403.5 eV is due to a superposition of various transitions to σ-orbitals, shape resonances, and Rydberg states. A similar suppression with q as for C is observed. Comparing the C, N, and O K-edge spectra, the broad structure is relatively strongest for N and its peak intensity reaches about 70% of the N \( 1\mathrm{s}-{\uppi}_{\mathrm{CONH}}^{\ast } \) peak intensity for q=2.

The O \( 1\mathrm{s}-{\uppi}_{\mathrm{C}=\mathrm{O}}^{\ast } \) transition is observed at 532.4 eV which is in good agreement with the values obtained in several studies on gas-phase amino acids and substantially higher than the 531.4 eV reported for cytochrome c (see Table 1). No other peaks are observed. Again, a broad feature is observed which starts at 534 eV and peaks around 538 eV. The intensity of this broad feature is weaker than for N and C and it only exhibits a slight shift to smaller photon energies with increasing q.

The non-dissociative double ionization (NDDI) spectra (Figure 1, bottom row) have an entirely different shape. The main feature of the NDDI spectra, the broad structure at the highest energies is due to direct 1s ionization followed by Auger decay. For carbon, nitrogen, and oxygen, this manifests as an almost linear increase from the 1s excitation background, followed by a broad ionization continuum. The 1s ionization energy can be determined from the onset of the linear section [16]. All 1s ionization energies exhibit a pronounced increase with protonation state (see Table 2). 1s ionization energies of many neutral amino acids have been determined by X-ray photoemission spectroscopy (XPS) in the Prince group and the lowest values are systematically lower than the ionization energies presented here [24]. For the typical example of alanine, the values are added in Table 1.

Table 2 [Melittin+qH]q+ detachment energies for the 1s levels of carbon, nitrogen, and oxygen

For ubiquitin in protonation states ranging from 4 to 11, Milosavljević et al. have recently linked the increase of C 1s ionization energy to the protein tertiary structure [16]. For q=5–8, no clear increase in 1s ionization energy (294.3 eV) was observed which was explained by unfolding of the protein in this range of protonation states that strongly influences the effective Coulomb field of the molecule. In contrast, the 3D structure of gas-phase protonated melittin is mostly a secondary structure. For this system, hydrogen/deuterium exchange and ion mobility studies provide evidence for strong structural relaxation in the range of q=2–4, with melittin being predominantly helical at q=2 and relaxed at q=4 [20, 28]. Our data shows a systematic increase in C, N, and O 1s ionization energies for q=2–4, which accompanies the loss in secondary structure.

For photon energies below the ionization threshold, for all the three K-edges, the dominant NDSI peak (\( 1\mathrm{s}-{\uppi}_{\mathrm{C}=\mathrm{O}}^{\ast } \)) is observed but at reduced relative intensity, as are the other features of the NDSI spectra. This is in line with the notion that the most probable process following 1s excitation is a resonant Auger decay, in which a valence electron fills the 1s vacancy. The simultaneous emission of an Auger electron from the same site leads to a singly ionized molecule. Two-electron (or more) Auger processes typically occur with much larger probability [29]. These processes lead to double ionization upon resonant 1s excitation, as visible in the bottom panels of Figure 1. Two-electron Auger processes do not, however, explain the dependence of the latter channel on protein protonation state and therefore on 3D structure.

In our recent study on melittin C K-edge photoionization at cryogenic temperatures [21], we have already shown that double ionization following 1s excitation can also be due to electron impact ionization “from within.” In this process, the Auger electron emitted during the de-excitation of the initial photo-induced 1s vacancy induces an electron impact ionization event elsewhere in the molecule. The cross section for this process can be estimated from atomic cross sections for electron impact ionization. To this end, we have used a Monte Carlo approach in which for each K-edge (C, N, and O), a typical Auger electron energy is chosen (250 eV, 375 eV, and 500 eV, respectively). Clearly, this is a strong simplification, as the actual Auger electron spectra are likely very different for the different K-edges. For each K-edge, we have then used the tabulated electron atomic impact cross sections for the H, C, N, and O constituents of melittin [30, 31]. For a given 1s photoexcited atomic site in the CONH groups along the protein backbone, we have then computed 2×105 random straight line trajectories. For each trajectory we computed whether or not it leads to an ionization event before the Auger electron leaves the protein. This approach is loosely based on the independent atom model (IAM), often employed in electron scattering from molecular systems [32, 33]. Garcia and co-workers have shown that for an electron energy of 300 eV, corrections to IAM scattering cross sections for CO2 are smaller than 10% [32, 33]. For higher energies, atomic electron impact ionization cross sections decrease and the accuracy of the IAM should be even better. As a last step, we then average the ionization probability for all carbon, nitrogen, and oxygen amide sites, respectively (under the approximation that the 1s-π* (CONH/amide) photoexcitation cross section is the same everywhere along the protein backbone.

To show the influence of protein conformation on this channel, we have performed the Monte Carlo calculations for the melittin structure in solution that features a very high α-helical content and for a hypothetical linear configuration. Table 3 shows the results. For all the three edges, the simulations yield a substantially lower contribution of ionization “from within” for the linear conformation (33%, 26%, and 20% for carbon, nitrogen, and oxygen, respectively) as compared to the helical conformation (44%, 39%, and 27% for carbon, nitrogen, and oxygen, respectively). Table 3 also shows the relative contribution of \( 1\mathrm{s}-{\uppi}_{\mathrm{CONH}}^{\ast } \) excitation with respect to the maximum of the ionization continuum for NDDI from Figure 3. The experimentally observed ratio for maximum helicity (q=2) and minimum helicity (q=4) is barely larger than 1 for carbon (1.04), while for nitrogen and oxygen, the ratios are much larger than 1, 1.67, and 2.29 respectively. For a medium-sized protein such as melittin, the quantitative difference between simulation and experimental data is most likely due to the fact that the non-dissociative ionization channels are competing with various fragmentation channels. The respective branching ratios depend for instance on protonation state, photoabsorption site (carbon, nitrogen, or oxygen), and internal temperature. In our previous study on soft X-ray spectroscopy on melittin at cryogenic temperatures (T=10 K), much stronger non-dissociative ionization from within was observed for the melittin C K-edge [21]. At a temperature of 10 K, the photoabsorption occurs in a molecule with almost zero internal energy, whereas at room temperature, the thermal energy amounts to about 5 eV [18]. This implies a significantly smaller relative yield of non-dissociative processes for the experiments at room temperature, in particular for the ionization—from-within events, whose contribution to non-dissociative double ionization will be quenched. In the same context, Milosavljević and co-workers investigated the influence of the protonation state on C K-edge soft X-ray absorption for the much larger protonated protein ubiquitin and observed a substantial reduction of the C \( 1\mathrm{s}-{\uppi}_{\mathrm{CONH}}^{\ast } \) resonance in NDSI with decreasing protonation state but did not assign a mechanism [16].

Table 3 Monte Carlo results for the probabilities for a typical Auger electron emitted from amide carbon, nitrogen, and oxygen, to ionize an atomic site within the protein. Linear refers to an idealized linear configuration, and helical refers to the solution geometry obtained from the protein database. The right columns give the relative intensities of the 1s−π*CONH resonance with respect to the maximum of the 1s ionization continuum for q=2–4 and the ratio between the values for q=2 (helical) and q=4 (non-helical)

Small Neutral Losses

Depending on the final electronic state of the inner-shell excitation process and even more so, on the molecular orbitals involved in the subsequent Auger de-excitation, the electronic excitation of the melittin cations can cover a range of 50 eV or more, with an average of almost 20 eV [18, 34].

Even when neglecting possible direct fragmentation channels, for instance due to population of repulsive states, inner-shell excitation processes will thus ultimately lead to fragmentation of the cationic protein, for example losses of small neutral groups. In the mass spectra, losses of a small neutral group manifest as peaks on the low mass side of the NDSI and NDDI peaks. This is illustrated for NDSI of [melittin+3H]3+ in the mass spectrum in Figure 2(a)). On the low mass side of the [melittin+3H]4+ peak, additional peaks corresponding to the loss of neutrals with m=17, 29, 44, and 59 Da are observed:

  • m=17 Da: loss of an ammonia (NH3) group, either from residues whose sidechain contains an N atom (here Q, K, R, and W) or from the amidated C-terminal. For CID, it was shown that both types of losses can compete [35]. For VUV photofragmentation of protonated substance P, Canon et al. have assigned NH3 loss to originate primarily from the K residue [36].

  • m=29 Da: loss of C2H5 from I residues has been observed in CID [37]. Another possibility would be CH3N loss from R.

  • m=44 Da: loss of CH4N2 from the R residue has been observed in ECD [38] and VUV photoionization [36].

  • m=59 Da: loss of C2H5NO from Q and CH5N3 from R did show up in ECD [38]. For VUV photons, the latter channel was observed [36].

Figure 2
figure 2

(a) Zoom into the soft X-ray photoionization mass spectrum at 288.14 eV. The NDSI peak is observed at 712.6 Da. The m/z region of the neutral loss peaks (m=692.5–720 Da) is multiplied by 3. (bd) Normalized partial ion yields of [melittin+qH](q+1)+ (NDSI, gray symbols) and [melittin+qH-44](q+1)+ (single ionization with loss of 44 Da, red line) for [melittin+qH]q+, with q=2–4. (eg) Relative differences between the normalized spectra

While all four neutral loss peaks are quite intense in the absence of a buffer gas in the ion trap, the presence of helium buffer gas substantially quenches all but the m=44 (CH4N2) loss peak. This implies that CH4N2 loss from R is associated either to the lowest activation energy or to the fastest dissociation rate, allowing for the process to occur before the protein excess energy thermalizes with the buffer gas. For the case of double ionization without the presence of buffer gas, both NDDI and double ionization with small neutral loss are fully quenched. In the presence of He buffer gas, no neutral loss channels besides m=44 loss can be clearly distinguished. The following paragraphs on neutral losses will thus focus on the m=44 loss channel.

Figure 2(b–d) shows the partial ions yields for the m=44 Da loss channel [melittin+qH-44](q+1)+ with the NDSI yield [melittin+qH](q+1)+ at the C K-edge for q=2–4. All spectra are normalized to the peak at 288.14 eV. Figure 2(e–g) displays the respective difference spectra of both channels. For q=2–4, the broad spectral feature starting at 290 eV is systematically higher for the m=44 loss channel. This is intuitively clear, as 1s excitation to higher excited states implies higher excitation energies. The 1s-πw,indole transition is the lowest energy feature at the C K-edge. As discussed in the previous section, this transition is particularly strong in the NDSI spectra for q=3 and 4. This extraordinary strength of the transition is lifted for the m=44 loss and similar relative intensities of the 1s-πw,indole transition are observed for all three protonation states. Furthermore, the 1s-σCC transition is reduced for the m=44 loss channel and q=4.

The soft X-ray partial ion yield spectra for m=44 (CH4N2) loss from doubly ionized [melittin+qH](q+2)+ are shown in Figure 3. The left column compares the NDDI channel (gray symbols) with the respective m=44 loss (red solid line, all spectra are normalized to the double ionization maximum), while the right column displays the difference between both spectra. Clearly, for all protonation states q=2–4, the differences are moderate and limited to energies below the ionization threshold. For double protonation, the melittin is most stable and almost identical soft X-ray spectra are observed whereas for higher protonation states (q=3, 4), the spectrum for the m=44 loss channel exhibits higher intensities between the main 1s-π* resonance and the onset of double ionization. As mentioned before, this energy region represents 1s excitation into higher excited σ-states and Rydberg states, whose population seem to favor neutral losses.

Figure 3
figure 3

(ac) Normalized partial ion yields of [melittin+qH](q+2)+ (NDDI, gray symbols) and [melittin+qH-44](q+2)+ (double ionization with loss of 44 Da, red line) for [melittin+qH]q+, with q=2–4. (df) Relative differences between the normalized spectra

Formation of Sequence Ions

The mass spectrometric most interesting features of the spectra are the peaks due to protein backbone scission. These so-called sequence ions are dictated by the primary structure of the molecule. Canon et al. have already used synchrotron radiation to study the dependence of sequence ion formation on the vacuum ultraviolet photon energy, for a number of proteins [41].

The data for doubly protonated [melittin+2H]2+ at the C K-edge are displayed in Figure 4. The panel on the bottom shows two mass spectra obtained with the Groningen tandem mass spectrometer without helium buffer gas. The photon energies 288.3 eV and 296 eV correspond to the C 1s-π* excitation in the amide C=O group and to direct 1s ionization, respectively. A large number of fragment peaks is apparent but only a few of these peaks have high intensity when measured with the NanoClusterTrap in the presence of buffer gas. For instance, the peaks at 812 and 826 are not observed with buffer gas and most likely product ions from slow fragmentation processes.

Figure 4
figure 4

Bottom: mass spectra for [melittin+2H]2+ photoionization at 288.3 eV (C 1s-π* excitation) and 296 eV (C inner-shell ionization) recorded without buffer gas. Soft X-ray spectra: partial ion yields (gray circles) as a function of C K-edge photon energy for the four sequence ions, labeled in the mass spectra. The red filled spectrum is the fitted NDSI data and the blue filled spectrum is the fitted NDDI data. The black solid line is the sum of NDSI and NDDI. On each panel, the ratio between the peak maxima of NDSI and NDDI contributions is given. Top: melittin sequence with highest proton affinity residues marked red. The protonation sites were identified in [39, 40]

The four panels in Figure 4 show the normalized partial ion yields for four unambigously identified singly charged fragments as a function of photon energy at the C K-edge (for additional fragments, see the supplementary information). Each spectrum was fitted by the sum of the NDSI (red) and NDDI (blue) spectra from Figure 1. The total fit is given as a solid black line. On a first glance, it is obvious that all four fragments (as well as the other fragments in the mass spectrum) have strong contributions from both, single and double ionization, i.e., no strong final charge-state selectivity is observed. This is in line with the expectations, as charge is not a key issue for the stability of the doubly protonated precursor. Deviations between fit and data points are generally limited to the 290–295-eV region, where experimental data is often markedly higher than the fit. The following discussion is based on the assumption that these deviations are due to larger contributions of 1s excitation into energetically higher unoccupied orbitals, i.e., higher electronic excitation. The electronic excitation upon direct 1s photoionization is a priori independent of photon energy.

For two fragments (a6+; z4+), the fit is excellent and deviations are negligible. Partial ion yields that are well reproduced by a superposition of non-dissociative single and double ionization yields most likely correspond to activation energies as low as those of the non-dissociative channels. The activation energies are likely lower than for m=44 loss following single ionization, where higher excited states are favored (Figure 2(b, e)). This can be explained with the help of the melittin sequence depicted on the top of Figure 4, where the five K and R residues, the residues with the highest proton affinity, are marked in red. The first two protonation sites in melittin have been determined by electron capture dissociation [39] to lie between A4 and V5 and between K23 and R24, with the first site likely stabilized by the helical secondary structure of the protein. As expected, the low activation energy fragments a6+ and z4+ both involve scissions that are close to the protonation sites. A deeper look into the data (see Figure S1) indicates that the protonation site is actually K7 rather than A4/V5.

Is there a similar relationship between activation energy for protein backbone scission and characteristics of the respective fragment soft X-ray spectrum also present at higher protonation states? Figure 5 shows the respective data for triply protonated [melittin+3H]3+. Again, only a selection of unambiguously assigned fragments is shown (for a more extensive list, see the supplementary information). A first glance at the spectra reveals that there are still sequence ions such as a6+, which display strong contributions, here with an NDSI/NDDI ratio of 0.74. a6+ is observed for [melittin+2H]2+ as well. However, the NDSI/NDDI sum fit is not as perfect as observed for the [melittin+2H]2+ case (see Figure 4). Deviations are again starting at 289 eV and now extending even beyond 295 eV, probably because of the higher ionization energy.

Figure 5
figure 5

Bottom: mass spectra for [melittin+3H]3+ photoionization at 288.3 eV (C 1s-π* excitation) and 298 eV (C inner-shell ionization) recorded without buffer gas. Soft X-ray spectra: partial ions yields as a function of C K-edge photon energy for the six sequence ions, as labeled in the mass spectra (gray circles). The red filled spectrum always is the fitted NDSI data and the blue filled spectrum is the fitted NDDI data. The black solid line is the sum of NDSI and NDDI. On each panel, the ratio between the peak maxima of NDSI and NDDI contributions is given. Top: melittin sequence with highest proton affinity residues in red. The protonation sites were identified in [39, 40]

Other sequence ions result primarily from single ionization (x203+ and x213+) with NDSI/NDDI ratios (determined from the peak maxima) of 3.2 and 4.25. x213+ is observed in the mass spectra for doubly protonated melittin in Figure 4, which are obtained without a buffer gas but not in the presence of a buffer gas. For [melittin+3H]3+, this channel has a much faster dissociation rates than for [melittin+2H]2+, possibly due to the additional Coulomb repulsion between x213+ and the complementary fragment which is now singly charged. Surprisingly, for these channels, the deviations between fit and actual data are largest, indicating high activation energies.

A third class of sequence ions (x133+, z214+, and z224+) are dominated by double ionization with NDSI/NDDI ratios between 0.16 and 0.39. Here, deviations are smallest and accordingly, activation energies are lowest. For [melittin+3H]3+, protonation sites were determined by Morrison et al. using UV photodissociation as A4, K21, and R24 [40]. The fragments z214+ and z224+ are close to the A4 site. As these two fragments are predominantly formed after double ionization, the remaining N-terminal fragment carries a protonation site and has to be singly charged. Fragment x133+ is not close to one of the protonation sites. Instead, the scission is on the N-side of the only proline (P14) residue. Kjeldsen et al. have identified the fourth protonation site in this region [39]. Also, an enhancement of scission probability close to P residues is often observed [42].

Figure 6 displays the dataset for [melittin+4H]4+. The protonation states are difficult to assign on the basis of existing literature [39, 40]. Most likely, two protons are located in the high proton affinity KRKR region, a third proton sits in the vicinity of the A4 residue and the fourth is located in the general region of the P14 residue. Again, three groups of peaks can be identified. Fragments a6+ and x134+ feature comparable contributions from single and double ionization. Their NDSI/NDDI ratio is between 0.71 and 1.1, respectively. Deviations between the fit and the experimental data are generally even larger than for [melittin+3H]3+.

Figure 6
figure 6

Bottom: mass spectra for [melittin+4H]4+ photoionization at 288.3 eV (C 1s-π* excitation) and 300 eV (C inner-shell ionization) recorded without buffer gas. Soft X-ray spectra: partial ions yields as a function of C K-edge photon energy for the six sequence ions, as labeled in the mass spectra (gray circles). The red filled spectrum always is the fitted NDSI data and the blue filled spectrum is the fitted NDDI data. The black solid line is the sum of NDSI and NDDI. On each panel, the ratio between the peak maxima of NDSI and NDDI contributions is given. Top: melittin sequence with highest proton affinity residues in red. The protonation sites were identified in [39, 40]

The fragment a6+ has been discussed for the two lower protonation states and it is likely related to a multiple scission scenario.

z224+ is dominated by single ionization with a ratio of 5.9. Note, that for the triply protonated precursor, z224+ was almost solely formed by double ionization (see Figure 5). Also, this fragments exhibits large differences between fit and experimental data and probably involves high activation energies.

The last set of fragments (b72+, y134+, and z225+) is dominated either by double ionization or for b72+ even solely due to double ionization. For these fragments, the agreement between fit and experimental data is remarkably good and it is almost perfect for fragment b72+. This hints at very low activation energies for the fragments. Double ionization of [melittin+4H]4+ yields a sixfold charged ion, which probably is the maximum charge this protein can stabilize [43]. Accordingly, activation energies for this system are most likely very low.

Conclusions

We have performed a soft X-ray absorption spectroscopy case study of melittin. Three main classes of reaction channels were investigated: non-dissociative ionization, loss of small neutral molecules, and formation of backbone scission ions.

The analysis of the non-dissociative ionization yields shows clear evidence for significant double ionization in melittin below the ionization threshold. This double ionization is to a large extent due to electron impact ionization by Auger electrons emitted after soft X-ray absorption. We observe a clear decline of the respective yield with decreasing melittin protonation state. There are two major contributions to this trend. First of all, the less compact structure for higher charge states entails lower yields of secondary ionization events triggered by the emitted Auger electrons. This is in line with Monte Carlo simulations of secondary electron ionization in the framework of the independent atom model (IAM).

The second class of reaction channels under investigation is small neutral molecule losses. We observe −17, −29, −44, and −59 loss peaks which can be assigned to loss of NH3, C2H5 CH4N2, and C2H5NO groups from amino acid residues in the peptide. −17 can be assigned to lysine (K), −29 to isoleucine (I), and −44 and −59 to arginine (R). Interestingly, the presence of buffer gas during photoexposure leads to quenching of all these channels, with the exception of −44 loss. Most likely, this channel has the lowest activation energy or a fast dissociation rate.

For sequence ions stemming from backbone scission, the respective partial ion yields were fitted with a superposition of non-dissociative single (NDSI) and double ionization (NDDI) spectra. The ratio between NDSI and NDDI in the fit is defining the relative importance of single and double ionization for fragment formation. The deviations between the NDSI+NDDI fit and fragment yield dependencies from photon energy are mainly observed in the 290–295-eV energy range, where experimental data was exceeding the fit for some fragments. This deviation can be explained by the higher activation energy required for formation of these fragments.

With sufficient mass resolution, the combination of soft X-ray spectroscopy and mass spectrometry allows to assign a specific X-ray absorption spectrum to each ion in the mass spectrum. This spectrum can be used to study the energy deposition for the respective fragmentation and determine the number of removed electrons.