Scientists working in the field of catalysis are fascinated with the prospect of producing a movie portraying a catalytic transformation. The enthrallment with movie making has been enticed both by the technical challenges that need to be overcome and the unprecedented access to the fundamental traits of a catalytic reaction. The importance in accessing fundamental understanding of catalysis was masterfully highlighted on the 2007 report from the US Department of Energy Basic Energy Sciences Workshop [1] in ‘to realize the full potential of catalysis for energy applications, scientists must develop a profound understanding of catalytic transformations so that they can design and build effective catalysts with atom-by-atom precision and convert reactants to products with molecular precision. Moreover, they must build tools to make real-time, spatially resolved measurements of operating catalysts. Ultimately, scientists must use these tools to achieve a fundamental understanding of catalytic processes occurring in multiscale, multiphase environments’. As reminder catalysis is a trillion-dollar industry [2], involve in 90 % of all commercially produced chemical products [3].

Catalysis is driven by the electronic structure of the valence shell. The availability of valence orbitals to form chemical bonds, and thus taking part in the catalytic reaction depends on their electron occupancy and energy [48]. Hard X-rays are ideal probes since they possess high penetration depth and are element specific, which enables the identification of chemical states under working conditions [9, 10]. X-ray photon-in photon-out core level spectroscopy is a powerful tool to understand catalytic reactions because it enables us to map the entire electronic structure of the catalyst under catalytic relevant conditions.

The development of hard X-ray free electron lasers (XFELs) revolutionized the field of ultrafast time-resolved X-ray measurements [1113]. FLASH VUV-soft XFEL located at DESY in Hamburg [14, 15] was the prototype for this kind of facilities that was shortly followed by the linac coherent light source (LCLS) (Stanford, USA), the first operational hard X-ray free electron laser [16]. LCLS operates both in the soft X-ray and hard X-ray regimes, with experimental stations dedicated to various fields of research [17]. SACLA XFEL facility at SPring-8 in Japan started operation recently [18, 19], and several XFEL projects are underway worldwide, including machines in Germany [20], South Korea [21], and Switzerland [22].

X-ray free electron lasers sources consist of high-energy (GeV) electron bunches injected into a series of undulators that are hundreds to thousands of meters long. The oscillation of the electrons in the initial part of the undulators causes radiation to be emitted, as in a synchrotron. As the radiation and electrons co-propagate the radiation field builds up and the electrons start to interact with the radiation, causing a micro-bunch structure to appear with the wavelength of the radiation. This micro-bunch structure radiates coherent X-ray photons, the intensity of which builds up exponentially in a process called self-amplified spontaneous emission (SASE). The result is an intense, spatially coherent beam of femtosecond X-ray pulses, which can be used for experiments.

The spontaneous nature of the SASE process generates XFELs radiation with a large variance in pulse energy (photon flux), photon energy (spectrum), and pulse arrival time. The result is large pulse-to-pulse parameter fluctuation. There are several approaches to reduce XFEL beam instability at the sample position, the most promising approach being to seed the XFEL, preferentially selecting a portion of the photon spectrum to initiate the lasing process [23], resulting in a significant improvement in the spectral stability with a small cost in photon flux (factor of approximately 5–10) [24]. Though the seeded X-ray spectrum is not properly monochromatic, as it has a tail that extends a few eV to lower photon energies, it greatly enhances the pulse-to-pulse energy stability of the photon beam through a monochromator. The recent introduction of a ‘timing tool’ [25, 26], which measures the timing jitter between the optical laser and the XFEL, enabled measurements with 10 fs time resolution [27]. This development has catapulted XFEL pump-probe measurements into time domains previously only accessible with ultrafast optical laser system.

The principal feature of a XFEL is the mammoth number of photons (1,011–1,012 photons/pulse) in a pulse with durations of 10–100 fs, or even sub-fs [28], rendering their X-ray peak brilliance unrivaled. Both SACLA and LCLS operate at low repetition rates of 100–120 Hz, which makes their average X-ray flux similar to that of 3rd-generation synchrotron facilities. The experiments that have benefitted the most from the development of XFEL facilities have been those that either take advantage of the large number of photons per pulse to perform single-shot or nonlinear X-ray experiments, or those that take advantage of the ultrashort pulse durations to perform measurements on the femtosecond timescale. For example, crystallography takes advantage of the fact that ultrashort X-ray pulse can outrun crystal damage, the so called diffraction before destruction approach [29, 30].

Proof-of-concept femtosecond time-resolved X-ray spectroscopy measurements have been performed, in which the authors detected the first steps in hot electron-mediated catalysis [31] and surface bond breaking [32]. Resonant X-ray emission spectroscopy (RXES) monitors the occupied electronic states involved in a chemical reaction [33]. The technique can be carried out on a shot-to-shot basis if one uses a dispersive spectrometer, such as von Hamos [34]. Unoccupied density of states can be determined by means of X-ray absorption spectroscopy (XAS). The advent of high resolution XAS (HR–XAS) enabled determination of changes in the unoccupied density states caused not only by metal oxidation state but also due to the presence, identity and adsorption strength of chemisorbed species [3537]. However HR–XAS has a serious limitation arising from the fact that XAS measurements require scanning of the incoming energy, limiting the time resolution at synchrotrons to the speed at which the monochromator can be moved, i.e., HR–XAS cannot be performed on a shot-to-shot, which is XFELs desired mode.

This limitation was mitigated by Szlachetko et al. [38], which demonstrated that high-energy resolution off-resonant spectroscopy (HEROS) could provide element-specific information about the unoccupied density of states. The concept of HEROS is depicted in Fig. 1. Due to the scanning free arrangement, the HEROS spectra can be recorded on a shot-to-shot basis, while maintaining an energy resolution independent of the initial lifetime broadening [39]. Moreover, the self-absorption process does not affect HEROS spectra, thus making it a powerful technique to identify and quantify the desired structural changes during a catalytic reaction.

Fig. 1
figure 1

Schematic representation (a) and energy level drawing (b) for an off-resonant scattering process. Reproduced from Szlachetko et al. [38]

The HEROS spectroscopy is based on the second-order photon-atom interaction [40, 41]. As noted by Tulkki and Aberg [40] in their theoretical work describing the resonant X-ray Raman scattering process, for incident beam energies tuned far below the absorption edge (off-resonant regime), the shape of the X-ray emission spectrum (XES) is proportional to the unoccupied electronic states multiplied by a Lorentz function resulting from core–hole broadening. In other words, for off-resonant spectroscopy the XAS spectrum is directly reflected in XES at one excitation energy. Until nowadays, the potential for extracting the electronic structure from a single X-ray emission spectrum recorded at off-resonant excitations was not explored because of the extremely weak scattering cross section as compared to X-ray absorption spectroscopy [4245]. With advent of XFELs, the low scattering cross sections are compensated by large intensity of femtosecond X-ray pulses. Moreover, in comparison to synchrotron sources where typically scanning-type spectrometers are employed for measurements, the application of dispersive-type spectrometers at XFELs is essential. The dispersion provides a unique possibility to record broadband energies in a single acquisition and thus facilitate experiments in terms of signal normalization.

As an example, we show the HEROS spectra recorded for nano-Pt/Al2O3 in different gas environment (i.e. H2 and CO at 200 °C). The sample consisted of 3–4 nm Pt nanoparticles supported on alumina (surface area 150 m2/g). The metal loading is 1 wt% and the catalyst was reduced at 200 °C in pure H2 for 1 h before the experiments. The experimental spectra are compared to the theoretical ones calculated using a combination of FEFF code with scattering formulas developed by Tulkki and Aberg. Figure 2 demonstrates the sensitivity of HEROS spectroscopy on catalytic system under in situ conditions. Because of experimental resolution being independent on initial core–hole broadening, the HEROS provide enhanced spectral features, as HR–XAS, however without necessity of scanning the incident beam energy. Thus the metal-site electronic structure changes induced by binding molecules can be detected on a shot-to-shot basis measurement. We would like to stress here, that the chemical speciation by HEROS might be performed using the same strategies as for XAS measurements. On one side a set of reference spectra can be measured that can later be used to identify chemical composition of matter under working conditions. Complementary to reference data, calculations using common XAS codes like FEFF [4649], Orca [5052] or FDMNES [5355] can be used to support the experimental data.

Fig. 2
figure 2

In-situ HEROS spectra of nanoPt/Al2O3 at 200 °C in H2 (red) and CO (blue)

As aforementioned, X-ray emission spectroscopy (XES) is an element-specific technique that provides access to the occupied electronic structure of scattering atom [5659]. XES relies on a second-order scattering process, where the core-electron is ejected into the continuum and an electron from higher electronic levels subsequently fills the remaining core–hole with simultaneous X-ray emission. The emitted X-ray energy is specific for the scattering atom as well as for the involved atomic levels. In general, XES may be employed for detection of core-to-core (c2c) or valence-to-core (v2c) transitions, which involve deep lying electronic states or outermost valence electrons, respectively. Figure 3 depicts the principal of c2c–XES and v2c–XES. The c2c–XES is commonly employed in combination with X-ray absorption spectroscopy because of relatively high decay rates [6063]. The core electronic states are usually weakly interacting with the outermost electrons leading to a simple description of the initial and final configurations. The c2c–XES can be used as a probe of inner shell electron correlations, multi-electron scattering and electron rearrangement processes [6468]. The v2c-transitions are much weaker but on the other hand provide sensitivity to the chemical surrounding of the scattering atom, and in particular allow probing the ligand orbitals. The v2c transition involves the valence electrons decay into the core–hole and therefore for mapping of the highest occupied electronic states in the system. Comparing to the electron based techniques commonly used for determination of low energy electronic states, v2c–XES uses a few keV X-ray energies for incoming and emitted photons. Thanks to the penetrating properties of hard X-rays, the occupied states and ligand environment may be thus probed in the bulk materials, or matter under working conditions.

Fig. 3
figure 3

Atomic process of XAS, v2c–XES and c2c–XES

As an example, the XES spectrum of CuO powder is presented in Fig. 4. The CuO was purchased from Aldrich and used as received. The main emission lines, involving L and M final core states as well as v2c transitions, are plotted in black, red and blue lines respectively. As shown the transition yields vary depending on the final electronic state. For core-to-core transitions, strong X-ray emission lines are detected, however those are usually insensitive to the chemical environment of scattering atom. On the other hand, the v2c transitions are used as a direct probe of the valence electron configuration with enhanced sensitivity to ligands [69], however with yields of few hundreds lower than the main XES lines. We should note here, that in contrast to the HEROS technique, XES does not require monochromatic beams, thus may be simply applied with SASE incident beams delivered by the XFELs.

Fig. 4
figure 4

The c2c and v2c–XES spectrum of CuO

It worth mentioning that HEROS and RXES are bulk techniques, however there are several strategies to retrieve information exclusively from the surface. The most successful approach is to apply phase or frequency modulations methodologies, which changes the surface chemistry while the bulk remains unalterable [7072].

The arrival of XFELs offered researchers sufficient time resolution to follow a catalytic reaction in real time, i.e., produce ‘movies’ of a catalytic transformation. This is conceptually achieved using pump-probe methodology [73]. Briefly, the reaction is initiated with an external trigger (time zero), and the changes are monitored with X-ray probe pulses delayed in time in respect to the trigger. The methodology enables acquisition of ‘snapshots’ during a catalytic transformation, which are then stitched together to make the movie. Despite the strides made towards ensuring a successful XFEL experiment, there are a couple of hurdles yet to be taken, concerning selective triggering and sample stability. Fine-grained control over all experiments parameters is paramount since XFELs experiments are dedicated to a single user at a time with limited access to measurement time at these facilities.

The trigger, used to initiate/excite the system, should create a unique excited-state potential surface characteristic of a single catalytic transformation. Conventionally, optical femtosecond pulses are used to initiate and/or excite a particular system. There are a few catalytic systems that may be triggered using optical pulses, among them the charge separation on photo-catalysis. However the vast majority of heterogeneous catalytic systems cannot be triggered by means of a simple optical excitation primarily due to reaction complexity and/or absence of specific modes on a heterogeneous catalyst or reactant that can be optically excited. A possible strategy is to use multi-pulses or shaped femtosecond pulses, which have to be optimized prior to experiment at the XFEL. Nuernberger et al. [74] demonstrated that the catalytic bond formation could be affected with a shaped femtosecond but the strategy has not been further pursued. Other possible triggers are laser-induced temperature jumps or the strong electric field of THz radiation [75]. However, it needs to be demonstrated that those triggers possess the required specificity and selectivity. Furthermore these are inherently picosecond triggers that will decrease the overall time resolution. As aforementioned, HEROS and RXES are bulk techniques, however the ability to trigger selectively a catalytic process converts them into surface techniques since excited data is subtracted from non-excited in pump-probe methodology.

Samples stability is another major issue. Figure 5 shows what happens to Cu foil during XFEL measurement at LCLS. Within the measurement, the Cu foil gets sputtered, requiring the sample position to be refreshed every minute. The use of short probe pulses enables the acquisition of desired data before sample damage [7678]. However, this also means that the sample needs to be refreshed continuously. For liquid phase transformations, the solution seems to be the use of micro/nano liquid jets [7981]. This might work for some of the liquid phase transformations but cannot be adapted for gas phase transformations, which accounts for a large number of heterogeneous catalytic transformations. The problem is further exacerbated with the decrease of incoming energy since the penetration depth decreases, i.e., more photons get absorbed.

Fig. 5
figure 5

Image depicting sputtering of Cu foil at LCLS/SLAC, experiment L618, CIX FEL beam line, 2012-12-15

A promising technology is a freestanding particles delivery system using aerodynamic focusing methods. It has been demonstrated that such a set up is able to deliver aerosols, nanoparticles, viruses, cells, and biomolecules from ambient conditions into vacuum [8284]. Liu et al. [85, 86] designed an apparatus that uses in-line thin plate orifices to manipulate the particle lateral spatial distribution prior to them passing through the nozzle and subsequently undergoing supersonic expansion into vacuum. An axi-symmetric stack of these thin plate orifices, or aerodynamics lenses, provides successive contractions of a flowing particle beam cross section and enables focusing of a wide range of particles (1 nm to 10 μm) [87]. Equipped with a pressure reducer, the inlet of the aerodynamic lenses stack samples aerosolized particles from atmospheric pressure at a rate of ca. 1 L/min and injects them into vacuum (< 10−6 mbar). If the laser diameter is similar to the one used by single particle aerosol mass spectroscopy [88, 89], a particle moving at 150 m/s will have an effective interaction time of a few microseconds, thus enabling single catalytic events to take place. Bogan et al. [90] used this technology to acquire the diffraction of iron oxide ellipsoid nanoparticles with dimensions of 200 nm by 50 nm, thus confirming the applicability of this technology to deliver nano-catalysts into a measuring chamber. The iron oxide nanoparticles were dispersed in water and injected into vacuum.

Cryogenic cooling has been used at synchrotrons to minimize/prevent beam damage, and in principle can be used at XFELs. However this is not a universal solution since most catalytic systems are only active at high temperatures (above liquid N2 temperatures). Another option is to use absorbers that decrease the number of photons at the sample per pulse, leading to a decrease of XFEL pulse brilliance. This is not desirable for X-ray diffraction or scattering experiments in which every photon counts, but might be a valid option for X-ray spectroscopy since the X-ray photons absorption is roughly an order of magnitude higher than photon scattering [78].

Though 4th-generation light sources are in its infancy, it is clear that their effect on the field of time-resolved X-ray science will be profound and far-reaching. In the case of heterogeneous catalysis, XFELs may enable us to follow reaction in real time, however in order to do so, problems with selective triggering and sample stability have to be solved, in particular for gas phase reactions characterized by multiple reaction steps. In a cinematographic analogy, we have the camera, the set, the script and the actors; what is missing is the director to shout ‘action’ and direct the scenes, and that the actors do not fall ill.

We would like finish with a word of caution to scientists aiming to take advantage of XFELs capabilities. Experiments at XFELs are inherently different from experiments at synchrotrons. While test experiments at synchrotrons are useful and in some cases necessary to confirm experimental aspects such as feasibility, the experiments at synchrotrons cannot be seen as a pre-run to the XFELs experiments because there is a plethora of parameters that are unique to XFELs. These parameters should be carefully considered before any experiment, follow-up proposals are often rejected due to the limited operational beam time.