Abstract
We image the structure of the complex chiral molecule Fenchone (C10H16O) from laser-induced electron diffraction data by applying a machine learning algorithm with a convolutional neural network (CNN).
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Imaging ultrafast molecular dynamics addresses fundamental science, which helps us understand chemical reactions’ basics. Laser-induced electron diffraction (LIED) [1] is a powerful laser-based imaging method that has the ability to image the three-dimensional structure of a single gas-phase molecule with combined sub-atomic picometre and femtosecond spatiotemporal resolution [2]. Taking snapshots of molecular dynamics via the LIED technique gives an inside into the intertwining of molecules, how they react, change, break or bend.
However, retrieving complex molecular structures from diffraction patterns is challenging. As the structural complexity increases, it becomes more challenging to identify the extremum with current retrieval algorithms, and therefore they are still limited to few-atom molecular systems [3].
A machine learning (ML) algorithm is fully capable of solving these difficulties based on its ability to consider multiple degrees of freedom simultaneously. Based on an ML-LIED framework, we demonstrate the accurate retrieval of a large and complex molecule’s three-dimensional (3D) structure.
2 ML Algorithm
Our ML algorithm utilizes a convolutional neural network (CNN). It will be trained to find the relationship between a molecular structure and its molecular interference signal from the two-dimensional differential cross-section maps 2D-DCS. The DCS contains the fingerprints of the internuclear distance of atoms of the molecule and is used as the algorithm’s input data. Using a convolutional neural network, we take advantage of its ability for image recognition. Here, the convolution of the 2D-DCS maps with different filters enhances at first subtle features of the maps providing a collection of feature maps (Fig. 1a). Subsequently, the feature maps pass through the fully connected neural network by multiplying the weights between each neuron to predict the atomic position in the molecule (Fig. 2b).
For training the ML algorithm, we first generated a database containing thousands of molecular structures spanning possible geometries. We calculated the corresponding 2D-DCS map for each structure by simulating the elastic scattering of electrons on the molecule using the independent atom model (IAM).
The database is split into training, validation, and test sets to validate the ML model. To evaluate the model’s accuracy during training, we define the absolute difference between the predicted and actual atom position (mean absolute error MAE) as our prediction error.
Once the ML model is satisfactorily trained, the experimental 2D-DCS map is used as an input to generate the predicted molecular structure that most likely contributes to the measured interference signal [4].
3 Extracting Molecular Structure
Before we use our ML framework to retrieve the molecular structure of a complex molecule, we firstly exam the ML model accuracy by revisiting published experimental LIED data of a small linear 1D molecule acetylene (C2H2) [5] and a planar 2D molecule carbon disulfide (CS2) [6]. Table 1 summarizes the ML predicted structural parameters of the C2H2 and CS2 molecules. The predicted structures agree nicely with previous publications where the structure was extracted from the LIED data by a standard fitting routine.
Then, we use our ML framework to study the configuration of a (+)-fenchone (C10H16O; 27 atoms) molecule which experimental 2D-DCS map is also determined with LIED. Retrieving the structure of such a complex molecule by using a standard fitting routine would require an unrealistic calculation time. For example, a calculation time of 1.4e+9 h would be needed to calculate only five variations of its possible structures. ML has the decisive advantage of interpolating and learning between the course grids of precalculated molecular geometries. Thus, we can sufficiently create an interpolated database that only considers the variation of four groups of atoms and a molecule-wide global change in structure. Let the machine itself interpolate the relationship between the molecular structures and corresponding 2D-DCSs with a reduced database, drastically minimizing computational time. Observing the MAE converges to a constant value of ~0.02 involved with the training and validation data sets, we verify that the model is not over/underfitting and is satisfactorily trained. Furthermore, the Pearson correlation coefficient is found to be 0.94, confirming the strong correlation between the experimental and predicted theoretical 2D-DCS.
Figure 2 shows the predicted 3D Cartesian coordinates (x, y, z) for seven atoms in (+)-fenchone that are retrieved from the experimental data by the ML model (green circles). The error bars included the predicted model error as well as the experimental statistical error. The equilibrium ground-state 3D positions of neutral (+)-fenchone are presented by red triangles. The slight deviation of the ML-LIED measured, and the equilibrium ground-state molecular structure is caused involuntarily by the LIED laser field. In addition, the schematic of the predicted 3D (+)-fenchone molecule is also shown. Here, the green circles indicate the degree of uncertainty.
4 Summary
We implement ML-LIED to retrieve atomic positions of 1D, 2D, and complex 3D molecules with picometer and attosecond resolution. The ML-based framework achieves high-accuracy pattern matching in complex solution spaces while overcoming scaling limitations in a standard fitting routine. Not just LIED, the problem of unfavorable scaling also arises with other diffraction methods. Combining ML with LIED offers a new general solution to overcome long-standing problems and opens up new opportunities to image the structure of large, complex molecules.
References
Zuo, T., Bandrauk, A.D., P.B.: Corkum “laser-induced electron diffraction: a new tool for probing ultrafast molecular dynamics”. Chem. Phys. Lett. 259, 313 (1996)
Sanchez, A., et al.: Molecular structure retrieval directly from laboratory-frame photoelectron spectra in laser-induced electron diffraction. Nat. Commun. 12, 1520 (2021)
Sanchez-Gonzalez, A., et al.: Accurate prediction of X-ray pulse properties from a free-electron laser using machine learning. Nat. Commun. 8, 15461 (2017)
Liu, X., et al.: Machine learning for laser-induced electron diffraction imaging of molecular structures. Comm. Chem. 4, 154 (2021)
Pullen, M.G., et al.: Imaging an aligned polyatomic molecule with laserinduced electron diffraction. Nat. Commun. 6, 7262 (2015)
Amini, K., et al.: Imaging the Renner–Teller effect using laser-induced electron diffraction. Proc. Natl. Acad. Sci. U. S. A. 116, 8173–8177 (2019)
Dalton, D.R.: Foundations of Organic Chemistry: Unity and Diversity of Structures, Pathways, and Reactions. Wiley (2020)
Herzberg, G.: Molecular Spectra and Molecular Structure III: Electronic Spectra and Electronic Structure of Polyatomic Molecules. D. Van Nostrand, New York (1966)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2024 The Author(s)
About this paper
Cite this paper
Liu, X. et al. (2024). Retrieving a Complex Molecular Structure from Laser-Induced Electron Diffraction Using Machine Learning. In: Argenti, L., Chini, M., Fang, L. (eds) Proceedings of the 8th International Conference on Attosecond Science and Technology. ATTO 2023. Springer Proceedings in Physics, vol 300. Springer, Cham. https://doi.org/10.1007/978-3-031-47938-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-47938-0_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47937-3
Online ISBN: 978-3-031-47938-0
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)