Retrieving a Complex Molecular Structure from Laser-Induced Electron Diffraction Using Machine Learning

Liu, Xinyao; Amini, Kasra; Sanchez, Aurelien; Belsa, Blanca; Steinle, Tobias; Chirvi, Katharina; Biegert, Jens

doi:10.1007/978-3-031-47938-0_7

Xinyao Liu⁴,
Kasra Amini⁴,
Aurelien Sanchez⁴,
Blanca Belsa⁴,
Tobias Steinle⁴,
Katharina Chirvi⁴ &
…
Jens Biegert^4,5

Part of the book series: Springer Proceedings in Physics ((volume 300))

Included in the following conference series:

International Conference on Attosecond Science and Technology

1139 Accesses

Abstract

We image the structure of the complex chiral molecule Fenchone (C₁₀H₁₆O) from laser-induced electron diffraction data by applying a machine learning algorithm with a convolutional neural network (CNN).

You have full access to this open access chapter, Download conference paper PDF

Learning physical properties of liquid crystals with deep convolutional neural networks

Article Open access 06 May 2020

A review of optical chemical structure recognition tools

Article Open access 07 October 2020

A Deep Learning Approach for Molecular Crystallinity Prediction

Keywords

1 Introduction

Imaging ultrafast molecular dynamics addresses fundamental science, which helps us understand chemical reactions’ basics. Laser-induced electron diffraction (LIED) [1] is a powerful laser-based imaging method that has the ability to image the three-dimensional structure of a single gas-phase molecule with combined sub-atomic picometre and femtosecond spatiotemporal resolution [2]. Taking snapshots of molecular dynamics via the LIED technique gives an inside into the intertwining of molecules, how they react, change, break or bend.

However, retrieving complex molecular structures from diffraction patterns is challenging. As the structural complexity increases, it becomes more challenging to identify the extremum with current retrieval algorithms, and therefore they are still limited to few-atom molecular systems [3].

A machine learning (ML) algorithm is fully capable of solving these difficulties based on its ability to consider multiple degrees of freedom simultaneously. Based on an ML-LIED framework, we demonstrate the accurate retrieval of a large and complex molecule’s three-dimensional (3D) structure.

2 ML Algorithm

Our ML algorithm utilizes a convolutional neural network (CNN). It will be trained to find the relationship between a molecular structure and its molecular interference signal from the two-dimensional differential cross-section maps 2D-DCS. The DCS contains the fingerprints of the internuclear distance of atoms of the molecule and is used as the algorithm’s input data. Using a convolutional neural network, we take advantage of its ability for image recognition. Here, the convolution of the 2D-DCS maps with different filters enhances at first subtle features of the maps providing a collection of feature maps (Fig. 1a). Subsequently, the feature maps pass through the fully connected neural network by multiplying the weights between each neuron to predict the atomic position in the molecule (Fig. 2b).

A flow diagram of the 2 D D C S with different filters and feature maps in a leads to be flattened and passed through the fully-connected neural network to predict the 3 D position of each atom in b. Which leads to a cost function to minimize the difference and optimize all filters and weights. — **Fig. 1**

A scatterplot of atom position versus atom numbers. It plots vertical lines of x, y, and z at atom number 1, and the data points of equilibrium and C N N plot a fluctuating trend between the 7 atoms. A schematic of 3 D molecular structure has 6 areas of uncertainty labeled 2, 3, 4, 5, 6, and 7. — **Fig. 2**

For training the ML algorithm, we first generated a database containing thousands of molecular structures spanning possible geometries. We calculated the corresponding 2D-DCS map for each structure by simulating the elastic scattering of electrons on the molecule using the independent atom model (IAM).

The database is split into training, validation, and test sets to validate the ML model. To evaluate the model’s accuracy during training, we define the absolute difference between the predicted and actual atom position (mean absolute error MAE) as our prediction error.

Once the ML model is satisfactorily trained, the experimental 2D-DCS map is used as an input to generate the predicted molecular structure that most likely contributes to the measured interference signal [4].

3 Extracting Molecular Structure

Before we use our ML framework to retrieve the molecular structure of a complex molecule, we firstly exam the ML model accuracy by revisiting published experimental LIED data of a small linear 1D molecule acetylene (C₂H₂) [5] and a planar 2D molecule carbon disulfide (CS₂) [6]. Table 1 summarizes the ML predicted structural parameters of the C₂H₂ and CS₂ molecules. The predicted structures agree nicely with previous publications where the structure was extracted from the LIED data by a standard fitting routine.

Table 1 Summary of C2H2 and CS2 structures predicted by machine learning (ML)

Full size table

Then, we use our ML framework to study the configuration of a (+)-fenchone (C₁₀H₁₆O; 27 atoms) molecule which experimental 2D-DCS map is also determined with LIED. Retrieving the structure of such a complex molecule by using a standard fitting routine would require an unrealistic calculation time. For example, a calculation time of 1.4e+9 h would be needed to calculate only five variations of its possible structures. ML has the decisive advantage of interpolating and learning between the course grids of precalculated molecular geometries. Thus, we can sufficiently create an interpolated database that only considers the variation of four groups of atoms and a molecule-wide global change in structure. Let the machine itself interpolate the relationship between the molecular structures and corresponding 2D-DCSs with a reduced database, drastically minimizing computational time. Observing the MAE converges to a constant value of ~0.02 involved with the training and validation data sets, we verify that the model is not over/underfitting and is satisfactorily trained. Furthermore, the Pearson correlation coefficient is found to be 0.94, confirming the strong correlation between the experimental and predicted theoretical 2D-DCS.

Figure 2 shows the predicted 3D Cartesian coordinates (x, y, z) for seven atoms in (+)-fenchone that are retrieved from the experimental data by the ML model (green circles). The error bars included the predicted model error as well as the experimental statistical error. The equilibrium ground-state 3D positions of neutral (+)-fenchone are presented by red triangles. The slight deviation of the ML-LIED measured, and the equilibrium ground-state molecular structure is caused involuntarily by the LIED laser field. In addition, the schematic of the predicted 3D (+)-fenchone molecule is also shown. Here, the green circles indicate the degree of uncertainty.

4 Summary

We implement ML-LIED to retrieve atomic positions of 1D, 2D, and complex 3D molecules with picometer and attosecond resolution. The ML-based framework achieves high-accuracy pattern matching in complex solution spaces while overcoming scaling limitations in a standard fitting routine. Not just LIED, the problem of unfavorable scaling also arises with other diffraction methods. Combining ML with LIED offers a new general solution to overcome long-standing problems and opens up new opportunities to image the structure of large, complex molecules.

References

Zuo, T., Bandrauk, A.D., P.B.: Corkum “laser-induced electron diffraction: a new tool for probing ultrafast molecular dynamics”. Chem. Phys. Lett. 259, 313 (1996)
Article ADS Google Scholar
Sanchez, A., et al.: Molecular structure retrieval directly from laboratory-frame photoelectron spectra in laser-induced electron diffraction. Nat. Commun. 12, 1520 (2021)
Article ADS Google Scholar
Sanchez-Gonzalez, A., et al.: Accurate prediction of X-ray pulse properties from a free-electron laser using machine learning. Nat. Commun. 8, 15461 (2017)
Article ADS Google Scholar
Liu, X., et al.: Machine learning for laser-induced electron diffraction imaging of molecular structures. Comm. Chem. 4, 154 (2021)
Article Google Scholar
Pullen, M.G., et al.: Imaging an aligned polyatomic molecule with laserinduced electron diffraction. Nat. Commun. 6, 7262 (2015)
Article ADS Google Scholar
Amini, K., et al.: Imaging the Renner–Teller effect using laser-induced electron diffraction. Proc. Natl. Acad. Sci. U. S. A. 116, 8173–8177 (2019)
Article ADS Google Scholar
Dalton, D.R.: Foundations of Organic Chemistry: Unity and Diversity of Structures, Pathways, and Reactions. Wiley (2020)
Google Scholar
Herzberg, G.: Molecular Spectra and Molecular Structure III: Electronic Spectra and Electronic Structure of Polyatomic Molecules. D. Van Nostrand, New York (1966)
Google Scholar

Download references

Author information

Authors and Affiliations

ICFO – Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Barcelona, Spain
Xinyao Liu, Kasra Amini, Aurelien Sanchez, Blanca Belsa, Tobias Steinle, Katharina Chirvi & Jens Biegert
ICREA, Barcelona, Spain
Jens Biegert

Authors

Xinyao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Kasra Amini
View author publications
You can also search for this author in PubMed Google Scholar
Aurelien Sanchez
View author publications
You can also search for this author in PubMed Google Scholar
Blanca Belsa
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Steinle
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Chirvi
View author publications
You can also search for this author in PubMed Google Scholar
Jens Biegert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jens Biegert .

Editor information

Editors and Affiliations

Department of Physics & CREOL, The College of Optics and Photonics, University of Central Florida, Orlando, FL, USA
Luca Argenti
Department of Physics & CREOL, The College of Optics and Photonics, University of Central Florida, Orlando, FL, USA
Michael Chini
Department of Physics & CREOL, The College of Optics and Photonics, University of Central Florida, Orlando, FL, USA
Li Fang

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, X. et al. (2024). Retrieving a Complex Molecular Structure from Laser-Induced Electron Diffraction Using Machine Learning. In: Argenti, L., Chini, M., Fang, L. (eds) Proceedings of the 8th International Conference on Attosecond Science and Technology. ATTO 2023. Springer Proceedings in Physics, vol 300. Springer, Cham. https://doi.org/10.1007/978-3-031-47938-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-47938-0_7
Published: 01 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47937-3
Online ISBN: 978-3-031-47938-0
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics

Retrieving a Complex Molecular Structure from Laser-Induced Electron Diffraction Using Machine Learning

Abstract

Similar content being viewed by others

Learning physical properties of liquid crystals with deep convolutional neural networks

A review of optical chemical structure recognition tools

A Deep Learning Approach for Molecular Crystallinity Prediction

Keywords

1 Introduction

2 ML Algorithm

3 Extracting Molecular Structure

4 Summary

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Retrieving a Complex Molecular Structure from Laser-Induced Electron Diffraction Using Machine Learning

Abstract

Similar content being viewed by others

Learning physical properties of liquid crystals with deep convolutional neural networks

A review of optical chemical structure recognition tools

A Deep Learning Approach for Molecular Crystallinity Prediction

Keywords

1 Introduction

2 ML Algorithm

3 Extracting Molecular Structure

4 Summary

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation