scholarly journals XAS Data Preprocessing of Nanocatalysts for Machine Learning Applications

Materials ◽  
2021 ◽  
Vol 14 (24) ◽  
pp. 7884
Author(s):  
Oleg O. Kartashov ◽  
Andrey V. Chernov ◽  
Dmitry S. Polyanichenko ◽  
Maria A. Butakova

Innovative development in the energy and chemical industries is mainly dependent on advances in the accelerated design and development of new functional materials. The success of research in new nanocatalysts mainly relies on modern techniques and approaches for their precise characterization. The existing methods of experimental characterization of nanocatalysts, which make it possible to assess the possibility of using these materials in specific chemical reactions or applications, generate significant amounts of heterogeneous data. The acceleration of new functional materials, including nanocatalysts, directly depends on the speed and quality of extracting hidden dependencies and knowledge from the obtained experimental data. Usually, such experiments involve different characterization techniques and different types of X-ray absorption spectroscopy (XAS) too. Using the machine learning (ML) methods based on XAS data, we can study and predict the atomic-scale structure and another bunch of parameters for the nanocatalyst efficiently. However, before using any ML model, it is necessary to make sure that the XAS raw experimental data is properly pre-processed, cleared, and prepared for ML application. Usually, the XAS preprocessing stage is vaguely presented in scientific studies, and the main efforts of researchers are devoted to the ML description and implementation stage. However, the quality of the input data influences the quality of ML analysis and the prediction results used in the future. This paper fills the gap between the stage of obtaining XAS data from synchrotron facilities and the stage of using and customizing various ML analysis and prediction models. We aimed this study to develop automated tools for the preprocessing and presentation of data from physical experiments and the creation of deposited datasets on the basis of the example of studying palladium-based nanocatalysts using synchrotron radiation facilities. During the study, methods of preliminary processing of XAS data were considered, which can be conditionally divided into X-ray absorption near edge structure (XANES) and extended X-ray absorption fine structure (EXAFS). This paper proposes a software toolkit that implements data preprocessing scenarios in the form of a single pipeline. The main preprocessing methods used in this study proposed are principal component analysis (PCA); z-score normalization; the interquartile method for eliminating outliers in the data; as well as the k-means machine learning method, which makes it possible to clarify the phase of the studied material sample by clustering feature vectors of experiments. Among the results of this study, one should also highlight the obtained deposited datasets of physical experiments on palladium-based nanocatalysts using synchrotron radiation. This will allow for further high-quality data mining to extract new knowledge about materials using artificial intelligence methods and machine learning models, and will ensure the smooth dissemination of these datasets to researchers and their reuse.

Author(s):  
Andrea Martini ◽  
Alexander A. Guda ◽  
Sergey A. Guda ◽  
Aram L. Bugaev ◽  
Olga V. Safonova ◽  
...  

Modern synchrotron radiation sources and free electron laser made X-ray absorption spectroscopy (XAS) an analytical tool for the structural analysis of materials under in situ or operando conditions. Fourier approach...


2020 ◽  
Author(s):  
Steven Torrisi ◽  
Matthew Carbone ◽  
Brian Rohr ◽  
Joseph H. Montoya ◽  
Yang Ha ◽  
...  

X-ray absorption spectroscopy (XAS) produces a wealth of information about the local structure of materials, but interpretation of spectra often relies on easily accessible trends and prior assumptions about the structure. Recently, researchers have demonstrated that machine learning models can automate this process to predict the coordinating environments of absorbing atoms from their XAS spectra. However, machine learning models are often difficult to interpret, making it challenging to determine when they are valid and whether they are consistent with physical theories. In this work, we present three main advances to the data-driven analysis of XAS spectra: we demonstrate the efficacy of random forests in solving two new property determination tasks (predicting Bader charge and mean nearest neighbor distance), we show that multiscale featurization can elucidate the regions and trends in spectra that encode various local properties, and we address the effect of normalization on model interpretability. The multiscale featurization transforms the spectrum into a vector of polynomial-fit features, and is contrasted with the commonly-used "pointwise" featurization that directly uses the entire spectrum as input. We find that across thousands of transition metal oxide spectra, the relative importance of features describing the curvature of the spectrum can be localized to individual energy ranges, and we can separate the importance of constant, linear, quadratic, and cubic trends, as well as the white line energy. This work has the potential to assist rigorous theoretical interpretations, expedite experimental data collection, and automate analysis of XAS spectra, thus accelerating discovery of new functional materials.


2020 ◽  
Author(s):  
Steven Torrisi ◽  
Matthew Carbone ◽  
Brian Rohr ◽  
Joseph H. Montoya ◽  
Yang Ha ◽  
...  

X-ray absorption spectroscopy (XAS) produces a wealth of information about the local structure of materials, but interpretation of spectra often relies on easily accessible trends and prior assumptions about the structure. Recently, researchers have demonstrated that machine learning models can automate this process to predict the coordinating environments of absorbing atoms from their XAS spectra. However, machine learning models are often difficult to interpret, making it challenging to determine when they are valid and whether they are consistent with physical theories. In this work, we present three main advances to the data-driven analysis of XAS spectra: we demonstrate the efficacy of random forests in solving two new property determination tasks (predicting Bader charge and mean nearest neighbor distance), we show that multiscale featurization can elucidate the regions and trends in spectra that encode various local properties, and we address the effect of normalization on model interpretability. The multiscale featurization transforms the spectrum into a vector of polynomial-fit features, and is contrasted with the commonly-used "pointwise" featurization that directly uses the entire spectrum as input. We find that across thousands of transition metal oxide spectra, the relative importance of features describing the curvature of the spectrum can be localized to individual energy ranges, and we can separate the importance of constant, linear, quadratic, and cubic trends, as well as the white line energy. This work has the potential to assist rigorous theoretical interpretations, expedite experimental data collection, and automate analysis of XAS spectra, thus accelerating discovery of new functional materials.


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
A. A. Guda ◽  
S. A. Guda ◽  
A. Martini ◽  
A. N. Kravtsova ◽  
A. Algasov ◽  
...  

AbstractX-ray absorption near-edge structure (XANES) spectra are the fingerprint of the local atomic and electronic structures around the absorbing atom. However, the quantitative analysis of these spectra is not straightforward. Even with the most recent advances in this area, for a given spectrum, it is not clear a priori which structural parameters can be refined and how uncertainties should be estimated. Here, we present an alternative concept for the analysis of XANES spectra, which is based on machine learning algorithms and establishes the relationship between intuitive descriptors of spectra, such as edge position, intensities, positions, and curvatures of minima and maxima on the one hand, and those related to the local atomic and electronic structure which are the coordination numbers, bond distances and angles and oxidation state on the other hand. This approach overcoms the problem of the systematic difference between theoretical and experimental spectra. Furthermore, the numerical relations can be expressed in analytical formulas providing a simple and fast tool to extract structural parameters based on the spectral shape. The methodology was successfully applied to experimental data for the multicomponent Fe:SiO2 system and reference iron compounds, demonstrating the high prediction quality for both the theoretical validation sets and experimental data.


Symmetry ◽  
2021 ◽  
Vol 13 (8) ◽  
pp. 1315
Author(s):  
Takafumi Miyanaga

X-ray absorption fine structure (XAFS) is a powerful technique used to analyze a local electronic structure, local atomic structure, and structural dynamics. In this review, I present examples of XAFS that apply to the local structure and dynamics of functional materials: (1) structure phase transition in perovskite PbTiO3 and magnetic FeRhPd alloys; (2) nano-scaled fluctuations related to their magnetic properties in Ni–Mn alloys and Fe/Cr thin films; and (3) the Debye–Waller factors related to the chemical reactivity for catalysis in polyanions and ligand exchange reaction. This study shows that the local structure and dynamics are related to the characteristic function of the materials.


1981 ◽  
Vol 20 (8) ◽  
pp. L612-L614 ◽  
Author(s):  
Hirohiko Adachi ◽  
Kazumi Fujima ◽  
Kazuo Taniguchi ◽  
Chie Miyake ◽  
Shosuke Imoto

IUCrJ ◽  
2017 ◽  
Vol 4 (5) ◽  
pp. 529-539 ◽  
Author(s):  
Masaki Yamamoto ◽  
Kunio Hirata ◽  
Keitaro Yamashita ◽  
Kazuya Hasegawa ◽  
Go Ueno ◽  
...  

The progress in X-ray microbeam applications using synchrotron radiation is beneficial to structure determination from macromolecular microcrystals such as smallin mesocrystals. However, the high intensity of microbeams causes severe radiation damage, which worsens both the statistical quality of diffraction data and their resolution, and in the worst cases results in the failure of structure determination. Even in the event of successful structure determination, site-specific damage can lead to the misinterpretation of structural features. In order to overcome this issue, technological developments in sample handling and delivery, data-collection strategy and data processing have been made. For a few crystals with dimensions of the order of 10 µm, an elegant two-step scanning strategy works well. For smaller samples, the development of a novel method to analyze multiple isomorphous microcrystals was motivated by the success of serial femtosecond crystallography with X-ray free-electron lasers. This method overcame the radiation-dose limit in diffraction data collection by using a sufficient number of crystals. Here, important technologies and the future prospects for microcrystallography are discussed.


Sign in / Sign up

Export Citation Format

Share Document