scholarly journals Unsupervised learning approaches to characterizing heterogeneous samples using X-ray single-particle imaging

IUCrJ ◽  
2022 ◽  
Vol 9 (2) ◽  
Author(s):  
Yulong Zhuang ◽  
Salah Awel ◽  
Anton Barty ◽  
Richard Bean ◽  
Johan Bielecki ◽  
...  

One of the outstanding analytical problems in X-ray single-particle imaging (SPI) is the classification of structural heterogeneity, which is especially difficult given the low signal-to-noise ratios of individual patterns and the fact that even identical objects can yield patterns that vary greatly when orientation is taken into consideration. Proposed here are two methods which explicitly account for this orientation-induced variation and can robustly determine the structural landscape of a sample ensemble. The first, termed common-line principal component analysis (PCA), provides a rough classification which is essentially parameter free and can be run automatically on any SPI dataset. The second method, utilizing variation auto-encoders (VAEs), can generate 3D structures of the objects at any point in the structural landscape. Both these methods are implemented in combination with the noise-tolerant expand–maximize–compress (EMC) algorithm and its utility is demonstrated by applying it to an experimental dataset from gold nanoparticles with only a few thousand photons per pattern. Both discrete structural classes and continuous deformations are recovered. These developments diverge from previous approaches of extracting reproducible subsets of patterns from a dataset and open up the possibility of moving beyond the study of homogeneous sample sets to addressing open questions on topics such as nanocrystal growth and dynamics, as well as phase transitions which have not been externally triggered.

2020 ◽  
Vol 11 (15) ◽  
pp. 6077-6083
Author(s):  
Thomas Mandl ◽  
Christofer Östlin ◽  
Ibrahim E. Dawod ◽  
Maxim N. Brodmerkel ◽  
Erik G. Marklund ◽  
...  

2015 ◽  
Vol 22 (6) ◽  
pp. 1345-1352 ◽  
Author(s):  
S. A. Bobkov ◽  
A. B. Teslyuk ◽  
R. P. Kurta ◽  
O. Yu. Gorobtsov ◽  
O. M. Yefanov ◽  
...  

Modern X-ray free-electron lasers (XFELs) operating at high repetition rates produce a tremendous amount of data. It is a great challenge to classify this information and reduce the initial data set to a manageable size for further analysis. Here an approach for classification of diffraction patterns measured in prototypical diffract-and-destroy single-particle imaging experiments at XFELs is presented. It is proposed that the data are classified on the basis of a set of parameters that take into account the underlying diffraction physics and specific relations between the real-space structure of a particle and its reciprocal-space intensity distribution. The approach is demonstrated by applying principal component analysis and support vector machine algorithms to the simulated and measured X-ray data sets.


IUCrJ ◽  
2021 ◽  
Vol 8 (6) ◽  
Author(s):  
Miklós Tegze ◽  
Gábor Bortel

In single-particle imaging (SPI) experiments, diffraction patterns of identical particles are recorded. The particles are injected into the X-ray free-electron laser (XFEL) beam in random orientations. The crucial step of the data processing of SPI is finding the orientations of the recorded diffraction patterns in reciprocal space and reconstructing the 3D intensity distribution. Here, two orientation methods are compared: the expansion maximization compression (EMC) algorithm and the correlation maximization (CM) algorithm. To investigate the efficiency, reliability and accuracy of the methods at various XFEL pulse fluences, simulated diffraction patterns of biological molecules are used.


2018 ◽  
Vol 8 (1) ◽  
pp. 132 ◽  
Author(s):  
Zhibin Sun ◽  
Jiadong Fan ◽  
Haoyuan Li ◽  
Huaidong Jiang

IUCrJ ◽  
2017 ◽  
Vol 4 (5) ◽  
pp. 560-568 ◽  
Author(s):  
Carsten Fortmann-Grote ◽  
Alexey Buzmakov ◽  
Zoltan Jurek ◽  
Ne-Te Duane Loh ◽  
Liubov Samoylova ◽  
...  

Single-particle imaging with X-ray free-electron lasers (XFELs) has the potential to provide structural information at atomic resolution for non-crystalline biomolecules. This potential exists because ultra-short intense pulses can produce interpretable diffraction data notwithstanding radiation damage. This paper explores the impact of pulse duration on the interpretability of diffraction data using comprehensive and realistic simulations of an imaging experiment at the European X-ray Free-Electron Laser. It is found that the optimal pulse duration for molecules with a few thousand atoms at 5 keV lies between 3 and 9 fs.


2021 ◽  
Vol 11 (Suppl_1) ◽  
pp. S11-S11
Author(s):  
Grigoriy Armeev ◽  
Alexey Shaytan ◽  
Mikhail Vorovich ◽  
Alexey Egorov ◽  
Aydar Ishmukhametov ◽  
...  

Background: Tick-borne encephalitis virus (TBEV) is a dangerous human pathogen which envelope structure is already known from cryoEM study. TBEV mature viral particle size (~50 nm in diameter) makes it suitable for single-particle imaging (SPI) on X-ray free-electron laser (XFEL). XFEL SPI studies are at the early stages of development; thus, a well-described and conformationally homogeneous sample is required to develop approaches for experimental setup and data analysis. Here we present the image analysis results of data collected in October 2019 during the European XFEL experiment #2316. Methods: The detector was placed at 1.62 m from the injector; photon energy was around 6 keV, pulse energy 4 mJ, beam diameter ~ 500 nm. All runs were processed to detect hits with threshold filter (5th percentile of lit pixels) and further filtered to omit low-intensity images and images that lack detector modules. Filtered hits were background and geometry corrected with SPImage library and custom python scripts. Then hits were azimuthally integrated using PyFAI library. Scattering profiles were further clustered using the affinity propagation algorithm with cosine similarity metric in log space. Extracted classes were used to build averaged images. All hit profiles were fitted with model scattering to estimate the diameter of the particle. Simulated diffraction patterns were prepared using Condor from the cryoEM electron density map (EMDB ID 3752). Results: During the analysis after the filtering, only 276 clean and bright hits were collected per 135 min of injection (from 27287 hits detected via lit pixels threshold). Thus the hit rate was around ~ 2 hits/min, which is expected to rise in the future. The majority of hits correspond to the 40-50 nm particles (Fig. 1a), which is expected for TBEV. However, the exact size may vary due to solvent evaporation, ion condensation, and possible variability in the sample. Conclusion: The averaged images and their scattering profiles correlate with the simulated scattering patterns, though not ideally (Fig. 1 bc). Such discrepancy is expected due to the absence of electron density in the center of modeled viral structures.


Sign in / Sign up

Export Citation Format

Share Document