Recovery of conformational continuum from single-particle cryo-EM data: Optimization of ManifoldEM informed by ground-truth studies
This work is based on the manifold-embedding approach to the study of biological molecules exhibiting conformational changes in a continuum. Previous studies established a workflow capable of reconstructing atomic-level structures in the conformational continuum from cryo-EM images so as to reveal the latent space of macromolecules undergoing multiple degrees of freedom. Here, we introduce a new approach that is informed by detailed heuristic analysis of manifolds formed by simulated heterogeneous cryo-EM datasets. These simulated models were generated with increasing complexity to account for multiple motions, state occupancies and CTF in a wide range of signal-to-noise ratios. Using these datasets as ground-truth, we provide detailed exposition of our findings using several conformational motions while exploring the available parameter space. Guided by these insights, we build a framework to leverage the high-dimensional geometric information obtained towards reconstituting the quasi-continuum of conformational states in the form of an energy landscape and respective 3D maps for all states therein. This framework offers substantial enhancements relative to previous work, for which a direct comparison of outputs has been provided.