J Struct Biol. 2024 Mar 01. pii: S1047-8477(24)00013-3. [Epub ahead of print] 108073
Cryo-electron microscopy has become a powerful tool to determine three-dimensional (3D) structures of rigid biological macromolecules from noisy micrographs with single-particle reconstruction. Recently, deep neural networks, e.g., CryoDRGN, have demonstrated conformational and compositional heterogeneity of complexes. However, the lack of ground-truth conformations poses a challenge to assess the performance of heterogeneity analysis methods. In this work, variational autoencoders (VAE) with three types of deep generative priors were learned for latent variable inference and heterogeneous 3D reconstruction via Bayesian inference. More specifically, VAEs with "Variational Mixture of Posteriors" priors (VampPrior-SPR), non-parametric exemplar-based priors (ExemplarPrior-SPR) and priors from latent score-based generative models (LSGM-SPR) were quantitatively compared with CryoDRGN. We built four simulated datasets composed of hypothetical continuous conformation or discrete states of the hERG K+ channel. Empirical and quantitative comparisons of inferred latent representations were performed with affine-transformation-based metrics. These models with more informative priors gave better regularized, interpretable factorized latent representations with better conserved pairwise distances, less deformed latent distributions and lower within-cluster variances. They were also tested on experimental datasets to resolve compositional and conformational heterogeneity (50S ribosome assembly, cowpea chlorotic mottle virus, and pre-catalytic spliceosome) with comparable high resolution. Codes and data are available: https://github.com/benjamin3344/DGP-SPR.
Keywords: Cryogenic electron microscopy; Deep generative models; Protein conformational variation; Single-particle analysis