bims-lances Biomed News
on Landscapes from Cryo-EM and Simulations
Issue of 2024–10–06
three papers selected by
James M. Krieger, National Centre for Biotechnology



  1. bioRxiv. 2024 Sep 19. pii: 2024.09.16.613176. [Epub ahead of print]
      Single-particle analysis by Cryo-electron microscopy (CryoEM) provides direct access to the conformation of each macromolecule. However, the image's signal-to-noise ratio is low, and some form of classification is usually performed at the image processing level to allow structural modeling. Classical classification methods imply the existence of a discrete number of structural conformations. However, new heterogeneity algorithms introduce a novel reconstruction paradigm, where every state is represented by a lower number of particles, potentially just one, allowing the estimation of conformational landscapes representing the different structural states a biomolecule explores. In this work, we present a novel deep learning-based method called HetSIREN. HetSIREN can fully reconstruct or refine a CryoEM volume in real space based on the structural information summarized in a conformational latent space. The unique characteristics that set HetSIREN apart start with the definition of the approach as a real space-based only method, a fact that allows spatially focused analysis, but also the introduction of a novel network architecture specifically designed to make use of meta-sinusoidal activations, with proven high analytics capacities. Continuing with innovations, HetSIREN can also refine the pose parameters of the images at the same time that it conditions the network with prior information/constraints on the maps, such as Total Variation and L1 denoising, ultimately yielding cleaner volumes with high-quality structural features. Finally, but very importantly, HetSIREN addresses one of the most confusing issues in heterogeneity analysis, as it is the fact that real structural heterogeneity estimation is entangled with pose estimation (and to a lesser extent with CTF estimation), in this way, HetSIREN introduces a novel encoding architecture able to decouple pose and CTF information from the conformational landscape, resulting in more accurate and interpretable conformational latent spaces. We present results on computer-simulated data, public data from EMPIAR, and data from experimental systems currently being studied in our laboratories. An important finding is the sensitivity of the structure and dynamics of the SARS-CoV-2 Spike protein on the storage temperature.
    Keywords:  Cryo-Electron Microscopy (CryoEM); Deep Learning; Heterogenous Reconstruction; Principal Component Analysis (PCA); SARS-CoV-2 Spike protein; Sinusoidal Representation Network (SIREN); Uniform Manifold Approximation and Projection (UMAP); temperature dependence
    DOI:  https://doi.org/10.1101/2024.09.16.613176
  2. bioRxiv. 2024 Sep 21. pii: 2024.09.20.610514. [Epub ahead of print]
      Herpesviridae infect nearly all humans for life, causing diseases that range from painful to life-threatening 1 . These viruses penetrate cells by employing a complex apparatus composed of separate receptor-binding, signal-transmitting, and membrane-fusing components 2 . But how these components coordinate their functions is unknown. Here, we determined the 4.19-angstrom cryoEM reconstruction of the central signal-transmitting component from herpes simplex virus 2, the gH/gL complex, in its elusive pre-activation state. Analysis of the continuum of conformational ensembles observed in cryoEM data revealed a series of structural rearrangements in gH/gL that allosterically transmit the fusion-triggering signal from the receptor-binding glycoprotein gD to the membrane fusogen gB. Furthermore, we identified a structural "switch" element in gH/gL that refolds and flips 180 degrees during the transition from pre-activation to activated form. Conservation of this "switch" in gH/gL homologs suggests that the proposed fusion triggering mechanism may apply to all Herpesviridae and points to a new target for subunit-based vaccines and treatment efforts.
    DOI:  https://doi.org/10.1101/2024.09.20.610514
  3. J Chem Theory Comput. 2024 Oct 02.
      Integrative structural biology synergizes experimental data with computational methods to elucidate the structures and interactions within biomolecules, a task that becomes critical in the absence of high-resolution structural data. A challenging step for integrating the data is knowing the expected accuracy or belief in the dataset. We previously showed that the Modeling Employing Limited Data (MELD) approach succeeds at predicting structures and finding the best interpretation of the data when the initial belief is equal to or slightly lower than the real value. However, the initial belief might be unknown to the user, as it depends on both the technique and the system of study. Here we introduce MELD-Adapt, designed to dynamically evaluate and infer the reliability of input data while at the same time finding the best interpretation of the data and the structures compatible with it. We demonstrate the utility of this method across different systems, particularly emphasizing its capability to correct initial assumptions and identify the correct fraction of data to produce reliable structural models. The approach is tested with two benchmark sets: the folding of 12 proteins with coarse physical insights and the binding of peptides with varying affinities to the extraterminal domain using chemical shift perturbation data. We find that subtle differences in data structure (e.g., locally clustered or globally distributed), starting belief, and force field preferences can have an impact on the predictions, limiting the possibility of a transferable protocol across all systems and data types. Nonetheless, we find a wide range of initial setup conditions that will lead to successful sampling and identification of native states, leading to a robust pipeline. Furthermore, disagreements about how much data is enforced and satisfied rapidly serve to identify incorrect setup conditions.
    DOI:  https://doi.org/10.1021/acs.jctc.4c00690