Massively parallel unsupervised single-particle cryo-EM data clustering via statistical manifold learning.

PLoS One

State Key Laboratory for Artificial Microstructure and Mesoscopic Physics, Institute of Condensed Matter Physics, School of Physics, Center for Quantitative Biology, Peking University, Beijing, China.

Published: October 2017

Structural heterogeneity in single-particle cryo-electron microscopy (cryo-EM) data represents a major challenge for high-resolution structure determination. Unsupervised classification may serve as the first step in the assessment of structural heterogeneity. However, traditional algorithms for unsupervised classification, such as K-means clustering and maximum likelihood optimization, may classify images into wrong classes with decreasing signal-to-noise-ratio (SNR) in the image data, yet demand increased computational costs. Overcoming these limitations requires further development of clustering algorithms for high-performance cryo-EM data processing. Here we introduce an unsupervised single-particle clustering algorithm derived from a statistical manifold learning framework called generative topographic mapping (GTM). We show that unsupervised GTM clustering improves classification accuracy by about 40% in the absence of input references for data with lower SNRs. Applications to several experimental datasets suggest that our algorithm can detect subtle structural differences among classes via a hierarchical clustering strategy. After code optimization over a high-performance computing (HPC) environment, our software implementation was able to generate thousands of reference-free class averages within hours in a massively parallel fashion, which allows a significant improvement on ab initio 3D reconstruction and assists in the computational purification of homogeneous datasets for high-resolution visualization.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5546606PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0182130PLOS

Publication Analysis

Top Keywords

cryo-em data
12
massively parallel
8
unsupervised single-particle
8
statistical manifold
8
manifold learning
8
structural heterogeneity
8
unsupervised classification
8
clustering
6
unsupervised
5
data
5

Similar Publications

The pathogenesis of Thyroid Eye Disease (TED) has been suggested as due to signal enhancement in orbital fibroblasts as a result of autoantibody-induced, synergistic, interaction between the TSH receptor (TSHR) and the IGF-1 receptor (IGF-1R). This interaction has been explained by a "receptor cross talk", mediated via β-arrestin binding. Here, we have examined if this interaction can be mediated via direct receptor contact using modeling and experimental approaches.

View Article and Find Full Text PDF

2D template matching (2DTM) can be used to detect molecules and their assemblies in cellular cryo-EM images with high positional and orientational accuracy. While 2DTM successfully detects spherical targets such as large ribosomal subunits, challenges remain in detecting smaller and more aspherical targets in various environments. In this work, a novel 2DTM metric, referred to as the 2DTM p-value, is developed to extend the 2DTM framework to more complex applications.

View Article and Find Full Text PDF

The structural organisation of pentraxin-3 and its interactions with heavy chains of inter-α-inhibitor regulate crosslinking of the hyaluronan matrix.

Matrix Biol

January 2025

Manchester Cell-Matrix Centre, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, UK; Lydia Becker Institute of Immunology and Inflammation, University of Manchester, Manchester, M13 9PL, United Kingdom. Electronic address:

Pentraxin-3 (PTX3) is an octameric protein, comprised of eight identical protomers, that has diverse functions in reproductive biology, innate immunity and cancer. PTX3 interacts with the large polysaccharide hyaluronan (HA) to which heavy chains (HCs) of the inter-α-inhibitor (IαI) family of proteoglycans are covalently attached, playing a key role in the (non-covalent) crosslinking of HC•HA complexes. These interactions stabilise the cumulus matrix, essential for ovulation and fertilisation in mammals, and are also implicated in the formation of pathogenic matrices in the context of viral lung infections.

View Article and Find Full Text PDF

Transmembrane AMPA receptor regulatory proteins (TARPs) are claudin-like proteins that tightly regulate AMPA receptors (AMPARs) and are fundamental for excitatory neurotransmission. With cryo-electron microscopy (cryo-EM) we reconstruct the 36 kDa TARP subunit γ2 to 2.3 Å, which points to structural diversity among TARPs.

View Article and Find Full Text PDF

ABCB1 is a broad-spectrum efflux pump central to cellular drug handling and multidrug resistance in humans. However, how it is able to recognize and transport a wide range of diverse substrates remains poorly understood. Here we present cryo-EM structures of lipid-embedded human ABCB1 in conformationally distinct apo-, substrate-bound, inhibitor-bound, and nucleotide-trapped states at 3.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!