Structural biology

Visualizing heterogeneous structural ensembles

We are actively working on new computational frameworks to extract ensembles of heterogeneous structures from single particle cryo-EM datasets. Our first application of these ideas produced cryo-DRGN (Deep Reconstructing Generative Networks), which has garnered great interested from the cryo-EM community, and was recently published in Nature Methods. CryoDRGN employs a series of neural networks arranged as a variational autoencoder to 1) embed single particle images in a low-dimensional latent space, and 2) reconstruct 3D density maps from data-occupied regions of this latent space. Together, these networks efficiently classify the particles in a continuous space, which allows us to visualize continuous motions as well as discrete compositional changes. Critically, in contrast to many approaches, one need not specify the number of classes expected nor the types of motion expected.

We can use the maps cryoDRGN generates to identify new structural states that help us understand how molecular machines work. Moreover, we can generate movies of the molecules transitioning from one state to another by traversing along the latent space data manifold, generating structures along the way. Such movies help us understand how structural dynamics facilitate assembly or function.

In related work, we are developing analysis tools to interrogate and interpret the overwhelming number of distinct 3D maps that tools like cryoDRGN can generate.

More importantly, cryoDRGN has it’s own emoji - ❄🐉!!

You can download and try the software yourself here.

 

in-situ structural biology

We were recently awarded a Sloan Research fellowship to extend our machine-learning based methods to complexes imaged directly in cells. If successful, this technique would allow us to visualize the full ensemble of conformational changes these machines utilize in their native cellular environment. Additionally, users could then spatially catalog how the structures vary throughout the cell. For example, we expect to see ribosomes undergoing assembly near the nucleolus and, with this approach, we could quantify how the distribution and maturity of ribosomes assembly intermediates varies throughout the cell. Learn more here.

 

model COMPLEXES FOR STRUCTURAL STUDIES

To guide our development of the computational tools described above, we are also building software to simulate highly heterogeneous single particle cryoEM datasets. This tool, which is implemented in Python, is based a seminal description of how one could model noise in single particle cryo-EM datasets by Baxter et al.

Our tool allows users to 1) convert an atomic model to a 3D density map; 2) generate noiseless 2D projections of this 3D density map from desired projection angles; 3) corrupt the particle image with white Gaussian noise and the contrast transfer function using desired noise and CTF parameters. The resulting images can then be used to test the efficacy of 3D reconstruction algorithms.

simulationed_noise-01.png

In related work, we are also building physical complexes with defined structural heterogeneity, which we can use to benchmark our 3D reconstruction tools.

 

AFFINITY-EM

In collaboration with Dave Thompson’s group at Purdue University, we are working on new methods to affinity purify protein complexes of interest directly on cryo-EM grids. These reagents would allow users to directly isolate an appropriately tagged protein complex from a cell lysate and and to determine the ensemble of conformations it adopts. In preliminary experiments, we have successfully isolated an intact complex from a E. coli.

In related work, we are adapting methods from others to 1) rapidly freeze grids using self-wicking grids and home-built automated plungers, and 2) depositing graphene and graphene oxide monolayers on cryoEM grids to aid in concentrating and supporting our protein complexes during the vitrification process.

 
f6_affinity-01.png

Mass spectrometry

Complex compositional analysis by mass spectrometry

Using quantitative mass spectrometry, we routinely determine the composition of macromolecular complexes we’ve isolated or assembled in vitro.

This work relies on application of a standard complex of known composition that was purified from media bearing mass-distinguishable isotopes. We typically collect “DIA/SWATH” data, and use a series of in-house scripts to carefully (and quantitatively) determine the relative abundance of each subunit in the complex.

When looking at many (10s-100s) of subunits, it sure beats a western blot!

ms_schematic.png
 

Pulse labeling mass spectrometry

We have continued to develop and apply methods to couple pulse-labeling to quantitative mass spectrometry, with the goal of carefully monitoring synthesis and degradation rates of individual proteins across the proteome.

In recent studies, we have applied these methods to bacterial, yeast, and human cells cultured in vitro to understand the relative contributions of various proteostasis pathways, and to identify novel targets for this protestasis machinery.

Additionally, we build software tools to analyze these pulse-labeling timecourses.