Protein sample preparation for solid-state NMR investigations
Graphical abstract
Introduction
Solid-state NMR is one of the more recent arrivals in the field of biophysical structure analysis techniques. Solid-state and solution-state NMR were actually developed about at the same time, but the application to solid biomolecules was for long hampered by the presence of broad resonance lines, which restricted the information attainable for large multi-spin systems like proteins. Solution-state NMR was thus the first approach developed in this context [1], and protein solid-state NMR emerged only when NMR technology had advanced to provide static fields high enough and magic-angle spinning fast enough to obtain sufficient resolution and sensitivity to detect and resolve the many spins in a protein. Another concomitant factor was that the cost of recombinant production and isotope labeling of proteins was coming down to affordable ranges, enabling production of the several tens of milligrams 13C/15N labeled samples the method required.
Solid-state NMR sample preparation is of central importance in protein studies, as also for solution-state NMR, x-ray crystallography, and cryo-electron microscopy (EM). In contrast to EM, which also can analyze samples extracted from organisms, for NMR the proteins to be studied most often need to be produced recombinantly, since isotope labeling is of high importance. In solid-state NMR, labeling may be even more important, as solid-state homonuclear 1H spectroscopy remains difficult with current proton linewidths, even under fast MAS. Also, probe and rotor background signals can add difficulties to the detection of the desired protein signals.
Solid-state NMR has the strength to go beyond the characterization of relatively small and soluble biomolecules, and can analyze large assemblies and complexes thereof. Sample preparation thus often goes beyond the expression and purification of the protein, and includes further steps like membrane reconstitution, assembly, or addition of interacting partners. The approaches thus share aspects with sample preparation for electron microscopy [2], [3], [4]. However, unlike for EM, the sample is often hydrated and at room temperature during the NMR experiment, conditions which might be considered closer to physiological.
Solid-state NMR needs homogenous protein samples, as measured linewidths are directly related to the local order the proteins show in the samples. Highest quality NMR spectra are still obtained on highly ordered assemblies, like small, rigid crystalline proteins, or highly rigid assemblies like bacterial needles or virus capsids. Understanding of how exactly the linewidth is related to order and (residual) dynamics is slowly emerging, but more work is needed to gain deeper insight.
Solid-state NMR was previously infamous for the high sample amounts it needed. The field took off twenty years ago with the use of sample containers (rotors) which needed about 40–50 mg starting material to be optimally filled. The latest developments today reduce this amount to below 0.5 mg, i.e. by a factor of 100. This opens perspectives which were unthought-of before, mainly in terms of types of proteins that can be analyzed. In terms of sample requirement, this actually puts solid-state NMR in one of the best positions in the panel of available high-resolution biophysical structure analysis methods. If EM can in principle analyze samples extracted from organisms, purity is there also of concern, and sorting of polymorphs, for example, is not always straightforward with current resolution, even if progress is highly promising. It is particularly interesting to combine solid-state NMR with cell-free expression, as there only the protein to be analyzed is “visible” as it represents the only isotope labeled material, and any additives/interacting partners will remain invisible in the spectra. The important drop in the required sample amount opens the possibility for this combination today.
We will review here sample preparation methods which have been used for solid-state NMR, but which often are derived from other biophysical methods, or which can be used by other methods as well. We will start from protein production by cellular methods for the introduction of selective labels, and cell-free methods; describe membrane protein reconstitution into lipids, as well as sample preparation for DNP and paramagnetic NMR. We finally review the basics of protein sedimentation which is today the main principle of sample preparation for solid-state NMR.
Section snippets
Residue-selective isotope labeling
Bacterial expression remains the workhorse for NMR sample preparation, and has been described extensively in the literature [5]. Selective labeling has been used in solution NMR, with a focus recently on methyl-group labeling in the context of the study of large proteins and the application of TROSY-based NMR approaches [6]. In contrast to solution NMR, where linewidths are dependent on rotational correlation times and thus protein size, which hampers analysis of large proteins or complexes,
Segmental labeling
Segmental isotope labeling of individual domains in a full-length construct allows the study of domains of a large protein separately. This is particularly useful to simplify the resonance assignments of large proteins in the solid state or to study the interaction of an individual domain within a protein-protein complex, with RNA and DNA or other small molecules. In such an approach, one of the two domains is made visible for NMR by 13C and 15N isotope labeling. This domain is ligated with a
Cell-free protein synthesis
Bacterial expression as described above, and more generally cell-based expression, is not always the method of choice for sample preparation for NMR studies, especially in the case of eukaryotic membrane proteins. Indeed, although some have been successfully expressed in cell-based expression systems, e.g. in Lactococcus lactis [23], in human [24], [25] or in insect cells [26], the production of such proteins is still challenging. Overexpression is often impaired by toxicity of the target
Membrane protein reconstitution
One of the major advantages of solid-state NMR is the possibility to conduct structural investigations on membrane proteins inserted in a lipid environment. This feature is important because many membrane proteins display their full activity only when embedded in a lipid bilayer. Therefore, the reconstitution in lipids procedure can be considered as a cornerstone in the sample preparation. The reconstitution process is based on the co-micellisation of purified membrane protein in detergent with
Dynamic nuclear polarization (DNP)
DNP is a sensitivity-enhancement method based on transferring the much larger electron polarization to surrounding nuclei [111], [112], in particular via electron-proton two- or three-spin processes and followed by 1H–1H spin-diffusion [113]. This is achieved by irradiating the sample with a microwave source and saturating the EPR lines of added organic radicals. This approach has been used in NMR, in biology and materials [114], [115], [116], [117], [118]. Protein DNP requires specific sample
Paramagnetic NMR
Paramagnetic NMR is a well-established tool in solution-state NMR, where for example it is used to identify interaction sites like nucleotide binding sites and to derive structural restraints. For reviews see Refs. [135], [136]. Paramagnetic relaxation enhancements (PREs) and Pseudo Contact Shifts (PCS) are used in protein structure determination [135]. The use of paramagnetic effects in biomolecular solid-state NMR developed over the last years [137], [138], [139], [140], and has been used to
Sedimentation and rotor filling
An important step in sample preparation is the filling of the protein into the NMR rotor. As the protein is subject to high centrifugal forces in the rotor, the best approach is to use ultracentrifugation to fill the rotor, as this avoids void space once the rotor is spun in the NMR probe. For this, special filling tools have been developed over recent years by different groups [109], [151], [152] (see also Fig. 9), and commercial filling tools are now also available. Concomitant with rotor
Summary and outlook
Sample preparation is progressing from being the compulsory first (sometimes difficult and, for many NMR spectroscopists, still mysterious and annoying) step to becoming a real tool box, capable of adapting proteins to spectroscopic needs, potentials and limitations. Singling out certain protein domains, amino acid types or pairs thereof by specific labeling, playing with deuteration and protonation patterns, simply sedimenting proteins of interest, and most importantly synthesizing proteins
Acknowledgements
This work was supported by the French Agence Nationale de Recherche (ANR-14-CE09-0024B), the LABEX ECOFECT (ANR-11-LABX-0048) within the Université de Lyon program Investissements d’Avenir (ANR-11-IDEX-0007), and by the Swiss National Science Foundation (Grant 200020_159707). T.W. acknowledges support by the ETH Career SEED-69 16-1.
Glossary
- ABC
- ATP-binding cassette
- AMP-PNP
- Adenylyl-imidodiphosphate
- cDNA
- complementary DNA
- CECF
- continuous-exchange CF system
- CF
- cell-free
- CFCF
- continuous flow CF
- CMC
- critical micelle concentration
- CTD
- C-terminal domain
- Cryo-EM
- cryo-electron microscopy
- DARR
- dipolar-assisted rotational recoupling
- DNA
- deoxyribonucleic acid
- DNP
- dynamic nuclear polarization
- E. coli
- Escherichia coli
- FUS
- fused in sarcoma
- GRecon
- Gradient reconstitution
- HpDnaB
- Helicobacter pylori DnaB
- LPR
- lipid-to-protein ratio
- NMR
- nuclear magnetic resonance
- NTD
- N-terminal domain
References (160)
- et al.
New developments in isotope labeling strategies for protein solution NMR spectroscopy
Curr. Opin. Struct. Biol.
(2000) - et al.
Architecture of a dodecameric bacterial replicative helicase
Structure
(2012) - et al.
Lactococcus lactis as host for overproduction of functional membrane proteins
Biochim. Biophys. Acta
(2003) - et al.
Semliki Forest virus vectors for overexpression of 101 G protein-coupled receptors in mammalian host cells
Protein Expr Purif.
(2006) - et al.
BacMam system for high-level expression of recombinant soluble and membrane glycoproteins for structural studies
Protein Expr Purif.
(2008) - et al.
Rapid screening of membrane protein expression in transiently transfected insect cells
Protein Expr Purif.
(2013) - et al.
Reconstitution of membrane proteins into liposomes
Meth. Enzymol.
(2003) - et al.
Membrane proteins, lipids and detergents: not just a soap opera
Biochim. Biophys. Acta (BBA) - Biomemb.
(2004) - et al.
Functional expression, purification, characterization, and membrane reconstitution of non-structural protein 2 from hepatitis C virus
Protein Expr Purif.
(2015) - et al.
Cell-free translation of integral membrane proteins into unilamelar liposomes
Meth. Enzymol.
(2009)
Cell-free synthesis and functional characterization of sphingolipid synthases from parasitic trypanosomatid protozoa
J. Biol. Chem.
High-throughput cell-free systems for synthesis of functionally active proteins
Trends Biotechnol.
Automated system for high-throughput protein production using the dialysis cell-free method
Protein Expr. Purif.
Reconstitution of the Cpx signaling system from cell-free synthesized proteins
N. Biotechnol.
Cell-free expression systems for eukaryotic protein production
Curr. Opin. Biotechnol.
Wheat germ systems for cell-free protein expression
FEBS Lett.
A bilayer cell-free protein synthesis system for high-throughput screening of gene products
FEBS Lett.
The past, present and future of cell-free protein synthesis
Trends Biotechnol.
Possible role of phosphorylation in the function of chicken MyoD1
J. Biol. Chem.
Phosphorylation of plant eukaryotic initiation factor-2 by the plant-encoded double-stranded RNA-dependent protein kinase, pPKR, and inhibition of protein synthesis in vitro
J. Biol. Chem.
Sequence specificity and efficiency of protein N-terminal methionine elimination in wheat-embryo cell-free system
Protein Expr Purif.
11-(Ethylthio)undecanoic acid. A myristic acid analogue of altered hydrophobicity which is functional for peptide N-myristoylation with wheat germ and yeast acyltransferase
J. Biol. Chem.
Cloning and characterization of a 72-kDa inositol-polyphosphate 5-phosphatase localized to the Golgi network
J. Biol. Chem.
Protein labeling strategies for liquid-state NMR spectroscopy using cell-free synthesis
Progr. NMR Spectr.
Wheat germ cell-free expression: Two detergents with a low critical micelle concentration allow for production of soluble HCV membrane proteins
Protein Expr Purif.
Advances in NMR structures of integral membrane proteins
Curr. Opin. Struct. Biol.
Labeling of membrane proteins by cell-free expression
Meth. Enzymol.
Cell-free expression–making a mark
Curr. Opin. Struct. Biol.
Reconstitution of membrane proteins into liposomes: application to energy-transducing membrane proteins
Biochim. Biophys. Acta
NMR Studies of structure and function of biological macromolecules (Nobel Lecture)
Angew. Chem. Int. Ed.
Cryo-EM structure of alpha-synuclein fibrils
Elife
GRecon: a method for the lipid reconstitution of membrane proteins
Angew. Chem. Int. Ed.
Gradient reconstitution of membrane proteins for solid-state NMR studies
J. Biomol. NMR
Recombinant protein expression in E. coli: a historical perspective
Methods Mol. Biol.
Bacterial metabolism
Annu. Rev. Microbiol.
Selective labeling and unlabeling strategies in protein solid-state NMR spectroscopy
J. Biomol. NMR
Synthesis of proteins by native chemical ligation
Science
Expressed protein ligation: a general method for protein engineering
Proc. Natl. Acad. Sci. USA
Protein trans-splicing by a split intein encoded in a split DnaE gene of Synechocystis sp. PCC6803
Proc. Natl. Acad. Sci. USA
Protein trans-splicing and its use in structural biology: opportunities and limitations
Mol. BioSyst.
Sortase-mediated protein ligation: a new method for protein engineering
JACS
Untangling a repetitive amyloid sequence: correlating biofilm-derived and segmentally labeled Curli Fimbriae by solid-state NMR spectroscopy
Angew. Chem. Int. Ed.
Unambiguous assignment of short- and long-range structural restraints by solid-state NMR spectroscopy with segmental isotope labeling
Chembiochem.
Segmental isotopic labeling of HIV-1 capsid protein assemblies for solid state NMR
J. Biomol. NMR
Combining DNP NMR with segmental and specific labeling to study a yeast prion protein strain that is not parallel in-register
Proc. Natl. Acad. Sci. USA
Segmental isotope labelling and solid-state NMR of a 12 × 59 kDa motor protein: identification of structural variability
J. Biomol. NMR
Structure and primase-mediated activation of a bacterial dodecameric replicative helicase
Nucl. Acids Res.
[sup 13]C–[sup 1]H dipolar-driven [sup 13]C–[sup 13]C recoupling without [sup 13]C rf irradiation in nuclear magnetic resonance of rotating solids
J. Chem. Phys.
Cell-free synthesis of zinc-binding proteins
J. Struct. Funct. Genom.
Cited by (22)
Solid-state NMR methods for the characterization of bioconjugations and protein-material interactions
2022, Solid State Nuclear Magnetic ResonanceRecent advances in solid-state relaxation dispersion techniques
2020, Solid State Nuclear Magnetic ResonanceCitation Excerpt :Over the last decades, magic-angle-spinning (MAS) solid-state NMR (ssNMR) spectroscopy emerged as the primary technique to elucidate the structure, interaction, and site-specific dynamics of a broad range of biomolecules, including membrane proteins [13–15], amyloids [16,17], virus capsides [18,19], silk proteins [20], and large protein complexes [21–25]. Advanced isotopic-labeling schemes [24,26], high magnetic field fields (up to 1.2 GHz), fast and ultra-fast MAS probes (up to 140 kHz spinning speeds) and associated method developments [27–30], proton-detected, multi-dimensional data acquisition [31,32], and sensitivity enhancement by dynamic nuclear polarization [33,34] ensure that increasingly more complex molecules can now be analyzed with sufficient sensitivity and resolution. These essential developments opened up the way to quantitative motional analysis of biomolecules and macromolecular complexes.
Solution structure and oligomeric state of the E. coliglycerol facilitator
2020, Biochimica et Biophysica Acta - BiomembranesCitation Excerpt :Besides all these possible sources of dynamic exchange, it's also possible that the NMR resonances in the protein dissolved in LMNG at high concentrations are broadened owing to static heterogeneity. In bilayer-dissolved IMP's the lipid:protein ratio can affect the static protein conformational heterogeneity and lead to line-broadened spectra [48,57]. This problem has been overcome in some cases by careful exploration of lipid:protein ratios to discover the right combination that gives rise to one predominant conformation.
A solid-state NMR tool box for the investigation of ATP-fueled protein engines
2020, Progress in Nuclear Magnetic Resonance SpectroscopyCitation Excerpt :Protein-nucleotide contacts in the motor protein DnaB were probed by MAS-DNP experiments even in the absence of glycerol allowing more protein to be filled in the MAS rotor. While the DNP enhancement factors ε are reduced compared to a sample made with glycerol, the overall sensitivity of the experiment increases because of the larger amount of protein in the MAS rotor [246] (for more details see also a recent review about sample preparation for NMR [78]). In that vein, DNP-enhanced PHHC experiments were implemented.
Use of paramagnetic systems to speed-up NMR data acquisition and for structural and dynamic studies
2019, Solid State Nuclear Magnetic ResonanceCitation Excerpt :Nuclear magnetic resonance (NMR) spectroscopy is a well-established technique that allows the study of a variety of systems ranging from small molecules and macromolecules in solution to crystals, powders, and a variety of supramolecular aggregates, including fibrils and bone, in the solid state [1–15].
- 1
Equal contributions.