Introduction

The Human Genome and ENCODE Projects have shown that the protein-coding potential of the mammalian genome is <2% and that >90% of the genome is transcribed1,2. While a growing number of noncoding RNA (ncRNA) is being shown to have important cellular roles during development and disease pathogenesis3,4,5,6,7,8, the purpose of the pervasive transcription and the function of its RNA output have remained under active debate. With the possibility of performing systematic CRISPR-Cas9 genome editing and advances in structural characterization9,10,11,12, a functional dissection of ncRNA is now more realistic than ever. However, mechanistic insight will ultimately depend on understanding the interplay between the ncRNA and associated regulatory factors. An understanding of the ncRNA mechanism has lagged behind because of challenges presented by a dearth of approachable methods to map RNA-centered interactomes and to identify high-confidence interacting protein partners.

Several in vivo RNA proteomic methods have recently been developed, including identification of direct RNA interacting proteins (iDRiP)13,14, RNA antisense purification (RAP)15, chromatin isolation by RNA purification (ChIRP)16, capture hybridization analysis of RNA targets (CHART)17,18, biotinylated RNA binding proteins (BioRBP)19 and CRISPR-assisted RNA–protein interaction detection (CARPID)20. Many were initially tested on the RNA gold standard, Xist—the long ncRNA responsible for establishing global silencing of the inactive X-chromosome5,21 (Table 1). The methods have produced an overlapping set of interacting proteins, in addition to factors that are unique to each method14,15,16,20. RNA-proteomic methods are still far from routine, and whether each can be generally applied to all ncRNAs remains unclear. Also lacking is a comparative analysis between methods, as well as between various RNA species to determine the applicability of the in vivo method to RNAs not involved in dosage compensation.

Table 1 Comparison of methods for RNA-based capture of protein targets in vivo

Here we describe a detailed protocol for iDRiP14 demonstrated to be successful for two other ncRNAs: (i) TERRA, a universal telomeric transcript that scaffolds factors critical for gene expression and for protection of chromosomal ends13,22,23,24,25; and (ii) U1, a splicing factor with well-established interacting proteins that thereby serves as a positive control. We then conduct a performance analysis relative to another method, ChIRP16. The effectiveness of iDRiP for the highly repetitive and tightly chromatin-bound TERRA RNA is of interest, and modifications we incorporated could be useful for the capturing of protein interactomes for other repetitive RNAs.

Applications of the method

iDRiP is an RNA-centric method to systematically identify proteins that directly interact with an RNA of interest from cultured cells13,14 or tissues26. It does not rely on an antibody-based immunoprecipitation procedure to capture RNA–protein complexes. In principle, the protocol can be applied to short or long RNAs, chromatin-bound or soluble RNAs, and nuclear or cytoplasmic RNAs. Previously, we had successfully applied this protocol to ncRNAs including the splicing factor U1 (164 nt), the chromatin-bound ncRNA Xist (17 kb) and the telomeric transcript TERRA (heterogeneous size, hundreds to >9 kb). The iDRiP protocol enabled us to isolate and characterize a large number of interacting proteins13,14. Because iDRiP employs shorter oligos (20–25 mer) than other methods (Table 1), it more easily accesses RNA–protein complexes than a method such as RAP15, which uses considerably longer oligo probes.

Limitations and alternative modifications

Our method has been tested mostly on nuclear ncRNAs, including the soluble U1 RNA and the chromatin-bound Xist and TERRA RNAs13,14. It should in principle also be applicable to cytoplasmic RNAs, though we have not explicitly tested them. We have also not tested iDRiP for RNAs <100 nt such as microRNAs and PIWI-interacting RNAs (piRNAs) (21–35 nt). Because iDRiP probe sizes are typically ~20–25 mer, it is theoretically possible to design probes for small RNAs as well. Modified DNA oligo probes such as locked nucleic acid chemistry with phosphorothioate backbones could be used, as locked nucleic acids generally have a higher affinity for nucleic acids, and phosphorothioate backbones lend greater stability than unmodified oligonucleotide probes. Additionally, iDRiP may be best suited to RNAs of moderate-to-high abundance. Xist RNA is present at only 100–300 copies per cell27, while TERRA and U1 are some of the most abundantly expressed transcripts in embryonic stem (ES) cells13. Capturing very low abundance RNAs may be more challenging, as more cells would have to be used and the signal-to-noise ratio may be diminished for very-low-abundance RNAs (<100 copies/cell) (see Troubleshooting). Enzymatic methods, such as the recently reported BioRBP19 and CARPID20, may therefore be more suitable for low-abundance transcripts, though one disadvantage of these alternatives is that they identify both directly as well as indirectly interacting proteins (see ‘Considerations for selecting an RNA proteomics method’).

Comparison with other methods

The current technologies using RNA-centric methodologies to identify RNA interacting proteins in vivo include iDRiP13,14, RAP15, ChIRP16, CHART17,18, BioRBP19 and CARPID20 (Table 1). Many methods were initially tested on Xist RNA, with the methods generating a partially overlapping set of interacting proteins. Here we focus on a comparative analysis between iDRiP and ChIRP, as the two methods both yielded dozens of proteins for Xist, whereas RAP yielded only a dozen or so. iDRiP and ChIRP differ in a number of fundamental ways (Fig. 1a). Although both techniques probe in vivo interactions under physiological conditions, substantial differences exist in how RNA–protein interactions are fixed, how chromatin is solubilized, how capture probes are annealed and, finally, how captured RNA–protein complexes are eluted. At the beginning of both protocols, ES cells are cross-linked to preserve in situ RNA–protein interactions. iDRiP utilizes UV light (at 254 nm) to photo-cross-link RNA to proteins, whereas ChIRP utilizes formaldehyde to chemically cross-link the macromolecules. Because UV cross-linking is more efficient for capturing RNA–protein interactions than DNA–protein or protein–protein interactions, this photo-cross-linking preferentially identifies direct RNA–protein interactions28. Furthermore, because UV light at 254 nm is a short-range cross-linker (‘zero-length’ scale), the RNA–protein interactions tend to be direct interactions. By contrast, the formaldehyde cross-linking employed during ChIRP fixes much larger macromolecular networks and typically results in identification of RNA–protein interactions that are both direct and indirect, potentially brought together by multiple intermediary factors. They could, for instance, include RNAs and proteins brought together by other protein subunits, RNA–DNA hybrids that interact with proteins, or RNA–chromatin–protein interactions.

Fig. 1: Overview of iDRiP and ChIRP.
figure 1

a, Workflows of iDRiP and ChIRP. b, The efficiency and specificity of TERRA recovery by iDRiP versus ChIRP using mES cells, expressed as % of input after RT-qPCR to detect PAR-TERRA, a TERRA transcript13 derived from sex chromosome, and U1 RNA. Capture probes for TERRA (TER) and the control probes (sense and luciferase) are indicated. TERRA-iDRiP and U1-iDRiP are from published data13. Mean ± SD are shown. c, Specificity as determined by fold enrichment of TERRA after normalization to various negative controls as indicated. d, Elution efficiencies of iDRiP and ChIRP expressed as % of input. Mean ± SD are shown. n/a, nondetectable.

Next, ES nuclei are isolated, and chromatin is solubilized prior to isolation of RNA–protein complexes (Fig. 1a). In iDRiP, nuclei are treated with DNase I (Extended Data Fig. 1a) to release the large networks of chromatin that would be prohibitive to the isolation of discrete RNA–protein–chromatin complexes. In ChIRP, vigorous sonication is used to achieve the same end. iDRiP was designed to avoid sonication, due to concern about the physically disruptive action that could potentially fragment and destroy large RNAs, such as Xist and TERRA, and their associated protein complexes. Later, biotinylated antisense oligonucleotides (ASO) of 20–25 DNA-based nucleotides are used to capture RNA-specific complexes. In ChIRP, a formamide-based buffer is applied to enable ASO hybridization at a lower temperature of 37 °C, as formamide lowers the melting temperature (Tm) for RNA–DNA duplexes. To elute, excess biotin is used to compete away and thereby release bound RNA–protein complexes. By contrast, iDRiP applies a nondenaturant hybridization buffer, so that ASO annealing has to be performed at a higher temperature of 65 °C, followed by slow cooling to promote ASO–RNA hybridization. To elute, iDRiP employs heated washes to 70 °C and disruptive detergents to denature the hybridized ASO–RNA interaction and wash away RNA–protein interactions that have not been photo-cross-linked.

Capture efficiency and specificity, as well as signal-to-noise ratio, are the most important factors that determine whether the protocol will successfully identify proteins directly interacting with the RNA. Using iDRiP, we recently identified an extensive proteome for TERRA13, a long ncRNA transcribed across telomeric ends and comprising hundreds to thousands of repeats of the telomeric sequence UUAGGG22,24,25. At the same time that we performed iDRiP, we also performed ChIRP for TERRA in mouse ES (mES) cells. The two methods were carried out under similar conditions with regard to cell culture and mES cell passage numbers. Most importantly, the capture probes used were identical. (Note: although ChIRP was performed around the same time, we only published the TERRA-iDRiP results in the 2017 paper13 because the performance characteristics of TERRA-iDRiP were more desirable and the method yielded more directly interacting proteins.)

Here we carried out a comparative analysis of the two resulting proteomes. We had designed capture probes targeting UUAGGG repeats and several negative control probes, including sense (TERRA-reverse complement), luciferase (nonmammalian) and the mammalian splicing RNA U1. After TERRA capture, qRT-PCR showed that recovery of TERRA RNA using TERRA capture probes was good for both iDRiP (>20% of input) and ChIRP (~13% of input) (Fig. 1b). However, TERRA-capture iDRiP yielded a more favorable signal-to-noise ratio, with a 64-fold enrichment of TERRA over U1 RNA, in comparison with the 16-fold enrichment using ChIRP (Fig. 1b). This difference suggested that iDRiP capture gives a slightly greater yield and specificity for TERRA. Greater specificity of iDRiP was further supported by examining the relative enrichment of TERRA probe capture over control probe capture (Fig. 1c). TERRA was enriched over 1,000-fold in TERRA-iDRiP compared with Luc-iDRiP capture (Fig. 1c). On the other hand, elution methods for ChIRP (excess biotin) and iDRiP (heat denaturation) were similarly efficient (Fig. 1d).

To obtain a quantitative proteomic analysis, we performed quantitative mass spectrometry (MS) using isobaric labeling methods29 and compared the enrichments over negative controls—TERRA-sense and luciferase—for ChIRP relative to our previously published iDRiP profiles13. Captured proteins were quantitatively compared by multiplexed MS-based proteomics using tandem-mass-tag (TMT) reagents and MS3-based quantification30,31. We averaged two biological replicates and reported proteins that are twofold enriched over luciferase controls. The correlation of two biological replicates of ChIRP shows a higher correlation in the global dataset than that of iDRiP (Extended Data Fig. 1a,b). Gene ontology (GO) analysis for enriched hits was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID) bioinformatic resources32. The number of twofold enriched targets is higher in ChIRP than in iDRiP, whereas the number of DAVID significant Biological Process gene sets (false discovery rate (FDR) <25%) is higher in iDRiP than in ChIRP (Extended Data Fig. 1c). There are 134 enriched proteins for iDRiP-MS13 and 205 enriched proteins for ChIRP-MS (Fig. 2a; Supplementary Tables 1,2). Among them, 27 proteins were shared (Fig. 2a, left panel). Among the top 30 hits, 11 proteins were shared (Fig. 2a, right panel). Common hits included shelterin proteins and factors involved in alternative lengthening of telomeres, chromatin modification and cell cycle regulation (Fig. 2b,c). The overall correlation between the iDRiP and ChIRP datasets was low (Pearson’s r = 0.13, Fig. 2d, left panel). However, when considering only the enriched subsets (log2 > 1), the correlation was high (Pearson’s r = 0.63, Fig. 2d, right panel). Thus, in spite of fundamental methodological differences between iDRiP and ChIRP, the top hits were shared.

Fig. 2: Characterization of the TERRA proteome as obtained by iDRiP versus ChIRP.
figure 2

a, Venn diagrams indicating the number of enriched proteins by iDRiP and ChIRP, along with the number of proteins that overlap. Candidates shown are enriched by log2 (TERRA/luciferase) >1 (left). The top 30 candidates are shown (right). b, Shelterin complex and proteins related to alternative lengthening of telomeres (ALT) are enriched within the TERRA proteome by both methods. c, The shared TERRA proteome as identified by iDRiP and ChIRP. d, Scatterplots showing correlation between TERRA hits by iDRiP versus ChIRP. Proteomic counts were normalized using luciferase. Log2 enrichment ratios are shown. Pearson’s r is shown. Left: correlation using data of all proteins obtained from MS. Right: correlation using data of proteins that are enriched over twofold obtained from MS.

There were nevertheless subsets of proteins unique to each dataset. For example, some telomere-associated proteins were only enriched in the iDRiP dataset, including CTC1 and STN1 of the CST complex, as well as RTEL1 (Fig. 2b, Supplementary Tables 1, 2). Notably, the sense control returned far fewer enriched proteins by iDRiP than by ChIRP (Fig. 3a). A number of telomere-associated proteins were enriched in the ChIRP sense control but not in the iDRiP counterpart (Fig. 3b, Extended Data Fig. 2), possibly because ChIRP pulled down indirect interactions mediated via the telomeric chromatin. Significantly, GO enrichment analysis revealed that mitochondrial proteins were major contaminants in the TERRA-ChIRP experiment, but not in the TERRA-iDRiP experiment (Fig. 3c). Consistent with their being contaminants rather than physiological interactors, the same mitochondrial proteins were also enriched in the U1-ChIRP samples (Extended Data Fig. 3). Thus, the ChIRP methodology isolates more false positives originating outside of the nucleus such as mitochondria, which should not come in contact with telomeres under physiological circumstances.

Fig. 3: iDRiP shows overall greater specificity for TERRA RNA.
figure 3

a, Scatterplot comparing log2 enrichment of Sense-iDRiP and Sense-ChIRP results normalized to luciferase. Red circle indicates that many telomere-associated proteins are enriched by Sense-ChIRP but not by Sense-iDRiP. Pearson’s r is shown. b, Captures by sense probes reveal differences between the iDRiP and ChIRP methods. c, GO cellular component enrichment analysis (generated using DAVID tools, GO_CC_term) for iDRiP and ChIRP datasets. Multiple mitochondrial contaminants are seen in the ChIRP dataset.

We previously biochemically and functionally validated the iDRiP candidate, ATRX, as a major TERRA-interacting protein13. By ChIRP, ATRX was also enriched with TERRA. However, it was not enriched relative to the sense control and would have likely been eliminated as an interacting factor from the ChIRP TERRA proteome. Promyelocytic leukemia protein (PML)—a tumor suppressor protein frequently mutated in promyelocytic leukemia and that was recently shown to interact with telomeres33—was also identified as a hit by both iDRiP and ChIRP (Fig. 2b, Supplementary Table 1). However, by ChIRP, PML was not enriched relative to the sense control and would not have been considered an enriched factor had we performed ChIRP alone (Supplementary Table 2). Using immuno-RNA fluorescent in situ hybridization (FISH) to visualize the localization of RNA and proteins, we observed that TERRA RNA showed 60–70% colocalization with both ATRX and PML (Fig. 4a), consistent with a TERRA–PML interaction. We further validated TERRA interacting candidates by UV light cross-linking RNA immunoprecipitation (UV-RIP) (Fig. 4b,c). EZH2, EHMT1, POT1 and aurora B are highly associated with TERRA, as shown in RNA slot blot (Fig. 4b) and qRT-PCR (Fig. 4c). For TERRA, iDRiP generally yielded greater specificity and a superior signal-to-noise ratio relative to sense, luciferase and U1 controls (Fig. 1c, Extended Data Fig. 2). Further analysis using scatterplots demonstrated a high degree of correlation between iDRiP TERRA hits normalized to sense control versus luciferase control (Fig. 4d, left panel; Pearson’s r = 0.55), but the same was not the case for ChIRP TERRA hits (Fig. 4d, right panels). Together, these data suggest that, at least for TERRA, iDRiP provides greater specificity for RNA proteomic analysis.

Fig. 4: Validation of the TERRA proteome as obtained by iDRiP.
figure 4

a, Representative immuno-RNA-FISH staining showing that TERRA (green) is colocalized with ATRX (red, top panel) and PML (red, bottom panel) in mES cells. b, Slot blot shows the TERRA RNA enrichment after UV-RIP for EZH2, POT1, CHD8, EHMT1, CENP-C and Aurora B. Control: GAPDH RNA. See Supplementary Methods for experimental details. c, RT-qPCR shows that PAR-TERRA, a TERRA transcript13 derived from sex chromosome, was enriched after UV-RIP using antibodies against EZH2, POT1, CHD8, EHMT1, CENP-C and Aurora B. PAR-TERRA enrichment was normalized to the IgG capture. Error bars, SD. d, Scatterplots comparing log2 enrichment of hits normalized to either Sense, U1 or luciferase control. iDRiP, left panels. ChIRP, right panels. Pearson’s r is shown.

In comparing various normalization methods, we observed that while normalization of iDRiP data to sense and luciferase controls showed high correlation (Fig. 4d, left panel), normalization to U1 displayed a different pattern that suggested potential commonalities between TERRA and U1 proteomes and that therefore suggested that U1 would not be an appropriate negative control (Fig. 4d, right panel). U1 RNA is a component of the spliceosome, which comprises a large number of well-established small nuclear RNAs and protein complexes, including numerous U1 small nuclear ribonucleoproteins (snRNPs), known as U1 RNA direct interacting proteins34,35.

Thus, to further test iDRiP and ChIRP methodologies, we performed RNA proteomics using the two methods in parallel to compare the recovery of known U1-interacting proteins (Fig. 5a,b). Both methods recovered U1 snRNPs, but with vastly different efficiencies. Specific U1 snRNPs were highly enriched by iDRiP (20- to 35-fold), with nearly all ranking among the top 33 hits among a total of 180 enriched proteins (Fig. 5b, Extended Data Fig. 4c, Supplementary Table 3). By contrast, the same proteins were enriched only several folds by ChIRP and were ranked low on the enrichment list (rank 48 to 1,133) (Fig. 5b, Extended Data Fig. 4d, Supplementary Table 4). ChIRP yielded a much larger enriched list of proteins (436 total). Nevertheless, iDRiP and ChIRP datasets showed 98 proteins in common (Fig. 5c). GO analysis revealed that proteins involved in mRNA processing, RNA splicing and spliceosomal snRNP assembly are enriched in both U1-iDRIP and U1-ChIRP (Fig. 5d and Extended Data Fig. 3a). To further analyze the specificity of the RNA proteomics, we examined the representation of other snRNPs—associated with the functionally related U2 (which exists in a particle with U1), and the biochemically and functionally separate U4, U5 and U636,37. Significantly, by ChIRP, almost all spliceosomal proteins are enriched, including snRNPs known to be associated with the U4/U6/U5 particle (Fig. 5e, red stars). In contrast, U1-iDRiP shows enrichment of most U1/U2 snRNPs but not of U4/U6/U5 snRNPs (Fig. 5d). Thus, iDRiP yielded greater specificity for snRNPs associated with the U1/U2 particle, whereas ChIRP appeared to isolate general splicing factors of both the U1/U2 and the U4/U6/U5 complexes.

Fig. 5: Differential U1 proteomes obtained by iDRiP versus ChIRP.
figure 5

a, Scatterplots comparing log2 enrichment of U1-iDRiP and U1-ChIRP proteomics normalized to luciferase control from mES cells. Pearson’s r is shown. b, U1 snRNPs were enriched by both methods. The fold-enrichment and relative rankings are shown. c, Venn diagram indicating the number of enriched proteins by iDRiP and ChIRP, along with the number of proteins that overlap. Candidates shown are enriched by log2 (U1/luciferase) >1. The total number of enriched proteins is shown in parentheses. d, GO biological process (GO_BP_term) enrichment analysis for U1 interactors identified by iDRiP. e, Components of the spliceosome enriched by U1-ChIRP versus U1-iDRiP. Red stars indicate enriched components.

Considerations for selecting an RNA proteomics method

Here, by investigating two well-established ncRNAs, TERRA and U1, we have performed a side-by-side analysis of two RNA proteomic methods. Our results show that both methods are able to capture bona fide RNA-interacting proteins. However, at least for TERRA and U1, the iDRiP method yielded a greater number of specific proteins and fewer contaminants (e.g., from mitochondria). The enrichment of mitochondrial proteins also suggests that nuclear isolation/extraction is less efficient in ChIRP. An advantage of ChIRP, however, is that indirectly interacting proteins could be identified. Because it employs a strong chemical cross-linking step, ChIRP could in principle be performed with less starting material. Analysis of the expression of genes encoding protein targets (twofold enriched) from ChIRP and iDRiP shows that the targets are not biased toward highly expressed RNA transcripts. In fact, the targets are clustered significantly near the low end of the transcriptome (Extended Data Fig. 3c), suggesting that both methods are able to capture low-abundance proteins. Our experience with TERRA and U1 here is similar to what has been reported for Xist, the RNA responsible for X-chromosome silencing14,16. iDRiP identified a range of nuclear factors, including both low- (CTC1, OBFC1) and high-abundance epigenetic factors (SPEN, CTCF, cohesin, ATRX, HNRNPK).

Current limitations for iDRiP that would require further optimization are small RNA targets, such as microRNAs and piRNAs, in the 20–35 nt size range. Because capture probes are in the 20–25 nt size range, it is theoretically possible to design one to two probes to pull down small RNAs. Another current limitation is the capture of very-low-abundance RNAs. Enzymatic methods may be more suitable for low-abundance, sub-stoichiometric interactions. Such methods include the recently developed BioRBP19 and CARPID20, both based on proximity protein labeling using biotinylation. Indeed, these methods have been tested on two dosage compensation RNAs, Tsix and Xist, respectively, both of moderate abundance27,38. CARPID analysis of Xist revealed a large set of proteins (73) (ref. 20), among which a fraction overlapped with targets identified by our iDRiP analysis. Shared proteins include the experimentally validated cohesin subunits, RAD21, SMC1A14, ATRX39 and BRG140. CARPID additionally identified ISWI chromatin remodeler, SNF2L, and the transcription coactivator, TAF1520—supporting the idea that Xist RNA recruits repressors as well as repels activators14,40. On the other hand, CARPID did not identify SPEN, which was identified by iDRiP, RAP and ChIRP. Technical differences between the CARPID and iDRiP methods likely account for proteins that are unique to each dataset. Using Tsix RNA as bait, BioRBP also identified a large number of high-confidence interactors, including the mRNA decapping enzyme DCP1A and the architectural glue protein CTCF19. The study then validated the proteins through functional characterization, demonstrating a role for both in the formation of X–X pairs and breaking the epigenetic symmetry of two X chromosomes to determine allelic choice.

Given the pros and cons of various RNA proteomic methods, we do not suggest adopting a single method alone. Rather, we advocate for using various orthogonal methods to arrive at a complete view of the interacting proteome for any given RNA. Among the available methods discussed, only iDRiP (and RAP) specifically targets direct interactors—made possible by the short-length cross-linker, UV. Whereas iDRiP enriches for these directly interacting proteins and is best used on moderate-to-high-abundance RNAs, the enzymatic methods (BioRBP and CARPID) may be used for lower-abundance transcripts, substoichiometric or transient interactors, and when proteins that do not directly interact and/or are within a large protein scaffold may be of interest. CHART and ChIRP would also offer alternative methods based on formaldehyde cross-linking and yield proteins that do not directly interact with RNA. Information gained from orthogonal methods could then be evaluated, and overlapping hits might be considered the highest-confidence hits, though nonoverlapping hits could arguably also be of interest owing to the unique experimental conditions of each method. Ultimately, a combination of a large family of RNA-proteomic methods including iDRiP, BioRBP, CARPID, CHART17,18, ChIRP and RAP is beneficial to have in the RNA biologist’s armamentarium.

Experimental design

Overview

To characterize RNA direct-interacting proteins in vivo, UV irradiation is applied to cross-link RNA and protein complexes. To minimize background due to DNA-bound proteins, iDRiP includes a critical DNase I treatment before elution of complexes, for purposes of fragmenting chromatin and eliminating proteins bound to single-stranded DNA that is complementary to the ASO. The washing steps are under a denaturation condition containing 1% sodium dodecyl sulfate (SDS) to eliminate noncovalent interaction through protein–protein interaction. This protocol utilizes biotinylated ASO of 20–25 DNA-based nucleotides to capture RNA of interest. The DNA probes are more stable than RNA probes and can be synthesized easily and cost-effectively. Both a negative control (luciferase) and a positive control (U1) are strongly recommended in the same run for proper normalization and ensuring the quality of iDRiP. Eluted proteins are analyzed by quantitative MS (using isobaric tags for relative and absolute quantitation)29, followed by normalization with the negative control to obtain the enrichment of RNA-interacting proteins. The workflow of iDRiP-MS is shown in Fig. 1a.

Cell types and numbers

We suggest using cell lines or cell types that express high levels of the RNA target of interest. The abundance could be tested by qRT-PCR using 1–5 µg total RNA for cDNA synthesis. In general, when PCR efficiency is optimized, a mean Cq >30 for the RNA target would suggest a low-abundance RNA. We suggest choosing cell lines or cell types in which the RNA’s Cq <26. The Cq of Xist is ~23 from mouse embryonic fibroblasts (MEFs). The Cq of PAR-TERRA, which is TERRA RNA derived from sex chromosomes, is 24–26 from mES cells. The Cq of U1 is ~21 from mES cells. This protocol has been shown to work for mES cells, mouse germ cells26, immortalized MEFs and human U2OS cancer cells.

UV energy

UV is used to directly cross-link RNA to interacting proteins. The optimal UV energy is expected to vary slightly from cell line to cell line and RNA to RNA. When cells are grown in a monolayer, 200–400 mJ/cm2 of 254 nm UV is recommended. Cells grown in multiple layers or tissues may require higher UV irradiation regimens 0.8–4.0 J/cm2 41. Higher UV energy could damage RNA and reduce the number of accessible sites for probe targeting. For protein-enriched RNA complexes or RNAs with enriched secondary structures, substantially lower UV energy is recommended.

Modifications

There are some modifications of TERRA iDRiP compared with the original iDRiP that might be helpful for users investigating other RNAs with repetitive content and tight chromatin binding, as well as applying the method to different types of cells. First, UV cross-linking efficiency could differ depending on the sample. We recommend a slightly lower dose for cells grown in monolayer and a higher dose for multilayer tissues or cells prepared in suspension. For Xist-iDRiP, 200 mJ/cm2 UV was used for MEFs grown in monolayer. For TERRA-iDRiP, 400 mJ/cm2 UV was used for mES cells that are usually grown in a colony. Although ES cells are generally trypsinized prior to irradiation, cells may adhere together in suspension, which would require adjustments in irradiation parameters. Second, for TERRA iDRiP, we heated lysates to 65 °C prior to probe hybridization to increase solubility of chromatin-bound complexes, and insoluble cellular material was removed by centrifugation prior to RNA capture by probe, and then the lysate was pre-cleared using beads to reduce the background noise. This is critical because insoluble material may adhere to beads and increase the background noise. When lysates were heated to 65 °C, TERRA recovery was increased twofold, as determined by RT-qPCR comparing heated with unheated samples. Third, the most critical step is the change in hybridization conditions for TERRA. We anticipated that the highly repetitive nature of TERRA and its high abundance of telomeric DNA could be an issue for probe specificity. Thus, whereas Xist capture hybridization was carried out at 37 °C overnight, TERRA was first denatured at 65 °C and then gradually cooled to 37 °C over 1–2 h to enhance capture specificity. We also avoided overnight incubations for TERRA to minimize background that could arise from longer incubations. Finally, we added an extra wash step for TERRA iDRiP, starting with a 300 mM LiCl wash (as for Xist iDRiP) and following with a lower-salt wash in 150 mM NaCl. For all capture experiments, we generally recommend using U1 as a positive control in side-by-side comparisons. A detailed protocol with RNA-specific modifications is provided in the Procedure.

Probe design

iDRiP utilizes 20–25 mer antisense oligos to capture a specific RNA. Tiling probes across the full-length RNA could be used to capture RNA as is done for RAP15, though we recommend against this approach owing to the likelihood of nonspecific RNA pulldown and decreasing the signal-to-noise ratio, resulting in pulldown of nonspecific interacting proteins. For example, we successfully captured a comprehensive interactome of >100 proteins for 17 kb Xist using nine oligo probes14. For U1, we only use one antisense oligo for iDRiP. In contrast, a tiling probe approach taken using RAP produced only a dozen abundant proteins15. Key to using fewer capture probes, however, is the ability to maintain an intact target RNA during the protocol. We recommend designing one probe per 500 nt and testing at least two sets of probes for capture efficiency before doing a large-scale iDRiP. For RNA >10 kb, one probe for 2–3 kb is recommended. We found the ideal probe length to be 20–25 nt. Optimal melting temperature for oligo probes is ~55–60 °C using the National Center for Biotechnology Information (NCBI) Primer-Blast tool. It is best to avoid T-rich sequences (>4 consecutive Ts) and GC-rich sequences (>5 continuous GCs). NCBI-Blastn should be applied to avoid high homology to other expressed sequences, which would be detrimental to specificity.

Quantitative proteomics to identify RNA-specific protein interactors

An accurate comparison of the amount of RNA co-isolated proteins with proteins in control samples is essential in identifying RNA-specific protein interactors. MS is the most important tool for unbiased quantitative mapping of proteins, and several MS methods are enabling such an analysis. The methods range from label-free quantification42 to the use of stable heavy isotopes for generating internal standards by either metabolically43 or chemically43 incorporating the stable isotopes into the proteins. An elegant way of using chemically incorporated stable isotopes for MS-based protein quantification is the use of isobaric reagents. These probes allow multiplexed quantifications through simultaneous quantification of multiple samples44. Currently, up to 16 samples can be simultaneously quantified using this strategy45, and further extensions of the multiplexing capacity are in development46.

We have used multiplexed proteomics with isobaric labeling to identify RNA-specific proteins isolated using the iDRiP and ChIRP methods, but it should be noted that other quantitative proteomics methods are also suitable to do this analysis. In brief, we have used TMT10-plex reagents for barcoding the samples47 and an MS3-based data acquisition method on an Orbitrap Fusion mass spectrometer (Thermo Fisher Scientific) for data acquisition. MS3-based data acquisition enables the generation of high-accuracy and high-precision quantitative data when using isobaric labels for quantitative proteomics30,31,48. A full description of the proteomics sample preparation would go beyond the scope of this protocol, and we, therefore, point to two other protocol papers describing the steps from the protein digestions to the MS data acquisition in full detail29,49.

Materials

Biological materials

  • UV-cross-linked 1.5 × 108 cells: mES cells (Mus musculus × Mus castaneus hybrid ES cell line EL 16.7, female 40,XX50, RRID: CVCL_A6UI)

    Caution

    All cell lines should be regularly checked for mycoplasma contamination

    Critical

    We recommend using at least 10 million cells for iDRiP capture. The number of cells may be adjusted for different RNA targets.

Reagents

  • Isopropanol (Sigma-Aldrich, cat. no. 278475)

  • Ethanol (Sigma-Aldrich, cat. no. E7023)

  • Protease inhibitors, cOmplete, EDTA-free protease inhibitor cocktail (Merck, cat. no. 4693132001)

  • MyOne streptavidin C1 beads (Thermo Fisher Scientific, cat. no. 65001)

  • RNase H (New England Biolabs, cat. no. M0297L)

  • RNase A (Thermo Fisher Scientific, cat. no. 12091021)

  • Proteinase K (Sigma-Aldrich, cat. no. 03115844001)

  • SuperScript IV reverse transcriptase (Thermo Fisher Scientific, cat. no. 18090200)

  • TRIzol (Thermo Fisher Scientific, cat. no. 15596018)

  • Yeast RNA (Thermo Fisher Scientific, cat. no. AM7118)

  • SUPERase•In RNase Inhibitor (Thermo Fisher Scientific, cat. no. AM2694)

  • TURBO DNase (Thermo Fisher Scientific, cat. no. AM2239)

  • Diethyl pyrocarbonate (DEPC; Sigma-Aldrich, cat. no. D5758)

  • Dithiothreitol (DTT, BioShop, cat. no. DTT001)

  • Phenylmethyl sulfonyl fluoride (PMSF, Sigma-Aldrich, cat. no.10837091001)

  • Nonidet P-40 substitute (BioShop, cat. no. NON505)

  • Dulbecco’s phosphate-buffered saline (DPBS, Thermo Fisher Scientific, cat. no. 141902350)

  • 1,4-Piperazinediethanesulfonic acid (PIPES; Sigma-Aldrich, cat. no. P6757)

  • Sodium lauroyl sarcosine (Sigma-Aldrich, cat. no. L9150)

  • Sodium deoxycholate (Sigma-Aldrich, cat. no. D6750)

  • Ethylene glycol tetraacetic acid (EGTA; Sigma-Aldrich, cat. no. E4378)

  • EDTA (Amresco, cat. no. 0105-1KG)

  • LiCl (Sigma-Aldrich, cat. no. L9650)

  • Triton X-100 (Acros Organics, cat. no. 9002-93-1)

  • SDS (BioShop, cat. no. SDS001)

  • Chloroform (Sigma-Aldrich, cat. no. C0549-1QT)

  • Acid phenol/chloroform (Thermo Fisher Scientific, cat. no. AM9722)

  • GlycoBlue (Thermo Fisher Scientific, cat. no. AM9516)

  • 3′ Biotin-TEG luciferase probe: 5′-GTATCCCTGGAAGATGGAAG-3′/3biotinTEG/ (Integrated DNA Technologies)

  • 3′ Biotin-TEG U1 probe sequence: 5′-CTCCCCTGCCAGGTAAGTAT-3′/3biotinTEG/ (Integrated DNA Technologies)

  • 3′ Biotin-TEG TERRA probe sequence: 5′-TAACCCTAACCCTAACCCTA-3′/3biotinTEG/ (Integrated DNA Technologies)

  • 3′ Biotin-TEG sense probe sequence: 5′-TTAGGGTTAGGGTTAGGGTT-3′/3biotinTEG/ (Integrated DNA Technologies)

  • TMT10plex Isobaric Label Reagent Set (Thermo Fisher Scientific, cat. no. 90111). Alternatively, TMTduplex Isobaric Label Reagent Set (cat. no. 90065), TMTsixplex Isobaric Label Reagent Set (cat. no. 90061), TMT10plex Isobaric Label Reagent Set plus TMT11-131C Label Reagent (cat. no. A34808) or TMTpro 16plex Label Reagent Set (cat. no. A44521) can be used according to the number of samples

Equipment

  • DNA LoBind microcentrifuge tubes, 1.5 ml, PCR clean, colorless (Eppendorf, cat. no. 022431021)

  • Pipettes (Rainin, cat. no. 17014393/ 17014391/ 17014382, or equivalent)

  • Pipette filter tip (SSIbio, cat. no. 4137NSF/4237NAF/4237NSF/4347NSF, or equivalent)

  • 15 ml centrifuge tubes (Thermo Fisher Scientific, cat. no. NUC339650)

  • Benchtop microcentrifuge (Thermo Fisher Scientific, Heraeus Fresco 21, cat. no. 75002555, or equivalent)

  • Heat block, for 1.5 ml microcentrifuge tubes (VWR, cat. no. 12621-088, or equivalent)

  • Water bath (YIHDER, model no. BU-420, or equivalent)

  • Ice bucket (VWR, cat. no. 10146-290, or equivalent)

  • Vortex mixer, Vortex-Genie 2 G560 (Scientific Industries, cat. no. SI-0236)

  • DynaMag-2 magnetic separation stand, for 1.5 ml tubes (Thermo Fisher Scientific, cat. no. 12321D)

  • Hybridization oven with a rotating wheel (Hybaid, Mini Oven MK II, cat. no. E4639, or equivalent, temperature could reach to 65 °C)

  • Real-time PCR detection system (Bio-Rad, model no. 1855195, or equivalent)

  • qPCR plates, 96-wel (Bio-Rad, model no. HSP9601 or equivalent suitable for real-time PCR detection system)

  • Tandem mass spectrometer (Thermo Fisher Scientific, Orbitrap Fusion, Q-Exactive, Bruker Impact II or similar models)

Software

Reagent setup

CSKT buffer

Component

Amount

Final concentration

PIPES

3.02 g

10 mM

NaCl

5.84 g

100 mM

MgCl2•6H2O

0.61 g

3 mM

Sucrose

102.7 g

0.3 M

Triton X-100

5 ml

0.5% (vol/vol)

DEPC-treated water

to 1 L

 

Total

1 L

N/A

  • Prepare 500 ml of DEPC-treated water in a bottle, and add the powders listed above to the solution.

  • Adjust solution to pH 6.8–7.0 using 10 N NaOH

  • Add DEPC-treated water to 1 L

  • Add 5 ml of Triton X-100 to the solution

  • Filter through a 0.2 μm sterile filter

  • Store at 4 °C in dark bottles (or covered with foil) up to several months

  • Freshly add 1× protease inhibitor cocktail before use

DNase I digestion buffer

Make up fresh as outlined below.

Component

Amount

Final concentration

1 M Tris pH7.5

500 μl

50 mM

10% of NP40

500 μl

0.5% (vol/vol)

10% sodium lauroyl sarcosinate

100 μl

0.1% (wt/vol)

DEPC-treated water

8.9 ml

 

Add fresh:

 Protease inhibitor (EDTA-free)

1 tablet

 SUPERase•In (20 μl per 1 ml of lysate)

200 μl

 Turbo DNase I (100 μl per 1 ml of lysate)

1 ml

 

Total

10 ml

N/A

0.5 M LiCl lysis buffer

Make up fresh as outlined below.

Component

Amount

Final concentration

1 M Tris pH 7.5

500 μl

50 mM

10% of NP40

500 μl

0.5% (vol/vol)

5 M LiCl

1 ml

0.5 M

0.5 M EDTA

400 μl

20 mM

0.5 M EGTA

400 μl

20 mM

DEPC-treated water

6.1 ml

 

Add fresh:

 10% sodium lauroyl sarcosinate

1 ml

1% (wt/vol)

 10% sodium deoxycholate

100 μl

0.1% (wt/vol)

Total

10 ml

N/A

2× binding buffer

Prepare as outlined below. Can be stored at room temperature (RT, 20–27°C) for months.

Component

Amount

Final concentration

1 M Tris-HCl (pH 7.5)

100 μl

10 mM

500 mM EDTA

20 μl

1 mM

5 M NaCl

4 ml

2 M

DEPC-treated water

5.9 ml

 

Total

10 ml

N/A

Wash buffer 1

Make up fresh as outlined below.

Component

Amount

Final concentration

1 M Tris-HCl pH 7.5

2.5 ml

50 mM

5 M LiCl

3 ml

0.3 M

0.5 M EDTA

500 μl

5 mM

10% SDS

5 ml

1% (wt/vol)

10% NP40

2.5 ml

0.5% (vol/vol)

DEPC-treated water

36.2 ml

 

Add fresh:

 1 M DTT

50 μl

1 mM

 200 mM PMSF

250 μl

1 mM

 Protease inhibitors, EDTA-free

1 tablet

Total

50 ml

N/A

Wash buffer 2

Make up fresh as outlined below.

Component

Amount

Final concentration

10% of SDS

1 ml

1% (wt/vol)

5 M NaCl

300 μl

150 mM

0.5 M EDTA

100 μl

5 mM

DEPC-treated water

8.6 ml

N/A

Add fresh:

 1 M DTT

10 μl

1 mM

 200 mM PMSF

50 μl

1 mM

Total

10 ml

N/A

Elution buffer

Prepare as outlined below. Can be stored at RT for months.

Component

Amount

Final concentration

1 M Tris-HCl pH 7.5

0.1 ml

10 mM

0.5 M EDTA pH 8.0 (optional)

20 μl

1 mM

10% Triton X-100

50 μl

0.05% (vol/vol)

DEPC water

9.83 ml

 

Total

10 ml

N/A

PK buffer

Prepare as outlined below. Can be stored at RT for months.

Component

Amount

Final concentration

5 M NaCl

200 μl

100 mM

1 M Tris-HCl pH 7.0

100 μl

10 mM

0.5 M EDTA pH 8.0

20 μl

1 mM

10% SDS

500 μl

0.5% (wt/vol)

DEPC-treated water

9.18 ml

 

Total

10 ml

N/A

DEPC water

Component

Amount

Final concentration

Distilled water

1,000 ml

 

DEPC

1 ml

 

Total

1,000 ml

N/A

Autoclave distilled water first, then mix thoroughly with DEPC and leave it in chemical hood overnight. Autoclave again. Can be stored at RT for months.

Procedure

UV-cross-linking and preparation of the nuclei

Timing ~2 h

  1. 1

    Starting with cells grown to 80% confluence in a 15 cm2 plate, rinse cells with cold DPBS once and irradiate cells (with 5 ml of cold (0–8 °C) DPBS covering the 15 cm2 plate) at 200–400 mJ/cm2. For MEFs, we use 200 mJ/cm2. For mES cells, we use 400 mJ/cm2. Irradiate cells at 254 nm without a lid.

    Critical step

    Trypsinize cells when using ES cells, cells growing in spherical colonies or tissue before UV irradiation. For mES cells, trypsinize the cells grown in the 15 cm2 plate, spin at 800g at RT for 5 min and replace the medium with 5 ml of cold PBS, and plate cells onto the plate for irradiation. Determine cell number before irradiation.

  2. 2

    Collect cells in a 15 ml centrifuge tube. Add 5 ml of cold DPBS to wash the plate, and collect the remaining cells from the plate. Spin at 800g for 3 min at 4 °C. Remove and discard the supernatant. Note: when preparing a large-scale experiment, pool 1.5 × 108 cells in a 50 ml tube for CSKT treatment.

    Critical step

    If RNA of interest is in the cytoplasm, omit the CSKT treatment for nuclear isolation (Step 3).

  3. 3

    Resuspend the pellet in 10 ml of ice-cold CSKT (for 1.5 × 108 cell pellet) buffer with protease inhibitors for nuclear extraction, and keep it on the rocker for 3–10 min at 4 °C. Spin at 800g at 4 °C for 3 min. Remove and discard the supernatant. Note: add more CSKT as needed to accommodate the number of cells.

    Critical step

    This step is to reduce background noise from the cytoplasm. The incubation time of CSKT treatment is dependent on the cell type. We recommend 3 min for mES cells, and 10 min for MEFs. For testing the time of CSKT treatment, the efficiency of nuclei extraction could be checked by looking at cell morphology under a light microscope. Place some CSKT-treated cells onto a slide to check the efficiency. The CSKT treatment should be stopped when the plasma membrane is disrupted but the nuclear membrane remains intact.

  4. 4

    Resuspend the pellet (for 1.5–5 × 107 cells) with 1 ml of cold DPBS, and transfer to 1.5 ml tube. Spin at 800g at 4 °C for 3 min.

  5. 5

    Aspirate all the liquid on top of the pellet. Flash freeze cell pellets in liquid nitrogen, and store at −80 °C until use.

    Pause point

    Cells could be stored at −80 °C for several months.

    Troubleshooting

Preparation of lysate

Timing ~1.5 h

  1. 6

    Thaw nuclear pellets on ice. For 1.5 × 108 cells, resuspend the nuclear pellets in 3 ml of DNase I digestion buffer supplemented with 300 µl of Turbo DNaseI enzyme (2 U/µl), 60 µl SUPERase•In and 1× protease inhibitors (the total volume will reach to 3.5 ml). Separate the 3.5 ml lysate equally into 4× 1.5 ml microtubes. Mix well, and incubate at 37 °C for 20 min in a water bath. Note: longer incubations may help the solubility but may also increase the degradation of RNA and proteins. Pipette the lysates and invert microtubes several times during the incubation. Be careful to avoid water contamination.

    Critical step

    This step is required for capturing nuclear RNAs. If DNase does not work well, the background noise will increase. To check the efficiency of DNase treatment, DNA can be purified using cell lysates from Step 7 by phenol/chloroform extraction. DNA size after DNase treatment should be ~100–200 bp, the size of mononucleosomes (Extended Data Fig. 1d).

    Critical step

    Cell (nucleus) clumping is usually a problem. Separating 3.5 ml lysate into 4× 1.5 ml microtubes will avoid cell clumping during this step and increase the efficiency of DNase treatment. Incomplete digestion of genomic DNA may result in large RNA–chromatin complexes that could be nonspecifically pulled down by beads. The background noise such as abundant nuclear RNAs could be detected after iDRiP by RT-qPCR.

    Critical

    This step is optional for cytosolic RNAs.

    Troubleshooting

  2. 7

    Combine 3.5 ml lysate from Step 6 with the reagents listed below in a 15 ml tube. Mix well, vortex and continue incubation at 37 °C for 5 min in the water bath, then put lysate on ice.

    For 3.5 ml of initial lysate, combine reagents as follows to reach the final concentrations listed in the table below (the total volume will reach 5 ml).

    Component

    Amount

    Final concentration

    0.5 M EGTA

    200 μl

    20 mM

    0.5 M EDTA

    200 μl

    20 mM

    5 M LiCl

    500 μl

    0.5 M

    10% sodium lauroyl sarcosinate

    470 μl

    1%

    10% sodium deoxycholate

    50 μl

    0.1%

    DEPC-treated water

    70 μl

     

    Total

    1,490 μl

     
  3. 8

    After mixing, separate the 5 ml of lysate into five microtubes (1.5 ml). (Note: at this point, the lysate may still have small clumps. It is okay to continue to the following steps.) Spin down at highest speed (15,000g) at 4 °C for 2 min. Save the supernatant on ice.

  4. 9

    Resuspend the pellet in each tube in 600 μl of 0.5 M LiCl lysis buffer (Note: 600 μl are used for the pellet from 1 ml lysate. Total volume will be 600 μl X 5 tubes = 3 ml), leave on ice for 10 min and then incubate at 65°C for 5 min (mix frequently by inverting the tubes) in a heat block. Immediately spin down (15,000g at RT, 1 min). Save the supernatant, and leave it on ice for several minutes. Combine with the previous supernatant (from Step 8). The final volume of lysate after this step will reach ~8 ml.

    Critical step

    Insoluble cellular material is removed by these two centrifugation steps prior to RNA capture by probes. Insoluble material may adhere to beads and increase the background noise. We also found that heating the insoluble materials from the pellet to 65 °C can increase the solubility of chromatin-bound RNA–protein complexes.

    Caution

    When collecting supernatant, it is important to avoid collecting the pellet (insoluble materials), otherwise the background noise will increase.

  5. 10

    Collect 1% (vol/vol) input. (It can vary from 1% to 10%). Freeze samples at −80 °C.

  6. 11

    Use 3–5 ml of lysate per capture for the following steps. Separate lysate into several 1.5 ml microtubes. Perform the hybridization in 1.5 ml microtubes.

    Critical step

    To test the capture efficiency of iDRiP, a small-scale (300 μl lysate) iDRiP could be done prior to a large-scale experiment (3 ml lysate).

Preclear lysate

Timing ~0.5 h

  1. 12

    Wash 50 μl Dynabeads (MyOne streptavidin C1 beads) with 1 ml of DEPC-treated water twice in a 1.5 ml microtube by placing on a magnet to collect the beads. Suspend beads in an equal volume of 0.5 M LiCl lysis buffer (i.e., take 50 μl beads from the original vial and resuspend 50 μl of lysis buffer).

  2. 13

    Add 50 μl of beads (50 μl for 3 ml of lysate, ~16.3 μl beads for 1 ml of lysate) for precleaning, and incubate beads with lysates at RT for 20 min with gentle rotation. Collect beads on a magnet, transfer the lysate to new tubes and discard beads.

Conjugate oligo probes to beads

Timing ~0.5 h

Critical

DNA Oligos should be 20–25 mers. See ‘Reagents’ for validated sequences.

  1. 14

    Wash with 300 μl of of Dynabeads with 1 ml of DEPC water twice.

  2. 15

    Resuspend beads in 300 μl of 1× binding buffer.

  3. 16

    Add 150 pmoles of probes (the total amount) to beads, and incubate at RT for 20 min with gentle rotation.

  4. 17

    Wash beads with 1 ml of 1× binding buffer twice.

  5. 18

    Resuspend beads in 300 μl of 0.5 M LiCl lysis buffer.

Hybridization

Timing ~3 h

Critical

One of the most critical factors for a successful iDRiP experiment is the hybridization condition used to capture the ncRNA of interest. We anticipated that the highly repetitive nature of TERRA and its high abundance in mES cells could be an issue for probe specificity. Thus, whereas Xist capture hybridization is carried out at 37 °C overnight, TERRA is first denatured at 65 °C and then gradually cooled to 37 °C over 1–2 h to enhance capture specificity.

Critical

Hybridization of capture probe to RNA target is done in a hybridization chamber/oven (used for Southern or northern blotting), where the temperature can be gradually decreased from 65 °C to 37 °C over 30–60 min. This aids in optimizing the hybridization temperature of different RNA-targeting probes between 65 °C and 37 °C.

  1. 19

    Prewarm lysates (from Step 13) and 300 μl of probe-conjugated beads (from Step 18) to 65 °C for 5 min. Mix lysates with the beads, and immediately incubate the mixture at 65 °C (~100 μl beads for 1 ml of lysate in a 1.5 ml microtube).

    Critical step

    It is important to bring up the temperatures of probes and lysates individually before hybridization to prevent nonspecific binding at a lower temperature.

  2. 20

    Incubate at 65 °C for 15 min, then reduce the temperature slowly to 37 °C and continue the hybridization for 1 h in the hybridization oven.

    Critical step

    It is important to reduce the temperature to 37 °C slowly to obtain the optimal hybridization efficiency. Suddenly dropping the temperature is not recommended. The easy way to slowly reduce the temperature is to set the oven temperature to 37 °C (the original temperature is 65 °C) and keep the hybridization oven closed.

    Caution

    Do not let the hybridization temperature exceed 65 °C; higher temperatures will disrupt the biotin–streptavidin interaction.

    Troubleshooting

Washing

Timing ~1.5 h

  1. 21

    Prewarm wash buffer 1 (see ‘Reagent setup’) to 37 °C.

  2. 22

    Capture the beads at 37 °C on the magnet. Remove the supernatant containing unbound materials, and add wash buffer 1 to resuspend the beads. Each wash constitutes 1 ml wash buffer/1.5 ml tube under rotation (on wheel, or 3 ml wash buffer for 3 ml of lysate in 15 ml tube) each for 5 min at 37 ºC

  3. 23

    Wash beads three times.

  4. 24

    Rinse beads once at RT in DNase I buffer (plus 0.2 M LiCl) (1 ml DNase I buffer/1.5 ml tube) to clean out the SDS and EDTA and EGTA, which can inhibit the DNase activity. Resuspend the beads in 300 µl DNase I buffer (for 3 ml of lysate) with 0.2 M LiCl + 10 µl TURBO DNase + 4 µl SUPERase•In + protease inhibitor. Incubate at 37 °C for 10 min in the water bath.

  5. 25

    Add wash buffer 1, and mix briefly before magnetic separation.

  6. 26

    Perform two more wash (total five washes), each for 5 min at 37 °C.

  7. 27

    Perform an additional wash with 1 ml of 150 mM of NaCl wash buffer/1.5 ml tube (wash buffer 2) at 37 °C for 5 min.

    Critical step

    The lower-salt washing step (150 mM NaCl) is important for improving the signal-to-noise ratio. However, it may reduce the capture efficiency if incubation at 37 °C exceeds 5 min.

Elution

Timing ~0.5 h

Critical

We have tried several methods for elution, such as excess biotin, 95 °C formamide and 70 °C low-salt solution. The RNA recovery rates did not change. We found that 0.05% (vol/vol) Triton X-100 is required for the low-salt solution method at 70 °C to efficiently elute complexes from beads. Other detergents such as 0.05% (vol/vol) NP40 may also work.

  1. 28

    Resuspend the beads in 600 µl elution buffer (for 3 ml of original lysate), and incubate the sample at 70 °C for 5 min, then immediately place it on the magnet (~200 µl elution buffer is used for 1 ml of lysate).

  2. 29

    Collect the supernatant in a separate tube. Put the supernatant on the magnet to remove residual beads by transferring supernatant again to a new tube. Keep the elution on ice.

  3. 30

    For every MS experiment, quantification of the RNA of interest is necessary. Set aside 10% (vol/vol) elute for quantification of RNA pulldown by RT-qPCR.

  4. 31

    For submission of samples for MS, snap freeze the elution and store at −80 °C.

  5. 32

    For RNA extraction, take 2–10% (vol/vol) of elution (usually 6–12 μl is enough), and add 90 μl of PK buffer (with 5 μl of 20 mg/ml proteinase K in 100 μl buffer) for 15 min at 55 °C (preheat PK buffer at 55 °C for 10 min in advance). Then, add 1 ml of Trizol. Process the input sample from Step 10 (use between 0.2% and 2% of input; usually 6 μl is enough) the same way by treating with proteinase K followed by Trizol extraction.

    Pause point

    The eluted proteins and RNA (in Trizol) can be stored at −80 °C for several months.

Trizol RNA extraction

Timing ~1 d

  1. 33

    Thaw out RNA in Trizol, and pipette the solution several times at RT. Add 200 µl of chloroform, and shake tubes by hand. Incubate samples at RT for 5 min.

    Caution

    Please conduct the procedures using Trizol and chloroform under a chemical hood.

  2. 34

    Spin at 14,000g for 15 min at 4 °C.

  3. 35

    Take the upper (aqueous) phase, and transfer to a new 1.5 ml tube.

  4. 36

    Add 500 µl of acid phenol/chloroform to the aqueous phase, shake and incubate at RT for 5 min.

  5. 37

    Spin down at 14,000g for 10 min at 4 °C.

  6. 38

    Transfer the upper phase to a new 1.5 ml tube.

  7. 39

    Add 3 µl of glycoBlue (15 mg/ml). Mix well.

  8. 40

    Add 0.5 mL of isopropyl alcohol, and mix well to precipitate RNA. Incubate samples at −20 °C overnight. (Note: RNA can be stored in isopropyl alcohol at −20 °C for several months.)

  9. 41

    Spin at 14,000g for 15 min at 4 °C.

  10. 42

    Wash with 1 ml of 75% (vol/vol) EtOH.

  11. 43

    Spin at 14,000g for 5 min. Carefully remove supernatant.

  12. 44

    Air dry for 5–10 min.

  13. 45

    Dissolve RNA in 30 µl of DEPC-treated water, and immediately put RNA samples on ice.

cDNA synthesis

  1. 46

    Prepare RNA-primer mix by combining the following components with each RNA sample in PCR reaction tubes. Add an equal volume of RNA from each sample for cDNA synthesis.

    Component

    Volume

    50 μM random hexamers

    1 μl

    10 mM dNTP mix (10 mM each)

    1 μl

    RNA sample (10 pg to 5 μg total RNA)

    5 μl (or up to 11 μl)

    DEPC-treated or nuclease-free water

    to 13 μl

  2. 47

    Mix and briefly centrifuge the components.

  3. 48

    Heat the RNA-primer mix at 65 °C for 5 min, and keep on ice for at least 1 min.

  4. 49

    Prepare reverse transcription reaction mix with the following components in a reaction tube (per RNA sample).

    Component

    Volume

    5× SSIV buffer

    4 μl

    100 mM DTT

    1 μl

    RNaseOUT recombinant RNase inhibitor

    1 μl

    SuperScript IV reverse transcriptase (200 U/μl)

    1 μl

  5. 50

    Mix well and briefly centrifuge the contents.

  6. 51

    Add the reverse transcription reaction mix to the annealed RNA sample from Step 49.

  7. 52

    Incubate the mixture at the following temperatures using a PCR machine:

    Temperature

    Time

    25 °C

    10 min

    50 °C

    50 min

    85 °C

    5 min

  8. 53

    (Optional) Add 1 μl E. coli RNase H to remove RNA, and incubate at 37 °C for 20 min.

  9. 54

    The samples are ready for RT-qPCR immediately, or they can be stored at −20 °C.

Calculating enrichment of the target RNA by RT-qPCR

  1. 55

    Before the RT-qPCR experiment, prepare tenfold serial dilutions from the input sample. The serial dilutions of cDNA samples are used for generating a standard curve for calculating the concentrations of RNA-captured samples.

  2. 56

    To define the RNA capture efficiency of iDRiP by RT-qPCR experiment, we use ‘percentage of the input’ as the unit of measurement. This analysis represents the amount of captured RNA relative to the amount of starting material from the input sample.

  3. 57

    The following table is an example for setting up an RT-qPCR standard curve. The concentration of captured RNA can be calculated using the CFX Manager for qPCR data analysis provided by Bio-Rad. Both the target RNA and the reference RNA need a standard curve. The reference RNA such as U1, U6 or GAPDH can be used. Set up qPCR reactions according to the manufacturer’s manual.

    Sample

    Cq

    Starting quantity (percent of input)

    100% input

     

    100

    10% input

     

    10

    1% input

     

    1

    0.1% input

     

    0.1

    0.01% input

     

    0.01

    0.001% input

     

    0.001

  4. 58

    Calculate capture efficiency, enrichment of the target RNA and signal-to-noise ratio using the formulas listed below:

    The capture efficiency (presented as % of input) = the amount of captured RNA/total starting RNA.

    The enrichment of target RNA (specificity) = the amount of target RNA after target probe capture (% of input)/the amount of target RNA after control probe capture (% of input).

    The fold enrichment of target RNA compared with the reference RNA (signal-to-noise ratio) = the amount of target RNA after capture (% of input)/the amount of reference RNA after capture (% of input).

Troubleshooting

Troubleshooting advice is provided in Table 2.

Table 2 Troubleshooting table

Timing

  • Steps 1–5, UV cross-linking: 2 h

  • Steps 6–11, preparation of lysate and first DNase treatment: 1.5 h

  • Steps 12–18, preclean lysate and conjugate oligo probes to beads: 1 h

  • Steps 19–20, hybridization: 3 h

  • Steps 21–27, washing and second DNase treatment: 1.5 h

  • Steps 28–32, elution: 0.5 h

  • Steps 33–45, RNA extraction 1.5 d

  • Steps 46–54, cDNA synthesis: 1.5 h

  • Steps 55–58, calculating enrichment of the target RNA by RT-qPCR: 3 h

Anticipated results

Calculating enrichment of the target RNA

After iDRiP capture, quantitative RT-PCR is used to check the recovery and specificity. Above 10% of target RNA can be recovered compared with the input. The signal-to-noise ratio, which compares the target RNA to other abundant RNAs such as U1 RNA or GAPDH, is very important. We usually obtain at least 50-fold enrichment (Fig. 1b). For the specificity of capture, the relative enrichment of the target RNA is calculated by comparing the amount of target RNA between the specific capture and the negative control captures. As shown in Fig. 1c, TERRA iDRiP yielded an over 1,000-fold enrichment compared with Luc-iDRiP capture. The high specificity is critical because proteins captured by the control probe are used for the normalization of the specific protein enrichment after quantitative MS.

The interaction of RNA and protein can be further validated by UV-RIP14 or immuno-RNA-FISH staining13 (Fig. 4a,b).

Statistical analysis

The intensity of each protein analyzed by quantitative MS (TMT) is considered as protein abundance after capture. After normalization with luciferase control, the log2 value of each protein abundance (target/control) represents the enrichment over control. Probability density using log2 values was plotted by fitting the data to a normal distribution (Extended Data Fig. 4a,b). We provide two different methods to obtain P values. (1) if only one data set is available, P values could be calculated by the fitting curve method. P value and FDR were calculated in R 3.6.2 for U1-ChIRP and U1-iDRiP (Supplementary Tables 5 and 6). A histogram of probability density was plotted using hist(x = abundance ratio, probability = TRUE), and datapoints on the histogram were extracted using the density() function and were fitted to a normal distribution by the nonlinear least squares method using the nls() function in R. The Gaussian formula used in this step was \(f\left( x \right) = {\mathrm{a}} \cdot e^{\left( - \frac{{(x - b)^2}}{{2c^2}}\right)}\), and the parameters a, b and c were determined by fitting the datapoints when finding a model. P values were calculated using the original log2 abundance ratio as x, and the FDR was obtained using P values adjusted by Benjamini–Hochberg method. (2) If there are biological replicates, P values can be calculated using the rank product method51. RankProd 2.052, an R package, was applied to U1-iDRiP and U1-ChIRP analysis (Extended Data Fig. 4c,d, Supplementary Table 7, 8)