当前位置: X-MOL 学术Nucleic Acids Res. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Transposable element sequence fragments incorporated into coding and noncoding transcripts modulate the transcriptome of human pluripotent stem cells
Nucleic Acids Research ( IF 16.6 ) Pub Date : 2021-08-03 , DOI: 10.1093/nar/gkab710
Isaac A Babarinde 1, 2 , Gang Ma 1, 2 , Yuhao Li 1, 2 , Boping Deng 2, 3 , Zhiwei Luo 4, 5 , Hao Liu 4, 5 , Mazid Md Abdul 4, 5 , Carl Ward 4, 5 , Minchun Chen 2 , Xiuling Fu 1, 2 , Liyang Shi 1, 2 , Martha Duttlinger 2 , Jiangping He 6 , Li Sun 1, 2 , Wenjuan Li 4, 5 , Qiang Zhuang 2 , Guoqing Tong 7 , Jon Frampton 3 , Jean-Baptiste Cazier 3, 8 , Jiekai Chen 5, 6, 9 , Ralf Jauch 10 , Miguel A Esteban 4, 5, 11 , Andrew P Hutchins 1, 2
Affiliation  

Transposable elements (TEs) occupy nearly 40% of mammalian genomes and, whilst most are fragmentary and no longer capable of transposition, they can nevertheless contribute to cell function. TEs within genes transcribed by RNA polymerase II can be copied as parts of primary transcripts; however, their full contribution to mature transcript sequences remains unresolved. Here, using long and short read (LR and SR) RNA sequencing data, we show that 26% of coding and 65% of noncoding transcripts in human pluripotent stem cells (hPSCs) contain TE-derived sequences. Different TE families are incorporated into RNAs in unique patterns, with consequences to transcript structure and function. The presence of TE sequences within a transcript is correlated with TE-type specific changes in its subcellular distribution, alterations in steady-state levels and half-life, and differential association with RNA Binding Proteins (RBPs). We identify hPSC-specific incorporation of endogenous retroviruses (ERVs) and LINE:L1 into protein-coding mRNAs, which generate TE sequence-derived peptides. Finally, single cell RNA-seq reveals that hPSCs express ERV-containing transcripts, whilst differentiating subpopulations lack ERVs and express SINE and LINE-containing transcripts. Overall, our comprehensive analysis demonstrates that the incorporation of TE sequences into the RNAs of hPSCs is more widespread and has a greater impact than previously appreciated.

中文翻译:

纳入编码和非编码转录本的转座因子序列片段调节人类多能干细胞的转录组

转座因子 (TE) 占哺乳动物基因组的近 40%,虽然大多数是片段的并且不再能够转座,但它们仍然可以促进细胞功能。由 RNA 聚合酶 II 转录的基因中的 TE 可以作为初级转录物的一部分进行复制;然而,它们对成熟转录序列的全部贡献仍未得到解决。在这里,使用长读长和短读长(LR 和 SR)RNA 测序数据,我们显示人类多能干细胞 (hPSC) 中 26% 的编码转录本和 65% 的非编码转录本包含 TE 衍生序列。不同的 TE 家族以独特的模式整合到 RNA 中,对转录本结构和功能产生影响。转录本中 TE 序列的存在与其亚细胞分布的 TE 型特异性变化相关,稳态水平和半衰期的改变,以及与 RNA 结合蛋白 (RBP) 的差异关联。我们确定 hPSC 特异性掺入内源性逆转录病毒 (ERV) 和 LINE:L1 到蛋白质编码 mRNAs 中,这些 mRNAs 产生 TE 序列衍生肽。最后,单细胞 RNA-seq 显示 hPSC 表达含有 ERV 的转录本,而分化的亚群缺乏 ERV 并表达含有 SINE 和 LINE 的转录本。总体而言,我们的综合分析表明,将 TE 序列整合到 hPSC 的 RNA 中更为普遍,并且比以前所认为的影响更大。产生 TE 序列衍生的肽。最后,单细胞 RNA-seq 显示 hPSC 表达含有 ERV 的转录本,而分化的亚群缺乏 ERV 并表达含有 SINE 和 LINE 的转录本。总体而言,我们的综合分析表明,将 TE 序列整合到 hPSC 的 RNA 中更为普遍,并且比以前所认为的影响更大。产生 TE 序列衍生的肽。最后,单细胞 RNA-seq 显示 hPSC 表达含有 ERV 的转录本,而分化的亚群缺乏 ERV 并表达含有 SINE 和 LINE 的转录本。总体而言,我们的综合分析表明,将 TE 序列整合到 hPSC 的 RNA 中更为普遍,并且比以前所认为的影响更大。
更新日期:2021-08-03
down
wechat
bug