当前位置: X-MOL 学术Genome Biol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
Genome Biology ( IF 10.1 ) Pub Date : 2020-02-12 , DOI: 10.1186/s13059-020-1949-z
Christian H Holland 1, 2 , Jovan Tanevski 1, 3 , Javier Perales-Patón 1 , Jan Gleixner 4, 5 , Manu P Kumar 6 , Elisabetta Mereu 7 , Brian A Joughin 6, 8 , Oliver Stegle 4, 5, 9 , Douglas A Lauffenburger 6 , Holger Heyn 7, 10 , Bence Szalai 11 , Julio Saez-Rodriguez 1, 2
Affiliation  

Background Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. Results To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. Conclusions Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used.

中文翻译:


转录因子和通路分析工具对单细胞 RNA-seq 数据的稳健性和适用性



背景 已经开发了许多功能分析工具来从大量转录组数据中提取功能和机制见解。随着单细胞 RNA 测序 (scRNA-seq) 的出现,原则上可以对单细胞进行此类分析。然而,scRNA-seq 数据具有丢失事件和文库大小较低等特征。因此,尚不清楚为批量测序建立的功能性 TF 和通路分析工具是否可以以有意义的方式应用于 scRNA-seq。结果为了解决这个问题,我们对模拟和真实的 scRNA-seq 数据进行基准研究。我们包括分别估计通路和转录因子 (TF) 活性的批量 RNA 工具 PROGENy、GO 富集和 DoRothEA,并将它们与专为 scRNA-seq 设计的工具 SCENIC/AUCell 和 metaVIPER 进行比较。对于计算机研究,我们模拟来自 TF/通路扰动批量 RNA 测序实验的单细胞。我们用 CRISPR 介导的敲除后的真实 scRNA-seq 数据来补充模拟数据。我们对模拟和真实数据的基准测试揭示了与原始批量数据相当的性能。此外,我们通过分析使用 13 个 scRNA-seq 方案测序的混合物样本,表明 TF 和通路活性保留了细胞类型特异性的变异性。我们还提供基准数据供社区进一步使用。结论我们的分析表明,使用手动策划的足迹基因集的基于批量的功能分析工具可以应用于 scRNA-seq 数据,部分优于专用的单细胞工具。此外,我们发现功能分析工具的性能对基因集比对所使用的统计更敏感。
更新日期:2020-02-12
down
wechat
bug