Demystifying “drop-outs” in single-cell UMI data,Genome Biology

当前位置： X-MOL 学术 › Genome Biol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Demystifying “drop-outs” in single-cell UMI data
Genome Biology ( IF 10.1 ) Pub Date : 2020-08-06 , DOI: 10.1186/s13059-020-02096-y
Tae Hyun Kim ₁ , Xiang Zhou ₂ , Mengjie Chen ₃

Affiliation

Many existing pipelines for scRNA-seq data apply pre-processing steps such as normalization or imputation to account for excessive zeros or “drop-outs." Here, we extensively analyze diverse UMI data sets to show that clustering should be the foremost step of the workflow. We observe that most drop-outs disappear once cell-type heterogeneity is resolved, while imputing or normalizing heterogeneous data can introduce unwanted noise. We propose a novel framework HIPPO (Heterogeneity-Inspired Pre-Processing tOol) that leverages zero proportions to explain cellular heterogeneity and integrates feature selection with iterative clustering. HIPPO leads to downstream analysis with greater flexibility and interpretability compared to alternatives.

中文翻译：

揭秘单细胞 UMI 数据中的“丢失”

许多现有的 scRNA-seq 数据流程应用标准化或插补等预处理步骤来解决过多的零或“丢失”问题。在这里，我们广泛分析了不同的 UMI 数据集，以表明聚类应该是最重要的步骤。我们观察到，一旦细胞类型异质性得到解决，大多数丢失就会消失，而输入或标准化异质数据可能会引入不需要的噪声，该框架利用零比例来解释。与替代方案相比，HIPPO 能够利用细胞异质性并将特征选择与迭代聚类相结合，从而使下游分析具有更大的灵活性和可解释性。

更新日期：2020-08-06

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11