当前位置: X-MOL 学术Brief. Bioinform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
DeepMotifSyn: a deep learning approach to synthesize heterodimeric DNA motifs
Briefings in Bioinformatics ( IF 9.5 ) Pub Date : 2021-07-29 , DOI: 10.1093/bib/bbab334
Jiecong Lin 1 , Lei Huang 2 , Xingjian Chen 3 , Shixiong Zhang 3 , Ka-Chun Wong 1
Affiliation  

The cooperativity of transcription factors (TFs) is a widespread phenomenon in the gene regulation system. However, the interaction patterns between TF binding motifs remain elusive. The recent high-throughput assays, CAP-SELEX, have identified over 600 composite DNA sites (i.e. heterodimeric motifs) bound by cooperative TF pairs. However, there are over 25 000 inferentially effective heterodimeric TFs in the human cells. It is not practically feasible to validate all heterodimeric motifs due to cost and labor. We introduce DeepMotifSyn, a deep learning-based tool for synthesizing heterodimeric motifs from monomeric motif pairs. Specifically, DeepMotifSyn is composed of heterodimeric motif generator and evaluator. The generator is a U-Net-based neural network that can synthesize heterodimeric motifs from aligned motif pairs. The evaluator is a machine learning-based model that can score the generated heterodimeric motif candidates based on the motif sequence features. Systematic evaluations on CAP-SELEX data illustrate that DeepMotifSyn significantly outperforms the current state-of-the-art predictors. In addition, DeepMotifSyn can synthesize multiple heterodimeric motifs with different orientation and spacing settings. Such a feature can address the shortcomings of previous models. We believe DeepMotifSyn is a more practical and reliable model than current predictors on heterodimeric motif synthesis. Contact:kc.w@cityu.edu.hk

中文翻译:

DeepMotifSyn:一种合成异二聚体 DNA 基序的深度学习方法

转录因子(TFs)的协同作用是基因调控系统中普遍存在的现象。然而,TF 结合基序之间的相互作用模式仍然难以捉摸。最近的高通量分析 CAP-SELEX 已经确定了 600 多个由协同 TF 对结合的复合 DNA 位点(即异二聚体基序)。然而,人类细胞中有超过 25 000 个推断有效的异二聚体 TF。由于成本和劳动力的原因,验证所有异二聚体基序实际上是不可行的。我们介绍了 DeepMotifSyn,这是一种基于深度学习的工具,用于从单体基序对合成异二聚基序。具体来说,DeepMotifSyn 由异二聚基序生成器和评估器组成。生成器是一个基于 U-Net 的神经网络,可以从对齐的基序对合成异二聚基序。评估器是一个基于机器学习的模型,可以根据基序序列特征对生成的异二聚基序候选进行评分。对 CAP-SELEX 数据的系统评估表明,DeepMotifSyn 显着优于当前最先进的预测器。此外,DeepMotifSyn 可以合成具有不同方向和间距设置的多个异二聚体基序。这样的功能可以解决以前模型的缺点。我们相信 DeepMotifSyn 是一种比当前异二聚体基序合成预测指标更实用和可靠的模型。联络:kc.w@cityu.edu.hk 对 CAP-SELEX 数据的系统评估表明,DeepMotifSyn 显着优于当前最先进的预测器。此外,DeepMotifSyn 可以合成具有不同方向和间距设置的多个异二聚体基序。这样的功能可以解决以前模型的缺点。我们相信 DeepMotifSyn 是一种比当前异二聚体基序合成预测指标更实用和可靠的模型。联络:kc.w@cityu.edu.hk 对 CAP-SELEX 数据的系统评估表明,DeepMotifSyn 显着优于当前最先进的预测器。此外,DeepMotifSyn 可以合成具有不同方向和间距设置的多个异二聚体基序。这样的功能可以解决以前模型的缺点。我们相信 DeepMotifSyn 是一种比当前异二聚体基序合成预测指标更实用和可靠的模型。联络:kc.w@cityu.edu.hk
更新日期:2021-07-29
down
wechat
bug