当前位置: X-MOL 学术arXiv.cs.NE › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Ranking labs-of-origin for genetically engineered DNA using Metric Learning
arXiv - CS - Neural and Evolutionary Computing Pub Date : 2021-07-16 , DOI: arxiv-2107.07878
I. Muniz, F. H. F. Camargo, A. Marques

With the constant advancements of genetic engineering, a common concern is to be able to identify the lab-of-origin of genetically engineered DNA sequences. For that reason, AltLabs has hosted the genetic Engineering Attribution Challenge to gather many teams to propose new tools to solve this problem. Here we show our proposed method to rank the most likely labs-of-origin and generate embeddings for DNA sequences and labs. These embeddings can also perform various other tasks, like clustering both DNA sequences and labs and using them as features for Machine Learning models applied to solve other problems. This work demonstrates that our method outperforms the classic training method for this task while generating other helpful information.

中文翻译:

使用度量学习对基因工程 DNA 的起源实验室进行排名

随着基因工程的不断进步,人们普遍关心的是能够识别基因工程 DNA 序列的来源实验室。为此,AltLabs 举办了基因工程归因挑战赛,聚集众多团队,提出解决这一问题的新工具。在这里,我们展示了我们提出的方法来对最可能的实验室进行排名,并为 DNA 序列和实验室生成嵌入。这些嵌入还可以执行各种其他任务,例如对 DNA 序列和实验室进行聚类,并将它们用作机器学习模型的特征,用于解决其他问题。这项工作表明,我们的方法在生成其他有用信息的同时优于此任务的经典训练方法。
更新日期:2021-07-19
down
wechat
bug