Cognitive analysis of metabolomics data for systems biology,Nature Protocols

当前位置： X-MOL 学术 › Nat. Protoc. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Cognitive analysis of metabolomics data for systems biology
Nature Protocols ( IF 13.1 ) Pub Date : 2021-01-22 , DOI: 10.1038/s41596-020-00455-4
Erica L-W Majumder ₁ , Elizabeth M Billings ₁ , H Paul Benton ₁ , Richard L Martin ₂ , Amelia Palermo ₁ , Carlos Guijas ₁ , Markus M Rinschen ₁ , Xavier Domingo-Almenara ₁ , J Rafael Montenegro-Burke ₁ , Bradley A Tagtow ₂ , Robert S Plumb ₃ , Gary Siuzdak ₁

Affiliation

Cognitive computing is revolutionizing the way big data are processed and integrated, with artificial intelligence (AI) natural language processing (NLP) platforms helping researchers to efficiently search and digest the vast scientific literature. Most available platforms have been developed for biomedical researchers, but new NLP tools are emerging for biologists in other fields and an important example is metabolomics. NLP provides literature-based contextualization of metabolic features that decreases the time and expert-level subject knowledge required during the prioritization, identification and interpretation steps in the metabolomics data analysis pipeline. Here, we describe and demonstrate four workflows that combine metabolomics data with NLP-based literature searches of scientific databases to aid in the analysis of metabolomics data and their biological interpretation. The four procedures can be used in isolation or consecutively, depending on the research questions. The first, used for initial metabolite annotation and prioritization, creates a list of metabolites that would be interesting for follow-up. The second workflow finds literature evidence of the activity of metabolites and metabolic pathways in governing the biological condition on a systems biology level. The third is used to identify candidate biomarkers, and the fourth looks for metabolic conditions or drug-repurposing targets that the two diseases have in common. The protocol can take 1–4 h or more to complete, depending on the processing time of the various software used.

中文翻译：

系统生物学代谢组学数据的认知分析

认知计算正在彻底改变大数据的处理和集成方式，人工智能 (AI) 自然语言处理 (NLP) 平台可帮助研究人员有效搜索和消化大量科学文献。大多数可用平台都是为生物医学研究人员开发的，但其他领域的生物学家正在出现新的 NLP 工具，一个重要的例子是代谢组学。NLP 提供基于文献的代谢特征情境化，可减少代谢组学数据分析流程中优先级排序、识别和解释步骤所需的时间和专家级主题知识。在这里，我们描述并演示了四种工作流程，将代谢组学数据与基于 NLP 的科学数据库文献检索相结合，以帮助分析代谢组学数据及其生物学解释。这四个程序可以单独使用，也可以连续使用，具体取决于研究问题。第一个用于初始代谢物注释和优先级排序，创建后续感兴趣的代谢物列表。第二个工作流程找到代谢物和代谢途径在系统生物学水平上控制生物条件的活性的文献证据。第三个用于识别候选生物标志物，第四个用于寻找两种疾病共有的代谢状况或药物再利用靶点。该协议可能需要 1-4 小时或更长时间才能完成，具体取决于所使用的各种软件的处理时间。

更新日期：2021-01-22

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11