当前位置: X-MOL 学术J. Cheminfom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions
Journal of Cheminformatics ( IF 8.6 ) Pub Date : 2024-01-24 , DOI: 10.1186/s13321-024-00805-4
Lung-Yi Chen , Yi-Pei Li

In the field of chemical synthesis planning, the accurate recommendation of reaction conditions is essential for achieving successful outcomes. This work introduces an innovative deep learning approach designed to address the complex task of predicting appropriate reagents, solvents, and reaction temperatures for chemical reactions. Our proposed methodology combines a multi-label classification model with a ranking model to offer tailored reaction condition recommendations based on relevance scores derived from anticipated product yields. To tackle the challenge of limited data for unfavorable reaction contexts, we employed the technique of hard negative sampling to generate reaction conditions that might be mistakenly classified as suitable, forcing the model to refine its decision boundaries, especially in challenging cases. Our developed model excels in proposing conditions where an exact match to the recorded solvents and reagents is found within the top-10 predictions 73% of the time. It also predicts temperatures within ± 20 $$^{\circ }{} {\textbf {C}}$$ of the recorded temperature in 89% of test cases. Notably, the model demonstrates its capacity to recommend multiple viable reaction conditions, with accuracy varying based on the availability of condition records associated with each reaction. What sets this model apart is its ability to suggest alternative reaction conditions beyond the constraints of the dataset. This underscores its potential to inspire innovative approaches in chemical research, presenting a compelling opportunity for advancing chemical synthesis planning and elevating the field of reaction engineering. Scientific contribution: The combination of multi-label classification and ranking models provides tailored recommendations for reaction conditions based on the reaction yields. A novel approach is presented to address the issue of data scarcity in negative reaction conditions through data augmentation.

中文翻译:

增强化学合成:用于预测可行反应条件的两级深度神经网络

在化学合成规划领域,准确推荐反应条件对于取得成功结果至关重要。这项工作引入了一种创新的深度学习方法,旨在解决预测化学反应的适当试剂、溶剂和反应温度的复杂任务。我们提出的方法将多标签分类模型与排名模型相结合,根据预期产品产量得出的相关性分数提供定制的反应条件建议。为了解决不利反应环境中数据有限的挑战,我们采用了硬负采样技术来生成可能被错误分类为合适的反应条件,迫使模型细化其决策边界,尤其是在具有挑战性的情况下。我们开发的模型擅长提出条件,其中 73% 的时间在前 10 个预测中找到与记录的溶剂和试剂完全匹配的条件。它还在 89% 的测试用例中预测温度在记录温度的 ± 20 $$^{\circ }{} {\textbf {C}}$$ 范围内。值得注意的是,该模型展示了其推荐多种可行反应条件的能力,其准确性根据与每个反应相关的条件记录的可用性而变化。该模型的独特之处在于它能够提出超出数据集限制的替代反应条件。这凸显了其激发化学研究创新方法的潜力,为推进化学合成规划和提升反应工程领域提供了绝佳的机会。科学贡献:多标签分类和排序模型的结合为基于反应产率的反应条件提供了量身定制的建议。提出了一种新方法,通过数据增强来解决负反应条件下的数据稀缺问题。
更新日期:2024-01-25
down
wechat
bug