Machine learning in physics: the pitfalls of poisoned training sets,Machine Learning: Science and Technology

当前位置： X-MOL 学术 › Mach. Learn. Sci. Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Machine learning in physics: the pitfalls of poisoned training sets
Machine Learning: Science and Technology ( IF 6.3 ) Pub Date : 2020-09-10 , DOI: 10.1088/2632-2153/aba821
Chao Fang _{1,

2} , Amin Barzeger _{1,

3} , Helmut G Katzgraber ₃

Affiliation

Known for their ability to identify hidden patterns in data, artificial neural networks are among the most powerful machine learning tools. Most notably, neural networks have played a central role in identifying states of matter and phase transitions across condensed matter physics. To date, most studies have focused on systems where different phases of matter and their phase transitions are known, and thus the performance of neural networks is well controlled. While neural networks present an exciting new tool to detect new phases of matter, here we demonstrate that when the training sets are poisoned (i.e. poor training data or mislabeled data) it is easy for neural networks to make misleading predictions.

中文翻译：

物理学中的机器学习：有毒训练集的陷阱

人工神经网络以识别数据中隐藏模式的能力而闻名，是最强大的机器学习工具之一。最值得注意的是，神经网络在确定凝聚态物理学中的物质状态和相变方面发挥了核心作用。迄今为止，大多数研究都集中在已知物质的不同相及其相变的系统上，因此神经网络的性能得到了很好的控制。虽然神经网络提供了一种令人兴奋的新工具来检测物质的新阶段，但在这里我们证明，当训练集被毒化（即训练数据不正确或数据标签错误）时，神经网络很容易做出误导性的预测。

更新日期：2020-09-12

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文