Interpreting and Boosting Dropout from a Game-Theoretic View,arXiv - CS - Computer Vision and Pattern Recognition

当前位置： X-MOL 学术 › arXiv.cs.CV › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Interpreting and Boosting Dropout from a Game-Theoretic View
arXiv - CS - Computer Vision and Pattern Recognition Pub Date : 2020-09-24 , DOI: arxiv-2009.11729
Hao Zhang, Sen Li, Yinchao Ma, Mingjie Li, Yichen Xie, Quanshi Zhang

This paper aims to understand and improve the utility of the dropout operation from the perspective of game-theoretic interactions. We prove that dropout can suppress the strength of interactions between input variables of deep neural networks (DNNs). The theoretic proof is also verified by various experiments. Furthermore, we find that such interactions were strongly related to the over-fitting problem in deep learning. Thus, the utility of dropout can be regarded as decreasing interactions to alleviate the significance of over-fitting. Based on this understanding, we propose an interaction loss to further improve the utility of dropout. Experimental results have shown that the interaction loss can effectively improve the utility of dropout and boost the performance of DNNs.

中文翻译：

从博弈论的角度解释和促进辍学

本文旨在从博弈论交互的角度理解和改进 dropout 操作的效用。我们证明 dropout 可以抑制深度神经网络 (DNN) 的输入变量之间的相互作用强度。理论证明也得到了各种实验的验证。此外，我们发现这种相互作用与深度学习中的过度拟合问题密切相关。因此，可以将 dropout 的效用视为减少交互以减轻过拟合的重要性。基于这种理解，我们提出了一种交互损失，以进一步提高 dropout 的效用。实验结果表明，交互损失可以有效提高 dropout 的效用并提升 DNN 的性能。

更新日期：2020-10-13

点击分享查看原文

点击收藏

阅读更多本刊最新论文