当前位置: X-MOL 学术ACM Trans. Archit. Code Optim. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
ArmorAll
ACM Transactions on Architecture and Code Optimization ( IF 1.6 ) Pub Date : 2020-05-30 , DOI: 10.1145/3382132
Charu Kalra 1 , Fritz Previlon 1 , Norm Rubin 1 , David Kaeli 1
Affiliation  

The vulnerability of GPUs to soft errors has become a first-class design concern as they are increasingly being used in accuracy-sensitive and safety-critical domains. Existing solutions used to enhance the reliability of GPUs come with significant overhead in terms of area, power, and/or performance. In this article, we propose ArmorAll, a light-weight, adaptive, selective, and portable software solution to protect GPUs against soft errors. ArmorAll consists of a set of purely compiler-based redundancy schemes designed to optimize instruction duplication on GPUs, thereby enabling much more reliable execution. The choice of the scheme determines the subset of instructions that must be duplicated in an application, allowing adaptable fault coverage for different applications. ArmorAll can intelligently select a redundancy scheme that provides the best coverage to an application with an accuracy of 91.7%. The high coverage provided by ArmorAll comes at an average improvement of 64.5% in runtime when using the selected redundancy scheme as compared to the state-of-the-art.

中文翻译:

全盔甲

GPU 对软错误的脆弱性已成为一流的设计问题,因为它们越来越多地用于精度敏感和安全关键领域。用于增强 GPU 可靠性的现有解决方案在面积、功率和/或性能方面具有显着的开销。在本文中,我们提出了 ArmorAll,这是一种轻量级、自适应、选择性和便携的软件解决方案,用于保护 GPU 免受软错误的影响。ArmorAll 由一组纯粹基于编译器的冗余方案组成,旨在优化 GPU 上的指令重复,从而实现更可靠的执行。方案的选择决定了必须在应用程序中复制的指令子集,从而允许针对不同应用程序进行适应性故障覆盖。ArmorAll 可以智能地选择冗余方案,以 91.7% 的准确度为应用程序提供最佳覆盖。与最先进的技术相比,使用所选冗余方案时,ArmorAll 提供的高覆盖率在运行时平均提高了 64.5%。
更新日期:2020-05-30
down
wechat
bug