当前位置: X-MOL 学术Computing › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Sensitivity of computational fluid dynamics simulations against soft errors
Computing ( IF 3.3 ) Pub Date : 2021-07-13 , DOI: 10.1007/s00607-021-00976-0
E. Fatih Yetkin 1 , Şenol Pişkin 2, 3
Affiliation  

Abstract

Computational capabilities of the largest high performance computing systems have increased by more than 100 folds in the last 10 years and keep increasing substantially every year. This increase is made possible mostly by multi-core technology besides the increase in clock speed of CPUs. Nowadays, there are systems with more than 100 thousand cores installed and available for processing simultaneously. Computational simulation tools are always in need of more than available computational sources. This is the case for especially complex, large scale flow problems. For these large scale problems, the soft error tolerance of the simulation codes should also be encountered where it is not an issue in relatively small scale problems due to the low occurrence probabilities. In this study, we analyzed the reaction of an incompressible flow solver to randomly generated soft errors at several levels of computation. Soft errors are induced into the final global assembly matrix of the solver by manipulating predetermined bit-flip operations. Behaviour of the computational fluid dynamics (CFD) solver is observed after iterative matrix solver, flow convergence and CFD iterations. Results show that the iterative solvers of CFD matrices are highly sensitive to customized soft errors while the final solutions seem more intact to bit-flip operations. But, the solutions might still differ from the real physical results depending on the bit-flip location and iteration number. So, the next generation computing platforms and codes should be designed to be able to detect bit-flip operations and be designed bit-flip resistant.

Graphic abstract



中文翻译:

计算流体动力学模拟对软错误的敏感性

摘要

在过去的 10 年中,最大的高性能计算系统的计算能力增加了 100 多倍,并且每年都在大幅增加。除了 CPU 时钟速度的增加之外,这种增加主要是通过多核技术实现的。如今,已安装了超过 10 万个内核并可同时处理的系统。计算仿真工具需要的不仅仅是可用的计算资源。这是特别复杂的大规模流动问题的情况。对于这些大规模问题,模拟代码的软错误容限也应该遇到,因为它在较小规模的问题中不是问题,因为发生概率较低。在这项研究中,我们分析了不可压缩的流动求解器在多个计算级别对随机生成的软错误的反应。通过操纵预定的位翻转操作,软错误被引入到求解器的最终全局组装矩阵中。在迭代矩阵求解器、流动收敛和 CFD 迭代之后观察计算流体动力学 (CFD) 求解器的行为。结果表明,CFD 矩阵的迭代求解器对自定义软错误高度敏感,而最终解决方案似乎对位翻转操作更完整。但是,根据位翻转位置和迭代次数,解决方案可能仍与实际物理结果有所不同。因此,下一代计算平台和代码应设计为能够检测位翻转操作并设计为抗位翻转。

图形摘要

更新日期:2021-07-13
down
wechat
bug