当前位置:
X-MOL 学术
›
arXiv.cs.MS
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Enabling GPU Accelerated Computing in the SUNDIALS Time Integration Library
arXiv - CS - Mathematical Software Pub Date : 2020-11-25 , DOI: arxiv-2011.12984 Cody J. Balos, David J. Gardner, Carol S. Woodward, Daniel R. Reynolds
arXiv - CS - Mathematical Software Pub Date : 2020-11-25 , DOI: arxiv-2011.12984 Cody J. Balos, David J. Gardner, Carol S. Woodward, Daniel R. Reynolds
As part of the Exascale Computing Project (ECP), a recent focus of
development efforts for the SUite of Nonlinear and DIfferential/ALgebraic
equation Solvers (SUNDIALS) has been to enable GPU-accelerated time integration
in scientific applications at extreme scales. This effort has resulted in
several new GPU-enabled implementations of core SUNDIALS data structures,
support for programming paradigms which are aware of the heterogeneous
architectures, and the introduction of utilities to provide new points of
flexibility. In this paper, we discuss our considerations, both internal and
external, when designing these new features and present the features
themselves. We also present performance results for several of the features on
the Summit supercomputer and early access hardware for the Frontier
supercomputer, which demonstrate negligible performance overhead resulting from
the additional infrastructure and significant speedups when using both NVIDIA
and AMD GPUs.
中文翻译:
在SUNDIALS时间集成库中启用GPU加速计算
作为Exascale计算项目(ECP)的一部分,非线性和微分/代数方程求解器(SUNDIALS)SUite的开发工作最近的重点是使GPU加速的时间积分可以在极端规模的科学应用中使用。这项工作已导致核心SUNDIALS数据结构的几种新的启用GPU的实现,对了解异构体系结构的编程范例的支持以及引入实用程序以提供新的灵活性。在本文中,我们将在设计这些新功能时讨论内部和外部的考虑因素,并介绍这些功能本身。我们还展示了Summit超级计算机上的一些功能以及Frontier超级计算机的早期访问硬件的性能结果,
更新日期:2020-12-01
中文翻译:
在SUNDIALS时间集成库中启用GPU加速计算
作为Exascale计算项目(ECP)的一部分,非线性和微分/代数方程求解器(SUNDIALS)SUite的开发工作最近的重点是使GPU加速的时间积分可以在极端规模的科学应用中使用。这项工作已导致核心SUNDIALS数据结构的几种新的启用GPU的实现,对了解异构体系结构的编程范例的支持以及引入实用程序以提供新的灵活性。在本文中,我们将在设计这些新功能时讨论内部和外部的考虑因素,并介绍这些功能本身。我们还展示了Summit超级计算机上的一些功能以及Frontier超级计算机的早期访问硬件的性能结果,