当前位置: X-MOL 学术J. Parallel Distrib. Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Kokkos implementation of an Ewald Coulomb solver and analysis of performance portability
Journal of Parallel and Distributed Computing ( IF 3.8 ) Pub Date : 2019-12-17 , DOI: 10.1016/j.jpdc.2019.12.003
Rene Halver , Jan H. Meinke , Godehard Sutmann

We have implemented the computation of Coulomb interactions in particle systems using the performance portable C++ framework Kokkos. For the computation of the electrostatic interactions in particle systems we used an Ewald summation. This implementation we consider as a basis for a performance portability study. As target architectures we used Intel CPUs, including Intel Xeon Phi, as well as Nvidia GPUs. To provide a measure for performance portability we compute the number of needed operations and required cycles, i.e. runtime, and compare these with the measured runtime. Results indicate a similar quality of performance portability on all investigated architectures.



中文翻译:

Ewald库仑求解器的Kokkos实现和性能可移植性分析

我们已经使用性能可移植的C ++框架Kokkos在粒子系统中实现了库仑相互作用的计算。为了计算粒子系统中的静电相互作用,我们使用了Ewald求和。我们认为此实现是性能可移植性研究的基础。作为目标体系结构,我们使用了Intel CPU,包括Intel Xeon Phi和N​​vidia GPU。为了提供性能可移植性的度量,我们计算所需的操作和所需的周期(即运行时间)的数量,并将其与测得的运行时间进行比较。结果表明,在所有研究的架构上,性能可移植性的质量均相似。

更新日期:2020-01-04
down
wechat
bug