Journal of Parallel and Distributed Computing ( IF 3.8 ) Pub Date : 2019-12-17 , DOI: 10.1016/j.jpdc.2019.12.003 Rene Halver , Jan H. Meinke , Godehard Sutmann
We have implemented the computation of Coulomb interactions in particle systems using the performance portable C++ framework Kokkos. For the computation of the electrostatic interactions in particle systems we used an Ewald summation. This implementation we consider as a basis for a performance portability study. As target architectures we used Intel CPUs, including Intel Xeon Phi, as well as Nvidia GPUs. To provide a measure for performance portability we compute the number of needed operations and required cycles, i.e. runtime, and compare these with the measured runtime. Results indicate a similar quality of performance portability on all investigated architectures.
中文翻译:
Ewald库仑求解器的Kokkos实现和性能可移植性分析
我们已经使用性能可移植的C ++框架Kokkos在粒子系统中实现了库仑相互作用的计算。为了计算粒子系统中的静电相互作用,我们使用了Ewald求和。我们认为此实现是性能可移植性研究的基础。作为目标体系结构,我们使用了Intel CPU,包括Intel Xeon Phi和Nvidia GPU。为了提供性能可移植性的度量,我们计算所需的操作和所需的周期(即运行时间)的数量,并将其与测得的运行时间进行比较。结果表明,在所有研究的架构上,性能可移植性的质量均相似。