当前位置:
X-MOL 学术
›
arXiv.cs.DC
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-11-22 , DOI: arxiv-2011.10896 Michael Riera, Erfan Bank Tavakoli, Masudul Hassan Quraishi, Fengbo Ren
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-11-22 , DOI: arxiv-2011.10896 Michael Riera, Erfan Bank Tavakoli, Masudul Hassan Quraishi, Fengbo Ren
Hardware-agnostic programming with high performance portability will be the
bedrock for realizing the ubiquitous adoption of emerging accelerator
technologies in future heterogeneous high-performance computing (HPC) systems,
which is the key to achieving the next level of HPC performance on an expanding
accelerator landscape. In this paper, we present HALO 1.0, an open-ended
extensible multi-agent software framework, that implements a set of proposed
hardware-agnostic accelerator orchestration (HALO) principles and a novel
compute-centric message passing interface (C^2MPI) specification for enabling
the portable and performance-optimized execution of hardware-agnostic
application codes across heterogeneous accelerator resources. The experiment
results of evaluating eight widely used HPC subroutines based on Intel Xeon
E5-2620 v4 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs
show that HALO 1.0 allows the same hardware-agnostic application codes of the
HPC kernels, without any change, to run across all the computing devices with a
consistently maximum performance portability score of 1.0, which is 2x-861,883x
higher than the OpenCL-based solution that suffers from an unstably low
performance portability score.
中文翻译:
HALO 1.0:硬件无关的加速器编排框架,可为异构HPC启用具有真正性能可移植性的硬件无关编程
与硬件无关的编程和高性能的可移植性将成为在未来的异构高性能计算(HPC)系统中普遍采用新兴加速器技术的基石,这是在不断扩展的加速器格局上实现更高级别的HPC性能的关键。在本文中,我们介绍了HALO 1.0,这是一个开放式的可扩展多代理软件框架,该框架实现了一组建议的与硬件无关的加速器编排(HALO)原理和一种新颖的以计算为中心的消息传递接口(C ^ 2MPI)规范用于跨异构加速器资源实现与硬件无关的应用程序代码的可移植且性能优化的执行。实验结果评估了基于Intel Xeon E5-2620 v4 CPU的八个广泛使用的HPC子例程,
更新日期:2020-11-25
中文翻译:
HALO 1.0:硬件无关的加速器编排框架,可为异构HPC启用具有真正性能可移植性的硬件无关编程
与硬件无关的编程和高性能的可移植性将成为在未来的异构高性能计算(HPC)系统中普遍采用新兴加速器技术的基石,这是在不断扩展的加速器格局上实现更高级别的HPC性能的关键。在本文中,我们介绍了HALO 1.0,这是一个开放式的可扩展多代理软件框架,该框架实现了一组建议的与硬件无关的加速器编排(HALO)原理和一种新颖的以计算为中心的消息传递接口(C ^ 2MPI)规范用于跨异构加速器资源实现与硬件无关的应用程序代码的可移植且性能优化的执行。实验结果评估了基于Intel Xeon E5-2620 v4 CPU的八个广泛使用的HPC子例程,