当前位置: X-MOL 学术arXiv.cs.DC › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
The PetscSF Scalable Communication Layer
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2021-02-25 , DOI: arxiv-2102.13018
Junchao Zhang, Jed Brown, Satish Balay, Jacob Faibussowitsch, Matthew Knepley, Oana Marin, Richard Tran Mills, Todd Munson, Barry F. Smith, Stefano Zampini

PetscSF, the communication component of the Portable, Extensible Toolkit for Scientific Computation (PETSc), is being used to gradually replace the direct MPI calls in the PETSc library. PetscSF provides a simple application programming interface (API) for managing common communication patterns in scientific computations by using a star-forest graph representation. PetscSF supports several implementations whose selection is based on the characteristics of the application or the target architecture. An efficient and portable model for network and intra-node communication is essential for implementing large-scale applications. The Message Passing Interface, which has been the de facto standard for distributed memory systems, has developed into a large complex API that does not yet provide high performance on the emerging heterogeneous CPU-GPU-based exascale systems. In this paper, we discuss the design of PetscSF, how it can overcome some difficulties of working directly with MPI with GPUs, and we demonstrate its performance, scalability, and novel features.

中文翻译:

PetscSF可扩展通信层

PetscSF是便携式可扩展科学计算工具(PETSc)的通信组件,用于逐步替换PETSc库中的直接MPI调用。PetscSF提供了一个简单的应用程序编程接口(API),用于通过使用星状图表示法来管理科学计算中的常见通信模式。PetscSF支持几种实现,这些实现的选择基于应用程序或目标体系结构的特征。用于网络和节点内通信的有效且可移植的模型对于实现大规模应用程序至关重要。消息传递接口已成为分布式存储系统的事实上的标准,已发展成为一个大型复杂的API,该API尚未在新兴的基于异构CPU-GPU的亿亿级系统上提供高性能。在本文中,我们讨论了PetscSF的设计,如何克服直接与MPI配合使用GPU的一些困难,并展示了其性能,可伸缩性和新颖的功能。
更新日期:2021-02-26
down
wechat
bug