当前位置: X-MOL 学术arXiv.cs.DC › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C++
arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2020-09-22 , DOI: arxiv-2009.10697
L\'eopold Cambier, Yizhou Qian, Eric Darve

We present TaskTorrent, a lightweight distributed task-based runtime in C++. TaskTorrent uses a parametrized task graph to express the task DAG, and one-sided active messages to trigger remote tasks asynchronously. As a result the task DAG is completely distributed and discovered in parallel. It is a C++14 library and only depends on MPI. We explain the API and the implementation. We perform a series of benchmarks against StarPU and ScaLAPACK. Micro benchmarks show it has a minimal overhead compared to other solutions. We then apply it to two large linear algebra problems. TaskTorrent scales very well to thousands of cores, exhibiting good weak and strong scalings.

中文翻译:

TaskTorrent:C++ 中的轻量级分布式基于任务的运行时系统

我们展示了 TaskTorrent,这是一个使用 C++ 编写的基于轻量级分布式任务的运行时。TaskTorrent 使用参数化的任务图来表达任务 DAG,并使用单侧主动消息异步触发远程任务。因此,任务 DAG 是完全分布式并并行发现的。它是一个 C++14 库,仅依赖于 MPI。我们解释了 API 和实现。我们针对 StarPU 和 ScaLAPACK 执行了一系列基准测试。微基准测试表明,与其他解决方案相比,它的开销最小。然后我们将其应用于两个大型线性代数问题。TaskTorrent 可以很好地扩展到数千个内核,表现出良好的弱扩展和强扩展。
更新日期:2020-09-23
down
wechat
bug