当前位置: X-MOL 学术J. Comput. Sci. Tech. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Gfarm/BB — Gfarm File System for Node-Local Burst Buffer
Journal of Computer Science and Technology ( IF 1.9 ) Pub Date : 2020-01-01 , DOI: 10.1007/s11390-020-9803-z
Osamu Tatebe , Shukuko Moriwake , Yoshihiro Oyama

Burst buffer has become a major component to meet the I/O performance requirement of HPC bursty traffic. This paper proposes Gfarm/BB that is a file system for a burst buffer efficiently exploiting node-local storage systems. Although node-local storages improve storage performance, they are only available during the job allocation. Gfarm/BB should have better access and metadata performance while it should be constructed on-demand before the job execution. To improve the read and write performance, it exploits the file descriptor passing and remote direct memory access (RDMA). It improves the metadata performance by omitting the persistency and the redundancy since it is a temporal file system. Using RDMA, writes and reads bandwidth are improved by 1.7x and 2.2x compared with IP over InfiniBand (IPoIB), respectively. It achieves 14 700 operations per second in the directory creation performance, which is 13.4x faster than the fully persistent and redundant case. The construction of Gfarm/BB takes 0.31 seconds using 2 nodes. IOR benchmark and ARGOT-IO application I/O benchmark show the scalable performance improvement by exploiting the locality of node-local storages. Compared with BeeOND, Gfarm/BB shows 2.6x and 2.4x better performance in IOR write and read benchmarks, respectively, and it shows 2.5x better performance in ARGOT-IO.

中文翻译:

Gfarm/BB — 用于节点本地突发缓冲区的 Gfarm 文件系统

突发缓冲区已成为满足 HPC 突发流量 I/O 性能要求的主要组件。本文提出了 Gfarm/BB,它是一种用于有效利用节点本地存储系统的突发缓冲区的文件系统。虽然节点本地存储提高了存储性能,但它们仅在作业分配期间可用。Gfarm/BB 应该有更好的访问和元数据性能,同时它应该在作业执行之前按需构建。为了提高读写性能,它利用了文件描述符传递和远程直接内存访问(RDMA)。由于它是一个临时文件系统,它通过省略持久性和冗余来提高元数据性能。使用 RDMA,与 IP over InfiniBand (IPoIB) 相比,写入和读取带宽分别提高了 1.7 倍和 2.2 倍。它在目录创建性能上实现了每秒 14 700 次操作,比完全持久和冗余的情况快 13.4 倍。使用 2 个节点构建 Gfarm/BB 需要 0.31 秒。IOR 基准测试和 ARGOT-IO 应用程序 I/O 基准测试通过利用节点本地存储的局部性显示了可扩展的性能改进。与 BeeOND 相比,Gfarm/BB 在 IOR 写入和读取基准测试中的性能分别提高了 2.6 倍和 2.4 倍,在 ARGOT-IO 中的性能提高了 2.5 倍。
更新日期:2020-01-01
down
wechat
bug