当前位置: X-MOL 学术Comput. Netw. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
On the performance investigation of a fast optical switches based optical high performance computing infrastructure
Computer Networks ( IF 5.6 ) Pub Date : 2021-08-06 , DOI: 10.1016/j.comnet.2021.108349
Fulong Yan 1, 2 , Hugo Meyer 3 , Changshun Yuan 4 , Xuwei Xue 2 , Bitao Pan 2 , Xiaotao Guo 2 , Nicola Calabretta 2 , Chongjin Xie 5
Affiliation  

Optical networks based on fast optical switches (FOSes) could potentially solve the latency, bandwidth, cost and power consumption challenges in current electrical switches (ESes) based high performance computing (HPC) networks. In this work, we present a novel HPC network which employs distributed FOS interconnecting by removing ES (Firefly). In Firefly, Dragonfly topology is adopted for the inter-group connection of blades, while the intra-group connection of blades is implemented by FOS with fast optical flow control. The Firefly exploits the wavelength, space, and time switching domain with nanoseconds reconfiguration time of the FOS to achieve efficient statistical multiplexing operation. We numerically investigate the Firefly performance with real HPC traffic traces collected by running multiple computing applications in MareNostrum 3 HPC infrastructure with Leaf-Spine architecture. Compared with Leaf Spine architecture, results show that Firefly performs 62.4%, 54%, 68.6%, and 71.8% less latency for the applications Conjugate Gradient (CG), Multi-Grid (MG), Multiple Instruction Lattice Computation (MILC), and Miniature Molecular Dynamics (MINI_MD), respectively. Moreover, Firefly can save 56.4% cost and 65.7% power consumption, respectively, with respect to Leaf-Spine when both support around 10,000 blades.



中文翻译:

在一个快速的光开关的性能研究基于光学的高性能计算基础设施

基于快速光学开关(FOSes)光网络可能解决在基于高性能计算(HPC)网络的电流的电气开关(ESES)的等待时间,带宽,成本和功耗的挑战。在这项工作中,我们提出了一个新颖的HPC网络,其采用分布式FOS通过去除ES(萤火虫)互连。在萤火虫,蜻蜓拓扑采用叶片的组间连接,而叶片的组内连接是由FOS具有快速光流控制来实现。萤火虫利用了波长,空间和时间切换与FOS的纳秒重构时域实现高效统计复用操作。我们调查数字与运行在计算机MareNostrum 3 HPC基础设施与叶脊柱架构多个计算应用程序收集实时交通HPC痕迹萤火虫性能。与叶脊柱架构相比,结果表明,萤火虫的应用共轭梯度(CG),多重网格(MG),多指令格计算(MILC)进行62.4%,54%,68.6%,和71.8%以下的延迟,和微型分子动力学(MINI_MD),分别。此外,萤火虫可以节省56.4%的成本和65.7%的功耗,相对于叶脊柱约10,000刀片都支持的时候。多指令格计算(MILC),分别微型分子动力学(MINI_MD)。此外,萤火虫可以节省56.4%的成本和65.7%的功耗,相对于叶脊柱约10,000刀片都支持的时候。多指令格计算(MILC),分别微型分子动力学(MINI_MD)。此外,萤火虫可以节省56.4%的成本和65.7%的功耗,相对于叶脊柱约10,000刀片都支持的时候。

更新日期:2021-08-15
down
wechat
bug