当前位置: X-MOL 学术J. Comput. Phys. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Pass-efficient methods for compression of high-dimensional turbulent flow data
Journal of Computational Physics ( IF 3.8 ) Pub Date : 2020-07-09 , DOI: 10.1016/j.jcp.2020.109704
Alec M. Dunton , Lluís Jofre , Gianluca Iaccarino , Alireza Doostan

The future of high-performance computing, specifically on future Exascale computers, will presumably see memory capacity and bandwidth fail to keep pace with data generated, for instance, from massively parallel partial differential equation (PDE) systems. Current strategies proposed to address this bottleneck entail the omission of large fractions of data, as well as the incorporation of in situ compression algorithms to avoid overuse of memory. To ensure that post-processing operations are successful, this must be done in a way that a sufficiently accurate representation of the solution is stored. Moreover, in situations where the input/output system becomes a bottleneck in analysis, visualization, etc., or the execution of the PDE solver is expensive, the number of passes made over the data must be minimized. In the interest of addressing this problem, this work focuses on the utility of pass-efficient, parallelizable, low-rank, matrix decomposition methods in compressing high-dimensional simulation data from turbulent flows. A particular emphasis is placed on using coarse representation of the data – compatible with the PDE discretization grid – to accelerate the construction of the low-rank factorization. This includes the presentation of a novel single-pass matrix decomposition algorithm for computing the so-called interpolative decomposition. The methods are described extensively and numerical experiments on two turbulent channel flow data are performed. In the first (unladen) channel flow case, compression factors exceeding 400 are achieved while maintaining accuracy with respect to first- and second-order flow statistics. In the particle-laden case, compression factors of 100 are achieved and the compressed data is used to recover particle velocities. These results show that these compression methods can enable efficient computation of various quantities of interest in both the carrier and disperse phases.



中文翻译:

高通量湍流数据压缩的传递有效方法

高性能计算的未来,特别是在未来的Exascale计算机上,可能会看到内存容量和带宽无法跟上例如从大规模并行偏微分方程(PDE)系统生成的数据的步伐。为解决此瓶颈而提出的当前策略需要遗漏大量数据,以及原位整合压缩算法,以避免过度使用内存。为了确保后处理操作成功,必须以存储解决方案足够准确的表示的方式进行。此外,在输入/输出系统成为分析,可视化等方面的瓶颈,或者PDE求解器的执行成本很高的情况下,必须最小化对数据的遍历次数。为了解决这个问题,这项工作着眼于通过效率高,可并行化,低秩的矩阵分解方法在从湍流压缩高维模拟数据中的实用性。特别强调的是使用数据的粗略表示法(与PDE离散化网格兼容)来加快低秩分解的构建。这包括介绍一种新颖的单遍矩阵分解算法,用于计算所谓的内插分解。对该方法进行了详尽的描述,并对两个湍流通道流量数据进行了数值实验。在第一(无载)通道流动情况下,在保持一阶和二阶流动统计信息的准确性的同时,获得了超过400的压缩系数。在充满粒子的情况下,压缩系数达到100,并且压缩后的数据用于恢复粒子速度。这些结果表明,这些压缩方法可以有效地计算出载体相和分散相中各种感兴趣的量。对该方法进行了详尽的描述,并对两个湍流通道流量数据进行了数值实验。在第一(无载)通道流动情况下,在保持一阶和二阶流动统计信息的准确性的同时,获得了超过400的压缩系数。在充满粒子的情况下,压缩系数达到100,并且压缩后的数据用于恢复粒子速度。这些结果表明,这些压缩方法可以有效地计算载体和分散相中各种感兴趣的量。对该方法进行了详细描述,并对两个湍流通道流量数据进行了数值实验。在第一(无载)通道流动情况下,在保持一阶和二阶流动统计信息的准确性的同时,获得了超过400的压缩系数。在充满粒子的情况下,压缩系数达到100,并且压缩后的数据用于恢复粒子速度。这些结果表明,这些压缩方法可以有效地计算载体和分散相中各种感兴趣的量。在充满粒子的情况下,压缩系数达到100,并且压缩后的数据用于恢复粒子速度。这些结果表明,这些压缩方法可以有效地计算出载体相和分散相中各种感兴趣的量。在充满粒子的情况下,压缩系数达到100,并且压缩后的数据用于恢复粒子速度。这些结果表明,这些压缩方法可以有效地计算出载体相和分散相中各种感兴趣的量。

更新日期:2020-07-10
down
wechat
bug