SW-LZMA: Parallel Implementation of LZMA Based on SW26010 Many-Core Processor,Wireless Communications and Mobile Computing

当前位置： X-MOL 学术 › Wirel. Commun. Mob. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

SW-LZMA: Parallel Implementation of LZMA Based on SW26010 Many-Core Processor
Wireless Communications and Mobile Computing ( IF 2.146 ) Pub Date : 2021-09-18 , DOI: 10.1155/2021/4486494
Bingzheng Li ₁ , Jinchen Xu ₁ , Zijing Liu ₁

Affiliation

With the development of high-performance computing and big data applications, the scale of data transmitted, stored, and processed by high-performance computing cluster systems is increasing explosively. Efficient compression of large-scale data and reducing the space required for data storage and transmission is one of the keys to improving the performance of high-performance computing cluster systems. In this paper, we present SW-LZMA, a parallel design and optimization of LZMA based on the Sunway 26010 heterogeneous many-core processor. Combined with the characteristics of SW26010 processors, we analyse the storage space requirements, memory access characteristics, and hotspot functions of the LZMA algorithm and implement the thread-level parallelism of the LZMA algorithm based on Athread interface. Furthermore, we make a fine-grained layout of LDM address space to achieve DMA double buffer cyclic sliding window algorithm, which optimizes the performance of SW-LZMA. The experimental results show that compared with the serial baseline implementation of LZMA, the parallel LZMA algorithm obtains a maximum speedup ratio of 4.1 times using the Silesia corpus benchmark, while on the large-scale data set, speedup is 5.3 times.

中文翻译：

SW-LZMA：基于SW26010众核处理器的LZMA并行实现

随着高性能计算和大数据应用的发展，高性能计算集群系统传输、存储和处理的数据规模呈爆炸式增长。对大规模数据进行高效压缩，减少数据存储和传输所需的空间，是提高高性能计算集群系统性能的关键之一。在本文中，我们介绍了 SW-LZMA，这是一种基于 Sunway 26010 异构众核处理器的 LZMA 的并行设计和优化。结合SW26010处理器的特点，分析了LZMA算法的存储空间需求、内存访问特性、热点功能，并基于Athread接口实现了LZMA算法的线程级并行。此外，我们对 LDM 地址空间进行了细粒度布局，实现了 DMA 双缓冲区循环滑动窗口算法，优化了 SW-LZMA 的性能。实验结果表明，与 LZMA 的串行基线实现相比，并行 LZMA 算法使用 Silesia 语料库基准获得最大加速比为 4.1 倍，而在大规模数据集上，加速比为 5.3 倍。

更新日期：2021-09-20

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>