The Accuracy and Efficiency of Posit Arithmetic,arXiv - CS - Performance

当前位置： X-MOL 学术 › arXiv.cs.PF › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

The Accuracy and Efficiency of Posit Arithmetic
arXiv - CS - Performance Pub Date : 2021-09-16 , DOI: arxiv-2109.08225
Stefan Dan Ciocirlan, Dumitrel Loghin, Lavanya Ramapantulu, Nicolae Tapus, Yong Meng Teo

Motivated by the increasing interest in the posit numeric format, in this paper we evaluate the accuracy and efficiency of posit arithmetic in contrast to the traditional IEEE 754 32-bit floating-point (FP32) arithmetic. We first design and implement a Posit Arithmetic Unit (PAU), called POSAR, with flexible bit-sized arithmetic suitable for applications that can trade accuracy for savings in chip area. Next, we analyze the accuracy and efficiency of POSAR with a series of benchmarks including mathematical computations, ML kernels, NAS Parallel Benchmarks (NPB), and Cifar-10 CNN. This analysis is done on our implementation of POSAR integrated into a RISC-V Rocket Chip core in comparison with the IEEE 754-based Floting Point Unit (FPU) of Rocket Chip. Our analysis shows that POSAR can outperform the FPU, but the results are not spectacular. For NPB, 32-bit posit achieves better accuracy than FP32 and improves the execution by up to 2%. However, POSAR with 32-bit posit needs 30% more FPGA resources compared to the FPU. For classic ML algorithms, we find that 8-bit posits are not suitable to replace FP32 because they exhibit low accuracy leading to wrong results. Instead, 16-bit posit offers the best option in terms of accuracy and efficiency. For example, 16-bit posit achieves the same Top-1 accuracy as FP32 on a Cifar-10 CNN with a speedup of 18%.

中文翻译：

Posit 算法的准确性和效率

由于对 posit 数字格式的兴趣日益增加，在本文中，我们评估了 posit 算法与传统 IEEE 754 32 位浮点 (FP32) 算法相比的准确性和效率。我们首先设计并实现了一个名为 POSAR 的 Posit Arithmetic Unit (PAU)，它具有灵活的位大小算法，适用于可以牺牲准确性以节省芯片面积的应用。接下来，我们使用一系列基准分析 POSAR 的准确性和效率，包括数学计算、ML 内核、NAS 并行基准 (NPB) 和 Cifar-10 CNN。与基于 IEEE 754 的火箭芯片浮点单元 (FPU) 相比，我们对集成到 RISC-V 火箭芯片内核中的 POSAR 实现进行了分析。我们的分析表明，POSAR 可以胜过 FPU，但结果并不出色。对于 NPB，32 位 posit 实现了比 FP32 更好的准确度，并提高了高达 2% 的执行力。但是，与 FPU 相比，具有 32 位 posit 的 POSAR 需要多 30% 的 FPGA 资源。对于经典 ML 算法，我们发现 8 位位置不适合替代 FP32，因为它们表现出低精度导致错误结果。相反，16 位定位提供了准确性和效率方面的最佳选择。例如，16 位 posit 在 Cifar-10 CNN 上实现了与 FP32 相同的 Top-1 精度，速度提升了 18%。16 位定位提供了准确性和效率方面的最佳选择。例如，16 位 posit 在 Cifar-10 CNN 上实现了与 FP32 相同的 Top-1 精度，速度提升了 18%。16 位定位提供了准确性和效率方面的最佳选择。例如，16 位 posit 在 Cifar-10 CNN 上实现了与 FP32 相同的 Top-1 精度，速度提升了 18%。

更新日期：2021-09-20

点击分享查看原文

点击收藏

阅读更多本刊最新论文