当前位置: X-MOL 学术Distrib. Parallel. Databases › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
On the necessity of explicit cross-layer data formats in near-data processing systems
Distributed and Parallel Databases ( IF 1.2 ) Pub Date : 2021-03-16 , DOI: 10.1007/s10619-021-07328-z
Lukas Weber , Tobias Vinçon , Christian Knödler , Leonardo Solis-Vasquez , Arthur Bernhardt , Ilia Petrov , Andreas Koch

Massive data transfers in modern data-intensive systems resulting from low data-locality and data-to-code system design hurt their performance and scalability. Near-Data processing (NDP) and a shift to code-to-data designs may represent a viable solution as packaging combinations of storage and compute elements on the same device has become feasible. The shift towards NDP system architectures calls for revision of established principles. Abstractions such as data formats and layouts typically spread multiple layers in traditional DBMS, the way they are processed is encapsulated within these layers of abstraction. The NDP-style processing requires an explicit definition of cross-layer data formats and accessors to ensure in-situ executions optimally utilizing the properties of the underlying NDP storage and compute elements. In this paper, we make the case for such data format definitions and investigate the performance benefits under RocksDB and the COSMOS hardware platform.



中文翻译:

关于近数据处理系统中显式跨层数据格式的必要性

低数据局部性和数据编码系统设计导致的现代数据密集型系统中的大量数据传输损害了它们的性能和可伸缩性。随着同一设备上存储和计算元素的打包组合变得可行,近数据处理(NDP)和向代码到数据设计的转变可能代表了可行的解决方案。向NDP系统架构的转变要求修订既定原则。数据格式和布局等抽象通常在传统DBMS中分散多个层,它们的处理方式封装在这些抽象层中。NDP样式的处理需要对跨层数据格式和访问器进行明确定义,以确保最佳地利用基础NDP存储和计算元素的属性来进行原位执行。在本文中,我们对这种数据格式定义进行了论证,并研究了RocksDB和COSMOS硬件平台下的性能优势。

更新日期:2021-03-16
down
wechat
bug