当前位置: X-MOL 学术Distrib. Parallel. Databases › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
BBoxDB: a distributed and highly available key-bounding-box-value store
Distributed and Parallel Databases ( IF 1.2 ) Pub Date : 2019-11-15 , DOI: 10.1007/s10619-019-07275-w
Jan Kristof Nidzwetzki , Ralf Hartmut Güting

BBoxDB is a distributed and highly available key-bounding-box-value store, which is designed to handle multi-dimensional big data. To handle large amounts of data, the software splits the stored data into multi-dimensional shards and spreads them across a cluster of nodes. Unlike existing key-value stores, BBoxDB stores each value together with an n -dimensional, axis parallel bounding box. The bounding box describes the spatial location of the value in an n -dimensional space. Multi-dimensional data can be retrieved by using range queries, which are efficiently supported by indices. A space partitioner (e.g., a K-D Tree, a Quad-Tree or a Grid) is used to split the n -dimensional space into disjoint regions (distribution regions). Distribution regions are created dynamically, based on the stored data. BBoxDB can handle growing and shrinking datasets. The data redistribution is performed in the background and does not affect the availability of the system; read and write access is still possible at any time. BBoxDB works with distribution groups, the data of all tables in a distribution group are distributed in the same way (co-partitioned). Spatial joins on co-partitioned tables can be executed efficiently without data shuffling between nodes. BBoxDB supports spatial joins out-of-the-box using the bounding boxes of the stored data. The joins are supported by a spatial index and executed in a distributed and parallel manner on the nodes of the cluster.

中文翻译:

BBoxDB:分布式且高可用的键边界框值存储

BBoxDB 是一个分布式、高可用的 key-bounding-box-value 存储,旨在处理多维大数据。为了处理大量数据,该软件将存储的数据拆分为多维分片,并将它们分布在节点集群中。与现有的键值存储不同,BBoxDB 将每个值与一个 n 维、轴平行的边界框存储在一起。边界框描述了值在 n 维空间中的空间位置。可以使用索引有效支持的范围查询来检索多维数据。空间划分器(例如,KD 树、四叉树或网格)用于将 n 维空间划分为不相交的区域(分布区域)。分布区域是基于存储的数据动态创建的。BBoxDB 可以处理增长和缩小的数据集。数据重新分配在后台进行,不影响系统可用性;仍然可以随时进行读写访问。BBoxDB 与分发组一起工作,分发组中所有表的数据以相同的方式分发(共同分区)。可以有效地执行共同分区表上的空间连接,而无需在节点之间进行数据混洗。BBoxDB 支持使用存储数据的边界框开箱即用的空间连接。连接由空间索引支持,并在集群节点上以分布式和并行方式执行。BBoxDB 与分发组一起工作,分发组中所有表的数据以相同的方式分发(共同分区)。可以有效地执行共同分区表上的空间连接,而无需在节点之间进行数据混洗。BBoxDB 使用存储数据的边界框支持开箱即用的空间连接。连接由空间索引支持,并在集群节点上以分布式和并行方式执行。BBoxDB 与分发组一起工作,分发组中所有表的数据以相同的方式分发(共同分区)。可以有效地执行共同分区表上的空间连接,而无需在节点之间进行数据混洗。BBoxDB 使用存储数据的边界框支持开箱即用的空间连接。连接由空间索引支持,并在集群节点上以分布式和并行方式执行。
更新日期:2019-11-15
down
wechat
bug