当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data (Extended Version)
arXiv - CS - Databases Pub Date : 2020-06-09 , DOI: arxiv-2006.05134
Kevin Wellenzohn, Michael H. B\"ohlen, Sven Helmer

We propose a robust index for semi-structured hierarchical data that supports content-and-structure (CAS) queries specified by path and value predicates. At the heart of our approach is a novel dynamic interleaving scheme that merges the path and value dimensions of composite keys in a balanced way. We store these keys in our trie-based Robust Content-And-Structure index, which efficiently supports a wide range of CAS queries, including queries with wildcards and descendant axes. Additionally, we show important properties of our scheme, such as robustness against varying selectivities, and demonstrate improvements of up to two orders of magnitude over existing approaches in our experimental evaluation.

中文翻译:

内容和结构的动态交织,用于半结构化分层数据的稳健索引(扩展版)

我们为半结构化分层数据提出了一个强大的索引,该索引支持由路径和值谓词指定的内容和结构 (CAS) 查询。我们方法的核心是一种新颖的动态交织方案,它以平衡的方式合并复合键的路径和值维度。我们将这些键存储在我们基于 trie 的 Robust Content-And-Structure 索引中,该索引有效地支持广泛的 CAS 查询,包括带有通配符和后代轴的查询。此外,我们展示了我们方案的重要特性,例如对不同选择性的鲁棒性,并在我们的实验评估中展示了比现有方法最多两个数量级的改进。
更新日期:2020-06-11
down
wechat
bug