当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Provenance-based Data Skipping (TechReport)
arXiv - CS - Databases Pub Date : 2021-04-26 , DOI: arxiv-2104.12815
Xing Niu, Ziyu Liu, Pengyuan Li, Boris Glavic

Database systems analyze queries to determine upfront which data is needed for answering them and use indexes and other physical design techniques to speed-up access to that data. However, for important classes of queries, e.g., HAVING and top-k queries, it is impossible to determine up-front what data is relevant. To overcome this limitation, we develop provenance-based data skipping (PBDS), a novel approach that generates provenance sketches to concisely encode what data is relevant for a query. Once a provenance sketch has been captured it is used to speed up subsequent queries. PBDS can exploit physical design artifacts such as indexes and zone maps. Our approach significantly improves performance for both disk-based and main-memory database systems.

中文翻译:

基于来源的数据跳过(TechReport)

数据库系统分析查询,以确定需要哪些数据来回答这些查询,并使用索引和其他物理设计技术来加快对这些数据的访问。但是,对于重要的查询类别,例如HAVING和top-k查询,不可能预先确定哪些数据是相关的。为了克服此限制,我们开发了基于源的数据跳过(PBDS),这是一种新颖的方法,可生成源草图以简洁地编码与查询相关的数据。一旦捕获了出处草图,就可以用来加快后续查询的速度。PBDS可以利用物理设计工件,例如索引和区域图。我们的方法极大地提高了基于磁盘的数据库和主内存数据库系统的性能。
更新日期:2021-04-29
down
wechat
bug