Selectivity Estimation with Attribute Value Dependencies using Linked Bayesian Networks

Halford, Max; Saint-Pierre, Philippe; Morvan, Franck

Computer Science > Databases

arXiv:2009.09883 (cs)

[Submitted on 21 Sep 2020]

Title:Selectivity Estimation with Attribute Value Dependencies using Linked Bayesian Networks

Authors:Max Halford, Philippe Saint-Pierre, Franck Morvan

View PDF

Abstract:Relational query optimisers rely on cost models to choose between different query execution plans. Selectivity estimates are known to be a crucial input to the cost model. In practice, standard selectivity estimation procedures are prone to large errors. This is mostly because they rely on the so-called attribute value independence and join uniformity assumptions. Therefore, multidimensional methods have been proposed to capture dependencies between two or more attributes both within and across relations. However, these methods require a large computational cost which makes them unusable in practice. We propose a method based on Bayesian networks that is able to capture cross-relation attribute value dependencies with little overhead. Our proposal is based on the assumption that dependencies between attributes are preserved when joins are involved. Furthermore, we introduce a parameter for trading between estimation accuracy and computational cost. We validate our work by comparing it with other relevant methods on a large workload derived from the JOB and TPC-DS benchmarks. Our results show that our method is an order of magnitude more efficient than existing methods, whilst maintaining a high level of accuracy.

Subjects:	Databases (cs.DB)
Cite as:	arXiv:2009.09883 [cs.DB]
	(or arXiv:2009.09883v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2009.09883

Submission history

From: Max Halford [view email]
[v1] Mon, 21 Sep 2020 14:05:05 UTC (829 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DB

< prev | next >

new | recent | 2009

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Max Halford
Philippe Saint-Pierre
Franck Morvan

export BibTeX citation

Computer Science > Databases

Title:Selectivity Estimation with Attribute Value Dependencies using Linked Bayesian Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Selectivity Estimation with Attribute Value Dependencies using Linked Bayesian Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators