当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Data Validation
arXiv - CS - Databases Pub Date : 2020-12-21 , DOI: arxiv-2012.12028
Mark P. J. van der Loo, Edwin de Jonge

Data validation is the activity where one decides whether or not a particular data set is fit for a given purpose. Formalizing the requirements that drive this decision process allows for unambiguous communication of the requirements, automation of the decision process, and opens up ways to maintain and investigate the decision process itself. The purpose of this article is to formalize the definition of data validation and to demonstrate some of the properties that can be derived from this definition. In particular, it is shown how a formal view of the concept permits a classification of data quality requirements, allowing them to be ordered in increasing levels of complexity. Some subtleties arising from combining possibly many such requirements are pointed out as well.

中文翻译:

资料验证

数据验证是一项活动,在该活动中,人们将确定特定的数据集是否适合给定的目的。正式定义驱动此决策过程的需求,可以明确传达需求,实现决策过程的自动化,并开辟维护和调查决策过程本身的方式。本文的目的是形式化数据验证的定义,并演示可以从该定义派生的某些属性。特别地,示出了该概念的形式视图如何允许对数据质量要求进行分类,从而允许以越来越高的复杂度对它们进行排序。还指出了可能组合许多这样的要求而引起的一些微妙之处。
更新日期:2020-12-23
down
wechat
bug