当前位置: X-MOL 学术IEEE Comput. Graph. Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Capturing and Visualizing Provenance From Data Wrangling.
IEEE Computer Graphics and Applications ( IF 1.7 ) Pub Date : 2019-10-04 , DOI: 10.1109/mcg.2019.2941856
Christian Bors 1 , Theresia Gschwandtner 1 , Silvia Miksch 1
Affiliation  

Data quality management and assessment play a vital role for ensuring the trust in the data and its fitness-of-use for subsequent analysis. The transformation history of a data wrangling system is often insufficient for determining the usability of a dataset, lacking information how changes affected the dataset. Capturing workflow provenance along the wrangling process and combining it with descriptive information as data provenance can enable users to comprehend how these changes affected the dataset, and if they benefited data quality. We present DQProv Explorer, a system that captures and visualizes provenance from data wrangling operations. It features three visualization components: allowing the user to explore the provenance graph of operations and the data stream, the development of quality over time for a sequence of wrangling operations applied to the dataset, and the distribution of issues across the entirety of the dataset to determine error patterns.

中文翻译:

从数据整理中捕获和可视化来源。

数据质量管理和评估对于确保对数据的信任及其对后续分析的适用性起着至关重要的作用。数据处理系统的转换历史通常不足以确定数据集的可用性,缺乏有关更改如何影响数据集的信息。在争用过程中捕获工作流的来源并将其与描述性信息结合起来作为数据来源,可以使用户了解这些更改如何影响数据集以及它们是否有益于数据质量。我们介绍了DQProv Explorer,它是一个捕获和可视化数据整理操作中出处的系统。它具有三个可视化组件:允许用户浏览操作和数据流的出处图,
更新日期:2019-11-01
down
wechat
bug