当前位置: X-MOL 学术IEEE Trans. Dependable Secure Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
PSpec-SQL: Enabling Fine-Grained Control for Distributed Data Analytics
IEEE Transactions on Dependable and Secure Computing ( IF 7.3 ) Pub Date : 2019-01-01 , DOI: 10.1109/tdsc.2019.2914209
Chen Luo , Fei He , Fei Peng , Dong Yan , Dan Zhang , Xin Zhou

Business organizations regularly collect customer data to improve their services. Organizations may want to share data within themselves or even with third-parties to maximize data utility. Since business data contain lots of customer data, organizations must respect customers' privacy expounded by privacy laws. In this paper, we present PSpec-SQL, a distributed data analytics system that automatically enforces privacy compliance for SQL queries. Our system provides a high-level language PSpec for the data owner to specify her data usage policy. As usual, the data analyst queries data to perform data analysis, but our system checks each query to ensure only policy-compliant queries are executed. We have implemented a prototype of PSpec-SQL on top of Spark-SQL, and carried out a case study on the TPC benchmarks. The results show the practicability of our system with negligible overhead over query processing.

中文翻译:

PSpec-SQL:为分布式数据分析启用细粒度控制

商业组织定期收集客户数据以改进他们的服务。组织可能希望在自己内部甚至与第三方共享数据,以最大限度地提高数据效用。由于业务数据包含大量客户数据,因此组织必须尊重隐私法所阐明的客户隐私。在本文中,我们介绍了 PSpec-SQL,这是一种分布式数据分析系统,可自动强制执行 SQL 查询的隐私合规性。我们的系统为数据所有者提供了一种高级语言 PSpec 来指定她的数据使用政策。像往常一样,数据分析师查询数据以执行数据分析,但我们的系统会检查每个查询以确保仅执行符合政策的查询。我们在 Spark-SQL 之上实现了 PSpec-SQL 的原型,并对 TPC 基准进行了案例研究。
更新日期:2019-01-01
down
wechat
bug