当前位置: X-MOL 学术arXiv.cs.CR › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Automatic de-identification of Data Download Packages
arXiv - CS - Cryptography and Security Pub Date : 2021-05-04 , DOI: arxiv-2105.02175
Laura Boeschoten, Roos Voorvaart, Casper Kaandorp, Ruben van den Goorbergh, Martine de Vos

The General Data Protection Regulation (GDPR) grants all natural persons the right of access to their personal data if this is being processed by data controllers. The data controllers are obliged to share the data in an electronic format and often provide the data in a so called Data Download Package (DDP). These DDPs contain all data collected by public and private entities during the course of citizens' digital life and form a treasure trove for social scientists. However, the data can be deeply private. To protect the privacy of research participants while using their DDPs for scientific research, we developed de-identification software that is able to handle typical characteristics of DDPs such as regularly changing file structures, visual and textual content, different file formats, different file structures and accounting for usernames. We investigate the performance of the software and illustrate how the software can be tailored towards specific DDP structures.

中文翻译:

自动取消识别数据下载包

通用数据保护条例(GDPR)授予所有自然人访问其个人数据的权利,前提是该数据由数据控制者处理。数据控制器必须以电子格式共享数据,并且通常以所谓的数据下载包(DDP)提供数据。这些DDP包含了公民数字生活过程中公共和私人实体收集的所有数据,并构成了社会科学家的宝库。但是,数据可以是高度私有的。为了在使用DDP进行科学研究时保护研究参与者的隐私,我们开发了能够处理DDP典型特征的去识别软件,例如定期更改文件结构,视觉和文本内容,不同文件格式,不同文件结构以及考虑用户名。
更新日期:2021-05-06
down
wechat
bug