当前位置: X-MOL 学术Annu. Rev. Stat. Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Synthetic Data
Annual Review of Statistics and Its Application ( IF 7.9 ) Pub Date : 2021-03-08 , DOI: 10.1146/annurev-statistics-040720-031848
Trivellore E. Raghunathan 1
Affiliation  

Demand for access to data, especially data collected using public funds, is ever growing. At the same time, concerns about the disclosure of the identities of and sensitive information about the respondents providing the data are making the data collectors limit the access to data. Synthetic data sets, generated to emulate certain key information found in the actual data and provide the ability to draw valid statistical inferences, are an attractive framework to afford widespread access to data for analysis while mitigating privacy and confidentiality concerns. The goal of this article is to provide a review of various approaches for generating and analyzing synthetic data sets, inferential justification, limitations of the approaches, and directions for future research.

中文翻译:


综合数据

对数据访问的需求,尤其是使用公共资金收集的数据的需求在不断增长。同时,由于担心提供数据的受访者的身份和敏感信息的泄露,使得数据收集者限制了对数据的访问。合成数据集是用来模拟实际数据中发现的某些关键信息并提供得出有效统计推断的能力的,它是一个吸引人的框架,可以广泛地访问数据进行分析,同时还可以减少隐私和机密性问题。本文的目的是概述各种用于生成和分析综合数据集的方法,推理依据,方法的局限性以及未来研究的方向。

更新日期:2021-03-09
down
wechat
bug