当前位置: X-MOL 学术J. Equal. Technol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Optimal design subsampling from Big Datasets
Journal of Quality Technology ( IF 2.5 ) Pub Date : 2021-03-04 , DOI: 10.1080/00224065.2021.1889418
Laura Deldossi 1 , Chiara Tommasi 2
Affiliation  

Abstract

Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.



中文翻译:

大数据集的优化设计子采样

摘要

大数据是大量的数字信息,很少通过适当计划的调查产生;因此,它们通常包含多余的观察结果。当目的是回答感兴趣的特定问题时,我们建议选择包含大部分信息的单元子样本以实现此目标。由优化设计理论驱动的选择方法结合了推理目的,因此比标准抽样方案表现更好。

更新日期:2021-03-04
down
wechat
bug