Communication-efficient distributed M-estimation with missing data,Computational Statistics & Data Analysis

当前位置： X-MOL 学术 › Comput. Stat. Data Anal. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Communication-efficient distributed M-estimation with missing data
Computational Statistics & Data Analysis ( IF 1.5 ) Pub Date : 2021-04-16 , DOI: 10.1016/j.csda.2021.107251
Jianwei Shi , Guoyou Qin , Huichen Zhu , Zhongyi Zhu

In the big data era, practical applications often encounter incomplete data. Current distributed methods, ignoring missingness, may cause inconsistent estimates. Motivated by that, a distributed algorithm is developed for M-estimation with missing data. The proposed algorithm is communication-efficient, where only gradient information is transferred to the central machine. The parameters of interest and the nuisance parameters are simultaneously updated. Theoretically, it is shown that the proposed algorithm achieves a full sample performance after a moderate number of iterations. The influence of nuisance parameters on distributed M-estimation is also investigated. Simulations via synthetic data illustrate the effectiveness of the algorithm. At last, the algorithm is applied to a real data set.

中文翻译：

缺少数据的高效通信的分布式M估计

在大数据时代，实际应用中经常会遇到不完整的数据。当前的分布式方法忽略了缺失，可能会导致估算结果不一致。因此，开发了一种用于丢失数据的M估计的分布式算法。所提出的算法是通信有效的，其中仅梯度信息被传送到中央机器。感兴趣的参数和讨厌的参数被同时更新。从理论上讲，该算法在经过适度的迭代后可以达到完整的样本性能。干扰参数对分布式M的影响-估计也进行了调查。通过综合数据进行的仿真说明了该算法的有效性。最后，将该算法应用于真实数据集。

更新日期：2021-04-27

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11