当前位置: X-MOL 学术IEEE J. Biomed. Health Inform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Multicenter privacy-preserving Cox analysis based on homomorphic encryption.
IEEE Journal of Biomedical and Health Informatics ( IF 7.7 ) Pub Date : 2021-04-06 , DOI: 10.1109/jbhi.2021.3071270
Yao Lu , Yu Tian , Tianshu Zhou , Shiqiang Zhu , Jingsong Li

The Cox proportional hazards model is one of the most widely used methods for analyzing survival data. Data from multiple data providers are required to improve the generalizability and confidence of the results of Cox analysis; however, such data sharing may result in leakage of sensitive information, leading to financial fraud, social discrimination or unauthorized data abuse. Some privacy-preserving Cox regression protocols have been proposed in past years, but they lack either security or functionality. In this paper, we propose a privacy-preserving Cox regression protocol for multiple data providers and researchers. The proposed protocol allows researchers to train models on horizontally or vertically partitioned datasets while providing privacy protection for both the sensitive data and the trained models. Our protocol utilizes threshold homomorphic encryption to guarantee security. Experimental results demonstrate that with the proposed protocol, Cox regression model training over 9 variables in a dataset of 113,035 samples takes approximately 44 min, and the trained model is almost the same as that obtained with the original nonsecure Cox regression protocol; therefore, our protocol is a potential candidate for practical real-world applications in multicenter medical research.

中文翻译:

基于同态加密的多中心隐私保护Cox分析。

Cox比例风险模型是用于分析生存数据的最广泛使用的方法之一。需要来自多个数据提供者的数据以提高Cox分析结果的通用性和可信度;但是,此类数据共享可能会导致敏感信息泄漏,从而导致财务欺诈,社会歧视或未经授权的数据滥用。过去几年中已经提出了一些保护隐私的Cox回归协议,但是它们缺乏安全性或功能。在本文中,我们为多个数据提供者和研究人员提出了一种保护隐私的Cox回归协议。提出的协议允许研究人员在水平或垂直划分的数据集上训练模型,同时为敏感数据和训练后的模型提供隐私保护。我们的协议利用阈值同态加密来保证安全性。实验结果表明,采用所提出的协议,对113,035个样本的数据集中的9个变量进行Cox回归模型训练大约需要44分钟,并且训练后的模型与原始的非安全Cox回归协议所获得的模型几乎相同。因此,我们的协议是多中心医学研究中实际应用中的潜在候选对象。
更新日期:2021-04-06
down
wechat
bug