Multicenter Privacy-Preserving Cox Analysis Based on Homomorphic Encryption,IEEE Journal of Biomedical and Health Informatics

当前位置： X-MOL 学术 › IEEE J. Biomed. Health Inform. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Multicenter Privacy-Preserving Cox Analysis Based on Homomorphic Encryption
IEEE Journal of Biomedical and Health Informatics ( IF 7.7 ) Pub Date : 2021-04-06 , DOI: 10.1109/jbhi.2021.3071270
Yao Lu ₁ , Yu Tian ₂ , Tianshu Zhou ₃ , Shiqiang Zhu ₄ , Jingsong Li ₅

Affiliation

The Cox proportional hazards model is one of the most widely used methods for analyzing survival data. Data from multiple data providers are required to improve the generalizability and confidence of the results of Cox analysis; however, such data sharing may result in leakage of sensitive information, leading to financial fraud, social discrimination or unauthorized data abuse. Some privacy-preserving Cox regression protocols have been proposed in past years, but they lack either security or functionality. In this paper, we propose a privacy-preserving Cox regression protocol for multiple data providers and researchers. The proposed protocol allows researchers to train models on horizontally or vertically partitioned datasets while providing privacy protection for both the sensitive data and the trained models. Our protocol utilizes threshold homomorphic encryption to guarantee security. Experimental results demonstrate that with the proposed protocol, Cox regression model training over 9 variables in a dataset of 113,035 samples takes approximately 44 min, and the trained model is almost the same as that obtained with the original nonsecure Cox regression protocol; therefore, our protocol is a potential candidate for practical real-world applications in multicenter medical research.

中文翻译：

基于同态加密的多中心隐私保护Cox分析

Cox 比例风险模型是分析生存数据的最广泛使用的方法之一。需要来自多个数据提供者的数据，以提高 Cox 分析结果的普遍性和可信度；然而，这种数据共享可能会导致敏感信息的泄露，从而导致金融欺诈、社会歧视或未经授权的数据滥用。过去几年已经提出了一些保护隐私的 Cox 回归协议，但它们缺乏安全性或功能性。在本文中，我们为多个数据提供者和研究人员提出了一种保护隐私的 Cox 回归协议。提议的协议允许研究人员在水平或垂直分区的数据集上训练模型，同时为敏感数据和训练模型提供隐私保护。我们的协议利用阈值同态加密来保证安全性。实验结果表明，使用所提出的协议，在 113,035 个样本的数据集中训练超过 9 个变量的 Cox 回归模型大约需要 44 分钟，并且训练的模型与使用原始非安全 Cox 回归协议获得的模型几乎相同；因此，我们的协议是多中心医学研究中实际应用的潜在候选者。

更新日期：2021-04-06

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>