当前位置: X-MOL 学术BMC Med. Genomics › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Privacy-preserving approximate GWAS computation based on homomorphic encryption.
BMC Medical Genomics ( IF 2.1 ) Pub Date : 2020-07-21 , DOI: 10.1186/s12920-020-0722-1
Duhyeong Kim 1 , Yongha Son 1 , Dongwoo Kim 1 , Andrey Kim 1 , Seungwan Hong 1 , Jung Hee Cheon 1
Affiliation  

One of three tasks in a secure genome analysis competition called iDASH 2018 was to develop a solution for privacy-preserving GWAS computation based on homomorphic encryption. The scenario is that a data holder encrypts a number of individual records, each of which consists of several phenotype and genotype data, and provide the encrypted data to an untrusted server. Then, the server performs a GWAS algorithm based on homomorphic encryption without the decryption key and outputs the result in encrypted state so that there is no information leakage on the sensitive data to the server. We develop a privacy-preserving semi-parallel GWAS algorithm by applying an approximate homomorphic encryption scheme HEAAN. Fisher scoring and semi-parallel GWAS algorithms are modified to be efficiently computed over homomorphically encrypted data with several optimization methodologies; substitute matrix inversion by an adjoint matrix, avoid computing a superfluous matrix of super-large size, and transform the algorithm into an approximate version. Our modified semi-parallel GWAS algorithm based on homomorphic encryption which achieves 128-bit security takes 30–40 minutes for 245 samples containing 10,000–15,000 SNPs. Compared to the true p-value from the original semi-parallel GWAS algorithm, the F1 score of our p-value result is over 0.99. Privacy-preserving semi-parallel GWAS computation can be efficiently done based on homomorphic encryption with sufficiently high accuracy compared to the semi-parallel GWAS computation in unencrypted state.

中文翻译:

基于同态加密的保护隐私的近似GWAS计算。

在名为iDASH 2018的安全基因组分析比赛中,三项任务之一是开发一种基于同态加密的,用于保护隐私的GWAS计算的解决方案。场景是数据持有者对许多单独的记录进行加密,每个记录由几个表型和基因型数据组成,并将加密的数据提供给不受信任的服务器。然后,服务器在没有解密密钥的情况下基于同态加密执行GWAS算法,并以加密状态输出结果,从而不会将敏感数据上的信息泄漏到服务器。通过应用近似同态加密方案HEAAN,我们开发了一种保护隐私的半并行GWAS算法。改进了Fisher评分和半并行GWAS算法,可以使用几种优化方法对同态加密数据进行有效计算;用伴随矩阵代替矩阵求逆,避免计算超大尺寸的多余矩阵,并将算法转换为近似版本。我们基于同态加密的改良半并行GWAS算法可实现128位安全性,对包含10,000-15,000个SNP的245个样本需要30-40分钟。与原始半并行GWAS算法的真实p值相比,我们的p值结果的F1得分超过0.99。与未加密状态下的半并行GWAS计算相比,基于同态加密的保密性高的半并行GWAS计算可以高效地完成。用伴随矩阵代替矩阵求逆,避免计算超大尺寸的多余矩阵,并将算法转换为近似版本。我们基于同态加密的改良半并行GWAS算法可实现128位安全性,对包含10,000-15,000个SNP的245个样本需要30-40分钟。与原始半并行GWAS算法的真实p值相比,我们的p值结果的F1得分超过0.99。与未加密状态下的半并行GWAS计算相比,基于同态加密的保密性高的半并行GWAS计算可以高效地完成。用伴随矩阵代替矩阵求逆,避免计算超大尺寸的多余矩阵,并将算法转换为近似版本。我们基于同态加密的改良半并行GWAS算法可实现128位安全性,对包含10,000-15,000个SNP的245个样本需要30-40分钟。与原始半并行GWAS算法的真实p值相比,我们的p值结果的F1得分超过0.99。与未加密状态下的半并行GWAS计算相比,基于同态加密的保密性高的半并行GWAS计算可以高效地完成。我们基于同态加密的改良半并行GWAS算法可实现128位安全性,对包含10,000-15,000个SNP的245个样本需要30-40分钟。与原始半并行GWAS算法的真实p值相比,我们的p值结果的F1得分超过0.99。与未加密状态下的半并行GWAS计算相比,基于同态加密的保密性高的半并行GWAS计算可以高效地完成。我们基于同态加密的改良半并行GWAS算法(可实现128位安全性)对包含10,000-15,000个SNP的245个样本花费30-40分钟。与原始半并行GWAS算法的真实p值相比,我们的p值结果的F1得分超过0.99。与未加密状态下的半并行GWAS计算相比,基于同态加密的保密性高的半并行GWAS计算可以高效地完成。
更新日期:2020-07-21
down
wechat
bug