Latent-variable Private Information Retrieval,arXiv - CS - Information Retrieval

当前位置： X-MOL 学术 › arXiv.cs.IR › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Latent-variable Private Information Retrieval
arXiv - CS - Information Retrieval Pub Date : 2020-01-16 , DOI: arxiv-2001.05998
Islam Samy, Mohamed A. Attia, Ravi Tandon, Loukas Lazos

In many applications, content accessed by users (movies, videos, news articles, etc.) can leak sensitive latent attributes, such as religious and political views, sexual orientation, ethnicity, gender, and others. To prevent such information leakage, the goal of classical PIR is to hide the identity of the content/message being accessed, which subsequently also hides the latent attributes. This solution, while private, can be too costly, particularly, when perfect (information-theoretic) privacy constraints are imposed. For instance, for a single database holding $K$ messages, privately retrieving one message is possible if and only if the user downloads the entire database of $K$ messages. Retrieving content privately, however, may not be necessary to perfectly hide the latent attributes. Motivated by the above, we formulate and study the problem of latent-variable private information retrieval (LV-PIR), which aims at allowing the user efficiently retrieve one out of $K$ messages (indexed by $\theta$) without revealing any information about the latent variable (modeled by $S$). We focus on the practically relevant setting of a single database and show that one can significantly reduce the download cost of LV-PIR (compared to the classical PIR) based on the correlation between $\theta$ and $S$. We present a general scheme for LV-PIR as a function of the statistical relationship between $\theta$ and $S$, and also provide new results on the capacity/download cost of LV-PIR. Several open problems and new directions are also discussed.

中文翻译：

隐变量隐私信息检索

在许多应用程序中，用户访问的内容（电影、视频、新闻文章等）可能会泄露敏感的潜在属性，例如宗教和政治观点、性取向、种族、性别等。为了防止此类信息泄露，经典 PIR 的目标是隐藏正在访问的内容/消息的身份，随后也隐藏了潜在属性。这种解决方案虽然是私有的，但成本可能太高，尤其是在强加了完美（信息论）隐私约束的情况下。例如，对于保存 $K$ 消息的单个数据库，当且仅当用户下载 $K$ 消息的整个数据库时，才可以私下检索一条消息。然而，私下检索内容可能不是完美隐藏潜在属性的必要条件。受上述激励，我们制定并研究了隐变量私人信息检索 (LV-PIR) 的问题，其目的是让用户有效地检索 $K$ 消息中的一个（由 $\theta$ 索引），而不会泄露有关隐变量的任何信息（由 $S$ 建模）。我们专注于单个数据库的实际相关设置，并表明基于 $\theta$ 和 $S$ 之间的相关性，可以显着降低 LV-PIR（与经典 PIR 相比）的下载成本。我们提出了 LV-PIR 的一般方案，作为 $\theta$ 和 $S$ 之间的统计关系的函数，并且还提供了关于 LV-PIR 的容量/下载成本的新结果。还讨论了几个未解决的问题和新的方向。它旨在允许用户有效地检索 $K$ 消息中的一个（由 $\theta$ 索引），而无需透露有关潜在变量的任何信息（由 $S$ 建模）。我们专注于单个数据库的实际相关设置，并表明基于 $\theta$ 和 $S$ 之间的相关性，可以显着降低 LV-PIR（与经典 PIR 相比）的下载成本。我们提出了 LV-PIR 的一般方案，作为 $\theta$ 和 $S$ 之间的统计关系的函数，并且还提供了关于 LV-PIR 的容量/下载成本的新结果。还讨论了几个未解决的问题和新的方向。它旨在允许用户有效地检索 $K$ 消息中的一个（由 $\theta$ 索引），而无需透露有关潜在变量的任何信息（由 $S$ 建模）。我们专注于单个数据库的实际相关设置，并表明基于 $\theta$ 和 $S$ 之间的相关性，可以显着降低 LV-PIR（与经典 PIR 相比）的下载成本。我们提出了 LV-PIR 的一般方案，作为 $\theta$ 和 $S$ 之间的统计关系的函数，并且还提供了关于 LV-PIR 的容量/下载成本的新结果。还讨论了几个未解决的问题和新的方向。我们专注于单个数据库的实际相关设置，并表明基于 $\theta$ 和 $S$ 之间的相关性，可以显着降低 LV-PIR（与经典 PIR 相比）的下载成本。我们提出了 LV-PIR 的一般方案，作为 $\theta$ 和 $S$ 之间的统计关系的函数，并且还提供了关于 LV-PIR 的容量/下载成本的新结果。还讨论了几个未解决的问题和新的方向。我们专注于单个数据库的实际相关设置，并表明基于 $\theta$ 和 $S$ 之间的相关性，可以显着降低 LV-PIR（与经典 PIR 相比）的下载成本。我们提出了 LV-PIR 的一般方案，作为 $\theta$ 和 $S$ 之间的统计关系的函数，并且还提供了关于 LV-PIR 的容量/下载成本的新结果。还讨论了几个未解决的问题和新的方向。

更新日期：2020-05-15

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>