当前位置: X-MOL 学术Biometrics › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Testing for association in multiview network data
Biometrics ( IF 1.4 ) Pub Date : 2021-04-01 , DOI: 10.1111/biom.13464
Lucy L Gao 1 , Daniela Witten 2 , Jacob Bien 3
Affiliation  

In this paper, we consider data consisting of multiple networks, each composed of a different edge set on a common set of nodes. Many models have been proposed for the analysis of such multiview network data under the assumption that the data views are closely related. In this paper, we provide tools for evaluating this assumption. In particular, we ask: given two networks that each follow a stochastic block model, is there an association between the latent community memberships of the nodes in the two networks? To answer this question, we extend the stochastic block model for a single network view to the two-view setting, and develop a new hypothesis test for the null hypothesis that the latent community memberships in the two data views are independent. We apply our test to protein–protein interaction data from the HINT database. We find evidence of a weak association between the latent community memberships of proteins defined with respect to binary interaction data and the latent community memberships of proteins defined with respect to cocomplex association data. We also extend this proposal to the setting of a network with node covariates. The proposed methods extend readily to three or more network/multivariate data views.

中文翻译:

测试多视图网络数据中的关联

在本文中,我们考虑由多个网络组成的数据,每个网络由一组公共节点上的不同边集组成。已经提出了许多模型来分析这种多视图假设数据视图密切相关的网络数据。在本文中,我们提供了评估此假设的工具。特别是,我们会问:给定两个网络,每个网络都遵循随机块模型,这两个网络中节点的潜在社区成员资格之间是否存在关联?为了回答这个问题,我们将单一网络视图的随机块模型扩展到双视图设置,并针对两个数据视图中的潜在社区成员资格独立的零假设开发了一个新的假设检验。我们将测试应用于来自 HINT 数据库的蛋白质-蛋白质相互作用数据。我们发现关于二元相互作用数据定义的蛋白质的潜在社区成员与关于复合物关联数据定义的蛋白质的潜在社区成员之间弱关联的证据。我们还将此提议扩展到具有节点协变量的网络设置。所提出的方法很容易扩展到三个或更多网络/多变量数据视图。
更新日期:2021-04-01
down
wechat
bug