...
首页> 外文期刊>Data & Knowledge Engineering >From biological to social networks: Link prediction based on multi-way spectral clustering
【24h】

From biological to social networks: Link prediction based on multi-way spectral clustering

机译:从生物网络到社交网络:基于多路谱聚类的链接预测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Link prediction in protein-protein interaction networks (PPINs) is an important task in biology, since the vast majority of biological functions involve such protein interactions. Link prediction is also important for online social networks (OSNs), which provide predictions about who is a friend of whom. Many link prediction methods for PPINs/OSNs are local-based and do not exploit all network structure, which limits prediction accuracy. On the other hand, there are global approaches to detect the overall path structure in a network, being computationally prohibitive for huge-size PPINs/OSNs. In this paper, we enhance a previously proposed multi-way spectral clustering method by introducing new ways to capture node proximity in both PPINs/OSNs. Our new enhanced method uses information obtained from the top few eigenvectors of the normalized Laplacian matrix. As a result, it produces a less noisy matrix, which is smaller and more compact than the original one. In this way, we are able to provide faster and more accurate link predictions. Moreover, our new spectral clustering model is based on the well-known Bray-Curtis coefficient to measure proximity between two nodes. Compared to traditional clustering algorithms, such as k-means and DBSCAN, which assume globular (convex) regions in Euclidean space, our approach is more flexible in capturing the non-connected components of a social graph and a wider range of cluster geometries. We perform an extensive experimental comparison of the proposed method against existing link prediction algorithms and k-means algorithm, using two synthetic data sets, three real social networks and three real human protein data sets. Our experimental results show that our SpectralLink algorithm outperforms the local approaches, the k-means algorithm and another spectral clustering method in terms of effectiveness, whereas it is more efficient than the global approaches.
机译:蛋白质-蛋白质相互作用网络(PPIN)中的链接预测是生物学中的重要任务,因为绝大多数生物学功能都涉及此类蛋白质相互作用。链接预测对于在线社交网络(OSN)也很重要,在线社交网络提供有关谁是谁的朋友的预测。用于PPIN / OSN的许多链接预测方法都是基于本地的,并未利用所有网络结构,这限制了预测的准确性。另一方面,存在检测网络中整体路径结构的全局方法,这些方法在计算上禁止用于大型PPIN / OSN。在本文中,我们通过引入在两个PPIN / OSN中捕获节点邻近性的新方法,增强了先前提出的多路谱聚类方法。我们的新增强方法使用了从归一化拉普拉斯矩阵的前几个特征向量获得的信息。结果,它产生了一个噪声较小的矩阵,该矩阵比原始矩阵更小,更紧凑。这样,我们能够提供更快,更准确的链接预测。此外,我们新的光谱聚类模型基于众所周知的Bray-Curtis系数来测量两个节点之间的接近度。与传统的聚类算法(例如k均值和DBSCAN)假设的欧几里得空间中存在球形(凸)区域相比,我们的方法在捕获社交图的非连接部分和更大范围的聚类几何形状时更加灵活。我们使用两个合成数据集,三个真实的社交网络和三个真实的人类蛋白质数据集,对提出的方法与现有的链接预测算法和k-means算法进行了广泛的实验比较。我们的实验结果表明,在有效性方面,我们的SpectralLink算法优于本地方法,k-means算法和其他频谱聚类方法,而它比全局方法更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号