首页> 外文会议>IEEE International Conference on Data Mining Workshops >Link Prediction in Large Networks by Comparing the Global View of Nodes in the Network
【24h】

Link Prediction in Large Networks by Comparing the Global View of Nodes in the Network

机译:通过比较网络中节点的全局视图来预测大型网络中的链路

获取原文

摘要

Link prediction is an important and well-studiedproblem in network analysis, with a broad range of applicationsincluding recommender systems, anomaly detection, and denoising. The general principle in link prediction is to use thetopological characteristics of the nodes in the network to predictedges that might be added to or removed from the network. While early research utilized local network neighborhood tocharacterize the topological relationship between pairs of nodes, recent studies increasingly show that use of global networkinformation improves prediction performance. Meanwhile, in thecontext of disease gene prioritization and functional annotationin computational biology, "global topological similarity" basedmethods are shown to be effective and robust to noise andascertainment bias. These methods compute topological profilesthat represent the global view of the network from the perspectiveof each node and compare these topological profiles to assess thetopological similarity between nodes. Here, we show that, in thecontext of link prediction in large networks, the performance ofthese global-view based methods can be adversely affected byhigh dimensionality. Motivated by this observation, we proposetwo dimensionality reduction techniques that exploit the sparsityand modularity of networks that are encountered in practicalapplications. Our experimental results on predicting futurecollaborations based on a comprehensive co-authorship networkshows that dimensionality reduction renders global-view basedlink prediction highly effective, and the resulting algorithmssignificantly outperform state-of-the-art link prediction methods.
机译:链接预测是网络分析中一个重要且经过充分研究的问题,具有广泛的应用,包括推荐系统,异常检测和去噪。链路预测的一般原理是使用网络中节点的拓扑特征来预测可能会添加到网络或从网络中删除的预测。尽管早期的研究利用本地网络邻域来表征节点对之间的拓扑关系,但最近的研究越来越表明,使用全局网络信息可以提高预测性能。同时,在计算生物学中疾病基因优先级划分和功能注释的背景下,基于“全局拓扑相似性”的方法被证明对噪声和确定性偏倚是有效且鲁棒的。这些方法从每个节点的角度计算代表网络全局视图的拓扑配置文件,并比较这些拓扑配置文件以评估节点之间的拓扑相似性。在这里,我们表明,在大型网络的链接预测的背景下,这些基于全局视图的方法的性能可能会受到高维度的不利影响。基于这种观察,我们提出了二维降维技术,该技术利用了在实际应用中遇到的网络的稀疏性和模块化。我们基于全面合著网络对未来合作进行预测的实验结果表明,降维处理使基于全局视图的链接预测非常有效,并且所产生的算法明显优于最新的链接预测方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号