首页> 外文会议>IEEE International Conference on Data Mining Workshops >Link Prediction in Large Networks by Comparing the Global View of Nodes in the Network
【24h】

Link Prediction in Large Networks by Comparing the Global View of Nodes in the Network

机译:通过比较网络中节点的全局视图,在大型网络中链路预测

获取原文

摘要

Link prediction is an important and well-studiedproblem in network analysis, with a broad range of applicationsincluding recommender systems, anomaly detection, and denoising. The general principle in link prediction is to use thetopological characteristics of the nodes in the network to predictedges that might be added to or removed from the network. While early research utilized local network neighborhood tocharacterize the topological relationship between pairs of nodes, recent studies increasingly show that use of global networkinformation improves prediction performance. Meanwhile, in thecontext of disease gene prioritization and functional annotationin computational biology, "global topological similarity" basedmethods are shown to be effective and robust to noise andascertainment bias. These methods compute topological profilesthat represent the global view of the network from the perspectiveof each node and compare these topological profiles to assess thetopological similarity between nodes. Here, we show that, in thecontext of link prediction in large networks, the performance ofthese global-view based methods can be adversely affected byhigh dimensionality. Motivated by this observation, we proposetwo dimensionality reduction techniques that exploit the sparsityand modularity of networks that are encountered in practicalapplications. Our experimental results on predicting futurecollaborations based on a comprehensive co-authorship networkshows that dimensionality reduction renders global-view basedlink prediction highly effective, and the resulting algorithmssignificantly outperform state-of-the-art link prediction methods.
机译:链路预测是网络分析中的一个重要且良好的问题,具有广泛的应用程序限制推荐系统,异常检测和去噪。链路预测中的一般原理是使用网络中的节点的图文特征来预测可能添加到网络或从网络中删除的预测。虽然早期研究利用当地网络社区进行特征,但是近对节点对之间的拓扑关系,近期的研究越来越越来越多地表明,使用全局网络信息提高了预测性能。同时,在疾病基因优先级排序和功能附带素计算生物学中,“全球拓扑相似性”的基于疾病的方法被证明是有效和稳健的噪声偏离偏差。这些方法从每个节点的角度来计算拓扑Profilesthat代表网络的全局视图,并比较这些拓扑轮廓以评估节点之间的图术例相似性。在这里,我们表明,在大型网络中的链路预测中,基于全局视图的方法的性能可能受到高维度的不利影响。通过该观察,我们预防何种程度的减少技术,这些技术利用了在实际应用中遇到的网络的突出性和模块化。我们对基于全面的共同作者网络智能预测FutureCollat​​ions的实验结果,该网络表的维度降低了全球视图的基于全局 - 视图高效,以及所产生的算法显着优异的最先进的链路预测方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号