首页> 外文会议>International Workshop on Complex Networks and Their Applications >Experimental Evaluation of Train and Test Split Strategies in Link Prediction
【24h】

Experimental Evaluation of Train and Test Split Strategies in Link Prediction

机译:线路预测中火车和试验分裂策略的实验评价

获取原文

摘要

In link prediction, the goal is to predict which links will appear in the future of an evolving network. To estimate the performance of these models in a supervised machine learning model, disjoint and independent train and test sets are needed. However, objects in a real-world network are inherently related to each other. Therefore, it is far from trivial to separate candidate links into these disjoint sets. Here we characterize and empirically investigate the two dominant approaches from the literature for creating separate train and test sets in link prediction, referred to as random and temporal splits. Comparing the performance of these two approaches on several large temporal network datasets, we find evidence that random splits may result in too optimistic results, whereas a temporal split may give a more fair and realistic indication of performance. Results appear robust to the selection of temporal intervals. These findings will be of interest to researchers that employ link prediction or other machine learning tasks in networks.
机译:在链接预测中,目标是预测在不断发展的网络的未来将出现哪些链接。为了估算在监督机器学习模型中的这些模型的性能,需要不相交和独立的列车和测试集。然而,实际网络中的对象本身与彼此相关。因此,它远远远远地将候选链接分离到这些不相交的集合中。在这里,我们的特征和经验研究了从文献中创建单独列车和链路预测测试集的两个主导方法,称为随机和时间分裂。比较这两种方法的性能在几个大型时间网络数据集上,我们发现随机分裂可能导致过于乐观的结果,而时间拆分可能会提供更公平和现实的性能指示。结果看起来稳健地选择时间间隔。这些发现对于在网络中使用链接预测或其他机器学习任务的研究人员感兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号