...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Popularity Agnostic Evaluation of Knowledge Graph Embeddings
【24h】

Popularity Agnostic Evaluation of Knowledge Graph Embeddings

机译:知识图形嵌入的人气不可行评估

获取原文
           

摘要

In this paper, we show that the distribution of entities and relations in common knowledge graphs is highly skewed, with some entities and relations being much more popular than the rest. We show that while knowledge graph embedding models give state-of-the-art performance in many relational learning tasks such as link prediction, current evaluation metrics like hits@k and mrr are biased towards popular entities and relations. We propose two new evaluation metrics, strat-hits@k and strat-mrr, which are unbiased estimators of the true hits@k and mrr when the items follow a power-law distribution. Our new metrics are generalizations of hits@k and mrr that take into account the popularity of the entities and relations in the data, with a tuning parameter determining how much emphasis the metric places on popular vs. unpopular items. Using our metrics, we run experiments on benchmark datasets to show that the performance of embedding models degrades as the popularity of the entities and relations decreases, and that current reported results overestimate the performance of these models by magnifying their accuracy on popular items.
机译:在本文中,我们表明,共同知识图中的实体和关系的分布非常偏向,有些实体和关系比其他实体和关系更受欢迎。我们表明,虽然知识图形嵌入模型在许多关系学习任务中提供最先进的性能,例如链路预测,当前评估度量像Hits @ K和MRR等当前的评估度量朝向流行的实体和关系偏见。我们提出了两个新的评估指标,Strat-Hits @ K和Strat-MRR,这是当项目遵循幂律分布时真正的命中@ K和MRR的无偏见估计。我们的新指标是HITS @ K和MRR的概括,考虑到数据中的实体和关系的普及,并通过调整参数确定高分地区的度量标准场所的重点。我们使用我们的指标,我们在基准数据集中运行实验,以表明嵌入模型的性能降低,因为实体和关系的普及减少,并且当前报告的结果通过放大他们对流行项目的准确性来估计这些模型的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号