...
首页> 外文期刊>Journal of supercomputing >A collective entity linking algorithm with parallel computing on large-scale knowledge base
【24h】

A collective entity linking algorithm with parallel computing on large-scale knowledge base

机译:大规模知识库上具有并行计算的集体实体链接算法

获取原文
获取原文并翻译 | 示例
           

摘要

Entity linking is a central concern of automatic knowledge question answering and knowledge base population. Traditional collective entity linking approaches only consider one of the entity contexts or semantic relations between entities. Thus, these approaches always have poor performance on Web documents. The efficiency of collective entity linking needs to be improved as well. This paper proposes a collective entity linking algorithm based on topic model and graph. Constructing the topic model can represent mentions and candidate entities by using topic distributions. It makes full use of context in documents. Entity semantic relations are represented by document similarities which are computed through the topic model. Parallel computing is used to reduce long running time which is caused by topic model construction. Entity graph is constructed according to the relations between entities in the knowledge graph. Hypertext-Induced Topic Search exploits the entity graph to compute hub value and authority value of candidate entities. And the authority value is the basis for entity linking. Experimental results on open-domain corpus (NLPCC2014) demonstrate the validity of the proposed method. Experimental results show that the proposed approach has 5.2% improvement in F-1-measure than AGDISTIS on corp NLPCC2014.
机译:实体链接是自动知识问答和知识库填充的核心问题。传统的集体实体链接方法仅考虑实体上下文或实体之间的语义关系之一。因此,这些方法在Web文档上始终具有较差的性能。集体实体链接的效率也需要提高。提出了一种基于主题模型和图的集体实体链接算法。通过使用主题分布,构建主题模型可以表示提及和候选实体。它充分利用了文档中的上下文。实体语义关系由通过主题模型计算的文档相似性表示。并行计算用于减少由于主题模型构建而导致的长时间运行。根据知识图中实体之间的关系构造实体图。超文本诱导主题搜索利用实体图来计算中心值和候选实体的权限值。权限值是实体链接的基础。在开放域语料库(NLPCC2014)上的实验结果证明了该方法的有效性。实验结果表明,该方法在公司NLPCC2014上的F-1-措施比AGDISTIS改进了5.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号