...
首页> 外文期刊>Expert Systems with Application >Document-specific keyphrase candidate search and ranking
【24h】

Document-specific keyphrase candidate search and ranking

机译:特定于文档的关键词候选搜索和排名

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an approach KeyRank to extract proper keyphrases from a document in English. It first searches all keyphrase candidates from the document, and then ranks them for selecting top-N ones as final keyphrases. Existing studies show that extracting a complete keyphrase candidate set that includes semantic relations in context, and evaluating the effectiveness of each candidate are crucial to extract high quality keyphrases from documents. Based on that words do not repeatedly appear in an effective keyphrase in English, a novel keyphrase candidate search algorithm using sequential pattern mining with gap constraints (called KCSP) is proposed to extract keyphrase candidates for KeyRank. And then an effectiveness evaluation measure pattern frequency with entropy (called PF-H) is proposed for KeyRank to rank these keyphrase candidates. Our experimental results show that KeyRank has better performance. Its first component KCSP is much more efficient than a closely related approach SPMW, and its second component PF-H is an effective evaluation mechanism for ranking keyphrase candidates.(1) (C) 2017 Elsevier Ltd. All rights reserved.
机译:本文提出了一种KeyRank方法,用于从英文文档中提取适当的关键词短语。它首先从文档中搜索所有候选关键字,然后对它们进行排名,以选择前N个关键字作为最终关键字。现有研究表明,提取包括上下文中的语义关系的完整关键字短语候选集,并评估每个候选关键字的有效性对于从文档中提取高质量关键字短语至关重要。鉴于单词不会在英语中的有效关键字短语中反复出现,提出了一种新的关键字候选候选搜索算法,该算法使用具有间隙约束的顺序模式挖掘(称为KCSP)来提取KeyRank的关键字候选。然后提出了一种具有熵的有效性评估度量模式频率(称为PF-H),用于KeyRank对这些候选关键短语进行排名。我们的实验结果表明KeyRank具有更好的性能。它的第一个组件KCSP比紧密相关的方法SPMW效率要高得多,而第二个组件PF-H是对候选关键短语进行排名的有效评估机制。(1)(C)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号