【24h】

Subject-Keyphrase Extraction Based on Definition-Use Chain

机译:基于定义-使用链的主题关键词提取

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a new concept called subject-keyphrase and also introduce a method to extract subject-keyphrases from a document. Subject-keyphrases refer to the words or phrases used to represent the sentence subjects of a document, i.e., the content of a document is organized around the subjects of the document. It can be expected that each sentence of a document is composed of a subject and an object, where the subject is defined in relation to its object. Using "definition" and "use" relations, we may thus identify the subjects in a given document by looking for keyphrases which appear frequently as subjects of sentences in the document. We thus present a subject-keyphrase extraction (SKE) algorithm based on the notion of definition-use chain (DU Chain) to identify subject-keyphrases. Experimental results show that SKE can successfully identify the subject-keyphrases to effectively capture the main idea of a document.
机译:在本文中,我们提出了一个称为主题关键字的新概念,并介绍了一种从文档中提取主题关键字的方法。主题关键字是指用于表示文档的句子主题的单词或短语,即,文档的内容围绕文档的主题进行组织。可以预期的是,文档的每个句子都由一个主语和一个宾语组成,其中主语是相对于其宾语定义的。因此,使用“定义”和“使用”关系,我们可以通过查找经常在文档中作为句子的主题的关键词来识别给定文档中的主题。因此,我们基于定义使用链(DU链)的概念提出了一种主题关键字短语提取(SKE)算法,以识别主题关键字短语。实验结果表明,SKE可以成功地识别主题关键字,从而有效地捕获文档的主要思想。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号