首页> 外文会议>International conference on computational linguistics >Single Document Keyphrase Extraction Using Label Information
【24h】

Single Document Keyphrase Extraction Using Label Information

机译:使用标签信息提取单文档关键词

获取原文

摘要

Keyphrases have found wide ranging application in NLP and IR tasks such as document summarization, indexing, labeling, clustering and classification. In this paper we pose the problem of extracting label specific keyphrases from a document which has document level metadata associated with it namely labels or tags (i.e. multi-labeled document). Unlike other, supervised or unsupervised, methods for keyphrase extraction our proposed methods utilizes both the document's text and label information for the task of extracting label specific keyphrases. We propose two models for this purpose both of which model the problem of extracting label specific keyphrases as a random walk on the document's text graph. We evaluate and report the quality of the extracted keyphrases on a popular multi-label text corpus.
机译:关键短语已在NLP和IR任务中得到广泛应用,例如文档摘要,索引,标记,聚类和分类。在本文中,我们提出了从具有与之相关联的文档级元数据(即标签或标签)(即多标签文档)的文档中提取标签特定关键字短语的问题。与其他有监督或无监督的关键词提取方法不同,我们提出的方法利用文档的文本和标签信息来提取特定于标签的关键词。为此,我们提出了两个模型,这两个模型都将提取特定于标签的关键字短语作为在文档的文本图上随机游走的问题建模。我们评估并报告了流行的多标签文本语料库上提取的关键词的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号