首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >An Integrated Approach for Keyphrase Generation via Exploring the Power of Retrieval and Extraction
【24h】

An Integrated Approach for Keyphrase Generation via Exploring the Power of Retrieval and Extraction

机译:通过探索检索和提取能力生成关键字的集成方法

获取原文

摘要

In this paper, we present a novel integrated approach for keyphrase generation (KG). Unlike previous works which are purely extractive or generative, we first propose a new multitask learning framework that jointly learns an extractive model and a generative model. Besides extracting keyphrases, the output of the extractive model is also employed to rectify the copy probability distribution of the generative model, such that the generative model can better identify important contents from the given document. Moreover, we retrieve similar documents with the given document from training data and use their associated keyphrases as external knowledge for the generative model to produce more accurate keyphrases. For further exploiting the power of extraction and retrieval, we propose a neural-based merging module to combine and re-rank the predicted keyphrases from the enhanced generative model, the extractive model, and the retrieved keyphrases. Experiments on the five KG benchmarks demonstrate that our integrated approach outperforms the state-of-the-art methods.
机译:在本文中,我们提出了一种用于密钥短语生成(KG)的新颖集成方法。与以前的纯粹是抽取式或生成式的作品不同,我们首先提出了一个新的多任务学习框架,该框架可以共同学习抽取式模型和生成式模型。除了提取关键短语外,提取模型的输出还用于纠正生成模型的复制概率分布,从而使生成模型可以更好地识别给定文档中的重要内容。此外,我们从训练数据中检索与给定文档相似的文档,并将其相关的关键短语用作生成模型的外部知识以生成更准确的关键短语。为了进一步利用提取和检索的功能,我们提出了一个基于神经的合并模块,用于合并和重新排序来自增强型生成模型,提取模型和检索到的关键词的预测关键词。对五个KG基准进行的实验表明,我们的集成方法优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号