首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >Extracting Keyphrases from Research Papers Using Word Embeddings
【24h】

Extracting Keyphrases from Research Papers Using Word Embeddings

机译:使用词嵌入从研究论文中提取关键词

获取原文

摘要

Unsupervised random-walk keyphrase extraction models mainly rely on global structural information of the word graph, with nodes representing candidate words and edges capturing the cooccurrence information between candidate words. However, integrating different types of useful information into the representation learning process to help better extract keyphrases is relatively unexplored. In this paper, we propose a random-walk method to extract keyphrases using word embeddings. Specifically, we first design a new word embedding learning model to integrate local context information of the word graph (i.e., the local word collocation patterns) with some crucial features of candidate words and edges. Then, a novel random-walk ranking model is designed to extract keyphrases by leveraging such word embeddings. Experimental results show that our approach outperforms 8 state-of-the-art unsupervised methods on two real datasets consistently for keyphrase extraction.
机译:无监督的随机游动关键词提取模型主要依赖于词图的整体结构信息,代表候选词的节点和捕获候选词之间的共现信息的边。但是,将相对不同类型的有用信息集成到表示学习过程中以帮助更好地提取关键短语是相对未开发的。在本文中,我们提出了一种随机游走法来使用词嵌入来提取关键词。具体来说,我们首先设计一个新的词嵌入学习模型,以将词图的局部上下文信息(即局部词搭配模式)与候选词和边缘的一些关键特征集成在一起。然后,设计了一种新颖的随机游走排名模型,以利用此类单词嵌入来提取关键短语。实验结果表明,我们的方法在两个真实数据集上始终优于8种最新的无监督方法,可以进行关键短语提取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号