首页> 外文会议>Conference on Computational Linguistics and Speech Processing >Collaborative Ranking between Supervised and Unsupervised Approaches for Keyphrase Extraction
【24h】

Collaborative Ranking between Supervised and Unsupervised Approaches for Keyphrase Extraction

机译:有监督和无监督方法之间的协作排名

获取原文

摘要

Automatic keyphrase extraction methods have generally taken either supervised or unsupervised approaches. Supervised methods extract keyphrases by using a training document set, thus acquiring knowledge from a global collection of texts. Conversely, unsupervised methods extract keyphrases by determining their relevance in a single-document context, without prior learning. We present a hybrid keyphrase extraction method for short articles, HybridRank, which leverages the benefits of both approaches. Our system implements modified versions of the TextRank (Mihalcea and Tarau, 2004)-unsupervised-and KEA (Witten et al., 1999)-supervised-methods, and applies a merging algorithm to produce an overall list of keyphrases. We have tested HybridRank on more than 900 abstracts belonging to a wide variety of subjects, and show its superior effectiveness. We conclude that knowledge collaboration between supervised and unsupervised methods can produce higher-quality keyphrases than applying these methods individually.
机译:自动关键短语提取方法通常采用有监督或无监督的方法。有监督的方法通过使用培训文档集来提取关键短语,从而从全局文本集中获取知识。相反,无监督方法通过在单文档上下文中确定其相关性来提取关键短语,而无需事先学习。我们提出了一种针对短文章的混合关键字短语提取方法HybridRank,它利用了两种方法的优势。我们的系统实现了TextRank(Mihalcea和Tarau,2004)-无监督和KEA(Witten等人,1999)-监督方法的修改版本,并应用了合并算法以生成关键字的整体列表。我们已经对900多种涉及广泛主题的摘要进行了HybridRank测试,并显示了其优越的有效性。我们得出的结论是,与单独应用这些方法相比,有监督方法和无监督方法之间的知识协作可以产生更高质量的关键词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号