首页> 外文会议>Conference on Computational Linguistics and Speech Processing >Collaborative Ranking between Supervised and Unsupervised Approaches for Keyphrase Extraction
【24h】

Collaborative Ranking between Supervised and Unsupervised Approaches for Keyphrase Extraction

机译:关键正酶提取的监督和无监督方法之间的协同排名

获取原文

摘要

Automatic keyphrase extraction methods have generally taken either supervised or unsupervised approaches. Supervised methods extract keyphrases by using a training document set, thus acquiring knowledge from a global collection of texts. Conversely, unsupervised methods extract keyphrases by determining their relevance in a single-document context, without prior learning. We present a hybrid keyphrase extraction method for short articles, HybridRank, which leverages the benefits of both approaches. Our system implements modified versions of the TextRank (Mihalcea and Tarau, 2004)-unsupervised-and KEA (Witten et al., 1999)-supervised-methods, and applies a merging algorithm to produce an overall list of keyphrases. We have tested HybridRank on more than 900 abstracts belonging to a wide variety of subjects, and show its superior effectiveness. We conclude that knowledge collaboration between supervised and unsupervised methods can produce higher-quality keyphrases than applying these methods individually.
机译:自动关键术提取方法通常采取监督或无监督的方法。监督方法通过使用培训文档集提取关键势,从而从全球文本集合获取知识。相反,无监督方法通过在单一文档上下文中确定其相关性而没有先前学习来提取关键势。我们介绍了一种用于短篇文章,HybridRank的混合关键词提取方法,利用两种方法的益处。我们的系统实现了Textrank(Mihalcea和Tarau,2004)的修改版本 - unpsupervised-and Kea(Witten等,1999)-Supervised-Methods,并应用合并算法来生成关键字的整体列表。我们已经测试了超过900个摘要的Hybridrank属于各种各样的受试者,并表现出其优越的效果。我们得出结论,监督和无人监督方法之间的知识协作可以产生比单独应用这些方法的更高质量的关键词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号