首页> 外文会议>International conference on natural language processing >Unsupervised Keyword Extraction from Polish Legal Texts
【24h】

Unsupervised Keyword Extraction from Polish Legal Texts

机译:波兰法律文本中的无监督关键字提取

获取原文

摘要

In this work, we present an application of the recently proposed unsupervised keyword extraction algorithm RAKE to a corpus of Polish legal texts from the field of public procurement. RAKE is essentially a language and domain independent method. Its only language-specific input is a stoplist containing a set of non-content words. The performance of the method heavily depends on the choice of such a stoplist, which should be domain adopted. Therefore, we complement RAKE algorithm with an automatic approach to selecting non-content words, which is based on the statistical properties of term distribution.
机译:在这项工作中,我们介绍了最近提出的无监督关键字提取算法RAKE在公共采购领域的波兰法律文本语料库中的应用。 RAKE本质上是一种与语言和领域无关的方法。它唯一的特定于语言的输入是包含一组非内容词的非索引字表。该方法的性能在很大程度上取决于对这样一个非索引字表的选择,该非索引字表应该被采用。因此,我们根据词条分布的统计特性,以自动选择非内容词的方式对RAKE算法进行补充。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号