首页> 外文期刊>Journal of Information Science >Supervised learning for building stemmers
【24h】

Supervised learning for building stemmers

机译:建立词干的监督学习

获取原文
获取原文并翻译 | 示例
           

摘要

This work is part of a project aiming to define a methodology for building simple but robust stemmers, having primitive knowledge of the stemmer's target language. The methodology starts with a very simple primary stemmer that simply removes the longest suffix (using the primitive knowledge - the list of available suffixes) that matches the ending of the examined word. Information retrieval (IR) experts express their arguments against the results of the primary stemmer. These (the experts' arguments) are valuable knowledge that offer us the ability to apply supervised learning in order to automatically produce better stemmers (that conform to the arguments expressed by the IR experts). We also conduct an evaluation of our supervised learning-based methodology that builds stemmers for languages that the experts do not have knowledge on.
机译:这项工作是一个项目的一部分,该项目旨在定义一种构建简单但强大的词干分析器的方法,并具有对词干分析器目标语言的原始知识。该方法从一个非常简单的主词干开始,该词干简单地删除了与所检查单词的末尾匹配的最长后缀(使用原始知识-可用后缀的列表)。信息检索(IR)专家对主要词干的结果表示反对。这些(专家的论据)是宝贵的知识,这些知识使我们能够应用监督学习,以自动生成更好的词干(与IR专家表达的论点一致)。我们还对基于监督的基于学习的方法进行了评估,该方法为专家不了解的语言构建了词干分析器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号