首页> 外文会议>Workshop of the Cross-Language Evaluation Forum >Statistical vs. Rule-Based Stemming for Monolingual French Retrieval
【24h】

Statistical vs. Rule-Based Stemming for Monolingual French Retrieval

机译:基于统计的与规则为基于规则的单机法国检索

获取原文

摘要

This paper describes our approach to the 2006 Adhoc Mo-nolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter’s stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.
机译:本文介绍了我们对2006年Adhoc Mo-Nolingual信息检索运行的方法。我们的实验的目标是比较拟议的统计终结器的性能,以基于规则的终结器,特别是Formber的Porter Sewer。统计干预方法基于Lexicon聚类,使用新颖的串距离测量。除了使用没有源的基线运行,我们提交了三次官方运行。结果表明,令人源性显着提高了检索性能(按预期)约9-10%,统计终止器的性能与基于规则的终止器的性能相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号