Statistical vs. Rule-Based Stemming for Monolingual French Retrieval

机译：基于统计的与规则为基于规则的单机法国检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes our approach to the 2006 Adhoc Mo-nolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter’s stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.

机译：本文介绍了我们对2006年Adhoc Mo-Nolingual信息检索运行的方法。我们的实验的目标是比较拟议的统计终结器的性能，以基于规则的终结器，特别是Formber的Porter Sewer。统计干预方法基于Lexicon聚类，使用新颖的串距离测量。除了使用没有源的基线运行，我们提交了三次官方运行。结果表明，令人源性显着提高了检索性能（按预期）约9-10％，统计终止器的性能与基于规则的终止器的性能相当。

著录项

来源
《Workshop of the Cross-Language Evaluation Forum》|2007年||共4页
会议地点
作者
Prasenjit Majumder; Mandar Mitra; Kalyankumar Datta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类文字和语言;
关键词

相似文献

外文文献
中文文献
专利

1. Statistical Models for Monolingual and Bilingual Information Retrieval [J] . NICOLA BERTOLDI, MARCELLO FEDERICO Information retrieval . 2004,第1a2期

机译：单语和双语信息检索的统计模型
2. A Rule-Based Extensible Stemmer for Information Retrieval with Application to Arabic [J] . Maryam Madani, Shadpour Mallakpour The international arab journal of information technology . 2005,第3期

机译：基于规则的可扩展词干在阿拉伯语中的信息检索
3. Prediction of genotoxic potential of cosmetic ingredients by an in silico battery system consisting of a combination of an expert rule-based system and a statistics-based system [J] . Kaneko Maki Aiba Nee, Hirota Morihiko, Kouzuki Hirokazu, The Journal of toxicological sciences . 2015,第1期

机译：通过硅电池系统预测化妆品成分的遗传毒性潜力，该系统由基于专家规则的系统和基于统计的系统组成
4. Statistical vs. Rule-Based Stemming for Monolingual French Retrieval [C] . Prasenjit Majumder, Mandar Mitra, Kalyankumar Datta Workshop of the Cross-Language Evaluation Forum . 2007

机译：基于统计的与规则为基于规则的单机法国检索
5. Statistical pattern recognition approaches for retrieval-based machine translation systems. [D] . Mansjur, Dwi Sianto. 2011

机译：基于检索的机器翻译系统的统计模式识别方法。
6. An Information Retrieval System for Computerized Patient Records in the Context of a Daily Hospital Practice: the Example of the Léon Bérard Cancer Center (France) [O] . P. Biron, M.H. Metzger, C. Pezet, 2014

机译：日常医院实践中用于计算机病历的信息检索系统：以莱昂贝拉德癌症中心为例（法国）
7. Across the Bridge: CLEF 2001 – Non-English Monolingual Retrieval. The French task. [O] . Eugenia Matoyo, Tony Valsamidis 2008

机译：过桥：CLEF 2001 –非英语单语检索。法国的任务。

Statistical vs. Rule-Based Stemming for Monolingual French Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅