首页> 外文会议>International Conference on Information Retrieval and Knowledge Management >Analyzing Malay Stemmer Performance Towards Fuzzy Logic Ranking Function on Malay Text Corpus
【24h】

Analyzing Malay Stemmer Performance Towards Fuzzy Logic Ranking Function on Malay Text Corpus

机译:在马来文本语料库上分析马来语对模糊逻辑排名功能的马来语

获取原文

摘要

In a way to make the result of Information Retrieval (IR) more accurate, a stemmer is needed to differentiate the words in searching useful information. This research aims to analyze both processing speed and accuracy of the Malay Language Stemmer such as Fatimah Stemmer and UniSZA Stemmer. This research will also compare the performance of Fuzzy Logic Ranking Function using the both stemmer. Evaluation of Recall and Precision using the relevant judgement list by the expert. The results presented UniSZA Stemmer clearly dominated the Fatimah Stemmer processing speed performance with faster times recorded in each set of the experiment, however, in term of accuracy, unfortunately Fatimah Stemmer has clearly dominated the UniSZA stemming accuracy performance with having much more correct stemmed words for each set of the experiment. The results also showed that Fuzzy Logic Ranking with Fatimah Stemmer has outperformed Fuzzy Logic Ranking with UniSZA Stemmer and English Porter Stemmer on 5 out of 8 Topic Set of query results on the Mean Average Precision measure. Fuzzy Logic Ranking with Fatimah Stemmer also gets the best result on the Precision at Rank 10, Mean Average Precision and the percentage of no relevant document in the top ten retrieved measures, on the topic that has most queries which is topic `Umum' that has a total of 11 queries.
机译:在一种使信息检索(IR)的结果更准确的方法中,需要一个终止器来区分搜索有用信息中的单词。本研究旨在分析法玛拉斯默生和Unisza Sefalmer等马来语词根的处理速度和准确性。该研究还将使用两种Sewer的模糊逻辑排名功能进行比较。使用专家的相关判决清单评估召回和精确度。结果介绍了Unisza Sewmer,明确统治了Fatimah Sefalmer处理速度性能,每组实验中记录的时间更快,但是,对于准确性,不幸的是,Fatimah Sefemer已经明显地占据了Unisza Stemming的精度性能,具有更正的茎秆每组实验。结果还表明,与Fatimah Sefitmer的模糊逻辑排名具有优于Unisza Seftmer和英语Porter Sewermer的表现优于8个主题查询的5个序列平均精度测量。与Fatimah Sefemer的模糊逻辑排名也在等级10的精度下获得最佳结果,平均精度和十大检索措施中没有相关文件的百分比,主题是具有主题`UMUM'的主题的主题共11个查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号