首页> 外文期刊>International Journal of Data Mining & Knowledge Management Process >Arabic Words Stemming Approach Using Arabic Wordnet
【24h】

Arabic Words Stemming Approach Using Arabic Wordnet

机译:使用阿拉伯语词网的阿拉伯语词干方法

获取原文
       

摘要

The big growth of the Arabic internet content in the last years has raised up the need for an effectivestemming techniques for Arabic language. Arabic stemming algorithms can be ranked, according to threecategory, as root-based approach (ex. Khoja); stem-based approach (ex. Larkey); and statistical approach(ex. N-Garm). However, no stemming of this language is perfect: The existing stemmers have a lowefficiency. In this paper, we introduce a new stemming technique for Arabic words that also solve theproblem of the plural form of irregular nouns in Arabic language, which called broken plural. Theproposed stem extractor provides very accurate results in comparisons with other algorithms.Consequently the search effectiveness improved.
机译:过去几年中,阿拉伯语互联网内容的迅猛增长引起了对有效的阿拉伯语词干提取技术的需求。根据三类,阿拉伯词根提取算法可以排名为基于根的方法(例如Khoja);基于词干的方法(例如Larkey);和统计方法(例如N-Garm)。但是,这种语言的词干并不是完美的:现有的词干提取器效率低下。在本文中,我们引入了一种新的阿拉伯单词词干处理技术,该技术还解决了阿拉伯语中不规则名词的复数形式问题,即不规则复数形式。与其他算法相比,提出的词干提取器提供了非常准确的结果,因此提高了搜索效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号