首页> 外文期刊>Information and Knowledge Management >Implemented Stemming Algorithms for Information Retrieval Applications
【24h】

Implemented Stemming Algorithms for Information Retrieval Applications

机译:实现了用于信息检索应用的源头算法

获取原文
       

摘要

Now a day’s text documents are advancing over internet, e-mails and web pages. As the use of internet is exponentially growing, the need of massive data storage is increasing from time to time. Normally many of the documents contain morphological variables, so stemming which is a preprocessing technique gives a mapping of different morphological variants of words into their base word called the stem. Stemming process is used in information retrieval applications accordingly as a way to improve retrieval performance based on the assumption that terms with the same stem usually have similar meaning. To do stemming operation on bulky documents, we require normally more computation time and power, to cope up with the need to search for a particular word in the data. In this paper, various stemming algorithms are analyzed with the benefits and limitation of the recent stemming methods or approaches.
机译:现在,一天的文本文件正在通过互联网,电子邮件和网页推进。由于互联网的使用是指数增长的,因此需要大量数据存储的时间从时间增加。通常,许多文件包含形态变量,因此源是一种预处理技术,使得单词的不同形态变异的映射到称为茎的基本词。在信息检索应用中使用SENTMING过程作为提高检索性能的方式,基于具有相同杆的术语通常具有相似含义的术语。要对庞大的文档进行操作,我们需要通常需要更多的计算时间和功率,以应对需要搜索数据中的特定单词。在本文中,分析了各种茎秆算法,以近期茎干方法或方法的益处和限制。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号