首页> 外文会议>Balkan Conference in Informatics >Bootstrapping the Albanian Information Retrieval
【24h】

Bootstrapping the Albanian Information Retrieval

机译:引导阿尔巴尼亚信息检索

获取原文

摘要

In this paper we investigate the Albanian language and try to uncover the characteristics of the language that will permit the Information Retrieval (IR) community to develop IR systems adapted for the specific language. As a consequence of our study (investigation) we provide a naive-single-step (rudimentary) stemming algorithm for the Albanian language. A stopword list is also created. Human experts are contacted for the evaluation of the provided stemming algorithm. The evaluation method used and the observation of the method's results uncover more rules, which could improve the capabilities of the rudimentary stemming algorithm. We believe that our approach for this specific language could become a standard way for building Information Retrieval functionalities (tools, functions, etc) for languages less perused, as is the language studied in this paper.
机译:在本文中,我们调查阿尔巴尼亚语,并尝试揭示将允许信息检索(IR)社区的语言的特征来开发适用于特定语言的IR系统。由于我们的研究(调查),我们为阿尔巴尼亚语言提供了一个天真的单步(基本)的算法。还创建了一个停止名单。联系人体专家,用于评估所提供的估算算法。使用的评估方法和对方法的观察结果揭示了更多的规则,这可以提高基本催眠算法的能力。我们认为,我们对这种特定语言的方法可能成为建立更少抄袭的语言的信息检索功能(工具,功能等)的标准方法,就像本文所研究的语言一样。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号