首页> 外文会议>Advances in Information Systems >Information Retrieval Effectiveness of Turkish Search Engines
【24h】

Information Retrieval Effectiveness of Turkish Search Engines

机译:土耳其搜索引擎的信息检索效率

获取原文

摘要

This is an investigation of information retrieval performance of Turkish search engines with respect to precision, normalized recall, coverage and novelty ratios. We defined seventeen query topics for Arabul, Arama, Netbul and Superonline. These queries were carefully selected to assess the capability of a search engine for handling broad or narrow topic subjects, exclusion of particular information, identifying and indexing Turkish characters, retrieval of hub/authoritative pages, stemming of Turkish words, correct interpretation of Boolean operators. We classified each document in a retrieval output as being "relevant" or "nonrelevant" to calculate precision and normalized recall ratios at various cut-off points for each pair of query topic and search engine. We found the coverage and novelty ratios for each search engine. We also tested how search engines handle meta-tags and dead links. Arama appears to be the best Turkish search engine in terms of average precision and normalized recall ratios, and the coverage of Turkish sites. Turkish characters (and stemming as well) still cause bottlenecks for Turkish search engines. Superonline and Netbul make use of the indexing information in metatag fields to improve retrieval results.
机译:这是对土耳其搜索引擎在准确性,归一化召回率,覆盖率和新颖性比率方面的信息检索性能的调查。我们为Arabul,Arama,Netbul和Superonline定义了17个查询主题。这些查询经过精心选择,以评估搜索引擎处理宽泛或狭窄主题主题,排除特定信息,识别和索引土耳其语字符,检索中心/权威页面,土耳其语词干,布尔运算符的正确解释的能力。我们将检索输出中的每个文档分类为“相关”或“不相关”,以计算每对查询主题和搜索引擎在各个截止点的精确度和归一化的查全率。我们找到了每个搜索引擎的覆盖率和新颖性比率。我们还测试了搜索引擎如何处理元标记和无效链接。就平均精确度和标准化召回率以及土耳其站点的覆盖范围而言,Arama似乎是最好的土耳其搜索引擎。土耳其语字符(以及词干)也仍然是土耳其搜索引擎的瓶颈。 Superonline和Netbul利用元标记字段中的索引信息来改善检索结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号