首页> 外文OA文献 >Merging Multiple Search Results Approach for Meta-Search Engines
【2h】

Merging Multiple Search Results Approach for Meta-Search Engines

机译:合并元搜索引擎的多个搜索结果方法

摘要

Meta Search Engines are finding tools developed for enhancing the search performance by submitting user queries to multiple searchengines and combining the search results in a unified ranked list. They utilized data fusion technique, which requires three major steps: databases selection, the results combination, and the results merging. This study tries to build a framework that can be used for merging the search results retrieved from any set of search engines. This framework based on answering three major questions:1.How meta-search developers could define the optimal rank order for the selected engines.2. How meta-search developers could choose the best search engines combination.3.What is the optimal heuristic merging function that could be used for aggregating the rank order of the retrieved documents form incomparable search engines.The main data collection process depends onrunning 40 general queries on three major search engines (Google, AltaVista, and Alltheweb). Real users have involved in the relevance judgment process for a five point relevancy scale. Theperformance of the three search engines, their different combinations and different merging algorithm have been compared to rank the database, choose the best combination and define the optimal merging function.The major findings of this study are (1) Ranking the databases in merging process should depends on their overall performance not their popularity or size; (2)Larger databases tend to perform better than smaller databases; (3)The combination of the search engines should depend on ranking the database and choosing theappropriate combination function; (4)Search Engines tend to retrieve more overlap relevant document than overlap irrelevant documents; and (5) The merging function which take theoverlapped documents into accounts tend to perform better than the interleave and the rank similarity function.In addition to these findings the study has developed a set of requirements for the merging process to be successful. This procedure include the databases selection, the combination, and merging upon heuristic solutions.
机译:元搜索引擎正在寻找开发工具,用于通过将用户查询提交给多个搜索引擎并将搜索结果合并到一个统一的排名列表中来增强搜索性能。他们利用数据融合技术,这需要三个主要步骤:数据库选择,结果组合和结果合并。这项研究试图建立一个可用于合并从任何搜索引擎集合中检索到的搜索结果的框架。该框架基于以下三个主要问题:1.元搜索开发人员如何为所选引擎定义最佳排名。元搜索开发人员如何选择最佳的搜索引擎组合。3。最佳启发式合并功能是什么,可用于汇总无与伦比的搜索引擎中检索到的文档的排名顺序。主要数据收集过程取决于运行40个常规查询在三个主要的搜索引擎(Google,AltaVista和Alltheweb)上。真实用户参与了五点相关性量表的相关性判断过程。比较了三种搜索引擎的性能,它们的不同组合和不同的合并算法,对数据库进行排名,选择最佳组合和定义最佳合并功能。本研究的主要发现是(1)在合并过程中对数据库进行排名取决于他们的整体表现,而不是他们的知名度或规模; (2)大型数据库的性能往往优于小型数据库; (3)搜索引擎的组合应取决于数据库的排名和选择合适的组合功能; (4)搜索引擎倾向于比不相关的重叠文档检索更多的相关重叠文档; (5)考虑到重叠文档的合并功能往往比交错和秩相似功能表现更好。除这些发现外,本研究还为合并过程的成功提出了一系列要求。此过程包括数据库选择,组合以及基于启发式解决方案的合并。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号