...
首页> 外文期刊>ACM SIGIR FORUM >Scalability and Efficiency Challenges inrnLarge-Scale Web Search Engines
【24h】

Scalability and Efficiency Challenges inrnLarge-Scale Web Search Engines

机译:大型Web搜索引擎的可伸缩性和效率挑战

获取原文
获取原文并翻译 | 示例
           

摘要

Commercial web search engines need to process thousands ofrnqueries every second and provide responses to user queriesrnwithin a few hundred milliseconds. As a consequence ofrnthese tight performance constraints, search engines constructrnand maintain very large computing infrastructuresrnfor crawling the Web, indexing discovered pages, and processingrnuser queries. The scalability and eu000eciency of theserninfrastructures require careful performance optimizations inrnevery major component of the search engine.rnThis tutorial aims to provide a fairly comprehensivernoverview of the scalability and eu000eciency challenges in largescalernweb search engines. In particular, the tutorial providesrnan in-depth architectural overview of a web search engine,rnmainly focusing on the web crawling, indexing, and queryrnprocessing components. The scalability and eu000eciency issuesrnencountered in these components are presented at four diu000berentrngranularities: at the level of a single computer, a clusterrnof computers, a single data center, and a multi-center searchrnengine. The tutorial also points out some open researchrnproblems and provides recommendations to researchers whornare new to the feld.
机译:商业网络搜索引擎需要每秒处理数千个查询,并在几百毫秒内提供对用户查询的响应。由于这些严格的性能约束,搜索引擎构建并维护了非常大的计算基础结构,以用于爬网,索引发现的页面以及处理用户查询。网络基础架构的可扩展性和可扩展性要求在搜索引擎的每个主要组件中都进行仔细的性能优化。本教程旨在对大型网络搜索引擎的可扩展性和可扩展性挑战提供一个相当全面的概述。特别是,本教程对Web搜索引擎进行了更深入的体系结构概述,主要侧重于Web爬网,索引和查询处理组件。这些组件中遇到的可伸缩性和效率问题以四个不同的粒度呈现:在单台计算机,集群计算机,单个数据中心和多中心搜索引擎的级别。本教程还指出了一些开放的研究问题,并向刚接触该领域的研究人员提供了建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号