首页> 外文会议>International Conference on computer science education >A distributed search engine based on a re-ranking algorithm model
【24h】

A distributed search engine based on a re-ranking algorithm model

机译:基于重排序算法模型的分布式搜索引擎

获取原文

摘要

With the rapid increase of websites and the explosive growth of Internet information, the centralized search engine will face great challenge in mass data processing and mass data storage. However, the distributed search engine can solve the problem effectively. In this paper, we describe the design and implementation of a distributed search engine that is based on Apache Nutch, Solr and Hadoop. Considering users click logs, we propose a re-ranking algorithm based on Lucene scoring. Our experimental results show that our approaches significantly satisfy users' massive data searching demand while maintaining high reliability and scalability.
机译:随着网站的快速增长和Internet信息的爆炸性增长,集中式搜索引擎将在海量数据处理和海量数据存储方面面临巨大挑战。但是,分布式搜索引擎可以有效地解决该问题。在本文中,我们描述了基于Apache Nutch,Solr和Hadoop的分布式搜索引擎的设计和实现。考虑到用户的点击日志,我们提出了一种基于Lucene评分的重新排名算法。我们的实验结果表明,我们的方法在保持高可靠性和可扩展性的同时,可以极大地满足用户的大量数据搜索需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号