首页> 外文会议>International conference on web-age information management >NBLucene: Flexible and Efficient Open Source Search Engine
【24h】

NBLucene: Flexible and Efficient Open Source Search Engine

机译:NBLucene:灵活高效的开源搜索引擎

获取原文

摘要

The most popular open source projects for text searching have been designed to support many features. These projects are well-written in Java for cross-platform using. But when conducting research, the execution efficiency of program should be more essential, which is a problem for applications written in Java. It is also difficult for Java to use parallel mechanisms in the modern computer system like SIMD and GPUs. To this end, we expand an open source text searching project written in C++ for research purpose. Our approach is to define a flexible and efficient search engine architecture which consists of extensible application programming interfaces. We aim to provide a flexible architecture to enable researchers to readily implement and modify search engine algorithms and strategies. Moreover, we integrate one generic mathematical encoding library which can be used to compress inverted index. We also implement an integral framework for result summarization, including snippet generation and cache strategies. Experiment results show that the new architecture makes a significant improvement versus original work.
机译:最受欢迎的用于文本搜索的开源项目已被设计为支持许多功能。这些项目用Java编写得很好,可以跨平台使用。但是,在进行研究时,程序的执行效率应该更为重要,这对于用Java编写的应用程序来说是个问题。 Java也很难在现代计算机系统(如SIMD和GPU)中使用并行机制。为此,我们扩展了一个用C ++编写的开源文本搜索项目,以供研究之用。我们的方法是定义一个灵活且高效的搜索引擎体系结构,该体系结构由可扩展的应用程序编程接口组成。我们旨在提供一种灵活的体系结构,以使研究人员能够轻松实现和修改搜索引擎算法和策略。此外,我们集成了一个通用的数学编码库,可用于压缩倒排索引。我们还为结果汇总实现了一个完整的框架,包括代码段生成和缓存策略。实验结果表明,新架构与原始工作相比有了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号