首页> 外文会议>Advances in focused retrieval >TopX 2.0 at the INEX 2008 Efficiency Track A (Very) Fast Object-Store for Top-k-Style XML Full-Text Search
【24h】

TopX 2.0 at the INEX 2008 Efficiency Track A (Very) Fast Object-Store for Top-k-Style XML Full-Text Search

机译:用于IN-K风格XML全文搜索的INEX 2008效率轨道A(非常)快速对象存储中的TopX 2.0

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

For the INEX Efficiency Track 2008, we were just on time to finish and evaluate our brand-new TopX 2.0 prototype. Complementing our long-running effort on efficient top-k query processing on top of a relational back-end, we now switched to a compressed object-oriented storage for text-centric XML data with direct access to customized inverted files, along with a complete reim-plementation of the engine in C++. Our INEX 2008 experiments demonstrate efficiency gains of up to a factor of 30 compared to the previous Java/JDBC-based TopX 1.0 implementation over a relational back-end. TopX 2.0 achieves overall runtimes of less than 51 seconds for the entire batch of 568 Efficiency Track topics in their content-and-structure (CAS) version and less than 29 seconds for the content-only (CO) version, respectively, using a top-15, focused (i.e., non-overlapping) retrieval mode-an average of merely 89 ms per CAS query and 49 ms per CO query.
机译:对于2008年INEX效率追踪,我们正好按时完成并评估了我们全新的TopX 2.0原型。长期以来我们一直致力于在关系后端之上进行高效的top-k查询处理,这是对我们长期努力的一种补充,现在,我们转向了压缩的面向对象的存储,以文本为中心的XML数据,可以直接访问自定义的反向文件以及完整的文件。 C ++中引擎的补充。我们的INEX 2008实验证明,与关系后端相比,以前的基于Java / JDBC的TopX 1.0实现与之相比,效率提高了30倍。使用topX,TopX 2.0的内容和结构(CAS)版本的全部568个效率跟踪主题的总运行时间分别不到51秒和仅内容(CO)版本的不到29秒。如图-15所示,聚焦(即非重叠)检索模式-每个CAS查询平均只有89毫秒,每个CO查询平均只有49毫秒。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号