首页> 外文期刊>International journal of computers, communications & control >Efficient Historical Query in HBase for Spatio-Temporal Decision Support
【24h】

Efficient Historical Query in HBase for Spatio-Temporal Decision Support

机译:HBase中的有效历史查询,用于时空决策支持

获取原文
获取原文并翻译 | 示例
           

摘要

Comparing to last decade, technologies to gather spatio-temporal data are more and more developed and easy to use or deploy, thus tens of billions, even trillions of sensed data are accumulated, which poses a challenge to spatio-temporal Decision Support System (stDSS). Traditional database hardly supports such huge volume, and tends to bring performance bottleneck to the analysis platform. Hence in this paper, we argue to use NoSQL database, HBase, to replace traditional back-end storage system. Under such context, the well-studied spatio-temporal querying techniques in traditional database should be shifted to HBase system parallel. However, this problem is not solved well in HBase, as many previous works tackle the problem only by designing schema, i.e., designing row key and column key formation for HBase, which we don't believe is an effective solution. In this paper, we address this problem from nature level of HBase, and propose an index structure as a built-in component for HBase. STEHIX (Spatio-TEmporal Hbase IndeX) is adapted to two-level architecture of HBase and suitable for HBase to process spatio-temporal queries. It is composed of index in the meta table (the first level) and region index (the second level) for indexing inner structure of HBase regions. Base on this structure, three queries, range query, kNN query and GNN query are solved by proposing algorithms, respectively. For achieving load balancing and scalable kNN query, two optimizations are also presented. We implement STEHIX and conduct experiments on real dataset, and the results show our design outperforms a previous work in many aspects.
机译:与过去的十年相比,收集时空数据的技术越来越发达,易于使用或部署,因此积累了数百亿甚至数万亿的感知数据,这对时空决策支持系统(stDSS)构成了挑战)。传统数据库几乎无法支持如此庞大的容量,并且往往会给分析平台带来性能瓶颈。因此,在本文中,我们主张使用NoSQL数据库HBase代替传统的后端存储系统。在这种情况下,传统数据库中经过充分研究的时空查询技术应该转移到HBase系统上。但是,这个问题在HBase中不能很好地解决,因为许多先前的工作仅通过设计方案来解决该问题,即为HBase设计行键和列键的格式,我们认为这不是有效的解决方案。在本文中,我们从HBase的本质层面解决了这个问题,并提出了索引结构作为HBase的内置组件。 STEHIX(时空Hbase IndeX)适用于HBase的两级体系结构,适用于HBase处理时空查询。它由元表中的索引(第一级)和区域索引(第二级)组成,用于为HBase区域的内部结构建立索引。在此结构的基础上,通过提出算法分别解决了范围查询,kNN查询和GNN查询三个查询。为了实现负载平衡和可扩展的kNN查询,还提出了两种优化方法。我们实现了STEHIX并在真实数据集上进行了实验,结果表明我们的设计在许多方面都优于以前的工作。

著录项

  • 来源
  • 作者单位

    Science and Technology on Information Systems Engineering Laboratory National University of Defense Technology Changsha 410073, P.R.China;

    Science and Technology on Information Systems Engineering Laboratory National University of Defense Technology Changsha 410073, P.R.China;

    Science and Technology on Information Systems Engineering Laboratory National University of Defense Technology Changsha 410073, P.R.China;

    Science and Technology on Information Systems Engineering Laboratory National University of Defense Technology Changsha 410073, P.R.China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    spatio-temporal query; HBase; range query; kNN query; GNN query; load balancing;

    机译:时空查询HBase;范围查询;kNN查询;GNN查询;负载均衡;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号