首页> 外文期刊>Journal of VLSI signal processing systems >A Query-oriented Adaptive Indexing Technique for Smart Grid Big Data Analytics
【24h】

A Query-oriented Adaptive Indexing Technique for Smart Grid Big Data Analytics

机译:智能网格大数据分析的面向查询的自适应索引技术

获取原文
获取原文并翻译 | 示例
           

摘要

IoT (Internet of Things) based Smart Grid (SG) is defined as a power grid integrated with a large network of smart objects portrayed by information and communication technology. The data sources of IoT-based SG, as well as their correlations, are usually perplexing, which necessitate indexing techniques for complex queries over the SG dataset to efficiently exploit the rich connotations of data to enable characteristic analytics and fault prediction. As part of popular big data platform, HBase is replacing classic relational data- bases to host huge heterogeneous data records in the form of key-value storage. However, most existing secondary index schemes on HBase are managed and retrieved by corresponding data columns instead of queries to incur inefficiency in answering a complex data query. In this paper, we propose an adaptive indexing technique to speed up a complex data query on HBase for IoT-based SG big data. Our proposed method is based on the observation that most analyses over big power grid data focus on data subsets related to specific power grid events or monitoring data instead of the whole dataset. Theoretical analysis and experimental test show that the proposed query-oriented secondary indexing scheme is feasible in improving the query performance. For a join operation, when compared with a query scheme without secondary indexing, our proposed indexing scheme outperforms from a minimum 6.54 × speedup to a maximum 860 × speedup; when compared with a classic secondary indexing scheme implemented on HBase, our indexing scheme outperforms from a minimum 1.20 × speedup to a maximum 8.68 × speedup. Our indexing technique would be a useful reference for other industrial big data practices.
机译:基于物联网(IoT)的智能电网(SG)被定义为与信息和通信技术所描绘的大型智能对象网络集成的电网。基于物联网的SG的数据源及其相关性通常令人困惑,这需要对SG数据集进行复杂查询的索引技术才能有效利用丰富的数据内涵,以进行特征分析和故障预测。作为流行的大数据平台的一部分,HBase正在取代传统的关系数据库,以键值存储的形式托管庞大的异构数据记录。但是,HBase上大多数现有的二级索引方案都是由相应的数据列而不是查询来管理和检索的,从而导致在回答复杂数据查询时效率低下。在本文中,我们提出了一种自适应索引技术,以加快基于IoT的SG大数据在HBase上的复杂数据查询。我们提出的方法基于以下观察结果:对大型电网数据的大多数分析都将重点放在与特定电网事件或监视数据相关的数据子集上,而不是整个数据集。理论分析和实验测试表明,提出的面向查询的二级索引方案在提高查询性能方面是可行的。对于联接操作,与没有二级索引的查询方案相比,我们提出的索引方案的性能要好于最低的6.54×加速到最高的860×加速。与在HBase上实现的经典二级索引方案相比,我们的索引方案的性能从最低的1.20×加速到最高的8.68×加速。我们的索引技术将为其他工业大数据实践提供有用的参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号