...
首页> 外文期刊>Cloud Computing, IEEE Transactions on >Fast, Ad Hoc Query Evaluations over Multidimensional Geospatial Datasets
【24h】

Fast, Ad Hoc Query Evaluations over Multidimensional Geospatial Datasets

机译:对多维地理空间数据集进行快速的临时查询评估

获取原文
获取原文并翻译 | 示例
           

摘要

Networked observational devices and remote sensing equipment continue to proliferate and contribute to the accumulation of extreme-scale datasets. Both the rate and resolution of the readings produced by these devices have grown over time, exacerbating the issues surrounding their storage and management. In many cases, the sheer scale of the information being maintained makes timely analysis infeasible due to the computational workloads required to process the data. While distributed solutions provide a scalable way to cope with data volumes, the communication and latency involved when inspecting large portions of an overall dataset limit applications that require frequent or rapid responses to incoming queries. This study investigates the challenges associated with providing approximate or exploratory answers to distributed queries. In many situations, this requires striking a balance between response times and error rates to produce meaningful results. To enable these use cases, we outline several expressive query constructs and describe their implementation; rather than relying on summary tables or pre-computed samples, our solution involves a coarse-grained global index that maintains statistics and models the relationships across dimensions in the dataset. To illustrate the benefits of these techniques, we include performance benchmarks on a real-world dataset in a production environment.
机译:联网的观察设备和遥感设备继续激增,并促进了超大规模数据集的积累。这些设备产生的读数的速率和分辨率都随着时间的推移而增长,加剧了围绕其存储和管理的问题。在许多情况下,由于处理数据需要大量的计算工作量,因此要保留的信息绝对规模使及时分析变得不可行。尽管分布式解决方案提供了一种可扩展的方式来处理数据量,但是在检查整个数据集的大部分时所涉及的通信和延迟限制了需要频繁或快速响应传入查询的应用程序。这项研究调查了与为分布式查询提供近似或探索性答案相关的挑战。在许多情况下,这需要在响应时间和错误率之间取得平衡,以产生有意义的结果。为了启用这些用例,我们概述了几种表达性查询构造并描述了它们的实现。我们的解决方案不依赖于汇总表或预先计算的样本,而是涉及一个粗粒度的全局索引,该索引可以维护统计数据并为数据集中各个维度之间的关系建模。为了说明这些技术的好处,我们在生产环境中的真实数据集上包含了性能基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号