【24h】

FAST: Near Real-Time Searchable Data Analytics for the Cloud

机译:快速:云的近实时可搜索数据分析

获取原文

摘要

With the explosive growth in data volume and complexity and the increasing need for highly efficient searchable data analytics, existing cloud storage systems have largely failed to offer an adequate capability for real-time data analytics. Since the true value or worth of data heavily depends on how efficiently data analytics can be carried out on the data in (near-) real-time, large fractions of data end up with their values being lost or significantly reduced due to the data staleness. To address this problem, we propose a near-real-time and cost-effective searchable data analytics methodology, called FAST. The idea behind FAST is to explore and exploit the semantic correlation within and among datasets via correlation-aware hashing and manageable flat-structured addressing to significantly reduce the processing latency, while incurring acceptably small loss of data-search accuracy. The near-real-time property of FAST enables rapid identification of correlated files and the significant narrowing of the scope of data to be processed. FAST supports several types of data analytics, which can be implemented in existing searchable storage systems. We conduct a real-world use case in which children reported missing in an extremely crowded environment (e.g., A highly popular scenic spot on a peak tourist day) are identified in a timely fashion by analyzing 60 million images using FAST. Extensive experimental results demonstrate the efficiency and efficacy of FAST in the performance improvements and energy savings.
机译:随着数据量和复杂性的爆炸性增长以及对高效可搜索数据分析的日益增长的需求,现有的云存储系统在很大程度上未能为实时数据分析提供足够的功能。由于数据的真实价值或价值在很大程度上取决于(近)实时对数据进行高效数据分析的能力,因此大部分数据最终会由于数据陈旧而丢失或显着降低其价值。为了解决此问题,我们提出了一种称为FAST的近实时且经济高效的可搜索数据分析方法。 FAST背后的想法是通过感知关联的哈希和可管理的平面结构化寻址来探索和利用数据集中的语义相关性,以显着减少处理延迟,同时损失很小的数据搜索精度。 FAST的近实时属性可以快速识别相关文件,并显着缩小要处理的数据范围。 FAST支持多种类型的数据分析,可以在现有的可搜索存储系统中实施。我们进行了一个真实的用例,其中使用FAST分析了6000万张图像,从而及时识别出在非常拥挤的环境中报告失踪的儿童(例如,在旅游高峰日一个非常受欢迎的景点)的情况。大量的实验结果证明了FAST在性能改进和节能方面的效率和功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号