首页> 外文期刊>Future generation computer systems >Bloom filter based optimization scheme for massive data handling in IoT environment
【24h】

Bloom filter based optimization scheme for massive data handling in IoT environment

机译:基于Bloom过滤器的优化方案,用于IoT环境中的海量数据处理

获取原文
获取原文并翻译 | 示例
       

摘要

With the widespread popularity of big data usage across various applications, need for efficient storage, processing, and retrieval of massive datasets generated from different applications has become inevitable. Further, handling of these datasets has become one of the biggest challenges for the research community due to the involved heterogeneity in their formats. This can be attributed to their diverse sources of generation ranging from sensors to on-line transactions data and social media access. In this direction, probabilistic data structures (PDS) are suitable for large-scale data processing, approximate predictions, fast retrieval and unstructured data storage. In conventional databases, entire data needs to be stored in memory for efficient processing, but applications involving real time in-stream data demand time-bound query output in a single pass. Hence, this paper proposes Accommodative Bloom filter (ABF), a variant of scalable bloom filter, where insertion of bulk data is done using the addition of new filters vertically. Array of m bits is divided into b buckets oflbits each and new filters of size‘m∕k′are added to each bucket to accommodate the incoming data. Data generated from various sensors has been considered for experimental purposes where query processing is done at two levels to improve the accuracy and reduce the search time. It has been found that insertion and search time complexity of ABF does not increase with increase in number of elements. Further, results indicate that ABF outperforms the existing variants of Bloom filters in terms of false positive rates and query complexity, especially when dealing with in-stream data.
机译:随着大数据在各种应用程序中的广泛普及,高效存储,处理和检索从不同应用程序生成的海量数据集的需求已成为必然。此外,由于其格式涉及异质性,处理这些数据集已成为研究界面临的最大挑战之一。这可以归因于其多样化的生成来源,从传感器到在线交易数据以及社交媒体访问。在这个方向上,概率数据结构(PDS)适用于大规模数据处理,近似预测,快速检索和非结构化数据存储。在常规数据库中,需要将整个数据存储在内存中以进行有效处理,但是涉及实时流内数据的应用程序需要在一次通过中进行限时查询输出。因此,本文提出了可调节布隆过滤器(ABF),它是可伸缩布隆过滤器的一种变体,其中批量数据的插入使用垂直添加的新过滤器完成。 m位数组分为b个桶,每个桶,并且为每个桶添加大小为“ m ∕ k”的新过滤器,以容纳传入的数据。已经考虑了从各种传感器生成的数据用于实验目的,其中在两个级别上执行查询处理以提高准确性并减少搜索时间。已经发现,ABF的插入和搜索时间复杂度不会随着元素数量的增加而增加。此外,结果表明,在误报率和查询复杂性方面,ABF优于Bloom Bloom过滤器的现有变体,尤其是在处理流内数据时。

著录项

  • 来源
    《Future generation computer systems》 |2018年第5期|440-449|共10页
  • 作者单位

    Computer Science & Engineering Department, Thapar Institute of Engineering and Technology (Deemed University);

    Computer Science & Engineering Department, Thapar Institute of Engineering and Technology (Deemed University);

    Computer Science & Engineering Department, Thapar Institute of Engineering and Technology (Deemed University);

    Computer Science & Engineering Department, Thapar Institute of Engineering and Technology (Deemed University);

    National Institute of Telecommunications (Inatel),Instituto de Telecomunicações,ITMO University,University of Fortaleza (UNIFOR);

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Internet of Things; Big data analytics; Probabilistic data structures; Bloom filter; In-stream data processing;

    机译:物联网;大数据分析;概率数据结构;布隆过滤器;流内数据处理;
  • 入库时间 2022-08-18 02:16:14

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号