【24h】

Big Data Quality: A Survey

机译:大数据质量:调查

获取原文

摘要

With the advances in communication technologies and the high amount of data generated, collected, and stored, it becomes crucial to manage the quality of this data deluge in an efficient and cost-effective way. The storage, processing, privacy and analytics are the main keys challenging aspects of Big Data that require quality evaluation and monitoring. Quality has been recognized by the Big Data community as an essential facet of its maturity. Yet, it is a crucial practice that should be implemented at the earlier stages of its lifecycle and progressively applied across the other key processes. The earlier we incorporate quality the full benefit we can get from insights. In this paper, we first identify the key challenges that necessitates quality evaluation. We then survey, classify and discuss the most recent work on Big Data management. Consequently, we propose an across-the-board quality management framework describing the key quality evaluation practices to be conducted through the different Big Data stages. The framework can be used to leverage the quality management and to provide a roadmap for Data scientists to better understand quality practices and highlight the importance of managing the quality. We finally, conclude the paper and point to some future research directions on quality of Big Data.
机译:随着通信技术的进步和生成的大量数据,收集和存储,以高效且经济高效的方式管理该数据洪水的质量变得至关重要。存储,处理,隐私和分析是主要键具有挑战性的大数据方面,需要质量评估和监控。大数据社区的质量得到了认可,作为其成熟度的基本方面。然而,这是一个关键的实践,应该在其生命周期的早期阶段实施,并逐步应用于其他关键过程。我们曾提前纳入质量,我们可以从洞察力中获得的全部好处。在本文中,我们首先确定了需要质量评估的关键挑战。然后,我们调查,分类和讨论大数据管理的最新工作。因此,我们提出了一种跨电路板质量管理框架,描述了通过不同大数据阶段进行的关键质量评估实践。该框架可用于利用质量管理,并为数据科学家提供路线图,以更好地了解质量实践,并突出管理质量的重要性。我们终于结束了论文并指出了一些关于大数据质量的未来研究方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号