【24h】

A Theoretical Study of Anomaly Detection in Big Data Distributed Static and Stream Analytics

机译:大数据分布式静态和流分析中异常检测的理论研究

获取原文

摘要

A high volume of data including log records, sensors, monitoring systems, manufacturing processes, call detail records, blogs, emails, and social media streams are generated around the clock by diverse applications. Thus, as the volume of data is growing rapidly, detecting anomaly from high volume big data becomes a critical and difficult task, due to the theatrical (research) and practical (technical) limitations. This paper aims to investigate anomaly detection and provide global understanding of anomaly concepts in the big data mining perspective. In this paper we demonstrate how existing methods of anomaly detection can be adopted with high volumes of data, specifically providing in depth understanding of the anomaly concept in streaming data. The key contribution of this study is an attempt to answer the following questions: 1) What is the concept of big data and what are big data analytic approaches? 2) What is the relationship between big data and anomaly detection? 3) What is the main characteristic of anomaly in big batch and streaming data? 4) What is the appropriate state of the art infrastructure to process and detect large-scale batch and streaming data?.
机译:各种各样的应用程序全天候生成大量数据,包括日志记录,传感器,监视系统,制造过程,呼叫详细记录,博客,电子邮件和社交媒体流。因此,随着数据量的快速增长,由于剧院(研究)和实际(技术)的局限性,从大量大数据中检测异常成为一项艰巨而艰巨的任务。本文旨在研究异常检测,并从大数据挖掘的角度提供对异常概念的全局理解。在本文中,我们演示了如何对大量数据采用现有的异常检测方法,特别是深入了解流数据中的异常概念。这项研究的主要贡献是试图回答以下问题:1)大数据的概念是什么,什么是大数据分析方法? 2)大数据与异常检测之间有什么关系? 3)大批量和流数据中异常的主要特征是什么? 4)处理和检测大规模批处理和流数据的最先进的基础架构是什么?

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号