【24h】

Outlier detection in data streams using MCOD algorithm

机译:使用MCOD算法检测数据流中的异常值

获取原文

摘要

Data mining is one of the most exciting fields of research for a researcher. In data mining, outlier detection is one of the important area where similar kind of data objects are grouped together and the objects that does not belong to the group are termed as outliers. This helps in finding objects that have different behavior with respect to other objects. Due to the presence of outliers overall nature of the data may be compromised. So it is a challenging task to find outliers present in the data. Every day huge amount data is flowing around us which belong to different streams, so our main is to find the objects that does not belong to the particular stream. In this paper, different outlier detection algorithms are described and implemented and the best algorithm among them is found based on their performance with the help of MOA tool. Performance issues like memory consumption, domain queries, time are shown. MOA tool contains prescribed algorithms where one can be used as a base algorithm to compare remaining algorithms. Each algorithm is an increasing and adaptive to concept extension. Finally the performance of each algorithm is tabled.
机译:数据挖掘是研究人员最令人兴奋的研究领域之一。在数据挖掘中,离群值检测是将相似类型的数据对象分组在一起并且不属于该组的对象称为离群值的重要领域之一。这有助于找到相对于其他对象具有不同行为的对象。由于存在异常值,因此可能会损害数据的整体性质。因此,找到数据中存在的异常值是一项艰巨的任务。每天都有大量数据流向我们,这些数据属于不同的流,因此我们的主要目的是查找不属于特定流的对象。本文描述并实现了不同的离群值检测算法,并借助MOA工具根据其性能找到了最佳算法。显示了诸如内存消耗,域查询,时间之类的性能问题。 MOA工具包含规定的算法,可以将其中一种算法用作比较其余算法的基础算法。每种算法都是一种不断增长的且适应于概念扩展的算法。最后列出了每种算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号