首页> 外文会议>IEEE International Conference on Fuzzy Systems >A Fuzzy Multiclass Novelty Detector for Data Streams
【24h】

A Fuzzy Multiclass Novelty Detector for Data Streams

机译:一种用于数据流的模糊多条新颖性探测器

获取原文

摘要

In many real-world applications data arrive continuously, in the form of streams. Such data can be used for the acquisition of knowledge by machine learning methods. In data streams learning, novelty detection is a relevant topic, which aims to identify the emergence of a new concept or a drift in the known concept in real time. Most approaches in the literature that focus on the novelty detection problem, make assumptions that limit the method usefulness. For instance, some methods are designed lying on the supposition that labeled data will be available at some time in the stream, while others restrict the proposed algorithm to one-class problems. Some recent approaches aim to overcome the limitations mentioned, considering multiclass problems and unlabeled datasets. In addition, there are also proposals that explore concepts of fuzzy set theory to add more flexibility to the learning process, although restricted to labeled datasets. In this paper, we propose a fuzzy multiclass novelty detector for data streams called FuzzND, as a fuzzy extension of the MINAS algorithm. Our algorithm generates a model based on fuzzy micro-clusters that provides flexible class boundaries. Allowing the identification of different types of novel information, i.e, novel classes, extension of classes or outliers more efficiently. Experiments show that our approach is promising in dealing with the changes in data streams and presents improvements in comparison to the non-fuzzy version.
机译:在许多现实世界应用中,数据以流的形式连续到达。这些数据可用于通过机器学习方法获取知识。在数据流学习中,新颖性检测是一个相关主题,旨在实时识别新概念的出现或在已知概念中漂移。在文献中的大多数方法,重点关注新颖性的检测问题,使假设限制了该方法的用途。例如,某些方法符合符合标记数据在流中的某个时间可用的假设,而其他方法将限制所提出的算法到一类问题。考虑到多种多组问题和未标记的数据集,一些最近的方法旨在克服提到的限制。此外,还有建议探讨模糊集理论的概念,以增加对学习过程的更多灵活性,尽管仅限于标记的数据集。在本文中,我们提出了一种用于数据流的模糊多条新颖性探测器,称为FUZZND,作为MINAS算法的模糊延伸。我们的算法基于基于模糊的微集群生成模型,提供灵活的类边界。允许识别不同类型的新颖信息,即新颖类别,更有效地扩展课程或异常值。实验表明,我们的方法在处理数据流的变化并与非模糊版本相比,提出改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号