首页> 外文会议>3rd international workshop on knowledge discovery from sensor data 2009 >OcVFDT: One-class Very Fast Decision Tree for One-class Classification of Data Streams
【24h】

OcVFDT: One-class Very Fast Decision Tree for One-class Classification of Data Streams

机译:OcVFDT:一类非常快速的决策树,用于数据流的一类分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Current research on data stream classification mainly focuses on supervised learning, in which a fully labeled data stream is needed for training. However, fully labeled data streams are expensive to obtain, which make the supervised learning approach difficult to be applied to real-life applications. In this paper, we model applications, such as credit fraud detection and intrusion detection, as a one-class data stream classification problem. The cost of fully labeling the data stream is reduced as users only need to provide some positive samples together with the unlabeled samples to the learner. Based on VFDT and POSC4.5, we propose our OcVFDT (One-class Very Fast Decision Tree) algorithm. Experimental study on both synthetic and real-life datasets shows that the OcVFDT has excellent classification performance. Even 80% of the samples in data stream are unlabeled, the classification performance of OcVFDT is still very close to that of VFDT, which is trained on fully labeled stream.
机译:当前对数据流分类的研究主要集中在有监督的学习上,其中需要完全标记的数据流进行训练。但是,获得完全标记的数据流非常昂贵,这使得监督学习方法很难应用于现实生活中。在本文中,我们将诸如信用欺诈检测和入侵检测之类的应用程序建模为一类数据流分类问题。完全标记数据流的成本降低了,因为用户只需要向学习者提供一些肯定的样本以及未标记的样本即可。基于VFDT和POSC4.5,我们提出了OcVFDT(一类超快速决策树)算法。对合成和真实数据集的实验研究表明,OcVFDT具有出色的分类性能。数据流中甚至80%的样本都没有标记,OcVFDT的分类性能仍然非常接近VFDT,后者是在完全标记的流上训练的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号