首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Multiple Distribution Data Description Learning Algorithm for Novelty Detection
【24h】

Multiple Distribution Data Description Learning Algorithm for Novelty Detection

机译:多分布数据描述新型检测算法

获取原文

摘要

Current data description learning methods for novelty detection such as support vector data description and small sphere with large margin construct a spherically shaped boundary around a normal data set to separate this set from abnormal data. The volume of this sphere is minimized to reduce the chance of accepting abnormal data. However those learning methods do not guarantee that the single spherically shaped boundary can best describe the normal data set if there exist some distinctive data distributions in this set. We propose in this paper a new data description learning method that constructs a set of spherically shaped boundaries to provide a better data description to the normal data set. An optimisation problem is proposed and solving this problem results in an iterative learning algorithm to determine the set of spherically shaped boundaries. We prove that the classification error will be reduced after each iteration in our learning method. Experimental results on 28 well-known data sets show that the proposed method provides lower classification error rates.
机译:目前的数据描述新颖性检测的学习方法,例如支持向量数据描述和具有大边缘的小球构成正常数据集周围的球形边界,以将该集合与异常数据分开。该球体的体积最小化以减少接受异常数据的可能性。然而,这些学习方法不能保证单个球形边界可以最好地描述如果存在该组中的一些独特的数据分布,则可以最好地描述正常数据集。我们在本文中提出了一种新的数据描述学习方法,其构造一组球形边界,以提供更好的数据描述到正常数据集。提出并解决该问题的优化问题导致迭代学习算法来确定球形边界集。我们证明,在我们的学习方法中,在每次迭代后将减少分类错误。 28个众所周知的数据集的实验结果表明,该方法提供了较低的分类误差率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号