首页> 外文会议>International Conference on Soft Computing and Data Mining >A Comparison of Weighted Support Vector Machine (WSVM), One-Step WSVM (OWSVM) and Iteratively WSVM (IWSVM) for Mislabeled Data
【24h】

A Comparison of Weighted Support Vector Machine (WSVM), One-Step WSVM (OWSVM) and Iteratively WSVM (IWSVM) for Mislabeled Data

机译:加权支持向量机(WSVM),单步WSVM(OWSVM)和迭代WSVM(IWSVM)的比较,用于误标记数据

获取原文

摘要

Labeling error can occur for various reasons such as the subjective nature of the labeling task, the lack of information to determine the true label of a given example and data entry error. Labeling errors were categorized as mislabeled, unlabeled, partially labeled, incompletely labeled and illegible label. In this study, the focus will be on mislabeled data. The problem of dealing with mislabeled data and in particular of constructing a classifier from such data has been approached from a number of different directions. Therefore, developing learning algorithms that effectively and efficiently deal with mislabeled data is a great practical importance and key aspect in machine learning. Support Vector Machine (SVM) has been widely accepted to be one of the most effective techniques in machine learning algorithms. One of the main drawbacks of SVM is it depends on only a small part of the data points (support vectors) and it treats all training data of a given class equally. To address this problem, one of the solution is the Weighted Support Vector Machines (WSVM). Wu & Liu proposed two different WSVM namely one-step WSVM (OWSVM) and iteratively WSVM (IWSVM). In this paper, a comparison of Weighted Support Vector Machine (WSVM), One-step WSVM (OWSVM) and Iteratively WSVM (IWSVM) for mislabeled data has been done to see the classification accuracy of each of the method. The three methods were compared based on correctly labeled, mislabeled data, data within margin, mislabeled data within margin and classification accuracy for eight KEEL repository datasets using 20% noise in training data. Based on the experimental results, the performance of OWSVM is better than both WSVM and IWSVM based on the correctly labeled, mislabeled data, data within margin, mislabeled data within margin and classification accuracy.
机译:标注错误可能因各种原因发生,如标签制作任务的主观性,缺乏信息,以确定一个给定的例子和数据录入错误的正确标签。被归类标签错误的错误标记,未标记,标记部分,不完全标记和标签字迹模糊。在这项研究中,重点将是贴错标签的数据。处理错误标记的数据,特别是构建从这种数据中的分类器的的问题已经从许多不同的方向的接近。因此,开发学习算法,有效地处理数据贴错标签是机器学习的一个重要的现实意义和重要方面。支持向量机(SVM)已被广泛接受是在机器的最有效的技术学习算法之一。一个SVM的主要缺点是它仅仅依赖于数据点(支持向量)的一小部分,它平等地对待给定类的所有训练数据。为了解决这个问题,该解决方案中的一个是所述加权支持向量机(WSVM)。吴和刘提出了两种不同的WSVM即一步法WSVM(OWSVM)和迭代WSVM(IWSVM)。在本文中,加权支持向量机(WSVM)的比较,为贴错标签数据一步法WSVM(OWSVM)并且迭代WSVM(IWSVM)已经完成,见各方法的分类精度。这三种方法基于正确地标注,贴错标签数据,裕度内的数据,余量和分类精度之内的贴错标签数据用于训练数据,用20%的噪声8个KEEL存储库的数据集进行比较。基于该实验结果,OWSVM的性能比既WSVM和IWSVM更好基于该正确地标注,贴错标签数据,裕度内的数据,余量和分类精度之内的错误标记的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号