【24h】

Nearest neighbor-based instance selection for classification

机译:基于最近邻居的实例选择以进行分类

获取原文
获取原文并翻译 | 示例

摘要

With the increasing size of big data, classifiers usually suffer from intractable computing and storage issues. Moreover, decision boundaries in complex classification problems are usually complicated and circuitous. Modeling on too many instances can sometimes cause oversensitivity to noise and degrade the learning accuracies. Instance selection offers an effective way to improve classification performance based on partial but significant data. This paper presents a novel instance selection algorithm based on nearest enemy information. The dataset is divided into several partitions corresponding to instances' nearest enemies. In every partition, representative instances are selected based on the distribution information to represent both sides of decision boundary. A support vector machine (SVM) is then adopted to conduct the classification model based on these representative instances. Experimental results illustrate that the proposed algorithm outperforms some conventional instance selection methods with higher classification accuracy and smaller size of selected instances.
机译:随着大数据规模的增长,分类器通常会遇到棘手的计算和存储问题。而且,复杂分类问题中的决策边界通常是复杂而circuit回的。在太多实例上建模有时会导致对噪声过度敏感,并降低学习准确性。实例选择提供了一种基于部分但重要的数据来提高分类性能的有效方法。本文提出了一种基于最近敌人信息的新颖实例选择算法。数据集分为几个与实例最近的敌人相对应的分区。在每个分区中,根据分布信息选择代表实例,以代表决策边界的两侧。然后,采用支持向量机(SVM)来基于这些代表性实例进行分类模型。实验结果表明,该算法优于传统的实例选择方法,具有较高的分类精度和较小的实例选择量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号