Fuzzy Analysis and Classification of Mislabeled and Noisy Data

机译：贴错标签的数据的模糊分析和分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A classifier learns from a "training" data set so it can later correctly classify a new pattern from the same population as the training set. However, when the examples for a learning algorithm consist of real world data then they are usually tainted with noise, ambiguity, uncertainty, imprecision, vagueness or incompleteness. Noise may be introduced by outliers; they are the result of some bad measurements or pattern mislabeling. Clearly, classification of such noisy data must be highly efficient and accurate. In this paper, we overcome this problem by introducing an efficient tool for feature selection where "bad" (non-discriminating) features are dropped and "good" features are weighted according to how well they separate classes in a data set. Good features are responsible for "electing" a class that the feature vector under test should naturally belong to. Thus, we call our new method "EFCLASS" denoting Election Fuzzy Classification. The proposed method is simple, fast and accurate. Various data sets that are known to be good examples for a classification algorithm are used to test the performance of the proposed method for the fuzzy classifier.

机译：分类器从“训练”数据集中学习，以便稍后可以从与训练集相同的总体中正确分类新模式。但是，当学习算法的示例包含现实世界的数据时，它们通常会被噪声，歧义，不确定性，不精确性，模糊性或不完整性所污染。离群值可能会引入噪声;它们是某些不良测量结果或图案错误贴标签的结果。显然，此类噪声数据的分类必须高效且准确。在本文中，我们通过引入一种有效的特征选择工具来克服此问题，在该工具中，将“不良”（无区别）特征丢弃，并根据“好”特征对数据集中的类进行区分的程度进行加权。好的特征负责“选择”被测特征向量自然应属于的类。因此，我们将表示选举模糊分类的新方法称为“ EFCLASS”。该方法简单，快速，准确。已知作为分类算法很好例子的各种数据集可用于测试所提出的模糊分类器方法的性能。

著录项

来源
《Computer Applications in Industry and Engineering》|2001年|p.146-150|共5页
会议地点
作者
Tony M. Abouhaidar; Carl G. Looney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词
pattern recognition; classification; fuzzy logic; feature selection; feature weighting;

机译：模式识别;分类;模糊逻辑;特征选择;特征权重;

相似文献

外文文献
中文文献
专利

1. A Vectorization-Optimization-Method-Based Type-2 Fuzzy Neural Network for Noisy Data Classification [J] . Wu G.-D., Huang P.-H. Fuzzy Systems, IEEE Transactions on . 2013,第1期

机译：基于矢量化优化方法的2类模糊神经网络的噪声数据分类
2. A robust multi-class AdaBoost algorithm for mislabeled noisy data [J] . Sun Bo, Chen Songcan, Wang Jiandong, Knowledge-Based Systems . 2016,第juna15期

机译：健壮的多类AdaBoost算法，用于标签错误的嘈杂数据
3. An intuitionistic fuzzy set based (SVM)-V-3 model for binary classification with mislabeled information [J] . Tian Ye, Deng Zhibin, Luo Jian, Fuzzy Optimization and Decision Making: A Journal of Modeling and Computation Under Uncertainty . 2018,第4期

机译：基于（SVM）-V-3模型的直觉模糊集，用于二进制分类，误标记信息
4. Fuzzy Analysis and Classification of Mislabeled and Noisy Data [C] . Tony M. Abouhaidar, Carl G. Looney International conference on computer applications in industry and engineering . 2001

机译：模糊分析与误标标签和嘈杂数据的分类
5. Robust mote-scale classification of noisy data via machine learning [D] . He, Jin. 2015

机译：通过机器学习对噪声数据进行鲁棒的微粒级分类
6. Building Diversified Multiple Trees for classification in high dimensional noisy biomedical data [O] . Jiuyong Li, Lin Liu, Jixue Liu, 2017

机译：构建多棵多棵树以在高维嘈杂的生物医学数据中进行分类
7. Feature Analysis, Evaluation and Comparisons of Classification Algorithms Based on Noisy Intrusion Dataset [O] . Hussain Jamal, Lalmuanawma Samuel 2016

机译：基于噪声入侵数据集的分类算法的特征分析，评估与比较
8. Fuzzy Logic Multisensor Association Algorithm: Multiple Emitters, Computational Complexity, and Noisy Data [R] . Smith, J. F. 1999

机译：模糊逻辑多传感器关联算法：多发射器，计算复杂度和噪声数据

Fuzzy Analysis and Classification of Mislabeled and Noisy Data

摘要

著录项

相似文献

相关主题

期刊订阅