【24h】

Simultaneous Feature Selection and Classification Using Fuzzy Rules

机译:同时使用模糊规则进行特征选择和分类

获取原文
获取原文并翻译 | 示例

摘要

Classification of data plays a key role in the present conditions as most of the real world applications deal with large volumes of data. The problem of classification is defined as assigning a class to each data object in a way that is consistent with some observed data i.e., the Features, which we have about the problem. In reality these Features are of high dimensionality and designing a classifier for such data lead to more computational overhead thereby degrading the performance of the classifier and hence that leads to a problem called Curse of Dimensionality. In order to solve such problem different methods of data reduction have been used and managed to eliminate the redundancy and non-important features present in the data sets. Among them feature selection is a powerful approach of dealing with high dimensional data by selecting relevant features from data set. A rule-based system that is used to solve such problem should be designed such a way that the rules are generated from the Features that are extracted from the large volumes of data. The system can be designed by using a concept called Fuzzy logic. One of the main attractions of a fuzzy rule-based system is its interpretability which stops the increase in the dimensionality of the data. For high-dimensional data, the identification of fuzzy rules is also a big challenge. This work has described a flexible Feature Selection method based on Modulator Learning Algorithm and shows that the Modulator learning algorithm is capable of identifying better-quality feature subsets for most data sets than correlation feature selection (cfs) subset evaluator and a comparative study was carried using K-nearest neighbor, J48 and few existing classifiers of Weka. The effectiveness of the proposed method is demonstrated by carrying out experimental studies on benchmark datasets from the UCIML repository and one synthetic dataset.
机译:数据分类在当前条件下起着关键作用,因为大多数现实世界中的应用程序都处理大量数据。分类问题定义为以与某些观察到的数据(即特征)一致的方式为每个数据对象分配一个类别。实际上,这些特征具有较高的维数,并且针对此类数据设计分类器会导致更多的计算开销,从而降低分类器的性能,从而导致称为维数诅咒的问题。为了解决这样的问题,已经使用和设法使用不同的数据缩减方法来消除数据集中存在的冗余和非重要特征。其中,特征选择是一种通过从数据集中选择相关特征来处理高维数据的有效方法。应该设计一种用于解决此类问题的基于规则的系统,以便从大量数据中提取的要素中生成规则。可以使用称为模糊逻辑的概念来设计系统。基于模糊规则的系统的主要吸引力之一是其可解释性,可解释性阻止了数据维数的增加。对于高维数据,模糊规则的识别也是一个很大的挑战。这项工作描述了一种基于调制器学习算法的灵活的特征选择方法,并表明与相关特征选择(cfs)子集评估器相比,调制器学习算法能够为大多数数据集识别质量更好的特征子集,并进行了比较研究。 K近邻,J48和Weka现有的几个分类器。通过对UCIML存储库中的基准数据集和一个综合数据集进行实验研究,证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号