首页> 外文会议>International Conference on Hybrid Information Technology >Data Mining Exploration System for Feature Selection Tasks
【24h】

Data Mining Exploration System for Feature Selection Tasks

机译:数据挖掘探索系统,用于特征选择任务

获取原文
获取外文期刊封面目录资料

摘要

The area of data mining and knowledge discovery is inherently associated with databases. Data mining methods are used in the process of knowledge discovery to reveal new pieces of knowledge from large databases. One of the stages in that process is a feature selection. A feature selection is usually meant as a process of finding a subset of features from the original set of features forming patterns in a given data set, optimal according to the defined goal and criterion of feature selection. The aim of this paper is to present the main functionalities of the Data Mining Exploration System (DMES) [4] and to explain its usefulness to the feature selection tasks. The DMES is an integrated software system that incorporates many algorithms which can be used in data mining. It is currently being developed in the University of Rzeszow. We describe in short its recent version (1.2) and the algorithms that had already been implemented. The DMES allows to visualize, split, preprocess, analyze, classify and reduce decision tables. To show one of the possibilities provided by the DMES system, we used it for the feature selection [3], [7], [8], [10], [12] task. Feature selection is performed by the most recent addition to the DMES system which are RBFS1, RBFS2 and ARS algorithms [14]. These feature selection methods are designed mainly for the multiple classifiers systems with homogeneous classifiers [2], [5], [6]. Homogeneous classifiers require many different subsets of the data set. The problem of finding the best subsets of a given feature set is exponentially complex. We use RBFS (Reduct Based Feature Selection) algorithm [15] and its two modifications called RBFS1 and RBFS2 to select optimal subsets of the feature set for multiple classifiers. RBFS algorithms are quite complex computationally because they use all decision-relative reducts [11] of a given decision table.
机译:数据挖掘和知识发现的区域本质上与数据库相关联。数据挖掘方法用于知识发现过程,以揭示大型数据库的新知识。该过程中的一个阶段是特征选择。特征选择通常是指在给定数据集中从原始特征集形成图案的特征的子集的过程,根据特征选择的定义目标和标准。本文的目的是介绍数据挖掘勘探系统(DMES)[4]的主要功能,并向特征选择任务解释其有用性。 DMES是一个集成的软件系统,它包含了许多可用于数据挖掘的算法。它目前正在Rzeszow大学开发。我们在最近的近期版本(1.2)和已经实施的算法中描述。 DME允许可视化,拆分,预处理,分析,分类和减少决策表。要显示DMES系统提供的可能性之一,我们将其用于特征选择[3],[7],[8],[10],[12]任务。特征选择是通过最新的DMES系统的添加,即RBFS1,RBFS2和ARS算法[14]。这些特征选择方法主要针对具有均相分类器的多个分类器系统[2],[5],[6]。同质分类器需要数据集的许多不同子集。找到给定功能集的最佳子集的问题是指数复杂的。我们使用RBFS(基于的特征选择)算法[15]及其两个名为RBFS1和RBFS2的两个修改,以选择用于多个分类器的功能集的最佳子集。 RBFS算法计算得非常复杂,因为它们使用给定决策表的所有决策相对减少[11]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号