Decision tree rule-based feature selection for large-scale imbalanced data

机译：基于决策树规则的大规模不平衡数据的特征选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A class imbalance problem often appears in many real world applications, e.g. fault diagnosis, text categorization, fraud detection. When dealing with a large-scale imbalanced dataset, feature selection becomes a great challenge. To confront it, this work proposes a feature selection approach based on a decision tree rule. The effectiveness of the proposed approach is verified by classifying a large-scale dataset from Santander Bank. The results show that our approach can achieve higher Area Under the Curve (AUC) and less computational time. We also compare it with filter-based feature selection approaches, i.e., Chi-Square and F-statistic. The results show that it outperforms them but needs slightly more computational efforts.

机译：阶级不平衡问题通常出现在许多真实世界应用中，例如，故障诊断，文本分类，欺诈检测。在处理大型不平衡数据集时，功能选择变得巨大挑战。要对抗它，这项工作提出了一种基于决策树规则的特征选择方法。通过分类来自桑坦德银行的大型数据集来验证拟议方法的有效性。结果表明，我们的方法可以在曲线（AUC）和较少的计算时间下实现更高的区域。我们还将其与基于过滤器的特征选择方法进行比较，即Chi-Square和F-Static。结果表明它优于它们，但需要略有更多的计算工作。

著录项

来源
《Wireless and Optical Communication Conference》|2017年|1 v.|共6页
会议地点
作者
Haoyue Liu; MengChu Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
Decision trees; Classification algorithms; Support vector machines; Indexes; Testing; Correlation; Text categorization;

机译：决策树;分类算法;支持向量机;索引;测试;关联;文本分类;

相似文献

外文文献
中文文献
专利

1. Feature-selection ability of the decision-tree algorithm and the impact of feature-selection/extraction on decision-tree results based on hyperspectral data [J] . Y. Y. WANG, J. LI International journal of remote sensing . 2008,第10期

机译：决策树算法的特征选择能力以及特征选择/提取对基于高光谱数据的决策树结果的影响
2. FEATURE SELECTION AND GRANULARITY LEARNING IN GENETIC FUZZY RULE-BASED CLASSIFICATION SYSTEMS FOR HIGHLY IMBALANCED DATA-SETS [J] . PEDRO VILLAR, ALBERTO FERNANDEZ, RAMON A. CARRASCO, International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2012,第3期

机译：高度不平衡数据集的基于遗传模糊规则的分类系统中的特征选择和粒度学习
3. A decision rule-based method for feature selection in predictive data mining [J] . Patricia E.N. Lutu, Andries P. Engelbrecht Expert systems with applications . 2010,第1期

机译：预测数据挖掘中基于决策规则的特征选择方法
4. Decision tree rule-based feature selection for large-scale imbalanced data [C] . Haoyue Liu, MengChu Zhou Wireless and Optical Communication Conference . 2017

机译：基于决策树规则的大规模不平衡数据特征选择
5. Fractional Random Weighted Bootstrapping for Classi?cation on Imbalanced Data with Ensemble Decision Tree Methods [D] . Carter, Sean Charles. 2019

机译：具有集合决策树方法的分数随机加权自动启动，用于分类数据
6. Peculiar Genes Selection: A new features selection method to improve classification performances in imbalanced data sets [O] . Federica Martina, Marco Beccuti, Gianfranco Balbo, -1

机译：特殊基因选择：一种新的特征选择方法可改善不平衡数据集中的分类性能
7. A Genetic Algorithm for Feature Selection and Granularity Learning in Fuzzy Rule-Based Classification Systems for Highly Imbalanced Data-Sets [O] . Pedro Villar, Francisco Herrera 2014

机译：基于模糊规则的高度不平衡数据集分类系统的特征选择和粒度学习遗传算法

Decision tree rule-based feature selection for large-scale imbalanced data

摘要

著录项

相似文献

相关主题

期刊订阅