首页> 外文会议>2014 International Conference on Computer and Communication Technologies >An empirical experimental evaluation on imbalanced data sets with varied imbalance ratio
【24h】

An empirical experimental evaluation on imbalanced data sets with varied imbalance ratio

机译:具有不平衡比变化的不平衡数据集的经验实验评估

获取原文
获取原文并翻译 | 示例

摘要

Class imbalance presents a problem when traditional Classification algorithms are applied .In the previous years there are most important substitution and change has been carried out on data classification. Classification of data becomes difficult because of its unbalanced nature. The problem of imbalance class has developed into significant data mining issue. The class imbalance situation arises when one class is rare compared to the other, take place frequently in machine learning applications. Dataset of unbalanced learning is a new concept of machine learning which has applicability in real time, since all the datasets of real time are of unbalanced in nature. Researchers have rigorously studied several techniques to alleviate the problem of class imbalance, including resampling algorithms, ensemble learning and algorithmic modification for transforming vast amounts of skewed data efficiently into information and knowledge representation. In this paper, we conducted an empirical study on imbalance datasets. Experimental Results shows conclusion of some findings using Area Under Curve (AUC), precision, F-Measure, TN-rate TP-rate evaluation metrics.
机译:当应用传统的分类算法时,类的不平衡成为一个问题。近年来,最重要的替代方法是数据分类。数据的分类由于其不平衡的性质而变得困难。不平衡类的问题已发展成为重要的数据挖掘问题。当一个班级比另一个班级少见时,就会出现班级不平衡的情况,这种情况经常发生在机器学习应用程序中。不平衡学习的数据集是机器学习的新概念,具有实时性,因为所有实时数据集本质上都是不平衡的。研究人员已经严格研究了几种缓解类不平衡问题的技术,包括重采样算法,集成学习和算法修改,这些算法可将大量偏斜的数据有效地转换为信息和知识表示。在本文中,我们对不平衡数据集进行了实证研究。实验结果显示了使用曲线下面积(AUC),精度,F度量,TN速率TP速率评估指标得出的一些发现的结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号