首页> 外文会议>International Conference on Data Mining >An Approach for learning from small and unbalanced data sets using Gaussian noise during artificial neural network training
【24h】

An Approach for learning from small and unbalanced data sets using Gaussian noise during artificial neural network training

机译:在人工神经网络训练期间使用高斯噪声了解小型和不平衡数据集的方法

获取原文

摘要

Classifiers have been widely applied to scientific and industrial domains, in general, attaining good performance. However, when applied to problems with small and unbalanced data sets, most classifiers have much worse performance, mainly due to the lack of learning ability to cope with these data constraints. The small size of the data set reduces the generalization power of the classifier and the unbalance in class representation makes the classifiers favor the larger and less important classes. Inherent to several real-world problems, small and unbalanced data sets are the reality to be tackled by learning algorithms for producing accurate and reliable classifiers to solve these problems. This paper proposes an approach to address these data constraints based on the addition of Gaussian noise to the input patterns' variables during the training process. The paper focuses on this noise addition to the multi layer perceptron (MLP) neural network (NN). Experimental results indicate that the proposed approach achieves a performance statistically better than the traditional methods (95% confidence) together with lower variability.
机译:分类器已被广泛应用于科学和工业领域,一般来说,达到了良好的性能。但是,当应用于小型和不平衡数据集的问题时,大多数分类器的性能都有更差的性能,主要是由于缺乏应对这些数据约束的学习能力。数据集的小尺寸降低了分类器的泛化功率,而类表示中的不平衡使得分类器利用更大且不太重要的类。到几个真实世界问题,小而不平衡的数据集是通过学习算法来解决的现实,用于制作准确和可靠的分类器来解决这些问题。本文提出了一种方法,可以根据在训练过程中添加高斯噪声对输入模式的变量来解决这些数据约束的方法。该纸张专注于多层Perceptron(MLP)神经网络(NN)的这种噪声。实验结果表明,该拟议方法统计上的性能比传统方法(95%的置信度)在一起,与较低的可变性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号