首页> 外文期刊>International journal of intelligent systems in accounting, finance & management >ASSESSING THE PREDICTIVE PERFORMANCE OF ARTIFICIAL NEURAL NETWORK-BASED CLASSIFIERS BASED ON DIFFERENT DATA PREPROCESSING METHODS, DISTRIBUTIONS AND TRAINING MECHANISMS
【24h】

ASSESSING THE PREDICTIVE PERFORMANCE OF ARTIFICIAL NEURAL NETWORK-BASED CLASSIFIERS BASED ON DIFFERENT DATA PREPROCESSING METHODS, DISTRIBUTIONS AND TRAINING MECHANISMS

机译:基于不同数据预处理方法,分布和训练机制的基于人工神经网络分类器的预测性能评估

获取原文
获取原文并翻译 | 示例
       

摘要

We analyse the implications of three different factors (preprocessing method, data distribution and training mechanism) on the classification performance of artificial neural networks (ANNs). We use three preprocessing approaches: no preprocessing, division by the maximum absolute values and normalization. We study the implications of input data distributions by using five datasets with different distributions: the real data, uniform, normal, logistic and Laplace distributions. We test two training mechanisms: one belonging to the gradient-descent techniques, improved by a retraining procedure, and the other is a genetic algorithm (GA), which is based on the principles of natural evolution. The results show statistically significant influences of all individual and combined factors on both training and testing performances. A major difference with other related studies is the fact that for both training mechanisms we train the network using as starting solution the one obtained when constructing the network architecture. In other words we use a hybrid approach by refining a previously obtained solution. We found that when the starting solution has relatively low accuracy rates (80-90%) the GA clearly outperformed the retraining procedure, whereas the difference was smaller to non-existent when the starting solution had relatively high accuracy rates (95-98%). As reported in other studies, we found little to no evidence of crossover operator influence on the GA performance.
机译:我们分析了三种不同因素(预处理方法,数据分布和训练机制)对人工神经网络(ANN)分类性能的影响。我们使用三种预处理方法:不进行预处理,除以最大绝对值和标准化。我们通过使用五个具有不同分布的数据集来研究输入数据分布的含义:真实数据,均匀分布,正态分布,逻辑分布和拉普拉斯分布。我们测试了两种训练机制:一种属于梯度下降技术,通过重新训练程序得到了改进,另一种是基于自然进化原理的遗传算法(GA)。结果显示,所有个体因素和综合因素对训练和测试成绩的统计学影响显着。与其他相关研究的主要区别在于以下事实:对于这两种训练机制,我们都使用构建网络体系结构时获得的一种作为初始解决方案来训练网络。换句话说,我们通过改进先前获得的解决方案来使用混合方法。我们发现,当起始解决方案的准确率相对较低(80-90%)时,GA明显优于再培训程序,而当起始解决方案的准确率较高(95-98%)时,差异较小甚至不存在。正如其他研究报道的那样,我们发现很少或没有证据表明交叉算子会影响GA的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号