首页> 外文期刊>Journal of Intelligent Learning Systems and Applications >Training with Input Selection and Testing (TWIST) Algorithm: A Significant Advance in Pattern Recognition Performance of Machine Learning
【24h】

Training with Input Selection and Testing (TWIST) Algorithm: A Significant Advance in Pattern Recognition Performance of Machine Learning

机译:使用输入选择和测试(TWIST)算法进行训练:机器学习模式识别性能的重大进步

获取原文
           

摘要

This article shows the efficacy of TWIST, a methodology for the design of training and testing data subsets extracted from given dataset associated with a problem to be solved via ANNs. The methodology we present is embedded in algorithms and actualized in computer software. Our methodology as implemented in software is compared to the current standard methods of random cross validation: 10-Fold CV, random split into two subsets and the more advanced T&T. For each strategy, 13 learning machines, representing different families of the main algorithms, have been trained and tested. All algorithms were implemented using the well-known WEKA software package. On one hand a falsification test with randomly distributed dependent variable has been used to show how T&T and TWIST behaves as the other two strategies: when there is no information available on the datasets they are equivalent. On the other hand, using the real Statlog (Heart) dataset, a strong difference in accuracy is experimentally proved. Our results show that TWIST is superior to current methods. Pairs of subsets with similar probability density functions are generated, without coding noise, according to an optimal strategy that extracts the most useful information for pattern classification.
机译:本文展示了TWIST的功效,TWIST是一种设计方法,用于设计训练和测试从给定数据集中提取的与要通过ANN解决的问题相关的数据子集。我们提出的方法已嵌入算法中,并已在计算机软件中实现。我们将在软件中实施的方法与当前的随机交叉验证标准方法进行了比较:10折CV,随机分为两个子集和更高级的T&T。对于每种策略,已经培训和测试了代表不同主要算法家族的13种学习机。所有算法均使用著名的WEKA软件包实施。一方面,已经使用具有随机分布因变量的伪造测试来显示T&T和TWIST如何像其他两种策略一样表现:当数据集上没有可用信息时,它们是等效的。另一方面,使用真实的Statlog(Heart)数据集,通过实验证明了准确性的巨大差异。我们的结果表明,TWIST优于当前方法。根据提取用于模式分类的最有用信息的最佳策略,生成具有相似概率密度函数的子集对,而无需编码噪声。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号