...
首页> 外文期刊>Procedia Computer Science >Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints
【24h】

Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints

机译:标准与非标准交叉验证:评估具有DataPoints的结构化分布的空间中的性能

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Cross-validation is a popularly used approach to evaluation of performance for classifiers. It relies on random selection of independent samples for training and testing, and assumes that if any similarities among samples exist, they do not lead to known grouping of datapoints in the input space. If these conditions are violated, as it may happen for datasets with some structure of samples included, standard cross-validation can return biased results even for many folds. In the paper the research on cross-validation was reported for application to stylometric datasets, describing a task of authorship attribution. The comparison of standard and non-standard processing was presented. In the latter case, selected subsets of examples were swapped over between training and test sets several times. The experiments with three popular classifiers showed that standard cross-validation tended to give over-optimistic results, whereas non-standard processing was more guarded, and by that more reliable. To avoid high computational costs involved, evaluation based on averaged predictions for limited numbers of test sets can be considered as a reasonable compromise.
机译:交叉验证是一种普遍使用的评估分类器性能的方法。它依赖于随机选择独立样本进行培训和测试,并假设如果存在样本之间的任何相似性,则它们不会导致已知输入空间中的数据点分组。如果违反了这些条件,则可能发生具有一些样本结构的数据集,即使许多折叠也可以返回偏置结果的标准交叉验证。在论文中,据报道了对唱片统计数据集的应用程序,描述了作者归因的任务。提出了标准和非标准加工的比较。在后一种情况下,在训练和测试组之间交换了所选子集几次。三种流行分类器的实验表明,标准交叉验证趋于过度乐观的结果,而非标准加工更加守卫,并且通过更可靠。为避免涉及的高计算成本,基于有限数量的测试集的平均预测的评估可以被视为合理的折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号