Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints

Grzegorz Baron; Urszula Stańczyk

首页> 外文期刊>Procedia Computer Science >Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints

【24h】

Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints

机译：标准与非标准交叉验证：评估具有DataPoints的结构化分布的空间中的性能

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cross-validation is a popularly used approach to evaluation of performance for classifiers. It relies on random selection of independent samples for training and testing, and assumes that if any similarities among samples exist, they do not lead to known grouping of datapoints in the input space. If these conditions are violated, as it may happen for datasets with some structure of samples included, standard cross-validation can return biased results even for many folds. In the paper the research on cross-validation was reported for application to stylometric datasets, describing a task of authorship attribution. The comparison of standard and non-standard processing was presented. In the latter case, selected subsets of examples were swapped over between training and test sets several times. The experiments with three popular classifiers showed that standard cross-validation tended to give over-optimistic results, whereas non-standard processing was more guarded, and by that more reliable. To avoid high computational costs involved, evaluation based on averaged predictions for limited numbers of test sets can be considered as a reasonable compromise.

机译：交叉验证是一种普遍使用的评估分类器性能的方法。它依赖于随机选择独立样本进行培训和测试，并假设如果存在样本之间的任何相似性，则它们不会导致已知输入空间中的数据点分组。如果违反了这些条件，则可能发生具有一些样本结构的数据集，即使许多折叠也可以返回偏置结果的标准交叉验证。在论文中，据报道了对唱片统计数据集的应用程序，描述了作者归因的任务。提出了标准和非标准加工的比较。在后一种情况下，在训练和测试组之间交换了所选子集几次。三种流行分类器的实验表明，标准交叉验证趋于过度乐观的结果，而非标准加工更加守卫，并且通过更可靠。为避免涉及的高计算成本，基于有限数量的测试集的平均预测的评估可以被视为合理的折衷。

著录项

来源
《Procedia Computer Science》 |2021年第a期|共10页
作者
Grzegorz Baron; Urszula Stańczyk;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Cross-validationFoldEvaluation of PerformanceClassificationDistributionStylometry;

机译：ProfileCtecclassificdistributionStyryromary的交叉验证尺寸;

相似文献

外文文献
中文文献
专利

1. Evaluation of breakdown characteristics of N₂ gas for non-standard lightning impulse waveforms - method for converting non-standard lightning impulse waveforms into standard lightning impulse waveforms - [J] . Wada J., Ueta G., Okabe S. Dielectrics and Electrical Insulation, IEEE Transactions on . 2013,第2期

机译：非标准雷电冲击波形的N _{2 气体击穿特性评估-将非标准雷电冲击波形转换为标准雷电冲击波形的方法-}
2. Evaluation of breakdown characteristics of CO2 gas for non-standard lightning impulse waveforms - Method for converting non-standard lightning impulse waveforms into standard lightning impulse waveforms - [J] . Ueta G., Wada J., Okabe S. Dielectrics and Electrical Insulation, IEEE Transactions on . 2011,第5期

机译：非标准雷电脉冲波形的CO2气体击穿特性评估-将非标准雷电脉冲波形转换为标准雷电脉冲波形的方法-
3. Evaluation of breakdown characteristics of gas insulated switchgears for non-standard lightning impulse waveforms - method for converting non-standard lightning impulse waveforms into standard lightning impulse waveforms - [J] . Okabe S., Yuasa S., Kaneko S., Dielectrics and Electrical Insulation, IEEE Transactions on . 2009,第1期

机译：评估非标准雷电冲击波形的气体绝缘开关设备的击穿特性-将非标准雷电冲击波形转换为标准雷电冲击波形的方法-
4. Simulation from non-standard distributions using envelope methods [C] . Evans, M.J., Swartz, . 2000

机译：使用包络法从非标准分布进行仿真
5. Dissembling disability: Performances of the non-standard body in early modern England. [D] . Row-Heyveld, Lindsey Dawn. 2011

机译：分解残疾：近代英格兰非标准身体的表现。
6. Study of the Self-Healing Performance of Semi-Flexible Pavement Materials Grouted with Engineered Cementitious Composites Mortar based on a Non-Standard Test [O] . Xu Cai, Wenke Huang, Kuanghuai Wu 2019

机译：基于非标准试验的工程胶凝复合砂浆灌浆半柔性路面材料的自愈性能研究
7. Phase space structure of Chern-Simons theory with a non-standard puncture [O] . Meusburger, C., Schroers, Bernd J. 2006

机译：具有非标准穿刺的Chern-Simons理论的相空间结构
8. Metric and Topology on a Non-Standard Real Line and Non-Standard Space-Time [R] . Tahir Shah, K. 1981

机译：非标准实线和非标准时空的度量和拓扑

Standard vs. non-standard cross-validation: evaluation of performance in a space with structured distribution of datapoints

摘要

著录项

相似文献

相关主题

期刊订阅