Comparison of validation variants by sum of ranking differences and ANOVA

Heberger Karoly; Kollar-Hunek Klara

首页> 外文期刊>Journal of Chemometrics >Comparison of validation variants by sum of ranking differences and ANOVA

【24h】

Comparison of validation variants by sum of ranking differences and ANOVA

机译：排名差异和ANOVA的验证变体的比较

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The old debate is revived: Definite differences can be observed in suggestions of estimation for prediction performances of models and for validation variants according to the various scientific disciplines. However, the best and/or recommended practice for the same data set cannot be dependent on the field of usage. Fortunately, there is a method comparison algorithm, which can rank and group the validation variants; its combination with variance analysis will reveal whether the differences are significant or merely the play of random errors. Therefore, three case studies have been selected carefully to reveal similarities and differences in validation variants. The case studies illustrate the different significance of these variants well. In special circumstances, any of the influential factors for validation variants can exert significant influence on evaluation by sums of (absolute) ranking differences (SRDs): stratified (contiguous block) or repeated Monte Carlo resampling and how many times the data set is split (5-7-10). The optimal validation variant should be determined individually again and again. A random resampling with sevenfold cross-validations seems to be a good compromise to diminish the bias and variance alike. If the data structure is unknown, a randomization of rows is suggested before SRD analysis. On the other hand, the differences in classifiers, validation schemes, and models proved to be always significant, and even subtle differences can be detected reliably using SRD and analysis of variance (ANOVA).

机译：恢复旧辩论：根据各种科学学科的预测性能的估计和验证变体的估计，可以观察到明确的差异。但是，相同数据集的最佳和/或推荐的做法不能依赖于使用领域。幸运的是，有一种方法比较算法，可以排名和分组验证变体;它与方差分析的结合将揭示差异是否是显着的或仅仅是随机误差的播放。因此，仔细选择了三种案例研究，以揭示验证变体中的相似性和差异。案例研究说明了这些变体的不同意义。在特殊情况下，任何用于验证变体的影响因素都可以通过（绝对）排名差异（SRD）的总和对评估产生重大影响：分层（连续块）或重复的蒙特卡罗重新采样以及数据集分为多次（ 5-7-10）。应当再次又一次地单独确定最佳验证变量。随机重新采样，具有七倍交叉验证似乎是一种良好的折衷，以减少偏差和方差相似。如果数据结构未知，则在SRD分析之前建议行的随机化。另一方面，使用SRD和方差分析（ANOVA）可以可靠地检测分类器，验证方案和模型中的分类器，验证方案和模型的差异，甚至可以可靠地检测到细微差异（ANOVA）。

著录项

来源
《Journal of Chemometrics》 |2019年第6期|共14页
作者
Heberger Karoly; Kollar-Hunek Klara;
展开▼
作者单位

Hungarian Acad Sci Res Ctr Nat Sci Inst Mat &

Environm Chem Plasma Chem Res Grp Magyar Tudosok Krt 2 H-1117 Budapest Hungary;

Budapest Univ Technol &

Econ Dept Inorgan &

Analyt Chem Budapest Hungary;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类化学;
关键词
cross-validation; method comparison; model validation; ranking; resampling;

机译：交叉验证;方法比较;模型验证;排名;重新采样;

相似文献

外文文献
中文文献
专利

1. Comparison of validation variants by sum of ranking differences and ANOVA [J] . Heberger Karoly, Kollar-Hunek Klara Journal of Chemometrics . 2019,第6期

机译：排名差异和ANOVA的验证变体的比较
2. Beer microfiltration with static turbulence promoter: Sum of ranking differences comparison [J] . Varga Aron, Gaspar Igor, Juhasz Reka, Journal of food process engineering . 2019,第1期

机译：带有静态湍流促进剂的啤酒微滤：排名差异总和比较
3. Sum of ranking differences in comparison of nickel-coated carbon nanofibers adsorbents in capacity and randomness of 1-butanethiol (1-butyl mercaptan) adsorption [J] . Karami Farshad, Khanmohammadi Mohammadreza, Garmarudi Amir Bagheri Journal of the Iranian Chemical Society . 2016,第12期

机译：镍包覆碳纳米纤维吸附剂在1-丁烷硫醇（1-丁基硫醇）吸附能力和无规性方面比较的等级差异总和
4. System Effect Estimation by Sharding: A Comparison Between ANOVA Approaches to Detect Significant Differences [C] . Guglielmo Faggioli, Nicola Ferro European Conference on Information Retrieval . 2021

机译：通过分片进行系统效应估计：ANOVA方法之间的比较检测显着差异
5. Heterogeneity Within Primary Progressive Aphasia (PPA): Differences Between Variants in Functional Communication, and a New Sub-Variant of Logopenic Variant PPA [D] . Gallée, Jeanne. 2021

机译：原发性渐进性失血病（PPA）内的异质性：功能性通信变体之间的差异，以及令人肺脑变异PPA的新次变体
6. Apportionment and districting by Sum of Ranking Differences [O] . Balázs R. Sziklai, Károly Héberger 2020

机译：按排名差异的分配和分区
7. Comparison of multianalyte proficiency test results by sum of ranking differences, principal component analysis, and hierarchical cluster analysis [O] . Škrbić Biljana, Héberger Károly, Durišić-Mladenović Nataša 2013

机译：通过排名差异，主成分分析和层次聚类分析的总和来比较多分析物能力验证结果

Comparison of validation variants by sum of ranking differences and ANOVA

摘要

著录项

相似文献

相关主题

期刊订阅