(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

Manuel J.A. Eugster; Friedrich Leisch; Carolin Strobl

首页> 外文期刊>Computational statistics & data analysis >(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

【24h】

(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

机译：基准实验的（心理）分析：调查数据集和学习算法之间关系的正式框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is common knowledge that the performance of different learning algorithms depends on certain characteristics of the data-such as dimensionality, linear separability or sample size. However, formally investigating this relationship in an objective and reproducible way is not trivial. A new formal framework for describing the relationship between data set characteristics and the performance of different learning algorithms is proposed. The framework combines the advantages of benchmark experiments with the formal description of data set characteristics by means of statistical and information-theoretic measures and with the recursive partitioning of Bradley-Terry models for comparing the algorithms’ performances. The formal aspects of each component are introduced and illustrated by means of an artificial example. Its real-world usage is demonstrated with an application example consisting of thirteen widely-used data sets and six common learning algorithms. The Appendix provides information on the implementation and the usage of the framework within the R language.

机译：众所周知，不同学习算法的性能取决于数据的某些特征，例如维数，线性可分离性或样本大小。但是，以客观和可重复的方式正式研究这种关系并非易事。提出了一个新的形式化框架，用于描述数据集特征和不同学习算法的性能之间的关系。该框架结合了基准实验的优势，通过统计和信息理论方法对数据集特征的形式描述以及与Bradley-Terry模型的递归划分相比较的算法性能。通过一个人工示例介绍并说明了每个组件的形式方面。通过一个由13个广泛使用的数据集和6种常见学习算法组成的应用示例演示了它的实际用法。附录提供有关R语言中框架的实现和用法的信息。

著录项

来源
《Computational statistics & data analysis》 |2014年第null期|共15页
作者
Manuel J.A. Eugster; Friedrich Leisch; Carolin Strobl;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类概率论与数理统计;数据处理、数据处理系统;
关键词
Benchmark experiments; Data set characterization; Recursive partitioning; Preference scaling; Bradley-Terry model;

机译：基准实验;数据集表征;递归分区;偏好缩放;Bradley-Terry模型;

相似文献

外文文献
中文文献
专利

1. (Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms [J] . Manuel J.A. Eugster, Friedrich Leisch, Carolin Strobl Computational statistics & data analysis . 2014,第Null期

机译：基准实验的（心理）分析：调查数据集和学习算法之间关系的正式框架
2. Investigation of an Immunoassay with Broad Specificity to Quinolone Drugs by Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Data Sets and Advanced Quantitative Structure-Activity Relationship Analysis [J] . Chen Jiahong, Lu Ning, Shen Xing, Journal of Agricultural and Food Chemistry . 2016,第13期

机译：数据集超分子比对线性分配和高级定量构效关系分析的遗传算法研究对喹诺酮类药物具有广泛特异性的免疫分析方法
3. KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework [J] . J. Alcala-fdez, A. Fernandez, J. Luengo, Journal of multiple-valued logic and soft computing . 2011,第2a3期

机译：KEEL数据挖掘软件工具：数据集存储库，算法和实验分析框架的集成
4. Place Recognition in Gardens by Learning Visual Representations: Data Set and Benchmark Analysis [C] . Maria Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov International Conference on Computer Analysis of Images and Patterns . 2019

机译：通过学习视觉表示法识别花园中的位置：数据集和基准分析
5. Data Collection Framework and Machine Learning Algorithms for the Analysis of Cyber Security Attacks [D] . Calvert, Chad. 2019

机译：数据收集框架和机器学习算法，用于分析网络安全攻击
6. Formal Medical Knowledge Representation Supports Deep Learning Algorithms Bioinformatics Pipelines Genomics Data Analysis and Big Data Processes [O] . Ferdinand Dhombres, Jean Charlet 2019

机译：正式的医学知识表示支持深度学习算法生物信息学管道基因组学数据分析和大数据过程
7. (Psycho-)Analysis of Benchmark Experiments A Formal Framework for Investigating the Relationship between Data Sets and Learning Algorithms [O] . Manuel J. A. Eugster, Friedrich Leisch, Carolin Strobl 2015

机译：（心理 - ）基准实验分析研究数据集与学习算法之间关系的形式框架

(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

摘要

著录项

相似文献

相关主题

期刊订阅