首页> 外文期刊>Frontiers in Psychology >Supervised classification in the presence of misclassified training data: a Monte Carlo simulation study in the three group case
【24h】

Supervised classification in the presence of misclassified training data: a Monte Carlo simulation study in the three group case

机译:在存在分类错误的训练数据的情况下进行监督分类:三组情况下的蒙特卡洛模拟研究

获取原文
           

摘要

Statistical classification of phenomena into observed groups is very common in the social and behavioral sciences. Statistical classification methods, however, are affected by the characteristics of the data under study. Statistical classification can be further complicated by initial misclassification of the observed groups. The purpose of this study is to investigate the impact of initial training data misclassification on several statistical classification and data mining techniques. Misclassification conditions in the three group case will be simulated and results will be presented in terms of overall as well as subgroup classification accuracy. Results show decreased classification accuracy as sample size, group separation and group size ratio decrease and as misclassification percentage increases with random forests demonstrating the highest accuracy across conditions.
机译:在社会和行为科学中,将现象统计划分为观察到的组非常普遍。但是,统计分类方法受所研究数据的特征影响。最初对观察组进行错误分类会使统计分类更加复杂。这项研究的目的是调查初始训练数据分类错误对几种统计分类和数据挖掘技术的影响。将对三组情况下的错误分类条件进行模拟,并以整体以及子组分类的准确性显示结果。结果表明,随机森林在不同条件下显示出最高的准确性,这是因为样本大小,组别和组大小比率的降低以及分类错误百分比的增加而导致分类精度下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号