...
首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance.
【24h】

Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance.

机译:训练用于医疗决策的神经网络分类器:不平衡数据集对分类性能的影响。

获取原文
获取原文并翻译 | 示例
           

摘要

This study investigates the effect of class imbalance in training data when developing neural network classifiers for computer-aided medical diagnosis. The investigation is performed in the presence of other characteristics that are typical among medical data, namely small training sample size, large number of features, and correlations between features. Two methods of neural network training are explored: classical backpropagation (BP) and particle swarm optimization (PSO) with clinically relevant training criteria. An experimental study is performed using simulated data and the conclusions are further validated on real clinical data for breast cancer diagnosis. The results show that classifier performance deteriorates with even modest class imbalance in the training data. Further, it is shown that BP is generally preferable over PSO for imbalanced training data especially with small data sample and large number of features. Finally, it is shown that there is no clear preference between oversampling and no compensation approach and some guidance is provided regarding a proper selection.
机译:这项研究调查了在开发用于计算机辅助医学诊断的神经网络分类器时训练数据中类别不平衡的影响。在存在医学数据中典型的其他特征的情况下进行研究,即其他特征包括小的训练样本大小,大量特征以及特征之间的相关性。探索了两种神经网络训练方法:经典后向传播(BP)和具有临床相关训练标准的粒子群优化(PSO)。使用模拟数据进行了一项实验研究,并在用于乳腺癌诊断的实际临床数据上进一步验证了结论。结果表明,即使训练数据中的班级失衡程度适中,分类器性能也会下降。此外,还表明,对于失衡的训练数据,尤其是在数据样本少,特征多的情况下,BP通常比PSO更可取。最后,表明在过采样和不采用补偿方法之间没有明显的偏爱,并且提供了有关正确选择的一些指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号