【24h】

Learning from imbalanced data: a comparative study for Colon CAD

机译:从不平衡数据中学习:Colon CAD的比较研究

获取原文
获取原文并翻译 | 示例

摘要

Classification plays an important role in the reduction of false positives in many computer aided detection and diagnosis methods. The difficulty of classifying polyps lies in the variation of possible polyp shapes and sizes and the imbalance between the number of polyp and non-polyp regions available in the training data. CAD schemes for medical applications demand high levels of sensitivity even at the expense of keeping a certain number of false positives. In this paper, we investigate some state-of-the-art solutions to the imbalanced data problem: Synthetic Minority Over-sampling Technique (SMOTE) and weighted Support Vector Machines (SVM). We tested these methods using a diverse database of CT colonography, which included a wide spectrum of difficult cases to detect polyps. We performed several experiments with different combinations of over-sampling techniques on training data. The results demonstrated that SVMs have achieved much better performance over C4.5 with different over-sampling techniques. Also, the results show that weighted SVM without over-sampling can achieve comparable performance in terms of sensitivity and specificity to conventional SVM combined with the over-sampling approach.
机译:在许多计算机辅助检测和诊断方法中,分类在减少误报中起着重要作用。息肉分类的困难在于可能的息肉形状和大小的变化以及训练数据中可用的息肉和非息肉区域的数量之间的不平衡。即使在保留一定数量的误报的代价下,用于医疗应用的CAD方案也需要很高的灵敏度。在本文中,我们研究了一些不平衡数据问题的最新解决方案:综合少数族裔过采样技术(SMOTE)和加权支持向量机(SVM)。我们使用多样化的CT结肠成像数据库测试了这些方法,该数据库包括范围广泛的难以检测息肉的病例。我们对训练数据使用过采样技术的不同组合进行了几次实验。结果表明,使用不同的过采样技术,SVM的性能优于C4.5。同样,结果表明,在不进行过度采样的情况下,加权SVM在灵敏度和特异性方面可以达到与传统SVM结合过采样方法相媲美的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号