Learning from imbalanced data: a comparative study for Colon CAD

机译：从不平衡数据中学习：Colon CAD的比较研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classification plays an important role in the reduction of false positives in many computer aided detection and diagnosis methods. The difficulty of classifying polyps lies in the variation of possible polyp shapes and sizes and the imbalance between the number of polyp and non-polyp regions available in the training data. CAD schemes for medical applications demand high levels of sensitivity even at the expense of keeping a certain number of false positives. In this paper, we investigate some state-of-the-art solutions to the imbalanced data problem: Synthetic Minority Over-sampling Technique (SMOTE) and weighted Support Vector Machines (SVM). We tested these methods using a diverse database of CT colonography, which included a wide spectrum of difficult cases to detect polyps. We performed several experiments with different combinations of over-sampling techniques on training data. The results demonstrated that SVMs have achieved much better performance over C4.5 with different over-sampling techniques. Also, the results show that weighted SVM without over-sampling can achieve comparable performance in terms of sensitivity and specificity to conventional SVM combined with the over-sampling approach.

机译：在许多计算机辅助检测和诊断方法中，分类在减少误报中起着重要作用。息肉分类的困难在于可能的息肉形状和大小的变化以及训练数据中可用的息肉和非息肉区域的数量之间的不平衡。即使在保留一定数量的误报的代价下，用于医疗应用的CAD方案也需要很高的灵敏度。在本文中，我们研究了一些不平衡数据问题的最新解决方案：综合少数族裔过采样技术（SMOTE）和加权支持向量机（SVM）。我们使用多样化的CT结肠成像数据库测试了这些方法，该数据库包括范围广泛的难以检测息肉的病例。我们对训练数据使用过采样技术的不同组合进行了几次实验。结果表明，使用不同的过采样技术，SVM的性能优于C4.5。同样，结果表明，在不进行过度采样的情况下，加权SVM在灵敏度和特异性方面可以达到与传统SVM结合过采样方法相媲美的性能。

著录项

来源
《Conference on Medical Imaging 2008: Computer-Aided Diagnosis; 20080219-21; San Diego,CA(US)》|2008年|P.iix|共2页
会议地点 San DiegoCA(US)
作者
Xiaoyun Yang; Yalin Zheng; Musib Siddique; Gareth Beddoe;
展开▼
作者单位

Medicsight PLC, Kensington Centre, 66 Hammersmith Road, London, W14 8UD, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类医用物理学;
关键词
colonic polyp detection; support vector machine; C4.5; over-sampling;

机译：结肠息肉检测；支持向量机； C4.5；过采样;

相似文献

外文文献
中文文献
专利

1. Learning With Imbalanced Data in Smart Manufacturing: A Comparative Analysis [J] . Yasmin Fathy, Mona Jaber, Alexandra Brintrup Quality Control, Transactions . 2021,第1期

机译：学习智能制造中的不平衡数据：比较分析
2. Prediction of secondary testosterone deficiency using machine learning: A comparative analysis of ensemble and base classifiers, probability calibration, and sampling strategies in a slightly imbalanced dataset [J] . Monique Tonani Novaes, Osmar Luiz Ferreira de Carvalho, Pedro Henrique Guimar?es Ferreira, Informatics in Medicine Unlocked . 2021,第a期

机译：使用机器学习预测次级睾酮缺乏：略微不平衡数据集中的集合和基础分类器，概率校准和采样策略的比较分析
3. Classifying imbalanced data using BalanceCascade-based kernelized extreme learning machine [J] . Raghuwanshi Bhagat Singh, Shukla Sanyam Pattern Analysis and Applications . 2020,第3期

机译：使用基于BalanceCascade的内灵极限学习机进行分类的分类数据
4. Learning from imbalanced data: a comparative study for Colon CAD [C] . Xiaoyun Yang, Yalin Zheng, Musib Siddique, Conference on Medical Imaging: Computer-Aided Diagnosis . 2008

机译：从不平衡数据学习：冒号CAD的比较研究
5. Learning in extreme conditions: Online and active learning with massive, imbalanced and noisy data. [D] . Ertekin, Seyda. 2009

机译：极端条件下的学习：具有大量，不平衡且嘈杂的数据的在线和主动学习。
6. An empirical study of ensemble-based semi-supervised learning approaches for imbalanced splice site datasets [O] . Ana Stanescu, Doina Caragea 2015

机译：基于整体的不平衡拼接位点数据集半监督学习方法的实证研究
7. Comparative Performance of Deep Learning and Machine Learning Algorithms on Imbalanced Handwritten Data [O] . A’inur A’fifah, Amelia Ritahani, Abdullah Ahmad 2018

机译：基于手写数据的深度学习和机器学习算法的比较表现

Learning from imbalanced data: a comparative study for Colon CAD

摘要

著录项

相似文献

相关主题

期刊订阅