...
首页> 外文期刊>BMC Medical Informatics and Decision Making >AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
【24h】

AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM

机译:基于AdaBoost的多个SVM-RFE用于DDSM中的乳房X线照片分类

获取原文
           

摘要

BackgroundDigital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed.MethodsWe propose a feature selection method based on multiple support vector machine recursive feature elimination (MSVM-RFE). We compared our method with four previously proposed feature selection methods which use support vector machine as the base classifier. Experiments were performed on lesions extracted from the Digital Database of Screening Mammography, the largest public digital mammography database available. We measured average accuracy over 5-fold cross validation on the 8 datasets we extracted.ResultsSelecting from 8 features, conventional algorithms like SVM-RFE and multiple SVM-RFE showed slightly better performance than others. However, when selecting from 22 features, our proposed modified multiple SVM-RFE using boosting outperformed or was at least competitive to all others.ConclusionOur modified method may be a possible alternative to SVM-RFE or the original MSVM-RFE in many cases of interest. In the future, we need a specific method to effectively combine models trained during the feature selection process and a way to combine feature subsets generated from individual SVM-RFE instances.
机译:背景技术乳腺X线摄影是诊断乳腺癌的最有前途的选择之一,乳腺癌是女性最常见的癌症。但是,由于难以区分实际的癌病灶和良性异常,因此其有效性减弱,从而导致不必要的活检转诊。为了克服这个问题,已经在全球范围内研究了使用机器学习技术的计算机辅助诊断(CADx)。由于这是一个分类问题,并且可以从乳房X线照片上获得的特征数量是无限的,因此需要针对CADx系统量身定制的特征选择方法。方法我们提出了一种基于多支持向量机递归特征消除的特征选择方法(MSVM-RFE)。我们将我们的方法与之前使用支持向量机作为基础分类器的四种特征选择方法进行了比较。对从筛查性X线摄影术的数字数据库(最大的公共数字化X线摄影术数据库)中提取的病变进行了实验。我们对提取的8个数据集进行了5倍交叉验证,测量了平均准确性。结果从8个功能中进行选择后,常规算法(例如SVM-RFE和多个SVM-RFE)显示出比其他算法稍好的性能。但是,当从22种功能中进行选择时,我们建议的使用升压的改进多SVM-RFE表现优于或至少与所有其他竞争产品一样。结论在许多感兴趣的情况下,我们的改进方法可能是SVM-RFE或原始MSVM-RFE的替代方法。将来,我们需要一种特定的方法来有效地组合在特征选择过程中训练的模型,以及一种组合从各个SVM-RFE实例生成的特征子集的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号