...
首页> 外文期刊>Academic radiology >Computer-aided detection; the effect of training databases on detection of subtle breast masses.
【24h】

Computer-aided detection; the effect of training databases on detection of subtle breast masses.

机译:计算机辅助检测;培训数据库对检测细微乳腺肿块的影响。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

RATIONALE AND OBJECTIVES: Lesion conspicuity is typically highly correlated with visual difficulty for lesion detection, and computer-aided detection (CAD) has been widely used as a "second reader" in mammography. Hence, increasing CAD sensitivity in detecting subtle cancers without increasing false-positive rates is important. The aim of this study was to investigate the effect of training database case selection on CAD performance in detecting low-conspicuity breast masses. MATERIALS AND METHODS: A full-field digital mammographic image database that included 525 cases depicting malignant masses was randomly partitioned into three subsets. A CAD scheme was applied to detect all initially suspected mass regions and compute region conspicuity. Training samples were iteratively selected from two of the subsets. Four types of training data sets-(1) one including all available true-positive mass regions in the two subsets ("all"), (2) one including 350 randomly selected mass regions ("diverse"), (3) one including 350 high-conspicuity mass regions ("easy"), and (4) one including 350 low-conspicuity mass regions ("difficult")-were assembled. In each training data set, the same number of randomly selected false-positive regions as the true-positives were also included. Two classifiers, an artificial neural network (ANN) and a k-nearest neighbor (KNN) algorithm, were trained using each of the four training data sets and tested on all suspected regions in the remaining data set. Using a threefold cross-validation method, the performance changes of the CAD schemes trained using one of the four training data sets were computed and compared. RESULTS: CAD initially detected 1025 true-positive mass regions depicted on 507 cases (97% case-based sensitivity) and 9569 false-positive regions (3.5 per image) in the entire database. Using the all training data set, CAD achieved the highest overall performance on the entire testing database. However, CAD detected the highest number of low-conspicuity masses when the difficult training data set was used for training. Results did agree for both ANN-based and KNN-based classifiers in all tests. Compared to the use of the all training data set, the sensitivity of the schemes trained using the difficult data set decreased by 8.6% and 8.4% for the ANN and KNN algorithm on the entire database, respectively, but the detection of low-conspicuity masses increased by 7.1% and 15.1% for the ANN and KNN algorithm at a false-positive rate of 0.3 per image. CONCLUSIONS: CAD performance depends on the size, diversity, and difficulty level of the training database. To increase CAD sensitivity in detecting subtle cancer, one should increase the fraction of difficult cases in the training database rather than simply increasing the training data set size.
机译:理由和目标:病变的明显程度通常与病变检测的视觉困难高度相关,计算机辅助检测(CAD)已被广泛用作乳房X线照相术中的“第二阅读器”。因此,在不增加假阳性率的情况下提高检测微小癌症的CAD敏感性非常重要。这项研究的目的是调查培训数据库案例选择对检测低显著性乳腺肿块的CAD性能的影响。材料与方法:将包括525个描述恶性肿块的病例的全视野数字化乳房X线图像数据库随机分为三个子集。应用CAD方案检测所有最初怀疑的质量区域并计算区域显着性。从两个子集中迭代地选择训练样本。四种类型的训练数据集-(1)一种包括两个子集中的所有可用的真阳性质量区域(“全部”),(2)一种包括350个随机选择的质量区域(“不同”),(3)一种包括组装了350个高显着质量区域(“容易”)和(4)包括350个低显着质量区域(“困难”)的区域。在每个训练数据集中,还包括与真实阳性相同数量的随机选择的阴性阳性区域。使用四个训练数据集中的每一个训练了两个分类器,分别是人工神经网络(ANN)和k最近邻(KNN)算法,并在其余数据集中的所有可疑区域上进行了测试。使用三重交叉验证方法,计算并比较了使用四个训练数据集之一训练的CAD方案的性能变化。结果:CAD最初在整个数据库中检测到507个案例(基于案例的敏感度为97%)上描绘的1025个真阳性质量区域和9569个假阳性区域(每个图像3.5个)。使用所有培训数据集,CAD在整个测试数据库上均实现了最高的整体性能。但是,当使用困难的训练数据集进行训练时,CAD会检测到数量最多的低显著质量。在所有测试中,基于ANN和基于KNN的分类器的结果均一致。与使用所有训练数据集相比,使用困难数据集训练的方案对整个数据库的ANN和KNN算法的敏感性分别降低了8.6%和8.4%,但是对低显着质量的检测对于ANN和KNN算法,每幅图像的假阳性率为0.3,分别增加了7.1%和15.1%。结论:CAD性能取决于培训数据库的大小,多样性和难度。为了提高CAD在检测细微癌症中的敏感性,应该增加培训数据库中困难病例的比例,而不是简单地增加培训数据集的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号