首页> 美国卫生研究院文献>Journal of Medical Imaging >Evaluation of data augmentation via synthetic images for improved breast mass detection on mammograms using deep learning
【2h】

Evaluation of data augmentation via synthetic images for improved breast mass detection on mammograms using deep learning

机译:利用深度学习评估通过合成图像评估通过合成图像改善乳房X线图的乳房谱系检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We evaluated whether using synthetic mammograms for training data augmentation may reduce the effects of overfitting and increase the performance of a deep learning algorithm for breast mass detection. Synthetic mammograms were generated using in silico procedural analytic breast and breast mass modeling algorithms followed by simulated x-ray projections of the breast models into mammographic images. In silico breast phantoms containing masses were modeled across the four BI-RADS breast density categories, and the masses were modeled with different sizes, shapes, and margins. A Monte Carlo-based x-ray transport simulation code, MC-GPU, was used to project the three-dimensional phantoms into realistic synthetic mammograms. 2000 mammograms with 2522 masses were generated to augment a real data set during training. From the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) data set, we used 1111 mammograms (1198 masses) for training, 120 mammograms (120 masses) for validation, and 361 mammograms (378 masses) for testing. We used faster R-CNN for our deep learning network with pretraining from ImageNet using the Resnet-101 architecture. We compared the detection performance when the network was trained using different percentages of the real CBIS-DDSM training set (100%, 50%, and 25%), and when these subsets of the training set were augmented with 250, 500, 1000, and 2000 synthetic mammograms. Free-response receiver operating characteristic (FROC) analysis was performed to compare performance with and without the synthetic mammograms. We generally observed an improved test FROC curve when training with the synthetic images compared to training without them, and the amount of improvement depended on the number of real and synthetic images used in training. Our study shows that enlarging the training data with synthetic samples can increase the performance of deep learning systems.
机译:我们评估了是否使用合成乳房X线图来训练数据增强可以降低过度拟合的影响,并提高深度学习算法进行乳房谱系检测的性能。在Silico程序分析乳房和乳房质量建模算法中产生合成乳房X线照片,然后在乳房模型的模拟X射线投影到乳房X光图像中。在硅乳母脊髓中,含有群体的含量越过四个Bi-Rads乳房密度类别,并且群众用不同的尺寸,形状和边缘进行建模。用于基于蒙特卡罗的X射线传输仿真代码MC-GPU,用于将三维幻影投影成现实的合成乳房X乳网显示器。生成2000年乳房X线照片,并产生了2522群众的乳房X线照片,以增加培训期间的真实数据。从数字数据库的策划乳房成像子集进行筛选乳房X线照相术(CBIS-DDSM)数据集,我们使用1111乳房X线照片(1198块)进行训练,120个乳房X线照片(120群)进行验证,以及361次乳房X线照片(378批)进行测试。我们使用reset-101架构的预先预先从Imagenet使用更快的R-CNN。我们比较了网络通过不同百分比的真实CBIS-DDSM培训集(100%,50%和25%)培训网络培训的检测性能,并且当培训集的这些子集增强250,500,000时,和2000年的合成乳房X线照片。进行自由响应接收器操作特性(FROC)分析以比较具有和没有合成乳房X光图的性能。我们通常观察到与合成图像的培训与没有它们的训练相比训练时改进的测试FROC曲线,并且改善量取决于训练中使用的真实和合成图像的数量。我们的研究表明,通过合成样品扩大培训数据可以提高深度学习系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号