首页> 外文期刊>Journal of digital imaging: the official journal of the Society for Computer Applications in Radiology >A Multi-million Mammography Image Dataset and Population-Based Screening Cohort for the Training and Evaluation of Deep Neural Networks-the Cohort of Screen-Aged Women (CSAW)
【24h】

A Multi-million Mammography Image Dataset and Population-Based Screening Cohort for the Training and Evaluation of Deep Neural Networks-the Cohort of Screen-Aged Women (CSAW)

机译:一百万乳房X线摄影图像数据集和基于人口的筛选队列,用于深神经网络的培训和评估 - 屏幕老年女性的队列(CSAW)

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

For AI researchers, access to a large and well-curated dataset is crucial. Working in the field of breast radiology, our aim was to develop a high-quality platform that can be used for evaluation of networks aiming to predict breast cancer risk, estimate mammographic sensitivity, and detect tumors. Our dataset, Cohort of Screen-Aged Women (CSAW), is a population-based cohort of all women 40 to 74 years of age invited to screening in the Stockholm region, Sweden, between 2008 and 2015. All women were invited to mammography screening every 18 to 24 months free of charge. Images were collected from the PACS of the three breast centers that completely cover the region. DICOM metadata were collected together with the images. Screening decisions and clinical outcome data were collected by linkage to the regional cancer center registers. Incident cancer cases, from one center, were pixel-level annotated by a radiologist. A separate subset for efficient evaluation of external networks was defined for the uptake area of one center. The collection and use of the dataset for the purpose of AI research has been approved by the Ethical Review Board. CSAW included 499,807 women invited to screening between 2008 and 2015 with a total of 1,182,733 completed screening examinations. Around 2 million mammography images have currently been collected, including all images for women who developed breast cancer. There were 10,582 women diagnosed with breast cancer; for 8463, it was their first breast cancer. Clinical data include biopsy-verified breast cancer diagnoses, histological origin, tumor size, lymph node status, Elston grade, and receptor status. One thousand eight hundred ninety-one images of 898 women had tumors pixel level annotated including any tumor signs in the prior negative screening mammogram. Our dataset has already been used for evaluation by several research groups. We have defined a high-volume platform for training and evaluation of deep neural networks in the domain of mammographic imaging.
机译:对于AI研究人员来说,访问大型和策划的数据集是至关重要的。在乳房放射学领域工作,我们的目的是开发一种高质量的平台,可用于评估旨在预测乳腺癌风险,估计乳腺敏感性和检测肿瘤的网络。我们的数据集是屏幕老年妇女(CSAW)的队列,是一个基于人口的所有女性群体,所有妇女40至74岁,邀请瑞典斯德哥尔摩区筛选,2008年至2015年。所有妇女都被邀请到乳房X线摄影筛查每18至24个月免费。从完全覆盖该地区的三个乳房中心的PACS收集图像。 DICOM元数据与图像一起收集。通过与区域癌症中心登记册的联系收集筛选决策和临床结果数据。从一个中心的事件癌症病例是放射科医师注释的像素水平。为一个中心的摄取区域定义了一个单独的有效评估外部网络的子集。 DataSet的收集和使用用于AI研究的目的已被道德审查委员会批准。 CSAW包括499,807名妇女,邀请2008年至2015年间筛选,共1,182,733次完成筛查考试。目前收集了大约200万乳房X线摄影图像,包括开发乳腺癌的女性的所有图像。有10,582名患有乳腺癌的女性;对于8463,这是他们的第一个乳腺癌。临床资料包括活检验证的乳腺癌诊断,组织学,肿瘤大小,淋巴结状态,ELSTON等级和受体状态。一千八百九十一体的898名女性的图像具有肿瘤像素水平,包括在前阴性筛查乳房X线图中的任何肿瘤症状。我们的数据集已被几个研究组用于评估。我们已经确定了乳房X光学域领域深神经网络的培训和评估的大批量平台。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号