首页> 外文会议>Pacific Symposium on Biocomputing >CROWDSOURCING IMAGE ANNOTATION FOR NUCLEUS DETECTION AND SEGMENTATION IN COMPUTATIONAL PATHOLOGY: EVALUATING EXPERTS, AUTOMATED METHODS, AND THE CROWD
【24h】

CROWDSOURCING IMAGE ANNOTATION FOR NUCLEUS DETECTION AND SEGMENTATION IN COMPUTATIONAL PATHOLOGY: EVALUATING EXPERTS, AUTOMATED METHODS, AND THE CROWD

机译:计算病理中核检测和分割的众包图像注释:评估专家,自动化方法和人群

获取原文

摘要

The development of tools in computational pathology to assist physicians and biomedical scientists in the diagnosis of disease requires access to high-quality annotated images for algorithm learning and evaluation. Generating high-quality expert-derived annotations is time-consuming and expensive. We explore the use of crowdsourcing for rapidly obtaining annotations for two core tasks in computational pathology: nucleus detection and nucleus segmentation. We designed and implemented crowdsourcing experiments using the CrowdFlower platform, which provides access to a large set of labor channel partners that accesses and manages millions of contributors worldwide. We obtained annotations from four types of annotators and compared concordance across these groups. We obtained: crowdsourced annotations for nucleus detection and segmentation on a total of 810 images; annotations using automated methods on 810 images; annotations from research fellows for detection and segmentation on 477 and 455 images, respectively; and expert pathologist-derived annotations for detection and segmentation on 80 and 63 images, respectively. For the crowdsourced annotations, we evaluated performance across a range of contributor skill levels (1,2,or 3). The crowdsourced annotations (4,860 images in total) were completed in only a fraction of the time and cost required for obtaining annotations using traditional methods. For the nucleus detection task, the research fellow-derived annotations showed the strongest concordance with the expert pathologist-derived annotations (F-M =93.68%), followed by the crowd-sourced contributor levels 1,2, and 3 and the automated method, which showed relatively similar performance (F-M = 87.84%, 88.49%, 87.26%, and 86.99%, respectively). For the nucleus segmentation task, the crowdsourced contributor level 3-derived annotations, research fellow-derived annotations, and automated method showed the strongest concordance with the expert pathologist-derived annotations (F-M = 66.41%, 65.93%, and 65.36%, respectively), followed by the contributor levels 2 and 1 (60.89% and 60.87%, respectively). When the research fellows were used as a gold-standard for the segmentation task, all three contributor levels of the crowdsourced annotations significantly outperformed the automated method (F-M = 62.21%, 62.47%, and 65.15% vs. 51.92%). Aggregating multiple annotations from the crowd to obtain a consensus annotation resulted in the strongest performance for the crowd-sourced segmentation. For both detection and segmentation, crowd-sourced performance is strongest with small images (400 x 400 pixels) and degrades significantly with the use of larger images (600 x 600 and 800 x 800 pixels). We conclude that crowdsourcing to non-experts can be used for large-scale labeling microtasks in computational pathology and offers a new approach for the rapid generation of labeled images for algorithm development and evaluation.
机译:工具在计算病理发展,以协助医生和生物医学科学家在疾病的诊断需要获得高品质的注释的图像进行算法的学习和评估。生成高品质的专家衍生注释是耗时且昂贵的。我们探索利用众包的快速获得注释在计算病理学两大核心任务:核探测和细胞核分段。我们设计并使用CrowdFlower平台,它提供了访问大量的劳工渠道合作伙伴的访问和管理全球数百万的贡献者实现众包实验。我们从四个类型的注释获得注释和比较这几组的一致性。我们得到:对于核检测与分割上共有810个图像的众包注释;使用注解810倍上的图像的自动方法;从研究人员的检测和分割上477倍455的图像,分别注释;和专家分别病理学家衍生的检测和分割注解80倍63的图像,。对于众包注解,我们评估了在一定范围的贡献者的技能水平(1,2,或3)的性能。的众包注解(4860个图像中总)均仅在用于获得使用传统方法的注释所需的时间和成本的一小部分完成。对于细胞核检测任务,在研究员衍生的注释显示出与病理学专家衍生注解(FM = 93.68%)的最强的一致性,接着人群来源贡献者水平1,2和3和自动化方法,该方法表现出相对相似的性能(FM = 87.84%,88.49分别%,87.26%,86.99和%)。对于细胞核分割任务,所述众包贡献者级别3衍生的注解,研究员衍生的注解,和自动化的方法显示出与病理学专家衍生注解最强一致性(FM = 66.41%,65.93%,和65.36%,分别地) ,接着(分别为60.89%和60.87%,)贡献者等级2和1。当研究人员使用作为黄金标准的分割任务,所述众包注解的所有三个贡献者水平显著优于自动化方法(F-M = 62.21%,62.47%,65.15和%对51.92%)。从人群聚集多个注释,以获取导致人群来源的分割性能最强的共识注解。对于这两种检测与分割,人群来源性能最强具有小的图像(400×400像素)和劣化显著与使用较大的图像(600 * 600和800个* 800像素)的。我们得出结论,众包,以非专业人士可以用于计算病理大型标签microtasks并提供了快速生成用于算法开发和评估标记图像的新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号