首页> 美国卫生研究院文献>BMC Bioinformatics >The gene normalization task in BioCreative III
【2h】

The gene normalization task in BioCreative III

机译:BioCreative III中的基因标准化任务

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundWe report the Gene Normalization (GN) challenge in BioCreative III where participating teams were asked to return a ranked list of identifiers of the genes detected in full-text articles. For training, 32 fully and 500 partially annotated articles were prepared. A total of 507 articles were selected as the test set. Due to the high annotation cost, it was not feasible to obtain gold-standard human annotations for all test articles. Instead, we developed an Expectation Maximization (EM) algorithm approach for choosing a small number of test articles for manual annotation that were most capable of differentiating team performance. Moreover, the same algorithm was subsequently used for inferring ground truth based solely on team submissions. We report team performance on both gold standard and inferred ground truth using a newly proposed metric called Threshold Average Precision (TAP-k).
机译:背景我们报道了BioCreative III中的基因归一化(GN)挑战,其中要求参赛团队返回全文文章中检测到的基因标识符的排名列表。为了进行培训,准备了32篇完全注释的文章和500篇部分注释的文章。总共选择了507篇文章作为测试集。由于注释成本高昂,因此不可能为所有测试文章都获得金标准的人类注释。取而代之的是,我们开发了一种期望最大化(EM)算法,该方法用于为手动注释选择少量能够最有效地区分团队绩效的测试文章。此外,随后仅基于团队提交的内容就使用了相同的算法来推断基本事实。我们使用新提出的称为阈值平均精度(TAP-k)的度量标准,报告了金牌标准和推断的地面真实性的团队绩效。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号