首页> 外文期刊>Bioinformatics >GeneTUKit: a software for document-level gene normalization
【24h】

GeneTUKit: a software for document-level gene normalization

机译:GeneTUKit:用于文档级基因标准化的软件

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and labor intensive. Therefore, providing assistive tools to facilitate the task is of high value.Results: We developed GeneTUKit, a document-level gene normalization software for full-text articles. This software employs both local context surrounding gene mentions and global context from the whole full-text document. It can normalize genes of different species simultaneously. When participating in BioCreAtIvE III, the system obtained good results among 37 runs: the system was ranked first, fourth and seventh in terms of TAP-20, TAP-10 and TAP-5, respectively on the 507 full-text test articles.
机译:动机:将文章中提及的基因链接到生物学数据库的条目可以极大地方便索引和查询生物学文献。由于基因名称的歧义性很高,因此这项任务特别具有挑战性。手动注释此任务成本高昂,费时且劳动密集。因此,提供辅助工具来完成这项任务具有很高的价值。结果:我们开发了GeneTUKit,这是一种用于全文文章的文档级基因标准化软件。该软件采用围绕基因提及的局部上下文和整个全文文档中的全局上下文。它可以同时标准化不同物种的基因。当参加BioCreAtIvE III时,该系统在37次运行中获得了良好的结果:在507篇全文测试文章中,该系统分别在TAP-20,TAP-10和TAP-5方面排名第一,第四和第七。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号