首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database
【2h】

The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database

机译:比较毒物基因组数据库中用于人工管理科学文献的管理范例和应用工具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators read the scientific literature and convert free-text information into a structured format using official nomenclature, integrating third party controlled vocabularies for chemicals, genes, diseases and organisms, and a novel controlled vocabulary for molecular interactions. Manual curation produces a robust, richly annotated dataset of highly accurate and detailed information. Currently, CTD describes over 349 000 molecular interactions between 6800 chemicals, 20 900 genes (for 330 organisms) and 4300 diseases that have been manually curated from over 25 400 peer-reviewed articles. This manually curated data are further integrated with other third party data (e.g. Gene Ontology, KEGG and Reactome annotations) to generate a wealth of toxicogenomic relationships. Here, we describe our approach to manual curation that uses a powerful and efficient paradigm involving mnemonic codes. This strategy allows biocurators to quickly capture detailed information from articles by generating simple statements using codes to represent the relationships between data types. The paradigm is versatile, expandable, and able to accommodate new data challenges that arise. We have incorporated this strategy into a web-based curation tool to further increase efficiency and productivity, implement quality control in real-time and accommodate biocurators working remotely.>Database URL:
机译:比较毒物基因组学数据库(CTD)是一种公共资源,可促进人们对环境化学物质对人类健康的影响的了解。 CTD生物馆员阅读科学文献,并使用官方术语将自由文本信息转换为结构化格式,整合了化学,基因,疾病和生物的第三方控制词汇以及分子相互作用的新型控制词汇。手动策展会生成强大,注释丰富的数据集,其中包含高度准确和详细的信息。目前,CTD描述了从25到400篇同行评审文章中手动策划的6800种化学物质,20到900个基因(对于330个生物体)和4300种疾病之间的349,000多个分子相互作用。将此手动管理的数据与其他第三方数据(例如,Gene Ontology,KEGG和Reactome注释)进一步集成在一起,以生成大量的毒物基因组关系。在这里,我们描述了我们的手动管理方法,该方法使用了涉及助记码的强大而高效的范例。这种策略允许生物管理者通过使用代码表示数据类型之间的关系来生成简单的语句,从而快速从文章中捕获详细信息。该范例是通用的,可扩展的,并且能够适应出现的新数据挑战。我们已将此策略整合到基于网络的策展工具中,以进一步提高效率和生产率,实时实施质量控制,并容纳可远程工作的生物策展人。>数据库URL:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号