...
首页> 外文期刊>Semantic web >Publishing DisGeNET as nanopublications
【24h】

Publishing DisGeNET as nanopublications

机译:将DisGeNET发布为纳米出版物

获取原文
获取原文并翻译 | 示例
           

摘要

The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for knowledge discovery in the Life Sciences. The manual curation of facts from published scientific papers is slow and inefficient, and therefore new approaches are needed that can enable the automatic, scalable and reliable extraction of assertions. While the publication of scientific assertions and datasets on the Semantic Web is gaining traction, it also creates new challenges such as the proper representation of provenance and versioning. Here, we address these issues and describe our efforts to represent the DisGeNET database of human gene-disease associations as permanent, immutable, and provenance rich digital objects called nanopublications. Our nanopublications are the first instance of a Linked Data model that ensures stable interlinking of the assertion and its metadata by Trusty URIs. As DisGeNET integrates manually curated as well as text-mined data of different origins, the semantic description of the evidence for each assertion is important to provide trust and allow evidence-based hypothesis generation. Here, we describe our steps to ensure high quality and demonstrate the utility of linking our data to other datasets on the emerging Semantic Web.
机译:在生物医学领域,不断增加和空前的出版率是生命科学领域知识发现的主要瓶颈。从已发表的科学论文中手动整理事实是缓慢且效率低下的,因此需要新的方法来实现对断言的自动,可伸缩和可靠的提取。虽然在语义Web上发布科学断言和数据集越来越受到欢迎,但它也带来了新的挑战,例如适当地表示出处和版本控制。在这里,我们解决了这些问题,并描述了我们将人类基因疾病协会的DisGeNET数据库表示为永久,不可变和起源丰富的数字对象(称为纳米出版物)的努力。我们的纳米出版物是链接数据模型的第一个实例,该模型可确保断言及其元数据通过Trusty URI进行稳定的互连。由于DisGeNET集成了人工策划的以及不同来源的文本挖掘的数据,因此每个断言的证据的语义描述对于提供信任并允许生成基于证据的假设非常重要。在这里,我们描述了确保高质量的步骤,并演示了将数据链接到新兴语义Web上的其他数据集的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号