首页> 外文期刊>BMC Bioinformatics >CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
【24h】

CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata

机译:CEDAR OnDemand:一种浏览器扩展,用于生成基于本体的科学元数据

获取原文
           

摘要

Public biomedical data repositories often provide web-based interfaces to collect experimental metadata. However, these interfaces typically reflect the ad hoc metadata specification practices of the associated repositories, leading to a lack of standardization in the collected metadata. This lack of standardization limits the ability of the source datasets to be broadly discovered, reused, and integrated with other datasets. To increase reuse, discoverability, and reproducibility of the described experiments, datasets should be appropriately annotated by using agreed-upon terms, ideally from ontologies or other controlled term sources. This work presents “CEDAR OnDemand”, a browser extension powered by the NCBO (National Center for Biomedical Ontology) BioPortal that enables users to seamlessly enter ontology-based metadata through existing web forms native to individual repositories. CEDAR OnDemand analyzes the web page contents to identify the text input fields and associate them with relevant ontologies which are recommended automatically based upon input fields’ labels (using the NCBO ontology recommender) and a pre-defined list of ontologies. These field-specific ontologies are used for controlling metadata entry. CEDAR OnDemand works for any web form designed in the HTML format. We demonstrate how CEDAR OnDemand works through the NCBI (National Center for Biotechnology Information) BioSample web-based metadata entry. CEDAR OnDemand helps lower the barrier of incorporating ontologies into standardized metadata entry for public data repositories. CEDAR OnDemand is available freely on the Google Chrome store https://chrome.google.com/webstore/search/CEDAROnDemand.
机译:公共生物医学数据存储库通常提供基于Web的界面来收集实验性元数据。但是,这些接口通常反映关联存储库的临时元数据规范实践,从而导致所收集的元数据缺乏标准化。缺乏标准化限制了源数据集被广泛发现,重用以及与其他数据集集成的能力。为了增加所描述实验的重用性,可发现性和可重复性,应该使用协议术语对数据集进行适当注释,理想情况下应使用本体或其他受控术语来源。这项工作提出了“ CEDAR OnDemand”,这是一种由NCBO(国家生物医学本体中心)BioPortal提供支持的浏览器扩展程序,它使用户可以通过各个存储库固有的现有Web表单无缝输入基于本体的元数据。 CEDAR OnDemand分析网页内容以识别文本输入字段,并将它们与相关的本体相关联,这些本体根据输入字段的标签(使用NCBO本体推荐器)和预先定义的本体列表自动推荐。这些特定于字段的本体用于控制元数据输入。 CEDAR OnDemand适用于以HTML格式设计的任何Web表单。我们将通过NCBI(国家生物技术信息中心)BioSample基于Web的元数据条目演示CEDAR OnDemand的工作方式。 CEDAR OnDemand有助于降低将本体集成到公共数据存储库的标准化元数据条目中的障碍。 CEDAR OnDemand可在Google Chrome浏览器商店https://chrome.google.com/webstore/search/CEDAROnDemand上免费获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号