首页> 外文期刊>Journal of Cheminformatics >QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
【24h】

QSAR DataBank - an approach for the digital organization and archiving of QSAR model information

机译:QSAR数据库-QSAR模型信息的数字化组织和归档方法

获取原文
       

摘要

Background Research efforts in the field of descriptive and predictive Quantitative Structure-Activity Relationships or Quantitative Structure–Property Relationships produce around one thousand scientific publications annually. All the materials and results are mainly communicated using printed media. The printed media in its present form have obvious limitations when they come to effectively representing mathematical models, including complex and non-linear, and large bodies of associated numerical chemical data. It is not supportive of secondary information extraction or reuse efforts while in silico studies poses additional requirements for accessibility, transparency and reproducibility of the research. This gap can and should be bridged by introducing domain-specific digital data exchange standards and tools. The current publication presents a formal specification of the quantitative structure-activity relationship data organization and archival format called the QSAR DataBank (QsarDB for shorter, or QDB for shortest). Results The article describes QsarDB data schema, which formalizes QSAR concepts (objects and relationships between them) and QsarDB data format, which formalizes their presentation for computer systems. The utility and benefits of QsarDB have been thoroughly tested by solving everyday QSAR and predictive modeling problems, with examples in the field of predictive toxicology, and can be applied for a wide variety of other endpoints. The work is accompanied with open source reference implementation and tools. Conclusions The proposed open data, open source, and open standards design is open to public and proprietary extensions on many levels. Selected use cases exemplify the benefits of the proposed QsarDB data format. General ideas for future development are discussed.
机译:背景技术在描述性和预测性定量结构-活动关系或定量结构-属性关系领域的研究工作每年产生约一千篇科学出版物。所有材料和结果主要通过印刷媒体进行交流。当前形式的打印介质在有效表示数学模型(包括复杂的和非线性的以及大量关联的数字化学数据)时存在明显的局限性。它不支持辅助信息的提取或重用,而计算机研究对研​​究的可访问性,透明度和可重复性提出了额外的要求。可以并且应该通过引入特定于域的数字数据交换标准和工具来弥合这种差距。当前出版物提供了定量结构与活动关系数据组织和档案格式的正式规范,称为QSAR数据库(简称QSARDB,简称QSAR)。结果本文介绍了QsarDB数据模式,该数据模式将QSAR概念(对象及其之间的关系)形式化,而QsarDB数据格式则将其形式化表示为计算机系统。 QsarDB的实用性和优势已通过解决日常QSAR和预测建模问题(包括预测毒理学领域的示例)得到了全面测试,并且可以应用于多种其他端点。这项工作随附了开源参考实现和工具。结论提议的开放数据,开放源代码和开放标准设计对许多级别的公共和专有扩展开放。选定的用例证明了建议的QsarDB数据格式的好处。讨论了未来发展的一般思路。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号