首页> 外文期刊>Journal of chemical information and modeling >. Developing Collaborative QSAR Models Without Sharing Structures
【24h】

. Developing Collaborative QSAR Models Without Sharing Structures

机译:。 在没有共享结构的情况下开发协作QSAR模型

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

It is widely understood that QSAR models greatly improve if more data are used. However, irrespective of model quality, once chemical structures diverge too far from the initial data set, the predictive performance of a model degrades quickly. To increase the applicability domain we need to increase the diversity of the training set. This can be achieved by combining data from diverse sources. Public data can be easily included; however, proprietary data may be more difficult to add due to intellectual property concerns. In this contribution, we will present a method for the collaborative development of linear regression models that addresses this problem. The method differs from other past approaches, because data are only shared in an aggregated form. This prohibits access to individual data points and therefore avoids the disclosure of confidential structural information. The final models are equivalent to models that were built with combined data sets.
机译:众所周知,如果使用更多数据,QSAR模型会大大提高。 然而,无论模型质量如何,一旦化学结构往远离初始数据集太远,模型的预测性能就会迅速降低。 为了增加适用性域,我们需要增加培训集的多样性。 这可以通过组合来自不同来源的数据来实现的。 公共数据可以很容易地包括; 然而,由于知识产权问题,专有数据可能更难以添加。 在这一贡献中,我们将介绍一种解决此问题的线性回归模型的协作开发方法。 该方法与其他过去的方法不同,因为数据仅以聚合的形式共享。 这禁止访问各个数据点,因此避免披露机密结构信息。 最终模型等同于使用组合数据集构建的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号