首页> 美国卫生研究院文献>Molecules >Predictive Capability of QSAR Models Based on the CompTox Zebrafish Embryo Assays: An Imbalanced Classification Problem
【2h】

Predictive Capability of QSAR Models Based on the CompTox Zebrafish Embryo Assays: An Imbalanced Classification Problem

机译:基于Comptox斑马鱼胚胎测定的QSAR模型的预测能力:一个不平衡的分类问题

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The CompTox Chemistry Dashboard (ToxCast) contains one of the largest public databases on Zebrafish (Danio rerio) developmental toxicity. The data consists of 19 toxicological endpoints on unique 1018 compounds measured in relatively low concentration ranges. The endpoints are related to developmental effects occurring in dechorionated zebrafish embryos for 120 hours post fertilization and monitored via gross malformations and mortality. We report the predictive capability of 209 quantitative structure–activity relationship (QSAR) models developed by machine learning methods using penalization techniques and diverse model quality metrics to cope with the imbalanced endpoints. All these QSAR models were generated to test how the imbalanced classification (toxic or non-toxic) endpoints could be predicted regardless which of three algorithms is used: logistic regression, multi-layer perceptron, or random forests. Additionally, QSAR toxicity models are developed starting from sets of classical molecular descriptors, structural fingerprints and their combinations. Only 8 out of 209 models passed the 0.20 Matthew’s correlation coefficient value defined a priori as a threshold for acceptable model quality on the test sets. The best models were obtained for endpoints mortality (MORT), ActivityScore and JAW (deformation). The low predictability of the QSAR model developed from the zebrafish embryotoxicity data in the database is mainly due to a higher sensitivity of 19 measurements of endpoints carried out on dechorionated embryos at low concentrations.
机译:Comptox化学仪表板(Toxcast)包含斑马鱼(Danio Rerio)发育毒性最大的公共数据库之一。数据由19个在相对低浓度范围内测量的独特1018化合物上的19个毒理学终点组成。终点与在剥离斑马鱼胚胎中发生的发育效果120小时,受精120小时,并通过畸形和死亡率监测。我们报告了209种定量结构 - 活动关系(QSAR)模型的预测能力,采用惩罚技术和不同的模型质量指标来应对不平衡的终点。生成所有这些QSAR模型以测试如何预测不合使用的三种算法中的不平衡分类(有毒或无毒)端点:Logistic回归,多层感知或随机林。另外,QSAR毒性模型是从经典分子描述符,结构指纹及其组合的组开发的。 209型号中只有8个通过了0.20 Matthew的相关系数值定义了一个先验的作为测试集上可接受的模型质量的阈值。获得最佳模型,用于终点死亡率(Mort),活动谱和颌骨(变形)。从数据库中斑马鱼胚胎毒性数据开发的QSAR模型的低可预测性主要是由于在低浓度下在剥离胚胎上进行的19次终点测量的敏感性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号