首页> 外文期刊>SAR and QSAR in Environmental Research >Selection of data sets for QSARS: analyses of Tetrahymena toxicity from aromatic compounds
【24h】

Selection of data sets for QSARS: analyses of Tetrahymena toxicity from aromatic compounds

机译:选择QSARS数据集:分析芳香族化合物对四膜虫的毒性

获取原文
获取原文并翻译 | 示例
       

摘要

The aim of this investigation was to develop a strategy for the formulation of a valid ecotoxicological-based QSAR while, at the same time, minimizing the required number of toxicological data points. Two chemical selection approaches--distance-based optimality and K Nearest Neighbor (KNN), were used to examine the impact of the number of compounds used in the training and testing phases of QSAR development (i.e. diversity and representivity, respectively) on the predictivity (i.e. external validation) of the QSAR. Regression-based QSARs for the ectotoxic potency for population growth impairment of aromatic compounds (benzenes) to the aquatic ciliate Tetrahymena pyriformis were developed based on descriptors for chemical hydrophobicity and electrophilicity. A ratio of one compound in the training set to three in the test set was applied. The results indicate that from a known chemical universe, in this case 385 derivatives, robust QSARs of equal quality may be developed from a small number of diverse compounds, validated by a representative test set. As a conservative recommendation it is suggested that there should be a minimum of 10 observations for each variable in a QSAR.
机译:这项研究的目的是为制定有效的基于生态毒理学的QSAR制定策略,同时最大程度地减少所需的毒理学数据点数量。两种化学选择方法-基于距离的最优性和K最近邻(KNN),用于检验QSAR开发的训练和测试阶段所用化合物数量(即分别为多样性和代表性)对预测性的影响(即外部验证)的QSAR。基于化学疏水性和亲电性的描述符,开发了基于回归的QSARs,用于抑制芳香族化合物(苯)对水生纤毛四膜虫的种群增长损害的外部毒性。应用训练组中一种化合物与测试组中三种化合物的比率。结果表明,从一个已知的化学宇宙中(在这种情况下为385个衍生物),可以通过少量代表性化合物验证的少量多样化合物来开发出具有同等质量的稳健QSAR。作为保守的建议,建议对于QSAR中的每个变量至少应有10个观测值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号