首页> 外文期刊>Journal of chemical information and modeling >Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity
【24h】

Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity

机译:随机森林法在水生毒性QSAR预测中的应用

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This work is devoted to the application of the random forest approach to QSAR analysis of aquatic toxicity of chemical compounds tested on Tetrahymena pyriformis. The simplex representation of the molecular structure approach implemented in HiT QSAR Software was used for descriptors generation on a two-dimensional level. Adequate models based on simplex descriptors and the RF statistical approach were obtained on a modeling set of 644 compounds. Model predictivity was validated on two external test sets of 339 and 110 compounds. The high impact of lipophilicity and polarizability of investigated compounds on toxicity was determined. It was shown that RF models were tolerant for insertion of irrelevant descriptors as well as for randomization of some part of toxicity values that were representing a "noise". The fast procedure of optimization of the number of trees in the random forest has been proposed. The discussed RF model had comparable or better statistical characteristics than the corresponding PLS or KNN models.
机译:这项工作致力于将随机森林方法应用于在梨形四膜虫中测试的化合物的水生毒性的QSAR分析。 HiT QSAR软件中实现的分子结构方法的单纯形表示用于二维级别的描述符生成。在644种化合物的建模集中获得了基于单纯形描述符和RF统计方法的适当模型。在339个和110个化合物的两个外部测试集上验证了模型的可预测性。确定了所研究化合物的亲脂性和极化性对毒性的强烈影响。结果表明,RF模型可以容忍插入无关的描述子,并且可以将代表“噪声”的部分毒性值随机化。提出了优化随机森林中树木数量的快速程序。与相应的PLS或KNN模型相比,所讨论的RF模型具有可比或更好的统计特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号