...
首页> 外文期刊>Journal of chemical information and modeling >Application of the Random Forest Method in Studies of Local Lymph Node Assay Based Skin Sensitization Data
【24h】

Application of the Random Forest Method in Studies of Local Lymph Node Assay Based Skin Sensitization Data

机译:随机森林法在基于皮肤敏化数据的局部淋巴结分析研究中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

The random forest and classification tree modeling methods are used to build predictive models of the skin sensitization activity of a chemical.A new two-stage backward elimination algorithm for descriptor selection in the random forest method is introduced.The predictive performance of the random forest model was maximized by tuning voting thresholds to reflect the unbalanced size of classification groups in available data.Our results show that random forest with a proposed backward elimination procedure outperforms a single classification tree and the standard random forest method in predicting Local Lymph Node Assay based skin sensitization activity.The proximity measure obtained from the random forest is a natural similarity measure that can be used for clustering of chemicals.Based on this measure,the clustering analysis partitioned the chemicals into several groups sharing similar molecular patterns.The improved random forest method demonstrates the potential for future QSAR studies based on a large number of descriptors or when the number of available data points is limited.
机译:随机森林和分类树建模方法用于建立化学品皮肤敏化活性的预测模型。介绍了一种新的两阶段反向消除算法,用于随机森林方法中的描述符选择。随机森林模型的预测性能通过调整投票阈值以在可用数据中反映分类组的不平衡大小来最大程度地发挥作用。从随机森林中获得的接近度度量是可以用于化学物质聚类的自然相似性度量。在此度量的基础上,聚类分析将化学物质分为几个共享相似分子模式的组。改进的随机森林方法证明了未来QSAR钉的潜力基于大量描述符或当可用数据点的数量受到限制时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号