In silico prediction of toxic action mechanisms of phenols for imbalanced data with random forest learner

Chen J.; Tang Y.Y.; Fang B.; Guo C.

首页> 外文期刊>Journal of molecular graphics & modelling >In silico prediction of toxic action mechanisms of phenols for imbalanced data with random forest learner

【24h】

In silico prediction of toxic action mechanisms of phenols for imbalanced data with random forest learner

机译：随机森林学习者对不平衡数据进行酚类毒理作用机理的计算机模拟预测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With an increasing need for the rapid and effective safety assessment of compounds in industrial and civil-use products, in silico toxicity exploration techniques provide an economic way for environmental hazard assessment. The previous in silico researches have developed many quantitative structure-activity relationships models to predict toxicity mechanisms for last decade. Most of these methods benefit from data analysis and machine learning techniques, which rely heavily on the characteristics of data sets. For Tetrahymena pyriformis toxicity data sets, there is a great technical challenge - data imbalance. The skewness of data class distribution would greatly deteriorate the prediction performance on rare classes. Most of the previous researches for phenol mechanisms of toxic action prediction did not consider this practical problem. In this work, we dealt with the problem by considering the difference between the two types of misclassifications. Random Forest learner was employed in cost-sensitive learning framework to construct prediction models based on selected molecular descriptors. In computational experiments, both the global and local models obtained appreciable overall prediction accuracies. Particularly, the performance on rare classes was indeed promoted. Moreover, for practical usage of these models, the balance of the two misclassifications can be adjusted by using different cost matrices according to the application goals.

机译：随着对快速有效的工业和民用产品中化合物安全性评估的需求，计算机毒性探索技术为环境危害评估提供了一种经济途径。先前的计算机研究已经开发出许多定量的结构-活性关系模型来预测最近十年的毒性机理。这些方法大多数都受益于数据分析和机器学习技术，这些技术严重依赖于数据集的特征。对于梨形四膜虫毒性数据集，存在巨大的技术挑战-数据不平衡。数据类别分布的偏斜将大大降低对稀有类别的预测性能。以前关于苯酚毒性作用预测机理的大多数研究都没有考虑到这一实际问题。在这项工作中，我们通过考虑两种类型的错误分类之间的差异来处理该问题。随机森林学习器用于成本敏感的学习框架中，以基于选定的分子描述符构建预测模型。在计算实验中，全局模型和局部模型均获得了可观的总体预测精度。特别是，确实提高了稀有班级的表现。此外，对于这些模型的实际使用，可以根据应用目标通过使用不同的成本矩阵来调整两个错误分类的平衡。

著录项

来源
《Journal of molecular graphics & modelling》 |2012年第null期|共7页
作者
Chen J.; Tang Y.Y.; Fang B.; Guo C.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类分子生物学;
关键词
Cost-sensitive; Phenols; QSAR; Random Forest; Toxic action mechanisms;

机译：成本敏感;酚;QSAR;随机森林;毒性作用机理;

相似文献

外文文献
中文文献
专利

1. In silico prediction of toxic action mechanisms of phenols for imbalanced data with random forest learner [J] . Chen J., Tang Y.Y., Fang B., Journal of molecular graphics & modelling . 2012,第Null期

机译：随机森林学习者对不平衡数据进行酚类毒理作用机理的计算机模拟预测
2. Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis [J] . Aakash Chavan Ravindranath, Nolen Perualila-Tan, Adetayo Kasim, Molecular BioSystems . 2015,第1期

机译：连接图谱中的基因表达数据和计算机模拟目标预测，以进行小分子作用机理分析
3. In Silico Prediction of Adverse Drug Reactions and Toxicities Based on Structural, Biological and Clinical Data [J] . Xin Liu, Zhe Shi, Ying Xue, Current drug safety . 2012,第3期

机译：基于结构，生物学和临床数据的药物不良反应和毒性的计算机模拟预测
4. Early Prediction of Sepsis Using Random Forest Classification for Imbalanced Clinical Data [C] . Simon Lyra, Steffen Leonhardt, Christoph Hoog Antink Computing in Cardiology Conference . 2019

机译：临床数据不平衡的随机森林分类对脓毒症的早期预测
5. An empirical study of random forests for mining imbalanced data. [D] . Golawala, Moiz M. 2007

机译：随机森林挖掘不平衡数据的实证研究。
6. Low-Quality Structural and Interaction Data Improves Binding Affinity Prediction via Random Forest [O] . Hongjian Li, Kwong-Sak Leung, Man-Hon Wong, 2015

机译：低质量的结构和相互作用数据可通过随机森林改善结合亲和力预测
7. Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis [O] . Chavan Ravindranath A, Perualila-Tan N, Kasim A, 2015

机译：连接图谱中的基因表达数据和计算机模拟目标预测，以进行小分子作用机理分析

In silico prediction of toxic action mechanisms of phenols for imbalanced data with random forest learner

摘要

著录项

相似文献

相关主题

期刊订阅