首页> 外文会议>International Conference on Algorithms for Computational Biology >BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds
【24h】

BESTox: A Convolutional Neural Network Regression Model Based on Binary-Encoded SMILES for Acute Oral Toxicity Prediction of Chemical Compounds

机译:BESTox:基于二进制编码SMILES的卷积神经网络回归模型,用于化合物的急性口服毒性预测

获取原文

摘要

Compound toxicity prediction is a very challenging and critical task in the drug discovery and design field. Traditionally, cell or animal-based experiments are required to confirm the acute oral toxicity of chemical compounds. However, these methods are often restricted by availability of experimental facilities, long experimentation time, and high cost. In this paper, we propose a novel convolutional neural network regression model, named BESTox, to predict the acute oral toxicity (LD_(50)) of chemical compounds. This model learns the compositional and chemical properties of compounds from their two-dimensional binary matrices. Each matrix encodes the occurrences of certain atom types, number of bonded hydrogens, atom charge, valence, ring, degree, aromaticity, chirality, and hybridization along the SMILES string of a given compound. In a benchmark experiment using a dataset of 7413 observations (train/test 5931/1482), BESTox achieved a squared correlation coefficient (R~2) of 0.619, root-mean-squared error (RMSE) of 0.603, and mean absolute error {MAE) of 0.433. Despite of the use of a shallow model architecture and simple molecular descriptors, our method performs comparably against two recently published models.
机译:在药物发现和设计领域,化合物毒性预测是一项非常具有挑战性和关键性的任务。传统上,需要基于细胞或动物的实验来确认化合物的急性口服毒性。但是,这些方法通常受到实验设备的可用性,实验时间长和成本高的限制。在本文中,我们提出了一种新的卷积神经网络回归模型,称为BESTox,用于预测化合物的急性口服毒性(LD_(50))。该模型从其二维二元矩阵中了解化合物的组成和化学性质。每个矩阵沿着给定化合物的SMILES字符串编码某些原子类型,键合氢的数量,原子电荷,化合价,环,度,芳香性,手性和杂化的出现。在使用7413个观测数据集(训练/测试5931/1482)的基准实验中,BESTox获得了0.619的平方相关系数(R〜2),0.603的均方根误差(RMSE)和平均绝对误差{ MAE)为0.433。尽管使用了浅层模型体系结构和简单的分子描述符,但我们的方法在两个最新发表的模型上的性能却相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号