首页> 美国卫生研究院文献>Toxicology Research >Modelling compound cytotoxicity using conformal prediction and PubChem HTS data
【2h】

Modelling compound cytotoxicity using conformal prediction and PubChem HTS data

机译:使用保形预测和PubChem HTS数据模拟化合物的细胞毒性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The assessment of compound cytotoxicity is an important part of the drug discovery process. Accurate predictions of cytotoxicity have the potential to expedite decision making and save considerable time and effort. In this work we apply class conditional conformal prediction to model the cytotoxicity of compounds based on 16 high throughput cytotoxicity assays from PubChem. The data span 16 cell lines and comprise more than 440 000 unique compounds. The data sets are heavily imbalanced with only 0.8% of the tested compounds being cytotoxic. We trained one classification model for each cell line and validated the performance with respect to validity and accuracy. The generated models deliver high quality predictions for both toxic and non-toxic compounds despite the imbalance between the two classes. On external data collected from the same assay provider as one of the investigated cell lines the model had a sensitivity of 74% and a specificity of 65% at the 80% confidence level among the compounds assigned to a single class. Compared to previous approaches for large scale cytotoxicity modelling, this represents a balanced performance in the prediction of the toxic and non-toxic classes. The conformal prediction framework also allows the modeller to control the error frequency of the predictions, allowing predictions of cytotoxicity outcomes with confidence.
机译:化合物细胞毒性的评估是药物发现过程的重要组成部分。对细胞毒性的准确预测有可能加快决策速度,并节省大量时间和精力。在这项工作中,我们基于16种来自PubChem的高通量细胞毒性试验,应用了类别条件保形预测来模拟化合物的细胞毒性。数据跨越16个细胞系,包含440-000多种独特的化合物。数据集严重失衡,只有0.8%的测试化合物具有细胞毒性。我们为每种细胞系训练了一个分类模型,并验证了有效性和准确性方面的表现。尽管两类之间不平衡,生成的模型仍可以对有毒和无毒化合物提供高质量的预测。从与所研究细胞系之一相同的分析提供者处收集的外部数据,该模型在分配给单个类别的化合物中的80%置信度下,灵敏度为74%,特异性为65%。与以前的大规模细胞毒性建模方法相比,这代表了有毒和无毒类别预测中的平衡性能。保形预测框架还允许建模者控制预测的错误频率,从而可以放心地预测细胞毒性结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号