首页> 外文期刊>Systems biomedicine. >Statistical models for predicting liver toxicity from genomic data
【24h】

Statistical models for predicting liver toxicity from genomic data

机译:从基因组数据预测肝毒性的统计模型

获取原文
       

摘要

This paper outlines the construction of statistical models for liver pathology in rats and for drug induced liver injury. The envisioned purpose for these models would be to improve the cost of discovering compound toxicity in order to improve the overall cost of drug discovery. The size and breadth of the CAMDA liver toxicity data set presents unique opportunity to test whether statistical toxicity models can serve this purpose. The paper develops models for predicting toxicity from gene expression data. These models purposely exclude physiology and pathology data available in the CAMDA data. Physiology and pathology data require live rats and expensive time-consuming processing that are antithetical to the goal of reducing the time and cost required to determine compound toxicity. Two models are described. One employs Lasso regression and glmnet algorithm to extract models for rat liver pathology. The other employs stochastic gradient boosting to extract models for drug induced liver injury. This paper demonstrates that, given a data set of the size and quality of the CAMDA data, modern machine learning algorithms can extract high quality models—models with sufficient accuracy and specificity to serve the goal of reducing the costs of discovering compound toxicity.
机译:本文概述了大鼠肝脏病理和药物性肝损伤统计模型的构建。这些模型的预期目的将是提高发现化合物毒性的成本,以便提高药物发现的总体成本。 CAMDA肝毒性数据集的大小和广度为检验统计毒性模型是否可以达到此目的提供了独特的机会。本文开发了从基因表达数据预测毒性的模型。这些模型特意排除了CAMDA数据中可用的生理和病理数据。生理和病理学数据需要活的大鼠和昂贵的耗时的处理,这与减少确定化合物毒性所需的时间和成本的目标相反。描述了两个模型。一种方法是使用Lasso回归和glmnet算法提取大鼠肝脏病理模型。另一种采用随机梯度增强来提取药物引起的肝损伤的模型。本文证明,给定具有CAMDA数据大小和质量的数据集,现代机器学习算法可以提取高质量的模型-具有足够准确性和特异性的模型,以达到降低发现复合毒性的成本的目的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号