首页> 美国卫生研究院文献>Toxicological Sciences >A Novel Framework for Predicting In Vivo Toxicities from In Vitro Data Using Optimal Methods for Dense and Sparse Matrix Reordering and Logistic Regression
【2h】

A Novel Framework for Predicting In Vivo Toxicities from In Vitro Data Using Optimal Methods for Dense and Sparse Matrix Reordering and Logistic Regression

机译:使用密集和稀疏矩阵重排序和逻辑回归的最佳方法从体外数据预测体内毒性的新框架

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this work, we combine the strengths of mixed-integer linear optimization (MILP) and logistic regression for predicting the in vivo toxicity of chemicals using only their measured in vitro assay data. The proposed approach utilizes a biclustering method based on iterative optimal reordering (DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Rabinowitz, J. D., and Rabitz, H. A. (2008). Biclustering via optimal re-ordering of data matrices in systems biology: rigorous methods and comparative studies. BMC Bioinformatics 9, 458–474.; DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Rabinowitz, J. D., and Rabitz, H. A. (2010b). A network flow model for biclustering via optimal re-ordering of data matrices. J. Global. Optim. 47, 343–354.) to identify biclusters corresponding to subsets of chemicals that have similar responses over distinct subsets of the in vitro assays. The biclustering of the in vitro assays is shown to result in significant clustering based on assay target (e.g., cytochrome P450 [CYP] and nuclear receptors) and type (e.g., downregulated BioMAP and biochemical high-throughput screening protein kinase activity assays). An optimal method based on mixed-integer linear optimization for reordering sparse data matrices (DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Li, G. Y., Rabinowitz, J. D., and Rabitz, H. A. (2010a). Enhancing molecular discovery using descriptor-free rearrangement clustering techniques for sparse data sets. AIChE J. 56, 405–418.; McAllister, S. R., DiMaggio, P. A., and Floudas, C. A. (2009). Mathematical modeling and efficient optimization methods for the distance-dependent rearrangement clustering problem. J. Global. Optim. 45, 111–129) is then applied to the in vivo data set (21.7% sparse) in order to cluster end points that have similar lowest effect level (LEL) values, where it is observed that the end points are effectively clustered according to (1) animal species (i.e., the chronic mouse and chronic rat end points were clearly separated) and (2) similar physiological attributes (i.e., liver- and reproductive-related end points were found to separately cluster together). As the liver and reproductive end points exhibited the largest degree of correlation, we further analyzed them using regularized logistic regression in a rank-and-drop framework to identify which subset of in vitro features could be utilized for in vivo toxicity prediction. It was observed that the in vivo end points that had similar LEL responses over the 309 chemicals (as determined by the sparse clustering results) also shared a significant subset of selected in vitro descriptors. Comparing the significant descriptors between the two different categories of end points revealed a specificity of the CYP assays for the liver end points and preferential selection of the estrogen/androgen nuclear receptors by the reproductive end points.
机译:在这项工作中,我们结合了混合整数线性优化(MILP)和逻辑回归的优势,仅使用化学计量的体外测定数据即可预测化学物质的体内毒性。所提出的方法利用基于迭代最优重排序的双聚类方法(DiMaggio,PA,McAllister,SR,Floudas,CA,Feng,XJ,Rabinowitz,JD和Rabitz,HA(2008)。系统生物学研究:严格的方法和比较研究。BMC Bioinformatics 9,458-474 .; DiMaggio,PA,McAllister,SR,Floudas,CA,Feng,XJ,Rabinowitz,JD和Rabitz,HA(2010b)。网络流动通过对数据矩阵进行最佳重新排序来建立双簇模型(J. Global。Optim。47,343–354。),以识别与在体外测定的不同子集上具有相似响应的化学物质子集相对应的双簇。体外试验的二聚化显示基于试验目标(例如,细胞色素P450 [CYP]和核受体)和类型(例如,下调的BioMAP和生化高通量筛选蛋白激酶活性试验)导致明显的聚类。一种基于混合整数线性优化的稀疏数据矩阵重新排序的最佳方法(DiMaggio,PA,McAllister,SR,Floudas,CA,Feng,XJ,Li,GY,Rabinowitz,JD和Rabitz,HA(2010a)。 AIChE J. 56,405–418 .; McAllister,SR,DiMaggio,PA和Floudas,CA(2009)。使用无描述符的重排聚类技术发现稀疏数据集的发现。然后将重排聚类问题(J. Global。Optim。45,111–129)应用于体内数据集(稀疏21.7%),以便聚类具有相似最低效果水平(LEL)值的端点,观察到根据(1)动物种类(即,将慢性小鼠和慢性大鼠的终点明确分开)和(2)相似的生理属性(即,发现了与肝脏和生殖有关的终点)有效地对这些终点进行了聚类分开r一起)。由于肝脏和生殖终点之间显示出最大程度的相关性,因此我们在正负逻辑框架中使用正则逻辑回归对它们进行了进一步分析,以确定哪些子集的体外特征可用于体内毒性预测。观察到,在309种化学物质上具有相似的LEL反应的体内终点(由稀疏聚类结果确定)也共享了选定的体外描述子的重要子集。比较两种不同类别的终点之间的显着描述子,发现CYP分析对肝脏终点的特异性以及生殖终点对雌激素/雄激素核受体的优先选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号